- 👋 Hi, I’m @hucodelab and my hobbies are: theater, reading, and studying.
- 👀 I’m interested in Data Engineering, Analytics, Data Science, Business Intelligence, Logistics and Stock Markets.
- 📫 How to reach me: hugovangulo@gmail.com
Skills: Python, SQL, PySpark, Cloud Computing, AI.
A solution that empowers global investors to stay ahead of potential crises worldwide. The solution followed the Medallion Architecture and was built on Databricks and PySpark, it was used Databricks Workflows to orchestrate the pipelines. The solution was deployed as a web-based application using Streamlit, enabling users to interact with the data in near real-time and access the features. https://latamfusionapp.azurewebsites.net/
This project gathers data from multiple sources, including the Yahoo Finance API, web scraping of Twitch data, and collecting fundamental data on video game companies. We then organize and streamline this information through data pipelines. Finally, the data is processed using machine learning regression models to predict future stock prices, providing actionable insights for investors.
This project collects data from multiple sources, including web scraping of Reddit, macroeconomic data from Central Bank API, and the Yahoo Finance API (stock prices). We streamline and integrate this data through automated pipelines, then apply regression models to predict future stock prices of PETROBRAS, empowering investors with data-driven insights.
This project shows the ETL and Machine Learning model building process to predict whether a client will accept an e-commerce offer. It was also made a deploy of the app.
This project shows the process of manipulating data and building a Machine Learning model to predict whether a US citizen has an income higher than $50,000 per year. The project compares the performance of 3 models: Random Forest, Random Forest with Grid Search (hyperparameter optimization), and logistic regression.
This project shows the process of data manipulation and construction of a LSTM model to predict whether a stock it's going to increase its market price or not. It was built a feature selection model to select the best features for the LSTM model.
This project shows the process of data manipulation and construction of a Deep Learning regression model trained with climatological data from the 20 largest soybean-producing municipalities in the state of Paraná - BR to make predictions of soybean productivity.
This project shows the process of data manipulation and Deep Learning classification model building trained to recognize Street numbers. The project compares the performance of two neural network models: Multilayer Perceptron (MLP) and Convolutional Neural Network (CNN).
This project shows metrics and indicators related to the twitter accounts of the main candidates of the Brazilian 2022 elections. The visualizations created in this project were deployed to a web application using DASH and REACT. This project was developed by Turing USP and I contributed to the data processing and visuals generation.
This project shows the process of extracting, manipulating, and analyzing data from the 2020 StackOverflow survey. The project contains visuals of the number of respondents by programming language, salaries, and developers' job satisfaction.
This project shows the scrapping (extraction) of Airbnb data from Airbnb Brazil. It was possible to make a comparative analysis between accommodation's prices three different brazilian cities as well. This software was developed using the python library: BeautifulSoup.
This project shows the scrapping (extraction) of Rotten Tomatoes data and it was possible to build a dataset using the data. The project was developed by using the python library: Selenium.