Skip to content
View hvpachisia's full-sized avatar

Block or report hvpachisia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
hvpachisia/README.md

Hi, I'm Harsh ๐Ÿ‘‹

Thanks for stopping by! Iโ€™m a data scientist and program evaluator with over 6 years of experience in the United States and India. I've led and worked in data science teams working on building predictive ML models for government clients, creating GIS-based systems, and developing widely-used data products in the urban and tech space, among other things. I hold an MSc in Computer Science and Public Policy (MS-CAPP) at the University of Chicago. I am passionate about applying my technical skills to build & improve products and programs that improve the lives of people.

Feel free to take a look at my resume and to connect with me via LinkedIn.

  • ๐Ÿ: The programming languages I use are: Python, R, and SQL.
  • โ˜๏ธ: Cloud Computing Stack: AWS and GCP
  • ๐Ÿ“Š: Other skills: Power BI, PySpark, Tableau, QGIS, Computer Vision.
  • ๐Ÿ“–: Currently working on: Improving expertise in LLMs and NLP.

Some of the projects I have worked on, among others, are:

  • Conducted a comprehensive analysis of climate change impacts using NLP models, computer vision, and dashboards.
  • Developed supervised & unsupervised ML models to evaluate the impact of transit programs and analyze crime patterns.
  • Deployed boosted trees ML model (in R) to optimize methane leak inspections at oil & gas facilities.
  • Used OpenCV and QGIS to classify and georeference urban land use images into interactive raster maps to show environmental degradation over time.
  • Built NLP-based techniques and models to investigate fake news.

You can see some of my work in the following repositories:

  1. ๐Ÿš“ Rides to Safety: Chicago Crime, UChicago, and the Lyft Smart Ride Program: This repository presents a comprehensive analysis of transportation and crime in Chicago. The projects utilize big data processing, machine learning, and interactive visualization to evaluate the impact of safety programs and analyze crime patterns. Technologies used include GCP and PySpark for handling large datasets, AWS for cloud computing, and R for data visualization and interactive applications. These skills are showcased through various components, including exploratory data analysis, supervised and unsupervised machine learning, and the development of Shiny applications for interactive data exploration.

  2. ๐Ÿ๏ธ Climate Dynamics Decoded: Analyzing Impact, Opinion, and Change: This repository is an integrated analysis of climate change impacts through disaster frequency and cost, public sentiment, and urban land use changes, utilizing techniques such as NLP models, PyTorch, OpenCV, and Dash for interactive data analysis.

  3. ๐Ÿ‡ฎ๐Ÿ‡ณ The economic and environmental costs of congestion: In this repository, I developed a novel method to estimate the economic & social costs of congestion in cities using half a billion Uber data points. I also built an interactive dashboard & published a paper highlighting the impacts of removing bottlenecks on labor markets & overall economic productivity. Techniques utilized included big data processing, PostGres for data management, with PostGIS, Python, & R for analysis.

Please get in touch, Iโ€™m keen to chat. I always look forward to meeting enthusiastic and interesting new people.

Pinned Loading

  1. climate_impact_analysis climate_impact_analysis Public

    An integrated analysis of climate change impacts through disaster frequency and cost, public sentiment, and urban land use changes, showcasing advanced data science and visualization techniques.

    Jupyter Notebook

  2. chicago_transport_crime chicago_transport_crime Public

    A capstone repository containing a holistic analysis of transport and crime in the University of Chicago area and across Chicago.

    Jupyter Notebook

  3. datapartnership/covid-19-maharashtra datapartnership/covid-19-maharashtra Public

    Understanding societal responses to policies undertaken during emergencies: Lessons from COVID-19's Second Wave in Maharashtra

    Jupyter Notebook 1

  4. fake-news-detection fake-news-detection Public

    Utilizing NLP-based techniques and models to investigate fake news

    Jupyter Notebook 1

  5. Urban-Mobility Urban-Mobility Public

    Using Uber Movement and OSRM data to calculate the economic & environmental costs of congestion in a city

    HTML 2 3