Here are some datasets that you could use for the project:
- Kaggle
- AWS datasets
- UK government open data
- Github archive
- Awesome public datasets
- Million songs dataset
- Some random datasets
- COVID Datasets
- Datasets from Azure
- Datasets from BigQuery
- Dataset search engine from Google
- Public datasets offered by different GCP services
- European statistics datasets
- Datasets for streaming
- Dataset for Santander bicycle rentals in London
- Common crawl data (copy of the internet)
- NASA's EarthData (May require introductory geospatial analysis)
- Collection Of Data Repositories
PRs with more datasets are welcome!