Tutorials for STAT4609 (2023) Big Data Analytics
- introduction to python
- Object-Oriented Programming (OOP) in Python 3
- PCA
- Never use 'print()' to debug again! Pysnooper is all you need!
- Lasso Regression using scikit-learn :)
- Application of tree-based methods with SKlearn, including, decision tree, random forest, boosting.
- Implementation of Decision Tree from scratch
- Kernel regression review
- Kernel regression implementation from scratch [Simplified example for HW3]
-
A1 & A2 extension: Implementation of Lasso Logistic Regression for Binary Classification from scratch
-
GAM
-
Implementation of Naive Bayes from scratch
-
Implementation of K-Means Clustering from scratch
- Non-negative Matrix Factorization using Alternating Least Squares [Important for HW5]
- User-based Collaborative Filtering from scratch, which is easy to extend for the item-based case.
- User-based Collaborative Filtering using Surprise [Surprise is an easy-to-use Python scikit for recommender systems.]
- Pytorch Introduction I
- Graph Basics with NetworkX
- Spectral Clustering Implementation from Scratch
- Comparison with K-means Clustering using Scikit-Learn
- Image Segmentation using Spectral Clustering
- Hierarchical Clustering with Scikit-Learn on both graph and tabular data
- TF-IDF:
- Implementation from Scratch
- Use SKLearn
- Text Classification with TF-IDF
- using Naive Bayes Classifier
- using Logistic Regression
- SVM implementation from scratch
- A gentle introduction to torch.autograd.
- Perceptron_and_neural_networks.