Machine Learning Algorithms From Scratch

This is was mostly a 'practice' repository, containing some ML algorithms which I have implemented from scratch. I no longer update or maintain this.

K Nearest Neighbours: Recommends movies from the TMDB 5000 movies dataset based on the list of genres given as input.

Logistic Regression: Predicts how likely peope are to buy a product based on their gender, age, and salary.

Simple Neural Network: 2-layered neural network which mimics the XOR gate, implemented(vectorized) from scratch using NumPy.

Digit Classification: Dataset used: MNIST
- Contains a binary classifier that labels all 0s as 1 and rest all digits as 0.
- Also contains an extension of the above classifier that classifies all 10 digits with an accuracy of 94%.
- Both of the above networks are 2-layered and are implemented(vectorized) from scratch using NumPy.

Decison Trees: Decision Tree classifier implemented from scratch in python. Dataset used: Banknote authentication dataset

Support Vector Machine: A simple C-SVM binary classifier. Dataset used: Breast Cancer Wisconsin Dataset

K-Means Clustering:
- Dataset used: Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap
- Implemented K-Means clustering algorithm. Used matplotlib to visualize clusters and centroids.

Principal Component Analysis:
- Dataset used: AT&T Database of Faces
- Applied the Principal Component Analysis (PCA) algorithm for dimensionality reduction on face images.

Moving Averages
- Dataset used: Air Quality Data Set
- Applied Simple Moving Average (SMA), Cumulative Moving Average (CMA), Weighted Moving Average (WMA), Exponentially Weighted Average (EWMA) on the dataset, all functions are written in NumPy.

Convolutions
- Edge Detection using kernels in NumPy
- Automated Canny Edge Detector

Contributers:

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
Convolutions		Convolutions
Decision-Trees		Decision-Trees
Digit-Classification		Digit-Classification
Histogram-Equalization		Histogram-Equalization
K-Means-Clustering		K-Means-Clustering
K-Nearest-Neighbours		K-Nearest-Neighbours
Logistic-Regression		Logistic-Regression
Moving-Averages		Moving-Averages
PCA		PCA
SVM		SVM
Simple-Neural-Network		Simple-Neural-Network
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md