Examples of applying unsupervised machine learning using K-means clustering.
Read the tutorial: Intelligent Topic Detection with Unsupervised Learning
Unsupervised learning is applied to a data set of randomly generated colors. The red, green, and blue values are used as features to categorize each color under a specific parent category.
For example, purple might be categories as Red or Blue. Likewise, Sky Blue would be categorized under Blue.
- Red
- Green
- Blue
The following graphs show the results of clustering and categorizing colors by their red, green, and blue values.
The following three colors were used as new data for predicting the category for.
red green blue hex x y group label
1 241 52 11 #F1340B 1 15807499 2 red
2 80 187 139 #50BB8B 2 5290891 3 green
3 34 15 194 #220FC2 3 2232258 1 blue
Unsupervised learning is applied to a data set of exchange traded funds. The percentage values for "Year to Date", "1 Year", "5 Year", and "10 Year" returns are used as features to categorize each ETF under a specific parent category. Example code is provided in R and JavaScript.
- International
- StockBigGain
- Stock
- Bond
- SmallMidLargeCap
The following output shows the results of clustering and categorizing ETF funds based on their percentage returns.
MIT
Kory Becker