Pre-processing high cardinality categorical (HCC) variables
The notebook explaines various methods to deal with high cardinality categorical values like ZIP/PIN codes, profession type etc. Usually there are hundreds of unique values. These are too many to create dummies for each. Each method is followed by source of the method.