Skip to content

Latest commit

 

History

History
50 lines (35 loc) · 925 Bytes

README.md

File metadata and controls

50 lines (35 loc) · 925 Bytes

Data Mining

For development

  1. Create virtual environment with virtualenv .venv.
  2. Activate venv with .venv\Scripts\activate.bat.
  3. Launch jupyter lab with jupyter-lab.

Lab 1

  • basic data selection
  • visualization
  • pandas
  • iris, zoo and autos datasets

Lab 2

  • k-nearest neighbors
  • kd-tree and ball tree
  • generating n-dimensional data linearly separable
  • generating checkerboard

Lab 3

  • k-means
  • fixing permutations - clusterization
  • jaccard
  • PCA visualization
  • Gaussian Mixture
  • Agglomerative Clustering
  • zoo dataset
  • image compression with clusterization

Lab 4

  • markov model
    • words as states
    • letters as states
  • prime ministers exposes dataset

Lab 5