Skip to content

chseifert/data-science-tutorials

 
 

Repository files navigation

data-science-tutorials

This repository contains jupyter-notebooks to accompany the tutorials for our data science lectures. The following topics are covered (each within a separate folder).

  1. Dataset Visualization (Boston Housing minus the linear regression; also other datasets like Flower, MNIST-digits, 20newsgroups) working/visualizing one dataset (incl. Matplotlib; .describe attribute; box-plot, min-max-normilization; boston housing; linear reg c/o dsP)
  2. Clustering
  3. Association Rule Learning (dataset yet to be determined; preferably from scikit learn)
  4. Regression (linear regression from Boston Housing and Car Prices)
  5. Bayes Learning (for spam filtering/text classification)
  6. Classification with Decision Trees (start with small 5-line dataset)
  7. Neural Networks (use keras.io to build a neural network for MNIST-digit classification) keras (for MNIST class); OPT gensim (for word2vec; pick dataset from tensorflow); then auto-encoder for representatino learning
  8. OPTIONAL MapReduce

Packages

See our python-tutorials on instructions how to set this up on your machine.

required

optional

  • Pandas; [documentation] also as pdf

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.7%
  • Python 0.3%