Coded by Weicheng Zhang.
This is the course project for Security Analytics, Fall 2017.
-
Homework 1: using Numpy, Pandas, sklearn, matplotlib, seaborn to decribe a csv chart.
-
Homework 2: using Pandas, matplotlib, subprocess, numpy to describe data from a TCP flow data file (.pcap), which is got by using WIRESHARK and output file using Tcpflow.
-
Homework 3: using decision tree and logistic model regression model to classify large data and see the accuracy of classification. Also I use information gain and entropy to screen all feature and get ten feature with most information of the data.
-
Homework 4: using K-means to devide data to clusters.
- Homework 1: the result is shown in /hw1/hw1_WeichengZhang.ipynb.
- Homework 2: the result is shown in /hw2/Hw2_WeichengZhang.ipynb.
- Homework 3: the result is shown in /hw3.Hw3_WeichengZhang.ipynb.
- Homework 3: the result is shown in /hw4.hw4_WeichengZhang.ipynb.