Skip to content

Latest commit

 

History

History
24 lines (21 loc) · 690 Bytes

README.md

File metadata and controls

24 lines (21 loc) · 690 Bytes

Big Data Analytics

Coursework for CS5344 Big Data Analytics Technology @ NUS, Spring 2021

Topics

  • Big data algorithms and scalability
  • Cluster computing architecture
  • Market basket analysis
  • Customer segmentation
  • Recommender engines
  • MapReduce and related algorithms
  • Finding similar items
  • Clustering high-dimensional data
  • Frequent itemsets
  • Recommender systems
  • Link analysis

Tools and Techniques

  • Python
  • Hadoop ecosystem (HDFS, MapReduce)
  • Apache Spark
  • Search engine technology (e.g. Google's PageRank, link-spam detection, hubs and authorities)

Resources