Skip to content
#

mapreduce-python

Here are 91 public repositories matching this topic...

《大数据挖掘技术》@复旦 课程项目,试图从搜狗实验室用户查询日志数据(2008)中找出搜索记录中有较高支持度关键词的频繁二项集。在实现层面上,我搭建了一个由五台服务器组成的微型 Hadoop 集群,并且用 Python 实现了 Parallel FP-Growth 算法中的三个 MapReduce 过程。

  • Updated Mar 29, 2021
  • Python

MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a reduce function that merges all intermediate values associated with the same intermediate key. Many real-world tasks…

  • Updated Apr 24, 2024
  • Python

Improve this page

Add a description, image, and links to the mapreduce-python topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mapreduce-python topic, visit your repo's landing page and select "manage topics."

Learn more