Apache Spark™ and Scala Workshops
-
Updated
Jul 29, 2024 - HTML
Apache Spark™ and Scala Workshops
Kinesis Connector for Structured Streaming
A real-time reddit data streaming pipeline for sentiment analysis of various subreddits
Custom state store providers for Apache Spark
A curated list of Pulsar tools, integrations and resources.
Azure Databricks - Advent of 2020 Blogposts
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Extensible streaming ingestion pipeline on top of Apache Spark
Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python
Spark structured streaming examples with using of version 3.5.1
spark structured streaming via HTTP communication
This repository includes supervised and unsupervised machine learning methods which are used to detect anomalies on network datasets. Decision Tree, Random Forest, Gradient Boost Tree, Naive Bayes, and Logistic Regression were used for supervised learning. K-Means was used for unsupervised learning.
Spark1.6和spark2.2的示例,包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe
Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps
Spark Structured Streaming data pipeline that processes movie ratings data in real-time.
An naive anomaly detection and data visualization tool for F1 on board telemetry data.
Real-time streaming data pipeline for Twitter Tweets
Samples for using Kafka within Spark Streaming and Akka Actors, Akka Streams
A library having Java and Scala examples for Spark 2.x
Analytics for IoT devices using Apache Spark Structured Streaming 2.4.0
Add a description, image, and links to the spark-structured-streaming topic page so that developers can more easily learn about it.
To associate your repository with the spark-structured-streaming topic, visit your repo's landing page and select "manage topics."