big-data
Here are 4,261 public repositories matching this topic...
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
Updated
Mar 20, 2024 - Python
ClickHouse® is a real-time analytics DBMS
-
Updated
Nov 7, 2024 - C++
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
-
Updated
Oct 19, 2024
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
-
Updated
Nov 7, 2024 - Java
The Data Engineering Cookbook
-
Updated
Aug 1, 2024
CMAK is a tool for managing Apache Kafka clusters
-
Updated
Aug 2, 2023 - Scala
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
-
Updated
Nov 7, 2024 - Jupyter Notebook
PredictionIO, a machine learning server for developers and ML engineers.
-
Updated
Jan 9, 2021 - Scala
Apache Ignite
-
Updated
Nov 2, 2024 - Java
Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.
-
Updated
Nov 7, 2024 - Java
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
-
Updated
Nov 7, 2024 - Java
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
-
Updated
Nov 6, 2024 - Scala
Improve this page
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."