Skip to content
@DataRockLabs

DataRock Labs

DataRock Labs

Your data, our passion.

At DataRock Labs, we believe that the journey of information from raw data to business value should be transparent. By combining cutting-edge data platform technologies with AI solutions, we ensure that you get the most out of your data.

Our services

Data engineering 💻

Data rocks. But it's often dirty, fragmented, and difficult to analyze. We can help you get the most out of your data by cleaning it up, organizing it, and making it easy to understand.

Artifical intelligence 🤖

AI rocks. And the first steps are easy, but valuable AI services in production is a whole different ballgame, especially at enterprise scale. Our AI solutions are designed to provide reliable long-term support for your business processes.

Business intelligence 📊

BI rocks. But it's completely useless if it isn't in the right place, at the right time, in the right format. We excel in extracting valuable insights from data, enabling informed decision-making and enhancing your business's efficiency.

Our open-source contributions 🌐

Great Expectations is a Python library that helps you write tests to validate data in your pipelines, ensuring that it meets your expectations. It allows you to define expectations for various data sources and formats, such as databases, CSV files, or JSON files, and then run those expectations as tests. This can be particularly useful in data engineering and data science workflows to ensure data quality and consistency throughout the pipeline. Great Expectations provides a flexible and expressive way to specify these expectations and integrate them seamlessly into your existing codebase.

We have 50 merged Pull Requests in the Great Expectations Project 🚀 View our merged Pull Requests

Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code. Benefits include:

  • Run dbt projects against Airflow connections instead of dbt profiles
  • Native support for installing and running dbt in a virtual environment to avoid dependency conflicts with Airflow
  • Run tests immediately after a model is done to catch issues early
  • Utilize Airflow’s data-aware scheduling to run models immediately after upstream ingestion
  • Turn each dbt model into a task/task group complete with retries, alerting, etc.

We have 3 merged Pull Requests in the Astronomer Cosmos Project View our merged Pull Requests

CodiumAI is leveraging its know-how in order to provide developers with an AI agent tool aiming to help developers review PRs faster and more efficiently. It automatically analyzes the commits and the PR and can provide several types of feedback:

Auto-Description: Automatically generating PR description - name, type, summary, and code walkthrough.
PR Review: Feedback about the PR main theme, type, relevant tests, security issues, focused PR, and various suggestions for the PR content.
Question Answering: Answering free-text questions about the PR.
Code Suggestion: Committable code suggestions for improving the PR.

We have 2 closed Pull Requests in the Codium PR-Agent Project View our merged Pull Requests

Popular repositories Loading

  1. lida lida Public

    Forked from microsoft/lida

    Automatic Generation of Visualizations and Infographics using Large Language Models

    Jupyter Notebook

  2. .github .github Public

  3. Budapest-PowerBI-Meetup-10th Budapest-PowerBI-Meetup-10th Public

    C#

Repositories

Showing 3 of 3 repositories
  • DataRockLabs/Budapest-PowerBI-Meetup-10th’s past year of commit activity
    C# 0 0 0 0 Updated Apr 23, 2024
  • .github Public
    DataRockLabs/.github’s past year of commit activity
    0 0 0 0 Updated Feb 24, 2024
  • lida Public Forked from microsoft/lida

    Automatic Generation of Visualizations and Infographics using Large Language Models

    DataRockLabs/lida’s past year of commit activity
    Jupyter Notebook 0 MIT 299 0 0 Updated Jan 19, 2024

Top languages

Loading…

Most used topics

Loading…