Bruin: Build Data Pipelines with SQL and Python
2024-12-17
Bruin is a powerful data pipeline tool that combines data ingestion, data transformation with SQL and Python, and data quality checks into a single framework. It works with major data platforms and runs on your local machine, an EC2 instance, or GitHub Actions. Key features include data ingestion, SQL & Python transformations, data quality checks, Jinja templating, end-to-end validation, and support for multiple environments. Pipelines are easily defined using a simple pipeline.yml file.
Development
data pipeline