Get the latest tech news

Bauplan – Git-for-data pipelines on object storage


: A Python-first Serverless Lakehouse¶ Bauplan is a Pythonic data platform that provides functions as a service for large-scale data pipelines and git-for-data over S3 data lakes. Bauplan handles tasks that would typically require an entire infrastructure team.

We are a team of ML and data engineers and we built Bauplan because we’ve experienced firsthand the frustration of spending too much time wrestling with cloud infrastructure. Using Git-for-data and our unique system of Refs, we make sure that every pipeline run and every table and every model is automatically versioned, reproducible and auditable. Run interactive or async SQL queries across branches and tables in S3, with full support for versioned data.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of pipelines

pipelines

Photo of object storage

object storage

Related news:

News photo

(Reasonably) secure Azure Pipelines on-prem deployments

News photo

Writing Composable SQL Using Knex and Pipelines

News photo

‘Tremendous’ AI Gas Demand Will Boost Pipelines, Executives Say