Get the latest tech news

Apache DataFusion


¶ DataFusion is an extensible query engine written in Rust that uses Apache Arrow as its in-memory format. The documentation on this site is for the core DataFusion project, which contains libraries and binaries for developers building fast and feature rich database and analytic systems, customized to particular workloads.

The documentation on this site is for the core DataFusion project, which contains libraries and binaries for developers building fast and feature rich database and analytic systems, customized to particular workloads. “Out of the box,” DataFusion offers SQL and Dataframe APIs, excellent performance, built-in support for CSV, Parquet, JSON, and Avro, extensive customization, and a great community. DataFusion features a full query planner, a columnar, streaming, multi-threaded, vectorized execution engine, and partitioned data sources.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Apache DataFusion

Apache DataFusion