Get the latest tech news

Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB


Knowledge-intensive analytical applications retrieve context from both structured tabular data and unstructured, text-free documents for effective decision-making. Large language models (LLMs) have made it significantly easier to prototype such retrieval and reasoning data pipelines. However, implementing these pipelines efficiently still demands significant effort and has several challenges. This often involves orchestrating heterogeneous data systems, managing data movement, and handling low-level implementation details, e.g., LLM context management. To address these challenges, we introduce FlockMTL: an extension for DBMSs that deeply integrates LLM capabilities and retrieval-augmented generation (RAG). FlockMTL includes model-driven scalar and aggregate functions, enabling chained predictions through tuple-level mappings and reductions. Drawing inspiration from the relational model, FlockMTL incorporates: (i) cost-based optimizations, which seamlessly apply techniques such as batching and caching; and (ii) resource independence, enabled through novel SQL DDL abstractions: PROMPT and MODEL, introduced as first-class schema objects alongside TABLE. FlockMTL streamlines the development of knowledge-intensive analytical applications, and its optimizations ease the implementation burden.

View a PDF of the paper titled Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB, by Anas Dorbani and 3 other authors View PDFHTML (experimental) Abstract:Knowledge-intensive analytical applications retrieve context from both structured tabular data and unstructured, text-free documents for effective decision-making. To address these challenges, we introduce FlockMTL: an extension for DBMSs that deeply integrates LLM capabilities and retrieval-augmented generation (RAG).

Get the Android app

Or read this on Hacker News

Read more on:

Photo of DuckDB

DuckDB

Photo of language models

language models

Photo of RAG

RAG

Related news:

News photo

My Browser WASM't Prepared for This. Using DuckDB, Apache Arrow and Web Workers

News photo

Beyond RAG: How Articul8’s supply chain models achieve 92% accuracy where general AI fails

News photo

Circuit Tracing: Revealing Computational Graphs in Language Models (Anthropic)