Get the latest tech news

Nvidia-Ingest: Multi-modal data extraction


NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri...

Including support for parsing PDFs, Word and PowerPoint documents, it uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images for use in downstream generative applications. Supports multiple methods of extraction for each document type in order to balance trade-offs between throughput and accuracy. Supports various types of pre and post processing operations, including text splitting and chunking; transform, and filtering; embedding generation, and image offloading to storage.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Nvidia-Ingest

Nvidia-Ingest