Get the latest tech news
Databricks acquires Lilac to supercharge data quality efforts for gen AI apps
Lilac’s entire tech stack will come under Databricks Mosaic AI tooling to give developers a way to better curate their unstructured datasets for custom generative AI systems.
Lilac, founded by former Google engineers Daniel Smilkov and Nikhil Thorat in 2023, addresses this challenge with a scalable open-source solution that offers an intuitive UI and AI-driven features to analyze, understand and modify unstructured text data, at scale. “The team behind Lilac specifically built their product to enable an analysis of model outputs for bias or toxicity, and preparation of data for RAG and fine-tuning or pre-training LLMs,” Databricks executives Matei Zaharia, Naveen Rao, Jonathan Frankle, Hanlin Tang and Akhil Gupta wrote in a joint blog post. While the specifics of the integration remain undisclosed at this stage, it will do the same job: simplify data tailoring to make it easier for teams to evaluate and monitor the outputs of their LLMs as well as prepare datasets for RAG, fine-tuning and pre-training.
Or read this on Venture Beat