Get the latest tech news

RAG architecture for SaaS – Learnings from building an AI code assistant

Building a RAG-based AI SaaS isn't hard - but there are gotchas and decisions you need to be mindful of. Here's how to build a code assistant from scratch - including the design decisions and the implementation details.

Scalability: As the set of embeddings grows, we can move some tenants to new physical machines, which allows us to use more memory and CPU for our applications over time. When we get to the conversation phase, I'll show you how this data model allows me to find the relevant code snippets for a user question with a single query. This made it easy to load the data - I could clone the repositories I wanted to embed and work with local files, rather than figure out the Github APIs.

Get the Android app

Or read this on Hacker News