Ask your WhatsApp: build a private RAG with LlamaIndex

Why build a WhatsApp RAG? I have a very active group chat with my friends on WhatsApp. At the time of writing, it is a bit over half a million messages. Since LLMs became a thing, I always wondered how I could use this data for something useful—or at the very least, prank my friends. Last year I tried a few different approaches to fine tune a model using the chat data, but it didn’t work all that well. Fine‑tuning a model on commodity hardware is a challenge in itself and the results were underwhelming. So I dropped that idea for a while. While going through the material for the HuggingFace Agents Course though, it became very clear that RAG (Retrieval Augmented Generation) would be a perfect fit for what I was trying to do. ...

September 5, 2025 · Fabio