RAG without using any framework: The Upgrade 🛠️ I recently ditched LangChain to build a RAG engine from scratch. The goal was total control over the pipeline. In January 2026, I made the initial prototype. I just pushed a major update including Reranking and a Web UI via Streamlit. Tech Stack: ➡️ Python 3.12 + asyncio ➡️ Google Gemini 2.5 Flash ➡️ FAISS + FlashRank (ms-marco-TinyBERT-L-2-v2) ➡️ Streamlit & Docker Why manual ? By implementing the ingestion, chunking, and retrieval manually, I optimized the memory management (auto-summarization) and added local reranking without fighting against library abstractions. The result is a fast, async RAG app that I actually understand line-by-line. 🚧 Current Status & Roadmap: Next, I plan to make a fast offline RAG with more features added on top of the current ones. #RAG #Python #GenerativeAI #Streamlit #Docker #GoogleGemini #Reranking #MachineLearning #VectorSearch #asyncio

To view or add a comment, sign in

Explore content categories