turbovec RAG dataset search in 4 GB RAM

A 10 million document RAG dataset occupies 31 GB of RAM at float32. turbovec fits it in just 4 GB - and now it searches it faster than FAISS. I just shipped a new release of turbovec: a Rust vector index with Python bindings, built on Google Research's TurboQuant algorithm. Data-oblivious 2-4 bit quantization that matches the Shannon lower bound on distortion - zero training and no rebuilds when the corpus grows. What's in the box: → Hand-written SIMD kernels - 12–20% faster than FAISS FastScan on ARM; match-or-beat on x86. → O(1) stable-id delete and save/load. The corpus is live and mutable, not a static snapshot. → Drop-in integrations for LangChain, LlamaIndex, and Haystack. → Published benchmarks (recall, speed, compression) at d=200/1536/3072 — every number reproducible from the repo. If you're building RAG where memory, latency, or privacy matters, give it a spin. GitHub: https://lnkd.in/e5M4dVRk Paper: https://lnkd.in/eHRmpYms #RAG #VectorSearch #OpenSource #Rust #Python #𝗟𝗟𝗠 #𝗢𝗽𝗲𝗻𝗦𝗼𝘂𝗿𝗰𝗲 #𝗚𝗲𝗺𝗺𝗮4

20 Comments

Carlos López Andrés 1w

This looks very solid on the infra side. What we have been seeing is that once compression and latency are under control, the hard problems show up in ranking stability rather than raw recall. Especially in dense regions where top-k becomes very sensitive to small perturbations. That part tends to matter more in production than the average metrics.

1 Reaction

Nuttapong Maneenate 1w

Looks amazing, I love it.

1 Reaction

Sky Nguyen 1w

I gotta check to see if it has native mps support because if so, I might be making the move from FAISS…

1 Reaction

Surya Pratap Rana 1w

Amazing work. Thank you for sharing here

1 Reaction

Aliyan Anwar 1w

Great work 👏

1 Reaction

Zara Siddique 1w

This guy just can't stop building!

2 Reactions

Coder Nik Says 1w

This is really cool, will try this. Thanks for sharing!

1 Reaction

Shamal De Silva 1w

Amazing work !!!

1 Reaction

Rohan Biswas 1w

Definitely gonna try it!!!

1 Reaction

Dr. Oliver Borchers 1w

Man times moved on since I wrote fast sentence embeddings. Fantastic job, will give it a spin

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

Nitheesh Kumar R
1mo
Report this post
✅ Day 89 of 100 Days LeetCode Challenge Problem: 🔹 #1480 – Running Sum of 1D Array 🔗 https://lnkd.in/gSTZrxF7 Learning Journey: 🔹 Today’s problem focused on computing the running (prefix) sum of an array. 🔹 Instead of using an extra array, I optimized the solution by modifying the input array in-place. 🔹 Starting from index 1, I updated each element as: • nums[i] += nums[i-1] 🔹 This way, each index stores the cumulative sum up to that point. 🔹 Finally, returned the modified array. Concepts Used: 🔹 Prefix Sum 🔹 In-place Computation 🔹 Array Traversal Key Insight: 🔹 The previous element already stores the prefix sum, so we can reuse it directly. 🔹 Eliminates the need for extra space while maintaining linear time. Complexity: 🔹 Time: O(n) 🔹 Space: O(1) #LeetCode #Algorithms #DataStructures #CodingInterview #100DaysOfCode #Python #ProblemSolving #LearningInPublic #TechCareers
Like Comment
To view or add a comment, sign in
Aman Sharma
1mo
Report this post
🚀 Day 39/60 — LeetCode Discipline Problem Solved: Sqrt(x) Difficulty: Easy Today’s challenge was to compute the square root of a number without using built-in functions. Instead of brute force, I used Binary Search — a classic, elegant approach that narrows down the answer efficiently. 💡 Key Learnings: • Binary Search application beyond arrays • Handling edge cases (x < 2) • Avoiding overflow using conditions carefully • Finding floor value of square root • Optimized thinking over brute force ⚡ Performance: Runtime: 4 ms Like walking in a foggy path, I didn’t see the answer directly… But step by step— cutting the search space in half— the truth revealed itself. That’s the beauty of algorithms. #LeetCode #60DaysOfCode #DSA #BinarySearch #ProblemSolving #CodingJourney #Python #Consistency #TechGrowth
Like Comment
To view or add a comment, sign in
Vishwanath T L
4w
Report this post
🚀 Stop iterating through rows like it’s 2010. In a recent pipeline, we were processing 5 million records to calculate a rolling score. Using a standard loop took forever and pegged the CPU at 100%. Before optimisation: for i in range(len(df)): df.at[i, 'score'] = df.at[i, 'val'] * 1.05 if df.at[i, 'flag'] else df.at[i, 'val'] After optimisation: import numpy as np df['score'] = np.where(df['flag'], df['val'] * 1.05, df['val']) Performance gain: 85x faster execution. Vectorisation isn’t just a "nice to have"—it’s the difference between a pipeline that crashes at 2 AM and one that finishes in seconds. By letting NumPy handle the heavy lifting in C, we eliminated the Python overhead entirely. If you're still using `.iterrows()` or manual loops for column transformations, it’s time to refactor. The performance delta on large datasets is simply too massive to ignore. What is the biggest "bottleneck" function you’ve refactored recently that gave you a massive speedup? #DataEngineering #Python #PerformanceTuning #Vectorization #DataScience
Like Comment
To view or add a comment, sign in
Abhishek Gupta
3w
Report this post
Day 56 of #GeekStreak60: The Phantom Pointer Trick! 🕵️♂️📈 Tackled the "Sorted subsequence of size 3" problem on @GeeksforGeeks today. Key Learning: Finding an increasing triplet is easy if you use extra arrays to track minimums and maximums, but that violates the O(1) space constraint. The optimal solution is to use "Greedy State Tracking." By iterating through the array in a single pass, I maintained three variables: the absolute smallest number seen (num1), a valid middle number (num2), and a snapshot of num1 locked in at the exact moment num2 was discovered. If the loop encounters any number strictly greater than num2, the valid triplet is instantly formed! This perfectly eliminates the need for O(n) memory arrays while keeping the time complexity to a strict O(n). Just 4 days left! The logic is feeling sharper than ever. 🚀 #geekstreak60 #npci #coding #Algorithms #Python #DataStructures #Optimization #SoftwareEngineering
Like Comment
To view or add a comment, sign in
Keerthana Murugesan
1w
Report this post
Learn about numpy sorting with real time example, In this video, we cover: 00:07 Basic Sorting in ascending order 00:50 Descending sorting 01:43 argsort explaning 04:04 partition 05:36 2d dimensional array. #numpy #python #datascience #machinelearning #coding https://lnkd.in/gqnjPFhs

NumPy Sorting Explained with Real Example | sort(), argsort(), partition(), 2D Arrays

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Anat Kumar Chauhan
2w
Report this post
LeetCode Problem 1448. Count Good Nodes in Binary Tree: "Given a binary tree root, a node X in the tree is named good if in the path from root to X there are no nodes with a value greater than X. Return the number of good nodes in the binary tree." Approach: make use of pre-order traversal i.e., visit root->left->right child, in each visit, keep track of the max element upto that point by maintaining a stack, top of the stk should represent the max element upto that point in the path, pop() elements from stack when both the children (left,right) are explored. Time Complexity: O(n) Space Complexity: O(n) b/c of maintaining a stack #Python #LeetCode #DSA #DataStructures #Algorithms #Stack #Recursion #DFS #BinaryTree #ProblemSolving
Like Comment
To view or add a comment, sign in
MQL5 Algo Trading

445 followers
3w
Report this post
A Renko forecasting pipeline was tested on EURUSD using MetaTrader 5 tick/time-bar data converted into 11,578 Renko bars with ATR-derived brick size 0.00028. Training used 60 days of M5 history and 14 engineered features covering direction sequences and volume stats. CatBoost delivered 59.27% test accuracy. Feature importance was led by volume: last_volume (18.36), avg_volume (14.23), volume_ratio (12.81), followed by consecutive-move metrics, challenging price-pattern-heavy rule sets. Implementation stack: Python, numpy, pandas, MetaTrader5 API, catboost. Modules cover data ingest, Renko conversion, feature generation, cross-validated training, and probability-based signaling with a 75% confidence threshold. #MQL5 #MT5 #AlgoTrading #AITrading https://lnkd.in/dAGfEC7K
1 Comment
Like Comment
To view or add a comment, sign in
JAGDISH KUMAR
2w
Report this post
Vector databases are great, but they aren't always the right tool for complex document intelligence. 🧠📉 If you are tired of context fragmentation and untraceable LLM hallucinations, it is time to look at Vectorless RAG with Page Index. By swapping out mathematical embeddings for a reasoning-based, hierarchical document tree, you can achieve upwards of 98% accuracy on complex Q&A tasks with perfect citation traceability. I wrote a complete guide on how this architecture works, including a full Python code implementation. Read it here: https://lnkd.in/gRuXiSxK #ArtificialIntelligence #RAG #PythonDeveloper #MachineLearning #AIEngineering
1 Comment
Like Comment
To view or add a comment, sign in
Amin Entezari
1w Edited
Report this post
Excited to share that my R package discoCVI is now submitted to CRAN! discoCVI implements the DISCO metric a Cluster Validity Index for evaluating density-based clustering results without ground truth labels. Key features: - Handles arbitrary-shaped clusters (rings, spirals, crescents) - Explicitly evaluates noise point quality - Returns interpretable scores in [-1, 1] The package is a faithful R translation of the Python reference implementation by Beer et al. (2025) arXiv:2503.00127, verified across 8 benchmark datasets with differences about < 10^-14. Install from GitHub: devtools::install_github("aminentezari/discoCVI") Special thanks to Prof. Davide Chicco for supervision and guidance. #RStats #DataScience #MachineLearning #Clustering #OpenSource #CRAN

1 Comment
Like Comment
To view or add a comment, sign in
Rahul kumar
1w
Report this post
Day 30/100 — Solved “Maximum Depth of N-ary Tree” . today, A simple problem on the surface, but a great exercise to reinforce recursive thinking and traversal strategies. Approach 1: Depth-First Search (DFS) Used recursion to explore each branch of the tree as deep as possible. For every node, I computed the depth of all its children and took the maximum among them. The final answer for each node becomes 1 + max depth of its children. Base cases handled: If the node is null → depth = 0 If the node has no children → depth = 1 This approach is intuitive and closely follows the definition of depth itself. Approach 2: Breadth-First Search (BFS) Traversed the tree level by level using a queue. Each iteration processes one full level of nodes, and the depth counter increments after finishing each level. This approach is useful when thinking in terms of layers rather than paths. Complexity: Both approaches run in O(n) time since every node is visited once. Space complexity differs based on recursion depth (DFS) vs queue size (BFS). Key takeaway: Understanding both DFS and BFS gives flexibility in tackling tree problems from different perspectives—path-based vs level-based thinking. #Day30 #100DaysOfCode #DSA #LeetCode #Python #CodingJourney #Algorithms #DataStructures #ProblemSolving
Like Comment
To view or add a comment, sign in

2,837 followers

79 Posts

View Profile Follow

turbovec RAG dataset search in 4 GB RAM

More Relevant Posts

NumPy Sorting Explained with Real Example | sort(), argsort(), partition(), 2D Arrays

https://www.youtube.com/

Explore content categories