Reducing API Latency by 35% with Redis

I reduced API latency by 35% using Redis. But the interesting part wasn't the caching itself — it was the decisions around it. Here's what I actually learned: 𝟭. Choosing what to cache is harder than how to cache Not every endpoint deserves a cache. I only cached data that was read frequently and changed rarely. Wrong caching = stale data in production. 𝟮. Cache invalidation is the real problem Redis TTL handles expiry. But what if data changes before TTL expires? I had to think about invalidation strategy before writing a single line of caching code. 𝟯. Eviction policy matters more than memory size I used allkeys-lru — so when Redis memory filled up, least recently used keys were evicted automatically. Without this, Redis throws errors under memory pressure. 𝟰. Redis is not just a cache Same Redis instance in my system served three jobs: → Cache layer (API response caching) → Message broker (Celery async job queue) → Session store (user session data) One tool. Three completely different responsibilities. Result: 35% latency reduction on critical endpoints — without touching a single database query. Stack: Redis · Django · Celery #Redis #BackendEngineering #Python #SystemDesign #Django #Celery #SES #async

To view or add a comment, sign in

More Relevant Posts

Utkarsh Keshari
6d
Report this post
"Why not just use a HashMap?" Every backend dev has heard this about Redis. Here's the truth: they're missing: Redis isn't a cache. It's a distributed data structure server. And that changes everything. HashMap vs Redis: HashMap: → Lives inside one JVM → Lost on restart → Invisible to other services → Dies when your app crashes Redis: → Runs independently → Shared across ALL services → Survives restarts with persistence → Built-in TTL and expiration → Sub-millisecond operations Where Redis actually shines: 1. Leaderboards Sorted sets handle real-time rankings at scale. No SQL queries. Just O(log N) operations. 2. Rate Limiting Track requests per user with atomic counters. Block abusers before they hit your DB. 3. Distributed Locks Ensure only ONE instance runs critical jobs. No race conditions across replicas. 4. Session Storage Stateless microservices behind load balancers? Redis keeps sessions alive across instances. 5. Pub/Sub Instant messaging between services. No polling. No delay. 6. Event Streaming Lightweight alternative when Kafka is overkill. Perfect for audit logs and notifications. The mindset shift: HashMap = Memory inside one app. Redis = Memory shared across your entire system. You don't add Redis because you need a cache. You add it when your architecture needs a fast, shared state layer. Once you understand this, you stop comparing Redis to HashMaps. You start treating it as a distributed infrastructure. What's the most creative way you've used Redis in production? #SpringBoot #Java #Microservices #BackendDevelopment #SoftwareArchitecture
Like Comment
To view or add a comment, sign in
Pranjal Rastogi
1w Edited
Report this post
Just add Redis is not scaling.🚫 It’s a shortcut. And at scale… it breaks. Imagine this 👇 10,000 concurrent requests (hello Java Virtual Threads 👋) All fetching the same config.What happens? ➡️ 10,000 network calls to Redis ➡️ Same data ➡️ Same latency ➡️ Same bottleneck… just moved You didn’t fix the problem. You relocated it. 🧠 Senior engineers think differently: They don’t ask:“Where do I cache?” They ask:“Where should this data live?” ⚡ The real pattern: Multi-tier caching L1 Cache (in-process) → ultra-fast (no network) → perfect for hot, immutable data → e.g., Caffeine L2 Cache (distributed – Redis) → shared across instances → handles changing state → consistency matters here 💥 The mistake most systems make: Using Redis for everything even when data rarely changes every request hits the same key latency matters more than consistency 🔑 Rule of thumb: If your data is: read-heavy ✅ rarely changing ✅ identical across requests ✅ 👉 it belongs in L1, not Redis 📉 What changes when you fix this? Fewer network hops Lower tail latency (p99 improves BIG time) Less Redis load Better horizontal scaling Most systems don’t need more infra. They need better cache hierarchy. 💬 Question for you: What % of your “cached” requests are still making a network call? #SystemDesign #BackendEngineering #DistributedSystems #Caching #Redis #ScalableSystems #PerformanceEngineering #LowLatency #HighThroughput #TechLeadership #ArchitectureMatters #DeveloperCommunity
2 Comments
Like Comment
To view or add a comment, sign in
Nishant Singh
3w
Report this post
Most engineers hear "Redis" and think "just a cache." It's so much more than that. 🧵 Redis is the database your backend calls 1,000 times a second — and never waits for. Here's everything you need to know as an engineer 👇 ✅ How it works (RAM, not disk) ✅ 5 data structures and when to use them ✅ 6 real-world use cases ✅ When to use Redis vs Postgres Sub-millisecond latency. 1M+ ops/second. Built-in Pub/Sub. Save this for the next time someone asks you to "add a cache." 🔖
Like Comment
To view or add a comment, sign in
Bhanu Sai Pavan Kumar Pothuri
6d
Report this post
🚀 The Day Our Database Almost Melted: The Thundering Herd Problem Ever had a high-traffic "hot key" in Redis expire, only to see your database latency skyrocket seconds later? Welcome to the Thundering Herd. 📉 The Scenario Imagine you have a microservice deployed on Kubernetes. You're caching a heavy database query in Redis to keep things snappy. Everything is fine until that one critical cache key hits its TTL (Time to Live) and expires. Suddenly: Thousands of concurrent requests find a Cache Miss. Instead of waiting, every single thread attempts to re-compute the data. Your database is slammed with identical, expensive queries. Latency spikes, pods start failing health checks, and you’re in a full-blown incident. 🔒 The Evolution of the Lock How do we stop the stampede? It depends on your scale: 1. The Single Pod Approach (Local Locking) If you're running a single instance, you can handle this within the JVM. Using CompletableFuture combined with ConcurrentHashMap#computeIfAbsent, you can ensure that only one thread triggers the expensive DB call while others wait for the result. No need to over-engineer! 2. The Multi-Pod Reality (Distributed Locking) In a modern K8s environment with multiple pods, local locks aren't enough. Pod A doesn't know Pod B is already fetching the data. This is where a Distributed Lock (using Redis/Redlock) becomes mandatory. 🛠️ Why Distributed Locking is a Game Changer: Efficiency: Only one thread across your entire cluster gains the right to "warm up" the cache. Resource Protection: You prevent the "Thundering Herd" from ever reaching your DB. CPU Savings: While one thread computes, others wait/retry gracefully without burning CPU cycles on redundant calculations. 💬 Over to you... Distributed locking adds complexity, but it’s often the only thing standing between a smooth experience and a database meltdown. Have you ever faced a Thundering Herd problem in production? How did you solve it—was it a distributed lock, or did you go with something like "Cache Aside" with background refreshing? Let’s discuss in the comments! 👇 #SystemDesign #Redis #Microservices #SoftwareEngineering #Backend #Kubernetes #Java #DistributedSystems
Like Comment
To view or add a comment, sign in
Daniil Vereshchagin
1w
Report this post
Redis solves problems. But not all problems are Redis problems. Last week reviewed an architecture: Redis between API and Postgres. For caching user profiles. I ask: "How many requests per second?" Answer: "About 20." Redis is a powerful tool. But it's not free: Operational overhead Setup, monitoring, backups, failover, memory limits, eviction policies. Cache invalidation One of the two hardest problems in CS. Data in Redis is stale? How do you know? How do you update? Another failure point Redis down? API stops responding? Or graceful degradation with Postgres fallback? When Redis is actually needed: 1. High read load Thousands of identical requests per second. Postgres can't handle it even with connection pooling. 2. Expensive computations Result of heavy calculation that's cheaper to cache than compute every time. 3. Leaderboards, counters, sessions What Redis was built for. Sorted sets, atomic increments, TTL out of the box. When Redis isn't needed: 1. Low traffic 20-100 RPS? Properly configured Postgres will handle it. Connection pool + index tuning. 2. Data changes often If invalidation fires more than cache hits — you just added latency. 3. Fresh data is critical Finance, healthcare, any data where stale = problem. Simple rule: First optimize Postgres. Indexes, query optimization, connection pooling. Then add in-memory cache at application level (sync.Map in Go, dict in Python). Only when that doesn't help — Redis. Complexity should be earned by pain. Not added "just in case". Do you use Redis? For what? #golang #backend #redis #caching #performance
Like Comment
To view or add a comment, sign in
Nageswara Poka
5d
Report this post
Redis cache invalidation is trickier than it looks. 🔴 Here's a mistake I made — and the fix that saved me. I was using @CacheEvict to clear cached data after every update. Looks good in theory. But in practice — stale data kept appearing. 😤 The problem? I evicted by a SPECIFIC key. The same underlying data had MULTIPLE cache keys — one per filter combination. Evicting one left all the others dirty. ✅ The fix: @CacheEvict(value = "policies", allEntries = true) This clears the ENTIRE namespace — not just one key. Not surgical. But safe, correct, and easy to reason about. 💡 Rule of thumb: Use targeted key eviction → only when you fully control the key space. Use allEntries = true → when multiple key variants exist for the same data. Simple. But it took me an embarrassing amount of time to figure out. 😅 What Redis gotchas have caught you off guard? Drop them below 👇 #Java #SpringBoot #Redis #BackendDevelopment #TechLead #JavaDeveloper #SoftwareEngineering
Like Comment
To view or add a comment, sign in
Nannuri Manoj
3w
Report this post
Engineering for Scale: Why I implemented Redis in my Project Even though my current project doesn't have thousands of concurrent users yet, I wanted to tackle a very real-world problem: - API Latency and Database Load. I noticed that routes like GET /projects/:id/tasks require heavy SQL joins and filtering. In a production environment, hitting the DB for the same data every few seconds is a bottleneck waiting to happen. To see how tech companies solve this, I decided to implement Redis as a caching layer. Solving Real-World Challenges: I didn't just "add a cache", I treated this as a deep dive into distributed systems: - Read-Aside Pattern: I built a "withCache" utility that prioritizes microsecond Redis hits but falls back to the database if the cache is empty or the Redis server is unreachable (Graceful Degradation). - Auth-First Approach: One crucial takeaway was ensuring authentication always happens before checking the cache. Speed should never come at the cost of security. - Filter-Aware Caching: I learned how to design dynamic cache keys that encode filters like status or priority. Without this, the system would accidentally serve "To-Do" tasks to a user asking for "Done" tasks. The "Aha!" Moments and Errors: Implementation taught me things a tutorial never could: - The ACL Trap: I learned the hard way that Redis acl.conf files don't support comments. A single "#" at the top caused a startup crash, a small detail that taught me a lot about production-ready configurations. - Invalidation Logic: I had to ensure that cache keys are deleted after a successful DB write. If you do it before, you open a race condition where the cache might be re-populated with old data. The Goal: - For me, this wasn't just about making the API faster, It was about learning how to design systems that balance performance, consistency, and failure handling. Link for the full implementation(Github) and Documentation (inside /docs folder) is in the comments below 👇 👇 , don't forget to check it out #SoftwareEngineering #Redis #BackendDevelopment #SystemArchitecture #Postgres #NodeJS #WebPerformance

1 Comment
Like Comment
To view or add a comment, sign in
Rohit Pandey
3d
Report this post
🚀 Day 16 – Redis Explained (Backend Caching Basics) Welcome to Day 16 of the Backend Engineering series. One of the most popular tools for caching is: Redis. 🧠 What is Redis? Redis is an in-memory data store. Instead of reading data from disk-based databases, Redis stores data directly in RAM. Result? ⚡ Extremely fast performance. Common use cases: • Caching database queries • Storing session data • Leaderboards • Rate limiting • Message queues Example: Without Redis: App → Database → Response With Redis: App → Redis Cache → Response If data exists in Redis → return instantly. That’s why Redis is widely used in high-performance backend systems. 📅 Tomorrow: What is a Load Balancer? Follow along for backend engineering concepts explained simply. 🚀 #BackendEngineering #Redis #Caching #SoftwareEngineering #LearningInPublic

1 Comment
Like Comment
To view or add a comment, sign in
Akhil Reddy Kalva
2w
Report this post
Why your "Fast" Redis cache is actually killing your API performance 🛑📉 We’ve all been there: The feature is built, the Redis cache is connected, and everything seems perfect. But as the system scales, the P95 latency starts climbing. I saw a scenario recently that perfectly highlights a hidden bottleneck in high-scale Spring Boot applications. 🔍 The Scenario You have an API returning massive JSON responses (100KB – 2MB). You decide to cache the entire response in Redis to "speed things up." Sounds smart, right? But here is the hidden tax: Network Saturation: Every time a request hits, your server has to fetch a 2MB blob from Redis over the network. Do this 500 times a second, and you’ve saturated your network bandwidth. CPU Exhaustion: Once you get that 2MB string, your app has to deserialize it back into a Java Object. That is a heavy, CPU-intensive operation. ✅ The Fix: Don't Cache "Blobs," Cache "Intelligence" If you are caching massive objects, you aren't optimizing; you are just moving the bottleneck from the Database to the Network/CPU. Here is how you fix it: Cache Fragments: Break that 2MB object into smaller, logical pieces. Cache the pieces that don't change often. Compression: If you must cache large data, compress it before storing it in Redis. Binary Serialization: Switch from JSON (text-based) to Protobuf or Kryo (binary-based). They are significantly smaller and faster to serialize/deserialize. L1/L2 Caching: Keep the most frequently accessed "hot" data in local memory (using Caffeine) to avoid the network hop to Redis entirely. The Lesson: Performance is about more than just "using Redis." It's about being intentional with your data transfer and serialization overhead. How do you handle large responses in your microservices? Let's talk in the comments! 👇 #Java #SpringBoot #Microservices #PerformanceTuning #Redis #SoftwareEngineering #Scalability #BackendDevelopment
Like Comment
To view or add a comment, sign in
MOHAMMAD ABBAS
1w
Report this post
⚡ Redis Caching — Sometimes the Fastest Query is the One You Don’t Make One lesson I learned building backend systems: Performance is not only about writing better queries… Sometimes it’s about avoiding repeated queries altogether. That’s where caching helps 👇 What is Caching? Caching stores frequently accessed data in memory for faster retrieval. Instead of: ❌ Hitting database every request Use: ✅ Return from cache when possible Why Redis? 🚀 Super fast (in-memory) 🚀 Reduces DB load 🚀 Improves API response time 🚀 Great for scalable systems Good use cases for caching: ✔ Product/Menu data ✔ Frequently accessed user profiles ✔ API responses ✔ Sessions / Tokens ✔ Rate limiting Simple Flow Request comes in Check Redis cache Cache hit → return fast Cache miss → query DB + store in cache Example Idea (Spring Boot) @Cacheable("users") public User getUserById(Long id){ return repository.findById(id); } 💡 In my project, using caching helped improve performance for repeated reads. 🧠 Key Insight: Optimization is not only database tuning… Sometimes it starts with smarter data access. Have you used Redis only for caching, or also for rate limiting / sessions? #Java #SpringBoot #Redis #Caching #BackendDevelopment #PerformanceOptimization
Like Comment
To view or add a comment, sign in

1,400 followers

8 Posts

View Profile Follow

Reducing API Latency by 35% with Redis

More Relevant Posts

Explore content categories