Optimizing Spring Boot for Heavy File Uploads with Distributed Architecture

1mo

If your Spring Boot backend crashes during heavy file uploads, the problem isn't Java. It's your architecture. 🛑 This week, I’m building 'Aegis'—a Distributed Enterprise RAG Engine. To handle massive data ingestion without causing JVM memory spikes, I ripped out synchronous processing and implemented the Claim Check Pattern using MinIO and Apache Kafka. API latencies dropped from 32s to 12ms, and the system can now handle virtually infinite throughput. I wrote a deep-dive on exactly how I built this distributed Java architecture. Read the full breakdown here: https://lnkd.in/g7WYEG6Q (Check out the raw code on GitHub: https://lnkd.in/gdwJ_drr) #SystemDesign #Java #SpringBoot #Kafka

4 Comments

Naveen Metta 1mo

The Claim Check Pattern is criminally underused. Most teams try to shove large payloads directly through Kafka which kills broker performance and creates all sorts of message size config headaches. Offloading to object storage and passing just the reference is the right call. That 32s to 12ms improvement makes total sense once you remove the serialization and memory overhead from the hot path. We use a similar approach for document processing pipelines - store the raw file in S3, push a lightweight event with the reference, and let workers pull what they need. One thing worth watching with MinIO in this setup is making sure your bucket lifecycle policies are cleaning up temporary objects or you will end up with storage creep over time.

1 Reaction

Zeeshan A. 1mo

Don't you think it is overkill? Instead we can just use streams

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

Jason Feng
1mo
Report this post
The "ThreadLocal" Trap: Why your Session Logic Fails in Vert.x and Kafka Transitioning from traditional synchronous Java development to an asynchronous, event-driven architecture with Vert.x and Apache Kafka is a rewarding journey, but it comes with a major wake-up call: Your traditional session mechanisms are probably obsolete. After a deep dive into development and debugging today, I’ve consolidated a few critical architectural shifts that every team must consider before writing the first line of code. 1. The Death of the Context-Thread Bond In a classic Servlet-based world, we rely heavily on ThreadLocal to store user sessions, security contexts, or trace IDs. It’s easy: one thread per request. In Vert.x, the Event Loop is king. A single request may jump across multiple threads, or a single thread may handle thousands of interleaved requests. The moment you hit an await or a Kafka send, your ThreadLocal context vanishes. 2. Statelessness is Not Optional In a Kafka-driven processor, the "session" doesn't exist in memory. If your processor needs to call a long-running remote API (like a heavy PDF parser), you cannot simply "wait" and expect the environment to stay the same. You must explicitly pass state through Message Headers or Metadata Objects. 3. Rethinking the "Long-Running" Task Synchronous systems "block." Asynchronous systems "flow." If a task takes 30 seconds, a traditional system hangs a thread. In an event-driven system, you should be looking at: Asynchronous Callbacks: Trigger the task and let the result flow back into a different Kafka topic. Context Propagation: Explicitly carrying userId and traceId within the payload metadata. 🔑 Key Takeaway for Architects: Don't try to retrofit synchronous patterns into an asynchronous world. If you don't design your context propagation strategy (how session data travels across the event bus or Kafka topics) during the blueprint phase, you will spend weeks debugging NullPointerExceptions and lost sessions. Design for the flow, not for the thread. 🦑 #Java #Vertx #ApacheKafka #SoftwareArchitecture #BackendDevelopment #Microservices #AsyncProgramming
Like Comment
To view or add a comment, sign in
Sandeep Sunam
1mo
Report this post
🚀 Modern Spring Boot + Java 21 + Kafka - what's actually working in production: ⚡ Virtual Threads (Java 21) No more thread pool tuning. Drop-in concurrency that scales to thousands of connections. Clean blocking code, reactive performance. 📨 Kafka Done Right Idempotent producers + transactional consumers = exactly-once delivery. Pair with the Outbox Pattern (Debezium CDC) and you'll never lose a message again. 🧩 Hexagonal Architecture Keep Kafka adapters at the edges. Your business logic stays testable without a running broker. 🔥 GraalVM Native Spring Boot 3 + AOT = 60ms startup, 80MB RAM. Kafka consumers boot faster than your coffee brews. 📊 Observability @Observed + Micrometer + OTLP → full distributed traces across every Kafka event. Zero boilerplate. Hot take: Stop using WebFlux for CRUD. Java 21 virtual threads give you the same throughput with half the complexity. Save reactive for true streaming workloads. What's your current stack for event-driven services? 👇 #Java21 #SpringBoot #Kafka #Microservices #BackendEngineering #EventDriven

2 Comments
Like Comment
To view or add a comment, sign in
Deepesh Grover
3w
Report this post
🚨 Why your 4GB JVM app still gets OOMKilled (even when heap looks fine) Most developers assume: JVM memory = Heap But that’s only part of the story. Let’s look at a common real-world setup: You configure: -Xmx=4g Container memory limit = 4GB Looks perfect, right? ❌ Not really. Your JVM uses much more than just heap. Here’s where the “hidden” memory goes: 🔹 Metaspace Stores class metadata (Spring, Hibernate, proxies) ~150–250MB (no strict cap by default) 🔹 Thread Stacks Each thread ≈ 1MB 200 threads = ~200MB (Tomcat + HikariCP + Kafka + @Async) 🔹 Direct Buffers (Off-Heap) Used by WebClient, Kafka, Netty - Not visible in heap, not GC-managed 🔹 Code Cache JIT compiled code ~50–150MB (grows as app warms up) 🔹 GC Overhead Garbage collector needs working memory too 💥 Reality Check 4GB heap + 200MB metaspace + 200MB threads + 100MB buffers + 100MB code cache 👉 Total ≈ 4.7GB But your container limit is still 4GB. Result? 🚫 Kubernetes OOMKills your pod And the confusing part: ✔️ Heap looks fine (~2.5GB used) ❌ But your app still crashes Because OOMKill is based on total process memory, not just heap. ✅ The Fix ✔️ Keep heap at 70–75% of container memory ✔️ If container = 4GB → set -Xmx=3g ✔️ If -Xmx=4g → container ≥ 5.2GB ✔️ Cap metaspace: -XX:MaxMetaspaceSize=256m ✔️ Cap direct memory: -XX:MaxDirectMemorySize=256m ✔️ Monitor non-heap usage: /actuator/metrics/jvm.memory.used 💡 Takeaway If you're only monitoring heap, you're missing the full picture. 👉 Always consider total JVM memory footprint in containerized environments. #Java #JVM #SpringBoot #Kubernetes #Microservices #DevOps #Performance #BackendEngineering #SoftwareEngineering #Programming
Like Comment
To view or add a comment, sign in
Mitesh Nagpal
2w
Report this post
Things nobody tells you about Java Spring Boot - Until you’re in production After working on enterprise-scale applications handling 75,000+ daily transactions for a Fortune 5 client, here are my biggest takeaways: ✅ Design for failure — Always implement circuit breakers (Resilience4j). Production will surprise you. ✅ Kafka is a game changer — Async event-driven architecture saved us during peak load spikes. ✅ Database tuning matters more than code — SQL query optimization saves more performance than any code refactor. ✅ Don’t ignore logging — Structured logs with correlation IDs across microservices saved hours of debugging. ✅ Test early, test often — JUnit, Mockito and BDD approach caught bugs before they reached production. ✅ API contracts — Poor REST API design causes more problems than bad code. #Java #SpringBoot #Microservices #BackendDevelopment #SoftwareEngineering #TechCommunity #JavaDeveloper
Like Comment
To view or add a comment, sign in
Tej S.
2w Edited
Report this post
🚀 Understanding JVM Memory Areas + JVM Tuning in Kubernetes - Best Practices If you’re working with Java in production, especially inside Kubernetes containers, understanding JVM memory internals is non‑negotiable. 🧠 JVM Memory is broadly divided into: Heap (Young & Old Generation) Metaspace Thread Stacks Program Counter (PC) Register Native Memory (Direct Buffers, JNI, GC, etc.) 💡 Why this matters in Kubernetes? Because containers have memory limits, and the JVM does not automatically understand them unless configured properly. Wrong tuning = OOMKilled pods, GC storms, or wasted resources. ✅ JVM Tuning Best Practices for Kubernetes 1. Always Make JVM Container-Aware Modern JVMs (Java 11+) support containers, but be explicit: -XX:+UseContainerSupport 2. Size Heap Based on Container Memory -XX:MaxRAMPercentage=70 -XX:InitialRAMPercentage=50 3. Leave Headroom for Non-Heap Memory JVM uses memory beyond heap: Metaspace Thread stacks Direct buffers GC native memory Recomendation : Heap ≤ 70–75% of container memory 4. Use the Right Garbage Collector For most Kubernetes workloads: -XX:+UseG1GC 5. Tune Metaspace Explicitly -XX:MaxMetaspaceSize=256m 6. Each thread consumes stack memory. -Xss256k 7. Watch Out for OOMKilled vs Java OOM Java OOM → Heap or Metaspace issue OOMKilled → Container exceeded memory limit Found this helpful? Follow Tejsingh Kaurav for more insights on Software Design, building scalable E-commerce applications, and mastering AWS. Let’s build better systems together! 🚀 #Java #JVM #Kubernetes #CloudNative #PerformanceEngineering #DevOps #Backend #Microservices
Like Comment
To view or add a comment, sign in
The Bytecode Phantom

12 followers
4w
Report this post
After a deep dive into Bytecode and JVM Architecture today, it’s time to step back and ask the overarching question defining many architectural decisions. Why does Java dominate the enterprise backend? Many developers criticize Java for being verbose, but senior architects value it for different reasons. In high-stakes backend engineering, Java offers a strategic trifecta: 1️⃣ Scale and Stability: The JVM is a battle-tested environment optimized for sustained, high-throughput performance over long periods. 2️⃣ The Ecosystem: A massive, reliable library ecosystem (Spring Boot, Hibernate) that simplifies complex distributed system challenges. 3️⃣ Backwards Compatibility: Vital for long-term project viability, ensuring your 5-year-old application doesn’t suddenly break. Java isn’t just a language; it's a foundation for building resilient, complex, and highly observable systems. That is the definition of a professional backend. [Log_Level: Macro_View] Which Java feature do you find most essential for enterprise scalability? 👇 #TheBytecodePhantom #JavaBackend #SoftwareArchitecture #EnterpriseTech #SystemDesign #BackendEngineering #TechLeadership
Like Comment
To view or add a comment, sign in
Mayank Gopal
4w
Report this post
JVM Architecture - what actually runs your Java code ⚙️ While working with Java and Spring Boot, I realized something: We spend a lot of time writing code, but not enough time understanding what executes it. That’s where the JVM (Java Virtual Machine) comes in. A simple breakdown: • Class Loader Loads compiled `.class` files into memory. • Runtime Data Areas * Heap → stores objects (shared across threads) 🧠 * Stack → stores method calls and local variables (per thread) * Method Area → stores class metadata and constants * PC Register → tracks current instruction * Native Method Stack → handles native calls • Execution Engine * Interpreter - runs bytecode line by line * JIT Compiler - optimizes frequently used code into native machine code ⚡ • Garbage Collector Automatically removes unused objects from memory --- Why this matters: Understanding JVM helps in: * Debugging memory issues (like OutOfMemoryError) * Improving performance * Writing more efficient backend systems --- The more I learn, the more I see this pattern: Good developers write code. Better developers understand how it runs. #Java #JVM #BackendDevelopment #SpringBoot #SystemDesign
Like Comment
To view or add a comment, sign in
Ramjeet Mahto
2w Edited
Report this post
🚀 Java Streams: Sequential vs Parallel — When to use what? A simple concept, but often misunderstood 👇 🔹 Sequential Stream → Runs on a single thread (one CPU core) → Processes data step-by-step → Lower overhead → Best for: small datasets, simple operations 🔹 Parallel Stream → Uses multiple threads (ForkJoinPool) → Splits data across multiple CPU cores → Processes tasks concurrently → Best for: large datasets, CPU-intensive operations 💡 Key Insight: Parallel streams are NOT always faster. ⚠️ They introduce: - Thread management overhead - Context switching cost - Possible issues with shared mutable state ✔️ Use Parallel Stream when: - Data size is large - Task is CPU-bound - Operations are stateless & independent ❌ Avoid when: - Small datasets - I/O operations (DB calls, API calls) - Order matters strictly 💼 Real-world example: In one of my use cases, processing large collections (like aggregations/search results) using parallel streams improved performance — but only after ensuring operations were stateless and thread-safe. ⚡ Pro Tip: Always benchmark before switching to parallel — assumptions can be misleading. #Java #StreamAPI #Java8 #Performance #Backend #SoftwareEngineerin
Like Comment
To view or add a comment, sign in
Varun Induvasi
2w
Report this post
Stack vs Heap - what really goes where in Java Most explanations stop at: “Stack stores variables, Heap stores objects.” That’s not enough to actually understand it. Here’s what really matters: > Stack Memory Stores method calls and local variables Works in a LIFO (Last In, First Out) manner Very fast access Automatically cleared when method execution ends Think of it as: 👉 Temporary execution space > Heap Memory Stores objects and instance variables Shared across threads Managed by Garbage Collector Slower than stack but more flexible Think of it as: 👉 Long-term storage for objects When you create an object: - The reference is stored in the stack - The actual object is stored in the heap The difference isn’t just storage. It’s about how your program executes in memory. #Java #JVM #CSFundamentals #BackendDevelopment #SDEPrep
1 Comment
Like Comment
To view or add a comment, sign in
Kuldeep Vyas
1mo
Report this post
🚀 Day 18 — Memory Optimization Strategies Every Java Developer Should Know ~ Poor memory management doesn’t just slow applications — it kills microservices at scale. Here are the core strategies architects use to keep JVM apps fast, stable, and OOM-free 👇 🔹 1. Prefer Bounded Caches (Never Use Unbounded Maps!) Unbounded caches = slow memory death. Use TTL + max-size. Tools: Caffeine, Redis, Guava Cache. 🔹 2. Reduce Object Creation (Avoid GC Pressure) Frequent allocations → GC churn → latency spikes. Use: ✔ Object pooling (selectively) ✔ Reuse buffers ✔ Prefer primitives over wrappers 🔹 3. Tune JVM Heap the Right Way Don’t set memory blindly. Follow this rule: 🧠 “Enough for burst traffic, small enough for fast GC.” Use: -Xms, -Xmx, -XX:+UseG1GC 🔹 4. Avoid Large Objects in Memory 100MB+ arrays, huge DTOs, or large JSON blobs lead to promotion failures. Stream data instead of loading it whole. Use reactive I/O where needed. 🔹 5. Use Efficient Data Structures Right structure = half memory. Examples: • ArrayList instead of LinkedList • EnumSet instead of Set<Enum> • IntStream instead of Stream<Integer> 🔹 6. Profile & Watch for Leaks Use continuous monitoring: 📌 Prometheus + Grafana 📌 Heap dump analysis (MAT / VisualVM) 📌 Look for steadily increasing heap usage 🔹 7. Reduce Retained References Common pitfalls: • Static maps holding data • ThreadLocal misuse • Listeners not removed • Singletons storing heavy objects 🔹 8. Optimize Serialization JSON is expensive. Use: ⚡ Jackson Afterburner ⚡ Protocol Buffers for high-throughput services 🔹 9. Prefer External Queues Over In-Memory Buffers Kafka / RabbitMQ > internal BlockingQueue for large workloads. 🎯 In short: Fast systems are engineered — not accidental. Memory optimization is a continuous discipline, not a one-time fix. What are different memory optimization techniques you have used in your work? #Microservices #Java #100DaysofJavaArchitecture #MemoryManagement #JVM
Like Comment
To view or add a comment, sign in

1,078 followers

8 Posts

View Profile Follow

Optimizing Spring Boot for Heavy File Uploads with Distributed Architecture

More Relevant Posts

Explore content categories