SQL Optimization Case Study: Concurrency & Performance Fix

2w Edited

🚀 SQL Optimization Case Study: Fixing Concurrency & Performance in Series Generation Worked on optimizing a stored procedure responsible for generating unique reference numbers in a high-concurrency system. Before (Problem) • Separate SELECT + UPDATE → race condition risk • Multiple IF blocks → duplicate code • NOLOCK → dirty reads / inconsistent data 👉 Result: Duplicate IDs, slow performance, and unreliable behaviour under load. After (Solution) 🔹 Atomic Update (Key Fix) UPDATE Series WITH (UPDLOCK, ROWLOCK) SET CurrentSeries = CurrentSeries + 1 OUTPUT inserted.CurrentSeries ✔ Single operation → no race condition ✔ Ensures thread-safe sequence generation 🔹 Removed Redundant Queries • Eliminated repeated SELECT blocks • Used OUTPUT to fetch updated values directly ✔ Reduced query count ✔ Improved execution speed 🔹 Improved Locking Strategy • Used UPDLOCK → prevents concurrent updates • Removed NOLOCK → avoids dirty reads ✔ Better data consistency + reliability 🔹 Index Optimization CREATE NONCLUSTERED INDEX IX_Series_Type_Active ON Series (SeriesType, IsActive) INCLUDE (CurrentSeries, SeriesUpto, Prefix); ✔ Faster lookup ✔ Reduced table scans 📊 Impact 🚫 Eliminated duplicate reference numbers ⚡ Improved performance under concurrency 🔒 Stronger data integrity 🧩 Cleaner & maintainable code 💡 Takeaway: For high-volume systems, always ensure: • Atomic operations > separate SELECT + UPDATE • Proper locking > NOLOCK shortcuts • Efficient functions > convenience functions 👉 Small SQL changes can create big performance gains. #SQLServer #DatabaseOptimization #Concurrency #PerformanceTuning #BackendEngineering #SystemDesign

To view or add a comment, sign in

More Relevant Posts

Tarun Paul
1mo
Report this post
Query Optimization Mistakes (Final Synthesis) Most database performance problems are self-inflicted. Not because databases are slow. But because queries are poorly designed. After working with production systems, the same mistakes appear repeatedly. ❌ Fetching more data than needed SELECT * everywhere ❌ Missing or wrong indexes ❌ Ignoring execution plans ❌ N+1 query patterns ❌ Using OFFSET pagination at scale ❌ Long-running transactions Individually, each mistake seems small. Combined, they destroy performance. Real scenario. An API with: • inefficient queries • no indexing strategy • excessive joins Works fine in development. Fails under production load. Here’s the truth: Databases don’t get slower. Workloads get heavier. Optimization is not about tricks. It’s about: • reducing I/O • minimizing round trips • understanding execution plans The biggest shift happens when you stop asking: “Why is this query slow?” And start asking: “What unnecessary work is happening?” That’s where real performance gains come from. #Databases #SQL #Performance #BackendEngineering #SystemDesign
Like Comment
To view or add a comment, sign in
Ajinkya Tambade
2w
Report this post
🚀 𝗦𝗼𝗹𝘃𝗶𝗻𝗴 𝗦𝗤𝗟 𝗖𝗼𝗻𝗰𝘂𝗿𝗿𝗲𝗻𝗰𝘆 𝗕𝗼𝘁𝘁𝗹𝗲𝗻𝗲𝗰𝗸𝘀 𝗶𝗻 𝗛𝗶𝗴𝗵-𝗩𝗼𝗹𝘂𝗺𝗲 𝗦𝘆𝘀𝘁𝗲𝗺𝘀 In one of my recent projects, I faced a critical issue while handling bulk data updates (thousands of records in parallel) at the database level. 🔍 What Was Going Wrong? The system used a single shared staging table, which looked simple—but broke under scale: • Parallel jobs were blocking each other (locks & contention) • Table truncation caused cross-job failures • Stored procedures were tightly coupled to fixed tables • "ROW_COUNT()" was unreliable in dynamic execution • Adding new processing flows required code changes 👉 Clearly, this wasn't a query problem — it was a database design flaw. --- 🛠️ What I Changed (SQL Architecture Shift) Instead of patching queries, I redesigned the staging layer: ✔️ Introduced one staging table per execution line (isolation-first) ✔️ Built config-driven table mapping (DB-controlled, no hardcoding) ✔️ Used dynamic SQL with controlled validation ✔️ Captured "ROW_COUNT()" immediately after execution (fixing inconsistency) ✔️ Optimized bulk inserts using MySqlBulkCopy + batching --- ⚙️ Production-Grade Safeguards • Transaction handling with rollback support • Validation layer for dynamic SQL (prevent unsafe execution) • Timezone-consistent timestamps • Zero-code onboarding via configuration --- 📈 Result • Eliminated concurrency issues completely • Parallel processing became stable and predictable • Accurate row tracking restored • System became scalable without redeployment • Clean separation between application logic and DB layer --- 💡 Key Learning Most performance issues are not solved by writing better queries. 👉 They are solved by designing better data flow and isolation strategies. --- #SQL #SystemDesign #BackendEngineering #DotNet #Performance #DatabaseArchitecture
Like Comment
To view or add a comment, sign in
Luciana Machado
1mo
Report this post
Your database is not slow. Your queries are. One of the most common things I hear is: “The database is slow.” But most of the time… it isn’t. A while ago, I had to analyze a query that was taking several minutes to run. At first glance, nothing “too wrong”. But digging deeper, the pattern was clear: * Looping through records * Nested subqueries executed per row * Repeated reads over the same tables Classic row-by-row processing. So instead of trying to “tune” the query… I rewrote the approach. From this mindset: FOREACH record RUN subquery To this: WITH AggregatedData AS ( SELECT EventId, SUM(Value) AS Total FROM Items GROUP BY EventId ) SELECT e.Id, a.Total FROM Events e LEFT JOIN AggregatedData a ON a.EventId = e.Id The result: * Query time dropped from minutes to milliseconds * Massive reduction in IO * Stable performance even with data growth That’s when it becomes very clear: * SQL is not about how to iterate It’s about how to describe the result Another common issue I still see: Developers relying on ORM-generated queries without ever checking what is actually executed. ORMs are great. But the database only understands SQL. The real shift happens when you start looking at: * Execution plans * Index usage * Logical reads Because that’s where performance actually lives. The database is rarely the problem. Data access patterns are. Curious to hear: Have you ever rewritten a query and seen a massive performance gain? #SQLServer #DatabasePerformance #BackendDevelopment #SoftwareEngineering #DotNet #SystemDesign #DataEngineering

8 Comments
Like Comment
To view or add a comment, sign in
Tarun Paul
1mo
Report this post
When NOT to Normalize Your Database Normalization is good. Until it isn’t. Database normalization reduces redundancy. It keeps data clean. It enforces consistency. That’s why it’s taught as best practice. But at scale, normalization can hurt performance. Highly normalized schemas require: • multiple joins • more queries • more I/O Each join adds cost. Real scenario. An analytics system joins 6 tables for every request. Each query becomes expensive. Latency increases. Throughput drops. Denormalization solves this: • duplicate data intentionally • reduce joins • improve read performance But now you introduce: • data duplication • update complexity • consistency challenges Normalization favors correctness. Denormalization favors performance. The mistake is treating normalization as a rule. It’s not. It’s a starting point. Good engineers normalize first. Then denormalize strategically based on real performance needs. Database design is not theory. It’s trade-offs under load. #Databases #SQL #Performance #BackendEngineering #SystemDesign
Like Comment
To view or add a comment, sign in
Enavamshi Gadikota
6d Edited
Report this post
SQL Optimization from "Theoretical" to "Performant" | Tuning the SQL Table for our application (MRRS) We’ve all heard that indices are the answer to slow SQL performance. But as anyone working on enterprise-scale data knows: it’s never that simple. In my recent work on the MRRS project for Network Rail, I ran into a classic bottleneck where standard indexing wasn’t just "not helping", it was actually adding overhead to our ETL cycles. When you're dealing with the scale and precision required for rail infrastructure data, "good enough" isn't an option. I took a deep dive into the execution plans to move beyond the basics. Here’s what worked for us: The Practical Approach: 1. SARGability Over Everything: We found several legacy filters wrapping columns in functions (like YEAR() or TRIM()), which completely blinded our indices. Refactoring these to be SARG-compliant dropped our execution time significantly. 2. Index Pruning: We realized we had "index bloat"—too many overlapping indices that were slowing down our INSERT and UPDATE operations during the daily data refresh. By consolidating into a few high-impact composite indices, we improved write throughput 3. Covering Indices: Instead of just indexing the WHERE clause, we included the SELECT columns in the index itself. This allowed the engine to fulfill the query entirely from the index without ever touching the heavy base tables The Result? By moving away from "blanket indexing" and focusing on precision tuning, we achieved a much leaner, more resilient data pipeline that can handle the complexities of the MRRS project without breaking a sweat. The Lesson: Don't just add an index because it’s there. Read the execution plan, understand the write-penalty, and optimize for the specific data shape you’re working with. #DataEngineering #SQLOptimization #AzureDataFactory #MRRS #DatabaseTuning #SeniorDataEngineer
Like Comment
To view or add a comment, sign in
Sameer Paradkar
3w
Report this post
Most developers learn SQL. Very few learn what happens underneath it when 10,000 transactions hit the same row at once. That gap is exactly why some engineers get called into production incidents at 2 AM — and others design systems that never need that call. Database locks are the invisible arbiters of concurrency. Get them wrong and you get deadlocks, dirty reads, and silent data corruption. Get them right and your system handles massive load without flinching. Here's what the lock layer actually looks like across all 9 types — and the specific problem each one was built to solve. A Shared Lock allows multiple transactions to read simultaneously but blocks any write — your read-heavy APIs live and die by this. An Exclusive Lock goes further: one writer takes full control and everyone else waits, no exceptions. The Update Lock is the underrated one — it acts as an intention gate that reserves write rights before committing, which is the only reliable way to prevent the S→X conversion deadlock that quietly causes 80% of production incidents. Schema Locks protect structural changes — your ALTER TABLE doesn't corrupt live queries mid-flight. Bulk Update Locks drop row-level friction entirely during mass inserts, which is why ETL pipelines need them. Row-Level Locks give you surgical precision — lock exactly one row, leave the entire table free for everything else. Page-Level Locks trade some granularity for lower overhead. Table-Level Locks are the blunt instrument — simple to implement, devastating to concurrency at scale. And the Transaction Coordinator Hub orchestrates all of it, routing lock keys and validation pulses across every concurrent operation. Most deadlocks aren't bugs. They're design decisions made without knowing which lock type to reach for. Save this for the next time your DBA asks about lock escalation. Which lock type has burned you worst in production? Drop it below. 👇
3 Comments
Like Comment
To view or add a comment, sign in
Sourav Aggarwal
3w Edited
Report this post
𝗪𝗵𝘆 𝗱𝗼 𝗿𝗮𝗰𝗲 𝗰𝗼𝗻𝗱𝗶𝘁𝗶𝗼𝗻𝘀 𝘀𝘁𝗶𝗹𝗹 𝗯𝗿𝗲𝗮𝗸 𝗦𝗤𝗟 𝗰𝗼𝗱𝗲 𝘁𝗵𝗮𝘁 𝗹𝗼𝗼𝗸𝘀 𝗰𝗼𝗺𝗽𝗹𝗲𝘁𝗲𝗹𝘆 𝗰𝗼𝗿𝗿𝗲𝗰𝘁? Reading and updating data is not the same as protecting data under concurrency. This article explains SQL locks and isolation levels in a simpler, more practical way using a real time situation. It covers concepts like cell-level, row level, table level locks, phantom locks, and isolation levels. If you work on transactional workflows, this should help make concurrency issues much easier to reason about. #SQL #Databases #SystemDesign #Transactions #Concurrency #DatabaseLocks #IsolationLevels #RaceConditions

Understanding SQL Locks and Isolation Levels medium.com
Like Comment
To view or add a comment, sign in
Lee Markum
2w
Report this post
I absolutely love this post. I've seen a lot of databases where it seemed that the people designing it did not realize they were making performance decisions they would later regret. Once people feel the pain of those decisions, the databse and app have often been around so long that it is often too late to convince them to change it because changing the database schema means app layer changes they don't want to make. These folks will frequently say the recommended changes can't be done "because it would be too much work." Then those same people will often ask, "Is there anything else we can do to make it perform better?" I understand the desire for some other answer that involves less work. But sometimes there are no other answers. A house sitting on a busted foundation can only be "prettied up" so much. #database

Haider Z.

Database Engineer @ Microsoft • The Sev-1 Database Guy
1mo

Database Performance Tuning Rule #8: Your schema is a performance decision you made. What that means: Every data model decision has a query cost attached to it. Storing status as a VARCHAR instead of a smallint: → Every index on that column is larger. → Every comparison takes more CPU. → Trivial individually. Expensive at 500M rows. Normalizing a user profile into six tables: → Every profile page load needs six joins. → At 10,000 users/day: fine. → At 10,000 users/minute: those joins become the bottleneck. Putting created_at as a nullable column: → Now your time-range queries need IS NOT NULL checks. → The planner estimates null proportion from statistics. → Bad statistics → wrong plan → wrong index. Ask these questions at schema design time: • What is the most frequent query this table will serve? • What is the write rate at peak? • How will this table look at 10x current row count? • Which columns will appear in WHERE clauses? Are they indexed-friendly types? • Are there any nullable FK columns? (Each one is a potential lock incident waiting to happen in MySQL.) Schema reviews are performance reviews. Most teams treat them as correctness reviews. Correctness is table stakes. Performance is what you'll be pinged about. ───────────────────────────────────────────────── ♻️ Repost to every engineer who designs tables. Follow Haider Z. 📩 Real incidents, RCAs, SQL queries every week → https://lnkd.in/d3M5-pJA

4 Comments
Like Comment
To view or add a comment, sign in
Mohamed Abdalla
3w
Report this post
A quick note for when to think about no-sql over sql... firstly what sql produce ! 1 - ACID guarantees (no data loss, no inconsistency) required for : 💰Financial systems - Payment processing - Banking, orders, transactions 2 - Complex joins and relationships 👉 Relational modeling fits naturally 📈 Reporting systems and Data Analytics so not just storing 3 - Data integrity (constraints) Foreign keys 👉Unique constraints Transactions database itself protects data from bad logic 3 - three again 🤪 as it for integrity but for the schema as you strict data model and doesn’t change it frequently. NOW what know-sql 🤪 produces ? when you have -: - Millions of users and high write/read throughput (horizontal scaling) - Frequently changing fields (no migrations every week) - Caching (Redis)- Real-time (Logging systems) -Gaming, chats(data streams) (High performance (low latency) over consistency) - Social media likes 👍-Notifications - Logs (Eventual consistency is acceptable) - Big data / distributed systems Finally the kick-start question is "What problem am I solving?" to reach our target sql or no need for sql in our problem.. that's it reference book for these info : designing data intensive apps by martin kleppmann
4 Comments
Like Comment
To view or add a comment, sign in
Satyam Singh
2w
Report this post
🚀 Indexing in SQL — The Key to Faster Queries Today I explored one of the most important performance concepts in SQL—Indexing—and it completely changed how I think about query optimization. 💡 What is Indexing? Indexing is a technique used to speed up data retrieval by creating a data structure (like a B-Tree) that allows the database to find data quickly without scanning the entire table. 📊 Why It Matters: • ⚡ Speeds up SELECT queries significantly • 🔍 Enables faster searching and filtering • 🔗 Improves performance of JOIN operations • 📈 Essential for handling large datasets 🔑 Types of Indexes I Learned: • Primary Index (automatically created on primary key) • Unique Index (ensures no duplicate values) • Single & Composite Index • Clustered vs Non-Clustered Index ⚠️ But It’s Not Always Perfect: • ❌ Slows down INSERT, UPDATE, DELETE • ❌ Takes extra storage • ❌ Too many indexes can hurt performance 📌 Key Insight: Index only those columns that are frequently used in WHERE, JOIN, ORDER BY, or GROUP BY clauses. 💭 Next step: Testing query performance with and without indexes on large datasets to see the real impact. #SQL #Database #QueryOptimization #DataEngineering #LearningJourney #TechSkills #SoftwareEngineering #PlacementPreparation
Like Comment
To view or add a comment, sign in

29 followers

7 Posts

View Profile Connect

SQL Optimization Case Study: Concurrency & Performance Fix

More Relevant Posts

Explore related topics

Explore content categories