Shift to Modular SQL for Readable Code

Stop writing SQL for the database engine. Start writing it for the human who has to maintain it (probably you). We’ve all inherited that query. You know the one: 1,000 lines of monolithic code, nested subqueries seven levels deep, and zero comments. It runs, but modifying it feels like playing Jenga with production data. The engine doesn't care about your messy code, but your team's agility does. The shift every Data Analyst needs to make is toward Modular SQL. Modular code is readable code. Readable code is enhanceable code. Here is the blueprint for SQL that survives schema changes and business logic updates: ✅ DO: 1. Use CTEs (Common Table Expressions) to break complex logic into isolated steps. 2. Select explicit columns, never SELECT * in production. 3. Leverage Window Functions over messy self-joins. 4. Comment on WHY the logic exists, not how it works. ❌ DON'T: 1. Nest subqueries deeper than three levels. (Convert them to CTEs!) 2. Use SELECT * (protect your query from table schema evolution). 3. Perform raw date manipulation in WHERE clauses (isolate it in a CTE). 4. Adopt modular SQL. Save future-you hours of debugging. Less firefighting = More analysis. Check out the cheat sheet below. What’s the worst SQL anti-pattern you've encountered in code review? Share your pain below. 👇 #SQL #DataAnalytics #DataEngineering #CodingBestPractices #Analytics #DataScience #CareerGrowth

12 Comments

Patrick Walling 2w

I don't know that I've ever written a query for the database engine or another person tbh. I mean, it is a structured query language, isn't it? Maintain the structure and, surprise! It's written for the database engine and another person! Amazing! Seriously, it's STRUCTURED. If you understand the structure, and build it based on the structure (like there would be a way to do it otherwise).... sometimes I think I'm talking to a wall here. Please double check your work before posting (or committing code you've written because *red flag*) your 4th DON'T should actually be a DO, and you failed to mention using clear table aliases in your DOs despite it clearly being #3 on the cheat sheet. QA is sending that work back from where it came my friend. Now your whole project has fallen behind because you didn't bother reviewing your work! Anyway, aside from that the picture is solid on the benefits of the best practices it mentions.

1 Reaction

Shashank Garewal 2w

In the era of AI/LLMs assistance, the 'why' in comments seems to matter far more than the 'what' — which is what we typically write for every block, regardless of the query/code language. Small note on the post — the first two DON'Ts are already called out within the DOs themselves, feels a bit redundant. Also, DON'T #4 reads opposite of post intent — likely a typo, but as written it appears ≈ don't adopt modular SQL!

2 Reactions

Rida Khan 2w

Love this perspective Manpreet Singh! Even just correcting smaller queries, I've seen how quickly SQL can become a maze. Modular SQL and focusing on 'WHY' in comments are absolute game-changers for readability and long-term sanity. My biggest pain point is always queries that try to do way too much in one go (thats a nightmare) Thanks for sharing this blueprint!

1 Reaction

Jason Murray 2w

Writing SQL for humans at the expense of the engine increases cost and slows reporting. It is possible to do both and optimize queries while also improving security, but not for those who can't be bothered to add code comments, learn to leverage DDL, or model data for efficiency.

1 Reaction

Asmita Kaushal 2w

Writing SQL for humans instead of just the engine is such a game changer, future you will thank you every time you revisit that query 😄 Manpreet Singh

3 Reactions

Dewank Mahajan 2w

Strong point, poorly structured SQL doesn’t just slow queries, it slows teams readability is a scalability problem, not just a style choice. Manpreet Singh

2 Reactions

See more comments

To view or add a comment, sign in

More Relevant Posts

Manaswini Nagidi
4w
Report this post
🚀 𝗠𝘆 𝗦𝗤𝗟 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 𝗝𝗼𝘂𝗿𝗻𝗲𝘆 🗓️𝗗𝗮𝘆 𝟯𝟭 📌𝗧𝗼𝗽𝗶𝗰:𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝘁𝗼 𝗖𝗧𝗘𝘀 (Common Table Expressions) Today, I explored 𝗖𝗧𝗘𝘀 (𝗖𝗼𝗺𝗺𝗼𝗻 𝗧𝗮𝗯𝗹𝗲 𝗘𝘅𝗽𝗿𝗲𝘀𝘀𝗶𝗼𝗻𝘀) — one of the most useful SQL features for writing clean, readable, and maintainable queries. 📌𝗪𝗵𝗮𝘁 𝗶𝘀 𝗮 𝗖𝗧𝗘? A Common Table Expression (CTE) is a temporary result set created using the "WITH" clause, which can be referenced within a query. 📌𝗪𝗵𝘆 𝘂𝘀𝗲 𝗖𝗧𝗘𝘀 𝗶𝗻𝘀𝘁𝗲𝗮𝗱 𝗼𝗳 𝘀𝘂𝗯𝗾𝘂𝗲𝗿𝗶𝗲𝘀? - Improves readability - Makes queries easier to debug and maintain - Helps organize complex logic in a step-by-step way - Can improve performance in some scenarios - Useful when the same result needs to be referenced clearly within a query 📌𝗖𝗼𝗺𝗺𝗼𝗻 𝗨𝘀𝗲 𝗖𝗮𝘀𝗲𝘀 𝗼𝗳 𝗖𝗧𝗘𝘀 - Simplifying complex queries - Replacing nested subqueries - Performing step-by-step data transformations - Working with hierarchical or recursive data ✨𝗞𝗲𝘆 𝗧𝗮𝗸𝗲𝗮𝘄𝗮𝘆 CTEs make SQL queries more structured, professional, and easier to manage, especially when dealing with complex logic #SQL #SQLLearning #CTE #CommonTableExpressions #DataAnalytics #DataAnalyst #LearningJourney #40DaysOfCode #SQLJourney
Like Comment
To view or add a comment, sign in
Sahil Alam
6d
Report this post
SQL Optimization isn't about writing less code. It's about understanding what happens AFTER you hit run. Most engineers I know can write SQL. Very few understand what it costs. Here's everything that actually matters: 1. The Query Optimizer isn't magic It builds an execution plan based on statistics. Old or missing statistics = bad plan = slow query. Update your stats. Trust the plan less. 2. SARGability is everything SARG = Search ARGument Able. If your filter can't use an index, it scans the whole table. This breaks SARGability: WHERE YEAR(created_at) = 2024 This doesn't: WHERE created_at >= '2024-01-01' AND created_at < '2025-01-01' Same result. Completely different cost. 3. Implicit conversions are silent killers ISNULL(Amount, 0) when Amount is decimal? The engine converts everything to int quietly. Your index? Ignored. 4. Execution Plans > Gut Feeling Before optimizing anything read the plan. Look for: Table Scans, Key Lookups, Sort operators. These are your cost red flags. 5. Indexes aren't free Every index you add speeds up reads. But slows down writes. Design for your actual workload. The real lesson? Writing SQL is a skill. Understanding SQL cost is a discipline. One gets the query working. The other keeps the system alive at 3AM. Which of these did nobody teach you formally?👇 Found Insightful? ♻️ Repost in your network and follow Sahil Alam for more. #SQL #DataEngineering #Analytics #Debugging #DataQuality #Learning

2 Comments
Like Comment
To view or add a comment, sign in
Rakesh D L
2w
Report this post
🚀 𝗦𝗤𝗟 𝗛𝗮𝗻𝗱𝗯𝗼𝗼𝗸 – 𝗪𝗵𝗮𝘁 𝗬𝗼𝘂 𝗥𝗲𝗮𝗹𝗹𝘆 𝗡𝗲𝗲𝗱 𝗧𝗼 𝗞𝗻𝗼𝘄 Most people learn SQL as queries… but strong SQL comes from understanding data, relationships, and logic. 🧠 𝗖𝗢𝗥𝗘 𝗖𝗢𝗡𝗖𝗘𝗣𝗧𝗦 → What data, database, and DBMS really mean → Relational vs non-relational databases → SQL as a declarative language for CRUD operations → Tables, rows, columns, datatypes, and primary keys 💻 𝗤𝗨𝗘𝗥𝗬 𝗙𝗢𝗨𝗡𝗗𝗔𝗧𝗜𝗢𝗡 → CREATE, INSERT, SELECT, UPDATE, DELETE → WHERE, comparison operators, LIKE, IN, BETWEEN → ORDER BY, DISTINCT, LIMIT, OFFSET → Aliases, expressions, and built-in SQL functions 📊 𝗔𝗚𝗚𝗥𝗘𝗚𝗔𝗧𝗜𝗢𝗡 𝗦𝗞𝗜𝗟𝗟𝗦 → COUNT, SUM, MIN, MAX, AVG → GROUP BY and HAVING logic → Filter first, then aggregate when needed → Understand NULL handling in aggregates 🔗 𝗥𝗘𝗟𝗔𝗧𝗜𝗢𝗡𝗦𝗛𝗜𝗣𝗦 & 𝗝𝗢𝗜𝗡𝗦 → ER model basics: entity, attribute, key attribute → One-to-one, one-to-many, many-to-many relationships → Natural join, inner join, left join, right join → Full join, cross join, self join, and junction tables ⚡ 𝗥𝗘𝗔𝗟 𝗪𝗢𝗥𝗟𝗗 𝗗𝗜𝗦𝗖𝗜𝗣𝗟𝗜𝗡𝗘 → Build queries step by step instead of guessing → Use views for reusable logic → Use transactions for ACID behavior → Use indexes to improve search performance 🎯 𝗪𝗛𝗔𝗧 𝗥𝗘𝗖𝗥𝗨𝗜𝗧𝗘𝗥𝗦 𝗔𝗖𝗧𝗨𝗔𝗟𝗟𝗬 𝗟𝗢𝗢𝗞 𝗙𝗢𝗥 → Can you explain query logic clearly? → Do you know when to use WHERE vs HAVING? → Can you choose the right join for the problem? → Do you understand schema design, not just syntax? This handbook is a strong SQL foundation for interviews, analytics, and real project work because it moves from basics to joins, modeling, transactions, and optimization. #SQL #Database #Joins #Aggregation #ERModel #Transactions #InterviewPrep

31 Comments
Like Comment
To view or add a comment, sign in
Luciana Machado
1mo
Report this post
Your database is not slow. Your queries are. One of the most common things I hear is: “The database is slow.” But most of the time… it isn’t. A while ago, I had to analyze a query that was taking several minutes to run. At first glance, nothing “too wrong”. But digging deeper, the pattern was clear: * Looping through records * Nested subqueries executed per row * Repeated reads over the same tables Classic row-by-row processing. So instead of trying to “tune” the query… I rewrote the approach. From this mindset: FOREACH record RUN subquery To this: WITH AggregatedData AS ( SELECT EventId, SUM(Value) AS Total FROM Items GROUP BY EventId ) SELECT e.Id, a.Total FROM Events e LEFT JOIN AggregatedData a ON a.EventId = e.Id The result: * Query time dropped from minutes to milliseconds * Massive reduction in IO * Stable performance even with data growth That’s when it becomes very clear: * SQL is not about how to iterate It’s about how to describe the result Another common issue I still see: Developers relying on ORM-generated queries without ever checking what is actually executed. ORMs are great. But the database only understands SQL. The real shift happens when you start looking at: * Execution plans * Index usage * Logical reads Because that’s where performance actually lives. The database is rarely the problem. Data access patterns are. Curious to hear: Have you ever rewritten a query and seen a massive performance gain? #SQLServer #DatabasePerformance #BackendDevelopment #SoftwareEngineering #DotNet #SystemDesign #DataEngineering

8 Comments
Like Comment
To view or add a comment, sign in
Amit Kumar Mishra
2w
Report this post
𝗚𝗼𝗼𝗱 𝗦𝗤𝗟 𝘄𝗼𝗿𝗸𝘀. 𝗖𝗹𝗲𝗮𝗻 𝗦𝗤𝗟 𝗶𝘀 𝗲𝗮𝘀𝘆 𝘁𝗼 𝗿𝗲𝗮𝗱, 𝗱𝗲𝗯𝘂𝗴, 𝗮𝗻𝗱 𝘀𝗰𝗮𝗹𝗲. When you start learning SQL, the main focus is usually getting the correct result. But in real-world projects, writing clean and readable SQL is just as important. Because your queries will be read by: • teammates • analysts • engineers • your future self Here are 4 simple practices that instantly improve your SQL quality 👇 1️⃣ Use aliases for readability Aliases make queries shorter and easier to understand. Instead of repeating long table names, use meaningful aliases. Example: SELECT u.id, u.name, SUM(o.amount) AS total_spent FROM users AS u JOIN orders AS o ON u.id = o.user_id GROUP BY u.id, u.name; 2️⃣ Format queries properly Well-formatted SQL is much easier to debug and maintain. Best practices: • Use uppercase for SQL keywords • Place each clause on a new line • Align JOIN conditions 3️⃣ Follow naming conventions Consistent naming makes databases easier to navigate. Common convention: • snake_case for tables and columns • descriptive column names Example: customer_id order_date total_amount 4️⃣ Avoid SELECT * It might feel convenient, but it can: • slow down queries • retrieve unnecessary data • break code when schema changes Better approach: SELECT order_id, order_date, total_amount FROM orders; 💡 Key takeaway Clean SQL isn't just about style — It makes your queries faster to understand, easier to maintain, and more production-ready. Small habits like these make a big difference in real data projects. Curious to know 👇 What’s one SQL habit that improved your queries the most? #SQL #DataAnalytics #LearningInPublic #SQLTips #DataAnalyticsJourney
Like Comment
To view or add a comment, sign in
MadhuKanth Kella
1mo
Report this post
SQL Journey – Day 27: Subqueries Deep Dive (Advanced Practice) Today’s focus: Understanding how subqueries work internally and how to use them effectively for real-world problem solving. This was not just theory — practiced multiple scenarios to understand execution flow and logic building. ⸻ 🔹 What I Explored Subqueries inside SELECT, WHERE: • Using subqueries to fetch intermediate results • Comparing values using nested queries • Writing conditions based on dynamic results ⸻ 🔹 Types of Subqueries Practiced ✅ Single Row Subquery • Returns one value • Used with operators (=, >, <, etc.) ✅ Multi Row Subquery • Returns multiple values • Used with IN, ANY, ALL • Executes row by row ✅ Correlated Subquery • Depends on outer query • Executes row by row ⸻ 🔹 Key Concepts Understood • Subqueries execute inside → outside • Outer query depends on inner query results • Must maintain data type compatibility • Can be nested multiple levels ⸻ 🔹 Real Practice Scenarios • Example: Finding average value using subquery • Correlated subqueries are powerful but expensive • Poor usage can impact performance • Sometimes JOINs are a better alternative ⸻ 💡 Day 27 Realization • Subqueries are not just a concept — they are a thinking pattern • They help break complex problems into smaller logical steps • Mastering them = writing smarter SQL, not longer SQL ⸻ 🔖 Hashtags #SQL #Subqueries #AdvancedSQL #DataAnalytics #LearningJourney #RDBMS #TechCSE
1 Comment
Like Comment
To view or add a comment, sign in
Swathi Chippada
4w
Report this post
🚀𝗗𝗮𝘆 𝟮𝟵 – 𝗤𝘂𝗲𝗿𝘆 𝗘𝘅𝗲𝗰𝘂𝘁𝗶𝗼𝗻 𝗙𝗹𝗼𝘄 & 𝗔𝗱𝘃𝗮𝗻𝗰𝗲𝗱 𝗙𝗶𝗹𝘁𝗲𝗿𝗶𝗻𝗴 Today I focused on something most people ignore — how SQL actually executes internally. Not just writing queries, but understanding the execution flow that decides whether your query is correct or completely wrong. 🔹 𝗖𝗼𝗿𝗲 𝗖𝗼𝗻𝗰𝗲𝗽𝘁: Query Execution Order Even though we write: SELECT → FROM → WHERE → GROUP BY → HAVING → ORDER BY 𝗔𝗰𝘁𝘂𝗮𝗹 𝗲𝘅𝗲𝗰𝘂𝘁𝗶𝗼𝗻 𝗵𝗮𝗽𝗽𝗲𝗻𝘀 𝗮𝘀: FROM → WHERE → GROUP BY → HAVING → SELECT → ORDER BY That one shift changes everything. If you don’t get this, you’ll keep making mistakes and won’t even know why. 🔹 𝗪𝗛𝗘𝗥𝗘 𝘃𝘀 𝗛𝗔𝗩𝗜𝗡𝗚 (𝗖𝗹𝗲𝗮𝗿 𝗗𝗶𝗳𝗳𝗲𝗿𝗲𝗻𝗰𝗲) • WHERE → filters raw data before grouping • HAVING → filters aggregated data after GROUP BY 𝗦𝗶𝗺𝗽𝗹𝗲 𝗿𝘂𝗹𝗲: If you’re filtering before calculation → WHERE If you’re filtering after calculation → HAVING 🔹 𝗪𝗵𝗮𝘁 𝗜 𝗣𝗿𝗮𝗰𝘁𝗶𝗰𝗲𝗱 • Combining WHERE with subqueries • Using HAVING with aggregation • Nested filtering for real-world scenarios 🔹 𝗞𝗲𝘆 𝗥𝗲𝗮𝗹𝗶𝘇𝗮𝘁𝗶𝗼𝗻 Most SQL mistakes are not syntax errors — they come from misunderstanding execution order. Once you understand how the database thinks, debugging becomes much easier. 𝘊𝘰𝘯𝘴𝘪𝘴𝘵𝘦𝘯𝘤𝘺 𝘤𝘰𝘯𝘵𝘪𝘯𝘶𝘦𝘴. 𝘖𝘯𝘦 𝘥𝘢𝘺 𝘢𝘵 𝘢 𝘵𝘪𝘮𝘦. #𝗦𝗤𝗟 #𝗔𝗱𝘃𝗮𝗻𝗰𝗲𝗱𝗦𝗤𝗟 #𝗗𝗮𝘁𝗮𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 #𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴𝗝𝗼𝘂𝗿𝗻𝗲𝘆
Like Comment
To view or add a comment, sign in
VIVEK SAIRAM
1w
Report this post
𝐃𝐞𝐜𝐫𝐲𝐩𝐭𝐢𝐧𝐠 𝐒𝐐𝐋 𝐞𝐱𝐞𝐜𝐮𝐭𝐢𝐨𝐧: 𝐇𝐨𝐰 𝐭𝐡𝐞 𝐝𝐚𝐭𝐚𝐛𝐚𝐬𝐞 𝐀𝐂𝐓𝐔𝐀𝐋𝐋𝐘 𝐰𝐨𝐫𝐤𝐬. We all write SQL queries in this "Coding Order": SELECT ... FROM ... WHERE ... GROUP BY ... HAVING ... ORDER BY ... TOP ... (or LIMIT) It feels intuitive, right? We start with what we want (SELECT), then where to get it, and how to filter it. But here's the thing: This is NOT how the SQL database engine executes it. If you want to write optimized queries, you must understand the "Execution Order." It’s fundamentally different. The database engine's logical process is: 1. FROM - First, it needs the data source. 2. WHERE - Then, it filters the base rows (before any grouping). 3. GROUP BY - It groups the remaining rows. 4. HAVING - It filters those groups (not individual rows). 5. SELECT - Finally, it calculates the specific expressions (and aggregates like SUM). 6. ORDER BY - Then, it sorts the result set. 7. TOP / LIMIT - And last, it truncates the final, sorted result. Knowing this order is a game-changer. It explains why you can't use a column alias defined in the SELECT clause within your WHERE clause the WHERE is processed before the SELECT even knows about the alias. Check out this visualization I created that maps the Coding Order (how we write it) to the Execution Order (how the DB processes it), step-by-step. Understanding this will help you: 💡 Write logically sound queries. 💡 Debug performance issues. 💡 Stop making common SQL mistakes . Do you write your queries based on the execution order, or do you still think in coding order? Let me know in the comments! #SQL #Database #Performance #DataScience #Coding #CareerGrowth #Learning #SQLQuery #DataAnalysis
Like Comment
To view or add a comment, sign in
Irfan Ullah
1w
Report this post
Stop jumping between SQL topics Follow a clear path This roadmap shows how to go from zero to advanced SQL ⬇️ Step 1 Basics • What is SQL • Tables and databases • Data types and NULL • CRUD operations • DDL vs DML ⬇️ Step 2 Queries • SELECT • WHERE with AND OR NOT • ORDER BY • GROUP BY • LIMIT and DISTINCT ⬇️ Step 3 Functions • COUNT SUM AVG MIN MAX • UPPER LOWER CONCAT • Date functions • COALESCE ⬇️ Step 4 Joins • INNER • LEFT • RIGHT • FULL • SELF • CROSS ⬇️ Step 5 Subqueries • SELECT FROM WHERE • Correlated queries ⬇️ Step 6 Constraints • PRIMARY KEY • FOREIGN KEY • UNIQUE • NOT NULL • CHECK ⬇️ Step 7 Indexes and views • Index basics • Performance tradeoffs • Views ⬇️ Step 8 Normalization • 1NF 2NF 3NF • Remove redundancy • When to denormalize ⬇️ Step 9 Transactions • BEGIN COMMIT ROLLBACK • ACID • Isolation levels ⬇️ Step 10 Advanced SQL • Window functions • CTEs • Stored procedures • Triggers ⬇️ Step 11 Practice • Build projects • Prepare for interviews • Optimize queries Rule Learn then apply immediately #SQL #DataAnalytics #LearnSQL #Database #ProgrammingValley
Like Comment
To view or add a comment, sign in
Korupolu Siri
3w
Report this post
🚀 SQL Journey – Day 32: Recursive CTE (Hierarchical Queries) Today’s focus was on Recursive CTEs, one of the most powerful SQL concepts used to work with hierarchical or repeating data. 🔹 What is a Recursive CTE? A Recursive CTE is a CTE that calls itself repeatedly to process hierarchical or sequential data. 👉 Used when data has parent-child relationships 🔹 Basic Structure WITH cte_name AS ( -- Anchor Query (starting point) SELECT ... UNION ALL -- Recursive Query (calls itself) SELECT ... FROM table t JOIN cte_name c ON condition ) SELECT * FROM cte_name; 🔹 Key Components ✔ Anchor Query → Starting rows ✔ Recursive Query → Repeats logic ✔ UNION ALL → Combines results ✔ Stops when no new rows are returned 🔹 Where is it Used? • Employee → Manager hierarchy • Category → Subcategory structure • Organizational charts • Tree-like data traversal 🔹 Concept Understanding (From Today’s Notes) Recursive CTE works step-by-step: 1️⃣ Start with base data (Anchor) 2️⃣ Use result to fetch next level 3️⃣ Repeat until condition fails 👉 Like traversing a tree or graph 🔹 Important Rules • Must use UNION ALL (not UNION) • Recursive part must reference CTE name • Be careful with infinite loops • Can control depth using conditions 🔹 Interview Insight 💡 If a problem involves: • Hierarchy • Levels • Parent-child relationships 👉 Think Recursive CTE immediately 💡 Day 32 Realization Recursive CTE is not just SQL — it’s logic + iteration inside queries. Once you understand this, you can solve complex hierarchical problems easily. SQL is getting deeper. Thinking is getting sharper. HAPPY LEARNING!✨ #SQL #CTE #RecursiveCTE #DataAnalytics #LearningJourney #SQLPractice #RDBMS #TechJourney #CSE
Like Comment
To view or add a comment, sign in

1,746 followers

92 Posts

View Profile Follow

Shift to Modular SQL for Readable Code

More Relevant Posts

Explore related topics

Explore content categories