Nikita Tarasov-Gaevskii’s Post

WHERE and HAVING, NOT IN and NOT EXISTS do similar things in SQL, but when should you use which? WHERE and HAVING: The main difference WHERE → works on individual rows. It is applied before GROUP BY — it filters the ‘raw’ data. HAVING → works with the results of aggregate functions (COUNT, SUM, AVG, …), in simpler terms, with groups. It is applied after GROUP BY — filtering already aggregated groups. When to use: WHERE → whenever possible (faster, reduces the volume of data in advance) HAVING → only when filtering by aggregates or by GROUP BY results is required Together, when WHERE comes before GROUP BY and HAVING comes after. NOT IN and NOT EXISTS: The main difference NOT IN → comparison with a list. Checks that the value is not in the list. But there is a critical nuance: if there is at least one NULL in the subquery, the result may be empty. The reason is the peculiarities of SQL’s three-valued logic (TRUE / FALSE / UNKNOWN). May be less efficient on large datasets. NOT EXISTS → checking for the existence of rows Checks that no row exists that satisfies the condition. Not affected by NULL values. Often optimises better (especially with indexes). When to use: Use NOT EXISTS by default if there is a subquery, NULL values are possible, and reliability is important. You can use NOT IN if you are 100% certain there are no NULL values and the list of values is simple. #Java #Backend #Developer #JavaDevelompent #Software #Programming #SQL #Relational #Database #DB #PostgreSQL #Oracle #MySQL #MariaDB

4 Comments

Carsten Saastamoinen-Jakobsen 3w

If it is possible to learn to use NOT EXISTS instead of NOT IN, it should also be possible to learn to add the condition 'WHERE ReturnColumn IS NOT NULL' when NOT IN is used. In SQL Server, both constructs are changed to INNER JOIN by the compiler.

1 Reaction

Jeff Moden 3w

Nicely done especially on the NULL thing with NOT IN. Most people don't even bring that up.

2 Reactions

Dhruval Vaishnav 3w

Great breakdown. especially calling out the NULL behavior in NOT IN, that’s something many overlook. 👍 Clear, practical, and directly applicable in real-world queries.

3 Reactions

See more comments

To view or add a comment, sign in

More Relevant Posts

Ibraheem Tuffaha 🥛
6d
Report this post
Today I learned you don't need to branch your code for single-item IN clauses I assumed that splitting the query would be more optimized: one path when the list has a single ID, another path when it has many Turns out that's not needed In ActiveRecord, "where(column: value)" works whether you pass a single item or an array And if the array has a single element, Rails itself collapses it to "=" in Ruby before any SQL is generated, so the database never even sees an "IN" with one value And even if it did, every major engine treats "WHERE column IN (1)" and "WHERE column = 1" as the same predicate. Same plan, same execution Postgres, MySQL, SQLite, ClickHouse, SQL Server, Oracle.. all of them So write one method, pass it whatever you have, and let the database do its job. Your code stays simple, the query stays fast The real edge case isn't single-value IN, it's how databases handle parameterized IN lists. But for everyday ActiveRecord code? Just use "where" and move on
Like Comment
To view or add a comment, sign in
PRASANNA KOTKAR
2w
Report this post
🚀 Optimized a High-Volume Bulk Update Workflow in .NET + MySQL Recently worked on improving a large-scale bulk update workflow where thousands of records are processed in parallel across multiple execution lines. 🔍 Challenge The existing design relied on a single staging table for all transactions, which introduced: concurrency issues during parallel execution table truncation conflicts fixed stored procedure dependencies inaccurate updated row counts in dynamic mySQL execution scaling challenges as new lines were introduced ✅ What I Implemented Redesigned the workflow into a database-driven dynamic staging table architecture: ✔️ Physical staging table mapping from master configuration ✔️ One transaction-specific staging table per execution line ✔️ Bulk load using MySqlBulkCopy ✔️ Dynamic stored procedure joins using configurable table names ✔️ Fixed MySQL ROW_COUNT() issue by capturing it immediately after EXECUTE ✔️ Timezone-safe timestamp updates ✔️ Regex validation for dynamic table names ✔️ Transaction rollback support ✔️ New line onboarding through DB config only (no code deployment) ⚡ Outcome Significant performance gain in bulk updates Safe parallel transaction processing No staging table collision Accurate affected row tracking Highly scalable architecture for future expansion A good example of how database-driven staging design can greatly improve concurrency and maintainability in enterprise systems. #dotnet #mysql #csharp #database #storedprocedure #performance #softwareengineering #backenddevelopment #optimization
Like Comment
To view or add a comment, sign in
Rajasekar Su
2w
Report this post
Most .NET devs overthink SQL Server integer types. 🤔 tinyint, smallint, int, bigint — picking the "right" one feels like good engineering. It usually isn't. Here's what actually happens 👇 The advice sounds reasonable: "Use the smallest type that fits the domain." tinyint for status. smallint for lookup IDs. Save those bytes. The problem isn't the theory. It's what SQL Server does quietly underneath. When your FK column type doesn't exactly match the PK it references, SQL Server inserts a 𝗖𝗢𝗡𝗩𝗘𝗥𝗧_𝗜𝗠𝗣𝗟𝗜𝗖𝗜𝗧 in the execution plan. Your index becomes useless. A seek turns into a scan. And your EF Core query looks completely fine. 👨💻 𝗘𝘅𝗮𝗺𝗽𝗹𝗲: ❌ Type mismatch — hidden scan: -- Departments.Id → int (PK) -- Users.DepartmentId → smallint (FK) SELECT u.Name, d.Name FROM Users u INNER JOIN Departments d ON d.Id = u.DepartmentId -- CONVERT_IMPLICIT on DepartmentId -- Index on DepartmentId ignored → table scan ✅ Same types — index works correctly: -- Departments.Id → int (PK) -- Users.DepartmentId → int (FK) SELECT u.Name, d.Name FROM Users u INNER JOIN Departments d ON d.Id = u.DepartmentId -- No conversion. Index seek. Fast. This bug won't throw an error. It won't fail a unit test. It shows up only when you open the execution plan — or when the table grows. 😬 And the storage difference that caused all this? 𝘀𝗺𝗮𝗹𝗹𝗶𝗻𝘁 𝘃𝘀 𝗶𝗻𝘁 = 2 bytes per row. 🔎 𝗠𝘆 𝗮𝗰𝘁𝘂𝗮𝗹 𝗿𝘂𝗹𝗲: 𝗶𝗻𝘁 → default for all IDs and FKs 𝗯𝗶𝗴𝗶𝗻𝘁 → audit logs, events, high-churn tables 𝗯𝗶𝘁 → booleans 𝗱𝗲𝗰𝗶𝗺𝗮𝗹 → money, precise values Everything else? Question whether you need it. Always match your FK type to the PK it references. That single rule avoids more performance bugs than any type of micro-optimization. ♻️ Repost if this will help someone on your team. 💬 Have you ever caught a CONVERT_IMPLICIT in your execution plans?
Like Comment
To view or add a comment, sign in
Batool Hammoud
3w
Report this post
I just wrapped up a performance tuning marathon where we took a routine preview of 68 loan installments from a painful 12.5 seconds down to just 362ms (random test case). The "villain" wasn't a missing index or a lack of RAM—it was the way Hibernate and the Oracle Optimizer were talking to each other. Here’s the breakdown for my fellow devs: 1️⃣ The "Lazy" N+1 Killer: We found that a simple Java .filter() on a list was triggering 68 individual hidden queries. Even if each query takes 100ms, your users are waiting 7 seconds before they see a single byte. Lesson: Always use JOIN FETCH or DTOs for metadata lookups. 2️⃣ When Indexes are ignored: We had the right index, but Oracle stayed stubbornly slow. Why? Complex Window Functions (ROW_NUMBER()) inside Views can sometimes prevent "Predicate Pushdown," forcing the DB to scan millions of rows before filtering for your 68 IDs. 3️⃣ The "Merge" Strategy: The breakthrough came when we stopped asking for the "Penalty Data" separately. By merging the logic into the main fetch as a Correlated Scalar Subquery, we forced the database into a "Nested Loop" lookup. 4️⃣ The Results: ❌ Before: 12,551ms (User walks away for coffee) ✅ After: 362ms (Instant response) Performance tuning isn't just about adding indexes—it's about understanding the "conversation" between your application and your data. #Java #SpringBoot #Oracle #SQL #DatabasePerformance #BackendDevelopment #SoftwareEngineering #Hibernate #developer #Damascus

10 Comments
Like Comment
To view or add a comment, sign in
Abinaya V
4d
Report this post
Learned something new about INSERT Today I solved MCQs on CodeChef and learned something unexpected. I thought INSERT syntax was strict. Turns out, it's flexible. All of these are correct for integer columns: INSERT INTO subscribers(subscriber_id, first_name, last_name, course_id) VALUES ('13', 'Ella', 'Williams', '12'); INSERT INTO subscribers(subscriber_id, first_name, last_name, course_id) VALUES (13, 'Ella', 'Williams', '12'); INSERT INTO subscribers(subscriber_id, first_name, last_name, course_id) VALUES ('13', 'Ella', 'Williams', 12); INSERT INTO subscribers(subscriber_id, first_name, last_name, course_id) VALUES (13, 'Ella', 'Williams', 12); Why? Because many SQL databases automatically convert string numbers to integers when the column expects INT. Single quotes around integers? Works. No quotes? Works. Mixed? Works. The database handles the conversion internally. Where this works: - MySQL - PostgreSQL - SQLite Where this does NOT work (strict): - SQL Server - Oracle Best practice: No quotes for integers. Quotes only for text. This works everywhere and keeps your code clean and portable. Small lesson. But important.
Like Comment
To view or add a comment, sign in
Dhiraj Kumar
2w
Report this post
Stop Googling "How to do a LEFT JOIN in MySQL" every single day. 🛑 I used to waste hours digging through pages of database documentation just to find the exact syntax for a subquery or a window function. So, here is the ultimat MySQL Cheat Sheet. As a software developer, whether I'm architecting the database for a new startup feature or optimizing queries for performance, these are the commands I actually use. No fluff, just the real-world syntax. I’ve attached the full high-res PDF below. It covers: 🟢 Database & Table Operations: (Create, Alter, Keys, Indexes) 🟡 Advanced CRUD & Aggregations: (Group By, Having, Offset) 🔴 Joins & Subqueries: (Inner, Left, Correlated, Exists) 🟣 Conditional Logic: (Case, Coalesce, Window Functions) ⚙️ Admin & Backups: (User management & mysqldump) 💡 Pro-Tip before you swipe: Always run EXPLAIN to understand your queries before they hit production, and always back up before you DROP DATABASE. Flip through the document below and save this post for your next backend interview or late-night debugging session. 📌 What is the one SQL command you still have to Google no matter how many times you use it? Let me know in the comments! 👇 #MySQL #BackendDevelopment #DatabaseDesign #SoftwareEngineering #InterviewPrep #dhirajkumar #sde #coer #kimblylabs

2 Comments
Like Comment
To view or add a comment, sign in
Mounika Thouda
1w Edited
Report this post
🤯 𝙋𝙧𝙚𝙥𝙖𝙧𝙚𝙙𝙎𝙩𝙖𝙩𝙚𝙢𝙚𝙣𝙩 𝙞𝙣 𝙅𝘿𝘽𝘾 — 𝙒𝙝𝙮 𝙬𝙖𝙨 𝙞𝙩 𝙞𝙣𝙩𝙧𝙤𝙙𝙪𝙘𝙚𝙙 ? We already have Statement in JDBC Then why do we need PreparedStatement? Let’s understand the real reason 👇 Problem with Statement 📃 𝐖𝐡𝐞𝐧 𝐰𝐞 𝐮𝐬𝐞 "𝐒𝐭𝐚𝐭𝐞𝐦𝐞𝐧𝐭" ? Statement st = con.createStatement(); st.executeQuery("SELECT * FROM users WHERE id = " + id); ❌ Query is created every time ❌ Compilation happens again and again ❌ Vulnerable to SQL Injection ⚠️ 𝐖𝐡𝐲 𝐏𝐫𝐞𝐩𝐚𝐫𝐞𝐝𝐒𝐭𝐚𝐭𝐞𝐦𝐞𝐧𝐭 𝐢𝐬 𝐢𝐧𝐭𝐫𝐨𝐝𝐮𝐜𝐞𝐝? ➡️ To solve these problems: ✔ Improve performance ✔ Prevent SQL Injection ✔ Handle dynamic values safely ⚙️ 𝐇𝐨𝐰 𝐏𝐫𝐞𝐩𝐚𝐫𝐞𝐝𝐒𝐭𝐚𝐭𝐞𝐦𝐞𝐧𝐭 𝐰𝐨𝐫𝐤𝐬 PreparedStatement ps = con.prepareStatement( "SELECT * FROM users WHERE id = ?" ); ps.setInt(1, id); ResultSet rs = ps.executeQuery(); 🔍 What happens internally? ➡️ Query is precompiled once ➡️ Only values are replaced at runtime ✔ Faster execution ✔ Safer queries 🔐 🔐 SQL Injection Prevention ❌ Statement: SELECT * FROM users WHERE id = '1 OR 1=1' Can break your database SELECT * FROM users WHERE id = '1 OR 1=1' Can break your database 📝 PreparedStatement: Treats input as data, not SQL ➡️ Prevents injection ✅ 💭 One Line Summary 🖇️PreparedStatement = Precompiled + Secure + Efficient Have you used PreparedStatement in your projects? 🤔 Or still using Statement? Let’s discuss 👇💬 #Java #JDBC #PreparedStatement #JavaDeveloper #BackendDevelopment #Programming #TechJourney #LearnBySharing #Database #InterviewPrep
Like Comment
To view or add a comment, sign in
Bhanurekha Chintha
1w
Report this post
🚀 Day 7 of MySQL Journey Today’s focus: Core SQL Concepts (Before LIKE Operator) 🔹 Execution Order → FROM → WHERE → SELECT 🔹 Comparison Operators → > < = != 🔹 Logical Operators → AND | OR | NOT 🔹 Arithmetic Operations → Real-time calculations (Discounts 💸) 🔹 BETWEEN & IN → Handling ranges & multiple values 🔹 DISTINCT → Removing duplicates 🔹 IS NULL / IS NOT NULL → Handling missing data 🔹 SELECT Basics & Aliasing 💡 Practiced writing queries using real-time product tables 💡 Understood how SQL actually executes behind the scenes Consistency matters 💯 Day 7 done — getting stronger step by step. #MySQL #SQL #Database #LearningJourney #Consistency #BackendDevelopment #FullStackJava
Like Comment
To view or add a comment, sign in
Alekhya Bandu
1w
Report this post
Every day, I used to open a SQL application just to access and manage my database. 🤔One day, I asked myself: “Why am I doing it this way? Can’t I access it directly through a browser (URL)? That simple question pushed me to explore and learn something new. During my research, I discovered *phpMyAdmin* — a web-based tool that allows you to manage databases easily through a browser. It completely changed my approach. Now, instead of relying on installed applications, I can access and manage my database anytime, anywhere using just a URL. 💡 Lesson learned: Sometimes, asking the right question can lead to smarter and more efficient solutions. #MySQL #phpMyAdmin #QA #LearningJourney
Like Comment
To view or add a comment, sign in
Vivek Patel
3w
Report this post
🚀 5 SQL Tips That Improved My Backend Performance As a Backend Developer working with SQL Server, I realized that writing queries is easy… but writing optimized queries is a skill. Here are 5 SQL tips that helped me improve performance: ✔ Use SELECT only required columns (avoid SELECT *) ✔ Add proper indexes on frequently used columns ✔ Use JOINs wisely instead of nested queries ✔ Avoid unnecessary subqueries ✔ Always analyze execution plan Small improvements in queries can make a huge difference in application performance ⚡ Still learning and exploring better ways every day! 👉 What’s your favorite SQL optimization tip? #sqlserver #backenddeveloper #database #performance #optimization #dotnet #webdevelopment #coding #softwaredeveloper
Like Comment
To view or add a comment, sign in

1,240 followers

9 Posts

View Profile Connect

Nikita Tarasov-Gaevskii’s Post

More Relevant Posts

Explore content categories