Dinesh Kumar’s Post

🚀 Day 2/30 — SQL + Python Deep Dive Subqueries (Correlated vs Non-Correlated) 👉 Basics are done. Pipelines are built. 👉 Now we go deeper — into how SQL really executes and how Python scales. You’ve used subqueries before… But do you know: 👉 how they actually run? 👉 why some are slow? 🔹 What is a Subquery? 👉 A query inside another query 🔹 1. Non-Correlated Subquery 👉 Runs once, result is reused SELECT name FROM employees WHERE salary > ( SELECT AVG(salary) FROM employees ); 👉 Inner query runs once → outer query uses result 🔹 2. Correlated Subquery 👉 Runs for each row of outer query SELECT name FROM employees e1 WHERE salary > ( SELECT AVG(salary) FROM employees e2 WHERE e1.department = e2.department ); 👉 Inner query runs again & again 😐 🔹 Key Difference Non-correlated → runs once ⚡ Correlated → runs per row 🐢 🔹 Why This Matters Big impact on performance Correlated queries can be slow Often replaced using JOIN or CTE 🔹 Real Insight 👉 If your query is slow… 👉 check if it’s a correlated subquery 💡 Quick Summary Subqueries are powerful… But execution matters more than syntax 💡 Something to remember Same logic… Different execution → different performance. #SQL #Python #DataEngineering #LearningInPublic #TechLearning

To view or add a comment, sign in

More Relevant Posts

Preethi Ravula
3w
Report this post
Most analysts know SQL. Most analysts know Python. Very few know how to combine them efficiently. That’s why many stay average. Here are a few things I wish I learned earlier: In SQL: → WHERE cannot filter aggregated results If you're filtering grouped data, use HAVING. → Window functions save messy subqueries Use RANK(), ROW_NUMBER(), SUM() OVER() for ranking and running totals. → LAG() and LEAD() beat self-joins Comparing current vs previous period? One line does what multiple joins often can’t. In Python: → Do not load unnecessary data Filter in SQL before bringing it into pandas. → Avoid for loops in pandas Vectorized operations and apply functions are significantly faster. → Stop hardcoding dates Use datetime so your scripts stay dynamic and reusable. The real power comes when you combine both: → Pull data with SQL → Transform it in Python → Push results back with to_sql() That workflow alone will make you more efficient than most analysts around you. Knowing SQL or Python is useful. Knowing how to use both together is what separates strong analysts from average ones. #DataAnalytics #SQL #Python #AnalyticsEngineering #CareerGrowth

4 Comments
Like Comment
To view or add a comment, sign in
Tejasvi Parate
2w
Report this post
🚀 **SQL vs Python: Data Cleaning Cheat Sheet** Data cleaning is one of the most important steps in any data workflow. I came across this simple yet powerful cheat sheet that compares how to handle common data issues using both SQL and Python (Pandas). From handling missing values and duplicates to formatting data and detecting outliers — this visual makes it easy to understand both approaches side by side. 📌 A great quick reference for anyone working in Data Analytics or Data Engineering. 💡 Clean data = better insights = smarter decisions. #DataCleaning #SQL #Python #Pandas #DataAnalytics #DataEngineering #Learning #DataScience
Like Comment
To view or add a comment, sign in
Ratnajit Chakraborty
2w
Report this post
🚀 Time Series Analysis in SQL & Python — Real-World Challenges & Solutions Time series calculations in SQL can be surprisingly frustrating… At first, it feels simple — but once you start working on real business problems, things get tricky: When to use > vs >= Defining last 7 days vs last week correctly Identifying users who haven’t ordered in the last 30 days Rolling vs calendar-based calculations Even a small mistake in date logic can completely change your insights. While working with product and sales teams, I came across multiple such scenarios where accurate time-based logic was critical for decision-making. 👉 To organize my learning, I’ve created a small project where I’ve documented: Practical SQL time-based problems Clear and correct approaches Python (Pandas) validation using Jupyter Notebook 📂 I’ve shared: SQL queries Jupyter Notebook A quick reference guide on my GitHub: 👉 https://lnkd.in/gn5kg-xh I’ll continue adding more real-world tasks as I come across them while working on different use cases. 👉 Follow me for more practical tasks and insights like this. #SQL #Python #DataAnalytics #TimeSeries #DataScience #BusinessAnalytics #LearningInPublic #Analytics
Like Comment
To view or add a comment, sign in
Chiemela Chilaka
4w
Report this post
Most people rush into Python for data analysis… But skip the foundation that actually makes them effective. This is where many get stuck. Before writing a single line of Python, ask yourself: Can you confidently work with data in SQL? Because these 6 concepts are not optional — they are the building blocks of real analysis: ✔ Joins – Can you combine datasets correctly? ✔ Aggregations – Can you summarize data meaningfully? ✔ Window Functions – Can you analyze trends over time? ✔ Subqueries & CTEs – Can you break down complex logic? ✔ Data Cleaning – Can you trust your data? ✔ Filtering Logic – Can you extract the right insights? Here’s the truth 👇 Python doesn’t replace these skills… it amplifies them. If your SQL foundation is weak, your Python analysis will also be weak. But if you master these? You don’t just analyze data — you think like a data professional. 💡 The real question is: Are you learning tools… or building analytical thinking? #DataAnalytics #SQL #Python #DataSkills #LearningJourney #AnalyticsMindset
1 Comment
Like Comment
To view or add a comment, sign in
Eduardo Brandão
3w
Report this post
Why does SQL feel harder than Python? 🤔 → Because it forces you to deal with reality. In Python/R: • Data is often already shaped • You focus mostly on analysis 🛠️📦 In SQL: • Data is fragmented across tables • You have to rebuild it before analyzing 🧩 And more importantly: → You see how your query impacts performance⚡💸 → You think about joins, structure, and efficiency → You start asking the right questions (more business-driven💼) That’s exactly what makes SQL so valuable in industry. It doesn’t just help you analyze data; it helps you understand how data is structured, how systems work, and how to think closer to real business problems. #DataAnalytics #DataScience #SQL #Python #BusinessIntelligence #DataAnalyst #DataScientist #Analytics #DataCareers
Like Comment
To view or add a comment, sign in
Udhaya Kumar
3w
Report this post
📊 Mastering DataFrame selection in Pandas! From basic indexing to advanced filtering techniques, this guide helps you efficiently extract and manipulate data with ease. A valuable resource for anyone working with data in Python. 🚀 https://lnkd.in/dP_Wwm-r #Python #Pandas #DataScience #Analytics #MachineLearning

Mastering DataFrame Selection in Pandas: A Complete Guide medium.com
Like Comment
To view or add a comment, sign in
Sathiyanarayanan A
2w
Report this post
Week 14(notes) Python Pandas Essentials for Data Analysis ✨ 🐍 Python + Pandas = Powerful Data Analysis some fundamental Pandas operations that every data analyst should know: 📌 1. View First Rows Use head() to display the first 5 rows of a dataset. df.head() 📌 2. View Last Rows Use tail() to display the last 5 rows. df.tail() 📌 3. Statistical Summary Get quick insights like count, mean, std, min, max using: df.describe() 📌 4. Select Single Column df['Name'] 📌 5. Select Multiple Columns df[['Name', 'Age']] 📌 6. Add New Column df['Salary'] = df['Age'] * 1000 📌 7. Basic Filtering Filter rows based on a condition: df[df['Age'] > 25] 💡 Pandas makes data cleaning and analysis fast, simple, and efficient. #Python #Pandas #DataAnalysis #Data #Aspiring #LinkedInLearning #100DaysOfCode #Analytics #CareerTransition #Techdatacommunity #LearningJourney.
Like Comment
To view or add a comment, sign in
Sahina Rayeesa
1w
Report this post
🧠 Python Concept: dataclasses (Clean Data Models) Write less boilerplate code 😎 ❌ Traditional Class class User: def __init__(self, name, age): self.name = name self.age = age def __repr__(self): return f"User(name={self.name}, age={self.age})" 👉 More boilerplate 👉 Repetitive code ✅ Pythonic Way (dataclass) from dataclasses import dataclass @dataclass class User: name: str age: int 👉 Automatically generates: __init__ __repr__ __eq__ 🧒 Simple Explanation Think of it like a shortcut ➡️ You define data ➡️ Python builds the rest 💡 Why This Matters ✔ Cleaner code ✔ Less boilerplate ✔ Easier to maintain ✔ Used in real-world apps ⚡ Bonus Example @dataclass class User: name: str age: int = 18 👉 Default values supported 😎 🧠 Real-World Use ✨ API models ✨ Config objects ✨ Data handling 🐍 Write less code 🐍 Let Python do the work #Python #AdvancedPython #CleanCode #SoftwareEngineering #BackendDevelopment #Programming #DeveloperLife
Like Comment
To view or add a comment, sign in
Rajesh P
3w
Report this post
Working with Python and SQL together — a few things that made a difference for me In most projects, SQL handles data well, and Python helps in controlling the flow and processing around it. While working with both, a few patterns consistently worked better. 🔹 Always push filtering to SQL Instead of fetching everything and filtering in Python: rows = cursor.execute("SELECT * FROM orders") filtered = [row for row in rows if row["status"] == "COMPLETE"] Better to push it into SQL: SELECT * FROM orders WHERE status = 'COMPLETE'; 🔹 Use parameterized queries Avoid building queries using string formatting: query = f"SELECT * FROM emp WHERE emp_id = {emp_id}" Use bind variables instead: cursor.execute( "SELECT * FROM emp WHERE emp_id = :1", [emp_id] ) 🔹 Fetch data in manageable batches Instead of loading everything at once: rows = cursor.fetchall() Fetch in batches: rows = cursor.fetchmany(1000) 🔹 Let SQL handle data, Python handle flow cursor.execute("SELECT dept_id, COUNT(*) FROM emp GROUP BY dept_id") for row in cursor: process(row) SQL does aggregation, Python handles the next step. 💡 What worked for me Using Python and SQL together is less about replacing one with the other, and more about letting each do what it does best. Curious to know — how do you usually split work between SQL and Python in your projects? #Python #SQL #DataEngineering #OracleSQL #DatabaseDevelopment #CodingPractices
Like Comment
To view or add a comment, sign in
Real Python

206,520 followers
1w
Report this post
🐍📰 pandas GroupBy: Your Guide to Grouping Data in Python In this tutorial, you'll learn to work adeptly with the pandas GroupBy facility while mastering ways to manipulate, transform, and summarize data with real-world datasets. #python

pandas GroupBy: Your Guide to Grouping Data in Python realpython.com
Like Comment
To view or add a comment, sign in

72 followers

65 Posts

View Profile Connect

Dinesh Kumar’s Post

More Relevant Posts

Explore content categories