How to Use SQL Window Functions

Explore top LinkedIn content from expert professionals.

Summary

SQL window functions are a powerful feature that let you perform calculations across multiple rows while keeping each row visible, making them essential for data analysis and reporting. These functions help solve common analytics challenges like ranking, running totals, and comparing values over time—without complex joins or subqueries.

Use running totals: Apply window functions like SUM() OVER() to calculate cumulative metrics for sales or revenue without losing row-level details.
Compare periods: Utilize LAG() and LEAD() to analyze changes from one period to the next, such as month-over-month or day-over-day growth.
Rank and deduplicate: Assign ranks or unique numbers to rows within groups using ROW_NUMBER(), RANK(), and DENSE_RANK() to simplify leaderboards and remove duplicates.

Summarized by AI based on LinkedIn member posts

Poornachandra Kongara

Data Analyst | SQL, Python, Tableau | $100K+ Revenue Impact & 50% Efficiency Gains through ETL Pipelines & Analytics

20,376 followers 3mo
Report this post
𝗬𝗼𝘂 𝗹𝗲𝗮𝗿𝗻 𝗦𝗤𝗟. Then you hit your first analytics problem. Suddenly, window functions are the only thing that make sense. Because real analytics isn’t about rows, it’s about context. And window functions give you exactly that. Here’s how they show up in real business problems: 𝗥𝘂𝗻𝗻𝗶𝗻𝗴 𝗧𝗼𝘁𝗮𝗹𝘀 Track cumulative revenue, orders, or growth without losing row detail. Functions: SUM() OVER 𝗠𝗼𝗻𝘁𝗵-𝗼𝘃𝗲𝗿-𝗠𝗼𝗻𝘁𝗵 𝗚𝗿𝗼𝘄𝘁𝗵 Compare current values with previous periods directly in SQL. Functions: LAG() 𝗣𝗲𝗿𝗶𝗼𝗱-𝗼𝘃𝗲𝗿-𝗣𝗲𝗿𝗶𝗼𝗱 𝗖𝗼𝗺𝗽𝗮𝗿𝗶𝘀𝗼𝗻 See weekly, monthly, or quarterly changes without collapsing rows. Functions: LAG(), LEAD() 𝗥𝗮𝗻𝗸𝗶𝗻𝗴 & 𝗟𝗲𝗮𝗱𝗲𝗿𝗯𝗼𝗮𝗿𝗱𝘀 Find top customers, products, or regions with proper ranking logic. Functions: RANK(), DENSE_RANK(), ROW_NUMBER() 𝗖𝘂𝘀𝘁𝗼𝗺𝗲𝗿 / 𝗘𝗻𝘁𝗶𝘁𝘆 𝗦𝗲𝗴𝗺𝗲𝗻𝘁𝗮𝘁𝗶𝗼𝗻 Run separate calculations for regions, categories, or user groups. Functions: PARTITION BY 𝗧𝗶𝗺𝗲 𝗕𝗲𝘁𝘄𝗲𝗲𝗻 𝗘𝘃𝗲𝗻𝘁𝘀 Measure gaps between user actions - logins, clicks, purchases. Functions: LAG() 𝗠𝗼𝘃𝗶𝗻𝗴 𝗔𝘃𝗲𝗿𝗮𝗴𝗲𝘀 Smooth noisy data to spot real trends. Functions: AVG() OVER (ROWS BETWEEN …) 𝗗𝗲𝗱𝘂𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻 𝗟𝗼𝗴𝗶𝗰 Pick the correct record when duplicates sneak in. Functions: ROW_NUMBER() 𝗣𝗲𝗿𝗰𝗲𝗻𝘁𝗮𝗴𝗲 𝗖𝗼𝗻𝘁𝗿𝗶𝗯𝘂𝘁𝗶𝗼𝗻 See each row’s contribution to the whole or a subtotal. Functions: SUM() OVER () 𝗙𝘂𝗻𝗻𝗲𝗹 𝗗𝗿𝗼𝗽-𝗢𝗳𝗳 𝗔𝗻𝗮𝗹𝘆𝘀𝗶𝘀 Understand where users fall off across stages. Functions: ROW_NUMBER(), LAG() 𝗧𝗼𝗽-𝗡 𝗽𝗲𝗿 𝗚𝗿𝗼𝘂𝗽 Top 3 sellers per category. Top 5 customers per region. Functions: ROW_NUMBER() OVER (PARTITION BY … ORDER BY …) 𝗥𝗼𝗹𝗹𝗶𝗻𝗴 𝗠𝗲𝘁𝗿𝗶𝗰𝘀 Track rolling sums, averages, and trends over time. Functions: SUM() OVER (ROWS BETWEEN …) 𝗦𝗹𝗼𝘄𝗹𝘆 𝗖𝗵𝗮𝗻𝗴𝗶𝗻𝗴 𝗗𝗶𝗺𝗲𝗻𝘀𝗶𝗼𝗻 𝗗𝗲𝘁𝗲𝗰𝘁𝗶𝗼𝗻 Spot attribute changes over time for the same entity. Functions: LAG() 𝗙𝗶𝗿𝘀𝘁 & 𝗟𝗮𝘀𝘁 𝗩𝗮𝗹𝘂𝗲 𝗔𝗻𝗮𝗹𝘆𝘀𝗶𝘀 Find the first purchase, last login, or most recent status. Functions: FIRST_VALUE(), LAST_VALUE() 𝗠𝗲𝗱𝗶𝗮𝗻 & 𝗣𝗲𝗿𝗰𝗲𝗻𝘁𝗶𝗹𝗲 𝗔𝗻𝗮𝗹𝘆𝘀𝗶𝘀 Understand distributions without distorting detail. Functions: PERCENTILE_CONT() 𝗔𝘂𝗱𝗶𝘁 & 𝗖𝗵𝗮𝗻𝗴𝗲 𝗧𝗿𝗮𝗰𝗸𝗶𝗻𝗴 See how values changed across consecutive records. Functions: LAG(), LEAD() 𝗔𝗻𝗼𝗺𝗮𝗹𝘆 𝗗𝗲𝘁𝗲𝗰𝘁𝗶𝗼𝗻 Detect sudden spikes or drops in metrics. Functions: AVG() OVER, LAG() Window functions aren’t advanced SQL, they’re essential analytics. If you want to think like a data analyst or analytics engineer, mastering them is non-negotiable.
No more previous content

No more next content
31 Comments
Like Comment
Farizat Tabora

Microsoft MVP | Maximizing Efficiency in Business Processes with Excel and AI

3,998 followers 2w
Report this post
Everyone says "learn SQL." Nobody gives you the actual cheat sheet. I analyzed 50+ analyst job descriptions and real queries from production dashboards. The same 10 functions appeared in almost every single one. Here they are. Syntax included. Copy and use today. 🟢 FILTERING ① WHERE SELECT * FROM orders WHERE status = 'completed' Filter rows before aggregation. This is your first instinct for any query. ② HAVING SELECT city, COUNT(*) FROM users GROUP BY city HAVING COUNT(*) > 100 Filter AFTER aggregation. WHERE filters rows. HAVING filters groups. Know the difference. ③ CASE WHEN SELECT order_id, CASE WHEN amount > 1000 THEN 'high' WHEN amount > 100 THEN 'medium' ELSE 'low' END AS tier FROM orders IF/ELSE logic inside SQL. Use it to create categories, flags, labels on the fly. 🟢 AGGREGATION ④ GROUP BY + COUNT / SUM / AVG SELECT region, COUNT(*) AS users, AVG(revenue) AS avg_rev FROM sales GROUP BY region The foundation of every report. Group rows → calculate metrics per group. ⑤ DISTINCT SELECT COUNT(DISTINCT customer_id) FROM orders Count unique values. Without DISTINCT you count duplicates and your numbers are wrong. 🟢 JOINS ⑥ LEFT JOIN SELECT o.order_id, c.name FROM orders o LEFT JOIN customers c ON o.customer_id = c.id Keep ALL rows from left table. Match what you can from right. Unmatched → NULL. This is 80% of all joins you will ever write. ⑦ INNER JOIN SELECT p.name, s.quantity FROM products p INNER JOIN stock s ON p.id = s.product_id Keep ONLY rows that match in BOTH tables. Use when you need strict matches with no NULLs. 🟢 WINDOW FUNCTIONS ⑧ ROW_NUMBER() SELECT *, ROW_NUMBER() OVER (PARTITION BY user_id ORDER BY created_at DESC) AS rn FROM orders Number rows within each group. Filter WHERE rn = 1 to get the latest order per user. Use this daily. ⑨ SUM() OVER / AVG() OVER SELECT date, revenue, SUM(revenue) OVER (ORDER BY date) AS running_total FROM daily_sales Running totals and moving averages without GROUP BY. Your row-level data stays intact. ⑩ LAG() / LEAD() SELECT date, revenue, LAG(revenue) OVER (ORDER BY date) AS prev_day FROM daily_sales Access the previous or next row's value. Calculate day-over-day change in one line. 📌 QUERY TEMPLATE (covers 90% of tasks): SELECT dimension, COUNT(*) AS cnt, SUM(metric) AS total FROM table_a a LEFT JOIN table_b b ON a.id = b.a_id WHERE a.date >= '2024-01-01' GROUP BY dimension HAVING COUNT(*) > 10 ORDER BY total DESC LIMIT 20 This single template handles: → Segmented reports → Top-N analysis → Filtered aggregations → Multi-table analytics The learning path: → Week 1: ①–③ (filtering) → Week 2: ④–⑤ (aggregation) → Week 3: ⑥–⑦ (joins) → Week 4: ⑧–⑩ (window functions) 4 weeks. 10 functions. 90% of analyst SQL covered. Save this cheat sheet. Share with someone starting their analytics journey. 👇 #sql #dataanalytics #analytics #cheatsheet #career #datascience #programming
No more previous content

No more next content
1 Comment
Like Comment
Adarsh Reddy

Sr. Big Data Engineer | Specializing in Cloud Data Platforms (AWS, Azure, Fabric, GCP) | PySpark, Databricks, Snowflake, Kafka, ETL, Data Warehousing, Data Visualization, CI/CD & DevOps

2,462 followers 1mo
Report this post
Stop overcomplicating your SQL. 🛑 If you’re still using self-joins and messy subqueries to compare rows or calculate trends, you’re working harder, not smarter. The gap between a "SQL beginner" and a "Data Pro" is usually one thing: Window Functions. These functions allow you to perform calculations across a set of rows that are related to the current row—without collapsing them into a single output like a GROUP BY does. Here is the "Cheat Sheet" for the functions that actually move the needle in interviews and real-world projects: 🛠 The Power Players 🔹 ROW_NUMBER() → Your go-to for deduplication. Assigns a unique ID to every row. 🔹 RANK() vs. DENSE_RANK() → Essential for leaderboards. RANK() leaves gaps (1, 1, 3), while DENSE_RANK() keeps it tight (1, 1, 2). 🔹 LAG() & LEAD() → The "Time Travelers." Pull data from the previous or next row to calculate Month-over-Month growth effortlessly. 🔹 SUM() OVER() → Create running totals and cumulative sums in a single line of code. 💡 Why this changes the game: 1️⃣ Readability: Your code goes from 50 lines of nested logic to 10 lines of clean, declarative SQL. 2️⃣ Performance: Most modern engines optimize window functions better than complex self-joins. 3️⃣ Interview Gold: Almost every Senior Data Analyst or Data Engineer interview will test your ability to use PARTITION BY. ✅ Practical Tip: Next time you need to compare "Current Month Sales" vs "Last Month Sales," don't join the table to itself. Use LAG(sales) OVER (ORDER BY month). Master these, and you stop being a "user" and start being an "architect" of data. 🚀 Which Window Function saved you the most time this week? Let's discuss in the comments! 👇 #SQL #DataEngineering #DataAnalytics #DataScience #CodingTips #CareerGrowth #TechSkills #Database
Like Comment
Arpit Bhatnagar

Senior Manager, Data Science

9,535 followers 8mo
Report this post
SQL is easy to learn, but hard to master. The bar keeps getting higher each year in data science and analytics interviews. Here are 8 powerful SQL window functions that go beyond the basics 👇 1️⃣ LEAD() 📦 Use Case: Get next day's sales SELECT date, amount AS today_amount, LEAD(amount) OVER (ORDER BY date) AS next_day_amount FROM Sales; 2️⃣ LAG() 📦 Use Case: Compare with previous day SELECT date, amount AS today_amount, LAG(amount) OVER (ORDER BY date) AS previous_day_amount FROM Sales; 3️⃣ ROWS BETWEEN 1 PRECEDING AND CURRENT ROW 📦 Use Case: Rolling sum over 2 days SELECT id, value, SUM(value) OVER ( ORDER BY id ROWS BETWEEN 1 PRECEDING AND CURRENT ROW ) AS sum_prev_and_current FROM Numbers; 4️⃣ ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW 📦 Use Case: Running total SELECT id, value, SUM(value) OVER ( ORDER BY id ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW ) AS running_total FROM Numbers; 5️⃣ PERCENT_RANK() 📦 Use Case: Percentile rank of test scores SELECT student_id, score, PERCENT_RANK() OVER (ORDER BY score) AS percentile_rank FROM Scores; 6️⃣ ROW_NUMBER() 📦 Use Case: Deduplicate users by latest record SELECT *, ROW_NUMBER() OVER (PARTITION BY email ORDER BY created_at DESC) AS rn FROM Users; 7️⃣ RANK() 📦 Use Case: Find salary rank (with gaps for ties) SELECT *, RANK() OVER (PARTITION BY department_id ORDER BY salary DESC) AS salary_rank FROM Employees; 8️⃣ DENSE_RANK() 📦 Use Case: Top 3 salaries per department (no gaps) SELECT * FROM ( SELECT *, DENSE_RANK() OVER (PARTITION BY department_id ORDER BY salary DESC) AS rnk FROM Employees ) sub WHERE rnk <= 3; Comment if you found this helpful. Reach out via DM if you need help with stats. Repost if you think someone in your network should read this. #SQL #DataAnalytics #DataScience #WindowFunctions #InterviewPrep #LearningByDoing

1 Comment
Like Comment
Nishant Kumar

Data Engineer @ IBM | AWS · Spark · Kafka · PySpark · Airflow | RAG · LLMs · GenAI | Event-Driven Data Platforms | 110K DE Community

113,194 followers 7mo
Report this post
This little-known SQL Window function trick replaced 5 JOINs in my query — and recruiters love it in interviews. I’ll be honest: early in my career, I was obsessed with JOINs. Need ranking? → JOIN. Need running totals? → JOIN. Need first/last values? → JOIN. The problem? My queries got slower, messier, and nearly impossible to debug. Then I started using Window Functions — and everything changed. Here’s how they saved me: ➊ Ranking & Deduplication → Instead of joining on a subquery, I used ROW_NUMBER(), DENSE_RANK() over partitions. → Clean, efficient, and no messy joins. ➋ Running Totals / Moving Averages → SUM() OVER(ORDER BY date) gave me rolling totals instantly. → No need for multiple self-joins. ➌ First & Last Records → FIRST_VALUE() and LAST_VALUE() cut down entire subqueries. → Perfect for event-based data like logins or transactions. The impact? → One query went from 200+ lines (with nested joins) to 40 lines. → Execution time dropped by ~70%. → In interviews, whenever I bring up Window Functions, recruiters nod. They know it’s the mark of someone who writes scalable SQL. Join the group: https://lnkd.in/giE3e9yH - 𝐌𝐨𝐜𝐤 𝐈𝐧𝐭𝐞𝐫𝐯𝐢𝐞𝐰𝐬 𝐟𝐨𝐫 𝐃𝐚𝐭𝐚 𝐄𝐧𝐠𝐢𝐧𝐞𝐞𝐫𝐬: https://lnkd.in/g8Pqypt5 - 𝐈𝐧𝐭𝐞𝐫𝐯𝐢𝐞𝐰 𝐩𝐫𝐞𝐩 & 𝐏𝐫𝐨𝐯𝐞𝐧 𝐓𝐢𝐩𝐬: https://lnkd.in/gUEVYCGy - 𝐑𝐞𝐬𝐮𝐦𝐞 𝐑𝐞𝐯𝐢𝐞𝐰 𝐚𝐧𝐝 𝐎𝐩𝐭𝐢𝐦𝐢𝐳𝐚𝐭𝐢𝐨𝐧: https://lnkd.in/gp3yZsfW Follow for more 👋
No more previous content

No more next content
16 Comments
Like Comment
Sumit Gupta

Data & AI Creator | EB1A | GDE | International Speaker | Ex-Notion, Snowflake, Dropbox | Brand Partnerships

42,089 followers 3mo
Report this post
You think you know SQL… until window functions show up. That’s where most queries start to break. Because once the questions become: – What changed over time? – Who’s improving faster? – What’s the latest state per user? – Where did the trend shift? GROUP BY isn’t enough. This Guide is a cheat sheet of the Top 30 SQL Window Functions that analysts, data engineers, and interviewers actually expect you to know. It covers: • Ranking logic (ROW_NUMBER, RANK, DENSE_RANK) • Time-based comparisons (LAG, LEAD) • Running totals and rolling metrics • Percentiles, variance, and distributions • ROWS vs RANGE frames • QUALIFY for cleaner queries • Real analytics patterns used in production The key shift: Window functions don’t summarize data. They help you analyze it without losing context. If SQL is part of your job or your next role, this belongs in your toolkit. Save it. Practice it. And stop flattening insights too early.

49 Comments
Like Comment
Dawn Choo

Data Scientist (ex-Meta, ex-Amazon)

194,275 followers 1y
Report this post
I almost failed a Google SQL interview. Because I didn't know Window Functions. Even though I had learned about Window Functions... They never "clicked" for me. Because I couldn't grok their real-world application. So don't make the same mistake as me. Here are 5 key window functions & their applications: 𝟭/ 𝗥𝗢𝗪_𝗡𝗨𝗠𝗕𝗘𝗥() ROW_NUMBER() assigns a sequential integer to each row within a partition. 𝗙𝗼𝗿 𝗲𝘅𝗮𝗺𝗽𝗹𝗲: We can use ROW_NUMBER() to assign a unique identifier to each transaction per customer. This allows for easy tracking and referencing of transactions within a customer's history. 𝟮/ 𝗥𝗔𝗡𝗞() RANK() assigns rankings within a partition of a result set, leaving gaps in the ranking when there are ties. 𝗙𝗼𝗿 𝗲𝘅𝗮𝗺𝗽𝗹𝗲: In a e-commerce company, RANK() can be used to rank products by sales volume. We can use this to identify top-selling items within categories. 𝟯/ 𝗟𝗔𝗚() 𝗮𝗻𝗱 𝗟𝗘𝗔𝗗() LAG() accesses data from previous rows, while LEAD() accesses data from subsequent rows within a partition. 𝗙𝗼𝗿 𝗲𝘅𝗮𝗺𝗽𝗹𝗲: We can use LAG() to calculate month-over-month changes in revenue. This allows for easy tracking of growth trends and identification of significant changes. 𝟰/ 𝗙𝗜𝗥𝗦𝗧_𝗩𝗔𝗟𝗨𝗘() 𝗮𝗻𝗱 𝗟𝗔𝗦𝗧_𝗩𝗔𝗟𝗨𝗘() FIRST_VALUE() returns the first value in an ordered partition, while LAST_VALUE() returns the last value. 𝗙𝗼𝗿 𝗲𝘅𝗮𝗺𝗽𝗹𝗲: In analyzing stock prices, FIRST_VALUE() can be used to compare daily stock prices to the price at month's start, so we can measure price changes relative to the month's opening price. 𝟱/ 𝗦𝗨𝗠(), 𝗖𝗢𝗨𝗡𝗧() 𝗮𝗻𝗱 𝗔𝗩𝗚() These aggregate functions, when used with OVER(), allow for running calculations within a window. They're useful for computing cumulative totals, moving averages, or other rolling calculations. 𝗙𝗼𝗿 𝗲𝘅𝗮𝗺𝗽𝗹𝗲: In a analytics system, these functions can be used to calculate a 7-day moving average of daily active users (DAU), to smooth out daily fluctuations and identify trends in user engagement. ——— 𝗪𝗮𝗻𝘁 𝘁𝗼 𝘂𝘀𝗲 𝗪𝗶𝗻𝗱𝗼𝘄 𝗙𝘂𝗻𝗰𝘁𝗶𝗼𝗻𝘀 𝗼𝗻 𝗿𝗲𝗮𝗹 𝗯𝘂𝘀𝗶𝗻𝗲𝘀𝘀 𝗽𝗿𝗼𝗯𝗹𝗲𝗺𝘀? We got you. Check out these questions on Interview Master! • Window Functions about Creators on Meta: https://lnkd.in/g3Rt_tcH • Window Functions related to Amazon Sellers: https://lnkd.in/gic9TseR • Window Functions on Microsoft Windows Updates: https://lnkd.in/gCbSpZ9i • Window Functions and Google Play store: https://lnkd.in/gajf_u2q • Window Functions on LinkedIn Skills Endorsements: https://lnkd.in/gExPn9bb ——— ♻️ Found this useful? Repost it so others can see it too!

83 Comments
Like Comment
Sravya Kariyavula

Data Analyst | EX-Data Analyst at IBM | Python | SQL | Power BI | Tableau | Microsoft Excel | AWS | KPI Dashboards | Snowflake | A/B Testing | Health Care, Finance | Data Modeling | Turning Data into Decisions

3,215 followers 6mo
Report this post
Mastering SQL Window Functions — A Game-Changer for Data Analysis If you’re working with SQL and haven’t explored window functions yet, you’re missing out on one of the most powerful tools for data analysis and reporting. Unlike regular aggregate functions that collapse rows, window functions allow you to perform calculations across rows while keeping them intact. This makes them incredibly useful for ranking, running totals, moving averages, and comparing rows. Popular Window Functions: - ROW_NUMBER() — Assigns a unique number to rows - RANK() / DENSE_RANK() — Great for leaderboards or sorting by value - LAG() / LEAD() — Look back or ahead at other rows - SUM() OVER() — Running totals without grouping - AVG() OVER(PARTITION BY ...) — Group-wise calculations while retaining row-level detail Use Cases: - Sales trend comparisons month over month - Identifying top N products within each category - Calculating customer lifetime value over time - Creating dynamic cohort analyses Pro Tip: Window functions always work with OVER() — that’s where the magic happens! Whether you’re a Data Analyst, Engineer, or Scientist, mastering window functions will enhance your SQL skills.
No more previous content

No more next content
8 Comments
Like Comment
Dane Wade

Author at DataCeps

1,888 followers 3mo
Report this post
Window functions are where SQL stops being “query language” and starts being a modeling language. They don’t just calculate. They declare how you believe the world works. A window has three levers: Partition. Order. Frame. Most people learn the functions. Senior analysts learn the levers. Because every real use of window functions is one of these moves: 1) Declare identity PARTITION BY is you saying: “these rows belong to the same entity.” If that choice is wrong, everything downstream is beautifully wrong. 2) Declare time ORDER BY is you choosing the timeline you trust. Event time vs load time vs update time isn’t a technical detail. It’s the difference between behavior and pipeline artifacts. 3) Declare context The frame is the boundaries of what a row is allowed to “know.” Default frames are silent assumptions. Silent assumptions are where metrics go to drift. That’s why window functions are so powerful in production analytics: They let you answer questions that GROUP BY can’t touch: “What changed, compared to last time?” “What’s the running state as of each event?” “Who are the top performers within each cohort?” “What does ‘normal’ look like inside this segment?” And they do it without collapsing the data. That last part matters. In most organizations, the biggest reporting mistakes come from collapsing too early, then trying to reconstruct nuance later with joins and filters. Window functions keep the grain intact. They let you compute and keep the row-level truth visible. If you want a mental model: GROUP BY summarizes. Window functions contextualize. Summaries are for reporting. Context is for decisions. Want to learn SQL without overwhelm? Start with FREE 7-Day SQL Blueprint. Link In The Top Comment 👇🏻🔗✅
No more previous content

No more next content
3 Comments
Like Comment

How to Use SQL Window Functions

Summary

More in SQL Skills for Data Roles

Explore categories