How is data stored in SQL databases?

Kaushal Pahwani

Published Dec 11, 2022

Today we are going to take a look at how a SQL database stores the data, this understanding can be really helpful in analyzing database queries.

At a logical level, the data is stored in the form of rows and columns but at the physical level, it is stored in the form of data pages which are often of defined size. In a SQL server by default, the pages are 8KB and stored in a tree-like structure called B-Tree or Indexed B-Tree or Clustered index structure.

The nodes at the bottom of the B-Tree are called the leaf nodes storing the actual table data. Let's assume we have a Student table indexed at the id column and for the sake of simplicity consider 100 students' data sum up for a data page of 8 KB. The Student table data indexed at the id column will be stored in the database as below.

Recommended by LinkedIn

What is Index in Mssql

Lâm Quang Vinh 1 year ago

A Layman's Guide to Handling NULL Values in SQL

Sana Farooqui 2 years ago

Book review: Effective SQL: 61 Specific Ways to write…

Ken Ho 3 years ago

The node at the top of the tree is called the root node and the nodes between the root node and the leaf nodes are called the intermediate levels. The number of intermediate levels depends upon the number of data rows in the underlying table. The root and the intermediate nodes contain the index nodes implying it will either contain the pointer to an intermediate-level page or a data row in the leaf node. This tree-like structure helps the database engine retrieve the data quickly.

The above diagram also shows the path followed by the database to retrieve data when queried upon the indexed column for a specific student id. But how about when we run a query on the non-indexed columns the engine will have to search all the data rows which can be very inefficient. Here non-clustered indexes come to play.

Unlike clustered indexes, in non-clustered indexes, we have key values at the root and intermediate levels and row locators at the bottom level. When queried on a non-clustered column both non-clustered and clustered index work together to output the result efficiently.

How is data stored in SQL databases?

Kaushal Pahwani

Recommended by LinkedIn

Scoop of Software Engineering

487 followers

More articles by Kaushal Pahwani

Others also viewed

Azure SQL Temporal tables Database and What is a system-versioned temporal table?

SQL Server Data Type Conversion Methods and performance comparison

PART 2-SQL Server Table Management: A DDL Command Reference

SQL Data Cleaning

What elements of SQL do data scientists need to know?

Step-by-Step Guide to Creating SQL Hierarchical Queries

SQL Queries

How to extract data using SQL Server and BCP command into multiple files iterating from a table!

Connecting to SQL Database in Fabric: A Step-by-Step Guide

SQL Query Optimization: Hierarchical Queries

Explore content categories

Recommended by LinkedIn

Scoop of Software Engineering

487 followers

More articles by Kaushal Pahwani

A helper to choose the right database?

An overview of databases

Others also viewed

Azure SQL Temporal tables Database and What is a system-versioned temporal table?

SQL Server Data Type Conversion Methods and performance comparison

PART 2-SQL Server Table Management: A DDL Command Reference

SQL Data Cleaning

What elements of SQL do data scientists need to know?

Step-by-Step Guide to Creating SQL Hierarchical Queries

SQL Queries

How to extract data using SQL Server and BCP command into multiple files iterating from a table!

Connecting to SQL Database in Fabric: A Step-by-Step Guide

SQL Query Optimization: Hierarchical Queries

Explore content categories