Does agentic systems benefit from normalization?

Vito Chin

Published Apr 11, 2026

I’d been building many versions of agentic systems, in various environments (Agent Builder, Copilot Studio, Microsoft Foundry, GitHub Copilot) with various front-end “sockets”, i.e. Microsoft Teams, Microsoft 365 Copilot, Visual Studio Code. MCP and the CLI too.

All this in the pursuit of giving the best experience to the users of these agentic systems.

The reason for all these experimentations, frameworks, tools and platforms is really because of the diversity of questions / prompts / use-cases that the “system” is expected to cater for.

Software development, or shall I call it systems creation, these days can no longer cuddle in the comfort of a defined front-end, responding to the finite possibilities of a “form”.

One instance of a question that I was trying to solve for came from Richard Koh . In our work with partners, the prompt was:

“Top 5 partners in terms of technical capabilities for these 8 specializations [redacted] for each Area in the Asia region”

The other parts of this prompt justify the agentic part of this approach (which I might do another post on!) but the “for each Area” part is what I want to focus on in this post.

Area from a database design point of view is best as a 3NF normalized table. But while I was wrapping a MCP-ed sub-agent around this, I wasn’t sure if this is best for agentic systems, so of course, asked Copilot.

The answer turns out is a tour of the architectural difference between traditional OLTP database and agentic systems. Here’s the conversation verbatim:

Q: Does agentic systems benefit from normalization or is it better not to normalize?

Copilot:

Ah — this is where classical database theory and agentic/LLM system design start to diverge.

Short answer:

✅ Transactional agent systems benefit from normalization ✅ Reasoning‑heavy agent systems often work better with intentional denormalization 🚫 Fully normalized schemas are usually NOT optimal for agent orchestration layers

🧠 Traditional DB Goal vs Agentic System Goal

Normalization helps databases.

Agents operate on:

vector retrieval
context assembly
graph traversal
semantic chunking

Which behave very differently from relational algebra.

📌 Why Highly Normalized Schemas Hurt Agents

Let's say you modelled this in perfect 3NF:

Partner

--------

PartnerID

PartnerName

AreaID

 

Area

--------

AreaID

AreaName

Now a user asks your agent:

"Which ASEAN partners are Kubernetes specialized?"

The agent must now:

Understand AreaName = ASEAN
Resolve AreaID
Join Partner → Area
Possibly join Specialization table

For a SQL engine? ✅ trivial

For an LLM‑grounded reasoning agent?

You’ve just created:

retrieval fragmentation
multi-hop semantic dependency
join planning requirements
symbolic mapping tasks inside the prompt space

Which leads to:

hallucinated joins
missed lookup resolution
tool over-calling
larger context windows needed
grounding loss

📉 The Agent Cost of Normalization

In agentic pipelines:

Agents reason better over:

✅ self-contained entity descriptions

Instead of:

SQL

{

"PartnerID": 101,

"AreaID": 3

}

Prefer:

JSON

{

"PartnerName": "ABC",

"Area": "ASEAN",

"Specializations": ["AKS", "AI Platform"]

}

📌 Agentic Design Pattern Emerging in 2025–2026

Modern agentic architectures now use:

✅ Hybrid Modeling

Bottom Line and Example

In short, when creating relational data layers for AI, we can think about designing the tables to make it easier for AI to understand.

Table (schema) design makes a difference.

For example, if I went with the full 3NF design shown above, with Partner and Area tables separate, GPT-4.1 was not able intuit this properly…

Naturally, a more costly model tried harder and was able to produce results:

Claude Sonnet 4.6 is 3x the cost of GPT-4.1 on GitHub Copilot though, which perfectly exemplifies the case for optimizing for AI’s understanding, i.e. to produce more accurate results with lower cost. If we have area as text within the Partners table, no doubt GPT-4.1 would had been able to find it in the first pass.

Summary (and what happens in practice)

This has been an interesting diversion, exploration and thinking session.

So, what happened to what I was trying to solve for, i.e. the modelling around the partner ecosystem?

Well, my eventual schema is 3NF 😊

Why?

Because, well, I already cut the data into that shape and imported it into my database already.

I reckon my time and the amount of time my agent is going to be used with Claude Sonnet 4.6 or better models in tow is still better optimized that way…

And all this comes around to my favorite principle of respecting the diversity of situations that are there in the world.

There’s always the ideal architecture or perfect whatever, but for what context?

Does agentic systems benefit from normalization?

Vito Chin

Recommended by LinkedIn

Bottom Line and Example

Summary (and what happens in practice)

Vito's Tech Kitchen

422 followers

More articles by Vito Chin

Others also viewed

Ruby Script for ICM InfoWorks and SWMM Networks to make any number of sides Polygons for Subcatchments

The .NET Insider – Issue #7 ✨

Step-by-Step Guide to Create a Timer Trigger Azure Function

Advancements in .NET 8, .NET 9, and .NET 10: Features, Changes, and Best Practices for Developers

21 - Documenting Power BI bookmarks: From PBIR JSON to clear documentation

The .NET Insider – Issue 19

Automated Testing U-SQL In Visual Studio

Using Rest Api with AX 2012

FactoryTalk Optix Widget Development

Mastering GraphQL with HotChocolate in C#: A Comprehensive Tutorial

Explore content categories

Recommended by LinkedIn

Bottom Line and Example

Summary (and what happens in practice)

Vito's Tech Kitchen

422 followers

More articles by Vito Chin

Why Visual Studio Code Release Notes are so exciting these days

Flow and constraints

Using GitHub Copilot for data analysis

Bank statement vision with GPT-4o & Tesseract

From cow paths to cobbled road: A Pythonic data journey

Quantum musings with LLMs and Mermaid: Bridging text to visual diagrams

Happy days with Copilot

GPT-4, my AI, my thinking companion

Benefits of Argo CD’s declarative approach to GitOps via state synching

Fun with Cloudflare Zero Trust

Others also viewed

Ruby Script for ICM InfoWorks and SWMM Networks to make any number of sides Polygons for Subcatchments

The .NET Insider – Issue #7 ✨

Step-by-Step Guide to Create a Timer Trigger Azure Function

Advancements in .NET 8, .NET 9, and .NET 10: Features, Changes, and Best Practices for Developers

21 - Documenting Power BI bookmarks: From PBIR JSON to clear documentation

The .NET Insider – Issue 19

Automated Testing U-SQL In Visual Studio

Using Rest Api with AX 2012

FactoryTalk Optix Widget Development

Mastering GraphQL with HotChocolate in C#: A Comprehensive Tutorial

Explore content categories