Why Agent Observability !== Agentic Analytics
Agents were the buzz of 2025 (and probably the hot topic of this year too). As businesses start to contemplate integrating them into their operations, onsite customer-facing agents are poised to be the next interaction layer between businesses and consumers.
While in the past customers may have interacted with your website/mobile app while pondering your business proposition, businesses are now starting to prototype customer-facing agents hosted on their own digital estate to better retain control of their experience, optimise the customer journey, help solve customer frustrations, and potentially nudge customers to spend some money.
These customer-facing agents are now an abstraction layer between your customer and you as a business, becoming a new customer touchpoint and with that, opens a world of new possibilities and challenges.
What are agents?
So let's start off with defining what an agent actually is. Like all definitions in the world, everyone has their own definition of an agent, but there are some common themes.
Google defines AI agents as "software systems that use AI to pursue goals and complete tasks on behalf of users. They show reasoning, planning, and memory and have a level of autonomy to make decisions, learn, and adapt".
AWS on the other hand defines AI agents as "software programs that can interact with their environment, collect data, and use that data to perform self-directed tasks that meet predetermined goals. Humans set goals, but an AI agent independently chooses the best actions it needs to perform to achieve those goals."
Last but not least, Anthropic defines agents as systems where LLMs dynamically direct their own processes and tool usage, maintaining control over how they accomplish tasks.
So in short, agents are called agents because they have the agency to decide how best to achieve a predefined goal on behalf of users independently, without having to follow a strict one-directional workflow.
The current challenge
The current biggest challenge for businesses is working out how to move agents from a PoC phase and prototype environment into a real-world environment filled with unexpected edge cases and how to build guardrails to ensure that customer-facing agents perform as expected. The industry is currently flooded with platforms, frameworks and different approaches on how best to build and deploy agents (which are all important topics).
The less talked about discussion however is what happens after you have deployed an agent once it's out there in the wild. You probably want to know if the agent is working as designed, outputs are expected and that it hasn't just gone off the rails and given your customer a massive discount that is now eating into your profit margin. You also want to ensure there is governance and oversight into how an agent performs to ensure there are no compliance risks and that the customer experience is not adversely impacted. This all sounds reasonable right?
What can be used to solve this challenge
There are already quite a few mature and sophisticated agent observability platforms and tools like LangSmith and Langfuse which are designed to help monitor and optimise agent performance. These platforms do what they say on the box, they allow you to observe how your agent is doing.
Recommended by LinkedIn
Now if we go back to the start, agents are becoming an additional middle abstraction layer between the customer and you as the business. So do these Agent Observability tools allow you to understand your customer intent, their current sentiment and what they are actually looking to do right here and right now? Short answer No. These Agent Observability tools are designed to help you monitor the end-to-end behaviours of your agentic system, such as what LLMs or external tools the agents access.
Fundamentally agent observability is about "tracing" and evaluating the agent's performance by looking at its internal state and telemetry data to better understand how its operating. Common dimensions and metrics returned from these tools include request logs, prompts used, model's responses, token usage, reply latency and any external tools used.
Here is an example of what these structured logs look like from Langfuse:
While these observability tools definitely have their place in the world, they are designed for developers, ML engineers and data scientists to build better agents. What they lack however is providing an avenue to let the business team tell stories about their customers and how wonderfully awesome their agent(s) have been in influencing the customer journey.
Some things change, some things don't…
For those who have been around the neighbourhood for a while, you probably know what digital analytics is. Digital analytics is the process of collecting, measuring, analysing and interpreting data from across your digital estate to understand user behaviour so you can achieve business goals so that ultimately you can call yourselves "data-driven".
Before the explosion of the modern digital analytics platforms today, how did businesses try to understand their customers?
They did this by analysing log files on a web server and this is where the concept of page views and sessions were first introduced. The issue with log analysis is while it tells you how requests reach your website, they don't really tell you much about how a customer is interacting with the website content itself and lacks granularity when it comes to better understanding customer behaviour. In the same vein that you wouldn't use logs to do digital analytics, then agent observability tools are probably not the best-suited for understanding customer interactions with agents and so maybe a new paradigm is needed.
How the new paradigm of "Agentic Analytics" could look like
Let's call this new paradigm "Agentic Analytics" (please don't call it AA) and the purpose of Agentic Analytics is not necessarily to determine how performant an agent is but more about understanding how interacting with the agent is influencing the brand perception by the customer and whether it is helping to improve or deteriorate their current experience.
And to be honest, no one really knows how Agentic Analytics should be implemented or even work in principle - best practices here are still to be determined. My hypothesis however is that Agentic Analytics will have some of the following core themes (with probably more added as things evolve):
All in all, nothing is set in stone yet and things are moving fast in this space! More to come...