The problem with A.I. and Documents

Alen Burger

Published Jan 29, 2019

Adobe’s VP Engineering for Document Cloud, Phil Ydens, estimates there may be up to 2.5 trillion PDF documents in the world. (2015-2016)

Can voice assistants such as Siri and Alexa help us declutter our lives by understanding our documents?

The history of humanity is etched in billions of documents. The footprints of our lives manifest in a bread crumb trail of paper, in digital and physical form. As part of our lives, we generate and receive an unprecedented volume of documents.

Technology has delivered us practical solutions to digitize many of these statements-of-record. Humanity has made incredible strides digitizing our history. A Google search will show just how far. There is barely a subject or topic, current or historical that cannot be found with a few keystrokes.

A.I. has given us voice-powered assistants. Alexa and Siri are powerful assistants that can answer our questions and control our homes. Voice assistants allow us to speak instead of type to get an answer. The rub is that this only works for question and answer scenarios and not when you want to go really in depth into a topic.

As far as digital assistants have come, they cannot effectively help us with our documents. Sure, they can read an email or document out aloud. As much you may enjoy the dulcet tones of Alexa, nobody wants to spend time being read a long document, unless this a strategic move to cure insomnia. It is still easier to read the document yourself and let our non-artificial intelligence engine, our brain, do the processing. Your brain can make sense and understand a document in seconds and guide you to the relevant parts.

As simple as documents are for people to consume, A.I. still does not understand the meaning of documents. Strides have been made in Text Summarization, but they still operate at a superficial level and do not intrinsically understand the content.

The challenge is to use machine learning, not to just read documents verbatim but to understand the document. Voice assistants should not only be able to give a quick summary of a document but also answer any questions you may have.

Consider these questions that can readily be answered by most people after reading the corresponding document:

What does my insurance policy cover?
Does my insurance cover my mobile phone?
When did we sign the contract?
When do I get my pension?
Why is my mobile bill so high?
How long until I pay off my mortgage?
Did the proposal include exclusivity for the North American market?
What is the main idea behind the thesis?

What we find easy, is no cake walk for A.I. today.

While this might sound like the perfect challenge for machine learning, it becomes a gargantuan task once you understand that many different types of documents contain information on every domain and subject known - past to present.

It is not insurmountable; maybe the way forward is not only improved machine learning but new methods of authoring documents. Automated systems generate many of the documents that we receive today.

Consider your bills, contracts, policies they all created automatically by systems that have a lot more information ("metadata") than is just contained in that document. Enriching documents with metadata already available in our document production workflows makes it possible for machine learning to get to grips with your documents.

Companies such as Pryon (https://www.pryon.ai) and InfoSlips (https://www.infoslips.com) are already turning the dream to reality. Asking your document a question and getting a relevant answer is definitely in our future.

Personally, I cannot wait for the day when I do not have to personally read every document that makes it way to me. The day when voice assistants make us more productive; free up time and improve the quality of our lives.

To view or add a comment, sign in

The problem with A.I. and Documents

Alen Burger

More articles by Alen Burger

Others also viewed

AI and the Future of Financial Services Software

From Buzzsaw to Rocketship - IRS AI Edition

The Token Economy: Why Invoicing by AI Tokens Is the New “Lines of Code” Metric

Next up for SAAS stocks disruption...

The Observability Tax: A Forensic Audit of How Your Unified Platform Became a Liability.

From Technical Jargon to Executive Buy-in: The Art of Stakeholder Communication That Actually Moves Projects Forward

You're Paying Rent on Your Own Intelligence. You Just Don't Know It Yet.

Your POVs are losing you money

Cloud, AI and Humans: Why do we call it Bookkeeping when we no longer use Books?

Intelligent Orchestration: Payment Posting From RPA to Agentic AI

Explore content categories

More articles by Alen Burger

Hyper-Personalization Is Overrated: Why Most “Personalized” Experiences Still Fail Customers

The End of Human-Readable Code as the Source of Truth

The Quiet Erasure

When Code Stops Being the Product

The Federal Reserve Just Forced the U.S. to Confront the Future of Checks

Pit Crew Precision for Pediatric Care: What F1 & Children’s Hospitals Teach Us About Better Communication

Under the Lights: What a Night Race in a Vegas Taught Me About Communication

Dear Healthcare, It’s 2025. We Need to Talk!

My Credit Card Renewal Letter

Rebutting Key Points from “30 Years Later: Why the World Still Runs on Digital Paper"