When Documents Become Data, Everything Changes

Jim Thompson

Published Feb 23, 2026

For the past 30 years, I’ve had the privilege of working on some genuinely exciting challenges — healthcare systems, consumer goods, payments, and e-commerce — much of it centered around complex Salesforce implementations. So when I found myself spending time in the world of document ingestion and forms processing, I’ll admit: I wasn’t thrilled. PDFs? Paper forms? Attachments? It felt like plumbing.

But the deeper I’ve gone with today’s technology, the more I’ve realized something important: documents still quietly run the world. Tax returns. Contracts. Grant applications. Proposals. Intake forms. Compliance documents. Statements of work. Entire cloud platforms exist to modernize these workflows — yet even in cloud-first enterprises, critical business processes still begin life as a PDF. Emailed. Scanned. Downloaded from cloud storage. Filled out and uploaded again. And most organizations are still manually retyping that information into systems like Salesforce.

What’s changed isn’t the existence of documents. What’s changed is what we can do with them. Platforms like Microsoft’s Azure Document Intelligence and the newer Content Understanding now make it possible to extract structured meaning from unstructured documents at enterprise scale. Instead of relying on brittle, position-based templates, we can classify, extract, summarize, and score documents based on semantic understanding — not pixel coordinates.

A few recent standouts genuinely surprised me. With Sheepdocs we recently processed IRS forms and schedules, extracting structured financial data, and mapping it directly into Salesforce records — without rigid template configuration or heavy custom training. Just intelligent document capture, field mapping, and review before commit. In another case, we analyzed responses to an RFP: summarizing scope, extracting key dates and pricing, identifying risk indicators, and recommending next steps. What would have taken hours of reading became structured, reportable data inside Salesforce.

Recommended by LinkedIn

The Great Thinning: How Salesforce Consulting…

Shivanath Devinarayanan 4 weeks ago

The System of Record Isn't Dying. But It's About to…

Rakesh Vaddadi 2 months ago

Should I stay or should I go?: Top 5 Considerations in…

Tracy Julian 8 years ago

The more I talk with people in the Salesforce ecosystem, the more I’m convinced this need is larger than many acknowledge. Nonprofits issuing grants. Enterprises reviewing proposals from consultants and systems integrators. Healthcare organizations ingesting intake forms. Risk managers reviewing contracts. Financial services teams handling compliance documentation. Paper and PDFs haven’t gone away. They’ve simply been waiting for better tools.

What excites me now isn’t "scanning forms" or "processing PDFs." It’s the idea that documents can become actionable assets. When Microsoft-powered AI and machine learning are combined with Salesforce as the system of record, something interesting happens: documents stop being static files and start becoming structured intelligence. Could a chat agent extract insights from a handful of documents? Absolutely. But processing hundreds or thousands per hour — reliably, securely, and with validation — requires more than a prompt. It requires structured workflows, governed models, scalable infrastructure, and deliberate system design.

I didn’t expect to find document processing interesting. But when you zoom out, it’s less about documents — and more about unlocking trapped data at scale. And that’s a much bigger opportunity than it first appears.

When Documents Become Data, Everything Changes

Jim Thompson

Recommended by LinkedIn

Others also viewed

Your Customers Don’t Want Your Software. They Want What It Was Supposed to Do.

Salesforce CPQ Is Dead. Here's Your 90-Day Playbook.

The Silent Risks Hiding Inside Your Salesforce Org

The Enterprise Software Value Chain Is Being Reordered. Here's the Map.

How to "bulkify" your flows - volume 1: The SObject Variable

5 simple tips for your enterprise Salesforce strategy

Why Most Salesforce Projects Fail Before a Single Line of Code Is Written.

Profiles to Permission Sets Migration: Think Persona-Wise

Legacy Salesforce Orgs & the Silent Cost of Tech Debt: A Multi-Perspective Case for Audits

Explore content categories