How the Copilot Analyst Agent Turns Consultants into Python Developers

How the Copilot Analyst Agent Turns Consultants into Python Developers

If you work in professional services - whether audit, tax, consulting, or law - you are intimately familiar with the specific brand of misery that is the Portable Document Format.

The PDF was designed to preserve the layout of a document, regardless of the software used to view it. It was seemingly not designed to be useful to anyone who actually needs the data contained within it. It is somewhat of a rite of passage for a professional services worker to spend a late night highlighting rows in a PDF bank statement, copying them, pasting them into Excel, and watching in horror as 40 rows of data paste into a single cell.

For years, the solution to this problem was either:

  • An expensive OCR tool that requires three levels of internal approval to purchase.
  • A junior associate with a high pain tolerance.
  • Python.

Python is the gold standard for data cleaning. It is fast, repeatable, and precise. It is also completely inaccessible to the average employee whose only experience with Python was a LinkedIn Learning course they took 3 years ago and only half paid attention to.

This is where the new iteration of Copilot (specifically the Analyst Agent) changes things. It allows non-technical staff to utilise Python without writing a single line of syntax. (I should heavily caveat that this works best for simple standalone scripts, software developers managing multi-file client facing software will keep their jobs for the foreseeable)

Here is a practical workflow we used recently to convert a messy PDF into clean Excel rows.

The Problem

You've received a transaction log from a legacy system. The sender of this log file, naturally, could only provide it as a PDF. It was hundreds of pages of unstructured text, headers, and footers.

Step 1: The Low-Tech Prep

While Copilot can read PDFs, I have found that for particularly large or messy documents, it helps to meet the AI halfway. We opened the PDF in Adobe Acrobat (or Word) and simply did a "Save As > Text (Plain)."" (Note. if your PDF is scanned a lob-sided fashion your out of luck, as a rule of thumb if your out txt file doesn’t look “right” your probably best of finding another solution)

This strips away the formatting step and leaves us with the raw data.

 

Article content
Post "Save As > Text (Plain)" in Adobe Acrobat

Step 2: The Prompt (No Code Required)

This is where the utility for professional services becomes apparent. Previously, to parse this text file, I would need to define some regex, import pandas, set up a loop, test, iterate and eventually have an output deemed sufficient.

With Copilot, I simply uploaded the .txt file and gave it instructions in plain English. I treated the AI like a very fast, very literal intern.

The Prompt:

"I have uploaded a text file containing transaction logs. Please write a Python script to parse this text file. I need you to identify every row that begins with a date in DD/MM format. Extract the Date, the Number/Name, and the Amount. Ignore page headers and footers. Output the result as an Excel file."

Article content
prompt

Step 3: The Execution

This is the part that really impressed me. Copilot does not just "guess" the answer. It writes and executes a Python script in a sandboxed environment to process the file.

You can actually see the logic it creates. It identifies the patterns, splits the text lines, and structures the data. If it encounters an error (for example, if the date format changes halfway through), it often self-corrects or asks for clarification.

Article content
Generated Python Logic

Step 4: The Output

In less than 90 seconds (87 to be exact), Copilot provided a download link.

The result was a perfectly formatted Excel table. No manual spacing, no copy-paste errors, and no merged cells.

Article content
Output excel


Why This Matters for Professional Services

The value here is not just that we converted a file. It is that we democratised a technical skillset.

In a traditional firm structure, there is often a wall between the "business side" and the "data team." If a Tax Director wanted to run a complex data extraction, they had to ticket it to the data group and wait three days.

With tools like the Analyst Agent acting as the interface for Python:

  • Latency improves. The analysis happens in real-time during the engagement.
  • Accuracy improves. Python scripts are less prone to the "fat finger" errors inherent in manual Excel data entry.
  • Audit trails exist. Because Copilot provides the code it used, the methodology can be reviewed. We can see exactly how the data was filtered.

We are moving toward a reality where "Python" is no longer a noun referring to a programming language you need to learn, but a verb that the software does for you.

For those of us in professional services, this creates a necessary shift in how we allocate our time. We are trading hours of data wrangling for hours of strategic interpretation.

Ultimately, clients hire us for our insights, not our ability to manually resize Excel rows. It is refreshing to finally have a workflow that allows us to focus entirely on the former.

Excellent article and use case Éimhín. You are so right, our Clients want our insights not our time spent on manual data transfer.

To view or add a comment, sign in

More articles by Éimhín Ward

Others also viewed

Explore content categories