Automating CSV Data for Reliable API Testing

Sadia Islam

Published Jun 25, 2025

Recently, I tackled a practical challenge in my API automation work and wanted to share both the problem and my solution — in case it helps someone out there in a similar boat!

🚧 The Problem

In my current project, one of our APIs is responsible for onboarding users through the upload of CSV files. This API checks the uploaded data against existing backend records and rejects the entire upload if it detects any duplicates

Now, here comes the tricky part: My test needs to run repeatedly (automated), but each time it runs with the same CSV file, the API fails because duplicate entries already exist in the system.

Two workarounds were initially proposed:

Manually upload a new CSV file every time (not scalable)
Reset the database daily (risky and not practical for CI/CD)

I knew we could do better.

💡 My Thought Process

Instead of changing the whole database or managing tons of files manually… Why not generate unique data programmatically for every test run?

That way:

The API sees fresh data every time
The test runs on schedule (hands-free!)
No need to reset the DB or manage uploads manually

🛠️ The Solution I Built

I created an R&D project to build an automated solution that:

Create a dummy API locally that mimics the real endpoint behavior (CSV upload, duplicate checks). Later, I deployed it on a dummy server to test the flow in a CI/CD environment.
Used faker.js to dynamically generate unique user data (ID, Name, etc..)
Write that data into a temp.csv file using Node.js
Use Postman to send a form-data request with the CSV
Executed the API request via Newman to upload the CSV file.
Runs in GitHub Actions every 30 minutes or is triggered manually as needed.
Sends me the test result report in Telegram, with a full HTML report with pass/fail status

⚙️ Technologies Used:

Node.js for scripting
faker.js for generating fake (but unique) data
Newman to run Postman tests in CI
GitHub Actions for automation
Telegram Bot API for real-time notifications

Recommended by LinkedIn

The Journey to TDD-JSON Workbench: Simplifying API…

Prashant T. 1 year ago

From Idea to Working Parser: How AI helped me to build…

Marcelo Caldas 2 months ago

Stop Writing Buggy APIs: Why Pydantic Should Be Your…

Hargurjeet Singh Ganger 3 months ago

The Result

✅ Fully automated test
📂 Always fresh and valid data
🧪 No duplicates = no failed tests due to data
📲 Real-time result delivery
💥 100% hands-off CI/CD experience

My Key Learnings

Automating dynamic test data prevents flaky API test failures due to duplicates
Manual work, like resetting databases or uploading new files daily, isn't scalable
Real-time Telegram alerts provide quick visibility into test outcomes [you can use Gmail, Slack, or any other platform to get the real-time update]
A simple R&D initiative can lead to valuable production-ready solutions

What’s Next?

This R&D setup is now complete and validated. I’ll be integrating it into our production test automation pipeline soon.

LIVE API + Repository Link

☑️API Endpoint: https://playing-with-api.onrender.com/docs/

📂Github Repo: https://github.com/Sisadia/playing-with-api

Why I'm Sharing

This might seem like a small challenge, but it's a real-world pain point in test automation — and solving it with code gave me the flexibility to focus on testing logic rather than data cleanup.

Happy testing! ✨

Rahik T. 10mo

My two cents: if I understand the code correctly, you save the data to the database during each test run and delete it in the teardown process. Therefore, using Faker.js to generate fresh data each time seems unnecessary to me. Reimporting the same CSV/data already gives you a unique dataset for every test execution.

See more comments

To view or add a comment, sign in

Automating CSV Data for Reliable API Testing

Sadia Islam

Recently, I tackled a practical challenge in my API automation work and wanted to share both the problem and my solution — in case it helps someone out there in a similar boat!

🚧 The Problem

💡 My Thought Process

🛠️ The Solution I Built

⚙️ Technologies Used:

Recommended by LinkedIn

The Result

My Key Learnings

What’s Next?

LIVE API + Repository Link

Why I'm Sharing

More articles by Sadia Islam

Others also viewed

KUBERNETES CLUSTER HEALTH CHECKS USING ROBOT DATA DRIVEN LIBRARY

Git's Delta Compression Algorithm: Technical Deep Dive

GraphQL API Automation Testing

DataOps: Building Trust in Data through Automated Testing

GitHub Actions vs Claude Skills: Event-Driven Automation vs AI-Driven Execution

Automating dbt Best Practices: SQLFluff, dbt_project_evaluator, and more

Git for Data: Formal Semantics of Branching, Merging, and Rollbacks (Part 1)

Applying QA or Unit testing for Data Science projects

DevOpsify REST API Webserver: API Design, CRUD Implementation & Validation 🚀

Explore content categories

Recently, I tackled a practical challenge in my API automation work and wanted to share both the problem and my solution — in case it helps someone out there in a similar boat!

🚧 The Problem

💡 My Thought Process

🛠️ The Solution I Built

⚙️ Technologies Used:

Recommended by LinkedIn

The Result

My Key Learnings

What’s Next?

LIVE API + Repository Link

Why I'm Sharing

More articles by Sadia Islam

Why Your API Call is Hitting Twice: Understanding Preflight & XHR

Host your Allure Report on GitHub Pages with GitHub Actions

Improving Your Software Testing Workflow with Chrome Dev Tools: Learn Essential Features

Dynamic data for Test Automation

findElement() and findElements() in Selenium WebDriver

Others also viewed

KUBERNETES CLUSTER HEALTH CHECKS USING ROBOT DATA DRIVEN LIBRARY

Git's Delta Compression Algorithm: Technical Deep Dive

GraphQL API Automation Testing

DataOps: Building Trust in Data through Automated Testing

GitHub Actions vs Claude Skills: Event-Driven Automation vs AI-Driven Execution

Automating dbt Best Practices: SQLFluff, dbt_project_evaluator, and more

Git for Data: Formal Semantics of Branching, Merging, and Rollbacks (Part 1)

Applying QA or Unit testing for Data Science projects

DevOpsify REST API Webserver: API Design, CRUD Implementation & Validation 🚀

Explore content categories