Build software, orchestrating from a CLI - Problem Engineering Part 3

Vicky Biswas

Published Mar 26, 2026

Target Audience: Anyone who develops software, tests it, or is into DevOps.

In my last post, we explored how to break free from PPP (Prompt, Paste, Pray) by acting as a "Problem Engineer." We used specialized agents (ChatPRD, v0, Supabase) to build a full-stack app, showing that engineering constraints are more effective than writing isolated features.

But there was a catch: You were still the coordinator. You were switching tools, copying context, and manually driving the flow

Let's take the next step. We are moving from manual agent handoffs to central orchestration. We will use one central tool to run multiple agents, building a React frontend and Python backend, entirely from the command line. We will then use GitHub issues and PRs to enhance the software, mirroring what you do in your daily work.

Here is how you operationalise Problem Engineering - at the orchestration layer.

The Insight: Mapping the "Goldilocks Zone" - to Code

To orchestrate agents effectively, we must map our theoretical framework directly into the environment:

High-level outcomes become your Instructions to the Agents or Orchestrator.
Constraining details become parts of your Agents and Config (e.g., CLAUDE.md files).
Validation methods go into your Tools, Skills & Hooks (e.g., automated formatters and MCPs).

Note: I am using claude but this can easily be codex, or copilot, or aider, or other tools, some of which I showed on this graphic.

Article content — You could use any tool with slight modification

We are establishing a contract-first pipeline for new projects, existing codebases, and continuous improvement. This is not to establish best practice, or even near what I would call great quality, but good as a next step for developers on the AI journey. More will follow...

Let’s get our toes wet - The Walkthrough: The "Stranger Things" Calculator

We will build a themed calculator using a React/Next.js SPA (Port 3004) and a Python FastAPI backend (Port 8004), containerised with Docker. I choose Stranger Things, you may go with Bridgerton or Spider-Man, or whatever makes you tick.

Phase 1: The Setup

Instead of bouncing between browser tabs, we set up our environment to govern the agents for us:

# Initialize: Create an empty GitHub repo and install Claude Code 
git clone https://github.com/vickybiswas/agent-demo
cd agent-demo
curl -fsSL https://claude.ai/install.sh | bash

# Equip Plugins: Install specialized knowledge (I suggest frontend-design, pr-review-toolkit, security-guidance)
claude plugin install @anthropic/[plugin-name]

# Bind Tools (MCP): Give the agent access to real tools:
# playwright for UI testing
npm install @playwright/mcp && claude mcp add playwright npx @playwright/mcp@latest
# code-review-graph for architecture review
claude plugin marketplace add tirth8205/code-review-graph && claude plugin install code-review-graph@code-review-graph
# github to read/write PRs directly
claude mcp add-json github '{"type":"http","url":"https://api.githubcopilot.com/mcp","headers":{"Authorization":"Bearer '"$(grep GITHUB_PAT .env | cut -d '=' -f2)"'"}}'

I orchestrated a skill that would help set up hooks, Five Agents, Three Validators (nextjs, fastapi-validator, docker-validator), and Orchestration Files (like CLAUDE.md, REGRESSION.md, CREATE.md, STARTUP.md). In real life, these should be handmade and revised by the team periodically.

I am using Claude code pro for this demo; you can install and use it for free(see Appendix for instructions). However, it is important to understand that the free versions limit context and speed based on the models you choose. For this article, I used Haiku with Pro in yolo mode -> claude --dangerously-skip-permissions --model haiku

Note: Ensure docker is running so claude can use it for testing

Tell Claude: Use the repo-setup skill and INSTRUCTIONS.md to set up Agents, skills, hooks, all CLAUDE.md, etc. for agentic development.

Phase 2: Execution (The New Project)

With constraints locked, we issue high-level instructions:

Tell Claude: Use INSTRUCTIONS.md, the 3 Claude.md files, skills, agents, etc., to build/test a Stranger Things themed calculator with a animated and interactive UI. Use subagents to parallize work.

This will take roughly 20 minutes to build and run the entire code, test cases, documentation, etc. If it asks, answer to your best knowledge, or just say 'follow instructions given'. Once it finishes you will see something like:

Phase 3: Day 2 Operations (Established Projects)

Problem Engineering isn't just for greenfield apps. Because our constraints are defined, we can instruct the agent to orchestrate updates safely. Let's use GitHub Issues to initiate new features and changes, do a prelim RCA, make the fixes, raise a PR:

GitHub Issues: Update Calculator - Update the calculator to a Scientific one. Also, ensure you add elements, framer animations, interactions, colors, and sounds to change UI to a new Spiderman based theme.

Now let's ask Claude to get this done for us:

Tell Claude: Using fix-github-issue implement issue [issue id or url] and test it E2E thoroughly.

You should soon see something like:

The Takeaway

We only covered a frontend and backend here, but the possibilities are endless. Understanding the Problem Engineering is critical here covering concisely:

What to create (Specific requirements)
What to read (Source docs)
Where to put it (Directory constraints)
Format/Style (Lean vs. detailed)
How to run/test (Sample commands)
Success criteria (Definition of Done)

Are you ready to stop typing and start orchestrating? Try it yourself and let me know how "Problem Engineering" changes your workflow.

Pro-Tip: Use git worktrees to run multiple Claude sessions in parallel, acting as an Agent Team, or explore the agent team feature.

Appendix

1. Claude Code for free (Severely dependent on model)

create in Githuv > Settings > Developer > Personal access tokens
save in a .env file with -> GITHUB_PAT=your-github-pat ()
signup for openrouter https://openrouter.ai./ for free LLM access. Create a Key.
setup claude code - https://github.com/anthropics/claude-code
install claude cli -> curl -fsSL https://claude.ai/install.sh | bash
run -> alias claude-f='CLAUDE_CONFIG_DIR=~/.claude-f1 ANTHROPIC_BASE_URL="https://openrouter.ai/api" ANTHROPIC_AUTH_TOKEN="sk-..." ANTHROPIC_API_KEY="" CLAUDE_CODE_MAX_OUTPUT_TOKENS=2048 OPENCODE_EXPERIMENTAL_OUTPUT_TOKEN_MAX=2048 ANTHROPIC_MODEL="openrouter/free" ~/.local/bin/claude && source ~/.zshrc && claude-f (not persisting env to keep different versions running).
replace the key before running above
claude should be run as -> claude (pro if you buy) and claude-f (free)

2. Time and Cost

0% / 52% - Usage at Start Session/Weekly limit
7m - Setup
4% / 53% - Usage after Setup Session/Weekly limit
25 m - Application
12% / 54% - Usage after App Session/Weekly limit
15 m - Enhance to Scientific
23% / 55% - Usage after Enhance Session/Weekly limit

3. Quality of files and prompts

The prompts and files used here are shortcuts taken for demo purposes and are not anything near to what I would use in production. In coming posts I will approach on that as well. To start I would want the md files to be between 100 and 200 lines.

4. Ask claude for status if it waits too long

Vinit Sankhe 1mo

Lovely. Exact and precise.

See more comments

To view or add a comment, sign in

Build software, orchestrating from a CLI - Problem Engineering Part 3

Vicky Biswas

Phase 1: The Setup

Phase 2: Execution (The New Project)

Recommended by LinkedIn

Phase 3: Day 2 Operations (Established Projects)

The Takeaway

Are you ready to stop typing and start orchestrating? Try it yourself and let me know how "Problem Engineering" changes your workflow.

More articles by Vicky Biswas

Others also viewed

DEVOPS TASK3:KUBERNETES+JENKINS

Automating Continuous Deployment in DevOps using Jenkins

Day16-Docker Project for DevOps Engineers. 90Days of DevOps Challenge

[Test Engineering Weekly] Test Automation Portfolio, testing serverless apps, bugs in NASA software, the internals of databases

Building Log Archiver CLI Tool: A DevOps Project Implementation

DevOps Assembly Line Task 3

Bot-Driven Development: When Code Starts to Code Back

Better understanding of Git Rebase and Merge

DevOps Digest: June

Explore content categories

Phase 1: The Setup

Phase 2: Execution (The New Project)

Recommended by LinkedIn

Phase 3: Day 2 Operations (Established Projects)

The Takeaway

Are you ready to stop typing and start orchestrating? Try it yourself and let me know how "Problem Engineering" changes your workflow.

More articles by Vicky Biswas

Build a Full-stack Multi-tech app using Agents, start Problem Engineering

"Problem Engineering" - Where software development is heading

Is your phone eating Oreo? What's New?

Building Technology for Startups

Probable Launch of IPhone 7, Iphone 7 Plus and Apple Watch 2 with IOS 10 today

Coding or Architecting an Application

Why? and is it correct...

Marketing Revolution : Realizing the Power of Technology

Others also viewed

DEVOPS TASK3:KUBERNETES+JENKINS

Automating Continuous Deployment in DevOps using Jenkins

Day16-Docker Project for DevOps Engineers. 90Days of DevOps Challenge

[Test Engineering Weekly] Test Automation Portfolio, testing serverless apps, bugs in NASA software, the internals of databases

Building Log Archiver CLI Tool: A DevOps Project Implementation

DevOps Assembly Line Task 3

Bot-Driven Development: When Code Starts to Code Back

Better understanding of Git Rebase and Merge

DevOps Digest: June

Explore content categories