Gregg Cochran’s Post

🏭 I built an agentic dark factory for AI building. Imagine telling AI agents what you want to build, turning the lights off, and the agentic factory gets to work. That future isn’t hypothetical anymore. Dark Factory is an experiment in spec‑to‑software automation, using the GitHub Copilot CLI. (repo and website link in comments) At the core: Six specialist agents, each with its own prompt, model assignment, and governance rules. They’re stateless and only see what the Factory Manager explicitly passes forward. Here’s how it works: 1. You give Dark Factory a short, natural‑language goal in the GitHub Copilot CLI. 2. The system spins up an isolated, disposable git worktree so every build is clean and contained. 3. Specialist agents (Product, Architecture, Build, QA) move through a checkpoint‑gated pipeline. 4. Each phase must pass before the next begins. 5. A sealed acceptance test suite is generated from the spec before any code is written. 6. The building agents never see these tests, which prevents “teaching to the test.” 7. The output is a review‑ready pull request. This project is about exploring what AI systems can do when agnet orchestration, verification, and governance are designed intentionally. Have an idea you’re curious to see tested? Post it below. I’ll choose one, run it through the Dark Factory, and share the outcome. #AI #GitHub #Copilot #CopilotCLI

  • text

This is awesome Gregg Cochran! "Have an idea you’re curious to see tested?"Idea: Factory Communication Latency & Reliability Profiler Design and validate a system that determines the optimal communication architecture between factory machines and MES systems under real-world conditions. Objective: Given a simulated battery manufacturing line (200+ stations), identify which protocol and topology (TCP, MQTT, or hybrid) minimizes latency, maximizes throughput, and maintains reliability under load. Requirements: Support multiple communication protocols: Raw TCP sockets MQTT (brokered pub/sub) Simulate factory nodes generating test data: Payload sizes ranging from 1 KB to 1 MB Variable message frequency (steady-state and burst traffic) Include MES endpoint simulation with configurable response latency Measure: End-to-end latency (p50, p95, p99) Throughput (messages/sec) Packet loss and retry rates System degradation under scale (50 → 200+ nodes)

5/6) *chefs kiss* - Great additions!

Like
Reply

Very cool to see you building, Gregg! Is the goal here to create new stuff, test models on their ability to deliver, or just explore?

See more comments

To view or add a comment, sign in

Explore content categories