Lukas Nießen’s Post

The biggest risk in software right now isn’t downtime. It’s letting AI coding agents quietly erode your architecture one "fix" at a time. When an LLM gets stuck, it usually doesn’t stop and ask: - “Should this layer even know about that one?” - “Is this dependency direction allowed?” - “Are we introducing a circular dependency here?” It just makes the code work. So routers start importing database code directly. Service layers begin depending on framework internals. Circular dependencies creep in. And six weeks later the codebase still “runs”, but nobody wants to touch it anymore. That’s exactly why I built ArchUnitPython. It lets you enforce architectural rules in Python projects by writing them as simple unit tests. So instead of *hoping* humans or LLMs respect your architecture, you can make those rules executable and enforce them in CI. Example: rule = ( project_files("src/") .in_folder("**/presentation/**") .should_not() .depend_on_files() .in_folder("**/database/**") ) assert_passes(rule) A few things it can do: - enforce dependency direction rules - detect circular dependencies - validate naming conventions - validate PlantUML diagrams against code - calculate architecture/code quality metrics - support custom rules (- special support for FastAPI and Django) The goal is simple: If your team has architectural decisions, they should live in tests, not just in wiki pages, PR comments, or one senior engineer’s head. By putting them in your CI/CD pipeline as tests, they are ensured forever. Feedback and PRs are highly welcome! Repo: https://lnkd.in/dMGDBGkP #Python #SoftwareArchitecture #OpenSource #Testing #FastAPI #Django #Pytest #CodeQuality #AIEngineering #LLM

11 Comments

Lukas Nießen 6d

Repo link: https://github.com/LukasNiessen/ArchUnitPython - More: ArchUnit Python - ArchUnitPython is basically "ArchUnit for Python" - for those unaware, ArchUnit is a famous Java library that let's you do the exact same thing as described in my post, just for Java projects instead of Python.

1 Reaction

Aqdas Malik 4d

The boundary enforcement approach in ArchUnitPython solves exactly the problem that catches teams off guard six months into working with AI agents on a codebase. We hit a similar wall on a Phoenix project where Credo and mix quality checks caught individual style issues but not dependency direction violations across bounded contexts. Making architectural rules first-class test citizens that run in CI is the only reliable way to give AI agents a hard constraint they cannot bypass through clever code generation. How does the performance hold up when running these rules against larger codebases, say 100,000 lines or more?

1 Reaction

Shing Chan 6d

Has anyone tried to produce said architectual rules with an LLM? I think this project has super huge upsides as part of a complex A.I. developer workflow.

1 Reaction

Julien Zaegel 5d

I've had this issue on a codebase recently: design progressively became a mess, agents adding hacks over hacks to work around architecture issues. Hopefully we had enough post-deployment tests to provide a safety net. I've been able to totally restructure the project structure in a couple of weeks. Now the projects makes sense and AGENTS.md points to a clear architecture description. Agents are now following the path of least resistance and keeping changes on the good (or at least better) architecture. Lessons that I learnt: * Invest in good structure and documentation early. Agents make the code rot 10x faster. * Reviewing only diffs in PRs is not sufficient. Taking some time to look at project structure as a whole and assess the fitness of the patterns should be in the team hygene. * Defining what good architecture is is still a matter of taste and judgement. Yes we can delegate a lot of implementation to agents, but architecture needs scrutiny and human ownership.

1 Reaction

Frank Peters 1d

Lukas, the 'codebase still runs but nobody wants to touch it' line is painfully accurate. Great approach making rules executable!

1 Reaction

Jean-Francois René 5d

stateful global singleton .... sometime with public properties ... Opus try to do functional, but it still does this ... the problem is that models learned from so much public code where patterns violation exists ... I like the idea of architectural design unit test

1 Reaction

Tomáš S.

Senior Principal Software Engineer · Java / Jakarta EE · Enterprise platform architecture — Coherence over orthodoxy.

Agree, one of the issues with AI coding isn't that agents can't write good architecture, but that developers can't explain it and enforce it. Sometimes just a single file living in the repo can do that, but your rule is a really nice addition on top.

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

souvik dutta
1w
Report this post
🚀 Built & Open-Sourced: A Production-Ready LLM Workflow Engine in Python Over the past few months, I kept running into the same problem: ➡️ LLM apps don’t fail at prompts. They fail at orchestration. What starts simple quickly becomes: – multi-step pipelines – retry logic everywhere – branching flows – zero visibility into execution So I built something focused on that layer. protokol-core 👉 https://lnkd.in/dfA5wZWq 💡 What this actually solves: • Orchestrating multi-step LLM workflows cleanly • Managing retries, failures, and branching logic • Keeping execution fully explicit (no hidden state) • Designing systems that are easy to debug and extend ⚙️ What makes it different: • Minimal, dependency-light core • Composable workflow primitives • Deterministic execution model • First-class support for nested flows / subflows • Clear separation of logic vs execution • Built for real-world systems, not toy demos 🧠 Key idea: LLMs are not the hard part anymore. Workflow + control flow + reliability is. And most frameworks abstract that away… until you actually need control. 🎯 The gap: Most people can build: “prompt → prototype” ✅ Very few can reliably build: “multi-step system → production” That’s where things break. This project is focused entirely on that transition. If you’re working on: • AI platforms • LLM infra • Developer tools • Production AI systems You’ll probably relate to this. Try it. Break it. Extend it. Curious to hear how others are solving this. ⭐ https://lnkd.in/dfA5wZWq #Python #AIEngineering #SystemDesign #LLM #BackendEngineering #BuildInPublic

GitHub - souvik666/protokol-core: The Ultra-Lightweight, Enterprise-Grade LLM Orchestration Framework for Python. github.com
Like Comment
To view or add a comment, sign in
Michael Bennett
4w
Report this post
What the Architecture Reveals 🔍 512,000 lines of leaked Claude Code told us something important. The most powerful AI coding agent in the world isn't built on magic. It's built on surprisingly minimal architecture. Here's what the claw-code analysis revealed about how production agentic systems actually work: One agent loop. 40+ discrete tools. No rigid workflows. No hardcoded task sequences. The harness creates conditions for reasoning. The model does the work. Subagent spawning on context overflow. When a task risks filling the primary context window, Claude Code spawns independent agent instances with their own context and scope. Exploratory work doesn't contaminate the main thread. This is how you build agents that can actually run for hours without losing coherence. Permission-gated tools. Deny list always wins. Every tool — bash, file reads, web fetch, git ops — is individually permission-gated. Compound bash commands are evaluated sub-command by sub-command. If any part gets denied, the whole chain is blocked. This is the right design for anything executing real shell commands. 44 hidden feature flags. The most strategically sensitive part of the leak. These are features Anthropic has built but hasn't shipped. Competitors now have a product roadmap they weren't supposed to see. This architecture validates everything we've built at SELARIX. One founder. A cabinet of specialized agents. Tool permissions scoped by role. Context managed by design. The blueprint was always sound. Now everyone can see it. 🔗 claw-code.codes 🔗 https://lnkd.in/dJaA59Gt #AIArchitecture #AgenticAI #ClaudeCode #ClawCode #MultiAgent #SELARIX #OpenSource

GitHub - mbennett-labs/claw-code-parity: claw-code Rust port parity work - it is temporary work while claw-code repo is doing migration github.com
Like Comment
To view or add a comment, sign in
Amber Lopez
1w
Report this post
𝐀𝐈 𝐢𝐬 𝐰𝐫𝐢𝐭𝐢𝐧𝐠 𝐧𝐞𝐚𝐫𝐥𝐲 𝐡𝐚𝐥𝐟 𝐨𝐟 𝐲𝐨𝐮𝐫 𝐝𝐞𝐯𝐞𝐥𝐨𝐩𝐞𝐫𝐬' 𝐜𝐨𝐝𝐞. That number isn't hypothetical anymore. GitHub Copilot now generates an average of 46% of code written by active users, with Java developers reaching 61%. The productivity case is clear, but the security case is getting complicated. Nearly 30% of Copilot-generated Python code contains potential security weaknesses, and most of it lands in CI/CD pipelines before anyone's looked closely at it. The problem isn't the AI. It's the gap between where the code gets written and where it gets scrutinized. Pipeline gates, policy enforcement, and automated security scans need to move closer to the source, not sit at the end of the delivery chain waiting to become a blocker. That's the architecture shift Opsera was built for: connecting the intelligence of your AI coding tools to the governance your pipelines actually need. What's your team doing to validate AI-generated code before it merges?
1 Comment
Like Comment
To view or add a comment, sign in
Sheik Nomaan
4w
Report this post
Just published a new Medium article on using 𝑺𝑲𝑰𝑳𝑳.𝒎𝒅 in production LLM systems. At its core, 𝑺𝑲𝑰𝑳𝑳.𝒎𝒅 enables 𝐦𝐨𝐝𝐮𝐥𝐚𝐫 𝐩𝐫𝐨𝐦𝐩𝐭𝐬, 𝐫𝐮𝐧𝐭𝐢𝐦𝐞 𝐬𝐤𝐢𝐥𝐥 𝐬𝐞𝐥𝐞𝐜𝐭𝐢𝐨𝐧, and 𝐬𝐜𝐚𝐥𝐚𝐛𝐥𝐞 𝐛𝐮𝐬𝐢𝐧𝐞𝐬𝐬-𝐫𝐮𝐥𝐞 𝐦𝐚𝐧𝐚𝐠𝐞𝐦𝐞𝐧𝐭. Instead of burying prompt logic inside Python strings, it allows rules to be organized clearly: • extraction rules • negative examples • domain constraints • validation instructions • exception handling At runtime, the system dynamically routes and loads only the relevant skill instead of sending a monolithic prompt with every LLM call. This directly improves 𝐭𝐨𝐤𝐞𝐧 𝐞𝐟𝐟𝐢𝐜𝐢𝐞𝐧𝐜𝐲, 𝐦𝐨𝐝𝐮𝐥𝐚𝐫𝐢𝐭𝐲, 𝐦𝐚𝐢𝐧𝐭𝐚𝐢𝐧𝐚𝐛𝐢𝐥𝐢𝐭𝐲, 𝐚𝐧𝐝 𝐩𝐫𝐨𝐦𝐩𝐭 𝐯𝐞𝐫𝐬𝐢𝐨𝐧𝐢𝐧𝐠, while making business-rule-heavy LLM workflows much easier to scale. In the article, I walk through a practical invoice extraction pipeline example and how this architectural boundary improved code reviews, evaluation, and iteration speed. #LLM #GenAI #PromptEngineering #RAG #AIEngineering #MLOps

SKILL.md: Treating Prompts as First-Class Artifacts in Production LLM Systems medium.com
Like Comment
To view or add a comment, sign in
Hiếu Nguyễn
1w
Report this post
Developers are finding new ways to tame the complexity of LLM and agent workflows. At the heart of this effort is hieuchaydi/RepoBrain, a local-first codebase memory engine for AI coding assistants. RepoBrain indexes repositories, retrieves grounded evidence, traces logic flows, and ranks the safest files to inspect or edit before code generation. This is a critical step forward because teams are trying to make agent behavior more reliable, not just more powerful. What sets RepoBrain apart is its ability to provide actionable insights without requiring a hosted backend or API key. This is achieved through a combination of local index + evidence-backed retrieval, route/service/job flow hints for faster codebase orientation, and ranked edit targets with confidence and warnings. RepoBrain's capabilities include: - local index + evidence-backed retrieval - route/service/job flow hints for faster codebase orientation - ranked edit targets with confidence and warnings - built with Python The momentum behind RepoBrain looks earned because the project is easy to place inside a real workflow, not just admire from a distance. It lands in high-interest areas like agent, ai-agents, llm, and recent commits make it feel active instead of abandoned. The project still feels early, which gives it some discovery momentum. Repo: https://lnkd.in/ggAjSMGY #GitHub #OpenSource #GitHubTrending #LinkedInForDevelopers #Python #RepoBrain #Agent #AiAgents
Like Comment
To view or add a comment, sign in
Misael Difo
2w Edited
Report this post
Big thanks to the creator of Gentle AI (by Gentleman Programming – https://lnkd.in/eNEBdPjY) for making my latest migration project a breeze! I started by generating the SDD (Spec-Driven Development) plan in a Python/Django workspace, storing all specs and context in Engram. When I switched to the Go workspace, I simply told the AI to “search the Python information in Engram DB”—and it instantly brought all the context and requirements over. From there, I executed the entire implementation, following the same specs and architecture—seamlessly bridging both stacks. The new project now includes: Robust RabbitMQ integration for messaging and notifications Strict TDD (Test-Driven Development) with deterministic, in-memory tests Structured logging with logrus for traceability Full separation of concerns (entity, service, contract) for maintainability One of the best parts: these tools help you consume far fewer LLM context tokens, since all your specs and history are always available in Engram—no need to repeat yourself! Gentleman Programming has several other great AI/open-source projects worth checking out: Gentle AI: https://lnkd.in/eZQQNtwZ Engram: https://lnkd.in/e7CDbafP If you’re interested in reproducible, cross-language workflows and persistent project memory, definitely check out these tools! #SDD #GoLang #Python #Django #SoftwareArchitecture #GentleAI #DevWorkflow #Automation #OpenSource #RabbitMQ #TDD #Logging
Like Comment
To view or add a comment, sign in
ShipIt

36 followers
3w Edited
Report this post
🚀 ShipIt Agent v1.0.2 — The most powerful open-source Python agent framework After weeks of deep engineering, I'm releasing SHIPIT Agent v1.0.2 — a complete agent framework that goes beyond others. What's new: 🎯 Deep Agents — GoalAgent decomposes objectives and tracks success criteria. ReflectiveAgent evaluates and revises its own output. Supervisor delegates to workers and reviews quality. AdaptiveAgent creates new tools at runtime. 📊 Structured Output — One parameter: agent.run(prompt, output_schema=MyPydanticModel). Returns typed, validated instances. No chain wrapping needed. 🔗 Pipeline Composition — Sequential, parallel, conditional routing. Cleaner than Other LCEL. Full streaming support. 🧠 Advanced Memory — Conversation memory (4 strategies), semantic search with embeddings, entity tracking. AgentMemory.default() for zero-config. 📡 Real-Time Streaming — Every deep agent, pipeline, and team supports .stream(). Watch goal decomposition, reflections, worker delegations, and quality scores in real time. The numbers: 285 tests, 12 examples, 8 notebooks, 13 doc pages, 10 LLM providers, 30+ tools. pip install shipit-agent GitHub: https://lnkd.in/dpUiYqzF Docs: https://lnkd.in/dTxQtvF7 #AI #Python #LLM #AgentFramework #OpenSource

SHIPIT Agent - SHIPIT Agent shipiit.github.io
Like Comment
To view or add a comment, sign in
Raghav Thakur
1mo Edited
Report this post
Claude Code's Source Code has been leaked and it's breaking the internet! It's not just an API wrapper of Claude but a tool with multi-level architecture, showing us a very high bar for shipping AI coding tools. So how did this happen? It's Source Maps! Source maps are used for debugging and usually the code that is shipped is minified and compressed to be more abstract. However, source maps map and connect the bundled code back to the original source. NPM accidentally published the source map that effectively shipped the entire source code in human readable format. How to prevent your apps from this mistake? - Audit your NPM before every release using "npm pack --dry-run" - Never include source maps in production packages - Don't overlook .gitignore before pushing changes to production Malicious actors and developers can now better understand the data flow of Claude rather than brute-forcing through prompts injections. Developers can better understand "Claude Code's four-stage context management pipeline and craft payloads designed to survive compaction" (source). The source code was quickly taken down by Anthropic but some were lucky enough to see it before it was ;) Source: https://lnkd.in/g_UYpWfG

GitHub - instructkr/claw-code: The fastest repo in history to surpass 100K stars ⭐. Better Harness Tools that make real things done. Built in Rust using oh-my-codex. github.com
Like Comment
To view or add a comment, sign in
Inamul Hasan
1w Edited
Report this post
As a developer, I got tired of the standard code review loop: write code -> push -> wait for a cloud bot (like CodeRabbit) to run -> context switch back to fix the issues -> repeat. I wanted an enterprise grade AI auditor that worked on my *local staged files* before I even committed. Pika Review shifts the audit process entirely to your terminal. It concurrently analyzes your git diffs and generates rich Markdown reports right in your IDE (.pika-reports/). Here is exactly how it helps our daily workflow: - **Shift Left Security:** Flags SQL Injection, RCE, and Path Traversal flaws in your terminal—long before you hit "Push." - **Performance Audits:** Detects mathcal{O}(N^2) bottlenecks and N+1 query patterns before they hit production. - **Multi-Language Support:** A polyglot engine that understands idiomatic risks in TS, Python, Go, Rust, and more. - **Local Markdown Reports:** Generates structured, syntax-highlighted reports directly in your project root. - **Always Free (BYOK):** Unlike expensive SaaS tools, Pika Review is Bring-Your-Own-Key. You can try it out now: https://lnkd.in/gnW3Fp3w I’ve made the tool completely open source and I’m actively looking for contributions and feedback. ⭐ If you find the project useful, please consider dropping a star on the GitHub repo to support its development: https://lnkd.in/gTp_httf Feel free to raise an issue if you spot a bug or want to suggest a feature. Let me know what you think in the comments! 👇 #OpenSource #CodeReview #AI #SoftwareEngineering #DeveloperTools #CLI #GitWorkflow #Programming #CodingBestPractices #pikareview
Like Comment
To view or add a comment, sign in
Hiếu Nguyễn
5d
Report this post
Developers are constantly seeking ways to streamline their workflows and make the most of their time. In the realm of LLM and agent workflows, teams often struggle to balance reliability and power. Most rely on cumbersome server-side solutions that are difficult to scale and maintain. This is where ComposioHQ/awesome-codex-skills comes in – a curated list of practical Codex skills for automating workflows across the Codex CLI and API. At its core, this repository provides a collection of Python-based skills that can be used to improve the reliability and efficiency of agent behavior. What stands out is the variety of skills available, including bernstein – a multi-agent orchestrator with Codex CLI adapter, and what Are Codex Skills? – a fundamental question that gets to the heart of how these skills work. What makes this repository particularly interesting is how it addresses a common pain point in the development process. By providing a list of practical skills that can be easily integrated into existing workflows, ComposioHQ/awesome-codex-skills makes it easier for developers to make agent behavior more reliable, not just more powerful. Here are some key highlights: - bernstein – Multi-agent orchestrator with Codex CLI adapter. Runs parallel Codex agents in isolated git worktrees with quality gates. - what Are Codex Skills? - a curated list of practical Codex skills for automating workflows across the Codex CLI and API. - built with Python The traction makes sense: a repository sitting at #3 with around 637 new stars in the current trending window is usually solving a problem people can feel immediately. With its focus on making fast-moving AI workflows easier to steer and reuse in real projects, it's no wonder that ComposioHQ/awesome-codex-skills is getting attention. Repo: https://lnkd.in/eTmpF-UT #GitHub #OpenSource #GitHubTrending #LinkedInForDevelopers #Python #AwesomeCodexSkills #Awesome #AwesomeLists
Like Comment
To view or add a comment, sign in

4,820 followers

View Profile Follow

Lukas Nießen’s Post

More from this author

Idempotence in System Design: Full example

What is GitOps: A Full Example with Code

Why Infrastructure as Code is a MUST have

Explore content categories

Lukas Nießen’s Post

More Relevant Posts

More from this author

Idempotence in System Design: Full example

What is GitOps: A Full Example with Code

Why Infrastructure as Code is a MUST have

Explore related topics

Explore content categories