I let agents execute arbitrary Python code on my computer. (no, I haven’t lost my mind.) Most developers would call that a security nightmare. I call it the future of local AI and agentic engineering. Isolated containers spin up on demand, agent runs whatever Python it wants inside, nothing leaks out. We've been using it internally for a while, it powers code execution across Onyx agents doing data analysis, file processing, and tool generation. We pulled it out into a standalone repo with no Onyx dependency because this felt like something the whole agent ecosystem is missing. If you're building agents that need code execution, you shouldn't have to spend three weeks on sandboxing infrastructure before you can get to the interesting part. Day 4 of launch week. Look out for tomorrow as we’re taking this sandbox a whole lot further 👀

Congratulations Chris Weaver 🥂 This is interesting.. I have a question for you.. A lot of finance teams still rely on Excel drill-down workflows. In accounts receivable, double-clicking a summary cell often opens a separate sheet with the underlying PO and payment rows. Curious whether Onyx agents can automate that kind of extraction?

Like
Reply

Security first is the right call for agent execution. A reusable isolated sandbox plus auditable runs removes a major blocker for teams that want autonomous workflows in production

Like
Reply

Controlled execution is the real unlock here. Without proper sandboxing, agents can’t safely scale.

Like
Reply

This is great Chris Weaver ! Quick question, are there any restrictions on the file size for analysis?

Like
Reply
See more comments

To view or add a comment, sign in

Explore content categories