Magentic-UI: A Multi-Agent Web Interface for Complex Task Automation
Overview
Magentic-UI is a pioneering research prototype that introduces an agentic approach to automating complex web-based tasks. Designed for seamless collaboration between human users and AI agents, this system combines multiple specialized agents under the coordination of an intelligent Orchestrator. It offers transparency, control, and a high degree of flexibility in task execution.
Core Architecture
Magentic-UI is built on a team of five agents that work together in a modular architecture:
Key Features
🧑🤝🧑 Co-Planning
Users and the Orchestrator collaborate to define a step-by-step execution plan. The interface allows users to add, modify, delete, or regenerate steps for optimal task planning.
🤝 Co-Tasking
Execution of tasks is a cooperative process. Agents carry out subtasks while continuously integrating real-time feedback from users.
🛡️ Action Guards
Sensitive or potentially destructive operations require user approval, ensuring full transparency and control over actions performed by agents.
🧠 Plan Learning
The system adapts over time by learning from previous plans and user interactions, improving the efficiency of future executions.
How Magentic-UI Works
The entire process is interactive, visual, and modifiable by the user at any time.
Getting Started
Prerequisites
Installation (PyPI)
python3 -m venv .venv
source .venv/bin/activate
pip install magentic-ui
export OPENAI_API_KEY=<YOUR API KEY>
magentic ui --port 8081
Visit http://localhost:8081 to launch the UI.
Advanced Configuration
Using a Config File
To configure custom API keys or switch to Azure OpenAI, create a config.yaml file in ~/.magentic_ui. Here’s an example:
yaml
model_config: &client
provider: autogen_ext.models.openai.OpenAIChatCompletionClient
config:
model: gpt-4o
api_key: <YOUR API KEY>
max_retries: 10
Recommended by LinkedIn
orchestrator_client: *client
coder_client: *client
web_surfer_client: *client
file_surfer_client: *client
action_guard_client: *client
For Azure integration, replace the provider and include your endpoint, deployment name, and authentication method.
Building from Source
cd magentic-ui
uv venv --python=3.12 .venv
uv sync --all-extras
source .venv/bin/activate
# Install node via nvm
nvm install node
# Install dependencies
cd frontend
npm install -g gatsby-cli
npm install --global yarn
yarn install
yarn build
cd ..
magentic ui --port 8081
For frontend development, launch it separately:
cd frontend
cp .env.default .env.development
npm run start
Contributing
Magentic-UI is an open-source project under the Microsoft Open Source Code of Conduct. Contributions are welcome through pull requests or issue reviews.
Before contributing:
poe check
Marked issues such as "open for contribution" are good starting points.
Conclusion
Magentic-UI redefines how AI systems and humans collaborate on the web. With powerful agent orchestration, clear task segmentation, and user-controlled transparency, it serves as a blueprint for future human-AI co-working interfaces. Whether you're a developer, researcher, or enthusiast, Magentic-UI provides a robust and extensible foundation for building intelligent web automation tools.