GlenFlow’s Post

3,148 followers

GitHub Adds “Rubber Duck” Review Agent to Copilot CLI GitHub has launched an experimental “Rubber Duck” mode in Copilot CLI, bringing a second AI model into the loop to review, challenge, and validate the primary agent’s work before execution. What’s interesting isn’t just the feature - it’s the pattern. 🔹 Second Opinion by Design: A separate model from a different AI family evaluates plans before they run. 🔹 Focused Review Layer: It flags missed assumptions, edge cases, and hidden risks. 🔹 Better Outcomes on Complex Tasks: Especially effective on multi-file, high-step problems where errors compound. 🔹 Agent + Reviewer Pattern: Introduces a structured “builder + critic” dynamic inside AI workflows. As agents become more autonomous, the risk isn’t that they can’t execute - it’s that they execute flawed plans too confidently. Rubber Duck introduces friction in the right place: before things break. At GlenFlow, we see this as a natural next step in agentic development. Not just more powerful agents but systems of agents that challenge each other. Because in an AI-native workflow, quality won’t come from a single smarter model - it’ll come from orchestrated disagreement. Read more: https://lnkd.in/dUwd5dms #AI #GitHubCopilot #AICoding #AgenticAI #DevTools #SoftwareEngineering #FutureOfWork #GlenFlow

To view or add a comment, sign in

More Relevant Posts

Eduardo Nunez
1w
Report this post
The next big step for AI agents might be self-critique, not just generation. 🦆 We are introducing Rubber Duck in experimental mode for GitHub Copilot CLI - a second model from a different AI family that reviews the agent’s plan and work at key moments. What stood out to me is that this is not positioned as “more AI for the sake of more AI”. It is a targeted reviewer that steps in at high-value moments such as after drafting a plan, after a complex implementation, and after writing tests before execution. That feels like a very practical way to reduce compounding errors early, especially in long-running or multi-file tasks. I also like the product thinking here. Rubber Duck is invoked sparingly, either automatically at the right checkpoints or on demand when the user asks Copilot to critique its own work, which keeps the workflow focused instead of noisy. For anyone building with AI agents, this is a useful reminder that better outcomes may come not just from a stronger model, but from a better system design around review and correction. If you want to try it, run the /experimental command in Copilot CLI (another great reason for you to have a closer look at the terminal-first software development!), and it works when a Claude family model is selected as the orchestrator and access to GPT-5.4 is enabled. More details: https://msft.it/6041vDPAL #GitHubCopilot #GitHubCopilotCLI #CopilotCLI #DeveloperTools #AIAgents #CopilotRubberDuck #msftadvocate
Like Comment
To view or add a comment, sign in
Syphax GUEMGHAR
2w
Report this post
The next big step for AI agents might be self-critique, not just generation. 🦆 We are introducing Rubber Duck in experimental mode for GitHub Copilot CLI - a second model from a different AI family that reviews the agent’s plan and work at key moments. What stood out to me is that this is not positioned as “more AI for the sake of more AI”. It is a targeted reviewer that steps in at high-value moments such as after drafting a plan, after a complex implementation, and after writing tests before execution. That feels like a very practical way to reduce compounding errors early, especially in long-running or multi-file tasks. I also like the product thinking here. Rubber Duck is invoked sparingly, either automatically at the right checkpoints or on demand when the user asks Copilot to critique its own work, which keeps the workflow focused instead of noisy. For anyone building with AI agents, this is a useful reminder that better outcomes may come not just from a stronger model, but from a better system design around review and correction. If you want to try it, run the /experimental command in Copilot CLI (another great reason for you to have a closer look at the terminal-first software development!), and it works when a Claude family model is selected as the orchestrator and access to GPT-5.4 is enabled. More details: https://msft.it/6047QA7Jt #GitHubCopilot #GitHubCopilotCLI #CopilotCLI #DeveloperTools #AIAgents #CopilotRubberDuck #msftadvocate
Like Comment
To view or add a comment, sign in
Roberto Mardeni
1w
Report this post
The next big step for AI agents might be self-critique, not just generation. 🦆 We are introducing Rubber Duck in experimental mode for GitHub Copilot CLI - a second model from a different AI family that reviews the agent’s plan and work at key moments. What stood out to me is that this is not positioned as “more AI for the sake of more AI”. It is a targeted reviewer that steps in at high-value moments such as after drafting a plan, after a complex implementation, and after writing tests before execution. That feels like a very practical way to reduce compounding errors early, especially in long-running or multi-file tasks. I also like the product thinking here. Rubber Duck is invoked sparingly, either automatically at the right checkpoints or on demand when the user asks Copilot to critique its own work, which keeps the workflow focused instead of noisy. For anyone building with AI agents, this is a useful reminder that better outcomes may come not just from a stronger model, but from a better system design around review and correction. If you want to try it, run the /experimental command in Copilot CLI (another great reason for you to have a closer look at the terminal-first software development!), and it works when a Claude family model is selected as the orchestrator and access to GPT-5.4 is enabled. More details: https://msft.it/6047vElwN #GitHubCopilot #GitHubCopilotCLI #CopilotCLI #DeveloperTools #AIAgents #CopilotRubberDuck #msftadvocate
Like Comment
To view or add a comment, sign in
Abdallah Sawalha
3w
Report this post
The next big step for AI agents might be self-critique, not just generation. 🦆 We are introducing Rubber Duck in experimental mode for GitHub Copilot CLI - a second model from a different AI family that reviews the agent’s plan and work at key moments. What stood out to me is that this is not positioned as “more AI for the sake of more AI”. It is a targeted reviewer that steps in at high-value moments such as after drafting a plan, after a complex implementation, and after writing tests before execution. That feels like a very practical way to reduce compounding errors early, especially in long-running or multi-file tasks. I also like the product thinking here. Rubber Duck is invoked sparingly, either automatically at the right checkpoints or on demand when the user asks Copilot to critique its own work, which keeps the workflow focused instead of noisy. For anyone building with AI agents, this is a useful reminder that better outcomes may come not just from a stronger model, but from a better system design around review and correction. If you want to try it, run the /experimental command in Copilot CLI (another great reason for you to have a closer look at the terminal-first software development!), and it works when a Claude family model is selected as the orchestrator and access to GPT-5.4 is enabled. More details: https://msft.it/6046Q42io #GitHubCopilot #GitHubCopilotCLI #CopilotCLI #DeveloperTools #AIAgents #CopilotRubberDuck #msftadvocate
Like Comment
To view or add a comment, sign in
Filiz Babacan
3w
Report this post
The next big step for AI agents might be self-critique, not just generation. 🦆 We are introducing Rubber Duck in experimental mode for GitHub Copilot CLI - a second model from a different AI family that reviews the agent’s plan and work at key moments. What stood out to me is that this is not positioned as “more AI for the sake of more AI”. It is a targeted reviewer that steps in at high-value moments such as after drafting a plan, after a complex implementation, and after writing tests before execution. That feels like a very practical way to reduce compounding errors early, especially in long-running or multi-file tasks. I also like the product thinking here. Rubber Duck is invoked sparingly, either automatically at the right checkpoints or on demand when the user asks Copilot to critique its own work, which keeps the workflow focused instead of noisy. For anyone building with AI agents, this is a useful reminder that better outcomes may come not just from a stronger model, but from a better system design around review and correction. If you want to try it, run the /experimental command in Copilot CLI (another great reason for you to have a closer look at the terminal-first software development!), and it works when a Claude family model is selected as the orchestrator and access to GPT-5.4 is enabled. More details: https://msft.it/6045QfOqt #GitHubCopilot #GitHubCopilotCLI #CopilotCLI #DeveloperTools #AIAgents #CopilotRubberDuck #msftadvocate
Like Comment
To view or add a comment, sign in
Phil Seamark
2w
Report this post
The next big step for AI agents might be self-critique, not just generation. 🦆 We are introducing Rubber Duck in experimental mode for GitHub Copilot CLI - a second model from a different AI family that reviews the agent’s plan and work at key moments. What stood out to me is that this is not positioned as “more AI for the sake of more AI”. It is a targeted reviewer that steps in at high-value moments such as after drafting a plan, after a complex implementation, and after writing tests before execution. That feels like a very practical way to reduce compounding errors early, especially in long-running or multi-file tasks. I also like the product thinking here. Rubber Duck is invoked sparingly, either automatically at the right checkpoints or on demand when the user asks Copilot to critique its own work, which keeps the workflow focused instead of noisy. For anyone building with AI agents, this is a useful reminder that better outcomes may come not just from a stronger model, but from a better system design around review and correction. If you want to try it, run the /experimental command in Copilot CLI (another great reason for you to have a closer look at the terminal-first software development!), and it works when a Claude family model is selected as the orchestrator and access to GPT-5.4 is enabled. More details: https://msft.it/6046Qhagj #GitHubCopilot #GitHubCopilotCLI #CopilotCLI #DeveloperTools #AIAgents #CopilotRubberDuck #msftadvocate
Like Comment
To view or add a comment, sign in
Tamas Veiland
3w
Report this post
The next big step for AI agents might be self-critique, not just generation. 🦆 We are introducing Rubber Duck in experimental mode for GitHub Copilot CLI - a second model from a different AI family that reviews the agent’s plan and work at key moments. What stood out to me is that this is not positioned as “more AI for the sake of more AI”. It is a targeted reviewer that steps in at high-value moments such as after drafting a plan, after a complex implementation, and after writing tests before execution. That feels like a very practical way to reduce compounding errors early, especially in long-running or multi-file tasks. I also like the product thinking here. Rubber Duck is invoked sparingly, either automatically at the right checkpoints or on demand when the user asks Copilot to critique its own work, which keeps the workflow focused instead of noisy. For anyone building with AI agents, this is a useful reminder that better outcomes may come not just from a stronger model, but from a better system design around review and correction. If you want to try it, run the /experimental command in Copilot CLI (another great reason for you to have a closer look at the terminal-first software development!), and it works when a Claude family model is selected as the orchestrator and access to GPT-5.4 is enabled. More details: https://msft.it/6040Qf3IA #GitHubCopilot #GitHubCopilotCLI #CopilotCLI #DeveloperTools #AIAgents #CopilotRubberDuck #msftadvocate
Like Comment
To view or add a comment, sign in
Poovanesan Moodliar
3w
Report this post
The next big step for AI agents might be self-critique, not just generation. 🦆 We are introducing Rubber Duck in experimental mode for GitHub Copilot CLI - a second model from a different AI family that reviews the agent’s plan and work at key moments. What stood out to me is that this is not positioned as “more AI for the sake of more AI”. It is a targeted reviewer that steps in at high-value moments such as after drafting a plan, after a complex implementation, and after writing tests before execution. That feels like a very practical way to reduce compounding errors early, especially in long-running or multi-file tasks. I also like the product thinking here. Rubber Duck is invoked sparingly, either automatically at the right checkpoints or on demand when the user asks Copilot to critique its own work, which keeps the workflow focused instead of noisy. For anyone building with AI agents, this is a useful reminder that better outcomes may come not just from a stronger model, but from a better system design around review and correction. If you want to try it, run the /experimental command in Copilot CLI (another great reason for you to have a closer look at the terminal-first software development!), and it works when a Claude family model is selected as the orchestrator and access to GPT-5.4 is enabled. More details: https://msft.it/6040Qf37I #GitHubCopilot #GitHubCopilotCLI #CopilotCLI #DeveloperTools #AIAgents #CopilotRubberDuck #msftadvocate
Like Comment
To view or add a comment, sign in
Dominique Broeglin
3w
Report this post
The next big step for AI agents might be self-critique, not just generation. 🦆 We are introducing Rubber Duck in experimental mode for GitHub Copilot CLI - a second model from a different AI family that reviews the agent’s plan and work at key moments. What stood out to me is that this is not positioned as “more AI for the sake of more AI”. It is a targeted reviewer that steps in at high-value moments such as after drafting a plan, after a complex implementation, and after writing tests before execution. That feels like a very practical way to reduce compounding errors early, especially in long-running or multi-file tasks. I also like the product thinking here. Rubber Duck is invoked sparingly, either automatically at the right checkpoints or on demand when the user asks Copilot to critique its own work, which keeps the workflow focused instead of noisy. For anyone building with AI agents, this is a useful reminder that better outcomes may come not just from a stronger model, but from a better system design around review and correction. If you want to try it, run the /experimental command in Copilot CLI (another great reason for you to have a closer look at the terminal-first software development!), and it works when a Claude family model is selected as the orchestrator and access to GPT-5.4 is enabled. More details: https://msft.it/6041Q4baP #GitHubCopilot #GitHubCopilotCLI #CopilotCLI #DeveloperTools #AIAgents #CopilotRubberDuck #msftadvocate
Like Comment
To view or add a comment, sign in
Chandra Sekhar Viswanadha
3w
Report this post
The next big step for AI agents might be self-critique, not just generation. 🦆 We are introducing Rubber Duck in experimental mode for GitHub Copilot CLI - a second model from a different AI family that reviews the agent’s plan and work at key moments. What stood out to me is that this is not positioned as “more AI for the sake of more AI”. It is a targeted reviewer that steps in at high-value moments such as after drafting a plan, after a complex implementation, and after writing tests before execution. That feels like a very practical way to reduce compounding errors early, especially in long-running or multi-file tasks. I also like the product thinking here. Rubber Duck is invoked sparingly, either automatically at the right checkpoints or on demand when the user asks Copilot to critique its own work, which keeps the workflow focused instead of noisy. For anyone building with AI agents, this is a useful reminder that better outcomes may come not just from a stronger model, but from a better system design around review and correction. If you want to try it, run the /experimental command in Copilot CLI (another great reason for you to have a closer look at the terminal-first software development!), and it works when a Claude family model is selected as the orchestrator and access to GPT-5.4 is enabled. More details: https://msft.it/6046Q4UYm #GitHubCopilot #GitHubCopilotCLI #CopilotCLI #DeveloperTools #AIAgents #CopilotRubberDuck #msftadvocate
Like Comment
To view or add a comment, sign in

3,148 followers

View Profile Follow

GlenFlow’s Post

More from this author

Letter from the CEO - a Vision for 2024

Explore content categories

GlenFlow’s Post

More Relevant Posts

More from this author

Letter from the CEO - a Vision for 2024

Explore related topics

Explore content categories