By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
  • Home
  • Products
  • Agents
  • Capital
  • Commerce
Reading: Copilot Studio Computer-Using Agent: Build One in 2026
Sign In
  • Join US
Font ResizerAa
  • Home
  • Products
  • Agents
Search
  • Home
  • Products
  • Agents
  • Capital
  • Commerce
Have an existing account? Sign In
Follow US
> Blog > Copilot Studio Computer-Using Agent: Build One in 2026
A Copilot Studio computer-using agent driving a web browser and desktop app through a virtual mouse and keyboard

Copilot Studio Computer-Using Agent: Build One in 2026

Surya Koritala
Last updated: June 2, 2026 11:27 pm
By Surya Koritala
25 Min Read
Share
SHARE

A guided, click-by-click build of a generally available Copilot Studio computer-using agent that automates web and desktop UIs, with real config, model choice, credentials, and cost math.

Contents
  • What is a Copilot Studio computer-using agent?
  • What do you need before building a computer-using agent?
  • How do you add the computer use tool to an agent?
  • Which model should you pick for computer use in 2026?
  • Where does a computer-using agent actually run?
        • Pros
        • Cons
  • How do you handle credentials and keep the agent safe?
  • How do you write instructions and test the agent?
  • How much does a computer-using agent cost to run?
    • Worth building — for the narrow, API-less workflows it’s actually for
  • Builder’s take
  • Frequently asked questions
    • Is the Copilot Studio computer-using agent generally available?
    • What’s the difference between computer use and a regular Copilot Studio connector?
    • Which models can a computer-using agent use?
    • Do I need my own machine to run a computer-using agent?
    • How much does it cost to run computer use?
    • How do I keep a computer-using agent secure?
  • Primary sources

What is a Copilot Studio computer-using agent?

A Copilot Studio computer-using agent is an AI agent that operates websites and Windows desktop apps the way a person does — moving a virtual mouse, typing on a virtual keyboard, and reading the screen — instead of calling an API. Microsoft made this capability, branded as the “computer use” tool, generally available as part of the Copilot Studio May 2026 update, expanding it from preview to all commercial Power Platform geographies. It first appeared in preview in September 2025 for Windows desktop apps; the 2026 release adds model choice, enterprise credential storage, and audit logging.

Under the hood, computer use is powered by Computer-Using Agents (CUA): models that pair vision with reasoning to navigate graphical interfaces. The key design decision is that it does not rely on hard-coded selectors. When a vendor moves a button or redesigns a page, a brittle RPA script breaks; a computer-using agent looks at the new screen and figures out the next click. That resilience is the entire pitch — and the reason it is metered per step rather than per run.

This tutorial walks through an actual build: adding the tool, picking a model, wiring up a machine, storing credentials, testing, and publishing. Where the experience is genuinely low-code, I’ll name the exact clicks. Where it has real teeth — cost, security, machine setup — I’ll be blunt about what you’re signing up for.

A Copilot Studio computer-using agent driving a web browser and desktop app through a virtual mouse and keyboard
Image.

What do you need before building a computer-using agent?

Before you build a Copilot Studio computer-using agent you need three things: an agent with generative orchestration turned on, a licensing plan that supplies Copilot Credits, and a Windows machine (or a Microsoft-hosted one) for the agent to drive. Computer use is only available on agents that have generative orchestration enabled — it’s the setting that lets the model choose tools dynamically rather than follow a fixed topic tree.

On licensing, computer use bills through the Agent action feature and consumes Copilot Credits per step. A “step” is one unit of the model’s plan — launch the browser, click a button, fill a field — and each may bundle several low-level actions. Standard models cost 5 Credits per step; premium models cost 15. Budget in steps, not tasks, because a real-world form fill is rarely fewer than four or five steps.

On compute, the agent needs a Windows surface. Microsoft now offers three options, two of which are still in preview, covered in the machine section below. If you want to use an Anthropic model, your Power Platform admin must also turn on access to external models for the environment.

RequirementDetailWhere to set it
Generative orchestrationMust be ON for the agent — computer use is unavailable otherwiseAgent Settings > Orchestration
Copilot Credits5 Credits/step (standard) or 15/step (premium); billed via Agent actionTenant licensing / capacity
A Windows machineHosted browser, Cloud PC pool, or bring-your-own (BYO)Tool config > Machine
External model accessRequired to use Anthropic Claude models in the pickerPower Platform admin center
Computer-using agent prerequisites at a glance

How do you add the computer use tool to an agent?

To add computer use, open your agent’s Tools page, select Add tool, choose New tool, then pick Computer use — and configure four required fields: Name, Description, Model, and Instructions. This is the genuinely low-code part of a Copilot Studio computer-using agent, and Microsoft ships instruction templates so you’re not starting from a blank box.

Here is the exact path, click by click, straight from the product:

Generative orchestration is the gate. If the Computer use tile is missing from the New tool list, your agent almost certainly has generative orchestration turned off. Flip it on in the agent’s orchestration settings and the tile appears.

Step-by-step: add and configure the tool1. Go to the Tools page for your agent and select Add tool. 2. In Add tool, select New tool. 3. Select Computer use. 4. Provide the natural-language instructions describing the task (or start from a template). 5. Select Add and configure. 6. On the configuration page, fill the four required fields: Name (a display name to tell this tool apart from others), Description (a short note on what it does and when to use it — the agent reads this to decide when to call the tool), Model (the picker is covered below), and Instructions (the numbered steps, including full URLs and exact app names). 7. Select Save.

Which model should you pick for computer use in 2026?

At GA, two models are generally available for computer use — OpenAI’s Computer-Using Agent (CUA) and Anthropic’s Claude Sonnet 4.5, both standard tier at 5 Credits per step — while Claude Sonnet 4.6 (standard) and Claude Opus 4.6 (premium) are listed as Experimental. The Model field in the tool config is a dropdown; the choice is a direct cost-and-capability tradeoff for your Copilot Studio computer-using agent.

The practical guidance: start on a generally available standard model for anything you intend to ship. Reserve the Experimental Opus 4.6 — at triple the Credit cost — for genuinely hard, ambiguous UIs where reasoning quality earns back the spend. Don’t put an Experimental model on a production run you can’t babysit.

Status labels move fast. Sonnet 4.6 and Opus 4.6 are Experimental as of the late-May 2026 docs. Re-check the live model table in the tool before promoting anything Experimental into a production agent.

ProviderModelTierStatusCredits per step
OpenAIComputer-Using Agent (CUA)StandardGenerally available5
AnthropicClaude Sonnet 4.5StandardGenerally available5
AnthropicClaude Sonnet 4.6StandardExperimental5
AnthropicClaude Opus 4.6PremiumExperimental15
Computer use models in Copilot Studio (per Microsoft Learn, May 2026)

Where does a computer-using agent actually run?

A Copilot Studio computer-using agent runs on a Windows surface you select in the Machine field: a Microsoft-managed Hosted browser (preview), a Cloud PC pool (preview), or a bring-your-own machine you register in Power Automate. This is the single most consequential setup choice, because it determines security posture, what resources the agent can reach, and whether you’re production-ready.

The Hosted browser, powered by Windows 365 for Agents, is the zero-setup option: Microsoft-managed Edge plus built-in Windows apps. It is excellent for experimentation but explicitly not recommended for production — it isn’t Entra-joined to your tenant, isn’t governed by your Intune policies, and may be throttled. The Cloud PC pool, also preview and powered by Windows 365 for Agents, fixes that: it auto-scales, is Entra-joined and Intune-enrolled, and can reach Microsoft 365, SharePoint, and Azure resources tied to your org.

The bring-your-own-machine path is the generally available, production-grade route. You install Power Automate for desktop (version 2.61.132.25266 or later, including the web extension), register the machine via the Power Automate machine runtime app, then enable it for computer use.

Pros
  • Hosted browser: zero setup, instant experimentation
  • Cloud PC pool: auto-scaling, Entra-joined, Intune-governed, reaches org resources
  • BYO machine: generally available, full control, production-ready
  • Dedicating a machine avoids interruptions and run-queue contention
Cons
  • Hosted browser is preview, not for production, and can be throttled
  • Cloud PC pool is still preview as of May 2026
  • BYO requires Power Automate for desktop install and version management
  • Same-machine runs queue sequentially — plan capacity for concurrency
Bring-your-own machine: register and enableRegister: 1) Install the latest Power Automate for desktop, checking ‘Install the machine-runtime app to connect to the Power Automate cloud portal.’ 2) Open the Power Automate machine runtime app and sign in. 3) Register the machine to the target environment. Enable: 1) Sign in to make.powerautomate.com. 2) Go to Machines. 3) Select your registered machine. 4) On the machine detail page, select Settings. 5) Turn on ‘Enable for computer use.’ 6) Select Save. Runs targeting the same machine are queued and executed sequentially — watch the Run queue tab to monitor backlog.

How do you handle credentials and keep the agent safe?

Store sign-in credentials in the tool’s Stored credentials section — either Power Platform internal encrypted storage or your own Azure Key Vault — and lock the agent down with an access-control allow list, a dedicated least-privilege machine, and human supervision routing. Security is not a footnote for a computer-using agent; it is a live identity with a mouse, so treat it like one.

For secrets, internal storage needs no preconfiguration: pick a Type (Website or Desktop app), enter Username and Password, and a Login domain (e.g. *.contoso.com, wildcards allowed) or desktop app process name (e.g. msedge — find it in Task Manager under Processes). The Azure Key Vault option requires the PowerPlatform resource provider registered on the subscription, then you supply subscription ID, resource group, vault name, the username, and the Azure secret name.

Two more controls matter. Access control lets you restrict the agent to specific URLs and apps — but note its limit: it blocks actions on non-allowed sites, it does not stop the model from opening them. And Human supervision routes an email (via Outlook) to a designated reviewer if the agent detects potentially harmful instructions; set a response time limit, after which the request expires and the run stops.

Don’t confuse the two. ‘Credentials to use’ (maker-provided vs. end-user) controls who the agent acts AS on the machine. ‘Stored credentials’ are the website/app logins it types during a run. Maker-provided credentials on a shared agent mean anyone using it acts with YOUR access — choose deliberately.

“A computer-using agent is a live identity with a mouse. Give it a dedicated machine, a least-privilege account, and an allow list — or don’t give it your password at all.”

Build a computer-using agent the safe way

How do you write instructions and test the agent?

Write instructions like you’re briefing a careful colleague — full URLs, exact app names, explicit actions, numbered steps — then hit Test to watch the agent reason and act in real time before you publish. The quality of your Copilot Studio computer-using agent is mostly the quality of these instructions; vague prompts produce vague clicking.

Microsoft’s own best practices: be specific about sites and apps (include the full URL), state actions explicitly (“Once you fill in the form, select Submit. No need to ask for permission”), break down complex UI navigation step by step, and use list formatting for longer tasks. You can also define Inputs — dynamic values merged into the instructions at run time, so the same agent fills a form with different data on each run. For extraction, you can tell the agent to return values as plain text or as structured JSON to hand off to another tool.

When you test, the experience splits the screen: the left panel shows your instructions plus a step-by-step log of the agent’s reasoning and actions; the right panel previews the live machine. A Test completed message marks the end, and Stop testing halts all machine actions immediately. Iterate on the instructions until the log matches your intent — that’s the loop.

Scenario: extract portfolio data and return JSON (Microsoft sample)

Navigate to https://computerusedemos.blob.core.windows.net/web/Portfolio/index.html,
retrieve the portfolio details for all Contoso entities, and return the
results as a valid JSON object.

Structure the output so that:
*  Each top-level key is the client name
*  Each value contains the client's portfolio ID, portfolio value,
   portfolio manager, and last updated date (format: YYYY-MM-DD)
*  Return only the JSON, with no additional text.

# Tip: pair this computer use tool with an email tool, then in the AGENT
# instructions say: "Use computer use to extract the portfolio data and
# email the extracted JSON." Both tools must be added to the agent.
Copilot Studio Tools page Add tool New tool
Copilot Studio Tools page Add tool New tool
Copilot Studio computer use configuration page model picker
Copilot Studio computer use configuration page model picker
Copilot Studio computer use test experience split screen reasoning log
Copilot Studio computer use test experience split screen reasoning log
Power Automate machine runtime Enable for computer use setting
Power Automate machine runtime Enable for computer use setting

How much does a computer-using agent cost to run?

Worth building — for the narrow, API-less workflows it’s actually for

Copilot Studio’s computer-using agent is genuinely useful and genuinely generally available (May 2026) for automating web and desktop UIs that have no API. The build is low-code and fast: add the tool, pick a GA model, point it at a machine, store credentials, test, publish. But it is not a cheap, fire-and-forget RPA replacement. Per-step Credit metering, preview-status hosted compute, and a model that can click the wrong thing mean you should scope it to high-value, API-less tasks, harden the machine, and budget in steps. Build it where there’s no API. Reach for a connector or MCP everywhere else.

Each computer use step costs 5 Copilot Credits on a standard model or 15 on a premium model — so a four-step form fill is 20 Credits standard, 60 premium, and a typical multi-screen task can run 30+ Credits before you’ve automated anything substantial. Microsoft’s own worked example: launch the browser and open a portal, click Create new time sheet, fill three fields, and click Submit — four steps, 20 Credits standard.

The math compounds with task complexity, not task count. A login flow, a search, a few clicks of navigation, and an extraction can be a dozen steps before the ‘real’ work starts. That’s why model choice and instruction quality are cost levers, not just quality levers: tighter instructions mean fewer wasted steps, and reserving premium models for genuinely hard UIs keeps the multiplier off your routine runs.

Before you scale a computer-using agent across a team, estimate steps-per-run for your top three workflows, multiply by your daily volume, and put a real number against it. UI automation is powerful precisely because it works where APIs don’t — but you pay for every screen the model has to look at.

Copilot Credits per computer use run, by steps and model tier
Credit cost scales linearly with steps; premium models triple the bill. Source: Microsoft Learn computer use licensing.

Builder’s take

I build agent runtimes for a living (Cyntr) and a discussion platform (Loomfeed), so I read this release as a pipeline person, not a Microsoft fan. Three honest reactions:

  • The pricing model is the real story. Five Copilot Credits per step (15 on premium) means a chatty 30-step run is a budget line, not a rounding error. Vision-driven UI automation is inherently step-heavy. Cost your runs in steps before you cost them in tasks.
  • The ‘no API needed’ pitch is true and dangerous. Computer use is the right tool only when there’s genuinely no API. If a REST endpoint exists, a connector or MCP call is cheaper, faster, and far more deterministic than a model squinting at a screenshot.
  • Treat the agent like a junior contractor with your password. Dedicated machine, least-privilege account, an access-control allow list, and Key Vault for secrets are not optional hardening — they’re the baseline that makes this safe to publish.

Frequently asked questions

Is the Copilot Studio computer-using agent generally available?

Yes. Microsoft made computer use in Copilot Studio generally available as part of the May 2026 update, expanding it to all commercial Power Platform geographies. It launched in preview in September 2025. Note that some related pieces — the hosted browser, Cloud PC pool, the redesigned workflows experience, and the new orchestration layer — were still labeled preview or early-release at the time of writing.

What’s the difference between computer use and a regular Copilot Studio connector?

A connector or MCP tool calls an application’s API directly — fast, deterministic, and cheap. Computer use drives the application’s user interface with a virtual mouse and keyboard, using vision and reasoning, and is the right tool only when there is no API to connect to. If an API exists, prefer the connector; computer use costs Credits per step and is less deterministic.

Which models can a computer-using agent use?

Per Microsoft Learn (May 2026), OpenAI’s Computer-Using Agent (CUA) and Anthropic’s Claude Sonnet 4.5 are generally available at the standard tier (5 Credits/step). Claude Sonnet 4.6 (standard) and Claude Opus 4.6 (premium, 15 Credits/step) are listed as Experimental. Using Anthropic models requires your admin to enable external model access for the environment.

Do I need my own machine to run a computer-using agent?

Not necessarily. You can use the Microsoft-managed Hosted browser (preview, not for production) or a Cloud PC pool (preview, Entra-joined and Intune-enrolled) powered by Windows 365 for Agents. For production today, the generally available path is bring-your-own-machine: install Power Automate for desktop, register the machine, and turn on ‘Enable for computer use.’

How much does it cost to run computer use?

Computer use bills through the Agent action feature in Copilot Credits. Each step costs 5 Credits on a standard model or 15 on a premium model. Microsoft’s example of a four-step time-sheet submission costs 20 Credits on a standard model or 60 on a premium one. Budget by estimating steps per run, since complex UIs are step-heavy.

How do I keep a computer-using agent secure?

Use a dedicated, least-privilege machine; store secrets in Azure Key Vault or Power Platform internal storage; enable access control with an allow list of approved URLs and apps; and configure human supervision so a reviewer is emailed if the agent detects potentially harmful instructions. Remember access control blocks actions on non-allowed sites but doesn’t stop the model from opening them, and ‘maker-provided credentials’ on a shared agent let others act with your access.

Primary sources

  • What’s new in Copilot Studio: May 2026 updates and features — Microsoft Copilot Blog
  • Computer-using agents in Microsoft Copilot Studio are now generally available — Microsoft Community Hub
  • Automate web and desktop apps with computer use — Microsoft Learn
  • Configure where computer use runs — Microsoft Learn
  • What’s new in Power Platform: May 2026 feature update — Microsoft Power Platform Blog

Last updated: June 2, 2026. Related: Products.

LangChain alternatives 2026 — answers to what every builder is asking
Microsoft Agent 365 Cost: The Real All-In Per-Agent Math
MCP Server Statistics 2026: How Many Actually Exist
AI Agents by Industry: Where Adoption Is Landing in 2026
Best AI Agent Authentication Platforms 2026
TAGGED:AI AgentsComputer UseCopilot StudioMicrosoftPower PlatformTutorialUI Automation
Share This Article
Facebook Email Copy Link Print
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

More Popular from Alatirok

Reference architecture diagram showing an AI agent calling a website's NLWeb /ask endpoint, which extracts Schema.org JSON-LD into a vector store and exposes an MCP server
Agent Infrastructure

What Is NLWeb? Microsoft’s Agentic Web Protocol Explained

By Surya Koritala
28 Min Read
What Is Cognition Devin? The Enterprise Guide for

What Is Cognition Devin? The Enterprise Guide for 2026

By Surya Koritala
An AI agent connected to a virtual credit card with a spending limit gauge, illustrating agentic commerce controls in 2026
Commerce

How to Give an AI Agent a Credit Card With a Spending Limit

By Surya Koritala
31 Min Read
Agent Infrastructure

Azure Agent Mesh Tutorial: Deploy a Federated Agent

This azure agent mesh tutorial is the first hands-on deploy: target the Mesh with Agent Framework…

By Surya Koritala
Capital

LLM Long-Context Pricing Surcharge 2026: The Cliff Mapped

Long-context pricing surcharge: The LLM long context pricing surcharge 2026 doubles your whole request the moment…

By Surya Koritala

What Is Claude Cowork? Architecture, Cost, and Limits

What is Claude Cowork? A technical, vendor-neutral guide to its sandbox architecture, real per-seat plus API…

By Surya Koritala
Commerce

Best AI Agent Marketplaces 2026: Where to Sell Agents

The best AI agent marketplaces 2026 ranked by audience, listing model, and revenue share — AgentExchange,…

By Surya Koritala

Best AI Coding CLI 2026: Claude Code vs Codex vs Antigravity

The best AI coding CLI 2026 comes down to Claude Code, Codex CLI, and Antigravity CLI.…

By Surya Koritala

what’s actually being built in AI agents, who’s building it, and why it matters. Independent. Opinionated.

Categories

  • Home
  • Products
  • Agents
  • Capital
  • Commerce

Quick Links

  • Home
  • Products
  • Agents

© Alatirok by Loomfeed. All Rights Reserved.

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?