Get Your Brand Cited by LLMs With Wellows Try Now!

Google Launches Gemini 2.5 Computer Use Model for Browser Automation

  • October 9, 2025
    Updated
google-launches-gemini-2-5-computer-use-model-for-browser-automation

Google DeepMind unveiled the Gemini 2.5 Computer Use model, a browser-ready agent that can click, type, scroll, and operate behind logins. It’s now in public preview through the Gemini API in Google AI Studio and Vertex AI.

The model introduces a new computer_use tool running in a perception–action loop: analyzing screenshots, proposing actions, requesting confirmation for high-risk steps, and iterating until completion.


📌 Key Takeaways

  • Public preview via the Gemini API in Google AI Studio and Vertex AI.
  • Built on Gemini 2.5 Pro for visual reasoning and UI control.
  • Beats competitors on Online-Mind2Web, WebVoyager, and AndroidWorld.
  • Focused on web browsers, with mobile support in progress.
  • Safety guardrails include step checks and user confirmations.


What Google Announced

The model enables agents to handle real web UIs when APIs are missing. It can fill forms, use filters, and navigate authenticated flows to complete tasks end-to-end.


“We are releasing the Gemini 2.5 Computer Use model… built on Gemini 2.5 Pro’s visual understanding and reasoning capabilities.”

How It Works

Developers call the Computer Use tool inside an agent loop. Inputs are the user request, screenshot, and action history, returning function calls like click or type. Sensitive steps can force user confirmation.


Performance And Benchmarks

Google reports top performance on Online-Mind2Web, WebVoyager, and AndroidWorld. On Browserbase, it shows lower latency with higher accuracy, supported by benchmark tables and latency–quality plots.


Safety Model And Controls

The System Card outlines risks like misuse or prompt-injection. Guardrails include a safety service reviewing each step and developer options to require confirmations for high-stakes actions.


Safety runs alongside inference: every action is checked, and sensitive steps require explicit consent.

Early Uses And Integrations

The model powers Project Mariner, the Firebase Testing Agent, and AI Mode in Search. Internally, teams use it for UI testing to shorten development cycles.


How To Try It

You can test it now in Google AI Studio or Vertex AI. Google provides a Browserbase demo to see it in action, plus sample code for Playwright or cloud VM setups.


Why It Matters

Many business workflows exist only in web UIs. This model offers a safe screenshot-to-action loop, helping developers build production agents when APIs fall short.


Conclusion

Gemini 2.5 Computer Use moves AI beyond text and APIs into hands-on UI control. With safety checks, speed, and public access, it’s a step toward reliable, governable agents ready for real work.


For the recent AI News, visit our site.


If you liked this article, be sure to follow us on X/Twitter and also LinkedIn for more exclusive content.

Was this article helpful?
YesNo
Generic placeholder image
Articles written 878

Khurram Hanif

Reporter, AI News

Khurram Hanif, AI Reporter at AllAboutAI.com, covers model launches, safety research, regulation, and the real-world impact of AI with fast, accurate, and sourced reporting.

He’s known for turning dense papers and public filings into plain-English explainers, quick on-the-day updates, and practical takeaways. His work includes live coverage of major announcements and concise weekly briefings that track what actually matters.

Outside of work, Khurram squads up in Call of Duty and spends downtime tinkering with PCs, testing apps, and hunting for thoughtful tech gear.

Personal Quote

“Chase the facts, cut the noise, explain what counts.”

Highlights

  • Covers model releases, safety notes, and policy moves
  • Turns research papers into clear, actionable explainers
  • Publishes a weekly AI briefing for busy readers

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *