By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
  • Home
  • Products
  • Agents
  • Capital
  • Commerce
Reading: What Is Gemini Spark? Google’s Always-On AI Agent Explained
Sign In
  • Join US
Font ResizerAa
  • Home
  • Products
  • Agents
Search
  • Home
  • Products
  • Agents
  • Capital
  • Commerce
Have an existing account? Sign In
Follow US
> Blog > What Is Gemini Spark? Google’s Always-On AI Agent Explained
Google Gemini Spark always-on AI agent running on cloud virtual machines, illustrated on a phone and laptop

What Is Gemini Spark? Google’s Always-On AI Agent Explained

Surya Koritala
Last updated: June 3, 2026 11:36 pm
By Surya Koritala
28 Min Read
Share
SHARE

Google’s 24/7 agent runs on cloud VMs even with your devices off. Here is what Gemini Spark actually is, what it costs, and how it beats ChatGPT Agent.

Contents
  • What is Gemini Spark, in one paragraph?
  • How does Gemini Spark work? Tasks, Skills, and Schedules
  • How much does Gemini Spark cost? The AI Ultra price
  • Is Gemini Spark safe? The confirmation and audit-trail model
        • Pros
        • Cons
  • Gemini Spark vs the Gemini app: what’s the difference?
  • Gemini Spark vs ChatGPT Agent vs Claude Managed Agents
  • Where Gemini Spark fits: enterprise platform and the Project Mariner lineage
  • Gemini Spark availability: who can use it and when?
  • Builder’s take
  • Frequently asked questions
    • What is Gemini Spark in simple terms?
    • How much does Gemini Spark cost?
    • How does Gemini Spark work?
    • Is Gemini Spark safe to let spend money?
    • What is the difference between Gemini Spark and the Gemini app?
    • How is Gemini Spark different from ChatGPT Agent and Claude Managed Agents?
  • Primary sources

What is Gemini Spark, in one paragraph?

Gemini Spark is Google’s 24/7 personal AI agent, announced at Google I/O 2026 on May 19, that runs continuously on dedicated virtual machines in Google Cloud and takes multi-step actions across your apps even while your phone and laptop are turned off. Unlike the chatbot in the Gemini app, which answers a question and stops, Spark is a persistent agent: you delegate a goal, it works in the background, and it checks in when it needs you. That is the whole pitch in one line, and it is why the question ‘what is Gemini Spark’ has a different answer from ‘what is Gemini.’

Under the hood, Spark runs on the Gemini 3.5 Flash model and Google‘s Antigravity agent harness, which handles long-running planning loops and can spawn sub-agents in parallel. Because the work happens on a cloud VM rather than on your device, a task like ‘scan my Drive every Monday and rebuild this tracker’ keeps running with your laptop closed. That always-on cloud execution is the single biggest thing that separates Spark from the browser-tab agents most competitors shipped this year.

The thin explainers floating around since the keynote stop at ‘it is a 24/7 assistant that costs $100.’ That is true but useless. The rest of this guide covers what they skip: how Spark is wired into your apps, the exact safety model for spending your money, where it overlaps Google’s separate enterprise agent platform, and its lineage from the discontinued Project Mariner.

Google Gemini Spark always-on AI agent running on cloud virtual machines, illustrated on a phone and laptop
Image.

Gemini Spark = a cloud-hosted, always-on agent built on Gemini 3.5 Flash + the Antigravity harness. It connects to Google Workspace through structured APIs (not screen-reading), is organized around Tasks, Skills, and Schedules, requires a Google AI Ultra subscription starting at $100/month, and cannot spend money or email outsiders without your explicit confirmation.

How does Gemini Spark work? Tasks, Skills, and Schedules

Gemini Spark works by connecting to your Google apps through structured API integrations and organizing everything you delegate into three primitives: Tasks, Skills, and Schedules. Understanding those three is understanding Spark, because every workflow you build is some combination of them.

Tasks are the one-off or multi-step jobs you hand off, like ‘find and track interior-design internships in New Orleans’ or ‘scan my Drive and organize files into a tagged spreadsheet.’ A task can chain across Gmail, Docs, and Sheets from a single prompt. Skills are reusable behavioral templates you define once and invoke repeatedly: teach Spark your email tone or how you format a weekly report, and it reuses that pattern instead of forcing you to re-prompt. Per Google, Spark can even distill a Skill by analyzing your past outputs. Schedules are time-based or conditional triggers that fire a task automatically, like a Monday-morning meeting brief or ‘when a contract PDF lands in my inbox, summarize it.’

The mechanism that makes this reliable is the integration method. Spark reaches Gmail, Calendar, Drive, Docs, Sheets, Slides, YouTube, and Maps through structured Workspace APIs, not by reading pixels off the screen. That means it asks the Gmail API for your messages rather than visually parsing a rendered inbox. Structured calls are more predictable, faster, and cheaper than pixel-level automation, and they are far less likely to break when an app’s UI changes. Where Google lacks a native connector, Spark can fall back to acting through Chrome on the web and on Android via a system called Halo, and it can reach third-party tools over MCP.

PrimitiveWhat it isExample
TasksOne-off or multi-step jobs delegated to the agent“Scan my Drive and organize files into a tagged sheet”
SkillsReusable behavioral templates defined once, invoked repeatedly“Always draft replies in my voice and CC my assistant”
SchedulesTime-based or conditional triggers that fire a task“Every Monday 7am, build my week’s meeting brief”
Gemini Spark’s three primitives at a glance
The structured-API choice is the architectural decision that defines Spark. Screen-reading agents see what a human sees; Spark talks to the app’s API directly. That is more reliable but only works whe

How much does Gemini Spark cost? The AI Ultra price

$100/mo

Gemini Spark price floor

via Google AI Ultra (cheapest access tier)

20TB

Storage bundled with AI Ultra

plus YouTube Premium and 5x Gemini Pro limits

US, 18+

Launch availability

trusted testers + AI Ultra subscribers

Gemini Spark requires a Google AI Ultra subscription, which starts at $100 per month and is the cheapest way to get access. There is no standalone Spark tier and no free version at launch. The $100 AI Ultra plan also bundles 20TB of storage, YouTube Premium, and 5x the usage limits of the Gemini Pro plan, so the price is partly a Spark fee and partly a premium-everything bundle.

Google also offers a higher tier at roughly $200/month (reported with a 30TB storage allotment and 20x usage limits) that adds capacity and access to additional experimental features. But for the specific question of the Gemini Spark price, the answer is simple: $100/month via AI Ultra is the floor. If you are comparing that against rivals, ChatGPT’s Workspace Agents are rolling out through business and enterprise plans on a credit-based consumption model rather than a flat consumer fee, and Anthropic’s Claude Managed Agents bill at $0.08 per session-hour plus standard token rates. Those are different pricing philosophies, which we break down in the comparison below.

One caveat worth stating plainly: because Spark runs on dedicated cloud VMs that you do not pay for per-hour, the $100 is an all-you-can-reasonably-use subscription rather than metered compute. That is friendlier for predictable budgeting than pay-per-session pricing, but it also means heavy automation does not get cheaper at scale the way a usage-metered agent might.

Is Gemini Spark safe? The confirmation and audit-trail model

Yes, Gemini Spark is built so it cannot complete a high-stakes action like spending money or emailing an outsider without your explicit confirmation, and every such action is logged in a full audit trail. This is the part the keynote-parroting explainers gloss over, and it is the most important thing to understand before you let an always-on agent touch your inbox or your card.

Concretely: Spark cannot finish a purchase on its own. The confirmation can be a tap in the Gemini app, a spoken ‘yes,’ or a desktop notification you approve, but it has to happen. The same gate applies to sending external email. App connections are off by default, so you explicitly choose which of Gmail, Calendar, Drive and the rest the agent may touch, and you can revoke any of them or disable Spark entirely. On the spending side, the model supports a per-transaction cap, a daily cap, and a category allowlist, so you can tell it groceries are fine, subscriptions need a prompt, and everything else is forbidden.

The audit trail is the other half of the safety story. Every transaction and high-stakes action creates a durable record of what Spark did, when, and with whose approval. If a charge looks wrong, you have a complete log to review and dispute against. Google’s own guidance still recommends caution: do not paste passwords into chat, and think twice before scheduling high-stakes recurring actions, because the model can make mistakes. That candor is healthy. An agent that runs while you sleep should be judged on its guardrails, not its demos.

Pros
  • Cannot spend money or email outsiders without an explicit tap, voice, or desktop ‘yes’
  • Full, durable audit trail of every action: what, when, and who approved it
  • Connections off by default; per-transaction, daily, and category spending limits
  • Structured API access is more auditable than opaque screen-reading
Cons
  • Still experimental and US-only at launch, so real-world failure modes are not fully mapped
  • Google warns against scheduling high-stakes recurring actions due to possible model errors
  • Always-on cloud execution means more standing access to your data than a session-based agent
  • No standalone safety controls outside the $100 AI Ultra subscription

Set a low per-transaction cap and a strict category allowlist before connecting any payment method. Treat the audit trail as your reconciliation log, and never schedule an unattended recurring purchase you would not approve manually each time.

Gemini Spark vs the Gemini app: what’s the difference?

The Gemini app is a chatbot that responds to you in a single session; Gemini Spark is a persistent agent that runs 24/7 on cloud VMs and takes actions across your apps without you watching. They share the Gemini brand and underlying models, but they are different products with different jobs.

When you ask the Gemini app a question, it answers and the interaction ends. It can use tools within that conversation, but it does not keep working after you close the tab, and it does not own a long-lived workspace. Spark does. You delegate a goal, it spins up background work on a Google Cloud VM, and it persists across days through Schedules and Skills. The Gemini app is reactive; Spark is proactive and durable.

Practically, you reach for the Gemini app for research, drafting, and quick answers, and you reach for Spark when you want a standing process: an inbox that triages itself, a tracker that rebuilds weekly, a brief that lands every morning. Spark lives inside the broader Gemini experience and can be emailed directly at a dedicated Gmail address, but it is the agentic layer, not the chat layer.

DimensionGemini app (chatbot)Gemini Spark (agent)
Runs when you’re awayNo — session ends when you stopYes — 24/7 on cloud VMs, device can be off
Primary jobAnswer questions, draft contentExecute multi-step tasks across apps
PersistencePer-conversationDurable via Tasks, Skills, Schedules
Takes real-world actionsLimited, in-sessionYes, with confirmation gates
PriceIncluded in free/Pro tiersRequires AI Ultra ($100/mo)
Gemini Spark vs the Gemini app

Gemini Spark vs ChatGPT Agent vs Claude Managed Agents

The defining difference is how each agent reaches your apps: Gemini Spark uses structured Workspace APIs, ChatGPT Agent/Operator drives a browser by reading and clicking the screen, and Claude Managed Agents run code in a sandboxed cloud runtime. That one architectural fork shapes reliability, cost, and what each can actually do.

Spark’s structured-API approach is the most predictable for Google Workspace work and the only one of the three that runs always-on on dedicated VMs with your device off. ChatGPT’s agent (the lineage that began with Operator) is the most universal because it can operate any website a human can, but pixel-level screen-reading is slower and more brittle when UIs change. Claude Managed Agents sit in the middle for consumers but shine for developers: a single API call spins up a sandboxed, policy-controlled runtime billed at $0.08 per session-hour, which is the cleanest model if you are building rather than delegating personal chores.

On price, the three reflect three philosophies. Spark is a flat $100/month bundle (via AI Ultra). ChatGPT’s Workspace Agents use credit-based, pay-per-use consumption aimed at business and enterprise plans rather than a flat consumer price. Claude Managed Agents meter at $0.08/session-hour plus token costs. None is strictly cheapest; it depends on whether you want predictable subscription billing or pay-for-what-you-use metering.

“The agent wars of 2026 aren’t about who has the smartest model. They’re about who reads your app’s API versus who reads your app’s pixels.”

Alatirok analysis
CapabilityGemini SparkChatGPT Agent / OperatorClaude Managed Agents
Integration methodStructured Workspace APIs (+ browser/MCP fallback)Pixel-level screen-reading of a browserSandboxed cloud runtime running code/tools
Always-on (cloud VM, device off)YesNo — session-basedNo — session-based
Best atGoogle Workspace automationAny website a human can useDeveloper-built, policy-controlled agents
Price modelFlat $100/mo via AI UltraCredit-based pay-per-use (business/enterprise)$0.08/session-hour + token rates
Confirmation for spending/emailRequired, with audit trailPer-product guardrailsDeveloper-configured guardrails
Gemini Spark vs ChatGPT Agent vs Claude Managed Agents (2026)

Where Gemini Spark fits: enterprise platform and the Project Mariner lineage

Gemini Spark is the consumer face of a much larger stack: the same Antigravity harness underpins Google’s separate Gemini Enterprise Agent Platform and its Managed Agents API, which is the right product to evaluate for work, not Spark. Confusing the two is the most common mistake people make after the keynote.

Spark is a polished, opinionated consumer agent for one person’s digital life, gated behind AI Ultra. The Gemini Enterprise Managed Agents API (the developer-facing core, formerly associated with Vertex AI’s agent tooling) is REST-first and configuration-driven: a single API call spins up a custom agent in a Google-hosted secure environment with sandboxed runtimes, configurable network policy, and centralized governance. Teams declare agents in config files, commit them to version control like any infrastructure-as-code artifact, and inherit enterprise data privacy automatically. If your question is ‘should my company use Google for agents,’ the answer lives in that platform, not in Spark.

Spark’s capabilities also have a clear ancestry. Its web-navigation muscle was inherited from Project Mariner, Google DeepMind’s browser-controlling research agent that hit a state-of-the-art 83.5% on the WebVoyager benchmark as a single agent. Mariner was a screen-reading, Gemini 2.0-era prototype that Google quietly shut down on May 4, 2026, folding its tech into Gemini Agent, AI Mode in Search, and Chrome. So Spark is structured-API-first by design, but when it must act on a site without a connector, it leans on the very browser-control IP Mariner pioneered. The pixel-reading approach did not win; it became the fallback under a more reliable architecture.

Spark = personal, always-on, structured-API agent for your own Google life ($100/mo). Gemini Enterprise Managed Agents API = the governed, IaC-style platform for building agents at work. Project Mariner = the discontinued screen-reading ancestor whose 83.5%-WebVoyager browser skills now serve as Spark’s fallback.

Gemini Spark availability: who can use it and when?

At launch, Gemini Spark is available to trusted testers and to Google AI Ultra subscribers aged 18+ in the United States, with broader availability rolling out over time. It is an experimental beta, not a finished general-availability product, so expect rough edges and a moving feature set.

Day-one functionality centers on inbox management, meeting briefs, recurring task automation, and a small set of third-party MCP integrations (Canva, OpenTable, and Instacart were named), with Google promising to progressively add more connectors. You access Spark through gemini.google or the Gemini app once your AI Ultra subscription is active. There is no Android-only or iOS-only gate beyond the regional and subscription requirements, though the Android experience adds the Halo progress-tracking system.

If you are outside the US or under 18, you are waiting for now. And if you are an organization rather than an individual, the relevant rollout to watch is the Gemini Enterprise Agent Platform, which targets business users on a different track than the consumer Spark beta.

Availability snapshot: US only, 18+, requires Google AI Ultra ($100/mo), experimental beta. Reach it at gemini.google or in the Gemini app. International and younger-user access had not been announced

Builder’s take

I build agent infrastructure for a living at Cyntr and Loomfeed, so the part of the Spark launch that matters to me is not the demo. It is the architecture choice. Most consumer agents this year ship a screen-reading harness because it is fast to demo and works on any app. Google did the harder thing.

  • The structured-API bet is the real story. Reading the Gmail API beats reading pixels of Gmail every time on reliability, latency, and cost. The trade is coverage: Spark only goes where Google has a connector, which is why MCP and the Mariner-style browser fallback still matter.
  • Spark is the consumer skin on a much bigger stack. The same Antigravity harness powers the Gemini Enterprise Managed Agents API. If you are evaluating Google for agents at work, do not evaluate Spark. Evaluate the Managed Agents API, where you get sandboxed runtimes, network policy, and infrastructure-as-code.
  • The confirmation-plus-audit-trail model is the right default and I wish more vendors copied it. An agent that physically cannot spend money without a tapped ‘yes’ and that logs every action is far easier to trust than one that promises to behave.
  • Watch the lineage. Spark inherited Project Mariner’s web-navigation IP after Mariner hit 83.5% on WebVoyager and then got shut down. The browser-control muscle did not disappear; it became the fallback under a structured-first agent. That is the pattern to copy.

Frequently asked questions

What is Gemini Spark in simple terms?

Gemini Spark is Google’s always-on AI agent, announced at Google I/O 2026. It runs 24/7 on cloud virtual machines and completes multi-step tasks across your apps (like Gmail, Docs, and Sheets) even when your phone or laptop is off. Unlike the regular Gemini chatbot, which answers and stops, Spark keeps working in the background using Tasks, Skills, and Schedules.

How much does Gemini Spark cost?

Gemini Spark requires a Google AI Ultra subscription starting at $100 per month, which is the cheapest way to access it. That plan also bundles 20TB of storage and YouTube Premium. A higher tier around $200/month adds more capacity and usage. There is no free Spark tier at launch.

How does Gemini Spark work?

Spark runs on the Gemini 3.5 Flash model and Google’s Antigravity harness on dedicated Google Cloud VMs. It connects to your Google apps through structured APIs rather than screen-reading, and organizes work into three primitives: Tasks (jobs you delegate), Skills (reusable templates), and Schedules (time-based or conditional triggers that fire tasks automatically).

Is Gemini Spark safe to let spend money?

Spark cannot complete a purchase or send an external email without your explicit confirmation, via a tap, voice ‘yes,’ or desktop approval. App connections are off by default, you can set per-transaction, daily, and category spending limits, and every action is recorded in a full audit trail you can review and dispute against.

What is the difference between Gemini Spark and the Gemini app?

The Gemini app is a chatbot that answers within a single session and stops. Gemini Spark is a persistent agent that runs 24/7 on cloud VMs and takes actions across your apps without you watching. Use the app for research and drafting; use Spark for standing processes like a self-triaging inbox or a weekly tracker rebuild.

How is Gemini Spark different from ChatGPT Agent and Claude Managed Agents?

Spark reaches your apps through structured Workspace APIs and runs always-on on cloud VMs. ChatGPT Agent/Operator drives a browser by reading the screen pixel-by-pixel, which is universal but more brittle. Claude Managed Agents run in a sandboxed cloud runtime billed at $0.08 per session-hour, aimed more at developers building agents than at delegating personal tasks.

Primary sources

  • Gemini Spark — official overview — Google
  • Google introduces Gemini Spark, a 24/7 agentic assistant with Gmail integration — TechCrunch
  • Gemini Spark: Google’s Always-On AI Agent Explained — DataCamp
  • 100 things we announced at Google I/O 2026 — Google
  • Google’s agentic AI tool Gemini Spark is now available — here’s how to try it — Yahoo Tech
  • Project Mariner WebVoyager 83.5% state-of-the-art result — Google DeepMind
  • Project Mariner — Wikipedia
  • Claude Managed Agents Pricing and Beta Limits — WaveSpeed

Last updated: June 3, 2026. Related: Products.

Foundry Agent Service vs Bedrock AgentCore (2026)
AI Agent Consolidation: Why Most Startups Will Merge by 2028
What Happens to QA Engineers When Devin Is Good Enough
Open-Weight Models for Agents: 2026 Comparison
AI Agent Audit Logs: The Fields Compliance Requires
TAGGED:AI AgentsAI UltraAntigravityChatGPT AgentClaude Managed AgentsGemini SparkGoogle GeminiGoogle I/O 2026Project Mariner
Share This Article
Facebook Email Copy Link Print
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

More Popular from Alatirok

Dashboard visualizing token consumption per agentic coding task across frontier AI models
Observability

Tokens Per Agentic Coding Task: The 2026 Variance Data

By Surya Koritala
21 Min Read
What Is Cognition Devin? The Enterprise Guide for

What Is Cognition Devin? The Enterprise Guide for 2026

By Surya Koritala
Diagram of an AI agent holding a USDC wallet with spending-limit guardrails enforced before an onchain transfer
Commerce

What Is Circle Agent Stack? USDC Wallets for AI Agents

By Surya Koritala
24 Min Read
Identity & Provenance

AI Agent Identity: Entra Agent ID vs Okta vs SailPoint

AI agent identity governance, Entra vs Okta vs SailPoint: a 2026 buyer matrix on what each…

By Surya Koritala
Observability

Why Does My AI Agent Context Window Fill Up So Fast?

Why does my AI agent context window fill up so fast? Tool definitions eat two-thirds of…

By Surya Koritala
Agent Infrastructure

Migrate OpenAI Agent Builder to Agents SDK Before Nov 30

A hands-on tutorial to migrate OpenAI Agent Builder to Agents SDK before the Nov 30, 2026…

By Surya Koritala
Agent Infrastructure

Best Voice AI Agent Framework 2026: Vapi vs LiveKit vs Pipecat

The best voice AI agent framework 2026 depends on your call volume. Our neutral ranking covers…

By Surya Koritala

Purpose-Built Legal AI vs General LLM: 2026 Verdict

Purpose-built legal AI vs general LLM, settled with real 2026 benchmark data: where ChatGPT and Claude…

By Surya Koritala

what’s actually being built in AI agents, who’s building it, and why it matters. Independent. Opinionated.

Categories

  • Home
  • Products
  • Agents
  • Capital
  • Commerce

Quick Links

  • Home
  • Products
  • Agents

© Alatirok by Loomfeed. All Rights Reserved.

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?