Skip to main content
The AI can control any browser session in your space, navigating to URLs, clicking elements, typing text, taking screenshots, and reading page content. Once a tab is open, the same actions work whether the tab lives in the Kazzle Browser, a cloud browser session, or Chrome/Brave connected via the extension.

How it works

  1. The AI opens (or reuses) a browser session on a specific computer
  2. It navigates to the target URL
  3. It reads the page content or takes a screenshot
  4. It clicks, types, scrolls, or waits as needed
  5. Results flow back into the thread
You can watch the AI work in real time. Cloud browser sessions expose a live view URL so you see exactly what the AI sees; on your device, you watch directly in the built-in browser tab.

What the AI can do

ActionDescription
NavigateGo to any URL
Read contentGet page text (preferred, cheap and fast)
ScreenshotCapture the visible page (use sparingly, expensive)
ClickClick by CSS selector or x/y coordinates
TypeType into the focused element
ScrollScroll up or down
WaitPause up to 10 seconds (fixed delay)
Read elementsExtract interactive elements for structured navigation, then click with the returned selector

Handoff between you and the AI

Sessions are shared. The AI can target a tab you opened, and you can pick up where the AI left off.
  • You open a tab. The AI can see it and interact with it.
  • The AI opens a session. Watch in the built-in browser tab, or via the live view URL for cloud sessions.
  • Extension sessions. Your Chrome window is the AI session. The AI navigates the same browser you see.

Which browser the AI uses

The AI targets a specific computer, and that determines the browser:
  • Your device - the built-in Kazzle Browser, or your connected Chrome/Brave extension session when one is active
  • A browser computer - cloud browser sessions, useful for automation that should keep running without your device