Module: AgentSandbox::BrowserTools

Defined in:: lib/agent_sandbox/browser_tools.rb

Overview

RubyLLM tool adapters for Vercel’s ‘agent-browser` CLI running inside a sandbox. The sandbox image must have agent-browser + a chromium- compatible browser installed (see docker/browser.Dockerfile).

sandbox = AgentSandbox.new(backend: :docker, image: "agent-sandbox-browser",
                           hardened: false, memory: "2g")
chat = RubyLLM.chat(model: "gpt-4o-mini")
chat.with_tools(*AgentSandbox.browser_tools(sandbox))

Pass ‘vision_model:` to override the model used by `screenshot` and `read_image` (those tools take a second LLM hop to extract text from the image). Default is ENV or “gpt-5”.

The ‘agent-browser` daemon persists browser state (tabs, cookies) across invocations, so each tool call reuses the same Chrome session.

Defined Under Namespace

Modules: VisionSupport Classes: Back, Base, Click, Eval, Fill, GetText, Open, ReadImage, Reload, Screenshot, Snapshot, Wait

Class Method Summary collapse

.build(sandbox, vision_model: nil) ⇒ Object

Class Method Details

.build(sandbox, vision_model: nil) ⇒ `Object`