Skip to content

Browser

Domain: browser

Primary browser control and DOM interaction domain; the usual entry point for most workflows.

Profiles

  • workflow
  • full

Typical scenarios

  • Navigate pages
  • Interact with the DOM and capture screenshots
  • Work with tabs and storage

Common combinations

  • browser + network
  • browser + hooks
  • browser + workflow

Representative tools

  • get_detailed_data — Retrieve detailed data using detailId token.
  • browser_attach — Attach to an existing browser instance via Chrome DevTools Protocol (CDP).
  • browser_list_tabs — List all open tabs/pages in the connected browser.
  • browser_select_tab — Switch the active tab/page by index or URL/title pattern.
  • browser_launch — Launch browser instance.
  • browser_close — Close browser instance
  • browser_status — Get browser status (running, pages count, version)
  • page_navigate — Navigate to a URL
  • page_reload — Reload current page
  • page_back — Navigate back in history

Full tool list (63)

ToolDescription
get_detailed_dataRetrieve detailed data using detailId token.
browser_attachAttach to an existing browser instance via Chrome DevTools Protocol (CDP).
browser_list_tabsList all open tabs/pages in the connected browser.
browser_select_tabSwitch the active tab/page by index or URL/title pattern.
browser_launchLaunch browser instance.
browser_closeClose browser instance
browser_statusGet browser status (running, pages count, version)
page_navigateNavigate to a URL
page_reloadReload current page
page_backNavigate back in history
page_forwardNavigate forward in history
dom_query_selectorQuery single element (like document.querySelector). AI should use this BEFORE clicking to verify element exists.
dom_query_allQuery all matching elements (like document.querySelectorAll)
dom_get_structureGet page DOM structure (for AI to understand page layout).
dom_find_clickableFind all clickable elements (buttons, links). Use this to discover what can be clicked.
page_clickClick an element. Use dom_query_selector FIRST to verify element exists.
page_typeType text into an input element
page_selectSelect option(s) in a <select> element
page_hoverHover over an element
page_scrollScroll the page
page_wait_for_selectorWait for an element to appear
page_evaluateExecute JavaScript code in page context and get result.
page_screenshotTake a screenshot of the page, a specific DOM element, multiple elements, or a pixel region.
get_all_scriptsGet list of all loaded scripts on the page
get_script_sourceGet source code of a specific script.
console_enableEnable console monitoring to capture console.log, console.error, etc.
console_get_logsGet captured console logs
console_executeExecute JavaScript expression in console context
dom_get_computed_styleGet computed CSS styles of an element
dom_find_by_textFind elements by text content (useful for dynamic content)
dom_get_xpathGet XPath of an element
dom_is_in_viewportCheck if element is visible in viewport
page_get_performanceGet page performance metrics (load time, network time, etc.)
page_inject_scriptInject JavaScript code into page
page_set_cookiesSet cookies for the page
page_get_cookiesGet all cookies for the page
page_clear_cookiesClear all cookies
page_set_viewportSet viewport size
page_emulate_deviceEmulate mobile device (iPhone, iPad, Android)
page_get_local_storageGet all localStorage items
page_set_local_storageSet localStorage item
page_press_keyPress a keyboard key (e.g., "Enter", "Escape", "ArrowDown")
page_get_all_linksGet all links on the page
captcha_detectDetect CAPTCHA on the current page using AI vision analysis.
captcha_waitWait for the user to manually solve a CAPTCHA.
captcha_configConfigure CAPTCHA detection behavior.
stealth_injectInject modern stealth scripts to bypass bot detection.
stealth_set_user_agentSet a realistic User-Agent and browser fingerprint for the target platform.
stealth_configure_jitterConfigure CDP command timing jitter to mimic natural network latency.
stealth_generate_fingerprintGenerate a realistic browser fingerprint using real-world datasets.
stealth_verifyRun offline anti-detection checks on the current page.
camoufox_server_launchLaunch a Camoufox WebSocket server for multi-process / remote connections.
camoufox_server_closeClose the Camoufox WebSocket server. Connected clients are disconnected.
camoufox_server_statusGet the current status of the Camoufox WebSocket server (running, wsEndpoint).
framework_state_extractExtract component state from the live page. Supports React, Vue 2/3, Svelte 3/4/5, Solid.js, and Preact. Also detects Next.js/Nuxt meta-framework metadata (routes, build info, payload). Useful for debugging frontend applications, reverse-engineering SPA state, and finding hidden data.
indexeddb_dumpDump all IndexedDB databases and their contents. Useful for analyzing PWA data, stored tokens, or offline application state.
js_heap_searchSearch the browser JavaScript heap for string values matching a pattern. This is the CE (Cheat Engine) equivalent for web — scans the JS runtime memory to find tokens, API keys, signatures, or any string stored in JS objects.
tab_workflowCross-tab coordination for multi-page automation flows.
human_mouseMove the mouse along a natural Bezier curve path with random jitter.
human_scrollScroll the page with human-like behavior: variable speed, micro-pauses, and deceleration.
human_typingType text with human-like patterns: variable speed, occasional typos, and natural corrections.
captcha_vision_solveAttempt to solve a CAPTCHA using an external solving service or AI vision.
widget_challenge_solveSolve an embedded widget challenge.

Released under AGPL-3.0-only