Stagehand is a tool that enables both extraction of structured data from webpages and autonomous web automation using Browserbase and modern LLMs (OpenAI or Anthropic).
Stagehand offers two main capabilities in Sim:
-
stagehand_extract: Extract structured data from a single webpage. You specify what you want (a schema), and the AI retrieves and parses the data in that shape from the page. This is best for extracting lists, fields, or objects when you know exactly what information you need and where to get it.
-
stagehand_agent: Run an autonomous web agent capable of completing multi-step tasks, interacting with elements, navigating between pages, and returning structured results. This is much more flexible: the agent can do things like log in, search, fill forms, gather data from multiple places, and output a final result according to a requested schema.
Key Differences:
- stagehand_extract is a rapid “extract this data from this page” operation. It works best for direct, one-step extraction tasks.
- stagehand_agent performs complex, multi-step autonomous tasks on the web — such as navigation, searching, or even transactions — and can dynamically extract data according to your instructions and an optional schema.
In practice, use stagehand_extract when you know what you want and where, and use stagehand_agent when you need a bot to think through and execute interactive workflows.
By integrating Stagehand, Sim agents can automate data gathering, analysis, and workflow execution on the web: updating databases, organizing information, and generating custom reports—seamlessly and autonomously.
Usage Instructions
Integrate Stagehand into the workflow. Can extract structured data from webpages or run an autonomous agent to perform tasks.
Tools
stagehand_extract
Extract structured data from a webpage using Stagehand
Input
| Parameter | Type | Required | Description |
|---|---|---|---|
url | string | Yes | URL of the webpage to extract data from |
instruction | string | Yes | Instructions for extraction |
provider | string | No | AI provider to use: openai or anthropic |
apiKey | string | Yes | API key for the selected provider |
schema | json | Yes | JSON schema defining the structure of the data to extract |
Output
| Parameter | Type | Description |
|---|---|---|
data | object | Extracted structured data matching the provided schema |
stagehand_agent
Run an autonomous web agent to complete tasks and extract structured data
Input
| Parameter | Type | Required | Description |
|---|---|---|---|
startUrl | string | Yes | URL of the webpage to start the agent on |
task | string | Yes | The task to complete or goal to achieve on the website |
variables | json | No | Optional variables to substitute in the task (format: {key: value}). Reference in task using %key% |
format | string | No | No description |
provider | string | No | AI provider to use: openai or anthropic |
apiKey | string | Yes | API key for the selected provider |
outputSchema | json | No | Optional JSON schema defining the structure of data the agent should return |
Output
| Parameter | Type | Description |
|---|---|---|
agentResult | object | Result from the Stagehand agent execution |
Notes
- Category:
tools - Type:
stagehand