Prerequisites
Before you start, make sure you have:- The BrowserWire CLI installed (
npm install -g browserwire) and an LLM provider configured - The BrowserWire Chrome extension loaded in developer mode (run
npx browserwire --extension-pathto get the path, then load it viachrome://extensions) - Chrome open and ready
Discovery walkthrough
Start the BrowserWire server
In your terminal, run:The server starts on
http://127.0.0.1:8787. You should see a confirmation message with the address. Leave this terminal running throughout your session.Open Chrome and navigate to the target site
Open Chrome and go to the site you want to discover. Log in if needed — BrowserWire can capture authenticated pages just as well as public ones.
Open the BrowserWire side panel
Click the BrowserWire extension icon in the Chrome toolbar. This opens the side panel. If the icon isn’t visible, click the puzzle-piece icon and pin BrowserWire.
Click "Start Exploring"
In the side panel, click Start Exploring.Once active, the extension begins capturing everything it can see on the page: visible UI elements, form fields, buttons, navigation links, and interactive controls. A snapshot counter in the panel shows how many page states have been captured so far.
The extension only captures data while a session is active. It does not run in the background between sessions or send data anywhere except your local BrowserWire server.
Navigate the site to expose actions and UI
Browse the site the way an agent would need to use it. The more ground you cover, the richer the manifest will be.Focus on:
- Login and signup forms — navigate to the form, but you don’t need to submit it
- Primary navigation — click through the main sections of the site
- Key actions — open modals, expand menus, trigger search, click call-to-action buttons
- Data-heavy views — tables, listings, dashboards that agents might need to read
Click "Stop Exploring"
When you’ve covered the pages you care about, click Stop Exploring in the side panel.BrowserWire sends the captured snapshots to the CLI server, which uses your configured LLM to analyze the page content, synthesize reliable element locators, and compile the manifest. This processing happens in the background — you’ll see progress in the terminal.
Verify the manifest was built
Once processing finishes, confirm your site appears in the API:You should see a response listing your site with its slug, origin URL, and counts of discovered states, views, and actions:The slug is derived from the site’s origin (e.g.,
https://example.com → example-com).What gets captured
A discovery session builds a manifest containing:| Type | Description |
|---|---|
| Entities | Data types the site works with (users, products, orders, etc.) |
| Views | Read operations — pages or components that return structured data |
| Actions | Write operations — buttons, form submissions, and other UI interactions |
| Workflows | Multi-step sequences that combine actions in order (e.g., fill and submit a login form) |