navvi

Navvi provides your AI agent with a persistent browser identity for seamless web interactions.

<p align="center"> <img src="https://raw.githubusercontent.com/fellowship-dev/navvi/main/docs/navvi-logo.png" alt="Navvi" width="120" /> </p> <h1 align="center">Navvi</h1> <p align="center"> <strong>Give your AI agent a real browser identity.</strong> <br /> MCP server with persistent personas, anti-detection browser, and credential vault. <br /> Works with Claude Code, Cursor, and other MCP clients. <br /> Open-source alternative to <a href="https://www.browserbase.com/">Browserbase</a> and <a href="https://www.hyperbrowser.ai/">Hyperbrowser</a>. </p> <p align="center"> <a href="https://pypi.org/project/navvi/"><img src="https://img.shields.io/pypi/v/navvi" alt="PyPI" /></a> <a href="https://github.com/fellowship-dev/navvi"><img src="https://img.shields.io/github/stars/fellowship-dev/navvi" alt="GitHub stars" /></a> <a href="https://github.com/fellowship-dev/navvi/blob/main/LICENSE"><img src="https://img.shields.io/github/license/fellowship-dev/navvi" alt="License" /></a> </p> <p align="center"> <a href="#quick-start">Quick Start</a> &middot; <a href="#use-cases">Use Cases</a> &middot; <a href="#how-it-works">How It Works</a> &middot; <a href="#mcp-tools">MCP Tools</a> </p> <p align="center"> <img src="https://raw.githubusercontent.com/fellowship-dev/navvi/main/docs/navvi-demo.gif" alt="Navvi demo — searching and finding Navvi on GitHub" width="800" /> </p>

The Problem

Every time your AI agent needs to use the web, it starts from scratch. No cookies, no saved passwords, no history. It has to log in again and again — and half the time the automation gets detected and blocked.

  • Agent fills a login form → site detects Selenium/Playwright → blocked
  • Agent stores a password in a variable → session ends → password gone
  • Agent tries to reuse a browser → cookies wiped → logged out again
  • You paste credentials into the chat → now they're in your conversation history

Your agent has no identity. Every session is a stranger.

The Solution

Navvi gives your agent a persistent browser with its own identity. A Camoufox (anti-detect Firefox) that remembers where it's been, stays logged in, and manages its own credentials — without ever exposing passwords to the AI.

  • Persistent sessions — cookies, logins, and history survive restarts
  • Credential vault — passwords generated and stored inside the container, auto-filled into forms without the AI ever seeing them
  • Doesn't get blocked — anti-detect browser with OS-level input that passes bot detection where Selenium and Playwright fail
  • CAPTCHA handling — auto-clicks through common bot checks, with VNC handoff to a human when it can't
  • Multi-persona — each persona runs in its own isolated container with dedicated cookies, credentials, and history
  • Keeps your context clean — 11 high-level tools by default, 12 more unlock on demand so your agent isn't overwhelmed by options

Quick Start

1. Add to Claude Code

Add to your project's .mcp.json:

{
  "mcpServers": {
    "navvi": {
      "command": "uvx",
      "args": ["navvi@latest"],
      "env": {
        "NAVVI_GPG_PASSPHRASE": "pick-any-random-string-here"
      }
    }
  }
}

NAVVI_GPG_PASSPHRASE enables the credential vault (gopass). On first boot, Navvi generates a GPG key automatically. The key persists in a Docker volume across restarts.

Keep your passphrase safe. If you lose it and the Docker volume is deleted, all stored passwords are unrecoverable.

3. Use

Just tell your agent what to do:

"Log into Tutanota with stored credentials"
"Search DuckDuckGo for 'navvi browser' and list the top results"
"Sign up for a new Outlook account"

Navvi's journey tools (navvi_browse, navvi_login) handle navigation, element finding, clicking, typing, and screenshots internally. No manual step-by-step needed.

<details> <summary>For fine-grained control: atomic tools</summary>

Atomic tools are hidden by default. Unlock them when you need precise control:

navvi_atomic(enable=true)                          -> unlock low-level tools
navvi_open url=https://example.com                 -> navigate
navvi_find selector="input[type=email]"            -> locate element -> (x, y)
navvi_fill x=512 y=498 value="[email protected]"     -> type into it
navvi_screenshot                                   -> see what happened
</details>

4. Optional: Install skills

Skills give your AI agent dedicated browsing capabilities — isolates browser work from your main conversation. Works with Claude Code, Cursor, and 40+ other agents.

npx skills add fellowship-dev/navvi

Or install manually:

curl -fsSL https://raw.githubusercontent.com/fellowship-dev/navvi/main/install-companions.sh | bash

Included skills:

  • navvi-browse — autonomous web browsing with vision-driven navigation
  • navvi-login — login with stored credentials, handles reCAPTCHA and 2FA
  • navvi-signup — create new accounts with auto-generated credentials

If Navvi is useful to you, please ⭐ star the repo — it helps others discover it.


Use Cases

Persistent logins. Log into a service once — your agent stays logged in across sessions. No more re-entering credentials, no more expired sessions.

Secure credential management. Passwords are generated and stored inside the container. autofill types them into forms — the AI never sees the raw password at any point.

Account signup. Your agent creates accounts on services — generates passwords inside the container, fills forms, and persists the credentials for future logins.

Multi-persona workflows. Run multiple browser identities simultaneously — each persona gets its own container with isolated cookies and credentials.

Form automation on protected sites. Fill complex forms with dropdowns, date pickers, and multi-step wizards. OS-level input passes bot detection that blocks Selenium and Playwright.

Visual evidence for PRs. Screenshot your staging app before and after a code change. Record a user flow as a GIF. Attach it to the pull request.

How It Works

Each persona runs in its own Docker container with a dedicated Firefox instance, cookies, and profile. Your agent talks MCP, Navvi translates to browser actions.

Anti-detection uses Camoufox — a patched Firefox with fingerprint masking at the C++ level. Sites that detect and block Selenium, Playwright, and headless Chrome don't detect Navvi.

All input uses xdotool — OS-level mouse and keyboard events that websites cannot distinguish from a real person.

Credentials are stored in gopass inside the container:

  • generate — creates a random password, stores in gopass. The password never leaves the container or appears in AI context.
  • autofill — reads gopass and types directly into the browser. The password never travels through the AI.
  • import — bulk-import existing credentials from a JSON file.

Multi-persona — each persona runs in its own container (navvi-{name}) with an isolated Firefox profile, cookies, and history. Gopass credentials are namespaced per persona (navvi/{persona}/{service}) in a shared vault.

MCP Tools

By default, Navvi shows 11 high-level tools. Atomic tools unlock on demand via navvi_atomic.

Journey tools (default)

ToolWhat it does
navvi_browsePrimary tool — give it an instruction + URL, it handles everything
navvi_loginLog into a service using stored credentials

Lifecycle

ToolWhat it does
navvi_startStart container + persona's Firefox instance
navvi_stopStop container (profiles preserved)
navvi_statusShow running containers, personas, and health

Observation

ToolWhat it does
navvi_screenshotCapture the screen
navvi_vncGet live VNC URL for human handoff

Persona management

ToolWhat it does
navvi_personaCreate, update, list, delete browser personas
navvi_accountTrack accounts per persona (service, email, credential ref)

Progressive disclosure

ToolWhat it does
navvi_atomicUnlock/hide 12 low-level tools (click, find, fill, etc.)
<details> <summary>Atomic tools (hidden by default)</summary>
ToolWhat it does
navvi_openNavigate to a URL
navvi_findFind element by CSS selector → screen (x, y)
navvi_clickClick at coordinates
navvi_fillClick + type text
navvi_pressPress a key
navvi_scrollScroll the page
navvi_dragDrag between two points
navvi_mousedown/up/moveLow-level mouse control
navvi_urlGet current page URL
navvi_credsManage credentials: list, get, generate, import, autofill
navvi_listList available Codespaces (remote mode)
</details> <details> <summary>Recording tools (hidden by default)</summary>
ToolWhat it does
navvi_record_startStart recording screenshots
navvi_record_stopAssemble MP4
navvi_record_gifConvert to GIF
</details> <details> <summary>MCP Resources and Prompts (for developers)</summary>

Resources

Read persona state without tool calls:

URIWhat it returns
personas://listAll personas with account counts
persona://{name}/stateConfig, accounts, recent actions
persona://{name}/accountsAccount details
audit://{name}/logLast 20 actions

Prompts

Structured workflows available as prompt templates:

PromptWhat it does
signup_flowStep-by-step account creation on a service
login_flowLog in using stored credentials
qa_walkWalk a page for QA — screenshot, find issues, report
</details>

Personas

Each persona is a separate browser identity with its own container, Firefox instance, cookies, credentials, and history.

navvi_persona(action="create", name="mybot", description="GitHub admin", stealth="high")
navvi_start(persona="mybot")           -> launches container navvi-mybot
navvi_persona(action="list")
navvi_account(action="add", persona="mybot", service="github.com", email="[email protected]")

Persona config and state live in ~/.navvi/navvi.db. Each persona's browser profile persists in its own Docker volume (navvi-profile-{name}). Credentials share a common gopass vault.

Requirements

  • Docker — the browser runs in a container
  • uvcurl -LsSf https://astral.sh/uv/install.sh | sh (or brew install uv)
  • NAVVI_GPG_PASSPHRASE — any random string, enables the gopass credential vault. Set in .mcp.json env.
  • ffmpeg (optional) — only needed for video recording
  • ANTHROPIC_API_KEY (optional) — enables Haiku vision for navvi_browse ($0.002/step). Without it, falls back to claude -p CLI or heuristics. For best results, install the skills instead — they use Claude Code's native vision at no extra cost.

Contributing

See CONTRIBUTING.md for development setup and guidelines.

License

MIT