donobu

# Donobu SDK Create, run, and heal AI-assisted Playwright flows with a single dependency. The `donobu` package ships the Playwright fixture, Page.AI orchestration layer, CLI wrapper, failure triage, and plugin system to comprehensively test websites. ## Highlights - **Typed Playwright fixture** - `import { test } from 'donobu'` to extend Playwright with `page.ai` helpers, smart selectors, and persistence. - **Autonomous Page.AI** - run `page.ai()` with optional Zod schemas, cached tool-call replays, custom tool allow-lists, and env-var controls. - **Prebuilt tools** - call keyboard, mouse, accessibility, cookie, and analysis tools via friendly wrappers (`page.runAccessibilityTest`, etc). - **Failure triage & auto-heal** - `npx donobu test --auto-heal` captures screenshots, GPT reasoning, structured treatment plans, and can re-run fixes automatically. ## Prerequisites - Node.js 18+ and a package manager (npm 8+, pnpm 10+, or yarn). - Playwright browsers (`npx playwright install`). - At least one LLM credential (OpenAI, Anthropic, Google Gemini, AWS Bedrock, or Donobu API). ## Installation ```bash npm install --save-dev donobu @playwright/test npx playwright install # downloads browsers if needed ``` ## Quick Start 1. **Author a test using the Donobu fixture** ```ts import { test } from 'donobu'; test('Test for https://www.starbucks.com', async ({ page }) => { await page.goto('https://www.starbucks.com'); await page.ai('Go to the featured menu page'); await page.ai.assert( `Assert that the featured menu page has a seasonally appropriate vibe for ${new Date()}`, ); await page.ai('Find a Starbucks store in Stowe, Vermont'); await page.ai.assert( 'Assert that a store in Stowe, Vermont is found and the map shows Mt. Mansfield close by.', ); }); ``` 2. **Run the test with Page.AI enabled** ```bash OPENAI_API_KEY=sk-*** npx donobu test ``` `npx donobu test` proxies Playwright while wiring Donobu-specific env vars (triage directories, Page.AI cache clearing, auto-heal retries, etc.). ### Page.AI API Surface | Method | Description | | ---------------------------------------- | -------------------------------------------------------------------- | | `await page.ai(instruction, opts?)` | Launches an autonomous Donobu flow that can call browser tools. | | `await page.ai.assert(assertion, opts?)` | AI assertion against DOM text, screenshot, title, and URL. | | `await page.ai.extract(schema, opts?)` | Produce JSON data shaped by a Zod schema using screenshot + history. | - Every invocation of `page.ai()` is cached in `<spec directory>/.cache-lock/<spec-file>.cache.js`. Run `npx donobu test --clear-ai-cache` to regenerate the cache. ## Page.AI Caching, Env Vars, and Secrets - Per-spec cache: Page.AI cache entries are saved next to the spec inside `.cache-lock/<spec-file>.cache.js`. Commit them to stabilise selectors or delete to regenerate. - CLI toggles: `--clear-ai-cache` (or `DONOBU_PAGE_AI_CLEAR_CACHE=1`) clears cache before each `page.ai.act`. - Allow specific env vars by explicitly referencing them by name in the `page.ai()` instruction or by passing them as options: ```ts test('uses secret', async ({ page }, testInfo) => { await page.ai('Log in using {{$.env.MY_SECRET}} credentials', { envVars: ['SOME_OTHER_SECRET'], }); }); ``` In the above example, the `page.ai` agent will have access to the `MY_SECRET` and `SOME_OTHER_SECRET` env vars. ## CLI Usage `npx donobu` mirrors Playwright subcommands and adds Donobu-specific tooling. | Command | What it does | | ------------------------------------------------- | ----------------------------------------------------------------------------------------------------- | | `npx donobu test [playwright args]` | Runs Playwright tests with Donobu fixtures, triage, optional auto-heal, and Page.AI caching controls. | | `npx donobu test --auto-heal` | After failures, generate treatment plans and automatically retry tests whose plans recommend it. | | `npx donobu test --no-triage` | Skip evidence gathering (faster but no treatment plans). | | `npx donobu test --triage-output-dir ./artifacts` | Persist evidence outside `test-results/donobu-triage`. | | `npx donobu test --clear-ai-cache` | Clear Page.AI cache before every `act()` invocation for the run. | | `npx donobu heal --plan path/to/plan.json` | Re-run a previously generated treatment plan with matching Playwright args. | | `npx playwright-json-to-markdown report.json` | Convert Playwright JSON reports into human-friendly Markdown. | | `npx playwright-json-to-slack-json report.json` | Produce Slack-ready payloads from Playwright reports. | ### Failure Evidence & Auto-Heal - During `donobu test`, failure evidence (flow metadata, screenshots, DOM dumps, GPT summaries) is stored under `test-results/donobu-triage/<timestamp>-<runId>/`. - `triageTestFailure` builds a structured treatment plan containing failure reason, remediation steps, and automation directives. Plans are written next to the evidence (prefixed with `treatment-plan-`). - Passing `--auto-heal` lets Donobu run an autonomous flow that attempts to fix selectors/code. Successful fixes attach regenerated tests (`fixed-test.ts`) and annotate runs with `@self-healed`. ## GPT Configuration Donobu selects a GPT backend in the following priority order: 1. `BASE64_GPT_CONFIG` - Base64 JSON matching `GptConfigSchema`. 2. `DONOBU_API_KEY` - use Donobu hosted models. 3. Anthropic via AWS Bedrock (`AWS_BEDROCK_MODEL_NAME`, `AWS_REGION`, `AWS_ACCESS_KEY_ID`, `AWS_SECRET_ACCESS_KEY`). 4. Anthropic direct (`ANTHROPIC_API_KEY`, optional `ANTHROPIC_MODEL_NAME`). 5. Google Gemini (`GOOGLE_GENERATIVE_AI_API_KEY`, optional `GOOGLE_GENERATIVE_AI_MODEL_NAME`). 6. OpenAI (`OPENAI_API_KEY`, optional `OPENAI_API_MODEL_NAME`). Additional runtime env vars: | Env var | Purpose | | ------------------------------------------------ | ---------------------------------------------------------------- | | `DONOBU_PAGE_AI_CLEAR_CACHE` | Force cache invalidation for every `page.ai()` call. | | `BASE_WORKING_DIR` | Override the platform-specific Donobu data directory. | | `BROWSERBASE_API_KEY` / `BROWSERBASE_PROJECT_ID` | Run flows inside BrowserBase sessions instead of local Chromium. | ## Additional Resources - Example flows and generated tests: <https://github.com/donobu-inc/playwright-flows> - Support: <https://donobu.com>