UNPKG

vibe-tools

Version:
1,026 lines (755 loc) 68.9 kB
<div align="center"> <img height="72" src="https://github.com/user-attachments/assets/45eff178-242f-4d84-863e-247b080cc6f5" /> </div> <div align=center><h1>Give AI Agents an AI team and advanced skills</h1></div> | Summary | Prompt it | |---------|-----------| | Essential information to understand what vibe-tools is and how to get started using it | [![](https://b.lmpify.com/getting_started)](https://lmpify.com?q=https%3A%2F%2Fuuithub.com%2Feastlondoner%2Fcursor-tools%2Ftree%2Fmain%3FpathPatterns%3DREADME.md%26pathPatterns%3DCONFIGURATION.md%26pathPatterns%3Dpackage.json%26pathPatterns%3Dvibe-tools.config.json%26pathPatterns%3D.cursor-tools.env.example%26pathPatterns%3Dsrc%252Fvibe-rules.ts%0A%0AI'm%20new%20to%20vibe-tools.%20Can%20you%20explain%20what%20it%20is%2C%20how%20to%20install%20it%2C%20and%20how%20to%20get%20started%20with%20basic%20commands%3F) | | Overview of available commands and their basic functionality | [![](https://b.lmpify.com/command_overview)](https://lmpify.com?q=https%3A%2F%2Fuuithub.com%2Feastlondoner%2Fcursor-tools%2Ftree%2Fmain%3FpathPatterns%3Dsrc%252Fcommands%252Findex.ts%26pathPatterns%3Dsrc%252Ftypes.ts%26pathPatterns%3Dsrc%252Fvibe-rules.ts%26pathPatterns%3DREADME.md%0A%0AWhat%20commands%20are%20available%20in%20vibe-tools%20and%20what%20does%20each%20one%20do%3F) | | Browser automation commands and capabilities | [![](https://b.lmpify.com/browser_commands)](https://lmpify.com?q=https%3A%2F%2Fuuithub.com%2Feastlondoner%2Fcursor-tools%2Ftree%2Fmain%3FpathPatterns%3Dsrc%252Fcommands%252Fbrowser%252F**%252F*.ts%26pathPatterns%3Dtests%252Fcommands%252Fbrowser%252F*.html%26excludePathPatterns%3Dsrc%252Fcommands%252Fbrowser%252Fstagehand%252FstagehandScript.ts%0A%0AHow%20do%20I%20use%20the%20browser%20commands%20in%20vibe-tools%3F%20What%20browser%20automation%20capabilities%20are%20available%3F) | | LLM provider integration and configuration | [![](https://b.lmpify.com/llm_integration)](https://lmpify.com?q=https%3A%2F%2Fuuithub.com%2Feastlondoner%2Fcursor-tools%2Ftree%2Fmain%3FpathPatterns%3Dsrc%252Futils%252Ftool-enabled-llm%252F**%26pathPatterns%3Dsrc%252Fproviders%252F**%26pathPatterns%3Dsrc%252Fllms%252F**%26pathPatterns%3D.cursor-tools.env.example%0A%0AHow%20do%20I%20configure%20different%20LLM%20providers%20with%20vibe-tools%3F%20What%20providers%20are%20supported%3F) | | Model Context Protocol (MCP) commands and tools | [![](https://b.lmpify.com/mcp_commands)](https://lmpify.com?q=https%3A%2F%2Fuuithub.com%2Feastlondoner%2Fcursor-tools%2Ftree%2Fmain%3FpathPatterns%3Dsrc%252Fcommands%252Fmcp%252F**%252F*.ts%0A%0AHow%20do%20I%20use%20the%20MCP%20commands%20in%20vibe-tools%3F%20What%20is%20MCP%20and%20how%20does%20it%20work%3F) | | Testing framework and capabilities | [![](https://b.lmpify.com/testing)](https://lmpify.com?q=https%3A%2F%2Fuuithub.com%2Feastlondoner%2Fcursor-tools%2Ftree%2Fmain%3FpathPatterns%3Dsrc%252Fcommands%252Ftest%252F**%252F*.ts%26pathPatterns%3Dtests%252Ffeature-behaviors%252F**%252F*.md%26pathPatterns%3DTESTING.md%0A%0AHow%20do%20I%20use%20the%20testing%20capabilities%20in%20vibe-tools%3F%20How%20can%20I%20create%20and%20run%20tests%3F) | | Configuration options and customization | [![](https://b.lmpify.com/configuration)](https://lmpify.com?q=https%3A%2F%2Fuuithub.com%2Feastlondoner%2Fcursor-tools%2Ftree%2Fmain%3FpathPatterns%3Dsrc%252Fconfig.ts%26pathPatterns%3Dvibe-tools.config.json%26pathPatterns%3D.cursor-tools.env.example%26pathPatterns%3DCONFIGURATION.md%26pathPatterns%3Dsrc%252Fvibe-rules.ts%0A%0AHow%20do%20I%20configure%20vibe-tools%3F%20What%20configuration%20options%20are%20available%3F) | | Telemetry implementation and infrastructure | [![](https://b.lmpify.com/telemetry)](https://lmpify.com?q=https%3A%2F%2Fuuithub.com%2Feastlondoner%2Fcursor-tools%2Ftree%2Fmain%3FpathPatterns%3Dsrc%252Ftelemetry%252F**%26pathPatterns%3Dinfra%252F**%26pathPatterns%3DTELEMETRY.md%0A%0AHow%20does%20telemetry%20work%20in%20vibe-tools%3F%20What%20data%20is%20collected%20and%20how%20is%20it%20used%3F) | | Example usage | [![](https://b.lmpify.com/examples)](https://lmpify.com?q=https%3A%2F%2Fuuithub.com%2Feastlondoner%2Fcursor-tools%2Ftree%2Fmain%3FpathPatterns%3Dsrc%252Fvibe-rules.ts%26pathPatterns%3DREADME.md%26pathPatterns%3DCONFIGURATION.md%0A%0ACan%20you%20show%20me%20some%20examples%20of%20how%20to%20use%20vibe-tools%20commands%20effectively%3F) | ## Table of Contents - [The AI Team](#the-ai-team) - [New Skills](#new-skills-for-your-existing-agent) - [How to Use](#how-do-i-use-it) - [Example: Using Perplexity](#asking-perplexity-to-carry-out-web-research) - [Example: Using Gemini](#asking-gemini-for-a-plan) - [What is vibe-tools](#what-is-vibe-tools) - [Installation](#installation) - [Requirements](#requirements) - [Telemetry & Privacy](#telemetry--privacy) - [Tips](#tips) - [Additional Examples](#additional-examples) - [GitHub Skills](#github-skills) - [Gemini Code Review](#gemini-code-review) - [Detailed Cursor Usage](#detailed-cursor-usage) - [Tool Recommendations](#tool-recommendations) - [Command Nicknames](#command-nicknames) - [Web Search](#use-web-search) - [Repository Search](#use-repo-search) - [Documentation Generation](#use-doc-generation) - [GitHub Integration](#use-github-integration) - [Browser Automation](#use-browser-automation) - [Direct Model Queries](#use-direct-model-queries) - [Authentication and API Keys](#authentication-and-api-keys) - [AI Team Features](#ai-team-features) - [Perplexity: Web Search & Research](#perplexity-web-search--research) - [Gemini 2.0: Repository Context & Planning](#gemini-20-repository-context--planning) - [Stagehand: Browser Automation](#stagehand-browser-automation) - [Browser Command Options](#browser-command-options) - [Video Recording](#video-recording) - [Console and Network Logging](#console-and-network-logging) - [Complex Actions](#complex-actions) - [Troubleshooting Browser Commands](#troubleshooting-browser-commands) - [YouTube Video Analysis](#youtube-video-analysis) - [Skills](#skills) - [GitHub Integration](#github-integration) - [Linear Integration](#linear-integration) - [Xcode Tools](#xcode-tools) - [Documentation Generation](#documentation-generation-uses-gemini-20) - [Wait Command](#wait-command) - [Configuration](#configuration) - [vibe-tools.config.json](#vibe-toolsconfigjson) - [GitHub Authentication](#github-authentication) - [Repomix Configuration](#repomix-configuration) - [Model Selection](#model-selection) - [Cursor Configuration](#cursor-configuration) - [Cursor Agent Configuration](#cursor-agent-configuration) - [vibe-tools cli](#vibe-tools-cli) - [Command Options](#command-options) - [Execution Methods](#execution-methods) - [Troubleshooting](#troubleshooting) - [Examples](#examples) - [Web Search Examples](#web-search-examples) - [Repository Context Examples](#repository-context-examples) - [Documentation Examples](#documentation-examples) - [GitHub Integration Examples](#github-integration-examples) - [Xcode Command Examples](#xcode-command-examples) - [Browser Command Examples](#browser-command-examples) - [open subcommand examples](#open-subcommand-examples) - [act, extract, observe subcommands examples](#act-extract-observe-subcommands-examples) - [YouTube Command Examples](#youtube-command-examples) - [Node Package Manager](#node-package-manager-npm) - [Contributing](#contributing) - [Sponsors](#sponsors) - [License](#license) ### The AI Team - Perplexity to search the web and perform deep research - Gemini 2.0 for huge whole-codebase context window, search grounding and reasoning - Stagehand for browser operation to test and debug web apps (uses Anthropic, OpenAI, Gemini, or OpenRouter models) - OpenRouter for access to a variety of models through a unified API (for MCP commands) ### New Skills for your existing Agent - Work with GitHub Issues and Pull Requests - Access Linear issues with full context and comments - Generate local agent-accessible documentation for external dependencies - Analyze YouTube videos to extract insights, summaries, and implementation plans `vibe-tools` is optimized for Cursor Composer Agent but it can be used by any coding agent that can execute commands ### How do I use it? After installation, to see AI teamwork in action just ask Cursor Composer to use Perplexity or Gemini. Here are two examples: <div align="center"> <div> <h3>Asking Perplexity to carry out web research</h3> </div> <div style="display: flex;"> <img width="350" alt="image" src="https://github.com/user-attachments/assets/d136c007-387b-449c-9737-553b34e71bbd" /> </div> <details> <summary>see what happens next...</summary> <img width="350" alt="image" src="https://github.com/user-attachments/assets/06566162-fbaa-492a-8ce8-1a51e0713ee8" /> <details> <summary>see what happens next...</summary> <img width="350" alt="image" src="https://github.com/user-attachments/assets/fbca8d46-0e0e-4752-922e-62cceec6c12b" /> <details> <summary>see what happens next...</summary> <img width="1172" alt="image" src="https://github.com/user-attachments/assets/4bdae605-6f6c-43c3-b10c-c0263060033c" /> </details> </details> </details> see the spec composer and perplexity produced together: <a href="https://github.com/eastlondoner/pac-man/blob/main/specs/pac-man-spec.md">pac-man-spec.md</a> (link out to the example repo) <br/> <br/> </div> </div> <div align="center"> <div> <h3>Asking Gemini for a plan</h3> </div> <div style="display: flex;"> <img width="350" src="https://github.com/user-attachments/assets/816daee4-0a31-4a6b-8aac-39796cb03b51" /> </div> <details> <summary>see what happens next...</summary> <img width="350" alt="image" src="https://github.com/user-attachments/assets/b44c4cc2-6498-42e8-bda6-227fbfed0a7c" /> <details> <summary>see what happens next...</summary> <img width="350" alt="image" src="https://github.com/user-attachments/assets/dcfcac67-ce79-4cd1-a66e-697c654ee986" /> <details> <summary>see what happens next...</summary> <img width="350" alt="image" src="https://github.com/user-attachments/assets/8df7d591-f48b-463d-8d9b-f7e9c1c9c95b" /> </details> </details> </details> see the spec composer and perplexity produced together: <a href="https://github.com/eastlondoner/pac-man/blob/main/specs/pac-man-plan.md">pac-man-plan.md</a> (link out to the example repo) <br/> <br/> </div> </div> ## What is vibe-tools `vibe-tools` provides a CLI that your **AI agent can use** to expand its capabilities. `vibe-tools` is designed to be installed globally, providing system-wide access to its powerful features. When you run `vibe-tools install`, it configures instruction files tailored to your chosen development environment: - **Supported IDEs/Environments**: Cursor, Claude Code, Codex, Windsurf, Cline, Roo. - **Instruction File Setup**: The installer automatically creates or updates relevant configuration files: - For **Cursor**: `.cursorrules` or `.cursor/rules/vibe-tools.mdc`. - For **Claude Code**: `CLAUDE.md` (local or global `~/.claude/CLAUDE.md`). - For **Codex**: `codex.md` (local or global `~/.codex/instructions.md`). - For **Windsurf**: `.windsurfrules`. - For **Cline/Roo**: `.clinerules` directory (with `vibe-tools.md`) or legacy file. `vibe-tools` supports multiple AI instruction sources including Claude code, Codex, and IDE-specific rules, ensuring compatibility across various AI-powered development setups. `vibe-tools` integrates with multiple AI providers including OpenAI, Anthropic, Gemini, Perplexity, OpenRouter, ModelBox, and xAI (Grok). `vibe-tools` requires a Perplexity API key and a Google AI API key. `vibe-tools` is a node package that should be installed globally. ## Installation Install vibe-tools globally: ```bash npm install -g vibe-tools ``` Then run the setup: ```bash vibe-tools install . ``` This command will: 1. Guide you through API key configuration for the AI providers you choose. 2. Automatically install Playwright browsers (Chromium) for browser automation commands. 3. Create or update AI instruction files based on your selected IDE (e.g., setting up `.cursorrules` for Cursor, `CLAUDE.md` for Claude Code, `.windsurfrules` for Windsurf, etc.). ### Non-Interactive Installation (CI/CD) For automated environments, `vibe-tools install` automatically detects CI environments and runs in non-interactive mode: ```bash # CI environments - automatically detected and runs without prompts CI=true vibe-tools install . # Or explicitly set non-interactive mode NONINTERACTIVE=true vibe-tools install . ``` In non-interactive mode, vibe-tools will: - Auto-detect your package manager and IDE environment - Use existing configurations (local takes precedence over global) - Apply sensible defaults for new installations - Skip writing API keys to files (uses environment variables only) - Enable telemetry by default (can be disabled with `VIBE_TOOLS_NO_TELEMETRY=1`) ## Requirements - Node.js 18 or later - Perplexity API key - Google Gemini API key - For browser commands: - Playwright browsers are automatically installed during `vibe-tools install` - OpenAI API key or Anthropic API key (for `act`, `extract`, and `observe` commands) `vibe-tools` uses Gemini-2.5 models by default, which provide excellent performance with large context windows up to 2 million tokens - enough to handle an entire codebase in one shot. Available Gemini models include `gemini-2.5-flash` (default for speed), `gemini-2.5-pro` (default for quality), and `gemini-2.5-flash-lite-preview-06-17` (lightweight option). Gemini models are currently free to use on Google and you need a Google Cloud project to create an API key. `vibe-tools` uses Perplexity because Perplexity has the best web search api and indexes and it does not hallucinate. Perplexity Pro users can get an API key with their pro account and recieve $5/month of free credits (at time of writing). Support for Google search grounding is coming soon but so far testing has shown it still frequently hallucinates things like APIs and libraries that don't exist. ## Telemetry & Privacy `vibe-tools` collects **anonymous usage telemetry** to help improve the tool. You will be prompted during installation to enable or disable telemetry, and you can opt out at any time. No code, queries, file contents, or personal data are ever collected—only high-level command usage and error types (see [TELEMETRY.md](TELEMETRY.md) for full details). - Telemetry is **opt-in**: you choose during install. - You can change your choice later by setting the `VIBE_TOOLS_NO_TELEMETRY=1` environment variable. - For details on what is (and is not) collected, and how telemetry works, see [TELEMETRY.md](TELEMETRY.md). ## Tips: - Ask Cursor Agent to have Gemini review its work - Ask Cursor Agent to generate documentation for external dependencies and write it to a local-docs/ folder If you do something cool with `vibe-tools` please let me know on twitter or make a PR to add to this section! ## Additional Examples ### GitHub Skills To see vibe-tools GitHub and Perplexity skills: Check out [this example issue that was solved using Cursor agent and vibe-tools](https://github.com/eastlondoner/cursor-tools/issues/1) ### Gemini code review See cursor get approximately 5x more work done per-prompt with Gemini code review: <img width="1701" alt="long view export" src="https://github.com/user-attachments/assets/a8a63f4a-1818-4e84-bb1f-0f60d82c1c42" /> ## Detailed Cursor Usage Use Cursor Composer in agent mode with command execution (not sure what this means, see section below on Cursor Agent configuration). If you have installed the vibe-tools prompt to your .cursorrules (or equivalent) just ask your AI coding agent/assistant to use "vibe-tools" to do things. ### Tool Recommendations - `vibe-tools ask` allows direct querying of any model from any provider. It's best for simple questions where you want to use a specific model or compare responses from different models. - `vibe-tools web` uses an AI teammate with web search capability to answer questions. `web` is best for finding up-to-date information from the web that is not specific to the repository such as how to use a library to search for known issues and error messages or to get suggestions on how to do something. Web is a teammate who knows tons of stuff and is always up to date. - `vibe-tools repo` uses an AI teammate with large context window capability to answer questions. `repo` sends the entire repo as context so it is ideal for questions about how things work or where to find something, it is also great for code review, debugging and planning. With the `--with-diff` flag, it can also include git diff information for focused code review that keeps the AI focused on current changes while maintaining full codebase understanding. is a teammate who knows the entire codebase inside out and understands how everything works together. - `vibe-tools plan` uses an AI teammate with reasoning capability to plan complex tasks. Plan uses a two step process. First it does a whole repo search with a large context window model to find relevant files. Then it sends only those files as context to a thinking model to generate a plan it is great for planning complex tasks and for debugging and refactoring. Plan is a teammate who is really smart on a well defined problem, although doesn't consider the bigger picture. - `vibe-tools doc` uses an AI teammate with large context window capability to generate documentation for local or github hosted repositories by sending the entire repo as context. `doc` can be given precise documentation tasks or can be asked to generate complete docs from scratch it is great for generating docs updates or for generating local documentation for a libary or API that you use! Doc is a teammate who is great at summarising and explaining code, in this repo or in any other repo! - `vibe-tools browser` uses an AI teammate with browser control (aka operator) capability to operate web browsers. `browser` can operate in a hidden (headless) mode to invisibly test and debug web apps or it can be used to connect to an existing browser session to interactively share your browser with Cursor agent it is great for testing and debugging web apps and for carrying out any task that can be done in a browser such as reading information from a bug ticket or even filling out a form. Browser is a teammate who can help you test and debug web apps, and can share control of your browser to perform small browser-based tasks. - `vibe-tools youtube` uses an AI teammate with video analysis capability to understand YouTube content. `youtube` can generate summaries, extract transcripts, create implementation plans from tutorials, and answer specific questions about video content. It's great for extracting value from technical talks, tutorials, and presentations without spending time watching the entire video. YouTube is a teammate who can watch and analyze videos for you, distilling the key information. Note: For repo, doc and plan commands the repository content that is sent as context can be reduced by filtering out files in a .repomixignore file. ### Command Nicknames When using vibe-tools with Cursor Composer, you can use these nicknames: - "Gemini" is a nickname for `vibe-tools repo` - "Perplexity" is a nickname for `vibe-tools web` - "Stagehand" is a nickname for `vibe-tools browser` ### Use web search "Please implement country specific stripe payment pages for the USA, UK, France and Germany. Use vibe-tools web to check the available stripe payment methods in each country." Note: in most cases you can say "ask Perplexity" instead of "use vibe-tools web" and it will work the same. ### Use repo search "Let's refactor our User class to allow multiple email aliases per user. Use vibe-tools repo to ask for a plan including a list of all files that need to be changed." "Use vibe-tools repo to analyze how authentication is implemented in the Next.js repository. Use --from-github=vercel/next.js." "Use vibe-tools repo to explain this React component with documentation from the official React docs. Use --with-doc=https://react.dev/reference/react/useState or a local file path" "Use vibe-tools repo to review my recent changes and suggest improvements. Use --with-diff to include the git diff." "Use vibe-tools repo to check if my changes are compatible with the main branch. Use --with-diff --base=main." Note: in most cases you can say "ask Gemini" instead of "use vibe-tools repo" and it will work the same. ### Use doc generation "Use vibe-tools to generate documentation for the Github repo https://github.com/kait-http/kaito" and write it to docs/kaito.md" Note: in most cases you can say "generate documentation" instead of "use vibe-tools doc" and it will work the same. ### Use github integration "Use vibe-tools github to fetch issue 123 and suggest a solution to the user's problem" "Use vibe-tools github to fetch PR 321 and see if you can fix Andy's latest comment" Note: in most cases you can say "fetch issue 123" or "fetch PR 321" instead of "use vibe-tools github" and it will work the same. ### Use linear integration "Use vibe-tools linear to set up authentication with Linear" "Use vibe-tools linear to fetch issue ITE-123 and provide a summary of the current status" "Use vibe-tools linear to get issue ABC-456 and explain what the team is discussing in the comments" Note: in most cases you can say "fetch Linear issue ITE-123" or "get Linear issue ABC-456" instead of "use vibe-tools linear" and it will work the same. ### Use browser automation "Use vibe-tools to open the users page and check the error in the console logs, fix it" "Use vibe-tools to test the form field validation logic. Take screenshots of each state" "Use vibe-tools to open https://example.com/foo the and check the error in the network logs, what could be causing it?" Note: in most cases you can say "Use Stagehand" instead of "use vibe-tools" and it will work the same. ### Use direct model queries "Use vibe-tools ask to compare how different models answer this question: 'What are the key differences between REST and GraphQL?'" "Ask OpenAI's o3-mini model to explain the concept of dependency injection." "Use vibe-tools ask to analyze this complex algorithm with high reasoning effort: 'Explain the time and space complexity of the Boyer-Moore string search algorithm' --provider openai --model o3-mini --reasoning-effort high" Note: The ask command requires both --provider and --model parameters to be specified. This command is generally less useful than other commands like `repo` or `plan` because it does not include any context from your codebase or repository. **Ask Command Options:** - `--provider=<provider>`: AI provider to use (openai, anthropic, perplexity, gemini, modelbox, openrouter, xai, or groq) - `--model=<model>`: Model to use (required for the ask command) - `--max-tokens=<number>`: Maximum tokens for response - `--reasoning-effort=<low|medium|high>`: Control the depth of reasoning for supported models (OpenAI o1/o3-mini models, Claude 4 Sonnet, and XAI Grok models). Higher values produce more thorough responses for complex questions. - `--with-doc=<doc_url>`: Fetch content from one or more document URLs and include it as context. Can be specified multiple times (e.g., `--with-doc=<url1> --with-doc=<url2>`). ## Authentication and API Keys `vibe-tools` requires API keys for Perplexity AI, Google Gemini, and optionally for OpenAI, Anthropic, OpenRouter, and xAI. These can be configured in two ways: 1. **Interactive Setup**: Run `vibe-tools install` and follow the prompts 2. **Manual Setup**: Create `~/.vibe-tools/.env` in your home directory or `.vibe-tools.env` in your project root: ```env PERPLEXITY_API_KEY="your-perplexity-api-key" GEMINI_API_KEY="your-gemini-api-key" OPENAI_API_KEY="your-openai-api-key" # Optional, for Stagehand ANTHROPIC_API_KEY="your-anthropic-api-key" # Optional, for Stagehand and MCP OPENROUTER_API_KEY="your-openrouter-api-key" # Optional, for MCP XAI_API_KEY="your-xai-api-key" # Optional, for xAI Grok models GROQ_API_KEY="your-groq-api-key" # Optional, for Groq models GITHUB_TOKEN="your-github-token" # Optional, for enhanced GitHub access LINEAR_API_KEY="your-linear-api-key" # Optional, for Linear integration ``` - At least one of `ANTHROPIC_API_KEY` and `OPENROUTER_API_KEY` must be provided to use the `mcp` commands. **CI/CD Environments**: In non-interactive mode (automatically detected in CI environments), vibe-tools uses only environment variables for API keys and skips writing them to filesystem for enhanced security. ### Google Gemini API Authentication `vibe-tools` supports multiple authentication methods for accessing the Google Gemini API, providing flexibility for different environments and security requirements. You can choose from the following methods: 1. **API Key (Default)** - This is the simplest method and continues to be supported for backward compatibility. - Set the `GEMINI_API_KEY` environment variable to your API key string obtained from Google AI Studio. - **Example:** ```env GEMINI_API_KEY="your-api-key-here" ``` 2. **Service Account JSON Key File** - For enhanced security, especially in production environments, use a service account JSON key file. - Set the `GEMINI_API_KEY` environment variable to the **path** of your downloaded service account JSON key file. - **Example:** ```env GEMINI_API_KEY="./path/to/service-account.json" ``` - This method enables access to the latest Gemini models available through Vertex AI, such as `gemini-2.5-flash`. 3. **Automatic Doppler Secrets Manager Integration** (new in 0.63.x) - If the [Doppler](https://www.doppler.com/) CLI is installed and your working directory has been configured with `doppler setup`, vibe-tools will automatically run `doppler secrets --json` at startup and load any secrets whose names end with `_API_KEY` into the current process **before** it evaluates which providers are available. - This means you can keep all of your provider keys (e.g. `OPENAI_API_KEY`, `ANTHROPIC_API_KEY`, `GEMINI_API_KEY`, etc.) in Doppler and skip copying them into `.env` files. - Doppler integration is **on by default**. To turn it off add the following to `vibe-tools.config.json`: ```json { "disableDoppler": true } ``` - Doppler secrets are only loaded if the variable is *not already defined* in the environment, so explicit environment variables always win. - The integration is read-only: secrets are never written back to Doppler or logged. 5. **Environment Variable Precedence with VIBE_TOOLS_ Prefix** - You can prefix any environment variable with `VIBE_TOOLS_` to ensure it takes precedence over all other sources - Example: `VIBE_TOOLS_OPENAI_API_KEY` will override `OPENAI_API_KEY` from any source (environment, .env files, or Doppler) - This works for all API keys and configuration variables - Useful for CI/CD environments or when you want different API keys specifically for vibe-tools - **Example usage:** ```bash # This will use the prefixed key instead of the regular one VIBE_TOOLS_OPENAI_API_KEY="vibe-specific-key" OPENAI_API_KEY="regular-key" vibe-tools ask "Hello" ``` 4. **Application Default Credentials (ADC) for Gemini models (Recommended for Google Cloud Environments)** - **Note:** This is an _alternative_ to setting the `GEMINI_API_KEY` environment variable for Gemini models. - ADC is ideal when running `vibe-tools` within Google Cloud environments (e.g., Compute Engine, Kubernetes Engine) or for local development using `gcloud`. - Set the `GEMINI_API_KEY` environment variable to `adc`. - **Example:** ```env GEMINI_API_KEY="adc" ``` - **Setup Instructions:** - For Google Cloud environments no further steps are required. - To use vibe-tools locally with ADC, authenticate locally using gcloud: ```bash gcloud auth application-default login ``` ## AI Team Features ### Perplexity: Web Search & Research Use Perplexity AI to get up-to-date information directly within Cursor: ```bash vibe-tools web "What's new in TypeScript 5.7?" ``` ### Gemini 2.0: Repository Context & Planning Leverage Google Gemini 2.0 models with 1M+ token context windows for codebase-aware assistance and implementation planning: ```bash # Get context-aware assistance vibe-tools repo "Explain the authentication flow in this project, which files are involved?" # Generate implementation plans vibe-tools plan "Add user authentication to the login page" ``` The plan command uses multiple AI models to: 1. Identify relevant files in your codebase (using Gemini by default) 2. Extract content from those files 3. Generate a detailed implementation plan (using o3-mini by default) **Plan Command Options:** - `--fileProvider=<provider>`: Provider for file identification (gemini, openai, anthropic, perplexity, modelbox, openrouter, xai, or groq) - `--thinkingProvider=<provider>`: Provider for plan generation (gemini, openai, anthropic, perplexity, modelbox, openrouter, xai, or groq) - `--fileModel=<model>`: Model to use for file identification - `--thinkingModel=<model>`: Model to use for plan generation - `--fileMaxTokens=<number>`: Maximum tokens for file identification - `--thinkingMaxTokens=<number>`: Maximum tokens for plan generation - `--debug`: Show detailed error information - `--with-doc=<doc_url>`: Fetch content from one or more web URLs and include it as context during plan generation. Can be specified multiple times (e.g., `--with-doc=<url1> --with-doc=<url2>`). Repository context is created using Repomix. See repomix configuration section below for details on how to change repomix behaviour. Above 1M tokens vibe-tools will always send requests to Gemini 2.0 Pro as it is the only model that supports 1M+ tokens. The Gemini 2.0 Pro context limit is 2M tokens, you can add filters to .repomixignore if your repomix context is above this limit. ### Stagehand: Browser Automation Automate browser interactions for web scraping, testing, and debugging: **Note:** Playwright browsers are automatically installed when you run `vibe-tools install`. No additional setup is required for browser commands. 1. `open` - Open a URL and capture page content: ```bash # Open and capture HTML content, console logs and network activity (enabled by default) vibe-tools browser open "https://example.com" --html # Take a screenshot vibe-tools browser open "https://example.com" --screenshot=page.png # Debug in an interactive browser session vibe-tools browser open "https://example.com" --connect-to=9222 ``` 2. `act` - Execute actions using natural language - Agent tells the browser-use agent what to do: ```bash # Single action vibe-tools browser act "Login as 'user@example.com'" --url "https://example.com/login" # Multi-step workflow using pipe separator vibe-tools browser act "Click Login | Type 'user@example.com' into email | Click Submit" --url "https://example.com" # Record interaction video vibe-tools browser act "Fill out registration form" --url "https://example.com/signup" --video="./recordings" ``` 3. `observe` - Analyze interactive elements: ```bash # Get overview of interactive elements vibe-tools browser observe "What can I interact with?" --url "https://example.com" # Find specific elements vibe-tools browser observe "Find the login form" --url "https://example.com" ``` 4. `extract` - Extract data using natural language: ```bash # Extract specific content vibe-tools browser extract "Get all product prices" --url "https://example.com/products" # Save extracted content vibe-tools browser extract "Get article text" --url "https://example.com/blog" --html > article.html # Extract with network monitoring vibe-tools browser extract "Get API responses" --url "https://example.com/api-test" --network ``` 5. `mac-chrome` - Start a Chrome instance with remote debugging (macOS only): ```bash # Launch Chrome with remote debugging on port 9222 vibe-tools browser mac-chrome # Launch with debug output to see the full command vibe-tools browser mac-chrome --debug # Fast start-up with a minimal flag set vibe-tools browser mac-chrome --lite ``` This command: - Only works on macOS (shows clear error on other platforms) - Creates an isolated temporary profile for clean testing - Launches Chrome with comprehensive automation-optimized flags (or minimal flags with `--lite`) - Enables remote debugging on port 9222 - Provides connection instructions for Playwright/CDP tools - Uses proven Chrome configuration for reliable automation - `--lite` option launches Chrome with a reduced set of flags for quicker startup and fewer side-effects #### Browser Command Options All browser commands (`open`, `act`, `observe`, `extract`) support these options: - `--console`: Capture browser console logs (enabled by default, use `--no-console` to disable) - `--html`: Capture page HTML content (disabled by default) - `--network`: Capture network activity (enabled by default, use `--no-network` to disable) - `--screenshot=<file path>`: Save a screenshot of the page - `--timeout=<milliseconds>`: Set navigation timeout (default: 120000ms for Stagehand operations, 30000ms for navigation) - `--viewport=<width>x<height>`: Set viewport size (e.g., 1280x720) - `--headless`: Run browser in headless mode (default: true) - `--no-headless`: Show browser UI (non-headless mode) for debugging - `--connect-to=<port>`: Connect to existing Chrome instance. Special values: 'current' (use existing page), 'reload-current' (refresh existing page) - `--wait=<time:duration or selector:css-selector>`: Wait after page load (e.g., 'time:5s', 'selector:#element-id') - `--video=<directory>`: Save a video recording (1280x720 resolution, timestamped subdirectory). Not available when using --connect-to - `--url=<url>`: Required for `act`, `observe`, and `extract` commands - `--evaluate=<string>`: JavaScript code to execute in the browser before the main command **Notes on Connecting to an existing browser session with --connect-to** - DO NOT ask browser act to "wait" for anything, the wait command is currently disabled in Stagehand. - When using `--connect-to`, viewport is only changed if `--viewport` is explicitly provided - Video recording is not available when using `--connect-to` - Special `--connect-to` values: - `current`: Use the existing page without reloading - `reload-current`: Use the existing page and refresh it (useful in development) #### Video Recording All browser commands support video recording of the browser interaction in headless mode (not supported with --connect-to): - Use `--video=<directory>` to enable recording - Videos are saved at 1280x720 resolution in timestamped subdirectories - Recording starts when the browser opens and ends when it closes - Videos are saved as .webm files Example: ```bash # Record a video of filling out a form vibe-tools browser act "Fill out registration form with name John Doe" --url "http://localhost:3000/signup" --video="./recordings" ``` #### Console and Network Logging Console logs and network activity are captured by default: - Use `--no-console` to disable console logging - Use `--no-network` to disable network logging - Logs are displayed in the command output #### Complex Actions The `act` command supports chaining multiple actions using the pipe (|) separator: ```bash # Login sequence with console/network logging (enabled by default) vibe-tools browser act "Click Login | Type 'user@example.com' into email | Click Submit" --url "http://localhost:3000/login" # Form filling with multiple fields vibe-tools browser act "Select 'Mr' from title | Type 'John' into first name | Type 'Doe' into last name | Click Next" --url "http://localhost:3000/register" # Record complex interaction vibe-tools browser act "Fill form | Submit | Verify success" --url "http://localhost:3000/signup" --video="./recordings" ``` #### Troubleshooting Browser Commands Common issues and solutions: 1. **Element Not Found Errors** - Use `--no-headless` to visually debug the page - Use `browser observe` to see what elements Stagehand can identify - Check if the element is in an iframe or shadow DOM - Ensure the page has fully loaded (try increasing `--timeout`) 2. **Stagehand API Errors** - Verify your OpenAI or Anthropic API key is set correctly - Check if you have sufficient API credits - Try switching models using `--model` 3. **Network Errors** - Check your internet connection - Verify the target website is accessible - Try increasing the timeout with `--timeout` - Check if the site blocks automated access 4. **Video Recording Issues** - Ensure the target directory exists and is writable - Check disk space - Video recording is not available with `--connect-to` 5. **Performance Issues** - Use `--headless` mode for better performance (default) - Reduce the viewport size with `--viewport` - Consider using `--connect-to` for development 6. **Browser Installation Issues** - Playwright browsers are automatically installed during `vibe-tools install` - If browser installation fails, you can skip it by setting `SKIP_PLAYWRIGHT=1` and install manually - To manually install browsers: `npx playwright install chromium` - Browser installation uses the exact Playwright version that vibe-tools depends on ### YouTube Video Analysis Use Gemini-powered YouTube video analysis to extract insights, summaries, and implementation plans: ```bash # Generate a video summary vibe-tools youtube "https://www.youtube.com/watch?v=VIDEO_ID" --type=summary # Get a detailed transcript vibe-tools youtube "https://www.youtube.com/watch?v=VIDEO_ID" --type=transcript # Create an implementation plan based on tutorial content vibe-tools youtube "https://www.youtube.com/watch?v=VIDEO_ID" --type=plan # Ask specific questions about the video vibe-tools youtube "https://www.youtube.com/watch?v=VIDEO_ID" "How does the authentication flow work?" # Save summary to a file vibe-tools youtube "https://www.youtube.com/watch?v=VIDEO_ID" --type=summary --save-to=video-summary.md ``` The YouTube command leverages Gemini models' native ability to understand video content, enabling you to: - Extract key insights and summaries from technical talks, tutorials, and presentations - Generate complete transcripts of video content - Create implementation plans based on tutorial videos - Perform quality reviews of educational content - Get answers to specific questions about the video content **YouTube Command Options:** - `--type=<summary|transcript|plan|custom>`: Type of analysis to perform (default: summary) **Note:** The YouTube command requires a `GEMINI_API_KEY` to be set in your environment or .vibe-tools.env file as the Gemini API is currently the only interface that reliably supports YouTube video analysis. ## Skills ### GitHub Integration Access GitHub issues and pull requests directly from the command line with rich formatting and full context: ```bash # List recent PRs or issues vibe-tools github pr vibe-tools github issue # View specific PR or issue with full discussion vibe-tools github pr 123 vibe-tools github issue 456 ``` The GitHub commands provide: - View of 10 most recent open PRs or issues when no number specified - Detailed view of specific PR/issue including: - PR/Issue description and metadata - Code review comments grouped by file (PRs only) - Full discussion thread - Labels, assignees, milestones and reviewers - Support for both local repositories and remote GitHub repositories - Markdown-formatted output for readability - **Modular output filtering** with flags like `--review-only`, `--discussion-only`, `--metadata-only`, `--no-links`, and `--hide-resolved` for AI agent optimization **Authentication Methods:** The commands support multiple authentication methods: 1. GitHub token via environment variable: `GITHUB_TOKEN=your_token_here` 2. GitHub CLI integration (if `gh` is installed and logged in) 3. Git credentials (stored tokens or Basic Auth) Without authentication: - Public repositories: Limited to 60 requests per hour - Private repositories: Not accessible With authentication: - Public repositories: 5,000 requests per hour - Private repositories: Full access (with appropriate token scopes) ### Linear Integration Access Linear issues directly from the command line with rich formatting and full context: ```bash # Set up authentication (interactive prompts) vibe-tools linear connect # View specific issue with full details vibe-tools linear get-issue ITE-123 vibe-tools linear issue ABC-456 # Alternative command name ``` The Linear commands provide: - **Authentication Setup**: Support for both personal API keys and OAuth2 flow with PKCE - **Issue Details**: Complete issue information including: - Issue title, description, and status - Priority level and assignee information - Creation and update timestamps - Creator information - **Comments**: Full discussion thread with timestamps and authors - **Attachments**: List of attached files with links - **Flexible Identifiers**: Support for both Linear identifiers (e.g., `ITE-123`) and UUID format **Authentication Methods:** The Linear integration supports two authentication approaches: 1. **Personal API Key** (Recommended for individual use): - Simple setup with guided browser navigation - Direct API key entry for existing keys - Choice of local or global storage 2. **OAuth2 with PKCE** (Recommended for organizational use): - Secure browser-based authentication flow - Automatic token management - Enhanced security with PKCE (Proof Key for Code Exchange) **Authentication Setup:** Run the interactive setup command: ```bash vibe-tools linear connect ``` This will guide you through: - Choosing between personal API key or OAuth authentication - Setting up browser access to Linear (if needed) - Configuring token storage (project-local vs global) **Environment Variable:** Alternatively, you can set the `LINEAR_API_KEY` environment variable in your `.vibe-tools.env` file: ```env LINEAR_API_KEY="your-linear-api-key" ``` **Note:** Linear personal API keys should be used directly without a "Bearer" prefix when set as environment variables. ### Xcode Tools Automate iOS app building, testing, and running in the simulator: ```bash # Available subcommands vibe-tools xcode build # Build Xcode project and report errors vibe-tools xcode run # Build and run app in simulator vibe-tools xcode lint # Analyze code and offer to fix warnings ``` **Build Command Options:** ```bash # Specify custom build path (derived data) vibe-tools xcode build buildPath=/custom/build/path # Specify target device vibe-tools xcode build destination="platform=iOS Simulator,name=iPhone 15" ``` **Run Command Options:** ```bash # Run on iPhone simulator (default) vibe-tools xcode run iphone # Run on iPad simulator vibe-tools xcode run ipad # Run on specific device with custom build path vibe-tools xcode run device="iPhone 16 Pro" buildPath=/custom/build/path ``` The Xcode commands provide: - Automatic project/workspace detection - Dynamic app bundle identification - Build output streaming with error parsing - Simulator device management - Support for both iPhone and iPad simulators - Custom build path specification to control derived data location ### Documentation Generation (uses Gemini 2.0) Generate comprehensive documentation for your repository or any GitHub repository: ```bash # Document local repository and save to file vibe-tools doc --save-to=docs.md # Document remote GitHub repository (both formats supported) vibe-tools doc --from-github=username/repo-name@branch vibe-tools doc --from-github=https://github.com/username/repo-name@branch # Save documentation to file (with and without a hint) # This is really useful to generate local documentation for libraries and dependencies vibe-tools doc --from-github=eastlondoner/cursor-tools --save-to=docs/MY_DOCS.md vibe-tools doc --from-github=eastlondoner/cursor-tools --save-to=docs/MY_DOCS.md --hint="only information about the doc command" # Document dependencies vibe-tools doc --from-github=expressjs/express --save-to=docs/EXPRESS.md --quiet # Document with additional web documentation as context vibe-tools doc --from-github=reactjs/react-redux --with-doc=https://redux.js.org/tutorials/fundamentals/part-5-ui-and-react --save-to=docs/REACT_REDUX.md # Document using multiple web documents as context vibe-tools doc --from-github=some/repo --with-doc=https://example.com/spec1 --with-doc=https://example.com/spec2 --save-to=docs/MULTI_DOC.md ``` ### Wait Command - `vibe-tools wait <seconds>`: Pauses execution for the specified number of seconds. Useful for simple timing needs within scripts or chained commands. ## Configuration ### vibe-tools.config.json Customize `vibe-tools` behavior by creating a `vibe-tools.config.json` file. This file can be created either globally in `~/.vibe-tools/vibe-tools.config.json` or locally in your project root. The vibe-tools.config file configures the local default behaviour for each command and provider. Here is an example of a typical vibe-tools.config.json file, showing some of the most common configuration options: ```json { // Commands "repo": { "provider": "openrouter", "model": "google/gemini-2.5-pro" }, "doc": { "provider": "openrouter", "model": "anthropic/claude-sonnet-4", "maxTokens": 4096 }, "web": { "provider": "gemini", "model": "gemini-2.5-pro" }, "plan": { "fileProvider": "gemini", "thinkingProvider": "perplexity", "thinkingModel": "r1-1776" }, "browser": { "headless": false }, //... // Providers "stagehand": { "model": "claude-sonnet-4-20250514", // For Anthropic provider "provider": "anthropic", // or "openai" "timeout": 90000 }, "openai": { "model": "gpt-4o" } //... } ``` For details of all configuration options, see [CONFIGURATION.md](CONFIGURATION.md). This includes details of all the configuration options and how to use them. ### GitHub Authentication The GitHub commands support several authentication methods: 1. **Environment Variable**: Set `GITHUB_TOKEN` in your environment: ```env GITHUB_TOKEN=your_token_here ``` 2. **GitHub CLI**: If you have the GitHub CLI (`gh`) installed and are logged in, vibe-tools will automatically use it to generate tokens with the necessary scopes. 3. **Git Credentials**: If you have authenticated git with GitHub (via HTTPS), vibe-tools will automatically: - Use your stored GitHub token if available (credentials starting with `ghp_` or `gho_`) - Fall back to using Basic Auth with your git credentials To set up git credentials: 1. Configure git to use HTTPS instead of SSH: ```bash git config --global url."https://github.com/".insteadOf git@github.com: ``` 2. Store your credentials: ```bash git config --global credential.helper store # Permanent storage # Or for macOS keychain: git config --global credential.helper osxkeychain ``` 3. The next time you perform a git operation requiring authentication, your credentials will be stored Authentication Status: - Without authentication: - Public repositories: Limited to 60 requests per hour - Private repositories: Not accessible - Some features may be restricted - With authentication (any method): - Public repositories: 5,000 requests per hour - Private repositories: Full access (if token has required scopes) vibe-tools will automatically try these authentication methods in order: 1. `GITHUB_TOKEN` environment variable 2. GitHub CLI token (if `gh` is installed and logged in) 3. Git credentials (stored token or Basic Auth) If no authentication is available, it will fall back to unauthenticated access with rate limits. ### Repomix Configuration When generating documentation, vibe-tools uses Repomix to analyze your repository. By default, it excludes certain files and directories that are typically not relevant for documentation: - Node modules and package directories (`node_modules/`, `packages/`, etc.) - Build output directories (`dist/`, `build/`, etc.) - Version control directories (`.git/`) - Test files and directories (`test/`, `tests/`, `__tests__/`, etc.) - Configuration files (`.env`, `.config`, etc.) - Log files and temporary files - Binary files and media files You can customize the files and folders to exclude using two methods, both can be combined together: 1. **Create a `.repomixignore` file** in your project root to specify files to exclude. Example `.repomixignore` file for a Laravel project: ``` vendor/ public/ database/ storage/ .idea .env ``` 2. **Create a `repomix.config.json` file** in your project root for more advanced configuration options: Example `repomix.config.json` to enable compression and specify what to include: ```json { "include": ["src/**/*", "README.md", "package.json"], "output": { "compress": true } } ``` This configuration will be detected and used automatically by the `repo`, `plan`, and `doc` commands, allowing for precise control over which files are included in the repository analysis. If both a .repomixignore and an ignore section in `repomix.config.json` are present then the ignore patterns from both are combined. #### Model Selection The `browser` commands support different AI models for processing from multiple provide