@onkernel/create-kernel-app

Version:

Create Kernel sample applications

45 lines (29 loc) • 1.68 kB

Markdown

# Kernel TypeScript SDK + Stagehand + Gemini Computer Use Agent A Kernel application that demonstrates Computer Use Agent (CUA) capabilities using Google's Gemini 2.5 model with Stagehand for browser automation. ## What It Does This app uses [Gemini 2.5's computer use model](https://blog.google/technology/google-deepmind/gemini-computer-use-model/) capabilities to autonomously navigate websites and complete tasks. The example task searches for Kernel's company page on YCombinator and writes a blog post about their product. ## Setup 1. **Add your API keys as environment variables:** - `KERNEL_API_KEY` - Get from [Kernel dashboard](https://dashboard.onkernel.com/sign-in) - `GOOGLE_API_KEY` - Get from [Google AI Studio](https://aistudio.google.com/apikey) - `OPENAI_API_KEY` - Get from [OpenAI platform](https://platform.openai.com/api-keys) ## Running Locally Execute the script directly with tsx: ```bash npx tsx index.ts ``` This runs the agent without a Kernel invocation context and provides the browser live view URL for debugging. ## Deploying to Kernel 1. **Deploy the application:** ```bash kernel deploy index.ts --env GOOGLE_API_KEY=XXX --env OPENAI_API_KEY=XXX ``` 2. **Invoke the action:** ```bash kernel invoke ts-gemini-cua gemini-cua-task ``` The action creates a Kernel-managed browser and associates it with the invocation for tracking and monitoring. ## Documentation - [Kernel Documentation](https://docs.onkernel.com/quickstart) - [Kernel Stagehand Guide](https://www.onkernel.com/docs/integrations/stagehand) - [Gemini 2.5 Computer Use](https://blog.google/technology/google-deepmind/gemini-computer-use-model/)