opendia
Version:
π― OpenDia - The open alternative to Dia. Connect your browser to AI models with anti-detection bypass for Twitter/X, LinkedIn, Facebook
209 lines (155 loc) β’ 8.23 kB
Markdown
# OpenDia <img src="opendia-extension/icon-128.png" alt="OpenDia" width="32" height="32">
> **The open alternative to Dia**
> Connect your browser to AI models. No browser switching neededβworks seamlessly with any Chromium browser including Chrome & Arc.
[](https://badge.fury.io/js/opendia)
[](https://opensource.org/licenses/MIT)
## πΊ See it in Action

## π What is OpenDia?
OpenDia lets AI models control your browser automatically. **The key advantage? It leverages everything you already have**βyour logged-in accounts, saved passwords, cookies, wallets, and browsing history. No need to start from scratch or switch contexts.
**π Use Your Existing Digital Life:**
- β
**Logged-in accounts**: Post to Twitter / X, LinkedIn, Facebook with your existing sessions
- β
**Browser data**: Access your bookmarks, history, and saved passwords
- β
**Extensions & wallets**: Use MetaMask, password managers, or any installed extensions
- β
**Cookies & sessions**: Stay authenticated across all your favorite sites
- β
**Local testing**: Perfect for development with Cursor - test with real user sessions
**β¨ Key Benefits:**
- π **Universal AI Support**: Works with Claude, ChatGPT, Cursor and even local models
- π― **Anti-Detection**: Specialized bypasses for Twitter/X, LinkedIn, Facebook
- π± **Smart Automation**: AI understands your pages and finds the right elements
- π‘οΈ **Privacy-First**: Everything runs locally, your data stays with you
- β‘ **Zero Setup**: Get started with one command
## π Browser Support
Works with **any Chromium-based browser**:
- β
**Google Chrome**
- β
**Arc Browser**
- β
**Microsoft Edge**
- β
**Brave Browser**
- β
**Opera**
- β
**Vivaldi**
- β
**Any Chromium variant**
Perfect for **Cursor users** who want to automate their local testing and development workflows!
## π¬ What You Can Do
**Real workflows you can try today:**
### π° Content & Social Media
- **"Summarize all the articles I read today and post a Twitter thread about the key insights"**
- **"Find interesting articles related to AI from my bookmarks and create a reading list"**
- **"Read this article and post a thoughtful comment on the LinkedIn version"**
- **"Check my recent Twitter bookmarks and summarize the main themes"**
### π§ Productivity & Research
- **"Browse my latest emails and tell me what needs urgent attention"**
- **"Find all the GitHub repos I visited this week and create a summary report"**
- **"Extract the main points from this research paper and save them to my notes"**
- **"Search my browsing history for that article about AI safety I read last month"**
### π€ Development & Testing (Perfect for Cursor!)
- **"Test my web app's signup flow and take screenshots at each step"**
- **"Fill out this form with test data and check if validation works"**
- **"Navigate through my app and check if all the buttons work properly"**
- **"Use my connected wallet to test this DeFi interface"**
### π Advanced Automation
- **"Open tabs for all my daily news sources and summarize the top stories"**
- **"Draft replies to my unread messages based on the context"**
- **"Monitor this webpage and notify me when the content changes"**
- **"Automatically bookmark interesting articles I'm reading"**
## β‘ Quick Start
### 1. Start the Server
```bash
npx opendia
```
### 2. Install the Browser Extension
1. Download from [releases](https://github.com/aaronjmars/opendia/releases)
2. Go to `chrome://extensions/` (or your browser's extension page)
3. Enable "Developer mode"
4. Click "Load unpacked" and select the extension folder
### 3. Connect to Your AI
**For Claude Desktop**, add to your configuration:
```json
{
"mcpServers": {
"opendia": {
"command": "npx",
"args": ["opendia"]
}
}
}
```
**For Cursor or other AI tools**, use the same configuration or follow their specific setup instructions.
## π οΈ Capabilities
OpenDia gives AI models **17 powerful browser tools**:
### π― Smart Page Understanding
- **Analyze any webpage** - AI automatically finds buttons, forms, and interactive elements
- **Extract content intelligently** - Get clean text from articles, social posts, or search results
- **Understand context** - AI knows what type of page it's looking at and how to interact with it
### π±οΈ Natural Interactions
- **Click anything** - Buttons, links, menus - AI finds and clicks the right elements
- **Fill forms smartly** - Works even on complex sites like Twitter, LinkedIn, Facebook
- **Navigate seamlessly** - Go to pages, scroll, wait for content to load
- **Handle modern web apps** - Bypasses detection on social platforms
### π Tab & Window Management
- **Multi-tab workflows** - Open, close, switch between tabs automatically
- **Organize your workspace** - Let AI manage your browser tabs efficiently
- **Coordinate complex tasks** - Work across multiple sites simultaneously
### π Access Your Browser Data
- **Bookmarks & History** - Find that article you read last week
- **Current page content** - Get selected text, links, or full page content
- **Real-time information** - Work with whatever's currently on your screen
### π‘οΈ Anti-Detection Features
- **Social media posting** - Bypass automation detection on Twitter/X, LinkedIn, Facebook
- **Natural interactions** - Mimics human behavior to avoid triggering security measures
- **Reliable automation** - Works consistently even on sites that block typical automation tools
## π¬ Example Prompts to Try
Once everything is set up, try asking your AI:
**Content Creation:**
> *"Read the article on this page and create a Twitter thread summarizing the main points"*
**Research & Analysis:**
> *"Look through my browser history from this week and find articles about machine learning. Summarize the key trends."*
**Social Media Management:**
> *"Check my Twitter bookmarks and organize them into categories. Create a summary of each category."*
**Productivity:**
> *"Open tabs for my usual morning reading sites and give me a briefing of today's top stories"*
**Development Testing:**
> *"Fill out this contact form with test data and check if the submission works properly"*
**Personal Assistant:**
> *"Find that GitHub repo I was looking at yesterday about React components and bookmark it for later"*
## ποΈ How It Works
```mermaid
graph LR
A[AI Model] --> B[OpenDia Server]
B --> C[Browser Extension]
C --> D[Your Browser]
D --> E[Any Website]
```
1. **You ask** your AI to do something browser-related
2. **AI calls** OpenDia tools to understand and interact with pages
3. **OpenDia controls** your browser through the extension
4. **You get results** - AI can see what happened and respond intelligently
## π Security & Privacy
**Your data stays private**:
- β
**Everything runs locally** - No cloud processing of your browsing data
- β
**You control access** - Extension only works when you want it to
- β
**Open source** - Full transparency of what the code does
- β
**No tracking** - We don't collect or store any of your information
**Important**: This tool requires broad browser permissions to function. Only use with AI models you trust, and in environments where you're comfortable with browser automation.
## π€ Contributing
Love to have your help making OpenDia better!
### Quick Development Setup
```bash
git clone https://github.com/aaronjmars/opendia.git
cd opendia
# Start the server
cd opendia-mcp
npm install
npm start
# Load extension in your browser
# Go to chrome://extensions/ β Developer mode β Load unpacked: ./opendia-extension
```
### Ways to Contribute
- π **Report bugs** via [GitHub Issues](https://github.com/aaronjmars/opendia/issues)
- π‘ **Share it on social medias**
- π§ **Add new browser capabilities**
- π **Improve documentation**
- π§ͺ **Test with different AI models**
## π License
MIT License - see [LICENSE](LICENSE) for details.
---
**Ready to supercharge your browser with AI? Get started with `npx opendia`! π**