mcp-smart

# 🎯 MCP Smart Advisor A powerful MCP (Model Context Protocol) server that provides intelligent AI routing and multi-advisor consultations through OpenRouter API. Get expert coding advice from 6 premium AI models with smart routing, advanced caching, rate limiting, and security features. ## ✨ Features - **🧠 Smart Routing System** - Intelligent provider selection based on task requirements - **🎲 Random Mode** - NEW! Randomly selects from all available providers for unpredictable results - **🤖 6 Premium AI Providers** - Claude Sonnet 4.5, OpenAI GPT-5 Pro, xAI Grok 4, Google Gemini 3 Pro, DeepSeek v3.2, Moonshot Kimi-K2 Thinking - **⚡ Intelligent Caching** - LRU cache with configurable TTL to reduce API costs - **🛡️ Security First** - Input validation, prompt injection detection, and rate limiting - **📊 Comprehensive Logging** - Structured logging with cache metrics and health monitoring - **⚙️ Fully Configurable** - Environment variables for all settings - **🔄 Retry Logic** - Exponential backoff with Promise.allSettled for resilient API calls - **🔄 Circuit Breakers** - Individual provider protection with automatic failover ## 🚀 Quick Start ### Install via npx ```bash npx mcp-smart@1.5.7 ``` ### Install globally ```bash npm install -g mcp-smart@1.5.7 ``` ### Run directly ```bash mcp-smart ``` ### 🎲 Try Random Mode ```bash # Install and test the new random routing feature npx mcp-smart@1.5.7 # In your MCP client, try: # model: "random" - for unpredictable AI provider selection ``` ## 🔧 Configuration Set your OpenRouter API key and configure the server: ```bash export OPENROUTER_API_KEY="your-api-key-here" export MAX_RETRIES=3 export REQUEST_TIMEOUT=30000 export CACHE_TTL=300000 export MAX_TOKENS=4000 export MAX_CACHE_SIZE=100 export RATE_LIMIT_REQUESTS=10 export RATE_LIMIT_WINDOW=60000 # Circuit Breaker Configuration export CIRCUIT_BREAKER_FAILURE_THRESHOLD=5 export CIRCUIT_BREAKER_RECOVERY_TIMEOUT=60000 export CIRCUIT_BREAKER_MONITORING_PERIOD=300000 export CIRCUIT_BREAKER_HALF_OPEN_MAX_CALLS=3 ``` ### Environment Variables | Variable | Default | Description | |----------|---------|-------------| | `OPENROUTER_API_KEY` | **Required** | Your OpenRouter API key | | `MAX_RETRIES` | `3` | Maximum retry attempts for failed requests | | `REQUEST_TIMEOUT` | `30000` | Request timeout in milliseconds | | `CACHE_TTL` | `300000` | Cache time-to-live in milliseconds (5 min) | | `MAX_TOKENS` | `4000` | Maximum tokens per API request | | `MAX_CACHE_SIZE` | `100` | Maximum number of cached responses | | `MAX_TASK_LENGTH` | `10000` | Maximum task input length | | `MAX_CONTEXT_LENGTH` | `20000` | Maximum context input length | | `RATE_LIMIT_REQUESTS` | `10` | Requests per rate limit window | | `RATE_LIMIT_WINDOW` | `60000` | Rate limit window in milliseconds (1 min) | | `CIRCUIT_BREAKER_FAILURE_THRESHOLD` | `5` | Consecutive failures before opening circuit breaker | | `CIRCUIT_BREAKER_RECOVERY_TIMEOUT` | `60000` | Time to wait before attempting recovery (ms) | | `CIRCUIT_BREAKER_MONITORING_PERIOD` | `300000` | Circuit breaker monitoring window (ms) | | `CIRCUIT_BREAKER_HALF_OPEN_MAX_CALLS` | `3` | Max calls allowed in half-open state | ## 🎯 Usage with Claude Code Add to your `claude-code-config.json`: ```json { "mcpServers": { "smart-advisor": { "command": "npx", "args": ["mcp-smart@1.5.7"], "env": { "OPENROUTER_API_KEY": "your-openrouter-api-key-here" } } } } ``` ### 🎯 Claude Code Integration with Agent Hooks To enable automatic Smart advisor access in Claude Code, add this to your `~/.claude/CLAUDE.md` file (or local project CLAUDE.md). This hooks the MCP server with Claude's agent system: ```markdown When facing uncertainty and needing advice, you have exclusive access to L7 Google programmer named Smart. Ask him when appropriate using these terms: smart_advisor, code_review, get_advice, expert_opinion, smart_llm. You can specify what you're looking for, these are the modes available: - auto - GPT-5 Mini intelligently selects the best provider - intelligence - Routes to Claude Sonnet 4.5 (ultimate reasoning) - premium - Routes to OpenAI GPT-5 Pro (high-end reasoning) - speed - Routes to xAI Grok 4 (fast responses) - balance - Routes to Google Gemini 3 Pro (cost/performance balance) - cost - Routes to DeepSeek v3.2 (budget-friendly) - all - Multi-provider consultation (all providers) ``` This integration allows Claude Code to automatically consult the Smart advisor when encountering complex coding problems or needing expert opinions. ## 🤖 AI Providers & Intelligence Ranking ### Provider Capabilities (Ranked by Intelligence) | Provider | Intelligence | Cost | Speed | Context | Strengths | |----------|-------------|------|-------|---------|-----------| | **Claude Sonnet 4.5** | 🔥 Ultimate | High | Medium | Very High | Supreme reasoning, ethical coding, comprehensive solutions | | **OpenAI GPT-5 Pro** | 🏆 Highest | Very High | Slow | Medium | Complex reasoning, creativity, advanced coding | | **xAI Grok 4** | ⭐ Very High | Medium | Fast | High | Real-time data, creative thinking, fast responses | | **Google Gemini 3 Pro** | ⭐ Very High | Low | Fast | Highest (2M) | Multimodal, research, long-context, speed | | **DeepSeek v3.2** | ✅ High | Low | Fast | Medium | Cost-effective, coding/logic/math, analysis | | **Moonshot Kimi-K2 Thinking** | ⭐ Very High | Medium | Fast | Highest (2M) | Chinese language, reasoning, coding, long-context | ## 🎛️ Smart Routing Strategies ### Routing Options | Strategy | Provider | Use Case | Description | |----------|----------|----------|-------------| | **`auto`** ⚡ | GPT-5 Mini decides | Default smart routing | Intelligent provider selection based on task | | **`intelligence`** 🔥 | Claude Sonnet 4.5 | Ultimate reasoning | Most capable model for complex problems | | **`premium`** 🏆 | OpenAI GPT-5 Pro | Premium alternative | High-end reasoning and creativity | | **`speed`** 🚀 | xAI Grok 4 | Fast responses | Quick turnaround with real-time data | | **`balance`** ⚖️ | Google Gemini 3 Pro | Cost/performance | Optimal balance of speed, cost, and capability | | **`cost`** 💰 | DeepSeek v3.2 | Budget-friendly | Maximum cost efficiency | | **`random`** 🎲 | Random provider | Unpredictable | Randomly selects from all available providers | | **`all`** 🌟 | All providers | Comprehensive | Multi-provider consultation | ### Direct Provider Access - **`claude`** - Force Claude Sonnet 4.5 - **`openai`** - Force OpenAI GPT-5 Pro - **`xai`** - Force xAI Grok 4 - **`google`** - Force Google Gemini 3 Pro - **`deepseek`** - Force DeepSeek v3.2 - **`moonshot`** - Force Moonshot Kimi-K2 Thinking ## 💡 Example Usage ### Smart Auto-Routing (Recommended) ```typescript // Let GPT-4o-mini choose the best provider await smart_advisor({ model: "auto", task: "Optimize this React component for performance", context: "Component renders 10,000+ items with complex state" }); ``` ### Strategy-Based Routing ```typescript // Maximum intelligence for complex problems await smart_advisor({ model: "intelligence", // Routes to Claude Sonnet 4.5 task: "Design a fault-tolerant distributed system architecture", context: "Microservices with 99.99% uptime requirement" }); // Speed-optimized responses await smart_advisor({ model: "speed", // Routes to xAI Grok 4 task: "Quick debugging help for this JavaScript error", context: "TypeError in production, need fast solution" }); // Cost-effective solutions await smart_advisor({ model: "cost", // Routes to DeepSeek v3.2 task: "Write a simple sorting algorithm", context: "Basic coding task for learning" }); // Random provider selection await smart_advisor({ model: "random", // Randomly selects from all providers task: "Refactor this function for better readability", context: "Legacy code that needs modernization" }); ``` ### 🎲 Random Mode Benefits The random routing strategy offers unique advantages: - **🔍 Provider Testing** - Compare different AI approaches to the same problem - **⚖️ Load Balancing** - Distribute requests across providers automatically - **🎯 Bias Reduction** - Avoid over-reliance on a single provider - **🚀 Discovery** - Uncover unexpected solutions from different AI models - **🔄 Experimentation** - Perfect for A/B testing AI provider performance ```typescript // Great for testing different perspectives for (let i = 0; i < 5; i++) { const result = await smart_advisor({ model: "random", task: "Explain this complex algorithm", context: "University-level computer science" }); console.log(`Attempt ${i + 1}: Different AI perspective`); } ``` ### Multi-Advisor Consultation ```typescript // Get perspectives from all providers await smart_advisor({ model: "all", task: "Review this security-critical authentication system", context: "OAuth2 implementation handling sensitive user data" }); ``` ### Direct Provider Access ```typescript // Use specific provider directly await smart_advisor({ model: "claude", task: "Ethical considerations for AI system design", context: "Building recommendation engine for social media" }); ``` ## 🏗️ Architecture The server implements a structured 4-persona prompt system: 1. **Manager** - Defines clear requirements and ensures understanding 2. **Smart Technical Advisor** - Creates detailed technical architecture with deep insights 3. **QA** - Implements comprehensive tests covering edge cases and bottlenecks 4. **Engineer** - Provides production-ready, secure, and efficient code ### Smart Routing Flow ``` User Request → GPT-4o-mini Analysis → Provider Selection → Response ↓ [Task Complexity Assessment] ↓ [Cost/Performance/Speed Requirements] ↓ [Optimal Provider Routing] ``` ## 🛡️ Security Features - **Prompt Injection Detection** - Advanced pattern recognition for malicious inputs - **Script Injection Prevention** - Blocks XSS and code injection attempts - **Input Validation** - Length limits and comprehensive sanitization - **Rate Limiting** - Configurable request limits per time window - **Security Logging** - Detailed audit trail for security events ## 🔄 Resilience & Fault Tolerance ### Circuit Breaker Pattern - **Provider Protection** - Individual circuit breakers for each AI provider - **Automatic Failover** - Smart fallback to healthy providers when others fail - **Self-Healing** - Automatic recovery testing with configurable timeouts - **State Management** - CLOSED, OPEN, and HALF_OPEN states with proper transitions - **Failure Thresholds** - Configurable consecutive failure limits before opening - **Monitoring** - Real-time circuit breaker status and metrics tracking ## 📈 Performance & Monitoring ### Caching System - **LRU Caching** - Intelligent cache eviction based on usage patterns - **Cache Metrics** - Real-time hit/miss ratios and performance tracking - **TTL Management** - Configurable cache expiration ### Resilience Features - **Circuit Breakers** - Individual protection for each AI provider with automatic failover - **Promise.allSettled** - Graceful handling of provider failures - **Intelligent Fallback** - Hierarchical fallback to healthy providers based on capability ranking - **Exponential Backoff** - Smart retry logic for transient failures - **Health Monitoring** - Comprehensive system health checks including circuit breaker status - **Provider Recovery** - Automatic testing and recovery of failed providers ### Monitoring Dashboard ```typescript // Access real-time metrics const health = server.getHealthCheck(); const cacheMetrics = server.getCacheMetrics(); ``` ## 🔍 Advanced Features ### Health Monitoring - System uptime tracking - Cache performance metrics - Rate limit monitoring - Provider failure tracking - Version information ### Cache Management - Hit/miss ratio tracking - Eviction monitoring - Size management - Performance optimization ### Security Monitoring - Injection attempt detection - Rate limit violations - Input validation failures - Security event logging ## 🛠️ Development ```bash # Clone the repository git clone https://github.com/sliday/mcp-smart.git cd mcp-smart # Install dependencies npm install # Build the project npm run build # Run in development mode npm run dev # Run tests (44/45 passing) npm test # Run tests with coverage npm run test:coverage ``` ## 🧪 Testing The project maintains high test coverage with 44/45 tests passing: - Unit tests for all providers and routing strategies - Integration tests for end-to-end workflows - Security tests for injection detection - Performance tests for caching and rate limiting - Error handling and fallback testing ## 📊 Version History ### v1.5.7 (Latest) - 🚀 **Major Model Updates** - All providers upgraded to latest versions - ✨ Claude Sonnet 4 → Claude Sonnet 4.5 (enhanced reasoning) - ✨ OpenAI o3 → OpenAI GPT-5 Pro (next-generation intelligence) - ✨ xAI Grok-3-beta → xAI Grok 4 (improved speed and accuracy) - ✨ Google Gemini Flash → Google Gemini 3 Pro (enhanced capabilities) - ✨ DeepSeek v3-0324 → DeepSeek v3.2 (improved coding performance) - ✨ Moonshot Kimi-K2 → Moonshot Kimi-K2 Thinking (reasoning enhancement) - ✨ Router: GPT-4o-mini → GPT-5 Mini (smarter routing decisions) - 📚 Updated all documentation with new model names - 🎯 Improved routing prompts with latest model capabilities ### v1.5.3 - ✨ Added random routing strategy for unpredictable provider selection - 🎲 New `random` mode randomly selects from all available providers - 📚 Enhanced documentation with random mode benefits and use cases - 🎯 Added detailed examples for provider testing and load balancing - 🧪 **Perfect test coverage - 47/47 tests passing (100%)** - 🔧 Improved routing logic with better error handling - ✨ Highlighted random mode feature in Quick Start section - 🛠️ Enhanced test isolation and mock setup for reliability ### v1.4.0 - ✨ Added Claude Sonnet 4 (ultimate intelligence) - ✨ Added xAI Grok-3-beta (speed optimization) - ✨ Updated Google to Gemini Flash (cost-effective) - ✨ Enhanced smart routing with 7 strategies - ✨ Improved intelligence hierarchy and provider rankings - 🔧 Enhanced auto-routing decision logic - 📈 44/45 test coverage ### v1.3.0 - ✨ Smart routing system with auto-selection - 🔧 Cost/performance-aware provider selection - 📈 Comprehensive test coverage ### v1.2.0 - 🛡️ Enhanced security with prompt injection detection - ⚡ Improved caching with metrics tracking - 🔄 Promise.allSettled for better error handling ## 📄 License MIT License - see [LICENSE](LICENSE) file for details. ## 🤝 Contributing Contributions are welcome! Please read our contributing guidelines and submit pull requests. ## 🐛 Issues Report bugs and request features at [GitHub Issues](https://github.com/sliday/mcp-smart/issues). ## 🌟 Support If you find this project helpful, please give it a star on GitHub! --- **Built with ❤️ for the MCP ecosystem** **Powered by Claude Sonnet 4.5, OpenAI GPT-5 Pro, xAI Grok 4, Google Gemini 3 Pro, DeepSeek v3.2, and Moonshot Kimi-K2 Thinking**