UNPKG

@namastexlabs/speak

Version:

Open source voice dictation for everyone

113 lines (86 loc) 5.39 kB
# Why Speak? Open Source Voice Dictation ## Speak vs Wispr Flow: A Transparent Comparison Speak is an open-source voice dictation application that puts **you in control** of your data and workflow. Unlike closed-source alternatives, Speak gives you full transparency, privacy, and customization options. | Feature | Speak | Wispr Flow | |---------|-------|------------| | **Open Source** | ✅ Full transparency, community-driven | ❌ Closed source, proprietary | | **Linux Support** | ✅ Native support, first-class citizen | ❌ Not available | | **Privacy** | ✅ Local-first processing, no data sharing by default | ⚠️ Cloud processing, broad usage rights | | **Offline Mode** | 🔄 Roadmap (local Whisper models) | ❌ Requires internet connection | | **Free Tier** | ✅ Unlimited usage | ⚠️ 2,000 words/week limit | | **Self-Hosting** | ✅ Full self-hosting capabilities | ❌ Not available | | **Customization** | ✅ Full control, extensible architecture | ⚠️ Limited customization | | **Data Ownership** | ✅ You own all your data | ⚠️ Platform retains usage rights | | **Cross-Platform** | ✅ Windows, macOS, Linux | ❌ Windows, macOS only | | **Performance in Noise** | ✅ Advanced preprocessing, noise handling | ⚠️ Known issues in noisy environments | | **Long Sessions** | ✅ Optimized for extended dictation | ⚠️ Performance degradation reported | ## 🎯 Our Mission: Voice Dictation for Everyone **Speak exists to democratize voice dictation.** We believe everyone should have access to fast, accurate, and private voice-to-text technology, regardless of their platform or privacy requirements. ### What Makes Speak Different #### 🔓 **Open Source First** - **Full transparency**: Every line of code is open for inspection - **Community driven**: Features developed based on real user needs - **No vendor lock-in**: You're never trapped in a proprietary ecosystem - **Extensible**: Build your own features, integrations, and workflows #### 🛡️ **Privacy by Design** - **Local processing**: Your voice stays on your machine by default - **No mandatory data collection**: We don't track you unless you opt-in - **Self-hosting ready**: Run everything on your own infrastructure - **Zero licensing**: Your data remains yours forever #### 🐧 **True Cross-Platform** - **Linux native**: First-class support for Linux distributions - **Consistent experience**: Same features and performance across all platforms - **Open architecture**: Community can port to new platforms - **No platform tax**: Equal features on all supported operating systems #### ⚡ **Built for Real-World Use** - **Noisy environments**: Advanced audio preprocessing for challenging conditions - **Long sessions**: Optimized for extended dictation without performance degradation - **Unlimited usage**: No artificial word limits or usage caps - **Professional features**: Speaker diarization, custom vocabulary, domain-specific prompting ## 📊 Performance That Matters ### Speed & Accuracy - **<2 second latency**: From voice to text insertion - **>95% accuracy**: Powered by OpenAI's advanced Whisper models - **Multi-language**: 50+ languages with automatic detection - **Real-time streaming**: No waiting for complete thoughts ### Reliability You Can Trust - **Offline capable**: Local models coming soon (roadmap) - **No internet required**: For core functionality - **Robust audio handling**: Works in noisy environments - **Enterprise-grade**: HIPAA-compliant options available ## 🚀 What's Next We're just getting started. Here's what's on our roadmap: ### Coming Soon (Q1 2026) - **Offline mode**: Local Whisper models for complete privacy - **Advanced voice commands**: "New paragraph," "format as list," etc. - **Translation features**: Dictate in one language, output in another - **Team collaboration**: Shared dictionaries and workflows ### Future Vision - **Mobile apps**: iOS and Android companions - **Browser extension**: Voice dictation in web applications - **API access**: Integrate Speak into your own applications - **Plugin ecosystem**: Community-built extensions and integrations ## 💡 Why Choose Speak? ### For Individuals - **Privacy conscious**: Keep your voice data private and secure - **Linux users**: Finally, professional voice dictation on Linux - **Power users**: Full customization and extensibility - **Students & writers**: Unlimited usage for long-form content ### For Teams & Organizations - **Self-hosting**: Keep everything on your infrastructure - **HIPAA compliance**: Medical and healthcare applications - **Custom workflows**: Build organization-specific features - **No usage limits**: Scale without worrying about costs ### For Developers - **Open source**: Contribute to the codebase - **API access**: Integrate voice features into your apps - **Plugin system**: Build and share extensions - **Community**: Join a growing ecosystem of voice technology ## 🔗 Get Started Ready to try Speak? It's simple: 1. **Download**: Get the installer for your platform 2. **Configure**: Add your OpenAI API key (optional for offline mode) 3. **Dictate**: Hold Ctrl+Win and start speaking [Download Speak](./getting-started.md) | [Read the Docs](./) | [Contribute on GitHub](https://github.com/yourusername/speak) --- *Speak is free, open source, and built for the future of voice interaction.*