@namastexlabs/speak
Version:
Open source voice dictation for everyone
113 lines (86 loc) • 5.39 kB
Markdown
# Why Speak? Open Source Voice Dictation
## Speak vs Wispr Flow: A Transparent Comparison
Speak is an open-source voice dictation application that puts **you in control** of your data and workflow. Unlike closed-source alternatives, Speak gives you full transparency, privacy, and customization options.
| Feature | Speak | Wispr Flow |
|---------|-------|------------|
| **Open Source** | ✅ Full transparency, community-driven | ❌ Closed source, proprietary |
| **Linux Support** | ✅ Native support, first-class citizen | ❌ Not available |
| **Privacy** | ✅ Local-first processing, no data sharing by default | ⚠️ Cloud processing, broad usage rights |
| **Offline Mode** | 🔄 Roadmap (local Whisper models) | ❌ Requires internet connection |
| **Free Tier** | ✅ Unlimited usage | ⚠️ 2,000 words/week limit |
| **Self-Hosting** | ✅ Full self-hosting capabilities | ❌ Not available |
| **Customization** | ✅ Full control, extensible architecture | ⚠️ Limited customization |
| **Data Ownership** | ✅ You own all your data | ⚠️ Platform retains usage rights |
| **Cross-Platform** | ✅ Windows, macOS, Linux | ❌ Windows, macOS only |
| **Performance in Noise** | ✅ Advanced preprocessing, noise handling | ⚠️ Known issues in noisy environments |
| **Long Sessions** | ✅ Optimized for extended dictation | ⚠️ Performance degradation reported |
## 🎯 Our Mission: Voice Dictation for Everyone
**Speak exists to democratize voice dictation.** We believe everyone should have access to fast, accurate, and private voice-to-text technology, regardless of their platform or privacy requirements.
### What Makes Speak Different
#### 🔓 **Open Source First**
- **Full transparency**: Every line of code is open for inspection
- **Community driven**: Features developed based on real user needs
- **No vendor lock-in**: You're never trapped in a proprietary ecosystem
- **Extensible**: Build your own features, integrations, and workflows
#### 🛡️ **Privacy by Design**
- **Local processing**: Your voice stays on your machine by default
- **No mandatory data collection**: We don't track you unless you opt-in
- **Self-hosting ready**: Run everything on your own infrastructure
- **Zero licensing**: Your data remains yours forever
#### 🐧 **True Cross-Platform**
- **Linux native**: First-class support for Linux distributions
- **Consistent experience**: Same features and performance across all platforms
- **Open architecture**: Community can port to new platforms
- **No platform tax**: Equal features on all supported operating systems
#### ⚡ **Built for Real-World Use**
- **Noisy environments**: Advanced audio preprocessing for challenging conditions
- **Long sessions**: Optimized for extended dictation without performance degradation
- **Unlimited usage**: No artificial word limits or usage caps
- **Professional features**: Speaker diarization, custom vocabulary, domain-specific prompting
## 📊 Performance That Matters
### Speed & Accuracy
- **<2 second latency**: From voice to text insertion
- **>95% accuracy**: Powered by OpenAI's advanced Whisper models
- **Multi-language**: 50+ languages with automatic detection
- **Real-time streaming**: No waiting for complete thoughts
### Reliability You Can Trust
- **Offline capable**: Local models coming soon (roadmap)
- **No internet required**: For core functionality
- **Robust audio handling**: Works in noisy environments
- **Enterprise-grade**: HIPAA-compliant options available
## 🚀 What's Next
We're just getting started. Here's what's on our roadmap:
### Coming Soon (Q1 2026)
- **Offline mode**: Local Whisper models for complete privacy
- **Advanced voice commands**: "New paragraph," "format as list," etc.
- **Translation features**: Dictate in one language, output in another
- **Team collaboration**: Shared dictionaries and workflows
### Future Vision
- **Mobile apps**: iOS and Android companions
- **Browser extension**: Voice dictation in web applications
- **API access**: Integrate Speak into your own applications
- **Plugin ecosystem**: Community-built extensions and integrations
## 💡 Why Choose Speak?
### For Individuals
- **Privacy conscious**: Keep your voice data private and secure
- **Linux users**: Finally, professional voice dictation on Linux
- **Power users**: Full customization and extensibility
- **Students & writers**: Unlimited usage for long-form content
### For Teams & Organizations
- **Self-hosting**: Keep everything on your infrastructure
- **HIPAA compliance**: Medical and healthcare applications
- **Custom workflows**: Build organization-specific features
- **No usage limits**: Scale without worrying about costs
### For Developers
- **Open source**: Contribute to the codebase
- **API access**: Integrate voice features into your apps
- **Plugin system**: Build and share extensions
- **Community**: Join a growing ecosystem of voice technology
## 🔗 Get Started
Ready to try Speak? It's simple:
1. **Download**: Get the installer for your platform
2. **Configure**: Add your OpenAI API key (optional for offline mode)
3. **Dictate**: Hold Ctrl+Win and start speaking
[Download Speak](./getting-started.md) | [Read the Docs](./) | [Contribute on GitHub](https://github.com/yourusername/speak)
---
*Speak is free, open source, and built for the future of voice interaction.*