echogarden

Version:

An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

github.com/echogarden-project/echogarden

echogarden-project/echogarden

44 lines (37 loc) • 3.58 kB

Markdown

View Raw

# Echogarden components licensing ## Engines and libraries * `onnxruntime-node`: [MIT License](https://github.com/microsoft/onnxruntime/blob/main/LICENSE) * `espeak`: [GNU GPL v3](https://github.com/espeak-ng/espeak-ng/blob/master/COPYING) * `flite`: [BSD License](https://github.com/festvox/flite/blob/master/COPYING) * `pico`: [Apache License 2.0](https://github.com/gmn/nanotts/blob/master/LICENSE) * `gnuspeech`: [GNU GPL v3](https://github.com/mym-br/gnuspeech_sa/blob/master/COPYING.txt) * `sam`: [Abandonware / fair-use / unknown](https://github.com/discordier/sam#license) * `vosk` [Apache License 2.0](https://github.com/alphacep/vosk-api/blob/master/COPYING) * `tinyld` [MIT License](https://github.com/komodojp/tinyld/blob/develop/license) * `fasttext`: [MIT License](https://github.com/loretoparisi/fasttext.js/blob/master/LICENSE) * `fvad` [BSD-3-Clause](https://github.com/cpuimage/WebRTC_VAD/blob/master/LICENSE) * `kissfft`: [BSD-3-Clause](https://github.com/mborgerding/kissfft/blob/master/COPYING) * `pffft`: [BSD-3-Clause](https://github.com/marton78/pffft/blob/master/LICENSE.txt) * `rnnoise`: [BSD-3-Clause](https://github.com/xiph/rnnoise/blob/master/COPYING) * `sonic`: [Apache License 2.0](https://github.com/waywardgeek/sonic/blob/master/LICENSE) * `speex-resampler`: [BSD-3-Clause](https://github.com/xiph/speexdsp/blob/master/COPYING) * `rubberband`: [GNU GPL v2](https://github.com/breakfastquay/rubberband/blob/default/COPYING) * `jieba-wasm`: [MIT License](https://github.com/fengkx/jieba-wasm/blob/master/LICENSE) * `kuromoji`: [Apache License 2.0](https://github.com/takuyaa/kuromoji.js/blob/master/LICENSE-2.0.txt) * `tiktoken`: [MIT License](https://github.com/openai/tiktoken/blob/main/LICENSE) ## Package repository A large variety of voices, models and binaries are served from the repository. All are freely distributable, with varying licenses: * Kokoro model and voices (`kokoro-`) [Apache License 2.0](https://huggingface.co/hexgrad/Kokoro-82M) * Flite voices (`flite-`): [BSD License](https://github.com/festvox/flite/blob/master/COPYING) * SVOX Pico resources (`pico-`): [Apache License 2.0](https://github.com/gmn/nanotts/blob/master/LICENSE) * Silero VAD (`silero-vad`) and Silero language classifier (`silero-lang-classifier-95`): [MIT License](https://github.com/snakers4/silero-vad/blob/master/LICENSE) * Silero speech recognition models (`silero-en-`, `silero-de-`, `silero-es-`, `silero-ua-`): [BY-NC-SA](https://github.com/snakers4/silero-models/blob/master/LICENSE) * VITS pre-trained models (`vits-`): licensed under various creative commons licenses: [CC0](https://creativecommons.org/share-your-work/public-domain/cc0/), [CC-BY](https://creativecommons.org/licenses/by/4.0/) and [BY-NC-SA](https://creativecommons.org/licenses/by-nc-sa/4.0/), and few are public domain. You can view the individual license for each model in the model cards on the [Piper samples page](https://rhasspy.github.io/piper-samples/) * Whisper pre-trained models (`whisper-`): [MIT License](https://github.com/openai/whisper/blob/main/LICENSE) * MDX-NET source separation models (`mdxnet-`): [MIT License](https://github.com/kuielab/mdx-net/blob/main/LICENSE) * NSNet2: [Attribution 4.0 International](https://github.com/NeonGeckoCom/nsnet2-denoiser/blob/main/LICENSE) Tool binary distributions * FFmpeg: [LGPL, GPL v2 and GPL v3 Licenses](https://github.com/FFmpeg/FFmpeg) * SoX: [GPL v2 License](https://github.com/chirlu/sox/blob/master/LICENSE.GPL) * whisper.cpp: [MIT License](https://github.com/ggerganov/whisper.cpp/blob/master/LICENSE)