@picovoice/leopard-react-native
Version:
Picovoice Leopard React Native binding
110 lines (76 loc) • 3.95 kB
Markdown
# Leopard Binding for React Native
## Leopard Speech-to-Text Engine
Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
Leopard is an on-device speech-to-text engine. Leopard is:
- Private; All voice processing runs locally.
- [Accurate](https://picovoice.ai/docs/benchmark/stt/)
- [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
- Cross-Platform:
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
- Android and iOS
- Chrome, Safari, Firefox, and Edge
- Raspberry Pi (3, 4, 5)
## Compatibility
This binding is for running Leopard on **React Native 0.62.2+** on the following platforms:
- Android 5.0+ (SDK 21+)
- iOS 11.0+
## Installation
To start install, be sure you have installed yarn and cocoapods. Then, add the following native modules to your react-native project:
```console
yarn add /leopard-react-native
```
or
```console
npm i /leopard-react-native --save
```
Link the iOS package
```console
cd ios && pod install && cd ..
```
**NOTE**: Due to a limitation in React Native CLI auto-linking, the native module cannot be included as a
transitive dependency. If you are creating a module that depends on leopard-react-native,
you will have to list these as peer dependencies and require developers to install it alongside.
## AccessKey
Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
## Usage
Create an instance of `Leopard`:
```typescript
import { Leopard, LeopardErrors } from '/leopard-react-native'
const accessKey = "${ACCESS_KEY}" // AccessKey obtained from Picovoice Console (https://console.picovoice.ai/)
const modelPath = "${LEOPARD_MODEL_PATH}" // path relative to the assets folder or absolute path to file on device
try {
const leopard = await Leopard.create(accessKey, modelPath)
} catch (err: any) {
if (err instanceof LeopardErrors) {
// handle error
}
}
```
Transcribe an audio file by passing the file path to Leopard:
```typescript
try {
const { transcript, words } = await leopard.processFile("${AUDIO_FILE_PATH}")
console.log(transcript)
} catch (err: any) {
if (err instanceof LeopardErrors) {
// handle error
}
}
```
Finally, when done be sure to explicitly release the resources using `leopard.delete()`.
### Language Model
Create a custom model using the [Picovoice Console](https://console.picovoice.ai/) or use one of the default language models found in [lib/common](../../lib/common).
#### Adding to Android
To add a Leopard model file to your Android application, add the file as a bundled resource by placing it under the `assets` directory of your Android application.
#### Adding to iOS
To add a Leopard model file to your iOS application, add the file as a bundled resource by selecting Build Phases in `Xcode` and adding it to the `Copy Bundle Resources` step.
### Word Metadata
Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
- **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
- **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
- **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
- **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
## Demo App
For example usage refer to our [React Native demo application](https://github.com/Picovoice/leopard/tree/master/demo/react-native).