@corti/dictation-web

# Corti Dictation SDK ## Overview The **Corti Dictation SDK** is a web component that enables real-time speech-to-text dictation using Corti's Dictation API. It provides a simple interface for capturing audio, streaming it to the API, and handling transcripts. > **Note:** OAuth 2.0 authentication is not handled by this SDK. The client must provide an API key or authorization token before using the component. ## Installation Include the SDK in your project by importing the JavaScript module: ```html npm i @corti/dictation-web ``` Then import the module like so: ```js // Import the Corti Dictation SDK import '@corti/dictation-web'; ``` Alternatively, use a CDN to start quickly (not recommended). ```html <script src="https://cdn.jsdelivr.net/npm/@corti/dictation-web/dist/bundle.min.js" preload type="module" ></script> ``` ## Usage ### Demo 🚀 [Hosted Demo](https://codepen.io/hccullen/pen/OPJmxQR) ### Basic Example ```html <!DOCTYPE html> <html lang="en"> <body> <corti-dictation></corti-dictation> <textarea id="transcript" placeholder="Transcript will appear here..." ></textarea> <script> import '@corti/dictation-web'; const dictation = document.getElementById('transcript'); dictation.setAccessToken('YOUR_AUTH_TOKEN'); // Note: Never hardcode tokens // Listen for events dictationEl.addEventListener('transcript', e => { document.getElementById('transcript').value += e.detail.data.text + ' '; }); </script> </body> </html> ``` ## API Reference ### Properties | Property | Type | Description | | -------------------- | ------- | ------------------------------------------------------------------------------------------------------------------------- | | `devices` | Array | List of available recording devices. | | `selectedDevice` | Object | The selected device used for recording (MediaDeviceInfo). | | `recordingState` | String | Current state of recording (`stopped`, `recording`, `initializing` and `stopping`, ). | | `dictationConfig` | Object | Configuration settings for dictation. | | `debug_displayAudio` | Boolean | Overrides any device selection and instead uses getDisplayMedia to stream system audio. Should only be used for debugging | ### Methods | Method | Description | | -------------------------------------- | ---------------------------------------------------------------- | | `toggleRecording()` | Starts or stops recording. | | `setAccessToken(access_token: string)` | Set the latest access token. This will return the server config. | ### Events | Event | Description | | --------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `ready` | Fired once the component is ready. | | `recording-state-changed` | Fired when the recording state changes. `detail.state` contains the new state. | | `recording-devices-changed` | Fired when the user switches recording devices or the list of recording devices changes. `detail.devices` contains the full devices list. `detail.selectedDevice` contains the current selected device. | | `transcript` | Fired when a new transcript is received. `detail.data.text` contains the transcribed text. | | `command` | Fired whenever a new command is detected. | | `audio-level-changed` | Fired when the input audio level changes. `detail.audioLevel` contains the new level. | | `error` | Fired on error. `detail` contains the full error. | ## Authentication This SDK does not handle OAuth 2.0 authentication. The client must provide an API key or access token as a string using `setAccessToken`. ## Usage Examples Explore practical implementations and usage examples in the [Demo Folder](https://github.com/corticph/dictation-web-sdk/tree/main/demo). These demos can also be run locally. ## Styling ![UI Overview](https://github.com/corticph/dictation-web-sdk/blob/main/docs/ui.png) The default UI is designed to be slotted into existing applications seamlessly, however, it also supports custom styling as well as theming. The UI can be fully customized using CSS properties. Refer to our [Styling Guide](https://github.com/corticph/dictation-web-sdk/blob/main/docs/styling.md) for detailed instructions. ## License This SDK is licensed under MIT. ## Support For issues or questions, contact **Corti Support** at [support@corti.ai](mailto:help@corti.ai).