UNPKG

watson-speech

Version:

IBM Watson Speech to Text and Text to Speech SDK for web browsers.

2,267 lines (727 loc) 36.1 kB
<!DOCTYPE html> <html lang="en"> <head> <meta charset="utf-8"> <title>JSDoc: Global</title> <script src="scripts/prettify/prettify.js"> </script> <script src="scripts/prettify/lang-css.js"> </script> <!--[if lt IE 9]> <script src="//html5shiv.googlecode.com/svn/trunk/html5.js"></script> <![endif]--> <link type="text/css" rel="stylesheet" href="styles/prettify-tomorrow.css"> <link type="text/css" rel="stylesheet" href="styles/jsdoc-default.css"> </head> <body> <div id="main"> <h1 class="page-title">Global</h1> <section> <header> <h2></h2> </header> <article> <div class="container-overview"> <dl class="details"> </dl> </div> <h3 class="subsection-title">Methods</h3> <h4 class="name" id="getContentTypeFromFile"><span class="type-signature"></span>getContentTypeFromFile<span class="signature">(file)</span><span class="type-signature"> &rarr; {Promise}</span></h4> <div class="description"> <p>Reads the first few bytes of a binary file and resolves to the content-type if recognized &amp; supported</p> </div> <h5>Parameters:</h5> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>file</code></td> <td class="type"> <span class="param-type">File</span> | <span class="param-type">Blob</span> </td> <td class="description last"></td> </tr> </tbody> </table> <dl class="details"> <dt class="tag-source">Source:</dt> <dd class="tag-source"><ul class="dummy"><li> <a href="speech-to-text_file-player.js.html">speech-to-text/file-player.js</a>, <a href="speech-to-text_file-player.js.html#line58">line 58</a> </li></ul></dd> </dl> <h5>Returns:</h5> <dl> <dt> Type </dt> <dd> <span class="param-type">Promise</span> </dd> </dl> <h4 class="name" id="module:watson-speech/speech-to-text/get-models"><span class="type-signature"></span>module:watson-speech/speech-to-text/get-models<span class="signature">(options)</span><span class="type-signature"> &rarr; {Promise.&lt;T>}</span></h4> <div class="description"> <p>Returns a promise that resolves to an array of objects representing the available voice models. Example:</p> <pre class="prettyprint source lang-js"><code> [{ &quot;url&quot;: &quot;https://stream.watsonplatform.net/speech-to-text/api/v1/models/en-UK_BroadbandModel&quot;, &quot;rate&quot;: 16000, &quot;name&quot;: &quot;en-UK_BroadbandModel&quot;, &quot;language&quot;: &quot;en-UK&quot;, &quot;description&quot;: &quot;UK English broadband model.&quot; }, //... ]</code></pre><p> Requires fetch, pollyfill available at https://github.com/github/fetch</p> </div> <h5>Parameters:</h5> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>options</code></td> <td class="type"> <span class="param-type">Object</span> </td> <td class="description last"> <h6>Properties</h6> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>token</code></td> <td class="type"> <span class="param-type">String</span> </td> <td class="description last"><p>auth token</p></td> </tr> </tbody> </table> </td> </tr> </tbody> </table> <dl class="details"> <dt class="tag-source">Source:</dt> <dd class="tag-source"><ul class="dummy"><li> <a href="speech-to-text_get-models.js.html">speech-to-text/get-models.js</a>, <a href="speech-to-text_get-models.js.html#line43">line 43</a> </li></ul></dd> <dt class="tag-todo">To Do:</dt> <dd class="tag-todo"> <ul> <li>define format in @return statement</li> </ul> </dd> </dl> <h5>Returns:</h5> <dl> <dt> Type </dt> <dd> <span class="param-type">Promise.&lt;T></span> </dd> </dl> <h4 class="name" id="module:watson-speech/speech-to-text/recognize-file"><span class="type-signature"></span>module:watson-speech/speech-to-text/recognize-file<span class="signature">(options)</span><span class="type-signature"> &rarr; {<a href="RecognizeStream.html">RecognizeStream</a>|<a href="SpeakerStream.html">SpeakerStream</a>|<a href="FormatStream.html">FormatStream</a>|<a href="ResultStream.html">ResultStream</a>|<a href="TimingStream.html">TimingStream</a>}</span></h4> <div class="description"> <p>Create and return a RecognizeStream from a File or Blob (e.g. from a file <input>, a dragdrop target, or an ajax request)</p> </div> <h5>Parameters:</h5> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>options</code></td> <td class="type"> <span class="param-type">Object</span> </td> <td class="description last"><p>Also passed to {MediaElementAudioStream} and to {RecognizeStream}</p> <h6>Properties</h6> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th>Attributes</th> <th>Default</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>token</code></td> <td class="type"> <span class="param-type">String</span> </td> <td class="attributes"> </td> <td class="default"> </td> <td class="description last"><p>Auth Token - see https://github.com/watson-developer-cloud/node-sdk#authorization</p></td> </tr> <tr> <td class="name"><code>file</code></td> <td class="type"> <span class="param-type">Blob</span> | <span class="param-type">FileString</span> </td> <td class="attributes"> </td> <td class="default"> </td> <td class="description last"><p>String url or the raw audio data as a Blob or File instance to be transcribed (and optionally played). Playback may not with with Blob or File on mobile Safari.</p></td> </tr> <tr> <td class="name"><code>play</code></td> <td class="type"> <span class="param-type">Boolean</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> false </td> <td class="description last"><p>If a file is set, play it locally as it's being uploaded</p></td> </tr> <tr> <td class="name"><code>format</code></td> <td class="type"> <span class="param-type">Boolena</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> true </td> <td class="description last"><p>pipe the text through a {FormatStream} which performs light formatting. Also controls smart_formatting option unless explicitly set.</p></td> </tr> <tr> <td class="name"><code>realtime</code></td> <td class="type"> <span class="param-type">Boolena</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> options.play </td> <td class="description last"><p>pipe the text through a {TimingStream} which slows the output down to real-time to match the audio playback.</p></td> </tr> <tr> <td class="name"><code>outputElement</code></td> <td class="type"> <span class="param-type">String</span> | <span class="param-type">DOMElement</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> </td> <td class="description last"><p>pipe the text to a WriteableElementStream targeting the specified element. Also defaults objectMode to true to enable interim results.</p></td> </tr> <tr> <td class="name"><code>extractResults</code></td> <td class="type"> <span class="param-type">Boolean</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> false </td> <td class="description last"><p>pipe results through a ResultExtractor stream to simplify the objects. (Default behavior before v0.22) Automatically enables objectMode.</p></td> </tr> <tr> <td class="name"><code>resultsBySpeaker</code></td> <td class="type"> <span class="param-type">Boolean</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> false </td> <td class="description last"><p>pipe results through a SpeakerStream. Causes each data event to include multiple results, each with a speaker field. Automatically enables objectMode and speaker_labels. Adds some delay to processing.</p></td> </tr> </tbody> </table> </td> </tr> </tbody> </table> <dl class="details"> <dt class="tag-source">Source:</dt> <dd class="tag-source"><ul class="dummy"><li> <a href="speech-to-text_recognize-file.js.html">speech-to-text/recognize-file.js</a>, <a href="speech-to-text_recognize-file.js.html#line49">line 49</a> </li></ul></dd> </dl> <h5>Returns:</h5> <dl> <dt> Type </dt> <dd> <span class="param-type"><a href="RecognizeStream.html">RecognizeStream</a></span> | <span class="param-type"><a href="SpeakerStream.html">SpeakerStream</a></span> | <span class="param-type"><a href="FormatStream.html">FormatStream</a></span> | <span class="param-type"><a href="ResultStream.html">ResultStream</a></span> | <span class="param-type"><a href="TimingStream.html">TimingStream</a></span> </dd> </dl> <h4 class="name" id="module:watson-speech/speech-to-text/recognize-microphone"><span class="type-signature"></span>module:watson-speech/speech-to-text/recognize-microphone<span class="signature">(options)</span><span class="type-signature"> &rarr; {<a href="RecognizeStream.html">RecognizeStream</a>|<a href="SpeakerStream.html">SpeakerStream</a>|<a href="FormatStream.html">FormatStream</a>|<a href="ResultStream.html">ResultStream</a>}</span></h4> <div class="description"> <p>Create and return a RecognizeStream sourcing audio from the user's microphone</p> </div> <h5>Parameters:</h5> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>options</code></td> <td class="type"> <span class="param-type">Object</span> </td> <td class="description last"><p>Also passed to {RecognizeStream}, and {FormatStream} when applicable</p> <h6>Properties</h6> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th>Attributes</th> <th>Default</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>token</code></td> <td class="type"> <span class="param-type">String</span> </td> <td class="attributes"> </td> <td class="default"> </td> <td class="description last"><p>Auth Token - see https://github.com/watson-developer-cloud/node-sdk#authorization</p></td> </tr> <tr> <td class="name"><code>format</code></td> <td class="type"> <span class="param-type">Boolean</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> true </td> <td class="description last"><p>pipe the text through a FormatStream which performs light formatting. Also controls smart_formatting option unless explicitly set.</p></td> </tr> <tr> <td class="name"><code>keepMicrophone</code></td> <td class="type"> <span class="param-type">Boolean</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> false </td> <td class="description last"><p>keeps an internal reference to the microphone stream to reuse in subsequent calls (prevents multiple permissions dialogs in firefox)</p></td> </tr> <tr> <td class="name"><code>outputElement</code></td> <td class="type"> <span class="param-type">String</span> | <span class="param-type">DOMElement</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> </td> <td class="description last"><p>pipe the text to a <a href="WritableElementStream.html">WriteableElementStream</a> targeting the specified element. Also defaults objectMode to true to enable interim results.</p></td> </tr> <tr> <td class="name"><code>extractResults</code></td> <td class="type"> <span class="param-type">Boolean</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> false </td> <td class="description last"><p>pipe results through a ResultStream stream to simplify the objects. (Default behavior before v0.22) Requires objectMode.</p></td> </tr> <tr> <td class="name"><code>resultsBySpeaker</code></td> <td class="type"> <span class="param-type">Boolean</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> false </td> <td class="description last"><p>Pipe results through a SpeakerStream. Forces speaker_labels and objectMode to be true.</p></td> </tr> <tr> <td class="name"><code>mediaStream</code></td> <td class="type"> <span class="param-type">MediaStream</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> </td> <td class="description last"><p>Optionally pass in an existing MediaStream</p></td> </tr> </tbody> </table> </td> </tr> </tbody> </table> <dl class="details"> <dt class="tag-source">Source:</dt> <dd class="tag-source"><ul class="dummy"><li> <a href="speech-to-text_recognize-microphone.js.html">speech-to-text/recognize-microphone.js</a>, <a href="speech-to-text_recognize-microphone.js.html#line57">line 57</a> </li></ul></dd> </dl> <h5>Returns:</h5> <dl> <dt> Type </dt> <dd> <span class="param-type"><a href="RecognizeStream.html">RecognizeStream</a></span> | <span class="param-type"><a href="SpeakerStream.html">SpeakerStream</a></span> | <span class="param-type"><a href="FormatStream.html">FormatStream</a></span> | <span class="param-type"><a href="ResultStream.html">ResultStream</a></span> </dd> </dl> <h4 class="name" id="module:watson-speech/text-to-speech/get-voices"><span class="type-signature"></span>module:watson-speech/text-to-speech/get-voices<span class="signature">(options)</span><span class="type-signature"> &rarr; {Promise.&lt;T>}</span></h4> <div class="description"> <p>Returns a promise that resolves to an array of objects representing the available voices. Example:</p> <pre class="prettyprint source lang-js"><code> [{ &quot;name&quot;: &quot;en-US_MichaelVoice&quot;, &quot;language&quot;: &quot;en-US&quot;, &quot;customizable&quot;: true, &quot;gender&quot;: &quot;male&quot;, &quot;url&quot;: &quot;https://stream.watsonplatform.net/text-to-speech/api/v1/voices/en-US_MichaelVoice&quot;, &quot;description&quot;: &quot;Michael: American English male voice.&quot; }, //... ]</code></pre><p> Requires fetch, pollyfill available at https://github.com/github/fetch</p> </div> <h5>Parameters:</h5> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>options</code></td> <td class="type"> <span class="param-type">Object</span> </td> <td class="description last"> <h6>Properties</h6> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>token</code></td> <td class="type"> <span class="param-type">String</span> </td> <td class="description last"><p>auth token</p></td> </tr> </tbody> </table> </td> </tr> </tbody> </table> <dl class="details"> <dt class="tag-source">Source:</dt> <dd class="tag-source"><ul class="dummy"><li> <a href="text-to-speech_get-voices.js.html">text-to-speech/get-voices.js</a>, <a href="text-to-speech_get-voices.js.html#line44">line 44</a> </li></ul></dd> <dt class="tag-todo">To Do:</dt> <dd class="tag-todo"> <ul> <li>define format in @return statement</li> </ul> </dd> </dl> <h5>Returns:</h5> <dl> <dt> Type </dt> <dd> <span class="param-type">Promise.&lt;T></span> </dd> </dl> <h4 class="name" id="module:watson-speech/text-to-speech/synthesize"><span class="type-signature"></span>module:watson-speech/text-to-speech/synthesize<span class="signature">(options)</span><span class="type-signature"> &rarr; {Audio}</span></h4> <div class="description"> <p>Synthesize and play the supplied text over the computers speakers.</p> <p>Creates and returns a HTML5 <code>&lt;audio&gt;</code> element</p> </div> <h5>Parameters:</h5> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>options</code></td> <td class="type"> <span class="param-type">Object</span> </td> <td class="description last"> <h6>Properties</h6> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th>Attributes</th> <th>Default</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>token</code></td> <td class="type"> <span class="param-type">String</span> </td> <td class="attributes"> </td> <td class="default"> </td> <td class="description last"><p>auth token</p></td> </tr> <tr> <td class="name"><code>text</code></td> <td class="type"> <span class="param-type">String</span> </td> <td class="attributes"> </td> <td class="default"> </td> <td class="description last"><p>text to speak</p></td> </tr> <tr> <td class="name"><code>voice</code></td> <td class="type"> <span class="param-type">String</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> en-US_MichaelVoice </td> <td class="description last"><p>what voice to use - call getVoices() for a complete list.</p></td> </tr> <tr> <td class="name"><code>accept</code></td> <td class="type"> <span class="param-type">String</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> </td> <td class="description last"><p>specify desired audio format. Leave unset to allow (most) browsers to automatically negotiate an ideal format.</p></td> </tr> <tr> <td class="name"><code>X-Watson-Learning-Opt-Out</code></td> <td class="type"> <span class="param-type">Number</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> 0 </td> <td class="description last"><p>set to 1 to opt-out of allowing Watson to use this request to improve it's services</p></td> </tr> <tr> <td class="name"><code>autoPlay</code></td> <td class="type"> <span class="param-type">Boolean</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> true </td> <td class="description last"><p>automatically play the audio</p></td> </tr> <tr> <td class="name"><code>element</code></td> <td class="type"> <span class="param-type">DOMAudioElement</span> </td> <td class="attributes"> &lt;optional><br> </td> <td class="default"> </td> <td class="description last"><p><audio> element - will be used instead of creating a new one if provided</p></td> </tr> </tbody> </table> </td> </tr> </tbody> </table> <dl class="details"> <dt class="tag-source">Source:</dt> <dd class="tag-source"><ul class="dummy"><li> <a href="text-to-speech_synthesize.js.html">text-to-speech/synthesize.js</a>, <a href="text-to-speech_synthesize.js.html#line42">line 42</a> </li></ul></dd> <dt class="tag-see">See:</dt> <dd class="tag-see"> <ul> <li><a href="module-watson-speech_text-to-speech_get-voices.html">module:watson-speech/text-to-speech/get-voices</a></li> </ul> </dd> </dl> <h5>Returns:</h5> <dl> <dt> Type </dt> <dd> <span class="param-type">Audio</span> </dd> </dl> <h4 class="name" id="playFile"><span class="type-signature"></span>playFile<span class="signature">(file)</span><span class="type-signature"> &rarr; {Promise.&lt;<a href="FilePlayer.html">FilePlayer</a>>}</span></h4> <div class="description"> <p>Determines the file's content-type and then resolves to a FilePlayer instance</p> </div> <h5>Parameters:</h5> <table class="params"> <thead> <tr> <th>Name</th> <th>Type</th> <th class="last">Description</th> </tr> </thead> <tbody> <tr> <td class="name"><code>file</code></td> <td class="type"> <span class="param-type">File</span> | <span class="param-type">Blob</span> | <span class="param-type">String</span> </td> <td class="description last"><p>binary data or URL of audio file (binary data playback may not work on mobile Safari)</p></td> </tr> </tbody> </table> <dl class="details"> <dt class="tag-source">Source:</dt> <dd class="tag-source"><ul class="dummy"><li> <a href="speech-to-text_file-player.js.html">speech-to-text/file-player.js</a>, <a href="speech-to-text_file-player.js.html#line81">line 81</a> </li></ul></dd> </dl> <h5>Returns:</h5> <dl> <dt> Type </dt> <dd> <span class="param-type">Promise.&lt;<a href="FilePlayer.html">FilePlayer</a>></span> </dd> </dl> </article> </section> </div> <nav> <h2><a href="index.html">Home</a></h2><h3>Modules</h3><ul><li><a href="module-watson-speech.html">watson-speech</a></li><li><a href="module-watson-speech_speech-to-text.html">watson-speech/speech-to-text</a></li><li><a href="module-watson-speech_speech-to-text_get-models.html">watson-speech/speech-to-text/get-models</a></li><li><a href="module-watson-speech_speech-to-text_recognize-file.html">watson-speech/speech-to-text/recognize-file</a></li><li><a href="module-watson-speech_speech-to-text_recognize-microphone.html">watson-speech/speech-to-text/recognize-microphone</a></li><li><a href="module-watson-speech_text-to-speech.html">watson-speech/text-to-speech</a></li><li><a href="module-watson-speech_text-to-speech_get-voices.html">watson-speech/text-to-speech/get-voices</a></li><li><a href="module-watson-speech_text-to-speech_synthesize.html">watson-speech/text-to-speech/synthesize</a></li></ul><h3>Classes</h3><ul><li><a href="FilePlayer.html">FilePlayer</a></li><li><a href="FormatStream.html">FormatStream</a></li><li><a href="RecognizeStream.html">RecognizeStream</a></li><li><a href="ResultStream.html">ResultStream</a></li><li><a href="SpeakerStream.html">SpeakerStream</a></li><li><a href="TimingStream.html">TimingStream</a></li><li><a href="UrlPlayer.html">UrlPlayer</a></li><li><a href="WebAudioL16Stream.html">WebAudioL16Stream</a></li><li><a href="WritableElementStream.html">WritableElementStream</a></li></ul><h3>Events</h3><ul><li><a href="RecognizeStream.html#event:close">close</a></li><li><a href="RecognizeStream.html#event:data">data</a></li><li><a href="RecognizeStream.html#event:error">error</a></li><li><a href="RecognizeStream.html#event:listening">listening</a></li><li><a href="RecognizeStream.html#event:message">message</a></li><li><a href="RecognizeStream.html#event:open">open</a></li><li><a href="RecognizeStream.html#event:send-data">send-data</a></li><li><a href="RecognizeStream.html#event:send-json">send-json</a></li><li><a href="RecognizeStream.html#event:stop">stop</a></li><li><a href="SpeakerStream.html#event:data">data</a></li></ul><h3>Global</h3><ul><li><a href="global.html#getContentTypeFromFile">getContentTypeFromFile</a></li><li><a href="global.html#playFile">playFile</a></li></ul> </nav> <br class="clear"> <footer> Documentation generated by <a href="https://github.com/jsdoc3/jsdoc">JSDoc 3.4.3</a> on Tue Feb 21 2017 17:41:51 GMT+0000 (UTC) </footer> <script> prettyPrint(); </script> <script src="scripts/linenumber.js"> </script> </body> </html>