UNPKG

qminer

Version:

A C++ based data analytics platform for processing large-scale real-time streams containing structured and unstructured data

403 lines (392 loc) 16.9 kB
<!doctype html> <html> <head> <meta name="generator" content="JSDoc 3"> <meta charset="utf-8"> <title>Class: BiasedGk</title> <link rel="stylesheet" href="https://brick.a.ssl.fastly.net/Karla:400,400i,700,700i" type="text/css"> <link rel="stylesheet" href="https://brick.a.ssl.fastly.net/Noto+Serif:400,400i,700,700i" type="text/css"> <link rel="stylesheet" href="https://brick.a.ssl.fastly.net/Inconsolata:500" type="text/css"> <link href="css/baseline.css" rel="stylesheet"> </head> <body onload="prettyPrint()"> <nav id="jsdoc-navbar" role="navigation" class="jsdoc-navbar"> <div id="jsdoc-navbar-container"> <div id="jsdoc-navbar-content"> <a href="index.html" class="jsdoc-navbar-package-name">QMiner JavaScript API v9.4.0</a> </div> </div> </nav> <div id="jsdoc-body-container"> <div id="jsdoc-content"> <div id="jsdoc-content-container"> <div id="jsdoc-main" role="main"> <header class="page-header"> <div class="symbol-detail-labels"><span class="label label-kind">class</span>&nbsp;<span class="label label-static">static</span></div> <h1><small><a href="module-analytics.html">analytics</a>.<wbr></small><span class="symbol-name">BiasedGk</span></h1> <p class="source-link">Source: <a href="analyticsdoc.js.html#source-line-2683">analyticsdoc.<wbr>js:2683</a></p> <div class="symbol-classdesc"> <p>The CKMS (GK adapted for biased quantiles) algorithm for online biased quantile estimation. Given a probability p the algorithm returns the approximate value of the p-th quantile. The algorithm is most accurate in one of the extremes (which extreme depends on the parameters).</p> <p>The algorithm works by keeping a summary of buckets, each summarizing a range of values. Through the run of the algorithm new buckets are created and periodically merged if possible.</p> <p>It was first explained in: &quot;Effective Computation of Biased Quantiles over Data Streams&quot; https://www.cs.rutgers.edu/~muthu/bquant.pdf</p> <p>Only the biased version is implemented (the targeted version is flawed).</p> <p>The error is bounded by the rank of the element (not the absolute value). Specifically, the worst case relative error is bounded by max(eps<em>p, eps</em>p0) where eps is an accuracy paramter, p0 is the <code>targetProb</code> and p is the p-value set as the parameter of function <code>quantile</code>.</p> </div> <dl class="dl-compact"> </dl> </header> <section id="summary"> <div class="summary-callout"> <h2 class="summary-callout-heading">Properties</h2> <div class="summary-content"> <div class="summary-column"> <dl class="dl-summary-callout"> <dt><a href="module-analytics.BiasedGk.html#.init">init</a></dt> <dd> </dd> <dt><a href="module-analytics.BiasedGk.html#.memory">memory</a></dt> <dd> </dd> </dl> </div> <div class="summary-column"> <dl class="dl-summary-callout"> <dt><a href="module-analytics.BiasedGk.html#.samples">samples</a></dt> <dd> </dd> <dt><a href="module-analytics.BiasedGk.html#.size">size</a></dt> <dd> </dd> </dl> </div> <div class="summary-column"> </div> </div> </div> <div class="summary-callout"> <h2 class="summary-callout-heading">Methods</h2> <div class="summary-content"> <div class="summary-column"> <dl class="dl-summary-callout"> <dt><a href="module-analytics.BiasedGk.html#.compress">compress(val)</a></dt> <dd> </dd> <dt><a href="module-analytics.BiasedGk.html#.compress">compress()</a></dt> <dd> </dd> </dl> </div> <div class="summary-column"> <dl class="dl-summary-callout"> <dt><a href="module-analytics.BiasedGk.html#.save">save(fout)</a></dt> <dd> </dd> <dt><a href="module-analytics.BiasedGk.html#getParams">getParams()</a></dt> <dd> </dd> </dl> </div> <div class="summary-column"> <dl class="dl-summary-callout"> <dt><a href="module-analytics.BiasedGk.html#quantile">quantile(pVals)</a></dt> <dd> </dd> </dl> </div> </div> </div> </section> <section> <h2 id="BiasedGk">new&nbsp;<span class="symbol-name">BiasedGk</span><span class="signature"><span class="signature-params">([arg])</span></span></h2> <section> <h3> Example </h3> <div> <pre class="prettyprint"><code>// import modules var qm &#x3D; require(&#x27;qminer&#x27;); var fs &#x3D; require(&#x27;qminer&#x27;).fs; var quants &#x3D; qm.analytics.quantiles; // create the BiasedGk object var gk &#x3D; new quants.BiasedGk({ eps: 0.1, targetProb: 0.99, compression: &#x27;periodic&#x27;, useBands: true }); // create the data used for calculating quantiles var inputs &#x3D; [10, 1, 2, 8, 9, 5, 6, 4, 7, 3]; // fit the model for (var i &#x3D; 0; i &lt; inputs.length; i++) { gk.insert(inputs[i]); } // make the estimation for the 0.1 quantile var quant &#x3D; gk.quantile(0.1); // save the model gk.save(fs.openWrite(&#x27;gk.bin&#x27;)).close(); // open the gk model under a new variable var gk2 &#x3D; new analytics.quantiles.BiasedGk(fs.openRead(&#x27;gk.bin&#x27;));</code></pre> </div> </section> <section> <h3>Parameter</h3> <table class="jsdoc-details-table"> <thead> <tr> <th>Name</th> <th>Type</th> <th>Optional</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td> <p>arg</p> </td> <td> <p>(module:analytics.quantiles~BiasedGkParam or module:fs:FIn)</p> </td> <td> <p>Yes</p> </td> <td> <p>Constructor arguments. There are 2 ways of constructing: <br>1. Using the module:analytics.quantiles~BiasedGkParam object, <br>2. using the file input stream <a href="module-fs.FIn.html">module:fs.FIn</a>. </p> </td> </tr> </tbody> </table> </section> <dl class="dl-compact"> </dl> </section> <section> <h2>Properties</h2> <section> <div class="symbol-detail-labels"><span class="label label-static">static</span></div> <h3 id=".init"><span class="symbol-name">init</span></h3> <p>Indicates whether the model is initialized (has seen at least one value).</p> <dl class="dl-compact"> </dl> <div class="symbol-detail-labels"><span class="label label-static">static</span></div> <h3 id=".memory"><span class="symbol-name">memory</span></h3> <p>Returns the models current memory consumption.</p> <dl class="dl-compact"> </dl> <div class="symbol-detail-labels"><span class="label label-static">static</span></div> <h3 id=".samples"><span class="symbol-name">samples</span></h3> <p>Returns the number of samples seen by the model.</p> <dl class="dl-compact"> </dl> <div class="symbol-detail-labels"><span class="label label-static">static</span></div> <h3 id=".size"><span class="symbol-name">size</span></h3> <p>Returns the current size of the algorithms summary in number of tuples.</p> <dl class="dl-compact"> </dl> </section> <h2>Methods</h2> <section> <div class="symbol-detail-labels"><span class="label label-static">static</span></div> <h3 id=".compress"><span class="symbol-name">compress</span><span class="signature"><span class="signature-params">(val)</span>&nbsp;&rarr; <span class="signature-returns"> module:analytics.quantiles.Gk</span></span></h3> <p>Adds a new value to the summary.</p> <section> <h4> Example </h4> <div> <pre class="prettyprint"><code>var qm &#x3D; require(&#x27;qminer&#x27;); var gk &#x3D; new qm.analytics.quantiles.BiasedGk(); gk.insert(1.0); gk.insert(2.0);</code></pre> </div> </section> <section> <h4>Parameter</h4> <table class="jsdoc-details-table"> <thead> <tr> <th>Name</th> <th>Type</th> <th>Optional</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td> <p>val</p> </td> <td> <p>number</p> </td> <td> <p>&nbsp;</p> </td> <td> <p>the value</p> </td> </tr> </tbody> </table> </section> <dl class="dl-compact"> <dt>Returns</dt> <dd> <p><code>module:analytics.quantiles.Gk</code>B reference to self</p> </dd> </dl> <div class="symbol-detail-labels"><span class="label label-static">static</span></div> <h3 id=".compress"><span class="symbol-name">compress</span><span class="signature"><span class="signature-params">()</span></span></h3> <p>Manually runs the compression procedure.</p> <dl class="dl-compact"> <dt>Returns</dt> <dd> <p>reference to self</p> </dd> </dl> <div class="symbol-detail-labels"><span class="label label-static">static</span></div> <h3 id=".save"><span class="symbol-name">save</span><span class="signature"><span class="signature-params">(fout)</span>&nbsp;&rarr; <span class="signature-returns"> <a href="module-fs.FOut.html">module:fs.FOut</a></span></span></h3> <p>Saves the objects state into the output stream.</p> <section> <h4>Parameter</h4> <table class="jsdoc-details-table"> <thead> <tr> <th>Name</th> <th>Type</th> <th>Optional</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td> <p>fout</p> </td> <td> <p><a href="module-fs.FOut.html">module:fs.FOut</a></p> </td> <td> <p>&nbsp;</p> </td> <td> <p>the output stream</p> </td> </tr> </tbody> </table> </section> <dl class="dl-compact"> <dt>Returns</dt> <dd> <p><code><a href="module-fs.FOut.html">module:fs.FOut</a></code>B - the output stream</p> </dd> </dl> <h3 id="getParams"><span class="symbol-name">getParams</span><span class="signature"><span class="signature-params">()</span>&nbsp;&rarr; <span class="signature-returns"> module:analytics.quantiles~BiasedGkParam</span></span></h3> <p>Returns the models' parameters as a JavaScript object (JSON). These parameters are the same as are set through the constructor.</p> <dl class="dl-compact"> <dt>Returns</dt> <dd> <p><code>module:analytics.quantiles~BiasedGkParam</code>B The construction parameters.</p> <p>var analytics = qm.analytics; var gk = new analytics.quantiles.BiasedGk(); var params = gk.getParams();</p> <p>console.log(params.targetProb); console.log(params.eps); console.log(params.autoCompress); console.log(params.useBands);</p> </dd> </dl> <h3 id="quantile"><span class="symbol-name">quantile</span><span class="signature"><span class="signature-params">(pVals)</span>&nbsp;&rarr; <span class="signature-returns"> (number or Array)</span></span></h3> <p>Given an input cumulative probability, returns a quantile associated with that probability (e.g. for input 0.5 it will return the median).</p> <section> <h4> Example </h4> <div> <pre class="prettyprint"><code>var qm &#x3D; require(&#x27;qminer&#x27;); var gk &#x3D; new qm.analytics.quantiles.BiasedGk({ eps: 0.1, targetProb: 0.01 }); gk.insert(1.0); gk.insert(2.0); gk.insert(1.0); gk.insert(3.0); gk.insert(2.0); console.log(gk.quantile(0.01)); // prints the first percentile console.log(gk.quantile(0.25)); // prints the first quartile console.log(gk.quantile(0.5)); // prints the median</code></pre> </div> </section> <section> <h4>Parameter</h4> <table class="jsdoc-details-table"> <thead> <tr> <th>Name</th> <th>Type</th> <th>Optional</th> <th>Description</th> </tr> </thead> <tbody> <tr> <td> <p>pVals</p> </td> <td> <p>(number or Array)</p> </td> <td> <p>&nbsp;</p> </td> <td> <p>the p-values which we a querying</p> </td> </tr> </tbody> </table> </section> <dl class="dl-compact"> <dt>Returns</dt> <dd> <p><code>(number or Array)</code>B quantiles - depending whether the input was a single value or array the method returns a quantile or array of quantiles</p> </dd> </dl> </section> </section> </div> </div> <nav id="jsdoc-toc-nav" role="navigation"></nav> </div> </div> <footer id="jsdoc-footer" class="jsdoc-footer"> <div id="jsdoc-footer-container"> <p> </p> </div> </footer> <script src="scripts/jquery.min.js"></script> <script src="scripts/tree.jquery.js"></script> <script src="scripts/prettify.js"></script> <script src="scripts/jsdoc-toc.js"></script> <script src="scripts/linenumber.js"></script> <script src="scripts/scrollanchor.js"></script> </body> </html>