@arcjet/redact-wasm
Version:
Arcjet sensitive information redaction detection engine
92 lines (70 loc) • 3.27 kB
Markdown
<a href="https://arcjet.com" target="_arcjet-home">
<picture>
<source media="(prefers-color-scheme: dark)" srcset="https://arcjet.com/logo/arcjet-dark-lockup-voyage-horizontal.svg">
<img src="https://arcjet.com/logo/arcjet-light-lockup-voyage-horizontal.svg" alt="Arcjet Logo" height="128" width="auto">
</picture>
</a>
# `@arcjet/redact-wasm`
<p>
<a href="https://www.npmjs.com/package/@arcjet/redact-wasm">
<picture>
<source media="(prefers-color-scheme: dark)" srcset="https://img.shields.io/npm/v/%40arcjet%2Fredact-wasm?style=flat-square&label=%E2%9C%A6Aj&labelColor=000000&color=5C5866">
<img alt="npm badge" src="https://img.shields.io/npm/v/%40arcjet%2Fredact-wasm?style=flat-square&label=%E2%9C%A6Aj&labelColor=ECE6F0&color=ECE6F0">
</picture>
</a>
</p>
[Arcjet][arcjet] sensitive information redaction detection engine.
## Installation
```shell
npm install -S @arcjet/redact-wasm
```
## Example
```ts
import { initializeWasm } from "@arcjet/redact-wasm";
import type { SensitiveInfoEntity } from "@arcjet/redact-wasm";
function noOpDetect(_tokens: string[]): Array<SensitiveInfoEntity | undefined> {
return [];
}
function noOpReplace(_input: SensitiveInfoEntity): string | undefined {
return undefined;
}
const wasm = await initializeWasm(noOpDetect, noOpReplace);
// If WebAssembly isn't available in the environment then it will be undefined.
if (typeof wasm !== "undefined") {
const config = {
entities: [],
contextWindowSize: 1,
skipCustomDetect: true,
skipCustomRedact: true,
};
const entities = wasm.redact("I am a string", config);
// Do something with entities that should be redacted.
} else {
throw new Error(
"redact failed to run because Wasm is not supported in this environment",
);
}
```
## Implementation
This package provides sensitive information identification and redaction logic implemented as a
WebAssembly module which will run local analysis on the provided string.
The generated `_virtual/arcjet_analyze_bindings_redact.component.js` file contains the binary inlined as
a base64 [Data URL][mdn-data-url] with the `application/wasm` MIME type.
This was chosen to save on storage space over inlining the file directly as a
Uint8Array, which would take up ~3x the space of the Wasm file. See
[Better Binary Batter: Mixing Base64 and Uint8Array][wasm-base64-blog] for more
details.
It is then decoded into an ArrayBuffer to be used directly via WebAssembly's
`compile()` function in our entry point file.
This is all done to avoid trying to read or bundle the Wasm asset in various
ways based on the platform or bundler a user is targeting. One example being
that Next.js requires special `asyncWebAssembly` webpack config to load our
Wasm file if we don't do this.
In the future, we hope to do away with this workaround when all bundlers
properly support consistent asset bundling techniques.
## License
Licensed under the [Apache License, Version 2.0][apache-license].
[arcjet]: https://arcjet.com
[mdn-data-url]: https://developer.mozilla.org/en-US/docs/Web/HTTP/Basics_of_HTTP/Data_URLs
[wasm-base64-blog]: https://blobfolio.com/2019/better-binary-batter-mixing-base64-and-uint8array/
[apache-license]: http://www.apache.org/licenses/LICENSE-2.0