@allemandi/embed-utils
Version:
Fast, type-safe utilities for vector embedding comparison and search.
114 lines (86 loc) โข 3.37 kB
Markdown
# ๐ @allemandi/embed-utils
[](https://www.npmjs.com/package/@allemandi/embed-utils)
[](https://github.com/allemandi/embed-utils/blob/main/LICENSE)
> **Fast, type-safe utilities for vector embedding comparison and search.**
>
> Works in Node.js, browsers โ supports ESM, CommonJS, and UMD
<!-- omit from toc -->
## ๐ Table of Contents
- [โจ Features](#-features)
- [๐ ๏ธ Installation](#๏ธ-installation)
- [๐ Quick Usage Examples](#-quick-usage-examples)
- [๐งช Tests](#-tests)
- [๐ Related Projects](#-related-projects)
- [๐ค Contributing](#-contributing)
## โจ Features
- ๐ Find nearest neighbors by cosine similarity, or Euclidean/Manhattan distance
- ๐ Compute, normalize, and verify vector similarity
- โก Lightweight and fast vector operations
## ๐ ๏ธ Installation
```bash
# Yarn
yarn add @allemandi/embed-utils
# NPM
npm install @allemandi/embed-utils
```
## ๐ Quick Usage Examples
> ๐ For a complete list of methods and options, see [the API docs](https://github.com/allemandi/embed-utils/blob/main/docs/API.md).
**ESM**
```js
import { computeCosineSimilarity } from '@allemandi/embed-utils';
```
**CommonJS**
```js
const { findNearestNeighbors } = require('@allemandi/embed-utils');
const samples = [
{ embedding: [0.1, 0.2, 0.3], label: 'sports' },
{ embedding: [0.9, 0.8, 0.7], label: 'finance' },
{ embedding: [0.05, 0.1, 0.15], label: 'sports' },
];
const query = [0.09, 0.18, 0.27];
// Find top 2 neighbors with similarity โฅ 0.5
// (default method: cosine similarity)
const resultsCosine = findNearestNeighbors(query, samples, { topK: 2, threshold: 0.5 });
console.log(resultsCosine);
// [ { embedding: [0.1, 0.2, 0.3], label: "sports", similarityScore: 1 },
// { embedding: [0.05, 0.1, 0.15], label: "sports", similarityScore: 1 } ]
// Find top 3 neighbors with Euclidean distance โค 1.1
const resultsEuclidean = findNearestNeighbors(query, samples, {
topK: 3,
threshold: 1.1,
method: 'euclidean',
});
console.log(resultsEuclidean.length);
// 2
// only 2 results that pass threshold conditions
```
**UMD (Browser)**
```js
<script src="https://unpkg.com/@allemandi/embed-utils"></script>
<script>
const vectorsToNormalize = [3, 4];
const result = window.allemandi.embedUtils.normalizeVector(vectorsToNormalize);
console.log(result);
</script>
```
## ๐งช Tests
> Available in the GitHub repo only.
```bash
# Run the test suite with Jest
yarn test
# or
npm test
```
## ๐ Related Projects
Check out these related projects that might interest you:
**[Embed Classify CLI](https://github.com/allemandi/embed-classify-cli)**
- Node.js CLI tool for local text classification using word embeddings.
**[Vector Knowledge Base](https://github.com/allemandi/vector-knowledge-base)**
- A minimalist command-line knowledge system with semantic memory capabilities using vector embeddings for information retrieval.
## ๐ค Contributing
If you have ideas, improvements, or new features:
1. Fork the project
2. Create your feature branch (git checkout -b feature/amazing-feature)
3. Commit your changes (git commit -m 'Add some amazing feature')
4. Push to the branch (git push origin feature/amazing-feature)
5. Open a Pull Request