UNPKG

@thi.ng/text-analysis

Version:

Text tokenization, transformation & analysis transducers, utilities, stop words, porter stemming, vector encodings, similarities

70 lines (44 loc) 2.94 kB
# Change Log - **Last updated**: 2025-08-04T09:13:01Z - **Generator**: [thi.ng/monopub](https://thi.ng/monopub) All notable changes to this project will be documented in this file. Only versions published since **2022-01-01** are listed here. Please consult the Git history for older version information. See [Conventional Commits](https://conventionalcommits.org/) for commit guidelines. **Note:** Unlisted _patch_ versions only involve non-code or otherwise excluded changes and/or version bumps of transitive dependencies. ### [0.4.7](https://github.com/thi-ng/umbrella/tree/@thi.ng/text-analysis@0.4.7) (2025-07-20) #### 🩹 Bug fixes - fix [#532](https://github.com/thi-ng/umbrella/issues/532), fix centralTermsVec() ([6e19974](https://github.com/thi-ng/umbrella/commit/6e19974)) - update non-zero check ## [0.4.0](https://github.com/thi-ng/umbrella/tree/@thi.ng/text-analysis@0.4.0) (2025-06-18) #### 🚀 Features - add `filterDocsFrequency()` ([6ac1f90](https://github.com/thi-ng/umbrella/commit/6ac1f90)) #### ⏱ Performance improvements - minor update kmeansDense() ([ebd5618](https://github.com/thi-ng/umbrella/commit/ebd5618)) - internal use `lookupUnsafe()` ### [0.3.1](https://github.com/thi-ng/umbrella/tree/@thi.ng/text-analysis@0.3.1) (2025-06-15) #### 🩹 Bug fixes - update pkg exports ([ea72b9f](https://github.com/thi-ng/umbrella/commit/ea72b9f)) ## [0.3.0](https://github.com/thi-ng/umbrella/tree/@thi.ng/text-analysis@0.3.0) (2025-06-15) #### 🚀 Features - update kmeansDense ([d35b6bd](https://github.com/thi-ng/umbrella/commit/d35b6bd)) - update results to include original `docs` for each cluster ## [0.2.0](https://github.com/thi-ng/umbrella/tree/@thi.ng/text-analysis@0.2.0) (2025-06-14) #### 🚀 Features - add/migrate refactored tf-idf functions ([d311acc](https://github.com/thi-ng/umbrella/commit/d311acc)) - add/update vocab & vector encoding helpers, restructure ([9e4f60c](https://github.com/thi-ng/umbrella/commit/9e4f60c)) - add filterDocsIDF() ([f682b58](https://github.com/thi-ng/umbrella/commit/f682b58)) - add k-mean clustering fns ([3533843](https://github.com/thi-ng/umbrella/commit/3533843)) #### ♻️ Refactoring - update imports/exports ([a44be87](https://github.com/thi-ng/umbrella/commit/a44be87)) ## [0.1.0](https://github.com/thi-ng/umbrella/tree/@thi.ng/text-analysis@0.1.0) (2025-06-09) #### 🚀 Features - import as new pkg ([89fe9bb](https://github.com/thi-ng/umbrella/commit/89fe9bb)) - add re-exports from transducers/vectors pkgs - add porter stemmer, contractions & stop words from old [@thi.ng/notes](https://github.com/thi-ng/umbrella/tree/main/packages/notes) repo - add porter fixtures - update defVocab() to use [@thi.ng/bidir-index](https://github.com/thi-ng/umbrella/tree/main/packages/bidir-index) - add sparse vector support - add/port cosine/jaccard similarities from vectors/sparse pkgs - add doc strings & examples