intertext
Version:
Services for Recurrent Text-related Tasks
58 lines (38 loc) • 3.02 kB
Markdown
# ⒾⓃⓉⒺⓇⓉⒺⓍⓉ
<!-- START doctoc generated TOC please keep comment here to allow auto update -->
<!-- DON'T EDIT THIS SECTION, INSTEAD RE-RUN doctoc TO UPDATE -->
**Table of Contents** *generated with [DocToc](https://github.com/thlorenz/doctoc)*
- [InterText: Services for Recurrent Text-related Tasks](#intertext-services-for-recurrent-text-related-tasks)
- [Related Links](#related-links)
- [To Do](#to-do)
<!-- END doctoc generated TOC please keep comment here to allow auto update -->
## InterText: Services for Recurrent Text-related Tasks
InterText provides pre-packaged solutioons for a number of tasks in text formatting and typesetting that
tend to show up frequently. I'm aiming at conducing comparative [benchmarks](./README-benchmarks.md) and
soundness checks for all solutions. Areas covered so far include:
* [**InterText HYPH**](./README-hyphenation.md) for hyphenating text in multiple languages (only en-US
covered so far, but underlying software is multilingual and configurable).
* [**InterText SLABS**](./README-slabs.md) for segmenting and re-assembling text according to *Unicode
Standard Annex #14: Unicode Line Breaking Algorithm* (UAX#14); this is useful to determine line breaking
opportunities (LBOs) for running text.
See also the (rough) [list of planned features](./README-planned.md).
* [**InterText UCD**](./README-ucd.md)
## Related Links
* [regexpu-core/property-escapes.md at master · mathiasbynens/regexpu-core](https://github.com/mathiasbynens/regexpu-core/blob/master/property-escapes.md)
* [Unicode property escapes in JavaScript regular expressions · Mathias Bynens](https://mathiasbynens.be/notes/es-unicode-property-escapes)
* [https://unicode.org/Public/UNIDATA/PropertyValueAliases.txt](https://unicode.org/Public/UNIDATA/PropertyValueAliases.txt)
* [https://unicode.org/Public/UNIDATA/PropertyAliases.txt](https://unicode.org/Public/UNIDATA/PropertyAliases.txt)
* [UAX #24: Unicode Script Property](https://unicode.org/reports/tr24/#Script_Extensions)
* [UTS #18: Unicode Regular Expressions](https://unicode.org/reports/tr18/#RL1.2)
* [UAX #44: Unicode Character Database](https://unicode.org/reports/tr44/#Ideographic)
* [Unicode Utilities: UnicodeSet](https://unicode.org/cldr/utility/list-unicodeset.jsp?a=%5B%5B:name=/CJK/:%5D-%5B:ideographic:%5D%5D)
## To Do
* [X] use `INTERTEXT.rpr()` for tabulation instead of `JSON.stringify()`
* [ ] implement path manipulation, integrate [`pathmap`](https://github.com/jeremyruppel/pathmap)
* [ ] integrate color-related code from [DataMill
colorizer](https://github.com/loveencounterflow/datamill/blob/2d0ca3a784c8f3f9ba8d9fd6277d18c4ee859fb1/src/experiments/colorizer.coffee)
* [ ] implement number formatting using `Intl.NumberFormat`, including percentages, rounding
* [ ] CupOfHTML: make compatible with Paragate HTMLish parser
* [ ] CupOfHTML: consider using template strings as in ``H`div#id.class` 'content'``
* [ ] turn into monorepo
* [ ] integrate `jzr-old/timetunnel`