@hydre/disk

<h1 align=center>@hydre/disk</h1> <p align=center> <img src="https://img.shields.io/github/license/hydreio/disk.svg?style=for-the-badge" /> <img src="https://img.shields.io/codecov/c/github/hydreio/disk/edge?logo=codecov&style=for-the-badge"/> <a href="https://www.npmjs.com/package/@hydre/disk"> <img src="https://img.shields.io/npm/v/@hydre/disk.svg?logo=npm&style=for-the-badge" /> </a> <img src="https://img.shields.io/npm/dw/@hydre/disk?logo=npm&style=for-the-badge" /> <img src="https://img.shields.io/github/workflow/status/hydreio/disk/CI?logo=Github&style=for-the-badge" /> </p> <h3 align=center>Store and query your GraphQL types in Redis</h3> <p align=center> <img src="https://i.imgur.com/SxjQfSU.png" /> </p> - [Requirements](#requirements) - [Install](#install) - [Usage](#usage) - [Schema](#schema) - [With Node](#with-node) - [CREATE](#create) - [DELETE](#delete) - [KEYS](#keys) - [GET](#get) - [SET](#set) ## Requirements Hydre/disk leverage Graphql AST to provide an ORM kind tool which map your schema to Redisearch indexes and allows you to easily query your datas. - Redis - Redisearch module enabled - Node > 14.4 - Advanced GraphQL knowledge ## Install ```sh npm install @hydre/disk ``` ## Usage `hydre-disk` cli synchronize your GraphQL schema to Redisearch indexes and reindex all hashes uppon changes. Use it manually or through your CI `@hydre/disk` module provide you a query interface for usage in node ### Schema > It is advised to use this tool in an advanced environement where > you use schema generation and manage the AST yourself which would > allow you to remove any trace of configuration mapping. ```sh hydre-disk --file ./schema.gql ``` | flag | default value | description | | --- | ---|---| | `--file` | Error (mandatory) | direct path to a graphql schema file which contain your types | | `--redis` | `'redis://localhost:6379'` | redis url | | `--sync` | | if an index exist it will be dropped and recreated to allow reindexing Synchronization steps (current version) 1. The provided schema is parsed into a graphql ast 2. The graphql ast is parsed into a hydre/disk ast 3. Each graphql `type` definition represent a redisearch index 4. `FT.INFO` is used to check for index existence 5. If the index exist we skip it (unless --sync is passed) >> _go to 3_ - when `--sync` is passed, any found index will be deleted `FT.DROP idx KEEPDOCS` 6. If the index doesn't exist, the ast is serialized into a `FT.CREATE` command and executed for the `type` (see examples below) 7. After creating the index, the hashes are reindexed serialy with `FT.ADDHASH` with a `SCAN <cursor> MATCH <namespace:*>` - `namespace` is the graphql `type` name, which is also the index name > WARN: Reindexed hashes will loose their document `WEIGHTS`, the scope of this tool > has no interest in using this redisearch feature so if you need documents > weight, you'll need to reindex them manually hydre/disk is a 1-1 mapping with [available schema options](https://oss.redislabs.com/redisearch/Commands/#ftcreate) ```gql type {index} @MAXTEXTFIELDS @TEMPORARY(seconds: Int!) @NOOFFSETS @NOHL @NOFIELDS @NOFREQS @STOPWORDS(words: [String!]!) { {field}: ... @TEXT @NOSTEM @WEIGHT(weight: Int! | Float!) @PHONETIC(matcher: String!) {field}: ... @NUMERIC {field}: ... @GEO {field}: ... @TAG @SEPARATOR(sep: String!) {field}: ... @{type} @SORTABLE @NOINDEX } ``` The tool detect fields through their **primary positionned** directives. If none of them is detected the `type` will simply be ignored. ```js const FIELD_TYPES = new Set(['TEXT', 'NUMERIC', 'GEO', 'TAG']) ``` Redisearch is an interface on top of redis hashes which are independants, that mean you can use a real graphql schema and specify redisearch directives only on field which needs to be queried by secondary indexes (more on the example below) About shipping the schema it is cleaner if you can `visit` the ast to remove directives before production to keep them internally, the second best option is to extract the types from your API schema and keep a graphql file dedicated only to redis, the third option is to declare all directives at the top of your schema to make them valid. > I myself only write types and generate multiple things from it, including > the production schema which contains no directives at all and is used remotly. **Real usecase :** ```gql type User @MAXTEXTFIELDS { uuid : ID! name : String! @TEXT @NOSTEM address : String! posts : [Post!]! cities : [String!]! @TAG @SEPARATOR(sep: ",") } type Post @STOPWORDS(words: ["i", "know", "right"]) { uuid : ID! date : Int! @NUMERIC text : String @TEXT @WEIGHT(weight: 12.3) } ``` Will yield ```sh FT.CREATE User MAXTEXTFIELDS SCHEMA name TEXT NOSTEM cities TAG SEPARATOR , FT.CREATE Post STOPWORDS "i" "know" "right" SCHEMA date NUMERIC text TEXT WEIGHT 12.3 ``` > I assume you're an adult and know what you are doing, no validation is done. > That mean if you specify an invalid set of directives or wrong types > you'll simply get a raw redis error. ### With Node The master client will be used for write operations, and the slave client will be used to read. ```js import Redis from 'ioredis' import Mount from '@hydre/disk' import events from 'events' const master = new Redis() const Disk = Mount({ master_client : master, slave_client : master, events_enabled: true, events_name : '__disk__', }) await events.once(master, 'ready') ``` If events are enabled Disk will publish to redis every CREATE, SET, and DELETE operations in a similar way as keyevent-notifications `PUBLISH "<events_name>:<operation_type>:<index_name>" <uuid>` ex: `PUBLISH __disk__:CREATE:User User:xxxx-xxxx-xxxx-xxxx` which you can listen all through `PSUBSCRIBE __disk__:*:*` here is a example of connexion retry backoff client ```js const client = new Redis({ host : 'redis://localhost:6379', retryStrategy: attempt => { if (attempt > 10) return new Error(`Can't connect to redis after ${ attempt } tries..`) return 250 * 2 ** attempt }, }) ``` When using the disk you must follow this pattern ```js Disk[<operation>][<type>](query) ``` You may often only need to retrieve documents by ids, to allow for better performance Disk is only using redisearch if you include the `search` option. When it is not specified it only use regular `HSET` and `HMGET` commands. - **operation** is one of - `CREATE` - `DELETE` - `KEYS` - `GET` - `SET` - **type** is the namespace, which is the graphql type name. It is used for the collection name (index) and the node namespace. A `type User` would yield an `User` redisearch index and `User:<uuid>` documents. Basically `FT.ADD User User:xxxx 1 FIELDS ...` - **query** is an option object containing the following ```js { // array of uuids to use instead of the whole database (INKEYS) // if not specified it will use the whole index keys : [''], // array of field names to return // if not specified it will return all fields fields : [''], limit : Infinity, // limit results offset : 0, // paginate search : '', // redisearch query string (https://oss.redislabs.com/redisearch/Query_Syntax/) // object to use for CREATE and SET operations // it use `REPLACE PARTIAL` for a SET operation document: {} } ``` #### CREATE > Creating a document execute a regular HSET and then try to index > it in redisearch, because we may not want an index for every types. Create will generate an uuid v4 and use it with the type as a namespace ```js const uuid = await Disk.CREATE.User({ document: pepeg }) // User:xxxx-xxxx-xxx.. ``` #### DELETE Delete matching documents from redis (DD) ```js const result = await Disk.DELETE.User({ search: '*' }) // result = raw result with an array of 'OK' ``` #### KEYS Return an array of matching keys (NOCONTENT), can be used to count element or check for existence ```js const [key1, key2, ...keys] = await Disk.KEYS.User({ search: '*' }) ``` #### GET Return an array of javascript object > note that redis only returns strings, Disk will convert back > so any `'1'` will be converted to a Number, `'true'` to a boolean. > Big numbers will be converted to native BigInts ```js const Paul = await Disk.GET.User({ fields: ['name', 'age'], search: '@name:Paul', limit : 1 }) ``` #### SET > Setting a field to `undefined` remove it (HDEL) Merge a document into all matching results ```js const result = await Disk.SET.User({ search : '*', document: { group : 'communism' } }) // result = raw redis array result (FT.ADD) ```