neptune-gremlin

# neptune-gremlin The `neptune-gremlin` package is an SDK for querying an Amazon Neptune graph database using gremlin. Amazon Neptune is is a fast, reliable, fully managed graph database service that makes it easy to build and run applications. It allows you to build connections between identities, build knowledge graphs, detect fraud patterns, and make predictions. It can also simply be used as a general purpose database, which is made easier by this package. The source for this package includes an AWS CDK application that creates a Neptune cluster and a Lambda function in the same VPC to facilitate integration testing. *NOTICE* _This is an experimental package that is not supported in any way by AWS. Please do not use it for mission critical production workloads!_ ## Installation ```sh npm install neptune-gremlin ``` ## Usage ### Connect to Neptune: ```Javascript const gremlin = require("neptune-gremlin") // Get configuration values from the environment const host = process.env.NEPTUNE_ENDPOINT const port = process.env.NEPTUNE_PORT const useIam = process.env.USE_IAM === "true" // Create a new connection to the Neptune database const connection = new gremlin.Connection(host, port, {useIam}) await connection.connect() ``` ### Save a node (vertex): ```Javascript const node1 = { "unique-id-1", properties: { name: "Test Node", a: "A", b: "B", }, labels: ["label1", "label2"], } await connection.saveNode(node1) ``` ### Save an edge ```JavaScript const edge1 = { id: uuid.v4(), label: "points_to", to: node2.id, from: node1.id, properties: { "a": "b", }, } await connection.saveEdge(edge1) ``` ### Get all nodes and edges in the graph ```JavaScript const searchResult = await connection.search({}) ``` ### Run a custom traversal: ```Javascript const f = (g) => { return await g.V() .has("person", "name", "Eric") .bothE().bothV().dedup() .valueMap(true).toList() } const result = await connection.query(f) ``` ### Partition the graph A Neptune cluster does not have a native paritioning concept. All nodes are in the same database. Gremlin has a feature called a partition strategy that adds a property to each node and edge automatically to segment your graph into different sub graphs. All you have to do with this library is set the partition on the connection: ```JavaScript connection.setPartition("test_partition") ``` All subsequent calls using that connection will have the `_partition` property added by default. ## Development ### Building the project This package is all Javascript, so the build script just runs eslint, copies `neptune-gremlin.js` into the cdk app's lambda folder, and then synthesizes the cdk app. ```sh npm run build ``` ### Sample application There is a sample app at [https://github.com/aws-samples/cdk-neptune-knowledge-graph](https://github.com/aws-samples/cdk-neptune-knowledge-graph) where you can see how to incorporate this library into a REST API. ### Making changes If you want to make a change or an addition to the package (contributions welcome!), please add a test to `cdk-test-app/lambda/integration-test.js`. Deploy the stack to your AWS account and invoke the function to make sure everything works as expected. ### Deploying the cdk test app Make sure you run the top level build script, since it copies the latest `neptune-gremlin.js` file into the cdk app's lambda folder. Also keep in mind that the very first call to a new Neptune cluster tends to fail with a 500, so if that happens, just try again. _Note that this app will result in charges in your AWS account! Be sure to destroy the stack when you are done!_ ```sh npm run build cd cdk-test-app npm install cd lambda npm install cd .. npx cdk bootstrap npx cdk synth npx cdk diff npx cdk deploy ``` ### Cleaning up the CDK stack to avoid charges to your AWS account ```sh cdk destroy ```