@contiamo/dev
Version:
Dev enviromnent for contiamo
242 lines (151 loc) • 8.96 kB
Markdown
# Contiamo Local Dev Environment
Get the dev environment fast!
## Quick overview
Get started:
- `make docker-auth`
- `make pull`
Get the latest versions:
- `git pull`
- `make pull`
Start everything in normal mode:
- `make start`
Stop everything:
- `make stop`
Stop everything and clean up:
- `make clean`
Prepare for Pantheon-external mode (only do this once):
- `make build`
- `sudo bash -c 'echo "127.0.0.1 metadb" >> /etc/hosts'`
Start everything in Pantheon-external mode:
- `make pantheon-start`
- (In Pantheon directory) `env METADB_URL="jdbc:postgresql://localhost:5433/pantheon?user=pantheon&password=test" DATASTORE_TYPE=external sbt run`
Enable TLS `verify-full` mode on port 5435:
- Download the private key for `*.dev.contiamo.io`: `make get-pg-key`
- `echo "127.0.0.1 pg-localhost.dev.contiamo.io" | sudo tee -a /etc/hosts`
- `make build`
- `make pantheon-start`
- You may need to tell your local `psql` about the IdenTrust root we happen to be using: `curl https://letsencrypt.org/certs/trustid-x3-root.pem.txt > ~/.postgresql/root.crt`
- `psql "user=lemon@example.com password=<token> dbname=<project UUID> sslmode=verify-full" -h pg-localhost.dev.contiamo.io -p 5435`
## Getting started
### Prerequisites
Local development is supported via Docker Compose.
Before you start, you must install [Docker](https://docs.docker.com/install/) and [Docker-Compose](https://docs.docker.com/install/).
Additionally, the development requires access to our private docker registry. To access this ask the Ops team for permissions. Once permissions have been granted you must [install the `gcloud` CLI](https://cloud.google.com/sdk/docs/).
Once installed, run
```sh
make docker-auth pull
```
This will attempt to
1. authenticate with Google,
2. configure your Docker installation to use the new Google credentials, and
3. pull the required Docker images.
### Starting The Environment
Finally, to start the development environment, run
```sh
make start
```
Once the environment has started, you should see a message with a URL and credentials, like this
```
Dev ui: http://localhost:9898/contiamo/profile
Email: lemon@example.com
Password: localdev
```
### Start and add an external datasource
We have a dataset available on GCR for internal testing. It is a Postgres database that contains a single table `liftdata`. This can be used for testing against a known external dataset.
After starting the local dev environment, run
```sh
docker run --name liftdata --rm --network dev_default eu.gcr.io/dev-and-test-env/deutschebahn-liftdata-postgres:v1.0.0
```
In the Data Hub, you can now add a external Postgres datasource using:
| field | value |
|--------|------------|
| `HOST` | `liftdata` |
| `PORT` | `5432` |
| `DATABASE` | `liftdata` |
| `USER` | `pantheon` |
| `PASS` | `contiamodatahub19` |
when you are done, run
```sh
docker kill liftdata
```
to stop and cleanup the database container.
## Stopping
You can always cleanly stop the environment using
```sh
make stop
```
Any data in the databases will be preserved between `stop` and `start`.
## Cleaning up
If you need to reclaim space or want to restart your environment from scratch use
```sh
make clean
```
This will stop your current environment and remove any Docker volumes related to it. This includes any data and metadata in the databases.
As time goes on, Docker will download new images, but it does not automatically garbage collect old images. To do so, run `docker system prune`.
On Mac, all Docker filesystem data is stored in a single file of a fixed size, which is 16GB or 32GB by default. You can configure the size of this file by clicking on the Docker Desktop tray icon -> Preferences -> Disk -> move the slider.
## Tips
* Run `make` or `make help` to see all available commands.
* You can also run these commands from a different directory, with e.g. `make -C /path/to/dev start`.
* The commands in the Makefile are very useful, but there's some extra stuff available if you use `docker-compose` straight. For instance, get all logs with `docker-compose logs --follow`, or only datastore worker logs with `docker-compose logs --follow ds-worker`. Refer to `docker-compose.yml` for the definitions of the services.
* To use `docker-compose` without `cd`'ing to this directory, use e.g. `docker-compose -f /path/to/dev/docker-compose.yml logs --follow`.
## Custom Images
The Compose file supports overriding the Docker tag used for a service by setting several environment variables:
| Server | Environment Variable | Default |
|-------------|----------------------|----------|
| datastore | `DATASTORE_TAG` | `dev` |
| idp | `IDP_TAG` | `dev` |
| pantheon | `PANTHEON_TAG` | `latest` |
| contiamo-ui | `CONTIAMOUI_TAG` | `latest` |
## Options to Postgres
In environment variable `POSTGRES_ARGS`, you can pass extra arguments to the PostgreSQL daemon. By defaults, this is set to `-c log_connections=on`. To log modification statements in addition to connections, start the dev environment with
env POSTGRES_ARGS="-c log_connections=on -c log_statement=mod" make start
You can inspect these logs with `docker-compose logs --follow metadb`. The four acceptable values for `log_statement` are `none`, `ddl`, `mod`, and `all`. Further Postgres options can be found here: https://www.postgresql.org/docs/11/runtime-config.html .
## Setting up Pantheon Local Development
Local Pantheon debug development is supported by port redirection. To set this up, you first need to run two extra steps.
1. Run
```sh
make build
```
This builds the `eu.gcr.io/dev-and-test-env/pantheon:redir` Docker image, a "pseudo-Pantheon" that forwards everything to your local Pantheon on `127.0.0.1` port `4300`. _Do not push this image!_
2. Modify your `/etc/hosts` file to add
```
127.0.0.1 metadb
```
You can easily do this with `sudo bash -c 'echo "127.0.0.1 metadb" >> /etc/hosts'`.
This ensures that Pantheon can correctly resolve the storage database service.
## Running the Pantheon Local Development
Make sure you first set up the prerequisites, and also set up for Pantheon local development.
To start the Pantheon dev environment use
```sh
make pantheon-start
```
This will replace the Pantheon image with a simple port redirection image that will enable transparent redirect of
- http://localhost:9898/pantheon/api/v1/* to http://localhost:4300/api/v1/* ,
- http://localhost:9898/pantheon/jdbc/* to http://localhost:8765/* .
You can then start your local Pantheon debug build, e.g. from your IDE, and have it bind to those ports on localhost. To configure the meta-DB and enable data store from Pantheon, run SBT with
```sh
env METADB_URL="jdbc:postgresql://localhost:5433/pantheon?user=pantheon&password=test" DATASTORE_TYPE=external sbt
```
or set the same environment variables in IntelliJ. You can also use `export METADB_URL="jdbc:postgresql://localhost:5433/pantheon?user=pantheon&password=test" DATASTORE_TYPE=external`, to set the environment variables in the current terminal.
----
The docker-compose configuration will expose the following ports for use from local Pantheon:
- Nginx web server at `127.0.0.1` port `9898` <-- Use this to access Data Hub including UI, IDP, Pantheon, Datastore.
- PostgreSQL meta-DB at `127.0.0.1` port `5433`, username `pantheon`, password `test`.
- Datastore manager at `127.0.0.1` port `9191`
- Minio (for ingested files) at `127.0.0.1` port `9000`
When accessing Pantheon via Nginx on port 9898, you need to prepend `/pantheon` to Pantheon URLs, for instance: http://localhost:9898/pantheon/api/v1/status . Nginx will strip off the `/pantheon`, authenticate the request with IDP, and forward the request to Pantheon as `/api/v1/status`.
Using the `pantheon`/`test` credentials for Postgres, you also have access to
- the `metadb` database, for datastore,
- collection databases corresponding to a managed DB,
- collection databases corresponding to materializations for a project,
- the `simpleidp` database.
## Running a custom Pantheon in prod mode
You can also run Pantheon in prod mode locally, as follows.
1. In `sbt` shell, run `dist`.
2. From a console, run `docker build -t eu.gcr.io/dev-and-test-env/pantheon:local .` This will download dependencies if they are not cached yet, build a Docker image for Pantheon, and tag it `local`.
3. Run `env PANTHEON_TAG=local make start`.
Now datastore and metadb will still be available on the usual ports, but Nginx will proxy to a prod-mode Pantheon which runs inside Docker. Pantheon will automatically be run with appropriate environment variables (https://github.com/contiamo/dev/blob/master/docker-compose.yml#L81).
**Warning!** Do not push this image to GCR. It may accidentally end up being deployed on dev.contiamo.io .
## Profiler Server
The Profiler currently lives at http://localhost:8383.
## Enjoy!