jpo-utils

US Department of Transportation (USDOT) Intelligent Transportation Systems (ITS) Joint Program Office (JPO) Utilities

The JPO ITS utilities repository serves as a central location for deploying open-source utilities used by other JPO-ITS repositories.

Table of Contents

jpo-utils

1. Configuration

System Requirements

Minimum RAM: 16 GB
Supported operating systems:
- Ubuntu 22.04 Linux (Recommended)
- Windows 10/11 Professional (Professional version required for Docker virtualization)
- OSX 10 Mojave
  - NOTE: Not all images have ARM64 builds (they can still be ran through a compatibility layer)
Docker-compose V2 - version 3.4 or newer

The jpo-utils repository is intended to be ran with docker-compose v2 as it uses functionality added in the v2 release.

Tips and Advice

Read the following guides to familiarize yourself with the jpo-utils Docker configuration.

Important! You must rename sample.env to .env for Docker to automatically read the file. Do not push this file to source control.

2. MongoDB

A MongoDB instance that is initialized as a standalone replica-set and has configured users is configured in the docker-compose-mongo file. To use a different setup_mongo.sh or create_indexes.js script, pass in the relative path of the new script by overriding the KAFKA_INIT_SCRIPT_RELATIVE_PATH or MONGO_CREATE_INDEXES_SCRIPT_RELATIVE_PATH environmental variables. These scripts facilitate the initialization of the MongoDB Database along with the created indexes.

Where the COMPOSE_PROFILES variable in you're .env file are as follows:

mongo_full - deploys all resources in the docker-compose-mongo.yml file
- mongo - only deploys the mongo and mongo-setup services
- mongo_express - only deploys the mongo-express service

Quick Run

Create a copy of sample.env and rename it to .env.
Update the variable DOCKER_HOST_IP to the local IP address of the system running docker which can be found by running the ifconfig command
1. Hint: look for "inet addr:" within "eth0" or "en0" for OSX
Set the password for MONGO_ADMIN_DB_PASS and MONGO_READ_WRITE_PASS environmental variables to a secure password.
Set the COMPOSE_PROFILES variable to: mongo_full
Run the following command: docker-compose up -d
Go to localhost:8082 in your browser and verify that mongo-express can see the created database

Back to top

3. Kafka

The Bitnami Kafka is being used as a hybrid controller and broker in the docker-compose-kafka file. To use a different kafka_init.sh script, pass in the relative path of the new script by overriding the KAFKA_INIT_SCRIPT_RELATIVE_PATH environmental variable. This can help in initializing new topics at startup.

An optional kafka-init, schema-registry, and kafka-ui instance can be deployed by configuring the COMPOSE_PROFILES as follows:

kafka_full - deploys all resources in the docker-compose-kafka.yml file
- kafka - only deploys the kafka services
- kafka_setup - deploys a kafka-setup service that creates topics in the kafka service.
- kafka_schema_registry - deploys a kafka-schema-registry service that can be used to manage schemas for kafka topics
- kafka_ui - deploys a web interface to interact with the kafka cluster

Configure Topic Creation

The Kafka topics created by the kafka-setup service are configured in the kafka-topics-values.yaml file. The topics in that file are organized by the application, and sorted into "Stream Topics" (those with cleanup.policy = delete) and "Table Topics" (with cleanup.policy = compact).

The following enviroment variables can be used to configure Kafka Topic creation.

Environment Variable	Description
`KAFKA_TOPIC_CREATE_ODE`	Whether to create topics for the ODE
`KAFKA_TOPIC_CREATE_GEOJSONCONVERTER`	Whether to create topics for the GeoJSON Converter
`KAFKA_TOPIC_CREATE_CONFLICTMONITOR`	Whether to create topics for the Conflict Monitor
`KAFKA_TOPIC_CREATE_DEDUPLICATOR`	Whether to create topics for the Deduplicator
`KAFKA_TOPIC_PARTITIONS`	Number of partitions
`KAFKA_TOPIC_REPLICAS`	Number of replicas
`KAFKA_TOPIC_MIN_INSYNC_REPLICAS`	Minumum number of in-sync replicas (for use with ack=all)
`KAFKA_TOPIC_RETENTION_MS`	Retention time for stream topics, milliseconds
`KAFKA_TOPIC_DELETE_RETENTION_MS`	Tombstone retention time for compacted topics, milliseconds

Quick Run

Create a copy of sample.env and rename it to .env.
Update the variable DOCKER_HOST_IP to the local IP address of the system running docker which can be found by running the ifconfig command
1. Hint: look for "inet addr:" within "eth0" or "en0" for OSX
Set the COMPOSE_PROFILES variable to: kafka_full
Run the following command: docker-compose up -d
Go to localhost:8001 in your browser and verify that kafka-ui can see the created kafka cluster and initialized topics

Back to top

4. MongoDB Kafka Connect

The mongo-connector service connects to specified Kafka topics (as defined in the mongo-connector/connect_start.sh script) and deposits these messages to separate collections in the MongoDB Database. The codebase that provides this functionality comes from Confluent using their community licensed cp-kafka-connect image. Documentation for this image can be found here.

Configuration

Provided in the mongo-connector directory is a sample configuration shell script (connect_start.sh) that can be used to create kafka connectors to MongoDB. The connectors in kafka connect are defined in the format that follows:

declare -A config_name=([name]="topic_name" [collection]="mongo_collection_name"
    [convert_timestamp]=true [timefield]="timestamp" [use_key]=true [key]="key" [add_timestamp]=true)

The format above describes the basic configuration for configuring a sink connector, this should be placed at the beginning of the connect_start.sh file. In general we recommend to keep the MongoDB collection name the same as the topic name to avoid confusion. Additionally, if there is a top level timefield set convert_timestamp to true and then specify the time field name that appears in the message. This will allow MongoDB to transform that message into a date object to allow for TTL creation and reduce message size. To override MongoDB's default message _id field, set use_key to true and then set the key property to "key". The "add_timestamp" field defines whether the connector will add a auto generated timestamp to each document. This allows for creation of Time To Live (TTL) indexes on the collections to help limit collection size growth.

After the sink connector is configured above, then make sure to call the createSink function with the config_name of the configuration like so:

createSink config_name

This needs to be put after the createSink function definition. To use a different connect_start.sh script, pass in the relative path of the new script by overriding the CONNECT_SCRIPT_RELATIVE_PATH environmental variable.

Set the COMPOSE_PROFILES environmental variable as follows:

kafka_connect will only spin up the kafka-connect service in docker-compose-connect
- NOTE: This implies that you will be using a separate Kafka and MongoDB cluster
kafka_connect_standalone will run the following:
1. kafka-connect service from docker-compose-connect
2. kafka service from docker-compose-kafka
3. mongo and mongo-setup services from docker-compose-mongo

Quick Run

Create a copy of sample.env and rename it to .env.
Update the variable DOCKER_HOST_IP to the local IP address of the system running docker
Set the password for MONGO_ADMIN_DB_PASS and MONGO_READ_WRITE_PASS environmental variables to a secure password.
Set the COMPOSE_PROFILES variable to: kafka_connect_standalone,mongo_express,kafka_ui,kafka_setup
Navigate back to the root directory and run the following command: docker compose up -d
Produce a sample message to one of the sink topics by using kafka_ui by:
1. Go to localhost:8001
2. Click local -> Topics
3. Select topic.OdeBsmJson
4. Select Produce Message
5. Leave the defaults except set the Value field to {"foo":"bar"}
6. Click Produce Message
View the synced message in mongo-express by:
1. Go to localhost:8082
2. Click ode -- Or click whatever value you set the MONGO_DB_NAME to
3. Click OdeBsmJson, and now you should see your message!
Feel free to test this with other topics or by producing to these topics using the ODE

Back to top

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
docs		docs
kafka-connect		kafka-connect
kafka		kafka
mongo		mongo
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
docker-compose-connect.yml		docker-compose-connect.yml
docker-compose-kafka.yml		docker-compose-kafka.yml
docker-compose-mongo.yml		docker-compose-mongo.yml
docker-compose.yml		docker-compose.yml
docker.md		docker.md
sample.env		sample.env

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

jpo-utils

1. Configuration

System Requirements

Tips and Advice

2. MongoDB

Quick Run

3. Kafka

Configure Topic Creation

Quick Run

4. MongoDB Kafka Connect

Configuration

Quick Run

About

Releases

Packages

Languages

CDOT-CV/jpo-utils

Folders and files

Latest commit

History

Repository files navigation

jpo-utils

1. Configuration

System Requirements

Tips and Advice

2. MongoDB

Quick Run

3. Kafka

Configure Topic Creation

Quick Run

4. MongoDB Kafka Connect

Configuration

Quick Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages