Azure Storage -> Cosmo Tech Twin Cache connector

The aim of this project is to read data from an Azure storage and store data into Cosmo Tech Twin Cache solution

Features

Read Azure Storage blob files regarding environment variables
Store data into Cosmo Tech Twin Cache instance

Environment variables :

Here is the list of environment variables:

AZURE_CLIENT_ID : the Azure client id (can be found under the App registration screen)
AZURE_TENANT_ID : the Azure Tenant id (can be found under the App registration screen)
AZURE_CLIENT_SECRET : the app client secret (an already created secret can not be retrieved, thus it must be asked from its creator or a new one should be created)
ACCOUNT_NAME : the targeted storage account name
CONTAINER_NAME : the targeted container name
STORAGE_PATH : the targeted file path
TWIN_CACHE_HOST: the twin cache host
TWIN_CACHE_PORT: the twin cache port
TWIN_CACHE_NAME: the twin cache key name where data will be stored
TWIN_CACHE_PASSWORD: default account/user password (default None)

Log level

Default log level defined is "INFO". We use the logging API logging. You can change the log level by setting an environment variable named: LOG_LEVEL. Log levels used for identifying the severity of an event. Log levels are organized from most specific to least:

CRITICAL
ERROR
WARNING
INFO
DEBUG
NOTSET

How to run your image locally

Build the docker image

docker build -t azstorage-twincache-connector .

Run the docker image

Fill the file docker_envvars with your information:

AZURE_CLIENT_ID=<<azure_client_id>>
AZURE_TENANT_ID=<azure_tenant_id>
AZURE_CLIENT_SECRET=<azure_client_secret>
ACCOUNT_NAME=<storage_account_name>
CONTAINER_NAME=<storage_container_name>
STORAGE_PATH=<storage_path>
TWIN_CACHE_HOST=<twin_cache_host>
TWIN_CACHE_NAME=<twin_cache_name>
TWIN_CACHE_PORT=<twin_cache_port>
TWIN_CACHE_PASSWORD=<twin_cache_password>
LOG_LEVEL=DEBUG

Then run:

./run.sh

N.B:

Default log level is set to 'INFO'

Data format

Data format authorized is CSV files. (BOM encoding is not supported)

The azStorage-twincache-connector will read all csv files under storage specified with <ACCOUNT_NAME> <CONTAINER_NAME> <STORAGE_PATH>.

The connector will read CSV files header and check if either the pair of columns ('src', 'dest') or ('source', target') is present as header in order to discriminate twins CSV files and relationships CSV files.

Bulk insert is based on redisgraph-bulk-loader

N.B: In the current version (1.1.5), the schema enforcement is not handled, so you should respect the default format, i.e:

for Twin CSV files: See node-identifiers
- the first column of the file will be set with the column 'id' or 'name' if both are detected import will be canceled
for Relationships CSV files: See relationships-identifiers
- first and second column will be set with ('src', 'dest') or ('source', 'target') if both pairs are detected import will be canceled

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github/workflows		.github/workflows
azstorage_twincache_connector		azstorage_twincache_connector
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker_envvars		docker_envvars
main.py		main.py
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Azure Storage -> Cosmo Tech Twin Cache connector

Features

Environment variables :

Log level

How to run your image locally

Build the docker image

Run the docker image

Data format

About

Releases 9

Packages

Contributors 3

Languages

License

Cosmo-Tech/azStorage-twincache-connector

Folders and files

Latest commit

History

Repository files navigation

Azure Storage -> Cosmo Tech Twin Cache connector

Features

Environment variables :

Log level

How to run your image locally

Build the docker image

Run the docker image

Data format

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 9

Packages 0

Contributors 3

Languages

Packages