Skip to content

A command line porter for Vanilla Knowledge.

Notifications You must be signed in to change notification settings

vanilla/knowledge-porter

Repository files navigation

knowledge-porter

A command line porter for Vanilla Knowledge.

Example:

./bin/knowledge-porter import --config="./customer1.zendesk.json"

Where ./customer1.zendesk.json is relative path to your configuration file.

Configuration file contains set of feature flags for porter to configure source and destination (data & api). Example:

{
    "source": {
        "type": "zendesk",
        "protocol": "http", // For local setup only
        "prefix": "{PREFIX}", // prefix to set foreign id on destination records
        "domain": "{ZENDESK_API_DOMAIN}", // eg: help.gink.com
        "token": "{EMAIL_TOKEN}", // eg: dev@mail.com/token:xxQWERTYptodnoL
        "targetDomain": "{NEW_DOMAIN_PATH}", // eg: dev.vanilla.localhost
        "perPage": 2,
        "pageFrom": 1,
        "pageTo": 100,
        "sourceLocale": {Zendesk locale} // eg: "en-us"
        "destinationLocale": {locale} // eg: "en-us"
        "import": {
            "categories": true,
            "retrySections": true,
            "sections": true,
            "authors": true,
            "articles": true,
            "translations": true,
            "attachments": true,
            "helpful": true,
            "fetchPrivateArticles": false,
            "fetchDraft": false
        },
        "api": {
            "cache": true,
            "log": true
        }
    },
    "destination": {
        "type": "vanilla",
        "domain": "{VANILLA_API_DESTINATION_DOMAIN}", // ex: dev.vanilla.localhost
        "token": "va.{VANILLA_API_TOKEN}",
        "update": "onChange",
         "api": {
            "cache": false,
            "log": true,
            "verbose": false // Display more informations for each request
        }
        // by default we don't want KB to be patched after 1st sync 
        // that will allow to avoid kb-url-slug update if edited on vanilla side
        "patchKnowledgeBase": false, 
        "syncUserByEmailOnly": false
    }
}

Alternatively, you can set the porter configuration into an environmental variable and tell the porter to use the variable instead of the configuration file.

Example, assuming your configuration is set on the variable CONFIG:

./bin/knowledge-porter import --config="ENV:CONFIG"

Note: The environmental variable must contain a valid JSON configuration in the same way that a file configuration should be.

Rate Limit Bypass for Vanilla's Infrastructure

If the destination is type vanilla, you might want to use the rate_limit_bypass_token (optional) configuration parameter to set your rate limit bypass token.

Users and Authors

When porter import articles it is searching for existing user by email and uses it if found.

If user is not found by email porter checks configuration feature flag syncUserByEmailOnly and if that flag is false (dafault) search user by name.

If user still can not be found porter will create new user.

Import from multiple domains

In case we need to import data from different domains and they all could be reached using same api credentials/token there is another tool mulit-import.sh.

This script get one argument which is folder name with special configuration file. For example gink.

That folder should have 2 files inside: template.json and domains. Screen Shot 2020-03-25 at 5 22 12 PM

template.json is regular config json file for this porter. but with 2 special values prefix and source-domain :

{
 "source": {
        "type": "zendesk",
        "foreignIDPrefix": "{prefix}",
        "domain": "{source-domain}",
        ...
  },
 "destination": {
        "type": "vanilla",
        ...
    }
}

and domains file has very flat structure domain.com=prefix like:

diamond.gink.com=diamond
digger.gink.com=digger
betty.gink.com=betty

This bash script will:

  • read domains and loop trough
  • get each domain and prefix value
  • copy template.json to /conf subfolder with new name. Eg: diamond.gink.com.json
  • substitute prefix and source-domain in that new configuration file
  • run knowledge-porter command in background with this per domain prepared config

Example:

./bin/multi-import.sh gink

This multi-import script starts many php processes in background (one process per domain).

We can check progress using various log files created by this tool in targeted {folder}/log.

There is one special log file {folder}/log/import.log.

It has PIDs of running php processes in case developer need to stop them or investigate any incident.