diff --git a/docs/dev/developer-guide.md b/docs/dev/developer-guide.md
index 73234ba69..783e74e16 100644
--- a/docs/dev/developer-guide.md
+++ b/docs/dev/developer-guide.md
@@ -1,155 +1,261 @@
# Developer guide
-This guide describes how to setup a development environment for building and running
+This guide describes:
-- Nextclade CLI executable
-- Nextalign CLI executable
-- Nextclade Web and Nextclade WebAssembly module
+- how to download source code for Nextclade CLI and Nextclade Web
+- how to setup a development environment
+- how to build and run Nextclade CLI and Nextclade Web
+- how the official distributions are maintained, released and deployed
+
+This is only useful if you know programming at least a little or is curations about how Nextclade is built.
+
+> ⚠️ If you are Nextclade user or is looking to familiarize yourself with Nextclade usage and features, then refer to [Nextclade user documentation](https://docs.nextstrain.org/projects/nextclade/en/stable/index.html) instead.
+
+> ⚠️ This guide assumes basic familiarity with Nextclade Web and/or Nextclade CLI as well as certain technical skills.
+
+> ⚠️ Datasets are managed in a [separate repository](https://github.com/nextstrain/nextclade_data)
## Developing locally
-### Nextclade CLI and Nextalign CLI
+### Nextclade CLI
-Nextclade CLI and Nextalign CLI are written in Rust. The usual `rustup` & `cargo` workflow can be used:
+Nextclade CLI is written in Rust programming language. The usual `rustup` & `cargo` workflow can be used.
-```bash
-# Clone Nextclade git repository
-git clone https://github.com/nextstrain/nextclade
-cd nextclade
-
-# Install Rustup, the Rust version manager (https://www.rust-lang.org/tools/install)
-curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
-
-# Add Rust tools to the $PATH
-export PATH="$PATH:$HOME/.cargo/bin"
-
-# [Linux only] install openssl and pkgconfig. Example for Ubuntu:
-sudo apt-get update
-sudo apt-get install --yes libssl-dev pkg-config
-
-# Prepare dotenv file with default values
-cp .env.example .env
-
-# Build Nextclade and use it to download a dataset to ./data_dev/
-# In later commands, a dataset is expected to be in ./data_dev/, refer to Nextclade dataset information
-# (https://docs.nextstrain.org/projects/nextclade/en/stable/user/datasets.html#download-a-dataset)
-cargo run --bin=nextclade -- dataset \
- get --name=sars-cov-2 \
- --output-dir=data_dev/
-
-# Build and run Nextclade in debug mode (convenient for development, fast to build, slow to run, has debug info)
-# Refer to the user documentation for explanation of Nextclade CLI flags (https://docs.nextstrain.org/projects/nextclade/en/stable/)
-cargo run --bin=nextclade -- run \
- data_dev/sequences.fasta \
- --input-dataset=data_dev/ \
- --output-fasta='out/nextclade.aligned.fasta' \
- --output-tsv='out/nextclade.tsv' \
- --output-tree='out/nextclade.tree.json' \
- --in-order \
- --include-reference
-
-# Build Nextclade in release mode (slow to build, fast to run, no debug info)
-cargo build --release --bin=nextclade
-
-# Run Nextclade release binary
-./target/release/nextclade run \
- data_dev/sequences.fasta \
- --input-dataset=data_dev/ \
- --output-fasta='out/nextclade.aligned.fasta' \
- --output-tsv='out/nextclade.tsv' \
- --output-tree='out/nextclade.tree.json' \
- --in-order \
- --include-reference
-
-# Add -v flags to increase verbosity of output
-# nextclade run ... -vv
-
-# To build Nextalign, replace 'nextclade' with 'nextalign'
-# cargo build --release --bin=nextalign
+If you are not familiar with Rust, please refer to documentation:
-```
+- [Rust](https://www.rust-lang.org/learn) - the programming language itself
+- [Rustup](https://rust-lang.github.io/rustup/) - Rust toolchain installer and version manager
+- [Cargo](https://doc.rust-lang.org/cargo/) - Rust package manager
+
+as well as to the `--help` text for each tool.
+
+#### Steps
+
+1. Obtain source code (once)
+
+ Make sure you have [git](https://git-scm.com/) installed.
+
+ Clone Nextclade git repository:
+
+ ```bash
+ git clone https://github.com/nextstrain/nextclade
+ cd nextclade
+ ```
+
+ > 💡 We accept pull requests on GitHub. If you want to submit a with new feature or a bug fixe, then make a GitHub account, [make a fork](https://docs.github.com/en/get-started/quickstart/fork-a-repo) of the [origin Nextclade repository](https://github.com/nextstrain/nextclade) and clone your forked repository instead. Refer to [GitHub documentation "Contributing to projects"](https://docs.github.com/en/get-started/quickstart/contributing-to-projects) for more details.
+
+ > 💡 Make sure you [keep your local code up to date](https://github.com/git-guides/git-pull) with the origin repo, [especially if it's forked](https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork).
+
+ > 💡 If you are a member of Nextstrain team, then you don't need a fork and you can contribute directly to the origin repository. Still, in most cases, please submit pull requests for review, rather than pushing changes to branches directly.
+
+2. Install Rust if not already (https://www.rust-lang.org/tools/install):
+
+ ```bash
+ # [once] Install Rustup, the Rust version manager
+ curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
+
+ # [once] Add Rust tools to the $PATH
+ export PATH="$PATH:$HOME/.cargo/bin"
+
+ # [once] [Linux only] install openssl and pkgconfig. Example for Ubuntu:
+ sudo apt-get update
+ sudo apt-get install --yes libssl-dev pkg-config
+
+ # Check your installed versions of Rust compiler, Cargo and Rustup
+ $ rustc -V
+ $ cargo -V
+ $ rustup -V
+ ```
+
+ > ⚠️ We don't support Rust installations deviating from the [officially recommended steps](https://doc.rust-lang.org/book/ch01-01-installation.html). If you decide to go that route install Rust from Linux OS package repositories, Homebrew, Conda etc., things may or may not work, or they may work sometimes. Nextclade team don't have bandwidth to try every platform and config, so you are on your own here. But feel free to open pull requests with fixes, if necessary.
+
+ > 💡 Note, Rustup allows to install multiple versions of Rust and to switch between them. This repository contains a [rust-toolchain.toml](../../rust-toolchain.toml) file, which describes which version of Rust is currently in use by Nextclade official build. Cargo and Rustup should be able to [pick it up automatically](https://rust-lang.github.io/rustup/overrides.html#the-toolchain-file), install the required toolchain and use it when you type `cargo` commands.
+
+3. Prepare environment variables which configure Nextclade build-time settings (once)
+
+ ```bash
+ # [once] Prepare dotenv file with default values
+ cp .env.example .env
+ ```
+
+4. Build and run Nextclade CLI in debug mode (convenient for development - faster to build, slow to run, has more debug info in the executable, error messages are more elaborate):
+
+ ```bash
+ # (Re-)build Nextclade in debug mode.
+ # By default, the resulting executable will be in `target/debug/nextclade`.
+ cargo build --bin=nextclade
+
+ # (Re-)build Nextclade in debug mode and run `nextclade --help` to print Nextclade CLI main help screen. The arguments after the `--` are passed to nextclade executable, as if you'd run it directly. Refer to Nextclade CLI user documentation instead (https://docs.nextstrain.org/projects/nextclade/en/stable/index.html) for explanation of arguments.
+ cargo run --bin=nextclade -- --help
+
+ # (Re-)build Nextclade in debug mode and use it to download a dataset to `data_dev/` directory.
+ cargo run --bin=nextclade -- dataset get \
+ --name='nextstrain/sars-cov-2/MN908947' \
+ --output-dir='data_dev/nextstrain/sars-cov-2/MN908947/'
+
+ # (Re-)build Nextclade in debug mode and run the analysis using the dataset we just downloaded (to `data_dev/`) and output results to the `out/` directory.
+ cargo run --bin=nextclade -- run \
+ 'data_dev/nextstrain/sars-cov-2/MN908947/sequences.fasta' \
+ --input-dataset='data_dev/nextstrain/sars-cov-2/MN908947/' \
+ --output-dir='out/'
+ ```
+
+ > 💡 Note, depending on your computer hardware and internet speed, your first build can take significant amount of time, because the Rust toolchain and all dependency packages will be downloaded and compiled. Next time the existing toolchain and cached packages are used, so the repeated builds should be much faster.
+
+ > 💡 Add `-v` to Nextclade arguments to make console output more verbose. Add more occurrences, e.g. `-vv`, for even more verbose output.
+
+5. Build and run Nextclade CLI in release mode (slow to build, fast to run, very little debug info):
+
+ ```bash
+ # Build Nextclade in release mode.
+ # By default, the resulting executable will be in `target/release/nextclade`.
+ cargo build --bin=nextclade --release
+
+ # Run Nextclade release binary
+ ./target/release/nextclade run \
+ 'data_dev/nextstrain/sars-cov-2/MN908947/sequences.fasta' \
+ --input-dataset='data_dev/nextstrain/sars-cov-2/MN908947' \
+ --output-fasta='out/nextclade.aligned.fasta' \
+ --output-tsv='out/nextclade.tsv' \
+ --output-tree='out/nextclade.tree.json' \
+ --in-order \
+ --include-reference
+
+ ```
+
+ > 💡 Debug builds are incremental, i.e. only the files that have changed since last build are compiled. But release builds are not. If you need to quickly iterate on features, then use debug builds. If you are measuring performance, always use release builds.
### Nextclade Web
-Nextclade Web is a React Typescript application, which relies on Nextclade WebAssembly (wasm) module to perform the computation.
-The WebAssembly module shares the algorithms Rust code with Nextclade CLI. So building Nextclade Web involves 2 steps: building WebAssembly module and building the app itself.
+Nextclade Web is a React Typescript application, which relies on Nextclade WebAssembly (wasm) module to perform the computation. This WebAssembly module shares the same the Rust code for algorithms with Nextclade CLI. So building Nextclade Web involves 2 steps:
+
+- building WebAssembly module
+- building the web application itself
Install Node.js version 14+ (latest LTS release is recommended), by either downloading it from the official website: https://nodejs.org/en/download/, or by using [nvm](https://github.com/nvm-sh/nvm). We don't recommend using Node.js from the package manager of your operating system, and neither from conda or other sources.
-Let's build the WebAssembly module:
+> ⚠️ We don't have bandwidth to support Node.ks installations deviating from the officially recommended setup, e.g. from Linux OS package repositories, Homebrew, Conda etc. If you decide to go that route - things may work or they may not - you are on your own. But feel free to open pull requests with fixes if necessary.
-
-Show extra requirements for macOS
+#### Steps
-For macOS, you will also have to install llvm:
+1. Obtain source code (once)
-```bash
-brew install llvm
-```
+ Make sure you have [git](https://git-scm.com/) installed.
-Furthermore, you will need to set the following environment variables before invoking `yarn wasm-prod`:
+ Clone Nextclade git repository:
-```bash
-export CC=/opt/homebrew/opt/llvm/bin/clang
-export AR=/opt/homebrew/opt/llvm/bin/llvm-ar
-```
-
+ ```bash
+ git clone https://github.com/nextstrain/nextclade
+ cd nextclade
+ ```
-```bash
-# Clone Nextclade git repository
-git clone https://github.com/nextstrain/nextclade
-cd nextclade
+ > 💡 We accept pull requests on GitHub. If you want to submit a with new feature or a bug fixe, then make a GitHub account, [make a fork](https://docs.github.com/en/get-started/quickstart/fork-a-repo) of the [origin Nextclade repository](https://github.com/nextstrain/nextclade) and clone your forked repository instead. Refer to [GitHub documentation "Contributing to projects"](https://docs.github.com/en/get-started/quickstart/contributing-to-projects) for more details.
-# Install Rustup, the Rust version manager (https://www.rust-lang.org/tools/install)
-curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
+ > 💡 Make sure you [keep your local code up to date](https://github.com/git-guides/git-pull) with the origin repo, [especially if it's forked](https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork).
-# Add Rust tools to the $PATH
-export PATH="$PATH:$HOME/.cargo/bin"
+ > 💡 If you are a member of Nextstrain team, then you don't need a fork and you can contribute directly to the origin repository. Still, in most cases, please submit pull requests for review, rather than pushing changes to branches directly.
-# Prepare dotenv file with default values
-cp .env.example .env
+2. Install Rust if not already (https://www.rust-lang.org/tools/install):
-# Install wasm-pack
-cargo install wasm-pack
-cargo install wasm-opt
+ ```bash
+ # [once] Install Rustup, the Rust version manager
+ curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
+
+ # [once] Add Rust tools to the $PATH
+ export PATH="$PATH:$HOME/.cargo/bin"
+
+ # [once] [Linux only] install openssl and pkgconfig. Example for Ubuntu:
+ sudo apt-get update
+ sudo apt-get install --yes libssl-dev pkg-config
+
+ # Check your installed versions of Rust compiler, Cargo and Rustup
+ $ rustc -V
+ $ cargo -V
+ $ rustup -V
+ ```
-cd packages_rs/nextclade-web
+ > ⚠️ We don't support Rust installations deviating from the [officially recommended steps](https://doc.rust-lang.org/book/ch01-01-installation.html). If you decide to go that route install Rust from Linux OS package repositories, Homebrew, Conda etc., things may or may not work, or they may work sometimes. Nextclade team don't have bandwidth to try every platform and config, so you are on your own here. But feel free to open pull requests with fixes, if necessary.
-# Install dependency packages
-yarn install
+ > 💡 Note, Rustup allows to install multiple versions of Rust and to switch between them. This repository contains a [rust-toolchain.toml](../../rust-toolchain.toml) file, which describes which version of Rust is currently in use by Nextclade official build. Cargo and Rustup should be able to [pick it up automatically](https://rust-lang.github.io/rustup/overrides.html#the-toolchain-file), install the required toolchain and use it when you type `cargo` commands.
-# Run wasm-pack to build the WebAssembly module in release mode
-yarn wasm-prod
+3. Prepare environment variables which configure Nextclade build-time settings (once)
-# Alternatively, to build in debug mode (much much slower to run, but more debug info)
-# yarn wasm-dev
+ ```bash
+ # [once] Prepare dotenv file with default values
+ cp .env.example .env
+ ```
-```
+4. Install other required tools (once)
-The WebAssembly module and accompanying Typescript code should have been generated into `packages_rs/nextclade-web/src/gen/`,
-and now the web app should be able to find it.
+ ```bash
+ cargo install wasm-pack
+ ```
-Let's build build and run the web app (it's convenient to do it in a separate terminal instance from WebAssembly module build):
+
+ 🍏 Extra requirements for macOS [click to expand]
-```bash
-# Build and run web app in dev mode
-cd packages_rs/nextclade-web
-yarn install
-yarn dev
-```
+ > For macOS, you will also have to install llvm:
+ >
+ > ```bash
+ > brew install llvm
+ > ```
+ >
+ > Furthermore, you will need to set the following environment variables before invoking `yarn wasm-prod`:
+ >
+ > ```bash
+ > export CC=/opt/homebrew/opt/llvm/bin/clang
+ > export AR=/opt/homebrew/opt/llvm/bin/llvm-ar
+ > ```
-Open `http://localhost:3000/` in the browser. Code changes should trigger rebuild and fast refresh of the app.
+
-The optimized production version of the web app can be built with
+5. Install NPM dependencies (once)
-```bash
-yarn prod:build
-yarn prod:serve
-```
+ ```bash
+ cd packages_rs/nextclade-web
+ yarn install
+ ```
+
+ > ⚠️ Nextclade uses `yarn` to manage NPM dependencies. While you could try `npm` or other tools instead, we don't support this.
+
+6. Build the WebAssembly module
+
+ ```bash
+ cd packages_rs/nextclade-web
+ yarn wasm-prod
+ ```
+
+ This step might take a lot of time. The WebAssembly module and accompanying Typescript code should have been generated into `packages_rs/nextclade-web/src/gen/`. The web application should be able to find it there.
+
+ Repeat this step every time you are touching Rust code.
+
+7. Build and serve the web app
+
+ We are going to run a development web server, which runs continuously (it does not yield terminal prompt until you stop it). It's convenient to do it in a separate terminal instance from WebAssembly module build to allow rebuilding the app and the module independently.
+
+ The development version can be built using:
+
+ ```bash
+ cd packages_rs/nextclade-web
+ yarn dev
+ ```
+
+ Open `http://localhost:3000/` in the browser. Typescript code changes should trigger rebuild and fast refresh of the app. If you rebuild the WebAssembly module (ina separate terminal instance), it should also pick up the changes automatically.
-Open `http://localhost:8080/` in the browser.
+ The optimized production version of the web app can be built and served with
-The resulting files should be available under `packages_rs/nextclade-web/.build/production/web`. They can be served by any static webserver or static file hosting (`https://clades.nextstrain.org` uses AWS S3 + Cloudfront). `yarn prod:serve` is just an example of a simple webserver that serves files locally.
+ ```bash
+ yarn prod:build
+ yarn prod:serve
+ ```
+
+ Open `http://localhost:8080/` in the browser.
+
+ The resulting HTML, CSS and JS files should be available under `packages_rs/nextclade-web/.build/production/web`.
+
+ Production build does not have automatic rebuild and reload. You need to do full rebuild.
+
+ The `yarn prod:serve` command runs Express underneath and it is just an example of a simple (also slow and insecure) local file web server. But they can be served by any static file web server or static file hosting service. The official deployment uses AWS S3 + Cloudfront.
### Linting (static analysis)
@@ -171,29 +277,45 @@ Clippy is configured in `clippy.toml` and in `.cargo/config.toml`.
For routine development, it is recommended to configure your text editor to see the Rust compiler and linter errors.
-##### In [VSCode](https://code.visualstudio.com/):
-
-Make sure you have ["Rust Analyzer" extension](https://marketplace.visualstudio.com/items?itemName=rust-lang.rust-analyzer) (and not deprecated "Rust" extension), and configure it to use clippy: hit Ctrl+Shit+P, then find "Preferences: Open user settings (JSON)", then add:
-
-```
-"rust-analyzer.check.command": "clippy",
-```
-
-Now the warnings and errors will be shown as yellow and red squiggles. If you mouse hover a squiggle, there will appear a tooltip with explanation and a link to even more details. Sometimes there will be a link in the bottom of the tooltip to apply a "Quick fix" for this particular mistake. And there is also a light bulb in the editor to do the same.
+
+ 💡 In VSCode [click to expand]
+
+(these instructions can go out of date with time, so make sure you check VSCode community for what's latest and greatest)
+
+> Make sure you have ["Rust Analyzer" extension](https://marketplace.visualstudio.com/items?itemName=rust-lang.rust-analyzer) (and not deprecated "Rust" extension), and configure it to use clippy: hit Ctrl+Shit+P, then find "Preferences: Open user settings (JSON)", then add:
+>
+> ```
+> "rust-analyzer.check.command": "clippy",
+> ```
+>
+> Now the warnings and errors will be shown as yellow and red squiggles. If you mouse hover a squiggle, there will appear a tooltip with explanation and a link to even more details. Sometimes there will be a link in the bottom of the tooltip to apply a "Quick fix" for this particular mistake. And there is also a light bulb in the editor to do the same.
+>
+> You can disable the pesky inline type hints (for all languages) by adding this to your preferences JSON:
+>
+> ```
+> "editor.parameterHints.enabled": false,
+> "editor.inlayHints.enabled": "off",
+> ```
+>
+> An extension ["Error lens"](https://marketplace.visualstudio.com/items?itemName=usernamehw.errorlens) allows to see error and warning text inline in the editor.
-You can disable the pesky inline type hints (for all languages) by adding this to your preferences JSON:
+
-```
-"editor.parameterHints.enabled": false,
-"editor.inlayHints.enabled": "off",
-```
+
+ 💡 In Jetbrains CLion [click to expand]
-An extension ["Error lens"](https://marketplace.visualstudio.com/items?itemName=usernamehw.errorlens) allows to see error and warning text inline in the editor.
+(these instructions can go out of date with time, so make sure you check Jetbrains docs for what's latest and greatest)
-##### In Intellij CLion:
-In main menu, "File | Settings | Languages & Frameworks | Rust | External Linters", set "External tool" to "Clippy" and check the checkbox "Run external linter to analyze code on the fly".
+> Install [Intellij Rust plugin](https://intellij-rust.github.io/).
+>
+> In main menu, "File | Settings | Languages & Frameworks | Rust | External Linters", set "External tool" to "Clippy" and check the checkbox "Run external linter to analyze code on the fly".
+>
+> You should now see red and yellow squiggles if there are problems. Mouse hover to read the message and recommendations.
+>
+> Install [Inspection Lens plugin](https://plugins.jetbrains.com/plugin/19678-inspection-lens) to see the messages inline in the code.
+
#### Linting Typescript and JavaScript
@@ -206,52 +328,113 @@ yarn lint
The `eslint` configuration is in `.eslintrc.js`. `tsc` configuration is in `tsconfig.json`.
+Modern text editors should be able to display ESLint warnings out of the box as soon as you install NPM dependencies (the `yarn install` command in the build steps). Refer to the documentation of you text editor if it does not.
+
### Formatting (code style)
+Rust:
+
```bash
cargo fmt --all
```
+Typescript:
+
```bash
cd packages_rs/nextclade-web
yarn format:fix
```
+## Maintenance
+
+### Continuous integration (CI)
+
+Nextclade build and deployment process is automated using GitHub Actions:
+
+- Nextclade Web build and deployment: [.github/workflows/web.yml](../../.github/workflows/web.yml)
+- Nextclade CLI build and GitHub releases: [.github/workflows/cli.yml](../../.github/workflows/cli.yml)
+- Nextclade CLI Bioconda release: [.github/workflows/bioconda.yml](../../.github/workflows/bioconda.yml)
+
+The workflows run on every pull request on GitHub and every push to a major branch.
+
+### Deployment environments
+
+Nextclade GitHub repository contains 3 major branches with special meaning: `master`, `staging` and `release`, each has a corresponding domain name for Nextclade Web. Nextclade built from one of these branches fetches datasets from the corresponding dataset deployment environment (See [Dataset server maintenance guide](https://github.com/nextstrain/nextclade_data/blob/master/docs/dataset-server-maintenance.md))
+
+Other branches are built in the context of GitHub pull requests. If you submit a pull request, then Vercel bot will automatically post a comment message with a URL to the preview deployment of Nextclade Web. After CLI GitHub Actions workflow finishes, you can find the resulting Nextclade CLI executables in the "Artifacts" section of the workflow.
+
+Here is a list of environments:
+
+| Nextclade repo branch | Nextclade Web domain name | Dataset server | Meaning |
+|-----------------------|----------------------------|--------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------|
+| release | clades.nextstrain.org | data.clades.nextstrain.org | Final release, targeting all end users |
+| staging | staging.nextstrain.org | data.staging.nextstrain.org | Staging release, for last-minute testing and fixes before a final release is made, to not block progress on master branch |
+| master | master.nextstrain.org | data.master.nextstrain.org | Main development branch - accumulates features and bug fixes from pull requests |
+| other branches | temporary domain on Vercel | branch with the same name in dataset GitHub repo if exists, otherwise data.master.nextstrain.org | Pull requests - development of new features and bug fixes |
+
+Preview versions of Nextclade Web built from pull requests will first try to fetch data from GitHub, from the branch with the same name in the [dataset GitHub repository](https://github.com/nextstrain/nextclade_data), if such branch exists. If not, the it will fetch from `master` environment. This is useful during development, when you need to modify both software and data: if you have branches with the same name in both repos, Nextclade Web will fetch the datasets from that branch.
+
+Nextclade CLI built from pull requests in Nextclade repository is always using `master` deployment.
+
+If you build Nextclade Web or Nextclade CLI locally, you can configure the data environment by setting `DATA_FULL_DOMAIN` variable in your local `.env` file. Note that despite the name, variable should contain fUll URL to the dataset server root. This is a build-time setting. You need to rebuild Nextclade every time you set it.
+
+For example, for Nextclade v3 the default setting (`master` environment) is:
+
+```
+DATA_FULL_DOMAIN=https://data.master.clades.nextstrain.org/v3
+```
+
+You can serve datasets locally and tell Nextclade to use your local server:
+
+```
+DATA_FULL_DOMAIN=http://localhost:3001
+```
+
+You can turn on fetching from the same branch from the dataset repo by setting:
+
+```
+DATA_TRY_GITHUB_BRANCH=1
+```
+
+If you are deploying your own Nextclade instance, although it might be tempting to fetch datasets from GitHub directly, without deploying them to a file server, this is not recommended, because your users will probably hit GitHub's usage limits. i.e. we don't recommend to enable this setting for your major branches and end-user releases.
+
+## Trying custom dataset server
+
+There are multiple ways to make Nextclade to use a custom dataset server instead of the default one. This is useful for local testing, when developing datasets or Nextclade software itself.
+In all cases you need to have a dataset server directory ready (contained datasets and all the required index files).
-## Trying custom datasets locally
+### Prepare and serve a dataset server directory locally
-### Prepare and serve datasets locally
+- Build a fresh dataset server directory as described in the [nextstrain/nextclade_data](https://github.com/nextstrain/nextclade_data) repo. At the time of writing it simply means to run `./scripts/rebuild` and to observe the `data_output/` directory created, containing the dataset files and associated index files
-- Build a fresh dataset directory as described in the [nextstrain/nextclade_data](https://github.com/nextstrain/nextclade_data) repo. At the time of writing it simply means to run `./scripts/rebuild` and to observe the `data_output/` created, containing the dataset files and associated index files
-- Serve datasets directory locally using any static file server. [CORS](https://developer.mozilla.org/en-US/docs/Web/HTTP/CORS) should be enabled on the server. For example, using `serve` package from NPM:
+- Serve the output directory locally using any static file server. [CORS](https://developer.mozilla.org/en-US/docs/Web/HTTP/CORS) should be enabled on the server. For example, using `serve` package from NPM:
```bash
- npx serve@latest --cors --listen=tcp://0.0.0.0:27722 data_output/
+ npx serve@latest --cors --listen=tcp://0.0.0.0:3001 data_output/
```
- In this example, files should be available at
- ```
- http://localhost:27722/
- ````
- in particular, you should be able to download `index_v2.json` using
+
+ Now you should be able to fetch dataset index file with `curl`:
```bash
- curl http://localhost:27722/index_v2.json
+ curl http://localhost:3001/index.json
```
- or when navigating to this address in the browser.
+ and to see some JSON when navigating to `http://localhost:3001/index.json` in a web browser.
### Temporarily use custom dataset server with Nextclade CLI
-Run the usual `dataset list` and dataset `get commands`, with an additional flag:
+Run the usual `dataset list` and `dataset get`, with an additional flag:
```
---server=http://localhost:27722
+--server=http://localhost:3001
```
-This will tell Nextclade to use the local server for dataset queries.
+This will tell Nextclade to use the local dataset server instead of the default one.
+
+See Nextclade CLI user documentation for more details about available command ine arguments. You can type type `nextclade --help` for help screen. Each subcommand has it's own help screen, e.g `nextclade dataset get --help`.
### Temporarily use custom dataset server with Nextclade Web
-Add the `dataset-server` URL parameter with value set to URL of the custom dataset server:
+To provide Nextclade with the alternative location of the dataset server, add the `dataset-server` URL parameter with value set to URL of the custom dataset server:
```url
https://clades.nextstrain.org?dataset-server=http://example.com
@@ -260,20 +443,29 @@ https://clades.nextstrain.org?dataset-server=http://example.com
Local URLs should also work:
```url
-https://clades.nextstrain.org?dataset-server=http://localhost:27722
+https://clades.nextstrain.org?dataset-server=http://localhost:3001
```
-> ⚠️The linked resources should be available for fetching by a web browser on the client machine. Make sure [Cross-Origin Resource Sharing (CORS)](https://developer.mozilla.org/en-US/docs/Web/HTTP/CORS) is enabled on your file server as well as that all required authentication (if any) is included into the file URL itself.
+Combining locally built Nextclade Web and local dataset server too:
-> ⚠️The URLs might get quite complex, so don't forget to [encode the special characters](https://en.wikipedia.org/wiki/Percent-encoding), to keep the URLs valid.
+```url
+https://localhost:3000?dataset-server=http://localhost:3001
+```
+
+This instructs Nextclade to disregard the default dataset server URL and fetch data and index files from this custom location instead.
+
+> ⚠️ Web browser should be able to reach the dataset server address provided. Additionally, make sure [Cross-Origin Resource Sharing (CORS)](https://developer.mozilla.org/en-US/docs/Web/HTTP/CORS) is enabled on your server as well as that all required authentication (if any) is included into the file URL itself.
+> ⚠️The URLs might get quite complex, so don't forget to [encode special characters](https://en.wikipedia.org/wiki/Percent-encoding), to keep the URLs valid.
+
+See Nextclade Web user documentation for more details about available URL parameters.
### Permanently configure Nextclade CLI and Nextclade Web to use custom dataset server
Open `.env` file in the root of the project (if you don't have it, create it based on `.env.example`) and set the `DATA_FULL_DOMAIN` variable to the address of your local dataset server. In the example above it would be:
```
-DATA_FULL_DOMAIN=http://localhost:27722
+DATA_FULL_DOMAIN=http://localhost:3001
```
Rebuild Nextclade CLI and it will use this address by default for all dataset requests (without need for the additional `--server` flag).
@@ -286,13 +478,12 @@ Any network location can be used, not only localhost.
The same mechanism is used during CI builds for master/staging/production environments, to ensure they use their corresponding dedicated dataset server.
-
## Maintenance
There are 2 release targets, which are released and versioned separately:
-- CLI (Nextclade CLI and Nextalign CLI are released together)
-- Web application
+- Nextclade CLI
+- Nextclade Web
### Versioning
@@ -318,5 +509,4 @@ Nextclade project tries hard to adhere to [Semantic Versioning 2.0.0](https://se
- Follow comments in the script `./scripts/releases` on how to install dependencies for this script.
- Run `./scripts/releases web `, where `bump_type` signifies by how much you want to increment the version. It should be one of: `major`, `minor`, `patch`, `rc`, `beta`, `alpha`. It is advised against releasing `rc`, `beta`, `alpha` to release environment.
-
If you want to deploy the same version to multiple environments, then release to one environment (on one branch) and then promote it to other environments: manually fast-forward other branch(es) to this commit and push.