[UA] Low hanging fruit to improve reindexing performance #201605

jloleysens · 2024-11-25T14:19:26Z

Today when upgrading indices through reindexing we make the following requests to (1) create an index and (2) request a reindex:

Index creation request

      createIndex = await esClient.indices.create({
        index: newIndexName,
        body: {
          settings,
          mappings,
        },
      });

Regardless of current index settings, create indices with:

number_of_replicas: 0 (docs)
refresh_interval: -1 (docs) turn off refresh since we don't expect to be servicing search on the new index at this time

After reindexing is done, before creating aliases, we can use the index update API to make the settings the same as the original index (or default)

Reindexing request

    const startReindexResponse = await esClient.reindex({
      refresh: true,
      wait_for_completion: false,
      body: {
        source: { index: indexName },
        dest: { index: reindexOp.attributes.newIndexName },
      },
    });

slices: auto - (docs) possibly less low hanging fruit but could speed up reindexing by handling (more) shards in parallel. "Indexing performance scales linearly across available resources with the number of slices"

This has a few potential issues we should consider:

No longer 1 task: 1 task that splits into N tasks. Completed tasks will by reported by the "parent" task returned from the original request. How much impact will this have on existing code? It appears that "cancel" logic will remain unchanged but the contents of the "reindex" task my be quite different.
"Reindexing from remote clusters does not support manual or automatic slicing." - will this be an issue for UA?

Resources

Tune for indexing speed

The text was updated successfully, but these errors were encountered:

elasticmachine · 2024-11-25T14:22:30Z

Pinging @elastic/kibana-core (Team:Core)

botelastic bot added the needs-team Issues missing a team label label Nov 25, 2024

jloleysens removed the needs-team Issues missing a team label label Nov 25, 2024

botelastic bot added the needs-team Issues missing a team label label Nov 25, 2024

jloleysens added Team:Core Core services & architecture: plugins, logging, config, saved objects, http, ES client, i18n, etc Feature:Upgrade Assistant labels Nov 25, 2024

botelastic bot removed the needs-team Issues missing a team label label Nov 25, 2024

afharo self-assigned this Dec 11, 2024

jloleysens mentioned this issue Dec 18, 2024

Add action to create index from a source index elastic/elasticsearch#118890

Merged

afharo linked a pull request Dec 20, 2024 that will close this issue

[Upgrade Assistant] Reindexing optimizations #205055

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[UA] Low hanging fruit to improve reindexing performance #201605

[UA] Low hanging fruit to improve reindexing performance #201605

jloleysens commented Nov 25, 2024 •

edited

Loading

elasticmachine commented Nov 25, 2024

[UA] Low hanging fruit to improve reindexing performance #201605

[UA] Low hanging fruit to improve reindexing performance #201605

Comments

jloleysens commented Nov 25, 2024 • edited Loading

Index creation request

Reindexing request

Resources

elasticmachine commented Nov 25, 2024

jloleysens commented Nov 25, 2024 •

edited

Loading