Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Cluster dead because detector shard stuck initializing #1235

Open
mvanderlee opened this issue Jul 24, 2024 · 4 comments
Open

[BUG] Cluster dead because detector shard stuck initializing #1235

mvanderlee opened this issue Jul 24, 2024 · 4 comments
Labels
bug Something isn't working

Comments

@mvanderlee
Copy link

What is the bug?
Upgraded a cluster from 2.11.1 to 2.15.0 and the cluster is in red status because the .opensearch-sap-network-detectors-queries-000007 shard is stuck initializing.

{
    'current_node': {
        'attributes': {'shard_indexing_pressure_enabled': 'true'},
        'id': 'zLNHMkpeReSxfbVn99y1Sw',
        'name': 'opensearch-node1',
        'transport_address': '172.19.0.3:9300',
    },
    'current_state': 'initializing',
    'explanation': 'the shard is in the process of initializing on node [opensearch-node1], wait until initialization has completed',
    'index': '.opensearch-sap-network-detectors-queries-000007',
    'primary': True,
    'shard': 0,
    'unassigned_info': {
        'at': '2024-07-24T12:00:27.551Z',
        'last_allocation_status': 'throttled',
        'reason': 'CLUSTER_RECOVERED',
    },
}

How can one reproduce the bug?
Steps to reproduce the behavior:
No idea. Create cluster at 2.11, add detection rules, upgrade to 2.15 and observe error.

What is the expected behavior?
My cluster to not die because a stupid feature can't start.
If anomaly detection is broken, then only let that feature be broken, not my entire cluster!!!!!! FFS
Separate user indices from system indices. The fact that this isn't done and that they are treated identical is a super stupid decision.

What is your host/environment?

  • Docker container for 2.15

Do you have any screenshots?
If applicable, add screenshots to help explain your problem.

Do you have any additional context?
Add any other context about the problem.

@mvanderlee mvanderlee added bug Something isn't working untriaged labels Jul 24, 2024
@kaituo
Copy link

kaituo commented Aug 7, 2024

@kaituo kaituo removed the untriaged label Aug 7, 2024
@mvanderlee
Copy link
Author

@kaituo That's right.

I can't tell you the exact config because we've stopped using Detectors and created our own alerting system

@kaituo
Copy link

kaituo commented Aug 8, 2024

@opensearch-project/admin -- Can we please move this to https://github.com/opensearch-project/security-analytics ?

@prudhvigodithi prudhvigodithi transferred this issue from opensearch-project/anomaly-detection Aug 8, 2024
@dblock
Copy link
Member

dblock commented Aug 12, 2024

[Catch All Triage - 1, 2, 3]

@dblock dblock removed the untriaged label Aug 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants