Table of Contents generated with DocToc
- Troubleshoot a Manatee Shard in Distress.
- Manatee-adm
- Healthy Manatee
- Symptoms
- Useful Manatee Commands
Table of Contents
This guide will list the most common failure scenarios of Manatee and steps to resuscitate the Shard.
You should read the user guide first to ensure you understand how Manatee works. Some of the recovery steps could potentially be destructive and result in data loss if performed incorrectly.
The primary way an operator should be interacting with manatee is via the
manatee-adm
CLI. The main tool for assessing the cluster's overall state is
manatee-adm pg-status
.
The output of # manatee-adm pg-status
of a healthy manatee shard will look like
this:
ROLE PEER PG REPL SENT WRITE FLUSH REPLAY LAG
primary bb348824 ok sync 0/671FEDF8 0/671FEDF8 0/671FEDF8 0/671FE9F0 -
sync a376df2b ok async 0/671FEDF8 0/671FEDF8 0/671FEDF8 0/671FE9F0 -
async 09957297 ok - - - - - 0m00s
There are 3 peers in this shard, a primary
, sync
, and async
. The "PG"
column indicates that Postgres is online.
If anything is wrong with the cluster, errors or warnings will be printed below this output. Errors indicate problems that likely affect service, while warnings indicate problems that should not directly be impacting service at the moment.
The REPL column (and subsequent columns) describes the replication state of each peer to its immediate downstream peer. For example, the primary's REPL column represents the state of replication to the synchronous peer. A primary peer with "-" in the REPL column means replication is offline between the primary and the sync. A sync peer with "-" in the REPL field means that replication is offline between the sync and the async.
A 3-node manatee shard is healthy only if there are 3 peers in the shard, all three peers are marked "ok" in the "PG" column, the primary's REPL is "sync", and the sync's REPL is "async". As mentioned above, the tool checks these conditions and prints out when something is wrong.
"manatee-adm pg-status" should automatically identify problems with the cluster. In previous versions of Manatee, operators were required to use the "manatee-adm status" command and verify these conditions manually. The rest of this guide describes the process of identifying problems based on the "manatee-adm status" output, but it should no longer be required on newer versions of Manatee.
Here are some commonly seen outputs of manatee-adm status
when the shard is
unhealthy. Before you start: click here.
{
"sdc": {
"primary": {
"id": "10.99.99.16:5432:12345",
"ip": "10.99.99.16",
"pgUrl": "tcp://postgres@10.99.99.16:5432/postgres",
"zoneId": "ebe96dfb-2533-459b-964b-a5ad7fb3b566",
"backupUrl": "http://10.99.99.16:12345",
"online": true,
"repl": {}
}
}
}
This assumes you have an HA manatee shard setup. Some installations of SDC such
as COAL by default only come with 1 manatee peer. In those cases, this is the
expected output of # manatee-adm status
.
- Check that no manatee peers are out of disk space.
- Find the set of manatee peers in the DC. If this returns only 1 peer, then you do not have an HA manatee setup.
- Log on to a non primary peer.
- Restart the manatee-sitter process.
# svcadm disable manatee-sitter; svcadm enable manatee-sitter
- Check via
# manatee-adm status
that you see 2 peers with replication information. If this peer is unable replicate e.g. the sitter service goes back into maintenance, postgres dumps core, or you still don't see replication state, Then you'll need to rebuild this peer via# manatee-adm rebuild
- Continue to here
{
"sdc": {
"primary": {
"id": "10.1.0.140:5432:12345",
"ip": "10.1.0.140",
"pgUrl": "tcp://postgres@10.1.0.140:5432/postgres",
"zoneId": "45d023a7-116e-44c9-ae4e-28c7b10202ce",
"backupUrl": "http://10.1.0.140:12345",
"online": true,
"repl": {
"pid": 97110,
"usesysid": 10,
"usename": "postgres",
"application_name": "tcp://postgres@10.1.0.144:5432/postgres",
"client_addr": "10.1.0.144",
"client_hostname": "",
"client_port": 57443,
"backend_start": "2014-07-24T17:49:05.149Z",
"state": "streaming",
"sent_location": "F2/B4489EB8",
"write_location": "F2/B4489EB8",
"flush_location": "F2/B4489EB8",
"replay_location": "F2/B4488B78",
"sync_priority": 1,
"sync_state": "sync"
}
},
"sync": {
"id": "10.1.0.144:5432:12345",
"ip": "10.1.0.144",
"pgUrl": "tcp://postgres@10.1.0.144:5432/postgres",
"zoneId": "72e76247-7e13-40e2-8c44-6f47a2b37829",
"backupUrl": "http://10.1.0.144:12345",
"online": true,
"repl": {}
}
}
}
- Check that no manatee peers are out of disk space.
- Find the set of manatee peers in the DC and verify that an async peer exists.
You should see 3 instances -- the missing async is the instance that is missing from
# manatee-adm status
- Log on to the async peer.
- Restart the manatee-sitter process.
# svcadm disable manatee-sitter; svcadm enable manatee-sitter
- Check via
# manatee-adm status
that you see 3 peers with replication information. If this peer is unable replicate e.g. the sitter service goes back into maintenance, postgres dumps core, or you still don't see replication state, Then you'll need to rebuild this peer via# manatee-adm rebuild
{
"sdc": {}
}
- Check that no manatee peers are out of disk space.
- Find the set of manatee peers in the DC. If this returns nothing, then no manatee peers are deployed.
- Log on to any manatee peer.
- Check the manatee state:
[root@b35e12da (postgres) ~]$ manatee-adm zk-state | json
{
"generation": 1,
"primary": {
"id": "10.77.77.8:5432:12345",
"ip": "10.77.77.8",
"pgUrl": "tcp://postgres@10.77.77.8:5432/postgres",
"zoneId": "b35e12da-7359-4799-a3f7-8d3906f9160f",
"backupUrl": "http://10.77.77.8:12345"
},
"sync": {
"id": "10.77.77.20:5432:12345",
"zoneId": "8433685b-a1c0-4b19-8cad-c2f2e65dad1f",
"ip": "10.77.77.20",
"pgUrl": "tcp://postgres@10.77.77.20:5432/postgres",
"backupUrl": "http://10.77.77.20:12345"
},
"async": [
{
"id": "10.77.77.7:5432:12345",
"zoneId": "3dc65ac3-2977-40da-a948-3c72b9359884",
"ip": "10.77.77.7",
"pgUrl": "tcp://postgres@10.77.77.7:5432/postgres",
"backupUrl": "http://10.77.77.7:12345"
}
],
"initWal": "0/4405D30"
}
- Log on to the mantaee primary.
- Make sure that the
manatee-sitter
process is running. If not, start it. - Check
# manatee-adm status
and ensure that the primary is online. - Log on to the mantaee sync and make sure it is running.
- Log on to asyncs and make sure they are running.
- You can check the set of manatee peers that are connected to zk with the
# manatee-adm zk-active
command:
[root@b35e12da (postgres) ~]$ manatee-adm zk-active | json
{
"10.77.77.7:5432:12345-0000000164": {
"zoneId": "3dc65ac3-2977-40da-a948-3c72b9359884",
"ip": "10.77.77.7",
"pgUrl": "tcp://postgres@10.77.77.7:5432/postgres",
"backupUrl": "http://10.77.77.7:12345"
},
"10.77.77.20:5432:12345-0000000162": {
"zoneId": "8433685b-a1c0-4b19-8cad-c2f2e65dad1f",
"ip": "10.77.77.20",
"pgUrl": "tcp://postgres@10.77.77.20:5432/postgres",
"backupUrl": "http://10.77.77.20:12345"
},
"10.77.77.8:5432:12345-0000000163": {
"zoneId": "b35e12da-7359-4799-a3f7-8d3906f9160f",
"ip": "10.77.77.8",
"pgUrl": "tcp://postgres@10.77.77.8:5432/postgres",
"backupUrl": "http://10.77.77.8:12345"
}
}
{
"sdc": {
"primary": {
"id": "10.1.0.140:5432:12345",
"ip": "10.1.0.140",
"pgUrl": "tcp://postgres@10.1.0.140:5432/postgres",
"zoneId": "45d023a7-116e-44c9-ae4e-28c7b10202ce",
"backupUrl": "http://10.1.0.140:12345",
"online": true,
"repl": {
"pid": 97110,
"usesysid": 10,
"usename": "postgres",
"application_name": "tcp://postgres@10.1.0.144:5432/postgres",
"client_addr": "10.1.0.144",
"client_hostname": "",
"client_port": 57443,
"backend_start": "2014-07-24T17:49:05.149Z",
"state": "streaming",
"sent_location": "F2/B4489EB8",
"write_location": "F2/B4489EB8",
"flush_location": "F2/B4489EB8",
"replay_location": "F2/B4488B78",
"sync_priority": 1,
"sync_state": "sync"
}
},
"sync": {
"id": "10.1.0.144:5432:12345",
"ip": "10.1.0.144",
"pgUrl": "tcp://postgres@10.1.0.144:5432/postgres",
"zoneId": "72e76247-7e13-40e2-8c44-6f47a2b37829",
"backupUrl": "http://10.1.0.144:12345",
"online": true,
"repl": {}
},
"async": {
"id": "10.1.0.139:5432:12345",
"ip": "10.1.0.139",
"pgUrl": "tcp://postgres@10.1.0.139:5432/postgres",
"zoneId": "852a2e33-93f0-4052-b2e4-c61ea6c8b0fd",
"backupUrl": "http://10.1.0.139:12345",
"online": true,
"repl": {},
"lag": {
"time_lag": null
}
}
}
}
- Check that no manatee peers are out of disk space.
- Log on to the async.
- Rebuild via
# manatee-adm rebuild
- Depending on the output of
# manatee-adm status
at this point, you'll want to follow the steps described in the other troubleshooting sections of this document.
{
"sdc": {
"primary": {
"id": "10.1.0.140:5432:12345",
"ip": "10.1.0.140",
"pgUrl": "tcp://postgres@10.1.0.140:5432/postgres",
"zoneId": "45d023a7-116e-44c9-ae4e-28c7b10202ce",
"backupUrl": "http://10.1.0.140:12345",
"online": true,
"repl": {
"pid": 97110,
"usesysid": 10,
"usename": "postgres",
"application_name": "tcp://postgres@10.1.0.144:5432/postgres",
"client_addr": "10.1.0.144",
"client_hostname": "",
"client_port": 57443,
"backend_start": "2014-07-24T17:49:05.149Z",
"state": "streaming",
"sent_location": "F2/B4489EB8",
"write_location": "F2/B4489EB8",
"flush_location": "F2/B4489EB8",
"replay_location": "F2/B4488B78",
"sync_priority": 1,
"sync_state": "sync"
}
},
"sync": {
"id": "10.1.0.144:5432:12345",
"ip": "10.1.0.144",
"pgUrl": "tcp://postgres@10.1.0.144:5432/postgres",
"zoneId": "72e76247-7e13-40e2-8c44-6f47a2b37829",
"backupUrl": "http://10.1.0.144:12345",
"online": true,
"repl": {}
},
"deposed": {
"id": "10.1.0.139:5432:12345",
"ip": "10.1.0.139",
"pgUrl": "tcp://postgres@10.1.0.139:5432/postgres",
"zoneId": "852a2e33-93f0-4052-b2e4-c61ea6c8b0fd",
"backupUrl": "http://10.1.0.139:12345"
}
}
}
- Log on to the deposed peer.
- Rebuild via
# manatee-adm rebuild
, follow the prompts.
Another really common scenario we've seen in the past is where the manatee zones runs out of space.
The symptoms are usually:
- You can't login to the zone at all.
[Connected to zone 'e3ab01c5-d5d0-423c-97de-2c71183302d9' pts/2] No utmpx entry. You must exec "login" from the lowest level "shell". [Connection to zone 'e3ab01c5-d5d0-423c-97de-2c71183302d9' pts/2 closed]
- Manatee logs show errors with
ENOSPC
[2014-05-16T14:51:32.957Z] TRACE: manatee-sitter/ZKPlus/10377 on 5878969f-de0f-4538-b883-bdf416888136 (/opt/smartdc/manatee/node_modules/manatee/node_modules/zkplus/lib/client.js:361): mkdirp: completed Uncaught Error: ENOSPC, no space left on device FROM Object.fs.writeSync (fs.js:528:18) SyncWriteStream.write (fs.js:1740:6) Logger._emit (/opt/smartdc/manatee/node_modules/manatee/node_modules/bunyan/lib/bunyan.js:742:22) Logger.debug (/opt/smartdc/manatee/node_modu
You can check that the zone is indeed out of memory by checking the space left
on the zone's ZFS dataset in the GZ. Here we assume the manatee zone's uuid is
e3ab01c5-d5d0-423c-97de-2c71183302d9
[root@headnode (emy-10) ~]# zfs list | grep e3ab01c5-d5d0-423c-97de-2c71183302d9
zones/cores/e3ab01c5-d5d0-423c-97de-2c71183302d9 138M 99.9G 138M /zones/e3ab01c5-d5d0-423c-97de-2c71183302d9/cores
zones/e3ab01c5-d5d0-423c-97de-2c71183302d9 302M 0 986M /zones/e3ab01c5-d5d0-423c-97de-2c71183302d9
zones/e3ab01c5-d5d0-423c-97de-2c71183302d9/data 66.5M 0 31K /zones/e3ab01c5-d5d0-423c-97de-2c71183302d9/data
zones/e3ab01c5-d5d0-423c-97de-2c71183302d9/data/manatee 66.5M 0 42.0M /manatee/pg
Notice that the zone's delegated dataset has no more space.
- Increase the zone's ZFS dataset. In the GZ, run:
# zfs set quota=1G zones/e3ab01c5-d5d0-423c-97de-2c71183302d9
Verify there's free space in the dataset.
[root@headnode (emy-10) ~]# zfs list | grep e3ab01c5-d5d0-423c-97de-2c71183302d9 zones/cores/e3ab01c5-d5d0-423c-97de-2c71183302d9 138M 99.9G 138M /zones/e3ab01c5-d5d0-423c-97de-2c71183302d9/cores zones/e3ab01c5-d5d0-423c-97de-2c71183302d9 302M 722M 986M /zones/e3ab01c5-d5d0-423c-97de-2c71183302d9 zones/e3ab01c5-d5d0-423c-97de-2c71183302d9/data 66.5M 722M 31K /zones/e3ab01c5-d5d0-423c-97de-2c71183302d9/data zones/e3ab01c5-d5d0-423c-97de-2c71183302d9/data/manatee 66.5M 722M 42.0M /manatee/pg
You should be able to login to the zone at this point.
- Update the manatee service's quota in SAPI. Once you've made the local changes to the ZFS quota, you'll want to update the manatee service in SAPI such that future manatee zones will be deployed with the quota you've set. You can do this on the HN via sapiadm. In this example the quota size is set to 500G.
# sapiadm update $(sdc-sapi services?name=manatee | json -Ha uuid) params.quota=500
This lists the manatee peers in each DC by zone_uuid, alias, CN_uuid, zone_ip
.
From the headnode, run:
[root@headnode (staging-1) ~]# sdc-sapi /instances?service_uuid=$(sdc-sapi /services?name=manatee | json -Ha uuid) | json -Ha uuid | xargs -n1 sdc-vmadm get | json -ga uuid alias server_uuid nics.0.ip
4243db81-4453-42b3-a241-f26395712d19 manatee2 445aab6c-3048-11e3-9816-002590c3f3bc 172.25.3.65
8372b732-007c-400c-9642-9eb63d169cf2 manatee0 cf4414d0-3047-11e3-8545-002590c3f2d4 172.25.3.16
cce218f8-6ad9-45c6-bd98-d9d0b840b56a manatee1 aac3c402-3047-11e3-b451-002590c57864 172.25.3.59
This lists the manatee peers in each DC in Manta. If you have a multi-dc deployment, this will not find x-dc peers. From the headnode, run:
[root@headnode (staging-1) ~]# manta-adm show postgres
SERVICE SH ZONENAME GZ ADMIN IP
postgres 1 70d44638-f4fb-4cbd-8611-0f7c83d8502f 172.25.3.38
postgres 2 a5223321-600b-43eb-b66d-ebc0ab500046 172.25.3.38
postgres 3 ef318383-c5fb-4381-8416-a1636d8efa95 172.25.3.38