Open
Description
pgo 4.5.0
We are getting an intermittent error on backups as follows:
time="2021-06-07T06:00:15Z" level=info msg="pgo-backrest starts"
time="2021-06-07T06:00:15Z" level=info msg="debug flag set to false"
time="2021-06-07T06:00:15Z" level=info msg="backrest backup command requested"
time="2021-06-07T06:00:15Z" level=info msg="backrest command will be executed for both local and s3 storage"
time="2021-06-07T06:00:15Z" level=info msg="command to execute is [pgbackrest backup --stanza=db --type=full --repo1-retention-full=10 --db-host=10.244.140.50 --db-path=/pgdata/retroelk-prod-kdca && pgbackrest backup --stanza=db --type=full --repo1-retention-full=10 --db-host=10.244.140.50 --db-path=/pgdata/retroelk-prod-kdca --repo1-type=s3 --no-repo1-s3-verify-tls]"
time="2021-06-07T06:00:15Z" level=info msg="command is pgbackrest backup --stanza=db --type=full --repo1-retention-full=10 --db-host=10.244.140.50 --db-path=/pgdata/retroelk-prod-kdca && pgbackrest backup --stanza=db --type=full --repo1-retention-full=10 --db-host=10.244.140.50 --db-path=/pgdata/retroelk-prod-kdca --repo1-type=s3 --no-repo1-s3-verify-tls "
time="2021-06-07T06:22:50Z" level=error msg="command terminated with exit code 56"
time="2021-06-07T06:22:50Z" level=info msg="output=[]"
time="2021-06-07T06:22:50Z" level=info msg="stderr=[WARN: unable to check pg-1: [UnknownError] remote-0 process on '10.244.140.50' terminated unexpectedly [255]: kex_exchange_identification: Connection closed by remote host\nERROR: [056]: unable to find primary cluster - cannot proceed\n]"
time="2021-06-07T06:22:50Z" level=error msg="command terminated with exit code 56"
This is from a scheduled backup job ^
When we run the backup manually it works.