Re: Recovery Verification

From: Ron Johnson <ronljohnsonjr(at)gmail(dot)com>
To: pgsql-general <pgsql-general(at)postgresql(dot)org>
Subject: Re: Recovery Verification
Date: 2026-02-24 13:40:38
Message-ID: CANzqJaDO2LOaAGiyvyQN54AJBs8BRZ1CkWXWTwPdG1WM7ttbBg@mail.gmail.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Tue, Feb 24, 2026 at 1:12 AM <dolan(at)directdemocracysolutions(dot)com> wrote:

> Hi folks,
>
> When performing database recovery tests, after restoring from backup is
> complete, what SOPs and tools do you use to sample your database contents
> and verify the data looks correct? Do you have a list of queries to run?
> What metadata do you capture and where do you save the test report? Do you
> use automation? Is it built in-house, off-the-shelf, or open-source?
>
> Thanks, I'm not a DBA but no one else works at my company so any pointers
> would be appreciated.
>

If using pg_backup/pg_restore, then something like this is perfectly
adequate:
pg_backup ... $DB 2> backup_$(date +"%F_%T").log || mail -s "ERROR: backup
failed at $(date +\"%F %T\")" dolan(at)example(dot)com
pg_restore --exit-on-error ... $DB 2> restore_$(date +"%F_%T").log || mail
-s "ERROR: restore failed at $(date +\"%F %T\")" dolan(at)example(dot)com

Then you know to check the log file to see what happened.

My business users don't trust that, so I created a simple, fast, imperfect
script which I run at the same time as the backup:
BEGIN;
SELECT COUNT(*) FROM table_1;
SELECT COUNT(*) FROM table_2;
...
SELECT COUNT(*) FROM table_N;
COMMIT;

Run the same script on the restored database. The two log files have
always been identical.

--
Death to <Redacted>, and butter sauce.
Don't boil me, I'm still alive.
<Redacted> lobster!

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Greg Sabino Mullane 2026-02-24 15:49:30 Re: pgbackrest after a network outage unable to perform backup [fails always]
Previous Message KK CHN 2026-02-24 10:25:15 pgbackrest after a network outage unable to perform backup [fails always]