Quick Links

Re: pg_waldump: support decoding of WAL inside tarfile

From:	Amul Sul <sulamul(at)gmail(dot)com>
To:	Robert Haas <robertmhaas(at)gmail(dot)com>
Cc:	Chao Li <li(dot)evan(dot)chao(at)gmail(dot)com>, Jakub Wartak <jakub(dot)wartak(at)enterprisedb(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject:	Re: pg_waldump: support decoding of WAL inside tarfile
Date:	2026-01-27 12:06:34
Message-ID:	CAAJ_b94=gtCeUKkGPUmPj_2SwHV+PiXQ8Mx-1RqfYj3pP3OwpA@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Mon, Jan 26, 2026 at 10:52 PM Robert Haas <robertmhaas(at)gmail(dot)com> wrote:
>
> On Fri, Jan 23, 2026 at 7:27 AM Amul Sul <sulamul(at)gmail(dot)com> wrote:
> > Another option I previously considered was adding the filtration logic
> > inside the archive streamer itself. However, since the very first read
> > is required to calculate the WAL segment size, the filter check cannot
> > be performed immediately. However, we could send a signal to the
> > archive streamer via privateInfo (e.g., a read_any_wal or
> > skip_wal_check boolean flag) to disable the filtration check until the
> > size is calculated. But that approach isn't very elegant; if the first
> > WAL page we read belongs to a segment we actually want to skip, we
> > would still have to run the filter check and handle the skip/removal
> > logic outside of the streamer (i.e., inside init_archive_reader()).
> > This would result in performing the same filtration check in two
> > different places.
>
> I mean, I don't really buy this logic. If the information added to
> privateInfo is "here's the LSN before which you can remove stuff," and
> it starts out initialized to 0/0, then the read of the first WAL page
> causes no problem at all, because nothing is before 0/0. After it gets
> updated to some non-zero value, the next call to
> astreamer_waldump_content() can handle discarding any data we don't
> need.
>
> IMHO, the best argument for keeping things are is that in some cases,
> that decision might result in a bit of delay in discarding old data,
> but I don't think that really matters. I think all that we care about
> is the peak memory utilization of an operation, and I don't think that
> an explicit signaling system should increase that at all.
>
> That said, I'm certainly willing to consider other ideas about how
> this can work. However, I feel strongly that the logic needs to be not
> only correct, but clear and well-explained. Setting cur_wal to NULL to
> make the astreamer skip without adequate comments doesn't meet that
> standard. Maybe with some better comments it's all right, but frankly
> I'm a bit skeptical. Right now, you're using whether or not cur_wal is
> NULL as a signal to skip data or not skip data. How is that better
> than passing down the LSN and TLI that you want to read next and
> letting the astreamer figure out what to do itself? It's a signaling
> mechanism either way, but it seems a lot easier to figure out whether
> we always keep the LSN and TLI updated properly than to figure out
> whether cur_wal is always NULL at exactly the right times.
>

I am not sure how to calculate the LSN of the very first page it reads
at this point, since we don't have the segment size yet. Furthermore,
the previous implementation using the segment number as the hash key
was incorrect, as extracting the segment number and TLI is impossible
without knowing the segment size, IIUC.

In the attached version, I am using the WAL segment name as the hash
key, which is much more straightforward. I have rewritten
read_archive_wal_page(), and it looks much cleaner than before. The
logic to discard irrelevant WAL files is still within
get_archive_wal_entry. I added an explanation for setting cur_wal to
NULL, which is now handled in the separate function I mentioned
previously.

Kindly have a look at the attached version; let me know if you are
still not happy with the current approach for filtering/discarding
irrelevant WAL segments. It isn't much different from the previous
version, but I have tried to keep it in a separate routine for better
code readability, with comments to make it easier to understand. I
also added a comment for ArchivedWALFile.

I have included your patch (removing WalSegSz) to the attached patch
set as 0001; all my subsequent patches are now bumped by one.

Regards,
Amul

Attachment	Content-Type	Size
v11-0001-Remove-file-level-global-WalSegSz.patch	application/octet-stream	4.6 KB
v11-0002-Refactor-pg_waldump-Move-some-declarations-to-ne.patch	application/octet-stream	2.2 KB
v11-0003-Refactor-pg_waldump-Separate-logic-used-to-calcu.patch	application/octet-stream	2.4 KB
v11-0004-Refactor-pg_waldump-Restructure-TAP-tests.patch	application/octet-stream	5.6 KB
v11-0005-pg_waldump-Add-support-for-archived-WAL-decoding.patch	application/octet-stream	39.3 KB
v11-0006-pg_waldump-Remove-the-restriction-on-the-order-o.patch	application/octet-stream	13.4 KB
v11-0007-pg_verifybackup-Delay-default-WAL-directory-prep.patch	application/octet-stream	1.7 KB
v11-0008-pg_verifybackup-Rename-the-wal-directory-switch-.patch	application/octet-stream	5.9 KB
v11-0009-pg_verifybackup-enabled-WAL-parsing-for-tar-form.patch	application/octet-stream	9.9 KB

In response to

Re: pg_waldump: support decoding of WAL inside tarfile at 2026-01-26 17:22:33 from Robert Haas

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Amul Sul	2026-01-27 12:23:19	Re: pg_waldump: support decoding of WAL inside tarfile
Previous Message	Jakub Wartak	2026-01-27 12:06:13	Re: pg_stat_io_histogram