Quick Links

Re: [PING] [PATCH v2] parallel pg_restore: avoid disk seeks when jumping short distance forward

From:	Dimitrios Apostolou <jimis(at)gmx(dot)net>
To:	Nathan Bossart <nathandbossart(at)gmail(dot)com>
Cc:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)lists(dot)postgresql(dot)org
Subject:	Re: [PING] [PATCH v2] parallel pg_restore: avoid disk seeks when jumping short distance forward
Date:	2025-06-11 23:25:00
Message-ID:	50d0e587-3c6c-fec8-4937-efee4a59a6cf@gmx.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Wed, 11 Jun 2025, Nathan Bossart wrote:

> On Wed, Jun 11, 2025 at 12:32:58AM +0200, Dimitrios Apostolou wrote:
>> what read-seek pattern do you see on the system call level (as shown by
>> strace)? In pg_restore it was a constant loop of read(4K)-lseek(8-16K).
>
> For fseeko(), sizes less than 4096 produce a repeating pattern of read()
> calls followed by approximately (4096 / size) lseek() calls. For greater
> sizes, it's just a stream of lseek().

This is the opposite of what the link you shared before describes, so
maybe glibc has changed its behaviour to improve performance.

Anyway, the fact that fseek(>4096) produces a stream of lseek()s, means
that most likely no I/O is happening. You need to issue a getc() after
each fseeko(), like pg_restore is doing.

Dimitris

In response to

Re: [PING] [PATCH v2] parallel pg_restore: avoid disk seeks when jumping short distance forward at 2025-06-11 21:14:53 from Nathan Bossart

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Noboru Saito	2025-06-11 23:49:11	Re: [PATCH] Proposal: Improvements to PDF stylesheet and table column widths
Previous Message	Tatsuo Ishii	2025-06-11 22:52:35	Re: Add RESPECT/IGNORE NULLS and FROM FIRST/LAST options