Quick Links

Re: Add pg_stat_vfdcache view for VFD cache statistics

From:	Jakub Wartak <jakub(dot)wartak(at)enterprisedb(dot)com>
To:	KAZAR Ayoub <ma_kazar(at)esi(dot)dz>
Cc:	Pg Hackers <pgsql-hackers(at)postgresql(dot)org>, "tomas(at)vondra(dot)me" <tomas(at)vondra(dot)me>
Subject:	Re: Add pg_stat_vfdcache view for VFD cache statistics
Date:	2026-03-23 11:22:59
Message-ID:	CAKZiRmzs99+q55pgBJ0-RHxL-z=ZRD-22cGR2w9jscOyxY-vfA@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Sat, Mar 21, 2026 at 5:59 PM KAZAR Ayoub <ma_kazar(at)esi(dot)dz> wrote:
>
> Hello hackers,
>
> This comes from Tomas's patch idea from his website[1], i thought this patch makes sense to have.
>
> PostgreSQL's virtual file descriptor (VFD) maintains a
> per-backend cache of open file descriptors, bounded by
> max_files_per_process (default 1000). When the cache is full, the
> least-recently-used entry is evicted so its OS fd is closed, so a new
> file can be opened. On the next access to that file, open() must be
> called again, incurring a syscall that a larger cache would have
> avoided.
>
> A trivial example is with partitioned tables: a table with 1500
> partitions requires even more than 1500 file descriptors per full scan (main
> fork, vm ...), which is more than the default limit, causing potential evictions and reopens.
>
> The problem is well-understood and the fix is straightforward: raise
> max_files_per_process. Tomas showed a 4-5x throughput
> improvement in [1] sometimes, on my end i see something less than that, depending on the query itself, but we get the idea.
>
> AFAIK there is currently no way from inside PostgreSQL to know whether fd cache pressure is occurring.
>
> Implementation is trivial, because the VFD cache is strictly per-backend, the counters are also
> per-backend and require no shared memory or locking. Three macros (pgstat_count_vfd_hit/miss/eviction) update fields in PendingVfdCacheStats directly from fd.c.
>
> I find this a bit useful, I would love to hear about anyone's thoughts whether this is useful or not.

Hi,

My $0.02, for that for that to being useful it would need to allow viewing
global vfd cache picture (across all backends), not just from *current* backend.
Applicaiton wouldn't call this function anyway, because they would have to be
modified.

In order to get that you technically should collect the hits/misses in local
pending pgstat io area (see e.g. pgstat_io or simpler pgstat_bgwriter/
checkpointer) like you do already with PendingVfdCacheStats, but then copy them
to shared memory pgstat area (with some LWLock* protection) that would be
queryable.

-J.

In response to

Add pg_stat_vfdcache view for VFD cache statistics at 2026-03-21 16:58:36 from KAZAR Ayoub

Responses

Re: Add pg_stat_vfdcache view for VFD cache statistics at 2026-03-23 12:35:14 from David Geier

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Jakub Wartak	2026-03-23 11:29:50	Re: log XLogPrefetch stats at end of recovery
Previous Message	Ranier Vilela	2026-03-23 11:12:08	Re: Avoid multiple calls to memcpy (src/backend/access/index/genam.c)