Re: Vacuum statistics

From: Андрей Зубков <zubkov(at)moonset(dot)ru>
To: Alena Rybakina <lena(dot)ribackina(at)yandex(dot)ru>, Andrey Borodin <x4mmm(at)yandex-team(dot)ru>
Cc: Andrei Lepikhov <lepihov(at)gmail(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>, Alexander Korotkov <aekorotkov(at)gmail(dot)com>, Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>, Jim Nasby <jnasby(at)upgrade(dot)com>, Bertrand Drouvot <bertranddrouvot(dot)pg(at)gmail(dot)com>, Kirill Reshke <reshkekirill(at)gmail(dot)com>, Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com>, Melanie Plageman <melanieplageman(at)gmail(dot)com>, jian he <jian(dot)universality(at)gmail(dot)com>, Sami Imseih <samimseih(at)gmail(dot)com>, vignesh C <vignesh21(at)gmail(dot)com>, Ilia Evdokimov <ilya(dot)evdokimov(at)tantorlabs(dot)com>
Subject: Re: Vacuum statistics
Date: 2026-03-16 08:34:04
Message-ID: 54cbe53b-d8cf-495f-aa33-3501bec72780@moonset.ru
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Hi!

On 3/16/26 02:27, Alena Rybakina wrote:
>
> Hi Andrey, thank you for taking a look and for the review!
>
+1
>> But the column naming for rev_all_visible_pages and rev_all_frozen_pages
>> seems strange to me. I've skimmed the thread but could not figure out what
>> "rev_" stands for. Revisions? Revolutions? Reviews?
>
> We meant "revision", but after looking at our documentation I realized
> the confusion - the term is not explained there.
>
> I've renamed them to visible_pages_vm_cleared and frozen_pages_vm_cleared.
>
> Does this naming make more sense?
>
Really it was "revocations", but I'm agree with Andrey that naming isn't
clear. *_vm_cleared looks better, but talking about naming here "vm"
meaning is not clear. I think it will be understood as visibility map,
but it is "mark" really. Maybe "*_pages_marks_cleared" will be better?

Also a macro in pgstat.h:733 and pgstat.h:738 still holds "_rev_".

I think the docs description needs a little correction:

- visible_pages_vm_cleared. I think listing of possible DML operations
  is not needed here, also it seems a high rate of this counter has no
  direct relation to the index only scans because we can have very
  agressive vacuum on a table that will do the opposite. It will hold
  few pages without visibility marks constantly but with the cost
  of high visible_pages_vm_cleared rate. My proposition follows:

  Number of times the all-visible bit in the
   <link linkend="storage-vm">visibility map</link> was cleared for a
  pages of this table. The all-visible bit of a heap page is cleared
  every time backend process modifies a page previously marked
  all-visible by vacuum. Vacuum process must process page once again
  on the next run. A high rate of change of this counter means that
  vacuum should re-do its work on this table.

- frozen_pages_vm_cleared:

  Number of times the all-frozen bit in the
   <link linkend="storage-vm">visibility map</link> was cleared for a
  pages of this table.  The all-frozen bit of a heap page is cleared
  every time backend process modifies a page previously marked
  all-frozen by vacuum. Vacuum process must process page once again on
  the next freeze run on this table.

--

best regards, Andrei Zubkov

Postgres Professional

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Peter Smith 2026-03-16 08:38:21 Re: Skipping schema changes in publication
Previous Message Xuneng Zhou 2026-03-16 08:31:45 Re: Odd code around ginScanToDelete