Re: BUG #16662: pgbench: error: client 418 script 0 aborted in command 5 query 0: ERROR: invalid page in block 4830

From: 两个孩子的爹 <1726002692(at)qq(dot)com>
To: Michael Paquier <michael(at)paquier(dot)xyz>, pgsql-bugs <pgsql-bugs(at)lists(dot)postgresql(dot)org>
Subject: Re: BUG #16662: pgbench: error: client 418 script 0 aborted in command 5 query 0: ERROR: invalid page in block 4830
Date: 2020-10-09 01:53:17
Message-ID: tencent_CEA7C42EAB1412828B93F459A9706E441006@qq.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

When huge_page is set to off or try, there is no block damage problem in the pgbench pressure test process. When it is set to ON, the problem exists

sysctl -p
vm.nr_hugepages = 9000
vm.swappiness = 0
vm.overcommit_memory = 2
vm.overcommit_ratio = 98
vm.min_free_kbytes = 1024000
vm.dirty_background_ratio = 10
vm.dirty_ratio = 95
vm.vfs_cache_pressure = 150
fs.file-max = 6815744
fs.aio-max-nr = 1048576
net.core.rmem_max = 16777216
net.core.wmem_max = 16777216
net.core.wmem_default = 8388608
net.core.rmem_default = 8388608
net.ipv4.tcp_syncookies = 1
net.ipv4.tcp_tw_reuse = 1
net.ipv4.tcp_tw_recycle = 1
net.ipv4.tcp_fin_timeout = 30
net.ipv4.tcp_keepalive_time = 1200
net.ipv4.ip_local_port_range = 20000&nbsp; &nbsp; 60999
net.ipv4.tcp_max_syn_backlog = 8192
net.ipv4.tcp_max_tw_buckets = 5000
net.ipv4.conf.all.rp_filter = 0
net.ipv4.conf.all.arp_filter = 0
net.ipv4.conf.default.rp_filter = 0
net.ipv4.conf.default.arp_filter = 0
net.ipv4.conf.lo.rp_filter = 0
net.ipv4.conf.lo.arp_filter = 0
kernel.sem = 250 32000 100 128
kernel.shmmni = 4906
kernel.shmall = 41231686041
kernel.shmmax = 25769803776
kernel.sysrq = 0

------------------&nbsp;Original&nbsp;------------------
From: "Michael Paquier" <michael(at)paquier(dot)xyz&gt;;
Date:&nbsp;Fri, Oct 9, 2020 08:32 AM
To:&nbsp;"两个孩子的爹"<1726002692(at)qq(dot)com&gt;;"pgsql-bugs"<pgsql-bugs(at)lists(dot)postgresql(dot)org&gt;;

Subject:&nbsp;Re: BUG #16662: pgbench: error: client 418 script 0 aborted in command 5 query 0: ERROR: invalid page in block 4830

On Thu, Oct 08, 2020 at 03:49:58PM +0000, PG Bug reporting form wrote:
&gt; During the pressure test using pgbench, the following error occurred, but
&gt; the object with oid=16396 was queried through pg_class, and it did not
&gt; exist. Is this a bug?

Unlikely one in Postgres itself, I would recommend to be very careful
with this instance :(

A data checksum failure, as the one you are seeing here, means that an
8k page of a relation file that Postgres has flushed out to disk in
the past has been loaded back with some unexpected data.&nbsp; This means
that a source external to Postgres has changed this data.&nbsp; I have seen
this class of failures with problems involving either the kernel, the
file system, the hardware, or even some layer in charge of the host
virtualization, if your host is a VM of course.&nbsp; So something has
likely managed the flush request thought as completed by Postgres in a
non-durable way, or something could have directly changed the on-disk
data with the flush request actually done correctly, which is even a
worse problem.&nbsp; What this error tells is that the problem does not
come from Postgres itself.
--
Michael

In response to

Browse pgsql-bugs by date

  From Date Subject
Next Message PG Bug reporting form 2020-10-09 13:24:15 BUG #16663: DROP INDEX did not free up disk space: idle connection hold file marked as deleted
Previous Message Michael Paquier 2020-10-09 00:32:22 Re: BUG #16662: pgbench: error: client 418 script 0 aborted in command 5 query 0: ERROR: invalid page in block 4830