BUG #14171: Wrong FSM file after switching hot standby to master

From: timofeid(at)outlook(dot)com
To: pgsql-bugs(at)postgresql(dot)org
Subject: BUG #14171: Wrong FSM file after switching hot standby to master
Date: 2016-06-01 13:48:19
Message-ID: 20160601134819.30392.85324@wrigleys.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

The following bug has been logged on the website:

Bug reference: 14171
Logged by: Timofei Dynikov
Email address: timofeid(at)outlook(dot)com
PostgreSQL version: 9.4.4
Operating system: RHEL 6.6
Description:

Hi

We have an installation of Postgres 9.4.4(PostgreSQL 9.4.4 on
x86_64-unknown-linux-gnu, compiled by gcc (GCC) 4.4.7 20120313 (Red Hat
4.4.7-11), 64-bit) on RHEL 6.6. DB installed on 2 nodes, 1 node is master,
another node is hot standby(streaming replication). DB is monitored by
pacеmaker pgsql agent.
Sometimes we have troubles with fsm-files. In case:
• master instance is switching to another node(failover or switchover) on
highload
• Hot standby node restart and run as master succesfully.
• After that we sometimes get FSM files pointing to non-existent blocks in
the table, so subsequent insert operations on such tables fails with error
message like following: 'could not read block XX in file "base/YYYY/ZZZZZ"'.
The issue can be resolved by either deleting of wrong FSM file (while
database is stopped) or performing VACUUM FULL on erroneous table. The
problem is usually observed on relatively small tables (e.g. up to 30
blocks) which are often cleaned out (having most rows deleted).
Does anybody already faced such behavior? What can be the root cause of such
problems? Are there any recommendations on how to avoid them?

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message David G. Johnston 2016-06-01 14:20:51 Re: BUG #14169: Incorrect merge join result in 9.5
Previous Message Michael Meskes 2016-06-01 13:43:00 Re: BUG #14167: ecpg parser cann't ignore code in #ifdef ?