Quick Links

回复：The same 2PC data maybe recovered twice

From:	蔡梦娟(玊于) <mengjuan(dot)cmj(at)alibaba-inc(dot)com>
To:	"pgsql-hackers" <pgsql-hackers(at)postgresql(dot)org>
Cc:	"pgsql-bugs" <pgsql-bugs(at)postgresql(dot)org>
Subject:	回复：The same 2PC data maybe recovered twice
Date:	2023-07-07 09:48:39
Message-ID:	0706bec1-80ae-4f99-8cf8-c89734978770.mengjuan.cmj@alibaba-inc.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-bugs pgsql-hackers

Hi, all
I add a patch for pg11 to fix this bug, hope you can check it.
Thanks & Best Regard
------------------------------------------------------------------
发件人：蔡梦娟(玊于) <mengjuan(dot)cmj(at)alibaba-inc(dot)com>
发送时间：2023年7月6日(星期四) 10:02
收件人：pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
抄　送：pgsql-bugs <pgsql-bugs(at)postgresql(dot)org>
主　题：The same 2PC data maybe recovered twice
Hi, all. I want to report a bug about recovery of 2pc data, in current implementation of crash recovery, there are two ways to recover 2pc data:
1、before redo, func restoreTwoPhaseData() will restore 2pc data those xid < ShmemVariableCache->nextXid, which is initialized from checkPoint.nextXid;
2、during redo, func xact_redo() will add 2pc from wal;
The following scenario may cause the same 2pc to be added repeatedly：
1、start creating checkpoint_1, checkpoint_1.redo is set as curInsert;
2、before set checkPoint_1.nextXid, a new 2pc is prepared, suppose the xid of this 2pc is 100, and then ShmemVariableCache->nextXid will be advanced as 101;
3、checkPoint_1.nextXid is set as 101;
4、in CheckPointTwoPhase() of checkpoint_1, 2pc_100 won't be copied to disk because its prepare_end_lsn > checkpoint_1.redo;
5、checkPoint_1 is finished, after checkpoint_timeout, start creating checkpoint_2;
6、during checkpoint_2, data of 2pc_100 will be copied to disk;
7、before UpdateControlFile() of checkpoint_2, crash happened;
8、during crash recovery, redo will start from checkpoint_1, and 2pc_100 will be restored first by restoreTwoPhaseData() because xid_100 < checkPoint_1.nextXid, which is 101;
9、because prepare_start_lsn of 2pc_100 > checkpoint_1.redo, 2pc_100 will be added again by xact_redo() during wal replay, resulting in the same 2pc data being added twice;
10、In RecoverPreparedTransactions() -> lock_twophase_recover(), lock the same 2pc will cause panic.
Is the above scenario reasonable, and do you have any good ideas for fixing this bug?
Thanks & Best Regard

Attachment	Content-Type	Size
v1-0001-Fix-a-2PC-transaction-maybe-recovered-twice_11.patch	application/octet-stream	3.3 KB

In response to

The same 2PC data maybe recovered twice at 2023-07-06 02:02:15 from 蔡梦娟 (玊于)

Responses

Re: The same 2PC data maybe recovered twice at 2023-07-12 02:57:44 from Andy Fan

Browse pgsql-bugs by date

	From	Date	Subject
Next Message	PG Bug reporting form	2023-07-07 16:46:46	BUG #18017: configure --with-ldap fails when openldap is installed
Previous Message	Daniel Gustafsson	2023-07-07 07:23:38	Re: pg_basebackup: errors on macOS on directories with ".DS_Store" files

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Matthias van de Meent	2023-07-07 09:55:28	Re: Disabling Heap-Only Tuples
Previous Message	Peter Smith	2023-07-07 09:37:54	Re: [PATCH] Reuse Workers and Replication Slots during Logical Replication