Re: BUG #17438: Logical replication hangs on master after huge DB load

From: Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>
To: Sergey Belyashov <sergey(dot)belyashov(at)gmail(dot)com>
Cc: PostgreSQL mailing lists <pgsql-bugs(at)lists(dot)postgresql(dot)org>
Subject: Re: BUG #17438: Logical replication hangs on master after huge DB load
Date: 2022-04-01 04:39:37
Message-ID: CAA4eK1+F84tYPZ1CZJVf7PcyM8qZG+c4bmsBSq+w_Hce5Of5zg@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

On Thu, Mar 31, 2022 at 6:23 PM Sergey Belyashov
<sergey(dot)belyashov(at)gmail(dot)com> wrote:
>
> вт, 29 мар. 2022 г. в 15:03, Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>:
> > Is my understanding correct that this problem occurs only when you are
> > doing large operations of tables that are not published (like C or D)
> > in your case?
>
> I do not try to do anything on published tables, because I do it on
> partitions and I detach them first from published tables.
>
> > It is difficult for me to deduce anything from the errors given. I
> > think we need some more information from the logs to find the reason.
> > One idea could be that we run the subscriber with log_min_messages as
> > debug1 as that might give some more information in logs.
>
> Most of errors are:
...
>
> log file is attached.
>

From the logs, I see the below errors:
...
2022-03-31 13:26:47.234 MSK [887763] ERROR: terminating logical
replication worker due to timeout
2022-03-31 13:26:47.241 MSK [888012] ERROR: terminating logical
replication worker due to timeout
...

Based on these errors and the information provided by you (operations
of non-published tables), I think you are facing the same problem as
being discussed in the thread [1]. If you can test the patch submitted
in email [2] and let us know your findings that would be great.

[1] - https://www.postgresql.org/message-id/CAA5-nLARN7-3SLU_QUxfy510pmrYK6JJb%3Dbk3hcgemAM_pAv%2Bw%40mail.gmail.com
[2] - https://www.postgresql.org/message-id/OS3PR01MB6275E0C2B4D9E488AD7CBA209E1F9%40OS3PR01MB6275.jpnprd01.prod.outlook.com

--
With Regards,
Amit Kapila.

In response to

Browse pgsql-bugs by date

  From Date Subject
Next Message Michael Paquier 2022-04-01 05:54:53 Re: BUG #17448: In Windows 10, version 1703 and later, huge_pages doesn't work.
Previous Message Tom Lane 2022-03-31 20:34:02 Re: Postgres Bug - Aggregate with order by