| From: | Bala M <krishna(dot)pgdba(at)gmail(dot)com> |
|---|---|
| To: | Adrian Klaver <adrian(dot)klaver(at)aklaver(dot)com> |
| Cc: | Greg Sabino Mullane <htamfids(at)gmail(dot)com>, Francisco Olarte <folarte(at)peoplecall(dot)com>, chris+google(at)qwirx(dot)com, pgsql-general(at)lists(dot)postgresql(dot)org |
| Subject: | Re: Index corruption issue after migration from RHEL 7 to RHEL 9 (PostgreSQL 11 streaming replication) |
| Date: | 2025-11-06 17:04:20 |
| Message-ID: | CAJ4rSwuMDcsvXNfxBefWfDknoJMkdZoDmOJ_8pmo8ut_h_V57g@mail.gmail.com |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-general |
Hi Adrian, Thank you for your response. Please find the requested details
below:
*PostgreSQL Version:*
Source: PostgreSQL 11.15
Target: PostgreSQL 16.9
*Operating System:*
Source: RHEL 7.9
Target: RHEL 9.6
*Network Distance:*
Both servers are in the same data center, connected through a high-speed
internal network (low latency).
Logical Replication Settings:
*Source - Postgres 11.15.*
-- ==== WAL & Replication Settings ====
wal_level = 'logical'
max_wal_senders = '30'
max_replication_slots = '20'
wal_keep_segments = '800'
wal_sender_timeout = '300s'
max_worker_processes = '32'
max_logical_replication_workers = '16'
max_sync_workers_per_subscription = '8'
==== WAL & Checkpoint ====
max_wal_size = '40GB'
min_wal_size = '4GB'
checkpoint_timeout = '45min'
checkpoint_completion_target = '0.9'
==== Memory ====
shared_buffers = '18GB'
work_mem = '128MB'
maintenance_work_mem = 4GB'
effective_cache_size = '275GB'
*Target DB Postgres 16.10*
==== Logical Replication Settings ====
max_worker_processes = '32'
max_logical_replication_workers = '16'
max_sync_workers_per_subscription = '8'
wal_receiver_timeout = '300s'
==== WAL & Checkpoint ====
checkpoint_timeout = '45min'
checkpoint_completion_target = '0.9'
max_wal_size = '40GB'
min_wal_size = '4GB'
==== Memory ====
shared_buffers = '18GB'
work_mem = '128MB'
maintenance_work_mem = '4GB'
effective_cache_size = '275GB'
synchronous_commit = 'off'
Since you have already started is that not already to late for this?
Yes We are currently in the *testing phase* and validating with the above
parameters. However, the replication process has been *extremely slow —
it’s been running for the past 5 days* with limited progress.
Any specific tuning recommendations or best practices to improve
performance at this stage would be greatly appreciated.
Thanks & Regards
Krishna.
On Wed, 5 Nov 2025 at 21:07, Adrian Klaver <adrian(dot)klaver(at)aklaver(dot)com>
wrote:
> On 11/4/25 22:27, Bala M wrote:
> > Thank you all for your suggestions,
> >
> > Thanks for your quick response and for sharing the details.
> > After reviewing the options, the logical replication approach seems to
> > be the most feasible one with minimal downtime.
> >
> > However, we currently have 7 streaming replication setups running from
> > production, with a total database size of around 15 TB. Out of this,
> > there are about 10 large tables ranging from 1 TB (max) to 50 GB (min)
> > each, along with approximately 150+ sequences.
> >
> > Could you please confirm if there are any successful case studies or
> > benchmarks available for a similar setup?
>
> Since you have given minimal information in this post, I doubt there is
> really a way to compare to other situations. Collect the details you
> provided earlier in the thread for those folks getting to it just now.
>
> That would be:
>
> 1) Postgres versions on both ends
>
> 2) OS and versions on both ends.
>
> 3) Network distance between 'machines'.
>
> 4) The logical replication settings.
>
> > Additionally, please share any recommended parameter tuning or best
> > practices for handling logical replication at this scale.
>
> Since you have already started is that not already to late for this?
>
>
>
> >
> > Current server configuration:
> >
> > CPU: 144 cores
> >
> > RAM: 512 GB
> >
> >
> > Thanks & Regards
> > Krishna.
> >
>
>
>
> --
> Adrian Klaver
> adrian(dot)klaver(at)aklaver(dot)com
>
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Adrian Klaver | 2025-11-06 18:23:15 | Re: Index corruption issue after migration from RHEL 7 to RHEL 9 (PostgreSQL 11 streaming replication) |
| Previous Message | Jan Wieremjewicz | 2025-11-06 09:29:04 | Re: Enquiry about TDE with PgSQL |