Re: pg_basebackup error: replication slot "pg_basebackup_2194" already exists

From: Magnus Hagander <magnus(at)hagander(dot)net>
To: Ludovic Vaugeois-Pepin <ludovicvp(at)gmail(dot)com>
Cc: PostgreSQL mailing lists <pgsql-general(at)postgresql(dot)org>
Subject: Re: pg_basebackup error: replication slot "pg_basebackup_2194" already exists
Date: 2017-05-30 19:32:30
Message-ID: CABUevEyqc-9ETegRvB2C+CVbRFjHv1aDqMAQF=qZvcggWL1ubg@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general pgsql-hackers

On Tue, May 30, 2017 at 9:14 PM, Ludovic Vaugeois-Pepin <ludovicvp(at)gmail(dot)com
> wrote:

> I ran into the issue described below with 10.0 beta. The error I got is:
>
> pg_basebackup: could not create temporary replication slot
> "pg_basebackup_2194": ERROR: replication slot "pg_basebackup_2194"
> already exists
>
> A race condition? Or maybe I am doing something wrong.
>
>
>
>
>
> Release:
> Name : postgresql10-server
> Version : 10.0
> Release : beta1PGDG.rhel7
>
>
> Test Type:
> Functional testing of a pacemaker resource agent
> (https://github.com/ulodciv/pgha)
>
>
> Test Detail:
> During context/environement setup, pg_basebackup is invoked (in
> parallel) from multiple virtual machines. The backups are then started
> as asynchronously replicated hot standbies.
>
>
> Platform:
> Centos 7.3
>
>
> Installation Method:
> yum -y install
> https://download.postgresql.org/pub/repos/yum/testing/10/
> redhat/rhel-7-x86_64/pgdg-redhat10-10-1.noarch.rpm
> yum -y install postgresql10-server postgresql10-contrib
>
>
> Platform Detail:
>
>
> Test Procedure:
>
> Have pg_basebackup run simultaneously on multiple hosts against
> the same instance eg:
>
> pg_basebackup -h test4 -p 5432 -D /var/lib/pgsql/10/data -U repl1
> -Xs
>
>
> Failure?
>
> E deploylib.deployer_error.DeployerError:
> postgres(at)test5: got exit status 1 for:
> E pg_basebackup -h test4 -p 5432 -D
> /var/lib/pgsql/10/data -U repl1 -Xs
> E stderr: pg_basebackup: could not create temporary
> replication slot "pg_basebackup_2194": ERROR: replication slot
> "pg_basebackup_2194" already exists
> E pg_basebackup: child process exited with error 1
> E pg_basebackup: removing data directory
> "/var/lib/pgsql/10/data"
>
>
> Test Results:
>
>
> Comments:
> This seems to be new with 10. I recently began testing the
> pacemaker resource agent against PG 10. I never had (or noticed) this
> failure with 9.6.1 and 9.6.2.
>

Hah, that's an interesting failure. In the name of the slot, the 2194 comes
from the pid -- but it's the pid of pg_basebackup.

I assume you're not running the two pg_basebackup processes on the same
machine? Is it predictable when this happens (meaning that the pid value is
actually predictable), or do you have to run it a large numbe rof times
before it happens?

--
Magnus Hagander
Me: https://www.hagander.net/ <http://www.hagander.net/>
Work: https://www.redpill-linpro.com/ <http://www.redpill-linpro.com/>

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Joshua Ma 2017-05-30 20:56:02 9.5 "chained equality" behavior
Previous Message Kenneth Marshall 2017-05-30 19:25:05 Re: pg_basebackup error: replication slot "pg_basebackup_2194" already exists

Browse pgsql-hackers by date

  From Date Subject
Next Message Daniel Gustafsson 2017-05-30 19:54:22 Re: [JDBC] Channel binding support for SCRAM-SHA-256
Previous Message Masahiko Sawada 2017-05-30 19:28:05 Re: "create publication..all tables" ignore 'partition not supported' error