RE: Barman 2.3 errors

From: "Ahmed, Nawaz" <Nawaz(at)fast(dot)au(dot)fujitsu(dot)com>
To: GALLIANO Nicolas <Nicolas(dot)GALLIANO(at)dsi(dot)cnrs(dot)fr>, "pgsql-general(at)postgresql(dot)org" <pgsql-general(at)postgresql(dot)org>
Subject: RE: Barman 2.3 errors
Date: 2018-02-13 07:07:02
Message-ID: BAAE097528E6CB4FB2BE65AEFA6EEE04929F9CCD@SYD1217
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general


Hi Nicolas,

I would like to take stab at this one, as i had recently worked on a demo of barman. But like Michael Paquier said, it is better to check the information i provide and the situation you are facing with the maintainers of the project. Here is what i found.

The first thing is to check if you can see the replication slot named "barman" created on the target database using the below command. It should return a slot named "barman" with the slot_type as "physical". The "replication slot: OK" line of the check command shows the slot is available, however, please double check if it is true with the below query.

select * from pg_replication_slots;

Now let us target the line "WAL archive: FAILED", I faced this issue when i had killed the "receive-wal" process and restarted it. First look for the receive-wal process with the ps command

ps -ef|grep receive-wal

if it is not running then start it up in the background using the command

$ barman receive-wal tcgepg96ddm &

If the receive-wal process is running but you still face that error in the check command, then I suggest you switch the xlog using the below command.

$ barman switch-xlog tcgepg96ddm

if the above command fails to switch the xlog, then try to force it with the below command.

$ barman switch-xlog --force tcgepg96ddm

if you still cannot get it to work, then try to reset the status of the receive-wal process using the --reset option as below.

$ barman receive-wal --reset tcgepg96ddm

If you can successfully run the above reset command, then try to switch the log file and run the check command again to see if everything looks fine.

$ barman switch-xlog tcgepg96ddm

Hope that helps, again, these are the steps i had taken to resolve a similar issue. You might still want to get in touch with the project maintainers about the validity of the above commands.

Best Regards,

Nawaz Ahmed
Software Development Engineer

Fujitsu Australia Software Technology Pty Ltd
14 Rodborough Road, Frenchs Forest NSW 2086, Australia
T +61 2 9452 9027
Nawaz(at)fast(dot)au(dot)fujitsu(dot)com<mailto:Nawaz(at)fast(dot)au(dot)fujitsu(dot)com>
fastware.com.au<http://fastware.com.au/>

[cid:image001(dot)jpg(at)01D3A4F5(dot)6FE3CFE0]
[cid:image002(dot)jpg(at)01D3A4F5(dot)6FE3CFE0]

From: GALLIANO Nicolas [mailto:Nicolas(dot)GALLIANO(at)dsi(dot)cnrs(dot)fr]
Sent: Tuesday, 13 February 2018 2:23 AM
To: pgsql-general(at)postgresql(dot)org
Subject: Barman 2.3 errors

Hi,

I’m trying to backup a remote DB (9.6.6) using barman 2.3 but backup failed start.
In barman.log i’ve such errors :

2018-02-12 16:18:23,852 [57691] barman.server ERROR: Check 'replication slot' failed for server 'tcgepg96ddm'
2018-02-12 16:18:23,857 [57691] barman.server ERROR: Check 'receive-wal running' failed for server 'tcgepg96ddm'
2018-02-12 16:18:30,815 [57779] barman.wal_archiver INFO: No xlog segments found from streaming for tcgepg96ddm.
2018-02-12 16:18:30,824 [57781] barman.server INFO: Starting receive-wal for server tcgepg96ddm
2018-02-12 16:18:30,905 [57781] barman.wal_archiver INFO: Synchronous WAL streaming for barman_receive_wal: True
2018-02-12 16:18:30,906 [57781] barman.wal_archiver INFO: Activating WAL archiving through streaming protocol
2018-02-12 16:18:30,922 [57781] barman.cli ERROR: 'utf8' codec can't decode byte 0xe0 in position 47: invalid continuation byte
See log file for more details.
Traceback (most recent call last):
File "/usr/lib/python2.7/site-packages/barman/cli.py", line 1126, in main
p.dispatch(pre_call=global_config)
File "/usr/lib/python2.7/site-packages/argh/helpers.py", line 55, in dispatch
return dispatch(self, *args, **kwargs)
File "/usr/lib/python2.7/site-packages/argh/dispatching.py", line 174, in dispatch
for line in lines:
File "/usr/lib/python2.7/site-packages/argh/dispatching.py", line 277, in _execute_command
for line in result:
File "/usr/lib/python2.7/site-packages/argh/dispatching.py", line 231, in _call
result = function(namespace_obj)
File "/usr/lib/python2.7/site-packages/barman/cli.py", line 792, in receive_wal
server.receive_wal(reset=args.reset)
File "/usr/lib/python2.7/site-packages/barman/server.py", line 1708, in receive_wal
archiver.receive_wal(reset)
File "/usr/lib/python2.7/site-packages/barman/wal_archiver.py", line 751, in receive_wal
receive.execute()
File "/usr/lib/python2.7/site-packages/barman/command_wrappers.py", line 418, in execute
self.pipe_processor_loop(processors)
File "/usr/lib/python2.7/site-packages/barman/command_wrappers.py", line 484, in pipe_processor_loop
eof = stream.process()
File "/usr/lib/python2.7/site-packages/barman/command_wrappers.py", line 82, in process
self._buf += data.decode('utf-8')
File "/usr/lib64/python2.7/encodings/utf_8.py", line 16, in decode
return codecs.utf_8_decode(input, errors, True)
UnicodeDecodeError: 'utf8' codec can't decode byte 0xe0 in position 47: invalid continuation byte
2018-02-12 16:18:32,373 [58491] barman.server ERROR: Check 'WAL archive' failed for server 'tcgepg96ddm'
2018-02-12 16:18:32,468 [58491] barman.wal_archiver INFO: Synchronous WAL streaming for barman_receive_wal: True
2018-02-12 16:18:32,485 [58491] barman.server ERROR: Check 'receive-wal running' failed for server 'tcgepg96ddm'

My remote Db server conf is :

[myclient]
conninfo = host=myclient user=barman dbname=dbclient
streaming_conninfo = host= myclient user=streaming_barman
backup_method = postgres
streaming_backup_name = barman_streaming_backup
streaming_archiver = on
slot_name = barman

A barman check gives :

$barman check tcgepg96ddm
Server tcgepg96ddm:
WAL archive: FAILED (please make sure WAL shipping is setup)
PostgreSQL: OK
is_superuser: OK
PostgreSQL streaming: OK
wal_level: OK
replication slot: OK
directories: OK
retention policy settings: OK
backup maximum age: OK (no last_backup_maximum_age provided)
compression settings: OK
failed backups: OK (there are 0 failed backups)
minimum redundancy requirements: OK (have 0 backups, expected at least 0)
pg_basebackup: OK
pg_basebackup compatible: OK
pg_basebackup supports tablespaces mapping: OK
pg_receivexlog: OK
pg_receivexlog compatible: OK
receive-wal running: FAILED (See the Barman log file for more details)
archiver errors: OK

Have you ever had same problem with barman ?
Thanks for you help.
nicolas
Disclaimer

The information in this e-mail is confidential and may contain content that is subject to copyright and/or is commercial-in-confidence and is intended only for the use of the above named addressee. If you are not the intended recipient, you are hereby notified that dissemination, copying or use of the information is strictly prohibited. If you have received this e-mail in error, please telephone Fujitsu Australia Software Technology Pty Ltd on + 61 2 9452 9000 or by reply e-mail to the sender and delete the document and all copies thereof.

Whereas Fujitsu Australia Software Technology Pty Ltd would not knowingly transmit a virus within an email communication, it is the receiver’s responsibility to scan all communication and any files attached for computer viruses and other defects. Fujitsu Australia Software Technology Pty Ltd does not accept liability for any loss or damage (whether direct, indirect, consequential or economic) however caused, and whether by negligence or otherwise, which may result directly or indirectly from this communication or any files attached.

If you do not wish to receive commercial and/or marketing email messages from Fujitsu Australia Software Technology Pty Ltd, please email unsubscribe(at)fast(dot)au(dot)fujitsu(dot)com

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Durumdara 2018-02-13 07:35:36 Re: Connection loosing at some places - caused by firewall
Previous Message Michael Paquier 2018-02-13 04:08:17 Re: Barman 2.3 errors