Unaccent extension python script Issue in Windows

From: Ramanarayana <raam(dot)soft(at)gmail(dot)com>
To: PostgreSQL Developers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Cc: Hugh Ranalli <hugh(at)whtc(dot)ca>, Thomas Munro <thomas(dot)munro(at)enterprisedb(dot)com>, Michael Paquier <michael(at)paquier(dot)xyz>
Subject: Unaccent extension python script Issue in Windows
Date: 2019-03-11 16:24:45
Message-ID: CAKm4Xs7_61XMyOWmHs3n0mmkS0O4S0pvfWk=7cQ5P0gs177f7A@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Hi Hackers,

In master branch, unaccent extension is having issue with the below python
script.This issue is only in windows 10 and python 3.

python generate_unaccent_rules.py --unicode-data-file UnicodeData.txt
--latin-ascii-file Latin-ASCII.xml > unaccent.rules

I am getting the following error

UnicodeEncodeError: 'charmap' codec can't encode character '\u0100' in
position 0: character maps to <undefined>

I went through the python script and found that the stdout encoding is set
to utf-8 only if python version is <=2. The same needs to be done for
python 3
--
Cheers
Ram 4.0

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Alvaro Herrera 2019-03-11 16:27:47 Re: move hash_any to utils/hash/hashfn.c
Previous Message Andrew Dunstan 2019-03-11 15:59:50 Re: pgbench MAX_ARGS