Re: BUG #5944: COPY FROM doesn't work with international characters

From: John R Pierce <pierce(at)hogranch(dot)com>
To: Nathan Davalos <n(dot)davalos(at)sharedmarketing(dot)com>
Cc: pgsql-bugs(at)postgresql(dot)org
Subject: Re: BUG #5944: COPY FROM doesn't work with international characters
Date: 2011-03-23 23:49:03
Message-ID: 4D8A86EF.6020502@hogranch.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

On 03/23/11 4:32 PM, Nathan Davalos wrote:
> ...
> SET CLIENT_ENCODING TO 'WIN1251';
> copy tmpintermediate from 'thefile.txt';
>
>
> Sample contents of thefile:
> 230002 Alto Desempeño, S.A. De C.V.
>
> When using WIN1251 or WIN1252 I get nothing in the second field, it just
> ignores the data. Same thing for LATIN-1.
>
> When using UTF8 for client encoding I get this message:
> ERROR: invalid byte sequence for encoding "UTF8": 0xf16f2c20
> CONTEXT: COPY tmpintermediate , line 1

what is the byte (binary) encoding of the file? in hex,

ñ in win1251 == (no such character. win1251 is cyrillic)
ñ in win1252 == F1
ñ in UTF-8 == C3 B1

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Tom Lane 2011-03-24 00:07:53 Re: Index Ignored Due To Use Of View
Previous Message Nathan Davalos 2011-03-23 23:32:58 BUG #5944: COPY FROM doesn't work with international characters