Re: Unicode is not UTF-8. was :psqlODBC-Driver Test / text

From: Johann Zuschlag <zuschlag2(at)online(dot)de>
To:
Cc: Hiroshi Inoue <inoue(at)tpf(dot)co(dot)jp>, Dave Page <dpage(at)vale-housing(dot)co(dot)uk>, pgsql-odbc(at)postgresql(dot)org
Subject: Re: Unicode is not UTF-8. was :psqlODBC-Driver Test / text
Date: 2006-03-31 16:58:57
Message-ID: 442D5FD1.4010909@online.de
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-odbc

Johann Zuschlag schrieb:
> Let's have two examples:
> 1.
> backend-1 = ISO8859-1
> backend-2 = UTF-8
>
> 'A' = U+0041 (does windows use big-endian?)
>
> Win UCS-2: U+0041
> ODBC UTF-8: U+41
> backend-1 stores = 0x41
> backend-2 stores = U+41
>
> 2.
> 'Ä' = U+00C4 (german A-Umlaut)
>
> Win UCS-2: U+00C4
> ODBC UTF-8: U+C384
> backend-1 stores = 0xC4
> backend-2 stores = U+C384
>
> Did I get that right? So I have to be really careful when testing.
>
No, again wrong. Or is it more like this:

1.
a) locale = ISO8859-1
backend-1 = LATIN1

b) locale = UTF-8
backend-2 = Unicode

'A' = U+0041 (does windows use big-endian?)

Win UCS-2: U+0041
ODBC UTF-8: U+41
backend-1 stores = U+41
backend-2 stores = U+0041

2.
'Ä' = U+00C4 (german A-Umlaut)

Win UCS-2: U+00C4
ODBC UTF-8: U+C384
backend-1 stores = 0xC4
backend-2 stores = U+00C4

Did I get that right?

Regards,
Johann

In response to

Responses

Browse pgsql-odbc by date

  From Date Subject
Next Message Marc Herbert 2006-03-31 18:47:05 Re: Unicode is not UTF-8. was :psqlODBC-Driver Test / text
Previous Message Johann Zuschlag 2006-03-31 16:51:07 Re: Unicode is not UTF-8. was :psqlODBC-Driver Test / text