Re: 7.3.2 incorrectly counts characters for unicode varchar field

From: "Matthew Cooper" <matty(at)cloverworxs(dot)com>
To: "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: <pgsql-bugs(at)postgresql(dot)org>
Subject: Re: 7.3.2 incorrectly counts characters for unicode varchar field
Date: 2003-09-15 14:50:49
Message-ID: 000701c37b98$c0a0b970$6600030a@gateway01
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Attached is the UTF-8 encoded sql file in case it got messed up in the mail
transfer.

And here it is pasted in directly from the window that was displaying
chinese characters.

insert into mgc values ('分钟练习分钟练习练习');

Looking at the UTF-8 documentation, 10 chinese characters could be any
number of bytes, each character being say 2 or 3 characters.

Matty.
----- Original Message -----
From: "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: "Matthew Cooper" <matty(at)cloverworxs(dot)com>
Cc: <pgsql-bugs(at)postgresql(dot)org>
Sent: Saturday, September 13, 2003 5:51 PM
Subject: Re: [BUGS] 7.3.2 incorrectly counts characters for unicode varchar
field

> > insert into mgc values ('Ã¥Ë?â? éâ?TŸç»Æ'ä¹ Ã¥Ë?â?
éâ?TŸç»Æ'ä¹ ç»Æ'ä¹ ');
>
> I don't think this string is correctly unicode-encoded. Anyway "length"
> claims it is 30 characters.
>
> regards, tom lane
>

Attachment Content-Type Size
mgc.sql application/octet-stream 60 bytes

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Tom Lane 2003-09-15 15:10:19 Re: 7.3.2 incorrectly counts characters for unicode varchar field
Previous Message Christoph Jaeger 2003-09-15 14:22:51 Re: Foreign key constraint still active after table row removed