Quick Links

Re: Unicode combining characters

From:	Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp>
To:	phede-ml(at)islande(dot)org
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Unicode combining characters
Date:	2001-09-26 01:03:13
Message-ID:	20010926100313X.t-ishii@sra.co.jp
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

> > > - length() on the server side doesn't handle correctly Unicode [I
> > > have the same result with char_length()], and returns the number
> > > of chars (as it is however advertised to do), rather the length
> > > of the string.
> >
> > This is a known limitation.
>
> To solve this, we could use wcwidth() (there is a custom
> implementation for the systems which don't have it in the glibc). I'll
> have a look at it later.

And wcwidth() depends on the locale. That is the another reason we
could not use it.

> As Oleg suggested, I will try to aim for 7.3, first with a version in
> contrib, and later, if the implementation is fine, it could be moved
> to the core (or not ? Though it would be nice to make sure every
> PostgreSQL installation which supports unicode has it, so that users
> won't need to have administrative rights to use the functionality).

I would like to see SQL99's charset, collate functionality for 7.3 (or
later). If this happens, current multibyte implementation would be
dramatically changed. That would be a good timing to merge your
Unicode stuffs into the main source tree.
--
Tatsuo Ishii

In response to

Re: Unicode combining characters at 2001-09-25 18:14:20 from Patrice Hédé

Responses

Re: Unicode combining characters at 2001-09-26 06:17:36 from Thomas Lockhart

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Bruce Momjian	2001-09-26 01:40:26	Re: Beta time
Previous Message	Jean-Michel POURE	2001-09-25 19:04:40	Alter project: client or server side?