Re: Unicode combining characters

From: Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp>, ZeugswetterA(at)spardat(dot)at, phede-ml(at)islande(dot)org, pgsql-hackers(at)postgresql(dot)org
Subject: Re: Unicode combining characters
Date: 2001-10-03 16:05:34
Message-ID: 200110031605.f93G5ZI29159@candle.pha.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers pgsql-patches

> Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp> writes:
> > ... There seems some problems existing in the
> > implementation. Considering REGEX is not so slow, maybe we should
> > employ the same design as REGEX. i.e. using wide charcters, not
> > multibyte streams...
>
> Seems like a good thing to put on the to-do list. In the meantime,
> we still have the question of whether to enable multibyte in the
> default configuration. I'd still vote YES, as these results seem
> to me to demonstrate that there is no wide-ranging performance penalty.
> A problem confined to LIKE on long strings isn't a showstopper IMHO.
>

Added to TODO:

* Use wide characters to evaluate regular expressions, for performance
(Tatsuo)

--
Bruce Momjian | http://candle.pha.pa.us
pgman(at)candle(dot)pha(dot)pa(dot)us | (610) 853-3000
+ If your life is a hard drive, | 830 Blythe Avenue
+ Christ can be your backup. | Drexel Hill, Pennsylvania 19026

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Gurunandan R. Bhat 2001-10-03 17:07:27 Dumping variables..A sort of serialize
Previous Message Zeugswetter Andreas SB SD 2001-10-03 15:23:47 Re: Unicode combining characters

Browse pgsql-patches by date

  From Date Subject
Next Message Bruce Momjian 2001-10-03 16:10:11 Re: IPv6 Support for INET/CIDR types.
Previous Message Tom Lane 2001-10-03 14:56:04 Re: Unicode combining characters