Re: Unicode combining characters

From: Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To: Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp>
Cc: PostgreSQL-development <pgsql-hackers(at)postgreSQL(dot)org>, henry(at)zoo(dot)utoronto(dot)ca
Subject: Re: Unicode combining characters
Date: 2001-10-04 04:30:13
Message-ID: 200110040430.f944UDf16383@candle.pha.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers pgsql-patches

> > Ok. I ran the modified test (now the iteration is reduced to 100000 in
> > liketest()). As you can see, there's huge difference. MB seems up to
> > ~8 times slower:-< There seems some problems existing in the
> > implementation. Considering REGEX is not so slow, maybe we should
> > employ the same design as REGEX. i.e. using wide charcters, not
> > multibyte streams...

Let me add I think our regex code is very slow. It is the standard BSD
regex library by Henry Spencer. He rewrote it a few years ago for TCL
8.X and said he was working on a standalone library version. I have
asked him several times via email over the years but he still has not
released a standalone version of the new optimized regex code. It is on
our TODO list.

--
Bruce Momjian | http://candle.pha.pa.us
pgman(at)candle(dot)pha(dot)pa(dot)us | (610) 853-3000
+ If your life is a hard drive, | 830 Blythe Avenue
+ Christ can be your backup. | Drexel Hill, Pennsylvania 19026

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Thomas Lockhart 2001-10-04 04:39:19 Re: Timestamp, fractional seconds problem
Previous Message Tom Lane 2001-10-04 04:25:19 Re: Problem on AIX with current

Browse pgsql-patches by date

  From Date Subject
Next Message Vsevolod Lobko 2001-10-04 05:56:35 Re: [PATCH] Patch for more verbose error reporting in
Previous Message Tatsuo Ishii 2001-10-04 04:11:21 Re: Unicode combining characters