On Thu, Dec 1, 2011 at 12:30 AM, Robert Haas <robertmhaas(at)gmail(dot)com> wrote:
> On Mon, Nov 21, 2011 at 11:49 AM, Alexander Korotkov
> <aekorotkov(at)gmail(dot)com> wrote:
> > I've a question about pg_mb2wchar function. Is there any way for inverse
> > convertion pg_wchar* to char*?
> > I've looked to pg_wchar_tbl table definition, and I didn't find anything
> > about inverse transformation. So, any change to get inverse convertion?
> > I'm experimenting with index support for regexp search and I'm trying to
> > some characters back from color map.
> Well, any char can presumably also be represented as a wchar, but the
> reverse isn't necessarily true...
So, if wchar can't be presented as sequence of chars, it means that it
can't occurs in string in server encoding. In this situation it's enough
for me to know that it is so for paticular wchar.
I found that for UTF8 uncoding wchar is unicode. For single-bytes encodings
wchar just hold original value in it's lower byte. And there are some
conversions for PG_EUC_JP, PG_EUC_CN, PG_EUC_KR, PG_EUC_TW, PG_EUC_JIS_2004
which are not clear for me, but it's seems to be feasible to write inverse
conversion code using existing code of direct conversion.
> (What's a color map?)
Regexp engine translates regexp to finite state automatum. In order to
automatum don't have too much arc, characters are grouped to colors.
Colormap maps wchar -> color number. Since analyze automatum produced by
regexp engine, I need to convert wchar to original character.
With best regards,
In response to
pgsql-hackers by date
|Next:||From: Alexander Korotkov||Date: 2011-12-01 13:34:58|
|Subject: Re: WIP: index support for regexp search|
|Previous:||From: Kohei KaiGai||Date: 2011-12-01 13:25:21|
|Subject: Re: Refactoring on DROP/ALTER SET SCHEMA/ALTER RENAME TO statement|