Re: [GENERAL] invalid multibyte character for locale

From: Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To: Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp>
Cc: ftm(dot)van(dot)vugt(at)foxi(dot)nl, PostgreSQL-patches <pgsql-patches(at)postgresql(dot)org>
Subject: Re: [GENERAL] invalid multibyte character for locale
Date: 2005-03-15 23:53:15
Message-ID: 200503152353.j2FNrFJ26428@candle.pha.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general pgsql-patches

Tatsuo Ishii wrote:
> Apparently your hack does not kill #define USE_WIDE_UPPER_LOWER.
>
> BTW, the current code for upper/lower etc. seems to be broken. The
> exact problem you have are happening in Japanese encodings too(EUC_JP)
> too. PostgreSQL should not use wide-character method if LC_CTYPE is C.

Here is the patch I made. I will apply it to CVS HEAD and 8.0.X because
it is a bug fix.

---------------------------------------------------------------------------

> --
> Tatsuo Ishii
>
> > L.S.
> >
> > I have a database created on:
> >
> > db=# select version();
> > version
> > ---------------------------------------------------------------------
> > PostgreSQL 8.0.1 on i686-pc-linux-gnu, compiled by GCC egcs-2.91.66
> > (1 row)
> >
> >
> > The initdb was done using no-locale and unicode as default encoding, the
> > particular database itself is indeed encoded as UNICODE.
> >
> >
> > Due to a buggy glibc, the following patch was applied to this install in order
> > to avoid a crash on things like 'upper(<string>)':
> >
> > --- oracle_compat.c_orig Mon Dec 6 22:14:11 2004
> > +++ oracle_compat.c Mon Dec 6 22:14:24 2004
> > @@ -43,7 +43,7 @@
> > * We assume if we have these two functions, we have their friends too, and
> > * can use the wide-character method.
> > */
> > -#if defined(HAVE_WCSTOMBS) && defined(HAVE_TOWLOWER)
> > +#if defined(HAVE_WCSTOMBS) && defined(HAVE_TOWLOWER) && FALSE
> > #define USE_WIDE_UPPER_LOWER
> > #endif
> >
> >
> > The database on this machine was dumped and then restored on another, which
> > has a more recent installation of Slack on it:
> >
> >
> > db=# select version();
> > version
> > ------------------------------------------------------------------------
> > PostgreSQL 8.0.1 on i586-pc-linux-gnu, compiled by GCC gcc (GCC) 3.2.3
> > (1 row)
> >
> >
> > Again, the initdb on this machine was done using no-locale and unicode as
> > default encoding, the particular database obviously is also encoded as
> > UNICODE.
> >
> >
> >
> > On the second machine, I'm now getting the following:
> >
> > db=# select 'J?TERBOG';
> > ?column?
> > ----------
> > J?TERBOG
> > (1 row)
> >
> > db=# select lower('J?TERBOG');
> > ERROR: invalid multibyte character for locale
> > HINT: The server's LC_CTYPE locale is probably incompatible with the database
> > encoding.
> >
> >
> >
> > As far as I can tell, this didn't happen with v8.0.0, but I'm afraid I can't
> > be totally sure about that. Obviously, the error doesn't occur on the first
> > machine due to the hack needed for the buggy glibc.
> >
> >
> > I'd appreciate a pointer as to what is causing this. It 'shouldn't' be the
> > hack nor the dump/restore cycle, but.......?
> >
> >
> > TIA.
> >
> >
> >
> > --
> > Best,
> >
> >
> >
> >
> > Frank.
> >
> > ---------------------------(end of broadcast)---------------------------
> > TIP 7: don't forget to increase your free space map settings
> >
>
> ---------------------------(end of broadcast)---------------------------
> TIP 2: you can get off all lists at once with the unregister command
> (send "unregister YourEmailAddressHere" to majordomo(at)postgresql(dot)org)
>

--
Bruce Momjian | http://candle.pha.pa.us
pgman(at)candle(dot)pha(dot)pa(dot)us | (610) 359-1001
+ If your life is a hard drive, | 13 Roberts Road
+ Christ can be your backup. | Newtown Square, Pennsylvania 19073

Attachment Content-Type Size
unknown_filename text/plain 3.7 KB

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Mark Rae 2005-03-15 23:56:36 Re: prelimiary performance comparison pgsql vs mysql
Previous Message Bruce Momjian 2005-03-15 23:46:50 Re: prelimiary performance comparison pgsql vs mysql

Browse pgsql-patches by date

  From Date Subject
Next Message Alvaro Herrera 2005-03-16 00:21:02 Re: COPY CSV header line feature
Previous Message Tom Lane 2005-03-15 22:42:23 WIP: avoiding tuple construction/deconstruction overhead