Re: What is the maximum encoding-conversion growth rate, anyway?

From: Tatsuo Ishii <ishii(at)postgresql(dot)org>
To: mike(at)fuhr(dot)org
Cc: tgl(at)sss(dot)pgh(dot)pa(dot)us, pgsql-hackers(at)postgresql(dot)org
Subject: Re: What is the maximum encoding-conversion growth rate, anyway?
Date: 2007-05-29 04:02:35
Message-ID: 20070529.130235.15250995.t-ishii@sraoss.co.jp
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

> On Mon, May 28, 2007 at 10:23:42PM -0400, Tom Lane wrote:
> > Tatsuo Ishii <ishii(at)postgresql(dot)org> writes:
> > > I'm afraid we have to mke it larger, rather than smaller for 8.3. For
> > > example 0x82f5 in SHIFT_JIS_2004 (new in 8.3) becomes *pair* of 3
> > > bytes UTF_8 (0x00e3818b and 0x00e3829a). See
> > > util/mb/Unicode/shift_jis_2004_to_utf8_combined.map for more details.
> >
> > > So the worst case is now 6, rather than 3.
> >
> > Yipes.
>
> Isn't MAX_CONVERSION_GROWTH a multiplier? Doesn't 2 bytes becoming
> 2 * 3 bytes represent a growth of 3, not 6? Or does that 2-byte
> SHIFT_JIS_2004 sequence have a 1-byte sequence in another supported
> encoding? Or am I missing something?

Oops. You are right. The MAX_CONVERSION_GROWTH should be 3 (=
(2*3)/2), rather than 6 for the case.

So it seems we could safely make MAX_CONVERSION_GROWTH down to 3 for
the moment.
--
Tatsuo Ishii
SRA OSS, Inc. Japan

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Jon Colverson 2007-05-29 04:10:45 Attempt to re-archive existing WAL logs after restoring from backup
Previous Message michele.simionato@gmail.com 2007-05-29 03:57:34 status of PlPython