Quick Links

Why do we let CREATE DATABASE reassign encoding?

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	pgsql-hackers(at)postgreSQL(dot)org
Subject:	Why do we let CREATE DATABASE reassign encoding?
Date:	2009-04-23 17:46:41
Message-ID:	865.1240508801@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

If I have locale set to C, I can do this:

regression=# create database u8 encoding 'utf8';
CREATE DATABASE
regression=# create database l1 encoding 'latin1' template u8;
CREATE DATABASE

Had I had any actual utf8 data in u8, l1 would now contain
encoding-corrupt information. Given that we've tried to
clamp down on encoding violations in recent releases, I wonder
why this case is still allowed.

(In non-C locales, this will typically fail because the two
different encodings can't both match the locale. But I don't
believe it's our policy to enforce encoding validity only for
non-C locales.)

We should presumably let the encoding be changed when cloning
from template0, and probably it's reasonable to trust the user
if either source or destination DB encoding is SQL_ASCII.
In other cases I'm thinking it should fail.

regards, tom lane

Responses

Re: Why do we let CREATE DATABASE reassign encoding? at 2009-04-23 18:09:25 from Heikki Linnakangas
Re: Why do we let CREATE DATABASE reassign encoding? at 2009-04-23 18:46:49 from Bill Moran
Re: Why do we let CREATE DATABASE reassign encoding? at 2009-04-23 18:50:47 from Andrew Dunstan

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Heikki Linnakangas	2009-04-23 18:09:25	Re: Why do we let CREATE DATABASE reassign encoding?
Previous Message	David E. Wheeler	2009-04-23 16:29:06	Re: citex regression fails with de.UTF8 locale