Re: converting databases form SQL_ASCII to UTF8

From: Jasen Betts <jasen(at)xnet(dot)co(dot)nz>
To: pgsql-general(at)postgresql(dot)org
Subject: Re: converting databases form SQL_ASCII to UTF8
Date: 2011-05-03 13:05:14
Message-ID: ipouia$83r$2@reversiblemaps.ath.cx
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On 2011-04-22, Geoffrey Myers <geof(at)serioustechnology(dot)com> wrote:
> Vick Khera wrote:
>> On Fri, Apr 22, 2011 at 11:00 AM, Geoffrey Myers
>> <lists(at)serioustechnology(dot)com <mailto:lists(at)serioustechnology(dot)com>> wrote:
>>
>> Here's our problem. We planned on moving databases a few at a time.
>> Problem is, there is a process that pushes data from one database to
>> another. If this process attempts to push data from a SQL_ASCII
>> database to a new UTF8 database and it has one of these characters
>> mentioned above, the process fails.
>>
>>
>> The database's enforcement of the encoding should be the last layer that
>> does so. Your applications should be enforcing strict utf-8 encoding
>> from start to finish. Once this is done, and the old data already in
>> the DB is properly encoded as utf-8, then there should be no problems
>> switching on the utf-8 encoding in postgres to get that final layer of
>> verification.
>
> Totally agree. Still, the question remains, why not leave it as SQL_ASCII?

perhaps you want sorted output in some locale other than 'C'?
or maybe want to take a substring in the database...

utf8 in SQL-ASCII is just a string of octets

utf8 in a utf8 database is a string of unicode characters.

--
⚂⚃ 100% natural

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message raghu ram 2011-05-03 13:30:46 Re: Can we Flush the Postgres Shared Memory ?
Previous Message Raghavendra 2011-05-03 12:31:11 Re: Can we Flush the Postgres Shared Memory ?