| From: | "Iain" <iain(at)mst(dot)co(dot)jp> | 
|---|---|
| To: | "Christian Fowler" <spider(at)viovio(dot)com>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us> | 
| Cc: | "pgsql-admin list" <pgsql-admin(at)postgresql(dot)org> | 
| Subject: | Re: evil characters #bfef cause dump failure | 
| Date: | 2004-11-16 01:17:54 | 
| Message-ID: | 007901c4cb7a$1cbddcb0$7201a8c0@mst1x5r347kymb | 
| Views: | Whole Thread | Raw Message | Download mbox | Resend email | 
| Thread: | |
| Lists: | pgsql-admin | 
It seems that this kind of thing pops up from time to time. I don't have v8 
available right now to check, but is SQL_ASCII still the default DB 
encoding? I'm wondering is unicode wouldn't be a better choice these days.
regards
Iain
----- Original Message ----- 
From: "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: "Christian Fowler" <spider(at)viovio(dot)com>
Cc: "pgsql-admin list" <pgsql-admin(at)postgresql(dot)org>
Sent: Tuesday, November 16, 2004 6:00 AM
Subject: Re: [ADMIN] evil characters #bfef cause dump failure
> Christian Fowler <spider(at)viovio(dot)com> writes:
>>   server_encoding
>> -----------------
>>   SQL_ASCII
>
>> whoa! yikes, I bet this has a lot to do with it? I really wanted to keep
>> everything UNICODE end-to-end.  I must have forgotten --encoding on my
>> initdb? Anything I can do at this point?
>
> Hmm ... the safe way would be dump-n-reload but that's not working for
> you.  What you can try is to alter the pg_database.encoding value for
> that database, then start fresh backends (any existing ones won't notice
> the change).  Worst case if that doesn't make life good is to change it
> back.
>
> The real problem is that you've got invalid unicode data in the database
> (I'm not an expert, but I think that #bf is a 1-byte UTF8 sequence and
> then #ef starts a 3-byte sequence, so if this comes within 2 characters
> of end-of-line that would explain your dump problem).  You had better
> fix the data first before trying to lock down the encoding.  Once you
> change the encoding, backend internal operations will start spitting up
> on any stored bad data, whereas right now it's just passing it through
> unchanged.
>
> The safest way might be a dump-n-reload in any case, since reloading
> into a fresh UNICODE database will catch bad data.  If you try manual
> repairs you're likely to miss some places :-(
>
> regards, tom lane
>
> ---------------------------(end of broadcast)---------------------------
> TIP 5: Have you checked our extensive FAQ?
>
>               http://www.postgresql.org/docs/faqs/FAQ.html 
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Tom Lane | 2004-11-16 01:34:51 | Re: evil characters #bfef cause dump failure | 
| Previous Message | Markus Bertheau | 2004-11-15 23:13:21 | Re: evil characters #bfef cause dump failure |