Re: how to ignore invalid byte sequence for encoding without using sql_ascii?

From: "detrox yang" <detrox(at)gmail(dot)com>
To: "Martijn van Oosterhout" <kleptog(at)svana(dot)org>, "detrox(at)gmail(dot)com" <detrox(at)gmail(dot)com>, pgsql-general(at)postgresql(dot)org
Subject: Re: how to ignore invalid byte sequence for encoding without using sql_ascii?
Date: 2007-10-10 03:33:29
Message-ID: f9d504d90710092033u68b1aac4rc2b4b20429256056@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

got it. thanks very much.

On 10/2/07, Martijn van Oosterhout <kleptog(at)svana(dot)org> wrote:
>
> On Thu, Sep 27, 2007 at 02:28:27AM -0700, detrox(at)gmail(dot)com wrote:
> > I am now importing the dump file of wikipedia into my postgresql using
> > maintains/importDump.php. It fails on 'ERROR: invalid byte sequence
> > for encoding UTF-8'. Is there any way to let pgsql just ignore the
> > invalid characters ( i mean that drop the invalid ones ), that the
> > script will keep going without die on this error.
>
> No, postgres does not destroy data. It you want bits of your data
> removed you need to write your own tool to do it.
>
> That said, are you sure that the data you're importing is UTF-8?
>
> Have a nice day,
> --
> Martijn van Oosterhout <kleptog(at)svana(dot)org> http://svana.org/kleptog/
> > From each according to his ability. To each according to his ability to
> litigate.
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.1 (GNU/Linux)
>
> iD8DBQFHAfOQIB7bNG8LQkwRAlMxAJ93gd9QP/c00tOcK9rSzEUvg4kZcQCfQYjS
> JhhN/o8NT9xpahZmMz6XjbA=
> =n0T1
> -----END PGP SIGNATURE-----
>
>

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Guy Rouillier 2007-10-10 03:53:47 Re: SLEEP in posgresql
Previous Message Jasbinder Singh Bali 2007-10-10 02:56:27 SLEEP in posgresql