Re: plsql gets "out of memory"

From: Scott Marlowe <scott(dot)marlowe(at)gmail(dot)com>
To: Rural Hunter <ruralhunter(at)gmail(dot)com>
Cc: pgsql-admin(at)postgresql(dot)org, Kevin Grittner <Kevin(dot)Grittner(at)wicourts(dot)gov>, Julio Leyva <jcleyva(at)hotmail(dot)com>
Subject: Re: plsql gets "out of memory"
Date: 2011-09-03 04:02:35
Message-ID: CAOR=d=1Eayc8=NZkd-Ux9r4N7XfPjiJxPtJY6bzYk3r_qyK+dQ@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin

On Fri, Sep 2, 2011 at 9:30 PM, Rural Hunter <ruralhunter(at)gmail(dot)com> wrote:
> Hi Kevin,
>
> I did another try with following additional changes based on our discussion:
> 1. use the tcp connection
> 2. turn off autovacuum
> 3. turn off full_page_writes
>
> I could import more than 30G data in about 2 hours. That's totally
> acceptable performance to me with the current server capability.  There is a
> minor issue though. I saw a few errors during the import:
> ERROR:  invalid byte sequence for encoding "UTF8": 0xe6272c
> ERROR:  invalid byte sequence for encoding "UTF8": 0xe5272c
> ERROR:  invalid byte sequence for encoding "UTF8": 0xe5272c
> ERROR:  invalid byte sequence for encoding "UTF8": 0xe5272c
> ERROR:  invalid byte sequence for encoding "UTF8": 0xe68e27
> ERROR:  invalid byte sequence for encoding "UTF8": 0xe7272c
> ERROR:  invalid byte sequence for encoding "UTF8": 0xe5272c
> ERROR:  invalid byte sequence for encoding "UTF8": 0xe5a427
>
> My data was exported from an UTF8 MySQL database and my pgsql db is also
> UTF8. I got 8 errors above only with about 3 million records imported. The
> strange thing is, I usually see the problematic SQL output in the log if
> there is any error for that SQL so I have a chance to fix the data manually.
> But for the errors above, I don't see any SQL logged. The pgsql log just
> output error log same as above with no additional info:
> 2011-09-01 11:26:32 CST ERROR:  invalid byte sequence for encoding "UTF8":
> 0xe6272c
> 2011-09-01 11:26:47 CST ERROR:  invalid byte sequence for encoding "UTF8":
> 0xe5272c
> 2011-09-01 11:26:53 CST ERROR:  invalid byte sequence for encoding "UTF8":
> 0xe5272c
> 2011-09-01 11:26:58 CST ERROR:  invalid byte sequence for encoding "UTF8":
> 0xe5272c
> 2011-09-01 11:26:58 CST ERROR:  invalid byte sequence for encoding "UTF8":
> 0xe68e27
> 2011-09-01 11:27:01 CST ERROR:  invalid byte sequence for encoding "UTF8":
> 0xe7272c
> 2011-09-01 11:27:06 CST ERROR:  invalid byte sequence for encoding "UTF8":
> 0xe5272c
> 2011-09-01 11:27:15 CST ERROR:  invalid byte sequence for encoding "UTF8":
> 0xe5a427
>
> What could be the cause of that?

MySQL probably has looser checking of proper UTF-8 encodings.

In response to

Browse pgsql-admin by date

  From Date Subject
Next Message Lukasz Brodziak 2011-09-05 11:06:33 Re: Schema with OID 0 does not exist in pg_dump
Previous Message Rural Hunter 2011-09-03 03:30:26 Re: plsql gets "out of memory"