Re: [PERFORM] multi-threaded pgloader makes it in version 2.3.0

From: Simon Riggs <simon(at)2ndquadrant(dot)com>
To: Dimitri Fontaine <dfontaine(at)hi-media(dot)com>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: [PERFORM] multi-threaded pgloader makes it in version 2.3.0
Date: 2008-03-10 17:14:23
Message-ID: 1205169263.4269.169.camel@ebony.site
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general pgsql-performance

On Mon, 2008-03-10 at 17:18 +0100, Dimitri Fontaine wrote:

> Le samedi 01 mars 2008, Simon Riggs a écrit :
> > On Tue, 2008-02-26 at 13:08 +0100, Dimitri Fontaine wrote:
> > > I'd like to have some feedback about the new version, in term of bugs
> > > encountered and performance limitations (is pgloader up to what you would
> > > expect a multi-threaded loader to be at?)
> >
> > Maybe post to general as well if you don't get any replies here.
> > New feature is very important for us.
>
> So, here's yet another mail about pgloader new 2.3.0 version, please forgive
> me for being over zealous here if that's how I appear to be to you...
>
> Those links will give you detailed information about the new release.
> http://pgfoundry.org/projects/pgloader
> http://pgfoundry.org/forum/forum.php?forum_id=1283
> http://pgloader.projects.postgresql.org/#_parallel_loading

Sounds good.

Not sure when or why I would want an rrqueue_size larger than
copy_every, and less sounds very strange. Can we get away with it being
the same thing in all cases?

Do you have some basic performance numbers? It would be good to
understand the overhead of the parallelism on a large file with 1, 2 and
4 threads. Would be good to see if synchronous_commit = off helped speed
things up as well.

Presumably -V and -T still work when we go parallel, but just issue one
query?

--
Simon Riggs
2ndQuadrant http://www.2ndQuadrant.com

PostgreSQL UK 2008 Conference: http://www.postgresql.org.uk

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message LARC/J.L.Shipman/jshipman 2008-03-10 17:33:04 searching using indexes 8.3
Previous Message Alvaro Herrera 2008-03-10 16:50:26 Re: gsoc ideas

Browse pgsql-performance by date

  From Date Subject
Next Message Dimitri Fontaine 2008-03-10 17:57:38 Re: [PERFORM] multi-threaded pgloader makes it in version 2.3.0
Previous Message Rainer Pruy 2008-03-10 16:55:43 Re: Very slow (2 tuples/second) sequential scan after bulk insert; speed returns to ~500 tuples/second after commit