Re: Support Parallel Query Execution in Executor

From: Josh Berkus <josh(at)agliodbs(dot)com>
To: pgsql-hackers(at)postgresql(dot)org
Cc: "Mike Rylander" <mrylander(at)gmail(dot)com>, "Qingqing Zhou" <zhouqq(at)cs(dot)toronto(dot)edu>
Subject: Re: Support Parallel Query Execution in Executor
Date: 2006-04-08 00:52:15
Message-ID: 200604071752.16202.josh@agliodbs.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers pgsql-patches

Qingquing,

> First, I want to second Jonah's enthusiasm. This is very exciting!

Me, three! I didn't think this was ever going to come to Postgres absent
major corporate funding.

> This is really only a gut feeling for me (it can't be otherwise, since
> we can't yet test), but I think parallelizing a single seqscan is
> pretty much guaranteed to do nothing, because seqscans, especially on
> large tables, are IO bound.

Actuall, not true. Our current seqscan performance suffers from
produce-consumer fluctuation. GreenPlum and Sun did a whole bunch of
testing on this.

Basically reading a large table off disk does this:

read some table while not processing
process in cpu while not reading
read some more table while not processing
process some more in cpu while not reading
etc.
resulting in an I/O througput graph that looks like:

* * *
* * * * * *
* * * * * *
* * * *

The really annoying part about this, for me personally, is that the peaks
are significantly faster than comparable commercial DBMSes ... but our
average is far less. So even on a single seq scan, parallel query
execution would make a significant difference in performance, possibly as
much as +75% on seq scans of large tables.

--
--Josh

Josh Berkus
Aglio Database Solutions
San Francisco

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Jonah H. Harris 2006-04-08 01:00:13 Re: Support Parallel Query Execution in Executor
Previous Message Pavel Stehule 2006-04-07 20:10:06 please actualize FAQ, broken urls

Browse pgsql-patches by date

  From Date Subject
Next Message Jonah H. Harris 2006-04-08 01:00:13 Re: Support Parallel Query Execution in Executor
Previous Message Pavel Stehule 2006-04-07 20:10:06 please actualize FAQ, broken urls