Quick Links

Re: Parallel query execution

From:	Stephen Frost <sfrost(at)snowman(dot)net>
To:	Claudio Freire <klaussfreire(at)gmail(dot)com>
Cc:	Bruce Momjian <bruce(at)momjian(dot)us>, Gavin Flower <GavinFlower(at)archidevsys(dot)co(dot)nz>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Parallel query execution
Date:	2013-01-16 13:33:54
Message-ID:	20130116133354.GG16126@tamriel.snowman.net
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

* Claudio Freire (klaussfreire(at)gmail(dot)com) wrote:
> Well, there's the fault in your logic. It won't be as linear.

I really don't see how this has become so difficult to communicate.

It doesn't have to be linear.

We're currently doing massive amounts of parallel processing by hand
using partitioning, tablespaces, and client-side logic to split up the
jobs. It's certainly *much* faster than doing it in a single thread.
It's also faster with 10 processes going than 5 (we've checked). With
10 going, we've hit the FC fabric limit (and these are spinning disks in
the SAN, not SSDs). I'm also sure it'd be much slower if all 10
processes were trying to read data through a single process that's
reading from the I/O system. We've got some processes which essentially
end up doing that and we don't come anywhere near the total FC fabric
bandwidth when just scanning through the system because, at that point,
you do hit the limits of how fast the individual drive sets can provide
data.

To be clear- I'm not suggesting that we would parallelize a SeqScan node
and have the nodes above it be single-threaded. As I said upthread- we
want to parallelize reading and processing the data coming in. Perhaps
at some level that works out to not change how we actually *do* seqscans
at all and instead something higher in the plan tree just creates
multiple of them on independent threads, but it's still going to end up
being parallel I/O in the end.

I'm done with this thread for now- as brought up, we need to focus on
getting 9.3 out the door.

Thanks,

Stephen

In response to

Re: Parallel query execution at 2013-01-16 04:47:21 from Claudio Freire

Responses

Re: Parallel query execution at 2013-01-16 15:23:14 from Claudio Freire

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Stephen Frost	2013-01-16 13:38:08	Re: Parallel query execution
Previous Message	Kevin Grittner	2013-01-16 13:27:31	Re: Materialized views WIP patch