Quick Links

Re: Sorting Improvements for 8.4

From:	Jeff Davis <pgsql(at)j-davis(dot)com>
To:	Dann Corbit <DCorbit(at)connx(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Sorting Improvements for 8.4
Date:	2007-12-20 01:01:28
Message-ID:	1198112488.10057.42.camel@dogma.ljc.laika.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Wed, 2007-12-19 at 15:19 -0800, Dann Corbit wrote:
> The algorithm that I am suggesting will take exactly one pass to merge
> all of the files.
>

>From tuplesort.c:

"In the current code we determine the number of tapes M on the basis of
workMem: we want workMem/M to be large enough that we read a fair amount
of data each time we preread from a tape, so as to maintain the locality
of access described above. Nonetheless, with large workMem we can have
many tapes."

It seems like you are just choosing M to be equal to the number of
initial runs, whereas the current code takes into account the cost of
having workMem/M too small.

We do want to increase the number of runs that can be merged at once;
that's what dynamic run handling and forecasting are all about. But we
want to avoid unnecessary seeking, also.

Regards,
Jeff Davis

In response to

Re: Sorting Improvements for 8.4 at 2007-12-19 23:19:43 from Dann Corbit

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tom Lane	2007-12-20 01:02:30	Re: Sorting Improvements for 8.4
Previous Message	Tom Lane	2007-12-20 00:50:29	Re: pgwin32_open returning EINVAL