Quick Links

Re: GIN fast insert

From:	Robert Haas <robertmhaas(at)gmail(dot)com>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: GIN fast insert
Date:	2009-02-11 03:59:27
Message-ID:	603c8f070902101959i1c021e4agd69e7657cd30dbff@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Tue, Feb 10, 2009 at 10:38 PM, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> wrote:
>> I think this code needs to be somehow rewritten to make things degrade
>> gracefully when the pending list is long - I'm not sure what the best
>> way to do that is. Inventing a new data structure to store TIDs that
>> is never lossy seems like it might work, but you'd have to think about
>> what to do if it got too big.
>
> What would be wrong with letting it degrade to lossy? I suppose the
> reason it's trying to avoid that is to avoid having to recheck all the
> rows on that page when it comes time to do the index insertion; but
> surely having to do that is better than having arbitrary, unpredictable
> failure conditions.

No, I don't think that's it. See here, beginning with "the problem
with lossy tbm has two aspects":

http://archives.postgresql.org/message-id/4974B002.3040202@sigaev.ru

> It strikes me that part of the issue here is that the behavior of this
> code is much better adapted to the bitmap-scan API than the traditional
> indexscan API. Since GIN doesn't support ordered scan anyway, I wonder
> whether it wouldn't be more sensible to simply allow it to not offer
> the traditional API. It should be easy to make the planner ignore plain
> indexscan plans for an AM that didn't support them.

If that doesn't lead to a performance degradation, I think it would be
a good idea. It certainly seems like it would allow this patch to be
a LOT simpler, cleaner, and more robust.

...Robert

In response to

Re: GIN fast insert at 2009-02-11 03:38:47 from Tom Lane

Responses

Re: GIN fast insert at 2009-02-11 04:18:24 from Tom Lane
Re: GIN fast insert at 2009-02-11 15:02:07 from Teodor Sigaev

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Lawrence, Ramon	2009-02-11 04:04:16	Re: The testing of multi-batch hash joins with skewed data sets patch
Previous Message	Lawrence, Ramon	2009-02-11 03:51:13	Re: The testing of multi-batch hash joins with skewed data sets patch