Re: Better tracking of free space during SP-GiST index build

From: Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com>
To: obartunov(at)gmail(dot)com
Cc: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org>, Emre Hasegeli <emre(at)hasegeli(dot)com>
Subject: Re: Better tracking of free space during SP-GiST index build
Date: 2016-09-25 19:28:51
Message-ID: 77ac3599-e445-e0aa-6add-e6acad07afa3@2ndquadrant.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 09/25/2016 08:33 PM, Oleg Bartunov wrote:
> On Sat, Sep 24, 2016 at 11:32 PM, Tomas Vondra
> <tomas(dot)vondra(at)2ndquadrant(dot)com> wrote:
>> On 09/22/2016 07:37 PM, Tom Lane wrote:
>>>
>>> Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com> writes:
>>>
>>>> ... I've tried increasing the cache size to 768
>>>> entries, with vast majority of them (~600) allocated to leaf pages.
>>>> Sadly, this seems to only increase the CREATE INDEX duration a bit,
>>>> without making the index significantly smaller (still ~120MB).
>>>
>>>
>>> Yeah, that's in line with my results: not much further gain from a
>>> larger cache. Though if you were testing with the same IRRExplorer
>>> data, it's not surprising that our results would match. Would be
>>> good to try some other cases...
>>>
>>
>> Agreed, but I don't have any other data sets at hand. One possibility would
>> be to generate something randomly (e.g. it's not particularly difficult to
>> generate random IP addresses), but I'd much rather use some real-world data
>> sets.
>
> Tomas, I have one real dataset, which I used for testing spgist
> (https://www.postgresql.org/message-id/CAF4Au4zxd2XOV0A__FU7xoHxSiwJzm1z2xhs-FFaT1DzB9ub3Q@mail.gmail.com)
> Let me know if you need it.
>

Sure, that would be useful.

I think it would be useful to make repository of such data sets, so that
patch authors & reviewers can get a reasonable collection of data sets
if needed, instead of scrambling every time. Opinions?

regards

--
Tomas Vondra http://www.2ndQuadrant.com
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Enrique Meneses 2016-09-25 19:36:00 Re: Allowing GIN array_ops to work on anyarray
Previous Message Petr Jelinek 2016-09-25 18:48:41 Re: PATCH: two slab-like memory allocators