Quick Links

Re: Parallel CREATE INDEX for GIN indexes

From:	Tomas Vondra <tomas(dot)vondra(at)enterprisedb(dot)com>
To:	Andy Fan <zhihuifan1213(at)163(dot)com>
Cc:	pgsql-hackers(at)lists(dot)postgresql(dot)org
Subject:	Re: Parallel CREATE INDEX for GIN indexes
Date:	2024-05-13 09:19:08
Message-ID:	f902c19e-efcf-45b3-9d1e-3669658ec6ff@enterprisedb.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 5/13/24 10:19, Andy Fan wrote:
>
> Tomas Vondra <tomas(dot)vondra(at)enterprisedb(dot)com> writes:
>
>> ...
>>
>> I don't understand the question. The blocks are distributed to workers
>> by the parallel table scan, and it certainly does not do that block by
>> block. But even it it did, that's not a problem for this code.
>
> OK, I get ParallelBlockTableScanWorkerData.phsw_chunk_size is designed
> for this.
>
>> The problem is that if the scan wraps around, then one of the TID lists
>> for a given worker will have the min TID and max TID, so it will overlap
>> with every other TID list for the same key in that worker. And when the
>> worker does the merging, this list will force a "full" merge sort for
>> all TID lists (for that key), which is very expensive.
>
> OK.
>
> Thanks for all the answers, they are pretty instructive!
>

Thanks for the questions, it forces me to articulate the arguments more
clearly. I guess it'd be good to put some of this into a README or at
least a comment at the beginning of gininsert.c or somewhere close.

regards

--
Tomas Vondra
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

In response to

Re: Parallel CREATE INDEX for GIN indexes at 2024-05-13 08:19:43 from Andy Fan

Responses

Re: Parallel CREATE INDEX for GIN indexes at 2024-05-28 09:29:48 from Andy Fan

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Daniel Gustafsson	2024-05-13 09:22:08	Re: explain format json, unit for serialize and memory are different.
Previous Message	jian he	2024-05-13 09:16:24	explain format json, unit for serialize and memory are different.