Quick Links

Re: [HACKERS] Parallel tuplesort (for parallel B-Tree index creation)

From:	Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>
To:	Peter Geoghegan <pg(at)bowt(dot)ie>
Cc:	Robert Haas <robertmhaas(at)gmail(dot)com>, Thomas Munro <thomas(dot)munro(at)enterprisedb(dot)com>, Rushabh Lathia <rushabh(dot)lathia(at)gmail(dot)com>, Heikki Linnakangas <hlinnaka(at)iki(dot)fi>, Pg Hackers <pgsql-hackers(at)postgresql(dot)org>, Corey Huinker <corey(dot)huinker(at)gmail(dot)com>
Subject:	Re: [HACKERS] Parallel tuplesort (for parallel B-Tree index creation)
Date:	2018-01-17 13:47:24
Message-ID:	CAA4eK1KcqrRyVEvn6PBsR=MeY1C8NUO0o3jmgOWhZEUOvRkaig@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Tue, Jan 16, 2018 at 6:24 AM, Peter Geoghegan <pg(at)bowt(dot)ie> wrote:
> On Fri, Jan 12, 2018 at 10:28 AM, Robert Haas <robertmhaas(at)gmail(dot)com> wrote:
>> More comments:
>
> Attached patch has all open issues worked through, including those
> that I respond to or comment on below, as well as the other feedback
> from your previous e-mails. Note also that I fixed the issue that Amit
> raised,
>

I could still reproduce it. I think the way you have fixed it has a
race condition. In _bt_parallel_scan_and_sort(), the value of
brokenhotchain is set after you signal the leader that the worker is
done (by incrementing workersFinished). Now, the leader is free to
decide based on the current shared state which can give the wrong
value. Similarly, I think the value of havedead and reltuples can
also be wrong.

You neither seem to have fixed nor responded to the second problem
mentioned in my email upthread [1]. To reiterate, the problem is that
we can't assume that the workers we have launched will always start
and finish. It is possible that postmaster fails to start the worker
due to fork failure. In such conditions, tuplesort_leader_wait will
hang indefinitely because it will wait for the workersFinished count
to become equal to launched workers (+1, if leader participates) which
will never happen. Am I missing something due to which this won't be
a problem?

Now, I think one argument is that such a problem can happen in a
parallel query, so it is not the responsibility of this patch to solve
it. However, we already have a patch (there are some review comments
that needs to be addressed in the proposed patch) to solve it and this
patch is adding a new path in the code which has similar symptoms
which can't be fixed with the already proposed patch.

[1] - https://www.postgresql.org/message-id/CAA4eK1%2BizMyxzFD6k81Deyar35YJ5qdpbRTUp9cQvo%2BniQom7Q%40mail.gmail.com

--
With Regards,
Amit Kapila.
EnterpriseDB: http://www.enterprisedb.com

In response to

Re: [HACKERS] Parallel tuplesort (for parallel B-Tree index creation) at 2018-01-16 00:54:40 from Peter Geoghegan

Responses

Re: [HACKERS] Parallel tuplesort (for parallel B-Tree index creation) at 2018-01-17 17:27:10 from Peter Geoghegan

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Robert Haas	2018-01-17 13:49:26	Re: [HACKERS] Proposal: Local indexes for partitioned table
Previous Message	Fabien COELHO	2018-01-17 13:10:10	Re: Setting BLCKSZ 4kB