Quick Links

Re: Making all nbtree entries unique by having heap TIDs participate in comparisons

From:	Claudio Freire <klaussfreire(at)gmail(dot)com>
To:	Peter Geoghegan <pg(at)bowt(dot)ie>
Cc:	Robert Haas <robertmhaas(at)gmail(dot)com>, PostgreSQL-Dev <pgsql-hackers(at)postgresql(dot)org>, Alexander Korotkov <a(dot)korotkov(at)postgrespro(dot)ru>, Thomas Munro <thomas(dot)munro(at)enterprisedb(dot)com>, Anastasia Lubennikova <a(dot)lubennikova(at)postgrespro(dot)ru>
Subject:	Re: Making all nbtree entries unique by having heap TIDs participate in comparisons
Date:	2018-06-18 14:57:12
Message-ID:	CAGTBQpZozrc6oqxCypHvdHVmb-OQV3u-mK9cMU8-U4WuhHwAhg@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Fri, Jun 15, 2018 at 8:47 PM Peter Geoghegan <pg(at)bowt(dot)ie> wrote:

> > I think it would be helpful if you could talk more about these
> > regressions (and the wins).
>
> I think that the performance regressions are due to the fact that when
> you have a huge number of duplicates today, it's useful to be able to
> claim space to fit further duplicates from almost any of the multiple
> leaf pages that contain or have contained duplicates. I'd hoped that
> the increased temporal locality that the patch gets would more than
> make up for that. As far as I can tell, the problem is that temporal
> locality doesn't help enough. I saw that performance was somewhat
> improved with extreme Zipf distribution contention, but it went the
> other way with less extreme contention. The details are not that fresh
> in my mind, since I shelved this patch for a while following limited
> performance testing.
>
> The code could certainly use more performance testing, and more
> general polishing. I'm not strongly motivated to do that right now,
> because I don't quite see a clear path to making this patch useful.
> But, as I said, I have an open mind about what the next step should
> be.

Way back when I was dabbling in this kind of endeavor, my main idea to
counteract that, and possibly improve performance overall, was a
microvacuum kind of thing that would do some on-demand cleanup to
remove duplicates or make room before page splits. Since nbtree
uniqueification enables efficient retail deletions, that could end up
as a net win.

I never got around to implementing it though, and it does get tricky
if you don't want to allow unbounded latency spikes.

In response to

Re: Making all nbtree entries unique by having heap TIDs participate in comparisons at 2018-06-15 23:46:44 from Peter Geoghegan

Responses

Re: Making all nbtree entries unique by having heap TIDs participate in comparisons at 2018-06-18 17:03:20 from Peter Geoghegan

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Joe Conway	2018-06-18 15:06:20	Re: [Proposal] Table-level Transparent Data Encryption (TDE) and Key Management Service (KMS)
Previous Message	Tom Lane	2018-06-18 14:52:14	Re: [Proposal] Table-level Transparent Data Encryption (TDE) and Key Management Service (KMS)