Quick Links

Re: Frequent Update Project: Design Overview of HOT Updates

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	"Simon Riggs" <simon(at)2ndquadrant(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Frequent Update Project: Design Overview of HOT Updates
Date:	2006-11-09 23:28:37
Message-ID:	12825.1163114917@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

"Simon Riggs" <simon(at)2ndquadrant(dot)com> writes:
> As more UPDATEs take place these tuple chains would grow, making
> locating the latest tuple take progressively longer.

This is the part that bothers me --- particularly the random-access
nature of the search. I wonder whether you couldn't do something
involving an initial table fill-factor of less than 50%, and having
the first updated version living on the same heap page as its parent.
Only when the active chain length was more than one (which you
hypothesize is rare) would it actually be necessary to do a random
access into the overflow table.

More generally, do we need an overflow table at all, rather than having
these overflow tuples living in the same file as the root tuples? As
long as there's a bit available to mark a tuple as being this special
not-separately-indexed type, you don't need a special location to know
what it is. This might break down in the presence of seqscans though.

Actually, you omitted to mention the locking aspects of moving tuples
around --- exactly how are you going to make that work without breaking
concurrent scans?

> This allows the length of a typical tuple chain to be extremely short in
> practice. For a single connection issuing a stream of UPDATEs the chain
> length will no more than 1 at any time.

Only if there are no other transactions being held open, which makes
this claim a lot weaker.

> HOT can only work in cases where a tuple does not modify one of the
> columns defined in an index on the table, and when we do not alter the
> row length of the tuple.

Seems like "altering the row length" isn't the issue, it's just "is
there room on the page for the new version". Again, a generous
fillfactor would give you more flexibility.

> [We'll be able to do that more efficiently when
> we have plan invalidation]

Uh, what's that got to do with it?

regards, tom lane

In response to

Frequent Update Project: Design Overview of HOT Updates at 2006-11-09 17:13:16 from Simon Riggs

Responses

Re: Frequent Update Project: Design Overview of HOT Updates at 2006-11-09 23:33:37 from Josh Berkus
Re: Frequent Update Project: Design Overview of HOT at 2006-11-10 07:06:35 from Hannu Krosing
Re: Frequent Update Project: Design Overview of HOT Updates at 2006-11-10 11:32:19 from Zeugswetter Andreas ADI SD
Re: Frequent Update Project: Design Overview of HOTUpdates at 2006-11-10 12:19:50 from Simon Riggs

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Josh Berkus	2006-11-09 23:33:37	Re: Frequent Update Project: Design Overview of HOT Updates
Previous Message	Tom Lane	2006-11-09 23:17:33	Re: Introducing an advanced Frequent Update Optimization