Quick Links

Re: WIP: Avoid creation of the free space map for small tables

From:	John Naylor <jcnaylor(at)gmail(dot)com>
To:	Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>
Cc:	pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: WIP: Avoid creation of the free space map for small tables
Date:	2018-10-22 06:44:27
Message-ID:	CAJVSVGWxNu7gdxU6-QyUv=7ooSF5ujXVFu+gstKV62MbbdCHuA@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 10/16/18, Amit Kapila <amit(dot)kapila16(at)gmail(dot)com> wrote:
> I think we can avoid using prevBlockNumber and try_every_block, if we
> maintain a small cache which tells whether the particular block is
> tried or not. What I am envisioning is that while finding the block
> with free space, if we came to know that the relation in question is
> small enough that it doesn't have FSM, we can perform a local_search.
> In local_seach, we can enquire the cache for any block that we can try
> and if we find any block, we can try inserting in that block,
> otherwise, we need to extend the relation. One simple way to imagine
> such a cache would be an array of structure and structure has blkno
> and status fields. After we get the usable block, we need to clear
> the cache, if exists.

Here is the design I've implemented in the attached v6. There is more
code than v5, but there's a cleaner separation between freespace.c and
hio.c, as you preferred. I also think it's more robust. I've expended
some effort to avoid doing unnecessary system calls to get the number
of blocks.
--

For the local, in-memory map, maintain a static array of status
markers, of fixed-length HEAP_FSM_CREATION_THRESHOLD, indexed by block
number. This is populated every time we call GetPageWithFreeSpace() on
small tables with no FSM. The statuses are

'zero' (beyond the relation)
'available to try'
'tried already'

Example for a 4-page heap:

01234567
AAAA0000

If we try block 3 and there is no space, we set it to 'tried' and next
time through the loop we'll try 2, etc:

01234567
AAAT0000

If we try all available blocks, we will extend the relation. As in the
master branch, first we call GetPageWithFreeSpace() again to see if
another backend extended the relation to 5 blocks while we were
waiting for the lock. If we find a new block, we will mark the new
block available and leave the rest alone:

01234567
TTTTA000

On the odd chance we still can't insert into the new block, we'll skip
checking any others and we'll redo the logic to extend the relation.

If we're about to successfully return a buffer, whether from an
existing block, or by extension, we clear the local map.

Once this is in shape, I'll do some performance testing.

-John Naylor

Attachment	Content-Type	Size
v6-0001-Avoid-creation-of-the-free-space-map-for-small-ta.patch	text/x-patch	26.6 KB

In response to

Re: WIP: Avoid creation of the free space map for small tables at 2018-10-16 03:48:49 from Amit Kapila

Responses

Re: WIP: Avoid creation of the free space map for small tables at 2018-10-23 13:42:02 from John Naylor
Re: WIP: Avoid creation of the free space map for small tables at 2018-11-16 03:00:00 from Amit Kapila

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Michael Paquier	2018-10-22 07:00:21	Buildfarm failures for hash indexes: buffer leaks
Previous Message	Amit Langote	2018-10-22 06:35:36	Re: Side effect of CVE-2017-7484 fix?