Quick Links

Re: reducing the footprint of ScanKeyword (was Re: Large writable variables)

From:	Joerg Sonnenberger <joerg(at)bec(dot)de>
To:	John Naylor <jcnaylor(at)gmail(dot)com>
Cc:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Andres Freund <andres(at)anarazel(dot)de>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: reducing the footprint of ScanKeyword (was Re: Large writable variables)
Date:	2019-01-03 16:33:40
Message-ID:	20190103163340.GA15803@britannica.bec.de
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Sun, Dec 16, 2018 at 11:50:15AM -0500, John Naylor wrote:
> A few months ago I was looking into faster search algorithms for
> ScanKeywordLookup(), so this is interesting to me. While an optimal
> full replacement would be a lot of work, the above ideas are much less
> invasive and would still have some benefit. Unless anyone intends to
> work on this, I'd like to flesh out the offset-into-giant-string
> approach a bit further:

Hello John,
I was pointed at your patch on IRC and decided to look into adding my
own pieces. What I can provide you is a fast perfect hash function
generator. I've attached a sample hash function based on the current
main keyword list. hash() essentially gives you the number of the only
possible match, a final strcmp/memcmp is still necessary to verify that
it is an actual keyword though. The |0x20 can be dropped if all cases
have pre-lower-cased the input already. This would replace the binary
search in the lookup functions. Returning offsets directly would be easy
as well. That allows writing a single string where each entry is prefixed
with a type mask, the token id, the length of the keyword and the actual
keyword text. Does that sound useful to you?

Joerg

Attachment	Content-Type	Size
hash2.c	text/plain	8.1 KB

In response to

reducing the footprint of ScanKeyword (was Re: Large writable variables) at 2018-12-16 16:50:15 from John Naylor

Responses

Re: reducing the footprint of ScanKeyword (was Re: Large writable variables) at 2019-01-04 20:20:07 from John Naylor

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Andrew Alsup	2019-01-03 17:39:02	Re: SQL/JSON: functions
Previous Message	Andrew Alsup	2019-01-03 16:27:34	Re: Unable to `make install` on MacOS in the latest master (68a13f28be)