Quick Links

Re: Hash function for numeric (WIP)

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Neil Conway <neilc(at)samurai(dot)com>
Cc:	pgsql-patches <pgsql-patches(at)postgresql(dot)org>
Subject:	Re: Hash function for numeric (WIP)
Date:	2007-04-30 04:04:10
Message-ID:	13030.1177905850@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-patches

Neil Conway <neilc(at)samurai(dot)com> writes:
> On Fri, 2007-04-27 at 10:02 -0400, Tom Lane wrote:
>> Perhaps a sufficiently robust way would be to form the hash as the
>> XOR of each supplied digit, circular-shifted by say 3 times the
>> digit's weight.

> The only objection I have to this is that it means we need to have
> another hash function in the backend. The Jenkins hash we use in
> hash_any() has been studied and we can have at least some confidence in
> its collision-resistance, etc.

I'm still not very comfortable with that. You're proposing to add a
pretty obvious failure mechanism --- any numeric-returning function
that failed to "normalize" its output would now create a subtle,
hard-to-find bug. Even if you can promise that all the functions in
numeric.c get it right, what of user-written add-ons? And the only
return for taking this risk is speculation that the performance of the
hash function might be better.

I think if you want to go this way, you should at least provide some
evidence that we get a hashing performance benefit in exchange for
adding a new restriction on numeric-value validity. Perhaps a suitable
test would be to compare the number of hash collisions in a large set of
randomly-chosen-but-distinct numeric values.

regards, tom lane

In response to

Re: Hash function for numeric (WIP) at 2007-04-28 22:47:10 from Neil Conway

Responses

Re: Hash function for numeric (WIP) at 2007-05-04 03:03:59 from Neil Conway

Browse pgsql-patches by date

	From	Date	Subject
Next Message	Zdenek Kotala	2007-04-30 14:11:43	Re: Fix for large file support (nonsegment mode support)
Previous Message	Greg Smith	2007-04-30 02:07:54	Re: too much WAL volume