Quick Links

index structure for 114-dimension vector

From:	Andrew Lazarus <andrew(at)pillette(dot)com>
To:	pgsql-performance(at)postgresql(dot)org
Subject:	index structure for 114-dimension vector
Date:	2007-04-20 19:07:29
Message-ID:	1217328326.20070420120729@pillette.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

I have a table with 2.5 million real[] arrays. (They are points in a
time series.) Given a new array X, I'd like to find, say, the 25
closest to X in some sense--for simplification, let's just say in the
usual vector norm. Speed is critical here, and everything I have tried
has been too slow.

I imported the cube contrib package, and I tried creating an index on
a cube of the last 6 elements, which are the most important. Then I
tested the 2.5MM rows for being contained within a tolerance of the
last 6 elements of X, +/- 0.1 in each coordinate, figuring that would
be an indexed search (which I CLUSTERED on). I then ran the sort on
this smaller set. The index was used, but it was still too slow. I
also tried creating new columns with rounded int2 values of the last 6
coordinates and made a multicolumn index.

For each X the search is taking about 4-15 seconds which is above my
target at least one order of magnitude. Absolute numbers are dependent
on my hardware and settings, and some of this can be addressed with
configuration tweaks, etc., but first I think I need to know the
optimum data structure/indexing strategy.

Is anyone on the list experienced with this sort of issue?

Thanks.
Andrew Lazarus andrew(at)pillette(dot)com

In response to

Re: Basic Q on superfluous primary keys at 2007-04-19 00:16:18 from Merlin Moncure

Responses

Re: index structure for 114-dimension vector at 2007-04-20 21:41:24 from Jeff Davis
Re: index structure for 114-dimension vector at 2007-04-23 17:49:31 from C Storm
Re: index structure for 114-dimension vector at 2007-04-26 22:34:36 from Alexander Staubo

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Jeff Davis	2007-04-20 21:41:24	Re: index structure for 114-dimension vector
Previous Message	chrisj	2007-04-20 14:23:58	seeking advise on char vs text or varchar in search table