Quick Links

Re: caching query results

From:	Karel Zak <zakkr(at)zf(dot)jcu(dot)cz>
To:	Jan Wieck <wieck(at)debis(dot)com>
Cc:	pghackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: caching query results
Date:	2000-04-05 11:33:56
Message-ID:	Pine.LNX.3.96.1000405122918.31506A-100000@ara.zf.jcu.cz
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Tue, 4 Apr 2000, Jan Wieck wrote:

The nice picture. Thanks, I undestend you now. A question is where create
this key - create a specific function that look at to querytree and return
key or calculate it during statement transformation (analyze.c ..etc.).
Or is any other idea?

> > > These keys could be managed in a shared LRU table, and if the
> >
> > My current code is based on HASH table with keys and query&plan is
> > saved in special for a plan created MemoryContext (it is good for
> > a example SPI_freeplan()).

I thought about it, and what for SPI and PREPARE/EXECUTE query cache
use shared memory too? I'm vote for one query cache in postgresql. IMHO
is not good create a specific cache for SPI_saveplan()+PREPARE and second
for your suggested query cache.

If plans saved via SPI (under defined key - 'by_key' interface) will shared
under all backends a lot of features will faster (FK, PLangs ..etc) and
shared plans cached via PREPARE will persistent across more connetions.
(Some web developers will happy :-)

But I not sure with this...

> IIRC our hash table code insists on using global, per backend
> memory. I thought about managing the entire querycache with
> a new type of memory context, using different routines for
> palloc()/pfree(), working in a shared memory area only and
> eventually freeing longest unused plans until allocation
> fits. Let's see if using hash tables here would be easy or
> not.

I look at the current shmem routines - create specific space and hash
table for a query cache is not a problem, hash routines are prepared
for usage under shmem. The current lock management code is very simular.
With hash is not a problem here.

A problem is how store (copy) query & plan tree to this (shared) memory.
The current copyObject() is based on palloc()/pfree() and as you said
we haven't memory management routines (like palloc()) that working in
shmem.

Would be nice have MemoryContext routines for shmem - example
CreateGlobalMemory_in_shmem() and palloc() that knows work with this
specific context. It is a dream?

A solution is convert query & plan tree to string (like pg_rewrite (views))
and save to cache this string, (and what a speed during (vice versa) parsing?).
IMHO for this solution we not need a hash table, we can use a standard system
table and a syscache.

But more nice is variant with non-string and full plan-tree-structs in a
shmem.

> > Good. It is solution for 'known-query' and allow it skip any steps in the
> > query path. But we still not have any idea for cached plans validity. What
> > if user changes oid for any operator, drop column (etc)?
>
> That's why the key is only good to find "candidates". The
> cacheing has to look very close to the nodes in the tree and
> maybe compare down to pg_attribute oid's etc. to decide if
> it's really the same query or not.

OK.

Karel

In response to

Re: caching query results at 2000-04-04 14:27:37 from Jan Wieck

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Jan Wieck	2000-04-05 11:50:43	Re: 7.0 FK trigger question
Previous Message	Hiroshi Inoue	2000-04-05 08:05:19	RE: postgres crash on CURSORS