Quick Links

Re: query optimization with UDFs

From:	Neil Conway <neilc(at)samurai(dot)com>
To:	jungmin shin <jungmin(dot)shin(at)gmail(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: query optimization with UDFs
Date:	2006-10-10 04:30:30
Message-ID:	1160454630.5716.56.camel@localhost.localdomain
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general pgsql-hackers

On Mon, 2006-10-09 at 22:49 -0400, jungmin shin wrote:
> Does anybody know what the Postgres does for optimizing the queries
> with UDFs?

The optimizer considers function volatility to avoid reevaluating UDFs
needlessly, and to use index scans on predicates involving a function.
Also, functions defined in the SQL language will be inlined into the
function call site, when possible. That's all that comes to mind at the
moment...

Notably, the optimizer doesn't have support for realistic costing of
UDFs: it can't tell how expensive evaluating a UDF is going to be, nor
the number and distribution of the rows that will be produced by a
set-returning function. The Berkeley release of Postgres supported
"expensive function optimization" (a.k.a xfunc), which was an
implementation of Hellerstein and Stonebraker's work on "Predicate
Migration"[1]. That code is no longer in the current Postgres source.

BTW, I think it would make sense to implement a limited subset of the
xfunc ideas: add options to CREATE FUNCTION to allow cost information to
be specified, and then take advantage of this information instead of
using the existing constant kludges. This would be a tangible
improvement, and would have minimal impact on the planner. A further
improvement would be predicate migration: the xfunc code modified the
optimizer to consider pulling predicates up above joins in the query
plan. The traditional assumption is that all predicates are cheap, so
they are always pushed down in the hopes that this will reduce the size
of intermediate result sets. Naturally, this is not always wise with
expensive UDFs. After the basics have been done, it might be interesting
to implement this, provided it can be done without increasing the
planner's search space too much.

-Neil

[1] http://db.cs.berkeley.edu/jmh/miscpapers/sigmod93.pdf

In response to

query optimization with UDFs at 2006-10-10 02:49:40 from jungmin shin

Responses

Re: query optimization with UDFs at 2006-10-10 23:06:57 from Tom Lane

Browse pgsql-general by date

	From	Date	Subject
Next Message	Peter Bauer	2006-10-10 05:49:46	Re: Major Performance decrease after some hours
Previous Message	Chris	2006-10-10 04:18:57	Re: postgres query log analysis?

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Merlin Moncure	2006-10-10 05:48:20	Re: array_accum aggregate
Previous Message	jungmin shin	2006-10-10 02:49:40	query optimization with UDFs