Quick Links

Re: Thoughts on statistics for continuously advancing columns

From:	Chris Browne <cbbrowne(at)acm(dot)org>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Thoughts on statistics for continuously advancing columns
Date:	2009-12-30 21:15:05
Message-ID:	87pr5w2mg6.fsf@dba2.int.libertyrms.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

jd(at)commandprompt(dot)com ("Joshua D. Drake") writes:
> On the other hand ANALYZE also:
>
> 1. Uses lots of memory
> 2. Lots of processor
> 3. Can take a long time
>
> We normally don't notice because most sets won't incur a penalty. We got a
> customer who
> has a single table that is over 1TB in size... We notice. Granted that is
> the extreme
> but it would only take a quarter of that size (which is common) to start
> seeing issues.

I find it curious that ANALYZE *would* take a long time to run.

After all, its sampling strategy means that, barring having SET
STATISTICS to some ghastly high number, it shouldn't need to do
materially more work to analyze a 1TB table than is required to analyze
a 1GB table.

With the out-of-the-box (which may have changed without my notice ;-))
default of 10 bars in the histogram, it should search for 30K rows,
which, while not "free," doesn't get enormously more expensive as tables
grow.
--
"cbbrowne","@","gmail.com"
http://linuxfinances.info/info/linuxdistributions.html
Rules of the Evil Overlord #179. "I will not outsource core
functions." <http://www.eviloverlord.com/>

In response to

Re: Thoughts on statistics for continuously advancing columns at 2009-12-30 16:31:26 from Joshua D. Drake

Responses

Re: Thoughts on statistics for continuously advancing columns at 2009-12-30 22:35:02 from Tom Lane

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Robert Haas	2009-12-30 21:37:55	Re: quoting psql varible as identifier
Previous Message	Robert Haas	2009-12-30 21:13:21	Re: pg_read_file() and non-ascii input file