Quick Links

Re: COUNT(*) and index-only scans

From:	Robert Haas <robertmhaas(at)gmail(dot)com>
To:	Kevin Grittner <Kevin(dot)Grittner(at)wicourts(dot)gov>
Cc:	Bruce Momjian <bruce(at)momjian(dot)us>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: COUNT(*) and index-only scans
Date:	2011-10-11 10:37:43
Message-ID:	CA+TgmoaOT4xenvZQgBeUAuefn12QV_od5WPYQ+CVJPHo9MSYsw@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Mon, Oct 10, 2011 at 3:15 PM, Kevin Grittner
<Kevin(dot)Grittner(at)wicourts(dot)gov> wrote:
> Robert Haas <robertmhaas(at)gmail(dot)com> wrote:
>
>> Right now, our costing model for index-only scans is pretty dumb.
>> It assumes that using an index-only scan will avoid 10% of the
>> heap fetches. That could easily be low, and on an insert-only
>> table or one where only the recently-updated rows are routinely
>> accessed, it could also be high.
>
> As a reality check, I just ran this query on a table in a statewide
> copy of our data:
>
> select count(*),
> sum(case when xmin = '2'::xid then 0 else 1 end) as read_heap
> from "CaseHist";
>
> and got:
>
> count | read_heap
> -----------+-----------
> 205765311 | 3934924
>
> So on our real-world database, it would skip something on the order
> of 98% of the heap reads, right?

Yeah, if it's scanning the whole table.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

In response to

Re: COUNT(*) and index-only scans at 2011-10-10 19:15:07 from Kevin Grittner

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Robert Haas	2011-10-11 11:00:49	Re: table/index options \| was: COUNT(*) and index-only scans
Previous Message	Robert Haas	2011-10-11 10:36:48	Re: index-only scans