Quick Links

Re: Indexes and statistics

From:	"David Witham" <davidw(at)unidial(dot)com(dot)au>
To:	"Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	<pgsql-sql(at)postgresql(dot)org>
Subject:	Re: Indexes and statistics
Date:	2004-02-18 06:23:27
Message-ID:	CFA248776934FD43847E740E43C346D199DC18@ozimelb03.ozicom.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-sql

Tom,

I'm running PostgreSQL 7.3.2 on Red Hat Linux 7.3 with 512Mb RAM.

The table definition is:

Does this make it a "wide" table?

The data arrives ordered by service_num, day, time. This customer has one primary service_num that most of the calls are made from. Therefore each day a clump of CDRs will be loaded for that customer, interspersed with CDRs from all the other customers. Therefore the distribution of records for a service_num is clumpy but evenly distributed throughout the table. For a customer with a single primary number, this result applies to the customer as a whole. For a customer with many service_num's the result is a little more doubtful depending on whether their service_num's arrive sequentially or not. This would not necessarily be the case.

I hope this makes sense. Does it help any?

Thanks,
David

-----Original Message-----
From: Tom Lane [mailto:tgl(at)sss(dot)pgh(dot)pa(dot)us]
Sent: Wednesday, 18 February 2004 16:10
To: David Witham
Cc: pgsql-sql(at)postgresql(dot)org
Subject: Re: [SQL] Indexes and statistics

"David Witham" <davidw(at)unidial(dot)com(dot)au> writes:
> One of the customers is quite large (8.3% of the records):

Hmm. Unless your rows are quite wide, a random sampling of 8.3% of the
table would be expected to visit every page of the table, probably
several times. So the planner's cost estimates do not seem out of line
to me; an indexscan *should* be slow. The first question to ask is why
the deviation from reality. Are the rows for that customer ID likely to
be physically concentrated into a limited number of physical pages?
Do you have so much RAM that the whole table got swapped in, eliminating
the extra I/O that the planner is expecting?

regards, tom lane

Responses

Re: Indexes and statistics at 2004-02-18 15:21:31 from Tom Lane

Browse pgsql-sql by date

	From	Date	Subject
Next Message	Dana Hudes	2004-02-18 06:44:52	Re: bytea or blobs?
Previous Message	sad	2004-02-18 06:14:29	Re: bytea or blobs?