Quick Links

Re: Handling data in libpq

From:	"Jeroen T(dot) Vermeulen" <jtv(at)xs4all(dot)nl>
To:	am(at)fx(dot)ro
Cc:	pgsql-interfaces(at)postgresql(dot)org
Subject:	Re: Handling data in libpq
Date:	2002-10-19 11:01:36
Message-ID:	20021019110135.GX11228@xs4all.nl
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-interfaces

On Sat, Oct 19, 2002 at 01:23:52PM +0300, am(at)fx(dot)ro wrote:

> I'm looking for a new approach regarding fetching the data.
> Usage of cursors is probably the path to follow. I would be deeply
> grateful to anyone who would share a bit of his/her experience:
>
> - i assume the table has 50000 tuples
> - what is better: to fetch one tuple at one time, or to fetch
> a certain number of tuples (let's say 100).

Obviously there is some query and communications overhead associated with
fetching anything, regardless of size, and the odds are it's going to be
larger than the cost of sending a tuple of actual data across. So if
you're going to pay that overhead anyway, you'll want to get some decent
amount of data for it.

However, that also implies that fetching 500 blocks of 100 tuples each is
always going to be slower than getting 50000 tuples in one go. The two
advantages of fetching smaller blocks are: (1) you can start processing
some data right away, which might be useful for e.g. user interaction
(or you can use asynchronous communication so you can process your last
block of data while fetching the next); and (2) you may be running out of
memory when fetching and processing 50000 tuples at once. Swapping can
be a real drag on performance.

> - if i fetch 100 tuples, PQntuples returns 100? how do i know how
> many rows has the table?

Well, if you assume your table has 50000 rows, why do you need to ask?

Seriously though, you don't get this information. But if an estimate
is good enough, you can "select count(*) from table" before you start
fetching data.

> - i have the impression that keeping track of the tuple's index i
> nside the RecordSet and the index inside the whole table is quite
> tricky, but manageable. How do you guys handle this kind of situations?

It's not so hard. Just an extra nested loop.

Jeroen

In response to

Handling data in libpq at 2002-10-19 10:23:52 from am

Responses

Re: Handling data in libpq at 2002-10-19 15:47:25 from am

Browse pgsql-interfaces by date

	From	Date	Subject
Next Message	John L. Turner	2002-10-19 14:48:04	Re: Handling data in libpq
Previous Message	am	2002-10-19 10:23:52	Handling data in libpq