Re: Improve lseek scalability v3

From: Benjamin LaHaise <bcrl(at)kvack(dot)org>
To: Andres Freund <andres(at)anarazel(dot)de>
Cc: Matthew Wilcox <matthew(at)wil(dot)cx>, Andi Kleen <andi(at)firstfloor(dot)org>, viro(at)zeniv(dot)linux(dot)org(dot)uk, linux-fsdevel(at)vger(dot)kernel(dot)org, linux-kernel(at)vger(dot)kernel(dot)org, robertmhaas(at)gmail(dot)com, pgsql-hackers(at)postgresql(dot)org
Subject: Re: Improve lseek scalability v3
Date: 2011-09-16 20:08:17
Message-ID: 20110916200817.GD28519@kvack.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Fri, Sep 16, 2011 at 07:27:33PM +0200, Andres Freund wrote:
> many tuples does the table have. Those statistics are only updated every now
> and then though.
> So it uses those old stats to check how many tuples are normally stored on a
> page and then uses that to extrapolate the number of tuples from the current
> nr of pages (which is computed by lseek(SEEK_END) over the 1GB segements of a
> table).
>
> I am not sure how interested you are on the relevant postgres internals?

For such tables, can't Postgres track the size of the file internally? I'm
assuming it's keeping file descriptors open on the tables it manages, in
which case when it writes to a file to extend it, the internally stored size
could be updated. Not making a syscall at all would scale far better than
even a modified lseek() will perform.

Granted, I've not looked at the Postgres code at all.

-ben

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Tom Lane 2011-09-16 20:40:06 Re: force_not_null option support for file_fdw
Previous Message Joshua Berkus 2011-09-16 19:22:53 Re: unite recovery.conf and postgresql.conf