Re: Gsoc2012 idea, tablesample

From: Qi Huang <huangqiyx(at)hotmail(dot)com>
To: <heikki(dot)linnakangas(at)enterprisedb(dot)com>, <josh(at)agliodbs(dot)com>
Cc: <pgsql-hackers(at)postgresql(dot)org>, <andres(at)anarazel(dot)de>, <alvherre(at)commandprompt(dot)com>, <neil(dot)conway(at)gmail(dot)com>, <daniel(at)heroku(dot)com>, <cbbrowne(at)gmail(dot)com>, <kevin(dot)grittner(at)wicourts(dot)gov>
Subject: Re: Gsoc2012 idea, tablesample
Date: 2012-04-17 12:14:20
Message-ID: BAY159-W366C8A7525F7CF0A88533FA33F0@phx.gbl
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers


Besides, I saw the Gsoc site editing has been closed. Should I just submit through this mailing list with attachment?

Best Regards and ThanksHuang Qi VictorComputer Science of National University of Singapore

> Date: Tue, 17 Apr 2012 09:16:29 +0300
> From: heikki(dot)linnakangas(at)enterprisedb(dot)com
> To: josh(at)agliodbs(dot)com
> CC: huangqiyx(at)hotmail(dot)com; pgsql-hackers(at)postgresql(dot)org; andres(at)anarazel(dot)de; alvherre(at)commandprompt(dot)com; neil(dot)conway(at)gmail(dot)com; daniel(at)heroku(dot)com; cbbrowne(at)gmail(dot)com; kevin(dot)grittner(at)wicourts(dot)gov
> Subject: [HACKERS] Gsoc2012 idea, tablesample
>
> On 24.03.2012 22:12, Joshua Berkus wrote:
> > Qi,
> >
> > Yeah, I can see that. That's a sign that you had a good idea for a project, actually: your idea is interesting enough that people want to debate it. Make a proposal on Monday and our potential mentors will help you refine the idea.
>
> Yep. The discussion withered, so let me try to summarize:
>
> 1. We probably don't want the SQL syntax to be added to the grammar.
> This should be written as an extension, using custom functions as the
> API, instead of extra SQL syntax.
>
> 2. It's not very useful if it's just a dummy replacement for "WHERE
> random() < ?". It has to be more advanced than that. Quality of the
> sample is important, as is performance. There was also an interesting
> idea of on implementing monetary unit sampling.
>
> I think this would be a useful project if those two points are taken
> care of.
>
> Another idea that Robert Haas suggested was to add support doing a TID
> scan for a query like "WHERE ctid< '(501,1)'". That's not enough work
> for GSoC project on its own, but could certainly be a part of it.
>
> --
> Heikki Linnakangas
> EnterpriseDB http://www.enterprisedb.com
>
> --
> Sent via pgsql-hackers mailing list (pgsql-hackers(at)postgresql(dot)org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-hackers

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Greg Stark 2012-04-17 12:19:21 Re: Memory usage during sorting
Previous Message Qi Huang 2012-04-17 11:55:36 Re: Gsoc2012 idea, tablesample