Quick Links

Re: Postgres for a "data warehouse", 5-10 TB

From:	Robert Klemme <shortcutter(at)googlemail(dot)com>
To:	pgsql-performance(at)postgresql(dot)org
Subject:	Re: Postgres for a "data warehouse", 5-10 TB
Date:	2011-09-12 17:15:35
Message-ID:	j4lens$hp0$1@dough.gmane.org
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

On 11.09.2011 19:02, Marti Raudsepp wrote:
> On Sun, Sep 11, 2011 at 17:23, Andy Colson<andy(at)squeakycode(dot)net> wrote:
>> On 09/11/2011 08:59 AM, Igor Chudov wrote:
>>> By the way, does that INSERT UPDATE functionality or something like this exist in Postgres?
>> You have two options:
>> 1) write a function like:
>> create function doinsert(_id integer, _value text) returns void as
>> 2) use two sql statements:
>
> Unfortunately both of these options have caveats. Depending on your
> I/O speed, you might need to use multiple loader threads to saturate
> the write bandwidth.
>
> However, neither option is safe from race conditions. If you need to
> load data from multiple threads at the same time, they won't see each
> other's inserts (until commit) and thus cause unique violations. If
> you could somehow partition their operation by some key, so threads
> are guaranteed not to conflict each other, then that would be perfect.
> The 2nd option given by Andy is probably faster.
>
> You *could* code a race-condition-safe function, but that would be a
> no-go on a data warehouse, since each call needs a separate
> subtransaction which involves allocating a transaction ID.

Wouldn't it be sufficient to reverse order for race condition safety?
Pseudo code:

begin
insert ...
catch
update ...
if not found error
end

Speed is another matter though...

Kind regards

robert

In response to

Re: Postgres for a "data warehouse", 5-10 TB at 2011-09-11 17:02:06 from Marti Raudsepp

Responses

Re: Postgres for a "data warehouse", 5-10 TB at 2011-09-12 17:22:48 from Andy Colson

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Andy Colson	2011-09-12 17:22:48	Re: Postgres for a "data warehouse", 5-10 TB
Previous Message	Robert Klemme	2011-09-12 17:04:22	Re: Postgres for a "data warehouse", 5-10 TB