Re: [GSOC] Eliminate O(N^2) scaling from rw-conflict tracking in serializable transactions

From: Robert Haas <robertmhaas(at)gmail(dot)com>
To: Mengxing Liu <liu-mx15(at)mails(dot)tsinghua(dot)edu(dot)cn>
Cc: Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>, kgrittn <kgrittn(at)gmail(dot)com>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [GSOC] Eliminate O(N^2) scaling from rw-conflict tracking in serializable transactions
Date: 2017-07-28 18:46:47
Message-ID: CA+Tgmoa4qoxwdDvKpE1=HSw4Xk2u3y7MZYksnMxJ9LTRxMWHrw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Wed, Jul 26, 2017 at 11:41 AM, Mengxing Liu <
liu-mx15(at)mails(dot)tsinghua(dot)edu(dot)cn> wrote:

> Hi, all. There was a very strange phenomenon I couldn't explain. So I was
> wondering if you can help me.
>
> I was trying to replace the linked list with a skip list in serializable
> transaction object for faster conflict tracking. But the performance is bad.
> So I used the instruction "rdtsc" to compare the speed of my skip list and
> the original linked list. The skip list was about 1.5x faster.
>
> The interesting thing is that if I added the instruction "rdstc" at the
> end of the function "RWConflictExists",
> the performance of the whole system was increased by at most 3 times!
> Here is the result.
>
> benchmarks without rdtsc with rdtsc
> simpe read/write 4.91 14.16
> ssibench 9.72 10.24
> tpcb 26.45 26.38
>
> ( The simple read/write benchmark has the most number of conflicts. )
>
> The patch is attached. All the difference of the two columns is
> with/without a simple line of code:
> __asm__ __volatile__ ("rdtsc");
> But I don't know why this instruction will influence the performance so
> much!
>

Lock contention is really expensive, so a slight delay that is just long
enough to prevent the contention from happening can sometimes improve
performance. This example is surprisingly dramatic, though. Of course, we
can't commit it this way -- it will break on non-x86.

I would suggest that you gather information on what wait events are
occurring in the "without rdtsc" case. Like this:

$ script
$ psql
psql=> select wait_event from pg_stat_activity;
psql=> \watch 0.5
...run test in another window...
^C
\q
^D
...use awk or perl or something to count up the wait events and see where
the contention is happening...

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Robert Haas 2017-07-28 18:54:21 Re: map_partition_varattnos() and whole-row vars
Previous Message Robert Haas 2017-07-28 18:42:06 Re: segfault in HEAD when too many nested functions call