Re: Proposal for CSN based snapshots

From: Heikki Linnakangas <hlinnaka(at)iki(dot)fi>
To: Andres Freund <andres(at)anarazel(dot)de>
Cc: Robert Haas <robertmhaas(at)gmail(dot)com>, Alexander Korotkov <a(dot)korotkov(at)postgrespro(dot)ru>, Simon Riggs <simon(at)2ndquadrant(dot)com>, Dilip Kumar <dilipbalaut(at)gmail(dot)com>, Jeff Davis <pgsql(at)j-davis(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>, Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>, Greg Stark <stark(at)mit(dot)edu>, Andres Freund <andres(at)2ndquadrant(dot)com>, Rajeev rastogi <rajeev(dot)rastogi(at)huawei(dot)com>, Markus Wanner <markus(at)bluegap(dot)ch>, Ants Aasma <ants(at)cybertec(dot)at>, Bruce Momjian <bruce(at)momjian(dot)us>, obartunov <obartunov(at)postgrespro(dot)ru>, Teodor Sigaev <teodor(at)postgrespro(dot)ru>, Borodin Vladimir <root(at)simply(dot)name>
Subject: Re: Proposal for CSN based snapshots
Date: 2016-08-24 08:54:32
Message-ID: 0c4fb3ae-a828-5f07-79e3-0728e31e31b1@iki.fi
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 08/23/2016 06:18 PM, Heikki Linnakangas wrote:
> On 08/22/2016 08:38 PM, Andres Freund wrote:
>> On 2016-08-22 20:32:42 +0300, Heikki Linnakangas wrote:
>>> I
>>> remember seeing ProcArrayLock contention very visible earlier, but I can't
>>> hit that now. I suspect you'd still see contention on bigger hardware,
>>> though, my laptop has oly 4 cores. I'll have to find a real server for the
>>> next round of testing.
>>
>> Yea, I think that's true. I can just about see ProcArrayLock contention
>> on my more powerful laptop, to see it really bad you need bigger
>> hardware / higher concurrency.
>
> As soon as I sent my previous post, Vladimir Borodin kindly offered
> access to a 32-core server for performance testing. Thanks Vladimir!
>
> I installed Greg Smith's pgbench-tools kit on that server, and ran some
> tests. I'm seeing some benefit on "pgbench -N" workload, but only after
> modifying the test script to use "-M prepared", and using Unix domain
> sockets instead of TCP to connect. Apparently those things add enough
> overhead to mask out the little difference.
>
> Attached is a graph with the results. Full results are available at
> https://hlinnaka.iki.fi/temp/csn-4-results/. In short, the patch
> improved throughput, measured in TPS, with >= 32 or so clients. The
> biggest difference was with 44 clients, which saw about 5% improvement.
>
> So, not phenomenal, but it's something. I suspect that with more cores,
> the difference would become more clear.
>
> Like on a cue, Alexander Korotkov just offered access to a 72-core
> system :-). Thanks! I'll run the same tests on that.

And here are the results on the 72 core machine (thanks again,
Alexander!). The test setup was the same as on the 32-core machine,
except that I ran it with more clients since the system has more CPU
cores. In summary, in the best case, the patch increases throughput by
about 10%. That peak is with 64 clients. Interestingly, as the number of
clients increases further, the gain evaporates, and the CSN version
actually performs worse than unpatched master. I don't know why that is.
One theory that by eliminating one bottleneck, we're now hitting another
bottleneck which doesn't degrade as gracefully when there's contention.

Full results are available at
https://hlinnaka.iki.fi/temp/csn-4-72core-results/.

- Heikki

Attachment Content-Type Size
image/png 5.6 KB

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Magnus Hagander 2016-08-24 09:03:06 Re: [RFC] Change the default of update_process_title to off
Previous Message Simon Riggs 2016-08-24 08:44:45 Re: WAL consistency check facility