Quick Links

Re: Sync Rep: First Thoughts on Code

From:	Mark Mielke <mark(at)mark(dot)mielke(dot)cc>
To:	Markus Wanner <markus(at)bluegap(dot)ch>
Cc:	Heikki Linnakangas <heikki(dot)linnakangas(at)enterprisedb(dot)com>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Robert Haas <robertmhaas(at)gmail(dot)com>, Simon Riggs <simon(at)2ndquadrant(dot)com>, Fujii Masao <masao(dot)fujii(at)gmail(dot)com>, aidan(at)highrise(dot)ca, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Sync Rep: First Thoughts on Code
Date:	2008-12-20 15:29:16
Message-ID:	494D0F4C.5050000@mark.mielke.cc
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Good answers, Markus. Thanks.

I've bought the thinking of several here that the user should have some
control over what they expect (and what optimizations they are willing
to accept as a good choice), but that commit should still be able to
have a capped time limit.

I can think of many of my own applications where I would choose one mode
vs another mode, even within the same application, depending on the
operation itself. The most important requirement is that transactions
are durable. It becomes convenient, though, to provide additional
guarantees for some operation sequences.

I still see the requirement for seat reservation, bank account, or stock
trading, as synchronizing using read-write locks before starting the
select, rather than enforcing latest on every select.

For my own bank, when I do an online transaction, operations don't
always immediately appear in my list of transactions. They appear to
sometimes be batched, sometimes in near real time, and sometimes as part
of some sort of day end processing.

For seat reservation, the time the seat layout is shown on the screen is
not usually locked during a transaction. Between the time the travel
agent brings up the seats on the plane, and the time they select the
seat, the seat could be taken. What's important is that the reservation
is durable, and that conflicts are not introduced. The commit must fail
if another person has chosen the seat already already. The commit does
not need to wait until the reservation is pushed out to all systems
before completing. The same is true of stock trading.

However, it can be very convenient for commits to be immediately visible
after the commit completes. This allows for lazier models, such as a web
site that reloads the view on the reservations or recent trades and
expects to see recent commits no matter which server it accesses, rather
than taking into account that the commit succeeded when presenting the
next view.

If I look at sites like Google - they take the opposite extreme. I can
post a message, and it remembers that I posted the message and makes it
immediately visible, however, I might not see other new messages in a
thread until a minute or more later.

So it looks like there is value to both ends of the spectrum, and while
I feel the most value would be in providing a very fast system that
scales near linear to the number of nodes in the system, even at the
expense of immediately visible transactions from all servers, I can
accept that sometimes the expectations are stricter and would appreciate
seeing an option to let me choose based upon my requirements.

Cheers,
mark

Markus Wanner wrote:
> Hi,
>
> Mark Mielke wrote:
>
>> Where does the expectation come from?
>>
>
> I find the seat reservation, bank account or stock trading examples
> pretty obvious WRT user expectations.
>
> Nonetheless, I've compiled some hints from the documentation and sources:
>
> "Since in Read Committed mode each new command starts with a new
> snapshot that includes all transactions committed up to that instant" [1].
>
> "This [SERIALIZABLE ISOLATION] level emulates serial transaction
> execution, as if transactions had been executed one after another,
> serially, rather than concurrently." [1]. (IMO this implies, that a
> transaction "sees" changes from all preceding transactions).
>
> "All changes made by the transaction become visible to others and are
> guaranteed to be durable if a crash occurs." [2]. (Agreed, it's not
> overly clear here, when exactly the changes become visible. OTOH,
> there's no warning, that another session doesn't immediately see
> committed transactions. Not sure where you got that from).
>
>
>> I don't recall ever reading it in
>> the documentation, and unless the session processes are contending over
>> the integers (using some sort of synchronization primitive) in memory
>> that represent the "latest visible commit" on every single select, I'm
>> wondering how it is accomplished?
>>
>
> See the transaction system's README [3]. It documents the process of
> snapshot taking and transaction isolation pretty well. Around line 226
> it says: "What we actually enforce is strict serialization of commits
> and rollbacks with snapshot-taking". (So the outcome of your experiment
> is no surprise at all).
>
> And a bit later: "This rule is stronger than necessary for consistency,
> but is relatively simple to enforce, and it assists with some other
> issues as explained below.". While this implies, that an optimization is
> theoretically possible, I very much doubt it would be worth it (for a
> single node system).
>
> In a distributed system, things are a bit different. Network latency is
> an order of magnitude higher than memory latency (for IPC). So a similar
> optimization is very well worth it. However, the application (or the
> load balancer or both) need to know about this potential lag between
> nodes. And as you've outlined elsewhere, a limit for how much a single
> node may lag behind needs to be established.
>
> (As a side note: for a multi-master system like Postgres-R, it's
> beneficial to keep the lag time as low as possible, because the larger
> the lag, the higher the probability for a conflict between two
> transactions on different nodes.)
>
> Regards
>
> Markus Wanner
>
>
> [1]: Pg 8.3 Docu: Concurrency Control:
> http://www.postgresql.org/docs/8.3/static/transaction-iso.html
>
> [2]: Pg 8.3 Docu: COMMIT command:
> http://www.postgresql.org/docs/8.3/static/sql-commit.html
>
> [3]: README of transam (src/backend/access/transam/README):
> https://projects.commandprompt.com/public/pgsql/browser/trunk/pgsql/src/backend/access/transam/README#L224
>
>

--
Mark Mielke <mark(at)mielke(dot)cc>

In response to

Re: Sync Rep: First Thoughts on Code at 2008-12-20 14:23:59 from Markus Wanner

Responses

Re: Sync Rep: First Thoughts on Code at 2008-12-20 23:16:59 from Markus Wanner

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Markus Wanner	2008-12-20 15:30:28	Re: Sync Rep: First Thoughts on Code
Previous Message	Martin Pihlak	2008-12-20 15:27:30	Re: [COMMITTERS] pgsql: SQL/MED catalog manipulation facilities This doesn't do any