Re: Memory Errors

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Merlin Moncure <mmoncure(at)gmail(dot)com>
Cc: Sam Nelson <samn(at)consistentstate(dot)com>, pgsql-general <pgsql-general(at)postgresql(dot)org>
Subject: Re: Memory Errors
Date: 2010-09-08 20:58:27
Message-ID: 10245.1283979507@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Merlin Moncure <mmoncure(at)gmail(dot)com> writes:
> On Wed, Sep 8, 2010 at 4:03 PM, Sam Nelson <samn(at)consistentstate(dot)com> wrote:
>> So ... yes, it seems that those four id's are somehow part of the problem.
>> They're on amazon EC2 boxes (yeah, we're not too fond of the EC2 boxes
>> either), so memtest isn't available, but no new corruption has cropped up
>> since they stopped killing the waiting queries (I just double checked - they
>> were getting corrupted rows constantly, and we haven't gotten one since that
>> script stopped killing queries).

> That's actually a startling indictment of ec2 -- how were you killing
> your queries exactly? You say this is repeatable? What's your
> setting of full_page_writes?

I think we'd established that they were doing kill -9 on backend
processes :-(. However, PG has a lot of track record that says that
backend crashes don't result in corrupt data. What seems more likely
to me is that the corruption is the result of some shortcut taken while
shutting down or migrating the ec2 instance, so that some writes that
Postgres thought got to disk didn't really.

regards, tom lane

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message tamanna madaan 2010-09-08 21:10:23 Re: error while autovacuuming
Previous Message Thom Brown 2010-09-08 20:48:23 Text search parser's treatment of URLs and emails