Re: "Select * " on 12-18M row table from remote machine thru JDBC - Performance nose-dives after 10M-ish records

From: Deron <fecastle(at)gmail(dot)com>
To: antthelimey <antthelimey(at)gmail(dot)com>
Cc: pgsql-performance(at)postgresql(dot)org
Subject: Re: "Select * " on 12-18M row table from remote machine thru JDBC - Performance nose-dives after 10M-ish records
Date: 2012-09-28 15:10:26
Message-ID: CAF3Lvs5temEvkkao-Cs2giU6MUqm=X7N+t0DJjLqsq2X4T+i4g@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

I think the best advice I can think of is to go back to the basics. Tools
like sar and top and look at logs. Changing random settings on both the
client and server seems like guessing. I find it unlikely that the changes
you made (jdbc and shared buffers) had the effects you noticed. Determine
if it is I/O, CPU, or network. Put all your settings back to the way they
were. If the DB did not change, then look at OS and network.

Deron
On Sep 28, 2012 6:53 AM, "antthelimey" <antthelimey(at)gmail(dot)com> wrote:

> On machine 1 - a table that contains between 12 and 18 million rows
> On machine 2 - a Java app that calls Select * on the table, and writes it
> into a Lucene index
>
> Originally had a fetchSize of 10,000 and would take around 38 minutes for
> 12
> million, 50 minutes for 16ish million to read it all & write it all back
> out
> as the lucene index
>
> One day it started taking 4 hours. If something changed, we dont know what
> it was
>
> We tracked it down to, after 10 million or so rows, the Fetch to get the
> next 10,000 rows from the DB goes from like 1 second to 30 seconds, and
> stays there
>
> After spending a week of two devs & DBA trying to solve this, we
> eventually
> "solved" it by upping the FetchRowSize in the JDBC call to 50,000
>
> It was performing well enough again for a few weeks
>
> then...one day... it started taking 4 hours again
>
> we tried upping the shared_buffer from 16GB to 20GB
>
> And last night... it took 7 hours
>
> we are using PGSQL 9.1
>
> does anyone have ANY ideas?!
>
> thanks much
>
>
>
> --
> View this message in context:
> http://postgresql.1045698.n5.nabble.com/Select-on-12-18M-row-table-from-remote-machine-thru-JDBC-Performance-nose-dives-after-10M-ish-records-tp5725853.html
> Sent from the PostgreSQL - performance mailing list archive at Nabble.com.
>
>
> --
> Sent via pgsql-performance mailing list (pgsql-performance(at)postgresql(dot)org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-performance
>

In response to

Browse pgsql-performance by date

  From Date Subject
Next Message ktm@rice.edu 2012-09-28 15:38:23 Re: hardware advice
Previous Message antthelimey 2012-09-28 13:52:30 "Select * " on 12-18M row table from remote machine thru JDBC - Performance nose-dives after 10M-ish records