High %SYS CPU usage

From: Cesar Martin <cmartinp(at)gmail(dot)com>
To: pgsql-performance(at)postgresql(dot)org
Subject: High %SYS CPU usage
Date: 2012-10-30 12:54:23
Message-ID: CAMAsR=5F45+kj+hw9q+zE7zo=Qc0yBEB1sLXCF0QL+dWt_7KqQ@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

Hello there,

I have PostgreSQL 8.3.18 server running on Centos 6.2 (2.6.32-220.7.1) with
this specs:

2x CPU AMD Opteron 6282
128GB RAM
Raid 10 (12HD 15k rpm 1GB cache) with data
Raid 10 (4HD 15k rpm 1GB cache) with xlog
Raid 1 (15k rpm 1GB cache shared with xlog) with system

On this server I have only one database with 312GB of data. The database
had run fine during 4 months, but from two months ago, during high work
load periods, the server is collapsed by "%sys" type load.

For example "dstat -ar --socket --tcp" during %sys load problem:
http://pastebin.com/7zfDNvPh

Reboot the server mitigates the problem during few days, but always
reappear.
Server not is swapping, don't have excessive I/O, don't have %IRQ load.

I don't have any ideas...

Thank you very much for your help.

My sysctl and postgres.conf:

sysclt -a:
http://pastebin.com/EEVnNxsZ

My Postgres.conf:
max_connections = 500 # (change requires restart)
unix_socket_directory = '/var/run/postgres' # (change requires restart)
shared_buffers = 18GB # min 128kB or max_connections*16kB
work_mem = 30MB # min 64kB
maintenance_work_mem = 1GB # min 1MB
max_fsm_pages = 8553600 # min max_fsm_relations*16, 6 bytes each
max_fsm_relations = 409000 # min 100, ~70 bytes each
fsync = on # turns forced synchronization on or off
synchronous_commit = off # immediate fsync at commit
wal_buffers = 8MB # min 32kB
checkpoint_segments = 64 # in logfile segments, min 1, 16MB each
checkpoint_completion_target = 0.9 # checkpoint target duration, 0.0 - 1.0
archive_mode = on # allows archiving to be done
archive_command = 'exit 0'
effective_cache_size = 100GB
constraint_exclusion = on
default_text_search_config = 'pg_catalog.spanish'
max_locks_per_transaction = 100

--
César Martín Pérez
cmartinp(at)gmail(dot)com

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message ktm@rice.edu 2012-10-30 13:05:33 Re: Replaying 48 WAL files takes 80 minutes
Previous Message Vincenzo Melandri 2012-10-30 12:15:10 Seq scan on 10million record table.. why?