Re: Performance on new 64bit server compared to my 32bit desktop

From: Yeb Havinga <yebhavinga(at)gmail(dot)com>
To: Jose Ildefonso Camargo Tolosa <ildefonso(dot)camargo(at)gmail(dot)com>
Cc: "pgsql-performance(at)postgresql(dot)org" <pgsql-performance(at)postgresql(dot)org>
Subject: Re: Performance on new 64bit server compared to my 32bit desktop
Date: 2010-08-30 07:58:16
Message-ID: 4C7B6498.5040800@gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

Jose Ildefonso Camargo Tolosa wrote:
> Also, nowadays, Intel has better performance than AMD, at least when
> comparing Athlon 64 vs Core2, I'm still saving to get a Phenom II
> system in order to benchmark them and see how it goes (does anyone
> have one of these for testing?).
root(at)p:~/ff/www.cs.virginia.edu/stream/FTP/Code# cat /proc/cpuinfo
processor : 0
vendor_id : AuthenticAMD
cpu family : 16
model : 4
model name : AMD Phenom(tm) II X4 940 Processor
stepping : 2
cpu MHz : 3000.000
cache size : 512 KB
physical id : 0
siblings : 4
core id : 0
cpu cores : 4
apicid : 0
initial apicid : 0
fpu : yes
fpu_exception : yes
cpuid level : 5
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge
mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext
fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good
nonstop_tsc extd_apicid pni monitor cx16 popcnt lahf_lm cmp_legacy svm
extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt
bogomips : 6020.46
TLB size : 1024 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 48 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate

stream compiled with -O3

root(at)p:~/ff/www.cs.virginia.edu/stream/FTP/Code# ./a.out
-------------------------------------------------------------
STREAM version $Revision: 5.9 $
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 5031 microseconds.
(= 5031 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 5056.0434 0.0064 0.0063 0.0064
Scale: 4950.4916 0.0065 0.0065 0.0065
Add: 5322.0173 0.0091 0.0090 0.0091
Triad: 5395.1815 0.0089 0.0089 0.0089
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------

two parallel
root(at)p:~/ff/www.cs.virginia.edu/stream/FTP/Code# ./a.out & ./a.out

-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 2984.2741 0.0108 0.0107 0.0108
Scale: 2945.8261 0.0109 0.0109 0.0110
Add: 3282.4631 0.0147 0.0146 0.0149
Triad: 3321.2893 0.0146 0.0145 0.0148
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 2981.4898 0.0108 0.0107 0.0108
Scale: 2943.3067 0.0109 0.0109 0.0109
Add: 3283.8552 0.0147 0.0146 0.0149
Triad: 3313.9634 0.0147 0.0145 0.0148

four parallel
root(at)p:~/ff/www.cs.virginia.edu/stream/FTP/Code# ./a.out & ./a.out &
./a.out & ./a.out

-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 1567.4880 0.0208 0.0204 0.0210
Scale: 1525.3401 0.0211 0.0210 0.0213
Add: 1739.7735 0.0279 0.0276 0.0282
Triad: 1763.4858 0.0274 0.0272 0.0276
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 1559.0759 0.0208 0.0205 0.0210
Scale: 1536.2520 0.0211 0.0208 0.0212
Add: 1740.4503 0.0279 0.0276 0.0283
Triad: 1758.4951 0.0276 0.0273 0.0279
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 1552.7271 0.0208 0.0206 0.0210
Scale: 1527.5275 0.0211 0.0209 0.0212
Add: 1737.9263 0.0279 0.0276 0.0282
Triad: 1757.3439 0.0276 0.0273 0.0278
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 1515.5912 0.0213 0.0211 0.0214
Scale: 1544.7033 0.0210 0.0207 0.0212
Add: 1754.4495 0.0278 0.0274 0.0281
Triad: 1856.3659 0.0279 0.0259 0.0284

In response to

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message Pavel Stehule 2010-08-30 08:17:27 Re: Major performance problem after upgrade from 8.3 to 8.4
Previous Message Gerhard Wiesinger 2010-08-30 07:58:15 Re: Major performance problem after upgrade from 8.3 to 8.4