Actuall row count of Parallel Seq Scan in EXPLAIN ANALYZE .

From: Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com>
To: PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject: Actuall row count of Parallel Seq Scan in EXPLAIN ANALYZE .
Date: 2016-06-20 06:18:56
Message-ID: CAD21AoCoe2tFyttiwYD+k7-_78z5Q2+9uh780XUjLn_86bt+6Q@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Hi all,

My colleague noticed that the output of EXPLAIN ANALYZE doesn't work
fine for parallel seq scan.

postgres(1)=# explain analyze verbose select count(*) from pgbench_accounts ;
QUERY PLAN
-----------------------------------------------------------------------------------------------------------------------------------------------------
Finalize Aggregate (cost=217018.55..217018.56 rows=1 width=8)
(actual time=2640.015..2640.015 rows=1 loops=1)
Output: count(*)
-> Gather (cost=217018.33..217018.54 rows=2 width=8) (actual
time=2639.064..2640.002 rows=3 loops=1)
Output: (PARTIAL count(*))
Workers Planned: 2
Workers Launched: 2
-> Partial Aggregate (cost=216018.33..216018.34 rows=1
width=8) (actual time=2632.714..2632.715 rows=1 loops=3)
Output: PARTIAL count(*)
Worker 0: actual time=2632.583..2632.584 rows=1 loops=1
Worker 1: actual time=2627.517..2627.517 rows=1 loops=1
-> Parallel Seq Scan on public.pgbench_accounts
(cost=0.00..205601.67 rows=4166667 width=0) (actual
time=0.042..1685.542 rows=3333333 loops=3)
Worker 0: actual time=0.033..1657.486 rows=3457968 loops=1
Worker 1: actual time=0.039..1702.979 rows=3741069 loops=1
Planning time: 1.026 ms
Execution time: 2640.225 ms
(15 rows)

For example, the above result shows,
Parallel Seq Scan : actual rows = 3333333
worker 0 : actual rows = 3457968
worker 1 : actual rows = 3741069
Summation of these is 10532370, but actual total rows is 10000000.
I think that Parallel Seq Scan should show actual rows =
10000000(total rows) or actual rows = 2800963(rows collected by
itself). (10000000 maybe better)

After spent time to investigate this behaviour, ISTM that the problem
is nloops of Parallel Seq Scan.
Parallel Seq Scan is done only once, but nloops is incremented to 3.
So its "actual rows" is calculated 3333333(10000000 / 3) at explain.c:L1223.
Is it a bug?

Regards,

--
Masahiko Sawada

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Amit Kapila 2016-06-20 06:42:54 Re: Actuall row count of Parallel Seq Scan in EXPLAIN ANALYZE .
Previous Message Paul A Jungwirth 2016-06-20 05:29:43 Re: Review: GiST support for UUIDs