Quick Links

Re: BUG #17619: AllocSizeIsValid violation in parallel hash join

From:	Thomas Munro <thomas(dot)munro(at)gmail(dot)com>
To:	dastapov(at)gmail(dot)com, pgsql-bugs(at)lists(dot)postgresql(dot)org
Subject:	Re: BUG #17619: AllocSizeIsValid violation in parallel hash join
Date:	2022-09-22 08:44:01
Message-ID:	CA+hUKGKu3xSP7JsRGHw0d2Lxe_e4Y-3bf_1dkgAZj7xcsG=q1w@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-bugs

On Thu, Sep 22, 2022 at 7:46 PM PG Bug reporting form
<noreply(at)postgresql(dot)org> wrote:
> (gdb) p size
> $2 = 1702125924

Thanks for the detailed report. Hmm. That size, on a little-endian
system, is equivalent to the byte sequence "date\0\0\0\0", which looks
pretty suspiciously like the inside of a tuple, and not its size. We
must have got out of sync somehow.

> Potentially interesting piece of the puzzle is that there are some long
> outliers in rhs.payload and rhs.source, but the rest of the columns have
> values that are exactly of avg_width bytes:
>
> # select log_len, count(*) from (select log(length(payload))::int as log_len
> from rhs) foo group by 1 order by 2 desc;
> log_len │ count
> ─────────┼────────
> 3 │ 840852
> 4 │ 77776
> 5 │ 8003
> 6 │ 1317
> 7 │ 20
> (5 rows)

So there are some strings up to order 10^7 in length in there. The
file format consists of chunks, with a special case for tuples that
don't fit in one chunk. Perhaps there is a bug in that logic. It is
exercised in our regression tests, but perhaps not enough. I'll try
to repro this from your clues.

In response to

BUG #17619: AllocSizeIsValid violation in parallel hash join at 2022-09-21 15:57:21 from PG Bug reporting form

Responses

Re: BUG #17619: AllocSizeIsValid violation in parallel hash join at 2022-09-22 12:51:27 from Dmitry Astapov

Browse pgsql-bugs by date

	From	Date	Subject
Next Message	Dmitry Astapov	2022-09-22 12:51:27	Re: BUG #17619: AllocSizeIsValid violation in parallel hash join
Previous Message	qtds_126	2022-09-22 06:53:10	Re: The keyword in the procedure's error message is "function", which should be "procedure"