Quick Links

Re: Optimize partial TOAST decompression

From:	Andrey Borodin <x4mmm(at)yandex-team(dot)ru>
To:	Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com>
Cc:	Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>, Binguo Bao <djydewang(at)gmail(dot)com>, Paul Ramsey <pramsey(at)cleverelephant(dot)ca>, Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Optimize partial TOAST decompression
Date:	2019-09-30 16:20:22
Message-ID:	E11354D1-6980-408D-AC4B-E147EB7CCDED@yandex-team.ru
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

> 30 сент. 2019 г., в 20:56, Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com> написал(а):
>
> I mean this:
>
> /*
> * Use int64 to prevent overflow during calculation.
> */
> compressed_size = (int32) ((int64) rawsize * 9 + 8) / 8;
>
> I'm not very familiar with pglz internals, but I'm a bit puzzled by
> this. My first instinct was to compare it to this:
>
> #define PGLZ_MAX_OUTPUT(_dlen) ((_dlen) + 4)
>
> but clearly that's a very different (much simpler) formula. So why
> shouldn't pglz_maximum_compressed_size simply use this macro?

compressed_size accounts for possible increase of size during compression. pglz can consume up to 1 control byte for each 8 bytes of data in worst case.
Even if whole data is compressed well - there can be prefix compressed extremely ineffectively. Thus, if you are going to decompress rawsize bytes, you need at most compressed_size bytes of compressed input.

In response to

Re: Optimize partial TOAST decompression at 2019-09-30 15:56:07 from Tomas Vondra

Responses

Re: Optimize partial TOAST decompression at 2019-09-30 17:29:51 from Tomas Vondra

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Ryan Lambert	2019-09-30 16:47:04	Re: FETCH FIRST clause PERCENT option
Previous Message	Soumyadeep Chakraborty	2019-09-30 16:14:45	Re: Don't codegen deform code for virtual tuples in expr eval for scan fetch