Re: Collecting statistics about contents of JSONB columns

From: Mahendra Singh Thalor <mahi6run(at)gmail(dot)com>
To: Greg Stark <stark(at)mit(dot)edu>
Cc: Tomas Vondra <tomas(dot)vondra(at)enterprisedb(dot)com>, Nikita Glukhov <n(dot)gluhov(at)postgrespro(dot)ru>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>, Mahendra Thalor <mahendra(dot)thalor(at)enterprisedb(dot)com>, Oleg Bartunov <obartunov(at)postgrespro(dot)ru>, Thomas Munro <thomas(dot)munro(at)gmail(dot)com>
Subject: Re: Collecting statistics about contents of JSONB columns
Date: 2022-05-10 11:19:04
Message-ID: CAKYtNAr9JvjhpeKwJ_svvu7BwTUsSC1kUn5+bBOqJ2gRpgmpJw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Fri, 1 Apr 2022 at 20:21, Greg Stark <stark(at)mit(dot)edu> wrote:
>
> This patch has bitrotted, presumably after the other JSON patchset was
> applied. It looks like it's failing in the json header file so it may
> be as simple as additional functions added on nearby lines.
>
> Please rebase. Reminder, it's the last week of the commitfest so time
> is of the essence....

Thanks, Greg for the report.

Here, I am attaching re-based patches of the v05 series. These patches
are re-based on the commit 7dd3ee508432730d15c5.

> I noticed some typos.

> diff --git a/src/backend/utils/adt/jsonb_selfuncs.c b/src/backend/utils/adt/jsonb_selfuncs.c
> index f5520f88a1d..d98cd7020a1 100644

Thanks, Justin for the review. We will fix these comments in the next version.

> Next, I am going to try to disable all-paths collection and implement
> collection of most common paths (and/or hashed paths maybe).

Thanks, Nikita for the v04 series of patches. I tested on the top of
your patches and verified that time taken by analyse is reduced for
large complex json docs.

In v03 patches, it was more than 2 hours, and in v04 patches, it is 39
sec only (time for Tomas's test case).

I am waiting for your patches (disable all-paths collection and
implement collection of most common paths)

Just for testing purposes, I am posting re-based patches here.

--
Thanks and Regards
Mahendra Singh Thalor
EnterpriseDB: http://www.enterprisedb.com

Attachment Content-Type Size
v05_0005-Export-scalarineqsel.patch application/octet-stream 1.5 KB
v05_0003-Add-symbolic-names-for-some-jsonb-operators.patch application/octet-stream 7.1 KB
v05_0001-Add-pg_operator.oprstat-for-derived-operator-statist.patch application/octet-stream 11.9 KB
v05_0002-Add-stats_form_tuple.patch application/octet-stream 3.9 KB
v05_0004-Add-helper-jsonb-functions-and-mac.patch application/octet-stream 9.4 KB
v05_0006-Add-jsonb-statistics.patch application/octet-stream 136.9 KB

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Ajin Cherian 2022-05-10 11:33:49 Re: Support logical replication of DDLs
Previous Message John Naylor 2022-05-10 09:58:31 Re: [PoC] Improve dead tuple storage for lazy vacuum