Re: list of extended statistics on psql

From: Tomas Vondra <tomas(dot)vondra(at)enterprisedb(dot)com>
To: Julien Rouhaud <rjuju123(at)gmail(dot)com>, Tatsuro Yamada <tatsuro(dot)yamada(dot)tf(at)nttcom(dot)co(dot)jp>
Cc: Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>, Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com>, Michael Paquier <michael(at)paquier(dot)xyz>, Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject: Re: list of extended statistics on psql
Date: 2021-01-15 16:19:46
Message-ID: 2eb51458-7475-17d3-cb13-b3c950b2f930@enterprisedb.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 1/15/21 9:47 AM, Julien Rouhaud wrote:
> On Wed, Jan 13, 2021 at 10:22:05AM +0900, Tatsuro Yamada wrote:
>> Hi Tomas,
>>
>> On 2021/01/13 7:48, Tatsuro Yamada wrote:
>>> On 2021/01/12 20:08, Tomas Vondra wrote:
>>>> On 1/12/21 2:57 AM, Tatsuro Yamada wrote:
>>>>> On 2021/01/09 9:01, Tomas Vondra wrote:
>>>> ...>
>>>>>> While working on that, I realized that 'defined' might be a bit
>>>>>> ambiguous, I initially thought it means 'NOT NULL' (which it does not).
>>>>>> I propose to change it to 'requested' instead. Tatsuro, do you agree, or
>>>>>> do you think 'defined' is better?
>>>>>
>>>>> Regarding the status of extended stats, I think the followings:
>>>>>
>>>>>   - "defined": it shows the extended stats defined only. We can't know
>>>>>                whether it needs to analyze or not. I agree this name was
>>>>>                 ambiguous. Therefore we should replace it with a more suitable
>>>>>                name.
>>>>>   - "requested": it shows the extended stats needs something. Of course,
>>>>>                we know it needs to ANALYZE because we can create the patch.
>>>>>                However, I feel there is a little ambiguity for DBA.
>>>>>                To solve this, it would be better to write an explanation of
>>>>>                the status in the document. For example,
>>>>>
>>>>> ======
>>>>> The column of the kind of extended stats (e. g. Ndistinct) shows some statuses.
>>>>> "requested" means that it needs to gather data by ANALYZE. "built" means ANALYZE
>>>>>   was finished, and the planner can use it. NULL means that it doesn't exists.
>>>>> ======
>>>>>
>>>>> What do you think? :-D
>>>>>
>>>>
>>>> Yes, that seems reasonable to me. Will you provide an updated patch?
>>>
>>>
>>> Sounds good. I'll send the updated patch today.
>>
>>
>>
>> I updated the patch to add the explanation of the extended stats' statuses.
>> Please feel free to modify the patch to improve it more clearly.
>>
>> The attached files are:
>> 0001: Add psql \dx and the fixed document
>> 0002: Regression test for psql \dX
>> app-psql.html: Created by "make html" command (You can check the
>> explanation of the statuses easily, probably)
>
> Hello Yamada-san,
>
> I reviewed the patch and don't have specific complaints, it all looks good!
>
> I'm however thinking about the "requested" status. I'm wondering if it could
> lead to people think that an ANALYZE is scheduled and will happen soon.
> Maybe "defined" or "declared" might be less misleading, or even "waiting for
> analyze"?
>

Well, the "defined" option is not great either, because it can be
interpreted as "NOT NULL" - that's why I proposed "requested". Not sure
about "declared" - I wouldn't use it in this context, but I'm not a
native speaker so maybe it's OK.

regards

--
Tomas Vondra
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Tom Lane 2021-01-15 16:30:45 Re: pg_preadv() and pg_pwritev()
Previous Message Amit Langote 2021-01-15 15:34:09 Re: POC: postgres_fdw insert batching