Re: pageinspect patch, for showing tuple data

From: Michael Paquier <michael(dot)paquier(at)gmail(dot)com>
To: Nikolay Shaplov <n(dot)shaplov(at)postgrespro(dot)ru>
Cc: PostgreSQL mailing lists <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: pageinspect patch, for showing tuple data
Date: 2015-09-30 04:49:00
Message-ID: CAB7nPqSU0hFKnZXzEqRvum10nWYnv7Cj7x7Per5A=QpoR9BKJw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Tue, Sep 29, 2015 at 11:39 PM, Nikolay Shaplov wrote:
> But since now we actually parse data with tuple_data_split, we can
provide a
> precisely formed fake information, so you are not limited with how it is
> actually stored in page. You just pass any arguments you want. So you
does not
> need warning mode anymore.

Yeah, I agree with you here, let's simplify it then. One could as well
catch the error in a plpgsql wrapper function if that's really necessary
and log the failed events at the same time in a custom way.

- errmsg("input page too small (%d bytes)",
raw_page_size)));
+ errmsg("input page too small (%d
bytes)", raw_page_size)));
Please be careful of spurious diffs. Those just make the life of committers
more difficult than it already is.

+ <para>
+ General idea about output columns: <function>lp_*</function>
attributes
+ are about where tuple is located inside the page;
+ <function>t_xmin</function>, <function>t_xmax</function>,
+ <function>t_field3</function>, <function>t_ctid</function> are about
+ visibility of this tuplue inside or outside of the transaction;
+ <function>t_infomask2</function>, <function>t_infomask</function> has
some
+ information about properties of attributes stored in tuple data.
+ <function>t_hoff</function> sais where tuple data begins and
+ <function>t_bits</function> sais which attributes are NULL and which
are
+ not. Please notice that t_bits here is not an actual value that is
stored
+ in tuple data, but it's text representation with '0' and '1'
charactrs.
+ </para>
I would remove that as well. htup_details.h contains all this information.

+ <para>
+ For more detailed information see documentation:
+ <xref linkend="storage-page-layout">, <xref
linkend="ddl-system-columns">
+ and source code: <filename>src/include/storage/itemid.h</>,
+ <filename>src/include/access/htup_details.h</>.
+ </para>
This looks cool to me though.

+<screen>
+test=# select * from heap_page_item_attrs(get_raw_page('pg_range',
0),'pg_range'::regclass);
+[loooooong tuple data]
This example is too large in character per lines, I think that you should
cut a major part of the fields, why not just keeping lp and t_attrs for
example.

+ <tbody>
+ <row>
+ <entry><structfield>rel_oid</structfield></entry>
+ <entry><type>oid</type></entry>
+ <entry>OID of the relation, of the tuple we want to split</entry>
+ </row>
+
+ <row>
+ <entry><structfield>tuple_data</structfield></entry>
+ <entry><type>bytea</type></entry>
+ <entry>tuple raw data to split
+ </entry>
+ </row>
In the description of tuple_data_split, I am not sure it is worth listing
all the argument of the function like that. IMO, we should just say: those
are the fields returned by "heap_page_items". tuple_data should as well be
renamed to t_data.

+ tuple attributes instead of one peace of raw tuple data. All other
return
This is not that "peaceful" to me. It should be "piece" :)

+ values[13] = PointerGetDatum(tuple_data_bytea);
+ nulls[13] = false;
There is no point to set nulls[13] here.

+<screen>
+test=# select tuple_data_split('pg_range'::regclass,
'\x400f00001700000000000000ba0700004a0f0000520f0000'::bytea, 2304, 6, null);
+ tuple_data_split
+---------------------------------------------------------------------------------------
+
{"\\x400f0000","\\x17000000","\\x00000000","\\xba070000","\\x4a0f0000","\\x520f0000"}
+(1 row)
This would be more demonstrative if combined with heap_page_items, like
that for example:
SELECT tuple_data_split('pg_class'::regclass, t_data, t_infomask,
t_infomask2, t_bits) FROM heap_page_items(get_raw_page('pg_class', 0));
And actually this query crashes.
--
Michael

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Michael Paquier 2015-09-30 05:18:27 Re: Idea for improving buildfarm robustness
Previous Message Tatsuo Ishii 2015-09-30 04:26:03 Re: Doubt in pgbench TPS number