Re: Problem on pg_dump RANGE partition with expressions

From: Amit Langote <Langote_Amit_f8(at)lab(dot)ntt(dot)co(dot)jp>
To: Yugo Nagata <nagata(at)sraoss(dot)co(dot)jp>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: Problem on pg_dump RANGE partition with expressions
Date: 2018-07-12 08:44:48
Message-ID: 03767a65-b7c9-e848-d47c-898f2952e1b6@lab.ntt.co.jp
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Nagata-san,

On 2018/07/12 16:59, Yugo Nagata wrote:
> Hi,
>
> During looking into other thread[1], I found a problem on pg_dump of range
> partition table using expressions. When we create a range partitioned table,
> we cannot use a column more than once in the partition key.
>
> postgres=# create table t (i int) partition by range(i,i);
> ERROR: column "i" appears more than once in partition key
>
> On the other hand, the following query using expression is allowed.
>
> postgres=# create table test (i int) partition by range(i,(i));
> CREATE TABLE
>
> However, when we use pg_dump for this, we get
>
> CREATE TABLE public.test (
> i integer
> )
> PARTITION BY RANGE (i, i);
>
> , and we cannot restore this due to the error.

Oops.

> I can consider three approaches to resolve this.
>
>
> 1) Allow to appear more than once in range partition key
>
> I don't understand why there is this restriction. If we have no clear reason,
> can we rip out this restrition?

I can't recall exactly, but back when I wrote this code, I might have been
thinking that such a feature is useless and would actually just end up
being a foot gun for users.

Although, I'm open to ripping it out if it's seen as being overly restrictive.

> 2) Forbid this kind of partition key
>
> If we can make more checks and forbid such partition keys as RANGE(i,(i)),
> we can avoid the problem though it is a bit rigorous.

If there is a way for transformPartitionSpec to conclude that i and (i)
are in fact the same thing and error out, which is perhaps not that hard
to do, then maybe we could do that. That is, if we don't want to rip the
error out as your proposed solution (1) above.

> 3) Treat expressions as it is
>
> For some reasons, expressions like "(column)" or "(column COLLATE something)"
> is treated like simple attributes in the current implementation
> (in ComputePartitionAttr() ).
>
> If we treat these expressions as it is, pg_dump result will be the same
> expression as when the partition table is created, and we can restore this
> successfully.

Hmm, I think it's better to simplify (column) into a simple column
reference instead of uselessly maintaining it as an expression. There's
no other good reason to do this.

Thanks,
Amit

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Marina Polyakova 2018-07-12 08:57:01 Re: [HACKERS] WIP Patch: Pgbench Serialization and deadlock errors
Previous Message Marina Polyakova 2018-07-12 08:28:34 Re: [HACKERS] WIP Patch: Pgbench Serialization and deadlock errors