Re: [HACKERS] Case Preservation disregarding case

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: "Chuck McDevitt" <cmcdevitt(at)greenplum(dot)com>
Cc: "Stephan Szabo" <sszabo(at)megazone(dot)bigpanda(dot)com>, "beau hargis" <beauh(at)bluefrogmobile(dot)com>, pgsql-sql(at)postgresql(dot)org, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [HACKERS] Case Preservation disregarding case
Date: 2006-11-01 06:38:25
Message-ID: 14059.1162363105@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers pgsql-sql

"Chuck McDevitt" <cmcdevitt(at)greenplum(dot)com> writes:
> Equivalent, yes. But I can interpret that clause it mean I can show
> either the case folded or non-case-folded value in the information
> schema, as they are equivalent.

Well, that's an interesting bit of specs-lawyering, but I don't see
how you can defend it against these rules in SQL99 5.2:

21) For every <identifier body> IB there is exactly one
corresponding case-normal form CNF. CNF is an <identifier body>
derived from IB as follows.

Let n be the number of characters in IB. For i ranging from 1
(one) to n, the i-th character M(i) of IB is translated into the
corresponding character or characters of CNF as follows.

Case:

a) If M(i) is a lower case character or a title case character
for which an equivalent upper case sequence U is defined by
Unicode, then let j be the number of characters in U; the
next j characters of CNF are U.

b) Otherwise, the next character of CNF is M(i).

22) The case-normal form of the <identifier body> of a <regular
identifier> is used for purposes such as and including
determination of identifier equivalence, representation in
the Definition and Information Schemas, and representation in
diagnostics areas.

NOTE 44 - Any lower-case letters for which there are no upper-
case equivalents are left in their lower-case form.

Again, obviously we are not compliant because we fold to lower rather
than upper case, but I do not see how you can read (22) as not requiring
the information schema to show the upper-cased form. The output of
functions such as PQfname() might be considered closer to diagnostics
info than information schema, but that's covered too.

But the really serious problem with what you propose is that it would
allow two table columns with names that the system considers distinct
to show as the same string in the information schema and diagnostic
outputs. That can't be acceptable --- it's going to break any
application that does any nontrivial analysis of what it sees there,
not to mention that it violates various primary key constraints in
the information schema specification.

regards, tom lane

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Henry B. Hotz 2006-11-01 06:58:08 Re: Design Considerations for New Authentication Methods
Previous Message jungmin shin 2006-11-01 05:03:17 invoking Web service in user-defined function in a dynamic way.

Browse pgsql-sql by date

  From Date Subject
Next Message roopa perumalraja 2006-11-01 06:49:15 Re: Add calculated fields from one table to other table
Previous Message Richard Broersma Jr 2006-11-01 05:51:12 Re: Add calculated fields from one table to other table