Quick Links

Re: Unicode Normalization

From:	pg(at)thetdh(dot)com
To:	"David E(dot) Wheeler" <david(at)kineticode(dot)com>, "PG Hackers" <pgsql-hackers(at)postgresql(dot)org>
Cc:	"Hudson, T(dot) David" <pg1(at)thetdh(dot)com>
Subject:	Re: Unicode Normalization
Date:	2009-09-24 13:24:07
Message-ID:	W979322959270161253798647@webmail34
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

In a context using normalization, wouldn't you typically want to store a normalized-text type that could perhaps (depending on locale) take advantage of simpler, more-efficient comparison functions? Whether you're doing INSERT/UPDATE, or importing a flat text file, if you canonicalize characters and substrings of identical meaning when trivial distinctions of encoding are irrelevant, you're better off later. User-invocable normalization functions by themselves don't make much sense. (If Postgres now supports binary- or mixed-binary-and-text flat files, perhaps for restore purposes, the same thing applies.)

David Hudson

Responses

Re: Unicode Normalization at 2009-09-24 15:36:37 from David E. Wheeler

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Jan Urbański	2009-09-24 14:21:47	Re: [PATCH] DefaultACLs
Previous Message	ning	2009-09-24 13:17:12	Re: "BEGIN TRANSACTION" and "START TRANSACTION": different error handling