Re: character_not_in_repertoire vs. untranslatable_character

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Chapman Flack <chap(at)anastigmatix(dot)net>
Cc: PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: character_not_in_repertoire vs. untranslatable_character
Date: 2016-03-06 22:56:32
Message-ID: 3701.1457304992@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Chapman Flack <chap(at)anastigmatix(dot)net> writes:
> So there's an ISO error 22021 "character not in repertoire" and
> a PostgreSQL error 22P05 "untranslatable character" that seem
> very similar.

> If I look in backend/utils/mb/wchar.c, it looks as if PostgreSQL
> uses the first for the case of a corrupted encoding (bytes that
> can't be decoded to a character at all), and the second for the
> case of a valid character that isn't available in a conversion's
> destination encoding.

Yeah, that's the intended distinction I believe, though I would not
want to swear that we've been 100% consistent. 22021 means "this
character is bad in isolation", AFAICT, so it didn't seem appropriate
for the conversion scenario.

regards, tom lane

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Thomas Munro 2016-03-06 23:12:03 Re: Proposal: "Causal reads" mode for load balancing reads without stale data
Previous Message Thom Brown 2016-03-06 21:30:55 Re: The plan for FDW-based sharding