Skip site navigation (1) Skip section navigation (2)

Re: SET client_encoding = 'UTF8'

From: Oliver Jowett <oliver(at)opencloud(dot)com>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: Daniel Migowski <dmigowski(at)ikoffice(dot)de>, Kris Jurka <books(at)ejurka(dot)com>, pgsql-jdbc(at)postgresql(dot)org
Subject: Re: SET client_encoding = 'UTF8'
Date: 2008-05-19 15:39:47
Message-ID: 48319F43.5070008@opencloud.com (view raw or flat)
Thread:
Lists: pgsql-jdbc
Tom Lane wrote:
> Daniel Migowski <dmigowski(at)ikoffice(dot)de> writes:
>> Kris Jurka schrieb:
>>> On Sun, 18 May 2008, Daniel Migowski wrote:
>>>> The command SET client_encoding = 'UTF8'
>>> throws an exception in the driver, because the driver expects UNICODE.
>>> This has been discussed before and the problem is that there are a too 
>>> many ways to say UTF8 [1].  You can say UTF8, UTF-8, UTF -- 8, and so 
>>> on. Perhaps we should strip all spaces and dashes prior to comparison?
> 
> Perhaps we should make the backend return the values of client_encoding
> and server_encoding in canonical form (ie, "UTF8") regardless of the
> spelling variant the user used.  I'm not thrilled with having JDBC
> thinking it knows the conversion algorithm the backend uses.
> 
> Of course, such a change would break code relying on the older behavior
> :-(

Not sure if this is a big enough issue to warrant a server change. It 
only happens when a JDBC client issues a manual SET client_encoding to 
an encoding that's UTF8 but isn't spelled "UNICODE". That's going to be 
a no-op anyway, so I'm not entirely clear why the client needs to be 
sending it in the first place.

It sounds like the root cause might be something like "let's feed 
pg_dump output to JDBC". So we could add a special case in the driver to 
allow exactly "UTF8" as well as "UNICODE", if that's the canonical way 
the server spells it these days.

-O


In response to

Responses

pgsql-jdbc by date

Next:From: Tom LaneDate: 2008-05-19 15:52:07
Subject: Re: SET client_encoding = 'UTF8'
Previous:From: Tom LaneDate: 2008-05-19 14:03:44
Subject: Re: SET client_encoding = 'UTF8'

Privacy Policy | About PostgreSQL
Copyright © 1996-2014 The PostgreSQL Global Development Group