Re: Impact of UNICODE encoding on performance

From: Harry Mantheakis <harry(at)mantheakis(dot)freeserve(dot)co(dot)uk>
To: <pgsql-novice(at)postgresql(dot)org>
Subject: Re: Impact of UNICODE encoding on performance
Date: 2004-03-18 16:42:03
Message-ID: BC7F81DB.C4F3%harry@mantheakis.freeserve.co.uk
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-novice

Marc, thanks for your input.

The more I think about it, the more it seems like Unicode is the right
answer.

In the meantime I have not seen anyone screaming out about performance
issues, so I am not going to worry about it any more.

Unicode also supports the Euro currency symbol, which helps a lot :-)

Kind regards

Harry

> With UNICODE UTF-8 the basic (a-z, A-Z, 0-9, ...) 128 characters
> (there are actually less than 128) are single byte characters
> identical to the original ASCII specification. All other characters
> might have multiple bytes.
>
> This means that as long you are transferring roman alphabet based
> text, the impact will be very low since the text will mostly consist
> of those 128 characters.
>
> for other languages more characters consisting of multiple bytes
> would be transferred.
>
> I don't know about PostgreSQL's internal treatement of multi-byte
> characters and whether this woud require more CPU time.
>
> After weighing pro and cons, I'd definitely go with UNICODE.
>
> Marc

In response to

Browse pgsql-novice by date

  From Date Subject
Next Message Robert Morgan 2004-03-19 09:48:45 connect from network PC
Previous Message Tom Lane 2004-03-18 16:09:50 Re: books