Re: invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

From: "Albe Laurenz" <laurenz(dot)albe(at)wien(dot)gv(dot)at>
To: "Rural Hunter *EXTERN*" <ruralhunter(at)gmail(dot)com>, <pgsql-admin(at)postgresql(dot)org>
Subject: Re: invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?
Date: 2012-04-17 10:06:53
Message-ID: D960CB61B694CF459DCFB4B0128514C207C3392F@exadv11.host.magwien.gv.at
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin

Rural Hunter wrote:
>>>>> My db is in utf-8, I have a row in my table say tmp_article and I wanted
>>>>> to generate ts_vector from the article content:
>>>>> select to_tsvector(content) from tmp_article;
>>>>> But I got this error:
>>>>> ERROR: invalid byte sequence for encoding "UTF8": 0xf481

>> Do you use any nonstandard text search parsers or dictionaries?
>> What is the text search configuration you use (parameter
>> default_text_search_config)?

> Yes, I'm using a Chinese tsearch2 plugin called nlpbamboo and the
> default default_text_search_config is 'chinesecfg'. I tested "select
> to_tsvector('english',content) from tmp_article" and this works. So it's
> the problem of nlpbamboo?

That's probably the problem - it seems to emit something that is
not proper UTF-8 sometimes.

Do you get the error if you try Chinese settings without nlpbamboo?

> btw, what's the meaning of "Please don't top post."? I
> don't understand but sorry if I caused any problem.

http://en.wikipedia.org/wiki/Posting_style

Yours,
Laurenz Albe

In response to

Responses

Browse pgsql-admin by date

  From Date Subject
Next Message Igor Neyman 2012-04-17 13:43:59 Re: Recreate primary key without dropping foreign keys?
Previous Message videanuadrian 2012-04-17 05:42:50 Re: How to auto swtich the roles of primary and standby