Skip site navigation (1) Skip section navigation (2)

Re: invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?

From: "Albe Laurenz" <laurenz(dot)albe(at)wien(dot)gv(dot)at>
To: "Rural Hunter *EXTERN*" <ruralhunter(at)gmail(dot)com>,<pgsql-admin(at)postgresql(dot)org>
Subject: Re: invalid byte sequence for encoding "UTF8": 0xf481 - how could this happen?
Date: 2012-04-17 10:06:53
Message-ID: (view raw, whole thread or download thread mbox)
Lists: pgsql-admin
Rural Hunter wrote:
>>>>> My db is in utf-8, I have a row in my table say tmp_article and I wanted
>>>>> to generate ts_vector from the article content:
>>>>> select to_tsvector(content) from tmp_article;
>>>>> But I got this error:
>>>>> ERROR:  invalid byte sequence for encoding "UTF8": 0xf481

>> Do you use any nonstandard text search parsers or dictionaries?
>> What is the text search configuration you use (parameter
>> default_text_search_config)?

> Yes, I'm using a Chinese tsearch2 plugin called nlpbamboo and the
> default default_text_search_config is 'chinesecfg'. I tested "select
> to_tsvector('english',content) from tmp_article" and this works. So it's
> the problem of nlpbamboo?

That's probably the problem - it seems to emit something that is
not proper UTF-8 sometimes.

Do you get the error if you try Chinese settings without nlpbamboo?

>              btw, what's the meaning of "Please don't top post."? I
> don't understand but sorry if I caused any problem.

Laurenz Albe

In response to


pgsql-admin by date

Next:From: Igor NeymanDate: 2012-04-17 13:43:59
Subject: Re: Recreate primary key without dropping foreign keys?
Previous:From: videanuadrianDate: 2012-04-17 05:42:50
Subject: Re: How to auto swtich the roles of primary and standby

Privacy Policy | About PostgreSQL
Copyright © 1996-2017 The PostgreSQL Global Development Group