Skip site navigation (1) Skip section navigation (2)

Re: full-text search question

From: Andrew Sullivan <ajs(at)commandprompt(dot)com>
To: pgsql-admin(at)postgresql(dot)org
Subject: Re: full-text search question
Date: 2008-06-18 13:55:48
Message-ID: 20080618135548.GD433@commandprompt.com (view raw or flat)
Thread:
Lists: pgsql-admin
On Wed, Jun 18, 2008 at 02:49:48PM +0200, Sabbiolina wrote:
> www.google.com is only treated as a unique word? Why not producing multiple
> tokens like www.google.com, www, ., google, ., com? (obviously www and . can
> be nulled or stopworded).

You wouldn't want to get the token ".".  It's not a token, but a label
boundary.  So in your analogy of treating the labels in a FQDN as
"words", the "." needs to be treated the way spaces are between words.

A

-- 
Andrew Sullivan
ajs(at)commandprompt(dot)com
+1 503 667 4564 x104
http://www.commandprompt.com/

In response to

pgsql-admin by date

Next:From: Tom LaneDate: 2008-06-18 16:30:13
Subject: Re: Move postmater.pid completly
Previous:From: Oleg BartunovDate: 2008-06-18 13:19:24
Subject: Re: full-text search question

Privacy Policy | About PostgreSQL
Copyright © 1996-2014 The PostgreSQL Global Development Group