Re: Full text search on partial URLs

From: bricklen <bricklen(at)gmail(dot)com>
To: Zev Benjamin <zev-pgsql(at)strangersgate(dot)com>
Cc: "pgsql-general(at)postgresql(dot)org" <pgsql-general(at)postgresql(dot)org>
Subject: Re: Full text search on partial URLs
Date: 2013-11-06 18:47:38
Message-ID: CAGrpgQ-46WBaHT=nf7ms3X2LQPiWQYYwNt3B2XT1dVUXJNKMnA@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Wed, Nov 6, 2013 at 10:45 AM, Zev Benjamin
<zev-pgsql(at)strangersgate(dot)com>wrote:

> Hi,
>
> I have Postgres full text search set up for my application and it's been
> working great! However, my users would like their searches to turn up
> parts of URLs. For example, they would like a search for "foobar" to turn
> up a document that contains the string "http://example.com/foobar/blah"
> (and similarly for queries like "example" and "blah). With the default
> dictionaries for host, url, and url_path, the search query would have to
> contain the complete host or url path.
>
> What is the best way to accomplish this? Should I be looking at building
> a custom dictionary that breaks down hosts and urls or is there something
> simpler I can do?
>

Have you looked into trigrams?
http://www.postgresql.org/docs/current/static/pgtrgm.html

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Zev Benjamin 2013-11-06 18:53:03 Re: Full text search on partial URLs
Previous Message Zev Benjamin 2013-11-06 18:45:35 Full text search on partial URLs