Re: text search: restricting the number of parsed words in headline generation

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: sushant354(at)gmail(dot)com
Cc: pgsql-hackers(at)postgresql(dot)org
Subject: Re: text search: restricting the number of parsed words in headline generation
Date: 2011-08-23 18:59:18
Message-ID: 1659.1314125958@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Sushant Sinha <sushant354(at)gmail(dot)com> writes:
> Given a document and a query, the goal of headline generation is to
> produce text excerpts in which the query appears.

... right ...

> Here is a simple patch that limits the number of words during the
> tokenization phase and puts an upper-bound on the headline generation.

Doesn't this force the headline to be taken from the first N words of
the document, independent of where the match was? That seems rather
unworkable, or at least unhelpful.

regards, tom lane

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Tom Lane 2011-08-23 19:05:20 Re: skip WAL on COPY patch
Previous Message Peter Eisentraut 2011-08-23 18:57:28 Re: Why doesn't psql use the information schema to get ACL description ?