Re: Searching for pgweb

From: Oleg Bartunov <obartunov(at)gmail(dot)com>
To: Magnus Hagander <magnus(at)hagander(dot)net>
Cc: PostgreSQL WWW <pgsql-www(at)postgresql(dot)org>
Subject: Re: Searching for pgweb
Date: 2017-03-24 07:56:22
Message-ID: CAF4Au4xEo1ZddDFvjkJHBXE3_VOvp9rZfSOywKA0eUW9m=DAdw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-www

On Wed, Mar 22, 2017 at 7:51 PM, Magnus Hagander <magnus(at)hagander(dot)net>
wrote:

> Right now our main website search uses plainto_tsquery() to generate the
> searches.
>
> Should we consider switching that to phraseto_tsquery() now that we have
> phrase searching?
>

+1

Also, I suggest to use new parser, which better works _ and -, for example:

1.
select ts_parse('tsparser', 'btree_gin');
ts_parse
----------------
(16,btree_gin)
(11,btree)
(12,_)
(11,gin)
(4 rows)

select ts_parse('default', 'btree_gin');
ts_parse
-----------
(1,btree)
(12,_)
(1,gin)
(3 rows)

Default parser produces too much noise, just check the difference:

https://postgrespro.ru/search/?area=version&q=btree_gin&product=postgresql&version=9.6

https://www.postgresql.org/search/?u=%2Fdocs%2F9.6%2F&q=btree_gin

2.
select ts_parse('tsparser', 'utc-5');
ts_parse
------------
(15,utc-5)
(11,utc)
(12,-)
(9,5)
(4 rows)

select ts_parse('default', 'utc-5');
ts_parse
----------
(1,utc)
(21,-5)
(2 rows)

again, compare

https://postgrespro.ru/search/?area=version&q=utc-5&product=postgresql&version=9.6

https://www.postgresql.org/search/?u=%2Fdocs%2F9.6%2F&q=utc-5

We have also better parsing of email, but I'm not sure we need it on
postgres site.

We'll publish soon on github, let me know if you know it.

Oleg

> //Magnus
>
>
>
> --
> Magnus Hagander
> Me: http://www.hagander.net/
> Work: http://www.redpill-linpro.com/
>

In response to

Responses

Browse pgsql-www by date

  From Date Subject
Next Message Oleg Bartunov 2017-03-24 08:44:41 Re: Searching for pgweb
Previous Message Amit Langote 2017-03-24 06:16:53 Re: No hits when search devel documentation