Re: once more: documentation search indexing

From: Daniel Gustafsson <daniel(at)yesql(dot)se>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: Magnus Hagander <magnus(at)hagander(dot)net>, Robert Treat <rob(at)xzilla(dot)net>, Peter Geoghegan <pg(at)bowt(dot)ie>, "Jonathan S(dot) Katz" <jkatz(at)postgresql(dot)org>, Michael Christofides <michael(at)pgmustard(dot)com>, Andres Freund <andres(at)anarazel(dot)de>, PostgreSQL WWW <pgsql-www(at)lists(dot)postgresql(dot)org>
Subject: Re: once more: documentation search indexing
Date: 2022-04-19 09:17:59
Message-ID: F9DEE225-C182-4DCC-8AB5-B49D7D2AC71B@yesql.se
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-www

> On 18 Apr 2022, at 20:04, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> wrote:
>
> Magnus Hagander <magnus(at)hagander(dot)net> writes:
>> What would be the actual *advantage* of excluding them?
>
> The immediate problem is that Google is still preferentially returning old
> pages in some cases, e.g. top hit for "postgres gist gin index" is still
>
> https://www.postgresql.org/docs/9.1/textsearch-indexes.html
>
> Now maybe that just means they've not completely reindexed since we made
> the canonical-version change, so I'm content to wait awhile longer
> before concluding that that change wasn't sufficient. But we should be
> considering the possibility that it wasn't.

That particular 9.1 page is the second hit for "postgres gin index" after the
/current/ page for the Gin Index chapter. (I first thought it was the first
hit since I dismissed the "featured snippet" result as an ad.) DuckDuckGo
returns the 9.1 page or the current page seemingly at random for "postgres gin
gist index".

Searching for "postgres gist gin index <version>" on Google returns the correct
page for versions 8.3 through 9.4, for any other version (including lower) it
returns /current/.

Removing the old content might improve search results, but it might also just
remove it altogether bumping non-postgresql.org content higher.

--
Daniel Gustafsson https://vmware.com/

In response to

Responses

Browse pgsql-www by date

  From Date Subject
Next Message Simon Riggs 2022-04-19 12:14:30 New book
Previous Message Bruce Momjian 2022-04-18 20:10:23 Re: GSOC proposal for Improve pgarchives by Yedil