Re: [GENERAL] Feature Request: bigtsvector

From: Bruce Momjian <bruce(at)momjian(dot)us>
To: Ildus Kurbangaliev <i(dot)kurbangaliev(at)postgrespro(dot)ru>
Cc: CPT <cpt(at)novozymes(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgreSQL(dot)org>
Subject: Re: [GENERAL] Feature Request: bigtsvector
Date: 2015-09-09 15:17:18
Message-ID: 20150909151718.GA10124@momjian.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general pgsql-hackers

On Wed, Sep 9, 2015 at 06:14:28PM +0300, Ildus Kurbangaliev wrote:
> On Wed, 9 Sep 2015 10:52:02 -0400
> Bruce Momjian <bruce(at)momjian(dot)us> wrote:
>
> > On Wed, Jun 17, 2015 at 07:58:21AM +0200, CPT wrote:
> > > Hi all;
> > >
> > > We are running a multi-TB bioinformatics system on PostgreSQL and
> > > use a denormalized schema in
> > > places with a lot of tsvectors aggregated together for centralized
> > > searching. This is
> > > very important to the performance of the system. These aggregate
> > > many documents (sometimes tens of thousands), many of which contain
> > > large numbers of references to other documents. It isn't uncommon
> > > to have tens of thousands of lexemes. The tsvectors hold mixed
> > > document id and natural language search information (all f which
> > > comes in from the same documents).
> > >
> > > Recently we have started hitting the 1MB limit on tsvector size. We
> > > have found it possible to
> > > patch PostgreSQL to make the tsvector larger but this changes the
> > > on-disk layout. How likely is
> > > it that either the tsvector size could be increased in future
> > > versions to allow for vectors up to toastable size (1GB logical)? I
> > > can't imagine we are the only ones with such a problem. Since, I
> > > think, changing the on-disk layout might not be such a good idea,
> > > maybe it would be worth considering having a new bigtsvector type?
> > >
> > > Btw, we've been very impressed with the extent that PostgreSQL has
> > > tolerated all kinds of loads we have thrown at it.
> >
> > Can anyone on hackers answer this question from June?
> >
>
> Hi, I'm working on patch now that removes this limit without changes (or
> small changes) of on-disk layout. I think it'll be ready during this
> month.

Oh, great, thanks.

--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com

+ Everyone has their own god. +

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Teodor Sigaev 2015-09-09 16:27:05 Re: jsonb value retrieval performance
Previous Message Ildus Kurbangaliev 2015-09-09 15:14:28 Re: [GENERAL] Feature Request: bigtsvector

Browse pgsql-hackers by date

  From Date Subject
Next Message dinesh kumar 2015-09-09 15:37:48 Re: [PATCH] SQL function to report log message
Previous Message Ildus Kurbangaliev 2015-09-09 15:14:28 Re: [GENERAL] Feature Request: bigtsvector