Re: tsvector limitations

From: Tim <elatllat(at)gmail(dot)com>
To: Kevin Grittner <Kevin(dot)Grittner(at)wicourts(dot)gov>
Cc: pgsql-admin(at)postgresql(dot)org
Subject: Re: tsvector limitations
Date: 2011-06-14 15:08:14
Message-ID: BANLkTikz+MY1DoO5UcxzJbG+VNgGKn0HFg@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin

Hi Kevin,

Thanks for the reply.
I suspect there must have been some testing when the tsvector was created,
and I would be surprised if there is no general "how big is this object"
method in PostgreSQL.
That said perhaps this is the wrong mailing list for this question.

If it's "bad design" to store large text documents (pdf,docx,etc) as a BLOBs
or on a filesystem
and make them searchable with tsvectors can you suggest a good design?
If making your own search implementation is "better" what is the point of
tsvectors?
Maybe I'm missing something here?

On Tue, Jun 14, 2011 at 10:18 AM, Kevin Grittner <
Kevin(dot)Grittner(at)wicourts(dot)gov> wrote:

> Tim <elatllat(at)gmail(dot)com> wrote:
>
> > How many bytes of a tsvector would a 32MB ascii english unique
> > word list make?
> > How many bytes of a tsvector would something like "The Lord of the
> > Rings.txt" make?
>
> It would appear that nobody has run into this as a limit, nor done
> those specific tests. Storing a series of novels as a single value
> in a database seems to be bad design, of course, which might explain
> the lack of research.
>
> If you get some numbers through testing, please post them for future
> reference.
>
> -Kevin
>

In response to

Responses

Browse pgsql-admin by date

  From Date Subject
Next Message Kevin Grittner 2011-06-14 15:36:11 Re: tsvector limitations
Previous Message Kevin Grittner 2011-06-14 14:18:20 Re: tsvector limitations