Re: Difference between array column type and separate table

From: Mike Christensen <mike(at)kitchenpc(dot)com>
To: Alban Hertroys <dalroi(at)solfertje(dot)student(dot)utwente(dot)nl>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: Difference between array column type and separate table
Date: 2009-05-02 18:03:14
Message-ID: 7aa638e00905021103g5fae8f47v16bb25d4cdb3cda1@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Yeah I talked with some other SQL guru friends of mine and they all agree
the separate table is the way to go for a number of reasons, so that's what
I'll stick with. It was just one of those things where you see a new
feature and try to find an excuse to try it out <g>
Thanks!
Mike

On Sat, May 2, 2009 at 3:14 AM, Alban Hertroys <
dalroi(at)solfertje(dot)student(dot)utwente(dot)nl> wrote:

> On May 2, 2009, at 9:33 AM, Mike Christensen wrote:
>
> Using this, I could completely get rid of ThreadTags and have a table like
>> this:
>>
>> create table Threads (
>> Id uuid not null,
>> Posted timestamp not null,
>> Subject varchar(255) not null,
>> Replies int4 not null,
>> PosterId uuid not null,
>> Tags int2[],
>> primary key (Id)
>> );
>>
>> and then find threads using the ANY function:
>>
>> select * from Threads where 5 = ANY (Tags);
>>
>> To me this seems cleaner, but I'm wondering about performance. If I had
>> millions of threads, is a JOIN going to be faster? I guess what I'm asking
>> about is the underlying implementation of ANY. Is it doing a sequential
>> search? Can I index Tags and will ANY() then use that index? Any other
>> opinions on what option is better?
>>
>
> If you modify the array the entire array needs to be rewritten. I don't
> think you'd want that with millions of threads in it. I don't think array
> values are indexable either. So while they're probably faster to query for
> small amounts of threads, the join is likely faster to query for large
> amounts (provided they're indexed properly, of course).
>
> If you want to be sure, play around with explain analyse with both
> implementations.
>
> Alban Hertroys
>
> --
> If you can't see the forest for the trees,
> cut the trees and you'll see there is no forest.
>
>
> !DSPAM:880,49fc1d1e129741592332518!
>
>
>

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Terry Lee Tucker 2009-05-02 21:15:09 Re: Two Questions Re: Warm Backup
Previous Message Daniel Verite 2009-05-02 17:08:00 Re: Two Questions Re: Warm Backup