Re: Netflix Prize data

From: "Mark Woodward" <pgsql(at)mohawksoft(dot)com>
To: "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: "Gregory Stark" <stark(at)enterprisedb(dot)com>, "Greg Sabino Mullane" <greg(at)turnstep(dot)com>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: Netflix Prize data
Date: 2006-10-05 00:28:34
Message-ID: 21629.24.91.171.78.1160008114.squirrel@mail.mohawksoft.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

> "Mark Woodward" <pgsql(at)mohawksoft(dot)com> writes:
>> The rating, however, is one char 1~9. Would making it a char(1) buy
>> anything?
>
> No, that would actually hurt because of the length word for the char
> field. Even if you used the "char" type, which really is only one byte,
> you wouldn't win anything because of alignment issues. Personally I'd
> just go for three ints and a date, rather than trying to be cute with
> the rating.

Actually, the date is just days, right? I don't actualy need it too much.
So, create a small int for date and do this: smalldate =
date('1970-01-01') - rdate. And use small int for rating.

Column | Type | Modifiers
--------+----------+-----------
movie | integer |
client | integer |
day | smallint |
rating | smallint |

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Tom Lane 2006-10-05 00:40:03 Re: 8.2beta1 failure on IRIX
Previous Message Tom Lane 2006-10-05 00:18:11 Re: Netflix Prize data