Quick Links

Re: Slow duplicate deletes

From:	Michael Wood <esiotrot(at)gmail(dot)com>
To:	DrYSG <ygutfreund(at)draper(dot)com>
Cc:	pgsql-novice(at)postgresql(dot)org
Subject:	Re: Slow duplicate deletes
Date:	2012-03-06 08:14:30
Message-ID:	CAP6d-HWTDgsbf2-Rr+umdm4ZboEFYf2RMLNeLebZ-OhueaCa=Q@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-novice

On 5 March 2012 22:43, DrYSG <ygutfreund(at)draper(dot)com> wrote:
> One point I might not have made clear. The reason I want to remove duplicates
> is that the column "data_object.unique_id" became non-unique (someone added
> duplicate rows). So I added the bigSeriel (idx) to uniquely identify the
> rows, and I was using the SELECT MIN(idx) and GroupBy to pick just one of
> the rows that became duplicated.
>
> I am going to try out some of your excellent suggestions. I will report back
> on how they are working.
>
> One idea that was given to me was the following (what do you think Merlin?)
>
> CREATE TABLE portal.new_metatdata AS
> select distinct on (data_object.unique_id) * FROM portal.metadata;
>
> Or something of this ilk should be faster because it only need to do a
> sort on data_object.unique_id and then an insert. After you have
> verified the results you can do:
>
> BEGIN;
> ALTER TABLE portal.metatdata rename TO portal.new_metatdata_old;
> ALTER TABLE portal.new_metatdata rename TO portal.metatdata_old;
> COMMIT;

This sounds like a good way to go, but if you have foreign keys
pointing at portal.metadata I think you will need to drop and recreate
them again after the rename.

--
Michael Wood <esiotrot(at)gmail(dot)com>

In response to

Re: Slow duplicate deletes at 2012-03-05 20:43:47 from DrYSG

Browse pgsql-novice by date

	From	Date	Subject
Next Message	Michael Wood	2012-03-06 08:20:33	Re: postgreSQL odbc driver for Sun Solaris
Previous Message	Piyush Lenka	2012-03-06 06:22:39	pg_dump : no tables were found.