Re: How to find double entries

From: "Tena Sakai" <tsakai(at)gallo(dot)ucsf(dot)edu>
To: "Andreas" <maps(dot)on(at)gmx(dot)net>, <pgsql-sql(at)postgresql(dot)org>
Subject: Re: How to find double entries
Date: 2008-04-16 05:05:58
Message-ID: FE44E0D7EAD2ED4BB2165071DB8E328C0378F1E8@egcrc-ex01.egcrc.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-sql

Hi,

In a recent linux magazine article (http://www.linux-mag.com/id/5679)
there was a mentioning of Full-Text Search Integration. Which I know
nothing about, but sounded interesting to me. You might want to
check it out.

Regards,

Tena Sakai
tsakai(at)gallo(dot)ucsf(dot)edu

-----Original Message-----
From: pgsql-sql-owner(at)postgresql(dot)org on behalf of Andreas
Sent: Tue 4/15/2008 8:15 PM
To: pgsql-sql(at)postgresql(dot)org
Subject: [SQL] How to find double entries

Hi,

how can I find double entries in varchar columns where the content is
not 100% identical because of a spelling error or the person considered
it "looked nicer" that way?

I'd like to identify and then merge records of e.g. 'google', 'gogle',
'guugle'

Then I want to match abbrevations like 'A-Company Ltd.', 'a company
ltd.', 'A-Company Limited'

Is there a way to do this?
It would be OK just to list candidats up to be manually checked afterwards.

Regards
Andreas

In response to

Browse pgsql-sql by date

  From Date Subject
Next Message Volkan YAZICI 2008-04-16 06:15:42 Re: How to find double entries
Previous Message Craig Ringer 2008-04-16 04:22:57 Re: How to find double entries