From: | "Tena Sakai" <tsakai(at)gallo(dot)ucsf(dot)edu> |
---|---|
To: | "Andreas" <maps(dot)on(at)gmx(dot)net>, <pgsql-sql(at)postgresql(dot)org> |
Subject: | Re: How to find double entries |
Date: | 2008-04-16 05:05:58 |
Message-ID: | FE44E0D7EAD2ED4BB2165071DB8E328C0378F1E8@egcrc-ex01.egcrc.org |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-sql |
Hi,
In a recent linux magazine article (http://www.linux-mag.com/id/5679)
there was a mentioning of Full-Text Search Integration. Which I know
nothing about, but sounded interesting to me. You might want to
check it out.
Regards,
Tena Sakai
tsakai(at)gallo(dot)ucsf(dot)edu
-----Original Message-----
From: pgsql-sql-owner(at)postgresql(dot)org on behalf of Andreas
Sent: Tue 4/15/2008 8:15 PM
To: pgsql-sql(at)postgresql(dot)org
Subject: [SQL] How to find double entries
Hi,
how can I find double entries in varchar columns where the content is
not 100% identical because of a spelling error or the person considered
it "looked nicer" that way?
I'd like to identify and then merge records of e.g. 'google', 'gogle',
'guugle'
Then I want to match abbrevations like 'A-Company Ltd.', 'a company
ltd.', 'A-Company Limited'
Is there a way to do this?
It would be OK just to list candidats up to be manually checked afterwards.
Regards
Andreas
From | Date | Subject | |
---|---|---|---|
Next Message | Volkan YAZICI | 2008-04-16 06:15:42 | Re: How to find double entries |
Previous Message | Craig Ringer | 2008-04-16 04:22:57 | Re: How to find double entries |