Re: How to find double entries

From: Craig Ringer <craig(at)postnewspapers(dot)com(dot)au>
To: Andreas <maps(dot)on(at)gmx(dot)net>
Cc: pgsql-sql(at)postgresql(dot)org
Subject: Re: How to find double entries
Date: 2008-04-16 04:22:57
Message-ID: 48057F21.30607@postnewspapers.com.au
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-sql

Andreas wrote:
> Hi,
>
> how can I find double entries in varchar columns where the content is
> not 100% identical because of a spelling error or the person considered
> it "looked nicer" that way?

When doing some near-duplicate elimination as part of converting a
legacy data set to PostgreSQL I found the `fuzzystrmatch' contrib module
immensely helpful.

http://www.postgresql.org/docs/current/static/fuzzystrmatch.html

--
Craig Ringer

In response to

Browse pgsql-sql by date

  From Date Subject
Next Message Tena Sakai 2008-04-16 05:05:58 Re: How to find double entries
Previous Message Tom Lane 2008-04-16 03:23:32 Re: How to find double entries