how can I find double entries in varchar columns where the content is
not 100% identical because of a spelling error or the person considered
it "looked nicer" that way?
I'd like to identify and then merge records of e.g. 'google', 'gogle',
Then I want to match abbrevations like 'A-Company Ltd.', 'a company
ltd.', 'A-Company Limited'
Is there a way to do this?
It would be OK just to list candidats up to be manually checked afterwards.
pgsql-sql by date
|Next:||From: Tom Lane||Date: 2008-04-16 03:23:32|
|Subject: Re: How to find double entries |
|Previous:||From: Steve Midgley||Date: 2008-04-15 17:13:46|
|Subject: Re: export CSV file through Java JDBC|