daitch_mokotoff module

From: Dag Lem <dag(at)nimrod(dot)no>
To: PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject: daitch_mokotoff module
Date: 2021-12-03 20:07:29
Message-ID: yger1atbgfy.fsf@sid.nimrod.no
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Hello,

Please find attached a patch for the daitch_mokotoff module.

This implements the Daitch-Mokotoff Soundex System, as described in
https://www.avotaynu.com/soundex.htm

The module is used in production at Finance Norway.

In order to verify correctness, I have compared generated soundex codes
with corresponding results from the implementation by Stephen P. Morse
at https://stevemorse.org/census/soundex.html

Where soundex codes differ, the daitch_mokotoff module has been found
to be correct. The Morse implementation uses a few unofficial rules,
and also has an error in the handling of adjacent identical code
digits. Please see daitch_mokotoff.c for further references and
comments.

For reference, detailed instructions for soundex code comparison are
attached.

Best regards

Dag Lem

Attachment Content-Type Size
daitch_mokotoff.patch text/x-patch 51.2 KB
soundex-comparison.sh application/x-sh 1.0 KB

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Andrew Dunstan 2021-12-03 20:11:11 Re: The "char" type versus non-ASCII characters
Previous Message Melanie Plageman 2021-12-03 20:02:24 Re: pg_stat_bgwriter.buffers_backend is pretty meaningless (and more?)