| From: | Dag Lem <dag(at)nimrod(dot)no> |
|---|---|
| To: | PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org> |
| Subject: | daitch_mokotoff module |
| Date: | 2021-12-03 20:07:29 |
| Message-ID: | yger1atbgfy.fsf@sid.nimrod.no |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
Hello,
Please find attached a patch for the daitch_mokotoff module.
This implements the Daitch-Mokotoff Soundex System, as described in
https://www.avotaynu.com/soundex.htm
The module is used in production at Finance Norway.
In order to verify correctness, I have compared generated soundex codes
with corresponding results from the implementation by Stephen P. Morse
at https://stevemorse.org/census/soundex.html
Where soundex codes differ, the daitch_mokotoff module has been found
to be correct. The Morse implementation uses a few unofficial rules,
and also has an error in the handling of adjacent identical code
digits. Please see daitch_mokotoff.c for further references and
comments.
For reference, detailed instructions for soundex code comparison are
attached.
Best regards
Dag Lem
| Attachment | Content-Type | Size |
|---|---|---|
| daitch_mokotoff.patch | text/x-patch | 51.2 KB |
| soundex-comparison.sh | application/x-sh | 1.0 KB |
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Andrew Dunstan | 2021-12-03 20:11:11 | Re: The "char" type versus non-ASCII characters |
| Previous Message | Melanie Plageman | 2021-12-03 20:02:24 | Re: pg_stat_bgwriter.buffers_backend is pretty meaningless (and more?) |