From: | "Don Walker" <don(dot)walker(at)versaterm(dot)com> |
---|---|
To: | <pgsql-general(at)postgresql(dot)org> |
Subject: | Re: Use of ISpell dictionaries with tsearch2 - what is the point? |
Date: | 2006-04-28 20:15:59 |
Message-ID: | 001901c66b00$90c35410$dbd849c6@donxp |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
I realized shortly after I sent this email that I could use a synonym
dictionary to solve problem #2. To construct it myself I'd have to determine
the common misspellings and create synonyms for them. So I have two more
questions:
2.1 Are there any canned synonym dictionaries available the deal with
misspellings in English and/or French?
2.2 Are there any clever linguistic algorithms that can partly solve the
same problem?
-----Original Message-----
From: pgsql-general-owner(at)postgresql(dot)org
[mailto:pgsql-general-owner(at)postgresql(dot)org] On Behalf Of Don Walker
Sent: April 28, 2006 15:11
To: pgsql-general(at)postgresql(dot)org
Subject: [GENERAL] Use of ISpell dictionaries with tsearch2 - what is the
point?
I'm new to using tsearch2 and am trying to understand why I would want to
use an ISpell dictionary before the stemming dictionary. I'd originally
hoped that ISpell would suggest corrections for misspelled words as the
documents that I will be indexing will contain a lot of spelling mistakes.
>From what I now understand ISpell dictionaries only recognize properly
spelled words. This means that any misspelled word will be handled by the
stemming dictionary and usually just passed as is.
1. If I am correct about this then what is the point of using the ISpell
dictionary in the first place?
2. Is there a solution for correcting spelling mistakes in the documents you
index? I have seen the readme files for pg_trgm,
http://www.sai.msu.su/~megera/postgres/gist/, which would allow me to
suggest other terms for a query if the misspellings were common enough. I'd
rather fix the problem at index time so that querying with the proper term
would find any misspelled terms (within reason).
---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend
From | Date | Subject | |
---|---|---|---|
Next Message | Geoffrey | 2006-04-28 20:34:32 | Re: How to define + operator for strings |
Previous Message | Bruce Momjian | 2006-04-28 19:56:19 | Re: Autovacuum Logging |