BUG #5021: ts_parse doesn't recognize email addresses with underscores

From: "Dan O'Hara" <danarasoftware(at)gmail(dot)com>
To: pgsql-bugs(at)postgresql(dot)org
Subject: BUG #5021: ts_parse doesn't recognize email addresses with underscores
Date: 2009-08-28 13:59:41
Message-ID: 200908281359.n7SDxfaf044556@wwwmaster.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs pgsql-hackers


The following bug has been logged online:

Bug reference: 5021
Logged by: Dan O'Hara
Email address: danarasoftware(at)gmail(dot)com
PostgreSQL version: 8.3.7
Operating system: win32
Description: ts_parse doesn't recognize email addresses with
underscores
Details:

In the following example,

select distinct token as email
from ts_parse('default', ' first_last(at)yahoo(dot)com ' )
where tokid = 4

ts_parse returns last(at)yahoo(dot)com rather than first_last(at)yahoo(dot)com It seems
that any text prior to the underscore is truncated. If the portion
following the underscore is only numeric, such as this example,

select distinct token as email
from ts_parse('default', ' bill_2000(at)yahoo(dot)com ' )
where tokid = 4

then ts_parse returns nothing at all.

section 3.2.3 of RFC 5322 indicates that underscores are valid characters in
an email address.

http://tools.ietf.org/html/rfc5322

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Merlin Moncure 2009-08-28 16:10:39 inconsistent composite type null handling in plpgsql out variable
Previous Message Alvaro Herrera 2009-08-28 13:38:14 Re: BUG #5019: Nao funciona

Browse pgsql-hackers by date

  From Date Subject
Next Message Tom Lane 2009-08-28 14:00:46 Re: Memory context usage
Previous Message Alvaro Herrera 2009-08-28 13:44:02 Re: 8.5 release timetable, again