Andrew Dunstan <andrew(at)dunslane(dot)net> writes:
> I ended up not using a regex, which seemed to be a little heavy handed,
> but just writing a small custom recognition function, that should (and I
> think does) mimic the pattern recognition for these tokens used by the
> backend lexer.
I looked at this and realized that it still doesn't do very well at
distinguishing $foo$ from other random uses of $. The problem is that
looking back at just the immediately preceding character isn't enough
context to tell whether a $ is part of an identifier. Consider the
This is a legal identifier according to PG 7.4. But how about
This is a syntax error in 7.4, and we propose to redefine it as an
integer literal '42' followed by a dollar-quote start symbol.
There's no way to tell these apart with a single-character lookback,
or indeed any fixed number of characters of lookback.
I begin to think that we'll really have to bite the bullet and convert
psql's input parser to use flex. If we're not scanning with exactly the
same rules as the backend uses, we're going to get the wrong answers.
regards, tom lane
In response to
pgsql-hackers by date
|Next:||From: Tom Lane||Date: 2004-02-14 17:00:48|
|Subject: Re: Persistent main memory Storage Manager |
|Previous:||From: Tom Lane||Date: 2004-02-14 16:10:08|
|Subject: Re: pg_restore problems and suggested resolution |
pgsql-patches by date
|Next:||From: Andrew Dunstan||Date: 2004-02-14 20:04:31|
|Subject: Re: [HACKERS] dollar quoting|
|Previous:||From: Nick Barr||Date: 2004-02-14 11:57:45|
|Subject: Re: TODO : Multiple inserts in a single statement|