jed-users mailing list

[2007 Date Index] [2007 Thread Index] [Other years]
[Thread Prev] [Thread Next]      [Date Prev] [Date Next]

Re: UTF-8 and Regular Expressions


Hi John,

"John E. Davis" <davis@xxxxxxxxxxxxx> wrote:
> For those of you that use jed with UTF-8 encoded text, has the lack of
> true UTF-8 support by the regular expression functions been much of an
> impediment?

No, I had no problems or better said, I stumbled over none. But I found
another problem that might be related to this. Try to match the
following line with Test.*x (Menu->Search->Regexp Search). You see get
also the string “ ab” marked as match, which is confusing.

Test äöü x ab

> The reason I ask is that slang 3.0 will use PCRE as its regular
> expression library.  In anticipation of this, I plan to integrate PCRE
> with jed in the near future.  In fact, I already have a version that
> uses PCRE.

Cool. I really need it for my latex mode.

> My inclination is that if the lack of UTF-8 support by the current
> regular expression engine is not much of a problem, then I think that
> by default, regular expressions will be compiled using byte-semantics,
> independent of whether or not jed is running in UTF-8 mode.

If it really has no implact than you might do so. But can you add a
switch to configure to easily change it on compile time?

Bye, Jörg.
-- 
Was der Bauer nicht kennt, das frisst er nicht. Würde der Städter kennen,
was er frisst, er würde umgehend Bauer werden.
                                                       Oliver Hassencamp

--------------------------
To unsubscribe send email to <jed-users-request@xxxxxxxxxxx> with
the word "unsubscribe" in the message body.
Need help? Email <jed-users-owner@xxxxxxxxxxx>.


[2007 date index] [2007 thread index]
[Thread Prev] [Thread Next]      [Date Prev] [Date Next]