Summary: | doxygen.xml [and others] have invalid "[]" in regex | ||
---|---|---|---|
Product: | [Frameworks and Libraries] frameworks-syntax-highlighting | Reporter: | Gene Thomas <gene> |
Component: | syntax | Assignee: | KWrite Developers <kwrite-bugs-null> |
Status: | RESOLVED NOT A BUG | ||
Severity: | normal | CC: | jonathan.poelen, walter.von.entferndt |
Priority: | NOR | ||
Version First Reported In: | unspecified | ||
Target Milestone: | --- | ||
Platform: | Other | ||
OS: | Other | ||
Latest Commit: | Version Fixed In: | ||
Sentry Crash Report: |
Description
Gene Thomas
2021-02-23 06:58:25 UTC
[]] is valid with PCRE (regex engine used) where ] as the first character does not correspond to a closure (same with [^]]). ICU regex does not seem to support all PCRE syntax, it lacks for example (?|...) or \R which are also used. Thanks, I've switched from ICU to PCRE, much faster. Part of the problem is that ICU jumps through hoops to be correct. For example in German the regex (case insensitive) "^ẞ$" matches "SS" [2 code points], no other regex implementations do this that I have seen. ICU was getting into a internal infinite loop and throwing a "regex out of stack space" after 0.5 sec, lots of times, which made a .sh file take 30 seconds to syntax highlight! |