Skip to content

[RegExp] case-insensitive matching misses characters #517

@goyakin

Description

@goyakin

When we generate the character casing equivalence sets for RegExp using UnicodeData.txt, we start from lowercase letters and then add the characters that they map to. However, this misses non-letter characters. For example, '\u0345', which is in the "Mn" (Mark, Nonspacing) category, should match '\u0399' when the case-insensitive flag is passed (even without the "unicode" flag), but we fail to do so.

When this is fixed, the following test should pass:

var didMatch = /\u0345/i.test('\u0399');
WScript.Echo(didMatch);

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions