Commit 11d7f93d authored by Joe DeBlasio's avatar Joe DeBlasio Committed by Commit Bot

Add ы (U+042B) and ԍ (U+050D) to set of Cyrillic look-alikes.

This CL adds ы and ԍ to the set of Cyrillic characters that look like
Latin characters, as well as a test case to verify.

Bug: 884693
Change-Id: I0329b1c2bce2b733463346d69a2bae5d65234085
Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/1823320Reviewed-by: default avatarMustafa Emre Acer <meacer@chromium.org>
Commit-Queue: Mustafa Emre Acer <meacer@chromium.org>
Commit-Queue: Joe DeBlasio <jdeblasio@chromium.org>
Auto-Submit: Joe DeBlasio <jdeblasio@chromium.org>
Cr-Commit-Position: refs/heads/master@{#699536}
parent bfde1c0b
...@@ -147,7 +147,7 @@ IDNSpoofChecker::IDNSpoofChecker() { ...@@ -147,7 +147,7 @@ IDNSpoofChecker::IDNSpoofChecker() {
// These Cyrillic letters look like Latin. A domain label entirely made of // These Cyrillic letters look like Latin. A domain label entirely made of
// these letters is blocked as a simplified whole-script-spoofable. // these letters is blocked as a simplified whole-script-spoofable.
cyrillic_letters_latin_alike_ = icu::UnicodeSet( cyrillic_letters_latin_alike_ = icu::UnicodeSet(
icu::UnicodeString::fromUTF8("[асԁеһіјӏорԗԛѕԝхуъЬҽпгѵѡ]"), status); icu::UnicodeString::fromUTF8("[аысԁеԍһіјӏорԗԛѕԝхуъЬҽпгѵѡ]"), status);
cyrillic_letters_latin_alike_.freeze(); cyrillic_letters_latin_alike_.freeze();
cyrillic_letters_ = cyrillic_letters_ =
......
...@@ -391,6 +391,9 @@ const IDNTestCase kIdnCases[] = { ...@@ -391,6 +391,9 @@ const IDNTestCase kIdnCases[] = {
// ѕсоԗе.com is Cyrillic with Latin lookalikes. // ѕсоԗе.com is Cyrillic with Latin lookalikes.
{"xn--e1ari3f61c.com", L"\x0455\x0441\x043e\x0517\x0435.com", false}, {"xn--e1ari3f61c.com", L"\x0455\x0441\x043e\x0517\x0435.com", false},
// ыоԍ.com is Cyrillic with Latin lookalikes
{"xn--n1az74c.com", L"\x044b\x043e\x050d.com", false},
// Combining Diacritic marks after a script other than Latin-Greek-Cyrillic // Combining Diacritic marks after a script other than Latin-Greek-Cyrillic
{"xn--rsa2568fvxya.com", L"\xd55c\x0301\xae00.com", false}, // 한́글.com {"xn--rsa2568fvxya.com", L"\xd55c\x0301\xae00.com", false}, // 한́글.com
{"xn--rsa0336bjom.com", L"\x6f22\x0307\x5b57.com", false}, // 漢̇字.com {"xn--rsa0336bjom.com", L"\x6f22\x0307\x5b57.com", false}, // 漢̇字.com
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment