Fix handling of same-skeleton domains in IDN spoof checker

A previous fix for bug 898343 attempted to handle skeletons that are the same as their domains (e.g. test[.]net). The fix introduced an early return which caused skeleton lookups to incorrectly match subtrings: If a domain name contained a top domain as its prefix, spoof checker assumed a skeleton match without checking if it reached the end of the search string (i.e. the skeleton being looked up). E.g. Skeleton of atést[.]net (atest[.]net) matched the skeleton of test[.]net (test[.]net). This CL properly checks that all of the search string is consumed before returning a match. Bug: 925199 Change-Id: Ia548ea686915463e0f93e6bce1bad5721f324821 Reviewed-on: https://chromium-review.googlesource.com/c/1437716 Commit-Queue: Mustafa Emre Acer <meacer@chromium.org> Reviewed-by: Tommy Li <tommycli@chromium.org> Cr-Commit-Position: refs/heads/master@{#626867}

Fix handling of same-skeleton domains in IDN spoof checker
A previous fix for bug 898343 attempted to handle skeletons that are the same as their domains (e.g. test[.]net). The fix introduced an early return which caused skeleton lookups to incorrectly match subtrings: If a domain name contained a top domain as its prefix, spoof checker assumed a skeleton match without checking if it reached the end of the search string (i.e. the skeleton being looked up). E.g. Skeleton of atést[.]net (atest[.]net) matched the skeleton of test[.]net (test[.]net). This CL properly checks that all of the search string is consumed before returning a match. Bug: 925199 Change-Id: Ia548ea686915463e0f93e6bce1bad5721f324821 Reviewed-on: https://chromium-review.googlesource.com/c/1437716 Commit-Queue: Mustafa Emre Acer <meacer@chromium.org> Reviewed-by: Tommy Li <tommycli@chromium.org> Cr-Commit-Position: refs/heads/master@{#626867}
3035169b · Mustafa Emre Acer · Commit Bot · a0202087 · 3035169b · 3035169b
Commit 3035169b authored Jan 29, 2019 by Mustafa Emre Acer Committed by Commit Bot Jan 29, 2019
Showing with 28 additions and 17 deletions

components/url_formatter/idn_spoof_checker.cc components/url_formatter/idn_spoof_checker.cc +14 -16

components/url_formatter/url_formatter_unittest.cc components/url_formatter/url_formatter_unittest.cc +14 -1

No files found.
--- a/components/url_formatter/idn_spoof_checker.cc
+++ b/components/url_formatter/idn_spoof_checker.cc
@@ -35,24 +35,22 @@ class TopDomainPreloadDecoder : public net::extras::PreloadDecoder {
    if (!reader->Next(&is_same_skeleton))
      return false;

-    if (is_same_skeleton) {
-      *out_found = true;
-      result_ = search;
-      return true;
-    }
-
-    bool has_com_suffix = false;
-    if (!reader->Next(&has_com_suffix))
-      return false;
-
    std::string top_domain;
-    for (char c;; top_domain += c) {
-      huffman_decoder().Decode(reader, &c);
-      if (c == net::extras::PreloadDecoder::kEndOfTable)
-        break;
+    if (is_same_skeleton) {
+      top_domain = search;
+    } else {
+      bool has_com_suffix = false;
+      if (!reader->Next(&has_com_suffix))
+        return false;
+
+      for (char c;; top_domain += c) {
+        huffman_decoder().Decode(reader, &c);
+        if (c == net::extras::PreloadDecoder::kEndOfTable)
+          break;
+      }
+      if (has_com_suffix)
+        top_domain += ".com";
    }
-    if (has_com_suffix)
-      top_domain += ".com";

    if (current_search_offset == 0) {
      *out_found = true;

--- a/components/url_formatter/url_formatter_unittest.cc
+++ b/components/url_formatter/url_formatter_unittest.cc
@@ -1012,8 +1012,21 @@ const IDNTestCase idn_cases[] = {

    // Test that top domains whose skeletons are the same as the domain name are
    // handled properly. In this case, tést.net should match test.net top
-    // domain.
+    // domain and not be converted to unicode.
    {"xn--tst-bma.net", L"t\x00e9st.net", false},
+    // Variations of the above, for testing crbug.com/925199.
+    // some.tést.net should match test.net.
+    {"some.xn--tst-bma.net", L"some.t\x00e9st.net", false},
+    // The following should not match test.net, so should be converted to
+    // unicode.
+    // ést.net (a suffix of tést.net).
+    {"xn--st-9ia.net", L"\x00e9st.net", true},
+    // some.ést.net
+    {"some.xn--st-9ia.net", L"some.\x00e9st.net", true},
+    // atést.net (tést.net is a suffix of atést.net)
+    {"xn--atst-cpa.net", L"at\x00e9st.net", true},
+    // some.atést.net
+    {"some.xn--atst-cpa.net", L"some.at\x00e9st.net", true},

    // Modifier-letter-voicing should be blocked (wwwˬtest.com).
    {"xn--wwwtest-2be.com", L"www\x02ectest.com", false},