Rework disabled tests doc.

In retrospect, my prior attempt at documenting ways of disabling tests was too ambiguous. This rewrite collapses a couple of cases and provides clearer examples of the various mechanisms. Hopefully this will be more useful. Change-Id: I024ef5398c9a1fe9024e923a367a1b2ad1e23daa Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/2443632Reviewed-by: Chan Li <chanli@chromium.org> Commit-Queue: Dirk Pranke <dpranke@google.com> Cr-Commit-Position: refs/heads/master@{#814771}

Rework disabled tests doc.
In retrospect, my prior attempt at documenting ways of disabling tests was too ambiguous. This rewrite collapses a couple of cases and provides clearer examples of the various mechanisms. Hopefully this will be more useful. Change-Id: I024ef5398c9a1fe9024e923a367a1b2ad1e23daa Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/2443632Reviewed-by: Chan Li <chanli@chromium.org> Commit-Queue: Dirk Pranke <dpranke@google.com> Cr-Commit-Position: refs/heads/master@{#814771}
8b9103aa · Dirk Pranke · Commit Bot · 954feaf1 · 8b9103aa
Commit 8b9103aa authored Oct 07, 2020 by Dirk Pranke Committed by Commit Bot Oct 07, 2020
Hide whitespace changes
Inline Side-by-side

Showing with 50 additions and 51 deletions

docs/testing/on_disabling_tests.md docs/testing/on_disabling_tests.md +50 -51

No files found.
--- a/docs/testing/on_disabling_tests.md
+++ b/docs/testing/on_disabling_tests.md
 # On disabling tests
 Sometimes you don't want to run a test that you've written (or that
-you've imported, like conformance tests).
+you've imported, like conformance tests). The test might not be possible to
+run in a particular configuration, or be temporarily broken by another
+change, or be flaky, or simply not work yet. In these cases (and perhaps others),
+you should disable the test :).
-There are a number of different ways to "disable" a test.
+There are a number of different ways to do so:
 *   If the test is an entire binary or test suite, the first (and
-    simplest) first way is to simply not build (or build, but not run)
+    simplest) way is to simply not build (or build, but not run)
-    the test binary, of course.
+    the test binary, of course. This makes sense for binaries that
+    are specific to particular build configurations (e.g., Android JUnit
+    tests don't need to be built on Windows).
-*   The second way (for tests in C++) is to not compile a test in a
+*   A second way (for tests in C++) is to not compile a test in a
-    given configuration, e.g., #ifndef WIN. In this situation, the only
+    given configuration, e.g., `#ifndef WIN`. In this situation, the only
    way you would know the test existed and was disabled would be to
-    examine the source code.  In most cases today, we use this path for
+    parse the source code. We often do this today for tests that will
-    tests that will never be enabled, but sometimes we do this to
+    never be enabled in particular build configurations, but sometimes we do
-    temporarily skip tests as well.
+    this to temporarily skip tests as well.
-*   The third way, for GTest-based tests, is a variant of the second
+*   A third way is to take advantage of features in your testing framework to
-    way: instead of compiling it out completely, you change the name, so
+    skip over tests. Examples include involve adding `DISABLED_` to the test
-    that you simply don't run the test by default. But, at least in this
+    method name for GTest-based tests, `@unittest.skip` for Python-based tests,
-    case, you can potentially determine at runtime the list of disabled
+    or using the
-    tests, because the code is still in the binary. And, potentially you
+    [DisabledTest](../../base/test/android/javatests/src/org/chromium/base/test/DisabledTest.java)
-    can still force the test to be run via a command line flag.
+    annotation for JUnit-based Java tests.  In these cases, you don't run the
+    test by default, but you can determine the list of disabled tests at
-*   A fourth way is for a test harness to skip over a test at runtime
+    runtime because the tests are present in the executable, and you may still
-    for some reason, e.g., the harness determines that you're running on
+    be able to force the test to be run via a command-line flag.
-    a machine w/ no GPU and so the GPU tests are never invoked. Here you
-    can also ask the harness which tests are being skipped.
+*   Fourth, for test frameworks that support
+    [expectations files or filter files](https://bit.ly/chromium-test-list-format),
-*   A fifth way is for a test harness to run the test, but then have the
+    you can use them to decide what to run and what to skip. This moves
-    test detect at runtime that it should skip or exit early (e.g., the
+    the mechanisms out of the source code and into separate files; there are
-    test itself could detect there was no GPU). Depending on how the
+    advantages and disadvantages to this. The main advantage is that it
-    test does this, it may be impossible for you to really detect that
+    can make it easier to write tooling to disable tests, and the main
-    this happened, and you'd just view the test as 'passing'.
+    disadvantage is that it moves the mechanism away from the code it affects,
+    potentially making it harder to understand what's going on.
-*   A sixth way is to use [expectations files and filter
-    files](https://bit.ly/chromium-test-list-format), and have the test
+*   Finally, the test harness can run the test, but the test itself
-    harness use that file to decide what to run and what to skip.
+    might detect at runtime that it should exit early for some reason
+    rather than actually executing the code paths you'd normally want to
-In theory, we should eventually consistently have either or both of
+    test. For example, if you have a test for some code path that requires
-expectations files and filter files for all test steps. We still don't
+    a GPU, but there's no GPU on the machine, the test might check for a
-have this consistently everywhere in Chrome (as of 2020-09-18), but
+    GPU and exit early with "success".
-folks are working on them expanding the number of kinds of tests that do
-have them. Once we do have them, we can expect people to stop using at
+If you want to be able to determine a global picture of which tests
-least the third path.
+were disabled, you can either parse BUILD files, expectations and filter
+files, and source code to try and figure that out, or require the tests be
-As you can see from the above, it's difficult if not impossible to
+present in test binaries (i.e., not compiled out) and then run the test
-determine "all of the disabled tests" at any point in time. At best,
+binaries in order to collect the lists of disabled tests and report them
-you'd have to decide what subsets of disabled tests that you're
+to a central system.
-targeting, and which you'd like to ignore.
+Parsing code can be straightforward for some types of tests, but
-You could also choose to "ban" certain approaches, but those bans might
+difficult-to-impractical to do correctly for others.
-be hard to enforce, and some approaches may practically be necessary in
-some cases.
-Ultimately, the more temporary disabling we can do via the sixth path,
-the better off we probably are: the sixth path is the easiest for us to
-write tooling to support and the most generic of all of the approaches.