We have a few tests that are intermittently flaky. Let's try this to see if we can get them a bit more reliable.
Remove pending tests that are never run on CI (i.e. require `--online`), remove fixtures for those tests and just make `--official-cmd-taps` run by `--online` instead.
brew services