So that we get records about tests that are already marked as flaky as well. This should probably only be done for the walk command.