Collect metrics at 2am and 2pm UTC #15
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
We're using PyPIStats, which in turn relies on PyPI that provides download records as a publicly available dataset on Google's BigQuery.
Every day, the data update begins at 01:00:00 UTC and should take about 10 minutes according to https://pypistats.org/faqs
Instead of collecting data at 0 and 12 UTC, we should collect data at 2 and 14 UTC so that we get new data as soon as it becomes available.
Note that we're collecting twice a day to prevent missing any data in case one of the two workflows fail. They are expected to collect the exact same data if none of them fails.