Skip to content

Periodically scan access & resolution logs for requested but non-existent pages and DOIs #50

Closed
@adam3smith

Description

@adam3smith

We want to capture both unpublished/non-existent DOIs that are linked to and unpublished URLs, so we need two different data sources:

  1. For DOIs, rely on Datacite's resolution reports: https://stats.datacite.org/resolutions.html
  2. For URLs query our Dataverse's log

Let's try to get something approximately along these lines:

  • On the 15th of each month, query both sources above
  • Turn the DOIs into QDR dataverse URLs using the DOI suffix
  • Discard URLs that aren't valid (e.g. I'm seeing things like 10.5064/F68G8HMM. and 10.5064/ABCD in the Datacite resolution logs -- those aren't relevant to us)
  • Create a table that includes the URL and the number of failed resolutions/access attempts and email it to [email protected] as well as Dessi and my email.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions