Skip to content

mantis - Multiple Time Series Scanner #710

Open
@phuongquan

Description

@phuongquan

Submitting Author Name: T. Phuong Quan
Submitting Author Github Handle: @phuongquan
Other Package Authors Github handles: (comma separated, delete if none)
Repository: https://github.com/phuongquan/mantis
Version submitted: 0.3.0
Submission type: Standard
Editor: @beatrizmilz
Reviewers: TBD

Archive: TBD
Version accepted: TBD
Language: en


  • Paste the full DESCRIPTION file inside a code block below:
Package: mantis
Type: Package
Title: Multiple Time Series Scanner
Version: 0.3.0
Authors@R: c(
    person(c("T.", "Phuong"), "Quan", email = "[email protected]",
        role = c("aut", "cre"), comment = c(ORCID = "0000-0001-8566-1817")),
    person(family = "University of Oxford", role = "cph"),
    person(family = "National Institute for Health Research (NIHR)", role = "fnd")
    )
Description: Generate interactive html reports that enable quick visual review of multiple 
    related time series stored in a data frame. For static datasets, this can help to identify any temporal 
    artefacts that may affect the validity of subsequent analyses. For live data feeds, 
    regularly scheduled reports can help to pro-actively identify data feed problems
    or unexpected trends that may require action. The reports are self-contained and shareable 
    without a web server. 
URL: https://github.com/phuongquan/mantis,
    https://phuongquan.github.io/mantis/
BugReports: https://github.com/phuongquan/mantis/issues
License: GPL (>= 3)
Encoding: UTF-8
LazyData: true
Depends: 
    R (>= 4.1.0)
Imports:
    rmarkdown,
    knitr,
    reactable,
    dplyr (>= 1.1.1),
    tidyr,
    dygraphs,
    xts,
    ggplot2,
    scales,
    purrr,
    htmltools
Suggests: 
    covr,
    testthat (>= 3.0.0)
Config/testthat/edition: 3
RoxygenNote: 7.3.2
VignetteBuilder: knitr
Roxygen: list(markdown = TRUE)

Scope

  • Please indicate which category or categories from our package fit policies this package falls under: (Please check an appropriate box below. If you are unsure, we suggest you make a pre-submission inquiry.):

    • data retrieval
    • data extraction
    • data munging
    • data deposition
    • data validation and testing
    • workflow automation
    • version control
    • citation management and bibliometrics
    • scientific software wrappers
    • field and lab reproducibility tools
    • database software bindings
    • geospatial data
  • Explain how and why the package falls under these categories (briefly, 1-2 sentences):

It helps to identify any temporal artefacts or unexpected trends in collections of related time series. It does this by generating html reports that enable quick visual review of the data. This is important for ensuring the validity of any subsequent analyses based on the data.

  • Who is the target audience and what are scientific applications of this package?

There are two main target audiences:

  1. researchers who analyse data from large, temporal datasets, particularly routinely-collected data such as electronic health records. The package helps them to quickly check for temporal biases in their data before embarking on their main analyses, increasing the quality of their studies as well as trust in the scientific process.

  2. managers of live data feeds that are used as a data source for downstream analyses. Regular inspection of data that is frequently updated will help to identify any issues early and enable a timely response to rectify issues such as missing data. While a set of validation checks in a data pipeline could also work in this circumstance, the benefit of this package is the ability to check for temporal anomalies that are obvious to the human eye but that are difficult to capture programmatically.

I previously created a related package daiquiri which was accepted at ROpenSci in 2022, which creates data quality reports for temporal, record-level datasets. This new package was developed as a response to users who wanted to visually review datasets that were essentially a collection of time series in a data frame, as opposed to record-level, non-numeric data that requires aggregation, (which is what daiquiri is designed for).

There are other packages that do similar things, such as visually inspecting data frames (visdat) or running validation checks on them (assertr, pointblank), but to my knowledge there are none which assist in identifying anomalous temporal changes.

Yes

  • If you made a pre-submission inquiry, please paste the link to the corresponding issue, forum post, or other discussion, or @tag the editor you contacted.

#693

  • Explain reasons for any pkgcheck items which your package is unable to pass.

All passed

Technical checks

Confirm each of the following by checking the box.

This package:

Publication options

  • Do you intend for this package to go on CRAN?

  • Do you intend for this package to go on Bioconductor?

  • Do you wish to submit an Applications Article about your package to Methods in Ecology and Evolution? If so:

MEE Options
  • The package is novel and will be of interest to the broad readership of the journal.
  • The manuscript describing the package is no longer than 3000 words.
  • You intend to archive the code for the package in a long-term repository which meets the requirements of the journal (see MEE's Policy on Publishing Code)
  • (Scope: Do consider MEE's Aims and Scope for your manuscript. We make no guarantee that your manuscript will be within MEE scope.)
  • (Although not required, we strongly recommend having a full manuscript prepared when you submit here.)
  • (Please do not submit your package separately to Methods in Ecology and Evolution)

Code of conduct

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions