Description
Submitting Author Name: Anatoly Tsyplenkov
Submitting Author Github Handle: @atsyplenkov
Repository: https://github.com/atsyplenkov/tidyhydro
Submission type: Pre-submission
Language: en
- Paste the full DESCRIPTION file inside a code block below:
Package: tidyhydro
Type: Package
Title: Tidy Metrics for Assessing Hydrological Models Performance
Version: 0.1.1
Authors@R:
person(given = "Anatoly", family = "Tsyplenkov", email = "[email protected]", role = c("cre", "aut", "cph"), comment = c(ORCID = "0000-0003-4144-8402"))
Maintainer: Anatoly Tsyplenkov <[email protected]>
Description: Provides tidy tools for comparing simulated and observed hydrological time series. Includes compatibility with the 'yardstick' package for model performance evaluation using commonly used metrics such as the Nash–Sutcliffe Efficiency (NSE), Kling–Gupta Efficiency (KGE), percent bias (pBIAS) and etc.
License: MIT + file LICENSE
Depends: R (>= 4.1.0)
Imports:
Rcpp (>= 1.0.12),
rlang (>= 1.1.0),
yardstick (>= 1.3.1)
LinkingTo: Rcpp
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.3.2
Encoding: UTF-8
Language: en-US
Suggests:
hydroGOF,
testthat (>= 3.0.0),
quickcheck (>= 0.1.3),
quarto
Config/testthat/edition: 3
URL: https://github.com/atsyplenkov/tidyhydro, https://atsyplenkov.github.io/tidyhydro/
BugReports: https://github.com/atsyplenkov/tidyhydro/issues
LazyData: true
Config/Needs/website: bench, ggplot2, quarto, lubridate, dplyr
Scope
-
Please indicate which category or categories from our package fit policies or statistical package categories this package falls under. (Please check one or more appropriate boxes below):
Data Lifecycle Packages
- data retrieval
- data extraction
- data munging
- data deposition
- data validation and testing
- workflow automation
- version control
- citation management and bibliometrics
- scientific software wrappers
- field and lab reproducibility tools
- database software bindings
- geospatial data
- text analysis
Statistical Packages
- Bayesian and Monte Carlo Routines
- Dimensionality Reduction, Clustering, and Unsupervised Learning
- Machine Learning
- Regression and Supervised Learning
- Exploratory Data Analysis (EDA) and Summary Statistics
- Spatial Analyses
- Time Series Analyses
- Probability Distributions
-
Explain how and why the package falls under these categories (briefly, 1-2 sentences). Please note any areas you are unsure of:
tidyhydro
fits within the data validation and testing category / regression , as it provides automated, quantitative metrics (e.g., NSE, KGE, pBIAS) for evaluating the quality and performance of hydrological models compatible with the tidymodels
infrastructure.
- If submitting a statistical package, have you already incorporated documentation of standards into your code via the srr package?
No, I just learned about the srr
from the issue template. Feel a bit confused regarding that standards and not sure if it is applicable to my package.
- Who is the target audience and what are scientific applications of this package?
The target audience comprises the hydrological/ecological community and the general data science community who uses regression models within their workflows.
- Are there other R packages that accomplish the same thing? If so, how does yours differ or meet our criteria for best-in-category?
Yes, there is a package hydroGOF
, which has some overlap with tidyhydro
in terms of functionality. For example, metrics such as MSE, pBIAS, NSE, and KGE are implemented in both packages. However, tidyhydro
provides a tidyverse
-style interface, giving full compatibility and interoperability with the popular tidymodels
framework (via yardstick
). Additionally, tidyhydro
uses C++ via Rcpp for performance, which helps in validating large time series efficiently (see benchmarks).
- (If applicable) Does your package comply with our guidance around Ethics, Data Privacy and Human Subjects Research?
Not Applicable
- Any other questions or issues we should be aware of?:
- I raised this issue out of curiosity, as I have been thinking about submitting some package to rOpenSci for several years. However, I am not 100% sure that
tidyhydro
falls within the aims and scope of rOpenSci. I am interested to hear your thoughts about it. - The local
pkgcheck
fails to find continuous integration checks, which confuses me, as I do have a very comprehensive CI workflow set up. I am curious to see the results of the rOpenSci-hostedpkgcheck
. - I do need some guidance regarding
srr
standards. As far as I can understand from thesrr
manual, it is not really applicable to my package, astidyhydro
is not introducing new methods but rather provides a re-implementation of a well-known ones in a tidy way.