Skip to content

Handling large collections of datasets #125

Open
@diazrenata

Description

@diazrenata

If we go by route, the BBS data will really be many communities.
If we continue to handle each community separately (i.e. name the object and label everything with that name in the drake plans), it will be a lot of code and a lot of names.

I've already encountered this in BBS and made a temporary workaround:

  • when subsetting by route in the by-route branch, there needs to be some way of keeping track of which route & region the resulting abundance timeseries came from
  • I stuck those on as columns in the covariates table.

For a more systematic solution, my initial thought is we could allow datasets to exist as either the currently specified format or as a list of datasets in that format?
That would allow us to continue to refer to the whole lot of them as "bbs", but it would mean we'd need to build in some way of checking the format and handling them appropriately.

Alternatively, we could change the way we handle the individual communities, to accommodate wild cards/iterating over a list?

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions