Skip to content

subgenus = Incertae sedis then name string doesn't parse, also strange looking quality values #277

Open
@debpaul

Description

@debpaul

Raw data (unparsed): beulah-first-5000-name-strings-unparsed.csv

Modified GNParsed Data Set: beulath-taxonnames-gnparsed-first-5000-rows.txt

  • added family column, value = Carabidae
  • opened file in Notepad ++
  • changed CRLF line endings to UNIX (LF) (b/c upload to TW batch requires this)

Noticed

  • the Quality values look strange? Maybe on import into Excel, I need to select a certain data type for this field?
    Image

  • see also line 11 above where the value pseudoflavipes appears changed to pseudoflavipe0s in CanonicalFull column (also lines 116, 117)

    • don't know where that 0 comes from
  • see also Author Year leading and trailing 0. Not sure where they are coming from either
    Image

  • More 0 issues (and delimiters issue?), origin uncertain
    Image

  • Some names did not parse. (Not sure why). See screenshot next. Maybe because all these names have subgenus = (Incertae sedis) and GN doesn't recognize this value at this rank?

Image

  • In general, subgenus is missing from all parsed values.

Maybe in future?

  • option to parse (further atomize) down to lowest rank provided

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions