Skip to content

Latest commit

 

History

History
116 lines (91 loc) · 2.78 KB

README.md

File metadata and controls

116 lines (91 loc) · 2.78 KB

Textmining rules

Filters, remapped lists, subset labels, etc. used in textmining connector using the Pensoft Annotator.

  1. For all resources
  2. For specific resource(s)

For all resources

Remove across all resources:

https://github.com/EOL/textmine_rules/blob/main/terms_to_remove.txt
https://github.com/EOL/textmine_rules/blob/main/geo_synonyms.txt

Delete MoF with these labels:

https://github.com/EOL/textmine_rules/blob/main/del_MoF_with_these_labels.tsv
https://github.com/EOL/textmine_rules/blob/main/blacklist_labels.txt

Patterns for all textmined resources:

life history ontology https://github.com/EOL/textmine_rules/blob/main/life_history.tsv

Exclude traits and its descendants (inclusive):

https://github.com/EOL/textmine_rules/blob/main/exclude_descendants.tsv

Others:

Re-mapped across all resources: https://github.com/EOL/textmine_rules/blob/main/Terms_remapped/DATA_1841_terms_remapped.tsv
mRemark assignments https://github.com/EOL/textmine_rules/blob/main/mRemarks_assignments.tsv

For specific resource(s)

Wikipedia (inferred records)

Remove traits for specific taxon ranks https://github.com/EOL/textmine_rules/blob/main/Wikipedia_excluded_ranks.tsv

Wikipedia (inferred records) AND TreatmentBank

Soil compositions: https://github.com/EOL/textmine_rules/blob/main/soil_composition.tsv
Set measurementType to: http://purl.obolibrary.org/obo/ENVO_09200008

WoRMS

Re-mapped terms: https://github.com/EOL/textmine_rules/blob/main/Terms_remapped/WoRMS_only_terms_remapped.tsv

AntWeb

Exclude descendants of: “aquatic” https://github.com/EOL/textmine_rules/blob/main/AmphibiaWeb/descendants_of_aquatic.tsv

AmphibiaWeb articles

Exclude descendants of: “saline water” https://github.com/EOL/textmine_rules/blob/main/AmphibiaWeb/descendants_of_salt_water.tsv