- link: https://github.com/blmoistawinde/HarvestText
- author: blmoistawinde
- note: 一个专注无(弱)监督方法,能够整合领域知识(如类型,别名)对特定领域文本进行简单高效地处理和分析的库。
- link: https://github.com/opensemanticsearch
- web: https://www.opensemanticsearch.org/
- author: opensemanticsearch.org
- note: Free Software for your own Search Engine, Explorer for Discovery of large document collections, Media Monitoring, Text Analytics, Document Analysis & Text Mining platform based on Apache Solr or Elasticsearch open-source enterprise-search and Open Standards for Linked Data, Semantic Web & Linked Open Data integration.
- link: https://github.com/impira/docquery
- author: impira
- note: an easy way to extract information from documents.
- link: https://github.com/Filimoa/open-parse
- author: Filimoa
- note: a tool designed to fill this gap by providing a flexible, easy-to-use library capable of visually discerning document layouts and chunking them effectively.