Skip to content

dieterich-lab/medication_information_extraction_using_llms

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Medication information extraction using local large language models

This repository contains code for the manuscript: "Medication information extraction using local large language models". The project uses fine-tuned local open-source large language models for extracting structured medication infromation from unstructured clinical texts in two language: EN and DE.

Citation

If you use this code or the findings from our work, please cite

  @article{richter2025medication,
  title={Medication information extraction using local large language models},
  author={Richter-Pechanski, Phillip and Seiferling, Marvin and Kiriakou, Christina and Schwab, Dominic M and Geis, Nicolas A and Dieterich, Christoph and Frank, Anette},
  journal={medRxiv},
  pages={2025--03},
  year={2025},
  publisher={Cold Spring Harbor Laboratory Press}
}

Repository Structure

  • medication_extraction/ – Core scripts for data preparation, model fine-tuning, inference and evaluation (incl. confidence intervals).
  • further_experiments/ – Additional experiments including investigating data leakage of Llama 3.1 and an experimental JSON2BRAT tool, to apply the official n2c2 2018 track 2 evaluation script on the results.
  • shapley_experiments/ – Scripts to Investigate implicit knowledge of Llama using Shapley values for false negative predictions

See the respective folders for required libraries and additional information.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published