BADREX: Biomedical Abbreviation Expander with Dynamic Regular Expressions
BADREX uses dynamically generated regular expressions to annotate term definition–term abbreviation pairs, and corefers unpaired acronyms and abbreviations back to their initial (or most recent) definition in the text. BADREX achieves precision and recall of 98% and 97% on the Medstract corpus, and 90% and 85% on the BioText corpus. Against these corpora, BADREX yields improved performance over previous approaches, requires no training data and allows runtime customisation of its input parameters.
In addition, there is the option of annotating and classifying common medical abbreviations extracted from Wikipedia.