AIMed Corpora

The AImed corpus consists of 225 Medline abstracts. 200 abstracts describe interactions between human proteins, 25 do not refer to any interaction. There are 4084 protein references and around 1000 tagged interactions in this data set. In this data set there is no distinction between genes and proteins and the relations are symmetric.

 Resource URL:


Raymond J. Mooney
Razvan Bunescu

Associated Institutions

University of Texas

Application Domains
  • Biology
  • Genomics
  • Proteomics
Other Resource Type
Software Subtype
Programming Languages
Operating Systems
Included Components
Dataset Subtype
  • Human annotated
Data Model Subtype
Online Resource Subtype
Knowledge Base Subtype
Intended User Types
  • Informatics researcher
  • NLP researcher or developer
  • Software developer
Available Documentation
  • PDF documentation
  • Web page/HTML documentation