The AImed corpus consists of 225 Medline abstracts. 200 abstracts describe interactions between human proteins, 25 do not refer to any interaction. There are 4084 protein references and around 1000 tagged interactions in this data set. In this data set there is no distinction between genes and proteins and the relations are symmetric.
Resource URL: ftp://ftp.cs.utexas.edu/pub/mooney/bio-data/
Raymond J. Mooney
University of Texas