Tags: Structured data

Word sense ambiguity is a pervasive characteristic of natural language. For example, the word "cold" has several senses and may refer to a disease, a temperature sensation, or an environmental condition. The specific sense intended is determined by the te ...

Category Structured Data

 "DrugDDI: an annotated corpus for drug-drug interactions" submitted for publication.   The DrugDDI corpus is part of a larger study about automatic Drug-Drug Interaction Extraction. The corpus provides data for the development and automatic evaluatio ...

Category Human Annotated
The Arrowsmith Project Home Page

 Arrowsmith Two Node Search tool,  Anne O'Tate value-added PubMed search tool,  Author-ity Author Name Disambiguation tools, ADAM abbreviation database,    WETLAB prototype electronic lab notebook,   Compendium of Biomedical Text Mining tools,   ...

Structured abstracts contain distinct labeled sections (e.g., “RESULTS”) for key information from articles they summarize.  If English-language structured abstracts appear in journals that the US National Library of Medicine (NLM) indexes, the labels in t ...

Category Structured Data

RxNorm, a standardized nomenclature for clinical drugs and drug delivery devices, is produced by the National Library of Medicine (NLM). In this context, a clinical drug is a pharmaceutical product given to (or taken by) a patient with a therapeutic or di ...

Category Structured Data
NeuroLex - A dynamic lexicon of neuroscience terms

 The NeuroLex project, supported by the Neuroscience Information Framework project, is a dynamic lexicon of neuroscience terms. Unlike an encyclopedia, a lexicon provides the meaning of a term, and not all there is to know about it. The NeuroLex is being ...

Category Human Annotated

This package provides machine learning algorithms optimized for large text categorization tasks and is able to combine several text categorization solutions. The advantages of this package compared to existing approaches are: 1) its speed, 2) it is able t ...

The MIMIC-II research database (Multiparameter Intelligent Monitoring in Intensive Care) is notable for three factors:  it is publicly and freely available; it encompasses a diverse and very large population of ICU patients; and it contains high temporal ...

Category Relational

Researchers have requested the ability to have available MEDLINE citations in the state they were at a given moment in time without the MeSH vocabulary updates and other revisions that occur during the year. The MEDLINE/PubMed Baseline Repository was set ...

Category Structured Data

DrugNerAr Corpus: a corpus annotated with drug anaphoras. Text were collected from the Drugbank database. There is no corpus dedicated to the resolution of the anaphoric expressions occurring in drug interaction descriptions in pharmacological documents, ...

Category Human Annotated

See link to CMC resource catalog on upper right side of page.  All resources are fully open-access, but a registration is needed.  We simply take user and dowload counts and report back to our benifactors.  No data are shared. You will need to download t ...

Category Human Annotated