LLL Corpora

The dataset was prepared for the Genic Interaction Extraction Challenge. Extracting gene interaction means extracting the agent (proteins) and the target (genes) of all couples of genic interactions from sentences.  MIG-INRA has annotated hundreds of such interactions with the XML editor CADIXE. For the challenge, only a simple subset of them is provided as training corpus.

Associated Institutions

Institut National de la Recherche Agronomique

Application Domains
  • Biology
  • Genomics
Other Resource Type
Software Subtype
Programming Languages
Operating Systems
Included Components
Dataset Subtype
Data Model Subtype
Online Resource Subtype
Knowledge Base Subtype
Intended User Types
  • Informatics researcher
  • NLP researcher or developer
  • Software developer
Available Documentation
  • PDF documentation
  • Web page/HTML documentation
Date of Latest Version