Input data and outputs for the CoverageMetric project

  1. Abad-Navarro, Francisco 12
  1. 1 Universidad de Murcia
    info

    Universidad de Murcia

    Murcia, España

    ROR https://ror.org/03p3aeb86

  2. 2 Instituto Murciano de Investigación Biosanitaria Virgen de la Arrixaca
    info

    Instituto Murciano de Investigación Biosanitaria Virgen de la Arrixaca

    Murcia, España

Editor: Zenodo

Year of publication: 2024

Type: Dataset

CC BY 4.0

Abstract

This repository contain a zip file with input and output data for an experiment with CoverageMetric (https://github.com/fanavarro/CoverageMetric): input ontologies: ontologies used as input (FoodOn, LKIF, GeneOntology), together with their normalized form. text food_text: natural language text corpus about food, including the original and the processed text. gene_text: natural language text corpus about genetics, including the original and the processed text. legal_text: natural language text corpus about legal topics, including the original and the processed text. results: the results derived from comparing each ontology with each natural language text corpus by using CoverageMetric. analysis.R: R script to get figures summarizing the results. The SNOMED ontology and the medical text corpus used for input were not included due to licensing issues; however, the results are included in this repository.