PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

PREGO : A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types. / Zafeiropoulos, Haris; Paragkamian, Savvas; Ninidakis, Stelios; Pavlopoulos, Georgios A.; Jensen, Lars Juhl; Pafilis, Evangelos.

In: Microorganisms, Vol. 10, No. 2, 293, 2022.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Zafeiropoulos, H, Paragkamian, S, Ninidakis, S, Pavlopoulos, GA, Jensen, LJ & Pafilis, E 2022, 'PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types', Microorganisms, vol. 10, no. 2, 293. https://doi.org/10.3390/microorganisms10020293

APA

Zafeiropoulos, H., Paragkamian, S., Ninidakis, S., Pavlopoulos, G. A., Jensen, L. J., & Pafilis, E. (2022). PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types. Microorganisms, 10(2), [293]. https://doi.org/10.3390/microorganisms10020293

Vancouver

Zafeiropoulos H, Paragkamian S, Ninidakis S, Pavlopoulos GA, Jensen LJ, Pafilis E. PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types. Microorganisms. 2022;10(2). 293. https://doi.org/10.3390/microorganisms10020293

Author

Zafeiropoulos, Haris ; Paragkamian, Savvas ; Ninidakis, Stelios ; Pavlopoulos, Georgios A. ; Jensen, Lars Juhl ; Pafilis, Evangelos. / PREGO : A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types. In: Microorganisms. 2022 ; Vol. 10, No. 2.

Bibtex

@article{347a366f6db74e9da58b5bd62772ac6c,
title = "PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types",
abstract = "To elucidate ecosystem functioning, it is fundamental to recognize what processes occur in which environments (where) and which microorganisms carry them out (who). Here, we present PREGO, a one-stop-shop knowledge base providing such associations. PREGO combines text mining and data integration techniques to mine such what-where-who associations from data and metadata scattered in the scientific literature and in public omics repositories. Microorganisms, biological processes, and environment types are identified and mapped to ontology terms from established community resources. Analyses of comentions in text and co-occurrences in metagenomics data/metadata are performed to extract associations and a level of confidence is assigned to each of them thanks to a scoring scheme. The PREGO knowledge base contains associations for 364,508 microbial taxa, 1090 environmental types, 15,091 biological processes, and 7971 molecular functions with a total of almost 58 million associations. These associations are available through a web portal, an Application Programming Interface (API), and bulk download. By exploring environments and/or processes associated with each other or with microbes, PREGO aims to assist researchers in design and interpretation of experiments and their results. To demonstrate PREGO{\textquoteright}s capabilities, a thorough presentation of its web interface is given along with a meta-analysis of experimental results from a lagoon-sediment study of sulfur-cycle related microbes.",
keywords = "Biological processes, Comention statistics, Literature-derived associations, Microbiome data, Text mining",
author = "Haris Zafeiropoulos and Savvas Paragkamian and Stelios Ninidakis and Pavlopoulos, {Georgios A.} and Jensen, {Lars Juhl} and Evangelos Pafilis",
note = "Publisher Copyright: {\textcopyright} 2022 by the authors. Licensee MDPI, Basel, Switzerland.",
year = "2022",
doi = "10.3390/microorganisms10020293",
language = "English",
volume = "10",
journal = "Microorganisms",
issn = "2076-2607",
publisher = "M D P I AG",
number = "2",

}

RIS

TY - JOUR

T1 - PREGO

T2 - A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types

AU - Zafeiropoulos, Haris

AU - Paragkamian, Savvas

AU - Ninidakis, Stelios

AU - Pavlopoulos, Georgios A.

AU - Jensen, Lars Juhl

AU - Pafilis, Evangelos

N1 - Publisher Copyright: © 2022 by the authors. Licensee MDPI, Basel, Switzerland.

PY - 2022

Y1 - 2022

N2 - To elucidate ecosystem functioning, it is fundamental to recognize what processes occur in which environments (where) and which microorganisms carry them out (who). Here, we present PREGO, a one-stop-shop knowledge base providing such associations. PREGO combines text mining and data integration techniques to mine such what-where-who associations from data and metadata scattered in the scientific literature and in public omics repositories. Microorganisms, biological processes, and environment types are identified and mapped to ontology terms from established community resources. Analyses of comentions in text and co-occurrences in metagenomics data/metadata are performed to extract associations and a level of confidence is assigned to each of them thanks to a scoring scheme. The PREGO knowledge base contains associations for 364,508 microbial taxa, 1090 environmental types, 15,091 biological processes, and 7971 molecular functions with a total of almost 58 million associations. These associations are available through a web portal, an Application Programming Interface (API), and bulk download. By exploring environments and/or processes associated with each other or with microbes, PREGO aims to assist researchers in design and interpretation of experiments and their results. To demonstrate PREGO’s capabilities, a thorough presentation of its web interface is given along with a meta-analysis of experimental results from a lagoon-sediment study of sulfur-cycle related microbes.

AB - To elucidate ecosystem functioning, it is fundamental to recognize what processes occur in which environments (where) and which microorganisms carry them out (who). Here, we present PREGO, a one-stop-shop knowledge base providing such associations. PREGO combines text mining and data integration techniques to mine such what-where-who associations from data and metadata scattered in the scientific literature and in public omics repositories. Microorganisms, biological processes, and environment types are identified and mapped to ontology terms from established community resources. Analyses of comentions in text and co-occurrences in metagenomics data/metadata are performed to extract associations and a level of confidence is assigned to each of them thanks to a scoring scheme. The PREGO knowledge base contains associations for 364,508 microbial taxa, 1090 environmental types, 15,091 biological processes, and 7971 molecular functions with a total of almost 58 million associations. These associations are available through a web portal, an Application Programming Interface (API), and bulk download. By exploring environments and/or processes associated with each other or with microbes, PREGO aims to assist researchers in design and interpretation of experiments and their results. To demonstrate PREGO’s capabilities, a thorough presentation of its web interface is given along with a meta-analysis of experimental results from a lagoon-sediment study of sulfur-cycle related microbes.

KW - Biological processes

KW - Comention statistics

KW - Literature-derived associations

KW - Microbiome data

KW - Text mining

U2 - 10.3390/microorganisms10020293

DO - 10.3390/microorganisms10020293

M3 - Journal article

C2 - 35208748

AN - SCOPUS:85123370627

VL - 10

JO - Microorganisms

JF - Microorganisms

SN - 2076-2607

IS - 2

M1 - 293

ER -

ID: 291301207