Large-scale extraction of gene regulation for model organisms in an ontological context

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Large-scale extraction of gene regulation for model organisms in an ontological context. / Saric, Jasmin; Jensen, Lars J; Rojas, Isabel.

In: In Silico Biology, Vol. 5, No. 1, 2005, p. 21-32.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Saric, J, Jensen, LJ & Rojas, I 2005, 'Large-scale extraction of gene regulation for model organisms in an ontological context', In Silico Biology, vol. 5, no. 1, pp. 21-32.

APA

Saric, J., Jensen, L. J., & Rojas, I. (2005). Large-scale extraction of gene regulation for model organisms in an ontological context. In Silico Biology, 5(1), 21-32.

Vancouver

Saric J, Jensen LJ, Rojas I. Large-scale extraction of gene regulation for model organisms in an ontological context. In Silico Biology. 2005;5(1):21-32.

Author

Saric, Jasmin ; Jensen, Lars J ; Rojas, Isabel. / Large-scale extraction of gene regulation for model organisms in an ontological context. In: In Silico Biology. 2005 ; Vol. 5, No. 1. pp. 21-32.

Bibtex

@article{90d0f156c0844c0b9a3cb3ed3c9b2322,
title = "Large-scale extraction of gene regulation for model organisms in an ontological context",
abstract = "This paper presents an approach using syntactosemantic rules for the extraction of relational information from biomedical abstracts. The results show that by overcoming the hurdle of technical terminology, high precision results can be achieved. From abstracts related to baker's yeast, we manage to extract a regulatory network comprised of 441 pairwise relations from 58,664 abstracts with an accuracy of 83 - 90%. To achieve this, we made use of a resource of gene/protein names considerably larger than those used in most other biology related information extraction approaches. This list of names was included in the lexicon of our retrained partof- speech tagger for use on molecular biology abstracts. For the domain in question an accuracy of 93.6 - 97.7% was attained on Part-of-speech-tags. The method can be easily adapted to other organisms than yeast, allowing us to extract many more biologically relevant relations. The main reason for the comparable precision rates is the ontological model that was built beforehand and served as a guiding force for the manual coding of the syntactosemantic rules.",
author = "Jasmin Saric and Jensen, {Lars J} and Isabel Rojas",
year = "2005",
language = "English",
volume = "5",
pages = "21--32",
journal = "In Silico Biology",
issn = "1386-6338",
publisher = "I O S Press",
number = "1",

}

RIS

TY - JOUR

T1 - Large-scale extraction of gene regulation for model organisms in an ontological context

AU - Saric, Jasmin

AU - Jensen, Lars J

AU - Rojas, Isabel

PY - 2005

Y1 - 2005

N2 - This paper presents an approach using syntactosemantic rules for the extraction of relational information from biomedical abstracts. The results show that by overcoming the hurdle of technical terminology, high precision results can be achieved. From abstracts related to baker's yeast, we manage to extract a regulatory network comprised of 441 pairwise relations from 58,664 abstracts with an accuracy of 83 - 90%. To achieve this, we made use of a resource of gene/protein names considerably larger than those used in most other biology related information extraction approaches. This list of names was included in the lexicon of our retrained partof- speech tagger for use on molecular biology abstracts. For the domain in question an accuracy of 93.6 - 97.7% was attained on Part-of-speech-tags. The method can be easily adapted to other organisms than yeast, allowing us to extract many more biologically relevant relations. The main reason for the comparable precision rates is the ontological model that was built beforehand and served as a guiding force for the manual coding of the syntactosemantic rules.

AB - This paper presents an approach using syntactosemantic rules for the extraction of relational information from biomedical abstracts. The results show that by overcoming the hurdle of technical terminology, high precision results can be achieved. From abstracts related to baker's yeast, we manage to extract a regulatory network comprised of 441 pairwise relations from 58,664 abstracts with an accuracy of 83 - 90%. To achieve this, we made use of a resource of gene/protein names considerably larger than those used in most other biology related information extraction approaches. This list of names was included in the lexicon of our retrained partof- speech tagger for use on molecular biology abstracts. For the domain in question an accuracy of 93.6 - 97.7% was attained on Part-of-speech-tags. The method can be easily adapted to other organisms than yeast, allowing us to extract many more biologically relevant relations. The main reason for the comparable precision rates is the ontological model that was built beforehand and served as a guiding force for the manual coding of the syntactosemantic rules.

M3 - Journal article

C2 - 15972005

VL - 5

SP - 21

EP - 32

JO - In Silico Biology

JF - In Silico Biology

SN - 1386-6338

IS - 1

ER -

ID: 40749505