Automatic extraction of biomolecular interactions: an empirical approach

dc.contributor.author Zhang, Lifeng
dc.contributor.author Berleant, Daniel
dc.contributor.author Ding, Jing
dc.contributor.author Wurtele, Eve
dc.contributor.department Department of Genetics, Development, and Cell Biology (LAS)
dc.date 2018-02-18T04:05:10.000
dc.date.accessioned 2020-06-30T04:02:39Z
dc.date.available 2020-06-30T04:02:39Z
dc.date.copyright Tue Jan 01 00:00:00 UTC 2013
dc.date.issued 2013-01-01
dc.description.abstract <p><h3>Background</h3></p> <p>We describe a method for extracting data about how biomolecule pairs interact from texts. This method relies on empirically determined characteristics of sentences. The characteristics are efficient to compute, making this approach to extraction of biomolecular interactions scalable. The results of such interaction mining can support interaction network annotation, question answering, database construction, and other applications. <h3>Results</h3></p> <p>We constructed a software system to search MEDLINE for sentences likely to describe interactions between given biomolecules. The system extracts a list of the interaction-indicating terms appearing in those sentences, then ranks those terms based on their likelihood of correctly characterizing how the biomolecules interact. The ranking process uses a <em>tf-idf</em> (term frequency-inverse document frequency) based technique using empirically derived knowledge about sentences, and was applied to the MEDLINE literature collection. Software was developed as part of the MetNet toolkit (<a href="http://www.metnetdb.org/">http://www.metnetdb.org</a>). <h3>Conclusions</h3></p> <p>Specific, efficiently computable characteristics of sentences about biomolecular interactions were analyzed to better understand how to use these characteristics to extract how biomolecules interact.</p> <p>The text empirics method that was investigated, though arising from a classical tradition, has yet to be fully explored for the task of extracting biomolecular interactions from the literature. The conclusions we reach about the sentence characteristics investigated in this work, as well as the technique itself, could be used by other systems to provide evidence about putative interactions, thus supporting efforts to maximize the ability of hybrid systems to support such tasks as annotating and constructing interaction networks.</p>
dc.description.comments <p>This article is from <em>BMC Bioinformatics </em>14 (2013): 234, doi: <a href="http://dx.doi.org/10.1186/1471-2105-14-234" target="_blank">10.1186/1471-2105-14-234</a>. Posted with permission.</p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/gdcb_las_pubs/63/
dc.identifier.articleid 1064
dc.identifier.contextkey 9639851
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath gdcb_las_pubs/63
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/37974
dc.language.iso en
dc.source.bitstream archive/lib.dr.iastate.edu/gdcb_las_pubs/63/2013_Wurtele_AutomaticExtraction.pdf|||Sat Jan 15 01:20:05 UTC 2022
dc.source.uri 10.1186/1471-2105-14-234
dc.subject.disciplines Cell and Developmental Biology
dc.subject.disciplines Computational Biology
dc.subject.disciplines Genetics and Genomics
dc.subject.keywords Biomolecular interactions
dc.subject.keywords Information extraction
dc.subject.keywords Text mining
dc.subject.keywords Networks
dc.title Automatic extraction of biomolecular interactions: an empirical approach
dc.type article
dc.type.genre article
dspace.entity.type Publication
relation.isAuthorOfPublication a7de6326-d86c-4395-b9e6-51187c7f1782
relation.isOrgUnitOfPublication 9e603b30-6443-4b8e-aff5-57de4a7e4cb2
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
2013_Wurtele_AutomaticExtraction.pdf
Size:
709.4 KB
Format:
Adobe Portable Document Format
Description:
Collections