Identification of new drug classification terms in textual resources

Corinna Kolárik; Martin Hofmann-Apitius; Marc Zimmermann; Juliane Fluck
July 2007
Bioinformatics;Jul2007, Vol. 23 Issue 13, pi264
Academic Journal
Knowledge about biological effects of small molecules helps in the understanding of biological processes and supports the development of new therapeutic agents. DrugBank is a high quality database providing such information about drugs that contains annotation of drug effects and classification of therapeutic effects. However, to broaden the scope of such a database in classifying and annotating drugs, systems for automatic extraction of classification terms and the corresponding annotation of drugs are needed. We have developed an approach for the identification of new terms used in unstructured text that provide information about drug properties. It is based on the identification and extraction of phrases corresponding to lexico-syntactic patterns - so-called Hearst patterns that contain drug names and directly related drug annotation terms. Such phrases could be identified with a high performance in DrugBank text (0.89 F-score) and in Medline abstracts (0.83 F-score). In comparison to DrugBank annotation terminology, a huge amount of new drug annotation terms could be found. The evaluation of terms extracted from Medline showed that 29–53% of them are new valid drug property terms. They could be assigned to existing and new drug property classes not provided by the DrugBank drug annotation. We come to the conclusion that our system can support database content update by providing additionally drug descriptions of pharmacological effects not yet found in databases like DrugBank. Moreover, we propose that automatic normalization of terms improves the annotation and the retrieval of relevant database entries. Contact: corinna.kolarik@scai.fraunhofer.de Supplementary information: Supplementary data are available at Bioinformatics online.


Related Articles

  • Appro Packs In the Power.  // Bio-IT World;Feb2005, Vol. 4 Issue 2, p37 

    Introduces the Appro XtremeBladeSystem, a blade server system from Appro. Key features; Specifications; Availability; Contact information.

  • Bioinformatics.  // Nature Biotechnology;Oct2000 Supplement 1, Vol. 18, p31 

    The article presents a reprint of the article "Bioinformatics" which appeared in the 1999 issue, volume 17 of "Nature Biotechnology." It discusses the significance of bioinformatics in the dissemination of information regarding genetic material sequence, life's processes in the healthy and...

  • MathWorks Rolls Out New Analysis Tools.  // Bio-IT World;Feb2005, Vol. 4 Issue 2, p37 

    Introduces the Bioinformatics Toolbox v. 2.0 and Distributing Computing Toolbox from MathWorks Inc. Key features; Availability; Contact information.

  • BALL - biochemical algorithms library 1.3. Hildebrandt, Andreas; Dehof, Anna Katharina; Rurainski, Alexander; Bertsch, Andreas; Schumann, Marcel; Toussaint, Nora C.; Moll, Andreas; Stöckel, Daniel; Nickels, Stefan; Mueller, Sabine C.; Lenhof, Hans-Peter; Kohlbacher, Oliver // BMC Bioinformatics;2010, Vol. 11, p531 

    Background: The Biochemical Algorithms Library (BALL) is a comprehensive rapid application development framework for structural bioinformatics. It provides an extensive C++ class library of data structures and algorithms for molecular modeling and structural bioinformatics. Using BALL as a...

  • Area clinic to implement new electronic record program.  // Lakelander (Whitney, TX);5/21/2008, Vol. 22 Issue 21, p5 

    The article reports on the implementation of electronic medical record system (EMRS) at the Family Diagnostic Medical Center in Hillsboro, Texas.

  • Hot topics. Louwerse, Max; Wrede, Oliver // Information Design Journal (IDJ); 

    An introduction to a series of articles on information research topics is presented which includes computational research and medical information research.

  • NAHIT report advocates unique patient identifier.  // Hospital Home Health;Feb2008, Vol. 25 Issue 2, p23 

    The article focuses on the call made by the National Alliance for Health Information Technology for the creation of a voluntary patient-controlled system of unique patient identifiers to ensure privacy and accuracy when exchanging electronic medical information.

  • Thirst for Clarity.  // Healthcare Informatics;Oct2009, Vol. 26 Issue 10, p10 

    The article discusses various medical related reports published within the issue including "A Steady Hand," which details the initiatives and provides the specifics, "Making the Right Decisions," which discusses if one method is superior to the other, and the "Paper Trails," which talks about...

  • CME Quiz.  // Family Practice Management;Jun2008, Vol. 15 Issue 6, pA1 

    A quiz concerning medical information is presented.


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics