Status of text-mining techniques applied to biomedical text

Drug Discov Today. 2006 Apr;11(7-8):315-25. doi: 10.1016/j.drudis.2006.02.011.

Abstract

Scientific progress is increasingly based on knowledge and information. Knowledge is now recognized as the driver of productivity and economic growth, leading to a new focus on the role of information in the decision-making process. Most scientific knowledge is registered in publications and other unstructured representations that make it difficult to use and to integrate the information with other sources (e.g. biological databases). Making a computer understand human language has proven to be a complex achievement, but there are techniques capable of detecting, distinguishing and extracting a limited number of different classes of facts. In the biomedical field, extracting information has specific problems: complex and ever-changing nomenclature (especially genes and proteins) and the limited representation of domain knowledge.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Abstracting and Indexing
  • Biomedical Research
  • Databases, Bibliographic
  • Dictionaries as Topic
  • Humans
  • Information Storage and Retrieval* / methods
  • Language
  • Natural Language Processing*
  • Pattern Recognition, Automated / methods
  • Periodicals as Topic
  • Semantics
  • Terminology as Topic
  • Vocabulary, Controlled