Sequential result refinement for searching the biomedical literature

J Biomed Inform. 2009 Aug;42(4):678-84. doi: 10.1016/j.jbi.2009.02.009. Epub 2009 Mar 9.

Abstract

Information overload is a problem for users of MEDLINE, the database of biomedical literature that indexes over 17 million articles. Various techniques have been developed to retrieve high quality or important articles. Some techniques rely on using the number of citations as a measurement of an article's importance. Unfortunately, citation information is proprietary, expensive, and suffers from "citation lag." MEDLINE users have a variety of information needs. Although some users require high recall, many users are looking for a "few good articles" on a topic. For these users, precision is more important than recall. We present and evaluate a method for identifying articles likely to be highly cited by using information available at the time of listing in MEDLINE. The method uses a score based on Medical Subject Headings (MeSH) terms, journal impact factor (JIF), and number of authors. This method can filter large MEDLINE result sets (>1000 articles) returned by actual user queries to produce small, highly cited result sets.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Abstracting and Indexing
  • Algorithms*
  • Biomedical Research
  • Information Storage and Retrieval / methods*
  • Journal Impact Factor
  • MEDLINE*
  • Regression Analysis