Unsupervised knowledge discovery in medical databases using relevance networks

A J Butte; I S Kohane

Unsupervised knowledge discovery in medical databases using relevance networks

Proc AMIA Symp. 1999:711-5.

Authors

A J Butte¹, I S Kohane

Affiliation

¹ Children's Hospital Informatics Program, Boston, Massachusetts, USA.

PMID: 10566452
PMCID: PMC2232846

Abstract

Increasing amounts of data exist in medical databases. When multiple variables are measured for each case in a data set, there exists an underlying relationship between all pairs of variables, some highly correlated and some not. This report describes a technique that creates networks of related variables, or relevance networks, by dropping links with either too weak correlation or too few data points to defend the relationship. The paper describes how applying this methodology to the domain of laboratory results allows the generation of meaningful relations between types of laboratory tests. These relations could be used as the basis of further exploratory research.

Publication types

Research Support, U.S. Gov't, P.H.S.

MeSH terms

Clinical Laboratory Techniques / statistics & numerical data*
Clinical Pharmacy Information Systems
Data Interpretation, Statistical
Databases as Topic*
Health Services Research
Humans
Information Storage and Retrieval / methods*
Models, Theoretical

Grants and funding

5T15 LM07092-07/LM/NLM NIH HHS/United States