HGDP and HapMap analysis by Ancestry Mapper reveals local and global population relationships

PLoS One. 2012;7(11):e49438. doi: 10.1371/journal.pone.0049438. Epub 2012 Nov 26.

Abstract

Knowledge of human origins, migrations, and expansions is greatly enhanced by the availability of large datasets of genetic information from different populations and by the development of bioinformatic tools used to analyze the data. We present Ancestry Mapper, which we believe improves on existing methods, for the assignment of genetic ancestry to an individual and to study the relationships between local and global populations. The principle function of the method, named Ancestry Mapper, is to give each individual analyzed a genetic identifier, made up of just 51 genetic coordinates, that corresponds to its relationship to the HGDP reference population. As a consequence, the Ancestry Mapper Id (AMid) has intrinsic biological meaning and provides a tool to measure similarity between world populations. We applied Ancestry Mapper to a dataset comprised of the HGDP and HapMap data. The results show distinctions at the continental level, while simultaneously giving details at the population level. We clustered AMids of HGDP/HapMap and observe a recapitulation of human migrations: for a small number of clusters, individuals are grouped according to continental origins; for a larger number of clusters, regional and population distinctions are evident. Calculating distances between AMids allows us to infer ancestry. The number of coordinates is expandable, increasing the power of Ancestry Mapper. An R package called Ancestry Mapper is available to apply this method to any high density genomic data set.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • Computational Biology* / methods
  • Evolution, Molecular*
  • Genetics, Population
  • HapMap Project*
  • Human Genome Project*
  • Human Migration
  • Humans
  • Internet
  • Polymorphism, Single Nucleotide
  • Population Groups / genetics

Grants and funding

This work was funded with internal funding from University College of Dublin, Ireland. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.