Comparative genomic analysis of sequences sampled from a small region on soybean (Glycine max) molecular linkage group G

Genome. 2002 Aug;45(4):634-45. doi: 10.1139/g02-027.

Abstract

Eight DNA markers spanning an interval of approximately 10 centimorgans (cM) on soybean (Glycine max) molecular linkage group G (MLG-G) were used to identify bacterial artificial chromosome (BAC) clones. Twenty-eight BAC clones in eight distinct contiguous groups (contigs) were isolated from this genome region, along with 59 BAC clones on 17 contigs homoeologous to those on MLG-G. BAC clones in four of the MLG-G contigs were also digested to produce subclones and detailed physical maps. All of the BAC-ends were sequenced, as were the subclones, to estimate proportions in different sequence categories, compare similarities among homoeologs, and explore microsynteny with Arabidopsis. Homoeologous BAC contigs were enriched in repetitive sequences compared with those on MLG-G or the soybean genome as a whole. Fingerprint and cross-hybridization comparisons between MLG-G and homoeologous contigs revealed cases of highly similar physical organization between soybean duplicates, as did DNA sequence comparisons. Twenty-seven out of 78 total sequences on soybean MLG-G showed significant similarity to Arabidopsis. The homologs mapped to six compact genome segments in Arabidopsis, with the longest containing seven homologs spanning two million base pairs. These results extend previous observations of large-scale duplication and selective gene loss in Arabidopsis, suggesting that networks of conserved synteny between Arabidopsis and other angiosperm families can stretch over long physical distances.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Arabidopsis / genetics
  • Chromosomes, Artificial, Bacterial
  • Gene Duplication
  • Genetic Markers
  • Genome, Plant*
  • Glycine max / genetics*
  • Molecular Sequence Data
  • Physical Chromosome Mapping
  • Repetitive Sequences, Nucleic Acid
  • Sequence Analysis, DNA
  • Synteny

Substances

  • Genetic Markers

Associated data

  • GENBANK/AX196295