Chromosome-level assembly of the water buffalo genome surpasses human and goat genomes in sequence contiguity

Nat Commun. 2019 Jan 16;10(1):260. doi: 10.1038/s41467-018-08260-0.

Abstract

Rapid innovation in sequencing technologies and improvement in assembly algorithms have enabled the creation of highly contiguous mammalian genomes. Here we report a chromosome-level assembly of the water buffalo (Bubalus bubalis) genome using single-molecule sequencing and chromatin conformation capture data. PacBio Sequel reads, with a mean length of 11.5 kb, helped to resolve repetitive elements and generate sequence contiguity. All five B. bubalis sub-metacentric chromosomes were correctly scaffolded with centromeres spanned. Although the index animal was partly inbred, 58% of the genome was haplotype-phased by FALCON-Unzip. This new reference genome improves the contig N50 of the previous short-read based buffalo assembly more than a thousand-fold and contains only 383 gaps. It surpasses the human and goat references in sequence contiguity and facilitates the annotation of hard to assemble gene clusters such as the major histocompatibility complex (MHC).

Publication types

  • Comparative Study
  • Research Support, N.I.H., Intramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Buffaloes / genetics*
  • Chromatin / chemistry
  • Chromatin / genetics
  • Chromosomes, Mammalian / genetics*
  • Contig Mapping / methods*
  • Female
  • Genome / genetics*
  • Genomics / methods
  • Goats / genetics*
  • Haplotypes
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Major Histocompatibility Complex / genetics
  • Molecular Sequence Annotation / methods
  • Multigene Family / genetics
  • Repetitive Sequences, Nucleic Acid / genetics
  • Whole Genome Sequencing

Substances

  • Chromatin