Journal of Molecular Biology
Volume 354, Issue 4, 9 December 2005, Pages 994-1007
Journal home page for Journal of Molecular Biology

SVA Elements: A Hominid-specific Retroposon Family

https://doi.org/10.1016/j.jmb.2005.09.085Get rights and content

SVA is a composite repetitive element named after its main components, SINE, VNTR and Alu. We have identified 2762 SVA elements from the human genome draft sequence. Genomic distribution analysis indicates that the SVA elements are enriched in G+C-rich regions but have no preferences for inter- or intragenic regions. A phylogenetic analysis of the elements resulted in the recovery of six subfamilies that were named SVA_A to SVA_F. The composition, age and genomic distribution of the subfamilies have been examined. Subfamily age estimates based upon nucleotide divergence indicate that the expansion of four SVA subfamilies (SVA_A, SVA_B, SVA_C and SVA_D) began before the divergence of human, chimpanzee and gorilla, while subfamilies SVA_E and SVA_F are restricted to the human lineage. A survey of human genomic diversity associated with SVA_E and SVA_F subfamily members showed insertion polymorphism frequencies of 37.5% and 27.6%, respectively. In addition, we examined the amplification dynamics of SVA elements throughout the primate order and traced their origin back to the beginnings of hominid primate evolution, approximately 18 to 25 million years ago. This makes SVA elements the youngest family of retroposons in the primate order.

Introduction

Transposons and transposon-like repetitive elements collectively occupy about 44% of the human genome. Alu and L1 (long interspersed element-1) elements account for ∼30% of the genome sequence and are the most abundant transposable elements in humans,1 while human endogenous retroviruses (HERVs) represent another ∼1% of the human genome. In addition to the major retrotransposon families, there are smaller families of transposons such as SVA, which are receiving increased attention lately due to their recent retrotransposition activity in the human genome.2, 3

The SVA element was originally named SINE-R, with the R indicating its retroviral origin.4 In 1994, Shen et al. identified a new composite retroposon when they studied the structure of the RP gene.5 This new retroposon consisted of the SINE-R element together with a stretch of sequence that shares sequence similarity with Alu sequences. Thus, it was named “SVA” after its main components, SINE, VNTR and Alu.5

SVA elements contain the hallmarks of retrotransposons, in that they are flanked by target site duplications (TSDs), terminate in a poly(A) tail and they are occasionally truncated and inverted during their integration into the genome.2 In addition, they can transduce 3′ sequences during their movement. Therefore, it has been proposed that SVA elements are non-autonomous retrotransposons that are mobilized by L1 encoded proteins in trans.2

SVA elements remain active in the human genome, as demonstrated by their involvement in the creation of various diseases. To date, at least four diseases have been reported related to SVA insertions.2, 6, 7, 8, 9, 10 This makes SVA the third known category of retrotransposons currently expanding in the human lineage, along with L1 and Alu elements.11, 12

To assess the distribution and impact of SVA elements in the human genome, we examined all the SVA elements in the human genome reference sequence.1 Six SVA subfamilies were identified and characterized. For the two human-specific subfamilies, the associated insertion presence/absence human genomic diversity was analyzed. Furthermore, we traced the origin of the entire SVA family back to the beginning of the hominid primate radiation and determined the copy number of SVA elements in different non-human primate genomes. The overall distribution of SVA elements showed a significant correlation with genomic G+C content and gene density.

Section snippets

Copy number of SVA elements

In total, 2762 SVA elements were identified in the human genome draft sequence (hg17; May 2004 freeze). Together, they account for 4.2 Mb of the genome, with an average density of one element per 1.03 Mb. Among them, 1752 elements are full-length, composing 63% of the group. The copy number of SVA elements in the chimpanzee genome (panTro1; Nov 2003 freeze) was also determined. A total of 2637 elements were obtained, with 42% being full-length. After examining the truncated SVA elements in the

Copy number and subfamily composition of SVA elements

The total number of SVA elements reported here was based on a whole-genome analysis; therefore it is more likely to represent the actual number of SVA elements in the human genome compared to previous studies.2, 4 It should be noted here that due to the polymorphic nature of some of the young SVA elements, a proportion of the total number of elements will not be retrieved from any single reference genome that has been sequenced.29 Therefore, the copy number reported here represents the lower

Conclusion

The SVA family of retroposons represents the third known category of mobile elements whose de novo mobilization results in human genetic disorders. Until now, little was known about the distribution and properties of these elements in the human genome as compared to Alu and L1 elements. In this study, we identified and characterized all of the SVA elements from the draft sequence of the human genome. In addition, the human genomic diversity associated with polymorphic SVA elements as well as

Genome analysis

The RepeatMasker annotations of the human (hg17; May 2004 freeze) and chimpanzee genomes (panTro1; Nov. 2003 freeze) were obtained from the UCSC Genome Bioinformatics Site†. The locations of SVA elements were then extracted and inspected manually. Due to the highly variable VNTR region in SVA elements, some of the annotations did not recognize the composition of the elements correctly. In those cases, the annotations were edited manually.

Next, the SVA elements, along with

Acknowledgements

We thank everybody in the Batzer lab for critical reading and inspirational discussion during the preparation of the manuscript. This research was supported by the National Science Foundation grants BCS-0218338 (to M.A.B.) and EPS-0346411 (to M.A.B.); Louisiana Board of Regents Millennium Trust Health Excellence Fund HEF (2000-05)-05 (to M.A.B.), (2000-05)-01 (to M.A.B.) and (2001-06)-02 (to M.A.B.), National Institutes of Health RO1 GM59290 (to M.A.B.) and the State of Louisiana Board of

References (47)

  • R. Cordaux et al.

    Retrotransposition of Alu elements: how many sources?

    Trends Genet.

    (2004)
  • S.F. Altschul et al.

    Basic local alignment search tool

    J. Mol. Biol.

    (1990)
  • A. Pavlicek et al.

    Similar integration but different stability of Alus and LINEs in the human genome

    Gene

    (2001)
  • Z. Gu et al.

    Densities, length proportions, and other distributional features of repetitive sequences in the human genome estimated from 430 megabases of genomic sequence

    Gene

    (2000)
  • A.V. Rynditch et al.

    The regional integration of retroviral sequences into the mosaic genomes of mammals

    Gene

    (1998)
  • J.F. Brookfield

    Selection on Alu sequences?

    Curr. Biol.

    (2001)
  • P.L. Deininger et al.

    Mobile elements and mammalian genome evolution

    Curr. Opin. Genet. Dev.

    (2003)
  • A. Alonso et al.

    Real-time PCR designs to estimate nuclear and mitochondrial DNA copy number in forensic and ancient DNA studies

    Forensic Sci. Int.

    (2004)
  • J.A. Walker et al.

    Human DNA quantitation using Alu element-based polymerase chain reaction

    Anal. Biochem.

    (2003)
  • E.S. Lander et al.

    Initial sequencing and analysis of the human genome

    Nature

    (2001)
  • E.A. Bennett et al.

    Natural genetic variation caused by transposable elements in humans

    Genetics

    (2004)
  • M. Ono et al.

    A novel human nonviral retroposon derived from an endogenous retrovirus

    Nucl. Acids Res.

    (1987)
  • P.A. Callinan et al.

    Transposable elements and human disease

    Genome Dynam

    (2005)
  • Cited by (279)

    • Transposable Elements: Parasites that Shape Genome Evolution

      2024, Function and Evolution of Repeated Dna Sequences
    View all citing articles on Scopus

    H.W., J.X. and D.G. contributed equally to this work.

    View full text