VSGdb: a database for trypanosome variant surface glycoproteins, a large and diverse family of coiled coil proteins

Lucio Marcello, Suraj Menon, Pauline Ward, Jonathan M Wilkes, Nicola G Jones, Mark Carrington, J David Barry

Research output: Contribution to journalArticle

20 Citations (Scopus)

Abstract

BACKGROUND:
Trypanosomes are coated with a variant surface glycoprotein (VSG) that is so densely packed that it physically protects underlying proteins from effectors of the host immune system. Periodically cells expressing a distinct VSG arise in a population and thereby evade immunity. The main structural feature of VSGs are two long alpha-helices that form a coiled coil, and sets of relatively unstructured loops that are distal to the plasma membrane and contain most or all of the protective epitopes. The primary structure of different VSGs is highly variable, typically displaying only ~20% identity with each other. The genome has nearly 2000 VSG genes, which are located in subtelomeres. Only one VSG gene is expressed at a time, and switching between VSGs primarily involves gene conversion events. The archive of silent VSGs undergoes diversifying evolution rapidly, also involving gene conversion. The VSG family is a paradigm for alpha helical coiled coil structures, epitope variation and GPI-anchor signals. At the DNA level, the genes are a paradigm for diversifying evolutionary processes and for the role of subtelomeres and recombination mechanisms in generation of diversity in multigene families. To enable ready availability of VSG sequences for addressing these general questions, and trypanosome-specific questions, we have created VSGdb, a database of all known sequences.

DESCRIPTION:
VSGdb contains fully annotated VSG sequences from the genome sequencing project, with which it shares all identifiers and annotation, and other available sequences. The database can be queried in various ways. Sequence retrieval, in FASTA format, can deliver protein or nucleotide sequence filtered by chromosomes or contigs, gene type (functional, pseudogene, etc.), domain and domain sequence family. Retrieved sequences can be stored as a temporary database for BLAST querying, reports from which include hyperlinks to the genome project database (GeneDB) CDS Info and to individual VSGdb pages for each VSG, containing annotation and sequence data. Queries (text search) with specific annotation terms yield a list of relevant VSGs, displayed as identifiers leading again to individual VSG web pages.

CONCLUSION:
VSGdb http://www.vsgdb.org/ is a freely available, web-based platform enabling easy retrieval, via various filters, of sets of VSGs that will enable detailed analysis of a number of general and trypanosome-specific questions, regarding protein structure potential, epitope variability, sequence evolution and recombination events.
Original languageEnglish
Number of pages8
JournalBMC Bioinformatics
Volume8
Issue number1
DOIs
Publication statusPublished - 2 May 2007

    Fingerprint

Keywords

  • Antigenic Variation
  • Coiled Coil
  • Trypanosoma Brucei
  • Temporary Database
  • Source File

Cite this