-
Notifications
You must be signed in to change notification settings - Fork 0
ERV Project Data
Putative ancestral open reading frames (ORFs) encoding accessory genes have been identified in many endogenous lentiviruses. In some cases these represent clear homologs of the accessory genes found in exogenous lentiviruses, but in others it remains unclear whether they are homologs of previously described accessory genes or distinct genes. We therefore extended the set of lentivirus genome features defined in our core project to include these putatively distinct genes.
For each lentiviral paleovirus species, we have defined a "master" reference sequence. These represent reconstructed genomes of ancestral lentiviruses, which gave rise to ERV lineages.
Click to download each sequence in FASTA format:
- Rabbit endogenous lentivirus K (RELIK): Download RELIK-con
- Prosimian immunodeficiency virus 1 (PSIV1): Download PSIV1-con
- Prosimian immunodeficiency virus 2 (PSIV2): Download PSIV2-con
- Mustelid endogenous lentivirus (MELV): Download MELV-con
- Dermopteran endogenous lentivirus (DELV): Download DELV-con
- Springhare endogenous lentivirus (SpELV): Download SpELV-con
- Genome feature annotations: View feature annotations
- Auxiliary metadata in tabular format: View table
Figures illustrating snnotated paleovirus reference sequences, as collated from published manuscripts, are available in PDF format for each species:
- Rabbit endogenous lentivirus K: RELIK
- Prosimian immunodeficiency virus 1: pSIV-1
- Prosimian immunodeficiency virus 2: pSIV-2
- Mustelid endogenous lentivirus: MELV
- Dermopteran endogenous lentivirus: DELV
- Springhare endogenous lentivirus: SpELV
-
FASTA sequences: these sequences represent lentiviral ERVs identified in the whole genome sequences of vertebrate species. ERVs were identified by screening genomes using the database-Integrated Genome Screening Tool. A complete list of genome assemblies screened is available here.
-
GenBank XML sequences: ERV sequences that were generated via PCR and submitted to GenBank.
- Genomic location data: View table
- Input parameters for DIGS: View parameters
We have applied a systematic approach to naming lentivirus ERV lineages and loci.
Each element was assigned a unique identifier (ID) constructed from a defined set of components.
The first component is the classifier ‘ERV’ (endogenous retrovirus).
The second component is a composite of two distinct subcomponents separated by a period:
(i) the name of lentivirus the ERV derives from;
(ii) a numeric ID that identifies a unique insertion locus. Orthologous copies in different species are given the same number.
The third component of the ID defines the set of host species in which the ortholog occurs.