Repbase Reports |
---|
2003, Volume 3, Issue 4 |
April 30, 2003 |
Copyright © 2001-2016 - Genetic Information Research Institute |
ISSN# 1534-830X |
Page 79 |
GYPSY5-I_AG |
|||
---|---|---|---|
GYPSY5-I_AG is an internal portion of the GYPSY5_AG LTR retrotransposon - a consensus sequence. |
|||
Submitted: 30-Apr-2003 |
Accepted: 30-Apr-2003 |
||
Key Words: LTR retrotransposon; Gyspy clade; 4-bp TSD; gag; AP protease; reverse transcriptase; integrase; GYPSY5_AG; GYPSY5-LTR_AG; GYPSY5-I_AG |
|||
Source: consensus |
Organism: Anopheles gambiae str. PEST |
Taxonomy: Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles |
|
[1] |
Authors: Pavlicek,A., Kapitonov,V.V. and Jurka,J. |
||
Title: GYPSY5_AG, a family of LTR retrotransposons from African malaria mosquito. |
|||
Journal: Repbase Reports 3:(4) p. 79 (2003) |
|||
Abstract: GYPSY5_AG is a family of autonomous gypsy-like LTR retrotransposons. GYPSY5-I_AG, an internal portion of GYPSY5_AG, is flanked by GYPSY5-LTR_AG LTRs. The GYPSY5-I_AG consensus sequence was reconstructed based on multiple alignment of 15 copies; they are less than 1% divergent from the consensus sequence. Two elements are 100% identical to the consensus and contain intact ORF, thus the family appear to be still active. The consensus sequence encodes the 1469-aa GYPSY5_AGp protein (pos. 35-4441), composed of gag (zinc-finger, 309-357), protease (403-492), reverse transcriptase (pos. 668-837) and integrase (pos. 1181-1332) domains. GYPSY5_AGp: MLTKEELLCALEVANIEVPPKATLPQLRMLYEQSVPKNKMEEQSTQNFIPQRVCADEDEVTNNGNHVAAA AILMKDKAAPTSSTQGASFDATSHQLELMALRAKIMEMEQRQTFTDGRLVHPEELKHLIPEFSDGLGINK WINTIRYNSELYGWQDRTMLLYAGSRLTGAASEWYNGFRNTLKTFDEFADTIKKAFPDRCNEAVIHSQLA SVYKKISESYTSYVYRVNALGMSGHVSEEAIITYVIRGLSRDPLYDSLVTKDYRDIYDLIDNIKRYESHL LLRKNPERRSPSHINTISPRPIPPRQTTTEPLRCYNCSNHGHHSSQCTQPRRAPGSCFRCGSTSHVIRNC PVPDRRQLTVAAVQGNDNETAHLDSGENGNFVQLEAYQEVSVAFKRNNVWSPGLIITSLFDSGSSKSFIN EAIVPVTKLSAPQPSGFRGIGNVNLQTLGTVQLKLSFRNQTFIHNFYILPKSYMSLSMIVGRDLLSEFNI TLAQFRKHYSKLMLMNLNKDKILNLKKPGFYHKLQTLGLLRSSIQAPLPEVCKDSKLSDPNISSKFISIS DKSEHKQQYDTTFSEMCSINISDEASTINVGEHLSKEQGIALRSIVSNNYINFPDKHIIPSAHKMRISLT HDTPIFTKPRRLSFDERNKVKVIVKDLLEKNIIRPSNSPYASALVLVRKKNGEIRKCVDYRPLNKVTIRD NYPLPLIETCLEHLGNKKFFTLLDLKSGFHQVAMDEDSIKFTAFVTPDGQYEYTRMPFGLKNAPAEFQRF INTILRKFIENEKLVVYIDDILIASQDFKEHLEIVSEVLHTLRNNGLELRLDKCKFAYDELDYLGYKANH SGICPSDNHVKIIKNYPVPQNTKQVQQCLGLFSFFRRFVPHFSSIAKPLTNLLKNNVPFIFDDECKKAFE TLRDKLIVAPVLAIYDPKRETELHCDASSIAFGSVLLQKQDDGRYHPISYFSKTTSADEAKLHSYELETL AVIYALKRFHTYVHGIPIKIVTDCNSLVETLKNRNSSAKIARWSLFLENYNYIIQHRPGLAMSHVDALSR LEHLAAFDDVDIDFQIRVAQARDPLIQTLKKELETTDVEGYQLQDGIVFKRSPSNRLKLYVPTEMVNNLI RSIHEQIGHLGAEKCCNQIDQNYWFPNRKTRIINFINNCLKCIIHSAPSRVNNRNLHSIQKEPYPFDTLH IDHFGPLPTSSLKKKYLLVVIDAFTKFVKLYPTTSTSTKEVCNALKQYFSYYSRPKRIISDRGTCFTSAA FSNFLSSRGISHVLNATGSPQANGQVERVNRIIRPILSKLSNQTDHVDWVTHLLSTEYALNNTIHSSTRF SPSMLLFGVNQRGPSVDILTEYLEDKNKAFSDLETIRAEASFNILKSQQNNEKQHAKHHRPAPVFNEGEF VVIKNVDNTPNSNKKLIARYKGPYVIHKRLPNDRYVIRDIDGIQMTQIPYDGVLESDKLRRWIVPLGGA
|
|||
Derived: [1] (Consensus) |
|||
Download Sequence - Format: IG, EMBL, FASTA |
|||
References: |