Repbase Reports |
---|
2004, Volume 4, Issue 3 |
March 31, 2004 |
Copyright © 2001-2016 - Genetic Information Research Institute |
ISSN# 1534-830X |
Page 96 |
GYPSY52-I_AG |
|||
---|---|---|---|
GYPSY52-I_AG is an internal portion of retrotransposon GYPSY52_AG - a consensus sequence. |
|||
Submitted: 31-Mar-2004 |
Accepted: 31-Mar-2004 |
||
Key Words: LTR retrotransposon; Gypsy clade; CsRn1 lineage; 4-bp TSD gag; AP protease; Reverse Transcriptase; RNase-H; integrase GYPSY52_AG; GYPSY52-LTR_AG; GYPSY52-I_AG |
|||
Source: consensus |
Organism: Anopheles gambiae str. PEST |
Taxonomy: Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles |
|
[1] |
Authors: Tubio,J.M.C., Costas,J.C. and Naveira,H.F. |
||
Title: GYPSY52_AG, a member of the CsRn1 lineage of the Ty3/gypsy group of LTR retrotransposons in Anopheles gambiae. |
|||
Journal: Repbase Reports 4:(3) p. 96 (2004) |
|||
Abstract: GYPSY52_AG is a family of gypsy-like LTR retrotransposons that, according to the aminoacid sequence of its Reverse Transcriptase, RNase and Integrase is phylogenetically grouped with representatives of the CsRn1 lineage of other organisms. GYPSY48_AG, GYPSY49_AG, GYPSY50_AG, GYPSY51_AG and GYPSY53_AG are other members of this same lineage in Anopheles gambiae. The GYPSY52-I_AG consensus was reconstructed after multiple alignment of 7-8 copies. The consensus encodes the 327-aa GYPSY52_AG1p gag-like poliprotein (pos. 922-1902) and the 1137?aa GYPSY52_AG2p pol-like poliprotein (pos. 1906-5316). The sequence of the LTRs flanking GYPSY52-I_AG is deposited as GYPSY52-LTR_AG. GYPSY52_AG1p: MQRSPQQGAAPVASPAVAISPAVAASPAVVASPAAGLPTPSPYLIDITPLARDASTDLPE FQVETVNAMRLKPPELDTADIHTFFYALENWFDAWNISPHHHVRRFNILKTQIPTRIPPE LRPILDSVPSTDRYESAKKAIIQHFEESQRSRLHRLLSEMSLGDRKPSQLLAEMRRTANG AMTDSMLIDLWIGRLPPYVQSAVIASSQNADEKVKVADSVVDSFALYNRSGPYQTIAEVR NEEVNRLSRQVAELSQRLETLMNQNQARERSRARSRSRNRNTNQATNPNTNGYCFYHDRY GQQARNCRAPCSFNSRPQNNGTPTTSA GYPSY52_AG2p: RNAGDVQLLVDSISSASHRLIIKDYKTNQPFLIDTGADVSVIPRQHSSVPCKPSTMKLFA ANSTPIQVYGESLYTLDLGLRRAFLWNFVIADVGTAIIGADFLQHFHLLVDLRDKCLVDA LTNLRTPGVPDPHPGEPTVKVCDSTSPIATLLREFPGLTARTAPGTLLQSDVTHRIETTG QPTFARPRRLSPEKYAAARAEFESLVQLGVCRPSNSSWASPLHMTKKADGTWRPCGDYRA LNAKTIPDRYPLPFLQDFTMHLQGKTIFSKVDLHKAYHQIPIHPEDIPKTAITTPFGLFE FTTMPFGLRNAAQTFQRLIHDVLRGLDFVFPYIDDMIVASSSEEEHHEHLRQLFQRLEQH HLAINPAKCEFNRSEIAFLGHLVNAEGIRPLPERVRAISELSKPATIMELKKFLAMINYY RRFLPHALTTQSILLEMTPGNKKKDKTPLKWTPESSEAFDRCKEQLQQAALLAHPALNAE LSLWTDASDFAAGAVLHQRIDGQLQPLGFFSKKFEKAQLNYSTYDRELTAIYLAVRHFRY QLEGREFCIYTDHKPLTFAFRQTLDSSSPRRARQLDFIGQFSTDIRHVSGEENITADLLS RIEIVNASPAIDFERLAEEQTNDPELADILSGKTRTDLFLQKTPIPGSTQSLYADCPGGI IRPYITRSFRNQLLHAVHDLSHPGARATAKLMTERFVWLDIKRDAQEFARNCLACQRAKI GRHTKSPLVPYPATQDRFSHINIDIIGPFPISNGNRYCLTIIDRYTRWPEAIPIPDITAT TVVSALLYHWIARFGVPSHVTTDQGRQFESALFKELTRALGTKHIRTTAYHPQANGLIER WHRTLKAAICCKDTSKWSEHLPLILLGLRTTFKNDINASPAELVYGTTLTIPAEFLIEKP QPAMVNQSDFAKTLREAMSKIRPSNTAWHTNRTSFVHSDLNKCSHVFVRNDTVRPALTTP YHGPYQVLTRNSKSFQILLNEHPSLVSVDRIKPACTTEGIISSAPQQPSPDQLLTSQGYA TPVAQPPSTQQSTMSQRLTTPMPGPSTLDQLPSYNQSSLPDQQPTATQSPSTIQLPSTNQ PPMSRNRATRTSPMPSLQPARATTSFAPPPPILRKDQNVSTGVTRSQRRVVIPLRYR
|
|||
Derived: [1] (consensus) |
|||
Download Sequence - Format: IG, EMBL, FASTA |
|||
References: |