Repbase Reports |
---|
2003, Volume 3, Issue 5 |
May 31, 2003 |
Copyright © 2001-2016 - Genetic Information Research Institute |
ISSN# 1534-830X |
Page 87 |
GYPSY7-I_AG |
|||
---|---|---|---|
GYPSY7-I_AG is an internal portion of the GYPSY7_AG LTR retrotransposon - a consensus sequence. |
|||
Submitted: 31-May-2003 |
Accepted: 31-May-2003 |
||
Key Words: LTR retrotransposon; Gypsy clade; Gypsy group; 4-bp TSD; gag; protease; reverse transcriptase; integrase; env; GYPSY7_AG; GYPSY7-LTR_AG; GYPSY7-I_AG |
|||
Source: consensus |
Organism: Anopheles gambiae str. PEST |
Taxonomy: Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles |
|
[1] |
Authors: Kapitonov,V.V. and Jurka,J. |
||
Title: GYPSY7_AG, a family of LTR retrotransposons from African malaria mosquito. |
|||
Journal: Repbase Reports 3:(5) p. 87 (2003) |
|||
Abstract: GYPSY7_AG is a young family of gypsy-like LTR retrotransposons. GYPSY7_AG belongs to the Gypsy group of the Gypsy superfamily. GYPSY7-I_AG, an internal portion of GYPSY7_AG, is flanked by GYPSY7-LTR_AG LTRs. The GYPSY7-I_AG consensus sequence was reconstructed based on multiple alignment of 5 copies; they are ~0.4% divergent from the consensus sequence. The consensus sequence encodes the Gypsy7_AG1p 434-aa gag-like protein (pos. 507-1808), the 1061-aa Gypsy7_AG2p pol-like protein (pos. 1756-4939), composed of the protease, reverse transcriptase and integrase domains, and the Gypsy7_AG3p env protein (pos.4962- 6506). GYPSY7_AG1p: MEALAGRIAALEARFSESNVTDDFQDPPLFFTKQDGSAVDPESFEKIPGVVKDLPIFCGDPSELNSWIND VDGIIRLYQTISSHSLEKQNKFHMICKFIRRKIRGEANDALVASNVGINWNMMRKTLITYYGEKRDLETL DFQLMSVYQKGRTLEVYYDEVNRLLSLIANQIQTDDRFNHPEASKAMIGTYNKKAIDAFIRGLDGDVYKF IRNYEPTSLAAAYSYCISFQNLECRKMLTKPKHFNTPPSAPRNQIPLPTPHLPPRVFQHQQRPMTANNVR PHFAHHPPIQNFAGNFTQRPVWNQPNQQRPIFQRTNFNQPNQMKNFTQQRNNFRQNGPEPMEIDPSIRSH QVNYANRPNSSNIRPLKRQRAFNIEAVPRRELEPTSYEDNLYDDDVESQASYERYMRNVEKQEKLNENSH YDEISREAELNFLG GYPSY7_AG2p: KFSLRRNFSRSRIKFFRLKSALPYFIYHGKAGQQIKILIDTGSNKNFINPLHAKISHDVIKPFFVSSVGG DLLITKYSQAQIFAPYSDVNVKFYHLQGLKSFDAIIGYDTIKEMGAFVDAKRDNLVLENFIIPLSLHPLQ EVNRIEIRDTHLNHQEKEKLHLFLNKFQDLFQPPDEKLPFTTKVEATIATNDTEPIYCKSYPYPLSLKQE VETQIKKLLNNGIIRPSRSPYNSPVWIVPKKVDASNEKKYRLVIDYRKINLKTKSDRYPIPDTSTVLANL GNNKYFTTLDLASGFHQIRLAEKDIEKTAFSINNGKYEFLRLPFGLKNAPSIFQRVMDDVLREHIGKICH VYIDDIIVFGKTFDEHLKNLEIVLNTLREANFKIQPDKSEFLRTEVEFLGFIVSEYGLKPNEKKIESILK YPEPQTIRELRSFLGLSGYYRRFVKNYAALAKPLTKLLRGEDGQGHCKITKNQSKNFPIKLDDDAKRAFK TLKEVLSSDDVLAYPDFDHDFILTTDASDKAIGAVLSQNVNGVEKPITFISRTLSKTEENYATNEKEMLA IVWALHSLRNYIYGAKIIILTDHQPLTYAMSPKNNNAKLKRWKAFIEEHNYELSYKPGKTNVVADALSRI QINSLTPTQHSAEEDDLSFIPSTEAPINVFRNQLIFQKGTISSYEFVNPFPKFKRHTFIEPQFSIDFIKD KLKRFMIPGIINGIFTDEPTMGIIQETFKNLFNISTMKARFSQTQVQDICDQEQQIEEIRKIHNFAHRNA KENSLQAIKKFYFPSMRNKIEQYVKNCETCKVEKYERRPPEYIPVKTPIPKYPGEIVHVDIFAYNANFLF ISSMDKFSKYLKLKPIKSKSIADVKEVLLQLLYDWNLPRQIIFDNECTFVSNVIEQSILNLGVSIFKTPV NRSESNGQVERCHSTIREIARCTKGLNPDMSLITLIQQAVYKYNNTIHSFTKETPRKVYIGEQSEELSFR DRSKLKEKIESKIIKIFEEKNEKIKDDKYQDYEPNQFAYEKNKTMNKRDSRYKTVVVKENHPTYIIDSNN RKIHKINLRKN GYPSY7_AG3p: LIFYYRIAFFTLYGVLQASINIFDLTNNPLAIVPLGQAKIRIGYLRTIHPIDLTELEEIISRVFENSTNS TGKSPLQSLINLKLEKLNATISKIRPRRLRTKRWNSIGTAWKWIAGSPDAEDLTIINTTLNSLILQNNEQ LLINNGLSRRFQETTNIANHVIDLQNRIQREHQTEIQQIIKIANLDALQAHIKTLQEAILAAKHGIPNSE LLSIEDLNTVAEFLAQNGIYYTSVEEMLTQATAQVTMNSTHVIFMLKFPRLSYETYEYNYIDSIIQNDKR ILIKHNYIIRNLTHMFELPQPCIDQSSHQLCESKDLEEPSRCIRQLVQGEHTECMYEKVYSTGLVKHINN ANILLNDATAEISSNCSNINHILNGSYLIQFHNCNIFINGELFPSTEVSITGKPYISTLGLIAKEDGIRD EPSIEHLRNITLQHREKLHTISLVNNSLTWKLHIFGSIGLTTIVLITIAILYFITSIRRTKISLNIPTNN TNRQDVHHIETFVKKPTTFHALGRL
|
|||
Derived: [1] (Consensus) |
|||
Download Sequence - Format: IG, EMBL, FASTA |
|||
References: |