Repbase Reports

2003, Volume 3, Issue 9
September 30, 2003
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 175

GYPSY17-I_AG

GYPSY17-I_AG is an internal portion of retrotransposon GYPSY17_AG - a consensus sequence.

Submitted:
30-Sep-2003
Accepted:
30-Sep-2003
Key Words:
LTR retrotransposon; Gypsy clade; mdg1 lineage; 4-bp TSD gag; AP protease; reverse transcriptase; RNase-H; integrase GYPSY17_AG; GYPSY17-LTR_AG; GYPSY17-I_AG
Source:
consensus
Organism:
Anopheles gambiae str. PEST
Taxonomy:
Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles
[1] Authors:
Tubmo,J.M.C., Costas,J.C. and Naveira,H.F.
Title:
GYPSY17_AG, a member of the mdg1 lineage of the Ty3/gypsy group of LTR retrotransposons
Journal:
Repbase Reports 3:(9) p. 175 (2003)
Abstract:
GYPSY17_AG is a family of gypsy-like LTR retrotransposons that, according to the aminoacid sequence of its ORF2, is phylogenetically grouped with Drosophila representatives of the mdg1 lineage. GYPSY8_AG, GYPSY9_AG, GYPSY10_AG, GYPSY11_AG, GYPSY12_AG, GYPSY13_AG, GYPSY14_AG, GYPSY15_AG, and GYPSY16_AG are other members of this same lineage in Anopheles gambiae. The GYPSY17-I_AG consensus was reconstructed after multiple alignment of 6 copies. The consensus encodes the 454-aa GYPSY17_AG1p gag-like protein (pos. 1761-3122) and the 1260-aa GYPSY17_AG2p (pos. 2966-6745). The sequence of the LTRs flanking GYPSY17-I_AG is deposited as GYPSY17-LTR_AG. GYPSY17_AG1p: NPNFDHMTKLKEIVRRLELIHKTLQQNKGNIRQCALATYRLQVDEIYSVFRKEIETNYDKYSDSEIKFYN NIIQNLITNIVEKVNNETINTDNNTSDLNETLKSHKKLTLKTTAHVIISILSIYKRQKQIVPTANIDHTA IKTNTSVDSSYKPNMDALEILKTATSLIPTFSGRYDEAEAMLAALETMKEAVDEQHHRLIMRVVQSKLKG KGRKIIGKTVTNIEDALAKIGAYVKKTESPEDIATAIHALKQKTTPKDFGEEIQALAEELEQAYLGEDVA PALATAKTNKIAMAAFGKGLKKEIHQAIVLSGTIPTLDAAIRAIISIDKTNQSTQDKKSDNRQNGQENRY SSNQRQTNTRGIDQRQNNNNWRFPPQQNNNSWRSQPQQNNNNWRSPQTQNGRQGAGFNTRNRQPAGPNFL GQREPQRAILYTQMETQNPQVDQPSTSHYGQHTQ GYPSY17_AG2p: TGSRVQHPKQATRRPKFFRATRTAKGNPLHANGDPEPTGGPALHKPLRATYTINVQKSNFIRTRLGLADS ICNLFVDSGSDISIIKGNKVRPTQIYKPKDIVDIISVGEGTITTHGSTITDVIVEGKKIQQLFHIVPDNF KIPADGILGRDFFMNHRCIINYDTWIFSVKHNGEFLEAPIEDTINGKTLIPPRCEVIRKLDKLKELDTDA VVCAEQLQEDVLVGNCIVNKNYPFIKIINTSNKAKLVNISHIKTIPLNEFEIVKTSNHKDENRLAIIKDL IRKENISEDTDKSFEQLLLSYNDIFHLPNDHLTTNNFYEQDIKLEDKRPVYIPNYKQNHSQGPEIKKQIE KMLQDDVIEHSVSHYNSPILLVPKKSSDEKKWRLVVDFRQLNKKLLPDKFPLPRIDSILDQLGRAKFFST LDLMSGFHQIPLEESSKKYTAFSSTDGHYQFKRLPFGLNISPNSFQRMMTIAMTGLTPECAFVYVDDIVV VGASENHHLKNLEKVFERLRHYNLKLNPEKSCFFKKEVTYLGHKITDKGILPDDSKYDSIKNYPIPQNAD DARRYVAFCNYYRKFIPNFALKAKPLNSLLKKNTKFEWTQECQEAFEYLKNTLISPQILQYPDFSKQFIL TTDASTIACGAVLAQEHDGIDMPICFASRTFTKGEANKAIIEKELAAIHWAIMHFKHYLYGTKFTVKTDH RPLVYLFGMKNPSSKLTRMRLDLEEFDFTVEFVKGKQNVVADALSRIKITSDEIKSINVITRSMNKPVTS DNVLGNTSESDQLKMFHALAYDEVKDLPKLETSVKRNENTIELIGKILNKRKSKELLSVRDIHLNTDIGL QEPLLVKDFQKRKEKSAIVQFIKNIEKKLVMKSITQLAVSETDEIFKEVNPNEFKQIANNHLKNIQILIY TKPQTINDEKTINDILDKVHNTPTGGHIGQYRMYKKIRKEYVWNKMKKSIKDFLDKCITCKLNKHLIKTV EPFVKTDTPNIPFEVVSIDTVGPFQKTNNNNRYAVTLQCNLTKHVTVIAIPNKEANTVARAVIEKFMLIY GTNIKEFRTDMGTEYKNEIFKNISEILRIEHKFSTPYHPQTIGALERNHRCLNEYLRIFTNEHKDDWDDW INYYSFAYNTTPNLDHGYTPFELVFGRNEKISTNITEKSTPLYNYDDYSKEFKYRLKLAHDRTRKHIEQE KMKLLKEQQNINQVNFQIGDQIALTNENRTKLDPVYKGPYKVKEINGPNMIIENTDGVTQNIHKNRAIKI
Derived:
[1] (consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2024 - Genetic Information Research Institute