Repbase Reports |
---|
2002, Volume 2, Issue 11 |
November 30, 2002 |
Copyright © 2001-2016 - Genetic Information Research Institute |
ISSN# 1534-830X |
Page 2 |
CR1-2_AG |
|||
---|---|---|---|
CR1-2_AG is a CR1-like non-LTR retrotransposon - a consensus sequence. |
|||
Submitted: 00- - 0 |
Accepted: 30-Nov-2002 |
||
Key Words: non-LTR retrotransposon; CR1 clade; DNA/RNA-binding; PHD finger; endonuclease; reverse transcriptase; CR1-2_AG |
|||
Source: Anopheles gambiae str. PEST |
Organism: Anopheles gambiae str. PEST |
Taxonomy: Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles |
|
[1] |
Authors: Kapitonov,V.V. and Jurka,J. |
||
Title: CR1-2_AG, a family of CR1-like non-LTR retrotransposons from African malaria mosquito. |
|||
Journal: Repbase Reports 2:(11) p.2 (2002) |
|||
Abstract: CR1-2_AG is a family of CR1-like non-LTR retrotransposons. The CR1-2_AG consensus sequence was reconstructed based on multiple alignment of ~100 copies identified in the sequenced portion of the genome. Given the ~2% divergence of these copies from the consensus sequence, transposition of CR1-2_AG occurred less than 1 million years ago. Integrations of CR1-2_AG have not produced target site duplications. The consensus sequence encodes two proteins: a 416-aa CR1-2_AG-ORF1p (positions 256 1503) and 996-aa CR1-2_AG-ORF2p (positions 1559-4546). CR1-2_AG_ORF1p is a putative DNA/RNA binding protein, which includes the PDH domain. CR1-2_AG-ORF2p is composed of the AP endonuclease and reverse transcriptase domains. The 3' terminus is composed of the AATA microsatellite. CR1-2_AG-ORF1p: MASVICKKCEGAISNDPIPCFGLCEHYYHDKCIGLSTPLLRDFKKSQNLFWACADCAQRLRAVDTLRFSH GLSRDAAYLLESLQSDFRDTSRSVQAASAGLRLELSSSLDCFRNEIALMKQESASSIRSVKDFIDSLTAS HSMERNYSQAPLLTTLDEVKHGIKELDLMHRELLTSFNSLMNKLNSHLATHTTTSSAHHSAIPATHSTTT IPVAASKLTHQAVGENPSKRRLLDRSPDPSPTNTVTRAMLSSGTGLSCNNITTVPERPPRTWVFISRIAP DTPIEAIREMACSNIGTDDILVYSLVRRDRDLSTLSYVSFKIGVPDSHRAIALAASTWPRGISFKEFIDL NPRSVNVWRPTTAASHAPSAPVTRESDHHSSPPSINHTTQLRNADFTISPDHGPMSLPYTQYFQQA CR1-2_AG-ORF2p: MEANPASNTHNPSDSLPPLITCSESTPGASRSLIPSIDRLNIYYQNVRGLRTKLDELRLSLSELDMDVLV LTETWLDGSIPSSLISEDAYVIYRCDRNSLNSNRCRGGGVLIACSSVLNTSTLSLPFDSLESVWTIVKLQ NLAIYIGAVYIPPDLRSSEVVLDDLHESVSFVAGKLKPNDLMVLLGDFNSPSLSWQPSASCVNQFIPTGV SRENVSLLDGMSVNGLLQLSGIKNIRGRQLDLLFANAAFLECCSPVIASPVPLVALDNHHPALETSVLLT HSRPASSQRIPTARMFNFRKLDYQKLHRILADTDWSFIDADCDINQAVAAFTNVITSAFPSCCPLLKPAP NPKWSNRALRLLKSDKNRAQRAYRLNNTLHNLCVYKYAAKAYRLLNRHLYRRYVRRLQMRFTIDPGSFFR FANSRRGSASLPSTLFLDLSSATSNPDICNLFAKHFSSVFVDPSTFKVPLDVGLSYTPSDVISCNSVVVS ESLVKSALSKLKTSFSPGPDGIPACVLKKCGNTLTPILTRLFSRSLSVGIFPSQWKLAWLVPIYKKGDRT LASNYRGISIICACSKILESIVHLSVMPCVKNYISTEQHGFMPNRSVSTNLMCFLSSLYHYLSSGKQVDT IYTDFKAAFDSIPLSLLVAKLRKLGFGGSILPWFNSYLENRSYAVKICGSFSECFLSSSGVPQGSVLSPL LFILFLNDCTSILPPNGFLLYADDVKIFLPVSSTADCLVLQSWLCKFSTWCASNGLVLCPEKCSVLSFFR SSTSITHAYSVCDAPIPRASLSKDLGVFFDPSLSFKEHTDYVINKANKSLGYICRMSTEIRDPFCLKSLY CCWVRSVLEYACVIWSPVQLSLLQRIERIQRRFTRIVFRRSLGHHSIPLPSYDDRCTLLGLAKLEHRLSV AQASFVAGILLNTIDTPSLLSRLHLYAPCRTLRYRFRLQLPICRTRFARNEPFVRAMSSFNSTSDLFDFN ISYPVYRSRLRSFSVP.
|
|||
Derived: [1] (Consensus) |
|||
Download Sequence - Format: IG, EMBL, FASTA |
|||
References: |