Repbase Reports |
---|
2003, Volume 3, Issue 3 |
March 31, 2003 |
Copyright © 2001-2016 - Genetic Information Research Institute |
ISSN# 1534-830X |
Page 57 |
CR1-6_AG |
|||
---|---|---|---|
CR1-6_AG is a CR1-like non-LTR retrotransposon - a consensus sequence. |
|||
Submitted: 31-Mar-2003 |
Accepted: 31-Mar-2003 |
||
Key Words: non-LTR retrotransposon; CR1 clade; DNA/RNA-binding; PHD finger; AP endonuclease; reverse transcriptase; CR1-6_AG |
|||
Source: consensus |
Organism: Anopheles gambiae str. PEST |
Taxonomy: Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles |
|
[1] |
Authors: Kapitonov,V.V. and Jurka,J. |
||
Title: CR1-6_AG, a family of CR1-like non-LTR retrotransposons from African malaria mosquito. |
|||
Journal: Repbase Reports 3:(3) p. 57 (2003) |
|||
Abstract: CR1-6_AG is a young family of CR1-like non-LTR retrotransposons. The CR1-6_AG consensus sequence was reconstructed based on multiple alignment of ~10 copies identified in the sequenced portion of the genome. Given the ~2% divergence of these copies from the consensus sequence, transposition of CR1-6_AG occurred less than 1 million years ago. The 3' terminus of CR1-6_AG is composed of the ATAAAC microsatellite. CR1-6_AG encodes two protein sequences: a 358-aa CR1-6_AG-ORF1p (positions 225-1299) and 998-aa CR1-6_AG-ORF2p (positions 1300-4293). CR1-6_AG_ORF1p is DNA/RNA binding protein composed of the PDH domain (aa positions 4-57). CR1-6_AG-ORF2p is composed of the AP endonuclease (aa positions 50-230) and reverse transcriptase (aa positions 550-780) domains. CR1-6_AG-ORF1p: MAAICFACAVSLDAADCIVGCAYCEATFHRGCCRLPSELIDAVLTHIDLHWSCTGCTNILKNPRCRSVKE IGAQVGFQAALTSTVAAVGKLIEPIIAEVRSGFTLLQNAPIPQLCNTDPRPVAGRKRRRIIEDSMSPDVN KNVNIRQNNMFAASSPSAYTNTTVGIPPSSTLPEELMGTDSLSSPLRAAFPQPATDRIWIRLSRLSTAVT VEQVVASVKRRLATDDVLAYCLLRKGVSVDSVNWLSFKVRVPAALRDAALAPSSWPVGIGVREFVQSRQR EHGHSSSPITIKHRSLTRTPVVIDRRSMPRTPTSTVYHAPAHASTSQAQTLTSPQLGEHTLNDTTHGPNS TLIDGPLLIRRTSNTNLQQTTLDRFFHE CR1-6_AG-ORF2p: FAALPTPTYNRPHLTVSFTNRRTSVKSAQTPTCYVPAANALRTARSTASVYYQNVRGLRTKVDEFRLSVL ESNFDVIVLTETWLDPSLPSALLFDDSFRVYRCDRSVDNSTCSRGGGVLIACSQSLTSREHTTVHPSLEL VCVVIQLGNSRLFIIAAYLPPRLAANAATLREIENCIRSLCSTMHPGDGLLLLGDFNQPLVSWSAAQHDP DLPFLHYEPRTRSALSALFMDEMHHSGLFQINGHLNTSGRVLDLVFANNAVASVCLPLELCLTPLLAIDT YHPALELAIPLPREESAVPALTSRLDYARTDFNRLLPMIASFANVFDCSHYATLDLAVKDFERFMLQALN ECTPVKRPKRGPPWGDRTLRRLKTAKAAAYRDYLLRRCPAALRNYNTAHSLYRRYNRFRYLGHVRRTVLR CRGNSRVLWNFANNRRKSSGFPSSVSYNGRNGNNPSAVCDIFASRFAATFLPAVTDERQIADALSNVPVD AMAPNLPIIDEYSVSKAIDRLKSSFAPGPDGIPASTLKRCGTTIAPILASIFRDSLRSGIYPACWKTSWL VPVHKKGDKSNACNYRGITSLCACAKVFELLVYEPLLAAASNYISTAQHGFVPQRSTTTNLVEFVSLCHR TIDAGSQIDAVYTDIKAAFDSVPHALLLAKLETLGLPVQLLAWMRSYLTGRTYCVKMGPHTSRRIDASSG VPQGSNLGPLLFVIFLNDVTRLLPPDGHLLYADDAKLFLPIRDRSDQLRLQATLSAFQSWCSLNGLELCV EKCVVVTFARKRCPLVYDYALNGSTIGRKSCVTDLGVLLDEKLSFHDQLEHVVTKGNQLIGLLKQIARDI TDPICIKTLYCALVRPVLEYASVVWWPTAARPLARLESIQRKFTRFALRSWSVQLDYEGRCALLGIETLK QRNCNAQRLFVAGLLDNRIDSPALLSRLNMYVPPRSLRARSLLDVEERRTRFGSSDPFIRMCREFNVICD RHQPDMSRTALLNSIRVV
|
|||
Derived: [1] (Consensus) |
|||
Download Sequence - Format: IG, EMBL, FASTA |
|||
References: |