Repbase Reports

2003, Volume 3, Issue 12
December 31, 2003
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 207

I-1_AN

Non-LTR retrotransposon from the I clade.

Submitted:
31-Dec-2003
Accepted:
31-Dec-2003
Key Words:
Non-LTR retrotransposon; I clade; endonuclease; reverse transcriptase; RNaseH; zinc knuckle; I-1_AN
Source:
consensus
Organism:
Emericella nidulans (Aspergillus nidulans)
Taxonomy:
cellular organisms; Eukaryota; Fungi/Metazoa group; Fungi; Ascomycota; Pezizomycotina; Eurotiomycetes; Eurotiales; Trichocomaceae; Emericella
[1] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
I-1_AN, a non-LTR retrotransposon from Aspergillus nidulans.
Journal:
Repbase Reports 3:(12) p. 207 (2003)
Abstract:
Non-LTR retrotransposon. I superfamily. I-1_AN is a non-LTR retrotransposon from the I clade. This family was active during the last one million of years, or so. Some copies are less than 1% divergent from the consensus sequence. I-1_AN encodes two proteins: I-1_AN-1p (pos. 103-1583) and I-1_AN-2p (pos. 1581-5141). I-1_AN-1p is a putative DNA/RNA-binding protein similar to the ORF1 proteins encoded by non-LTR retrotransposons from the I superfamily. The protein includes the zinc knuckle-like motif (pos. 360-425). I-1_AN-2p includes endonuclease, reverse transcriptase and RNaseH domains. I-1_AN-1p: MEVDISPPGGTRPATPLLGENSDPPSGPTTPTPLPRNSLKRRALFSPQKTPTAAPVPVSHTPQAPSICEQ VGMVADDQLALLHDWKLAMTSLAKALDLTVSSLQGRPRDLARELAARFVTLAKQDSPQQISQMPVVAPPQ PPRQMEQPNHPPTPEASKSPLNRQTSQPTTWASLTAPRTGQGNWQTIAPEHHMQAKQTAQRRLKQSNKTD HRIFLRLPASSSLRAIGPHGIRVTLAGKVPDGITQVQVISTGYAITTTEQGKAFLLSEKAASLAGDGYFE IPTEYHQVIVSRIPKQLWSLDGWIDTTIADISMEAERITGIKPLMAKLSKHPVERDSITAVIAFPKKLQH PLQLFGLSGLSRPTRPKQRPLQCTRCHRFYDTRACRSSERCISCGSSKQEHNCRVQCINCCGPHAADFQK YPARPHIQRNTITRLSKDALAAICKAGRLAFQQEQKKAEESSKQQTDNTHTTNQPTRQLTQELLNQTLTS PEL I-1_AN-2p: MKILQANIGRGGAVYDLLLSFEADIILVQEPWTNTAKHLTKTHPQYQLFSPPTRWTARPRTLTYVQRDLP AHSLPEPISPDITTIYTAGLTIINVYRPPNNPVAPAGAGSTPSTLSTLLGYAPPENTILAGDFNTRHPFW QPDTESHAVTPGATGLLDWLDAHELELRLEPGTPTRGPNTLDLVFSNLPLRALVEDHLKTPSDHATIGII LEQEEPPPIYKLGSTNWEKARALASPPDPTLPIDLLAKQLVQISQLAIQGASRYNTRRLPRTPWWTPELT DILHQTRQQQNPDYKQLRKAIVQAKAEYWKQRIEQATAPIDAFKLAKWIQYPDQLAAPPLNIQGAQVTTP QGKADAFLNHLLEKGALLPNQTEEGPPNKPLGLLHLPTKEHCWAALCAPPPSAPGEDGLATTAWRELWPV LGDTITQLYYRCMEEGCFPLSLKSAKVIMLPKPGKRGYTQLNAWRPISLLSTLGKGLERLLAQQIAVRAI QADVLAPCHFRALPGRSAIDLVQVLVHRVEEAFQQGKDASLLLLDVKGAFDAVIHQQLLSHLRLQGWHKG LLQLLKDWLTGRSVSVHIKEGTATAPIKGRLPQGSPLSPILFLLYAARIVSTLEGSFCYADDMGILLTGN TLEESSQQLVEAYKQITALGTETGLPFSIEKTEIQYFSRKQQQHLPTVTLPGIGEITPSLYTQWLGVLLD TKLTFKAHINLVFSRGKRLAQHLKRLSNTQHSCPVASMQAAVIQYILPTALYRAEVFYTGKRQKGVVNSL LSLFCTAALAIIPAYKTTPTAALLREADLPDPEALLNSILQRAAVRYMSLDTKHPIAQIAAETTAGRPKT RLKRILQLLLSPLPERAIIELPLPPLCMLPTDNKGYSPAPLQISVYSDGSRTSQGAGYGYAIYFGPILVT KGHGPAGPRTEVYDAEIMGAVEGLRAALGQPCVGYSTQLVILLDNLAAASLLASYRPTPHRHGLSETFSQ LAAQWMESPSILTMQRKPLQVRWIPGHSGIAGNELADKLAKLGSSIYSPDIPPSPAYLQREAKQWLRTET YTAYANKAPETYKALNIRPHTKESRSREHKLPWWVLGRLVAARTGHGDFTAYHQRFNHSDYLESCSCGRT KTPVHFFFCPYTRKRWKDRWRCIRDGPSKTIDWLLSTAAGAEEFSRIVQESSFFKDICPNWARRSA
Derived:
[1] (Consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2024 - Genetic Information Research Institute