Repbase Reports |
---|
2002, Volume 2, Issue 3 |
March 31, 2002 |
Copyright © 2001-2016 - Genetic Information Research Institute |
ISSN# 1534-830X |
Page 2 |
G5_DM |
|||
---|---|---|---|
G5_DM is a non-LTR retrotransposon - a consensus sequence. |
|||
Submitted: 31-Mar-2002 |
Accepted: 31-Mar-2002 |
||
Key Words: Non-LTR retrotransposon; ORF1; ORF2; DNA binding protein; AP endonuclease; reverse transcriptase; RNase H; JOCKEY clad; G5_DM |
|||
Source: consensus |
Organism: Drosophila melanogaster |
Taxonomy: Eukaryota; Metazoa; Arthropoda; Tracheata; Hexapoda; Insecta; Pterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila |
|
[1] |
Authors: Kapitonov,V.V. and Jurka,J. |
||
Title: G5_DM, an ancient family of non-LTR retrotransposons from the Jockey clad. |
|||
Journal: Repbase Reports 2:(3) p. 2 (2002) |
|||
Abstract: G5_DM belongs to the JOCKEY clad of non-LTR retrotransposons. Its copies have poly(A) 3' tail and they are flanked by ~10-bp direct repeats generated upon integration into genome. G5_DM forms a separate family of retrotransposons that were recently (several million years ago) active in the Drosophila melanogaster genome. There are about 15 copies of G5_DM in the genome, they are ~4% divergent from the consensus sequence and ~10% divergent from each other. Although the G5 copies have accumulated multiple mutations, the consensus sequence contains two ORFs (positions 411-1976 and 1966-4683) that encode the 522-aa G5p1 and 906-aa G5p2 proteins, correspondingly. G5p1 is a putative DNA/RNA binding protein: MDNGPQQKSIDYESMMLIAGAGPKEIREQMKKSWKSETPASATTGISNTYCSSTFTLTSTSSTISSAYIF TSSLSGNICSTAASTIHKSASVSSIAKPQLATGWHDVSFKSRNGGIKKRAGAVFSPKMAKKVATDMPAKI SANCFQVLSDDEDMVVGEASSSDDDEPCSSNTALKRAARKAGPKQQLKGAHQTVPAAPKSSRHSKVPRMM FPNVVNFTAFRSELDALVGDSYTIKVLNSGDCAVQCNSPDSYRLVARHFLDKGSLFHHHQLPEDRPYKIV MRNIHHGVPSEDIIATLQNEGHNVVRIYTPRNKATSLPLNMRFIDLKKAENNNQIKGISVVCRHRVIWEK PRKQSEPIQCHRCQGYGHTKAYCSRHYICRECGENHPTAECKLEQDEARFCFHCGGPHAANFKGCKKYLL EASNRKNQRKVNEPSGSGPARGPHQPCPPAHMSGKPSFANFVRGSQPVAKPAVIVPHASANLESKLEQLF IRLDRMMSLVETLMQLLLQTRTFPSAAQNGSS G5p2 is composed of the endonuclease, reverse transcriptase and RNase H domains: MGPLKVAAWNANGASSKTNEILAFIELHEIDILLLSETHFVSRSTFRVPGFTLHTANHPDDSKRGGAAIL IRSLISHLPFSTLSENHIQTAVIQLTASRGTFNIASVYCPPNLRWTEADVELIIAQFGTKFLAAGDWNAK HRWWGNYRMCTRGRVLFSALAGEGIDIVATGEATCYPFRASATPSAIDFGISKGFRQQEINVQLLTELSS DHLPLLFELDEDAQLFKGVTKMLSPTANTVAFKEHIEATVDLNIPIDTCNSLEAYVDYLAATIAEAARRA TPPPHQARHTTARRAPILSLEARELLSHKRRLRRRYIATGDPSIKQLYSSTTNKLHRLLARTRRENLDTL LEGTGPDNNSHFSLWRLTRGIKRQPLFQSPVQSHSGLWLKTDDEKARAFASHLTSTFMPFNLTDDSNRVA IINFLDTPTAPARPIRHTTPQEVIMQLKALQIKKTPGYDGIDNRAAKSLPRKGVLALVKIFNAMLRLGHF PRQWKRARIIMIPKAGKPPTKIDAYRPISLLSTFFKIFERILLARLMELPQVVNHIPRHQFGFRKSHGCP EQIHRLVNQVTHGFEHKLYTVGVFLDVKQAFDRVWHEGLLYKMKALLPAPYYAILRSFISHRTFDVAVRD ARSSLEEIHAGVPQGSVLGPFLYTLYTADLPSPANNTEVSPDQLLLATYADDTAMLASHPVLQTASNAVQ EWLHAVEKWTAKWNVAINSSKSACVTFTLRPGTCTDLTFDGNPINNVTSHCYLGVHLDRRLTWRAHITSV KFKSLAKLKKLDWLFHSSKLQMSSKALLIKAILAPTWSYAIQVWGTAAKSQLNRLRVVQSRAARHASGLP WYVTNQVIERDLKVTPLGDQINFHSSRYADRLMVHPNRLANILANPISLRRLKRVHPTDLPTRRIV
|
|||
Derived: [1] (Consensus) |
|||
Download Sequence - Format: IG, EMBL, FASTA |
|||
References: |