Repbase Reports |
---|
2004, Volume 4, Issue 11 |
November 30, 2004 |
Copyright © 2001-2016 - Genetic Information Research Institute |
ISSN# 1534-830X |
Page 291 |
Gypsy7-I_DR |
|||
---|---|---|---|
Gypsy7-I_DR is an internal portion of the Gypsy7_DR LTR retrotransposon - a consensus sequence. |
|||
Submitted: 00-Nov-2004 |
Accepted: 30-Nov-2004 |
||
Key Words: LTR retrotransposon; endogenous retrovirus; Gypsy superfamily; gag; protease; reverse transcriptase; integrase; Gypsy7_DR; Gypsy7-LTR_DR; Gypsy7-I_DR |
|||
Source: Danio rerio |
Organism: Danio rerio |
Taxonomy: Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; Cyprinidae; Danio |
|
[1] |
Authors: Kapitonov,V.V. and Jurka,J. |
||
Title: Gypsy7_DR LTR retrotransposon from zebrafish |
|||
Journal: Repbase Reports 4:(11) p.291 (2004) |
|||
Abstract: Gypsy7-I_DR is a consensus sequence of the internal portion of Gypsy7_DR LTR retrotransposons. Its long terminal repeat is deposited in Repbase as Gypsy7-LTR_DR. The internal portion encodes the 1658-aa Gypsy7_DRp polyprotein (pos. 476-5449) composed of gag, protease, reverse transcriptase, and integrase domains. Given that some Gypsy7_DR genomic copies are flanked by 100% identical LTRs, it is possible that Gypsy7_DR elements are still transpositionally active. The gag domain is similar to the Arc protein important for long-term spatial memory in vertebrates (mammals, birds). Presumably, Arc was derived some 300-400 million years ago from a gypsy-encoded gag protein. Gypsy7-I_DRp: MANVNPSPSTSVDIDPPDIATPVWPPVQQRQFSSPSNIPTYHSTPTQLDPYGRTQVHFHTTTPG VTSTVQPDPMQLCTSASTVESPPSTATQHALPGYLPTPGREIHQLTAHVQGNWDRVFDCLKRQD KAVKELTEKSSKSFSLHEAKLAKMESTHQQLLNTLTAQRKDDTETADQLTKAVKVMVTQEIQRS ESTLISEIRFMVEQAQLELQKDIQATKEHSDKNFERLSSDLNHCSTEINAIKNQLDNLQTEISD VIPPIKQVSDPPSSAPVSVSTQSSSSVTAPMPFQTPVIKSDHLKLTFPTFGRPSDDADPLLYVT RCKDFLALHPLDDPDILATFRTVLYGTARDWWEVARSAISTWSEFETAFLSAFLSEDYEDELAE RVRTRTQAEKESIRDFAFTYRAMCKRWKPTLTESELVKMILKNIKPHLASQLRSRVHTVDELVK LGLQLEKDYVQQLHYVEHVTQPSPQRIAPNRVEKPPVLCWRCKGLHPPGSCPHYSSSVQTTQSS SHPPPTGNKRYFQTQKHGGNPSNNAMSVTLPSKSLPKSTVTKSVVIPQQLIVPIYIGAWRGKAI LDTGASYTLLHESLWKEIDPQASLHPWTLGPLYLANGEAEVPLGWTNFEIILHDKVFPTQAAIL TPKALAYSVVLGLDFIYSSGLQINVVDQTYSFKSNPNEEYPFQPGHASVPVGRSQHLNKNAQTQ HSSKTLSLLSSIPPPLPFPVVSQLAPSSDDQALIEMAVAEAHLPLESKPQLLHLLQSNPKVCTL QLGRTTVLQHCIYTTHPVPVKQRPYRLTPGKQAIVEEQIEEMLKAGVIEQSCSPWASPVVLVPK KDNSLRFCVDYRKLNAMTESDAYPIPNITEILESLSGASTFSSLDLNCGFWQVPMDDKSKLMTA FITSRGLYHFNVMPFGLKNAPATFQRLMEIVLRDLLGKICYVYIDDIVIYSPTLTQHLHDIQTI LERLEKAGLTLNLKKCSFCLPEITFLGHVVSHQGVAADPKKVEVIHAYPVPQNLKDVQRFLGLA GWYHRFVPNFSRIAEPLNNLKKKGRQFKWDSLCQQAFDNLKFCLTTPPILGHPDLNIPFTVYTD ASDSGLGAVLTQRKEQGGEEVIAYASRTLTKAEVNYSTTEKECLAVVWALDKWQHYLEPRMFTV VTDHSALQWVMNSTKPASRLMRWALRLQRYDFVIEYRKGRLNVAPDALSRMYSMPGCNLYTTEK DLPDFPVTPQTIWEEQHQDTDIMKIFQALAKNEQQEQAQYTVLEDKLYHITHLADETVHYKVVI PSTLRPTVLEWYHDTPLSGHLGIYKTYKRIQDVAYWPGMWTDIKKYVKNCAKCQVTKWDNRKPA GKLQQVTTSRPNEMWGVDIMGPMPKSGKQNEYLLVFVDYFSKWVELFPMRHATAQTIATILRQE MLTRWGVPDFILSDRGAQFVSSLFTELCGKWNITPKLTTAYHPQTNMTERVNRTLKSMIAGFVE DNHKTWDTYLPELRFALNSAIQESIGMTPAELHLGRKIHSPMDKLLHRRDLSPTKPAYDMVHKI TQLQRQAKENYTKAQKRQLRSYDKNRRDVFFRERERVWVRNFPISSAQHHFSAKLAPKWKGPYR IIQQLGPVNYQVSLEDTGEDVRNVHVCNLKPCFPTAEELEAREKNCTKILPQQDQKRF
|
|||
Derived: [1] (consensus) |
|||
Download Sequence - Format: IG, EMBL, FASTA |
|||
References: |