Repbase Reports |
---|
2003, Volume 3, Issue 3 |
March 31, 2003 |
Copyright © 2001-2016 - Genetic Information Research Institute |
ISSN# 1534-830X |
Page 35 |
BEL14-I_AG |
|||
---|---|---|---|
BEL14-I_AG is an internal portion of the BEL14_AG LTR retrotransposon - a consensus sequence. |
|||
Submitted: 31-Mar-2003 |
Accepted: 31-Mar-2003 |
||
Key Words: LTR retrotransposon; Bel clade; 5-bp TSD; PHD domain; protease; reverse transcriptase; integrase; BEL14_AG; BEL14-LTR_AG; BEL14-I_AG |
|||
Source: consensus |
Organism: Anopheles gambiae str. PEST |
Taxonomy: Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles |
|
[1] |
Authors: Kapitonov,V.V., Pavlicek,A. and Jurka,J. |
||
Title: BEL14_AG, a family of Bel/Pao-like LTR retrotransposons from African malaria mosquito. |
|||
Journal: Repbase Reports 3:(3) p. 35 (2003) |
|||
Abstract: BEL14_AG is a young family of Bel/Pao-like LTR retrotransposons. BEL14-I_AG, an internal portion of BEL1_AG is flanked by BEL14-LTR_AG LTRs. The BEL14-I_AG consensus sequence was reconstructed based on multiple alignment of 16 copies; they are ~1.5% divergent from the consensus sequence. The consensus sequence encodes a 1898-aa BEL14_AGp Bel-like protein (pos. 57-5750). BEL14_AGp is composed of the PHD (pos. 9-55), protease (pos. 277-400), reverse transcriptase (pos. 900-1060) and integrase (pos. 1600-1765) domains. BEL14_AGp: MGPKKARGCKACGNQVDDTLYVQCDECDAWWHFSCAGITASVEAVEKCAWLCEECARKTLREQSSPREGN KEPKEGTSKHVDGDLVRNLSLEAATDGGARPVTNPQRRPLLSLDEANDEIAPGTSTHVAGGPVHNLNQDA ATEGGVRPVMTPKRRPLSSLDEADRGKTSVSSNIVHRGSCPNLNLDAASDDVARQLAVLKRRQEVEKRRM ELELQLKFVQEEEALLGFGENKSFSISPQLNSFQTEKRTVKRSEEEKEEPDLTPRQEAARHMVSKELPVF SGDPAEWPIFISHYEYTTRRCGYSNWENMLRLQKCLKGPALEAVRSRLVLPDVVPQVIEKLRSKYGRPVH LIKTFIEKVRKIPAPQTDKLDSLVEYGEAVQCMVDHMVAAGERAHITNPLLLQEVVGKLPTDQQLRWSHH IRGMTSVDLSTFSDYMEDLAEDAARLTTIDSPSVRGTSKGRPTKGYVHAHVDPDGATTSSAAERQCVSCN VAGHVLSTCTNFRGLPVKDRWRRARELSVCFSCLEKHNWRSCKNRSRCGINDCAFRHHALLHDPDAIESP STADRERRHFPRTSGSQTHQVINNYHQSNPMSALFRIVPVTAYGPGVMIKTFAFLDEGSSMTLMDEDLAK QLGVKGDRRPLCIKWTGDTTRVEPASMMIDLQIGPVTSTKRFTLKAVRTVTSLSLPQQTFTMDDKRWDHL KQLPLPEYRDARPQLLIGLDNLRLAVPLKTREGLAGEPVAVKTRLGWCVYGKTAGSQIGRVLHMCECGAS DENSTIQGALRKFYELEQLGTVSSDVPDPDERRALTILETTTVRIGNRFESGLLWKTDNVELPSSLGMAR RRLECLERRMERDPKLKTVVHHHIADMMEKGYIHKATSAELAECNSKRIWYLPLGVVTNPKKPGKVRIIW DAAAKVQGTSLNDMLLKGPDELISLPGVLFRFRMYGIAVCADVKEMFLQIRMRDEDKHAQRFLWREDPAD DIATYFVDVVTFGSACSPATAQYVKNRNAKEHAEKYPRAVRGILTSTYVDDYLDSFGTFEEASRVSREVR GIFSNGGFVLRNWVSNNPVVLERLGGESSSPGMKSLTSTADDGERVLGLRWNPSSDQLSFYTQACVGMAE IFETECTPTKREVLKCVMSLFDPLGLLANFTIHGRILIQDLWRAGTGWDEAISPSQMRDWRRWVDVFPLI AQLRIPRCYFPEAREKVYENAELHLFVDASQLAYACVLYLRVVDSEGEPHCTMLCGKAKVAPLKPLTIPK MELQACLLGARLLKSTEQHHPISVKKRVLWTDSTVALSWIHADPRNYRPFVANRVAEIQENTNVNEWRWV PTQDNPADEATKWKGRANFNWDGIWFQGPSFLLQDEESWPTRRLVSTTPEEEIRRVNLHREKLNPGLLPL KAERFSRLERMIRTLAWIVRYVDNLMRKVGGAPLHLGILSQDELERAETIAWKQAQGEYFQDEVRVLSVG EGTGRSTVPKESPIYGLLPYADERGVLRMRGRIGAAPELPYAARYPIVLPRDAWITHLLVDKFHRRFRHA NNETVVNELRQYFQIPKMRRLVSKVVRQCVFCHIRRTLPQIPPMAPLPKQRLTAFVRPFTFVGLDYFGPL LVRRGRAQEKRWVALFTCLTIRAIHLEVVSSLSTDSCILAVRRFVARRGAPVEVFSDNGTNFVGASQQLR KEIDERNDALAATFTNANTRWTFNPPGAPHMGGVWERMVRSVKAAMSTMTELQRTPDDETLLTVIVEAEG MINTRPLTYIPLESADQESLTPNHFLLGSSSGVKQRPVAPTSLQTGLRSNWKMVQHILDGFWRRWIKEYL PVLARQSKWFETVREIEVGDIVLIVDGGARNQWKRGIVERVVSGADGRIRQAWVRTNTGTLRRPAAKLAL LEIRKGDK
|
|||
Derived: [1] (Consensus) |
|||
Download Sequence - Format: IG, EMBL, FASTA |
|||
References: |