| | | | | |
homo.
MLSVAARSGPFAPVLSATSRGVAGALRP------LVQATVPATPEQPVLDLKRPFLSRES
rattus.
------------------SRGVAGALRP------LLQSAVPATSEPPVLDVKRPFLCRES
bos.
MLSVAARSRHSRP----SYRPRPAGWRA------LRPWYSRSSRESPVLDLKRSVLCRES
Gallus MLSVAARSGPFAPYLSAAAHAVPGPLKA------LAPAALRA--EKVVLDLKRPLLCRES
Sacch.cere.
-------------------------------------------------------MLGIR
Ncrassa.
---------------------------------------------------MAPVSIVSR
Nicot.
-------WPVRSAAPSSSAFISANHFSS------DDDSSSPRS-ISPSLASVFLHHTRGF
Solan.
MLRVAGRRLSSSAARSSSTFFTRSSFTV------TDDSSPARS-PSPSLTSSFLDQIRGF
Zea.
MLRVAGRRLSSSLSWRPAAAVARGPLAGAGVPDRDDDSARGRSQPRFSIDSPFFVASRGF
Trypan.
----MFRRSCISAFQPTAFLRVSLVFKQ-------LEGSNPLTVKDRPVNSWSDEFLKPP
Rbcasulatus
0 10
20
30
| |
|
|
homo. LSGQAVRRPLVASVGLNVPASVCYSHTDIKVPDFSEYRRLEVLDST------KSSRESSEARK
rattus. LSGQAATRPLVATVGLNVPASVRYSHTDIKVPDFSDYRRAEVLDST------KSSKESSEARK
bos. LRGQAAA-ALVASVSLNVPASVRYSHTDIKVPDFSDYRRPEVLDST------KSSKESSEARK
Gallus MSGRSARRDLVAGISLNAPASVRYVHNDVTVPDFSAYRREDVMDAT------TSSQTSSEDRK
Sacch.cere. SSVKTCFKPMSLTSKRLISQSLLASKSTYRTPNFDDVLKEN---------------NDADKGR
Ncrassa. AAMRAAAAPARAVRALTTSTALQGSSSSTFESPFKGESKAAKVPDFG----KYMSKAPPSTNM
Nicot. SSN-SVSPAHDMGLVPDLPPTVAAIKNPTSKIVYDEHNHERYPP------------GDPSKRA
Solan. SSN-SVSPAHQLGLVSDLPATVAAIKNPSSKIVYDDSNHERYPP------------GDPSKRA
Zea. SSTETVVPRNQDAGLADLPATVAAVKNPNPKVVYDEYNHERYPP------------GDPSKRA
Trypan. VSKEMADKYGRYAKYSDPTLCSVDTSSEVVLNTYPEGSREGRIEATAGVALKDYDASMWDEEF
Rbcasul
MSHAEDNAGTRR
. . :
:
40
50 60
70
|
| |
|
homo. GFSYLVTGVT-------TVGVAYAAKNAVTQFVSSM------------SASADVLAL
rattus. GFSYLVTATT-------TVGVAYAAKNAVSQFVSSM------------SASADVLAM
bos. GFSYLVTATT-------TVGVAYAAKNVVSQFVSSM------------SASADVLAM
Gallus GFSYLVTATA-------CVATAYAAKNVVTQFISSL------------SASADVLAL
Sacch.cere. SYAYFMVGAM-------GLLSSAGAKSTVETFISSM------------TATADVLAM
Ncrassa. LFSYFMVGTM-------GAITAAGAKSTIQEFLKNM------------SASADVLAM
Nicot. -FAYFVLTGG-------RFVYASLMRLLILKFVLSM------------SASKDVLAL
Solan. -FAYFVLTGG-------RFVYASSVRLLILKFVLSM------------SASKDVLAL
Zea. -FAYFVLSGG-------RFIYASLLRLLVLKFVLSM------------SASKDVLAL
Trypan. FRKYILKPKLPNELEDRARVTDYALNSALLGFVILMVRYAVLPLWYVGQPAMSMVGQ
(RNA editing?)
Rbcasul DFLYHATAAT-------GVVVTGAA--VWP-LINQM------------NASADVKAM
Rb.spha. qmnpsadvq-a lasifvdvss vepgvqltvk : : . : *: : .: .::. SynechococcusPCC7002 -MTQLSGSSDVPDLGRRQFLNLLWVGTAAGTALGGLYPVIKYFIPPSSG-GAGGGVIAKD Phormidium-l ----ASGTPDVPDLGARQFMNLLTFGAATGTVLGMLYPVVRYFIPPSSG-GAGGGVTAKD Synechococcuselongatus -MAQVSGMSDVPDMGRRQFMNLLTFGTITGTALGALYPVVKYFIPPASG-GTGGGAVAKD Nostoc -MAQFSESADVPDMGRRQFMNLLTFGTVTGVALGALYPVVKYFIPPASG-GAGGGTTAKD Anabaena --MDNSIPIESPSLSRRQLLNFITGATVAVTAGAALYPAGKFLIAPAEKTGAGGAILAKD Chlorobium MAQTGNFKSPARMSSLGQGAAPASSGAVTGGKPREGGLKGVDFERRGFLHKIVGGVGAVV . . * .: : : . *. *80 90 100 110 120 130
.:* .: : . *:***:*: :*: ::. . :. *:.*: * *. .SynechococcusPCC7002 ALGNDIIVSDYLQTHTAGDRSLAQGLKGDPTYV--------------VVEGDNTISSYGI
140 150
160 170
180 190
300
*|* |
* |* |
| |
homo.
EWVILIGVCTHLGCVPIAN
AGDFGGYYCPCHGSHYDASGRIRLGPAPLNLEVPTYEFTSDDMVIVG--
rattus. EWVILIGVCTHLGCVPIAN
AGDFGGYYCPCHGSHYDASGRIRKGPAPLNLEVPTYEFTSGDVVVVG--
bos.
EWVILIGVCTHLGCVPIAN
AGDFGGYYCPCHGSHYDASGRIRKGPAPLNLEVPSYEFTSDDMVIVG--
Gallus EWVILVGVCTHLGCVPIAN
SGDFGGYYCPCHGSHYDASGRIRKGPAPYNLEVPTYQFVGDDLVVVG--
Sacch.cere. QWLIMLGICTHLGCVPIGE
AGDFGGWFCPCHGSHYDISGRIRKGPAPLNLEIPAYEFDG-DKVIVG---
Ncrassa. EWLVMLGVCTHLGCVPIGE
AGDYGGWFCPCHGSHYDISGRIRKGPAPLNLEIPLYEFPEEGKLVIG--
Nicot. EWLVVIGVCTHLGCIPLPN
AGDFGGWFCPCHGSHYDISGRIRKGPAPYNLEVPTYSFLEENKLLIG--
Solan. EWLVVVGVCTHLGCIPLPN
AGDFGGWFCPCHGSHYDISGRIRKGPAPYNLEVPTYSFLEENKLLIG--
Zea.
EWLVVIGVCTHLGCIPLPN
AGDFGGWFCPCHGSHYDISGRIRKGPAPFNLEVPTYSFLEENKLLVG--
Trypan. EMAVVIAICTHLGCIPIPN
EGLFGGFFCPCHGSHYDASGRIRQGPAPLNLEVPPYRWVDDKTIYLGKL
Rbcasul EWLVMLGVCTHLGCVPMGDKSGDFGGWFCPCHGSHYDSAGRIRKGPAPRNLDIPVAAFVDETTIKLG
Rb.spha. ewlvmwgvCTHLGCvpiggvsgdfggwfCPCHGShYDsagrirk-GPAPenl
: :::.:******:*: : * :**::********* ***** **** ***:* * : SynechococcusPCC7002 NAICTHLGCVVPWNT AENKFMCPCHGSQYDETGKVVRGPAPLSLALVHAEVTEDDKISFT Phormidium-l NAVCTHLGCVVPWNA SENKFICPCHGSQYDATGKVVRGPAPLSLALVHTDVTEDGKIAMT Synechococcuselongatus NAVCTHLGCVVPWNV SENKFICPCHGSQYDSTGKVVRGPAPLSLALVKATVTEDDKLVFT Nostoc NAICTHLGCVVPWNV AENKFKCPCHGSQYDETGKVVRGPAPLSLALAHAN-TVDDKIILS Anabaena VDNCTHLGCTFPWNP LDQQFQCPCHGSRYAPDGSVVRGPAPLPLKIVQVA-VIDNSILIS Chlorobium SAVCTHLGCLVNWVD ADNQYFCPCHGAKYKLTGEIISGPQPLPLKQYKARIEGDSIIISK ****** . * :::: *****::* *.:: ** **.* :. *. : .
Rieske ironsulfur sequences
@ g488299 Homo sapiens 274 aa Ref a P20788 rat Rattus norvegicus 256 aa Ref b P13272 beef Bos taurus 269 aa Ref pig Sus scrofa 85 aa Ref c A41607 corn Zea mays 273 aa Ref d B41607 tobacco Nicotiana tabacum 258 aa Ref e P37841 Potato Solanum tuberosum 265 aa Ref f TBU28866 cds1 Trypanosoma brucei 297 aa Ref g CFU28865 cds1 Crithidia fasciculata 30 aa fragment Ref h P08067 Sacch. cerevisiae 215 aa Ref i P07056 Neurospora crassa 231 aa Ref j A32382 Bradyrhizobium japonicum 176 aa Ref k P05417 Paracoccus denitrificans 190 aa Ref l Q02762 Rhodobacter sphaeroides 187 aa Ref m P08500 Rhodobacter capsulatus 191 aa Ref o JQ0345 Rhodopseudomonas viridis 179 aa Ref p P23136 Rhodospirillum rubrum 183 aa Ref q CLPET cds2 Chlorobium limicola 428 aa Ref r Sulfolobus acidocaldarius 250 aa Ref r Sulfolobus acidocaldarius 250 aa Ref s Bacillus subtilis 167 aa Ref t B. stearothermophillus 169 aa Ref u Chromatium vinosum 207 aa Ref Rieske: 1 mlasaggywp msaqgvnkmr rrvlvaatsv vgavgagyal vpfvasmnps araraagapv // 1632948 Presequence(not aligned): *** **** ** * **** *** *** * ******* * @ mlsvaarsgpfapvlsatsrgvagalrplvqatvpatpeqpvldlkrpflsreslsgqavrrplvasvglnvpasvcy a mlsvaarsgpfapvlsatsrgvagalrpllqsavpatseppvldvkrpflcreslsgqaatrplvatvglnvpasvry b mlsvaars-rhsrpsyrprpagwralrpwy--srs-srespvldlkrsvlcreslrgqaaa-alvasvslnvpasvry c mlrvagrrlssslswrpaaavargplagagvpdrdddsargrsqprfsidspffvasrgfsstetvvprnqdaglad d wpvrsaapsssafisanhfssdddsssprsispslasvflhhtrgfssnsvspahdmglvpd e mlrvagrrlsssaarssstfftrssftvtddssparspspsltssfldqirgfssnsvspahqlglvsd f mfrrscisafqptaflrvslvfkqlegsnpltvkdrpvnswsdeflkppvskemadkygryakysdptlcsvdtssevvlntypegsregrieatagvalkdydasmwd g mfrrtfttafqat h mlgirssvktcfkpmsltsk i mapvsivsraamraaaaparavralttstalqgsssstfespfkg j k l m o p Start of mature sequence: 10 20 30 40 50 60 @ 1 ----------shtdikvpdf seyrrlevld stkssresse arkgfsylvt gvttvgvaya aknavtqfvs 60 a 1 ----------shtdikvpdf sdyrraevld stksskesse arkgfsylvt atttvgvaya aknavsqfvs b 1 ----------shtdikvpdf sdyrrpevld stksskesse arkgfsylvt atttvgvaya aknvvsqfvs c 1 ----------lpatvaavkn pnpkvvydey nheryppgdp skrafayfvl sggrfiyasl lrllvlkfvl d 1 ----------lpptvaaikn ptskivydeh nheryppgdp skrafayfvl tggrfvyasl mrllilkfvl e 1 ----------lpatvaaikn psskivydds nheryppgdp skrafayfvl tggrfvyass vrllilkfvl u 1 mlasaggywp msaq gvnkmrrrvlvaatsvvgavg agyalvpfva f 1 ----------eeffrkyilk pklpneledr arvtdyalns allgfvilmv ryavlplwyv gqp------- g 1 ----------raarvsllvk qlegttp h 1 ----------rlisqsllas kstyrtpnfd dvlkenndad kgrsyayfmv gamgllssag akstvetfis i 1 ----------eskaakvpdf gky------- -mskappstn ml--fsyfmv gtmgaitaag akstiqeflk j 1 mtt----assadhp-trrdf lfv------- ---------- ---------- -atgaaaavg gaaalwpfis k 1 mshadeha--gdhgatrrdf lyy------- ---------- ---------- -atagagtva agaaawtlvn l 1 msnaedha------gtrrdf lyy------- ---------- ---------- -atagagava tgaavwplin m 1 mshaedna------gtrrdf lyh------- ---------- ---------- -ataatgvvv tgaavwplin 36 o 1 massdta------eatrrdf lyv------- ---------- ---------- -ataavgaag vaavawpfit p 1 maeaehtastpggessrrdf liy------- ---------- ---------- -gttavgavg valavwpfid u 1 mlasaggywp msaqgvnkmr rrvlvaatsv vgavgagyal vpfva 70 80 90 100 110 120 @ 61 smsasadvl-a lakieiklsd ipegknmafk wrgkplfvrh rtqkeieqea avelsqlrd------------p 120 a smsasadvl-a mskieiklsd ipegknmafk wrgkplfvrh rtkkeidqea avevsqlrd------------p b smsasadvl-a mskieiklsd ipegknmafk wrgkplfvrh rtkkeidqea avevsqlrd------------p vevsqlrd------------p c smsaskdvl-a laslevdlss iepgttvtvk wrgkpvfirr rteddiklan svdvaslrh------------p d smsaskdvl-a laslevdlss iepgttvtvk wrgkpvfirr rteddislan svdlgslrd------------p u 61 smnpsararaa gapveadisk lepgallrvk wrgmpvwvvh rssemlaals sndp klvdptsevpq e smsaskdvl-a laslevdlss iepgstvtvk wrgkpvfirr rtdddiklan svdlgtlrd------------p f amsm-vgqmni ea--evgele dreck--tvv wrgkpvfvyr rserqmndvl gtplsalkh------------p g ********************************* ********** ****** h smtatadvl-a makvevnlaa iplgknvvvk wqgkpvfirh rtpheiqean svdmsalkd------------p i nmsasadvl-a makvevdlna ipegknviik wrgkpvfirh rtpaeieean kvnvatlrd------------p j qmnpdastiaa gapievdlsp iaegqdikvf wrgkpiyish rtkkqidear avnvaslpd------------p k 41 qmnpsadvq-a lasiqvdvsg vetgtqltvk wlgkpvfirr rtedeiqagr evdlgqlidrsaqnsnkpd-ap l 37 qmnpsadvq-a lasifvdvss vepgvqltvk flgkpifirr rteadielgr svqlgqlvdtnarnanidagae m 37 qmnasadvk-a masifvdvsa vevgtqltvk wrgkpvfirr rdekdielar svplgalrdtsaenankp-gae 107 o qmnpdaatiaa gapididisp vtegqivrvf wrgkpifirh rtakeiqsee aadvgalid------------p p fmnp-aadtla lastevdvsa iaegqaitvt wrgkpvfvrh rtqkeivvar avdpaslrd------------p u 61 eadisk lepgallrvk wrgmpvwvvh rssemlaals sndpklvdptsevpq 130 140 150 160 170 180 | | | | @ 121 qhdldrvkkp -------ewviligvct HLGCvpia---------na gdfggyyCpc HGShYDasgr irl-gpaplnl 180 a qhdlervkkp -------ewviligvCT HLGCvpia---------na gdfggyyCPC HGShYDasgr irk-GPAPlnl b qhdlervkkp -------ewviligvCT HLGCvpia---------na gdfggyyCPC HGShYDasgr irk-gpaplnl qhdlervkkp -------ewviligvct hlgcvpia---------na gdfggyycpc hgshydasgr irk-gpaplnl c eqdaervknp -------ewlvvigvCT HLGCiplp---------na gdfggwfCPC HGShydisgr irk-gpapfnl d qqdaervknp -------ewlvvigvCT HLGCiplp---------na gdfggwfCPC HGShYDisgr irk-GPAPynl u qpd--ycknp trsikpeylvaigiCT HLGCsptyrpefgpddlgs gwkggfhcpc hgsrfdlaar vfknvpaptnl e qqdaervknp -------ewlvvvgvCT HLGCiplp---------na gdfggwfCPC HGShYDisgr irk-gpapynl f etdearfpdh ------remavviaiCT HLGCipip---------ne glfggffCPC HGShYDasgr irq-GPAPlnl g ********** ***************** *************** ******* h qtdadrvkdp -------qwlimlgiCT HLGCvpig---------ea gdfggwfCPC Hgshydisgr irk-gpaplnl i etdadrvkkp -------ewlvmlgvCT HLGCvpig---------ea gdyggwfCPC HGShYDisgr irk-GPAPlnl j qsdearvksg -----heqwlvvigiCT HLGCipia---------he gnydgffCPC HGSqYDssgr irq-GPAPanl k atdenrtmde -----agewlvmigvCT HLGCvpig--------dga gdfggwfCPC HGShYDtsgr irr-GPAPqnl l atdqnrtlde -----agewlvmwgvCT HLGCvpig--------gvs gdfggwfCPC HGShYDsagr irk-GPAPenl m 108 atdenrtlpa fdgtntgewlvmlgvCT HLGCvpmg--------dks gdfggwfCPC HGShYDsagr irk-GPAPrnl 175 o qpdsarvkpg -----kaewlvvyasCT HLGCiplg---------hq gdwggwfCPC HGSqYDasgr vrk-GPAPtnl p qtdearvqqa -------qwlvmvgvCT HLGCiplg----qkagdpk gdfdgwfCPC HGShYDsagr irk-GPAPlnl r yi hfyppnyvnsgqltase pdqltaaa----llaarqa nvpalihcdc hgstydpyhg asvltg s cpc hyglyekdgtnvpgtpplapldh t ffcpc hyglytkdgtnvpgtpptapldr 190 @ 181 evptyeftsd dmvivg 196 a evptyeftsg dvvvvg b evpsyeftsd dmvivg evptyeftsd dlvivg c evptysflee nkllvg d evptysflee nkllig e evptysflee nkllig f evppyrwvdd ktiylgkl h eipayefdgd -kvivg i eiplyefpee gklvig j pvppyqfvsd tkiqig k hipvaefldd ttiklg l piplakfide ttiqlg m 176 dipvaafvde ttiklg o pvppyefvdn tkirigagva p pvppyaftdd ttvlig u 181 vipkhvylnd ttiligedrgsa s yeqe vkdgflylgk akpkgeg t ye fevkdgklyl gkakprgea Detailed alignment of ligand region between rhodobacter and vertebrates: 130 140 150 160 170 180 | *| * | * | * | | b 121 qhdlervkkp -------ewviligvCT HLGCvpia-na gdfggyyCPC HGShYDasgr irkgpaplnl 180 m 108 atdenrtlpa fdgtntgewlvmlgvCT HLGCvpmgdks gdfggwfCPC HGShYDsagr irkGPAPrnl 175 \______S-S_______/ | | | | | | | 110 120 130 140 150 160 170