Rieske ironsulfur sequences

                     10        20        30        40        50        60

                     |         |         |         |         |         |

homo.       MLSVAARSGPFAPVLSATSRGVAGALRP------LVQATVPATPEQPVLDLKRPFLSRES
rattus.     ------------------SRGVAGALRP------LLQSAVPATSEPPVLDVKRPFLCRES
bos.        MLSVAARSRHSRP----SYRPRPAGWRA------LRPWYSRSSRESPVLDLKRSVLCRES
Gallus      MLSVAARSGPFAPYLSAAAHAVPGPLKA------LAPAALRA--EKVVLDLKRPLLCRES
Sacch.cere. -------------------------------------------------------MLGIR
Ncrassa.    ---------------------------------------------------MAPVSIVSR
Nicot.      -------WPVRSAAPSSSAFISANHFSS------DDDSSSPRS-ISPSLASVFLHHTRGF
Solan.      MLRVAGRRLSSSAARSSSTFFTRSSFTV------TDDSSPARS-PSPSLTSSFLDQIRGF
Zea.        MLRVAGRRLSSSLSWRPAAAVARGPLAGAGVPDRDDDSARGRSQPRFSIDSPFFVASRGF
Trypan.     ----MFRRSCISAFQPTAFLRVSLVFKQ-------LEGSNPLTVKDRPVNSWSDEFLKPP
Rbcasulatus
                                   0        10        20              30
                                   |         |         |               |
homo.       LSGQAVRRPLVASVGLNVPASVCYSHTDIKVPDFSEYRRLEVLDST------KSSRESSEARK
rattus.     LSGQAATRPLVATVGLNVPASVRYSHTDIKVPDFSDYRRAEVLDST------KSSKESSEARK
bos.        LRGQAAA-ALVASVSLNVPASVRYSHTDIKVPDFSDYRRPEVLDST------KSSKESSEARK
Gallus      MSGRSARRDLVAGISLNAPASVRYVHNDVTVPDFSAYRREDVMDAT------TSSQTSSEDRK
Sacch.cere. SSVKTCFKPMSLTSKRLISQSLLASKSTYRTPNFDDVLKEN---------------NDADKGR
Ncrassa.    AAMRAAAAPARAVRALTTSTALQGSSSSTFESPFKGESKAAKVPDFG----KYMSKAPPSTNM
Nicot.      SSN-SVSPAHDMGLVPDLPPTVAAIKNPTSKIVYDEHNHERYPP------------GDPSKRA
Solan.      SSN-SVSPAHQLGLVSDLPATVAAIKNPSSKIVYDDSNHERYPP------------GDPSKRA
Zea.        SSTETVVPRNQDAGLADLPATVAAVKNPNPKVVYDEYNHERYPP------------GDPSKRA
Trypan.     VSKEMADKYGRYAKYSDPTLCSVDTSSEVVLNTYPEGSREGRIEATAGVALKDYDASMWDEEF
Rbcasul                                                        MSHAEDNAGTRR
                              .       .      :    :

                  40               50         60                    70
                   |                |         |                     |
homo.        GFSYLVTGVT-------TVGVAYAAKNAVTQFVSSM------------SASADVLAL
rattus.      GFSYLVTATT-------TVGVAYAAKNAVSQFVSSM------------SASADVLAM
bos.         GFSYLVTATT-------TVGVAYAAKNVVSQFVSSM------------SASADVLAM
Gallus       GFSYLVTATA-------CVATAYAAKNVVTQFISSL------------SASADVLAL
Sacch.cere.  SYAYFMVGAM-------GLLSSAGAKSTVETFISSM------------TATADVLAM
Ncrassa.     LFSYFMVGTM-------GAITAAGAKSTIQEFLKNM------------SASADVLAM
Nicot.       -FAYFVLTGG-------RFVYASLMRLLILKFVLSM------------SASKDVLAL
Solan.       -FAYFVLTGG-------RFVYASSVRLLILKFVLSM------------SASKDVLAL
Zea.         -FAYFVLSGG-------RFIYASLLRLLVLKFVLSM------------SASKDVLAL
Trypan.      FRKYILKPKLPNELEDRARVTDYALNSALLGFVILMVRYAVLPLWYVGQPAMSMVGQ (RNA editing?)
Rbcasul      DFLYHATAAT-------GVVVTGAA--VWP-LINQM------------NASADVKAM

Rb.spha.    qmnpsadvq-a lasifvdvss vepgvqltvk 
               :   :                    .  :  *:  :             .: .::. 

SynechococcusPCC7002   -MTQLSGSSDVPDLGRRQFLNLLWVGTAAGTALGGLYPVIKYFIPPSSG-GAGGGVIAKD
Phormidium-l           ----ASGTPDVPDLGARQFMNLLTFGAATGTVLGMLYPVVRYFIPPSSG-GAGGGVTAKD
Synechococcuselongatus -MAQVSGMSDVPDMGRRQFMNLLTFGTITGTALGALYPVVKYFIPPASG-GTGGGAVAKD
Nostoc                 -MAQFSESADVPDMGRRQFMNLLTFGTVTGVALGALYPVVKYFIPPASG-GAGGGTTAKD
Anabaena               --MDNSIPIESPSLSRRQLLNFITGATVAVTAGAALYPAGKFLIAPAEKTGAGGAILAKD
Chlorobium             MAQTGNFKSPARMSSLGQGAAPASSGAVTGGKPREGGLKGVDFERRGFLHKIVGGVGAVV
                            .        .  *       .: :             :   .      *.  *
                   80        90       100       110       120                         130
                    |         |         |         |         |                           |
homo.       AKIEIKLSDIPEGKNMAFKWRGKPLFVRHRTQKEIEQEAAVELSQLRDP------------QHDLDRV------KKP-
rattus.     SKIEIKLSDIPEGKNMAFKWRGKPLFVRHRTKKEIDQEAAVEVSQLRDP------------QHDLERV------KKP-
bos.        SKIEIKLSDIPEGKNMAFKWRGKPLFVRHRTKKEIDQEAAVEVSQLRDP------------QHDLERV------KKP-
Gallus      SKIEIKLSDIPEGKNVAFKWRGKPLFVRHRTQAEINQEAEVDVSKLRDP------------QHDLDRV------KKP-
Sacch.cere. AKVEVNLAAIPLGKNVVVKWQGKPVFIRHRTPHEIQEANSVDMSALKDP------------QTDADRV------KDP-
Ncrassa.    AKVEVDLNAIPEGKNVIIKWRGKPVFIRHRTPAEIEEANKVNVATLRDP------------ETDADRV------KKP-
Nicot.      ASLEVDLSSIEPGTTVTVKWRGKPVFIRRRTEDDISLANSVDLGSLRDP------------QQDAERV------KNP-
Solan.      ASLEVDLSSIEPGSTVTVKWRGKPVFIRRRTDDDIKLANSVDLGTLRDP------------QQDAERV------KNP-
Zea.        ASLEVDLSSIEPGTTVTVKWRGKPVFIRRRTEDDIKLANSVDVASLRHP------------EQDAERV------KNP-
Trypan.     MNIEAEVGELEDRECKTVVWRGKPVFVYRRSERQMNDVLGTPLSALKHP------------ETDEARF------PDHR
Rbcasul     ASIFVDVSAVEVGTQLTVKWRGKPVFIRRRDEKDIELARSVPLGALRDTSAENAN-KPGAEATDENRTLPAFDGTNTG
Rb.spha.                       flgkpifirrrteadielgrsvqlgqlvdtnarnanidagaeatdqnrtldeag
l
.:* .:  :       . *:***:*: :*:  ::.    . :. *:.*: *  *. .
SynechococcusPCC7002   ALGNDIIVSDYLQTHTAGDRSLAQGLKGDPTYV--------------VVEGDNTISSYGI
Phormidium-l           ALGNDVVLKDFLATHTPGERVLAEGLKGDPTYL--------------VVEDAGVDRSYGI
Synechococcuselongatus ALGNDIKVSEYLAKHLPGDRSLAQGIKGDPTYV--------------IVTEDHQIANYGL
Nostoc                 ELGNDVSLSKFLENRNAGDRALVQGLKGDPTYI--------------VVENKQAIKDYGI
Anabaena               ILGKQIPASQILA-EPPQTRALVAGLAGEPTYL--------------IVKEDHTLDRIGL
Chlorobium             AVSTLYPVVKYIIPPARKIKNVDELTVGKASEVPDGKSKIFQFNEDKVIVVNKGGALTAV
                        :..     . :       : :     *..: :              ::         .:
 
 

                       140       150       160       170       180       190       300
                        *|*        |       * |*        |         |         |
homo.           EWVILIGVCTHLGCVPIAN AGDFGGYYCPCHGSHYDASGRIRLGPAPLNLEVPTYEFTSDDMVIVG--
rattus.         EWVILIGVCTHLGCVPIAN AGDFGGYYCPCHGSHYDASGRIRKGPAPLNLEVPTYEFTSGDVVVVG--
bos.            EWVILIGVCTHLGCVPIAN AGDFGGYYCPCHGSHYDASGRIRKGPAPLNLEVPSYEFTSDDMVIVG--
Gallus          EWVILVGVCTHLGCVPIAN SGDFGGYYCPCHGSHYDASGRIRKGPAPYNLEVPTYQFVGDDLVVVG--
Sacch.cere.     QWLIMLGICTHLGCVPIGE AGDFGGWFCPCHGSHYDISGRIRKGPAPLNLEIPAYEFDG-DKVIVG---
Ncrassa.        EWLVMLGVCTHLGCVPIGE AGDYGGWFCPCHGSHYDISGRIRKGPAPLNLEIPLYEFPEEGKLVIG--
Nicot.          EWLVVIGVCTHLGCIPLPN AGDFGGWFCPCHGSHYDISGRIRKGPAPYNLEVPTYSFLEENKLLIG--
Solan.          EWLVVVGVCTHLGCIPLPN AGDFGGWFCPCHGSHYDISGRIRKGPAPYNLEVPTYSFLEENKLLIG--
Zea.            EWLVVIGVCTHLGCIPLPN AGDFGGWFCPCHGSHYDISGRIRKGPAPFNLEVPTYSFLEENKLLVG--
Trypan.         EMAVVIAICTHLGCIPIPN EGLFGGFFCPCHGSHYDASGRIRQGPAPLNLEVPPYRWVDDKTIYLGKL
Rbcasul         EWLVMLGVCTHLGCVPMGDKSGDFGGWFCPCHGSHYDSAGRIRKGPAPRNLDIPVAAFVDETTIKLG
Rb.spha.        ewlvmwgvCTHLGCvpiggvsgdfggwfCPCHGShYDsagrirk-GPAPenl

                  :  :::.:******:*: :  * :**::********* ***** **** ***:* * : 

SynechococcusPCC7002   NAICTHLGCVVPWNT  AENKFMCPCHGSQYDETGKVVRGPAPLSLALVHAEVTEDDKISFT
Phormidium-l           NAVCTHLGCVVPWNA  SENKFICPCHGSQYDATGKVVRGPAPLSLALVHTDVTEDGKIAMT
Synechococcuselongatus NAVCTHLGCVVPWNV  SENKFICPCHGSQYDSTGKVVRGPAPLSLALVKATVTEDDKLVFT
Nostoc                 NAICTHLGCVVPWNV  AENKFKCPCHGSQYDETGKVVRGPAPLSLALAHAN-TVDDKIILS
Anabaena               VDNCTHLGCTFPWNP  LDQQFQCPCHGSRYAPDGSVVRGPAPLPLKIVQVA-VIDNSILIS
Chlorobium             SAVCTHLGCLVNWVD  ADNQYFCPCHGAKYKLTGEIISGPQPLPLKQYKARIEGDSIIISK
                          ****** . *     :::: *****::*   *.:: ** **.*   :.    *. :  .
























Rieske ironsulfur sequences
@ g488299              Homo sapiens                274 aa   Ref

a P20788      rat      Rattus norvegicus           256 aa   Ref

b P13272      beef     Bos taurus                  269 aa   Ref

              pig      Sus scrofa                   85 aa   Ref

c A41607      corn     Zea mays                    273 aa   Ref

d B41607      tobacco  Nicotiana tabacum           258 aa   Ref

e P37841      Potato   Solanum tuberosum           265 aa   Ref

f TBU28866 cds1        Trypanosoma brucei          297 aa   Ref

g CFU28865 cds1        Crithidia fasciculata        30 aa fragment Ref

h P08067               Sacch. cerevisiae           215 aa   Ref

i P07056               Neurospora crassa           231 aa   Ref

j A32382               Bradyrhizobium japonicum    176 aa   Ref

k P05417               Paracoccus denitrificans    190 aa   Ref

l Q02762               Rhodobacter sphaeroides     187 aa   Ref

m P08500               Rhodobacter capsulatus      191 aa   Ref

o JQ0345               Rhodopseudomonas viridis    179 aa   Ref

p P23136               Rhodospirillum rubrum       183 aa   Ref

q CLPET cds2           Chlorobium limicola         428 aa   Ref

r                      Sulfolobus acidocaldarius   250 aa   Ref

r                      Sulfolobus acidocaldarius   250 aa   Ref

s                      Bacillus subtilis           167 aa   Ref

t                      B. stearothermophillus      169 aa   Ref

u                      Chromatium vinosum          207 aa   Ref



Rieske:      

        1 mlasaggywp msaqgvnkmr rrvlvaatsv vgavgagyal vpfvasmnps araraagapv

       

      

      

//

  1632948

Presequence(not aligned):    ***            **** **  * **** ***    *** * ******* *

 @  mlsvaarsgpfapvlsatsrgvagalrplvqatvpatpeqpvldlkrpflsreslsgqavrrplvasvglnvpasvcy

 a  mlsvaarsgpfapvlsatsrgvagalrpllqsavpatseppvldvkrpflcreslsgqaatrplvatvglnvpasvry

 b  mlsvaars-rhsrpsyrprpagwralrpwy--srs-srespvldlkrsvlcreslrgqaaa-alvasvslnvpasvry

 c  mlrvagrrlssslswrpaaavargplagagvpdrdddsargrsqprfsidspffvasrgfsstetvvprnqdaglad

 d  wpvrsaapsssafisanhfssdddsssprsispslasvflhhtrgfssnsvspahdmglvpd

 e  mlrvagrrlsssaarssstfftrssftvtddssparspspsltssfldqirgfssnsvspahqlglvsd

 f  mfrrscisafqptaflrvslvfkqlegsnpltvkdrpvnswsdeflkppvskemadkygryakysdptlcsvdtssevvlntypegsregrieatagvalkdydasmwd

 g  mfrrtfttafqat

 h  mlgirssvktcfkpmsltsk

 i  mapvsivsraamraaaaparavralttstalqgsssstfespfkg

 j        

 k         

 l       

 m   

 o   

 p  





Start of mature sequence:  10         20         30         40         50         60  

 @    1 ----------shtdikvpdf seyrrlevld stkssresse arkgfsylvt gvttvgvaya aknavtqfvs   60

 a    1 ----------shtdikvpdf sdyrraevld stksskesse arkgfsylvt atttvgvaya aknavsqfvs 

 b    1 ----------shtdikvpdf sdyrrpevld stksskesse arkgfsylvt atttvgvaya aknvvsqfvs 

 c    1 ----------lpatvaavkn pnpkvvydey nheryppgdp skrafayfvl sggrfiyasl lrllvlkfvl 

 d    1 ----------lpptvaaikn ptskivydeh nheryppgdp skrafayfvl tggrfvyasl mrllilkfvl 

 e    1 ----------lpatvaaikn psskivydds nheryppgdp skrafayfvl tggrfvyass vrllilkfvl 

 u    1                            mlasaggywp msaq gvnkmrrrvlvaatsvvgavg agyalvpfva

 f    1 ----------eeffrkyilk pklpneledr arvtdyalns allgfvilmv ryavlplwyv gqp------- 

 g    1 ----------raarvsllvk qlegttp 

 h    1 ----------rlisqsllas kstyrtpnfd dvlkenndad kgrsyayfmv gamgllssag akstvetfis 

 i    1 ----------eskaakvpdf gky------- -mskappstn ml--fsyfmv gtmgaitaag akstiqeflk 

 j    1 mtt----assadhp-trrdf lfv------- ---------- ---------- -atgaaaavg gaaalwpfis  

 k    1 mshadeha--gdhgatrrdf lyy------- ---------- ---------- -atagagtva agaaawtlvn 

 l    1 msnaedha------gtrrdf lyy------- ---------- ---------- -atagagava tgaavwplin 

 m    1 mshaedna------gtrrdf lyh------- ---------- ---------- -ataatgvvv tgaavwplin   36

 o    1 massdta------eatrrdf lyv------- ---------- ---------- -ataavgaag vaavawpfit 

 p    1 maeaehtastpggessrrdf liy------- ---------- ---------- -gttavgavg valavwpfid 

 u    1 mlasaggywp msaqgvnkmr rrvlvaatsv vgavgagyal vpfva





                 70         80         90        100        110                    120

 @   61 smsasadvl-a lakieiklsd ipegknmafk wrgkplfvrh rtqkeieqea avelsqlrd------------p 120                                                                                     

 a      smsasadvl-a mskieiklsd ipegknmafk wrgkplfvrh rtkkeidqea avevsqlrd------------p

 b      smsasadvl-a mskieiklsd ipegknmafk wrgkplfvrh rtkkeidqea avevsqlrd------------p

                                                                 vevsqlrd------------p

 c      smsaskdvl-a laslevdlss iepgttvtvk wrgkpvfirr rteddiklan svdvaslrh------------p

 d      smsaskdvl-a laslevdlss iepgttvtvk wrgkpvfirr rteddislan svdlgslrd------------p

 u   61 smnpsararaa gapveadisk lepgallrvk wrgmpvwvvh rssemlaals sndp klvdptsevpq

 e      smsaskdvl-a laslevdlss iepgstvtvk wrgkpvfirr rtdddiklan svdlgtlrd------------p

 f      amsm-vgqmni ea--evgele dreck--tvv wrgkpvfvyr rserqmndvl gtplsalkh------------p

 g      ********************************* ********** ******

 h      smtatadvl-a makvevnlaa iplgknvvvk wqgkpvfirh rtpheiqean svdmsalkd------------p

 i      nmsasadvl-a makvevdlna ipegknviik wrgkpvfirh rtpaeieean kvnvatlrd------------p

 j      qmnpdastiaa gapievdlsp iaegqdikvf wrgkpiyish rtkkqidear avnvaslpd------------p

 k   41 qmnpsadvq-a lasiqvdvsg vetgtqltvk wlgkpvfirr rtedeiqagr evdlgqlidrsaqnsnkpd-ap

 l   37 qmnpsadvq-a lasifvdvss vepgvqltvk flgkpifirr rteadielgr svqlgqlvdtnarnanidagae

 m   37 qmnasadvk-a masifvdvsa vevgtqltvk wrgkpvfirr rdekdielar svplgalrdtsaenankp-gae  107

 o      qmnpdaatiaa gapididisp vtegqivrvf wrgkpifirh rtakeiqsee aadvgalid------------p

 p      fmnp-aadtla lastevdvsa iaegqaitvt wrgkpvfvrh rtqkeivvar avdpaslrd------------p

 u   61                 eadisk lepgallrvk wrgmpvwvvh rssemlaals sndpklvdptsevpq



               130               140                150        160        170        180

                                  |  |                          |   |

 @  121 qhdldrvkkp -------ewviligvct HLGCvpia---------na gdfggyyCpc HGShYDasgr irl-gpaplnl 180

 a      qhdlervkkp -------ewviligvCT HLGCvpia---------na gdfggyyCPC HGShYDasgr irk-GPAPlnl

 b      qhdlervkkp -------ewviligvCT HLGCvpia---------na gdfggyyCPC HGShYDasgr irk-gpaplnl 

        qhdlervkkp -------ewviligvct hlgcvpia---------na gdfggyycpc hgshydasgr irk-gpaplnl

 c      eqdaervknp -------ewlvvigvCT HLGCiplp---------na gdfggwfCPC HGShydisgr irk-gpapfnl 

 d      qqdaervknp -------ewlvvigvCT HLGCiplp---------na gdfggwfCPC HGShYDisgr irk-GPAPynl 

 u      qpd--ycknp  trsikpeylvaigiCT HLGCsptyrpefgpddlgs gwkggfhcpc hgsrfdlaar vfknvpaptnl

 e      qqdaervknp -------ewlvvvgvCT HLGCiplp---------na gdfggwfCPC HGShYDisgr irk-gpapynl

 f      etdearfpdh ------remavviaiCT HLGCipip---------ne glfggffCPC HGShYDasgr irq-GPAPlnl

 g      ********** ***************** *************** *******

 h      qtdadrvkdp -------qwlimlgiCT HLGCvpig---------ea gdfggwfCPC Hgshydisgr irk-gpaplnl

 i      etdadrvkkp -------ewlvmlgvCT HLGCvpig---------ea gdyggwfCPC HGShYDisgr irk-GPAPlnl

 j      qsdearvksg -----heqwlvvigiCT HLGCipia---------he gnydgffCPC HGSqYDssgr irq-GPAPanl

 k      atdenrtmde -----agewlvmigvCT HLGCvpig--------dga gdfggwfCPC HGShYDtsgr irr-GPAPqnl

 l      atdqnrtlde -----agewlvmwgvCT HLGCvpig--------gvs gdfggwfCPC HGShYDsagr irk-GPAPenl 

 m  108 atdenrtlpa fdgtntgewlvmlgvCT HLGCvpmg--------dks gdfggwfCPC HGShYDsagr irk-GPAPrnl  175

 o      qpdsarvkpg -----kaewlvvyasCT HLGCiplg---------hq gdwggwfCPC HGSqYDasgr vrk-GPAPtnl

 p      qtdearvqqa -------qwlvmvgvCT HLGCiplg----qkagdpk gdfdgwfCPC HGShYDsagr irk-GPAPlnl 

 r              yi hfyppnyvnsgqltase pdqltaaa----llaarqa nvpalihcdc hgstydpyhg asvltg

 s                                                              cpc hyglyekdgtnvpgtpplapldh

 t                                                            ffcpc hyglytkdgtnvpgtpptapldr

               190        

 @  181 evptyeftsd dmvivg 196

 a      evptyeftsg dvvvvg

 b      evpsyeftsd dmvivg

        evptyeftsd dlvivg

 c      evptysflee nkllvg

 d      evptysflee nkllig

 e      evptysflee nkllig

 f      evppyrwvdd ktiylgkl      

 h      eipayefdgd -kvivg

 i      eiplyefpee gklvig

 j      pvppyqfvsd tkiqig

 k      hipvaefldd ttiklg

 l      piplakfide ttiqlg

 m  176 dipvaafvde ttiklg

 o      pvppyefvdn tkirigagva

 p      pvppyaftdd ttvlig

 u 181  vipkhvylnd ttiligedrgsa

 s     yeqe vkdgflylgk akpkgeg

 t     ye fevkdgklyl gkakprgea



Detailed alignment of ligand region between rhodobacter and vertebrates:

                130               140         150        160        170        180

                 |                *| *         |        * | *        |          |

 b  121 qhdlervkkp -------ewviligvCT HLGCvpia-na gdfggyyCPC HGShYDasgr irkgpaplnl  180

 m  108 atdenrtlpa fdgtntgewlvmlgvCT HLGCvpmgdks gdfggwfCPC HGShYDsagr irkGPAPrnl  175

                                         \______S-S_______/

          |          |         |          |          |          |          |     

         110        120       130        140        150        160        170