P450s that have appeared since the 1993 P450 nomenclature update.
      This is part E of the bibiographic P450 files.  
      This section contains bacterial sequences CYP101 to CYP174.  
      This includes references that were incomplete and duplications
      of sequences that were already in the update.  If a sequence 
      is assigned an accession number that was not in the old update
      it is included in this list.  48 new P450s were added July 27, 2000
      Four new sequences were added Jan. 9, 2001 CYP102C1, CYP172-174.
      Added CYP175A1 9/17/2001
      Compiled by David R. Nelson
      Last modified June 2, 2003 added 25 new sequences. 
      Last modified Nov. 5, 2003 There are now 501 bacterial P450s

51 Family

101 Family

102 Family

103 Family

104 Family

105A Subfamily

105B Subfamily

105C Subfamily

105D Subfamily

105E Subfamily

106 Family

107A Subfamily

107B Subfamily

107C Subfamily

107D Subfamily

107E Subfamily

107F Subfamily

107G Subfamily

107H Subfamily

107J Subfamily

108 Family

109 Family

110 Family

111 Family

112 Family

113A Subfamily

113B Subfamily

114 Family

115 Family

116 Family

117 Family

118 Family

119 Family

120 Family

121 Family

122 Family

123 Family

124 Family

125 Family

126 Family

127 Family

128 Family

129 Family

130 Family

131 Family

132 Family

133 Family


51 Family


CYP51      Mycobacterium tuberculosis
           GenEMBL Z80226 (34809bp) gi 1550642 Rv0764c
           complement (6140-7495)
           33.7% identical to CYP51 over 439AA overlap
           this is a bacterial CYP51

CYP51      Mycobacterium bovis subsp. bovis AF2122/97
           NC_002945 complete genome complement(858662..858868)
           CYP51 100% match
           locus_tag = Mb0786c

CYP51 Mycobacterium avium
TIGR contig:3273:m_avium Length = 5,475,738
79% to CYP51 M. tuberculosis
3021360 TSTVVPRVSGGEEEHGHLEEFRTDPIGLMQRVRDECGDVGWFQLVDKHVILLSGAQANEF 3021539
3021540 FFRSADEDLDQAEAYPFMTPIFGKGVVFDASPERRKEMLHNSALRGEQMKGHASTIEGEV 3021719
3021720 KKMIADWGDEGEIELLDFFAELTIYTSTACLIGLKFREQLDHRFAEYYHDLERGTDPLCY 3021899
3021900 VDPYLPIESFKRRDEARVKLVALVQEIMDQRLANPPKDKADRDMLDVLVSIKDEDGKPRF 3022079
3022080 SADEITGMFISLMFAGHHTSSGTSAWTLIELIRHPDVYAEVLAELEELYADGQEVSFHAL 3022259
3022260 RSIPKLDNVVKETLRLHPPLIILMRVAKGEFEVEGFPIHEGDYVAASPAISNRIPEDFPD 3022439
3022440 PDAFKPDRYNKPEQADIVNRWTWIPFGAGRHRCVGAAFAQMQIKAIFSVLLREYDFEMAQ 3022619
3022620 PADSYRNDHSKMVVQLARPAKVRYRKR 3022700

CYP51 Mycobacterium smegmatis
TIGR contig:3439:m_smegmatis Length = 6,989,783
80% to CYP51 M. tuberculosis
4858809 VPRVSGGEEEHGHLEEFRTDPIGLMKRVRSECGDVGWFQLADKQVVLLSGAEANEFFFRS 4858988
4858989 SDSELNQAEAYPFMTPIFGEGVVFDADPERRAEMLHNTALRGEQMKGHAATIENEVRRMV 4859168
4859169 ESWGDEGEIDLLEFFAELTIYTSTACLIGVKFRNQLDKRFADYYHLLERGTDPLCYVDPY 4859348
4859349 LPIESFRIRDEARANLVELVQEVMNGRIANPPKDKSDRDLLDVLVSIKDEDGTPRFSANE 4859528
4859529 VTGMFISLMFAGHHTSSGTASWTLIELLRHPEFYAKVQAELDDLYADGQEISFHALRQIP 4859708
4859709 NLDNALKETLRLHPPLIILMRVAQDEFEVAGRPIHKGQMVAASPAISNRIPEDFPDPDTF 4859888
4859889 DPDRYDKPRQEDLINRWTWIPFGAGKHRCVGAAFAQMQIKAIFSVLLRDFEFEMAQPSES 4860068
4860069 YRNDHSKMVVQLARPAKVRYRRR 4860137

CYP51 Methylococcus capsulatus
TIGR contig:221:m_capsulatus
49% to CYP51 M. tuberculosis
NOTE FUSION PROTEIN EXTENDS C-TERMINAL. 
SEE J. Biol. Chem., Vol. 277, Issue 49, 46959-46965, December 6, 2002
A Novel Sterol 14-Demethylase/Ferredoxin Fusion Protein (MCCYP51FX) from
Methylococcus capsulatus Represents a New Class of the Cytochrome P450
Superfamily 
Colin J. Jackson¤, David C. Lamb¤, Timothy H. Marczylo, Andrew G. S. Warrilow, Nigel J. Manning¦, David J. Lowe, Diane
E. Kelly, and Steven L. Kelly
908332 MSHPPSNTP
908305 PVKPGGLPLLGHILEFGKNPHAFLMALRHEFGDVAEFRMFHQRMVLLTGSQASEAFYRAP 908126
908125 DEVLDQGPAYRIMTPIFGRGVVFDARIERKNQQLQMLMPALRDKPMRTYSEIIVAEVEAM 907946
907945 LRDWKDAGTIDLLELTKELTIYTSSHCLLGAEFRHELNTEFAGIYRDLEMGIQPIAYVFP 907766
907765 NLPLPVFKRRDQARVRLQELVTQIMERRARSQERSTNVFQMLIDASYDDGSKLTPH 907598
907597 EITGMLIATIFAGHHTSSGTTAWVLIELLRRPEYLRRVRAEIDALFETHGRVTFESLRQM 907418
907417 PQLENVIKEVLRLHPPLILLMRKVMKDFEVQGMRIEAGKFVCAAPSVTHRIPELFPNPEL 907238
907237 FDPDRYTPERAEDKDLYGWQAFGGGRHKCSGNAFAMFQIKAIVCVLLRNYEFELAAAPE 907061
907060 SYRDDYRKMVVEPASPCLIRYRRRDAP 906980

101 Family


CYP101A1    Pseudomonas putida
            GenEMBL D00528 (1950bp)
            Koga,H., Yamaguchi,E., Matsunaga,K., Aramaki,H. and Horiuchi,T.
            Cloning and nucleotide sequences of NADH-putidaredoxin reductase
            gene(camA) and putidaredoxin gene(camB) involved in cytochrome
            P-450cam hydroxylase of Pseudomonas putida
            J. Biochem. 106, 831-836 (1989)
            Note: only the last 93 nucleotides of the cam gene was cloned along 
            with two downstream genes.

CYP101A1    Pseudomonas putida
            PIR C60886 (last 8 amino acids)
            Romeo, C., Moriwaki, N., Yasunobu, K.T., Gunsalus, I.C.,
            Koga, H.
            Identification of the coding region for the putidaredoxin
            reductase gene from the plasmid of Pseudomonas putida.
            J. Protein Chem. 6, 253-261 (1987)

CYP101B1    Novosphingobium aromaticivorans
            NZ_AAAV01000165.1 
            complement(29626..30870) gene = Saro2804
            43% to CYP101
MLPHDRGQNSTRRITAMEAPAHVPADRVVDIDIYMPPGLAEHGF
HKAWSDLSAGNPAVVWTPRNEGHWIALGGEALQEVQSDPERFSSRIIVLPKSVGEMHG
LIPTTIDPPEHRPYRQLLNAHLNPGAIRGLSESIRQTAVDLIEGFAAQGHCNFTAQYA
EQFPIRVFMALVGIEASEAPRIRHWAECMTRPGMDMTFDEAKAVFFDYVGPLVDARRE
TPGEDMISAMINADLGDGRRLTRDEALSVVTQVLIAGLDTVVNVLGFIMRELAGNPAL
RADLRQRGADILPVVHELFRRFGLVSIAREVRRDIEFHGVHLKAGDMIAIPTQVHGLD
PRVNPDPLAIDPSRKRARHSTFGSGPHMCPGQELARKEVAITLEEWLRRIPDFALGPN
SDLSPVPGIVGALRRVELVWNT

CYP101C1   Novosphingobium aromaticivorans
           NZ_AAAV01000133.1 
           complement(4199..5389) gene = Saro1574
           44% to CYP101A1
MIPAHVPADRVVDFDIFNPPGVEQDYFAAWKTLLDGPGLVWSTA
NGGHWIAARGDVVRELWGDAERLSSQCLAVTPGLGKVMQFIPLQQDGAEHKAFRTPVM
KGLASRFVVALEPKVQAVARKLMESLRPRGSCDFVSDFAEILPLNIFLTLIDVPLEDR
PRLRQLGVQLTRPDGSMTVEQLKQAADDYLWPFIEKRMAQPGDDLFSRILSEPVGGRP
WTVDEARRMCRNLLFGGLDTVAAMIGMVALHLARHPEDQRLLRERPDLIPAAADELMR
RYPTVAVSRNAVADVDADGVTIRKGDLVYLPSVLHNLDPASFEAPEEVRFDRGLAPIR
HTTMGVGAHRCVGAGLARMEVIVFLREWLGGMPEFALAPDKAVTMKGGNVGACTALPL
VWRA

CYP101D1   Novosphingobium aromaticivorans
           NZ_AAAV01000085.1 
           complement(6803..8068) gene = Saro0669
           44% to CYP101
MNAQTSTATQKHRVAPPPHVPGHLIREIDAYDLDGLEQGFHEAW
KRVQQPDTPPLVWTPFTGGHWIATRGTLIDEIYRSPERFSSRVIWVPREAGEAYDMVP
TKLDPPEHTPYRKAIDKGLNLAEIRKLEDQIRTIAVEIIEGFADRGHCEFGSEFSTVF
PVRVFLALAGLPVEDATKLGLLANEMTRPSGNTPEEQGRSLEAANKGFFEYVAPIIAA
RRGGSGTDLITRILNVEIDGKPMPDDRALGLVSLLLLGGLDTVVNFLGFMMIYLSRHP
ETVAEMRREPLKLQRGVEELFRRFAVVSDARYVVSDMEFHGTMLKEGDLILLPTALHG
LDDRHHDDPMTVDLSRRDVTHSTFAQGPHRCAGMHLARLEVTVMLQEWLARIPEFRLK
DRAVPIYHSGIVAAVENIPLEWEPQRVSA

CYP101D2   Novosphingobium aromaticivorans
           NZ_AAAV01000042
           complement(5601..6899) gene = Saro0208
           63% to 101D1
MGTTRMDTFNPQESRLATNFDEAVRAKVERPANVPEDRVYEIDM
YALNGIEDGYHEAWKKVQHPGIPDLIWTPFTGGHWIATNGDTVKEVYSDPTRFSSEVI
FLPKEAGEKYQMVPTKMDPPEHTPYRKALDKGLNLAKIRKVEDKVREVASSLIDSFAA
RGECDFAAEYAELFPVHVFMALADLPLEDIPVLSEYARQMTRPEGNTPEEMATDLEAG
NNGFYAYVDPIIRARVGGDGDDLITLMVNSEINGERIAHDKAQGLISLLLLGGLDTVV
NFLSFFMIHLARHPELVAELRSDPLKLMRGAEEMFRRFPVVSEARMVAKDQEYKGVFL
KRGDMILLPTALHGLDDAANPEPWKLDFSRRSISHSTFGGGPHRCAGMHLARMEVIVT
LEEWLKRIPEFSFKEGETPIYHSGIVAAVENVPLVWPIAR

102 Family

CYP102A1   Bacillus megaterium
           Ruettinger,R.T.,Wen, L.-P. and Fulco, A.J. 
           Coding Nucleotide, 5'-Regulatory, and Deduced Amino Acid Sequences of 
           P450BM-3, a Single Peptide Cytochrome P450:NADPH-P450 Reductase from 
           Bacillus megaterium. 
           J. Biol. Chem. 264, 10987-10995 (1989)

CYP102A1    Bacillus megaterium
            GenEMBL J04832 (4957bp)
            Ravichandran,K.G., Boddupalli, S.S., Hasemann,C.A.,
            Peterson,J.A. and Deisenhofer,J. 
            Crystal structure of hemoprotein domain of P450BM-3, a prototype
            for microsomal P450s.
            Science 261, 731-736 (1993)
            P450 is N-terminal

CYP102A2    Bacillus subtilis
            GenEMBL D87979
            Yamamoto, H., S. Uchiyama, F. A. Nugroho, and J. Sekiguchi.
            A 23.4 kb segment at the 69 degrees-70 degrees region of the 
            Bacillus subtilis genome. 
            Microbiology. 143, 1317-20 (1997)
            Gene name yfnJ    66.4% identical to CYP102A1 P450 part only
            also called YetO (fusion of P450 and reductase like CYP102A1, P450 part is 
            N-terminal)

CYP102A3    Bacillus subtilis
            GenEMBL U93874, Z99117
            Sorokin, A., A. Bolotin, B. Purnelle, H. Hilbert, J. Lauber, A. 
            Dusterhoft, and S. D. Ehrlich.  
            Sequence of the Bacillus subtilis genome region in the vicinity of 
            the lev operon reveals two new extracytoplasmic function RNA 
            polymerase sigma factors SigV and SigZ. 
            Microbiology. 143, 2939-43 (1997)
            Gene name yrhJ  most similar to CYP102A2
            (fusion of P450 and reductase like CYP102A1 P450 part is N-
            terminal)

CYP102A4    Bacillus anthracis str. Ames
            GenPept AAP27014                
            bifunctional P-450:NADPH-P450 reductase 1 
            79% to 102A2
   1 MDKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKLAEEYG PIFRMQTLSD TIIVVSGHEL
  61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETQEPNWQ KAHNILMPTF SQRAMKDYHA
 121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM
 181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSE NQEENDLLSR
 241 MLNVQDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR
 301 VLTDSTPTYQ QVMKLKYIRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV
 361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG
 421 MLLQHFEFID YEEYQLDVKQ TLTLKPGDFK IRIVPRNQTI SHTTVLAPTE EKLKNHEIKQ
 481 QVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVAAL NDRIGSLPKE
 541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKG DELKGVQYAV FGCGDHNWAS TYQRIPRYID
 601 EQMAQKGATR FSTRGEADAS GDFEEQLEQW KQRMWSDAMK VFGLELNKNM EKERSTLSLQ
 661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSERSTRHIE ISLPEGATYK EGDHLGVLPI
 721 NSEKNVNRIL KRFGLNGKDQ VILSASGRSV NHIPLDSPVR LYDLLSYSVE VQEAATRAQI
 781 REMVTFTACP PHKKELESLL EDGVYQEQIL KKRISMLDLL EKYEACEIRF EPFLELLPAL
 841 KPRYYSISSS PLVAQDRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP
 901 QSNFQLPENP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MNVGEAHLYF GCRHPEKDYL
 961 YRTELENDER DGLISLHTAF SRLEGQAKTY VQHVIKEDRI HLISLLDNGA HLYICGDGSK
1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR LQEEGRYGKD VWAGI

CYP102A5    Bacillus cereus ATCC 14579
            GenPept AAP10153 
            NADPH-cytochrome P450 reductase/P450 fusion
            79% to 102A2 Bacillus subtilis
  1 MEKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKIAEEYG PIFQIQTLSD TIIVVSGHEL
  61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETHEPNWK KAHNILMPTF SQRAMKDYHA
 121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM
 181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSG DQEENDLLSR
 241 MLNVPDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR
 301 VLTDPTPTYQ QVMKLKYMRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV
 361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG
 421 MLLQHFELID YQNYQLDVKQ TLTLKPGDFK IRILPRKQTI SHPTVLAPTE DKLKNDEIKQ
 481 HVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVVAL NDRIGSLPKE
 541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKP DELKGVQYAV FGCGDHNWAS TYQRIPRYID
 601 EQMAQKGATR FSKRGEADAS GDFEEQLEQW KQNMWSDAMK AFGLELNKNM EKERSTLSLQ
 661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSDRSTRHIE VSLPEGATYK EGDHLGVLPV
 721 NSEKNINRIL KRFGLNGKDQ VILSASGRSI NHIPLDSPVS LLALLSYSVE VQEAATRAQI
 781 REMVTFTACP PHKKELEALL EEGVYHEQIL KKRISMLDLL EKYEACEIRF ERFLELLPAL
 841 KPRYYSISSS PLVAHNRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP
 901 QSNFELPKDP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MNLGQAHLYF GCRHPEKDYL
 961 YRTELENDER DGLISLHTAF SRLEGHPKTY VQHLIKQDRI NLISLLDNGA HLYICGDGSK
1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR VQDEGRYGKD VWAGI

CYP102A6    Bradyrhizobium japonicum USDA 110
            GenPept BAC48147                
            NC_004463 complete genome 3173438..3176674
            NADPH-cytochrome P450 reductase/P450 fusion
            54% to 102A2
   1 MSSKNRLDPI PQPPTKPVVG NMLSLDSAAP VQHLTRLAKE LGPIFWLDMM GSPIVVVSGH
  61 DLVDELSDEK RFDKTVRGAL RRVRAVGGDG LFTADTREPN WSKAHNILLQ PFGNRAMQSY
 121 HPSMVDIAEQ LVQKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE
 181 SLVRSLETIM MTRGLPFEQI WMQKRRKTLA EDVAFMNKMV DEIIAERRKS AEGIDDKKDM
 241 LAAMMTGVDR STGEQLDDVN IRYQINTFLI AGHETTSGLL SYTLYALLKH PDILKKAYDE
 301 VDRVFGPDVN AKPTYQQVTQ LTYITQILKE ALRLWPPAPA YGISPLADET IGGGKYKLRK
 361 GTFITILVTA LHRDPSVWGP NPDAFDPENF SREAEAKRPI NAWKPFGNGQ RACIGRGFAM
 421 HEAALALGMI LQRFKLIDHQ RYQMHLKETL TMKPEGFKIK VRPRADRERG AYGGPVAAVS
 481 SAPRAPRQPT ARPGHNTPML VLYGSNLGTA EELATRMADL AEINGFAVHL GALDEYVGKL
 541 PQEGGVLIIC ASYNGAPPDN ATQFVKWLGS DLPKDAFANV RYAVFGCGNS DWAATYQSVP
 601 RFIDEQLSGH GARAVYPRGE GDARSDLDGQ FQKWFPAAAQ VATKEFGIDW NFTRTAEDDP
 661 LYAIEPVAVT AVNTIVAQGG AVAMKVLVND ELQNKSGSNP SERSTRHIEV QLPSNITYRV
 721 GDHLSVVPRN DPTLVDSVAR RFGFLPADQI RLQVAEGRRA QLPVGEAVSV GRLLSEFVEL
 781 QQVATRKQIQ IMAEHTRCPV TKPKLLAFVG EEAEPAERYR TEILAMRKSV YDLLLEYPAC
 841 ELPFHVYLEM LSLLAPRYYS ISSSPSVDPA RCSITVGVVE GPAASGRGVY KGICSNYLAN
 901 RRASDAIYAT VRETKAGFRL PDDSSVPIIM IGPGTGLAPF RGFLQERAAR KAKGASLGPA
 961 MLFFGCRHPD QDFLYADELK ALAASGVTEL FTAFSRADGP KTYVQHVLAA QKDKVWPLIE
1021 QGAIIYVCGD GGQMEPDVKA ALVAIRHEKS GSDTATAARW IEEMGATNRY VLDVWAGG

CYP102B1    Streptomyces coelicolor cosmid F43.
            GenEMBL AL136502 CDS 10570..12153 gene="SCF43.12"
            Highly similar to the N-terminal P450 domain of Bacillus
            megaterium 41.9% identity in 497 aa overlap. 
            45% to 102A1 over 433 amino acids
            cloned and expressed by David Lamb and Steve Kelly

CYP102B2   Streptomyces avermitilis
           GenEMBL AP005050
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV7426 
           78% to 102B1 from Streptomyces coelicolor

CYP102C1    Rhodococcus sp. X309 
            GenEMBL AF059700.1 complement(3619-4584) runs off end of sequence
            partial gene 48% to 102B1 

CYP102D1   Streptomyces avermitilis
           GenEMBL AP005023
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV575 47% to 102A3 
           40% to 102B1, 44% to 102C1 partial seq 

CYP102E1    Ralstonia metallidurans
            GenEMBL NZ_AAAI01000371 
            104500-107000 region
            51% to 102D1
MSTATPAAALEPIPRDPGWPIFGNLFQITPGEVGQHLLARSRHHDGIFELDFAGKRVPFVS
SVALASELCDATRFRKIIGPPLSYLRDMAGDGLFTAHSDEPNWGCAHRILMPAFSQRAM
KAYFDVMLRVANRLVDKWDRQGPDADIAVADDMTRLTLDTIALAGFGYDFASFASDELDP 
FVMAMVGALGEAMQKLTRLPIQDRFMGRAHRQAAEDIAYMRNLVDDVIRQRRVSPTSGMD 
LLNLMLEARDPETDRRLDDANIRNQVITFLIAGHETTSGLLTFALYELLRNPGVLAQAY 
AEVDTVLPGDALPVYADLARMPVLDRVLKETLRLWPTAPAFAVAPFDDVVLGGRYRLRKD 
RRISVVLTALHRDPKVWANPERFDIDRFLPENEAKLPAHAYMPFGQGERACIGRQFALTE 
AKLALALMLRNFAFQDPHDYQFRLKETLTIKPDQFVLRVRRRRPHERFV
TRQASQAVADAAQTDVRGHGQAMTVLCASSLGTARELAEQIHAGAIAAGFDAKLADLDDA
VGVLPTSGLVVVVAATYNGRAPDSARKFEAMLDADDASGYRANGMRLALLGCGNSQWATY
QAFPRRVFDFFITAGAVPLLPRGEADGNGDFDQAAERWLAQLWQALQADGAGTGGLGVDV
QVRSMAAIRAETLPAGTQAFTVLSNDELVGDPSGLWDFSIEAPRTSTRDIRLQLPPGITY
RTGDHIAVWPQNDAQLVSELCERLDLDPDAQATISAPHGMGRGLPIDQALPVRQLLTHFI
ELQDVVSRQTLRALAQATRCPFTKQSIEQLASDDAEHGYA

CYP102F1   Actinosynnema pretiosum subsp. auranticum
           AF453501 complement(6501..9518)
           maytansinoid antitumor agent ansamitocin biosynthetic gene cluster I
           49% to 102A3
           gene = asm30
MVATGTRIPGPKPLPLVGNLLDVLTSDLDTDVDFLDRCHREHGG
IVALTFAGQRQVFASSHELVARMCSDPSWGKAVHPALEQVRDFAGDGLFTARGDEPNW
GKAHRLLMPAFGPTAMRDHFPAMLDIAEQMLVRWRRFGPDHRIDVADDMTRLTLDTIA
LCAFGARFNSFYRDRAHPFVDAMVRSLVEAGERAERLPGVQPFLVGRNQRYRDDIATM
NRIADGIVAARAALPAGERPDDLLERMLTCADPVTGERLSARNVRYQLATFLIAGHET
TSGLLSFAVHRLLAHPEVLRKAKDAVDGVLGDRVPAFEDLARLDYLGQVLRETLRLHP
TAPAFALAPDEPAELGGHAIGAGEPVLVMLPTLHRDPAVWRDPDVFDPERFAPERMDE
IPACAWMPFGHGARACIGRPFALQEATLVLALVLQRFDLALADPDHRLTIKQTLTLKP
DSLVVRARPRADRPGATATVETVVPHQVPATHRHGTPLHVFYGSNGGSGEGLARTIAG
DGAARGWATSVAPLDDAVRALPASGPVVIVSSSYNGAPPDNAAHFVRWLTQDGPDLSG
VDYLVLGCGNLDWSATYQRVPTLIDEAMAAAGARRLRERGATDARADFFGDWERWYEP
LWPLLSAECGVEVGEIGPRFRVVESDAADGLGDLASAVVLENRELVRGPDAGSKRHLE
LRLPDGTSYRTGDYLSVLPQNHPDLVRRAVARLGTRAERVVTVESSAPTGLVPVGRAL
RVDELLTRCVDLSAPAGAGVVARLAERCPCPPERAELAATTGATLLELLERFPSCAVD
LALALELLPAPRTRLYSISSAAEEQRAEVALTVSVTGVTSGYLSRVRPGDRVAVGIAS
PPESFRPPADNTVPVVLIAAGTGIAPFRGFLRARAALGGEPGPALLLFGCRGPELDDL
YAEEFAALGDWLEVDRAYSRHPDGEVRHVQHRLWQRRDRVRELVDAGARVYLCGDATR
VGPAVEEVLGRIGPGAGWLDALRAGGRYATDVF

103 Family

CYP103A1     Agrobacterium tumefaciens
             GenEMBL M19352, AF242881 CDS 141158.142426 
             gene="virH1"

CYP103A2     Agrobacterium tumefaciens
             GenEMBL AF034769
             GenEMBL AB016260 CDS 124584..125759

CYP103A3   Agrobacterium tumefaciens plasmid pTiAB2/73 vir region
           GenEMBL AF329849 892..2148
           gene = virH
           61% TO 103A1
MNARGPEKVSQTSGPIISASLDPDNVSVSDLDRSGHAIFAEWRP
KRPFLRRQDGVYVLLRADDVLGLSSDPRTRQIETELMLNRGINEGAVFDFVRYSMLFS
NNEVHSRRRSPFTRTFAFRMIENLRPQVSQLTETLFQDLKELDSFNFVEEFASKLPAV
AIAGLLGLPPSDIPYFTQLVYRVARCLSPSWRDADLPDIEASAAEFKNYVQAVIDDRR
SNPRDDFLSSFIRATREAEDLSPDEGLAQLMLIVLAGTDTTKTGLTALTGQLLRHRHV
WEALLKDESLVPAAVEEGLRFEPPVGSYPRLALADIDLEGFILPKGSLLALCTMSALR
DEKHFAHPELFDIHRKQMHWHMVFGAGAHRCLGEALARLELQEGLATVLRYAPTLSIE
GEWPTVQGHGGVRRIAEMRVGFRRQI

104 Family

CYP104A1     Agrobacterium tumefaciens
             GenEMBL M19352, AF242881 CDS 142447..143670 
             gene="virH2"

CYP104A2    Agrobacterium tumefaciens
            GenEMBL AB016260 
            103A2 CDS 124584..125759 and 
            104A2 CDS 125919..127094 83% to 104A1

105A Subfamily

CYP105A1    Streptomyces griseolus
            GenEMBL M36480 (1629bp) Y18556 CDS 2447..3703
            Omer,C.A., Lenstra,R., Litle,P.J., Dean,C.R., Tepperman,J.M.,
            Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
            Genes for two herbicide-inducible cytochromes P-450 from
            Streptomyces griseolus
            J. Bacteriol. 172, 3335-3345 (1990)
            Gene suaC

CYP105A2    Amycolata autotrophica
            GenEMBL D26543 (1197bp)
            Kawauchi,H., Sasaki,J., Adachi,T., Hanada,K., Beppu,T. and 
            Horinouchi,S. 
            Cloning and nucleotide sequence of a bacterial cytochrome P-450 
            VD25
            gene encoding vitamin D-3 25-hydroxylase
            Biochim. Biophys. Acta 1219, 179-183 (1994)

CYP105A3    Streptomyces carbophilus
            GenEMBL D30815 PIR JC4287
            Watanabe,I., Nara,F. and Serizawa,N.
            Cloning, characterization and expression of the gene encoding
            cytochrome P-450sca-2 from Streptomyces carbophilus involved in
            production of pravastatin, a specific HMG-CoA reductase inhibitor
            Gene 163 (1), 81-85 (1995)

105B Subfamily

CYP105B1    Streptomyces griseolus
            GenEMBL M36481 (1688bp) M32239
            Omer,C.A., Lenstra,R., Litle,P.J., Dean,C.R., Tepperman,J.M.,
            Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
            Genes for two herbicide-inducible cytochromes P-450 from
            Streptomyces griseolus
            J. Bacteriol. 172, 3335-3345 (1990)
            Gene subC, SU-2

CYP105B2    Streptomyces tubercidicus strain R-922 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Cyp229
            78% to 105B1

105C Subfamily

CYP105C1    Streptomyces sp.
            GenEMBL M31939 PIR S19629 (381 amino acids)
            Horii, M., Ishizaki, T., Paik, S.Y., Manome, T. and Murooka, Y.
            An operon containing the genes for cholesterol oxidase and a
            cytochrome P-450-like protein from a Streptomyces sp.
            J. Bacteriol. 172, 3644-3653 (1990)
            Gene choP

105D Subfamily

CYP105D1    Streptomyces griseus
            GenEMBL S45823 X63601 (1700bp) PIR S24750 (412 amino acids)
            Trower,M.K., Lenstra,R., Omer.C., Buchholz,S.E., and 
            Sariaslani,F.S.
            Cloning, nucleotide sequence determination and expression
            of the genes encoding cytochrome P-450soy (soyC) and 
            ferredoxinsoy (soyB) from streptomyces griseus.
            Mol. Microbiol. 6, 2125-2134 (1992)
            PIR S35901 (412 amino acids)
            Erratum. Cloning, nucleotide sequence determination and
            expression of the genes encoding cytochrome P-450(soy)
            (soyC) and ferredoxin(soy) (soyB) from Streptomyces griseus.
            Mol. Microbiol. 7, 1024-1025 (1993)

CYP105D2    Streptomyces griseus
            GenEMBL AF071145
            84% identical to 105D1

CYP105D3    Streptomyces sclerotialus
            GenEMBL AF071149
            68% identical to 105D1

CYP105D4    Streptomyces lividans 
            GenEMBL AF072709 CDS complement(1593..2813)
            69% to 105D1 67% to 105D2 82% to 105D3 57% to 105A1 

CYP105D5    Streptomyces coelicolor 
            3StF60 [Full Sequence] Sanger cosmid 
            CDS comp(2106-3344) 98% identical to CYP105D4
            cloned and expressed by David Lamb and Steve Kelly

CYP105D6   Streptomyces avermitilis
           GenEMBL AB070949.1 69121-70371
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV412_pteD 55% to 105D1 from Streptomyces griseus,
           53% to 105D4, 54% to 105D5 (if first 17aa left off 105D5)
           Gene = pteD

CYP105D7   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV7469 73% to 105D4 from Streptomyces lividans

CYP105D8    Streptomyces tubercidicus strain I-1529 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Cyp233
            68% to 105D7

CYP105D9   Streptomyces sp. JP95
           GenEMBL AF509565 11774..13024
           griseorhodin biosynthesis gene cluster
           55% to 105D6
           gene = grhO3
MTDTLDEPQTLADGAEDAPAYPVKRTCPYRMPPGYEELREKGPI
SRVTLWNGRTAWLVTGNDLGRRLFPDARLSSDVLDPRFPLLAPRIEAQRQQAAAPPLV
GVDDPVHARQRRMVLPSFGIRQINALRPEIQKYADDLLDTMLAKGPGVTVDLLTEYAL
PMPSAVICMLLGVPYEDHHYFDERSRHVLSSSGEEQAAQAQQAFTEILAYLDDLIVRK
QAEPGDTLLDELIARQLEEGKVDRQELAMIATVLLVSGHETTSNMIALSTMALLADPD
QLAALRADESLMPRAVDELMRFSSIGDMLMRVAKEDIEIEGHLIRAGDGVILSTMLMN
RDPGAFERPDELDIRRPAGRHVAFGYGIHQCIGQNLARAEMEIALATLFRRVPTLKLA
VPAEQVPVNAPFVLQGVSELPVTW

105E Subfamily

CYP105E1    Rhodococcus fascians
            GenEMBL Z29635 (7139bp) PIR S42052 (399 amino acids)
            Crespi,M., Vereecke,D.M., Temmerman,W.G., Van Montagu,M.
            and Desomer,J.
            The fas operon of Rhodococcus fascians encodes new genes required 
            for efficient fasciation of host plants.
            J. Bact. 176, 2492-2501 (1994)
MAGTADLPLEMRRNGLNPTEELAQVRDRDGVIPVGELYGAPAFL
VCRYEDVRRIFADSNRFSNAHTPMFAIPSGGDVIEDELAAMRAGNLIGLDPPDHTRLR
HILAAEFSVHRLSRLQPRIAEIVDSALDGLEQAGQPADLMDRYALPVSLLVLCELLGV
PYADRDELRDRTARLLDLSASAEQRAVAQREDRRYMATLVTRAQEQPGDDLLGILARK
IGDNLSTDELISIISLIMLGGHETTASMIGLSVLALLHHPEQAAMMIEDPNCVNSGIE
ELLRWLSVAHSQPPRMAVTEVQIAGVTIPAGSFVIPSLLAANRDSNLTDRPDDLDITR
GVAGHLAFGHGVHFCLGHSLARMTLRTAVPAVLRRFPDLALSPSHDVRLRSASIVLGL
EELQLTW

CYP105F1    Streptomyces lavendulae 
            GenEMBL AF127374 CDS 2006..3229
            48% to 105C1 42% to 105B1 40% to 105D1 new subfamily in 105

CYP105F2   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           85% to 105F1
           clone name SP8812

CYP105G1    Amycolatopsis mediterranei 
            GenEMBL AF040571 CDS complement(5011..6066)
            49% to 105C1, 105B1 new subfamily in 105 
            looks like an insertion in the seq from 80-120

CYP105H1    Streptomyces noursei ATCC 11455 nyst 
            GenEMBL AF263912 CDS comp (58637..59833)
            gene="nysN" 47% to 105B1 46% to 105A1 46% to 105D1 
            function="presumably involved in modification of the
            nystatin macrolactone ring"

CYP105H2    Streptomyces albus
            GenEMBL AF071143 
            77% to 105H1
LLIAGHETTANNIGLGVVTLLSHPQWAGDERAVEELLRLHSVAD
MVALRVAVDDVEIAGQVIRKGEGIVPLLAAANHDTEVFGCPHAFDPERSERRHVAFGY
GVHQCLGQNL

CYP105H3   Streptomyces natalensis 
           GenEMBL AJ278573 52789..53985
           pimaricin biosynthetic gene cluster.
           68% to 105H1
           gene = pimG
MTYTDPAAPETDPPAVDFPQRKPGVPFPPPDYADYRDRKGLVLS
QLSDGKRVWLVTRHEDVRAVLTSPSISSNPEHKGFPNVGNLGVPKQDQIPGWFVGMDS
PEHDRFRKALIPEFTVRRVRAMKPAIERTVDAQLDAMLAAGNTADLVADFALPIPSLV
ISALLGVPPADREFFESRTRVLVSLRSSTDDDRMAAAKDLLRYINRLVEIKQKWGGDD
LITRLLATGAIAPHEMSGVLMLLLIAGHETTANNIALGVVTLLANPQWIGDDRAVEET
LRFHSVADLVSLRVAVQDVEIAGQLIKAGEGIVPLVAAANHDENAFECPHAFDPSRSA
RHHVAFGYGVHQCLGQNLVRIEMEVAYRKLFERIPNLELAVPTDGLDIKYDGVLYGLN
ELPVRW

CYP105H4   Streptomyces nodosus 
           GenEMBL AF357202 complement(62051..63250)
           amphotericin biosynthetic gene cluster
           84% to 105H1
MTAETEMTTFAPGCPVAFPLRRPGRPFPPPEYADYRAGEGLVRS
ELPASGPVWLVTRHEDVRTVLTDPRISADPSRPGFPRARRTGGAPSQSEIPGWFVALD
PPEHDRFRKTLIPEFTVRKVRELRPAIQQIVDERIDALLAAGNSADLIADFALSVPSL
VISDLLGVPKADRDFFEAKTKVLVTLSSTDEQRDEASKALLRYLNRLIQIKGRRPGED
LISRLLQAGTMNRQELSGVSMLLLIAGHETTANNIGLGVVQLLTNPQWIGDDRIVEEM
LRYYSVADLVSFRVAVEDVEIGGQLIKAGEGIVPLIAAANHDGSVFDKPEEFNPERSA
RSHVAFGYGVHQCLGQNLVRVEMEIAYRTLFERIPTLELAVPVEELPLKYDGVLFGLH
ELPVTWS

CYP105H5   Streptomyces griseus 
           GenEMBL AJ300302 10678..11859
           Gene = canC
           72% to 105H3
MTTSPGPTVVDFPRRTPREPLPLSQYAEHRKQNGLVQTHLPNGR
PIWLVTRHEDVRAVLTHPRISANPDNEGFPNVGETMGVPKQEQIPGWFVGLDSPEHDR
FRKVLIPEFTVRRVRELRPAIERTVDERIDAMLAGGNTADLVNDFALPVPSLVISALL
GVPSADRDFFESRTRTLVAIRTSTDEERAEATRQLLRYINRLIVIKKKWRGEDLISRL
LSTGKLSDEELSGVLLLLLIAGHETTANNIGLGVVTLLSHREWIGDDRLVEELLRLHS
VADMVALRVAVDDVEIAGQTIRKGEGIVPLLASANHDTEAFGCPHAFNPERTERRHVA
FGYGVHQCLGQNLVRVEMEIAYRKLFERIPELRLAVPEDQLAYKYDGILFGLHELPVR
W

CYP105J1   Amycolatopsis mediterranei rifamycin 
           GenEMBL AF040570 CDS comp (67462..68673)
           52% to AF072709 105D4 50% to 105D1 new subfamily in 105

CYP105K1   Streptomyces tendae strain Tue901 
           GenEMBL Y18574 CDS 6325..7557
           45% to 105A3 46% to 105D1 43% to 105B1 new subfamily in 105
           gene="nikF"

CYP105K2   Streptomyces ansochromogenes
           GenEMBL AF469953  14..1246
           95% to 105K1
           note="involved in nikkomycin biosynthesis
MTEAFDHDIPSFPMARECPMHPPAEYRELRGQEPVSRVRMPDGQ
VAWLVLKHALARKLLADPRVSADRLHPAFPGRLTAEQRAATERVRRLTTRRSMIHLDG
DEHGAHRRILTGEFSLRRIAAQRPRVQEIVDRSIDEMLAAPQPADLVEHVSQAVPSLV
ICELLGVPHEQRRDFHEWAGMLVSRSVSIQERAAASDALNDFLEALVTEKERGEPADD
LIGRLIARNRQTPVMTHDEIVGTAVMLLVAGHQTTANMISLGVVALLENPEHKARIAA
DSSLLPPAIEEMLRYFSVVENAPARVATEDIAIGGVTIRKNEGIVVSGLAADWDDEVF
GHPDRLDFERGARHHVAFGYGVHQCLGQNLARVELEIVFETLLRRVPGLSLAVPAEEL
PYKDDAGIYGIYRVPVNC

CYP105L1    Streptomyces fradiae 
            GenEMBL AF055922 CDS comp (6507..7769)
              GenEMBL AF147703 complement(2565..3875)
            Fouces,R., Mellado,E., Diez,B. and Barredo,J.L.
            The left edge of the tylosin gene cluster from Streptomyces 
            fradiae
            Microbiology (1999) In press
            tylH1
            46% to 105A1 42% to 105D1 43% to 105B1 new subfamily in 105
MSSSGDARPSQKGILLPAARANDTDEAAGRRSIAWPVARTCPFS
PPEQYAALRAEEPIARAELWDGAPVWLISRQDHVRALLADPRVSIHPAKLPRLSPSDG
EAEASRSLLTLDPPDHGALRGHFIPEFGLRRVRDVRPSVEQIVTGLLDDLTARGDEAD
LLADFALPMATQVICRLLDIPYEDRDYFQERTEQATRPAAGEEALEALLELRDYLDRL
ISGKTGRESGDGMLGSMVAQARGGGLSHADVLDNAVLLLAAGHETTASMVTMSVLVLL
QHPTAWRELTVNPGLLPGAVDELLRYLSIADGLRRSATADIEIDGHTIRAGDGLVFLL
AAANRDEAVFSEPEAFDIHRSARRHVAFGYGPHQCLGQNLARMELEVALGAVLERLPA
LRPTTDVAGLRLKSDSAVFGVYELPVAW

CYP105L2   Micromonospora griseorubida
           GenEMBL AB089954 1490..2641
           gene cluster for the polyketide macrolide mycinamicin
           54% to 105L1
           gene = mycCI
MDRTCAWALPEQYAEFRQRATGWPAKVWDGSPTWLVSRYEHVRA
LLVDPRVTVDPTRQPRLSEADGDGDGFRSMLMLDPPEHTRLRRMFISAFSVRQVETMR
PEIEKIVDGILDRLLALEPPVDILTHLALPMSTQVICHLLGVPYEDREFFQERSELAS
RPNDDRSMPALIELVEYLDGLVRTKTAHPDTGLLGTAVTERLLKGEITHQELVNNAVL
LLAAGHETSANQVTLSVLTLLRHPETAAELREQPELMPNAVDELLRYHSIADGLRRAA
TADIVLGDHTIRAGDGLIILLSSANHDGNTFGAEATFDIHRPARHHVAFGYGPHQCLG
QNLARLEMEVTLGKLFRRVPALRLAQEPDALRVRQGSPIFGIDELLVEW

CYP105M1    Streptomyces clavuligerus clavulanic 
            GenEMBL AF200819 CDS 136..1359
            GenEMBL AY034175 CDS 200..1423
            GenEMBL U87786 CDS 13810..15036 
            function="involved in clavulanic acid biosynthesis"
            48% to 105B1 42% to 105A1 41% to 105D1 new subfamily in 105
MNEAAPQSDQVAPAYPMHRVCPVDPPPQLAGLRSQKAASRVTLW
DGSQVWLVTSHAGARAVLGDRRFTAVTSAPGFPMLTRTSQLVRANPESASFIRMDDPQ
HSRLRSMLTRDFLARRAEALRPAVRELLDEILGGLVKGERPVDLVAGLTIPVPSRVIT
LLFGAGDDRREFIEDRSAVLIDRGYTPEQVAKARDELDGYLRELVEERIENPGTDLIS
RLVIDQVRPGHLRVEEMVPMCRLLLVAGHGTTTSQASLSLLSLLTDPELAGRLTEDPA
LLPKAVEELLRFHSIVQNGLARAAVEDVQLDDVLIRAGEGVVLSLSAGNRDETVFPDP
DRVDVDRDARRHLAFGHGMHQCLGQWLARVELEEILAAVLRWMPGARLAVPFEELDFR
HEVSSYGLGALPVTW

CYP105N1    Streptomyces coelicolor 
            St4C2 [Full Sequence] Sanger cosmid 
            CDS 29986-31221 45% to 105A1 new subfamily in 105
            cloned and expressed by David Lamb and Steve Kelly

CYP105N2    Streptomyces glaucescens cytochrome P450
            GenEMBL AF071144
            95% to 105N1 only 5 aa diffs
            57% to AF071148 56% to AF071146 59% to 105D3 54% to 105A3 
LLIAGHETTTSMIALSTLLLLDRPELPAELRNDPDLMPAAVDEL
LRVLSVADSIPLRVAAEDIELSGRTVPADDGVIALLAGANHDPEQFDDPERVDFHRTD
NHHVAFGYGMHQCLGQNL

CYP107N3   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           91% to 107N1
           clone name SP0881

CYP105P1   Streptomyces avermitilis
           GenEMBL AB070949.1 67376-68575
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV413_pteC low 40% range to 105 subfamilies 
           Gene = pteC

CYP105P2   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           92% to 105P1
           clone name SP7863

CYP105Q1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV1611 49% to 105B1 from Streptomyces griseolus 
           46% to 105D4 and D5

CYP105Q2   Streptomyces sp. 
           GenEMBL BD133549
           78% to CYP105Q1 
  3 LIAGHETTATMISLGTLALFEHPDQLAVLRDTEDPKVVAGAVDELLRYLSIVHSGLRRVA 182
183 KGDIEIDGRLIRKGDGLLFDLQTANWDPNAFPGAERLDLARPARQHNAFGFGTHQC 350

CYP105Q3   Streptomyces sp.
           GenEMBL BD133546 
           77% to 105Q1
 139 MADTLTDAAPDTDGRVPEYPMPRATGCPLAPSPAAAELRGDRPITRVRIWNGSTPWLITR 318
 319 HADQRTLLTDPRVSNDDHEPDFPHVNAHRAAIAPHTPKLITNTDAPEHTRLRRSVNAPFL 498
 499 VKRIEAMRPAVQKIVDDLIDDMLAGPSPADLLTALALPVPSLVIAELLGVPYEDHHFFQE 678
 679 NSNRVLDNSLTAEEAQESSRALGGYLDTLFRTKLEQPGEDVLSEMGSKVKAGEMTHQEAV 858
 859 SMGVAMLIAGHETTATMISLGTLALFEHPDQLAVLRDTEDPKVVAGAVDELLRYLSIVHS 1038
1039 GLRRVAKGDIEIDGRLIRKGDGLLFDLQTANWDPNAFPGAERLDLARPARQHNAFGYGPH 1218
1219 QCLGQNLARLELQVVYGTLYRRVLTLRPAVPVDQLAFNHTGTTYGVKCLPVTW 1377

CYP105R1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV7186

CYP105S1    Streptomyces tubercidicus strain R-922
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Cyp230
            56% to CYP105S2

CYP105S2    Streptomyces tubercidicus strain I-1529
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Cyp234
            56% to CYP105S1

CYP105T1    Burkholderia fungorum
            GenEMBL NZ_AAAJ02000095 
            8366..9610 gene = Bcep2217
            44% to 105H1
MRKTMTSAINDVRPQTTSTFPFARTGSPLHPPAEYARYRDGQPV
TRVQMWDGRYAWIFTRMEDVKAVLSSPHFSVVPSKPGYPFLTPARAATVKSYQTFITM
DPPDHTRFRRMLTRDFTQKRMEELRPQIAAYVNRLIDEMLARGSPGDLVSALALKLPV
TVVSMLVGVPYEDHEDLVKWSGQRLDLEQNPTVSESAADNMLAYFDGLLQRKERDPGD
GADMLSRLVIEQIKPGHLSRLEAIHMVNLLYFAGHETTANQIALGTLSFLLDPRQRAL
LENNPGLLKNAIEEMLRFHTISHYNSCRVATADVEVGGTLIREGEGAYALIMAANRDP
AAFPAPDRFDIERPNSQEHVAFSYGLHMCLGQPLARLELQVCFEALFRRLPRLRLAVP
LEELPFKREMYVYGLHALPVTW

CYP105U1   Streptomyces hygroscopicus strain NRRL 3602
           AY179507 complement(63940..65133)
           Geldanamycin biosynthesis gene cluster
           50% to 105B1 52% to 105B2 not 105S
           gene = gdmP
MDEIRDYPESRAAACPFSPPLGYEELRERSAVTRVRMWDGSTPF
LVTGYHEARAALGDSRFSADGTHKAMPRFVKFEVPAEVFNLGRMDDPEHARIRRMLTA
NFTIRRTEAMRPMIQGIVDGLLDRLIAQGPPADLVADFAFPLPSQVIGVMLGVSDADF
AEFQQASQGVMDFTASAEEMGAALGVMVDYVARMCAAKRADPGDDLLSRLIVDQELTG
GLTQQQVVATALVLLLAGHETTANMIALSTVLLLSHPEQLARLRADAGLMGNAVDELL
RYITIVQEGTGRVATEDVEVGGVLIPGGEGVIINLPSANRDPHFADAHELDLSRPNAR
EHVAFGFGVHQCLGQTLARVELQIALETLLRRLPTLRLEVPFDDLAFLYESMNFGVAR
VPVAW

CYP105V1   Streptomyces sp. HK803
           GenEMBL AY354515 36297..37508
           Gene = plmT4 
           43% to CYP105Q1
MSQLSSELPAFPMSKAKGCPLDPPPEYAQLRSDRPVAKARLWDG
KEVWLITGYDEIRSIFTDPRISVDNTQPGYPWLSEQARTVVLTGGVKPVGRMDPPEHT
AMRRMLGQGFLVKKIQNMRGDVEALVNELIDDILAGPRPTDLVPSLAMPVPSTALGWV
LGVPPADKRLISLVPRLFDEDSGLEGAMEARAELFAYIDELITHRENQPGDDIISHLV
GYYQKGELSRVSVLTQSVTLIAAALDTTRSMITNGILALLQHPEQAAALIEDPDLVPA
AVEELLRYTVVTEFSSKRVAAADIEIAGETIKAGDGIICLISAGNRDEKVFTDPDTLD
VRRDAKQHLGFGAGIHTCIGKQLARMELEVVYGTLFRRIPELRLAVPFDQLVFRNTFD
VQGVRALPVTW

CYP105W1   Micromonospora echinospora
           GenEMBL AF497482 84045..85229
           Gene = calE10
           calicheamicin biosynthetic locus
           45% to CYP105K1 47% to 105D4
MPRRCPFGPPAEYARLRTERPVARLPMLGGNTAWVVSRYADVKR
VLSDPRMSADRRRAGFPRFAPTTESQRQASFANFRPPLNWMDPPEHTAARRQIVDEFA
ARRVRQLRPLVERVVDEHLDAMTAGRSSADLVPSFSYPVPSRVICEMLGVPYGEHAFF
ERRSTRMLSRGVPADERARCAREIREFLDGVVTDKERHPGDDVLSRLLAAQRAAGEPD
HEAVVSMAFVLLVAGHVTTSNMISLSVLALLTHPERLARLRAEPDRFPAAVEELLRYF
TIVEAATARTATADVTVGGVTIRAGEGVVALGQAANRDPAAFDRPDEFDPDRDARHHL
AFGYGRHICPGQHLARLELDVALSRLVRRLPGLRLTVDVDDLPLKEDGNIFGLHALPVAW

CYP105X1   Pseudonocardia autotrophica same as Amycolata autotrophica
           GenEMBL AF525299 2766..3974
           Gene = pauC
           P-450 gene cluster
           49% to 105A3
MAEDTLGQDFPMQRQCPFEPPKEYERLRAEQPISRVRMPDGTPA
WLVTLHEDVRTVLASPAFSSDLAHPGMPAVNPEIRTIARQQRPPFSRMDPPEHSFFRR
MLIPEFTVKRTKTLRAGIQSVVDGLIDDLLRKSPPVDLVDEFALPVPSLVICQLLGVP
YSRHEFFQQQARVILSRQSTREQVGAAFTALRAYLDTLVEEKLHTPGDDLTSRLATEH
LEPTGDVRRQDLVASCMLLLTAGHETTSHMISLGVTALLEHPDQLAALQNDLTLLPEA
VEELVRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDI
HRGNRRHACFGYGVHQCIGQHLARTELEVAFSTLFTRIPTLQIAAPSDELDYDHDGML
FGLHELPVTW

CYP105X2   Amycolata autotrophica same as Pseudonocardia autotrophica
           GenEMBL AF071148
           99% to 105X3 94% to 105X1 61% to 165B2
LLIAGHETTSHMISLGVTALLERPDQLAALQNDLTLLPEAVEEL
LRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDIHRGN
RRHVAFGYGVHQCLGQNL

CYP105X3   Micromonospora inyoensis 
           GenEMBL AF071146
           99% to 105X2 61% to 165B2 60% to 105A3
LLIAGHETTSHMISLGVTALLEHPDQLAALQNDLTLLPEAVEEL
LRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDIHRGN
RRHVAFGYGVHQCLGQNL

106 Family

CYP106A1    Bacillus megaterium 
            GenEMBL X16610
            Gene BM-1

CYP106A2    Bacillus megaterium
            GenEMBL Z21972 (4317bp) PIR S32216 (410 amino acids)
            PIR S39924 (410 amino acids) Swiss Q06069 (410 amino acids)
            Rauschenbach,R., Isernhagen,M., Noeske-Jungblut,C., Boidol,W.
            and Siewert,G.
            Cloning, sequencing and expression of the genes for cytochrome
            P450meg, the steroid-15beta-monooxygenase from Bacillus
            megaterium ATCC 13368.
            Molec. Gen. Genet. 241, 170-176 (1993)

CYP106B1    Bacillus anthracis str. Ames
            Genpept AAP26480                 
            47% to 106A2 47% to 109B1
  1 MASPENVILV HEISKLKTKE ELWNPYEWYQ FMRDNHPVHY DDEQDVWNVF LYDDVNRVLS
 61 DYSLFSSRRE RRQFAIPPLE TRININSTDP PEHRNVRSIV SKAFTPRSLE QWKPRIQSIA
121 NELVKDIENC SEVDIVEQFA APLPVTVISD LLGVPTTDRK KIKAWSDILF MPYSKEKFND
181 LDAEKGIALN EFKAYLLPIV QEKRYHLTDD IISDLIRAEY EGERLTDEEI VTFSLGLLAA
241 GNETTTNLII NSFYCFLVDS PATYKEVREK PKLISKAVEE VLRYRFPVTL ARRITEDTNI
301 FGPLMKKDQM VVAWVSAANL DEKKFSQASK FNIHRIGNEK HLTFGKGPHF CLGAPLARLE
361 AEIALTTFIN AFEKIALSPS FNIEQCILEN EQTLKFLPIR LKPQ

CYP106B2P  Bacillus cereus ATCC 14579
           GenPept AAP09572  GenEMBL AE017006
83% to 106B1 54% to CYP109B1 YjiB Z99110 Bacillus subtilis I -helix
1 MTSVITDGEI VTFSLGLLAA GNETTTNLII NSFYCFLVDS PGIYEELRKE PNLILKAIEE
61 VLRYRFPVTL TRRITALSER ESPSPLGMG

CYP106B3P  Bacillus cereus ATCC 14579
           GenPept AAP09575 GenEMBL AE017006
87% to 106B1 54% to 106A2 C-term fragment 
   LKEDTNIFGPF
 1 MKKNQMIVAW VSAANLDEKK FSQASQFNVH RTGNEKHLTF GKGPHFCLGA PLARLEAEIA
61 LTTFINAFEK IELFPSFCLE KCILENEQTL KYLPIRLKAT

107A Subfamily

CYP107A1    Saccharopolyspora erythraea
            GenEMBL X60379 Swiss Q00441 (406 amino acids)
            Haydock S.F., Dowson J.A., Dhillon N., Roberts G.A., Cortes J.,
            Leadlay P.F.
            Cloning and sequence analysis of genes involved in erythromycin 
            biosynthesis in Saccharopolyspora erythraea: sequence similarities 
            between eryG and a family of S-adenosylmethionine-dependent 
            methyltransferases.
            Mol. Gen. Genet. 230, 120-128 (1991).

            Weber J.M., Leung J.O., Swanson S.J., Idler K.B., Mcalpine J.B.
            An erythromycin derivative produced by targeted gene disruption in
            Saccharopolyspora erythraea.
            Science 252, 114-117 (1991)

CYP107A2   Streptomyces rochei plasmid pSLA2-L
           NC_004808 complement(44847..46067)
           64% to 107A1
           note="ORF26 (406 aa), lankamycin biosynthesis protein
           similar to M54983-1 Saccharopolyspora erythraea
           6-deoxyerythronolide B hydroxylase, EryF CYP107A1
MTTDAHTAVPSLDSDLFHIDQYEAYAALREREPVSKVSFIGREA
FLITRHAEAKAALGDLRLSNDFKKQPPGVELPTYHGIPEDVRPYFANNMGSNDPPAHT
RLRRLVSREFTARRVESMRTRVAQLAEHLLDGLAGERETDLVERFAYPLPITVISELL
GVEERYQGDFGRWSNEFLVIDADRVEQREHAARALVGFILELVDRRRADPGSDLLSAL
IHVHDEDEDRLSTDELASVVLILLIAGFETSVSLIAMATYLLLTHPGELAKVRADPSL
VPNAVDEVLRFLGPAEITTRGTLEPVEIGGVHIPAHSTVLIAGAAANRDPRRFPDPER
FDVTRDTGGHLSFGHGIHFCVGGPLARLEGEIALRALLNRFPGLDLAIPAEQVRWRRS
FLRGIESLPVRLGR

107B Subfamily

CYP107B1    Saccharopolyspora erythraea
            GenEMBL M83110 Swiss P33271 (405 amino acids) PIR B42606 (405 
            amino acids)
            Andersen J.F., Hutchinson C.R.
            Characterization of Saccharopolyspora erythraea cytochrome P-450 
            genes
            and enzymes, including 6-deoxyerythronolide B hydroxylase.
            J. Bacteriol. 174, 725-735 (1992)

CYP107B2   Streptomyces sp.
           GenEMBL BD133548 
           58% to 107B1
3   LIAGHETTTNFIGNAALALLRHPESLAHLRAEPQLLGGALDELLRYDSPVGIATFRFSTE 182
183 ALTLGGTEIPEGVPVLIAPGAANRDPDRFPDPDRLDLTRGATGHLAFGFGMHHC 344

107C Subfamily

CYP107C1    Streptomyces thermotolerans
            GenEMBL D30759 (3267bp complete sequence of CarA) 
            Arisawa,A., Kawamura,N., Takeda,K., Tsunekawa,N.,
            Okamura,K. and Okamoto,R.
            Cloning of a macrolide antibiotic biosynthesis gene acyA, which
            encodes 3-O-acyltransferase, from Streptomyces thermotolerans and 
            its use for direct fermentative production of a hybrid macrolide.
            Appl. Environ. Microbiol. 60, 2657-2660 (1994)

            Arisawa,A., Tsunekawa,N., Okamura,K. and Okamoto,R.
            Nucleotide sequence analysis of carbomycin biosynthetic genes
            including macrolide antibiotics 3-O-acyltransferase gene from
            Streptomyces thermotolerans.
            unpublished (1994)

CYP107C1    Streptomyces thermotolerans
            GenEMBL M80346 (2393bp C-terminal fragment of CarA)
            Schoner,B.E., Geistlich,M., Rosteck,P., Rao.R.N., Seno,E.,
            Reynolds,P., Cox,K., Burgett,S. and Hershberger,C.L. 
            Sequence similarity between macrolide resistance determinants and
            ATP binding transport proteins.
            Gene 115, 93-96 (1992)
            Note: P450 fragment called carX. is equivalent to C-terminal of CarA.

107D Subfamily

CYP107D1    Streptomyces antibioticus
            GenEMBL L37200 (1400bp)
            Rodriguez,A.M., Olano,C., Mendez,C., Hutchinson,C.R. and 
            Salas,J.A.
            A cytochrome P450-like gene possibly involved in oleandomycin 
            biosynthesis by Streptomycese antibioticus.
            unpublished (1994)

107E Subfamily

CYP107E1    Micromosospora griseorubida
            GenEMBL D16098 (2168bp)
            Inouye,M., Takada,Y., Muto,N., Horinouchi,S. and Beppu,T.
            Cloning and nucleotide sequences of a gene governing mycinamicinIV
            hydroxylation.
            unpublished (1993)

107F Subfamily

CYP107F1    Streptomyces griseus
            GenEMBL D45916 (2787bp) AB018074 CDS 341-1561
            Ueda,K. and Horinouchi,S.
            Cloning and Nucleotide Sequence of a Gene Involved in Redbrown
            Pigment Biosynthesis in S. griseus
            Unpublished (1995)

CYP107F2   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV1171 55% to 107F1 
           this subfamily is on the outskirts of CYP107

107G Subfamily

CYP107G1   Streptomyces hygroscopicus
           GenEMBL X86780 (107379bp)
           complement (91764-92978)
           rapN

107H Subfamily

CYP107H1  Bacillus subtilis
          GenEMBL U51868 (10153bp) Z99119, AF008220
          coding region 7164-8351
          pimelic acid biosynthesis
          gene name bioI

107J Subfamily

CYP107J1  Bacillus subtilis
          GenEMBL Y11043 U93876, Z99117
          Belitsky, B. R., M. C. Gustafsson, A. L. Sonenshein, and C. Von
          Wachenfeldt. 
          An lrp-like gene of Bacillus subtilis involved in
          branched-chain amino acid transport. J Bacteriol. 179, 5448-57 
          (1997).
          gene name cypA 42.6% identical to 107B1
          also called yrdE

CYP107J2  Bacillus anthracis str. Ames 
           GenPept AAP26475
           58% to 107J1 cypA of Bacillus subtilis
  1 MAMKNKVGIR IEDGINLASA QFKEDAYEIY KESRKVQPVL FVNKTELGAE WLITRYEDAL
 61 PLLKDNRLKK DPANVFSQDT LNVFLTVDNS DYLTTHMLNS DPPNHNRLRS LVQKVFTPKM
121 IAQLEGRIQD IADDLLNEVE RKGSLNLVDD YSFPLPIIVI SEMLGIPKED QAKFRIWSHA
181 VIAYPETPEE IKETEKQLSE FITYLQYLVD MKRKEPKEDL VSALILAESE GHKLSARELY
241 SMIMLLIVAG HETTVNLITN TVLALLENPN QLQLLKENPK LIDAAIEEGL RYYSPVEVTT
301 SRWADEPFQI HDQTIEKGDM VVIALAAANR DETVFENPEV FDITRENNRH IAFGHGSHFC
361 LGAPLARLEA KIAITTLFER MPELQIKGNR EDIKWQGNYL MRSLEELPLT F

CYP107J3   Bacillus cereus ATCC 14579
           GenPept AAP09568            
           59% to 107J1 cypA Y11043 Bacillus subtilis
  1 MKNKVGLSIE DGINLASAQF KEDAYEIYKE SRKKQPILFV NQVEIGKEWL ITRYEDALPL
 61 LKDNRLKKDW TNVFSQDIKN MYLSVDNSDH LTTHMLNSDP PNHSRLRSLV QKAFTPKMIA
121 QLDGRIQRIA DDLISDIERK GTLNLVDDYS FPLPIIVISE MLGIPKEDQA KFRIWSHAVI
181 ASPETPEEIK ETEKQLSEFI TYLQYLVDIK RKEPKEDLVS ALILAESEGH KLSARELYSM
241 IMLLIVAGHE TTVNLITNTV LALLENPNQL QLLKDNPKLI DSAIEEGLRY YSPVEVTTAR
301 WAAEPFQIHH QTIQKGDMVI IALASANRDE TVFENPEIFD ITRENNRHIA FGHGSHFCLG
361 APLARLEAKI AITTLFNRMP ELQIKGNREE IKWQGNYLMR SLEELPLTF

CYP107J4P  Bacillus cereus ATCC 14579
           GenPept AAP09593                 
           46% to CYP107J3 in same genomic region
           47% to CYP107Y1 SAV2377 AP005030 Streptomyces avermitilis
           50% to 107H1
  1 MKEPQLQQHL EKFIQYIEAL VNEKRLNPDA DLISELVQTK EQEDKLSNNE LLSTIWLLII
 61 AGHETTVNLI SNGLLALLQH PEQMNLIREN PSLIPSAVDE LLRHSGPVMF ISRLASEDMT
121 IHGKRIPKGD LVLLSLTAAN IDPQKFTYPE TLNISREENN HLAFGAGIHH CLGAPLARLE
181 GQIALGTLLQ RLPNLRLAIK PDQLNYNHSK IRSLVNLPVV F

CYP107K1   Bacillus subtilis
           GenEMBL AL009126 Z99113 comp(76702-77832)
           polyketide hydroxylase pksS
           just over 41% identical to CYP107J1

CYP107L1   Streptomyces venezuelae 
           GenEMBL AF087022
           GenEMBL AF079139 CDS 122..1372
           pikC gene
           function="catalyzes the hydroxylation of YC-17 into
           methymycin and neomethymycin and narbomycin into
           pikromycin"
           51% to 107B1 47% to 107A1 44% to AF254925 42% to 107J1 
           41% to AL049754 new CYP107 subfamily

CYP107L2   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV1987 60% to 107L1 from Streptomyces venezuelae

CYP107L3    Streptomyces tubercidicus strain I-1529
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name CypLA
            60% to CYP107L1 91% to 107L4

CYP107L4    Streptomyces tubercidicus strain R-922
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name CypLC
            61% to CYP107L1 91% to 107L3

CYP107L5   Streptomyces sp.
           GenEMBL BD133547 
           68% to 107L2 
3 LIAGHETTVNLITGAVHALLTHPGQLAQVRGDMSLVDAVVEETLRHEGPVENATFRFAAE 182
183 PLEIGGTVIPAGDPVLIGLAAADRDGARYPGPDRFDIHRDTRGHLAFGFGTHRC 344

CYP107L6   Streptomyces sp.
           GenEMBL BD133544 
           72% to 107L2
MGHEHVIDLGEYGPGFTENPHPVYAELRARGPVHRVRLPKHDAHHEAWLVVGYEEARAAL
ADPRLSKDGSTIGVTFLDEELIGKYLLIADPPQHTRLRGLIAREFTGRRVERLRPRVQEI
TDSLLDEMLPRGRADLVESFAYPLPLTVICELLGVPEIDRAAFRKLSTEAVAPTSGESEY
AAFVQLAAYLEELVEEKRCAPPADDLLSALIRTTDEDGDRLSPAELRGMAFILLIAGHET
TVNLITGAVHALLTHPGQLAQVRGDMSLVDAVVEETLRHEGPVENATFRFAAEPLEIGGT
VIPAGDPVLIGLAAADRDGARYPGPDRFDIHRDTRGHLAFGHGIHFCLGAPLARLEARVA
LRALLERCPGLTPDGAPGEWLPGMLIRGVRSLPVRW*

CYP107L7P   Streptomyces narbonensis
            GenEMBL AF521878  13901..14661
            desosamine biosynthetic gene cluster
            91% to 107L1
            gene= nbmL
            note= frameshift and deleltion generates premature 
            stop codon and truncated protein"
MSRTHQGTTASRPVLDLAALGQDFAADPYPTYARLRAEGPAHRV
RTPEGDEVWLVVGYDTARAVLADPRFSKDWRNSATPPTEAEAALSHNMLESDPRCGPT
(deletion)
ALRADLTLLDGAVEEMLRYGGPVESATYRFPVEPVDLDGTVLPAGETVLVVLAD
AHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCTGAPLARMEARIAVRALLERCPDLALD
VSPGELFWYPNPMIRGLESLPIRWRSGREAGRRVPVEPACRP*

CYP107L8   Streptomyces sp. HK803
           GenEMBL AY354515 complement(72672..73871)
           Gene = plmS2
           56% to CYP107L6
MVTVDLSAYGPGFFTDPYPYYARLREAGPVHEIVLADGDRFWLI
VGYDEARAALADPRLAKSLDPPSEDERHVLITDPPDHTRLRRLVSREFTARRVEAMRP
RVQEITDGLLDEMVAGRRRADLVPSLGSPLPITVLCELLGVPLADREDFRGWTERVLV
PAEPDTIAWWKSRGFAQAGMALTDYLKNMIEDKRRSTPTGDLISSLLRTTAEDNDRLS
AAELHSMVFILIVAGHETTANLITNGVRALLAHPEQLAALRTDPEGLIDQAVEEMLRY
DGPVETSTKRFTLEAVRYGATKIPPGETLLVSIAATGRDPAQFERPDTFDIHRGTTGT
RSGHVAFGHGIHFCLGAGLARMESRVAILTLLRRCPDLALDIDPAGLDWLPGIRVRGV
RSLPVRW

CYP107L9   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           62% to 107L6 before frameshift at C-term
           clone name SP0854

CYP107M1   Actinomadura hibisca 
           GenEMBL D87924CDS complement(6299..7534)
           45% to AF127374 CDS 3226..4458 44% to AF254925 
           45% to 107D1 44% to 107G1, 107E1 new subfamily in 107

CYP107N1   Streptomyces lavendulae 
           GenEMBL AF127374 CDS 3226..4458
           50% to 107D1 52% to AF254925 47% to 107E1 new subfamily in 107

CYP107P1   Streptomyces coelicolor cosmid H10 
           GenEMBL AL049754 CDS complement(10413..11648)
           41% to AF087022 40% to 107B1 40% tp 107G1 
           40% to 107D1 new subfamily in 107
           cloned and expressed by David Lamb and Steve Kelly

CYP107P2   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV4539 86% to 107P1 from Streptomyces coelicolor

CYP107P3   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           78% to 107P2 missing 156 aa at N-term
           C-term may be frameshifted
           clone name SP0887

CYP107Q1   Amycolatopsis mediterranei 
           GeEMBL AF040571 CDS complement(781..>2316)
           66% to AF040570 comp(68704..69969) 43% to 107C1 
           41% to 107B1 40% to 107A1 new subfamily in 107

CYP107Q2   Amycolatopsis mediterranei 
           GenEMBL AF040570 CDS comp (68704..69969)
           66% to AF040571 complement(781..>2316) new subfamily in 107

CYP107R1   Streptomyces maritimus 
           GenEMBL AF254925 CDS comp (18384..19589) 
           gene="encR" 
           53% to AF127374 CDS 3226..4458 49% to 107E1 new subfamily in 107
MTTHTQQLRDFPFAPPAELHMEPAFAQLREEEPISRVRLPYGGE
AWLVTRYQDIKTVLGDPRFSRAATQHAQAPRIQPDPAGEGVLMSLDPPDHTRLRKTVA
GVFTKRRVEDLRPATQRIAEELLEAMEASGAPADLVASYALPLPVTVICDLLGVPGDD
REQLRGWSDALLSTTACTPAESAAAAQAMADHFAALVSQRRRQPTDDLLGALVQTWDR
EEGLLRDEELVLLTRDLLIAGHETTASQIANCTYLLLQRPHDMDRLRTDPSAMASAVE
ELLRFIPLGSGSFRARVATEPVELCGVRIQPGDTVFAPTVAANWDPDVFAEPGRLDID
RSPNPHVAFGHGVHHCLGAQLARLELQVALGVLLRRLPRLRLAVDEAEIVWKTGMQVR
GPKTLPVKW

CYP107S1   Pseudomonas aeruginosa
           NZ_AABQ07000001
           NC_002516 3741011..3742267
           locus_tag = PA3331
           47% to 107B1

CYP107T1   Streptomyces coelicolor  
           StH63 [Full Sequence] Sanger cosmid 
           51% to CYP107L1 CDS 16028-17233
           cloned and expressed by David Lamb and Steve Kelly

CYP107U1   Streptomyces coelicolor 
           StE41 [Full Sequence] Sanger cosmid 
           comp(7438-8739) 44% to CYP107B1 
           cloned and expressed by David Lamb and Steve Kelly

CYP107U2   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV3536 85% to 107U1 from Streptomyces coelicolor

CYP107U3   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           84% to 107U1 missing 90 aa at N-term
           clone name SP0819

CYP107V1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV3519 low 40% range with some 107 subfamilies

CYP107W1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV2894_olmB low 40% to 107 subfamilies

CYP107X1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV6249 49% to 107L1 from Streptomyces venezuelae

CYP107Y1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV2377 50% to 107L1 from Streptomyces venezuelae

CYP107Z1    Streptomyces rimosus ssp. paromyceticus strain R-2374 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema11
            96% to CYP107Z2v1

CYP107Z2v1  Streptomyces albofaciens strain C-0083
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema8
            96% to 107Z2v2 and CYP107Z1

CYP107Z2v2  Streptomyces rimosus ssp. paromyceticus strain BOEH-4355
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema3
            96% to CYP107Z2v1 95% to CYP107Z1

CYP107Z3    Streptomyces sp. strain IHS-0435
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema7
            76% to 107Z12

CYP107Z4    Streptomyces lydicus strain NRAB-0114 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema16
            82% to 107Z12

CYP107Z5V1  Streptomyces lydicus strain NRRL-2433 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema15
            97% to 107Z5v3

CYP107Z5v2  Streptomyces chattanoogensis DSM-40241 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema6
            1 aa diff to CYP107Z5v3

CYP107Z5v3  Streptomyces lydicus strain R-401
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema4
            100% to S. kasugaensis strain A/96

CYP107Z5v3  Streptomyces kasugaensis strain A/96
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema10
            100% to S. lydicus strain R-401

CYP107Z6    Streptomyces sp. strain I-1525 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema5
            85% to CYP107Z8

CYP107Z7    Streptomyces tubercidicus strain DSM-40261 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema17
            90% to CYP107Z8

CYP107Z8    Streptomyces platensis strain Tu-3077 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema13
            89% to CYP107Z9

CYP107Z9    Streptomyces tubercidicus strain NRAA-7027 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema12
            89% to CYP107Z8

CYP107Z10   Streptomyces tubercidicus strain I-1529 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema2
            90% to CYP107Z11

CYP107Z10   Streptomyces platensis strain I-1548
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema14
            100% to S. tubercidicus strain I-1529

CYP107Z11   Streptomyces platensis strain NRAA-7479 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema9
            92% to 107Z12

CYP107Z12   Streptomyces tubercidicus strain R-922 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema1
            92% to CYP107Z11

CYP107AA1   Bradyrhizobium japonicum USDA 110
            GenPept BAC51802                 
            NC_004463 complete genome complement(7193424..7194725)
            41% to 133B1v1 45% to 107L1
  1 MVTPGSGAAI GVFVSCGNRF EVTMNEQAQP AGGDPLFNPL SPDFIRNPYP HYDRLRAIDP
 61 IHVTPFGQFV ASRHADVSLV MRDKRFGKDF VERSKRRYSE KIMDEPVFRS MSHWMLQADP
121 PDHTRLRGLV VKAFTARRVE DMRPRIQEIV DEAIDAVIDR GHMDLIEDFA FRLPVTIICD
181 MLGIPEDHRE VFYKSSRDGG RLLDPVPLTP EEIAKGNAGN MMAQMYFQQL FELRRRNPAD
241 DLTTQLVQAE EDGNKLTNEE LTANIILLFG AGHETTVNLI GNGLLALHRN PDQLALLKAR
301 PELMVNAIEE FLRYDSSVQM TGRVTLEDID DLGGRKIPKG ETVLCLLGSA NRDPAVYPDR
361 PDRLDVTRPN VKPLSFGGGI HFCLGAQLAR IEAEIAIATL LRRLPDLRID DVENPEWRPT
421 FVLRGLKSLP ASW

CYP107AB1   Streptomyces rochei plasmid pSLA2-L
            NC_004808 Links 87725..88939
            49% to 107A1 
            note="ORF37 lankamycin biosynthesis protein
MNQPQLPEIPALNSELFHTDQYATYREILEQRPVTRVRFYDGSL
VWLVNRHEDVRAALTDPRLSNDPMKQSDIDLSAATGIPADLIEYFQRNMFRSDEPDHG
RLRKLVTREFTVRRINALRPRIRQIADDLLEKFAATGGGDLVEALARPLPLTVMCELL
GVPEEDRADFQTWSQHIVESSPEFAERNAVSYRSLFECVRSLIRRRRDEPGDDLLSAL
VDLRDVADRLSENELISTVFLLLVAGIETTVNVLGTGTFLLLTHPGELARLRADGALL
GPAVEEMLRYMAPIEITSRHTLEPVEIGGVSIDAQSTVLINLAAANRDPARFEDPQSF
RVDRNDGGHLTFGHGIHYCLGAALARAEAEVTFEALLERFPDLRLAASASDLTWRHAF
MRGPVELPVSWG

CYP107AC1   Streptomyces atroolivaceus
            GenEMBL AF484556 60948..62147
            leinamycin biosynthetic gene cluster
            48% to 107N1
            gene = LnmA
MSATRRVHIYPFEGEVDGLEIHPKFAELRETDPLARVRLPYGGE
GWMVTRYDDVRAANSDPRFSRAQIGEDTPRTTPLARRSDTILSLDPPEHTRLRRLLSK
AFTARRMGAMQSWLEELFAGLLDGVERTGHPADIVRDLAQPFTIAVICRLLGVPYEDR
GRFQHWSEVIMSTTAYSKEEAVSADASIRAYLADLVSARRAAPHDDLLGVLVSARDDD
DRLTEDELITFGVTLLVAGHETSAHQLGNMVYALLTHEDQLSLLREQPELLPRAVEEL
LRFVPLGNGVGNARIALEDVELSGGTVRAGEGVVAAAVNANRDPRAFDDPDRLDITRE
KNPHLAFGHGAHYCLGAQLARMELRVAIGGLLERFPGLRLAVPADQVEWKTGGLFRGP
QRLPIAW

CYP107AD1   Streptomyces hygroscopicus
            GenEMBL AF521896 4248..5489
            ansamycin biosynthesis gene cluster
            43% to 107X1
            gene = gdnH
MSGRHFEQGERGTAMADTPEEELRILDPQSVAQELRKHGPPRQI
TMHGTTAWLVSRYEEVRDCLGHPGMSPAAAYAASQGQTNPVSGLFEDTVAGTNPPQHT
RLRRLLAKAFTVRRVESLRPRVQEITDTLLDRIAVDGRADLVSALAIPLPMQVICELL
GVPIADRTEFHQWADLMLTPPLDPDTAARSQDASAKLWTYMEDLAEARRKAPEDDLIS
DLMSAHEDDRLSHREVVATARMMLIAGYELTGSFISNAVFSLLSQPDQMELLRKDPEL
AGRGLEELLRHAGPGILIVRFANEDVEIGSVSIRAGDQVLLDMDAAHSDPAHFTDGER
LDLTRDSAVHLQFGHGIHYCIGAPLARVEGQIALESLVRRFPGLRLSVPAAEISHSKN
PFIRSLTALPVEFEAQQPVAG

CYP107AE1   Streptomyces sp.
            GenEMBL BD133545 
            50% to 107X1
VILLKSLAANGLTASSCFTVSPLPIRSASPSIAFLTSSSERDSGVRNDRPSDAQPAIARF
RFPTPPHPRNPTQPHPTPPRPSPTDDPLQAPTFFADPYPTYARLRDTAPVLKVPTGSGGG
GRHSYVVTGYAEAREAFTDPRLSKDTASFFAGRPSQRDLHPAVSRNMLATDPPQHARLRA
LVTKAFTTGAVARLRPYISSLVDELLDTWPTHGTVDLIADLAVPLPVTVICELLGVPDSD
RASVRTWSSDLFAAGDPQRIDAASHAVGDYMTALVAAKRTAPGDSLLDDLIAVRDGQDHL
SEDELVSLAVLLLVAGHETTTNFIGNAALALLRHPESLAHLRAEPQLLGGALDELLRYDS
PVGIATFRFSTEALTLGGTEIPEGVPVLIAPGAANRDPDRFPDPDRLDLTRGATGHLAFG
HGIHRCLGAPLARAEAELALHAVITRYPQAALATPPETLPWRHTRLTRGLASLPITLRDH
PK*

CYP107AF1   Streptomyces collinus DSM2012
            GenEMBL AF293355 24259..25518
            Gene = rubU
            rubrinomycin gene cluster
            52% to 107B1
MARTDAPQAAPPADLFTPAFHQNPHEALAGLRRTAPAVPVMTPN
GLRTWLVTGHEHARALLADPRLSKDMRVGRDLIPRNFVDPDKQREFLAESGERSQFPH
VLSVHMLDSDPPDHTRLRRLVGRAFTARRVESLRPRITELTDELLDAMARHERLDLME
ALAFPVPFTVICWLLGVPPDDRAAFRRWSNLLVSGAGTDEVREASASMITYLTELIEA
KRNEPADDMLTDLVHARDAGDQLSSDELISMAFLLLVAGHETTVNLIGNGALALLTHP
EVREQLAADESLWPGAVEEFLRYDGPVTNATWRFTTEPVEVGSVTIPEGEFVTISIGA
AGRDPDRYPDPDRLDITRAHSGSVAFGHGIHHCLGAPLARLEGRIVLSRLFARLPGLR
LAADPDELSWRSSLMMRGLEELPVFTA

CYP107AG1   Streptomyces atroolivaceus
            GenEMBL AF484556 complement(120436..121638)
            Gene = LnmZ
            leinamycin biosynthetic gene cluster
            49% to 107E1
MSTEVETEKPAPVAYPFTGSEGLELSQSYAKLFEDGDPIRVQLP
FGEPAWLVTRYDDARFVLTDRRFSRHLATQRDEPRMTPRAVPESILTMDPPDHTRLRT
LVSKAFTPRRIESKRAWIGELAAGLVADMKAGGAPAELVGSYALAIPVTVICELLGVP
EDDRTRLRGWCDAALSTGELTDEECVQSFMDLQKYFEDLVKERRAEPRDDLTSALIEA
RDAHDRLAEPELIGLCISILIGGFETTASEISSFVHVLQQRRELWTRLCADPEAIPAA
VEELLRFVPFAANGISPRYALEDMTVGGVLVREGEPVIVDTSAVNRDGLVFDNADEVV
IDRADNRHMVFGHGAHHCLGAHLARVELQEALKALVEGMPGLRLSGDVEWKADMIIRA
PRVMHVEW

CYP107AH1  Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           50% to 107L6 missing about 42 aa at N-term
           clone name SP0749

CYP107AJ1  Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           52% to 107B1 frameshifted C-term
           clone name SP0908

108 Family

CYP108A1    Pseudomonas spp.
            Swiss P33006 (428 amino acids) PIR S27653 A42971 (428 amino acids)
            Also found a PIR cross-reference to EMBL S39894 but could not 
            retrieve it
            Peterson J.A., Lu J.-Y., Geisselsoder J., Graham-Lorence S.,
            Carmona C., Witney F., Lorence M.C.
            Cytochrome P-450 terp: Isolation and purification of the protein 
            and sequencing of its operon.
            J. Biol. Chem. 267, 14193-14203 (1992)

CYP108A1    Pseudomonas spp.
            GenEMBL M91440 (6620bp)
            Hasemann,C.A., Ravichandran,K.G., Peterson,J.A. and
            Deisenhofer,J.
            Crystal structure and refinement of cytochrome P450terp at 2.3A
            resolution.
            J. Molec. Biol. 236, 1169-1185 (1994)

CYP108B1    Caulobacter crescentus CB15
            GenEMBL AE005918 GenPept AAK24465
            NC_002696 complete genome 2703947..2705221
            Complete genome sequence of Caulobacter crescentus
            Proc. Natl. Acad. Sci. U.S.A. 98 (7), 4136-4141 (2001)
            47% to CYP108A1
  1 MTISTDIANT IIDPKAYADG DRIDQAFAHL RREAPLAVAQ PDGFDPFWVV TRHADILEVE
 61 RQNELFHNGD RATVVTTIEP DKKVREMMGG SPHLVRSLVQ MDNPDHFAYR KITQGALLPQ
121 NLRALEARIR EIARGFVDRM AEHGDRCDFA RDVAFLYPLH VIMEVLGVPE SDEPRMLKLT
181 QELFGNADPD LNRTGKSVTD VGEGVDSIQS VVMDFMMYFN AITEDRRANP RDDLATLIAN
241 GKINGEPMGH LEAMSYYIIA ATAGHDTTSS TTAGALWALA ENPDQFAKVK ADPSLIPGLI
301 EESIRWVTPV KHFMRTATAD AELGGQKIAK GDWIMLSYPS GNRDEAVFED PFTFRVDRTP
361 NKHVAFGYGA HICLGQHLAR MEMRVLWEEL FARLDHVELD GAPTRMVANF VCGPKSVPIR
421 FKMH

CYP108C1    Saccharopolyspora spinosa strain NRRL 18395
            No accession number
            Istvan Molnar
            Syngenta Biotechnology, Inc.
            47% to CYP108B1 43% to CYP108A1

CYP108D1    Novosphingobium aromaticivorans
            GenEMBL NZ_AAAV01000137
            16805..18166 gene = Saro1710
            47% to 108B1 39% to 108C1 
MTNTSRLTKRRRPRRSDGKREGFMDSIPMVPAEVGRAVIDPKSY
GTWEPLLDRFDALRAEAPVAKVVAPDDEHEPFWLVSSFDGVMKASKDNATFLNNPKST
VFTLRVGEMMAKAITGGSPHLVESLVQMDAPKHPKLRRLTQDWFMPKNLARLDGEIRK
IANEAIDRMLGAGEEGDFMALVAAPYPLHVVMQILGVPPEDEPKMLFLTQQMFGGQDE
DMNKSGLKDLPPEQISQIVAGAVAEFERYFAGLAAERRRNPTDDVATVIANAVVDGEP
MSDRDTAGYYIITASAGHDTTSASSAGAALALARDPDLFARVKADRNLLPGIVEEAIR
WTTPVQHFMRTAATDTELCGQKIAAGDWLMLNYVAANHDPAQFPEPRKFDPTRPANRH
LAFGAGSHQCLGLHLARLEMRVLLDVLLDRVDSLELAGEPKRVNSTFVGGFKSLPMRW
KAA

CYP108E1    Ralstonia metallidurans
            GenEMBL NZ_AAAI01000348
            46192..47481 gene = Reut4024
            41% to 108B1 39% to 108A1 48% to 108C1 
MTIASDFDTELASHEIYSDPERMHEMFETLRREDPVHWTTAPGH
PPFWAVTKQADVIEVGKHPDVFIASPKSFLMNDVEQRVRIEETAATGGKLVRTMIHMD
DPDHKKYRGLTQSYFMPANIKRLESVIQERARALVGRLIEKGTSEFCSEIAVWYPLQI
VMTLLDVPESEHPYLLKLTQQFLAPKDPTLRRDGPDERGKGAVAKEYFAYFGKMLAER
RAAPLKEDLGSLIAHATVDGEPLPLMEAVSYYVILATAGHDTTSSSMCSGLYYLLTQP
GELDRLRARPELMPSAIEEMFRHGSPVKHFVRTATRDFELRGKKIQAGDEVALMYHSA
SFDEEVFDEPRSFRIDRGPNKHVAFGFGIHACLGQNLARASMRTFFTELLARTESIEV
VGKAEFIASNQVGGMKTLNIRVTPSKQSTTDRIEVAA

109 Family

CYP109A1    Bacillus subtilis
            GenEMBL M24523 (3187bp)
            Lewis,P.J. and Wake,R.G.
            DNA and protein sequence conservation at the replication terminus
            in Bacillus subtilis 168 and W23
            J. Bacteriol. 171, 1402-1408 (1989)

            Ahn,K. and Wake,R.G.
            A unique open reading frame adjacent to the replication terminus 
            of the Bacillus subtilis W23 chromosome compared with Bacillus
            subtilis 168
            unpublished (1990)

            Ahn,K.S. and Wake,R.G.
            Variations and coding features of the sequence spanning the
            replication terminus of Bacillus subtilis 168 and W23 chromosomes
            Gene 98, 107-112 (1991)

CYP109B1    Bacillus subtilis
            GenEMBL AF015825 Z99110  
            YjiB
            also similar to CYP106A, both 106 and 109 are close 
            together on a tree

110 Family

CYP110A1    Anabaena sp. (a cyanobacterium)
            Swiss P29980 (354 amino acids) GenEMBL M38044 (5933bp)
            GenEMBL U38537, M13161
            Lammers,P.J., McLaughlin,S., Papin,S., Trujillo-Provencio,C. and
            Ryncarz,A.J.II.
            Developmental rearrangement of cyanobacterial nif genes: 
            Nucleotide sequence, open reading frames, and cytochrome p-450 
            homology of the Anabaena sp. strain PCC 7120 nifD element
            J. Bacteriol. 172, 6981-6990 (1990)
            This sequence was later revised to give a complete P450 sequence 
            of 448 amino acids.

CYP110A1    Nostoc sp. PCC 7120 same as Anabaena sp. PCC 7120
            GenPept BAB73407, C37842 (this entry missing N-term)
            NC_003272 complete genome 1708114..1709493
            1 aa diff to M38044
  1 MLTQLPNPIS VPSWWQLINW IADPIGFQKK YSKKYGNIFS MQLAGIGSFV ILGEPQALQE
 61 IFTQDSRFDV GRGNTLAEPL IGRTSLMLMD GDRHRRERKL LMPPFHGERL QAYAQQICLI
121 TNQIASEWQI GQPFVARSAM QKLSLEVIIQ IVFGLADGER YQQIKPLFTD WLNMTDSPLR
181 SSMLFLKSLQ KDWGTWTPWG QMKHKQRSIY DLLQAEIEEK RTKENEQRGD VLSLMMAARD
241 ENGQAMTDEE LKDELLTILF AGHETTATTI AWAFYQILKN VNVQEKLQQE LDRLGANPNP
301 MEIAQLPYLT AVSQETLRMY PVLPTLFPRI TKSSINIAGY QLEPDTTLMA SIYLIHYRED
361 LYPNPQQFRP ERFIERQYSP SEYIPFGGGS RRCLGYALAL LEIKLVIATV LSNYQLALAE
421 DKPVNVQRRG FTLAPDGGVR VIMTGKKSLK FEQSSKIFN

CYP110A2    Anabaena variabilis (a cyanobacterium)
            GenEMBL U38478 (1743bp)
            Lammers, P.J. and Duran, S.
            possible alkane/fatty acid hydroxylase

CYP110B1   Nostoc sp. PCC 7120 Same as Anabaena
           GenPept BAB75445, AC2274 
           NC_003272 complete genome complement(4523158..4524546)
           45% to CYP110A2 53% to 110E1 49% to 110D1 47% to 110C1
  1 MHLPKGPQTP VFVQVLRWVF SPMSFLEDCA KRYGDIFSVK LAKDVPAIVF LSNPKDIQQI
 61 LTNDNNQLDS PGDWNDLFEP LLGKRSVITL SGAEHQRQRQ LLMPPFHGER MRGYSQVITD
121 VTEKVISQHQ IGQPFQVRSV TQAITLRVIM QAVFGLYEGS RAEKLQHLLS DLLEKSSSPF
181 SVALLYFPSL RRDFGPIKFW GEQVQIQQQA DELIYQEIQE RRENPDPSRT DILSLLMDAR
241 DADGQPMTDV ELRDELMTLL VAGHETTATA LAWAMYWIHK LPPVKARLLE ELDSLGDNPD
301 STTIFKLPYL NAVYSETLRI YPVAMLTFAR RVIETMALGG YELPPGTPVL GSIYLTHHRE
361 DLYPEPKKFK PERFLERQFS PYEYLPFGGG TRRCLGLAFA QWEMKLALAK ILTSYELELV
421 NNSVEVRPKR RGLVTGPHRP IEMVIKSQRQ ITSRILETTT VS

CYP110B2   Nostoc punctiforme
           NZ_AAAY02000005 GenPept ZP_00111619.1
           complement(58895..60277) gene = Npun6097
           75% TO 110B1
MKLPKGPQSPAVLQMLRWITSPMSFMETCAKRYGDMFTIRLDSK
SPPLIFVSKPEVLEQILTNDIKGLEAPGDTNLVFESLLGKHSVITISGAEHQRQRQLL
LPPFHGERMRSYSQIISDITEKVISQYQIGQPFNIRSVTQAITLRVIMQAVFGLDEGP
RAEKLQHCLAEMLEKGSSVLSAALLYFPALQRDFGPINFWGKQMRRQQAADKLIYEEI
RERQEQPDPSRTDILSLLMAARDEAGQPMTDEKLRDELMTLLVAGHETTATALAWAFY
WIQKIPTVRQKLLKELDSLGDNPDPSTIFKLPYLNAVCSETLRIYPVAMLTFARVVRT
PLSLGGYELEPGIGVIGSIYLTHHREDLYPEPKQFKPERFLERQFSPYEYLPFGGGAR
RCIGLAFAQLEMKLALAKILSTRELELVDNSEVRPKRRGLVTGQDRPIQMVVTSQRQV
KFPILQTATV

CYP110C1   Nostoc sp. PCC 7120 Same as Anabaena
           GenPept BAB76385, AF2391         
           NC_003272 complete genome 5587079..5588485
           48% to CYP110A2 49% to 110E1 47% to 110B1
  1 MKYQIQRPNP LKTHPFLQKL QWIADPVEYM KKASLQHPDM FTAEVIGFGD TVVFVSHPQG
 61 IQTLFANDRK KLVAVGEANR ILYPLVGNNS MFLLEGVKHK QRRQLLMPSF HGERMREYGH
121 LIRNITENLF SQLQQDVTFS ALTAMREISM QVILQAVFGF YEGERCQQFK HLLPIFLSEL
181 FQSPLASSIL FFPSLQKDLG NLTPWGRFVR QREKIDKLLY AEIAERRQEI NSDRIDILSL
241 LISARDETGD SMSDKELRDE LITLMISGHE TTGTAMAWSL YWILQTPEVF QRLIQELDSL
301 GDSPDPMSIF RLPYLTAVCN ETLRINPVAM LTLPRVVKEP IELLGNRLET STTVVGCIYL
361 THHREDLYPE SKLFKPERFL KREFSQYEFM PFGGGVRGCI GQALAMFEMK IVLATVLSRY
421 QLALADRKPE RPQRQGFTLT PTNGVKMLIT GQHKRQNYSM AASTTFNA


CYP110C2   Nostoc punctiforme
           GenPept ZP_00108280.1
           GenEMBL NZ_AAAY02000070 complement(34550..35941)
           gene = Npun2703
           60% to 110C1
  1 MQLPNILKSP SLLQKLHWVS DPIGYMENAA QEYPDIFTGK IVGFGDTVVF VNHPQAIQEI
 61 LTNDRKKFTA VGELNGILKP LLGDNSVLML ESDRHKRQRQ LVTPSFHGER MQAYGQLICN
121 VSKKIFNQLP LNKPFVARNL TKEISLQVIL QSIFGFYEGE KIQKLRQLLP LLLELFESPL
181 SSSLFLFSFL QQDLGAWSPW GNFLRVREKI DQFLYTEIAE CQQQADPERI DILSLLISCR
241 DEAGQPMTDQ ELRDQLITLI LAGYDTTATA MAWGLYWIHK QPLVCEKLLQ ELDTLGDSPD
301 PMSISRLPYL TAVCNETLRI HPVTMFSFPR VVQEPLELLG HSLEPGTILL PSIYLTHHRE
361 NLYPQSKQFK PERFIERQFS PYEFLPFGGG VRRCMGEALA LFEIKLALAT IVSHYHLALV
421 DQRPEQPQRR GFNLAPGSGV KMVMTDQRAR KESLINMTTT PLS

CYP110D1   Nostoc sp. PCC 7120 Same as Anabaena
           GenPept BAB76465, AF2401
           NC_003272 complete genome 5678382..5679743
           48% to CYP110A1 53% to 110E1, 49% to 110B1
  1 MTVTQNLPNG PRIPRLLRLF KFITQPIQYV EDFAKVYGDN FTIWGSGESY FVYFSHPQAL
 61 EQIFTNVSCF ESSGGGSPLL ELLLGKNSLI LLEGDRHQRQ RQLLTPPFHG ERMRAYGQTI
121 REITQQVTQA WQMGKPFNIR ASMQEITMRV ILRVVFGVDE GELFQELRQL LTTLLDFMGS
181 PLMSSTFFFS FTQKDYGAWS PWGRMVRLIK KIDQLIYALI AQRRAEFGEN RQDILSLLIS
241 ARYDDGQPMS DVELRDELMT MLVAGHETTA SALTWAFYWI DSVPEVREKL FQELDTLNDD
301 SEPSIIAKLP YLTAVCQETL RFYPIVLNAF FRRTKNPMEI MGYKLPKATL VVPSIYLAHH
361 REEVYPQSKQ FRPERFLEKQ FSPYEYLPFG GGNRRCIGLA FAQYEMKIVL ATILSQFQVS
421 RLSKRPVQPV RRGLTLAAPG GMKMVANKRM RNS

CYP110D2   Nostoc punctiforme
           NZ_AAAY02000028 GenPept ZP_00109203.1
           52704..54170 gene = Npun3650
           68% to 110D1
MNIPLSVTLSNMKSRNNKIQKPSNLQTPMTATYNLPDGPQMPRW
LRTIKFISQPVKYVDDFAKTYGDTFTIRSSRSDNHIVYFSQPQALEEIFTADSRHFEV
GRGNTGLRFLLGDRSFMLVDGDRHQRQRQLLAPPFHGERMRAYGEDIRKITQQVSHEW
KIGKPFNIRESMQEITLRVILRVVFGLNEGELFEELRRSLSDLLDFISSPIMSSAFFF
RFIQKDFGAWSPWGRILLQRQKVDLLIYTLLRERRAQTDQNRQDILSLMMAARYDDGQ
GMSDEELHDELMTLLVAGHETTASALTWAFYWIDHLPEVREKLLQELNTIGVNPDLSS
VAKLPYLTAVCQETLRIYPIAMTAFVRIVKTPITIMGYELREGTAIVPSIYLAHHREE
VYPQSKQFKPERFLERQYSPYEYLPFGGGNRRCIGMAFAQYEMKIVLATVLSEFQVSL
VNKRPVHPVRRGLTVATPAGMRMVATPQVKRANTPALV

CYP110D3   Trichodesmium erythraeum
           GenPept ZP_00074554.1 GenEMBL NZ_AABK02000068
           complement(10019..11407) gene = Tery3870
           54% to 110D1
MTLPDGPSLSPLQRRLRTWKFIFSPLSAIEERYSEYGDIFRTNT
NSLYPFIYFCNPKAIQQIFTADPDTFTSGSINGILKYFVGLNSLLLQDGDRHKRQRKL
LMPPFHGDRMRKYGDLIYNITSNVISQWKIEQPFPIRKSTQEISLKVILAAVFGLDQE
GKSYEKLRVLMSDLLDSMSSPLSSTFLFFNFLRKDWGPWSPWGRFLRKKQELHELIIA
EIQTAKKEGNHRDDILSLLLEARDEAGNAMSDEEIKDELLTMLFAGHETTASALAWAL
YWIDMIPSVGEKLMAELATIPSNSDQVAITKLPYLSAICQETLRIYPIAMNAFPRVVQ
KPIEIMGYQLEPGMVAIVPIYLTHHREDIYPEPKKFKPERFLERQFSPYEYLPFGGGS
RRCIGSAFALFEMKLVLATILSQWELKLLPNQRISPVRRGLTMAPPANMRMVVKPKKS
WQKVSQPILTSG

CYP110E1    Nostoc sp. PCC 7120 Same as Anabaena
            GenPept BAB76532, AI2409
            NC_003272 complete genome 5753083..5754450
            50% to CYP110A2 53% to CYP110B1 53% to 110D1
  1 MKLPDSPKIP KFMQLVQWIY QPLQLMEASA KAHGDSFTLW LTNKRPIVFL SNPQAIQELF
 61 TTPLEQLDAR GTAQVLQPLL GENSLLLLSG ETHQRQRKLL TPPFHGDRMR AYGDIITNIT
121 KEVISNWQLG KPFSVRDSMQ EITLRVILQA VFGLREGERY TQLQKRLCDI LDLSGSALRS
181 TLSFLPALQI DLGRWSPWGH FLRQREAIDQ LLYAEIQDRR DHPDPSRTDI LSLMMAARDE
241 NGEAMTDVEL RDELMTLLVA GHETTASALT WALYWIHKLP QVREKLLAEL DNFGDNGDVN
301 EITRLPYLTA VCQETLRIYP IAMVTIPRIT KTNLEIGGHQ FAPGTMLVGC IYLMHRRPDL
361 YPQPQEFKPE RFLEKQYSLY EYLPFGGSNR RCVGMAFALY EMKLILATVL ANVDLALVDN
421 YPVKPTRRGV TLAPSGGKWL IATAQHQKIK NPVEV

CYP110E2   Nostoc punctiforme
           NZ_AAAY02000088 GenPept ZP_00107327.1
           complement(18173..19567) gene = Npun1723
           58% TO 110E1 55% TO 110B1
MSLLKLPNGPQTHPWIQMYQWLTNPLEYMEACTKRYGDIFTLKL
GQNFAHQVFISNPQAIQQIFTTDPKQLDSGESAGIKAPLLGQQSLLALDGKPHQRQRK
LLTPPFHGERMLAYGELIREITEQVSSQWQVGETFAVLPSMQAISFQVILKAVFGLED
GPRYKKLNELLIKILNPKIPLLRTVLLIFPSMRQDLGAWSPWGKYLRLRQQIDQLIYA
QIQERKAQPNLSGTDILSLMMAARDEAGEPMTDLELRDELMTLLVAGHETTATSLSWA
LYWIHHRPQVREKLLQELDNLGEKPDPNAIFRLPYLNAVCSETLRLYPVAMSALNRLV
KSPLQIGEYNFEPGTILIPSIYLTHHREDLYPESKQFKPERFLERQFSPYEYLPFGGG
NRRCIGMAFALFEMKLVLATVLSRWQMELADSKPVRPVRKGLLFSPAGGVQMVVKGKR
LQNQPILQTSSSSV

CYP110E3   Trichodesmium erythraeum
           GenPept ZP_00072591.1
           GenEMBL NZ_AABK02000017 complement(<3..1016)
           53% to 110E1 missing C-terminal 121 aa (runs off end of clone)
  1 MIKLPGPKSP ALTQILQWTA KPIKFMEKCA REYGDTFEVK LNYPIVFISH PKAIEEIFKA
 61 NPKKFDCGSS NKLAQPLLGD YSLLLLDDIP HQRQRKLLMP PFHGKRMQAY GELICNVAQE
121 VASKWEIGQV FSMREFTAEI SLKVILQAVF GLYEGERYSK LEKLLGSLLE SLSSPLKTSM
181 LFFQFLQIDL GPWSPWGNFI KNREEIYELL CAEISERRQK LDPERSDILT MLLLARDEEG
241 EGMSDIELRD ELMTLLIAGH ETTATSLSWA FYWIHHQPEI YQKLSRELET FGDDLNPMTV
301 INLPYMNAVC SETLRIYPVV IIVSPRKTKL PITIMGQT

CYP110E4   Gloeobacter violaceus PCC 7421
           GenEMBL AP006578 complement(257348..258724) 
           gene = gll3063
           NC_005125 complete genome complement(3256348..3257724)
           locus_tag = gll3063
           71% to 110E5 55% to 110E1
MSLPPGPSSPSPFQLMQWIGCPTDYLHTTAARYGDPFTMRVGVF
PPLVMFSDPRAIQQLFTAEAGTFDAGASNVALRPTLGANSLLLLDGERHQQQRRLLTP
PFHGERMRAYGELIRQVTEEVIVRWQPGKPFLVRNAMQRISLAVILQAVFGLHDGTRL
VRLRQALGSMLDAMSSPLSMAMLLMLPEDFGPWSPRARLQAHLGAIDELLYAEIRERR
EHFDAGAGDILGLLLAARDEAGAAMGDAELRDELMTLLVAGHETTATAMAWALYWIHY
LPQVRERLLAELDSLGSDPDPEAIARLPYLGAVCSETLRIYPVALIASPRVARHTVRI
LERDYEAGTRLAAGIYLAHHRPETYPEPERFRPERFLERTFSPYEFVPFGGGSRRCIG
MAFALYEMKLVIATVLLERDLRLVQPRLLRPVRRGVTLAPPEGLYLVPTGERSASRLL
SRTSTAGQ

CYP110E5   Gloeobacter violaceus PCC 7421
           GenEMBL AP006578 complement(258800..260176) gene = gll3064
           NC_005125 complete genome complement(3257800..3259176)
           locus_tag = gll3064
           71% to 110E4 55% to 110E2
MSLPAGPASPPPLQLLQWIGRPTDYLERTARRYGDPFTMRLGLH
SPVTGVFFSSPEAFQQLFNTEPGLFDSGGANASSTFNLLFGTNSLILLDGERHQQQRR
LLTPPFHGERMRSYGELIRTLAEQVTARWNLGTPFQARRSMQRISLGVILKAVFGLHD
GTRYLRVCRLLGNLIDASASPLLFGLRLIFPQDAGPMSPMGQLKAQIDAIDELLYAEI
RERRERPDPRADDILSLLMAARDEAGQGMGDVELRDELMTLLVAGHETTATAMAWALY
WIHRLPQVRERLLAELDSLGSDPDPEAIARLPYLGAVCSETLRIYPVAMVAFARVPRR
PVRILDREYPAGTFLIPNIYLAHRRPEAYPDPERFRPERFLERTFSPYEFVPFGGGSR
RCIGVAFALYEMKLVLATVLSRVELRLADPRPRLPVRRGLTLAPPEDLHLIPTALRSG
HRDLLPAC

CYP110F1   Nostoc punctiforme
           NZ_AAAY02000005 GenPept ZP_00111618.1
           complement(57031..58407) gene = Npun6096
           48% TO 110E1 48% TO 110D1
MKILDSLTTPSLLQTLQLIAKPTKTLENYATKYGDIFTMRVMGL
KSPPIVFFSHPQAISDCFAVPAHKLDFKKATHVFKPLFGENSIVFKEARSHQQQRQLL
LPAFHGDNLKSYGQAICQIAEELTQSWTSGTNICIHKLMSKITLEIILQVVFGITHGV
RYQQLKEQLSALLEDVTKPWYSSLFFFPSLQKDLGAWSPWGIFLKRREQIDKLIYAEI
SERRWQNDAMRTDILSLLMSAHDVNGQQMTDEELRDQLVSLLLLGYETTSGVLAWIFY
LIHSHPEVKHRLMQELSTLDNLTNPEAITQLPYLTAVCQETLRIHPIALICTPRMLKE
PVEIMGHKFTSETVLVPCIHLAHRRTDTYPEPEQFRPERFLNQKFSPYEYLPFGGGYR
GCIGAAFSMYELKLVTAIILSRFELSLTDKRPAYPVRRGITIVPSGGVKMVVTKKAKF
KRQTILST

CYP110G1   Trichodesmium erythraeum
           GenPept ZP_00074734.1
           GenEMBL NZ_AABK02000081 complement(2404..3738)
           42% to 110C1
  1 MKQVCALKTP LWLQRFNYIT NPVSYWQKAY SSYKDAFYAQ GINFGKPLMV FYTPSAAKQI
 61 IENCQGDLTT TSFDSELTAI FGDSSFFILE GTNHKKMRKL LIPALHGKHI KTYGELICNL
121 VNNLIENLPF NQSFSALEIA QEISMQVMIK LLFGNYQQER YQKIKQLMIN MVSLFAANVF
181 GFPLFFKFLQ QDLGLVSPWG NFLQQRRKIQ QLIYQEIAER RNHPNQERTD ILSLLMTAQD
241 EKGNFLNDEE LLGQLLSLLF TGNESTAASI AWSWYEVYRN SKIKEKLLEE INNLGDSPEP
301 LSLFNLPYLS AVCNETLRKY PVTMFMIPRI VKNTTEINGY QLDKGMLVTV GTYILHHRED
361 IYDQPEEFKP ERFIEHRFSS FEFLPFGRGM RGCIGADIAL YQMKLTLATI ISHHRLELTN
421 YGQIFPKRRN TILTPIKLRI IKAC

111 Family

CYP111A1    Pseudomonas incognita
            GenEMBL L23310 (2080bp)
            Ropp,J.D., Gunsalus,I.C. and Sligar,S.G.
            Cloning and expression of a member of a new cytochrome P-450
            family: cytochrome P450lin (CYP111) from Pseudomonas incognita.
            J. Bact. 175, 6028-6037 (1993)

CYP111A2    Novosphingobium aromaticivorans
            GenEMBL NZ_AAAV01000134
            complement(20145..21356) gene = Saro1618
            65% to CYP111A1
MLDLKNPDTYQGGVPYAALQDLRAEGPVHWNPESDGAGFWAVLG
HDEIVAVSRQPDLFSSAFENGGHRIFNENQVGLTGAGESAIGIPFISRDPPSHTQYRK
FVMPALSPARLQGIEERIAKRVERLFAQVPLGETVNILPLLTVPLPLLTLAELLGVPA
DLWPDLHRWTDAFVGEDDPDFRQSPEAMQAVLAEFMGFATALFEDRRANPGPDIASLL
ANTEIRGEPAPLRDFIANLILALVGGNETTRNSINHTMIALAENPGQWDILRADPSLM
TAAVKEMVRFASPVIHMRRTAMRDTQLGQQAICKGDKVVIFYPAGNRDPAVFENPDRF
EITRPVRQHLAFGSGAHVCVGSRLAEMQLRLAFAEMARHVRAFEVVGEPSRVRSNFIN
GFKRLEVRLLV

112 Family

CYP112A1    Bradyrhizobium japonicum
            GenEMBL L02323 L12971 U12678 (11,715bp)
            NC_004463 complete genome 2317922..2319127
            Tully,R.E. and Keister,D.L.
            Cloning and mutagenesis of a cytochrome P-450 locus from
            Bradyrhizobium japonicum that is expressed anaerobically and
            symbiotically
            Appl. Environ. Microbiol. 59, 4136-4142 (1993)
            Note: called BJ-1 see CYP114, CYP115P, CYP117

CYP112A2    Rhizobium sp. NGR234 plasmid pNGR234a
            GenEMBL   AE000083 
            NC_000914 complement(233666..234868)
            Gene = y4lD
            Freiberg,C., Fellay,R., Bairoch,A., Broughton,W.J., Rosenthal,A.
            and Perret,X.
            Molecular basis of symbiosis between Rhizobium and legumes
            Nature 387 (6631), 394-401 (1997)
            about 92% identical to 112A1
MPEQPLPTLPMWRVDHIEPSPTMLALRANGPIHNVRFPRGHEGW
WVTGYDEAKAVLSDAAFRPAGMPPAAFTPDCVILGSPGWLVSHEGGEHARLRTIVAPA
FSDRRVKLLAQQVEAIAAQLFETLAAQPQPADLRRHLSFPLPAMVISALMGVLYEDHA
FFAGLSDEVMTHQHESGPRSASRLAWEELRAYIRGKMRDKRQDPGDNLLTDLLAAVDR
GEATEEEAIGLAAGMLVAGHESTVAQIEFGLLAMLRHPQQRERLVGNPSLVDKAVEEI
LRMYPPGAGWDGIMRYPRTDVTIAGVHIPAESKVLVGLPATSFDPRHFEDPEIFDIGR
DAKPHLAFSYGPHYCIGMALARLELKVVFGSIFQRFPALRLAVAPEELKLRKEIITGG
FEEFPVLW

CYP112A3v1  Mesorhizobium loti
            GenPept NP_106888                
            95% to 112A2 Rhizobium sp. NGR234
  1 MSEQPLPTLP MWRVDHIEPS PEMLALRANG PIHHVRFPSG HEGWWVTGYD EAKAALSDAA
 61 FRPAGMPPAA FTPDSVILGS PGWLVSHEGG EHARLRTIVA PAFSNRRVKV LAQQVEAIAA
121 QLFETLAAQP QPADLRRHLS FPLPAMVISA LMGVLYEDHA FFAGLSDEVM THQHESGPRS
181 ASRLAWEELR AYIRGKMWDK RQDPGDNLLT DLLAAVEQGN ATEEEAIGLA AGMLVAGHES
241 TVAQIEFGLL AMFRHPQQRE RLVGDPSLVD KAVEEILRMY PPGAGWDGIM RYPRTDVTIA
301 GVHIPAESKV LVGLPATSFD PRHFDDPEIF DIGRDENPHL TFSHGPHYCI GMALARLELK
361 VVVGSIFQRF PALRLAVAPE ELKLRKEIIT GGFEEFPVLW

CYP112A3v2  Mesorhizobium loti
            GenEMBL AL672112 complement(85404..86606)
            Strain R7A symbiosis island
            Gene = msi071
            2 DIFFS with CYP112A3v1

CYP112A4   Rhizobium etli symbiotic plasmid p42d
           NC_004041 55365..56645
           89% to 112A3
           gene = cpxP2
MSEQSLPTLPMWRVDHIEPSPEMLALRAKGPIHRVRLPSGHECW
WVTGYDEAKAVLSDAAFLPAGMPPADFTPDSVILGSPGWLVSHEGDEHARLRTIVAPA
FSNSRVKLLTQQVEAITVQLFDTLAVQPQPADLRRHLSFPLPAKVISALMGVPFEEHA
FFAGLSDEVMTHQHESGPRSASGLAWEELRAYIHGKIRGKRQDPGDNLLTDLLAAVDQ
GKATEEEAIGLAAGVLVAGHESTVAQIEFGLLAMFRHPQQRERLVRDPSLVDKAVEEI
LRMYSPGAGWDGIMRYPRTDVTIAGVHIPAESKVLVGLPATSFDPCHFKDPEVFDIGR
DANPHLAFSYGQHNCIGAALARLELKAIFGSIFQRFPALRLAVAPEELKLRKEIITGG
FEEMPVLWCGRPPASQSSHLAAPGAHRSDQPLDR

113A Subfamily

CYP113A1    Saccharopolyspora erythraea
            GenEMBL L05776 (1320bp) S51613 U82823 PIR B40634 (412 amino acids)
            Stassi,D.L., Donadio,S., Staver,M.J. and Katz,L.
            Identification of a Saccharopolyspora erythraea gene 
            required for the final hydroxylation step in erythromycin
            biosynthesis.
            J. Bact. 175, 182-189 (1993)
            eryK erythromycin C-12 hydroxylase
            Note: two different database entries have different start 
            codons.  Neither is ATG.

113B Subfamily

CYP113B1    Streptomyces fradiae
            GenEMBL U08223 (7082bp)
            Merson-Davies,L.A. and Cundliffe,E.
            Analysis of five tylosin biosynthestic genes from the tylIBA 
            region of the Streptomyces fradiae genome.
            unpublished (1994)

CYP113B2    Streptomyces caelestis 
cytochrome P-450 hydroxylase homolog (nidi)
GenEMBL AF016585 CDS complement(1-396) N-term only, 60% to 113B1
MVDSVTGPMELSKDANAKELLDWFSHNRTHHPVFWDEGRQAWQV
FRYDDYLTVSNHPEFFSSDFTEVAPTPPELEMILGPGTIGALDPPAHGPMRKLVSQAF
TPRRMAGQEQRIRVIAEELLDRVRGQKTIA

CYP113C1   Streptomyces virginiae
           GenEMBL AB072568 4994..6202
           46% to 113A1
           gene = visD
MAQQTPPAPPSMADGGKAMLAWLRTMRDEHPVHEDQYGVFHVYR
HSDVLAVTSDPAVFSSDLSRLRPDSSALSEEILSVIDPPLHRKLRSLVSQAFTLRTVA
DLEPRVTELAGRLLEKVEGSEFDLVGDFAYPLPVIVIAELLGVPAEDRELFRGWSDRM
LSMQVDDPLEIQFGDEAGEDYERLVKEPLKEMHAYLQRHVDARRETPGDDLLSRLVTA
EIAGERLTDRQIVEFGALLLMAGHVSTSMLLGNTVLCLEENPETAAALRADRALISGV
IEEVLRMRPPITVAARVTTGEVVVGGVTIPKDRMVMASLLSANHDERHIQDPEVFDPR
RSPNPQLAFGHGIHYCLGGPLARLEGRVALEMLLDRFEDIRVTPGAPYDFHREGLFVP
ARSPLTVRRG

114 Family

CYP114A1    Bradyrhizobium japonicum
            GenEMBL L02323 L12971 U12678 (11,715bp)
            NC_004463 complete genome 2319222..2320511
            Tully,R.E. and Keister,D.L.
            Cloning and mutagenesis of a cytochrome P-450 locus from
            Bradyrhizobium japonicum that is expressed anaerobically and
            symbiotically
            Appl. Environ. Microbiol. 59, 4136-4142 (1993)
            Note: called BJ-3 see CYP112, CYP115P, CYP117

CYP114A2    Rhizobium sp. NGR234 plasmid pNGR234.
            GenEMBL AE000082 CDS comp (9861..11264) gene = y4lC 
            NC_000914 complement(232170..233573)
            cytochrome P450 BJ-3 homolog" 90% to CYP114A1
MDMQETTTACADAFAELASPACIDDPYPFMRWLREHDPVHRAAS
GLFLLSRHADICWALKATGDAFRGPAPGELARYFPRAATSLSLNLLASTLAMKEPPTH
TRLRRLISRDFTMREIDNLRPSIARFVAARLDGMAPALERGEAVDLHRQFALALPMLV
FAELFGMPQDDMFGLAAGIGAILEGLSPHASDPQLAAADAASARMKAYFGDLIQRKCI
DPRHDIVATLVGAHDDDADTLSDAELISMLWGMLLGGFATTAATIDHAVLAMLAYPDQ
RHWLQGDAAGVEAFVEEVLRCDAPAMFSSIPRIAQSDIELSGVVIPKNADVRVLIAAG
NRDPDAFADPDRFDPARFYGTSPGMSTDGKIMLSFGHGIHFCLGAQLARVQLAESLPR
IQARFPTLTVAEQPTREPSAFLRTFRALPVRLHAQGDSPRLTSAFLNGQRGVEGGASF
EHGDGERRSATDRRAQP

CYP114A3v1  Mesorhizobium loti
            GenPept NP_106889          
            92% to 114A2
  1 MDVQETTAAC RDAFAELASP ACIQDPYTFM RWLREHDPVH RAASGLFLLS RHADIYWALK
 61 ATGDVFRGPA PGELARYFPR AETSLSLNLL ASTLAMKEPP THTRLRRLIS RDFTIRQIDN
121 LRPSIARIVA ARLDGMAPAL ERGEAVDLHW EFALAVPILV FAELFGMPQD DMFGLAAGIG
181 AILEGLSPHA SDPQLAEADA ASARVQAYFG DLIQRKRTDP RNDIVSMVVG AHDDDADTLS
241 DAELISMLWG MLLGGFATTA ATIDHAVLAM LAYPEQRHWL QGDAVGVKAF VEEVLRCDAP
301 AMFSSIPRIA QRDIELGGVV IPKNADVRVL IAAGNRDPDA FSDPDRFDPA RFYGTTPGMS
361 TDGKIMLSFG HGIHFCLGAQ LARVQLAESL PRIEARFPTL ALAEQPTREP SAFLRTFRAL
421 PVRLHAQGG

CYP114A3v2  Mesorhizobium loti
            GenEMBL AL672112 complement(84020..85309)
            Strain R7A symbiosis island
            Gene = msi070
            10 DIFFS with CYP114A3v1

CYP114A4   Rhizobium etli symbiotic plasmid p42d
           NC_004041 56651..58252
           90% to 114A3
           gene = cpxP3
MDVQDTTAACHDAFAELASPACIQDPYPFMRWLREHDPVHRAAS
GLFLLSRHADIYWAFKATGDAFRGPAPSELARYFPRAASSLSLNLLASTLAMKEPPTH
TRLRRLISRDFTVGQIDNLRPSIARIVAARLDGMAPALERGEAVDLHREFALALPMLV
FAELFGMPQDDVFELSAIVSAILEGLSPHASDPQLAAADVASARVKAYFGDLILRKRA
DPRRDIVSTLVGAHTDDADTLSDAELISMLWGMLLGGFATTAATIDHAVLAMLAYPEE
RHWLQGDAAGVEAFVEEVLRCEAPAMFSSIPRIAQRDIELHGVVIPKDADVRVLIAAG
NRDPDAFADPDRFDPVRFYGTRPGMSSDGKIMLSFGHGIHFCLGAQLARVQLAESLPQ
IQARFPTLALAEQPTREPSAFLRTFRALPVRLHAQAAAEVRVVVDQDLCGTTGQCVLT
LPGTFRQREPDGVAEVCMATVPQALHAAVRLAASQCPVAAIRVIESEAGDDHCTNPGP
TPSPADAERHAAKDLRNPGEHDGTI

115 Family

CYP115A1P   Bradyrhizobium japonicum
            GenEMBL L02323 L12971 U12678 (11,715bp see 1351-1578)
            NC_004463 complete genome 2317600..2317905
            Tully,R.E. and Keister,D.L.
            Cloning and mutagenesis of a cytochrome P-450 locus from
            Bradyrhizobium japonicum that is expressed anaerobically and
            symbiotically
            Appl. Environ. Microbiol. 59, 4136-4142 (1993)
            Note: called BJ-2 see CYP112, CYP114, CYP117
            Note: This gene fragment has a perfectly good P450 sequence 
            of 76 amino
            acids that includes the C-terminal up to a stop codon.
            This may be a fragment of another intact P450 that was 
            broken up or
            rearranged during cloning.  A pseudogene would be expected 
            to have lost
            integrity slowly and the whole gene should fade at about 
            the same rate.
            This fragment is good but no upstream region continues it.
GDADRFDVTRRHNPHLSFGQGPHFCLGAALARLELGCAFPAL
FVRLEHLALTIAAEDVVYMPSYVIRCPQRLPVTFRPSIA

CYP115A2v1  Mesorhizobium loti 
            GenPept NP_106680 88% to CYP115A1P 39% to 154C1 41% to 154A1 
  1 MPAAPTQLDR LSSAILRQGG MARVSLPGDV VTWAAARHQT LRQMLSDQRF NKDWRQWRAL
 61 QDGEIPEDHP LIGICKVDNM TTAHGADHRR LRGLLSSSFA PSRIALLAPR VEQCVDRLLA
121 EMAQRGGSAD LMSEFAAPLP TNVIAELFGL PDEQREEIVA LTYSLASTSA TAEEVRQTRQ
181 RIPEFFRRLI ALKRGQLGDD LASALIVARD KGELVSDTEL IDMLFMVLSA GFVTTAGVIG
241 NGVLALLTHP QQLHLVRSGQ VPWSQAIEEI LRWGTSAANL PFRYATQDVE IDGCLVRRGD
301 AVLMAFHAAN RDEKAFGPGA NRFDVTRRHN PHLSFGEGPH SCLGAALARL ELRCAFPPLF
361 GRLEDLALTI AAEDVVYMPS YVIRCPQRLP VSFRPSVA

CYP115A2v2  Mesorhizobium loti
            GenEMBL AL672113 41375..42607
            Strain R7A symbiosis island
            Gene = msi159
            10 DIFFS with CYP115A2v1

CYP115A3P   Rhizobium etli symbiotic plasmid p42d
            NC_004041 54883..55296
            70% to 115A1P 70% to 115A2
            gene = cpxP1 pseudogene C-terminal
ANSYGRPTYGDTDMFDFNRLQNPHLPLGQGPHLCLGAALARLELGSVFPPPFVRPEDLALAIAAE

116 Family

CYP116A1    Rhodococcus erythropolis
            GenEMBL U17130 (6458bp)
            Nagy,I., Schoofs,G., Compernolle,F., Proost,P., Vanderleyden,J. 
            And De Mot,R.
            Degradation of the thiocarbamate herbicide EPTC (S-ethyl
            dipropylcarbamothioate) and biosafening by Rhodococcus sp. NI86/21
            involve an inducible cytochrome P-450 system and aldehyde
            dehydrogenase.
            unpublished

CYP116B1    Ralstonia metallidurans
            GenEMBL NZ_AAAI01000322 
            25751..28093 gene = Reut3205
            52% to CYP116A1 with C-term. Extension
            extension may contain a reductase and a ferredoxin component
MPQTNAPASSGSCPIDHSALRAPNGCPISHQAAAFDPFEDGYQQ
DPPEYVRWSRAQEPVFYSPKLGYWVVTRYDDIKAIFRDNITFSPSIALEKITPTGEAA
NAVLASYGYAMNRTLVNEDEPAHMPRRRALMEPFTPAELAHHEPMVRKLTREYVDRFI
DTGRADLVDEMLWEVPLTVALHFLGVPEEDMDLLRQYSIAHTVNTWGRPKPEEQVAVA
HAVGNFWQLAGRILDKMREDPSGPGWMQYGLRKQRELPEVVTDSYLHSMMMAGIVAAH
ETTANASANAIKLLLQHPDVWREICEDPALIPNAVEECLRHNGSVAAWRRLVTRDTEV
GGMSLAAGSKLLIVTSSANHDEHHFADADLFDIHRDNASDQLTFGYGSHQCMGKNLAR
MEMQIFLEELTSRLPHMRLAGQRFTYVPNTSFRGPEHLWVEWDPARNPERTDPTVLAP
RDAVRIGEPTGGTTGRTLIVERVETAAQGVSRIRLVSPDGRALPRWSPGSHIDIECGH
TGISRQYSLCGDPADTSAFEIAVLREPESRGGSAWIHASLRAGDKLKVRGPRNHFRLD
ETCRRAIFIAGGIGVTPVSAMARRAKELGVDYTFHYCGRSRASMAMIDELRALHGDRV
RIHAADEGQRADLAQVLGAPDANTQIYACGPARMIEALEALCATWPEDSLRVEHFSSK
LGTLDPSREQPFAVELKDSGLTLEVPPDQTLLATLRAANIDVQSDCEEGLCGSCEVRV
LAGEIDHRDVVLTRGERDANNRMMACCSRAAKGGKIVLGL

CYP116B2   Rhodococcus sp. NCIMB 9784
           GenEMBL AF459424 
           66% to 116B1 over full fusion protein length
           extension may contain a reductase and a ferredoxin component
MSASVPASAPACPVDHAALAGGCPVSANAAAFDPFGSAYQTDPA
ESLRWSRDEEPVFYSPELGYWVVTRYEDVKAVFRDNILFSPAIALEKITPVSAEATAT
LARYDYAMARTLVNEDEPAHMPRRRALMDPFTPKELAHHEAMVRRLTREYVDRFVESG
KADLVDEMLWEVPLTVALHFLGVPEEDMATMRKYSIAHTVNTWGRPAPEEQVAVAEAV
GRFWQYAGTVLEKMRQDPSGHGWMPYGIRKQREMPDVVTDSYLHSMMMAGIVAAHETT
ANASANAFKLLLENRAVWEEICADPSLIPNAVEECLRHSGSVAAWRRVATADTRIGDV
DIPAGAKLLVVNASANHDERHFERPDEFDIRRPNSSDHLTFGYGSHQCMGKNLARMEM
QIFLEELTTRLPHMELVPDQEFTYLPNTSFRGPDHVWVQWDPQANPERTDPAVLHRHQ
PVTIGEPAARAVSRTVTVERLDRIADDVLRLVLRDAGGKTLPTWTPGAHIDLDLGALS
RQYSLCGAPDAPSYEIAVHLDPESRGGSRYIHEQLEVGSPLRMRGPRNHFALDPGAEH
YVFVAGGIGITPVLAMADHARARGWSYELHYCGRNRSGMAYLERVAGHGDRAALHVSE
EGTRIDLAALLAEPAPGVQIYACGPGRLLAGLEDASRNWPDGALHVEHFTSSLAALDP
DVEHAFDLELRDSGLTVRVEPTQTVLDALRANNIDVPSDCEEGLCGSCEVAVLDGEVD
HRDTVLTKAERAANRQMMTCCSRACGDRLALRL

117 Family

CYP117A1    Bradyrhizobium japonicum
            GenEMBL L02323 L12971 U12678 (11,715bp)
            NC_004463 complete genome 2321653..2322996
            Tully,R.E. and Keister,D.L.
            Cloning and mutagenesis of a cytochrome P-450 locus from
            Bradyrhizobium japonicum that is expressed anaerobically and
            symbiotically
            Appl. Environ. Microbiol. 59, 4136-4142 (1993)
            Note: called BJ-4 see CYP112, CYP114, CYP115P

CYP117A2    Rhizobium sp. NGR234 plasmid pNGR234a
            GenEMBL   AE000082 complement(7357..8700) U00090
            NC_000914 complement(229666..231009) gene = y4kV
            Freiberg,C., Fellay,R., Bairoch,A., Broughton,W.J., Rosenthal,A.
            and Perret,X.
            Molecular basis of symbiosis between Rhizobium and legumes
            Nature 387 (6631), 394-401 (1997)
            about 90% identical to 117A1
MNVLLNPLNRRHRLRYDIPVMPGAFPLVGHLPAIVCDLPRLLRR
AERTLGSHFWLDFGPAGHLMTCVDPHAFALLRHKDVSSALIEEIAPELLGGTLVAQDG
GAHRQARDAIKAAFLPEGLTQAGIGDLFAPVIRARVQAWRDRGDVTILPETGDLMLKL
IFTLMGVPAQDLPGWHRKYRQLLQLIVAPSVDLPGLPLRRGRAARDWIDAQLRQFVRD
ARAHAARTGLINDMVSAFDRSDDALSDDLLVANIRLLLLAGHDTTASTMAWMVIELAR
QPMLWDALVEEAQRVGAVPTRHADLEQCPVAEALFRETLRVHPATTLLPRRALQELQL
GQRRIPAGTHLCIPLLHFSTSALLHEAPDQFRLARWLQRTEPIRPVDMLQFGTGPHVC
IGYHLVWLELVQFSIALALTMHKAGVRPLLLSGVEKGRRYYPTAHPSMTIRIGFS

CYP117A3    Mesorhizobium loti
            GenPept NP_106891
            NC_002678 complete genome 5191629..5192972
            locus_tag = mlr6367
            94% to 117A2
  1 MDMLLNPLDR RHRLRDDIPV VPGAFPLVGH LPAIVCDLPR LLRRAERTLG SHFWLDFGPA
 61 GHLMTCVDPD AFALLRHKDV SSALIEEIAP ELLGGTLVAQ DGGAHRQARD AIKAAFLPKG
121 LTQAGIGNLF APVIQARVQA WRDRGDVTIL RETGDLMLKL IFSLMGIPAQ DLPGWHRKYR
181 QLLQLIVAPP VDLPGLPLRR GRAARDWIDA QLRQFVRDAR AHAARTGLIN DMVSSFDRGD
241 DALSDDVLVA NIRLLLLAGH DTTASTMAWM VIELARQPGL WDALVEEAQR VGAVPTRHAD
301 LAQCPVAEAL FRETLRVHPA TTLLPRRALQ ELQLGQRRIP AGTPLCIPLL HFSTSALLHE
361 APDQFRLARW LQRTEPIRPV DMLQFGTGPH VCIGYHLVWL EMVQFCIALA LTMHKAGVRP
421 RLLSAVEKGR RYFPTAHPSM KIRIGFS

CYP117A3v2  Mesorhizobium loti
            GenEMBL AL672112 complement(81551..82888)
            Strain R7A symbiosis island
            Gene = msi068
            2 DIFFS with CYP117A3v1

CYP117A4   Rhizobium etli symbiotic plasmid p42d
           NC_004041 59081..60424
           85% to 117A2
           gene = cpxP4
MDMLLNPLNRWRRLRDDIPVMPGAFPLVGHLPAIVCDLPRLLRR
AERTLGSHFWLDFGPAGHLMTCLDPDALALLRHKEVSSALIEEMAPDILGGTLVTLDG
SAHRQARDGIKAAFLPRGLTEAGIGELFEPIIRAQVKAWRDRGEVAILPDTRNLMLKL
TFSLMGIPAQDLSEWHRKYRQLLQLMVAPPIDLPGMPLRRGRAARDWIDAQSRQFIRD
ARARAARTGLINDMVSAFDCSDGALSDDVLVANIRLLLLAGHETSASTIAWMVIELAQ
HPELWDALVEEAQRVGAVPTGHEDLAQCPVAEALFRETLRMHPASSLVPRRAMQELQL
GQRRIPSGTHLCIPLLHFSTSPLLHEAPDQFRLGRWLQRTEPIRPVDMLQFGAGPHVC
MGYHLVWLELVQFSIALALTMQEAGVRPRLMSGVEKGRRYYPTAHPSMTVRIGFS

118 Family

CYP118P1    Mycobacterium leprae
            GenEMBL L04666 (40,123bp)
            Smith,D.R.
            M. leprae cosmid dna sequence
            Unpublished (1992)
            Note 15,700 to 17,350 is the region of interest

CYP118P1    Mycobacterium leprae
            GenPept CAC31116                 
            NC_002677 547312..547788 locus_tag = ML0447
            NC_002677 complement(2562932..2563627) locus_tag = ML2159 
            (a duplication of the seq.)
            putative fatty oxidation complex alpha subunit 
            Sequence below is from TIGR primary nucleotide sequence for ML2159
            CYP118 exact match, 49% to 102C1
  4 TASQHDDILDIMLYSADPSTGEQLDTDNVVNQILTLLVSGSQTLANAIAFALHYLLSIHH 183
184 DIAAQTRREIYQNRSDRGIANVSY
258 FGDVVKLRCLRRVVDATLRLWS
    VPCYLRQARRD 360
361 TTLGNGTSLFHKGQWVIVLLTAPMPG
    WGPDANEFNPDRXXXXXXXXXXXXXXXX 470
520 FGTGLRTCIGRRFALHEMALELTMIVHQYILSRADPG 
    YCLSISEAFTLKTVGL 677

119 Family

CYP119A1   Sulfolobus solfataricus (an archaebacterium)
           GenEMBL U51337 (1254bp)
           Wright, R.L., Harris, K., Solow, B., White, R.H. and Kennelly, P.J.
           Cloning of a potential cytochrome P450 from the Archaeon Sulfolobus 
           solfataricus.
           FEBS. Lett. 384, 235-239 (1996)

CYP119A2   Sulfolobus tokodaii
           GenPept BAB66184                 
           64% to CYP119A1 U51337 Sulfolobus solfataricus
  1 MYDWFKQMRK ESPVYYDGKV WNLFKYEDCK MVLNDHKRFS SNLTGYNDKL EMLRSGKVFF
 61 DIPTRYTMLT SDPPLHDELR NLTADAFNPS NLPVDFVREV TVKLLSELDE EFDVIESFAI
121 PLPILVISKM LGINPDVKKV KDWSDLVALR LGRADEIFSI GRKYLELISF SKKELDSRKG
181 KEIVDLTGKI ANSNLSELEK EGYFILLMIA GNETTTNLIG NAIEDFTLYN SWDYVREKGA
241 LKAVEEALRF SPPVMRTIRV TKEKVKIRDQ VIDEGELVRV WIASANRDEE VFKDPDSFIP
301 DRTPNPHLSF GSGIHLCLGA PLARLEARIA LEEFAKKFRV KEIVKKEKID NEVLNGYRKL
361 VVRVERA

120 Family

CYP120A1    Synechocystis sp. (strain PCC6803) Cyanobacterium
            GenEMBL   D64003(113064bp)
            coding region 62160-63494
            Kaneko,T., Tanaka,A., Sato,S., Kotani,H., Sazuka,T., Miyajima,N.,
            Sugiura,M. and Tabata,S.
            Sequence analysis of the genome of the unicellular cyanobacterium
            Synechocystis sp. strain PCC6803. I. sequence features in the 1Mb
            region from map positions 64% to 92% of the genome
            DNA Res. 2,153-166 (1995)
            note: gene slr0574 (previously had incorrect gene identifier here)

NT01NS3472 Nostoc sp. PCC 7120 in TIGR not in Genbank
40% to CYP120 aa 399-443
MEMKIVAAHLLRRYHWEILPNQSLDSVLVPTNQPQDGLRVRFQPL

CYP120A2   Trichodesmium erythraeum
           NZ_AABK02000021 
           complement(1844..2800) gene = Tery2088
           318aa (short) 57% to 120A1 (missing N-term 127aa)
MTANYLEKWVEMGTLTWYPEIRNYTFDIASLLFMGSDESSQTKL
VSLFEEWVKGLFSIPLSLPWTRFGKSLRCRQKLLQHIEEIILQRQQQQNLGEDALGIL
LQAQDKEVNGLSLDELKDQILLLLFAGHETLTSAIASFCLLTSQHLDVLTRLRQEQKQ
FSAIEPLTLENLKRMTYLDMVLKEVLRLIPPVGGGFRQVTQDCEFCGYSIPKGWLVQY
QIAKTHQDETLYPDDKNFDPERFAPENAVDKQKVFGYVPFGGGMRECLGKEFARLEMK
IFAVMLLRGYEWELLPEQDLSVVAAPTPYPRDGLKVKFRKVE

CYP120B1   Nostoc punctiforme
           NZ_AAAY02000018.1 
           complement(62382..63695) gene = Npun4299
           43% TO CYP120A1
MKTNQIPPGSFGLPVLGETLSFVFDRDFAKKRYHQYGPIFKTHL
LGRPTVVMAGPEALEFVLSSHIENFSWREGWPDNFKTLLGESLFLQDGEEHRRNRRLM
MPALHGPALASYFSTMEDITRSYLQKWEKKQEFTWFQEFKQLTFDIASQLFLGTRPGP
ECVRLSQLFTTLTNGLLAINPLPLPFTTFGKAIAARNEILEHLTQVVRERQQNPTQDT
ISLLIKAKDEDGNSLSEKEIIAQAVLLLFAGHETTTSMLTWLCTELACHPEVLEKARV
EQLQLASQGDLDLEQLGKMPYLEQVLWEVERLHQPVGGGFRGVIKDFELNGYHVPTGW
QLYYSIGVTHQIEEIYSEPELFDPDRFSPQRQEHKKYPFSLVGFGGGPRICIGIAFAK
MEMKIVAAHLLRSYHWEILPNQSLEVVAVPTNRPKDGLRVRFQPR

CYP120C1   Nostoc punctiforme
           NZ_AAAY02000127.1 GenPept ZP_00106106.1
           8154-9512 gene = Npun477
           44% to 120B1 36% to CYP120A1
MQQLKSAEEIPGSYGLPILGETLEIFRDSELYLWRRFQQYGSVF
KTSVLGRKRAYLIGPSANRLVLVEQAENMSSRIGWYFLESTFGNNILLQDGEEHRLTR
RLMYPAFHGKAIATYFDTIQNIVQDFLKDWGERGTISLNSSFRQLTLMIATRLFLGSQ
NKSEVEQTSQWFTQLLDSSMAIFKWNVPFTLYGRGQNARGKLVAFLREAIAQRIEQGN
LEESKDVLGLLLAAVDEDGNKLSETQVINEALLLLFAGHETTASLLTWVIFELGNHPE
WRERLRQEQLAVVGNNPLSLSHLKQFPQLTNVLKEAERLYPPVYAYNRGVLKDIEYGG
YRIPAGWFVTISPMLTHRLPELYTEPDRFDPDRFAPPREEDKKHPLALMGFGYGSHSC
LGMEFAQMEMKIVLSTLLRHYDWTVKPDYSAIAPVRQPSKVKDILQAYIEPLLIKHPL
DS

121 Family

CYP121A1   Mycobacterium tuberculosis
           GenEMBL Z77163 (42861bp) gi 1449344 Rv2276
           complement (32358 to 33548)
           unpublished

CYP121A1   Mycobacterium bovis subsp. bovis AF2122/97, 
           NC_002945 complete genome 2526703..2527893
           Gene = cyp121 100% match
           locus_tag = Mb2299

122 Family

CYP122A1   Streptomyces sp.
           GenEMBL U65940( 2500bp)
           nearly identical to rapJ gene of St. hygroscopicus involved in 
           rapamycin biosynthesis

CYP122A2   Streptomyces hygroscopicus
           GenEMBL X86780 (107379bp)
           coding region 96465-97625
           rapJ

CYP122A3   Streptomyces hygroscopicus var. 
           GenEMBL AF235504 CDS 71460..72626 
           gene="fkbD"
           note="C9 hydroxylase" 89% to 122A1 77% to 122A2

123 Family

CYP123     Mycobacterium tuberculosis
           GenEMBL Z80226 (34809bp) gi 1550644 Rv0766c
           complement (8322-9530)

CYP123     Mycobacterium bovis subsp. bovis AF2122/97, 
           NC_002945 complete genome complement(861053..862261)
           Gene = cyp123 100% match
           locus_tag = Mb0789c

124 Family

CYP124A1   Mycobacterium tuberculosis
           GenEMBL Z77163 (42861bp) gi 1449354 Rv2266
           complement (39907-41193)

CYP124A1   Mycobacterium bovis subsp. bovis AF2122/97, 
           NC_002945 complete genome 2519058..2520344
           Gene = cyp124 100% match
           locus_tag = Mb2289

CYP124B1   Streptomyces cinnamonensis
           GenEMBL AF440781 93981..95273
           polyether antibiotic monensin biosynthesis gene cluster
           41% to CYP124
           gene = monD
MGLTVGPDNAKRGIVPITDSKPAATFPDLVDPSFWARPHAERVA
LFEEMRGLPRPAFIRQNMPGVPWTFGYHALVKYADIVEVSRRPQDFSSNGATTIIGLP
PELDEYYGSMINMDNPEHSRLRRIVSRSFGRNMIPEFEAVATRTARRIIDELIARGPG
DFIRPVAAEMPIAVLSDMMGIPAEDHDFLFDRSNTIVGPLDPDYVPDRADSERAVIEA
SRELGDYIAGLRAERLAAPGNDLITKLVQVQADGEQLTRQELVSFFILLVIAGMETTR
NAISHALVLLTEHPEQKQLLLSDFDTHAPNAVEEILRVSTPINWMRRVATRDCDMNGH
RFRRGDRIFLFYWSGNRDESVFPDPYRFDITRGTNAHVTFGAVGPHVCLGAHLARMEI
TVLYRELLAALPQIHAVGQPRRLDSSFIEGIKHLHCAF

CYP124B2   Streptomyces nanchangensis NS3226
           GenEMBL AF521085 complement(100196..101467)
           polyether ionophore nanchangmycin biosynthetic gene cluster
           41% to CYP124
           gene = nanP
MNRGVVSPTEATPASSAKATRPPDFMDPSFWLRPRDERAEVFEK
LRALPGPEFVPPRLPWGPLASGYYALSKHADICEVSRRPQDFSSEGATAILPPEMDEF
YGSMINMDNPEHSRLRRIVARSFGRGMAPKFDAMSRRVARRIVDELIERGPGDFIRPA
AEMPIAVLSTMMGIPGEDYEFLFERTNTIMGGADPELAADPEKMAAAVLGALRDLGDY
IGRLREDRLARPGPDVITKLVQVQEDGEQLTNQELVSFFILLINAGMETTRNVIAQAL
VLLTEHPDQRQLLLSDFELHAKGAVEEILRVGTPINWMRRTATGDCEMNGHRFRKGDE
IFLFYWSANHDEKVFEDAYRFDITRDPNPHLSFGAVGPHFCLGAHLARIEIIAMLREL
LASLPDIRVEGEPVRLASSFIEGFKELSCTF

125 Family

CYP125A1   Mycobacterium tuberculosis
           GenEMBL Z82098 (34154bp) gi 1666115 Rv3545c also AD000003
           coding region 8135-9436

CYP125A1   Mycobacterium bovis subsp. bovis AF2122/97, 
           NC_002945 complete genome complement(3927359..3928660)
           Gene = cyp125A1 100% match
           locus_tag =Mb3575c

CYP125A2   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV5841 57% to CYP125A1 from Mycobacterium tuberculosis

CYP125A3P  Mycobacterium leprae
           GenPept CAC30983
           NC_002677 2415021..2416227
           locus_tag="ML2024
           Sequence below is from TIGR primary nucleotide sequence for ML2024
           51% to CYP125A1 Rv3545c Z82098 Mycobacterium tuberculosis
   1 PGFDFPDPEIYTEQLSV*EPAEMCQAETI**NEQPIGRSGFYDDDY 138
     XXXXXXXXXXXXXX
 174 HSGTFSNLEKTALACYQEGMNDEQISRGKLVLLNIDASQYTRLHKIISPGFIP*AAEQLR 353
 354 DDLXXXXXXXXXXXXXXX 362 
 410 SGDFVEHVSCELSRQAAIAGLPSG 481
 480 VPQEDCKKLFHWSN 521
 522 QTVGAQDPKFATNDPMVTSVKLIM*AMQIAADRAKPLGQVIVTNLVEADIEGHKLSKDEFGSF 710 
 713 VIMLTAAGKENTRNCIMQSMMQFTNFPD*WELYK 814 
 816 KKAPGTTADKIIRQATLVMS 875
 876 FQRTVLK*YELSSVSIKKGQRVVVIYRSANFDEKVLTIRLPCSIMRNPT 1022
1022 PHAGFNDTNVHYCIGIN 1072
1073 LARMTIDRMFHAIAESMPNL*STGKPK*LRSGWLNGVKHWQVD 1201

CYP125A4   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           75% to 125A2 before frameshifted region
           clone name SP0266

126 Family

CYP126A1   Mycobacterium tuberculosis
           GenEMBL Z80226 (34809bp) gi 1550656 Rv0778
           coding region 20888-22132

CYP126A1    Mycobacterium bovis subsp. bovis AF2122/97, 
            NC_002945 complete genome 873620..874864
            Gene = cyp126 100% match
            locus_tag = Mb0801

CYP126A2P  Mycobacterium leprae
           GenPept CAC31567       
           NC_002677 complement(1384839..1385327)
           locus_tag="ML1185
           Sequence below is from TIGR primary nucleotide sequence for ML1185
           37% to CYP126 C-terminal
184 DRSLIPSAIEEGSRSETPNWASVTRITIA*LAIGGKTILPNAGVDILMGSANRDGSRWTE 363
364 PNTFDIHWPRQAHTTLAGSHMCLGIGLAQLDTRVMLNNLFD 486

127 Family

CYP127A1   Rhizobium sp.
           GenEMBL Z68203(34010bp) 
           coding region 29431-30675
           also AE000101 Rhizobium sp. NGR234

CYP127A2   Rhizobium sp. BR816
           No accession number
           Ellen Luyten
           Submitted to nomenclature committee 4/12/2000
           73% identical to CYP127A1

CYP127A3v1 Mesorhizobium loti
           GenPept NP_106463
           NC_002678 complete genome 4745586..4746803
           79% to 127A2
  1 MAINPVPDHV PPEMVRDFSL FTSPGMPPTP NGDPHAAVAC AHDGPPIFYS PYNTQDGRGT
 61 WVITRAADQR KVLQDTETFS SHRSIFSSIL GETWPTIPLE LDPPAHGAFR SLLSPLLSPK
121 RVTALEPAVR ERAIALIDRI TASATSCDVM KDFAFPFTVS IFLRFLGLPD QGLDTFVGWA
181 KDLLHGDDVE RPVAARKIVA FIDELATNRR KDPVDDLMTF IVQAQIEGRR LTDGEIRGIG
241 VLVFVAGLDT VAAAIGFDLA YLARNLKDQE LLRSEPARIL LATEELLRAY PPIQLIRVAT
301 KDIDFEGAPI RKGDYVSCAT MIANRDPEEF ESPNTVDLAR DHNRHAAFGY GPHRCLGSHL
361 ARREIVIGLE EWLARIPTFR IKEGTAPITC GGHVFGIENL ILDWS

CYP127A3v2  Mesorhizobium loti
            GenEMBL AL672114 complement(100678..101895)
            Strain R7A symbiosis island
            Gene = msi332
            2 DIFFS with CYP127A3v1

CYP127A4   Rhizobium etli symbiotic plasmid p42d
           NC_004041 97484..98974
           81% to 127A2
           gene = cpxA5
MHLCSERIYRKRGTRENPMSTGRAGEASKKFRLRPTKQRGFRAA
RRSDRCIACHWRLALLRLEIWRSTILLAPSPRRIRSRRRGFDDRRKAVATIRVPEHVP
PEMVKDFSLFTSPGMERMPNGDPHAAVACLHNGPRIFYSPCNTRDGRGTWVIVRAQDQ
RKLLQDTGTFSSHRSLFASALGENWPLIPLELDPPAHSVFRSLLNPLLSPRRIMELEP
AVRDRAIALISKISASSTSCDILTDFAFPFAVSIFLRLLGLSDERLNTFVGWGKDLLH
GDGIRRTAAARTILAFIDELAAMRRKEPADDFMTFVVQAKVDGRLLRDQEIHGIGVLL
FVAGLDTVATAIGFDLAYLARNPTEQELLRSKPDRIVLAAEELLRAYSTVQMIRVATK
DINFEGAPIRKGDYISCATMIANRDPVEFENPNTIDLAREDNRHTAFAYGPHRCLGSH
LARREIIIGLEEWLSRIPDFRIKDGTAPITYGGHVFGMENLILDWS

128 Family

CYP128A1   Mycobacterium tuberculosis
           GenEMBL Z77163 (42861bp) gi 1449352 Rv2268c
           coding region 37021-38490

CYP128A1   Mycobacterium bovis subsp. bovis AF2122/97, 
           NC_002945 complete genome complement(2521761..2523230)
           Gene = cyp128 100% match
           locus_tag = Mb2291c

129 Family

CYP129A1       Steptomyces sp.
          GenEMBL U50973(3196bp)
          Dickens,M.L. and Strohl,W.R.
          Isolation and characterization of a gene from Streptomyces sp.
          strain C5 that confers the ability to convert daunomycin to
          doxorubicin on Streptomyces lividans TK24
          J. Bacteriol. 178, 3389-3395 (1996)
          gene name doxA

CYP129A2   Streptomyces peucetius 
           GenEMBL U77891 CDS comp (83..1330)
           gene="doxA"
           product="daunorubicin C-14 hydroxylase" 94% to 129A1 

130 Family

CYP130A1   Mycobacterium tuberculosis
           GenEMBL Z77137 (36096bp) gi 1480330 Rv1256c
           coding region 30691-31908 cy50.26

CYP130A1X  Mycobacterium bovis subsp. bovis AF2122/97, 
           NC_002945 complete genome
           CYP130 lies in a deletion in M. bovis

CYP130A2P   Mycobacterium leprae strain TN
            GenPept AL583920.1
            59% to CYP130A1
VMSHRFRFTTADIWPNPWSMYRTLRDHEAVHHVVPANQPEDDYYVLPRHADVWSMAMRS
HAKLSSAQRLTVNYSDMELIGLQDNPPMVMQDQPV*TKCRKLVSRRFTPRQTNVVEPKVR
HFVVEHIEQLRAKGSVDIVTELFKPLPPMVVAHYFGFPEKVRSQFDGW
TTAADGGGALFRFPRKSPITIRRLAPAIVAANTADAGGITNELDVAGYAVESMLAYFTR
IATGGNNTVTGMLGG*MPL
SHRRKQHRHWHARRLDAVKDTAEAD
LLRLTSSVRGLMRTTTRDVAIGHTTVSPGRRVLMRYGQAKRDER*YSAAAS*LDVTW*
PPNILIFSHGAH
YLGAKVTRMQRR
VRLTELLARYPDFEVDESSIAWAGGKLHTTP

131 Family

CYP131A1       Streptomyces peucetius
         GenEMBL L47164(3444bp)
         coding region 32-1348
         gene dnrQ  duanosamine biosynthesis
         possible sequence errors at C-terminal (no recognizable signature sequence 
         in the last 68 amino acids)

CYP131A2     Streptomyces sp.
         GenEMBL L35154 (4134bp)
         3838-4134 N-terminal fragment 94% identical to L47164
         gene dauQ daunomycin biosynthesis 

132 Family

CYP132A1  Mycobacterium tuberculosis
          GenEMBL Z80108 (40778bp) gi 1542902 Rv1394c
          complement (9842-11227)
          most often matches CYP4 family in blast search

CYP132A1   Mycobacterium bovis subsp. bovis AF2122/97, 
           NC_002945 complete genome complement(1566263..1567648)
           Gene = cyp132 1 aa diff
           locus_tag = Mb1429c

133 Family

CYP133A1     Erwinia herbicola
           Randy S. Fischer, Roy A. Jensen
           First P450 from an enteric bacteria (similar to E. coli)
           submitted to nomenclature committee

CYP133B1v1     Xylella fastidiosa, section 35 of 22. 
               AE003889 CDS complement(3751..4959)
               82% to AE003887 48% to CYP133A1

CYP133B1v1   Xylella fastidiosa 9a5c 
             GenPept AAF83187                 
             100% match
  1 MKLTDLSNPA FLENPYPLYE TLRAQAPFVS IGPNALMTGR YSLVDSLLHN RNMGKKYMES
 61 MRVRYGDSAA DMPLFQAFSR MFITINPPAH THLRGLVMQA FTGRESESMR PLAIDTAHQL
121 IDNFEQKPSV DLVAEFAFPF PMQIICKMMD VDIGDAVTLG IAVSKIAKVF DPSPMSADEL
181 VHASTAYEEL AQYFTKLIEL RRTHPGTDLI SMFLRAEEDG EKLTHDEIVS NVIMLLIAGY
241 ETTSNMIGNA LIALHRHPEQ LALLKSDLSL MPQAVSECLR YDGSVQFTMR AAMDDIEVEG
301 ELVPRGTVVF LMLGAANRDP AQFTHPDQLD ITRKQGRLQS FGAGIHHCLG YRLALIELEC
361 ALTTLFERLP HLRLAHLDAL NWNQRSNLRG VNTLIVDLHA KN

CYP133B1v2  Xylella fastidiosa Temecula1
            GenPept AAO29526                 
            6 diffs to CYP133B1v1
  1 MKLTDLSNPA FLENPYPLYE TLRAQAPFVS IGPNALMTGR YSLVDSLLHN RNMGKKYIES
 61 IRLRYGDTAA DMPLFQAFSR MFITINPPAH THLRGLVMQA FTGRESESMR PLAIDTAHQL
121 IDNFEQKPSV DLVAEFAFPF PMQIICKMMD VDIGDAVTLG MAVSKIAKVL DPSPMSADEL
181 VHASTAYEEL AQYFTKLIEL RRTHPGTDLI SMFLRAEEDG EKLTHDEIVS NVIMLLIAGY
241 ETTSNMIGNA LIALHRHPEQ LALLKSDLSL MPQAVSECLR YDGSVQFTMR AAMDDIEVEG
301 ELVPRGTVVF LMLGAANRDP AQFTHPDQLD ITRKQGRLQS FGAGIHHCLG YRLALIELEC
361 ALTTLFERLP HLRLAHLDAL NWNQRSNLRG VNTLIVDLHA KN

CYP133B1v3 Xylella fastidiosa Dixon
           NZ_AAAL01000071 complement(6849..8057)
           98% to 133B1v1 7 diffs 97% TO 133B1v2 11 diffs 
           gene = XfasA0474
MKLTDLSNPAILENPYPLYETLRAQAPFVSIGPNALMTGRYSLV
DSLLHNRNMGKNYMESMRVRYGDSAADMPLFQAFNRMFITINPPAHTHLRGLVMQAFT
GRESESMRPLVIDTAHQLIDNFEQKPSVDLVAEFAFPFPMQIICKMMDVDIGDAVTLG
MAVSKIAKVFDPSPMSADELVHASTAYEELAQYFTKLIELRRTHPGTDLISMFLRAEE
DGEKLTHDEIVSNVIMLLIAGYETTSNMIGNALIALHRHPEQLTLLKSDLSLMPQAVS
ECLRYDGSVQFTMRAAMDDIEVEGELVPRGTVVFLMLGAANRDPAQFTHPDQLDITRK
QGRLQSFGAGIHHCLGYRLALIELECALTALFERLPHLRLAHLDALNWNQRSNLRGVN
TLIVDLHAKN

CYP133B2v1  Xylella fastidiosa, section 33 of 22 
            AE003887 CDS 6723..7925
            82% to AE003889 48% to CYP133A1

CYP133B2v2  Xylella fastidiosa Ann-1
            NZ_AAAM01000051 complement(2764..3966)
            97% TO 133B2v1 8 diffs
            gene = XfasO1476
MKLADLSSPAFLENPYPLYETLRRQGPFVSIGPNALMTGRYSIV
DGLLHNRNMGKSYMESIRVRYGDDALDMPLFQGFNRMFLMLNPPVHTHLRGLVMQAFT
GRESESMRPLATDTAHRLIDDFEQKSSVDLVTEFSFPLPMRIICRMMDVDISDAISLS
VAVSNIAKVFDPAPMSPDELVHASAAYEELAHYFTRLIELRRAQPGTDLISMLLRAEE
EGQKLTHDEIVSNVILLLLSGYETASNMIGNALIALHRHPKQLARLKSDLSLMPQTVL
ECLRYDGSVQFTVRAAMDDVSIEGDVVPRGTIVFLMLGAANRDPAQFTDPDHLEITRK
QGRLQSFGAGVHHCLGYRLALVELECALTVLLERLPHLRLANLDTLSWNQRGNLRGVN
ALIADLHP

CYP133B2v3  Xylella fastidiosa Dixon
            NZ_AAAL01000066 complement(2275..3477)
            97% TO 133B2v1 9 diffs 10 diffs to CYP133B2v2
            gene = XfasA0420
MKLADLSSPAFLENPYPLYETLRRQGPFVSIGPNALMTGRYSIV
DGLLHNRNMGKSYMESIRVRYGDDALDMPLFQGFNRMFLMLNPPVHTHLRGLVMQAFT
GRESESMRPLAIDTAHRLIDDFEQKSSVDLVTEFSFPLPMRIICRMMHVDISDAISLS
VAVSNLAKVLDPAPMSPDELVHASAAYEELAHYFTRLIELRRAQPGTDLISMLLRAEE
EGQKLTHDEIVSNVILLLLGGYETTSNMIGNALIALHRHPKQLARLKSDLSLMPQAVL
ECLRYDGSVQFTIRAAIDDVSIEGDVVPRGTIVFLMLGAANRDPVQFTDPDHLEITRK
QGRLQSFGAGVHHCLGYRLALVELECALTVLLERLPHLRLANLDTLSWNQRGNLRGVN
ALIADLHP

CYP133B3   Xanthomonas axonopodis pv. citri str. 306
           GenPept AAM38014                 
           56% to 133B2
  1 MLLSDLATPQ FRHDPYPTYA RLREEGPLVQ VADGRLMSGR YAVVDRLLSD RRVGRDYLQS
 61 VRLRYGEAAV HLPLFQGMSR MFLLLNPPLH TQLRGLMTQA FGARQMESMR EVASDIAAGL
121 IDAFQANGHC DLLTEFAFPL PIAIICRMLD IAAADVTALS HATSALAKVF DPMMTAEELQ
181 ATSVAYDQLA TYFHGVIAQR RSAGGDDLIA RFIQAEDNGR RLSEEEIVSN VILLFFAGHE
241 TTSNMICNAL VALHRHPQQL RLLQETPGLL PNAVLECMRY DSSVQMATRT ALQDFEIEGV
301 AVPRGTMLYL MLGAANHDTL QFTDPQVLDI RRQQGRALSL GGGIHHCLGN RLALIEVEAA
361 LACLLARLPA LRLEQLDTLS WNDRANLRGV DALLASW

CYP133B4   Xanthomonas campestris pv. Campestris str. ATCC 33913
           GenPept AAM42318                
           BioI biotin synthesis
           52% to 133B2 65% to 133B3
  1 MQLSDFATPA FRQDPYPMYA RLRAAGPLVQ ISDNGWVSGH YTVVDALLSD RRVGRNYLDS
 61 IRVRYGANAA EMPLFQGMSR MFLLLNPPVH TQQRALMTKA FGARQLEALR EVAVDTADAL
121 LDQHEDRRSC DLLNDFAMPM TISLICRMLG LAVTDVAALG QASSALAKVF DPLMRPEDMA
181 QATAAYTTLE QYFRAIVLQR RDTQEDDLIA RLIAAEDHGQ RMPVDDIVSN VIMLFTAGHE
241 TTANMICNAL IALHRHPEQL QLLRDTPTLM PNAVLECMRY DSSVQVAMRS VLQPLQVEGT
301 TLPVGAILYL MLGSANHDAE QFTAPQQLDL RRQQGRALSF GGGVHHCLGN RLALIELETA
361 LERLLQRAPA LRLPELDNLS WNERANLRGI QALHATW

CYP133B5   Ralstonia solanacearum GMI1000 megaplasmid
           GenEMBL AL646080 77388..78584
           gene = RSp0709
           77% to CYP133B2v2
MKLADLSTPSFLENPYPLYETLRSQGPFVRIGPNALMTGHYSIV
DALLHNRQMGKSYMESIRLRYGDEGPNMPLFQGFSRMFLMLNPPMHTRLRGLMMQVFN
ARQIESMREVATATAHQLIDDFEQKPSADLVAEFAFPLPVRIICQMMDLDIDDAMALG
VGVSKLAKVFDPAPMSADALVETSAAYEELAQYFTKVIEARRAQPGTDLISMLMRAEE
NGETLTHDEIVSNVILLFIAGHETTSNMIGNALIALHRNPQQLDLLKREPSRMPNAVL
ECLRYDGSVQVTIRAALEDVEVEGEVLPRGTTVFLMLGAANRDPAQFTDPDQLDIGRQ
QGRLQTFGAGIHHCLGYRLALIELESALGALFERLPNLRLTNLDQLSWNQRGNLRGVN
ALMAAW

CYP134A1   Bacillus subtilis
           GenEMBL AF017113, Z99121, Z99122
           cypB also called cypX

CYP134A2P  Bacillus cereus ATCC 14579
           GenPept AAP10061   
           57% to CYP134A1 cypX Z99122 Bacillus subtilis C-term
 1 MIGATNCDSN VFERPDKFNV YRPDIDIKKA FSGTARHLAF GLSIYNCVGV AFAKLKIEID
61 STIKDNISRK KLRDIKDFVK KTSKMN

CYP134B1   Photorhabdus luminescens subsp. laumondii TTO1
           GenEMBL NC_005126 complete genome complement(313663..314886)
           locus_tag = plu0296
           46% to CYP134A1
MAKLSSFNIHDPKFIKNPYDFYDILHKQDLVYFEQSQNSYFIGK
YEDVDAILKSSIFNTKPLTALAEPVMGDRVLAQMEGEEHACKRKFIMQGLSRDYFNRY
YEPMIRKITEDLLQPYMEKGNIDIVNDFGRDYAVLVTLSILGLPSDNYRDIAEWHKGI
ASFITQFDQTELEKMHSLECSQKLIRLLKPIIDQRRRNPSKDIISIFCQDTAMSMSEI
TALCLNILLAATEPADKILAMMLNHLISNPSMLDVVLKDRSLVRDAFEETLRLTSPVQ
LIPREASEDVTISGIDIPKGAVVFCMIGAANRDPSVFHKPNEFDLYRRKNTTSPQKAN
RKRHLAFGAGTHACAAAAFSLSQLEVSSNIILDLLHNLRFADHYHYQETGVYTRGPSK
LLLSFDPIASSAIKE

CYP135A1    Mycobacterium tuberculosis
            GenEMBL Z96800
            Rv0327c

CYP135A1    Mycobacterium bovis subsp. bovis AF2122/97, 
            NC_002945 complete genome complement(393726..395075)
            Gene = cyp135A1 1 aa diff
            locus_tag = Mb0334c

CYP135B1    Mycobacterium tuberculosis
            GenEMBL AL021942
            Rv0568

CYP135B1    Mycobacterium bovis subsp. bovis AF2122/97, 
            NC_002945 complete genome 660693..662111
            Gene = cyp135B1 100% match
            locus_tag = Mb0583

CYP136A1    Mycobacterium tuberculosis
            GenEMBL Z83866
            coding region 23158-24636
            Rv3059

CYP136A1   Mycobacterium bovis subsp. bovis AF2122/97, 
           NC_002945 complete genome 3376038..3377516
           Gene = cyp136 1 aa diff
           locus_tag = Mb3085

CYP136B1    Mycobacterium abscessus
            GenPept AAN38721 
            46% to CYP136A1
  1 MDAVEAAQRP GGTMTNHLLA PAHHVKERLS SVIMVPAPHA VDDRWRRWSR DWPVRELAPA
 61 PAGSGLKAVR GDAGLPFVGH TLDYIRFGSD FSRERYDRLG SVSWMGAFGT KMVVIAGPDA
121 TREAFTSEAK AFSQDGWSFL IDAFFHRGLM LMSFDEHLMH RRIMQEAFTR PRLTGYVEQV
181 TPCVRSAVPA WPVGPSVRIY PLLKELTLDI ATDVFMGGRG KDESDAVNKA FVATVRAASS
241 LVRAPLPGTR FRAGVQGRRV LEDYFFRHLP AARAGETEDL FAALCQATTE DGERFSDEDV
301 VNHMIFLMMA AHDTSTITTT AVTYFLAKYP QWQEAAAAEA AAIGDGLPDI EALEKMTVID
361 RVIKEALRLL APVPLVMRKT VRDVAIDGYH IPSNTLCAIT PAVNHFDRTI WNDPERFDPS
421 RFDEPRREDQ HHRFAWVPFG GGAHKCIGMQ FGTLEVKAIL HRMLRSFTWK VPENYHVRWD
481 NTSLPIPVDG LPLEMKRR

CYP137A1    Mycobacterium tuberculosis
            GenEMBL AL022121
            Rv3685c

CYP137A1   Mycobacterium bovis subsp. bovis AF2122/97, 
           NC_002945 complete genome complement(4064642..4066072)
           Gene = cyp137 1 aa diff
           locus_tag = Mb3710c

CYP138A1    Mycobacterium tuberculosis
            GenEMBL Z92770
            Rv0136

CYP138A1    Mycobacterium bovis subsp. bovis AF2122/97, 
            NC_002945 complete genome 163556..164881
            Gene = cyp138 100% match
            locus_tag = Mb0141

CYP138A2P   Mycobacterium leprae
            GenPept CAC32181 GenEMBL AL583926
            NC_002677 complement(3167438..3168451)
            locus_tag = ML2648
            Sequence below is from TIGR primary nucleotide sequence for ML2648
            40% to CYP138
  1 NRVAREIVVEVIYGALFGAFEALSGLVPQDTVLGPMGRYSMAPSLIR
439 ITINVIMRAGFGSELDELRRLHPTAATL 522
    RWTVERQARCNHDIFMLDSRSTAERLRRRLHGTCMKNH 351
352 VRIFEAEPLWGLRTGLKASLLPHCRLINRITINVIMRAGFGSELDELRRLHPTAA 516
517 TLVGLF*LLSQHLGVLADPSSMGATMPGDDPAPALRQATIPG
638 LGVQWTRTVIDFAARRVYSSVYHLSEWAIPREDSILISIAQIYXXXXXXXXXXXX 766
795 DPRRYVEHKPSSFAWI 842
    PFSGGT 861
862 SRCVSICQDGDGMNVVLKMVLRYWIIDTTTAPGER*HLRGVVYTPRNGGR 1011

CYP139A1    Mycobacterium tuberculosis
            GenEMBL Z95617 GenPept AAK45973 (with 7 more aa at N-term)
            Rv1666c

CYP139A1    Mycobacterium bovis subsp. bovis AF2122/97, 
            NC_002945 complete genome complement(1877656..1878948)
            Gene = cyp139 start codon differs by 6 aa
            locus_tag = Mb1694c

CYP139A2P   Mycobacterium leprae 
            GenPept CAC31622 GenEMBL AL583921.1
            NC_002677 complement(1474970..1475188) and
            complement(1474991..1475161)
            locus_tags = ML1237 ML1238
            61% to CYP139A1
GAAVATTSMTVILARLASRTRLHLLAHYTHRVRARNFAALIP*LSLTVEVINSMPTQ

CYP140A1    Mycobacterium tuberculosis
            GenEMBL Z97193
            Rv1880c

CYP140A1    Mycobacterium bovis subsp. bovis AF2122/97, 
            NC_002945 complete genome complement(2120751..2122067)
            Gene = cyp140A1 100% match
            locus_tag = Mb1912c

CYP140A2    Mycobacterium ulcerans
            No accession number
            Pam Small
            Submitted to nomenclature committee 10/17/2003
            62% to CYP140A1

CYP140A3P  Mycobacterium leprae TN
           GenEMBL L01095.1
           48% to 140A1
VRQRLHWFAQYGFIRGIAATHH
RRSDPLARLDIALAIKANPVP
YCHKPRPRRPLVQSRISYLTANRAITHELLQSEDFHVFWLNVTLPAPSHWL
RRRTGYRTSSQYNL
LHPLLAIQ*AYHIHYRKTVSPLFAPKAVATLRDRIEQTTLALLDQLAHQHDVVDVVNRY
CSQLPVAVISDILGYP
VPDRDRSHILKFGELVAPSLDVELT*Q*YQQA*REVAGFNFWL
LKHLPQLQRTPGDNLVRHLSH*EDNKPTEISLSKSKLQAISG
GLVLATGGETTVNLLGRGI
LLLDTPEHMVMLQACPEPGHKRG*EILRLDSPIQMAARVARKDVDLAGSTIKRSQVVVLY
FGRSQPGPVRLCRSR*VQHRTPQCGKESRIFR*QEFCLENALTRAYNAVGLRAFFDHLP*
TRAAGTRSRLDTRVLRGWSTLPIALGPTRSMVS

CYP140A4   Mycobacterium avium subsp. paratuberculosis 
           GenEMBL AJ250018 complement(2795..>3145)
           59% to 140A1 runs off end
GAAARQSRPVGPRRSRRSCDRQPGPDDRAHAPPATSTSAPAMVG
LVPRRARNRDPKVFSDPTTFDVTRPNAREHLAFASGIHACLGAALARIEGATCARSFE
NFPDRSSRARNGGR

CYP141A1    Mycobacterium tuberculosis
            GenEMBL Z95150
            Rv3121
            cosmid cY164 from Sanger Centre
            coding region 29289-30488

CYP141A1P   Mycobacterium bovis subsp. bovis AF2122/97, 
            NC_002945 complete genome 3441289-3441483
            aa 337-400 (first part of gene is in a deletion)
IAFGYGPHACPASAYSRMCLTTFFTSLTQRFPQLQLARPFEDLERRGKGLHSVGIKELLVTWPT*

CYP142A1    Mycobacterium tuberculosis
            GenEMBL
            Rv3518c

CYP142A1aP  Mycobacterium bovis subsp. bovis AF2122/97, 
            NC_002945 complete genome complement(3898119..3898736)
            gene = CYP142A1a aa 1-197 100%
            locus_tag = Mb3548c
            In Mycobacterium bovis, a frameshift due to a single base
            deletion (c-*) splits cyp142 into 2 parts (pseudogene)

CYP142A1bP  Mycobacterium bovis subsp. bovis AF2122/97, 
            NC_002945 complete genome complement(3897541..3898122)
            gene = CYP142A1b aa 207-end 100%
            locus_tag = Mb3547c
            In Mycobacterium bovis, a frameshift due to a single base
            deletion (c-*) splits cyp142 into 2 parts (pseudogene)

CYP143A1    Mycobacterium tuberculosis
            GenEMBL AL022021
            Rv1785c

CYP143A1    Mycobacterium bovis subsp. bovis AF2122/97, 
            NC_002945 complete genome complement(2013905..2015086)
            Gene = cyp143A1 100% match
            locus_tag = Mb1813c

CYP143A2P   Mycobacterium leprae
            GenPept CAC30494                
            NC_002677 1861010..1862160
            locus_tag = ML1542
            P450 pseudogene
            Sequence below is from TIGR primary nucleotide sequence for ML1542
            55% to CYP143 Mycobacterium tuberculosis Rv1785c
   1 MSTSAKANPTHFTYCSLNYSALSMITDRGVIWKTLXX 105
 113 AKPVVFMNG*YYLNVSRKCILHTTSITKGFSSREAXXX 217
 225 PGNALPVLPXXXXXXXXXXXXXXXXXXX 251 
 278 SLNNLNKALPALRTYTVTMANAITSRGEW 364 
 366 EAMTDFANX 389
 391 LFPLQLFLVL*GLXX 429
 434 AQDRDHLIALLKDVVIGMSDKPFLSQADIADQGELCEYLVDTIAERKQNPA 586
 585 PDVLSQVLIGEDPLSEIKVLDLESL 659
 659 MLILAELDTVTATVGFSLLQPACRQQLRTMLRDKPKQIRILIED 790
 792 ILQLEPPAQITPYITTEFVNVDGMTLSPGSRVRLC 896
     XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 993 GSHLARLKLTLAVDEWLINI 1052 
     XXXXXXXXXXXXXXXXXX
1116 LFALKALALHW 1148

CYP144A1    Mycobacterium tuberculosis
            GenEMBL Z97345
            Rv1777

CYP144A1    Mycobacterium bovis subsp. bovis AF2122/97, 
            NC_002945 complete genome 2001114..2002418
            Gene = cyp144 1 aa diff
            locus_tag = Mb1806

CYP145      Nocardioides sp.
            GenEMBL AB000735
            gene for 2-carboxybenzal

CYP146      Amycolatopsis orientalis
            GenEMBL AJ223998
            cosmid PCZA361 (gene 2 of 2)

CYP147A1    Myxococcus xanthus Partial missing C-term 
            GenEMBL AF111947 CDS 1939..>2877
            42% to AF087022 partial  new family

CYP147B1    Streptomyces avermitilis
            No accession number
            Submitted by David Lamb and Haruo Ikeda 9/3/02
            Clone name SAV584 50% to 147A1 from Myxococcus xanthus
            (147A1 is missing C-term so could be higher % identity)

CYP147C1    Streptomyces tubercidicus strain I-1529
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name CypEA
            50% to 147A1

CYP147D1    Magnetospirillum magnetotacticum
            GenEMBL NZ_AAAP01002628 
            complement(814..1824) gene = Magn3224
            N-terminal is about 61 aa short
            51% to 147B1 44% to 147C1
MCAPGPGRDPQCGTGRSSSGDPPDHDRLRGQVMRCFTPQRVRGM
REKTRRITDDLIAKMAGKTRIDLVDDFSYPLPVTVICELLGVPPEDEAQFHGWATQLA
TALEPNQRGDEETQAKNEVCFNEIADYIQGLIKEKRKNPQEDILSDLATDTDGMNDFD
LIATAVLLLVAGHETTVNLITNGMLTLLRFPEHLERLRAEPETAPRLIEELLRYEPPV
HYRTRLALADIPVAGITIPKDAPVILLLAAANRDPLRFSDPDRFDPDRPDNRHLGFGG
GLHYCVGAPLARIEAEVALVSLVRRLKGLSLTENPPPYRPGASLRGPCHLRLALEEVA
EG

CYP147E1   Methanosarcina barkeri Archaea; Euryarchaeota
           GenEMBL NZ_AAAR01001943 4935..6305
           52% to 147D1 probable lateral transfer
           gene = Meth3340
MYRQGSGPNDRRQTMTQQSLYEQVLDYANRANPYPLYAKLRQTP
ITRQIDGSYVVSTYREIVSLLHDPRIGSDFRMRSA
HDRPSAGLSANQELASKNQAQDEGAETSSSNQGSETEVV
PSFIGLDPPEHDRLRRQATWPFGPPHTPGRVADMEPELILLA
NRQIDTIKGRTSIDIVEDFAYPIPVTMISELLGVPPEDQPRLHALSEAIIEDIDLDPR
QSPEEQKRRQEQSSQTFKELEQYMEVLIEHHRKQPGSDLLSGLITDHGSDGPMAQADL
VSTASLLLIAGHETTVNLITNGMLTLLRHPDVLERLRREPDLVIRLVEEFLRYEPPVQ
ILPNRVALSDITIAGTTIQKGSPVILLLASGSRDPARFHDPEKFDPDRRDNMHLGFGS
GIHYCYGAPLARLETQIALTELVQRLENPRLAHDPPPYRQSATLRGPRHLIVEIDGVK
DWEFHL

CYP147F1   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           55% to 147B1 if 9 aa removed, 54% to 147E1
           clone name SP0549

CYP148A1    Deinococcus radiodurans R1 
            GenEMBL AE002083 CDS 1719..2948
            38% to AL049754 complement(10413..11648)

CYP148A1    Deinococcus radiodurans
            GenPept AAF12079 
            GenEMBL NC_001263 2539498..2540727
            Gene = DR2538
  1 MTASSGSSAP SSGPLLAAVQ GLWSGAALAD PHPIYEQIRG FANADGLVRL PEWNTAFAVG
 61 HAATSAVLRS PAARSGEWDH GPSDGGKLLQ HMMLFRNGIP HARLRGLVQK AFTPRVVEEQ
121 RDLVRSLLDE LLSDMARAGG PVDLVAGLSG PLPGRVIMRM LGLRGADEER FLGWSASVAE
181 LLGGADRSPA LLARIEADAR EMRGYFRDLA DELRVSPQPG LLSALAAVED GGERLSGDEL
241 LSNAVLLLAA GHETTSNLIP GGVLALSQQP GAWAALLNHP RHPGVADELL RHVSPVQLDG
301 RMLTEAQTVG ETPLPAGTPV QLLLAAANRD PQVFPDPERL DWDRPNASRH LAFAAGPHYC
361 LGASLARLEI AETFAALAER FPDLRVSAAP HYKANFVLRG PQELWVTLG

CYP149A1    Microcystis aeruginosa 
            GenEMBL AB036790 CDS complement(779..2254)
            gene="mapks"
            41% to 107H1 partial seq new family

CYP150A1     Mycobacterium species
             GenEMBL AF107046
             Pascal Poupin
             gene 1

CYP150A2     Mycobacterium smegmatis mc2155
             GenEMBL AF107047 1092..2405
             Pascal Poupin
             gene 2 
MTDSTATDPAATTPDFDTVDYFTDQSLVPDPHPYFDHLRSKCPV
VREPHYGVLAITSFEEATTVLKDTETFSSCIAVGGPFPPLPFTPEGDDITGQIEQHRT
QLPMFEHMVTMDPPEHTNARSLLNRLLTPKRLKENEDFMWRLADECLDDFIDDGSCEF
LKQYAKPFSLLVIADLLGVPEEDHDEFRHVLGAPRPGAIVGSLDGDQLAMNPLAWLDD
KFVRYLEDRRKEPRDDVLTALATAKYPDGSTPEVIDVVRSATFLFAAGQETTTKLLSA
SLRVLGDRPDIQQALREDRSRIPTFVEEALRMDAPVKSQFRLAKKTTQLGGVDVPAGT
TLMVCPGAVNRDPVRFEDPHTFSLDRKNVREHIAFGRGVHSCPGGPLARVEGRVSLER
ILDRMADIRIDEEHHGPADNRRYTYEPTYILRGLTDLHIKFEPVR

CYP151A1     Mycobacterium smegmatis
             GenEMBL AF102510
             Poupin P, Ducrocq V, Hallier-Soulier S, Truffaut N
             Cloning and Characterization of the Genes Encoding a Cytochrome
             P450 (PipA) Involved in Piperidine and Pyrrolidine Utilization and
             Its Regulatory Protein (PipR) in Mycobacterium smegmatis mc2155.
             J Bacteriol 181, 3419-3426 1999

CYP151A2     Mycobacterium sp. strain RP1
             GenEMBL AJ310142
             Pascal Poupin 
             Submitted to nomenclature committee March 22, 2001
             86% identity in 399 aa overlap with CYP151A1

CYP152A1    Bacillus subtilis
            GenEMBL AB006424
            ybdT gene
            this sequence is missing part of the heme signature sequence, but has 
            PERF and EXXR

CYP152A2    Clostridium acetobutylicum
            GenPept AAK81262                 
            YBDT B.subtilis ortholog
            59% to 152A1
  1 MLLKENTAKD KGIDSTLDLL KEGYLFIKNR ADHYQSDLFE TRLMGQRIIC MTGEEAARIF
 61 YDSDKFKRQG AAPKRVQETL LGENAIQTLD GESHLHRKKL FMLLTNQVQQ KRLAELTTEK
121 WEASASKWHT KSIVLFNEAN EILCQVACHW AGVPLMESDI KNRAEDFSSM IDSFGAVGPR
181 HWKGKKARNT IEAWIKEIIE NVRSGRIRAE EGSPLHEIAF YIDVNGQQMP AEMAAIELIN
241 ILRPIVAIST FITFSALALY EHSEYREKLQ SKDIRYLEMF TQEVRRYYPF APFVGARVRK
301 DFLWNNCEFK KEMLVLLDIY GTNHDSRIWQ KPYEFIPDRF RSYKGNLFDF IPQGGGDPSS
361 THRCPGEGIT LEIMKTSLDF LSTKIDFTVP DQDLSYSLSK IPTLPKSGFI IDNINLKL

CYP152B1    Sphingomonas paucimobilis
            GenEMBL AB006957
            Isamu Matsunaga
            this sequence is missing part of the heme signature sequence, but has 
            PERF and EXXR

CYP152B2    Azotobacter vinelandii
            NZ_AAAU02000007 102969-104183
            56% to