P450s that have appeared since the 1993 P450 nomenclature update.
This is part E of the bibiographic P450 files.
This section contains bacterial sequences CYP101 to CYP174.
This includes references that were incomplete and duplications
of sequences that were already in the update. If a sequence
is assigned an accession number that was not in the old update
it is included in this list. 48 new P450s were added July 27, 2000
Four new sequences were added Jan. 9, 2001 CYP102C1, CYP172-174.
Added CYP175A1 9/17/2001
Compiled by David R. Nelson
Last modified June 2, 2003 added 25 new sequences.
Last modified Nov. 5, 2003 There are now 501 bacterial P450s
51 Family
101 Family
102 Family
103 Family
104 Family
105A Subfamily
105B Subfamily
105C Subfamily
105D Subfamily
105E Subfamily
106 Family
107A Subfamily
107B Subfamily
107C Subfamily
107D Subfamily
107E Subfamily
107F Subfamily
107G Subfamily
107H Subfamily
107J Subfamily
108 Family
109 Family
110 Family
111 Family
112 Family
113A Subfamily
113B Subfamily
114 Family
115 Family
116 Family
117 Family
118 Family
119 Family
120 Family
121 Family
122 Family
123 Family
124 Family
125 Family
126 Family
127 Family
128 Family
129 Family
130 Family
131 Family
132 Family
133 Family
51 Family
CYP51 Mycobacterium tuberculosis
GenEMBL Z80226 (34809bp) gi 1550642 Rv0764c
complement (6140-7495)
33.7% identical to CYP51 over 439AA overlap
this is a bacterial CYP51
CYP51 Mycobacterium bovis subsp. bovis AF2122/97
NC_002945 complete genome complement(858662..858868)
CYP51 100% match
locus_tag = Mb0786c
CYP51 Mycobacterium avium
TIGR contig:3273:m_avium Length = 5,475,738
79% to CYP51 M. tuberculosis
3021360 TSTVVPRVSGGEEEHGHLEEFRTDPIGLMQRVRDECGDVGWFQLVDKHVILLSGAQANEF 3021539
3021540 FFRSADEDLDQAEAYPFMTPIFGKGVVFDASPERRKEMLHNSALRGEQMKGHASTIEGEV 3021719
3021720 KKMIADWGDEGEIELLDFFAELTIYTSTACLIGLKFREQLDHRFAEYYHDLERGTDPLCY 3021899
3021900 VDPYLPIESFKRRDEARVKLVALVQEIMDQRLANPPKDKADRDMLDVLVSIKDEDGKPRF 3022079
3022080 SADEITGMFISLMFAGHHTSSGTSAWTLIELIRHPDVYAEVLAELEELYADGQEVSFHAL 3022259
3022260 RSIPKLDNVVKETLRLHPPLIILMRVAKGEFEVEGFPIHEGDYVAASPAISNRIPEDFPD 3022439
3022440 PDAFKPDRYNKPEQADIVNRWTWIPFGAGRHRCVGAAFAQMQIKAIFSVLLREYDFEMAQ 3022619
3022620 PADSYRNDHSKMVVQLARPAKVRYRKR 3022700
CYP51 Mycobacterium smegmatis
TIGR contig:3439:m_smegmatis Length = 6,989,783
80% to CYP51 M. tuberculosis
4858809 VPRVSGGEEEHGHLEEFRTDPIGLMKRVRSECGDVGWFQLADKQVVLLSGAEANEFFFRS 4858988
4858989 SDSELNQAEAYPFMTPIFGEGVVFDADPERRAEMLHNTALRGEQMKGHAATIENEVRRMV 4859168
4859169 ESWGDEGEIDLLEFFAELTIYTSTACLIGVKFRNQLDKRFADYYHLLERGTDPLCYVDPY 4859348
4859349 LPIESFRIRDEARANLVELVQEVMNGRIANPPKDKSDRDLLDVLVSIKDEDGTPRFSANE 4859528
4859529 VTGMFISLMFAGHHTSSGTASWTLIELLRHPEFYAKVQAELDDLYADGQEISFHALRQIP 4859708
4859709 NLDNALKETLRLHPPLIILMRVAQDEFEVAGRPIHKGQMVAASPAISNRIPEDFPDPDTF 4859888
4859889 DPDRYDKPRQEDLINRWTWIPFGAGKHRCVGAAFAQMQIKAIFSVLLRDFEFEMAQPSES 4860068
4860069 YRNDHSKMVVQLARPAKVRYRRR 4860137
CYP51 Methylococcus capsulatus
TIGR contig:221:m_capsulatus
49% to CYP51 M. tuberculosis
NOTE FUSION PROTEIN EXTENDS C-TERMINAL.
SEE J. Biol. Chem., Vol. 277, Issue 49, 46959-46965, December 6, 2002
A Novel Sterol 14-Demethylase/Ferredoxin Fusion Protein (MCCYP51FX) from
Methylococcus capsulatus Represents a New Class of the Cytochrome P450
Superfamily
Colin J. Jackson¤, David C. Lamb¤, Timothy H. Marczylo, Andrew G. S. Warrilow, Nigel J. Manning¦, David J. Lowe, Diane
E. Kelly, and Steven L. Kelly
908332 MSHPPSNTP
908305 PVKPGGLPLLGHILEFGKNPHAFLMALRHEFGDVAEFRMFHQRMVLLTGSQASEAFYRAP 908126
908125 DEVLDQGPAYRIMTPIFGRGVVFDARIERKNQQLQMLMPALRDKPMRTYSEIIVAEVEAM 907946
907945 LRDWKDAGTIDLLELTKELTIYTSSHCLLGAEFRHELNTEFAGIYRDLEMGIQPIAYVFP 907766
907765 NLPLPVFKRRDQARVRLQELVTQIMERRARSQERSTNVFQMLIDASYDDGSKLTPH 907598
907597 EITGMLIATIFAGHHTSSGTTAWVLIELLRRPEYLRRVRAEIDALFETHGRVTFESLRQM 907418
907417 PQLENVIKEVLRLHPPLILLMRKVMKDFEVQGMRIEAGKFVCAAPSVTHRIPELFPNPEL 907238
907237 FDPDRYTPERAEDKDLYGWQAFGGGRHKCSGNAFAMFQIKAIVCVLLRNYEFELAAAPE 907061
907060 SYRDDYRKMVVEPASPCLIRYRRRDAP 906980
101 Family
CYP101A1 Pseudomonas putida
GenEMBL D00528 (1950bp)
Koga,H., Yamaguchi,E., Matsunaga,K., Aramaki,H. and Horiuchi,T.
Cloning and nucleotide sequences of NADH-putidaredoxin reductase
gene(camA) and putidaredoxin gene(camB) involved in cytochrome
P-450cam hydroxylase of Pseudomonas putida
J. Biochem. 106, 831-836 (1989)
Note: only the last 93 nucleotides of the cam gene was cloned along
with two downstream genes.
CYP101A1 Pseudomonas putida
PIR C60886 (last 8 amino acids)
Romeo, C., Moriwaki, N., Yasunobu, K.T., Gunsalus, I.C.,
Koga, H.
Identification of the coding region for the putidaredoxin
reductase gene from the plasmid of Pseudomonas putida.
J. Protein Chem. 6, 253-261 (1987)
CYP101B1 Novosphingobium aromaticivorans
NZ_AAAV01000165.1
complement(29626..30870) gene = Saro2804
43% to CYP101
MLPHDRGQNSTRRITAMEAPAHVPADRVVDIDIYMPPGLAEHGF
HKAWSDLSAGNPAVVWTPRNEGHWIALGGEALQEVQSDPERFSSRIIVLPKSVGEMHG
LIPTTIDPPEHRPYRQLLNAHLNPGAIRGLSESIRQTAVDLIEGFAAQGHCNFTAQYA
EQFPIRVFMALVGIEASEAPRIRHWAECMTRPGMDMTFDEAKAVFFDYVGPLVDARRE
TPGEDMISAMINADLGDGRRLTRDEALSVVTQVLIAGLDTVVNVLGFIMRELAGNPAL
RADLRQRGADILPVVHELFRRFGLVSIAREVRRDIEFHGVHLKAGDMIAIPTQVHGLD
PRVNPDPLAIDPSRKRARHSTFGSGPHMCPGQELARKEVAITLEEWLRRIPDFALGPN
SDLSPVPGIVGALRRVELVWNT
CYP101C1 Novosphingobium aromaticivorans
NZ_AAAV01000133.1
complement(4199..5389) gene = Saro1574
44% to CYP101A1
MIPAHVPADRVVDFDIFNPPGVEQDYFAAWKTLLDGPGLVWSTA
NGGHWIAARGDVVRELWGDAERLSSQCLAVTPGLGKVMQFIPLQQDGAEHKAFRTPVM
KGLASRFVVALEPKVQAVARKLMESLRPRGSCDFVSDFAEILPLNIFLTLIDVPLEDR
PRLRQLGVQLTRPDGSMTVEQLKQAADDYLWPFIEKRMAQPGDDLFSRILSEPVGGRP
WTVDEARRMCRNLLFGGLDTVAAMIGMVALHLARHPEDQRLLRERPDLIPAAADELMR
RYPTVAVSRNAVADVDADGVTIRKGDLVYLPSVLHNLDPASFEAPEEVRFDRGLAPIR
HTTMGVGAHRCVGAGLARMEVIVFLREWLGGMPEFALAPDKAVTMKGGNVGACTALPL
VWRA
CYP101D1 Novosphingobium aromaticivorans
NZ_AAAV01000085.1
complement(6803..8068) gene = Saro0669
44% to CYP101
MNAQTSTATQKHRVAPPPHVPGHLIREIDAYDLDGLEQGFHEAW
KRVQQPDTPPLVWTPFTGGHWIATRGTLIDEIYRSPERFSSRVIWVPREAGEAYDMVP
TKLDPPEHTPYRKAIDKGLNLAEIRKLEDQIRTIAVEIIEGFADRGHCEFGSEFSTVF
PVRVFLALAGLPVEDATKLGLLANEMTRPSGNTPEEQGRSLEAANKGFFEYVAPIIAA
RRGGSGTDLITRILNVEIDGKPMPDDRALGLVSLLLLGGLDTVVNFLGFMMIYLSRHP
ETVAEMRREPLKLQRGVEELFRRFAVVSDARYVVSDMEFHGTMLKEGDLILLPTALHG
LDDRHHDDPMTVDLSRRDVTHSTFAQGPHRCAGMHLARLEVTVMLQEWLARIPEFRLK
DRAVPIYHSGIVAAVENIPLEWEPQRVSA
CYP101D2 Novosphingobium aromaticivorans
NZ_AAAV01000042
complement(5601..6899) gene = Saro0208
63% to 101D1
MGTTRMDTFNPQESRLATNFDEAVRAKVERPANVPEDRVYEIDM
YALNGIEDGYHEAWKKVQHPGIPDLIWTPFTGGHWIATNGDTVKEVYSDPTRFSSEVI
FLPKEAGEKYQMVPTKMDPPEHTPYRKALDKGLNLAKIRKVEDKVREVASSLIDSFAA
RGECDFAAEYAELFPVHVFMALADLPLEDIPVLSEYARQMTRPEGNTPEEMATDLEAG
NNGFYAYVDPIIRARVGGDGDDLITLMVNSEINGERIAHDKAQGLISLLLLGGLDTVV
NFLSFFMIHLARHPELVAELRSDPLKLMRGAEEMFRRFPVVSEARMVAKDQEYKGVFL
KRGDMILLPTALHGLDDAANPEPWKLDFSRRSISHSTFGGGPHRCAGMHLARMEVIVT
LEEWLKRIPEFSFKEGETPIYHSGIVAAVENVPLVWPIAR
102 Family
CYP102A1 Bacillus megaterium
Ruettinger,R.T.,Wen, L.-P. and Fulco, A.J.
Coding Nucleotide, 5'-Regulatory, and Deduced Amino Acid Sequences of
P450BM-3, a Single Peptide Cytochrome P450:NADPH-P450 Reductase from
Bacillus megaterium.
J. Biol. Chem. 264, 10987-10995 (1989)
CYP102A1 Bacillus megaterium
GenEMBL J04832 (4957bp)
Ravichandran,K.G., Boddupalli, S.S., Hasemann,C.A.,
Peterson,J.A. and Deisenhofer,J.
Crystal structure of hemoprotein domain of P450BM-3, a prototype
for microsomal P450s.
Science 261, 731-736 (1993)
P450 is N-terminal
CYP102A2 Bacillus subtilis
GenEMBL D87979
Yamamoto, H., S. Uchiyama, F. A. Nugroho, and J. Sekiguchi.
A 23.4 kb segment at the 69 degrees-70 degrees region of the
Bacillus subtilis genome.
Microbiology. 143, 1317-20 (1997)
Gene name yfnJ 66.4% identical to CYP102A1 P450 part only
also called YetO (fusion of P450 and reductase like CYP102A1, P450 part is
N-terminal)
CYP102A3 Bacillus subtilis
GenEMBL U93874, Z99117
Sorokin, A., A. Bolotin, B. Purnelle, H. Hilbert, J. Lauber, A.
Dusterhoft, and S. D. Ehrlich.
Sequence of the Bacillus subtilis genome region in the vicinity of
the lev operon reveals two new extracytoplasmic function RNA
polymerase sigma factors SigV and SigZ.
Microbiology. 143, 2939-43 (1997)
Gene name yrhJ most similar to CYP102A2
(fusion of P450 and reductase like CYP102A1 P450 part is N-
terminal)
CYP102A4 Bacillus anthracis str. Ames
GenPept AAP27014
bifunctional P-450:NADPH-P450 reductase 1
79% to 102A2
1 MDKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKLAEEYG PIFRMQTLSD TIIVVSGHEL
61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETQEPNWQ KAHNILMPTF SQRAMKDYHA
121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM
181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSE NQEENDLLSR
241 MLNVQDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR
301 VLTDSTPTYQ QVMKLKYIRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV
361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG
421 MLLQHFEFID YEEYQLDVKQ TLTLKPGDFK IRIVPRNQTI SHTTVLAPTE EKLKNHEIKQ
481 QVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVAAL NDRIGSLPKE
541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKG DELKGVQYAV FGCGDHNWAS TYQRIPRYID
601 EQMAQKGATR FSTRGEADAS GDFEEQLEQW KQRMWSDAMK VFGLELNKNM EKERSTLSLQ
661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSERSTRHIE ISLPEGATYK EGDHLGVLPI
721 NSEKNVNRIL KRFGLNGKDQ VILSASGRSV NHIPLDSPVR LYDLLSYSVE VQEAATRAQI
781 REMVTFTACP PHKKELESLL EDGVYQEQIL KKRISMLDLL EKYEACEIRF EPFLELLPAL
841 KPRYYSISSS PLVAQDRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP
901 QSNFQLPENP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MNVGEAHLYF GCRHPEKDYL
961 YRTELENDER DGLISLHTAF SRLEGQAKTY VQHVIKEDRI HLISLLDNGA HLYICGDGSK
1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR LQEEGRYGKD VWAGI
CYP102A5 Bacillus cereus ATCC 14579
GenPept AAP10153
NADPH-cytochrome P450 reductase/P450 fusion
79% to 102A2 Bacillus subtilis
1 MEKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKIAEEYG PIFQIQTLSD TIIVVSGHEL
61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETHEPNWK KAHNILMPTF SQRAMKDYHA
121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM
181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSG DQEENDLLSR
241 MLNVPDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR
301 VLTDPTPTYQ QVMKLKYMRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV
361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG
421 MLLQHFELID YQNYQLDVKQ TLTLKPGDFK IRILPRKQTI SHPTVLAPTE DKLKNDEIKQ
481 HVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVVAL NDRIGSLPKE
541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKP DELKGVQYAV FGCGDHNWAS TYQRIPRYID
601 EQMAQKGATR FSKRGEADAS GDFEEQLEQW KQNMWSDAMK AFGLELNKNM EKERSTLSLQ
661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSDRSTRHIE VSLPEGATYK EGDHLGVLPV
721 NSEKNINRIL KRFGLNGKDQ VILSASGRSI NHIPLDSPVS LLALLSYSVE VQEAATRAQI
781 REMVTFTACP PHKKELEALL EEGVYHEQIL KKRISMLDLL EKYEACEIRF ERFLELLPAL
841 KPRYYSISSS PLVAHNRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP
901 QSNFELPKDP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MNLGQAHLYF GCRHPEKDYL
961 YRTELENDER DGLISLHTAF SRLEGHPKTY VQHLIKQDRI NLISLLDNGA HLYICGDGSK
1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR VQDEGRYGKD VWAGI
CYP102A6 Bradyrhizobium japonicum USDA 110
GenPept BAC48147
NC_004463 complete genome 3173438..3176674
NADPH-cytochrome P450 reductase/P450 fusion
54% to 102A2
1 MSSKNRLDPI PQPPTKPVVG NMLSLDSAAP VQHLTRLAKE LGPIFWLDMM GSPIVVVSGH
61 DLVDELSDEK RFDKTVRGAL RRVRAVGGDG LFTADTREPN WSKAHNILLQ PFGNRAMQSY
121 HPSMVDIAEQ LVQKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE
181 SLVRSLETIM MTRGLPFEQI WMQKRRKTLA EDVAFMNKMV DEIIAERRKS AEGIDDKKDM
241 LAAMMTGVDR STGEQLDDVN IRYQINTFLI AGHETTSGLL SYTLYALLKH PDILKKAYDE
301 VDRVFGPDVN AKPTYQQVTQ LTYITQILKE ALRLWPPAPA YGISPLADET IGGGKYKLRK
361 GTFITILVTA LHRDPSVWGP NPDAFDPENF SREAEAKRPI NAWKPFGNGQ RACIGRGFAM
421 HEAALALGMI LQRFKLIDHQ RYQMHLKETL TMKPEGFKIK VRPRADRERG AYGGPVAAVS
481 SAPRAPRQPT ARPGHNTPML VLYGSNLGTA EELATRMADL AEINGFAVHL GALDEYVGKL
541 PQEGGVLIIC ASYNGAPPDN ATQFVKWLGS DLPKDAFANV RYAVFGCGNS DWAATYQSVP
601 RFIDEQLSGH GARAVYPRGE GDARSDLDGQ FQKWFPAAAQ VATKEFGIDW NFTRTAEDDP
661 LYAIEPVAVT AVNTIVAQGG AVAMKVLVND ELQNKSGSNP SERSTRHIEV QLPSNITYRV
721 GDHLSVVPRN DPTLVDSVAR RFGFLPADQI RLQVAEGRRA QLPVGEAVSV GRLLSEFVEL
781 QQVATRKQIQ IMAEHTRCPV TKPKLLAFVG EEAEPAERYR TEILAMRKSV YDLLLEYPAC
841 ELPFHVYLEM LSLLAPRYYS ISSSPSVDPA RCSITVGVVE GPAASGRGVY KGICSNYLAN
901 RRASDAIYAT VRETKAGFRL PDDSSVPIIM IGPGTGLAPF RGFLQERAAR KAKGASLGPA
961 MLFFGCRHPD QDFLYADELK ALAASGVTEL FTAFSRADGP KTYVQHVLAA QKDKVWPLIE
1021 QGAIIYVCGD GGQMEPDVKA ALVAIRHEKS GSDTATAARW IEEMGATNRY VLDVWAGG
CYP102B1 Streptomyces coelicolor cosmid F43.
GenEMBL AL136502 CDS 10570..12153 gene="SCF43.12"
Highly similar to the N-terminal P450 domain of Bacillus
megaterium 41.9% identity in 497 aa overlap.
45% to 102A1 over 433 amino acids
cloned and expressed by David Lamb and Steve Kelly
CYP102B2 Streptomyces avermitilis
GenEMBL AP005050
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV7426
78% to 102B1 from Streptomyces coelicolor
CYP102C1 Rhodococcus sp. X309
GenEMBL AF059700.1 complement(3619-4584) runs off end of sequence
partial gene 48% to 102B1
CYP102D1 Streptomyces avermitilis
GenEMBL AP005023
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV575 47% to 102A3
40% to 102B1, 44% to 102C1 partial seq
CYP102E1 Ralstonia metallidurans
GenEMBL NZ_AAAI01000371
104500-107000 region
51% to 102D1
MSTATPAAALEPIPRDPGWPIFGNLFQITPGEVGQHLLARSRHHDGIFELDFAGKRVPFVS
SVALASELCDATRFRKIIGPPLSYLRDMAGDGLFTAHSDEPNWGCAHRILMPAFSQRAM
KAYFDVMLRVANRLVDKWDRQGPDADIAVADDMTRLTLDTIALAGFGYDFASFASDELDP
FVMAMVGALGEAMQKLTRLPIQDRFMGRAHRQAAEDIAYMRNLVDDVIRQRRVSPTSGMD
LLNLMLEARDPETDRRLDDANIRNQVITFLIAGHETTSGLLTFALYELLRNPGVLAQAY
AEVDTVLPGDALPVYADLARMPVLDRVLKETLRLWPTAPAFAVAPFDDVVLGGRYRLRKD
RRISVVLTALHRDPKVWANPERFDIDRFLPENEAKLPAHAYMPFGQGERACIGRQFALTE
AKLALALMLRNFAFQDPHDYQFRLKETLTIKPDQFVLRVRRRRPHERFV
TRQASQAVADAAQTDVRGHGQAMTVLCASSLGTARELAEQIHAGAIAAGFDAKLADLDDA
VGVLPTSGLVVVVAATYNGRAPDSARKFEAMLDADDASGYRANGMRLALLGCGNSQWATY
QAFPRRVFDFFITAGAVPLLPRGEADGNGDFDQAAERWLAQLWQALQADGAGTGGLGVDV
QVRSMAAIRAETLPAGTQAFTVLSNDELVGDPSGLWDFSIEAPRTSTRDIRLQLPPGITY
RTGDHIAVWPQNDAQLVSELCERLDLDPDAQATISAPHGMGRGLPIDQALPVRQLLTHFI
ELQDVVSRQTLRALAQATRCPFTKQSIEQLASDDAEHGYA
CYP102F1 Actinosynnema pretiosum subsp. auranticum
AF453501 complement(6501..9518)
maytansinoid antitumor agent ansamitocin biosynthetic gene cluster I
49% to 102A3
gene = asm30
MVATGTRIPGPKPLPLVGNLLDVLTSDLDTDVDFLDRCHREHGG
IVALTFAGQRQVFASSHELVARMCSDPSWGKAVHPALEQVRDFAGDGLFTARGDEPNW
GKAHRLLMPAFGPTAMRDHFPAMLDIAEQMLVRWRRFGPDHRIDVADDMTRLTLDTIA
LCAFGARFNSFYRDRAHPFVDAMVRSLVEAGERAERLPGVQPFLVGRNQRYRDDIATM
NRIADGIVAARAALPAGERPDDLLERMLTCADPVTGERLSARNVRYQLATFLIAGHET
TSGLLSFAVHRLLAHPEVLRKAKDAVDGVLGDRVPAFEDLARLDYLGQVLRETLRLHP
TAPAFALAPDEPAELGGHAIGAGEPVLVMLPTLHRDPAVWRDPDVFDPERFAPERMDE
IPACAWMPFGHGARACIGRPFALQEATLVLALVLQRFDLALADPDHRLTIKQTLTLKP
DSLVVRARPRADRPGATATVETVVPHQVPATHRHGTPLHVFYGSNGGSGEGLARTIAG
DGAARGWATSVAPLDDAVRALPASGPVVIVSSSYNGAPPDNAAHFVRWLTQDGPDLSG
VDYLVLGCGNLDWSATYQRVPTLIDEAMAAAGARRLRERGATDARADFFGDWERWYEP
LWPLLSAECGVEVGEIGPRFRVVESDAADGLGDLASAVVLENRELVRGPDAGSKRHLE
LRLPDGTSYRTGDYLSVLPQNHPDLVRRAVARLGTRAERVVTVESSAPTGLVPVGRAL
RVDELLTRCVDLSAPAGAGVVARLAERCPCPPERAELAATTGATLLELLERFPSCAVD
LALALELLPAPRTRLYSISSAAEEQRAEVALTVSVTGVTSGYLSRVRPGDRVAVGIAS
PPESFRPPADNTVPVVLIAAGTGIAPFRGFLRARAALGGEPGPALLLFGCRGPELDDL
YAEEFAALGDWLEVDRAYSRHPDGEVRHVQHRLWQRRDRVRELVDAGARVYLCGDATR
VGPAVEEVLGRIGPGAGWLDALRAGGRYATDVF
103 Family
CYP103A1 Agrobacterium tumefaciens
GenEMBL M19352, AF242881 CDS 141158.142426
gene="virH1"
CYP103A2 Agrobacterium tumefaciens
GenEMBL AF034769
GenEMBL AB016260 CDS 124584..125759
CYP103A3 Agrobacterium tumefaciens plasmid pTiAB2/73 vir region
GenEMBL AF329849 892..2148
gene = virH
61% TO 103A1
MNARGPEKVSQTSGPIISASLDPDNVSVSDLDRSGHAIFAEWRP
KRPFLRRQDGVYVLLRADDVLGLSSDPRTRQIETELMLNRGINEGAVFDFVRYSMLFS
NNEVHSRRRSPFTRTFAFRMIENLRPQVSQLTETLFQDLKELDSFNFVEEFASKLPAV
AIAGLLGLPPSDIPYFTQLVYRVARCLSPSWRDADLPDIEASAAEFKNYVQAVIDDRR
SNPRDDFLSSFIRATREAEDLSPDEGLAQLMLIVLAGTDTTKTGLTALTGQLLRHRHV
WEALLKDESLVPAAVEEGLRFEPPVGSYPRLALADIDLEGFILPKGSLLALCTMSALR
DEKHFAHPELFDIHRKQMHWHMVFGAGAHRCLGEALARLELQEGLATVLRYAPTLSIE
GEWPTVQGHGGVRRIAEMRVGFRRQI
104 Family
CYP104A1 Agrobacterium tumefaciens
GenEMBL M19352, AF242881 CDS 142447..143670
gene="virH2"
CYP104A2 Agrobacterium tumefaciens
GenEMBL AB016260
103A2 CDS 124584..125759 and
104A2 CDS 125919..127094 83% to 104A1
105A Subfamily
CYP105A1 Streptomyces griseolus
GenEMBL M36480 (1629bp) Y18556 CDS 2447..3703
Omer,C.A., Lenstra,R., Litle,P.J., Dean,C.R., Tepperman,J.M.,
Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
Genes for two herbicide-inducible cytochromes P-450 from
Streptomyces griseolus
J. Bacteriol. 172, 3335-3345 (1990)
Gene suaC
CYP105A2 Amycolata autotrophica
GenEMBL D26543 (1197bp)
Kawauchi,H., Sasaki,J., Adachi,T., Hanada,K., Beppu,T. and
Horinouchi,S.
Cloning and nucleotide sequence of a bacterial cytochrome P-450
VD25
gene encoding vitamin D-3 25-hydroxylase
Biochim. Biophys. Acta 1219, 179-183 (1994)
CYP105A3 Streptomyces carbophilus
GenEMBL D30815 PIR JC4287
Watanabe,I., Nara,F. and Serizawa,N.
Cloning, characterization and expression of the gene encoding
cytochrome P-450sca-2 from Streptomyces carbophilus involved in
production of pravastatin, a specific HMG-CoA reductase inhibitor
Gene 163 (1), 81-85 (1995)
105B Subfamily
CYP105B1 Streptomyces griseolus
GenEMBL M36481 (1688bp) M32239
Omer,C.A., Lenstra,R., Litle,P.J., Dean,C.R., Tepperman,J.M.,
Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
Genes for two herbicide-inducible cytochromes P-450 from
Streptomyces griseolus
J. Bacteriol. 172, 3335-3345 (1990)
Gene subC, SU-2
CYP105B2 Streptomyces tubercidicus strain R-922
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Cyp229
78% to 105B1
105C Subfamily
CYP105C1 Streptomyces sp.
GenEMBL M31939 PIR S19629 (381 amino acids)
Horii, M., Ishizaki, T., Paik, S.Y., Manome, T. and Murooka, Y.
An operon containing the genes for cholesterol oxidase and a
cytochrome P-450-like protein from a Streptomyces sp.
J. Bacteriol. 172, 3644-3653 (1990)
Gene choP
105D Subfamily
CYP105D1 Streptomyces griseus
GenEMBL S45823 X63601 (1700bp) PIR S24750 (412 amino acids)
Trower,M.K., Lenstra,R., Omer.C., Buchholz,S.E., and
Sariaslani,F.S.
Cloning, nucleotide sequence determination and expression
of the genes encoding cytochrome P-450soy (soyC) and
ferredoxinsoy (soyB) from streptomyces griseus.
Mol. Microbiol. 6, 2125-2134 (1992)
PIR S35901 (412 amino acids)
Erratum. Cloning, nucleotide sequence determination and
expression of the genes encoding cytochrome P-450(soy)
(soyC) and ferredoxin(soy) (soyB) from Streptomyces griseus.
Mol. Microbiol. 7, 1024-1025 (1993)
CYP105D2 Streptomyces griseus
GenEMBL AF071145
84% identical to 105D1
CYP105D3 Streptomyces sclerotialus
GenEMBL AF071149
68% identical to 105D1
CYP105D4 Streptomyces lividans
GenEMBL AF072709 CDS complement(1593..2813)
69% to 105D1 67% to 105D2 82% to 105D3 57% to 105A1
CYP105D5 Streptomyces coelicolor
3StF60 [Full Sequence] Sanger cosmid
CDS comp(2106-3344) 98% identical to CYP105D4
cloned and expressed by David Lamb and Steve Kelly
CYP105D6 Streptomyces avermitilis
GenEMBL AB070949.1 69121-70371
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV412_pteD 55% to 105D1 from Streptomyces griseus,
53% to 105D4, 54% to 105D5 (if first 17aa left off 105D5)
Gene = pteD
CYP105D7 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV7469 73% to 105D4 from Streptomyces lividans
CYP105D8 Streptomyces tubercidicus strain I-1529
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Cyp233
68% to 105D7
CYP105D9 Streptomyces sp. JP95
GenEMBL AF509565 11774..13024
griseorhodin biosynthesis gene cluster
55% to 105D6
gene = grhO3
MTDTLDEPQTLADGAEDAPAYPVKRTCPYRMPPGYEELREKGPI
SRVTLWNGRTAWLVTGNDLGRRLFPDARLSSDVLDPRFPLLAPRIEAQRQQAAAPPLV
GVDDPVHARQRRMVLPSFGIRQINALRPEIQKYADDLLDTMLAKGPGVTVDLLTEYAL
PMPSAVICMLLGVPYEDHHYFDERSRHVLSSSGEEQAAQAQQAFTEILAYLDDLIVRK
QAEPGDTLLDELIARQLEEGKVDRQELAMIATVLLVSGHETTSNMIALSTMALLADPD
QLAALRADESLMPRAVDELMRFSSIGDMLMRVAKEDIEIEGHLIRAGDGVILSTMLMN
RDPGAFERPDELDIRRPAGRHVAFGYGIHQCIGQNLARAEMEIALATLFRRVPTLKLA
VPAEQVPVNAPFVLQGVSELPVTW
105E Subfamily
CYP105E1 Rhodococcus fascians
GenEMBL Z29635 (7139bp) PIR S42052 (399 amino acids)
Crespi,M., Vereecke,D.M., Temmerman,W.G., Van Montagu,M.
and Desomer,J.
The fas operon of Rhodococcus fascians encodes new genes required
for efficient fasciation of host plants.
J. Bact. 176, 2492-2501 (1994)
MAGTADLPLEMRRNGLNPTEELAQVRDRDGVIPVGELYGAPAFL
VCRYEDVRRIFADSNRFSNAHTPMFAIPSGGDVIEDELAAMRAGNLIGLDPPDHTRLR
HILAAEFSVHRLSRLQPRIAEIVDSALDGLEQAGQPADLMDRYALPVSLLVLCELLGV
PYADRDELRDRTARLLDLSASAEQRAVAQREDRRYMATLVTRAQEQPGDDLLGILARK
IGDNLSTDELISIISLIMLGGHETTASMIGLSVLALLHHPEQAAMMIEDPNCVNSGIE
ELLRWLSVAHSQPPRMAVTEVQIAGVTIPAGSFVIPSLLAANRDSNLTDRPDDLDITR
GVAGHLAFGHGVHFCLGHSLARMTLRTAVPAVLRRFPDLALSPSHDVRLRSASIVLGL
EELQLTW
CYP105F1 Streptomyces lavendulae
GenEMBL AF127374 CDS 2006..3229
48% to 105C1 42% to 105B1 40% to 105D1 new subfamily in 105
CYP105F2 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
85% to 105F1
clone name SP8812
CYP105G1 Amycolatopsis mediterranei
GenEMBL AF040571 CDS complement(5011..6066)
49% to 105C1, 105B1 new subfamily in 105
looks like an insertion in the seq from 80-120
CYP105H1 Streptomyces noursei ATCC 11455 nyst
GenEMBL AF263912 CDS comp (58637..59833)
gene="nysN" 47% to 105B1 46% to 105A1 46% to 105D1
function="presumably involved in modification of the
nystatin macrolactone ring"
CYP105H2 Streptomyces albus
GenEMBL AF071143
77% to 105H1
LLIAGHETTANNIGLGVVTLLSHPQWAGDERAVEELLRLHSVAD
MVALRVAVDDVEIAGQVIRKGEGIVPLLAAANHDTEVFGCPHAFDPERSERRHVAFGY
GVHQCLGQNL
CYP105H3 Streptomyces natalensis
GenEMBL AJ278573 52789..53985
pimaricin biosynthetic gene cluster.
68% to 105H1
gene = pimG
MTYTDPAAPETDPPAVDFPQRKPGVPFPPPDYADYRDRKGLVLS
QLSDGKRVWLVTRHEDVRAVLTSPSISSNPEHKGFPNVGNLGVPKQDQIPGWFVGMDS
PEHDRFRKALIPEFTVRRVRAMKPAIERTVDAQLDAMLAAGNTADLVADFALPIPSLV
ISALLGVPPADREFFESRTRVLVSLRSSTDDDRMAAAKDLLRYINRLVEIKQKWGGDD
LITRLLATGAIAPHEMSGVLMLLLIAGHETTANNIALGVVTLLANPQWIGDDRAVEET
LRFHSVADLVSLRVAVQDVEIAGQLIKAGEGIVPLVAAANHDENAFECPHAFDPSRSA
RHHVAFGYGVHQCLGQNLVRIEMEVAYRKLFERIPNLELAVPTDGLDIKYDGVLYGLN
ELPVRW
CYP105H4 Streptomyces nodosus
GenEMBL AF357202 complement(62051..63250)
amphotericin biosynthetic gene cluster
84% to 105H1
MTAETEMTTFAPGCPVAFPLRRPGRPFPPPEYADYRAGEGLVRS
ELPASGPVWLVTRHEDVRTVLTDPRISADPSRPGFPRARRTGGAPSQSEIPGWFVALD
PPEHDRFRKTLIPEFTVRKVRELRPAIQQIVDERIDALLAAGNSADLIADFALSVPSL
VISDLLGVPKADRDFFEAKTKVLVTLSSTDEQRDEASKALLRYLNRLIQIKGRRPGED
LISRLLQAGTMNRQELSGVSMLLLIAGHETTANNIGLGVVQLLTNPQWIGDDRIVEEM
LRYYSVADLVSFRVAVEDVEIGGQLIKAGEGIVPLIAAANHDGSVFDKPEEFNPERSA
RSHVAFGYGVHQCLGQNLVRVEMEIAYRTLFERIPTLELAVPVEELPLKYDGVLFGLH
ELPVTWS
CYP105H5 Streptomyces griseus
GenEMBL AJ300302 10678..11859
Gene = canC
72% to 105H3
MTTSPGPTVVDFPRRTPREPLPLSQYAEHRKQNGLVQTHLPNGR
PIWLVTRHEDVRAVLTHPRISANPDNEGFPNVGETMGVPKQEQIPGWFVGLDSPEHDR
FRKVLIPEFTVRRVRELRPAIERTVDERIDAMLAGGNTADLVNDFALPVPSLVISALL
GVPSADRDFFESRTRTLVAIRTSTDEERAEATRQLLRYINRLIVIKKKWRGEDLISRL
LSTGKLSDEELSGVLLLLLIAGHETTANNIGLGVVTLLSHREWIGDDRLVEELLRLHS
VADMVALRVAVDDVEIAGQTIRKGEGIVPLLASANHDTEAFGCPHAFNPERTERRHVA
FGYGVHQCLGQNLVRVEMEIAYRKLFERIPELRLAVPEDQLAYKYDGILFGLHELPVR
W
CYP105J1 Amycolatopsis mediterranei rifamycin
GenEMBL AF040570 CDS comp (67462..68673)
52% to AF072709 105D4 50% to 105D1 new subfamily in 105
CYP105K1 Streptomyces tendae strain Tue901
GenEMBL Y18574 CDS 6325..7557
45% to 105A3 46% to 105D1 43% to 105B1 new subfamily in 105
gene="nikF"
CYP105K2 Streptomyces ansochromogenes
GenEMBL AF469953 14..1246
95% to 105K1
note="involved in nikkomycin biosynthesis
MTEAFDHDIPSFPMARECPMHPPAEYRELRGQEPVSRVRMPDGQ
VAWLVLKHALARKLLADPRVSADRLHPAFPGRLTAEQRAATERVRRLTTRRSMIHLDG
DEHGAHRRILTGEFSLRRIAAQRPRVQEIVDRSIDEMLAAPQPADLVEHVSQAVPSLV
ICELLGVPHEQRRDFHEWAGMLVSRSVSIQERAAASDALNDFLEALVTEKERGEPADD
LIGRLIARNRQTPVMTHDEIVGTAVMLLVAGHQTTANMISLGVVALLENPEHKARIAA
DSSLLPPAIEEMLRYFSVVENAPARVATEDIAIGGVTIRKNEGIVVSGLAADWDDEVF
GHPDRLDFERGARHHVAFGYGVHQCLGQNLARVELEIVFETLLRRVPGLSLAVPAEEL
PYKDDAGIYGIYRVPVNC
CYP105L1 Streptomyces fradiae
GenEMBL AF055922 CDS comp (6507..7769)
GenEMBL AF147703 complement(2565..3875)
Fouces,R., Mellado,E., Diez,B. and Barredo,J.L.
The left edge of the tylosin gene cluster from Streptomyces
fradiae
Microbiology (1999) In press
tylH1
46% to 105A1 42% to 105D1 43% to 105B1 new subfamily in 105
MSSSGDARPSQKGILLPAARANDTDEAAGRRSIAWPVARTCPFS
PPEQYAALRAEEPIARAELWDGAPVWLISRQDHVRALLADPRVSIHPAKLPRLSPSDG
EAEASRSLLTLDPPDHGALRGHFIPEFGLRRVRDVRPSVEQIVTGLLDDLTARGDEAD
LLADFALPMATQVICRLLDIPYEDRDYFQERTEQATRPAAGEEALEALLELRDYLDRL
ISGKTGRESGDGMLGSMVAQARGGGLSHADVLDNAVLLLAAGHETTASMVTMSVLVLL
QHPTAWRELTVNPGLLPGAVDELLRYLSIADGLRRSATADIEIDGHTIRAGDGLVFLL
AAANRDEAVFSEPEAFDIHRSARRHVAFGYGPHQCLGQNLARMELEVALGAVLERLPA
LRPTTDVAGLRLKSDSAVFGVYELPVAW
CYP105L2 Micromonospora griseorubida
GenEMBL AB089954 1490..2641
gene cluster for the polyketide macrolide mycinamicin
54% to 105L1
gene = mycCI
MDRTCAWALPEQYAEFRQRATGWPAKVWDGSPTWLVSRYEHVRA
LLVDPRVTVDPTRQPRLSEADGDGDGFRSMLMLDPPEHTRLRRMFISAFSVRQVETMR
PEIEKIVDGILDRLLALEPPVDILTHLALPMSTQVICHLLGVPYEDREFFQERSELAS
RPNDDRSMPALIELVEYLDGLVRTKTAHPDTGLLGTAVTERLLKGEITHQELVNNAVL
LLAAGHETSANQVTLSVLTLLRHPETAAELREQPELMPNAVDELLRYHSIADGLRRAA
TADIVLGDHTIRAGDGLIILLSSANHDGNTFGAEATFDIHRPARHHVAFGYGPHQCLG
QNLARLEMEVTLGKLFRRVPALRLAQEPDALRVRQGSPIFGIDELLVEW
CYP105M1 Streptomyces clavuligerus clavulanic
GenEMBL AF200819 CDS 136..1359
GenEMBL AY034175 CDS 200..1423
GenEMBL U87786 CDS 13810..15036
function="involved in clavulanic acid biosynthesis"
48% to 105B1 42% to 105A1 41% to 105D1 new subfamily in 105
MNEAAPQSDQVAPAYPMHRVCPVDPPPQLAGLRSQKAASRVTLW
DGSQVWLVTSHAGARAVLGDRRFTAVTSAPGFPMLTRTSQLVRANPESASFIRMDDPQ
HSRLRSMLTRDFLARRAEALRPAVRELLDEILGGLVKGERPVDLVAGLTIPVPSRVIT
LLFGAGDDRREFIEDRSAVLIDRGYTPEQVAKARDELDGYLRELVEERIENPGTDLIS
RLVIDQVRPGHLRVEEMVPMCRLLLVAGHGTTTSQASLSLLSLLTDPELAGRLTEDPA
LLPKAVEELLRFHSIVQNGLARAAVEDVQLDDVLIRAGEGVVLSLSAGNRDETVFPDP
DRVDVDRDARRHLAFGHGMHQCLGQWLARVELEEILAAVLRWMPGARLAVPFEELDFR
HEVSSYGLGALPVTW
CYP105N1 Streptomyces coelicolor
St4C2 [Full Sequence] Sanger cosmid
CDS 29986-31221 45% to 105A1 new subfamily in 105
cloned and expressed by David Lamb and Steve Kelly
CYP105N2 Streptomyces glaucescens cytochrome P450
GenEMBL AF071144
95% to 105N1 only 5 aa diffs
57% to AF071148 56% to AF071146 59% to 105D3 54% to 105A3
LLIAGHETTTSMIALSTLLLLDRPELPAELRNDPDLMPAAVDEL
LRVLSVADSIPLRVAAEDIELSGRTVPADDGVIALLAGANHDPEQFDDPERVDFHRTD
NHHVAFGYGMHQCLGQNL
CYP107N3 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
91% to 107N1
clone name SP0881
CYP105P1 Streptomyces avermitilis
GenEMBL AB070949.1 67376-68575
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV413_pteC low 40% range to 105 subfamilies
Gene = pteC
CYP105P2 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
92% to 105P1
clone name SP7863
CYP105Q1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV1611 49% to 105B1 from Streptomyces griseolus
46% to 105D4 and D5
CYP105Q2 Streptomyces sp.
GenEMBL BD133549
78% to CYP105Q1
3 LIAGHETTATMISLGTLALFEHPDQLAVLRDTEDPKVVAGAVDELLRYLSIVHSGLRRVA 182
183 KGDIEIDGRLIRKGDGLLFDLQTANWDPNAFPGAERLDLARPARQHNAFGFGTHQC 350
CYP105Q3 Streptomyces sp.
GenEMBL BD133546
77% to 105Q1
139 MADTLTDAAPDTDGRVPEYPMPRATGCPLAPSPAAAELRGDRPITRVRIWNGSTPWLITR 318
319 HADQRTLLTDPRVSNDDHEPDFPHVNAHRAAIAPHTPKLITNTDAPEHTRLRRSVNAPFL 498
499 VKRIEAMRPAVQKIVDDLIDDMLAGPSPADLLTALALPVPSLVIAELLGVPYEDHHFFQE 678
679 NSNRVLDNSLTAEEAQESSRALGGYLDTLFRTKLEQPGEDVLSEMGSKVKAGEMTHQEAV 858
859 SMGVAMLIAGHETTATMISLGTLALFEHPDQLAVLRDTEDPKVVAGAVDELLRYLSIVHS 1038
1039 GLRRVAKGDIEIDGRLIRKGDGLLFDLQTANWDPNAFPGAERLDLARPARQHNAFGYGPH 1218
1219 QCLGQNLARLELQVVYGTLYRRVLTLRPAVPVDQLAFNHTGTTYGVKCLPVTW 1377
CYP105R1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV7186
CYP105S1 Streptomyces tubercidicus strain R-922
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Cyp230
56% to CYP105S2
CYP105S2 Streptomyces tubercidicus strain I-1529
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Cyp234
56% to CYP105S1
CYP105T1 Burkholderia fungorum
GenEMBL NZ_AAAJ02000095
8366..9610 gene = Bcep2217
44% to 105H1
MRKTMTSAINDVRPQTTSTFPFARTGSPLHPPAEYARYRDGQPV
TRVQMWDGRYAWIFTRMEDVKAVLSSPHFSVVPSKPGYPFLTPARAATVKSYQTFITM
DPPDHTRFRRMLTRDFTQKRMEELRPQIAAYVNRLIDEMLARGSPGDLVSALALKLPV
TVVSMLVGVPYEDHEDLVKWSGQRLDLEQNPTVSESAADNMLAYFDGLLQRKERDPGD
GADMLSRLVIEQIKPGHLSRLEAIHMVNLLYFAGHETTANQIALGTLSFLLDPRQRAL
LENNPGLLKNAIEEMLRFHTISHYNSCRVATADVEVGGTLIREGEGAYALIMAANRDP
AAFPAPDRFDIERPNSQEHVAFSYGLHMCLGQPLARLELQVCFEALFRRLPRLRLAVP
LEELPFKREMYVYGLHALPVTW
CYP105U1 Streptomyces hygroscopicus strain NRRL 3602
AY179507 complement(63940..65133)
Geldanamycin biosynthesis gene cluster
50% to 105B1 52% to 105B2 not 105S
gene = gdmP
MDEIRDYPESRAAACPFSPPLGYEELRERSAVTRVRMWDGSTPF
LVTGYHEARAALGDSRFSADGTHKAMPRFVKFEVPAEVFNLGRMDDPEHARIRRMLTA
NFTIRRTEAMRPMIQGIVDGLLDRLIAQGPPADLVADFAFPLPSQVIGVMLGVSDADF
AEFQQASQGVMDFTASAEEMGAALGVMVDYVARMCAAKRADPGDDLLSRLIVDQELTG
GLTQQQVVATALVLLLAGHETTANMIALSTVLLLSHPEQLARLRADAGLMGNAVDELL
RYITIVQEGTGRVATEDVEVGGVLIPGGEGVIINLPSANRDPHFADAHELDLSRPNAR
EHVAFGFGVHQCLGQTLARVELQIALETLLRRLPTLRLEVPFDDLAFLYESMNFGVAR
VPVAW
CYP105V1 Streptomyces sp. HK803
GenEMBL AY354515 36297..37508
Gene = plmT4
43% to CYP105Q1
MSQLSSELPAFPMSKAKGCPLDPPPEYAQLRSDRPVAKARLWDG
KEVWLITGYDEIRSIFTDPRISVDNTQPGYPWLSEQARTVVLTGGVKPVGRMDPPEHT
AMRRMLGQGFLVKKIQNMRGDVEALVNELIDDILAGPRPTDLVPSLAMPVPSTALGWV
LGVPPADKRLISLVPRLFDEDSGLEGAMEARAELFAYIDELITHRENQPGDDIISHLV
GYYQKGELSRVSVLTQSVTLIAAALDTTRSMITNGILALLQHPEQAAALIEDPDLVPA
AVEELLRYTVVTEFSSKRVAAADIEIAGETIKAGDGIICLISAGNRDEKVFTDPDTLD
VRRDAKQHLGFGAGIHTCIGKQLARMELEVVYGTLFRRIPELRLAVPFDQLVFRNTFD
VQGVRALPVTW
CYP105W1 Micromonospora echinospora
GenEMBL AF497482 84045..85229
Gene = calE10
calicheamicin biosynthetic locus
45% to CYP105K1 47% to 105D4
MPRRCPFGPPAEYARLRTERPVARLPMLGGNTAWVVSRYADVKR
VLSDPRMSADRRRAGFPRFAPTTESQRQASFANFRPPLNWMDPPEHTAARRQIVDEFA
ARRVRQLRPLVERVVDEHLDAMTAGRSSADLVPSFSYPVPSRVICEMLGVPYGEHAFF
ERRSTRMLSRGVPADERARCAREIREFLDGVVTDKERHPGDDVLSRLLAAQRAAGEPD
HEAVVSMAFVLLVAGHVTTSNMISLSVLALLTHPERLARLRAEPDRFPAAVEELLRYF
TIVEAATARTATADVTVGGVTIRAGEGVVALGQAANRDPAAFDRPDEFDPDRDARHHL
AFGYGRHICPGQHLARLELDVALSRLVRRLPGLRLTVDVDDLPLKEDGNIFGLHALPVAW
CYP105X1 Pseudonocardia autotrophica same as Amycolata autotrophica
GenEMBL AF525299 2766..3974
Gene = pauC
P-450 gene cluster
49% to 105A3
MAEDTLGQDFPMQRQCPFEPPKEYERLRAEQPISRVRMPDGTPA
WLVTLHEDVRTVLASPAFSSDLAHPGMPAVNPEIRTIARQQRPPFSRMDPPEHSFFRR
MLIPEFTVKRTKTLRAGIQSVVDGLIDDLLRKSPPVDLVDEFALPVPSLVICQLLGVP
YSRHEFFQQQARVILSRQSTREQVGAAFTALRAYLDTLVEEKLHTPGDDLTSRLATEH
LEPTGDVRRQDLVASCMLLLTAGHETTSHMISLGVTALLEHPDQLAALQNDLTLLPEA
VEELVRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDI
HRGNRRHACFGYGVHQCIGQHLARTELEVAFSTLFTRIPTLQIAAPSDELDYDHDGML
FGLHELPVTW
CYP105X2 Amycolata autotrophica same as Pseudonocardia autotrophica
GenEMBL AF071148
99% to 105X3 94% to 105X1 61% to 165B2
LLIAGHETTSHMISLGVTALLERPDQLAALQNDLTLLPEAVEEL
LRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDIHRGN
RRHVAFGYGVHQCLGQNL
CYP105X3 Micromonospora inyoensis
GenEMBL AF071146
99% to 105X2 61% to 165B2 60% to 105A3
LLIAGHETTSHMISLGVTALLEHPDQLAALQNDLTLLPEAVEEL
LRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDIHRGN
RRHVAFGYGVHQCLGQNL
106 Family
CYP106A1 Bacillus megaterium
GenEMBL X16610
Gene BM-1
CYP106A2 Bacillus megaterium
GenEMBL Z21972 (4317bp) PIR S32216 (410 amino acids)
PIR S39924 (410 amino acids) Swiss Q06069 (410 amino acids)
Rauschenbach,R., Isernhagen,M., Noeske-Jungblut,C., Boidol,W.
and Siewert,G.
Cloning, sequencing and expression of the genes for cytochrome
P450meg, the steroid-15beta-monooxygenase from Bacillus
megaterium ATCC 13368.
Molec. Gen. Genet. 241, 170-176 (1993)
CYP106B1 Bacillus anthracis str. Ames
Genpept AAP26480
47% to 106A2 47% to 109B1
1 MASPENVILV HEISKLKTKE ELWNPYEWYQ FMRDNHPVHY DDEQDVWNVF LYDDVNRVLS
61 DYSLFSSRRE RRQFAIPPLE TRININSTDP PEHRNVRSIV SKAFTPRSLE QWKPRIQSIA
121 NELVKDIENC SEVDIVEQFA APLPVTVISD LLGVPTTDRK KIKAWSDILF MPYSKEKFND
181 LDAEKGIALN EFKAYLLPIV QEKRYHLTDD IISDLIRAEY EGERLTDEEI VTFSLGLLAA
241 GNETTTNLII NSFYCFLVDS PATYKEVREK PKLISKAVEE VLRYRFPVTL ARRITEDTNI
301 FGPLMKKDQM VVAWVSAANL DEKKFSQASK FNIHRIGNEK HLTFGKGPHF CLGAPLARLE
361 AEIALTTFIN AFEKIALSPS FNIEQCILEN EQTLKFLPIR LKPQ
CYP106B2P Bacillus cereus ATCC 14579
GenPept AAP09572 GenEMBL AE017006
83% to 106B1 54% to CYP109B1 YjiB Z99110 Bacillus subtilis I -helix
1 MTSVITDGEI VTFSLGLLAA GNETTTNLII NSFYCFLVDS PGIYEELRKE PNLILKAIEE
61 VLRYRFPVTL TRRITALSER ESPSPLGMG
CYP106B3P Bacillus cereus ATCC 14579
GenPept AAP09575 GenEMBL AE017006
87% to 106B1 54% to 106A2 C-term fragment
LKEDTNIFGPF
1 MKKNQMIVAW VSAANLDEKK FSQASQFNVH RTGNEKHLTF GKGPHFCLGA PLARLEAEIA
61 LTTFINAFEK IELFPSFCLE KCILENEQTL KYLPIRLKAT
107A Subfamily
CYP107A1 Saccharopolyspora erythraea
GenEMBL X60379 Swiss Q00441 (406 amino acids)
Haydock S.F., Dowson J.A., Dhillon N., Roberts G.A., Cortes J.,
Leadlay P.F.
Cloning and sequence analysis of genes involved in erythromycin
biosynthesis in Saccharopolyspora erythraea: sequence similarities
between eryG and a family of S-adenosylmethionine-dependent
methyltransferases.
Mol. Gen. Genet. 230, 120-128 (1991).
Weber J.M., Leung J.O., Swanson S.J., Idler K.B., Mcalpine J.B.
An erythromycin derivative produced by targeted gene disruption in
Saccharopolyspora erythraea.
Science 252, 114-117 (1991)
CYP107A2 Streptomyces rochei plasmid pSLA2-L
NC_004808 complement(44847..46067)
64% to 107A1
note="ORF26 (406 aa), lankamycin biosynthesis protein
similar to M54983-1 Saccharopolyspora erythraea
6-deoxyerythronolide B hydroxylase, EryF CYP107A1
MTTDAHTAVPSLDSDLFHIDQYEAYAALREREPVSKVSFIGREA
FLITRHAEAKAALGDLRLSNDFKKQPPGVELPTYHGIPEDVRPYFANNMGSNDPPAHT
RLRRLVSREFTARRVESMRTRVAQLAEHLLDGLAGERETDLVERFAYPLPITVISELL
GVEERYQGDFGRWSNEFLVIDADRVEQREHAARALVGFILELVDRRRADPGSDLLSAL
IHVHDEDEDRLSTDELASVVLILLIAGFETSVSLIAMATYLLLTHPGELAKVRADPSL
VPNAVDEVLRFLGPAEITTRGTLEPVEIGGVHIPAHSTVLIAGAAANRDPRRFPDPER
FDVTRDTGGHLSFGHGIHFCVGGPLARLEGEIALRALLNRFPGLDLAIPAEQVRWRRS
FLRGIESLPVRLGR
107B Subfamily
CYP107B1 Saccharopolyspora erythraea
GenEMBL M83110 Swiss P33271 (405 amino acids) PIR B42606 (405
amino acids)
Andersen J.F., Hutchinson C.R.
Characterization of Saccharopolyspora erythraea cytochrome P-450
genes
and enzymes, including 6-deoxyerythronolide B hydroxylase.
J. Bacteriol. 174, 725-735 (1992)
CYP107B2 Streptomyces sp.
GenEMBL BD133548
58% to 107B1
3 LIAGHETTTNFIGNAALALLRHPESLAHLRAEPQLLGGALDELLRYDSPVGIATFRFSTE 182
183 ALTLGGTEIPEGVPVLIAPGAANRDPDRFPDPDRLDLTRGATGHLAFGFGMHHC 344
107C Subfamily
CYP107C1 Streptomyces thermotolerans
GenEMBL D30759 (3267bp complete sequence of CarA)
Arisawa,A., Kawamura,N., Takeda,K., Tsunekawa,N.,
Okamura,K. and Okamoto,R.
Cloning of a macrolide antibiotic biosynthesis gene acyA, which
encodes 3-O-acyltransferase, from Streptomyces thermotolerans and
its use for direct fermentative production of a hybrid macrolide.
Appl. Environ. Microbiol. 60, 2657-2660 (1994)
Arisawa,A., Tsunekawa,N., Okamura,K. and Okamoto,R.
Nucleotide sequence analysis of carbomycin biosynthetic genes
including macrolide antibiotics 3-O-acyltransferase gene from
Streptomyces thermotolerans.
unpublished (1994)
CYP107C1 Streptomyces thermotolerans
GenEMBL M80346 (2393bp C-terminal fragment of CarA)
Schoner,B.E., Geistlich,M., Rosteck,P., Rao.R.N., Seno,E.,
Reynolds,P., Cox,K., Burgett,S. and Hershberger,C.L.
Sequence similarity between macrolide resistance determinants and
ATP binding transport proteins.
Gene 115, 93-96 (1992)
Note: P450 fragment called carX. is equivalent to C-terminal of CarA.
107D Subfamily
CYP107D1 Streptomyces antibioticus
GenEMBL L37200 (1400bp)
Rodriguez,A.M., Olano,C., Mendez,C., Hutchinson,C.R. and
Salas,J.A.
A cytochrome P450-like gene possibly involved in oleandomycin
biosynthesis by Streptomycese antibioticus.
unpublished (1994)
107E Subfamily
CYP107E1 Micromosospora griseorubida
GenEMBL D16098 (2168bp)
Inouye,M., Takada,Y., Muto,N., Horinouchi,S. and Beppu,T.
Cloning and nucleotide sequences of a gene governing mycinamicinIV
hydroxylation.
unpublished (1993)
107F Subfamily
CYP107F1 Streptomyces griseus
GenEMBL D45916 (2787bp) AB018074 CDS 341-1561
Ueda,K. and Horinouchi,S.
Cloning and Nucleotide Sequence of a Gene Involved in Redbrown
Pigment Biosynthesis in S. griseus
Unpublished (1995)
CYP107F2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV1171 55% to 107F1
this subfamily is on the outskirts of CYP107
107G Subfamily
CYP107G1 Streptomyces hygroscopicus
GenEMBL X86780 (107379bp)
complement (91764-92978)
rapN
107H Subfamily
CYP107H1 Bacillus subtilis
GenEMBL U51868 (10153bp) Z99119, AF008220
coding region 7164-8351
pimelic acid biosynthesis
gene name bioI
107J Subfamily
CYP107J1 Bacillus subtilis
GenEMBL Y11043 U93876, Z99117
Belitsky, B. R., M. C. Gustafsson, A. L. Sonenshein, and C. Von
Wachenfeldt.
An lrp-like gene of Bacillus subtilis involved in
branched-chain amino acid transport. J Bacteriol. 179, 5448-57
(1997).
gene name cypA 42.6% identical to 107B1
also called yrdE
CYP107J2 Bacillus anthracis str. Ames
GenPept AAP26475
58% to 107J1 cypA of Bacillus subtilis
1 MAMKNKVGIR IEDGINLASA QFKEDAYEIY KESRKVQPVL FVNKTELGAE WLITRYEDAL
61 PLLKDNRLKK DPANVFSQDT LNVFLTVDNS DYLTTHMLNS DPPNHNRLRS LVQKVFTPKM
121 IAQLEGRIQD IADDLLNEVE RKGSLNLVDD YSFPLPIIVI SEMLGIPKED QAKFRIWSHA
181 VIAYPETPEE IKETEKQLSE FITYLQYLVD MKRKEPKEDL VSALILAESE GHKLSARELY
241 SMIMLLIVAG HETTVNLITN TVLALLENPN QLQLLKENPK LIDAAIEEGL RYYSPVEVTT
301 SRWADEPFQI HDQTIEKGDM VVIALAAANR DETVFENPEV FDITRENNRH IAFGHGSHFC
361 LGAPLARLEA KIAITTLFER MPELQIKGNR EDIKWQGNYL MRSLEELPLT F
CYP107J3 Bacillus cereus ATCC 14579
GenPept AAP09568
59% to 107J1 cypA Y11043 Bacillus subtilis
1 MKNKVGLSIE DGINLASAQF KEDAYEIYKE SRKKQPILFV NQVEIGKEWL ITRYEDALPL
61 LKDNRLKKDW TNVFSQDIKN MYLSVDNSDH LTTHMLNSDP PNHSRLRSLV QKAFTPKMIA
121 QLDGRIQRIA DDLISDIERK GTLNLVDDYS FPLPIIVISE MLGIPKEDQA KFRIWSHAVI
181 ASPETPEEIK ETEKQLSEFI TYLQYLVDIK RKEPKEDLVS ALILAESEGH KLSARELYSM
241 IMLLIVAGHE TTVNLITNTV LALLENPNQL QLLKDNPKLI DSAIEEGLRY YSPVEVTTAR
301 WAAEPFQIHH QTIQKGDMVI IALASANRDE TVFENPEIFD ITRENNRHIA FGHGSHFCLG
361 APLARLEAKI AITTLFNRMP ELQIKGNREE IKWQGNYLMR SLEELPLTF
CYP107J4P Bacillus cereus ATCC 14579
GenPept AAP09593
46% to CYP107J3 in same genomic region
47% to CYP107Y1 SAV2377 AP005030 Streptomyces avermitilis
50% to 107H1
1 MKEPQLQQHL EKFIQYIEAL VNEKRLNPDA DLISELVQTK EQEDKLSNNE LLSTIWLLII
61 AGHETTVNLI SNGLLALLQH PEQMNLIREN PSLIPSAVDE LLRHSGPVMF ISRLASEDMT
121 IHGKRIPKGD LVLLSLTAAN IDPQKFTYPE TLNISREENN HLAFGAGIHH CLGAPLARLE
181 GQIALGTLLQ RLPNLRLAIK PDQLNYNHSK IRSLVNLPVV F
CYP107K1 Bacillus subtilis
GenEMBL AL009126 Z99113 comp(76702-77832)
polyketide hydroxylase pksS
just over 41% identical to CYP107J1
CYP107L1 Streptomyces venezuelae
GenEMBL AF087022
GenEMBL AF079139 CDS 122..1372
pikC gene
function="catalyzes the hydroxylation of YC-17 into
methymycin and neomethymycin and narbomycin into
pikromycin"
51% to 107B1 47% to 107A1 44% to AF254925 42% to 107J1
41% to AL049754 new CYP107 subfamily
CYP107L2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV1987 60% to 107L1 from Streptomyces venezuelae
CYP107L3 Streptomyces tubercidicus strain I-1529
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name CypLA
60% to CYP107L1 91% to 107L4
CYP107L4 Streptomyces tubercidicus strain R-922
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name CypLC
61% to CYP107L1 91% to 107L3
CYP107L5 Streptomyces sp.
GenEMBL BD133547
68% to 107L2
3 LIAGHETTVNLITGAVHALLTHPGQLAQVRGDMSLVDAVVEETLRHEGPVENATFRFAAE 182
183 PLEIGGTVIPAGDPVLIGLAAADRDGARYPGPDRFDIHRDTRGHLAFGFGTHRC 344
CYP107L6 Streptomyces sp.
GenEMBL BD133544
72% to 107L2
MGHEHVIDLGEYGPGFTENPHPVYAELRARGPVHRVRLPKHDAHHEAWLVVGYEEARAAL
ADPRLSKDGSTIGVTFLDEELIGKYLLIADPPQHTRLRGLIAREFTGRRVERLRPRVQEI
TDSLLDEMLPRGRADLVESFAYPLPLTVICELLGVPEIDRAAFRKLSTEAVAPTSGESEY
AAFVQLAAYLEELVEEKRCAPPADDLLSALIRTTDEDGDRLSPAELRGMAFILLIAGHET
TVNLITGAVHALLTHPGQLAQVRGDMSLVDAVVEETLRHEGPVENATFRFAAEPLEIGGT
VIPAGDPVLIGLAAADRDGARYPGPDRFDIHRDTRGHLAFGHGIHFCLGAPLARLEARVA
LRALLERCPGLTPDGAPGEWLPGMLIRGVRSLPVRW*
CYP107L7P Streptomyces narbonensis
GenEMBL AF521878 13901..14661
desosamine biosynthetic gene cluster
91% to 107L1
gene= nbmL
note= frameshift and deleltion generates premature
stop codon and truncated protein"
MSRTHQGTTASRPVLDLAALGQDFAADPYPTYARLRAEGPAHRV
RTPEGDEVWLVVGYDTARAVLADPRFSKDWRNSATPPTEAEAALSHNMLESDPRCGPT
(deletion)
ALRADLTLLDGAVEEMLRYGGPVESATYRFPVEPVDLDGTVLPAGETVLVVLAD
AHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCTGAPLARMEARIAVRALLERCPDLALD
VSPGELFWYPNPMIRGLESLPIRWRSGREAGRRVPVEPACRP*
CYP107L8 Streptomyces sp. HK803
GenEMBL AY354515 complement(72672..73871)
Gene = plmS2
56% to CYP107L6
MVTVDLSAYGPGFFTDPYPYYARLREAGPVHEIVLADGDRFWLI
VGYDEARAALADPRLAKSLDPPSEDERHVLITDPPDHTRLRRLVSREFTARRVEAMRP
RVQEITDGLLDEMVAGRRRADLVPSLGSPLPITVLCELLGVPLADREDFRGWTERVLV
PAEPDTIAWWKSRGFAQAGMALTDYLKNMIEDKRRSTPTGDLISSLLRTTAEDNDRLS
AAELHSMVFILIVAGHETTANLITNGVRALLAHPEQLAALRTDPEGLIDQAVEEMLRY
DGPVETSTKRFTLEAVRYGATKIPPGETLLVSIAATGRDPAQFERPDTFDIHRGTTGT
RSGHVAFGHGIHFCLGAGLARMESRVAILTLLRRCPDLALDIDPAGLDWLPGIRVRGV
RSLPVRW
CYP107L9 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
62% to 107L6 before frameshift at C-term
clone name SP0854
CYP107M1 Actinomadura hibisca
GenEMBL D87924CDS complement(6299..7534)
45% to AF127374 CDS 3226..4458 44% to AF254925
45% to 107D1 44% to 107G1, 107E1 new subfamily in 107
CYP107N1 Streptomyces lavendulae
GenEMBL AF127374 CDS 3226..4458
50% to 107D1 52% to AF254925 47% to 107E1 new subfamily in 107
CYP107P1 Streptomyces coelicolor cosmid H10
GenEMBL AL049754 CDS complement(10413..11648)
41% to AF087022 40% to 107B1 40% tp 107G1
40% to 107D1 new subfamily in 107
cloned and expressed by David Lamb and Steve Kelly
CYP107P2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV4539 86% to 107P1 from Streptomyces coelicolor
CYP107P3 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
78% to 107P2 missing 156 aa at N-term
C-term may be frameshifted
clone name SP0887
CYP107Q1 Amycolatopsis mediterranei
GeEMBL AF040571 CDS complement(781..>2316)
66% to AF040570 comp(68704..69969) 43% to 107C1
41% to 107B1 40% to 107A1 new subfamily in 107
CYP107Q2 Amycolatopsis mediterranei
GenEMBL AF040570 CDS comp (68704..69969)
66% to AF040571 complement(781..>2316) new subfamily in 107
CYP107R1 Streptomyces maritimus
GenEMBL AF254925 CDS comp (18384..19589)
gene="encR"
53% to AF127374 CDS 3226..4458 49% to 107E1 new subfamily in 107
MTTHTQQLRDFPFAPPAELHMEPAFAQLREEEPISRVRLPYGGE
AWLVTRYQDIKTVLGDPRFSRAATQHAQAPRIQPDPAGEGVLMSLDPPDHTRLRKTVA
GVFTKRRVEDLRPATQRIAEELLEAMEASGAPADLVASYALPLPVTVICDLLGVPGDD
REQLRGWSDALLSTTACTPAESAAAAQAMADHFAALVSQRRRQPTDDLLGALVQTWDR
EEGLLRDEELVLLTRDLLIAGHETTASQIANCTYLLLQRPHDMDRLRTDPSAMASAVE
ELLRFIPLGSGSFRARVATEPVELCGVRIQPGDTVFAPTVAANWDPDVFAEPGRLDID
RSPNPHVAFGHGVHHCLGAQLARLELQVALGVLLRRLPRLRLAVDEAEIVWKTGMQVR
GPKTLPVKW
CYP107S1 Pseudomonas aeruginosa
NZ_AABQ07000001
NC_002516 3741011..3742267
locus_tag = PA3331
47% to 107B1
CYP107T1 Streptomyces coelicolor
StH63 [Full Sequence] Sanger cosmid
51% to CYP107L1 CDS 16028-17233
cloned and expressed by David Lamb and Steve Kelly
CYP107U1 Streptomyces coelicolor
StE41 [Full Sequence] Sanger cosmid
comp(7438-8739) 44% to CYP107B1
cloned and expressed by David Lamb and Steve Kelly
CYP107U2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV3536 85% to 107U1 from Streptomyces coelicolor
CYP107U3 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
84% to 107U1 missing 90 aa at N-term
clone name SP0819
CYP107V1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV3519 low 40% range with some 107 subfamilies
CYP107W1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV2894_olmB low 40% to 107 subfamilies
CYP107X1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV6249 49% to 107L1 from Streptomyces venezuelae
CYP107Y1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV2377 50% to 107L1 from Streptomyces venezuelae
CYP107Z1 Streptomyces rimosus ssp. paromyceticus strain R-2374
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema11
96% to CYP107Z2v1
CYP107Z2v1 Streptomyces albofaciens strain C-0083
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema8
96% to 107Z2v2 and CYP107Z1
CYP107Z2v2 Streptomyces rimosus ssp. paromyceticus strain BOEH-4355
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema3
96% to CYP107Z2v1 95% to CYP107Z1
CYP107Z3 Streptomyces sp. strain IHS-0435
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema7
76% to 107Z12
CYP107Z4 Streptomyces lydicus strain NRAB-0114
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema16
82% to 107Z12
CYP107Z5V1 Streptomyces lydicus strain NRRL-2433
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema15
97% to 107Z5v3
CYP107Z5v2 Streptomyces chattanoogensis DSM-40241
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema6
1 aa diff to CYP107Z5v3
CYP107Z5v3 Streptomyces lydicus strain R-401
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema4
100% to S. kasugaensis strain A/96
CYP107Z5v3 Streptomyces kasugaensis strain A/96
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema10
100% to S. lydicus strain R-401
CYP107Z6 Streptomyces sp. strain I-1525
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema5
85% to CYP107Z8
CYP107Z7 Streptomyces tubercidicus strain DSM-40261
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema17
90% to CYP107Z8
CYP107Z8 Streptomyces platensis strain Tu-3077
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema13
89% to CYP107Z9
CYP107Z9 Streptomyces tubercidicus strain NRAA-7027
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema12
89% to CYP107Z8
CYP107Z10 Streptomyces tubercidicus strain I-1529
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema2
90% to CYP107Z11
CYP107Z10 Streptomyces platensis strain I-1548
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema14
100% to S. tubercidicus strain I-1529
CYP107Z11 Streptomyces platensis strain NRAA-7479
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema9
92% to 107Z12
CYP107Z12 Streptomyces tubercidicus strain R-922
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema1
92% to CYP107Z11
CYP107AA1 Bradyrhizobium japonicum USDA 110
GenPept BAC51802
NC_004463 complete genome complement(7193424..7194725)
41% to 133B1v1 45% to 107L1
1 MVTPGSGAAI GVFVSCGNRF EVTMNEQAQP AGGDPLFNPL SPDFIRNPYP HYDRLRAIDP
61 IHVTPFGQFV ASRHADVSLV MRDKRFGKDF VERSKRRYSE KIMDEPVFRS MSHWMLQADP
121 PDHTRLRGLV VKAFTARRVE DMRPRIQEIV DEAIDAVIDR GHMDLIEDFA FRLPVTIICD
181 MLGIPEDHRE VFYKSSRDGG RLLDPVPLTP EEIAKGNAGN MMAQMYFQQL FELRRRNPAD
241 DLTTQLVQAE EDGNKLTNEE LTANIILLFG AGHETTVNLI GNGLLALHRN PDQLALLKAR
301 PELMVNAIEE FLRYDSSVQM TGRVTLEDID DLGGRKIPKG ETVLCLLGSA NRDPAVYPDR
361 PDRLDVTRPN VKPLSFGGGI HFCLGAQLAR IEAEIAIATL LRRLPDLRID DVENPEWRPT
421 FVLRGLKSLP ASW
CYP107AB1 Streptomyces rochei plasmid pSLA2-L
NC_004808 Links 87725..88939
49% to 107A1
note="ORF37 lankamycin biosynthesis protein
MNQPQLPEIPALNSELFHTDQYATYREILEQRPVTRVRFYDGSL
VWLVNRHEDVRAALTDPRLSNDPMKQSDIDLSAATGIPADLIEYFQRNMFRSDEPDHG
RLRKLVTREFTVRRINALRPRIRQIADDLLEKFAATGGGDLVEALARPLPLTVMCELL
GVPEEDRADFQTWSQHIVESSPEFAERNAVSYRSLFECVRSLIRRRRDEPGDDLLSAL
VDLRDVADRLSENELISTVFLLLVAGIETTVNVLGTGTFLLLTHPGELARLRADGALL
GPAVEEMLRYMAPIEITSRHTLEPVEIGGVSIDAQSTVLINLAAANRDPARFEDPQSF
RVDRNDGGHLTFGHGIHYCLGAALARAEAEVTFEALLERFPDLRLAASASDLTWRHAF
MRGPVELPVSWG
CYP107AC1 Streptomyces atroolivaceus
GenEMBL AF484556 60948..62147
leinamycin biosynthetic gene cluster
48% to 107N1
gene = LnmA
MSATRRVHIYPFEGEVDGLEIHPKFAELRETDPLARVRLPYGGE
GWMVTRYDDVRAANSDPRFSRAQIGEDTPRTTPLARRSDTILSLDPPEHTRLRRLLSK
AFTARRMGAMQSWLEELFAGLLDGVERTGHPADIVRDLAQPFTIAVICRLLGVPYEDR
GRFQHWSEVIMSTTAYSKEEAVSADASIRAYLADLVSARRAAPHDDLLGVLVSARDDD
DRLTEDELITFGVTLLVAGHETSAHQLGNMVYALLTHEDQLSLLREQPELLPRAVEEL
LRFVPLGNGVGNARIALEDVELSGGTVRAGEGVVAAAVNANRDPRAFDDPDRLDITRE
KNPHLAFGHGAHYCLGAQLARMELRVAIGGLLERFPGLRLAVPADQVEWKTGGLFRGP
QRLPIAW
CYP107AD1 Streptomyces hygroscopicus
GenEMBL AF521896 4248..5489
ansamycin biosynthesis gene cluster
43% to 107X1
gene = gdnH
MSGRHFEQGERGTAMADTPEEELRILDPQSVAQELRKHGPPRQI
TMHGTTAWLVSRYEEVRDCLGHPGMSPAAAYAASQGQTNPVSGLFEDTVAGTNPPQHT
RLRRLLAKAFTVRRVESLRPRVQEITDTLLDRIAVDGRADLVSALAIPLPMQVICELL
GVPIADRTEFHQWADLMLTPPLDPDTAARSQDASAKLWTYMEDLAEARRKAPEDDLIS
DLMSAHEDDRLSHREVVATARMMLIAGYELTGSFISNAVFSLLSQPDQMELLRKDPEL
AGRGLEELLRHAGPGILIVRFANEDVEIGSVSIRAGDQVLLDMDAAHSDPAHFTDGER
LDLTRDSAVHLQFGHGIHYCIGAPLARVEGQIALESLVRRFPGLRLSVPAAEISHSKN
PFIRSLTALPVEFEAQQPVAG
CYP107AE1 Streptomyces sp.
GenEMBL BD133545
50% to 107X1
VILLKSLAANGLTASSCFTVSPLPIRSASPSIAFLTSSSERDSGVRNDRPSDAQPAIARF
RFPTPPHPRNPTQPHPTPPRPSPTDDPLQAPTFFADPYPTYARLRDTAPVLKVPTGSGGG
GRHSYVVTGYAEAREAFTDPRLSKDTASFFAGRPSQRDLHPAVSRNMLATDPPQHARLRA
LVTKAFTTGAVARLRPYISSLVDELLDTWPTHGTVDLIADLAVPLPVTVICELLGVPDSD
RASVRTWSSDLFAAGDPQRIDAASHAVGDYMTALVAAKRTAPGDSLLDDLIAVRDGQDHL
SEDELVSLAVLLLVAGHETTTNFIGNAALALLRHPESLAHLRAEPQLLGGALDELLRYDS
PVGIATFRFSTEALTLGGTEIPEGVPVLIAPGAANRDPDRFPDPDRLDLTRGATGHLAFG
HGIHRCLGAPLARAEAELALHAVITRYPQAALATPPETLPWRHTRLTRGLASLPITLRDH
PK*
CYP107AF1 Streptomyces collinus DSM2012
GenEMBL AF293355 24259..25518
Gene = rubU
rubrinomycin gene cluster
52% to 107B1
MARTDAPQAAPPADLFTPAFHQNPHEALAGLRRTAPAVPVMTPN
GLRTWLVTGHEHARALLADPRLSKDMRVGRDLIPRNFVDPDKQREFLAESGERSQFPH
VLSVHMLDSDPPDHTRLRRLVGRAFTARRVESLRPRITELTDELLDAMARHERLDLME
ALAFPVPFTVICWLLGVPPDDRAAFRRWSNLLVSGAGTDEVREASASMITYLTELIEA
KRNEPADDMLTDLVHARDAGDQLSSDELISMAFLLLVAGHETTVNLIGNGALALLTHP
EVREQLAADESLWPGAVEEFLRYDGPVTNATWRFTTEPVEVGSVTIPEGEFVTISIGA
AGRDPDRYPDPDRLDITRAHSGSVAFGHGIHHCLGAPLARLEGRIVLSRLFARLPGLR
LAADPDELSWRSSLMMRGLEELPVFTA
CYP107AG1 Streptomyces atroolivaceus
GenEMBL AF484556 complement(120436..121638)
Gene = LnmZ
leinamycin biosynthetic gene cluster
49% to 107E1
MSTEVETEKPAPVAYPFTGSEGLELSQSYAKLFEDGDPIRVQLP
FGEPAWLVTRYDDARFVLTDRRFSRHLATQRDEPRMTPRAVPESILTMDPPDHTRLRT
LVSKAFTPRRIESKRAWIGELAAGLVADMKAGGAPAELVGSYALAIPVTVICELLGVP
EDDRTRLRGWCDAALSTGELTDEECVQSFMDLQKYFEDLVKERRAEPRDDLTSALIEA
RDAHDRLAEPELIGLCISILIGGFETTASEISSFVHVLQQRRELWTRLCADPEAIPAA
VEELLRFVPFAANGISPRYALEDMTVGGVLVREGEPVIVDTSAVNRDGLVFDNADEVV
IDRADNRHMVFGHGAHHCLGAHLARVELQEALKALVEGMPGLRLSGDVEWKADMIIRA
PRVMHVEW
CYP107AH1 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
50% to 107L6 missing about 42 aa at N-term
clone name SP0749
CYP107AJ1 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
52% to 107B1 frameshifted C-term
clone name SP0908
108 Family
CYP108A1 Pseudomonas spp.
Swiss P33006 (428 amino acids) PIR S27653 A42971 (428 amino acids)
Also found a PIR cross-reference to EMBL S39894 but could not
retrieve it
Peterson J.A., Lu J.-Y., Geisselsoder J., Graham-Lorence S.,
Carmona C., Witney F., Lorence M.C.
Cytochrome P-450 terp: Isolation and purification of the protein
and sequencing of its operon.
J. Biol. Chem. 267, 14193-14203 (1992)
CYP108A1 Pseudomonas spp.
GenEMBL M91440 (6620bp)
Hasemann,C.A., Ravichandran,K.G., Peterson,J.A. and
Deisenhofer,J.
Crystal structure and refinement of cytochrome P450terp at 2.3A
resolution.
J. Molec. Biol. 236, 1169-1185 (1994)
CYP108B1 Caulobacter crescentus CB15
GenEMBL AE005918 GenPept AAK24465
NC_002696 complete genome 2703947..2705221
Complete genome sequence of Caulobacter crescentus
Proc. Natl. Acad. Sci. U.S.A. 98 (7), 4136-4141 (2001)
47% to CYP108A1
1 MTISTDIANT IIDPKAYADG DRIDQAFAHL RREAPLAVAQ PDGFDPFWVV TRHADILEVE
61 RQNELFHNGD RATVVTTIEP DKKVREMMGG SPHLVRSLVQ MDNPDHFAYR KITQGALLPQ
121 NLRALEARIR EIARGFVDRM AEHGDRCDFA RDVAFLYPLH VIMEVLGVPE SDEPRMLKLT
181 QELFGNADPD LNRTGKSVTD VGEGVDSIQS VVMDFMMYFN AITEDRRANP RDDLATLIAN
241 GKINGEPMGH LEAMSYYIIA ATAGHDTTSS TTAGALWALA ENPDQFAKVK ADPSLIPGLI
301 EESIRWVTPV KHFMRTATAD AELGGQKIAK GDWIMLSYPS GNRDEAVFED PFTFRVDRTP
361 NKHVAFGYGA HICLGQHLAR MEMRVLWEEL FARLDHVELD GAPTRMVANF VCGPKSVPIR
421 FKMH
CYP108C1 Saccharopolyspora spinosa strain NRRL 18395
No accession number
Istvan Molnar
Syngenta Biotechnology, Inc.
47% to CYP108B1 43% to CYP108A1
CYP108D1 Novosphingobium aromaticivorans
GenEMBL NZ_AAAV01000137
16805..18166 gene = Saro1710
47% to 108B1 39% to 108C1
MTNTSRLTKRRRPRRSDGKREGFMDSIPMVPAEVGRAVIDPKSY
GTWEPLLDRFDALRAEAPVAKVVAPDDEHEPFWLVSSFDGVMKASKDNATFLNNPKST
VFTLRVGEMMAKAITGGSPHLVESLVQMDAPKHPKLRRLTQDWFMPKNLARLDGEIRK
IANEAIDRMLGAGEEGDFMALVAAPYPLHVVMQILGVPPEDEPKMLFLTQQMFGGQDE
DMNKSGLKDLPPEQISQIVAGAVAEFERYFAGLAAERRRNPTDDVATVIANAVVDGEP
MSDRDTAGYYIITASAGHDTTSASSAGAALALARDPDLFARVKADRNLLPGIVEEAIR
WTTPVQHFMRTAATDTELCGQKIAAGDWLMLNYVAANHDPAQFPEPRKFDPTRPANRH
LAFGAGSHQCLGLHLARLEMRVLLDVLLDRVDSLELAGEPKRVNSTFVGGFKSLPMRW
KAA
CYP108E1 Ralstonia metallidurans
GenEMBL NZ_AAAI01000348
46192..47481 gene = Reut4024
41% to 108B1 39% to 108A1 48% to 108C1
MTIASDFDTELASHEIYSDPERMHEMFETLRREDPVHWTTAPGH
PPFWAVTKQADVIEVGKHPDVFIASPKSFLMNDVEQRVRIEETAATGGKLVRTMIHMD
DPDHKKYRGLTQSYFMPANIKRLESVIQERARALVGRLIEKGTSEFCSEIAVWYPLQI
VMTLLDVPESEHPYLLKLTQQFLAPKDPTLRRDGPDERGKGAVAKEYFAYFGKMLAER
RAAPLKEDLGSLIAHATVDGEPLPLMEAVSYYVILATAGHDTTSSSMCSGLYYLLTQP
GELDRLRARPELMPSAIEEMFRHGSPVKHFVRTATRDFELRGKKIQAGDEVALMYHSA
SFDEEVFDEPRSFRIDRGPNKHVAFGFGIHACLGQNLARASMRTFFTELLARTESIEV
VGKAEFIASNQVGGMKTLNIRVTPSKQSTTDRIEVAA
109 Family
CYP109A1 Bacillus subtilis
GenEMBL M24523 (3187bp)
Lewis,P.J. and Wake,R.G.
DNA and protein sequence conservation at the replication terminus
in Bacillus subtilis 168 and W23
J. Bacteriol. 171, 1402-1408 (1989)
Ahn,K. and Wake,R.G.
A unique open reading frame adjacent to the replication terminus
of the Bacillus subtilis W23 chromosome compared with Bacillus
subtilis 168
unpublished (1990)
Ahn,K.S. and Wake,R.G.
Variations and coding features of the sequence spanning the
replication terminus of Bacillus subtilis 168 and W23 chromosomes
Gene 98, 107-112 (1991)
CYP109B1 Bacillus subtilis
GenEMBL AF015825 Z99110
YjiB
also similar to CYP106A, both 106 and 109 are close
together on a tree
110 Family
CYP110A1 Anabaena sp. (a cyanobacterium)
Swiss P29980 (354 amino acids) GenEMBL M38044 (5933bp)
GenEMBL U38537, M13161
Lammers,P.J., McLaughlin,S., Papin,S., Trujillo-Provencio,C. and
Ryncarz,A.J.II.
Developmental rearrangement of cyanobacterial nif genes:
Nucleotide sequence, open reading frames, and cytochrome p-450
homology of the Anabaena sp. strain PCC 7120 nifD element
J. Bacteriol. 172, 6981-6990 (1990)
This sequence was later revised to give a complete P450 sequence
of 448 amino acids.
CYP110A1 Nostoc sp. PCC 7120 same as Anabaena sp. PCC 7120
GenPept BAB73407, C37842 (this entry missing N-term)
NC_003272 complete genome 1708114..1709493
1 aa diff to M38044
1 MLTQLPNPIS VPSWWQLINW IADPIGFQKK YSKKYGNIFS MQLAGIGSFV ILGEPQALQE
61 IFTQDSRFDV GRGNTLAEPL IGRTSLMLMD GDRHRRERKL LMPPFHGERL QAYAQQICLI
121 TNQIASEWQI GQPFVARSAM QKLSLEVIIQ IVFGLADGER YQQIKPLFTD WLNMTDSPLR
181 SSMLFLKSLQ KDWGTWTPWG QMKHKQRSIY DLLQAEIEEK RTKENEQRGD VLSLMMAARD
241 ENGQAMTDEE LKDELLTILF AGHETTATTI AWAFYQILKN VNVQEKLQQE LDRLGANPNP
301 MEIAQLPYLT AVSQETLRMY PVLPTLFPRI TKSSINIAGY QLEPDTTLMA SIYLIHYRED
361 LYPNPQQFRP ERFIERQYSP SEYIPFGGGS RRCLGYALAL LEIKLVIATV LSNYQLALAE
421 DKPVNVQRRG FTLAPDGGVR VIMTGKKSLK FEQSSKIFN
CYP110A2 Anabaena variabilis (a cyanobacterium)
GenEMBL U38478 (1743bp)
Lammers, P.J. and Duran, S.
possible alkane/fatty acid hydroxylase
CYP110B1 Nostoc sp. PCC 7120 Same as Anabaena
GenPept BAB75445, AC2274
NC_003272 complete genome complement(4523158..4524546)
45% to CYP110A2 53% to 110E1 49% to 110D1 47% to 110C1
1 MHLPKGPQTP VFVQVLRWVF SPMSFLEDCA KRYGDIFSVK LAKDVPAIVF LSNPKDIQQI
61 LTNDNNQLDS PGDWNDLFEP LLGKRSVITL SGAEHQRQRQ LLMPPFHGER MRGYSQVITD
121 VTEKVISQHQ IGQPFQVRSV TQAITLRVIM QAVFGLYEGS RAEKLQHLLS DLLEKSSSPF
181 SVALLYFPSL RRDFGPIKFW GEQVQIQQQA DELIYQEIQE RRENPDPSRT DILSLLMDAR
241 DADGQPMTDV ELRDELMTLL VAGHETTATA LAWAMYWIHK LPPVKARLLE ELDSLGDNPD
301 STTIFKLPYL NAVYSETLRI YPVAMLTFAR RVIETMALGG YELPPGTPVL GSIYLTHHRE
361 DLYPEPKKFK PERFLERQFS PYEYLPFGGG TRRCLGLAFA QWEMKLALAK ILTSYELELV
421 NNSVEVRPKR RGLVTGPHRP IEMVIKSQRQ ITSRILETTT VS
CYP110B2 Nostoc punctiforme
NZ_AAAY02000005 GenPept ZP_00111619.1
complement(58895..60277) gene = Npun6097
75% TO 110B1
MKLPKGPQSPAVLQMLRWITSPMSFMETCAKRYGDMFTIRLDSK
SPPLIFVSKPEVLEQILTNDIKGLEAPGDTNLVFESLLGKHSVITISGAEHQRQRQLL
LPPFHGERMRSYSQIISDITEKVISQYQIGQPFNIRSVTQAITLRVIMQAVFGLDEGP
RAEKLQHCLAEMLEKGSSVLSAALLYFPALQRDFGPINFWGKQMRRQQAADKLIYEEI
RERQEQPDPSRTDILSLLMAARDEAGQPMTDEKLRDELMTLLVAGHETTATALAWAFY
WIQKIPTVRQKLLKELDSLGDNPDPSTIFKLPYLNAVCSETLRIYPVAMLTFARVVRT
PLSLGGYELEPGIGVIGSIYLTHHREDLYPEPKQFKPERFLERQFSPYEYLPFGGGAR
RCIGLAFAQLEMKLALAKILSTRELELVDNSEVRPKRRGLVTGQDRPIQMVVTSQRQV
KFPILQTATV
CYP110C1 Nostoc sp. PCC 7120 Same as Anabaena
GenPept BAB76385, AF2391
NC_003272 complete genome 5587079..5588485
48% to CYP110A2 49% to 110E1 47% to 110B1
1 MKYQIQRPNP LKTHPFLQKL QWIADPVEYM KKASLQHPDM FTAEVIGFGD TVVFVSHPQG
61 IQTLFANDRK KLVAVGEANR ILYPLVGNNS MFLLEGVKHK QRRQLLMPSF HGERMREYGH
121 LIRNITENLF SQLQQDVTFS ALTAMREISM QVILQAVFGF YEGERCQQFK HLLPIFLSEL
181 FQSPLASSIL FFPSLQKDLG NLTPWGRFVR QREKIDKLLY AEIAERRQEI NSDRIDILSL
241 LISARDETGD SMSDKELRDE LITLMISGHE TTGTAMAWSL YWILQTPEVF QRLIQELDSL
301 GDSPDPMSIF RLPYLTAVCN ETLRINPVAM LTLPRVVKEP IELLGNRLET STTVVGCIYL
361 THHREDLYPE SKLFKPERFL KREFSQYEFM PFGGGVRGCI GQALAMFEMK IVLATVLSRY
421 QLALADRKPE RPQRQGFTLT PTNGVKMLIT GQHKRQNYSM AASTTFNA
CYP110C2 Nostoc punctiforme
GenPept ZP_00108280.1
GenEMBL NZ_AAAY02000070 complement(34550..35941)
gene = Npun2703
60% to 110C1
1 MQLPNILKSP SLLQKLHWVS DPIGYMENAA QEYPDIFTGK IVGFGDTVVF VNHPQAIQEI
61 LTNDRKKFTA VGELNGILKP LLGDNSVLML ESDRHKRQRQ LVTPSFHGER MQAYGQLICN
121 VSKKIFNQLP LNKPFVARNL TKEISLQVIL QSIFGFYEGE KIQKLRQLLP LLLELFESPL
181 SSSLFLFSFL QQDLGAWSPW GNFLRVREKI DQFLYTEIAE CQQQADPERI DILSLLISCR
241 DEAGQPMTDQ ELRDQLITLI LAGYDTTATA MAWGLYWIHK QPLVCEKLLQ ELDTLGDSPD
301 PMSISRLPYL TAVCNETLRI HPVTMFSFPR VVQEPLELLG HSLEPGTILL PSIYLTHHRE
361 NLYPQSKQFK PERFIERQFS PYEFLPFGGG VRRCMGEALA LFEIKLALAT IVSHYHLALV
421 DQRPEQPQRR GFNLAPGSGV KMVMTDQRAR KESLINMTTT PLS
CYP110D1 Nostoc sp. PCC 7120 Same as Anabaena
GenPept BAB76465, AF2401
NC_003272 complete genome 5678382..5679743
48% to CYP110A1 53% to 110E1, 49% to 110B1
1 MTVTQNLPNG PRIPRLLRLF KFITQPIQYV EDFAKVYGDN FTIWGSGESY FVYFSHPQAL
61 EQIFTNVSCF ESSGGGSPLL ELLLGKNSLI LLEGDRHQRQ RQLLTPPFHG ERMRAYGQTI
121 REITQQVTQA WQMGKPFNIR ASMQEITMRV ILRVVFGVDE GELFQELRQL LTTLLDFMGS
181 PLMSSTFFFS FTQKDYGAWS PWGRMVRLIK KIDQLIYALI AQRRAEFGEN RQDILSLLIS
241 ARYDDGQPMS DVELRDELMT MLVAGHETTA SALTWAFYWI DSVPEVREKL FQELDTLNDD
301 SEPSIIAKLP YLTAVCQETL RFYPIVLNAF FRRTKNPMEI MGYKLPKATL VVPSIYLAHH
361 REEVYPQSKQ FRPERFLEKQ FSPYEYLPFG GGNRRCIGLA FAQYEMKIVL ATILSQFQVS
421 RLSKRPVQPV RRGLTLAAPG GMKMVANKRM RNS
CYP110D2 Nostoc punctiforme
NZ_AAAY02000028 GenPept ZP_00109203.1
52704..54170 gene = Npun3650
68% to 110D1
MNIPLSVTLSNMKSRNNKIQKPSNLQTPMTATYNLPDGPQMPRW
LRTIKFISQPVKYVDDFAKTYGDTFTIRSSRSDNHIVYFSQPQALEEIFTADSRHFEV
GRGNTGLRFLLGDRSFMLVDGDRHQRQRQLLAPPFHGERMRAYGEDIRKITQQVSHEW
KIGKPFNIRESMQEITLRVILRVVFGLNEGELFEELRRSLSDLLDFISSPIMSSAFFF
RFIQKDFGAWSPWGRILLQRQKVDLLIYTLLRERRAQTDQNRQDILSLMMAARYDDGQ
GMSDEELHDELMTLLVAGHETTASALTWAFYWIDHLPEVREKLLQELNTIGVNPDLSS
VAKLPYLTAVCQETLRIYPIAMTAFVRIVKTPITIMGYELREGTAIVPSIYLAHHREE
VYPQSKQFKPERFLERQYSPYEYLPFGGGNRRCIGMAFAQYEMKIVLATVLSEFQVSL
VNKRPVHPVRRGLTVATPAGMRMVATPQVKRANTPALV
CYP110D3 Trichodesmium erythraeum
GenPept ZP_00074554.1 GenEMBL NZ_AABK02000068
complement(10019..11407) gene = Tery3870
54% to 110D1
MTLPDGPSLSPLQRRLRTWKFIFSPLSAIEERYSEYGDIFRTNT
NSLYPFIYFCNPKAIQQIFTADPDTFTSGSINGILKYFVGLNSLLLQDGDRHKRQRKL
LMPPFHGDRMRKYGDLIYNITSNVISQWKIEQPFPIRKSTQEISLKVILAAVFGLDQE
GKSYEKLRVLMSDLLDSMSSPLSSTFLFFNFLRKDWGPWSPWGRFLRKKQELHELIIA
EIQTAKKEGNHRDDILSLLLEARDEAGNAMSDEEIKDELLTMLFAGHETTASALAWAL
YWIDMIPSVGEKLMAELATIPSNSDQVAITKLPYLSAICQETLRIYPIAMNAFPRVVQ
KPIEIMGYQLEPGMVAIVPIYLTHHREDIYPEPKKFKPERFLERQFSPYEYLPFGGGS
RRCIGSAFALFEMKLVLATILSQWELKLLPNQRISPVRRGLTMAPPANMRMVVKPKKS
WQKVSQPILTSG
CYP110E1 Nostoc sp. PCC 7120 Same as Anabaena
GenPept BAB76532, AI2409
NC_003272 complete genome 5753083..5754450
50% to CYP110A2 53% to CYP110B1 53% to 110D1
1 MKLPDSPKIP KFMQLVQWIY QPLQLMEASA KAHGDSFTLW LTNKRPIVFL SNPQAIQELF
61 TTPLEQLDAR GTAQVLQPLL GENSLLLLSG ETHQRQRKLL TPPFHGDRMR AYGDIITNIT
121 KEVISNWQLG KPFSVRDSMQ EITLRVILQA VFGLREGERY TQLQKRLCDI LDLSGSALRS
181 TLSFLPALQI DLGRWSPWGH FLRQREAIDQ LLYAEIQDRR DHPDPSRTDI LSLMMAARDE
241 NGEAMTDVEL RDELMTLLVA GHETTASALT WALYWIHKLP QVREKLLAEL DNFGDNGDVN
301 EITRLPYLTA VCQETLRIYP IAMVTIPRIT KTNLEIGGHQ FAPGTMLVGC IYLMHRRPDL
361 YPQPQEFKPE RFLEKQYSLY EYLPFGGSNR RCVGMAFALY EMKLILATVL ANVDLALVDN
421 YPVKPTRRGV TLAPSGGKWL IATAQHQKIK NPVEV
CYP110E2 Nostoc punctiforme
NZ_AAAY02000088 GenPept ZP_00107327.1
complement(18173..19567) gene = Npun1723
58% TO 110E1 55% TO 110B1
MSLLKLPNGPQTHPWIQMYQWLTNPLEYMEACTKRYGDIFTLKL
GQNFAHQVFISNPQAIQQIFTTDPKQLDSGESAGIKAPLLGQQSLLALDGKPHQRQRK
LLTPPFHGERMLAYGELIREITEQVSSQWQVGETFAVLPSMQAISFQVILKAVFGLED
GPRYKKLNELLIKILNPKIPLLRTVLLIFPSMRQDLGAWSPWGKYLRLRQQIDQLIYA
QIQERKAQPNLSGTDILSLMMAARDEAGEPMTDLELRDELMTLLVAGHETTATSLSWA
LYWIHHRPQVREKLLQELDNLGEKPDPNAIFRLPYLNAVCSETLRLYPVAMSALNRLV
KSPLQIGEYNFEPGTILIPSIYLTHHREDLYPESKQFKPERFLERQFSPYEYLPFGGG
NRRCIGMAFALFEMKLVLATVLSRWQMELADSKPVRPVRKGLLFSPAGGVQMVVKGKR
LQNQPILQTSSSSV
CYP110E3 Trichodesmium erythraeum
GenPept ZP_00072591.1
GenEMBL NZ_AABK02000017 complement(<3..1016)
53% to 110E1 missing C-terminal 121 aa (runs off end of clone)
1 MIKLPGPKSP ALTQILQWTA KPIKFMEKCA REYGDTFEVK LNYPIVFISH PKAIEEIFKA
61 NPKKFDCGSS NKLAQPLLGD YSLLLLDDIP HQRQRKLLMP PFHGKRMQAY GELICNVAQE
121 VASKWEIGQV FSMREFTAEI SLKVILQAVF GLYEGERYSK LEKLLGSLLE SLSSPLKTSM
181 LFFQFLQIDL GPWSPWGNFI KNREEIYELL CAEISERRQK LDPERSDILT MLLLARDEEG
241 EGMSDIELRD ELMTLLIAGH ETTATSLSWA FYWIHHQPEI YQKLSRELET FGDDLNPMTV
301 INLPYMNAVC SETLRIYPVV IIVSPRKTKL PITIMGQT
CYP110E4 Gloeobacter violaceus PCC 7421
GenEMBL AP006578 complement(257348..258724)
gene = gll3063
NC_005125 complete genome complement(3256348..3257724)
locus_tag = gll3063
71% to 110E5 55% to 110E1
MSLPPGPSSPSPFQLMQWIGCPTDYLHTTAARYGDPFTMRVGVF
PPLVMFSDPRAIQQLFTAEAGTFDAGASNVALRPTLGANSLLLLDGERHQQQRRLLTP
PFHGERMRAYGELIRQVTEEVIVRWQPGKPFLVRNAMQRISLAVILQAVFGLHDGTRL
VRLRQALGSMLDAMSSPLSMAMLLMLPEDFGPWSPRARLQAHLGAIDELLYAEIRERR
EHFDAGAGDILGLLLAARDEAGAAMGDAELRDELMTLLVAGHETTATAMAWALYWIHY
LPQVRERLLAELDSLGSDPDPEAIARLPYLGAVCSETLRIYPVALIASPRVARHTVRI
LERDYEAGTRLAAGIYLAHHRPETYPEPERFRPERFLERTFSPYEFVPFGGGSRRCIG
MAFALYEMKLVIATVLLERDLRLVQPRLLRPVRRGVTLAPPEGLYLVPTGERSASRLL
SRTSTAGQ
CYP110E5 Gloeobacter violaceus PCC 7421
GenEMBL AP006578 complement(258800..260176) gene = gll3064
NC_005125 complete genome complement(3257800..3259176)
locus_tag = gll3064
71% to 110E4 55% to 110E2
MSLPAGPASPPPLQLLQWIGRPTDYLERTARRYGDPFTMRLGLH
SPVTGVFFSSPEAFQQLFNTEPGLFDSGGANASSTFNLLFGTNSLILLDGERHQQQRR
LLTPPFHGERMRSYGELIRTLAEQVTARWNLGTPFQARRSMQRISLGVILKAVFGLHD
GTRYLRVCRLLGNLIDASASPLLFGLRLIFPQDAGPMSPMGQLKAQIDAIDELLYAEI
RERRERPDPRADDILSLLMAARDEAGQGMGDVELRDELMTLLVAGHETTATAMAWALY
WIHRLPQVRERLLAELDSLGSDPDPEAIARLPYLGAVCSETLRIYPVAMVAFARVPRR
PVRILDREYPAGTFLIPNIYLAHRRPEAYPDPERFRPERFLERTFSPYEFVPFGGGSR
RCIGVAFALYEMKLVLATVLSRVELRLADPRPRLPVRRGLTLAPPEDLHLIPTALRSG
HRDLLPAC
CYP110F1 Nostoc punctiforme
NZ_AAAY02000005 GenPept ZP_00111618.1
complement(57031..58407) gene = Npun6096
48% TO 110E1 48% TO 110D1
MKILDSLTTPSLLQTLQLIAKPTKTLENYATKYGDIFTMRVMGL
KSPPIVFFSHPQAISDCFAVPAHKLDFKKATHVFKPLFGENSIVFKEARSHQQQRQLL
LPAFHGDNLKSYGQAICQIAEELTQSWTSGTNICIHKLMSKITLEIILQVVFGITHGV
RYQQLKEQLSALLEDVTKPWYSSLFFFPSLQKDLGAWSPWGIFLKRREQIDKLIYAEI
SERRWQNDAMRTDILSLLMSAHDVNGQQMTDEELRDQLVSLLLLGYETTSGVLAWIFY
LIHSHPEVKHRLMQELSTLDNLTNPEAITQLPYLTAVCQETLRIHPIALICTPRMLKE
PVEIMGHKFTSETVLVPCIHLAHRRTDTYPEPEQFRPERFLNQKFSPYEYLPFGGGYR
GCIGAAFSMYELKLVTAIILSRFELSLTDKRPAYPVRRGITIVPSGGVKMVVTKKAKF
KRQTILST
CYP110G1 Trichodesmium erythraeum
GenPept ZP_00074734.1
GenEMBL NZ_AABK02000081 complement(2404..3738)
42% to 110C1
1 MKQVCALKTP LWLQRFNYIT NPVSYWQKAY SSYKDAFYAQ GINFGKPLMV FYTPSAAKQI
61 IENCQGDLTT TSFDSELTAI FGDSSFFILE GTNHKKMRKL LIPALHGKHI KTYGELICNL
121 VNNLIENLPF NQSFSALEIA QEISMQVMIK LLFGNYQQER YQKIKQLMIN MVSLFAANVF
181 GFPLFFKFLQ QDLGLVSPWG NFLQQRRKIQ QLIYQEIAER RNHPNQERTD ILSLLMTAQD
241 EKGNFLNDEE LLGQLLSLLF TGNESTAASI AWSWYEVYRN SKIKEKLLEE INNLGDSPEP
301 LSLFNLPYLS AVCNETLRKY PVTMFMIPRI VKNTTEINGY QLDKGMLVTV GTYILHHRED
361 IYDQPEEFKP ERFIEHRFSS FEFLPFGRGM RGCIGADIAL YQMKLTLATI ISHHRLELTN
421 YGQIFPKRRN TILTPIKLRI IKAC
111 Family
CYP111A1 Pseudomonas incognita
GenEMBL L23310 (2080bp)
Ropp,J.D., Gunsalus,I.C. and Sligar,S.G.
Cloning and expression of a member of a new cytochrome P-450
family: cytochrome P450lin (CYP111) from Pseudomonas incognita.
J. Bact. 175, 6028-6037 (1993)
CYP111A2 Novosphingobium aromaticivorans
GenEMBL NZ_AAAV01000134
complement(20145..21356) gene = Saro1618
65% to CYP111A1
MLDLKNPDTYQGGVPYAALQDLRAEGPVHWNPESDGAGFWAVLG
HDEIVAVSRQPDLFSSAFENGGHRIFNENQVGLTGAGESAIGIPFISRDPPSHTQYRK
FVMPALSPARLQGIEERIAKRVERLFAQVPLGETVNILPLLTVPLPLLTLAELLGVPA
DLWPDLHRWTDAFVGEDDPDFRQSPEAMQAVLAEFMGFATALFEDRRANPGPDIASLL
ANTEIRGEPAPLRDFIANLILALVGGNETTRNSINHTMIALAENPGQWDILRADPSLM
TAAVKEMVRFASPVIHMRRTAMRDTQLGQQAICKGDKVVIFYPAGNRDPAVFENPDRF
EITRPVRQHLAFGSGAHVCVGSRLAEMQLRLAFAEMARHVRAFEVVGEPSRVRSNFIN
GFKRLEVRLLV
112 Family
CYP112A1 Bradyrhizobium japonicum
GenEMBL L02323 L12971 U12678 (11,715bp)
NC_004463 complete genome 2317922..2319127
Tully,R.E. and Keister,D.L.
Cloning and mutagenesis of a cytochrome P-450 locus from
Bradyrhizobium japonicum that is expressed anaerobically and
symbiotically
Appl. Environ. Microbiol. 59, 4136-4142 (1993)
Note: called BJ-1 see CYP114, CYP115P, CYP117
CYP112A2 Rhizobium sp. NGR234 plasmid pNGR234a
GenEMBL AE000083
NC_000914 complement(233666..234868)
Gene = y4lD
Freiberg,C., Fellay,R., Bairoch,A., Broughton,W.J., Rosenthal,A.
and Perret,X.
Molecular basis of symbiosis between Rhizobium and legumes
Nature 387 (6631), 394-401 (1997)
about 92% identical to 112A1
MPEQPLPTLPMWRVDHIEPSPTMLALRANGPIHNVRFPRGHEGW
WVTGYDEAKAVLSDAAFRPAGMPPAAFTPDCVILGSPGWLVSHEGGEHARLRTIVAPA
FSDRRVKLLAQQVEAIAAQLFETLAAQPQPADLRRHLSFPLPAMVISALMGVLYEDHA
FFAGLSDEVMTHQHESGPRSASRLAWEELRAYIRGKMRDKRQDPGDNLLTDLLAAVDR
GEATEEEAIGLAAGMLVAGHESTVAQIEFGLLAMLRHPQQRERLVGNPSLVDKAVEEI
LRMYPPGAGWDGIMRYPRTDVTIAGVHIPAESKVLVGLPATSFDPRHFEDPEIFDIGR
DAKPHLAFSYGPHYCIGMALARLELKVVFGSIFQRFPALRLAVAPEELKLRKEIITGG
FEEFPVLW
CYP112A3v1 Mesorhizobium loti
GenPept NP_106888
95% to 112A2 Rhizobium sp. NGR234
1 MSEQPLPTLP MWRVDHIEPS PEMLALRANG PIHHVRFPSG HEGWWVTGYD EAKAALSDAA
61 FRPAGMPPAA FTPDSVILGS PGWLVSHEGG EHARLRTIVA PAFSNRRVKV LAQQVEAIAA
121 QLFETLAAQP QPADLRRHLS FPLPAMVISA LMGVLYEDHA FFAGLSDEVM THQHESGPRS
181 ASRLAWEELR AYIRGKMWDK RQDPGDNLLT DLLAAVEQGN ATEEEAIGLA AGMLVAGHES
241 TVAQIEFGLL AMFRHPQQRE RLVGDPSLVD KAVEEILRMY PPGAGWDGIM RYPRTDVTIA
301 GVHIPAESKV LVGLPATSFD PRHFDDPEIF DIGRDENPHL TFSHGPHYCI GMALARLELK
361 VVVGSIFQRF PALRLAVAPE ELKLRKEIIT GGFEEFPVLW
CYP112A3v2 Mesorhizobium loti
GenEMBL AL672112 complement(85404..86606)
Strain R7A symbiosis island
Gene = msi071
2 DIFFS with CYP112A3v1
CYP112A4 Rhizobium etli symbiotic plasmid p42d
NC_004041 55365..56645
89% to 112A3
gene = cpxP2
MSEQSLPTLPMWRVDHIEPSPEMLALRAKGPIHRVRLPSGHECW
WVTGYDEAKAVLSDAAFLPAGMPPADFTPDSVILGSPGWLVSHEGDEHARLRTIVAPA
FSNSRVKLLTQQVEAITVQLFDTLAVQPQPADLRRHLSFPLPAKVISALMGVPFEEHA
FFAGLSDEVMTHQHESGPRSASGLAWEELRAYIHGKIRGKRQDPGDNLLTDLLAAVDQ
GKATEEEAIGLAAGVLVAGHESTVAQIEFGLLAMFRHPQQRERLVRDPSLVDKAVEEI
LRMYSPGAGWDGIMRYPRTDVTIAGVHIPAESKVLVGLPATSFDPCHFKDPEVFDIGR
DANPHLAFSYGQHNCIGAALARLELKAIFGSIFQRFPALRLAVAPEELKLRKEIITGG
FEEMPVLWCGRPPASQSSHLAAPGAHRSDQPLDR
113A Subfamily
CYP113A1 Saccharopolyspora erythraea
GenEMBL L05776 (1320bp) S51613 U82823 PIR B40634 (412 amino acids)
Stassi,D.L., Donadio,S., Staver,M.J. and Katz,L.
Identification of a Saccharopolyspora erythraea gene
required for the final hydroxylation step in erythromycin
biosynthesis.
J. Bact. 175, 182-189 (1993)
eryK erythromycin C-12 hydroxylase
Note: two different database entries have different start
codons. Neither is ATG.
113B Subfamily
CYP113B1 Streptomyces fradiae
GenEMBL U08223 (7082bp)
Merson-Davies,L.A. and Cundliffe,E.
Analysis of five tylosin biosynthestic genes from the tylIBA
region of the Streptomyces fradiae genome.
unpublished (1994)
CYP113B2 Streptomyces caelestis
cytochrome P-450 hydroxylase homolog (nidi)
GenEMBL AF016585 CDS complement(1-396) N-term only, 60% to 113B1
MVDSVTGPMELSKDANAKELLDWFSHNRTHHPVFWDEGRQAWQV
FRYDDYLTVSNHPEFFSSDFTEVAPTPPELEMILGPGTIGALDPPAHGPMRKLVSQAF
TPRRMAGQEQRIRVIAEELLDRVRGQKTIA
CYP113C1 Streptomyces virginiae
GenEMBL AB072568 4994..6202
46% to 113A1
gene = visD
MAQQTPPAPPSMADGGKAMLAWLRTMRDEHPVHEDQYGVFHVYR
HSDVLAVTSDPAVFSSDLSRLRPDSSALSEEILSVIDPPLHRKLRSLVSQAFTLRTVA
DLEPRVTELAGRLLEKVEGSEFDLVGDFAYPLPVIVIAELLGVPAEDRELFRGWSDRM
LSMQVDDPLEIQFGDEAGEDYERLVKEPLKEMHAYLQRHVDARRETPGDDLLSRLVTA
EIAGERLTDRQIVEFGALLLMAGHVSTSMLLGNTVLCLEENPETAAALRADRALISGV
IEEVLRMRPPITVAARVTTGEVVVGGVTIPKDRMVMASLLSANHDERHIQDPEVFDPR
RSPNPQLAFGHGIHYCLGGPLARLEGRVALEMLLDRFEDIRVTPGAPYDFHREGLFVP
ARSPLTVRRG
114 Family
CYP114A1 Bradyrhizobium japonicum
GenEMBL L02323 L12971 U12678 (11,715bp)
NC_004463 complete genome 2319222..2320511
Tully,R.E. and Keister,D.L.
Cloning and mutagenesis of a cytochrome P-450 locus from
Bradyrhizobium japonicum that is expressed anaerobically and
symbiotically
Appl. Environ. Microbiol. 59, 4136-4142 (1993)
Note: called BJ-3 see CYP112, CYP115P, CYP117
CYP114A2 Rhizobium sp. NGR234 plasmid pNGR234.
GenEMBL AE000082 CDS comp (9861..11264) gene = y4lC
NC_000914 complement(232170..233573)
cytochrome P450 BJ-3 homolog" 90% to CYP114A1
MDMQETTTACADAFAELASPACIDDPYPFMRWLREHDPVHRAAS
GLFLLSRHADICWALKATGDAFRGPAPGELARYFPRAATSLSLNLLASTLAMKEPPTH
TRLRRLISRDFTMREIDNLRPSIARFVAARLDGMAPALERGEAVDLHRQFALALPMLV
FAELFGMPQDDMFGLAAGIGAILEGLSPHASDPQLAAADAASARMKAYFGDLIQRKCI
DPRHDIVATLVGAHDDDADTLSDAELISMLWGMLLGGFATTAATIDHAVLAMLAYPDQ
RHWLQGDAAGVEAFVEEVLRCDAPAMFSSIPRIAQSDIELSGVVIPKNADVRVLIAAG
NRDPDAFADPDRFDPARFYGTSPGMSTDGKIMLSFGHGIHFCLGAQLARVQLAESLPR
IQARFPTLTVAEQPTREPSAFLRTFRALPVRLHAQGDSPRLTSAFLNGQRGVEGGASF
EHGDGERRSATDRRAQP
CYP114A3v1 Mesorhizobium loti
GenPept NP_106889
92% to 114A2
1 MDVQETTAAC RDAFAELASP ACIQDPYTFM RWLREHDPVH RAASGLFLLS RHADIYWALK
61 ATGDVFRGPA PGELARYFPR AETSLSLNLL ASTLAMKEPP THTRLRRLIS RDFTIRQIDN
121 LRPSIARIVA ARLDGMAPAL ERGEAVDLHW EFALAVPILV FAELFGMPQD DMFGLAAGIG
181 AILEGLSPHA SDPQLAEADA ASARVQAYFG DLIQRKRTDP RNDIVSMVVG AHDDDADTLS
241 DAELISMLWG MLLGGFATTA ATIDHAVLAM LAYPEQRHWL QGDAVGVKAF VEEVLRCDAP
301 AMFSSIPRIA QRDIELGGVV IPKNADVRVL IAAGNRDPDA FSDPDRFDPA RFYGTTPGMS
361 TDGKIMLSFG HGIHFCLGAQ LARVQLAESL PRIEARFPTL ALAEQPTREP SAFLRTFRAL
421 PVRLHAQGG
CYP114A3v2 Mesorhizobium loti
GenEMBL AL672112 complement(84020..85309)
Strain R7A symbiosis island
Gene = msi070
10 DIFFS with CYP114A3v1
CYP114A4 Rhizobium etli symbiotic plasmid p42d
NC_004041 56651..58252
90% to 114A3
gene = cpxP3
MDVQDTTAACHDAFAELASPACIQDPYPFMRWLREHDPVHRAAS
GLFLLSRHADIYWAFKATGDAFRGPAPSELARYFPRAASSLSLNLLASTLAMKEPPTH
TRLRRLISRDFTVGQIDNLRPSIARIVAARLDGMAPALERGEAVDLHREFALALPMLV
FAELFGMPQDDVFELSAIVSAILEGLSPHASDPQLAAADVASARVKAYFGDLILRKRA
DPRRDIVSTLVGAHTDDADTLSDAELISMLWGMLLGGFATTAATIDHAVLAMLAYPEE
RHWLQGDAAGVEAFVEEVLRCEAPAMFSSIPRIAQRDIELHGVVIPKDADVRVLIAAG
NRDPDAFADPDRFDPVRFYGTRPGMSSDGKIMLSFGHGIHFCLGAQLARVQLAESLPQ
IQARFPTLALAEQPTREPSAFLRTFRALPVRLHAQAAAEVRVVVDQDLCGTTGQCVLT
LPGTFRQREPDGVAEVCMATVPQALHAAVRLAASQCPVAAIRVIESEAGDDHCTNPGP
TPSPADAERHAAKDLRNPGEHDGTI
115 Family
CYP115A1P Bradyrhizobium japonicum
GenEMBL L02323 L12971 U12678 (11,715bp see 1351-1578)
NC_004463 complete genome 2317600..2317905
Tully,R.E. and Keister,D.L.
Cloning and mutagenesis of a cytochrome P-450 locus from
Bradyrhizobium japonicum that is expressed anaerobically and
symbiotically
Appl. Environ. Microbiol. 59, 4136-4142 (1993)
Note: called BJ-2 see CYP112, CYP114, CYP117
Note: This gene fragment has a perfectly good P450 sequence
of 76 amino
acids that includes the C-terminal up to a stop codon.
This may be a fragment of another intact P450 that was
broken up or
rearranged during cloning. A pseudogene would be expected
to have lost
integrity slowly and the whole gene should fade at about
the same rate.
This fragment is good but no upstream region continues it.
GDADRFDVTRRHNPHLSFGQGPHFCLGAALARLELGCAFPAL
FVRLEHLALTIAAEDVVYMPSYVIRCPQRLPVTFRPSIA
CYP115A2v1 Mesorhizobium loti
GenPept NP_106680 88% to CYP115A1P 39% to 154C1 41% to 154A1
1 MPAAPTQLDR LSSAILRQGG MARVSLPGDV VTWAAARHQT LRQMLSDQRF NKDWRQWRAL
61 QDGEIPEDHP LIGICKVDNM TTAHGADHRR LRGLLSSSFA PSRIALLAPR VEQCVDRLLA
121 EMAQRGGSAD LMSEFAAPLP TNVIAELFGL PDEQREEIVA LTYSLASTSA TAEEVRQTRQ
181 RIPEFFRRLI ALKRGQLGDD LASALIVARD KGELVSDTEL IDMLFMVLSA GFVTTAGVIG
241 NGVLALLTHP QQLHLVRSGQ VPWSQAIEEI LRWGTSAANL PFRYATQDVE IDGCLVRRGD
301 AVLMAFHAAN RDEKAFGPGA NRFDVTRRHN PHLSFGEGPH SCLGAALARL ELRCAFPPLF
361 GRLEDLALTI AAEDVVYMPS YVIRCPQRLP VSFRPSVA
CYP115A2v2 Mesorhizobium loti
GenEMBL AL672113 41375..42607
Strain R7A symbiosis island
Gene = msi159
10 DIFFS with CYP115A2v1
CYP115A3P Rhizobium etli symbiotic plasmid p42d
NC_004041 54883..55296
70% to 115A1P 70% to 115A2
gene = cpxP1 pseudogene C-terminal
ANSYGRPTYGDTDMFDFNRLQNPHLPLGQGPHLCLGAALARLELGSVFPPPFVRPEDLALAIAAE
116 Family
CYP116A1 Rhodococcus erythropolis
GenEMBL U17130 (6458bp)
Nagy,I., Schoofs,G., Compernolle,F., Proost,P., Vanderleyden,J.
And De Mot,R.
Degradation of the thiocarbamate herbicide EPTC (S-ethyl
dipropylcarbamothioate) and biosafening by Rhodococcus sp. NI86/21
involve an inducible cytochrome P-450 system and aldehyde
dehydrogenase.
unpublished
CYP116B1 Ralstonia metallidurans
GenEMBL NZ_AAAI01000322
25751..28093 gene = Reut3205
52% to CYP116A1 with C-term. Extension
extension may contain a reductase and a ferredoxin component
MPQTNAPASSGSCPIDHSALRAPNGCPISHQAAAFDPFEDGYQQ
DPPEYVRWSRAQEPVFYSPKLGYWVVTRYDDIKAIFRDNITFSPSIALEKITPTGEAA
NAVLASYGYAMNRTLVNEDEPAHMPRRRALMEPFTPAELAHHEPMVRKLTREYVDRFI
DTGRADLVDEMLWEVPLTVALHFLGVPEEDMDLLRQYSIAHTVNTWGRPKPEEQVAVA
HAVGNFWQLAGRILDKMREDPSGPGWMQYGLRKQRELPEVVTDSYLHSMMMAGIVAAH
ETTANASANAIKLLLQHPDVWREICEDPALIPNAVEECLRHNGSVAAWRRLVTRDTEV
GGMSLAAGSKLLIVTSSANHDEHHFADADLFDIHRDNASDQLTFGYGSHQCMGKNLAR
MEMQIFLEELTSRLPHMRLAGQRFTYVPNTSFRGPEHLWVEWDPARNPERTDPTVLAP
RDAVRIGEPTGGTTGRTLIVERVETAAQGVSRIRLVSPDGRALPRWSPGSHIDIECGH
TGISRQYSLCGDPADTSAFEIAVLREPESRGGSAWIHASLRAGDKLKVRGPRNHFRLD
ETCRRAIFIAGGIGVTPVSAMARRAKELGVDYTFHYCGRSRASMAMIDELRALHGDRV
RIHAADEGQRADLAQVLGAPDANTQIYACGPARMIEALEALCATWPEDSLRVEHFSSK
LGTLDPSREQPFAVELKDSGLTLEVPPDQTLLATLRAANIDVQSDCEEGLCGSCEVRV
LAGEIDHRDVVLTRGERDANNRMMACCSRAAKGGKIVLGL
CYP116B2 Rhodococcus sp. NCIMB 9784
GenEMBL AF459424
66% to 116B1 over full fusion protein length
extension may contain a reductase and a ferredoxin component
MSASVPASAPACPVDHAALAGGCPVSANAAAFDPFGSAYQTDPA
ESLRWSRDEEPVFYSPELGYWVVTRYEDVKAVFRDNILFSPAIALEKITPVSAEATAT
LARYDYAMARTLVNEDEPAHMPRRRALMDPFTPKELAHHEAMVRRLTREYVDRFVESG
KADLVDEMLWEVPLTVALHFLGVPEEDMATMRKYSIAHTVNTWGRPAPEEQVAVAEAV
GRFWQYAGTVLEKMRQDPSGHGWMPYGIRKQREMPDVVTDSYLHSMMMAGIVAAHETT
ANASANAFKLLLENRAVWEEICADPSLIPNAVEECLRHSGSVAAWRRVATADTRIGDV
DIPAGAKLLVVNASANHDERHFERPDEFDIRRPNSSDHLTFGYGSHQCMGKNLARMEM
QIFLEELTTRLPHMELVPDQEFTYLPNTSFRGPDHVWVQWDPQANPERTDPAVLHRHQ
PVTIGEPAARAVSRTVTVERLDRIADDVLRLVLRDAGGKTLPTWTPGAHIDLDLGALS
RQYSLCGAPDAPSYEIAVHLDPESRGGSRYIHEQLEVGSPLRMRGPRNHFALDPGAEH
YVFVAGGIGITPVLAMADHARARGWSYELHYCGRNRSGMAYLERVAGHGDRAALHVSE
EGTRIDLAALLAEPAPGVQIYACGPGRLLAGLEDASRNWPDGALHVEHFTSSLAALDP
DVEHAFDLELRDSGLTVRVEPTQTVLDALRANNIDVPSDCEEGLCGSCEVAVLDGEVD
HRDTVLTKAERAANRQMMTCCSRACGDRLALRL
117 Family
CYP117A1 Bradyrhizobium japonicum
GenEMBL L02323 L12971 U12678 (11,715bp)
NC_004463 complete genome 2321653..2322996
Tully,R.E. and Keister,D.L.
Cloning and mutagenesis of a cytochrome P-450 locus from
Bradyrhizobium japonicum that is expressed anaerobically and
symbiotically
Appl. Environ. Microbiol. 59, 4136-4142 (1993)
Note: called BJ-4 see CYP112, CYP114, CYP115P
CYP117A2 Rhizobium sp. NGR234 plasmid pNGR234a
GenEMBL AE000082 complement(7357..8700) U00090
NC_000914 complement(229666..231009) gene = y4kV
Freiberg,C., Fellay,R., Bairoch,A., Broughton,W.J., Rosenthal,A.
and Perret,X.
Molecular basis of symbiosis between Rhizobium and legumes
Nature 387 (6631), 394-401 (1997)
about 90% identical to 117A1
MNVLLNPLNRRHRLRYDIPVMPGAFPLVGHLPAIVCDLPRLLRR
AERTLGSHFWLDFGPAGHLMTCVDPHAFALLRHKDVSSALIEEIAPELLGGTLVAQDG
GAHRQARDAIKAAFLPEGLTQAGIGDLFAPVIRARVQAWRDRGDVTILPETGDLMLKL
IFTLMGVPAQDLPGWHRKYRQLLQLIVAPSVDLPGLPLRRGRAARDWIDAQLRQFVRD
ARAHAARTGLINDMVSAFDRSDDALSDDLLVANIRLLLLAGHDTTASTMAWMVIELAR
QPMLWDALVEEAQRVGAVPTRHADLEQCPVAEALFRETLRVHPATTLLPRRALQELQL
GQRRIPAGTHLCIPLLHFSTSALLHEAPDQFRLARWLQRTEPIRPVDMLQFGTGPHVC
IGYHLVWLELVQFSIALALTMHKAGVRPLLLSGVEKGRRYYPTAHPSMTIRIGFS
CYP117A3 Mesorhizobium loti
GenPept NP_106891
NC_002678 complete genome 5191629..5192972
locus_tag = mlr6367
94% to 117A2
1 MDMLLNPLDR RHRLRDDIPV VPGAFPLVGH LPAIVCDLPR LLRRAERTLG SHFWLDFGPA
61 GHLMTCVDPD AFALLRHKDV SSALIEEIAP ELLGGTLVAQ DGGAHRQARD AIKAAFLPKG
121 LTQAGIGNLF APVIQARVQA WRDRGDVTIL RETGDLMLKL IFSLMGIPAQ DLPGWHRKYR
181 QLLQLIVAPP VDLPGLPLRR GRAARDWIDA QLRQFVRDAR AHAARTGLIN DMVSSFDRGD
241 DALSDDVLVA NIRLLLLAGH DTTASTMAWM VIELARQPGL WDALVEEAQR VGAVPTRHAD
301 LAQCPVAEAL FRETLRVHPA TTLLPRRALQ ELQLGQRRIP AGTPLCIPLL HFSTSALLHE
361 APDQFRLARW LQRTEPIRPV DMLQFGTGPH VCIGYHLVWL EMVQFCIALA LTMHKAGVRP
421 RLLSAVEKGR RYFPTAHPSM KIRIGFS
CYP117A3v2 Mesorhizobium loti
GenEMBL AL672112 complement(81551..82888)
Strain R7A symbiosis island
Gene = msi068
2 DIFFS with CYP117A3v1
CYP117A4 Rhizobium etli symbiotic plasmid p42d
NC_004041 59081..60424
85% to 117A2
gene = cpxP4
MDMLLNPLNRWRRLRDDIPVMPGAFPLVGHLPAIVCDLPRLLRR
AERTLGSHFWLDFGPAGHLMTCLDPDALALLRHKEVSSALIEEMAPDILGGTLVTLDG
SAHRQARDGIKAAFLPRGLTEAGIGELFEPIIRAQVKAWRDRGEVAILPDTRNLMLKL
TFSLMGIPAQDLSEWHRKYRQLLQLMVAPPIDLPGMPLRRGRAARDWIDAQSRQFIRD
ARARAARTGLINDMVSAFDCSDGALSDDVLVANIRLLLLAGHETSASTIAWMVIELAQ
HPELWDALVEEAQRVGAVPTGHEDLAQCPVAEALFRETLRMHPASSLVPRRAMQELQL
GQRRIPSGTHLCIPLLHFSTSPLLHEAPDQFRLGRWLQRTEPIRPVDMLQFGAGPHVC
MGYHLVWLELVQFSIALALTMQEAGVRPRLMSGVEKGRRYYPTAHPSMTVRIGFS
118 Family
CYP118P1 Mycobacterium leprae
GenEMBL L04666 (40,123bp)
Smith,D.R.
M. leprae cosmid dna sequence
Unpublished (1992)
Note 15,700 to 17,350 is the region of interest
CYP118P1 Mycobacterium leprae
GenPept CAC31116
NC_002677 547312..547788 locus_tag = ML0447
NC_002677 complement(2562932..2563627) locus_tag = ML2159
(a duplication of the seq.)
putative fatty oxidation complex alpha subunit
Sequence below is from TIGR primary nucleotide sequence for ML2159
CYP118 exact match, 49% to 102C1
4 TASQHDDILDIMLYSADPSTGEQLDTDNVVNQILTLLVSGSQTLANAIAFALHYLLSIHH 183
184 DIAAQTRREIYQNRSDRGIANVSY
258 FGDVVKLRCLRRVVDATLRLWS
VPCYLRQARRD 360
361 TTLGNGTSLFHKGQWVIVLLTAPMPG
WGPDANEFNPDRXXXXXXXXXXXXXXXX 470
520 FGTGLRTCIGRRFALHEMALELTMIVHQYILSRADPG
YCLSISEAFTLKTVGL 677
119 Family
CYP119A1 Sulfolobus solfataricus (an archaebacterium)
GenEMBL U51337 (1254bp)
Wright, R.L., Harris, K., Solow, B., White, R.H. and Kennelly, P.J.
Cloning of a potential cytochrome P450 from the Archaeon Sulfolobus
solfataricus.
FEBS. Lett. 384, 235-239 (1996)
CYP119A2 Sulfolobus tokodaii
GenPept BAB66184
64% to CYP119A1 U51337 Sulfolobus solfataricus
1 MYDWFKQMRK ESPVYYDGKV WNLFKYEDCK MVLNDHKRFS SNLTGYNDKL EMLRSGKVFF
61 DIPTRYTMLT SDPPLHDELR NLTADAFNPS NLPVDFVREV TVKLLSELDE EFDVIESFAI
121 PLPILVISKM LGINPDVKKV KDWSDLVALR LGRADEIFSI GRKYLELISF SKKELDSRKG
181 KEIVDLTGKI ANSNLSELEK EGYFILLMIA GNETTTNLIG NAIEDFTLYN SWDYVREKGA
241 LKAVEEALRF SPPVMRTIRV TKEKVKIRDQ VIDEGELVRV WIASANRDEE VFKDPDSFIP
301 DRTPNPHLSF GSGIHLCLGA PLARLEARIA LEEFAKKFRV KEIVKKEKID NEVLNGYRKL
361 VVRVERA
120 Family
CYP120A1 Synechocystis sp. (strain PCC6803) Cyanobacterium
GenEMBL D64003(113064bp)
coding region 62160-63494
Kaneko,T., Tanaka,A., Sato,S., Kotani,H., Sazuka,T., Miyajima,N.,
Sugiura,M. and Tabata,S.
Sequence analysis of the genome of the unicellular cyanobacterium
Synechocystis sp. strain PCC6803. I. sequence features in the 1Mb
region from map positions 64% to 92% of the genome
DNA Res. 2,153-166 (1995)
note: gene slr0574 (previously had incorrect gene identifier here)
NT01NS3472 Nostoc sp. PCC 7120 in TIGR not in Genbank
40% to CYP120 aa 399-443
MEMKIVAAHLLRRYHWEILPNQSLDSVLVPTNQPQDGLRVRFQPL
CYP120A2 Trichodesmium erythraeum
NZ_AABK02000021
complement(1844..2800) gene = Tery2088
318aa (short) 57% to 120A1 (missing N-term 127aa)
MTANYLEKWVEMGTLTWYPEIRNYTFDIASLLFMGSDESSQTKL
VSLFEEWVKGLFSIPLSLPWTRFGKSLRCRQKLLQHIEEIILQRQQQQNLGEDALGIL
LQAQDKEVNGLSLDELKDQILLLLFAGHETLTSAIASFCLLTSQHLDVLTRLRQEQKQ
FSAIEPLTLENLKRMTYLDMVLKEVLRLIPPVGGGFRQVTQDCEFCGYSIPKGWLVQY
QIAKTHQDETLYPDDKNFDPERFAPENAVDKQKVFGYVPFGGGMRECLGKEFARLEMK
IFAVMLLRGYEWELLPEQDLSVVAAPTPYPRDGLKVKFRKVE
CYP120B1 Nostoc punctiforme
NZ_AAAY02000018.1
complement(62382..63695) gene = Npun4299
43% TO CYP120A1
MKTNQIPPGSFGLPVLGETLSFVFDRDFAKKRYHQYGPIFKTHL
LGRPTVVMAGPEALEFVLSSHIENFSWREGWPDNFKTLLGESLFLQDGEEHRRNRRLM
MPALHGPALASYFSTMEDITRSYLQKWEKKQEFTWFQEFKQLTFDIASQLFLGTRPGP
ECVRLSQLFTTLTNGLLAINPLPLPFTTFGKAIAARNEILEHLTQVVRERQQNPTQDT
ISLLIKAKDEDGNSLSEKEIIAQAVLLLFAGHETTTSMLTWLCTELACHPEVLEKARV
EQLQLASQGDLDLEQLGKMPYLEQVLWEVERLHQPVGGGFRGVIKDFELNGYHVPTGW
QLYYSIGVTHQIEEIYSEPELFDPDRFSPQRQEHKKYPFSLVGFGGGPRICIGIAFAK
MEMKIVAAHLLRSYHWEILPNQSLEVVAVPTNRPKDGLRVRFQPR
CYP120C1 Nostoc punctiforme
NZ_AAAY02000127.1 GenPept ZP_00106106.1
8154-9512 gene = Npun477
44% to 120B1 36% to CYP120A1
MQQLKSAEEIPGSYGLPILGETLEIFRDSELYLWRRFQQYGSVF
KTSVLGRKRAYLIGPSANRLVLVEQAENMSSRIGWYFLESTFGNNILLQDGEEHRLTR
RLMYPAFHGKAIATYFDTIQNIVQDFLKDWGERGTISLNSSFRQLTLMIATRLFLGSQ
NKSEVEQTSQWFTQLLDSSMAIFKWNVPFTLYGRGQNARGKLVAFLREAIAQRIEQGN
LEESKDVLGLLLAAVDEDGNKLSETQVINEALLLLFAGHETTASLLTWVIFELGNHPE
WRERLRQEQLAVVGNNPLSLSHLKQFPQLTNVLKEAERLYPPVYAYNRGVLKDIEYGG
YRIPAGWFVTISPMLTHRLPELYTEPDRFDPDRFAPPREEDKKHPLALMGFGYGSHSC
LGMEFAQMEMKIVLSTLLRHYDWTVKPDYSAIAPVRQPSKVKDILQAYIEPLLIKHPL
DS
121 Family
CYP121A1 Mycobacterium tuberculosis
GenEMBL Z77163 (42861bp) gi 1449344 Rv2276
complement (32358 to 33548)
unpublished
CYP121A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 2526703..2527893
Gene = cyp121 100% match
locus_tag = Mb2299
122 Family
CYP122A1 Streptomyces sp.
GenEMBL U65940( 2500bp)
nearly identical to rapJ gene of St. hygroscopicus involved in
rapamycin biosynthesis
CYP122A2 Streptomyces hygroscopicus
GenEMBL X86780 (107379bp)
coding region 96465-97625
rapJ
CYP122A3 Streptomyces hygroscopicus var.
GenEMBL AF235504 CDS 71460..72626
gene="fkbD"
note="C9 hydroxylase" 89% to 122A1 77% to 122A2
123 Family
CYP123 Mycobacterium tuberculosis
GenEMBL Z80226 (34809bp) gi 1550644 Rv0766c
complement (8322-9530)
CYP123 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(861053..862261)
Gene = cyp123 100% match
locus_tag = Mb0789c
124 Family
CYP124A1 Mycobacterium tuberculosis
GenEMBL Z77163 (42861bp) gi 1449354 Rv2266
complement (39907-41193)
CYP124A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 2519058..2520344
Gene = cyp124 100% match
locus_tag = Mb2289
CYP124B1 Streptomyces cinnamonensis
GenEMBL AF440781 93981..95273
polyether antibiotic monensin biosynthesis gene cluster
41% to CYP124
gene = monD
MGLTVGPDNAKRGIVPITDSKPAATFPDLVDPSFWARPHAERVA
LFEEMRGLPRPAFIRQNMPGVPWTFGYHALVKYADIVEVSRRPQDFSSNGATTIIGLP
PELDEYYGSMINMDNPEHSRLRRIVSRSFGRNMIPEFEAVATRTARRIIDELIARGPG
DFIRPVAAEMPIAVLSDMMGIPAEDHDFLFDRSNTIVGPLDPDYVPDRADSERAVIEA
SRELGDYIAGLRAERLAAPGNDLITKLVQVQADGEQLTRQELVSFFILLVIAGMETTR
NAISHALVLLTEHPEQKQLLLSDFDTHAPNAVEEILRVSTPINWMRRVATRDCDMNGH
RFRRGDRIFLFYWSGNRDESVFPDPYRFDITRGTNAHVTFGAVGPHVCLGAHLARMEI
TVLYRELLAALPQIHAVGQPRRLDSSFIEGIKHLHCAF
CYP124B2 Streptomyces nanchangensis NS3226
GenEMBL AF521085 complement(100196..101467)
polyether ionophore nanchangmycin biosynthetic gene cluster
41% to CYP124
gene = nanP
MNRGVVSPTEATPASSAKATRPPDFMDPSFWLRPRDERAEVFEK
LRALPGPEFVPPRLPWGPLASGYYALSKHADICEVSRRPQDFSSEGATAILPPEMDEF
YGSMINMDNPEHSRLRRIVARSFGRGMAPKFDAMSRRVARRIVDELIERGPGDFIRPA
AEMPIAVLSTMMGIPGEDYEFLFERTNTIMGGADPELAADPEKMAAAVLGALRDLGDY
IGRLREDRLARPGPDVITKLVQVQEDGEQLTNQELVSFFILLINAGMETTRNVIAQAL
VLLTEHPDQRQLLLSDFELHAKGAVEEILRVGTPINWMRRTATGDCEMNGHRFRKGDE
IFLFYWSANHDEKVFEDAYRFDITRDPNPHLSFGAVGPHFCLGAHLARIEIIAMLREL
LASLPDIRVEGEPVRLASSFIEGFKELSCTF
125 Family
CYP125A1 Mycobacterium tuberculosis
GenEMBL Z82098 (34154bp) gi 1666115 Rv3545c also AD000003
coding region 8135-9436
CYP125A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(3927359..3928660)
Gene = cyp125A1 100% match
locus_tag =Mb3575c
CYP125A2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV5841 57% to CYP125A1 from Mycobacterium tuberculosis
CYP125A3P Mycobacterium leprae
GenPept CAC30983
NC_002677 2415021..2416227
locus_tag="ML2024
Sequence below is from TIGR primary nucleotide sequence for ML2024
51% to CYP125A1 Rv3545c Z82098 Mycobacterium tuberculosis
1 PGFDFPDPEIYTEQLSV*EPAEMCQAETI**NEQPIGRSGFYDDDY 138
XXXXXXXXXXXXXX
174 HSGTFSNLEKTALACYQEGMNDEQISRGKLVLLNIDASQYTRLHKIISPGFIP*AAEQLR 353
354 DDLXXXXXXXXXXXXXXX 362
410 SGDFVEHVSCELSRQAAIAGLPSG 481
480 VPQEDCKKLFHWSN 521
522 QTVGAQDPKFATNDPMVTSVKLIM*AMQIAADRAKPLGQVIVTNLVEADIEGHKLSKDEFGSF 710
713 VIMLTAAGKENTRNCIMQSMMQFTNFPD*WELYK 814
816 KKAPGTTADKIIRQATLVMS 875
876 FQRTVLK*YELSSVSIKKGQRVVVIYRSANFDEKVLTIRLPCSIMRNPT 1022
1022 PHAGFNDTNVHYCIGIN 1072
1073 LARMTIDRMFHAIAESMPNL*STGKPK*LRSGWLNGVKHWQVD 1201
CYP125A4 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
75% to 125A2 before frameshifted region
clone name SP0266
126 Family
CYP126A1 Mycobacterium tuberculosis
GenEMBL Z80226 (34809bp) gi 1550656 Rv0778
coding region 20888-22132
CYP126A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 873620..874864
Gene = cyp126 100% match
locus_tag = Mb0801
CYP126A2P Mycobacterium leprae
GenPept CAC31567
NC_002677 complement(1384839..1385327)
locus_tag="ML1185
Sequence below is from TIGR primary nucleotide sequence for ML1185
37% to CYP126 C-terminal
184 DRSLIPSAIEEGSRSETPNWASVTRITIA*LAIGGKTILPNAGVDILMGSANRDGSRWTE 363
364 PNTFDIHWPRQAHTTLAGSHMCLGIGLAQLDTRVMLNNLFD 486
127 Family
CYP127A1 Rhizobium sp.
GenEMBL Z68203(34010bp)
coding region 29431-30675
also AE000101 Rhizobium sp. NGR234
CYP127A2 Rhizobium sp. BR816
No accession number
Ellen Luyten
Submitted to nomenclature committee 4/12/2000
73% identical to CYP127A1
CYP127A3v1 Mesorhizobium loti
GenPept NP_106463
NC_002678 complete genome 4745586..4746803
79% to 127A2
1 MAINPVPDHV PPEMVRDFSL FTSPGMPPTP NGDPHAAVAC AHDGPPIFYS PYNTQDGRGT
61 WVITRAADQR KVLQDTETFS SHRSIFSSIL GETWPTIPLE LDPPAHGAFR SLLSPLLSPK
121 RVTALEPAVR ERAIALIDRI TASATSCDVM KDFAFPFTVS IFLRFLGLPD QGLDTFVGWA
181 KDLLHGDDVE RPVAARKIVA FIDELATNRR KDPVDDLMTF IVQAQIEGRR LTDGEIRGIG
241 VLVFVAGLDT VAAAIGFDLA YLARNLKDQE LLRSEPARIL LATEELLRAY PPIQLIRVAT
301 KDIDFEGAPI RKGDYVSCAT MIANRDPEEF ESPNTVDLAR DHNRHAAFGY GPHRCLGSHL
361 ARREIVIGLE EWLARIPTFR IKEGTAPITC GGHVFGIENL ILDWS
CYP127A3v2 Mesorhizobium loti
GenEMBL AL672114 complement(100678..101895)
Strain R7A symbiosis island
Gene = msi332
2 DIFFS with CYP127A3v1
CYP127A4 Rhizobium etli symbiotic plasmid p42d
NC_004041 97484..98974
81% to 127A2
gene = cpxA5
MHLCSERIYRKRGTRENPMSTGRAGEASKKFRLRPTKQRGFRAA
RRSDRCIACHWRLALLRLEIWRSTILLAPSPRRIRSRRRGFDDRRKAVATIRVPEHVP
PEMVKDFSLFTSPGMERMPNGDPHAAVACLHNGPRIFYSPCNTRDGRGTWVIVRAQDQ
RKLLQDTGTFSSHRSLFASALGENWPLIPLELDPPAHSVFRSLLNPLLSPRRIMELEP
AVRDRAIALISKISASSTSCDILTDFAFPFAVSIFLRLLGLSDERLNTFVGWGKDLLH
GDGIRRTAAARTILAFIDELAAMRRKEPADDFMTFVVQAKVDGRLLRDQEIHGIGVLL
FVAGLDTVATAIGFDLAYLARNPTEQELLRSKPDRIVLAAEELLRAYSTVQMIRVATK
DINFEGAPIRKGDYISCATMIANRDPVEFENPNTIDLAREDNRHTAFAYGPHRCLGSH
LARREIIIGLEEWLSRIPDFRIKDGTAPITYGGHVFGMENLILDWS
128 Family
CYP128A1 Mycobacterium tuberculosis
GenEMBL Z77163 (42861bp) gi 1449352 Rv2268c
coding region 37021-38490
CYP128A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(2521761..2523230)
Gene = cyp128 100% match
locus_tag = Mb2291c
129 Family
CYP129A1 Steptomyces sp.
GenEMBL U50973(3196bp)
Dickens,M.L. and Strohl,W.R.
Isolation and characterization of a gene from Streptomyces sp.
strain C5 that confers the ability to convert daunomycin to
doxorubicin on Streptomyces lividans TK24
J. Bacteriol. 178, 3389-3395 (1996)
gene name doxA
CYP129A2 Streptomyces peucetius
GenEMBL U77891 CDS comp (83..1330)
gene="doxA"
product="daunorubicin C-14 hydroxylase" 94% to 129A1
130 Family
CYP130A1 Mycobacterium tuberculosis
GenEMBL Z77137 (36096bp) gi 1480330 Rv1256c
coding region 30691-31908 cy50.26
CYP130A1X Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome
CYP130 lies in a deletion in M. bovis
CYP130A2P Mycobacterium leprae strain TN
GenPept AL583920.1
59% to CYP130A1
VMSHRFRFTTADIWPNPWSMYRTLRDHEAVHHVVPANQPEDDYYVLPRHADVWSMAMRS
HAKLSSAQRLTVNYSDMELIGLQDNPPMVMQDQPV*TKCRKLVSRRFTPRQTNVVEPKVR
HFVVEHIEQLRAKGSVDIVTELFKPLPPMVVAHYFGFPEKVRSQFDGW
TTAADGGGALFRFPRKSPITIRRLAPAIVAANTADAGGITNELDVAGYAVESMLAYFTR
IATGGNNTVTGMLGG*MPL
SHRRKQHRHWHARRLDAVKDTAEAD
LLRLTSSVRGLMRTTTRDVAIGHTTVSPGRRVLMRYGQAKRDER*YSAAAS*LDVTW*
PPNILIFSHGAH
YLGAKVTRMQRR
VRLTELLARYPDFEVDESSIAWAGGKLHTTP
131 Family
CYP131A1 Streptomyces peucetius
GenEMBL L47164(3444bp)
coding region 32-1348
gene dnrQ duanosamine biosynthesis
possible sequence errors at C-terminal (no recognizable signature sequence
in the last 68 amino acids)
CYP131A2 Streptomyces sp.
GenEMBL L35154 (4134bp)
3838-4134 N-terminal fragment 94% identical to L47164
gene dauQ daunomycin biosynthesis
132 Family
CYP132A1 Mycobacterium tuberculosis
GenEMBL Z80108 (40778bp) gi 1542902 Rv1394c
complement (9842-11227)
most often matches CYP4 family in blast search
CYP132A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(1566263..1567648)
Gene = cyp132 1 aa diff
locus_tag = Mb1429c
133 Family
CYP133A1 Erwinia herbicola
Randy S. Fischer, Roy A. Jensen
First P450 from an enteric bacteria (similar to E. coli)
submitted to nomenclature committee
CYP133B1v1 Xylella fastidiosa, section 35 of 22.
AE003889 CDS complement(3751..4959)
82% to AE003887 48% to CYP133A1
CYP133B1v1 Xylella fastidiosa 9a5c
GenPept AAF83187
100% match
1 MKLTDLSNPA FLENPYPLYE TLRAQAPFVS IGPNALMTGR YSLVDSLLHN RNMGKKYMES
61 MRVRYGDSAA DMPLFQAFSR MFITINPPAH THLRGLVMQA FTGRESESMR PLAIDTAHQL
121 IDNFEQKPSV DLVAEFAFPF PMQIICKMMD VDIGDAVTLG IAVSKIAKVF DPSPMSADEL
181 VHASTAYEEL AQYFTKLIEL RRTHPGTDLI SMFLRAEEDG EKLTHDEIVS NVIMLLIAGY
241 ETTSNMIGNA LIALHRHPEQ LALLKSDLSL MPQAVSECLR YDGSVQFTMR AAMDDIEVEG
301 ELVPRGTVVF LMLGAANRDP AQFTHPDQLD ITRKQGRLQS FGAGIHHCLG YRLALIELEC
361 ALTTLFERLP HLRLAHLDAL NWNQRSNLRG VNTLIVDLHA KN
CYP133B1v2 Xylella fastidiosa Temecula1
GenPept AAO29526
6 diffs to CYP133B1v1
1 MKLTDLSNPA FLENPYPLYE TLRAQAPFVS IGPNALMTGR YSLVDSLLHN RNMGKKYIES
61 IRLRYGDTAA DMPLFQAFSR MFITINPPAH THLRGLVMQA FTGRESESMR PLAIDTAHQL
121 IDNFEQKPSV DLVAEFAFPF PMQIICKMMD VDIGDAVTLG MAVSKIAKVL DPSPMSADEL
181 VHASTAYEEL AQYFTKLIEL RRTHPGTDLI SMFLRAEEDG EKLTHDEIVS NVIMLLIAGY
241 ETTSNMIGNA LIALHRHPEQ LALLKSDLSL MPQAVSECLR YDGSVQFTMR AAMDDIEVEG
301 ELVPRGTVVF LMLGAANRDP AQFTHPDQLD ITRKQGRLQS FGAGIHHCLG YRLALIELEC
361 ALTTLFERLP HLRLAHLDAL NWNQRSNLRG VNTLIVDLHA KN
CYP133B1v3 Xylella fastidiosa Dixon
NZ_AAAL01000071 complement(6849..8057)
98% to 133B1v1 7 diffs 97% TO 133B1v2 11 diffs
gene = XfasA0474
MKLTDLSNPAILENPYPLYETLRAQAPFVSIGPNALMTGRYSLV
DSLLHNRNMGKNYMESMRVRYGDSAADMPLFQAFNRMFITINPPAHTHLRGLVMQAFT
GRESESMRPLVIDTAHQLIDNFEQKPSVDLVAEFAFPFPMQIICKMMDVDIGDAVTLG
MAVSKIAKVFDPSPMSADELVHASTAYEELAQYFTKLIELRRTHPGTDLISMFLRAEE
DGEKLTHDEIVSNVIMLLIAGYETTSNMIGNALIALHRHPEQLTLLKSDLSLMPQAVS
ECLRYDGSVQFTMRAAMDDIEVEGELVPRGTVVFLMLGAANRDPAQFTHPDQLDITRK
QGRLQSFGAGIHHCLGYRLALIELECALTALFERLPHLRLAHLDALNWNQRSNLRGVN
TLIVDLHAKN
CYP133B2v1 Xylella fastidiosa, section 33 of 22
AE003887 CDS 6723..7925
82% to AE003889 48% to CYP133A1
CYP133B2v2 Xylella fastidiosa Ann-1
NZ_AAAM01000051 complement(2764..3966)
97% TO 133B2v1 8 diffs
gene = XfasO1476
MKLADLSSPAFLENPYPLYETLRRQGPFVSIGPNALMTGRYSIV
DGLLHNRNMGKSYMESIRVRYGDDALDMPLFQGFNRMFLMLNPPVHTHLRGLVMQAFT
GRESESMRPLATDTAHRLIDDFEQKSSVDLVTEFSFPLPMRIICRMMDVDISDAISLS
VAVSNIAKVFDPAPMSPDELVHASAAYEELAHYFTRLIELRRAQPGTDLISMLLRAEE
EGQKLTHDEIVSNVILLLLSGYETASNMIGNALIALHRHPKQLARLKSDLSLMPQTVL
ECLRYDGSVQFTVRAAMDDVSIEGDVVPRGTIVFLMLGAANRDPAQFTDPDHLEITRK
QGRLQSFGAGVHHCLGYRLALVELECALTVLLERLPHLRLANLDTLSWNQRGNLRGVN
ALIADLHP
CYP133B2v3 Xylella fastidiosa Dixon
NZ_AAAL01000066 complement(2275..3477)
97% TO 133B2v1 9 diffs 10 diffs to CYP133B2v2
gene = XfasA0420
MKLADLSSPAFLENPYPLYETLRRQGPFVSIGPNALMTGRYSIV
DGLLHNRNMGKSYMESIRVRYGDDALDMPLFQGFNRMFLMLNPPVHTHLRGLVMQAFT
GRESESMRPLAIDTAHRLIDDFEQKSSVDLVTEFSFPLPMRIICRMMHVDISDAISLS
VAVSNLAKVLDPAPMSPDELVHASAAYEELAHYFTRLIELRRAQPGTDLISMLLRAEE
EGQKLTHDEIVSNVILLLLGGYETTSNMIGNALIALHRHPKQLARLKSDLSLMPQAVL
ECLRYDGSVQFTIRAAIDDVSIEGDVVPRGTIVFLMLGAANRDPVQFTDPDHLEITRK
QGRLQSFGAGVHHCLGYRLALVELECALTVLLERLPHLRLANLDTLSWNQRGNLRGVN
ALIADLHP
CYP133B3 Xanthomonas axonopodis pv. citri str. 306
GenPept AAM38014
56% to 133B2
1 MLLSDLATPQ FRHDPYPTYA RLREEGPLVQ VADGRLMSGR YAVVDRLLSD RRVGRDYLQS
61 VRLRYGEAAV HLPLFQGMSR MFLLLNPPLH TQLRGLMTQA FGARQMESMR EVASDIAAGL
121 IDAFQANGHC DLLTEFAFPL PIAIICRMLD IAAADVTALS HATSALAKVF DPMMTAEELQ
181 ATSVAYDQLA TYFHGVIAQR RSAGGDDLIA RFIQAEDNGR RLSEEEIVSN VILLFFAGHE
241 TTSNMICNAL VALHRHPQQL RLLQETPGLL PNAVLECMRY DSSVQMATRT ALQDFEIEGV
301 AVPRGTMLYL MLGAANHDTL QFTDPQVLDI RRQQGRALSL GGGIHHCLGN RLALIEVEAA
361 LACLLARLPA LRLEQLDTLS WNDRANLRGV DALLASW
CYP133B4 Xanthomonas campestris pv. Campestris str. ATCC 33913
GenPept AAM42318
BioI biotin synthesis
52% to 133B2 65% to 133B3
1 MQLSDFATPA FRQDPYPMYA RLRAAGPLVQ ISDNGWVSGH YTVVDALLSD RRVGRNYLDS
61 IRVRYGANAA EMPLFQGMSR MFLLLNPPVH TQQRALMTKA FGARQLEALR EVAVDTADAL
121 LDQHEDRRSC DLLNDFAMPM TISLICRMLG LAVTDVAALG QASSALAKVF DPLMRPEDMA
181 QATAAYTTLE QYFRAIVLQR RDTQEDDLIA RLIAAEDHGQ RMPVDDIVSN VIMLFTAGHE
241 TTANMICNAL IALHRHPEQL QLLRDTPTLM PNAVLECMRY DSSVQVAMRS VLQPLQVEGT
301 TLPVGAILYL MLGSANHDAE QFTAPQQLDL RRQQGRALSF GGGVHHCLGN RLALIELETA
361 LERLLQRAPA LRLPELDNLS WNERANLRGI QALHATW
CYP133B5 Ralstonia solanacearum GMI1000 megaplasmid
GenEMBL AL646080 77388..78584
gene = RSp0709
77% to CYP133B2v2
MKLADLSTPSFLENPYPLYETLRSQGPFVRIGPNALMTGHYSIV
DALLHNRQMGKSYMESIRLRYGDEGPNMPLFQGFSRMFLMLNPPMHTRLRGLMMQVFN
ARQIESMREVATATAHQLIDDFEQKPSADLVAEFAFPLPVRIICQMMDLDIDDAMALG
VGVSKLAKVFDPAPMSADALVETSAAYEELAQYFTKVIEARRAQPGTDLISMLMRAEE
NGETLTHDEIVSNVILLFIAGHETTSNMIGNALIALHRNPQQLDLLKREPSRMPNAVL
ECLRYDGSVQVTIRAALEDVEVEGEVLPRGTTVFLMLGAANRDPAQFTDPDQLDIGRQ
QGRLQTFGAGIHHCLGYRLALIELESALGALFERLPNLRLTNLDQLSWNQRGNLRGVN
ALMAAW
CYP134A1 Bacillus subtilis
GenEMBL AF017113, Z99121, Z99122
cypB also called cypX
CYP134A2P Bacillus cereus ATCC 14579
GenPept AAP10061
57% to CYP134A1 cypX Z99122 Bacillus subtilis C-term
1 MIGATNCDSN VFERPDKFNV YRPDIDIKKA FSGTARHLAF GLSIYNCVGV AFAKLKIEID
61 STIKDNISRK KLRDIKDFVK KTSKMN
CYP134B1 Photorhabdus luminescens subsp. laumondii TTO1
GenEMBL NC_005126 complete genome complement(313663..314886)
locus_tag = plu0296
46% to CYP134A1
MAKLSSFNIHDPKFIKNPYDFYDILHKQDLVYFEQSQNSYFIGK
YEDVDAILKSSIFNTKPLTALAEPVMGDRVLAQMEGEEHACKRKFIMQGLSRDYFNRY
YEPMIRKITEDLLQPYMEKGNIDIVNDFGRDYAVLVTLSILGLPSDNYRDIAEWHKGI
ASFITQFDQTELEKMHSLECSQKLIRLLKPIIDQRRRNPSKDIISIFCQDTAMSMSEI
TALCLNILLAATEPADKILAMMLNHLISNPSMLDVVLKDRSLVRDAFEETLRLTSPVQ
LIPREASEDVTISGIDIPKGAVVFCMIGAANRDPSVFHKPNEFDLYRRKNTTSPQKAN
RKRHLAFGAGTHACAAAAFSLSQLEVSSNIILDLLHNLRFADHYHYQETGVYTRGPSK
LLLSFDPIASSAIKE
CYP135A1 Mycobacterium tuberculosis
GenEMBL Z96800
Rv0327c
CYP135A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(393726..395075)
Gene = cyp135A1 1 aa diff
locus_tag = Mb0334c
CYP135B1 Mycobacterium tuberculosis
GenEMBL AL021942
Rv0568
CYP135B1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 660693..662111
Gene = cyp135B1 100% match
locus_tag = Mb0583
CYP136A1 Mycobacterium tuberculosis
GenEMBL Z83866
coding region 23158-24636
Rv3059
CYP136A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 3376038..3377516
Gene = cyp136 1 aa diff
locus_tag = Mb3085
CYP136B1 Mycobacterium abscessus
GenPept AAN38721
46% to CYP136A1
1 MDAVEAAQRP GGTMTNHLLA PAHHVKERLS SVIMVPAPHA VDDRWRRWSR DWPVRELAPA
61 PAGSGLKAVR GDAGLPFVGH TLDYIRFGSD FSRERYDRLG SVSWMGAFGT KMVVIAGPDA
121 TREAFTSEAK AFSQDGWSFL IDAFFHRGLM LMSFDEHLMH RRIMQEAFTR PRLTGYVEQV
181 TPCVRSAVPA WPVGPSVRIY PLLKELTLDI ATDVFMGGRG KDESDAVNKA FVATVRAASS
241 LVRAPLPGTR FRAGVQGRRV LEDYFFRHLP AARAGETEDL FAALCQATTE DGERFSDEDV
301 VNHMIFLMMA AHDTSTITTT AVTYFLAKYP QWQEAAAAEA AAIGDGLPDI EALEKMTVID
361 RVIKEALRLL APVPLVMRKT VRDVAIDGYH IPSNTLCAIT PAVNHFDRTI WNDPERFDPS
421 RFDEPRREDQ HHRFAWVPFG GGAHKCIGMQ FGTLEVKAIL HRMLRSFTWK VPENYHVRWD
481 NTSLPIPVDG LPLEMKRR
CYP137A1 Mycobacterium tuberculosis
GenEMBL AL022121
Rv3685c
CYP137A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(4064642..4066072)
Gene = cyp137 1 aa diff
locus_tag = Mb3710c
CYP138A1 Mycobacterium tuberculosis
GenEMBL Z92770
Rv0136
CYP138A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 163556..164881
Gene = cyp138 100% match
locus_tag = Mb0141
CYP138A2P Mycobacterium leprae
GenPept CAC32181 GenEMBL AL583926
NC_002677 complement(3167438..3168451)
locus_tag = ML2648
Sequence below is from TIGR primary nucleotide sequence for ML2648
40% to CYP138
1 NRVAREIVVEVIYGALFGAFEALSGLVPQDTVLGPMGRYSMAPSLIR
439 ITINVIMRAGFGSELDELRRLHPTAATL 522
RWTVERQARCNHDIFMLDSRSTAERLRRRLHGTCMKNH 351
352 VRIFEAEPLWGLRTGLKASLLPHCRLINRITINVIMRAGFGSELDELRRLHPTAA 516
517 TLVGLF*LLSQHLGVLADPSSMGATMPGDDPAPALRQATIPG
638 LGVQWTRTVIDFAARRVYSSVYHLSEWAIPREDSILISIAQIYXXXXXXXXXXXX 766
795 DPRRYVEHKPSSFAWI 842
PFSGGT 861
862 SRCVSICQDGDGMNVVLKMVLRYWIIDTTTAPGER*HLRGVVYTPRNGGR 1011
CYP139A1 Mycobacterium tuberculosis
GenEMBL Z95617 GenPept AAK45973 (with 7 more aa at N-term)
Rv1666c
CYP139A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(1877656..1878948)
Gene = cyp139 start codon differs by 6 aa
locus_tag = Mb1694c
CYP139A2P Mycobacterium leprae
GenPept CAC31622 GenEMBL AL583921.1
NC_002677 complement(1474970..1475188) and
complement(1474991..1475161)
locus_tags = ML1237 ML1238
61% to CYP139A1
GAAVATTSMTVILARLASRTRLHLLAHYTHRVRARNFAALIP*LSLTVEVINSMPTQ
CYP140A1 Mycobacterium tuberculosis
GenEMBL Z97193
Rv1880c
CYP140A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(2120751..2122067)
Gene = cyp140A1 100% match
locus_tag = Mb1912c
CYP140A2 Mycobacterium ulcerans
No accession number
Pam Small
Submitted to nomenclature committee 10/17/2003
62% to CYP140A1
CYP140A3P Mycobacterium leprae TN
GenEMBL L01095.1
48% to 140A1
VRQRLHWFAQYGFIRGIAATHH
RRSDPLARLDIALAIKANPVP
YCHKPRPRRPLVQSRISYLTANRAITHELLQSEDFHVFWLNVTLPAPSHWL
RRRTGYRTSSQYNL
LHPLLAIQ*AYHIHYRKTVSPLFAPKAVATLRDRIEQTTLALLDQLAHQHDVVDVVNRY
CSQLPVAVISDILGYP
VPDRDRSHILKFGELVAPSLDVELT*Q*YQQA*REVAGFNFWL
LKHLPQLQRTPGDNLVRHLSH*EDNKPTEISLSKSKLQAISG
GLVLATGGETTVNLLGRGI
LLLDTPEHMVMLQACPEPGHKRG*EILRLDSPIQMAARVARKDVDLAGSTIKRSQVVVLY
FGRSQPGPVRLCRSR*VQHRTPQCGKESRIFR*QEFCLENALTRAYNAVGLRAFFDHLP*
TRAAGTRSRLDTRVLRGWSTLPIALGPTRSMVS
CYP140A4 Mycobacterium avium subsp. paratuberculosis
GenEMBL AJ250018 complement(2795..>3145)
59% to 140A1 runs off end
GAAARQSRPVGPRRSRRSCDRQPGPDDRAHAPPATSTSAPAMVG
LVPRRARNRDPKVFSDPTTFDVTRPNAREHLAFASGIHACLGAALARIEGATCARSFE
NFPDRSSRARNGGR
CYP141A1 Mycobacterium tuberculosis
GenEMBL Z95150
Rv3121
cosmid cY164 from Sanger Centre
coding region 29289-30488
CYP141A1P Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 3441289-3441483
aa 337-400 (first part of gene is in a deletion)
IAFGYGPHACPASAYSRMCLTTFFTSLTQRFPQLQLARPFEDLERRGKGLHSVGIKELLVTWPT*
CYP142A1 Mycobacterium tuberculosis
GenEMBL
Rv3518c
CYP142A1aP Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(3898119..3898736)
gene = CYP142A1a aa 1-197 100%
locus_tag = Mb3548c
In Mycobacterium bovis, a frameshift due to a single base
deletion (c-*) splits cyp142 into 2 parts (pseudogene)
CYP142A1bP Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(3897541..3898122)
gene = CYP142A1b aa 207-end 100%
locus_tag = Mb3547c
In Mycobacterium bovis, a frameshift due to a single base
deletion (c-*) splits cyp142 into 2 parts (pseudogene)
CYP143A1 Mycobacterium tuberculosis
GenEMBL AL022021
Rv1785c
CYP143A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(2013905..2015086)
Gene = cyp143A1 100% match
locus_tag = Mb1813c
CYP143A2P Mycobacterium leprae
GenPept CAC30494
NC_002677 1861010..1862160
locus_tag = ML1542
P450 pseudogene
Sequence below is from TIGR primary nucleotide sequence for ML1542
55% to CYP143 Mycobacterium tuberculosis Rv1785c
1 MSTSAKANPTHFTYCSLNYSALSMITDRGVIWKTLXX 105
113 AKPVVFMNG*YYLNVSRKCILHTTSITKGFSSREAXXX 217
225 PGNALPVLPXXXXXXXXXXXXXXXXXXX 251
278 SLNNLNKALPALRTYTVTMANAITSRGEW 364
366 EAMTDFANX 389
391 LFPLQLFLVL*GLXX 429
434 AQDRDHLIALLKDVVIGMSDKPFLSQADIADQGELCEYLVDTIAERKQNPA 586
585 PDVLSQVLIGEDPLSEIKVLDLESL 659
659 MLILAELDTVTATVGFSLLQPACRQQLRTMLRDKPKQIRILIED 790
792 ILQLEPPAQITPYITTEFVNVDGMTLSPGSRVRLC 896
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
993 GSHLARLKLTLAVDEWLINI 1052
XXXXXXXXXXXXXXXXXX
1116 LFALKALALHW 1148
CYP144A1 Mycobacterium tuberculosis
GenEMBL Z97345
Rv1777
CYP144A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 2001114..2002418
Gene = cyp144 1 aa diff
locus_tag = Mb1806
CYP145 Nocardioides sp.
GenEMBL AB000735
gene for 2-carboxybenzal
CYP146 Amycolatopsis orientalis
GenEMBL AJ223998
cosmid PCZA361 (gene 2 of 2)
CYP147A1 Myxococcus xanthus Partial missing C-term
GenEMBL AF111947 CDS 1939..>2877
42% to AF087022 partial new family
CYP147B1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV584 50% to 147A1 from Myxococcus xanthus
(147A1 is missing C-term so could be higher % identity)
CYP147C1 Streptomyces tubercidicus strain I-1529
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name CypEA
50% to 147A1
CYP147D1 Magnetospirillum magnetotacticum
GenEMBL NZ_AAAP01002628
complement(814..1824) gene = Magn3224
N-terminal is about 61 aa short
51% to 147B1 44% to 147C1
MCAPGPGRDPQCGTGRSSSGDPPDHDRLRGQVMRCFTPQRVRGM
REKTRRITDDLIAKMAGKTRIDLVDDFSYPLPVTVICELLGVPPEDEAQFHGWATQLA
TALEPNQRGDEETQAKNEVCFNEIADYIQGLIKEKRKNPQEDILSDLATDTDGMNDFD
LIATAVLLLVAGHETTVNLITNGMLTLLRFPEHLERLRAEPETAPRLIEELLRYEPPV
HYRTRLALADIPVAGITIPKDAPVILLLAAANRDPLRFSDPDRFDPDRPDNRHLGFGG
GLHYCVGAPLARIEAEVALVSLVRRLKGLSLTENPPPYRPGASLRGPCHLRLALEEVA
EG
CYP147E1 Methanosarcina barkeri Archaea; Euryarchaeota
GenEMBL NZ_AAAR01001943 4935..6305
52% to 147D1 probable lateral transfer
gene = Meth3340
MYRQGSGPNDRRQTMTQQSLYEQVLDYANRANPYPLYAKLRQTP
ITRQIDGSYVVSTYREIVSLLHDPRIGSDFRMRSA
HDRPSAGLSANQELASKNQAQDEGAETSSSNQGSETEVV
PSFIGLDPPEHDRLRRQATWPFGPPHTPGRVADMEPELILLA
NRQIDTIKGRTSIDIVEDFAYPIPVTMISELLGVPPEDQPRLHALSEAIIEDIDLDPR
QSPEEQKRRQEQSSQTFKELEQYMEVLIEHHRKQPGSDLLSGLITDHGSDGPMAQADL
VSTASLLLIAGHETTVNLITNGMLTLLRHPDVLERLRREPDLVIRLVEEFLRYEPPVQ
ILPNRVALSDITIAGTTIQKGSPVILLLASGSRDPARFHDPEKFDPDRRDNMHLGFGS
GIHYCYGAPLARLETQIALTELVQRLENPRLAHDPPPYRQSATLRGPRHLIVEIDGVK
DWEFHL
CYP147F1 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
55% to 147B1 if 9 aa removed, 54% to 147E1
clone name SP0549
CYP148A1 Deinococcus radiodurans R1
GenEMBL AE002083 CDS 1719..2948
38% to AL049754 complement(10413..11648)
CYP148A1 Deinococcus radiodurans
GenPept AAF12079
GenEMBL NC_001263 2539498..2540727
Gene = DR2538
1 MTASSGSSAP SSGPLLAAVQ GLWSGAALAD PHPIYEQIRG FANADGLVRL PEWNTAFAVG
61 HAATSAVLRS PAARSGEWDH GPSDGGKLLQ HMMLFRNGIP HARLRGLVQK AFTPRVVEEQ
121 RDLVRSLLDE LLSDMARAGG PVDLVAGLSG PLPGRVIMRM LGLRGADEER FLGWSASVAE
181 LLGGADRSPA LLARIEADAR EMRGYFRDLA DELRVSPQPG LLSALAAVED GGERLSGDEL
241 LSNAVLLLAA GHETTSNLIP GGVLALSQQP GAWAALLNHP RHPGVADELL RHVSPVQLDG
301 RMLTEAQTVG ETPLPAGTPV QLLLAAANRD PQVFPDPERL DWDRPNASRH LAFAAGPHYC
361 LGASLARLEI AETFAALAER FPDLRVSAAP HYKANFVLRG PQELWVTLG
CYP149A1 Microcystis aeruginosa
GenEMBL AB036790 CDS complement(779..2254)
gene="mapks"
41% to 107H1 partial seq new family
CYP150A1 Mycobacterium species
GenEMBL AF107046
Pascal Poupin
gene 1
CYP150A2 Mycobacterium smegmatis mc2155
GenEMBL AF107047 1092..2405
Pascal Poupin
gene 2
MTDSTATDPAATTPDFDTVDYFTDQSLVPDPHPYFDHLRSKCPV
VREPHYGVLAITSFEEATTVLKDTETFSSCIAVGGPFPPLPFTPEGDDITGQIEQHRT
QLPMFEHMVTMDPPEHTNARSLLNRLLTPKRLKENEDFMWRLADECLDDFIDDGSCEF
LKQYAKPFSLLVIADLLGVPEEDHDEFRHVLGAPRPGAIVGSLDGDQLAMNPLAWLDD
KFVRYLEDRRKEPRDDVLTALATAKYPDGSTPEVIDVVRSATFLFAAGQETTTKLLSA
SLRVLGDRPDIQQALREDRSRIPTFVEEALRMDAPVKSQFRLAKKTTQLGGVDVPAGT
TLMVCPGAVNRDPVRFEDPHTFSLDRKNVREHIAFGRGVHSCPGGPLARVEGRVSLER
ILDRMADIRIDEEHHGPADNRRYTYEPTYILRGLTDLHIKFEPVR
CYP151A1 Mycobacterium smegmatis
GenEMBL AF102510
Poupin P, Ducrocq V, Hallier-Soulier S, Truffaut N
Cloning and Characterization of the Genes Encoding a Cytochrome
P450 (PipA) Involved in Piperidine and Pyrrolidine Utilization and
Its Regulatory Protein (PipR) in Mycobacterium smegmatis mc2155.
J Bacteriol 181, 3419-3426 1999
CYP151A2 Mycobacterium sp. strain RP1
GenEMBL AJ310142
Pascal Poupin
Submitted to nomenclature committee March 22, 2001
86% identity in 399 aa overlap with CYP151A1
CYP152A1 Bacillus subtilis
GenEMBL AB006424
ybdT gene
this sequence is missing part of the heme signature sequence, but has
PERF and EXXR
CYP152A2 Clostridium acetobutylicum
GenPept AAK81262
YBDT B.subtilis ortholog
59% to 152A1
1 MLLKENTAKD KGIDSTLDLL KEGYLFIKNR ADHYQSDLFE TRLMGQRIIC MTGEEAARIF
61 YDSDKFKRQG AAPKRVQETL LGENAIQTLD GESHLHRKKL FMLLTNQVQQ KRLAELTTEK
121 WEASASKWHT KSIVLFNEAN EILCQVACHW AGVPLMESDI KNRAEDFSSM IDSFGAVGPR
181 HWKGKKARNT IEAWIKEIIE NVRSGRIRAE EGSPLHEIAF YIDVNGQQMP AEMAAIELIN
241 ILRPIVAIST FITFSALALY EHSEYREKLQ SKDIRYLEMF TQEVRRYYPF APFVGARVRK
301 DFLWNNCEFK KEMLVLLDIY GTNHDSRIWQ KPYEFIPDRF RSYKGNLFDF IPQGGGDPSS
361 THRCPGEGIT LEIMKTSLDF LSTKIDFTVP DQDLSYSLSK IPTLPKSGFI IDNINLKL
CYP152B1 Sphingomonas paucimobilis
GenEMBL AB006957
Isamu Matsunaga
this sequence is missing part of the heme signature sequence, but has
PERF and EXXR
CYP152B2 Azotobacter vinelandii
NZ_AAAU02000007 102969-104183
56% to