P450s that have appeared since the 1993 P450 nomenclature update.
Part C covering CYP10 to CYP69
Animal P450 names continue at CYP301 in this file
Lower eukaryote names continue at CYP501 in this file
ALL Drosophila melanogaster P450S ARE NOW INCLUDED
This includes list references that were incomplete and duplications of
sequences that were already in the update. If a sequence is assigned an
accession number that was not in the old update it is included in this list.
Some expressed sequence tags (ESTs) are also included from humans.
This list was last revised on Feb. 4, 2003.
Added all Neurospora P450s see 527A1-553A1
Added CYP51 Trypanosoma brucei
Added Fugu P450s, added CYP505A3 added CYP18 Spodoptera
Added all human genes and pseudogenes
Added 104 Anopheles genes
Revised May 13, 2003 named all remaining Dictyostelium P450 genes
Compiled by David R. Nelson
There are 80 C. elegans P450s listed here, surpassing the known mouse and human
complements. The C. elegans genome is now about 90% finished sequence and 9%
unfinished (one strand or one chemistry). About 1Mb is not cloned at three
telomeres and two internal sites. A special issue of Science appeared in Dec. 98 even though the genome is not completely done. The amount of sequence in the Blast searchable database at Washington Univ. is 140Mb, more than the 100Mb size of the genome. Therefore, we can guess that this set includes all the P450 genes in C. elegans, but the distrubution is not even. Most P450 genes (43 genes) are on chromosome V. see additional info on C. elegans P450s see this list. To see the actual sequences go to the C. elegans sequence file. So far we are missing CYP11A, CYP11B, CYP17, CYP19, CYP21, CYP24 and CYP27A and CYP27B. Does C. elegans make steroids? The present evidence would suggest not. Does C. elegans have
mitochondrial P450s? There is one probable mitochondrial P450 in C. elegans on
cosmid ZK177 named CYP44. It was thought to be incomplete but now a full length sequence
has been assembled from the genomic sequence.
10A Subfamily
11A Subfamily
11B Subfamily
12A Subfamily
12B Subfamily
13A Subfamily
13B Subfamily
14A Subfamily
16A Subfamily
17A Subfamily
18A Subfamily
19A Subfamily
21A Subfamily
22A Subfamily
23A Subfamily
24A Subfamily
25A Subfamily
26A Subfamily
27A Subfamily
27B Subfamily
28A Subfamily
29A Subfamily
30A Subfamily
31A Subfamily
32A Subfamily
33A Subfamily
33B Subfamily
33C Subfamily
33D Subfamily
33E Subfamily
34A Subfamily
35A Subfamily
35B Subfamily
35C Subfamily
35D Subfamily
36A Subfamily
37A Subfamily
37B Subfamily
40A Subfamily
41A Subfamily
42A Subfamily
43A Subfamily
44A Subfamily
45A Subfamily
46A Subfamily
51A Subfamily
52A Subfamily
52B Subfamily
52C Subfamily
52D Subfamily
52E Subfamily
52F Subfamily
53A Subfamily
53B Subfamily
54A Subfamily
55A Subfamily
56A Subfamily
57A Subfamily
58A Subfamily
59A Subfamily
60A Subfamily
60B Subfamily
61A Subfamily
62A Subfamily
63A Subfamily
64A Subfamily
65A Subfamily
66A Subfamily
67A Subfamily
10A Subfamily
CYP10 Lymnaea stagnalis (pond snail)
GenEMBL S46130 (1870bp) PIR JX0225 (545 amino acids)
Teunissen,Y., Geraerts,W.P., van Heerikhuizen,H., Planta,R.J.
and Joosse,J.
Molecular cloning of a cDNA encoding a member of a novel
cytochrome P450 family in the mollusc Lymnaea stagnalis.
J. Biochem. 112, 249-252 (1992)
11A Subfamily
CYP11A1 human
PIR A48733 (239 amino acids)
Matteson, K.J., Chung, B.C., Urdea, M.S. and Miller, W.L.
Study of cholesterol side-chain cleavage (20,22 desmolase)
deficiency causing congenital lipoid adrenal hyperplasia
using bovine-sequence P450scc oligodeoxyribonucleotide
probes.
Endocrinology 118, 1296-1305 (1986)
CYP11A1 rabbit
GenEMBL S59219 (1336bp) PIR A49189 (445 amino acids)
Yang,X., Iwamoto,K., Wang,M., Artwhol,J., Mason,J.I. and
Pang,S.
Inherited congenital adrenal hyperplasia in the rabbit is caused by
a deletion in the gene encoding cytochrome P450 cholesterol side
chain cleavage enzyme.
Endocrinol. 132, 1977-1982 (1993)
CYP11A1 bovine
PIR A42033 (18 amino acids)
Pikuleva, I.A., Lapko, A.G., Chashchin, V.L.
Functional reconstitution of cytochrome P-450-scc with hemin
activated with Woodward's reagent K. Formation of a
hemeprotein cross-link.
J. Biol. Chem. 267, 1438-1442 (1992)
CYP11A1 Sus scrofa (pig)
GenEMBL L34259 (2376bp)
Urban,R.J., Shupnik,M.A. and Bodenburg,Y.H.
Insulin-like growth factor-I increases expression of the porcine
P-450 cholesterol side chain cleavage gene through a GC-rich domain.
J. Biol. Chem. 269, 25761-25769 (1994)
CYP11A1 goat
GenEMBL D50058 (1825bp)
Okuyama,E., Okazaki,T., Furukawa,A., Wu,R.-F. and Ichikawa,Y.
Molecular cloning and nucleotide sequences of cDNA clones of sheep
and goat
adrenolcortical cytochrome P450scc.
unpublished (1995)
CYP11A1 sheep
GenEMBL D50057 (1825bp)
Okuyama,E., Okazaki,T., Furukawa,A., Wu,R.-F. and Ichikawa,Y.
Molecular cloning and nucleotide sequences of cDNA clones of sheep
and goat
adrenolcortical cytochrome P450scc.
unpublished (1995)
CYP11A1 bovine
PIR S29644 (21 amino acids)
Tsujita, M. and Ichikawa, Y.
Substrate-binding region of cytochrome P-450(scc) (P-450
XIA1). Identification and primary structure of the
cholesterol binding region in cytochrome P-450(scc).
Biochim. Biophys. Acta 1161, 124-130 (1993)
Cyp11a1 mouse
GenEMBL J05511
Rice,D.A., Kirkman,M.S., Aitkin,L.D., Mouw,A.R., Schimmer,B.P. and
Parker,K.L.
Analysis of the promoter region of the gene encoding mouse
cholesterol side-chain cleavage enzyme
J. Biol. Chem. 265, 11713-11720 (1990)
CYP11A1 Oncorhynchus mykiss (rainbow trout)
GenEMBL S57305 (1789bp) Swiss Q07217 (514 amino acids)
PIR S32197 (514 amino acids)
Takahashi,M., Tanaka,M., Sakai,N., Adachi,S., Miller,W.L.
and Nagahama,Y.
Rainbow trout ovarian cholesterol side-chain cleavage
cytochrome P450 (P450scc). cDNA cloning and mRNA expression
during oogenesis.
FEBS Lett. 319, 45-48 (1993)
CYP11A1 Fugu rubripes (pufferfish)
No accession number
Scaffold_1630
37515 MARWSVWRSPVVLPLSRMEVPMTGARHSSTMPVARQTYSDSSSFV 37381
37380 RSFNDIPGLWKNGVANLYNFWKLDGFRNLHHIMVQNFNTFGPIYR 37246 (2)
37065 EKIGYYESVNIINPEDAAILFKAEGHYPKRLKVEAWTSYRDYRNRKYGVLLK 36904 (2)
36822 NGEEWRCNRVLLNKEVISPKVLENFVPLLDEVGNDFVVRVHKKIARSGQNKWTTDLSQELFKYALES 36628 (1)
36506 VSSVLYGERLGLFLDYIDPEAQHFIDCISLMFKTTSPML
YIPPALLRKVGAKVWRDHVEAWDGIFNQ 36300 (1 expected) bad boundary
36208 ADRCIQNIYRRLRQETGPSKKYPGVLASLLLRDKLSIEDIKASITELMAGGVDT 36047 (0)
35975 TSITLLWTLYELARHPNLQEELRAEVAAARTESQGDMLEMLKRIPLVKGALKETLR 35808 (2)
35731 LHPVAVSLQRYIAEDIIIQNYHIPAG 35654 (0)
35568 TLVQLGLYAMGRDPKVFFRPEQYQPSRWLRSETHYFKSLGFGFGPRQCLGRRIAEAEMQLFLIH 35377 (0)
35297 MLENFRVEKQRHMEVQSTFELILLPDKPIILTLKPLSS* 35181
CYP11A1 Dasyatis americana (southern stingray)
GenEMBL U63299(4619bp)
Nunez,B.S. and Trant,J.M.
Isolation of the cDNA encoding the interrenal form of cholesterol
side chain cleavage cytochrome P450 of the southern stingray
(Dasyatis americana).
unpublished (1996)
11B Subfamily
CYP11B1 human
GenEMBL J05140 M32863 (1482bp) M32878 (2633bp) M32879 (1155bp)
Mornet,E., Dupont,J., Vitek,A. and White,P.C.
Characterization of two genes encoding human steroid 11-beta-
hydroxylase (P-450-11-beta).
J. Biol. Chem. 264, 20961-20967 (1989)
CYP11B1 human
GenEMBL D16155 (156bp)
Naiki,Y., Shizuta,Y., Kawamoto,T., Yasushiro,M., Miyahara,K.,
Toda,K., Tadao,O. and Imura,H.
A nonsense mutation (TGG(116Arg)-TAG(stop)) in CYP11B1
causes steroid 11beta-hydroxylase deficiency.
J. Clin. Endocrinol. Metab. 77, 1677-1682 (1993)
CYP11B1 human
GenEMBL D10169 D90428 (2085bp)
Kawamoto,T., Mitsuuchi,Y., Toda,K., Yokoyama,Y., Miyahara,K.,
Miura,S., Ohnishi,T., Ichikawa,Y., Nakao,K., Imura,H., Ulick,S.
and Shizuta,Y.
Role of steroid 11beta-hydroxylase and steroid 18-hydroxylase in the
biosynthesis of glucocorticoids and mineralocorticoids in humans.
Proc. Natl. Acad. Sci. USA 89, 1458-1462 (1992)
CYP11B1 human
PIR S29068 (30 amino acids)
Kawamoto, T., Mitsuuchi, Y., Toda, K., Miyahara, K.,
Yokoyama, Y., Nakao, K., Hosoda, K., Yamamoto, Y., Imura,
H. and Shizuta, Y.
Cloning of cDNA and genomic DNA for human cytochrome P-450
(11-beta).
FEBS Lett. 269, 345-349 (1990)
CYP11B1 Papio hamadryas ursinus (chacma baboon)
GenEMBL U52085(228bp)
Hampf,M., Swart,A. and Swart,P.
Expression of Papio ursinus steroid 11beta hydroxylase
unpublished (1996)
CYP11B1 rat
GenEMBL D10107 (1528bp) S58847
PIR B46040 (499 amino acids)
Matsukawa,N., Nonaka,Y., Higaki,J., Nagano,M., Mikami,H.,
Ogihara,T. and Okamoto,M.
Dahl's salt-resistant normotensive rat has mutations in
cytochrome P450 (11 beta), but the salt-sensitive hypertensive
rat does not.
J. Biol. Chem. 268, 9117-9121 (1993)
Note: only 1 amino acid difference with 11B1 at position 84
E (normal) changed to G (seen in 11B2 and 11B3)
D10107 has five differences with D11354
CYP11B1 rat
GenEMBL S58858 (330bp) PIR JX0251(499 amino acids)
Nomura,M., Morohashi, K.-i., Kirita, S., Nonaka, Y.,
Okamoto,M., Nawata,H. and Omura,T.
Three forms of rat CYP11B genes: 11 beta-hydroxylase gene,
aldosterone synthase gene, and a novel gene.
J. Biochem. 113, 144-152 (1993)
Note: Fig. 3 has 6 errors. 1,2) aldo-46 amino acids 2 and 3 are
incorrectly translated gctctc = AL not HS. 3) 11B3 codon at 559-561
incorrectly translated gac = D not N. 4,5,6) 11beta-62, 11B1
and 11B3 codon at 964-966 tcc = Ser not Pro.
CYP11B1 rat
GenEMBL S58849, D14086 to D14091 PIR A46039 (499 amino acids)
Mukai,K., Imai,M., Shimada,H. and Ishimura,Y.
Isolation and characterization of rat CYP11B genes involved
in late steps of mineralo- and glucocorticoid syntheses.
J. Biol. Chem. 268, 9130-9137 (1993)
CYP11B1 rat
GenEMBL S63899 (596bp)
Mukai,K., Imai,M., Shimada,H., Okada,Y., Ogishima,T. and
Ishimura,Y.
Structural differences in 5'-flanking regions of rat cytochrome
P-450aldo and P-450(11) beta genes
Biochem. Biophys. Res. Commun. 180, 1187-1193 (1991)
CYP11B1 rat
GenEMBL D11354 (1528bp) PIR A46040 (499 amino acids)
Matsukawa,N., Nonaka,Y., Higaki,J., Nagano,M., Mikami,H.,
Ogihara,T. and Okamoto,M.
Dahl's salt-resistant normotensive rat has mutations in
cytochrome P450 (11 beta), but the salt-sensitive hypertensive
rat does not.
J. Biol. Chem. 268, 9117-9121 (1993)
Note: This sequence is the wild type DS rat sequence. The mutant
DR rat has five amino acids differences accession number D10107
CYP11B1 Cavia porcellus (guinea pig)
GenEMBL Z69785(2028bp)
Bulow,H.E., Mobius,K., Bahr,V. and Bernhardt,R.
Molecular cloning and functional expression of the cytochrome P450
11B-hydroxylase of the guinea pig.
Biochem. Biophys. Res. Commun. 221, 304-312 (1996)
CYP11B1 bovine
PIR JX0151 (503 amino acids)
Kirita, S., Hashimoto, T., Kitajima, M., Honda, S.,
Morohashi, K. and Omura, T.
Structural analysis of multiple bovine P-450(11beta) genes
and their promoter activities.
J. Biochem. 108, 1030-1041 (1990)
CYP11B1 pig
GenEMBL D38590(1671bp)
Sun,T., Zhao,Y., Nonaka,Y. and Okamoto,M.
Cloning and expression of cytochrome P450(11 beta) of porcine
adrenal cortex.
J. Steroid Biochem. Mol. Biol. 52, 227-232 (1995)
CYP11B1 sheep
GenEMBL L34337 (1639bp)
Boon,W.C., Roche,P.J., Hammond,V.E., Jeyaseelan,K.,
Crawford,R.J. and Coghlan,J.P.
Cloning and expression analysis of a cytochrome P450-11-beta
cDNA in sheep.
unpublished (1994)
CYP11B1 Ovis aries (sheep)
GenEMBL L28716 (2300bp)
Anwar,A., Jeyaseelan,K. and Coghlan,J.P.
Molecular cloning and characterization of the ovine CYP11B1
promoter.
Biochem. Mol. Biol. Int. 33, 1169-1178 (1994)
CYP11B1 Ovis ammon (sheep)
GenEMBL L47569(9027bp)
Anwar,A., Jeyaseelan,K. and Coghlan,J.P.
Characterization of an ovine 11-beta hydroxylase (cyp11b) gene.
unpublished (1995)
Cyp11b1 mouse
PIR A41552 (500 amino acids)
Domalik, L.J., Chaplin, D.D., Kirkman, M.S., Wu, R.C., Liu,
W., Howard, T.A., Seldin, M.F., Parker, K.L.
Different isozymes of mouse 11beta-hydroxylase produce
mineralocorticoids and glucocorticoids.
Mol. Endocrinol. 5, 1853-1861 (1991)
Cyp11b1 mouse
PIR A32210 (42 amino acids)
Mouw, A.R., Rice, D.A., Meade, J.C., Chua, S.C., White, P.C.,
Schimmer, B.P. and Parker, K.L.
Structural and functional analysis of the promoter region of
the gene encoding mouse steroid 11beta-hydroxylase.
J. Biol. Chem. 264, 1305-1309 (1989)
CYP11B2 human
GenEMBL J05140 M32864 (1809bp) M32880 (3088bp) M32881 (1101bp)
Mornet,E., Dupont,J., Vitek,A. and White,P.C.
Characterization of two genes encoding human steroid 11-beta-
hydroxylase (P-450-11-beta).
J. Biol. Chem. 264, 20961-20967 (1989)
CYP11B2 human
GenEMBL D90429 D10170 (2114bp)
Kawamoto,T., Mitsuuchi,Y., Toda,K., Yokoyama,Y., Miyahara,K.,
Miura,S., Ohnishi,T., Ichikawa,Y., Nakao,K., Imura,H., Ulick,S.
and Shizuta,Y.
Role of steroid 11beta-hydroxylase and steroid 18-hydroxylase in the
biosynthesis of glucocorticoids and mineralocorticoids in humans.
Proc. Natl. Acad. Sci. USA 89, 1458-1462 (1992)
CYP11B2 human
GenEMBL D13752(6910bp)
Kawamoto,T., Mitsuuchi,Y., Ohnishi,T., Ichikawa,Y., Yokoyama,Y.,
Sumimoto,H., Toda,K., Miyahara,K., Kuribayashi,I., Nakao,K.,
Hosoda,K., Yamamoto,Y., Imura,H. and Shizuta,Y.
Cloning and expression of a cDNA for human cytochrome P-450aldo as
related to primary aldosteronism.
Biochem. Biophys. Res. Commun. 173 (1), 309-316 (1990)
CYP11B2 human
GenEMBL S77398 S77401 S77403 S77406 S77409 (genomic sequences)
Shizuta,Y., Kawamoto,T., Mitsuuchi,Y., Miyahara,K., Rosler,A.,
Ulick,S. and Imura,H.
Inborn errors of aldosterone biosynthesis in humans.
Steroids 60 (1), 15-21 (1995)
CYP11B2 rat
GenEMBL S58859 (353bp) PIR JX0252 (500 amino acids)
Nomura,M., Morohashi, K.-i., Kirita, S., Nonaka, Y.,
Okamoto,M., Nawata,H. and Omura,T.
Three forms of rat CYP11B genes: 11 beta-hydroxylase gene,
aldosterone synthase gene, and a novel gene.
J. Biochem. 113, 144-152 (1993)
CYP11B2 rat
GenEMBL S58850 D14092 to D14097
PIR B46039 (500 amino acids)
Mukai,K., Imai,M., Shimada,H. and Ishimura,Y.
Isolation and characterization of rat CYP11B genes involved
in late steps of mineralo- and glucocorticoid syntheses.
J. Biol. Chem. 268, 9130-9137 (1993)
CYP11B2 rat
GenEMBL S63898 (594bp)
Mukai,K., Imai,M., Shimada,H., Okada,Y., Ogishima,T. and Ishimura,Y.
Structural differences in 5'-flanking regions of rat cytochrome
P-450aldo and P-450(11) beta genes.
Biochem. Biophys. Res. Commun. 180, 1187-1193 (1991)
CYP11B2 rat
GenEMBL S64136 (3001bp) PIR JN0615 (506 amino acids)
GenEMBL U14908 (3000bp)
Zhou,M. and Gomez-Sanchez,C.E.
Cloning and expression of a rat cytochrome P-450 11
beta-hydroxylase/aldosterone synthase (CYP11B2) cDNA variant.
Biochem. Biophys. Res. Commun. 194, 112-117 (1993)
Erratum: Biochem. Biophys. Res. Commun. 196, 1018 (1993)
CYP11B2 rat
PIR A34281 (20 amino acids) Swiss P30099 (510 amino acids)
Ogishima, T., Mitani, F., Ishimura, Y.
Isolation of aldosterone synthase cytochrome P-450 from zona
glomerulosa mitochondria of rat adrenal cortex.
J. Biol. Chem. 264, 10935-10938 (1989)
Note: sequence is from N-terminal after signal sequence
CYP11B2 rat
Swiss P30099 (510 amino acids) GenEMBL D00567 (2824bp)
Swiss P30100 (500 amino acids) GenEMBL D00568 (2705bp)
Matsukawa,N., Nonaka,Y., Ying,Z., Higaki,J., Ogihara,T. and
Okamoto,M.
Molecular cloning and expression of cDNAs encoding rat aldosterone
synthase: variants of cytochrome P-450 11beta
Biochem. Biophys. Res. Commun. 169, 245-252 (1990)
CYP11B2 hamster
GenEMBL S73810 (1503bp)
LeHoux,J.G., Mason,J.I., Bernard,H., Ducharme,L., LeHoux,J.,
Veronneau,S. and Lefebvre,A.
The presence of two cytochrome P450 aldosterone synthase mRNAs in
the hamster adrenal.
J. Steroid Biochem. Mol. Biol. 49, 131-137 (1994)
CYP11B2 cavia porcellus (domestic guinea pig)
GenEMBL AF018569
Buelow,H.E. and Bernhardt,R.
Molecular Cloning and Functional Expression of the Guinea Pig
Aldosterone Synthase.
Unpublished
CYP11B2 Rana catesbeiana (bullfrog)
GenEMBL D10984 (1919bp)
Nonaka,Y., Takemori,H., Halder,S.K., Sun,T., Ohta,M., Hatano,O.,
Takakusa,A. and Okamoto,M.
Frog Cytochrome P-450 (11 beta, aldo), a single enzyme involved in
the final steps
of glucocorticoid and mineralocorticoid biosynthesis.
Eur. J. Biochem. 229, 249-256 (1995)
Cyp11b2 mouse
GenEMBL S85260 (2804bp)
Domalik,L.J., Chaplin,D.D., Kirkman,M.S., Wu,R.C., Liu,W.W.,
Howard,T.A., Seldin,M.F. and Parker,K.L.
Different isozymes of mouse 11 beta-hydroxylase produce
mineralocorticoids and glucocorticoids.
Mol. Endocrinol. 5, 1853-1861 (1991)
CYP11B3 rat
PIR JX0253 (498 amino acids)
Nomura,M., Morohashi, K.-i., Kirita, S., Nonaka, Y.,
Okamoto,M., Nawata,H. and Omura,T.
Three forms of rat CYP11B genes: 11 beta-hydroxylase gene,
aldosterone synthase gene, and a novel gene.
J. Biochem. 113, 144-152 (1993)
CYP11B3 rat
GenEMBL U14907 (1497bp)
Zhou,M., Gomez-Sanchez,E.P., Foecking,M. and Gomez-Sanchez,C.E.
Cloning of the cytochrome P-450 CYP11B3 complementary DNA in the
rat.
unpublished
CYP11B3 rat
GenEMBL S59144 D14098 to D14103
Mukai,K., Imai,M., Shimada,H. and Ishimura,Y.
Isolation and characterization of rat CYP11B genes involved
in late steps of mineralo- and glucocorticoid syntheses.
J. Biol. Chem. 268, 9130-9137 (1993)
Note: only one amino acid difference with Nomura's 11B3
CYP11B3 rat
GenEMBL U17082(1497bp)
Mellon,S.H., Bair,S.R. and Monis,H.
P450c11B3 mRNA, transcribed from a third P450c11 gene, is expressed
in a tissue-specific, developmentally, and hormonally regulated
fashion in the rodent adrenal and encodes a protein with both
11-hydroxylase and 18-hydroxylase activities.
J. Biol. Chem. 270, 1643-1649 (1995)
CYP11B8P rat pseudogene
GenEMBL D14104 to D14108
Mukai,K., Imai,M., Shimada,H. and Ishimura,Y.
Isolation and characterization of rat CYP11B genes involved
in late steps of mineralo- and glucocorticoid syntheses.
J. Biol. Chem. 268, 9130-9137 (1993)
Note: authors call this sequence 11B4
12A Subfamily
CYP12A1 Musca domestica (housefly)
GenEMBL U86618
Rene Feyereisen
MIKYKQYSRAIVALRQRGAQQYSTNVTNASQPDVKATTTTTISP
EWQEAKPFEEMPSMNSWPIIKNMLPWGKYGKMEPTQFLMALRDDMGPIVRTAAFMGRP
PTVITHNPHDFEMVFRNEGIWPIRPGGDAQMYHRTVLREDFFQGVTGLVSVNGEKWGN
FRSTVNPVLMQPKNVRLYLNKMAQVNDEFMARIRQIRDPETLEVPASFQEEMNRWTLE
SVSVVALDKQLGLITTNRDNPDLKKLIGLLNDFFELGQKIEFGLPFWKYIKTPTFKLF
MKTLDGLLEIGNKYVNEAIDRLEAERQSGVPEKPENEKSVLEKLIKIDRKIATVMAID
MILAGVDTTSTTFTALLLCLAKNPEKQEKLREEIRQILPRKDSQFEPSSLNHIPYTRA
CIKEALRMYPLTLGNARILANDTVLSGYRVPKGTLVSMISTGLLQDDNHYTKAKEYLP
ERWMRPTKEETEDSATCPHALKASSPFIYLPFGFGPRSCVGRRIVEMELELGIARLVR
NFRIEFNYPTENAFKFKLINVPNIPLKFKFTDVEN
CYP12A2 Musca domestica (housefly)
GenEMBL U94698
Rene Feyereisen
CYP12A3 Musca domestica (housefly)
GenEMBL U94699
Rene Feyereisen
cyp12a4 Drosophila melanogaster
GenEMBL AC006091 85973-87865 also AC015190 AC008141
chromosome 3 clone BACR48G05 (D475)
MLKVRSALSLIQSQKATLSLATQKHTEYFKILLYIYINKLYYQRWQTNVATAEAREDSEW
LQAKPFEQIPRLNMWALSMKMSMPGGKYKNMELMEMFEAMRQDYGDIFFMPGIMGNPPFL
STHNPQDFEVVFRNEGVWPNRPGNYTLLYHREEYRKDFYQGVMGVIPTQGKPWGDFRTVV
NPVLMQPKNVRLYYKKMSQVNQEFVQRILELRDPDTLEAPDDFIDTINRWTLESVSVVAL
DKQLGLLKNSNKESEALKLFHYLDEFFIVSIDLEMKPSPWRYIKTPKLKRLMRALDGIQE
VTLAYVDEAIERLDKEAKEGVVRPENEQSVLEKLLKVDRKVATVMAMDMLMAGVDTTSST
FTALLLCLAKNPEKQARLREEVMKVLPNKNPEFTEASMKNVPYLRACIKESQRLHPLIVG
NARVLARDAVLSGYRVPAGTYVNIVPLNALTRDEYFPQASEFLPERWLRSPKDSESKCPA
NELKSTNPFVFLPFGFGPRMCVGKRIVEMELELGTARLIRNFNVEFNYPTENAFRSALIN
LPNIPLKFKFIDLPN
cyp12a5 Drosophila melanogaster
GenEMBL AC006091 83449-85296 also AC015190 AC008141
chromosome 3 clone BACR48G05 (D475)
76% identical to other AC006091 SEQ. 58% TO 12A1, 12A2
MLKGRIALNILQSQKPIVFSASQQ*RWQTNVPTAEIRNDPEWLQAKPFEE
IPKANILSLFAKSALPGGKYKNLEMMEMIDALRQDYGNIIFLPGMMGRDG
LVMTHNPKDFEVVFRNEGVWPFRPGSDILRYHRTVYRKDFFDGVQGIIPS
QGKSWGDFRSIVNPVLMQPKNVRLYFKKMSQVNQEFIKEIRDASTQEVPG
NFLETINRWTLESVSVVALDKQLGLLRESGKNSEATKLFKYLDEFFLHSA
DLEMKPSLWRYFKTPLLKKMLRTMDSVQEVTLKYVDEAIERLEKEAKEGV
VRPEHEQSVLEKLLKVDKKVATVMAMDMLMAGVDTTSSTFTALLLCLAKN
PEKQARLREEVMKVLPNKDSEFTEASMKNVPYLRACIKESQRVYPLVIGN
ARGLTRDSVISGYRVPAGTIVSMIPINSLYSEEYFPKPTEFLPERWLRNA
SDSAGKCPANDLKTKNPFVFLPFGFGPRMCVGKRIVEMELELGTARLIRN
FNVEFNHSTKNAFRSALINLPNIPLKFKFKFTDVPN*
CYP12A6 Drosophila wassermani
no accession number
Tina Yee and Phil Danielson
59% to 12A2 56% to 12A1
N-terminal does not match known P450 sequences may be in error
12B Subfamily
CYP12B1 Drosophila acanthoptera
no accession number
Phil Danielson
Ac40
submitted to nomenclature committee
Cyp12b2 Drosophila melanogaster
GenEMBL AC018326 7227-9141 also AC004345 AC004657
77% identical to 12b1
Cyp12c1 Drosophila melanogaster
GenEMBL AC009385 comp(57935-59646) also AC012807
Cyp12d1 Drosophila melanogaster
GenEMBL AC008187 comp(84371-86114)
Cyp12e1 Drosophila melanogaster
GenEMBL AC018294 2070-3932
CYP12F1 Anopheles gambiae (malaria vector)
No accession number
Submitted by Rene Feyereisen July 5, 2002
Low 40% range with other CYP12s 60% to 12F2
CYP12F1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPl3r4
CYP12F2 Anopheles gambiae (malaria vector)
No accession number
Submitted by Rene Feyereisen July 5, 2002
Low 40% range with other CYP12s 60% to 12F1
CYP12F2 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPl3r3
CYP12F3 Anopheles gambiae (malaria vector)
No accession number
Submitted by Rene Feyereisen July 5, 2002
Low 40% range with other CYP12s 60% to 12F2
CYP12F3 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPl3r2
CYP12F4 Anopheles gambiae (malaria vector)
No accession number
Submitted by Rene Feyereisen July 5, 2002
Low 40% range with other CYP12s 55% to 12F2
CYP12F4 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPl3r1
13A Subfamily
CYP13A1 Caenorhabditis elegans (nematode worm)
GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
Wilson,R., Ainscough,R., Anderson,K., Baynes,C., Berks,M.,
Bonfield,J., Burton,J., Connell,M., Copsey,T., Cooper,J.,
Coulson,A., Craxton,M., Dear,S., Du,Z., Durbin,R., Favello,A.,
Fulton,L., Gardner,A., Green,P., Hawkins,T., Hillier,L., Jier,M.,
Johnston,L., Jones,M., Kershaw,J., Kirsten,J., Laister,N.,
Latreille,P., Lightning,J., Lloyd,C., McMurray,A., Mortimore,B.,
O'Callaghan,M., Parsons,J., Percy,C., Rifken,L., Roopra,A.,
Saunders,D., Shownkeen,R., Smaldon,N., Smith,A., Sonnhammer,E.,
Staden,R., Sulston,J., Thierry-Mieg,J., Thomas,K., Vaudin,M.,
Vaughan,K., Waterston,R., Watson,A., Weinstock,L.,
Wilkinson-Sproat,J. and Wohldman,P.
2.2 Mb of contiguous nucleotide sequence from chromosome III of C.
elegans.
Nature 368, 32-38 (1994)
Product T10B9.8
CYP13A2 Caenorhabditis elegans (nematode worm)
GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
see CYP13A1 for reference.
Product T10B9.7
CYP13A3 Caenorhabditis elegans (nematode worm)
GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
see CYP13A1 for reference.
Product T10B9.5
CYP13A4 Caenorhabditis elegans (nematode worm)
GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
see CYP13A1 for reference.
Product T10B9.1
CYP13A5 Caenorhabditis elegans (nematode worm)
GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
see CYP13A1 for reference.
Product T10B9.2
CYP13A6 Caenorhabditis elegans (nematode worm)
GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
see CYP13A1 for reference.
Product T10B9.3
CYP13A7 Caenorhabditis elegans (nematode worm)
GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
see CYP13A1 for reference.
Product T10B9.10
CYP13A8 Caenorhabditis elegans (nematode worm)
GenEMBL Z48717 (34881 bp) and Z83130 (ZK1325) and Z92859 (Y53C12)
see CYP13A1 for reference.
Product T10B9.4
CYP13A9P Caenorhabditis elegans (nematode worm)
GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
see CYP13A1 for reference.
Product T10B9.6 pseudogene
Note: closer inspection suggests this may not be a pseudogene.
A complete gene can be assembled, but perhaps the genefinder program
did not see
it.
CYP13A10 Caenorhabditis elegans (nematode worm)
GenEMBL Z46934 (35989 bp) and Z92859 (Y53C12)
see CYP13A1 for reference.
product ZK1320.4
CYP13A11 Caenorhabditis elegans (nematode worm)
GenEMBL Z81503 (F14F7) and Z92819 (Y37D8) and Z94158 (Y39E4)
F14F7.a contig 666 10-13k region
CYP13A12 Caenorhabditis elegans (nematode worm)
GenEMBL Z81503 (F14F7) and Z92819 (Y37D8) and Z94158 (Y39E4)
F14F7.b contig 666 13-18k region
13B Subfamily
CYP13B1 Caenorhabditis elegans (nematode worm)
GenEMBL Z54269 (19692bp) and Z92827 (C29F7)
F02C12.5 also C29F7 has the end of this sequence
Formerly CYP16A1 but renamed after full length sequence sorted with
CYP13
CYP13B2 Caenorhabditis elegans (nematode worm)
GenEMBL Z81565
K06G5
14A Subfamily
CYP14A1 Caenorhabditis elegans (nematode worm)
GenEMBL Z50742 (20856bp)
Wilson,R., Ainscough,R., Anderson,K., Baynes,C., Berks,M.,
Bonfield,J., Burton,J., Connell,M., Copsey,T., Cooper,J.,
Coulson,A., Craxton,M., Dear,S., Du,Z., Durbin,R., Favello,A.,
Fulton,L., Gardner,A., Green,P., Hawkins,T., Hillier,L., Jier,M.,
Johnston,L., Jones,M., Kershaw,J., Kirsten,J., Laister,N.,
Latreille,P., Lightning,J., Lloyd,C., McMurray,A., Mortimore,B.,
O'Callaghan,M., Parsons,J., Percy,C., Rifken,L., Roopra,A.,
Saunders,D., Shownkeen,R., Smaldon,N., Smith,A., Sonnhammer,E.,
Staden,R., Sulston,J., Thierry-Mieg,J., Thomas,K., Vaudin,M.,
Vaughan,K., Waterston,R., Watson,A., Weinstock,L.,
Wilkinson-Sproat,J. and Wohldman,P.
2.2 Mb of contiguous nucleotide sequence from chromosome III of C.
elegans.
Nature 368 (6466), 32-38 (1994)
K09A11.2
CYP14A2 Caenorhabditis elegans (nematode worm)
GenEMBL Z50742 (20856bp)
see CYP14A1 for reference.
K09A11.3
CYP14A3 Caenorhabditis elegans (nematode worm)
GenEMBL Z50742 (20856bp)
see CYP14A1 for reference.
K09A11.4
CYP14A4 Caenorhabditis elegans (nematode worm)
GenEMBL Z50742 (20856bp) Z70212(R04D3.1 continuation of Z50742)
see CYP14A1 for reference.
after K09A11.5
partial sequence at end of cosmid continues on cosmid R04D3
CYP14A5 Caenorhabditis elegans (nematode worm)
GenEMBL U64847
F08F3.7
CYP15A1 cockroach
No accession number
Rene Feyereisen
CYP15B1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPj2l3
47% identical to 15A1
16A Subfamily
CYP16A1X Caenorhabditis elegans (nematode worm)
GenEMBL Z54269
number retired
This sequence is now CYP13B1
17A Subfamily
CYP17 human
GenEMBL NM_000102
MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLP
RHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQMATL
DIASNNRKGIAFADSGAHWQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATH
NGQSIDISFPVFVAVTNVISLICFNTSYKNGDPELNVIQNYNEGIIDNLSKDSLVDLV
PWLKIFPNKTLEKLKSHVKIRNDLLNKILENYKEKFRSDSITNMLDTLMQAKMNSDNG
NAGPDQDSELLSDNHILTTIGDIFGAGVETTTSVVKWTLAFLLHNPQVKKKLYEEIDQ
NVGFSRTPTISDRNRLLLLEATIREVLRLRPVAPMLIPHKANVDSSIGEFAVDKGTEV
IINLWALHHNEKEWHQPDQFMPERFLNPAGTQLISPSVSYLPFGAGPRSCIGEILARQ
ELFLIMAWLLQRFDLEVPDDGQLPSLEGIPKVVFLIDSFKVKIKVRQAWREAQAEGST
CYP17 human EST
GenEMBL Z19875 (235bp)
UK-HGMP (United Kingdom human genome mapping project)
covers amino acids 270-348 when translating the complementary
strand. The fragment goes through at least 6 frame shifts.
sequence ID AAAAWEO
CYP17 human EST
GenEMBL Z20209 (248bp)
UK-HGMP (United Kingdom human genome mapping project)
covers amino acids 265-349 when translating the complementary
strand. The fragment goes through at least 8 frame shifts.
sequence ID AAABPSZ
CYP17 human
GenEMBL S85459 (556bp)
Biason,A., Mantero,F., Scaroni,C., Simpson,E.R. and Waterman,M.R.
Deletion within the CYP17 gene together with insertion of foreign
DNA is the cause of combined complete 17
alpha-hydroxylase/17,20-lyase deficiency in an Italian patient
Mol. Endocrinol. 5, 2037-2045 (1991)
CYP17 rat
PIR A31359 (507 amino acids)
Namiki, M., Kitamura, M., Buczko, E. and Dufau, M.L.
Rat testis P-450-17-alpha cDNA: the deduced amino acid
sequence, expression and secondary structural configuration.
Biochem. Biophys. Res. Commun. 157, 705-712 (1988)
CYP17 rat
PIR D41425 (16 amino acids)
Imaoka, S., Kamataki, T. and Funae, Y.
Purification and characterization of six cytochromes P-450
from hepatic microsomes of immature female rats.
J. Biochem. 102, 843-851 (1987)
CYP17 rat
GenEMBL S50146 Z11902 (3345bp)
PIR S20655 (97 amino acids)
Nason,T.F., Han,X.G. and Hall,P.F.
Cyclic AMP regulates expression of the rat gene for steroid
17 alpha-hydroxylase/C17-20 lyase P-450 (CYP17) in rat Leydig
cells.
Biochim. Biophys Acta 1171, 73-80 (1992)
Note: have not been able to download S50146 from GCG or NCBI.
CYP17 rat
GenEMBL X69816 (7556bp)
Givens,C.R., Zhang,P., Bair,S.R. and Mellon,S.H.
Transcriptional regulation of rat cytochrome P450c17 expression in
mouse Leydig MA-10 and adrenal Y-1 cells: identification of a
single protein that mediates both basal and cAMP-induced activities.
DNA Cell Biol. 13, 1087-1098 (1994)
CYP17 rat
PIR S24316 (97 amino acids)
Zhang, P., Nason, T.F., Han, X.G. and Hall, P.F.
Gene for 17-alpha-hydroxylase/C(17-20) lyase P-450: complete
nucleotide sequence of the porcine gene and 5' upstream
sequence of the rat gene.
Biochim. Biophys. Acta 1131, 345-348 (1992)
CYP17 hamster
no accession number
Cloutier,M., Fleury,A., Courtemanche,J., Ducharme,L.
Mason,J.I. and Lehoux,J.G.
Cloning and expression of hamster adrenal cytochrome P450c17 cDNA.
Ann. N.Y. Acad. Sci. 774, 294-296 (1995)
CYP17 Cavia (guinea pig)
GenEMBL S75277(1732bp)
Tremblay,Y., Fleury,A., Beaudoin,C., Vallee,M. and Belanger,A.
Molecular cloning and expression of guinea pig cytochrome P450c17
cDNA (steroid 17 alpha-hydroxylase/17,20 lyase): tissue
distribution, regulation, and substrate specificity of the
expressed enzyme.
DNA Cell Biol. 13 (12), 1199-1212 (1994)
CYP17 Cavia porcellus (guinea pig)
PIR S52756 (508 amino acids)
Huang,Y., Voigt,J.M. and Colby,H.D.
unpublished
CYP17 pig
GenEMBL Z11854 to Z11856 GenEMBL S40341 (1858bp)
PIR S24233 (501 amino acids) PIR S30074 (501 amino acids)
Zhang,P. Nason,T.F., Han,X.G. and Hall,P.F.
Gene for 17 alpha-hydroxylasae/C(17-20) lyase P-450: complete
nucleotide sequence of the porcine gene and 5' upstream sequence of
the rat gene.
Biochim. Biophys. Acta 1131, 345-348 (1992)
CYP17 Sus scrofa (pig)
GenEMBL U41519 to U41525
Conley,A.J., Graham-Lorence,S.E., Kagimoto,M., Lorence,M.C.,
Murry,B.A., Oka,K., Sanders,D. and Mason,J.I.
Nucleotide sequence of a cDNA encoding porcine testis 17
alpha-hydroxylase cytochrome P-450.
Biochim. Biophys. Acta 1130, 75-77 (1992)
CYP17 bovine
GenEMBL M64646 (1725bp)
Zuber,M.X., John,M.E., Okamura,T., Simpson,E.R. and Waterman,M.R.
Bovine adrenocortical cytochrome P-450-17-alpha: Regulation of gene
expression by ACTH and elucidation of primary sequence
J. Biol. Chem. 261, 2475-2482 (1986)
CYP17 Ovis aries (sheep)
GenEMBL L40335 (1728bp)
Murry,B.A., Swart,P. and Mason,J.I.
Cloning and expression of ovine cytochrome P-450 17-alpha
hydroxylase/c17-20 lyase.
Unpublished (1995)
CYP17 Equus caballus (horse)
GenEMBL D30688(1906bp) D13818
Hasegawa,T., Mukoyama,H., Yoshida,S. and Takahashi,M.
Molecular cloning and nucleotide sequence of equine testicular
cytochrome P-450 steroid 17alpha-hydroxylase/C17,20-lyase messenger
ribonucleic acid.
Biol. Reprod. Mono. 1, 615-622 (1995)
CYP17 Equus caballus (horse)
GenEMBL D88184(6217bp)
Hasegawa,T.
Exon/Intron structure of Equine P450c17.
unpublished (1996)
CYP17 Oncorhynchus mykiss (rainbow trout)
GenEMBL S50356
Sakai,N., Tanaka,M., Adachi,S., Miller,W.L. and Nagahama,Y.
Rainbow trout cytochrome P-450c17 (17 alpha-hydroxylase/17-20
lyase) cDNA cloning, enzymatic properties and temporal pattern of
ovarian P-450c17 mRNA expression during oogenesis.
FEBS Lett. 301, 60-64 (1992)
Identical to X65800
CYP17 Oncorhynchus mykiss (rainbow trout)
GenEMBL X65800 (2287bp) Swiss P30437 (514 amino acids)
Sakai,N., Tanaka,M., Adachi,S., Miller,W.L. and Nagahama,Y.
Rainbow trout cytochrome P-450c17 (17
alpha-hydroxylase/17,20-lyase). cDNA cloning, enzymatic properties
and temporal pattern of ovarian P-450c17 mRNA expression during
oogenesis
FEBS Lett. 301, 60-64 (1992)
Identical to S50356
CYP17 Oryzias latipes (medaka)
GenEMBL D87121(2421bp)
Kobayashi,D., Matsuyama,M., Tanaka,M., Fukada,S. and Nagahama,Y.
Structural analysis of medaka P-450c17 and expression in the
ovarian follicle.
unpublished (1996)
CYP17 Oryzias latipes (medaka)
GenEMBL D87122(2302bp)
Kobayashi,D., Tanaka,M., Fukada,S. and Nagahama,Y.
Presence of a Novel Cytochrome P-450c17 Transcripts in Medaka
Gonads
unpublished (1996)
note: this sequence is missing exon 6
otherwise identical to D87121
CYP17 Squalus acanthias (dogfish)
GenEMBL S77384 (1964bp)
Trant,J.
Isolation and characterization of the cDNA encoding the spiny
dogfish shark
(Squalus acanthias) form of cytochrome P450c17
J. Exp. Zool. 272, 25-33 (1995)
CYP17 Ictalurus punctatus (channel catfish)
GenEMBL AF063837
Trant,J.M., Berard,C., Byrne,B.J. and Wunder,J.
Isolation and heterologous expression of the cDNA encoding the
cytochrome P450 17-hydroxylase from the channel catfish (Ictalurus
punctatus)
Unpublished (1998)
CYP17A1 Fugu rubripes (pufferfish)
No accession number
Scaffold_4175
80% to Oryzias latipes 17, 79% to trout 17 73% to catfish
61% to dogfish 48% to other CYP17 from Fugu
MDWVLFVYAFSAVNLALLALHLKFRTPASGPRGPPRLPALPLIGSLLSLRSPHPPHVLFKE
LQGKYGQTYSLMMGSHRVIIVNHHAHAKEVLLKKGKIFAGRPRS 11667 (0)
11572 VTTDVLSRDGKDIAFGDYSATWRFHRKIVHGALCMFGEGSASIEKI 11435 (1)
10871 ICAEAASLCSILSEAWTAGLALDLSPELTRAVTNVICSLCFSSSYRRGDAEFEAMLHYSQ 10692
10691 GIVDTVAKDSLVDIFPCLQ 10635 (0)
IFPNADLRLLKRCVSVRDKLLQKEYDKHK (0)
AAYSDHVQRDLLDALLRAKCSAENNNTTGINAESVGLTDDHLLMT
VGDIFGAGVETTTTVMKWAITYLIHHPQ 9765 (0)
9637 IQSRIQEELDSRVGMDRSPQLSDRGSLPYLEATIREVLRIRPVAPLFIPHVALSDT 9473 (2)
9369 SIGDFAVKKGTRVVINLWSLHHDEKEWENPERFDP 9265 (1)
GRFLNSEGTGLVIPSSSYLPFGAGVRVCLGEALAK 8558
8557 MELFLFLSWILQRFTLTVPSGHSLPSLEGKFGVVLQPTKYKVNATPRPGWEGKCKACWN* 8378
CYP17A2 Fugu rubripes (pufferfish)
No accession number
Scaffold_8086
50% to Oryzias latipes 17, 48% to trout 17 50% to catfish
49% to dogfish 48% to other CYP17 from Fugu
1198 MVTVGSFLIFRRPVRGSEPGSEAGPPRVKVPCISWVPVLGSLPWLRGGRPLHLIFTQLSYR 1380 (2)
1600 YGPLFALYLGPHLTVVVNNHQHAREVLLLRGKDFAGRPRM 1719 (0)
1797 VTTDLLTRGGKDIAFSDYCPLWKSHRRLVQNSFTLFGEGTSRLQDM 1934 (1)
2061 VLAAVDSLCEELLSMEGRGFDPAPAVTRAVTNVVCMLVFSATYRHGDSELQEVLRYNDGI 2240
2241 VQTIAGGGLVDIYPWMK 2291 (0)
2372 VFPNKTLSKLKACIAVRDRLLTHKLEEHK 2458 (0)
2622 ATLTDNQPRDLLDALLMGQVGRGRRKGSGRVEEDIITEDHVLMTAAEAFGAGVETTSTTLLWILAYLLHHPQ 2837 (0)
2925 VQERVQKELDDHVGSERPVRVSDRARLTYLDCVINEGMRIRPVSPVLIPHTAMTDSR 3074 (2)
3870 IGGHHISRGTRVLVNMWSIHHDSAHWDKPDLFNP 3971 (1)
4519 DRFRDHQGQRVTPSCFLPFGAGPRVCVGESLARLELFLFLSSLLQRMSFRLPNGA 4692
4693 SPPDLQGRMGVVLQPVPYKVVVTPRVG* 4776
CYP17 fragment a Fugu rubripes (pufferfish)
No accession number
Fc:c028I22x2 LPC.10549.x2 Length = 1007 61% to 17A1 Fugu
GAPRATTDNHHAHAKEARPKKGKKAAGRPRK exon 1
ATTDGASRDGTDTAEGDHSATRRDQR exon 2
CYP17 fragment b Fugu rubripes (pufferfish)
No accession number
Fc:c028I22x1 LPC.10549.x1 Length = 894
78% to Fc:c028I22x2 65% to 17A1 Fugu
KEQQAEHGQTTSRMMGSHRGNTDNQHAHAKEARQKKGKKGAGRPRK exon 1
ATTDGRSRDGTDIAEGDHSATWRHHRKIVQRARRRNGEGSAPSEKI exon 2 C-helix
18A Subfamily
Cyp18 Drosophila melanogaster
GenEMBL S66112 (63bp)
Hurban,P. and Thummel,C.
Isolation and characterization of fifteen ecdysone-inducible
drosophila genes reveal unexpected complexities in ecdysone regulation.
Molec. Cell Biol. 13, 7101-7111 (1993)
Note: very short ecdysone inducible fragment in the heme binding
region about 2/3 of amino acids are identical to 2D sequences.
called Eig 17-1
dimethylnitrosamine demethylase
Cyp18 Drosophila melanogaster
GenEMBL U44753(2539bp)
Bassett,M.H, Waterman,M.R., McCarthy,J.L. and Sliter,T.J.
Cloning and characterization of CYP18 in Drosophila melanogaster:
identification of an insect member of a new cytochrome P450 family.
unpublished (1996)
complete sequence from the Eig 17-1 fragment
Cyp18 Drosophila melanogaster
GenEMBL AC012164 114600-117669 also AC015216
CYP18 Spodoptera littoralis (cotton leafworm)
No accession number
Lyndsay Davies
Submitted to nomenclature committee 2/22/02
61% identical to CYP18 from Drosophila
since there is only one CYP18 seq in Drosophila this new sequence
will be called CYP18 without a subfamily or number like CYP18A1.
If more CYP18s are found in a single species this may have to change.
19A Subfamily
CYP19 human
GenEMBL S52034 (142bp) S52789 (106bp) S52793 (149bp)
S52794 (125bp)
Harada,N.
A unique aromatase (P-450AROM) mRNA formed by alternative
use of tissue-specific exons 1 in human skin fibroblasts.
Biochem. Biophys. Res. Commun. 189, 1001-1007 (1992)
CYP19 human
GenEMBL D14473 (295bp) S59092 S59095 S59171
Toda,K and Shizuta,Y.
Molecular cloning of a cDNA showing alternative splicing of
the 5'-untranslated sequence of mRNA for human aromatase P-450.
Eur. J. Biochem. 213, 383-389 (1993)
CYP19 human
GenEMBL D13391 (2238bp)
Katsumi,T. and Shizuta,Y.
Identification and characterization of cis-acting regulatory
elements for
the expression of the human aromatase cytochrome P-450 gene.
J. Biol. Chem. 269, 8099-8107 (1994)
CYP19 human
GenEMBL D21240 (794bp) D21241 (3231bp)
Harada,N., Utsumi,T. and Takagi,Y.
Tissue-specific expression of the human aromatase cytochrome P-450
gene by alternative use of multiple exons 1 and promoters, and
switching of tissue-specific exons 1 in carcinogenesis.
Proc. Natl. Acad. Sci. U.S.A. 90 (23), 11312-11316 (1993)
CYP19 human
GenEMBL X55983 (669bp)
Toda,K., Miyahara,K., Kawamoto,T., Ikeda,H., Sagara,Y. and
Shizuta,Y.
Characterization of a cis-acting regulatory element involved in
human-
aromatase P-450 gene expression.
Eur, J. Biochem. 205, 303-309 (1992)
exon 1
CYP19 human
GenEMBL S71536 (792bp)
Toda,K., Simpson,E.R., Mendelson,C.R., Shizuta,Y. and Kilgore,M.W.
Expression of the gene encoding aromatase cytochrome P450 (CYP19)
in fetal tissues
Mol. Endocrinol. 8, 210-217 (1994)
CYP19 human
GenEMBL M32245 (840bp)
Harada,N., Yamada,K., Saito,K., Kibe,N., Dohmae,S. and Takagi,Y.
Structural characterization of the human estrogen synthetase
(aromatase) gene.
Biochem. Biophys. Res. Commun. 166, 365-372 (1990)
CYP19 human
GenEMBL D29757 (875bp) PIR PC2041 (45 amino acids)
Honda,S.-I., Harada,N. and Takagi,Y.
Novel exon 1 of the aromatase gene specific for aromatase
transcripts
in human brain.
Biochem. Biophys. Res. Commun. 198, 1153-1160 (1994)
CYP19 human
GenEMBL M22246 (2966bp)
Harada,N.
Cloning of a complete cDNA encoding human aromatase: immunochemical
identification and sequence analysis
Biochem. Biophys. Res. Commun. 156, 725-732 (1988)
CYP19 human
GenEMBL L21982 (1166bp)
Mahendroo,M.S., Mendelson,C.R. and Simpson,E.R.
Tissue-specific and hormonally-controlled alternative promoters
regulate aromatase cytochrome P450 gene expression in human adipose
tissue
J. Biol. Chem. 268, 19463-19470 (1993)
CYP19 human
GenEMBL S85356 (1384bp)
Means,G.D., Kilgore,M.W., Mahendroo,M.S., Mendelson,C.R. and
Simpson,E.R.
Tissue-specific promoters regulate aromatase cytochrome P450 gene
expression in human ovary and fetal tissues.
Mol. Endocrinol. 5, 2005-2013 (1991)
CYP19 human
GenEMBL S96437 (971bp)
Kilgore,M.W., Means,G.D., Mendelson,C.R. and Simpson,E.R.
Alternative promotion of aromatase P-450 expression in the human
placenta.
Mol. Cell. Endocrinol. 83, R9-R16 (1992)
CYP19 human
PIR A40542 (48 amino acids)
Mahendroo, M.S., Means, G.D., Mendelson, C.R. and Simpson, E.R.
Tissue-specific expression of human P-450-AROM. The promoter
responsible for expression in adipose tissue is different
from that utilized in placenta.
J. Biol. Chem. 266, 11276-11281 (1991)
CYP19 Macaca fuscata (Japanese macaque)
GenEMBL S79807(369bp)
Yamada-Mouri,N., Hirata,S., Hayashi,M. and Kato,J.
Analysis of the expression and the first exon of aromatase mRNA in
monkey brain.
J. Steroid Biochem. Mol. Biol. 55 (1), 17-23 (1995)
CYP19 rat
GenEMBL S59505 (639bp)
Fitzpatrick,S.L. and Richards,J.S.
cis-acting elements of the rat aromatase promoter required for
adenosine 3',5'-monophosphate induction in ovarian granulosa cells
and constitutive expression in R2C Leydig cells.
Molec. Endocrinol. 7, 341-354 (1993)
Note: promoter
CYP19 rat
GenEMBL Z11815 (590bp)
Hickey,G.J., Krasnow,J.S., Beattie,W.G. and Richards,J.S.
Aromatase cytochrome P450 in rat ovarian granulosa cells before and
after luteinization: Adenosine 3',5'-monophosphate-dependent and
independent regulation. Cloning and sequencing of rat aromatase
cDNA and 5' genomic DNA
Mol. Endocrinol. 4, 3-12 (1990)
CYP19 Oryctolagus cuniculus (rabbit)
GenEMBL Z68271(1783bp)
Delarue,B., Mittre,H., Feral,C., Benhaim,A. and Leymarie,P.
Rapid sequencing of rabbit aromatase cDNA using RACE PCR without
cloning.
C. R. Acad. Sci. III, Sci. Vie 319, 663-670 (1996)
CYP19 Oryctolagus cuniculus (rabbit)
GenEMBL Z70302(1455bp)
Delarue,B., Mittre,H. and Leymarie,P.
Expression des transcrits codant pour l'aromatase de lapin dans
differents tissus.
unpublished (1996)
Cyp19 mouse
Swiss P28649 (503 amino acids) GenEMBL D00659 (2420bp)
Terashima,M., Toda,K., Kawamoto,T., Kuribayashi,I., Ogawa,Y.,
Maeda,T. and Shizuta,Y.
Isolation of a full-length cDNA encoding mouse aromatase P450
Arch. Biochem. Biophys. 285, 231-237 (1991)
CYP19 pig
no accession number
Corbin, C.J., Khalil, M.W. and Conley, A.J.
Functional ovarian and placental isoforms od porcine aromatase.
Mol. Cell. Endocrinol. 113, 29-37 (1995)
CYP19 Sus scrofa (pig)
GenEMBL L15471 (454bp)
Ko,Y., Choi,I., Green,M.L., Simmen,F.A. and Simmen,R.C.
Transient expression of the cytochrome P450 aromatase gene in
elongating porcine blastocysts is correlated with uterine
insulin-like growth factor levels during peri-implantation
development.
Mol. Reprod. Dev. 37 (1), 1-11 (1994)
Note: This is only a fragment of 80 amino acids, including
helix K and the EXXR conserved sequence.
CYP19 Sus scrofa (pig)
GenEMBL U52141(1133bp)
Choi,I., Collante,W., Simmen,R.C.M. and Simmen,F.A.
Molecular cloning of multiple forms of cytochrome p450 aromatase
and their developmental expression in porcine blastocysts,
endometrium, and placenta.
Unpublished (1997)
CYP19 Sus scrofa (pig)
GenEMBL U52142(1584bp)
Choi,I., Collante,W., Simmen,R.C.M. and Simmen,F.A.
Molecular cloning of multiple forms of cytochrome p450 aromatase
and their developmental expression in porcine blastocysts,
endometrium, and placenta.
Unpublished (1997)
CYP19 Sus scrofa (pig)
GenEMBL U37309 (417bp)
Choi,I., Simmen,R.C. and Simmen,F.A.
Molecular cloning of cytochrome P450 aromatase complementary
deoxyribonucleic acid from periimplantation porcine and equine
blastocysts identifies multiple novel 5'-untranslated exons
expressed in embryos, endometrium, and placenta.
Endocrinology 137 (4), 1457-1467 (1996)
CYP19 Sus scrofa (pig)
GenEMBL U37311(2470bp)
Choi,I., Simmen,R.C. and Simmen,F.A.
Molecular cloning of cytochrome P450 aromatase complementary
deoxyribonucleic acid from periimplantation porcine and equine
blastocysts identifies multiple novel 5'-untranslated exons
expressed in embryos, endometrium, and placenta.
Endocrinology 137 (4), 1457-1467 (1996)
CYP19 Sus scrofa (pig)
GenEMBL U57510
Choi,I., Collante,W., Simmen,R.C.M., Troyer,D. and Simmen,F.A.
Molecular cloning and structural characterization of porcine
cytochrome p450 aromatase chromosomal genes: evidence for the
existence of multiple, closely related genes that encode
developmental and tissue-specific isoforms of aromatase.
unpublished (1997)
CYP19 Sus scrofa (pig)
GenEMBL U57517(287bp) U57518(287bp) U57519(358bp)
Choi,I., Collante,W., Simmen,R.C.M., Troyer,D. and Simmen,F.A.
Molecular cloning and structural characterization of porcine
cytochrome p450 aromatase chromosomal genes: evidence for the
existence of multiple, closely related genes that encode
developmental and tissue-specific isoforms of aromatase.
unpublished (1997)
CYP19 Sus scrofa (pig)
GenEMBL U57520(517bp) U57521(495bp)
Choi,I., Collante,W., Simmen,R.C.M., Troyer,D. and Simmen,F.A.
Molecular cloning and structural characterization of porcine
cytochrome p450 aromatase chromosomal genes: evidence for the
existence of multiple, closely related genes that encode
developmental and tissue-specific isoforms of aromatase.
unpublished (1997)
CYP19 bovine
GenEMBL S66248 (2104bp)
Hinshelwood,M.M., Corbin,C.J., Tsang,P.C. and Simpson,E.R.
Isolation and characterization of a cDNA insert encoding bovine
aromatase
cytochrome P450.
Endocrinology 133, 1971-1977 (1993)
Note two amino acid differences with Z32741
CYP19 bovine
GenEMBL M64646 (1725bp)
Zuber,M.X., John,M.E., Okamura,T., Simpson,E.R. and Waterman,M.R.
Bovine adrenocortical cytochrome P-450-17-alpha: Regulation of gene
expression by ACTH and elucidation of primary sequence.
J. Biol. Chem. 261, 2475-2482 (1986)
CYP19 bovine
GenEMBL Z32741 (4226bp) PIR S44210 (503 amino acids)
Vanselow,J. and Furbass,R.
Aromatase cytochrome P450 gene and pseudogene
unpublished (1994)
CYP19 bovine
GenEMBL Z69241 to Z69250 (genomic sequences)
Furbass,R. and Vanselow,J.
unpublished (1996)
CYP19P bovine
GenEMBL Z32813 (1006bp)
Vanselow,J. and Furbass,R.
Aromatase cytochrome P450 gene and pseudogene
unpublished (1994)
CYP19 Equus caballus (horse)
GenEMBL U37313 (458bp)
Choi,I., Simmen,R.C. and Simmen,F.A.
Molecular cloning of cytochrome P450 aromatase complementary
deoxyribonucleic acid from periimplantation porcine and equine
blastocysts identifies multiple novel 5'-untranslated exons
expressed in embryos, endometrium, and placenta.
Endocrinology 137 (4), 1457-1467 (1996)
CYP19 chicken (three different strains of chicken)
GenEMBL M73277 to M73285, M73286 to M73294
M73295 to M73303
PIR A41063 (495 amino acids)
Matsumine,H., Herbst,M., Ou,S.-H.I., Wilson,J.D. and
McPhaul,M.J.
Aromatase mRNA in the extragonadal tissues of chickens with
the henny-feathering trait is derived from a distinctive
promoter structure that contains a segment of a retroviral
long terminal repeat.
J. Biol. Chem. 266, 19900-19907 (1991)
CYP19 Coturnix coturnix japonica (Japanese quail)
GenEMBL S46949 (692bp) PIR A48977(230 amino acids)
Harada,N., Yamada,K., Foidart,A. and Balthazart,J.
Regulation of aromatase cytochrome P-450 (estrogen synthetase)
transcripts in the quail brain by testosterone.
Brain Res. Mol. Brain Res. 15, 19-26 (1992)
Note: only three amino acid differences with chicken.
CYP19 Coturnix coturnix (quail)
GenEMBL D50336(4351bp)
Kudo,T., Yamamoto,H., Sato,S. and Sutou,S.
Comparison of 5' upstream regions of chicken and quail aromatase
gene.
Unpublished (1995)
CYP19 Poephila guttata (zebra finch)
GenEMBL S75898(3188bp)
Shen,P., Campagnoni,C.W., Kampf,K., Schlinger,B.A., Arnold,A.P. and
Campagnoni,A.T.
Isolation and characterization of a zebra finch aromatase cDNA: in
situ hybridization reveals high aromatase expression in brain
Brain Res. Mol. Brain Res. 24 (1-4), 227-237 (1994)
CYP19 Ictalurus punctatus (channel catfish)
GenEMBL S75715(2102bp)
Trant,J.
Isolation and characterization of the cDNA encoding the channel
catfish (Ictalurus punctatus) form of cytochrome P450arom
Gen. Comp. Endocrinol. 95 (2), 155-168 (1994)
CYP19 Onchorynchus mykiss (rainbow trout)
Tanaka, M., Telecky, T.M., Fukada, S., Adachi,S., Chen, S. and
Nagahama, Y.
Cloning and sequence analysis of the cDNA encoding P-450 aromatase
(P450arom) from a rainbow trout (Onchorynchus mykiss) ovary,
relationship
between the amount of P450arom mRNA and the production of
oestradiol-17-beta
in the ovary.
J. Mol. Endocrin. 8, 53-61 (1992)
CYP19 Carassius auratus (goldfish)
GenEMBL U18974(2939bp)
Gelinas,D.M., Pitoc,G.A. and Callard,G.V.
Isolation of goldfish brain aromatase cDNA and analysis of
expression during the reproductive cycle and after steroid
treatment in vivo.
unpublished (1996)
CYP19 Oryzias latipes (medaka)
GenEMBL D82968(1851bp)
Tanaka,M., Fukada,S., Matsuyama,M. and Nagahama,Y.
Structure and promoter analysis of the cytochrome P-450 aromatase
gene of the teleost fish, medaka (Oryzias latipes)
J. Biochem. 117 (4), 719-725 (1995)
CYP19 Tilapia nilotica (Cichlid fish)
GenEMBL U72071(1804bp)
Chang,X.T., Kobayashi,T., Nakamura,M., Kajura,H. and Nagahama,Y.
Isolation and characterization of cDNA encoding the tilapia
(Oreochromis niloticus) cytochrome P450 aromatase (P450arom):
Changes in P450arom mRNA, protein and enzyme activity in ovarian
follicles during oogenesis.
J. Mol. Endocrinol. (1996) In press
CYP19 Haplochromis burtoni (cichlid fish)
GenEMBL AF114716
White,R.B.
Aromatase expression during social change in the cichlid fish,
Haplochromis burtoni.
Unpublished
CYP19 Paralichthys olivaceus (Japanese Flounder)
GenEMBL AB017182
Kitano,T., Takamune,K., Kobayashi,T., Nagahama,Y. and Abe,S.
Suppression of P450 Aromatase (P450arom) Gene Expression in
Sex-Reversed Males Produced by Rearing Genetically Female Larvae at
High Water Temperature during a period of Sex Differentiation in
Japanese Flounder (Paralichthys olivaceus)
Unpublished (1998)
CYP19A1 Fugu rubripes (pufferfish)
No accession number
Scaffold_7098
65% to AF183906 ovary form of CYP19 from Zebrafish
60% to AF183908 brain form of CYP19 from Zebrafish
61% to other Fugu CYP19
this is probably the ovary form
9466 MAAVGLDAEVLVSVSPNATEAESPGSSAGTRALIILTCLLLLVWSHTEKKSVP 9308 (1)
9242 SLLGPSFCLGFGPLLTYVRFIWTGIGTASNYYNKKYGDIVRVWVNGEETLVISR 9081 (2)
8985 ASAVHHVLKSRQYTSRFGSKQGLSCIGMNERGIIFNNNVTEWRKIRGYFTK 8830 (1)
8759 ALTGPAVQNTVEVCNSSTQAHLDRLEDLAQVDVLSLLRCTVVDISNRLFLDIPIN 8595 (1)
8499 EKELLLKIHKYFDTWQTVLIKPDIYFKFGWIHQKHKTAA 8392 (2)
8296 RELQEAIEGLVEQKRRDLEQADKLENINFTAELLFAQ 8186 (0)
8084 NHGELSAENVMQCVLEMVIAAPDTLSVSLFFMLLLLKQNPDVELQLLQEIDAVVGK (0 expected, bad boundary)
RQLQNGDLQKLRVLETFINECLRFHPV 7719
7718 VDFTMRRSLSDDVIEGYRVPKGTNIILNTGHMHRTEFFLRPTEFCLQNFEKN 7563 (0)
APRRYFQPFGSGPRACVGKHIAMVMMKSILVTLLSQYSVCPHEGLT 7327
7326 LDCLPQTNNLSQQPVEHQEEAQQLSMRFLPRQRGSWQTV* 7207
CYP19A1 Danio rerio (zebrafish)
GenEMBL AF183906
Chiang,E.F.L., Yan,Y.L., Guiguen,Y., Postlethwait,J. and Chung,B.C.
Two Cyp19 (P450 Aromatase) Genes on Duplicated Zebrafish
Chromosomes Are Expressed in Ovary or Brain
Mol. Biol. Evol. 18 (4), 542-550 (2001)
Called CYP19a expressed in ovary
CYP19A2 Fugu rubripes (pufferfish)
No accession number
Scaffold_4200
65% to both brain and ovary forms of CYP19 from Zebrafish
61% to other Fugu CYP19
this is probably the brain form
2918 MKPKETLNITASGPFTPLPLPLMMMMMMLLLMMMMLFLTWNRPQRQHVP (1)
3451 GPLFLAGLGPLLSYCRFMWTGIGTACNFYNNKYGSLVRVWINGEETLILSR (2)
3674 SSAVYHVLRSAHYTARFGSRAGLECIGMEGQGVIFNSDVQLWRRARVYFSK (1)
3960 ALTGPGLQRTVGVCVTSTAKHLDCLVDMTDASGHVDALNLLRAIVVDISNRLFLRVPLN 4136 (1)
EKDLLTKIHNYFETWQAVLIKPDIFFKIGWLFDKHRRAA 4319 (2)
QELQDTMAALLKVKRKLVHEAEKLDDVLDFATELILAQ (0)
EAGEFSADNVRQCVLEMVIAAPDTLSISLFFMLMLLKQHPDVELRIVEELSTvsrt (0)
egeENIDYQRLKVMESFINESMRFHPVVDFTMRKALEDDTIEGIRIRKGTNIILNIGLMHKTE 5039
5040 FFPKPREFSLTNFEQT (0)
VPSRFFQPFGCGPRSCVG 5219
5220 KHIAMVMMKAILATLLSRYTVCPRHGCTLTSIRQTNNLSQQPVEDEHSLAMRFIPRTIQSPS* 5408
CYP19A1 Danio rerio (zebrafish)
GenEMBL AF183908
Chiang,E.F.L., Yan,Y.L., Guiguen,Y., Postlethwait,J. and Chung,B.C.
Two Cyp19 (P450 Aromatase) Genes on Duplicated Zebrafish
Chromosomes Are Expressed in Ovary or Brain
Mol. Biol. Evol. 18 (4), 542-550 (2001)
Called CYP19b expressed in brain
CYP19 Trachemys scripta (red-eared slider turtle)
no accession number
Crews,D.
Temperature-dependent sex determination: the interplay of steroid
hormones
and temperature.
Zool. Sci. 13, 1-13 (1996)
CYP20 human
GenEMBL AC011737.8 chromosome 2 clone RP11-33N4,
AC011737.8 chr 2 (missing exons 12,13)
AC080075.2 (missing exons 1,7,8)
MLDFAIFAVTFLLALVGAVLYLYP (0)
ASRQAAGIPGITPTEEK (2)
DGNLPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINPNKTS (1)
DPFETMLKSLLRYQSGGGSVSENHMRKKLYENGVTDSLKSNFALLLK (0)
LSEELLDKWLSYPETQHVPLSQHMLGFAMKSVTQMVMGSTFEDDQEVIRFQKNHGT (0)
VWSEIGKGFLDGSLDKNMTRKKQYED (1)
ALMQLESVLRNIIKERKGRNFSQHIFIDSLVQGNLNDQQ (0)
ILEDSMIFSLASCIITAK (1)
LCTWAICFLTTSEEVQKKLYEEINQVFGNGPVTPEKIEQLR (2)
YCQHVLCETVRTAKLTPVSAQLQDIEGKIDRFIIPRE (0)
TLVLYALGVVLQDPNTWPSPHK (2) genomic seq stops here the rest is cDNA
FDPDRFDDELVMKTFSSLGFSGTQECPELR (2) intron site based on fish genomic DNA
FAYMVTTVLLSVLVKRLHLLSVEGQVIETKYELVTSSREEAWITVSKRY
CYP20 mouse
GenEMBL AK020848 adult retina cDNA plus ESTs for C-term
MLDFAIFAVTFLLALVGAVLYLYPASRQASGIPGLTPTEEKDGN
LPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTTDVLKQHFNPNKTSDPFET
MLKSLLGYQSGGGSAGEDHVRRKLYGDAVTASLHSNFPLLLQLSEELLDKWLSYPETQ
HIPLSQHMLGFALKFVTRMVLGSTFEDEQEVIRFQKIHG
TVWSEIGKGFLDGSLDKNTTRKKQYQEALMQLESTLKKIIKERKGGNFRQHT
FIDSLTQGKLNEQQILEDCVVFSLASCIITAR
LCTWTIHFLTTTGEVQKK
LCKEIDQVLGEGPITSEKIEQLSYCQQVLFETVRTAKLTPVSARLQDIEGKVGPFVIPKE 360
TLVLYALGVVLQDPSTWPLPHRFDPDRFADEPVMKVFSSLGFSGTWECPELXFAYMVTAV 540
LVSVLLEKLRLLAVDRQVVEMKYELVTSAREEAWITVSKRH*
CYP20 Bos taurus (cow)
MLDFAIFAVTFLLALVGAVLYLYPASRQAAGIPGITPTEEKDGNLPDIV
NSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINPNKTLDPFETMLKSLLR
YQSDSGNVSENHMRKKLYENGVTNCLRINFALLIKLSEELLDKWLSYPESQHVPLCQHML
GFAMKSVTQMVMGSTFEDEQEVIRFQKNHGTVWSEIGKGFLDGSLDKSTTRKKQYEDALM
QLESILKKIKERKGRNFSQHIFIDSLVQGNLNDQQILEDTMIFSLAS
CMITAKLCTWAVCFLTTYEEIQKKLYEEIDQVLGKGPITSEKIEELRYCRQVLCETVRTA
KLTPVSARLQDIEGKIDKFIIPRETLVLYALGVVLQEXGTWSSPYKFDPERFDDESVMKT
FSLLGFSGTRECPELRFAYMVTAVLLSVLLRRLHLLSVE
GQVIETKYELVTSSKEEAWITVSKRY 498
CYP20 Gallus gallus (chicken)
BU454844 603215388F1 CSEQRBN14 Gallus gallus cDNA clone ChEST201o21 5'.
BU111630 603125978F1 CSEQCHL13 Gallus gallus cDNA clone ChEST95k19 5'
BU356654 603474052F1 CSEQCHN70 Gallus gallus cDNA
MLDFAIFAVTFLLILVGAVLYLYP
ASRQASGIPGLAPTDDK
DGNLPDIIASRSLHEFLVNLHEKYGPLVSFWFGRRLVVSLGSIDLLKQHVNPNRSS
DPFEMMLKSFLRYQSSLNGDTGESHLRRKLYESGVSKSLQSNLALIQK
LSEELLAKWLSLPEAQHIPLCQHMLGFAMKSVTQTAMGSSFEDDQEVIRFRRHHDA
IWSEIGKGFLDGSLDKNATRKKLYED
ALKEMESTLRKVITGTPRQIIQQAFIDTLLQGNLSDQQ
ILEDTMIFSLAGCIITAN
LCTWAVYFLTTSEDVQQNLCKEVDHVLGKGPITHEKIEQLR
YCRQVLCETVRTAKLTPIAAQLQELEGRVDQHTVPKE
TLVLYALGVMLQDSSSWPSPYK
FDPERFSEDSAMTNFSLLGFSGSQECPELR
FAYMVATVLLSILVRKLYLHPVKGQVMETKYELVTSPKEEAWITVSKRS*
CYP20 Xenopus laevis (African clawed frog)
BJ037591.1 NIBB Mochii normalized Xenopus neurula library Xenopus laevis cDNA clone XL040p15 5'
BQ725586 BQ725586.1 AGENCOURT_8103453 NICHD XGC Emb2 Xenopus laevis cDNA clone
95% identical to tropicalis seq. 74% to chicken
BC044111.1 mRNA, complete cds Length = 2417
175 MLDFAIFAITFLLILVGAVLYLYPSSRQACGIPGLAPTEEKDGNLQDIVNSGS 333
334 LHEFLVNLHERFGPVASFWFGRRLVVSLGSLDLLKQHINPNKTSDPFQTMLKSLLGYQSG 513
514 VIGEAAESHVQKKLYENGITKALHSNFSVIIKLSEELLAKWGTYPQSQHVPLCQHMLGFA 693
694 MKSVTQTAMGSSFEDDQEVIHFRRNHDAIWSEIGKGFLDGSIERSPSRKKLYEDALMEME 873
874 TVLKKTIKERKGKNPGRHVFLDSLLQGNLSDKQVLEDSMIFSLAGCVITANLCTWAIYFL 1053
1054 TTSEEVQDKLYKEVNRVIGKGPITMDKLEQLSYCRQILCETVRTASLTPISARLQELEGR 1233
1234 VDQHIIPKETLVLYALGVVLQDNTAWPLAYRFDPDRFDDETAKQSLSLLGLSGSQECPEL 1413
1414 RFAYMVAMVLLCVLVRKLNLLPVKGQVMETKYELVTSPKEEAWITVSKRS 1563
CYP20 Xenopus tropicalis
From JGI blast server http://aluminum.jgi-psf.org/prod/bin/runBlast.pl?db=xenopus1&dump=1
And from Sanger http://www.sanger.ac.uk/Projects/X_tropicalis/blast_server.shtml
AL848968.1 Xenopus tropicalis EST, clone TEgg007o12 5'
AL870093 AL870093.1 Xenopus tropicalis EST, clone TEgg120l22 5'
60% to Fugu CYP20, 70% to human CYP20, 62% to zebrafish, 30% to ciona CYP20 like seq
72% to chicken
MLDFAIFAITFLLILVGAVLYLYP exon 1
SSRQACGIPGLAPTEEK exon 2
DGNLQDIVNSGSLHEFLVNLHERFGPVASFWFGRRLVVSLGSLDLLKQHINPNKTS exon 3
DPFQMMLKSLLGYQSGVIGEAAESHVQKKLFENGIIKALHSNFSVVIK exon 4
LSEDLLAKWLTYPQSQHVPLCQHMLGFAMKSVTQTAMGSSFDDDQEVIHFRRNHDA exon 5
IWSEIGKGFLDGSIERSPNRKKLYED exon 6
ALMEMETVLKKAIKERKVKNPGRHVFVDSLLQGNLSDKQ exon 7
VLEDSMIFSLAGcvitan exon 8
VCTWAIYFLTTSEEVQDKLFKEVTRVIGKGPITMDKLEQLS exon 9
YCRQILCETVRTASLTPISARLQELEGRVDQHIIPKE exon 10
tlvlyalgvvlqdntawplayr exon 11 X. laevis
FDPDRFNDETAKQSLTLLGFSGSQECPELR exon 12
FAYMVAMVLLSVLVRKLHLLPVKGQVMETKYELVTSPKEEAWITVSKRS exon 13
CYP20 Fugu rubripes (pufferfish)
No accession number
Scaffold_486
59% TO CYP20 human
MLDFAIFAVTFVIVLVGAVLYLYP (0)
SSRRASGIPGLNPTDEK (2)
11654 DGNLQDIVGRGSLHEFLVSLHQEFGPVASFWFGSRPVVSLGSLQQLRQHINPNHST 11487 (1)
DSFETMLKSLLGYHSGGGGASTDSIIRKKVYQGAIDTTLKNNFPLVLK (0)
LVDELVGKWKSFPEDQHTPLCAHQLVLAMKTITQLALGESFSEDARVIAFRKNHDV (0)
IWSEIGKGYMDGSLEKSTSRKGHYEK (1)
ALSEMESTLLSVVKERKSQRNKSVFVDSLIQSTLTERQ
IMEDCMVFMLAGCAITAN
VCIWALHFLSTSEEVQDRLYKEFEEVLGSSPVSLEKIPQLR
YCQQVLNETLRTAKLTPIAARLQEVEGKVDQHLIPKE
SLVIYALGVILQDSDTWNAPYR
FDPDRFEEESVKKSFHLLGFSGSQTCPELR
FAYTVATVLLSVLVRQLKLHRLKDTLMEVRSELVSTPRDETWITFNLRN*
CYP20 Tetraodon nigroviridis (freshwater pufferfish)
>FS_CONTIG_2529_2 Length = 13425 from http://fugu.hgmp.mrc.ac.uk/blast/
>FS_CONTIG_2529_1 Length = 1009 from http://fugu.hgmp.mrc.ac.uk/blast/
MLDFAIFAVTFVVILVGAVLYLYP
SSRRASGVPGLNPTDEK
XXXXXXXXXXXDIVARGSVQEFLVSLHQEFGPVASFWFGSRPVVSLGSLQQLQQHANPNRSS
DSFETMLKSLLGYHSGGGGASTENIIRKKVYQGAIDATLKNNFPLVLK
LVDELVGKWTSSPEDQHTPLCAHQLVLAMKSITQLALGESFSQDARVVSFRRNYDA
IWSEVGKGFMDGSLERSTSRKGRYEX
ALSEMEATLLSVVKDRKSQRKTSVLVDTLLQSTLTDRQ
IMEDCMVFTLAGCAITAN
VCIWALHFLSSYEDVQDRLHQELEEVLGSGSVSLEKIPQLR
YCQQVLNETVRTAKLTPVAAGLQEVEGKVDQHLIPKE
TLVIYALGVILQDSHTWDAPCR
FHPDRFEEESVRKSFRLLGFSGSQTCPELR
VAYTVATVLLSAVVRQLRLHRLEDTLVEVRSELVSTPREETWITFSRRN
CYP20 Danio rerio (zebrafish)
Assembled CYP20 from zebrafish ESTs
BQ259821 faa04d08.y1 zebrafish fin day3 regeneration
BM185037 fv16g05.x2 zebrafish adult brain
BG985721 2543 NICHD Zebrafish normalized I
BM070720 fu98e06.y1 zebrafish adult brain cDNA
CA472036 AGENCOURT_10739799 NCI_CGAP_ZKid1 cDNA (kidney?)
BI981894 fu52c12.y1 zebrafish adult brain Danio rerio cDNA
BM083017 fu28a11.y1 Campbell zebrafish ovary cDNA MLDFAIFAVTFVIILIGAVLYLYPSSRRASGVPGLNPTEEKDGNLQDIVNKGSLHEFLVG
LHDEFGSVASFWFGARPVVSLGAVNQLRQHINPNWTTDSFETMLKSLLGYQSGSGVGLTE
SMMRKKVYEGAINKTLENNFPLLLQQVEELVDKWASYPKSQHTPLCAHLLGLAMKAVTQL
AMGSRFRDDAEVIRFRKNHEAIWSQ
IGKGYLDGSLEKSSSRKAHYESALAEMESVLKSVAKQRPGQGSSQSFVNYLLQANLTER
QVMEDGMVFTLAGCVITANLCIWAVHFLSVSEAVQDRLYHELVEVLGDEPVSLEKIPQLR
YCQQVLNETVRTAKLTPVAARLQEVEGKVDQHIIPKETLVIYALGVVLQDADTWSLPYRF
NPDRFAEESVMKSFSLLGFSGSQACPELRFAYTVATVLL
STLVRRLRMHRVDGQVVEARYELVTTPKDDTWITVSKRN*
CYP20 Danio rerio (zebrafish)
>ctg10765 genomic contig CYP20 74% to fugu
9501 MLDFAIFAVTFVIILIGAVLYLYP (0) 9572
SSRRASGVPGLNPTEEK (2?)
DGNLQDIVNKGSLPEFLVGLH
DEFGSVASFWFGARPVVSLGAVNQLRQHINPNWT (1) 10024
12291 TDSFETMLKSLLGYQSGSGVGLTESMMRKKVYEGAINKTLENNFPLLLQ (0) 12439
12929 QVEELVDKWASYPKSQHTPLCAHFL 13003 frameshift
13003 GLAMKAVTQLAMGSRFRDDAEVIRFRKNHEA (0) 13095
15738 IWSEIGKGYLDGSLEKSSSRKAHYES (1?) 15815 AC boundary istead of AG
15897 ALAEMESVLKSVAKQRPGQGSSQSFVNYLLQANLTERQ (0) 16010
16583 VMEDGMVFTLAGCVITAN (1) 16636
17689 LCIWAVHFLSVSEAVQDRLYHELVEVLGDELVSLEKIPQLR (2) 17811
19293 YCQQVLNETVRTAKLTPVAARLQEVEGKVDQHVIPKE (0) 19403
21269 TLVIYALGVVLQDADTWSLPYR (2) 21334
21425 FNPDRFAEESVMKSFSLLGFSGSQACPELR (2) 21514
FAYTVATVLLSTLVRRLRMHRVDGQVVEARYELVTTPKDDTWITVSKRN* from wz10135.3
CYP20 Oryzias latipes (medaka fish)
BJ524824.1 MF01SSB cDNA Oryzias latipes cDNA
BJ003683.1 BJ003683 MF01SSA cDNA Oryzias latipes cDNA
MLDFAIFAVTFVVILVGAVLYLYPSSRRASGVPGLFPTDEKDGNLQDIVDRGSLHEFLV
GLHEQFGPVASFWFGRQPVVSLGSVDPLRQHINPNHTTDSFETMLKSLLGYQAGAGGGAN
ESVMRKKLYESAINNALKNSFPAVLKVAEELVDKWSSVPEDQHIPLCAHLLGLALKTV
TQLALGERFKDDAEVISFRKNHEAIWSEIGKGYMDGSLEKSSSRKRHYE SA
LSDMEATLLAV AKDRKAQRRQTA FVDALLQSGLTERQIMEDCMVFTLAGC
VITANLCIWALHFLSTAEDVQEKLCQEVEDLFGSDPVSLDRIPQLKYCQQVLNE
TVRTAKLTPVAARLXEVEGKVGQHVIPKETLVIYALG
VVLQDADTWSTPYRFDPDRFQDESARKSFCLLGFSGSQTCPELRFAYTVATVLLATLVRRLKLRPLK
About 26 aa missing at end
CYP20 Salmo salar (Atlantic salmon)
GenEMBL CA063128.1 ssalrgb509318 mixed_tissue Salmo salar cDNA.
GenEMBL CB516811.1 ssalrgb509318_rev mixed_tissue Salmo salar cDNA.
GenEMBL CB513409.1 ssalrgb531212_rev mixed_tissue Salmo salar cDNA.
BG935303.1 SL1-0624 Atlantic Salmon liver Salmo salar cDNA clone SL1-0624
MLDFAIFAVTFVIFLVGAVLYLYPSSRSASGIPGLNPTEEKDGNLQDIVNRGSLHEF
LASLHGQFGPVASFWFGGRPVVSLGSVDQLRQHINPNRTTDSFETMLKSLLGYQSGTGGG
ATEAVMRKKLYESAVNNTLEKNFPMLLKLVEELVGKWQSFPKDQHTPLCAHLLGLAMKAVTQ
18 amino acids missing here
RKNHEAIWSEIGKGYLDGSMEKSSIRKEHYESA
LAEMETVLMSVAKDRKGQRSQTAFVDTLLQSNLTERQVME
DSMVFTLAGCVITANLCIWAVHFLST
SEDVQEKLHQELEDVLGSEPVSLDKIPQLRYFQQVLNETVRTAKLTPIAARLQENEGKVD
QHIIPKETLVIYALGVVLQDADTWSCPYKFDPDRFTEDSARKSFSLLGFSGNQACPELRF
AYTVATVVLSTVVRQLKLYQVKGQVVEARSELVSTPKDDTWITVSRRS*
21A Subfamily
CYP21A1P human
GenEMBL M13935 (3206bp)
White,P.C., New,M.I. and Dupont,B.
Structure of human steroid 21-hydroxylase genes.
Proc. Natl. Acad. Sci. U.S.A. 83, 5111-5115 (1986)
97% to 21A2 NT_033167.1|Hs6_33343
329415 MLLLGLLLLLPLLAGARLLWNWWKLRSLHLLPLAPGFLHLLQPDLPIYLLGLTQKFGPIYRLHLGLQ 329215
329117 DVVVLNSKRTIEEAMVKKWADFAGRPEPLTCK 329022
LVSKNYPDLSL
328702 XXWSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCE 328592
RMRAQPGTPVAIEEEFSLLTCSINCYLTFGDKIK 328383
328294 EDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLR 328193
328091 FFPNPGLRRLKQAIEKRDHNEEKQLRQHK
327835 ESLVAGQWRDMMDYMLQGVAQPSMEEGSGQLLEGHLHMAAVDLLIGGTETTANTLSWAVV
327654 FLLHHPE 327634
IQQRL*EELDHELGPGASSSRVPYKDRARLPLLNATIAEVLRLWPVV
327292 PLALPHRTTRPS 327257
SISGYDIPEGTVIIPNLQGAHLDETVWERPHEFWP 327069
326971 DRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPSGDALPSLQPLPH
326791 CSVILKMQPFQVRLQPRGMGAHSPGQNQ 326708
CYP21A1P human with congenital adrenal hyperplasia
GenEMBL M26857 (4034bp) X05445
Rodrigues,N.R., Dunham,I., Yu,C.Y., Carroll,M.C.,
Porter,R.R. and Campbell,R.D.
Molecular characterization of the HLA-linked steroid
21-hydroxylase B gene from an individual with congenital
adrenal hyperplasia.
EMBO J. 6, 1653-1661 (1987)
CYP21A1P human
GenEMBL S60612 (426bp)
Collier,S., Tassabehji,M. and Strachan,T.
A de novo pathological point mutation at the 21-hydroxylase locus:
implications for gene conversion in the human genome.
Nature Genetics 3, 260-265 (1993)
CYP21A2 human
GenEMBL M21544 M21545 M21547 to M21550 M23224 M23225
M31022 M31023 (ten segments)
Globerman,H., Amor-Gueret,M., Parker,K.L., New,M.I. and White,P.C.
Nonsense mutation causing steroid 21-hydroxylase deficiency
J. Clin. Invest. 82, 139-144 (1988)
CYP21A2 human
GenEMBL M12792 M23280 (5141bp)
Higashi,Y., Tanae,A., Inoue,H., Hiromasa,T., and
Fujii-Kuriyama,Y.
Aberrant splicing and missense mutations cause steroid 21-
hydroxylase [P-450(C21)] deficiency in humans: possible gene
conversion products.
Proc. Natl. Acad.Sci USA 85, 7486-7490 (1988)
CYP21A2 human
GenEMBL X54940
Partanen,J. and Campbell,R.D.
Rapid characterisation of mutant P450c21 Genes by PCR
unpublished (1993)
CYP21A2 human
GenEMBL X58898 to X58908 PIR S26485 (97 amino acids)
PIR S26484 (371 amino acids) PIR S29670 (495 amino acids)
PIR S29671 (372 amino acids) PIR S26584 (371 amino acids)
PIR S29673 (372 amino acids)
Helmberg,A., Tabarelli,M., Dobler,G. and Kofler,R.
Identification of molecular defects causing congenital adrenal
hyperplasia by cloning of PCR-amplified 21-hydroxylase genes
unpublished (1992)
CYP21A2 human with congenital adrenal hyperplasia
GenEMBL M28548 (4044bp) X05449
Rodrigues,N.R., Dunham,I., Yu,C.Y., Carroll,M.C.,
Porter,R.R. and Campbell,R.D.
Molecular characterization of the HLA-linked steroid
21-hydroxylase B gene from an individual with congenital
adrenal hyperplasia.
EMBO J. 6, 1653-1661 (1987)
Note: mutant gene, 2 amino acid differences
CYP21A2 human with congenital adrenal hyperplasia
GenEMBL M26856 X05448 (4042bp)
Rodrigues,N.R., Dunham,I., Yu,C.Y., Carroll,M.C.,
Porter,R.R. and Campbell,R.D.
Molecular characterization of the HLA-linked steroid
21-hydroxylase B gene from an individual with congenital
adrenal hyperplasia.
EMBO J. 6, 1653-1661 (1987)
Note: normal gene
CYP21 rat
GenEMBL U56853(1964bp)
Zhou,M.Y., Vila,M.C., Gomez-Sanchez,E.P. and Gomez-Sanchez,C.E.
Cloning of two alternatively spliced 21-hydroxylase cDNAs in the
rat adrenal.
unpublished (1996)
MLLPGLLLLLLLLLLAGTRWLWGQWKLWKLRLPPLAPGFLHFLQPNLPVYLFGLAQKLGP
IYRIRLGLQDVVVLNSNKTIEEALIQKWVDFAGRPQILDGKMNFDLSMGDYSLTWKAHKK
LSRSALVLGMRDSMEPLVEQLTQEFCERMRAQAGASVAIHKEFSLLTCSIISCLTFGDKQ
DSTLLNATHSCVRDLLKAWNHWSVQILDIIPFLRFFPNPGLWKLKQFQESRDHIVMQELK
RHKDSLVAGQWKDMIDYMLQGVEKQRDARDPGQLHERHVHMSVVDLFVGGTETTAATLSW
AVAFLLHHPEIQKRLQEELDLKLAPSSQLLYKNRMQLPLLMATIAEVLRLRPVVPMALPH
RATKASSISGYDIPKDTIIIPNIQGANLDEMVWELPSKFWPDRFLESGKSPRIPTFGCGA
RVCLGEPLARLEFFVVLARLLQTFTLLPPPDGTLPSLQPLPYTGINLLIPPFQVRLQPRN
LAPQDQGQK
CYP21 rat
GenEMBL U56854
alternatively splice form
CYP21 pig
GenEMBL S53049 M83939 (4792bp) Swiss Q02390 (492 amino acids)
PIR S28169 (492 amino acids)
Burghelle-Mayeur,C., Geffrotin,C. and Vaiman,M.
Sequences of the swine 21-hydroxylase gene (Cyp21) and
a portion of the opposite-strand overlapping gene of unkown
function previously described in human.
Biochim. Biophys. Acta 1171, 153-161 (1992)
CYP21 sheep
PIR A43349 (497 amino acids) B43349 (479 amino acids)
Crawford, R.J., Hammond, V.E., Connell, J.M. and Coghlan, J.P.
The structure and activity of two cytochrome P450c21 proteins
encoded in the ovine adrenal cortex.
J. Biol. Chem. 267, 16212-16218 (1992)
B43349 is truncated early after alternate splicing
CYP21 Fugu rubripes (pufferfish)
No accession number
Scaffold_15
44% to CYP21 human over 454 amino acids
this is the first fish CYP21 sequence found
MCSFLFCSSFSAPPAVKRNLPLSLWQLPLRPSSPPIPGPPCRFLIGNMTE
28230 LMHDHLPIHLTNLAKRYGNIYRLKCGNTT 28144 (1)
28058 AMIVLNSSDIIREALVKKWSDFAGRAVSYT 27969 (1)
27888 ADIVSGGGRNISLGDYTEEWKALRRLVHGALQRCCKHSLHNVIERQALQLRK 27754 (0)
VLVDYRGGAVDLSEDFTVAASNVIITLVFGKE (0)
YDKSSSELQQLHRCLNEIVALWGSTWISALDTFPLLR (0)
KFPNPVFSRLLREVSRRDEIIRKHLNQFK (0)
CVLCCAQSEGHRRTDVITGSLLEG (0)
26896 VLTDMHVHMATVDLLIGGSETTAAWLNWTVAFLLHRPE (0)
FQTKVYEELCTVLEGRYPKYSDRQRLPILCSLIHEVLRLRPVAPLAVPHKAIRDS (2)
SIAGYFIPRNTIIIPNLFGAHHDPEVWSDPYSFKP (1)
26238 ERFLEGGGGSTRALIPFGGGARLCLGETVAKMELFLFTAYLLRDFCFVLPDSEAPLPDLR 26059
26058 GVASVVLKIKSFTVIARPRTGP* 25990
CYP21 Tetraodon nigroviridis (freshwater pufferfish)
GenEMBL AL281449.1 C0BG094DE12LP1 G Tetraodon nigroviridis
genomic clone 094J24 T7.Length = 895
AL233853.1 C0BG007BE04XD1 G Tetraodon nigroviridis
genomic clone 007I08 T7.Length = 1079
86% to Fugu CYP21
MGCIFFFFYLPFSAPPAVKRSLLQSLCGLLHRPSSPSIPGPPCRFLIGNMTE
LMQDHLPIHLTDLAKRYGNIYRLKCGNTS
AMVVLSSGDVIREALVKKWSDFAGRSVSYT
ADIVSGGGRTISLGDYTEEWKAHRRLVHSAL (frameshift) ERCXKQSLHDVIERQALQLRK
Missing exon 4 and part of exon 5
GSAWISALDTFPLLR
KFPNPVFSRLLREVTRRDEIIRKHLNQYK
CVLCCVQSQDNKSTDVITGSLLEG
VLTDVHVHMATVDLLIGGTETTAAWLNWTVAFLLHRPE
IQTKVYEELCTVLEGRYPKYSDRHRLPVLCSLVHEVLRLRPVAPLAVPHKAVRDS
SIAGYFIPKNTIIIPNLFGAHHDPXVWPDPYSFXX
Missing exon 11
22A Subfamily
CYP22A1 Caenorhabditis elegans (nematode worm)
GenEMBL U39648 (28581 bp) AF407572, NM_171699, NM_171698
cosmid T13C5.1
Jia,K., Albert,P.S. and Riddle,D.L.
DAF-9, a cytochrome P450 regulating C. elegans larval development
and adult longevity
Development 129 (1), 221-231 (2002)
daf-9 mutant This gene may be in the pathway to
synthesize a ligand for DAF-12, a nuclear receptor.
Daf-9 has a role in larval development.
DAF comes from abnormal DAuer Formation
23A Subfamily
CYP23A1 Caenorhabditis elegans (nematode worm)
GenEMBL U39472 (42282 bp)
see CYP14A1 for reference
cosmid B0304.3
24A Subfamily
CYP24 rat
GenEMBL L04608 to L04619, S52625 to S52636
Ohyama,Y., Noshiro,M., Eggertsen,G., Gotoh,O., Kato,Y.,
Bjorkhem,I. and Okuda,K.
Structural characterization of the gene encoding rat
25-hydroxyvitamin D3 24-hydroxylase.
Biochemistry 32, 76-82 (1993)
CYP24 rat
GenEMBL Z28351 (1419bp)
Hahn,C.N., Kerry,D.M., Omdahl,J.L. and May,B.K.
Identification of a vitamin D responsive element in the promoter of
the
gene for the rat 25-hydroxyvitamin D3 24-hydroxylase
Nuc. Acids Res. 22, 2410-2416 (1994)
CYP24 rat
GenEMBL X59506 (3209bp)
Ohyama,Y., Noshiro,M. and Okuda,K.
Cloning and expression of cDNA encoding 25-hydroxyvitamin D3 24-
hydroxylase.
FEBS Lett. 278, 195-198 (1991)
CYP24 rat
GenEMBL D17792 (?bp)
Ohyama, Y., Ozono,K., Uchida,M., Shinki,T., Kato,S., Suda,T.,
Yamamoto,O.,
Noshiro,M. and Kato,Y.
Identification of a vitamin D-responsive element in the 5'-flanking
region of the rat
25-hydroxyvitamin D3 24-hydroxylase gene.
J. Biol. Chem. 269, 10545-10550 (1994)
CYP24 rat
GenEMBL U03112 (?bp)
Zierold,C., Darwish,H.M. and DeLuca,H.F.
Identification of a vitamin D-response element in the rat calcidiol
(25-hydroxyvitamin D3) 24-hydroxylase gene.
Proc. Natl. Acad.Sci. USA 91, 900-902 (1994)
CYP24 human
GenEMBL S67623 (776bp)
Labuda,M., Lemieux,N., Tihy,F., Prinster,C. and Glorieux,F.H.
Human 25-hydroxyvitamin D 24-hydroxylase cytochrome P450
subunit maps to a different chromosomal location than that of
pseudovitamin D-deficient rickets.
J. Bone Miner. Res. 8, 1397-1406 (1993)
CYP24 human
GenEMBL L13286 (3254bp)
Chen,K.-S., Prahl,J.M. and DeLuca,H.F.
Isolation and expression of 1,25-dihydroxyvitamin D3
24-hydroxylase cDNA.
Proc. Natl. Acad. Sci. USA 90, 4543-4547 (1993)
CYP24 human
NM_000782
MSSPISKSRSLAAFLQQLRSPRQPPRLVTSTAYTSPQPREVPVC
PLTAGGETQNAAALPGPTSWPLLASLLQILWKGGLKKQHDTLVEYHKKYGKIFRMKLG
SFESVHLGSPCLLEALYRTESVPQRLEIKPWKAYRDYRKEGYGLLILEGEDWQRVRSA
FQKKLMKPGEVMKLDNKINEVLADFMGRIDELCDERGHVEDLYSELNKWSFESICLVL
YEKRFGLLQKNAGDEAVNFIMAIKTMMSTFGRMMVTPVELHKSLNTKVWQGHTLAWDT
IFKSVKACIDNRLEKYSQQPSADFLCDIYHQNRLSKKELYAAVTELQLAAVETTANSL
MWILYNLSRNPQVQQKLLKEIQSVLPENQRPREEDLRNMPYLKACLKESMRLTPGVPF
TTRTLDKATVLGEYALPKGTVLMLNTQVLGSSEDNFEDSSQFRPERWLQEKEKINPFA
HLPFGVGKRMCIGRRLAELQLHLALCWIVRKYDIQATDNEPVEMLHSGTLVPSRELPI
AFCQR
CYP24 Fugu rubripes (pufferfish)
No accession number
Scaffold_4128
MLWRLRGALTLPPELTVLDAIPGPTNWPLVGSLFELLRKGGLTRQHEAL
VDYHKKFGKIFRLKLGSFESVHIGAPCLLESLYRTEGSYPQRLEIKPWTAYRDMRDEAYGLLIL
EGKDWQRVRRAFQQKLMKPTEVVKLDRKINE
VLEDFVSRIGKTNIGGKIEDLYFELNKWSFES
ICLVLYDKRFGLLQDKVNEEAMNFITAVKT
MMSTFGLMMVTPVELHKSLNTKTWQDHTAAWDRIFST
AKVYIDKKLKRNSVIAPDDLIGDILHQSRLSKKELYAAITELQIGGVET
8608 TANSMLWAIFNLSRNPGAQRRLLEEIRTVVPPEQDPCGEHIKSMPYLKACLKESMR 8441 (2)
7830 ISPSVPFTSRTLDKDTVLGDYAIPKG 7753
TVLMINSHALGSSEDYFDDGKKFKPERWLREHGTINPFAH
VPFGIGKRMCIGRRLAELQMSLFLQLVRDFE
IVATDNEPLDVIHSGLLVPNRELPVAFIKR
25A Subfamily
CYP25A1 Caenorhabditis elegans (nematode worm)
GenEMBL Z66495 ( 40145bp)
see CYP14A1 for reference
cosmid C36A4.1
CYP25A2 Caenorhabditis elegans (nematode worm)
GenEMBL Z66495 ( 40145bp)
see CYP14A1 for reference
cosmid C36A4.2
CYP25A3 Caenorhabditis elegans (nematode worm)
GenEMBL Z66495 ( 40145bp)
see CYP14A1 for reference
cosmid C36A4.3
CYP25A4 Caenorhabditis elegans (nematode worm)
GenEMBL Z66495 ( 40145bp)
see CYP14A1 for reference
cosmid C36A4.6
CYP25A5 Caenorhabditis elegans (nematode worm)
GenEMBL AF038613
see CYP14A1 for reference
F42A6
CYP25A6P Caenorhabditis elegans (nematode worm)
GenEMBL U50072
K06B9.1
missing C-terminal
26A Subfamily
Cyp26a1 mouse
No accession number
Jim Ray
submitted to nomenclature committee
Note: new family in mammals, homolog to human ESTs R51129 and R21282
Cyp26a1 mouse
No accession number
Martin Petkovich
submitted to nomenclature committee
Note: new family in mammals, homolog to human ESTs R51129 and R21282
Cyp26a1 mouse
GenEMBL Y12657
Fujii, H., Sato, T., Kaneko, S., Gotoh, O., Fujii-Kiriyama, Y.,
Osawa, K.,
Kato, S. and Hamada, H.
Metabolic inactivation of retinoic acid by a novel P450
differentially expressed in
developing mouse embryos.
EMBO J. 16, 4163-4173 (1997)
Note: new family in mammals, homolog to human ESTs R51129 and R21282
CYP26A1 human
GenEMBL NM_000783
White,J.A., Beckett-Jones,B., Guo,Y.D., Dilworth,F.J., Bonasoro,J.,
Jones,G. and Petkovich,M.
cDNA cloning of human retinoic acid-metabolizing enzyme (hP450RAI)
identifies a novel family of cytochromes P450
J. Biol. Chem. 272 (30), 18538-18541 (1997)
Note: new family in mammals, equal to human ESTs R51129 and R21282
CYP26A1 human
NM_000783
MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPL
PPGTMGFPFFGETLQMVLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL
GDDRLVSVHWPASVRTILGSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEV
GSSLEQWLSCGERGLLVYPEVKRLMFRIAMRILLGCEPQLAGDGDSEQQLVEAFEEMT
RNLFSLPIDVPFSGLYRGMKARNLIHARIEQNIRAKICGLRASEAGQGCKDALQLLIE
HSWERGERLDMQALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREELKSKG
LLCKSNQDNKLDMEILEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGW
NVIYSICDTHDVAEIFTNKEEFNPDRFMLPHPEDASRFSFIPFGGGLRSCVGKEFAKI
LLKIFTVELARHCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFHGEI
CYP26A1 zebra fish
GenEMBL U68234
White, J.A., Guo, Y.-D., Baetz, K., Beckett-Jones, B., Bonasoro, J.
Hsu, K.E., Dilworth, F.E., Jones, G. and Petkovich, M.
Identification of the retinoic acid-inducible all trans retinoic
acid 4-hydroxylase.
J. Biol. Chem. 271, 29922-29927 (1996)
Note: new family in vertebrates, homolog to human ESTs R51129 and
R21282
CYP26A1 Fugu rubripes (pufferfish)
No accession number
Scaffold_12575
67% to 26A1 human
missing C-term zebrafish seq shown in lower case
2144 MAVSALLATFLCTIVLPLLLFLVTVKLWEVY (frameshift)
VIRERDSACP (frameshift)
SPLPPG (frameshift)
TMGLPFIGETLQLILQ (0)
1689 RRKFLRMKRQKYGYIYRTHLFGNPTVRVTGANNVRHILLGEHRLVAV 1552
1551 QWPASVRTILGSDTLSNVHGAQHKTKKK 1465 (0)
1230 AIMQAFSREALEFYIPAMQHEVQAAVQEWLAKDSCVLVY
PEMKRLMFRIAMQILLGFQLE 1051 (frameshift)
1049 QIKTDEQKLVEAFEEMIKNLFSLPIDMPFSGLYR 948 (0)
784 GLKARNFIHAKIEENIKRKLRESNSDSKCRDALQQLIDSSKKSGQVLSMQ 635 (0)
548 VLKESATELLFGGHETTASTATSLIMFLGLNPEVLDKLRHELSDKVMHKGF 396 (1)
329 LDLRSLNLETLEQLKYTSCVIKETLRMNPPVPGGFRVALKTFELG 195 (0)
100 GYQIPKGWNVIYSICDTHDVAEIFP 26 (frameshift)
24 NKEDFQPE 1 end of Scaffold_12575
RFMMKNCGDSSRFQYIPFGGG
srmcvgkefakvllkiflveltqhcnwilsngpptmktgptiypvdnlptkftsyvrn
CYP26A1 Xenopus laevis
GenEMBL AF057566
Hollemann,T., Chen,Y., Grunz,H. and Pieler,T.
Regionalized metabolic activity establishes boundaries of retinoic
acid signalling.
EMBO J. 17, 7361-7372 (1998)
CYP26B1 human
GenEMBL AC007002
Nelson, D.R. A second CYP26 P450 in humans and zebrafish: CYP26B1
Archives of Biochemistry and Biophysics 371, 345-347 (1999)
MLFEGLDLVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQ
GSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILMGEHHLVSTEWPRSTRMLLGPNTVSNS
IGDIHRNKRKVFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQ
KLTFRMAIRVLLGFSIPEEDLGHLFEVYQQFVDNVFSLPVDLPFSGYRR
GIQARQILQKGLEKAIREKLQCTQGKDYLDALDLLIESSKEHGKEMTMQELKDGTLELIF
AAYATTASASTSLIMQLLKHPTVLEKLRDELRAHGILHSGGCPCEGTLRLDTLSGLRYLD
CVIKEVMRLFTPISGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDP
DRFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVLAVELASTSRFELATRTFPRI
TLVPVLHPVDGLSVKFFGLDSNQNEILPETEAMLSATV
CYP26B1 Fugu rubripes (pufferfish)
No accession number
Scaffold_4267
74% to 26B1 human
8448 MLFDSFDLVSALATLAACLVSMALLLAVSQQLWQLRWTATRDRNCKLPMPKGSMGFPFIGETCHWLLQ 8651
17930 GSGFHASRRQKYGNVFKTHLLGRPLIRVTGAENIRKVLMGEHTLVTVDWPQSTSTLLGPNSLA 18118
18119 NSIGDIHRKKRK 18154
1487 VFAKVFSHEALESYLPKIQQVIQESLRVWSSNPEPINVYR 1606 (2)
1689 ESQRLSFTMAVRVLLGFRVSEEEMKHLFSTFQDFVDNLFSLPIDLPFSGYRK 1844 (0)
1921 GIRARDTLQKSIEKAIREKPLCSQGKDYSDALDVLMESAKENGSELTMQELK 2076 (0) exon 4
2851 ESTIELIFAAFATTASASTSLIMQLLRHPPVLERLREELRARGL exon 5,6 fused
LHNGCLCPEGELRLDTIVSLKYLDCVIKEVLRLFTPVSGAYRTAMQTFELD 3135 (0) exon 5,6 fused
3271 GVQIPKGWSVMYSIRDTHDTSTVFKDVDVFDPDRFSQERGEDKEGRFHYLPFGGGVRSCLGKQLA exon 7
TLFLRILAIELASTSRFELATRQFPRVITVPVVHPVDGLKVKFYGLDSNQNEIMAKSEELLGAAV* 3663 exon 7
CYP26C1 human
GenEMBL AL358613.11 May 2, 2001
522 amino acids, 6 exons, (0) = phase 0 intron
52% to 26B1 human, also 15 amino acid insertion in exon 5 vs. 26B1
MFPWGLSCLSVLGAAGTALLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFGETLHWLVQ (0)
GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQSAHILLGSHTLLGAVGEPHRRRRK (0)
VLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVSVYDASKALTFRMAARILLGLRL
DEAQCATLARTFEQLVENLFSLPLDVPFSGLRK (0)
GIRARDQLHRHLEGAISEKLHEDKAAEPGDALDLIIHSARELGHEPSMQELK (0)
ESAVELLFAAFFTTASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGSEGPPPD
CGCEPDLSLAALGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD (0)
GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGAAREDSRGASSRLHYIPFGGGARSCLG
QELAQAVLQLLAVELVRTARWELATPAFPAMQTVPIVHPVDGLRLFFHPLTPSVAGNGLCL*
CYP26C1 mouse
GenEMBL AC110212.1
84% to 26C1 human exon 5
ELAVELLFAAFFTTASASTSLILLLLQHPAAITKIQQELSAQGLGRACTCTPRASGSPPDCGCEPDLSLAMLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD
CYP26C1 Fugu rubripes (pufferfish)
No accession number
Scaffold_11741
Equally similar to 26B1 and 26C1 human
But C-terminal is 68% to 26C1 while 58% to 26B1
Lower case region very poor match may not be correct exon structure here.
6362 MLGLVSALATALTTLLLLLLLLALTRQLWSFRWSLTRDRRCELPLPKGSMGWPLVGETFQWLFQ 6171 (0)
5571 GSNFHISRRKRHGNVFKTHLLGKPLVRVTGAENIRKILLGEHSLVCTQWPQSTRIILGPN 5392
5391 ALVNSIGELHKRKRK 5347 (0)
4963 ILAKVFSRKALESYLPRLQEVIKCEIAKWCAEPGSVDVYAATRSLTFRIAIGVLLGLHL 4781
4780 EEERIDYLAQIFGQLMSNLFSLPIDAPFSGLRK (0)
3972 GIKARKILHANMEKIIEKKMERQQEEEEYRDAFDYMLSTSKEQGQQISIQELK 3814 (0)
3581 ETAVELIFAAHSTTASAATSLVLQLLHHPEVVERVRVELEAQKLcynslnlpsqa 3417 (1)
3377 ctfpqsqchasnLSLDKLNQLHYIDCVIKEVLRFLPPVSGGYRTALQTFELD 3222 (0)
2770 GYQIPKGWTVMYSIRDTHETAEIFQNPELFDPDRFVTAQVESRSSRFSYVPFGGGVR 2600
2599 SCVGKELAQIILKTLTIELIRTCKWTLATEKFPKMQTVPIVHPVNGLHVNFMYKNLHEIDH* 2414
27A Subfamily
CYP27A1 human
Swiss Q02318 (531 amino acids)
Cali J.J., Russell D.W.
Characterization of the human sterol-27-hydroxylase: A mitochondrial
P450
that catalyzes multiple oxidation reactions in bile acid
biosynthesis.
J. Biol. Chem. 266, 7774-7778 (1991)
CYP27A1 human
GenEMBL X59812 (2107bp)
Guo,Y., Strugnell,S., Back,D.W. and Jones,G.
Transfected human liver cytochrome P-450 hydroxylates vitamin D
analogs at different side-chain positions
Proc. Natl. Acad. Sci. U.S.A. 90, 8668-8672 (1993)
CYP27A1 human
NM_000784
MAALGCARLRWALRGAGRGLCPHGARAKAAIPAALPSDKATGAP
GAGPGVRRRQRSLEEIPRLGQLRFFFQLFVQGYALQLHQLQVLYKAKYGPMWMSYLGP
QMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDQHDLTYGPFTTEGHHWYQLRQA
LNQRLLKPAEAALYTDAFNEVIDDFMTRLDQLRAESASGNQVSDMAQLFYYFALEAIC
YILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVLPFWKRYLDGWN
AIFSFGKKLIDEKLEDMEAQLQAAGPDGIQVSGYLHFLLASGQLSPREAMGSLPELLM
AGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHMPLLKAVLKE
TLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTAFSEPESFQPHRWL
RNSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPETGEL
KSVARIVLVPNKKVGLQFLQRQC
CYP27A1 rat
GenEMBL M73231 (2300bp)
Shayiq,R.M. and Avadhani,N.G.
Sequence complementarity between the 5'-terminal regions of mRNAs
for
rat mitochondrial cytochrome P-450c27/25 and a growth hormone-
inducible
serine protease inhibitor: a possible gene overlap.
J. Biol. Chem. 267, 2421-2428 (1992)
CYP27A1 rat
GenEMBL U17363, U17369 to U17376 genomic sequence
Mullick, J., Addya,S., Sucharov,C. and Avadhani,N.G.
Localization of a transcription promoter within the second exon of
the cytochrome P-450c27/25 gene for the expression of the major
species of two-kilobase mRNA.
Biochemistry 34, 13729-13742 (1995)
CYP27A1 rabbit
PIR A90152 (21 amino acids)
Dahlbaeck, H.
Characterization of the liver mitochondrial cytochrome P-450
catalyzing the 26-hydroxylation of 5beta-cholestane-3alpha,
7alpha,12alpha-triol.
Biochem. Biophys. Res. Commun. 157, 30-36 (1988)
PIR A90155 (21 amino acids)
Dahlbaeck, H.
Biochem. Biophys. Res. Commun. (1989) 159:370
CYP27A1 pig
no accession number
Kjell Wikvall
77% identical to human
CYP27A1 Fugu rubripes (pufferfish)
No accession number
Scaffold_3437
46% to 27A1 missing N-terminal exon upstream of scaffold
136 (0) ILEKGRYGPIYRNGMNAVSVSTAKLLGEVLRNDDKFPNRGDMSIWKEYRDLRGYGYGPFTE 321 (2)
536 KDERWYNLRAVLNKRMLRPKDALQYGDTIGEVVTDFIRRIYFLRQRSPTGDVVTDLNNELYHFSLE 733 (1)
816 AIASILFETRLGCLEEEIPTGTQDFINAISQMFSNNFQVFLMPKWSRGVLPYWRRYVAGWDGIFSF 1013 (1)
1206 ATRLIDRKMEFIQQHLDNNQNVEGEYLTYLLSNTQMSIKDVYGSVSELLLAGVDT 1370 (0)
1465 TSNTLTWTLHLLSKYPQCQEILFKEVSTSVPADRAPSAEEVTRMPYLRAVVKESLR 1632 (2)
1756 MFPVIPMNGRILADKDVMIGGYQFSKN 1836 (0)
1949 TAFNFSHYAIGRDEDTFPEPATFMPERWLQDSHNRPNAFGAIAFGFGVRGCVGRRIAELEMYSFLCH 2149 (0)
2308 LMRHFEIKPDPKMGELKSVCRTVLIPDKPVSLRFLDRGSGHAA* 2439
CYP27A2 Fugu rubripes (pufferfish)
No accession number
Scaffold_697
53% to 27A1 mouse
29906 MASFTALRCAAIGARNSALRPATLPSRNLNLQATSEAANLKGIADLPGPNTYKILYWLFVKGYGERSHLLQ 30118 (0)
30735 GKLKNIYGPMWRWKLGPYDFVSVASPELIARVIQQEGRYPVRVQLPHWKEYRDLRGQAYGLHVE 30926 (2)
31027 TGPEWSRLRSALKPRMLKLREVVALSPDESR 31119 frameshift EVDGDLL
29 aa of this exon missing in region of poor seq and a small sequence gap
31334 GISAILFETRLGCLGEKVDPNVQRFISGVNDMLSLSDITYLFPRWTRSFVPVWKRFAQAWDDISDV 31531 (1)
31614 ASSLIDRRIAEIDARVANGQSVEGLYLTYLLSSDKMSRAEISTCITDLLLGGVDT 31778 (0)
32763 TSNTLSWALYHLAKDPVAQDRLYDEVNSVCPNHHQPTTDDLANMPFLKAVIKEVLR 32930 (2)
33004 LYPVVHQNARFISENDVILNDYWFPKK (0)
TQFHLCHYSVCHDETQFKHAERFLPERWLRHSAPLSGYYQHHPYSFIPFGVGVRACVGKRVAELEMXXXXXX 33348
this exon runs into a sequence gap
YFALTR (0) this is a Fugu seq for the end of the upper exon that is no
longer in the databases. The last exon is missing
CYP27A3 Fugu rubripes (pufferfish)
No accession number
Scaffold_6002 Length = 16767
= LGS139924.x1 57% to 27A1 I-helix
= LGS125183.x1 Cyp27a1
First 16 aa and 49-81 supported by EST from AU050037 Paralichthys olivaceus
aa 49 on also supported by AW343479 zebrafish EST
MFRNRLLTVGLRASVPHREGLHRTAVNYAGARRRHASSAATEITEHNVR
QKTMEDLGGPSFLTTLNWLFLKGYLPKTQQMQ (0)
4619 VEHSKIYGPLWKSKYGPMVVVNVASADLIEQVLRQEGRHPVRTDMPHWRRYRALRNQAHGPLTE 4428
exons 3 and 4 in a sequence gap
1891 AEQMVQKKMEEIQNKVDLHQDVEGAYLTHLLLSEKMTVTEILGSITELLLAGVDT 1715
1661 TSNTISWALYQLAQNPSIQDQLYHEVRSVCPGNKMPDSDDIAQMPYLKAVIRETLR
LYPVVPGNARVTVDKEIVVGGYLFPKQ (0)
754 TLFHLCHYCVSHDENIFPNSRVFQPERWLRGREEKSKQHPFGSVPFGFGVRACLGRRV 581
580 AELEMYLLLSRVRAVQTGLTAAGEGNQEVL*
>CYP27A fragment a Fugu rubripes (pufferfish)
No accession number
a may be exon 4 of 27A3
LPC42076.x1 39% to 27A1 202-276 79% to LPC42075.x1 gene duplication? Exon 4
GISSVLFESRMGCLNDEVPEETQKFIYSVGEMFRLSAVVVLFPQSVWPYLPLWKKFVAAWDYLFKV
>CYP27A fragment b Fugu rubripes (pufferfish)
No accession number
may be exon 4 of 27A3
LPC42075.x1 35% to 27A1 79% to LPC42076.x1 gene duplication? Exon 4
GISSVLFESRLGCLNDEVPEETRRVIYSVGEMCRLSAVVVLFPQSGWPYLPVWARLGAAGDYLLQF
27B Subfamily
CYP27B1 rat
GenEMBL AF000139
St-Arnaud,R., Messerlian,S., Moir,J.M., Omdahl,J.L. and
Glorieux,F.H.
The 25-hydroxyvitamin D 1-alpha-hydroxylase gene maps to the
pseudovitamin D-deficiency rickets (PDDR) disease locus
J. Bone Miner. Res. 12, 1552-1559 (1997)
previously named CYP40. This sequence has errors in it.
CYP27B1 rat
GenEMBL AB001992
Cloning and expression of rat 25-hydroxyvitamin D3-1-alpha-
hydroxylase cDNA .
Shinki, T., Shimada, H., Wakino, S., Anazawa, H., Hayashi, M.,
Saruta, T., DeLuca, H.F. and Suda, T.
Proc. Natl. Acad. Sci. USA 94, 12920-12925 (1997)
Note: this sequence has been called CYP27B1 in this paper. The name
CYP40 was
given in May 1997 based on the sequence from John Omdahl that was
not
completely accurate at the time of submission for a name. It was
necessary to
revise the name CYP40 to CYP27B1.
CYP27B1 mouse
GenEMBL AB006034
Takeyama, K-i., Kitanaka, S., Sato, T., Kobori, M., Yanagisawa, J. and
Kato, S.
25-hydroxyvitamin D3 1 alpha hydroxylase and vitamin D synthesis.
Science 227, 1827-1830 (1997)
CYP27B1 human
GenEMBL AB005989 cDNA sequence
Takeyama,K., Kitanaka,S., Sato,T., Kobori,M., Yanagisawa,J. and
Kato,S.
25-Hydroxyvitamin D3 1alpha-hydroxylase and vitamin D synthesis
Science 277, 1827-1830 (1997)
CYP27B1 human
GenEMBL AB005990 gene sequence
Murayama,A., Kitanaka,S., Takeyama,K. and Kato,S.
Human 25-hydroxyvitamin D3 1alpha-hydroxylase gene
Unpublished (1997)
CYP27B1 human
GenEMBL AB005038 cDNA sequence AB006987 gene sequence
Monkawa,T., Yoshida,T., Wakino,S., Shinki,T., Anazawa,H.,
Deluca,H.F., Suda,T., Hayashi,M. and Saruta,T.
Molecular cloning of cDNA and genomic DNA for human
25-hydroxyvitamin D3 1alpha-hydroxylase
Biochem. Biophys. Res. Commun. 239, 527-533 (1997)
CYP27B1 human
GenEMBL AF020192 cDNA sequence
Fu,G.K., Lin,D., Zhang,M.Y., Bikle,D.D., Shackleton,C.H.,
Miller,W.L. and Portale,A.A.
Cloning of human 25-hydroxyvitamin D-1alpha-hydroxylase and
mutations causing vitamin D-dependent rickets type 1
Mol. Endocrinol. 11, 1961-1970 (1997)
CYP27B1 human
GenEMBL AF0027152 gene sequence
Fu,G.K., Portale,A.A. and Miller,W.L.
Complete structure of the human gene for the vitamin D 1alpha-
hydroxylase,
P450c1alpha.
DNA Cell Biol. 16, 1499-1507 (1997)
CYP27B1 human
NM_000785
MTQTLKYASRVFHRVRWAPELGASLGYREYHSARRSLADIPGPS
TPSFLAELFCKGGLSRLHELQVQGAAHFGPVWLASFGTVRTVYVAAPALVEELLRQEG
PRPERCSFSPWTEHRRCRQRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAGTLNN
VVCDLVRRLRRQRGRGTGPPALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDT
ETFIRAVGSVFVSTLLTMAMPHWLRHLVPGPWGRLCRDWDQMFAFAQRHVERREAEAA
MRNGGQPEKDLESGAHLTHFLFREELPAQSILGNVTELLLAGVDTVSNTLSWALYELS
RHPEVQTALHSEITAALSPGSSAYPSATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPD
KDIHVGDYIIPKNTLVTLCHYATSRDPAQFPEPNSFRPARWLGEGPTPHPFASLPFGF
GKRSCMGRRLAELELQMALAQ
ILTHFEVQPEPGAAPVRPKTRTVLVPERSINLQFLDR
CYP27B1 Fugu rubripes (pufferfish)
No accession number
Scaffold_470
52% to 27B1 human
MLQQALRVSCRSASPLVKWMERWAECASARPQAVKPLGDMPGPSVASFAWDLFAKRGLSRLHELQ (0)
LEGVRRYGPMWKASFGPILTVHVADPALIEQVLRKEGQHPMRSDLSSWKDYRRLRGHHYGLLTS (2)
51430 EGEEWQSIRSLLGKHMLRPKAVEAYDQTLNSVVDDLITKLRLRRSSQGLVTDIASEFYRFGLE 51630 (1)
51726 GVSSVLFESRIGCLDKIVPEETERFIQCINTMFVMTLLTMAMPSWMHQLFPKPWNVFCQCWDYMFDF (1)
AKGHIDQRMAAEAEKIARGEEVEGRYLTYFLSRTSLPMKTVYSNVTELLLAGVDT (0)
52280 ISSTLSWSLYELSRHQAVQASLREEVLSVLGGRRVPTAADVAQMPLLKATIKEVLR 52444 (2)
52527 LYPVIPANARVITERDIQVGGYLIPKN 52610 (0)
52697 TLITLCHYATSRDPAVFPRPDEFLPQRWLNKEQSHHPYASVPFGVGKRSCIGRRIAELELYLAVAR 52894 (0)
53237 ILLEFDIKPDPEGISVKPMTRTLLVPENVINLQFTER* 53347
CYP27C1 human
GenEMBL AC027142
BM562765 BI459427 ESTs
43% identical to 27A1 assembled gene
intron starting with QIH ending in VDT is from Celera's data
CRA_Gene|hCG42613 /len=10487. This Celera sequence is still missing the C-terminal. Probable last exon is now found in AC027142. AG Intron boundary is in the same location as CYP26B1. Stop codon is one codon away from 26B1s stop codon. Length is preserved from cys to intron. (n) = intron phase, 9 exons
1 85452 MQTSAMALLARILRAGLRPAPERGGLLGGGAPRRPQPAGARLPAGARAEDKGAGRPGSPPG 85634 61
62 85635 GGRAEGPRSLAAMPGPRTLANLAEFFCRDGFSRIHEIQ (0) 85748 99
100 39574 QKHTREYGKIFKSHFGPQFVVSIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISA (2) 39371 163
164 43984 EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSME (1) 43787 229
230 41743 GVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFC 41564 290
291 41563 RSWDGLFKFS 41534 300 (1)
301 QIHVDNKLRDIQYQMDRGRRVSGGLLTYLFLSQALTLQEIYANVTEMLLAGVDT (0) 354 (Celera sequence)
355 110201 TSFTLSWTVYLLARHPEVQQTVYREIVKNLGERHVPTAADVPKVPLVRALLKETLR (2) 110034 410
411 108566 LFPVLPGNGRVTQEDLVIGGYLIPKG (0) 108489 436
437 108006 TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAELEIHLVVIQ (0) 107794 504
505 102503 LLQHFEIKTSSQTNAVHAKTHGLLTPGGPIHVRFVNRK* 102619 542
CYP27C1 Fugu rubripes (pufferfish)
No accession number
Scaffold_1410
75% to 27C1 human
MSVMNKLTTTCWTNFYGDRRNKQMLFVLRCLHKSATSGTFGVAREEPLPERLITSTDATKKRLPKT
19233 LAEMPGPGTISNLFEFFWRDGFSRIHEIQ 19319 (0)
20076 IEHSKMYGKIFKSRFGPQLVVSVADRDLVAEVLRAE
GVAPQRANMESWHEYRDMRRRSTGLISA 20261 (2)
EGEDWLRMRSVLRQLIMRPRDVAVFSDDVSEVVDEVVDDLIKR
IVCLRSQSSDGTTICNINDLFFKYAME (1)
20741 GIAAILYECRLGCLSQKIPQETEDYIDALHLMFSSFKTTMYAGAIPKWLRPV 20911
20912 FPKPWEEFCDSWDGLFRF 20968 (1)
SVHVDKRLKQIESQLQRGEKVTGGLLTYMLVAKEMSVEEIYANVTEMLLAGvdt (0)
23004 TSFTLSWASYLLARHPDVQQQIHAEVMRVLGSEKVATAEDVQHLPFIRGLVKETLR 23183 (2)
LFPVLPGNGRITQDDMVLGGYFIPKG
23623 TQLALCHYSTSLDDENFPSSLEFRPDRWIRKHSSDRLDNFGSIPFGYGIRSCIGKRI 23793
23794 AELEMHLALIR (0)
23998 IIQKFHVCVSPLTTDVKAKTHGLLCPGAPINLQFIDREI* 24117
CYP27C1 Silurana tropicalis (frog)
GenEMBL BQ392731 AL629634 AL595312
74% to human 27C1 not counting the divergent N-terminal
87% to Xenopus 27C1
MAALGQLLRGSARLEGLARSFHRFPGAQAAGQALEHEQAEGVLGATVKGSPMVKN
LKEMPGPSTMANLVEFFWRDGFGRIQEIQQKHARQYGRIFKSHFGPQFVVSIADKDMVAQ
VLRAERDAPQRANMESWHEYRELRGRSTGLISAEGEKWLNMRSVLRQKILRPRDVAMYSG
GVNEVVEDLVKRIRKLRVQESDGLTVTNVNDLYFKYSMEAIATILYECRLGCLDDQIPQQ
TKEYIEALELMFSMFKTTMYAGAIPKWLRPLIPKPWREF
CRSWDGLFKFSQIHVDDRLRQIESQLEKGEEVQGGVLTHLLLSKELDLEEIYA
NMTEMLLAGVDTTSFTLSWATYLLAKNPGIQEAVYQQIVQNFGKDQVPTAEDVPKMPLVRAVVKETL
RLFPVLPGNGRVTQDDLVVGGYFIPKGTQLALCHYSTSYDAECFPAAEEFRPERWIRSGN
LERKENFGSIPFGYGIRSCIGRRVAELEMHLLLIQLLQ
NLEIKPSPQTTTVLPKTHGLLCPGGKINVRFVDRQ*
CYP27C1 Xenopus laevis (African clawed frog)
GenEMBL BJ086834 CA982419 BQ385474 BJ069739
87% to Silurana 27C1
MAVLGHLLKGSARLEGLARGFHQFPKIQAAGQALEQEQAEGEL
GARAKEAPMM KSLKDMPGPSTLANLVEFFWRDGFGRIHEIQQKHTRQYGRIFKSHFGPQF
VVSIADKDLVAQVIRAERDAPQRANMESWHEYRELRGRSTGLISAEGEKWLNMRSVLRQK
ILRPRDVAMYTGGVNEVIGDLVKKIHKLRAQESDGLTVTNVNDLYFKYSMEAIATVLYEC
RLGCLDDIIPQQTKEYIAALELMFSMFKTTMYAGGLPKWLRHHNFPNPWKEF
53aa gap see Silurana seq
HMTEMLLAGVDTTSFTLSWATYLLAKNPSIQESVYQQIVQNLGKDQVPTAEDVTKIPMVR
AVVKETLRLFPVLPGNGRVTQDDLVLDGYLIPKGTQLALCHYSTSYDDKCFPGAEEFKPE
RWIRSGYLERKENFGSIPFGNGIRSCIGRRVAELEIHLLLIQL
LQNFEIKLSPETMTVLPKTHGLLCPGEKINVRFLDRQ*
CYP27C1 Gallus gallus (chicken)
GenEMBL BU235126 BU209688
Partial seq missing N and C-terminals
78% to Silurana 27C1 82% to human 27C1
PGLSGSQSLEVRSGAENKAARPGELLEPSPQLGRV KSLHEMPGPNTLYNLYEFFWKDGFGRIHE
IQQKHTQEYGKIFKSHFGPQFVVSIADRDMVAQVLRSEGRAPQRANMESWQEYRDLRGR
ATGLISAEGEQWLKMRSVLRQKILKPKDVAVYSGGVNEVITDLIKRIYTLRSQEEDGETV
TNVNNLFFKYSMEGVATILYECRLGCLENNVPQQTVEYIEALELMFSMFKTTMYAGAIPR
WLRPFIPKPWREFCRSWDGLFKFSQIHVDNKLKSIQS
QLDQGEEVNGGLLTYLLVSKELTLEEIYA
NMTEMLLAGVDTTSFTLSWAIYMLAKHPEVQQRVYEEIINKLGKDQAPVNRDVPKLPLIR
AVLKETLR
CYP27C1 mouse and rat
Mouse is missing this sequence in blast searches. In humans it lies between
BIN1 and ERCC3 on chr 2. This region does not contain 27C1 in mouse (chr 18)
unless there is a sequence gap. Blast of the rat genome
(Rat Genome Sequencing Consortium Assembly Posted date: Dec 4, 2002)
also had no hits, so 27C1 may have been lost in rodent evolution.
It is present in human, Fugu, Xenopus, Silurana (another frog) and chicken.
The absence of 27C1 may be a useful synapomorphy for comparing rodents to other mammals.
28A Subfamily
CYP28A1 Drosophila mettleri
GenEMBL U89746 (1895bp)
Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C.
Molecular cloning of a family of xenobiotic-inducible drosophilid
cytochrome p450s: evidence for involvement in host-plant
allelochemical resistance.
Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997)
note: 369mt, full length, one of 109 seqs submitted to Nomenclature
committee
CYP28A2 Drosophila mettleri
GenEMBL U89747 (1764bp)
Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C.
Molecular cloning of a family of xenobiotic-inducible drosophilid
cytochrome p450s: evidence for involvement in host-plant
allelochemical resistance.
Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997)
note: 43mt, full length, one of 109 seqs submitted to Nomenclature
committee
CYP28A3 Drosophila nigrospiracula (a desert dwelling Drosophilid)
GenEMBL U91565 (451bp)
Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C.
Molecular cloning of a family of xenobiotic-inducible drosophilid
cytochrome p450s: evidence for involvement in host-plant
allelochemical resistance.
Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997)
54 amino acids
CYP28A4 Drosophila hydei (a desert dwelling Drosophilid)
GenEMBL U91566 (366bp)
Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C.
Molecular cloning of a family of xenobiotic-inducible drosophilid
cytochrome p450s: evidence for involvement in host-plant
allelochemical resistance.
Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997)
54 amino acids
Cyp28a5 Drosophila melanogaster
GenEMBL AC018242 6065-8082 also AC001660
CYP28B1 Musca domestica (housefly)
no accession number
Nannan Liu
44% identical to 28A1 over 498 amino acids
submitted to nomenclature committee 7/29/99
Cyp28c1 Drosophila melanogaster
GenEMBL AC014191 comp(8254-10002) also AL133495
Cyp28d1 Drosophila melanogaster
GenEMBL AC017780 comp(28385-30262) also AC009355 AC008324 AC008327
Cyp28d2 Drosophila melanogaster
GenEMBL AC017780 comp(31545-33456) AC008324
29A Subfamily
CYP29A1 Caenorhabditis elegans (nematode worm)
GenEMBL Z69787 and AL022276 (Y102F5)
C44C10.2 (with modifications)
note: GeneFinder translation is incorrect in some regions.
First exon probably ends at 11181 coding for 35 amino acids as does
the first exon
of CYP29A2.
There is probably a sequence error in the stop codon at 11155-11157.
2nd exon probably runs from 11227-11293.
11400-11518 is the third exon.
12465 to 12979 should be one exon.
There is probably a sequence error in the stop codon at 12879-12881.
some segments around I-helix missed by genefinder. N-terminal of
genefinder translation is probably a different gene. Matches
F23A7.3, but this gene
does not have a P450 sequence downstream.
CYP29A2 C. elegans
GenEMBL Z74043
T19B10.1
nearly the whole gene of wEST00713, CEMSH91R, missing 5' end
CYP29A2 C. elegans
GenEMBL Z74040
T19B10.1
5' end of gene from Z74043
CYP29A3 C. elegans
no accession number
Y108G3 contig 541
CYP29A4 C. elegans
GenEMBL Z99102
B0331.1
30A Subfamily
CYP30 Mercenaria mercenaria (northern quahog, a clam)
GenEMBL AF014795 (1628bp)
Brown, D., Clark,G.C. and Van Beneden,R.J.
A novel cytochrome P450 from the clam, Mercenaria mercenaria
Unpublished
31A Subfamily
CYP31A1P C. elegans
GenEMBL Z68213
C01F6.3
missing 25 amino acids at C-helix plus in frame stop codon at ERK*
CYP31A2 C. elegans
GenEMBL Z68336 and Z92789
F22B3 and H02I12
This gene is definitely different than CYP31A3, there are 8 amino
acids differences
and the introns are divergent.
CYP31A3 C. elegans
no accession number
T16C6 this sequence revised according to Y17G9.contig 61
cosmid T16C6 lies 3' of E04A4 and 5' of R11E3. Y17G9 contig 61 covers
this
region. E04A4 ends at 29981 of Y17G9 contig 61 R11E3 starts at 49037
of Y17G9
contig 61 this sequence lies between 41545-45037
CYP31A4P C. elegans
no accession number
Pseudogene related to CYP31A sequences 44849-45028 of Y17G9.contig61
C-TERMINAL exon fragment, This sequence is inside the last intron of
CYP31A3
32A Subfamily
CYP32 C. elegans
GenEMBL U53148
C26F1.2
33A Subfamily
CYP33A1 C. elegans
GenEMBL U55365
C12D5.7
33B Subfamily
CYP33B1 C. elegans
GenEMBL U50311
C25E10.2
33C Subfamily
CYP33C1 Caenorhabditis elegans (nematode worm)
GenEMBL AF039053
C45H4.a near 35k
CYP33C2 Caenorhabditis elegans (nematode worm)
GenEMBL AF039053
C45H4.b near 38k
also on Y10C10 contig 60. Probable end of 33C2 gene is on contig 57,
but the intron ends of these two contigs do not overlap yet.
CYP33C3 Caenorhabditis elegans (nematode worm)
GenEMBL AF016676
F41B5.b 5k to 7k
CYP33C3 Caenorhabditis elegans (nematode worm)
GenEMBL D35162
EST 5 prime read with heme signature
3 prime fragment of this clone yk18a10 = D32508 (not in coding region)
this is a portion of clone YK824
CYP33C4 Caenorhabditis elegans (nematode worm)
GenEMBL AF016438
F44C8 whole gene of EST CEL10E1
CYP33C4 Caenorhabditis elegans
GenEMBL M88882 (501bp)
EST CEL10E1
2nd frame contains PPGP and KKYG from N-terminal region of
many P450s.
CYP33C5 Caenorhabditis elegans (nematode worm)
GenEMBL AF016676
F41B5.c
CYP33C6 Caenorhabditis elegans (nematode worm)
GenEMBL AF016676
F41B5.e
CYP33C7 Caenorhabditis elegans (nematode worm)
GenEMBL AF016676
F41B5.d
CYP33C8 Caenorhabditis elegans (nematode worm)
GenEMBL AF003385
R08F11
CYP33C9 Caenorhabditis elegans (nematode worm)
GenEMBL AF016449
C50H11
CYP33C10P Caenorhabditis elegans (nematode worm)
GenEMBL AF016676
F41B5.a
contains first two exons split by 3700bp and no C-terminal sequence.
CYP33C11 Caenorhabditis elegans (nematode worm)
no accession number
Y49C4 contig 103
33D Subfamily
CYP33D1 Caenorhabditis elegans (nematode worm)
GenEMBL Z92804
K05D4.4 (previously found on F10A3 and F11A5 at early stages of
sequencing)
CYP33D2X Caenorhabditis elegans (nematode worm)
GenEMBL Z92830
F11A5
this sequence is really the same gene as CYP33D1
CYP33D3 Caenorhabditis elegans (nematode worm)
GenEMBL Z81487 (C54E10), AL021470 (Y17D7A.4) and Z98877 (Y69H2)
GenEMBL AL020988 (Y80D3)
33E Subfamily
CYP33E1 Caenorhabditis elegans (nematode worm)
GenEMBL U61945
C49C8.4
CYP33E2 Caenorhabditis elegans (nematode worm)
GenEMBL U61952
F42A9.5
CYP33E3P Caenorhabditis elegans (nematode worm)
GenEMBL U61952
F42A9.4
missing C-terminal 3' neighbor is C49C8 (with CYP33E1)
34A Subfamily
CYP34A1 Caenorhabditis elegans (nematode worm)
GenEMBL Z81119 and AL022301 (Y75B12)
T10H4.10
There are two P450s listed in this clone as one gene
This is the first one
CYP34A2 Caenorhabditis elegans (nematode worm)
GenEMBL Z81119 and AL022301 (Y75B12)
T10H4.11
There are two P450s listed in this clone as one gene
This is the second one
CYP34A3 Caenorhabditis elegans (nematode worm)
GenEMBL Z81047 and AL022301 (Y75B12)
C41G6.1
CYP34A4 Caenorhabditis elegans (nematode worm)
GenEMBL AF068712
T09H2.1
CYP34A5 Caenorhabditis elegans (nematode worm)
GenEMBL AF039050
B0213.10
CYP34A6 Caenorhabditis elegans (nematode worm)
GenEMBL AF039050
B0213.11
CYP34A7 Caenorhabditis elegans (nematode worm)
GenEMBL AF039050
B0213.12
CYP34A8 Caenorhabditis elegans (nematode worm)
GenEMBL AF039050
B0213.14
CYP34A9 Caenorhabditis elegans (nematode worm)
GenEMBL AF039050
B0213.15
CYP34A10 Caenorhabditis elegans (nematode worm)
GenEMBL AF039050
B0213.16
35A Subfamily
CYP35A1 Caenorhabditis elegans (nematode worm)
GenEMBL U97008
C03G6 near 19k
CYP35A2 Caenorhabditis elegans (nematode worm)
GenEMBL U97008
C03G6 near 22k
CYP35A3 Caenorhabditis elegans (nematode worm)
no accession number
K09D9
CYP35A4 Caenorhabditis elegans (nematode worm)
GenEMBL AF016418
C49G7.8
CYP35A5 Caenorhabditis elegans (nematode worm)
GenEMBL AF039049
K07C6.5 [and part of F40C5 contig 6 9-578 plus strand Note:
newest version of
F40C5 does not contain this P450. probably an error corrected by the
sequencers]
NOTE: K07C6 IS NOW IN GENBANK AND IT CONTAINS 4 P450S
It is part of a contig with T09H2 and B0213 that has 13 P450s in a
cluster
6 P450s on B0213 appear to have inserted themselves into a cluster
of
three olfactory receptor-like genes. K09D9 is the next clone 3' and
it has at least
one more P450. C49G7 is next with one more P450
Contig order K07C6 T09H2 B0213 K09D9 C49G7
35B Subfamily
CYP35B1 Caenorhabditis elegans (nematode worm)
GenEMBL AF039049
K07C6.4, [F40C5 contigs 5 and 14 Note: newest version of F40C5 does
not
contain this P450. probably an error corrected by the sequencers]
CYP35B2 Caenorhabditis elegans (nematode worm)
GenEMBL AF039049
K07C6.3 [and F40C5 contig 15 Note: newest version of F40C5 does not
contain
this P450. probably an error corrected by the sequencers]
CYP35B3 Caenorhabditis elegans (nematode worm)
GenEMBL AF039049
K07C6.2
35C Subfamily
CYP35C1 Caenorhabditis elegans (nematode worm)
GenEMBL Z77652
C06B3.3
35D Subfamily
CYP35D1 Caenorhabditis elegans (nematode worm)
GenEMBL Z83105
F14H3 near 20k
CYP35D2P Caenorhabditis elegans (nematode worm)
GenEMBL Z83105
F14H3 near 14k
only N-terminal is present
36A Subfamily
CYP36A1 Caenorhabditis elegans (nematode worm)
GenEMBL Z83220
C34B7.3
37A Subfamily
CYP37A1 Caenorhabditis elegans (nematode worm)
GenEMBL Z81493 (F01D5) and Z92851 (Y39G8)
37B Subfamily
CYP37B1 Caenorhabditis elegans (nematode worm)
GenEMBL Z93381 and Z93389
F28G4 and T13F3
CYP38A1 Suberite domuncula (sponge)
GenEMBL Y17816 (1789bp)
Mueller,W.E.G., Wiens,M., Batel,R., Steffen,R., Borojevic,R. and
Custodio,M.R.
Establishment of a primary cell culture from a sponge: Primmorphs
from Suberites domuncula
Mar. Ecol. Prog. Ser. In press
most similar to the CYP4 family
MLDFVIFAITAVAGLIGILLFFYFSRSTETKPVSSASPTSTIPR
WSAPPADIEKGDLDVMMKKHGSLHQFLLHLHDNGKTPVTSFWWGKTHVVSFCSPQAFK
ESAVFVNRPVELFVGFEPLITPFSIQYANDEDWVQRSKCLYHTLKGDDLKSYFHHFVQ
IAQEEESLWSSYTSDKEVSLTKEVFPMTIKGIARTCFGDIFKDENELSKMAESYHVCW
RTMEEGVPEAGSKRETEFLKHRRVLEDIIRRIIQERKEGEDLQELPFIDSMLQNYDSE
DKIIADAISFMVGGFHTSGYMFTWMLWYLSSHPESQDRLRTEIERETGGERGDRLKEY
SLRADTFLRQVQDETIRLSTLAPWAARYSDKKVTVCGYTIPAKTPMIHALGVGLKNKT
VWENTDSWDPDRFSPNGRRGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEIV
PVEGQTVIQVHGLVTEPKDDIKIYIRSRKED"
CYP39 human
GenEMBL EST R07010 R11279 and UNIGENE entry Hs.25121
covers the C-terminal part of a P450. The 2 ESTs with coding
regions
are not found in UNIGENE, but the opposite end of EST R11279 =
R11221 and it
is in UNIGENE with 13 EST sequences all from the 3 prime noncoding
region.
This sequence is most like CYP4A11, but the percent identity is only
39%. Since this is the most conserved region of P450s, the sequence
must be in a
new family. More sequence is known form the mouse homolog.
ENLLLIKWCVLETIRLKAPGVITRKVVKPVEILNYIIPSGDLLMLSPFWLHRNPKYFPEPELFKPERWEKGKFRRKHSFL
GTASWA
FGAGSSQCPGKVFALLEVQVC
CYP39A1 human
AC008104 AL035670 note heme region exon corrected 1/18/02
MELISPTVIIILGCLALFLLLQRKNLRRPPCIKGWIPWIGVGFEFGKAPLEFIEKARIK
YGPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYRT
ASIPKNVFLALHEKLYIMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLVR
HLLYPVTVNMLFNKSLFSTNKKKIKEFHQYFQVYDEDFEYGSQLPECLLR
NWSKSKKWFLELFEKNIPDIKACKSAKDNSM
TLLQATLDIVETETSKENSPNYGLLLLWASLSNAVP
VAFWTLAYVLSHPDIHKAIMEGISSVFGKAG
KDKIKVSEDDLENLLLIKWCVLETIRLKAPGVITRKVVKPVEIL
NYIIPSGDLLMLSPFWLHRNPKYFPEPELFKPERW
KKANLEKHSFLDCFMAFGSGKFQCPARW
FALLEVQMCIILILYKYDCSLLDPLPKQ
SYLHLVGVPQPEGQCRIEYKQRI
CYP39 mouse
GenEMBL ESTs AA096922, AA606237
Note: this sequence has a poorly conserved I Helix and may not require
molecular
oxygen. The substrate may be a peroxide or some substance with its
own oxygen.
consensus
QFKTYDEGFEYGSQLPEWLLRNWSKSKRWLLALFEKNIGNIKAHGSAGHS
GTLLQAILEVVETETRQYSPNYGLVVLWAALANAPPIAFWTLGYILSHPDIHRTVL
ESISSVFGTAGKDKIKVSEDDLKKLLIIKWCILESVRLRAPGVITRKVVK
PVKILNHTVPSGDLLMLSPFWLHRNPKYFPEPESFKPERWKEANLDKYIF
LDYFMAFGGRKFQCPGKWFALLEIQLCIILVLYKYECSLLDPL
40A Subfamily
CYP40X rat, mouse, human (name changed to CYP27B1)
Note: This family was created based on the rat sequence AF000139 submitted to
the Nomenclature Committee in May 1997. This was the only sequence available at
the time and it was about 38% identical to rat CYP27. The sequence was
considered a new family and named CYP40. Since then, there have been additional
sequences determined for the rat mouse and human homologs. The rat sequence
AB001992 differs from the AF000139 sequence at 34 amino acids, mostly in two
regions that are probably frameshifts. The sequence AB001992 is 42.3% identical
to the rat CYP27A sequence. The mouse and human sequences are also more than 40%
identical to the CYP27A sequence. Therefore, they belong in the CYP27 family as
a new subfamily CYP27B1. Since there only appears to be one gene in each
species, all three species orthologs will be called CYP27B1. CYP40 has been
retired and this is indicated by the X in CYP40X.
41A Subfamily
CYP41 Boophilus microplus (southern cattle tick)
GenEMBL U92732 (414bp)
Crampton,A.L., Miller,C., Baxter,G.D. and Barker,S.C.
Expressed sequenced tags and new genes from the cattle tick,
Boophilus microplus.
Exp. Appl. Acarol. 22 (3), 177-186 (1998)
EST is from the middle of the P450
whole sequence known but still confidential.
42A Subfamily
CYP42 C. elegans
GenEMBL AL020988 (Y80D3), M89401 (EST cm08B12)
43A Subfamily
CYP43 C. elegans
GenEMBL AF026203
E03E2.1
44A Subfamily
CYP44 C. elegans
GenEMBL U21321
CELZK177
only mitochodrial-like P450 in C. elegans (missing part of I-helix
may be
pseudogene)
45A Subfamily
CYP45 Homarus americanus (American lobster)
GenEMBL AF065892 (1581bp)
Snyder,M.J.
Identification of a new cytochrome P450 family, CYP45, from the
lobster, Homarus americanus, and expression following hormone and
xenobiotic exposures
Arch. Biochem. Biophys. (1998) In press
46A Subfamily
CYP46 human
GenEMBL NM_006668
Lund EG, Guileyardo JM and Russell DW.
cDNA cloning of cholesterol 24-hydroxylase, a mediator of
cholesterol homeostasis in the brain.
Proc. Natl. Acad. Sci. U.S.A. 96, 7238-7243 (1999)
32% identity with Drosophila 4D2
ESTs H06539, H51951, R36281
mouse homolog EST AA096922
MSPGLLLLGSAVLLAFGLCCTFVHRARSRYEHIPGPPRPS
FLLGHLPCFWKKDEVGGRVLQDVFLDW
AKKYGPVVRVNVFHKTSVIVTSPESVK
KFLMSTKYNKDSKMYRALQTVFGER
LFGQGLVSECNYERWHKQRRVIDLAFSRSSLVSLMETFNEKAEQLVEILEAKADGQTPVSMQDMLTYTAMDILAK
AAFGMETSMLLGAQKPLSQAVKLMLEGITASRNTLAK
FLPGKRKQLREVRESIRFLRQVGRDWVQRRREALKRGEEVPADILTQILK
AEEGAQDDEGLLDNFVTFFIA
GHETSANHLAFTVMELSRQPEIVAR
LQAEVDEVIGSKRYLDFEDLGRLQYLSQ
VLKESLRLYPPAWGTFRLLEEETLIDGVRVPGNTPLL
FSTYVMGRMDTYFEDPLTFNPDRFGPGAPK
PRFTYFPFSLGHRSCIGQQFAQ
MEVKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPAPPPPPC
CYP46 Fugu rubripes (pufferfish)
No accession number
Scaffold_4537
60% to 46 human
MGVFNLIFGWISQASIFLLLLLFIALLGYCMYIKYTHMKYDHIPGPPRDS (2)
FFSGHSSKLLDIMKDDGVVHDMFLKW (2)
AETYGPVYKIYFLHHVMVFVSCPETTK (0)
EMLMSPKYTKDKFLHNRIGSLFGQR (2)
FLGNGLVTVRDHEKWYKQRRIMDPAFSSL (2)
YLRSLMGNFNETADKLMDKLSEIADNKTTANMLHLVNCVTMEVLAK (0)
VAFGVDLDLLRKSSPFPRAVELCLKGMVFSIRDTFFM (0)
LNPKNWSFIREVRGACRLLRQTGAQWIQQRKTAMRNGEVPKDILTQIIKSAGK (1)
EEIMTQEDEEFMLDNFLTFFIA (1)
GQETTANQLGFCIMELGRHPDILER (2)
VKKEVDEAIGMKQDISYDDLGHLGYLSQ (0)
VLKETLRLYPTAPGTSRDLKEDMVIGGVHVPGGVVCV (0)
FSSYGMGRMETFFKDPLKFDPDRFDPDAPK (2)
PYYCYFPFSLGPRSCLGQNFAQ (0)
MEAKVVMAKLIQRFDFTLLPGQSFDILDNGTLRPKSGVLCSLRHRDHKK*
CYP46A2P Fugu rubripes (pufferfish)
CYP46A3P Fugu rubripes (pufferfish)
CYP46A4P human
NT_004424.11|Hs1_4581 chromosome 1 CYP46 pseudogene fragment
2405597 DPLTFNPYRFGPGAPKPRFTYFPFSLGHHSCIGQQFAQMEVKVVMAKLLQRLEFQLVPGP 2405776
2405777 RFGLQ*QATLKPLDPELCTLRPRGWQPAAPPPRC 2405878
Cyp47a1 Drosophila melanogaster
GenEMBL AC005556, AC004516, AC004426 intron exon boundaries
approximate
renamed Cyp4aa1 following CYP4Z1
CYP48A1 Trichogramma cacœciae (a parasitoid wasp)
GenEMBL AF207950
BBRC 268, 677-682 (2000) Cloning and expression of cytochrome
P450 genes belonging to the CYP4 family and to a novel
family, CYP48, in two hymenopteran insects, Trichogramma
cacœciae and Apis mellifera by S. Tares, J.B. Bergé and M. Amichot.
123 amino acid sequence from I-helix to PERF region
45% identical to CYP47A1, but sure to drop into the
mid to low 30% range for a full length sequence
clone name 16Tc
CYP48A2v1 Trichogramma cacœciae (a parasitoid wasp)
GenEMBL AF207951
BBRC 268, 677-682 (2000) Cloning and expression of cytochrome
P450 genes belonging to the CYP4 family and to a novel
family, CYP48, in two hymenopteran insects, Trichogramma
cacœciae and Apis mellifera by S. Tares, J.B. Bergé and M. Amichot.
124 amino acid sequence from I-helix to PERF region
40% identical to CYP47A1, but sure to drop into the
mid to low 30% range for a full length sequence
clone name 15Tc
CYP48A2v2 Trichogramma cacœciae (a parasitoid wasp)
GenEMBL AF207952
BBRC 268, 677-682 (2000) Cloning and expression of cytochrome
P450 genes belonging to the CYP4 family and to a novel
family, CYP48, in two hymenopteran insects, Trichogramma
cacœciae and Apis mellifera by S. Tares, J.B. Bergé and M. Amichot.
124 amino acid sequence from I-helix to PERF region
40% identical to CYP47A1, but sure to drop into the
mid to low 30% range for a full length sequence.
58% identical to CYP48A1
clone name 21Tc ony two amino acid differences with CYP48A2v1
CYP49A1 Drosophila melanogaster
GenEMBL AC017930 48452-54739 AC007398 AC007418
ESTs AA941795 AI258819 AA697999
mitochondrial P450 fits in the mitochondrial clan
CYP49A1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs2l1
Note this is the last two digit number for animal P450s (except CYP51). The
animal P450s will continue at CYP301A1. This leaves 200 possible animal
families before reaching CYP501 and higher that are reserved for lower
eukayotes.
51A Subfamily
CYP51 human
GenEMBL U23942 (3172bp)
Stromstedt,M., Rozman,D. and Waterman,M.R.(1995)
The ubiquitously expressed human CYP51 encodes lanosterol 14 alpha
demethylase, a cyochrome P450 whose expression is regulated by
oxysterols.
Arch. Biochem. Biophys. 329, 73-81 (1996)
unpublished
CYP51 human
GenEMBL U51684 to U51692 (genomic sequence)
Rozman,D., Stromstedt,M., Tsui,L.-C., Scherer,S.W. and
Waterman,M.R.
Structure and mapping of the human lanosterol 14-alpha demethylase
gene (CYP51) encoding the cytochrome P450 involved in cholesterol
biosynthesis; comparison of exon/intron organization with other
mammalian and fungal CYP genes.
Genomics (1996) In press
CYP51 human
GenEMBL D55653 (3085bp)
Aoyama,Y., Funae,Y, Noshiro,M., Horiuchi,T. and Yoshida,Y.
Occurrence of a P450 showing high homology to yeast lanosterol 14-
demethylase (P450DM) in rat liver.
Biochem. Biophys. Res. Commun. 201, 1320-1326 (1994)
CYP51P1 human
no accession number
Rozman, D. Stromstedt, M. and Waterman, M.R. The three human
cytochrome P450 lanosterol 14a-demethylase (CYP51) genes reside on
chromosomes 3, 7, and 13: Structure of two retrotransposed
pseudogenes,
association with a line-1 element, and evolution of the human CYP51
family.
Archives of Biochemistry and Biophysics 333, 466-474 (1996).
processed pseudogene on chromosome 3
CYP51P1 human
processed pseudogene U36926 5 in frame stops
MAAAAGMMLLGLLQAGG*VLGQAMEEVAGGNLLSMLLIACAFTLSLVYLFRLAAGHLVQL
TAGAKSPPYIFSPVPFLGHAIAFGKSPTEFLENAYGNYGPVFSFIMVGKAFTYLLGSDAA
ALLFNSKNEVLNAEDVYSRLTTPVFG*GVAYDVPNPVFLEQKKTLKSGLNIAHFK*HVSI
IEKETKEYFESWGESGEKNVFAALSELIILTASHYLHGKEIRSQHNEKVAQLYADLDGGF
SHAAWLLPGWLPLPCFRRRDRAHQEIKDIFYKAIQKRRQSQEKIDDILQTLLDATYKDGR
PLTDDEVAGMLTGLLLAEQHTSSTSA*MGFFLARDKTLQEKCYLEQKTVCGENLPPLTY
DQLKDLNLLDRCIKETLRLRHPVMIMMRMARIPKTVAGYTIPPGHQVCVSPTVNQRLKDS
WVEHLDFNPDRYL*DNPASREKFAYVPFGAGHHGCTGENFAYVQIKTIWSTMLRLYEFDL
IDGYFPTVNYTTMIHTPENPVIHYK*RSK
CYP51P2 human
no accession number
Rozman, D. Stromstedt, M. and Waterman, M.R. The three human
cytochrome P450 lanosterol 14a-demethylase (CYP51) genes reside on
chromosomes 3, 7, and 13: Structure of two retrotransposed
pseudogenes,
association with a line-1 element, and evolution of the human CYP51
family.
Archives of Biochemistry and Biophysics, 333: 466-474 (1996)
processed pseudogene on chromosome 13
CYP51P2 human
processed pseudogene U40053
MAAAAGMMLLGLLQAG
GSVLGQAMEEVTGGNLLSMLLIACTFTLSLVYLFRLAAGHLVQLPAGAKSPPYVFSPVP
FPGHAIAFGKSPVEFLE
NAYEKYGPVFSFTMVGKTFTYL
LGSDAAALLFNSKNEDQNAEDVYSHLTTPVFGKGVAYDVPNPVFLEQKKMLKSGLNKAHF
KQHVSL
EKETKEYFQSWGESGEKNVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFS
HAAWLLPGWLPLPSFRCRDRAHWEIKDIFYKAIQKRRQSQEKIDDILQTLLDATYKDGRP
LTDDEVAGMLIGLLLAGQHSSSTTSAWMDFFLARDKTLQEKCYLEQKTVCGENLPPLTYD
QLKGLNLLDRCIKETLRLRPPIMIMMRMARTPQTVVGYTIPPGHQVCVSPTVNQRPKDSW
VERLDFNPDCYLQDNPASGEKFAYVPFGAGCHR*IGENFAYVQIKTIWSTMLRLYEFDLI
DGYFPIVNYTTMIHTPENPLIHYKRRSK
CYP51P3 human
NT_025741.8|Hs6_25897 chromosome 6 CYP51P3
20447678 MLGLVQMSRSALGQLVEWVAGESDSLLSMLLISCAFILSLVCFATIVTTWPSCQLVQNAH 20447857
20447858 HMFSLPLHSLGMPYIWEKLN*ISRKCI*EVWAYEKYGPVCSFSVVSKTFT 20448007
20447859 ICFLSHYIPWACHTFGKS*IEFLESAYEKYGHMRSMDLYVVFLW*ARHLLLERDETAL 20448032
20448347 LFNSKTKDAEDVYSHLRTPAFGKGVECDMPNPAFLGQEKMLKSSLKVAHFRQQVSI 20448514
20448515 TEKERNTFKLGRKQRKKLCEALSELII 20448595
FDS*PWFTWKGNQKSTQ*EVAQLCADVSGGFKPLA 20448698
20448699 WLRPRWLPLRGCRSSYKAH*EHNYCL*GNPETQKPEEKIQGILQTLLDTTDKGEHLLAD 20448875
20448876 EEVTGLLIRLFSGQHTSSTTGA*MSCFVARDETLHEKCYLKQKTVRGEDLPSLTYDLLK 20449052
20449426 VIGRTIPAGHQMGLSLTVNQGFQDTWVELVDPDQ*LQDISTGEK 20449557
20449566 GGGRHHIGKNFAHVQIKTVWSTLFHLYEFDFIDGYFPTVNYKTVVHHTPKNPVITYK*R 20449742
CYP51 rat
GenEMBL D29962 (1924bp) PIR JC2334 (430 amino acids)
GenEMBL D55681 (2268bp)
Aoyama,Y., Funae,Y, Noshiro,M., Horiuchi,T. and Yoshida,Y.
Occurrence of a P450 showing high homology to yeast lanosterol 14-
demethylase (P450DM) in rat liver.
Biochem. Biophys. Res. Commun. 201, 1320-1326 (1994)
Note: This is the first time a single P450 family is represented in
plants and animals. The N-terminal is more conserved than the C-
terminal.
CYP51 rat
GenEMBL U17697 (2383bp)
Sloane,D.L., So,O., Leung,R., Scarafia,L.E., Saldou,N., Jarnagin,K.
and Swinney,D.C.
Molecular Cloning and Functional Expression of Rat Lanosterol
14 alpha-Demethylase
Gene (1995) In press
CYP51 rat
Aoyama, Y. et al . Sterol 14-demethylase P450 (P45014DM) is one of
the most ancient and conseved P450 species.
The Journal of Biochemistry 119: 926-933 (1996)
includes a report on a processed pseudogene from CYP51 in rats
CYP51 Gallus gallus (chicken)
DKFZ426_22O10R1 from http://www.ri.bbsrc.ac.uk/cgi-bin/est-blast/blast.pl
Roslin Institute chicken EST project
603372085F1>603372085F1 cloneID='ChEST280i22' Chondrocytes from http://www.chick.umist.ac.uk/
603582971F1>603582971F1 cloneID='ChEST534n18' stage_36_trunks from http://www.chick.umist.ac.uk/
data from the UMIST/Nottingham/Dundee universities chicken EST sequencing project.
82% to human CYP51 81% to Fugu CYP51
MLSLLEVGGSLLERAAVGNPLSLLLAASAFALSLGYLFQLGYRRHVGADRTNH
PPHIPSSIPFLGHAIAFGKSPIEFLENAYDKYGPVFSFTMVGK
TFTYLLGSDAAALLFNSKNEDLNAEDVYSRLTTPVFGKGVAYDVPNVVFLEQKKMLKTGL
NIAQFKQHVTLIEEETKEYFKAWGESGERNLFEAFSELIILTASHCLHGKEIRSLLNEKV
AQLYADLDGGFTHAAWLLPAWLPLPSFRRRDRAHRAIKNIFYKVIQKRRSSEEKEDDMLQ
TLLDASYKDGRPLTD
DEIAGMLIGLLLAGQHTSSTTSAWLGFFIARDKAIQEQCYAEQKAVCGDDLPPLTYDQLK
DLSLLDRCLKETLRLRPPIMTIMRLAKTPQTVAGYSIPPGHQVCVSPTVNQRLKDSWKDA
LDFKPDRYLRDNPAAGEKFAYIPFGAGRHRCIGENFAYVQIKTIWSTLLRLYEFDLVDGY
FPSINYTTMIHTPNNPVIRYKRRSL*
CYP51 Xenopus laevis (African clawed frog)
No accession number
Hiroaki Ohi, Yoshiaki Fujita
78-80% identical to mammalian CYP51s
submitted to nomenclature committee July 26, 2000
CYP51 Fugu rubripes (pufferfish)
No accession number
Scaffold_437
78% to 51 mouse
67511 MSAHLYEMSSKLIGDTVGRVHD
67445 NLTTVVLAASFITLSLGYVSKLLLRQSFVTDA (0)
KHPPYI 67266
67265 PSCIPFLGHAISFGKSPIEFLENAYEK 67185
66989 YGPVFSFTMVGSTFTYLLGSDAAALLFNSKNEDLNAEDVYSRLTTPVFGKGVAYDVPNP 66813
IFLEQKKMLKTGLNIAHFKEHVKIIEAETRE 66633
66632 YFQRWGDSGER 66600
66525 DLFEALSELIILTASSCLHGKEIRSMLDERVAQLYADLDGGFTHAA 66388
WLLPGWLPLPSF
66277 RKRDRAHREIKKIFFKVIEKRRRSGENTDDILQTLVDATYK
DGRPLSDDEIGGMLIGLLLAGQHTSSTTSAWMGFFMARDRRLQ 65918
65917 ERCYAEQKAACGEDLPPLSFDQ 65852
LKDLSLLEGCLKETLRLRPPIMTMMRMARSPQ
TAAGYTIPVGHQVCVSPTVNHRLGDAWEQRLEFKPDRYLDDNPAAGEKFAYI 64964
64963 PFGAG
RHRCIGENFAYVQIKTIWSTLLRMYEF 64784
64783 DLVDGYFPTINYTTMIHTPHNPVIRYKRRHE* 64688
CYP51 Caenorhabditis elegans (nematode worm/ mammalian contamination)
no accession number
F54E5 contig 00150 this fragment's amino acid translation is identical
to human sequences U51688, AC000120 and rat sequence AB004091.
DVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAA
WLLPGWLPLPSFR
The human sequences U23942, U40053 and D55653 and the rat sequences
U17697, D55681, D87997, D78370 all have N instead of D at the first
amino acid. This may be a polymorphism in this gene. It is very
unlikely that this sequence is from C. elegans and it most likely
represents a contamination by a mammalian sequence.
CYP51 Triticum aestivum (wheat)
Francis Durst
CYP51BLE
submitted to nomenclature committee
see under plants for more 51 sequences
CYP51 Saccharomyces cerevisiae (yeast)
GenEMBL U10555 (24,431bp)
Johnston,M., Andrews,S., Brinkman,R., Cooper,J., Ding,H., Dover,J.,
Du,Z., Favello,A., Fulton,L., Gattung,S., Geisel,C., Kirsten,J.,
Kucaba,T., Hillier,L., Jier,M., Johnston,L., Keppler,D.,
Langston,Y., Latreille,P., Louis,E., Macri,C., Mardis,E.,
Mouser,L., Nhan,M., Rifken,L., Riles,L., St.Peter,H., Thornton,L.,
Trevaskis,E., Vaudin,M., Vaughan,K., Vignati,D., Wilcox,L.,
Willis,A., Wilson,R., Wohldman,P. and Waterston,R.
The complete sequence of Saccharomyces cerevisiae chromosome VIII
Science 265, 2077-2082 (1994)
YHR007c reverse complement of 20,961-22,553, ERG11 gene
CYP51 Saccharomyces cerevisiae (yeast)
GenEMBL T17568 (45bp)
Burns,N., Grimwade,B., Ross-Macdonald,P.B., Choi,E.-Y., Finberg,K.,
Roeder,G.S. and Snyder,M.
Large-Scale Analysis of Gene Expression, Protein Localization and
Gene Disruption in Saccharomyces cerevisiae
Genes Dev. 8, 1087-1105 (1994)
CYP51 Penicillium italicum
GenEMBL Z49750 (2053bp)
Nistelrooy,H.G.M., Van den Brink,H.M., Kan,J.A.L., Gorcom,R.F.M.
and Waard,M.A.
Isolation and molecular characterisation of the gene encoding
eburicol 14-alpha-demethylase (CYP51) from Penicillium italicum
Unpublished (1995)
CYP51 Uncinula necator (grape powdery mildew fungus)
GenBank U72657 (1960bp) U83840 (1756bp) U72658 (1575bp)
Delye,C., Laigret,F. and Corio-Costet,M.F.
Cloning and sequence analysis of the eburicol 14alpha-demethylase
gene of the obligate biotrophic grape powdery mildew fungus
Gene 195 (1), 29-33 (1997)
note: a mutation causing resistance to an inhibitor triadimenol is
Y136F
Delye,C., Laigret,F. and Corio-Costet,M.F.
A mutation in the 14 alpha-demethylase gene of Uncinula necator that
correlates
with resistance to a sterol biosynthesis inhibitor.
Applied and Environ. Microbiol. 63, 2966-2970 (1997)
CYP51 Schizosaccharomyces pombe
GenEMBL Z54096 (12168bp)
Hunt,S. Devlin,K. and Churcher,C.M.
unpublished (1995)
CYP51 Ustilago maydis
GenEMBL Z48164 (1874bp)
Hargreaves,J.A. and Keon,J.P.R.
The sterol 14 alpha demethylase (ERG11) gene from Ustilago maydis.
Unpublished (1994)
CYP51 Issatchenkia orientalis
GenEMBL S75391 (1242bp)
Burgener-Kairuz,P. Zuber,J.P., Buchman,T.G., Bille,J. and Rossier,M.
Rapid detection and identification of Candida albicans and
Torulopsis (Candida) glabrata in clinical specimens by species-specific
nested PCR amplification of a
cytochrome P-450 lanosterol-alpha-demethylase (L1A1) gene fragment.
J. Clin. Microbiol. 32, 1902-1907 (1994)
CYP51 Kluyveromyces marxianus
GenEMBL X97682 (338bp)
Morace,G., Posteraro,B., Sanguinetti,M., Lo Cascio,G. and Fadda,G.
Identification of various medically important Candida species in
clinical specimens by using PCR-REA.
Unpublished
CYP51 Erysiphe graminis f. sp. hordei eburicol
GenEMBL AF052515 (2167bp)
Delye,C., Bousset,L. and Corio-Costet,M.F.
PCR cloning and detection of point mutations in the eburicol
14alpha-demethylase (CYP51) gene from Erysiphe graminis f. sp.
hordei, a 'recalcitrant' fungus.
Curr. Genet. 34, 399-403 (1998)
CYP51 Candida tropicalis
GenEMBL M23673
Chen,C., Kalb,V.F., Turi,T.G. and Loper,J.C.
Primary structure of the cytochrome P450 lanosterol 14
alpha-demethylase gene from Candida tropicalis
DNA 7 (9), 617-626 (1988)
CYP51 Candida glabrata
GenEMBL S75389 (1242bp)
Burgener-Kairuz,P. Zuber,J.P., Buchman,T.G., Bille,J. and Rossier,M.
Rapid detection and identification of Candida albicans and
Torulopsis (Candida)
glabrata in clinical specimens by species-specific nested PCR
amplification of a
cytochrome P-450 lanosterol-alpha-demethylase (L1A1) gene fragment.
J. Clin. Microbiol. 32, 1902-1907 (1994)
CYP51 Candida parapsilosis
GenEMBL AF019902 (677bp)
Okhravi,N., Adamson,P., Mant,R., Matheson,M.M., Midgley,G.,
Towler,H.M. and Lightman,S.
Polymerase chain reaction and restriction fragment length
polymorphism mediated detection and speciation of Candida spp
causing intraocular infection.
Invest. Ophthalmol. Vis. Sci. 39 (6), 859-866 (1998)
CYP51 Candida albicans
GenEMBL X13296
Lai,M.H. and Kirsch,D.R.
Nucleotide sequence of cytochrome P450 L1A1 (lanosterol 14
alpha-demethylase) from Candida albicans
Nucleic Acids Res. 17, 804 (1989)
CYP51 Candida guilliermondii
GenEMBL X97680 (336bp)
Morace,G., Sanguinetti,M., Posteraro,B., Cascio,G.L. and Fadda,G.
Identification of various medically important Candida species in
clinical specimens by PCR-restriction enzyme analysis
J. Clin. Microbiol. 35 (3), 667-672 (1997)
CYP51 Candida dubliniensis P450L1A1, partial.
GenEMBL AJ012573
Morace,G., Posteraro,B., Sanguinetti,M. and Fadda,G.
Identification of various medically important yeast by Reverse
Cross Blot Hybridization
Unpublished
MTDQEIANLLIGILMGGQHTSASTSAWFLLHLGEKPHLQDAIYQ
EVVELLKEKGGDLNDLTYEDLQKLPSVTNTIKETLRMHMPLHSIFRKVKNPLRIPETNYVVPRGHYVLVSPG"
CYP51 Pichia anomala
GenEMBL AF019903
Okhravi,N., Adamson,P., Mant,R., Matheson,M.M., Midgley,G.,
Towler,H.M. and Lightman,S.
Polymerase chain reaction and restriction fragment length
polymorphism mediated detection and speciation of Candida spp
causing intraocular infection
Invest. Ophthalmol. Vis. Sci. 39 (6), 859-866 (1998)
CYP51 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.721 (supercontig 99)
Lower case = EST support
49163 MGILQVVAGPLSQQFSQLGTVSQIGVAIASFLFVAVVLNVLQQFLFKKPNEPPLVFHWFP 49342
49343 LIGSTITYGMDPPRFFKENREK (0)
YGDCFTFILLGKKT 49522
49523 TVYVGPKGNDFILNGKIRDVNAEEIYTVLTTPVFGKDVVYDCPNsklmeqkk 49678 (0)
49741 fmkialtteafrqyvpiisdevtsylkrtadfkgksgivdippkmaqitiftashalqg 49917
49918 keirdkfdetladlyhdldmgfspinfmlhwaplpwnnrrdyaqrtvakiymdtikerra 50097
50098 rgetgaqdimwhlmnstykgdvpvpdheiahmmiallmagqhsssstsswimlrlasr 50271
50272 pdimedlyneqvknlgadlppltyedlaklplhaaivkETLRLHAPIHSIMRAVKTPMPV 50451
50452 PGTKYVIPTDHVLLAAPGVSATDESYFPQPDLWEPHRWEKDSPLAPTIVRNVPSEEDDEK 50631
50632 IDYGYGLVSKGANSPYLPFGAGRHRCIGEQFANVQLQTILAIIVRNFKFRNVDGSDkvig 50811
50812 tdyaslfsrplepakiywerregcql* 50892
CYP51 Dictyostelium discoideum (cellular slime mold)
GenEMBL AU033519
47% identical to S. bicolor CYP51 all the best matches are to CYP51
MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIF
TLHLMGFKMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLR
FVKNGLVLSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSIN
KSLGQSGQLADLYPNWKKVSIQFHSFSQISHYHHSKRETLPVLKSRHFPSIIQERRRSTD
DSVDDVLYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEYFD
ETQKDINDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKGKT
IPAGHILAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFGIL
QIKTIWTVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK*
CYP51 Trypanosoma brucei (African sleeping sickness parasite)
GenEMBL AZ217433 AQ948529 AQ950690 AL492728 AL481001 AZ217732
AQ948526 AQ647531 AL480043 AQ660405 AL495885 AZ212935 AL481816
Assembled from overlapping GSS fragments
33% to mammalian CYP51s
MLLEVAIFLLTALALYSFYFVKSFNVTRPTDPPVYPVTVPILGHIIQFGKSPLGFMQECKRQL
KSGIFTINIVGKRVTIVGDPHEHSRFFLPRNEVLSPREVYSFMVPVFGEGVAYAAPYPRMREQ
LNFLAEELTIAKFQNFVPAIQHEVRKFMAANWDKDEGEINLLEDCSTMIINTACQCLFGEDLRKRLDARRFAQ
LLAKMESSLIPAAVFLPILLKLPLPQSARCHEARTELQKILSEIIIARKEEEVNKDSSTSDLLSGLLSAVYRDGTPMSLHEV
CGMIVAAMFAGQHTSSITTTWSMLHLMHPANVKHLEALRKEIEEFPAQLNYNNVM
DEMPFAERCARESIRRDPPLLMLMRKVMADVKVGSYV
VPKGDIIACSPLLSHHDEEAFPEPRRWDPERDEKVEGAFIGFGAGVHKCIGQKFGLLQVK
TILATAFRSYDFQLLRDEVPDPDYHTMVVGPTASQCRVKYIRRKAAAA*
CYP51 Trypanosoma brucei (African sleeping sickness parasite)
GenEMBL AF363026
Joubert,B.M., Nguyen,L.N., Matsuda,S.P.T. and Buckner,F.S.
Cloning and functional characterization of a Trypanosoma brucei
lanosterol 14 alpha-demethylase gene
Mol. Biochem. Parasitol. 117 (1), 115-117 (2001)
CYP51v1 Trypanosoma cruzi (Chagas disease parasite)
No accession number
Submitted by Sean M. Boyle for Fred Buckner Oct 25, 2001
83% to CYP51 of T. brucei
clone name CL13
CYP51v2 Trypanosoma cruzi (Chagas disease parasite)
No accession number
Submitted by Sean M. Boyle for Fred Buckner Oct 25, 2001
4 amino acid difference to CYP51v1
clone name WTB9
CYP51 Leishmania major (Leishmaniasis parasite)
GenEMBL AZ048391 LMAJFV1_lm78a01.x1 Leishmania major FV1 random genomic library
2 NKFFTPRNEILSPREVYSFMVPVFGEGVAYAAPYPRMREQLNFLAEELTVAKFQNFAPSI 181
182 QHEVRKFMKANWNKDEGEINILDDCSAMIINTACQCLFGEDLRKRLDARQFAQLLAKMES 361
362 CLIPAAVFLPWILKLPLPQSYRCRDARAELQDILSEIIIAREKEEAHKDSSTSDLLASLL 541
542 GAVYR*WTRMSQHEVCGMIVDAMFAGLPTFTITTT 646
CYP51 Phaeodactylum tricornutum (diatom)
GenEMBL BI307668.1
5 ETFADPDTYDPDRFAAPREEHKQPYAYMGFGGGLHSCMGQNFAFLQVKTIISVLLREYEL 184
185 ERVEPGMPDIGYDDMVVGPKGDCTVRYRKR 274
note: one of the few P450s from diatoms (stramenopiles)
another is CYP97E1 from Skeletonema costatum
CYP51 Mycobacterium tuberculosis
GenEMBL Z80226
This is probably in the 51 family even though it is in bacteria
It always clusters with the other CYP51 sequences.
The activity of this enzyme has been demonstrated to be a 14 alpha
demethylase with preference for obtusifoliol.
Mike Waterman presented the crystal structure of this
protein at a recent meeting in Moscow in July 2000.
52A Subfamily
CYP52A1 Candida tropicalis
GenEMBL M24894 M15945
Sanglard,D. and Loper,J.C.
Characterization of the alkane-inducible cytochrome P450 (P450alk)
gene from the yeast Candida tropicalis: identification of a new
P450 gene family.
Gene 76, 121-136 (1989)
CYP52A2 Candida tropicalis
Swiss P30607 (522 amino acids) GenEMBL M63258 (5745bp)
Seghezzi,W., Sanglard,D. and Fiechter,A.
Characterization of a second alkane-inducible cytochrome
P450p-encoding gene, CYP52A2, from Candida tropicalis
Gene 106, 51-60 (1991)
CYP52A3 Candida maltosa
GenEMBL D12475 D01168 (2047bp)
Ohkuma,M., Hijiki,T., Tanimoto,T., Schunck,W.H., Muller,H.G.,
Yano,K. and Takagi,M.
Evidence that more than one gene encodes n-alkane-inducible
cytochrome P-450s in Candida maltosa, found by two-step gene
disruption.
Agric. Biol. Chem. 55, 1757-1764 (1991)
Note: This is CYP52A3-b, one of two alleles in C. maltosa.
CYP52A3 Candida maltosa
GenEMBL S77461 (2047bp) Swiss P24458 (523 amino acids)
Ohkuma,M., Hikiji,T., Tanimoto,T., Schunck,W.H., Muller,H.G.,
Yano,K. and Takagi,M.
Evidence that more than one gene encodes n-alkane-inducible
cytochrome P-450s in Candida maltosa, found by two-step gene
disruption
Agric. Biol. Chem. 55, 1757-1764 (1991)
CYP52A4 Candida maltosa
GenEMBL X51932, S64322 (1787bp)
Schunck,W.H., Vogel,F., Gross,B., Kargel,E., Mauersberger,S.,
Kopke,K., Gengnagel,C. and Muller,H.G.
Comparison of two cytochromes P-450 from Candida maltosa:
primary structures, substrate specificities and effects of
their expression in Saccharomyces cerevisiae on the proliferation
of the endoplasmic reticulum.
Eur. J. Cell Biol. 55, 336-345 (1991)
CYP52A5 Candida maltosa
GenEMBL D12714
Ohkuma,M.
GenEMBL X55881
Ohkuma,M., Muraoka,S., Tanimoto,T., Fujii,M., Ohta,A. and Takagi,M.
CYP52 (cytochrome P450alk) multigene family in Candida maltosa:
identification and characterization of eight members.
DNA Cell Biol. 14, 163-173 (1995)
CYP52A6 Candida tropicalis
GenEMBL Z13010 (2501bp) Swiss P30608 (524 amino acids)
Seghezzi,W., Meili,C., Ruffiner,R., Kuenzi,R., Sanglard,D. and
Fiechter,A.
Identification and characterization of additional members of the
cytochrome P450 gene family CYP52 of Candida tropicalis.
DNA Cell Biol. 11, 767-780 (1992)
CYP52A7 Candida tropicalis
GenEMBL Z13011 (2087bp) Swiss P30609 (507 amino acids)
Seghezzi,W., Meili,C., Ruffiner,R., Kuenzi,R., Sanglard,D. and
Fiechter,A.
Identification and characterization of additional members of the
cytochrome P450 gene family CYP52 of Candida tropicalis.
DNA Cell Biol. 11, 767-780 (1992)
CYP52A8 Candida tropicalis
GenEMBL Z13012 (1929bp) Swiss P30610 (517 amino acids)
Seghezzi,W., Meili,C., Ruffiner,R., Kuenzi,R., Sanglard,D. and
Fiechter,A.
Identification and characterization of additional members of the
cytochrome P450 gene family CYP52 of Candida tropicalis.
DNA Cell Biol. 11, 767-780 (1992)
CYP52A9 Candida maltosa
GenEMBL D26160 (283bp)
Ohkuma,M.
unpublished
ALK5-B allele of CYP52A9 ALK5-A
CYP52A9 Candida maltosa
PIR JS0723
ALK5-A
CYP52A10 Candida maltosa
CYP52A11 Candida maltosa
GenEMBL D26159 (815bp)
Ohkuma,M.
unpublished
ALK8-B allele of CYP52A11 ALK8-A
Note on CYP52A12-CYP52A20 and CYP52D2 from Candida tropicalis.
These sequences seem to be orthologs of other Candida tropicalis sequences.
52A12 seems to be an ortholog of 52A6
52A13 and 52A14 seem to be orthologs of 52A2
52A15 and 52A16 seem to be orthologs of 52A1
52A17 and 52A18 seem top be orthologs of 52A8
52A19 and 52A20 seem to be orthologs of 52A7
the last 8 listed appear in what might be allelic pairs 94-96% identical to each other.
It seems probable that these actually are the orthologs of those other sequences
and this Candida tropicalis strain is different from the other.
CYP52D2 matches Candida maltosa 52D1 and was not observed in C. tropicalis before.
CYP52A12 Candida tropicalis
No accession number
David Craft Cognis Corp.
Submitted to nomenclature committee June 26, 2001
Clone name 1p
90% to CYP52A6
CYP52A13 Candida tropicalis
No accession number
David Craft Cognis Corp.
Submitted to nomenclature committee June 26, 2001
Clone name 2p might be an allele of CYP52A14 (96% identical)
86% to CYP52A2
CYP52A14 Candida tropicalis
No accession number
David Craft Cognis Corp.
Submitted to nomenclature committee June 26, 2001
Clone name 3p might be an allele of CYP52A13 (96% identical)
87% to CYP52A2
CYP52A15 Candida tropicalis
No accession number
David Craft Cognis Corp.
Submitted to nomenclature committee June 26, 2001
Clone name 4p might be an allele of CYP52A16 (95% identical)
85% to CYP52A1
CYP52A16 Candida tropicalis
No accession number
David Craft Cognis Corp.
Submitted to nomenclature committee June 26, 2001
Clone name 5p might be an allele of CYP52A15 (95% identical)
84% to CYP52A1
CYP52A17 Candida tropicalis
No accession number
David Craft Cognis Corp.
Submitted to nomenclature committee June 26, 2001
Clone name 7p might be an allele of CYP52A18 (94% identical)
78% to CYP52A8
CYP52A18 Candida tropicalis
No accession number
David Craft Cognis Corp.
Submitted to nomenclature committee June 26, 2001
Clone name 8p might be an allele of CYP52A17 (94% identical)
77% to CYP52A8
CYP52A19 Candida tropicalis
No accession number
David Craft Cognis Corp.
Submitted to nomenclature committee June 26, 2001
Clone name 9p might be an allele of CYP52A20 (95% identical)
75% to CYP52A7
CYP52A20 Candida tropicalis
No accession number
David Craft Cognis Corp.
Submitted to nomenclature committee June 26, 2001
Clone name 10p might be an allele of CYP52A19 (95% identical)
76% to CYP52A7
52B Subfamily
CYP52B1 Candida tropicalis
GenEMBL Z13013 (2442bp) Swiss P30611 (506 amino acids)
Seghezzi,W., Meili,C., Ruffiner,R., Kuenzi,R., Sanglard,D. and
Fiechter,A.
Identification and characterization of additional members of the
cytochrome P450 gene family CYP52 of Candida tropicalis.
DNA Cell Biol. 11, 767-780 (1992)
52C Subfamily
CYP52C1 Candida tropicalis
GenEMBL Z13014 (1622bp) Swiss P30612 (505 amino acids)
Seghezzi,W., Meili,C., Ruffiner,R., Kuenzi,R., Sanglard,D. and
Fiechter,A.
Identification and characterization of additional members of the
cytochrome P450 gene family CYP52 of Candida tropicalis.
DNA Cell Biol. 11, 767-780 (1992)
CYP52C2 Candida maltosa
52D Subfamily
CYP52D1 Candida maltosa
CYP52D2 Candida tropicalis
No accession number
David Craft Cognis Corp.
Submitted to nomenclature committee June 26, 2001
Clone name 6p
61% to CYP52D1
52E Subfamily
CYP52E1 Candida apicola
GenEMBL X76225 (1560bp) PIR S38894 (519 amino acids)
Lottermoser,K.
unpublished (1994)
CYP52E2 Candida apicola
GenEMBL X87640 (2203bp)
Lottermoser,K., Schunck,W.H. and Asperger,O.
Cytochromes P450 of the sophorose lipid-producing yeast Candida
apicola: heterogeneity and polymerase chain reaction-mediated
cloning of two genes
Yeast 12 (6), 565-575 (1996)
52F Subfamily
CYP52F1 Yarrowia lipolytica
GenEMBL AB010388 (3578bp)
Iida,T., Ohta,A. and Takagi,M.
Cloning and characterization of an n-alkane-inducible cytochrome
P450 gene essential for n-decane assimilation by Yarrowia
lipolytica.
Yeast 14 (15), 1387-1397 (1998)
gene for ALK1
CYP52F2 Yarrowia lipolytica gene for ALK2
GenEMBL AB010389
Iida,T., Ohta,A. and Takagi,M.
Yarrowia lipolytica cytochrome P450 gene, complete cds
unpublished
CYP52F3 Yarrowia lipolytica gene for ALK3
GenEMBL AB010390
Iida,T., Ohta,A. and Takagi,M.
Yarrowia lipolytica cytochrome P450 gene, complete cds
unpublished
CYP52F4 Yarrowia lipolytica gene for ALK4
GenEMBL AB010391
Iida,T., Ohta,A. and Takagi,M.
Yarrowia lipolytica cytochrome P450 gene, complete cds
unpublished
CYP52F5 Yarrowia lipolytica gene for ALK5
GenEMBL AB010392
Iida,T., Ohta,A. and Takagi,M.
Yarrowia lipolytica cytochrome P450 gene, complete cds
unpublished
CYP52F6 Yarrowia lipolytica gene for ALK6
GenEMBL AB010393
Iida,T., Ohta,A. and Takagi,M.
Yarrowia lipolytica cytochrome P450 gene, complete cds
unpublished
CYP52F7 Yarrowia lipolytica gene for ALK7
GenEMBL AB010394
Iida,T., Ohta,A. and Takagi,M.
Yarrowia lipolytica cytochrome P450 gene, complete cds
unpublished
CYP52F8 Yarrowia lipolytica gene for ALK8
GenEMBL AB010395
Iida,T., Ohta,A. and Takagi,M.
Yarrowia lipolytica cytochrome P450 gene, complete cds
unpublished
53A Subfamily
CYP53A1 Aspergillus niger
GenEMBL X52521
van Gorcom,R.F., Boschloo,J.G., Kuijvenhoven,A., Lange,J., van
Vark,A.J., Bos,C.J., van Balken,J.A., Pouwels,P.H. and van den
Hondel,C.A.
Isolation and molecular characterisation of the
benzoate-para-hydroxylase gene (bphA) of Aspergillus niger: a
member of a new gene family of the cytochrome P450 superfamily
Mol. Gen. Genet. 223, 192-197 (1990)
CYP53A2 Aspergillus parasiticus
GenEMBL AC005991
Lewis,J., Kupfer,D., Keller,N. and Roe,B.A.
Aspergillus parasiticus Cosmid Clone ap0
Unpublished
57% to CYP53A3
27896 MLLDLYLVKCLFCPL
27941 VPIVALGLLVAYYVAGYLKQWHLHDLPGPFIAGFSRIWLILQVRQGYRSLVVNDLHRRCG 28120
28121 KIVRLAPNHISVADESAIQAIYGHGNGFLK 28210 intron
28267 SNFYNAFLNVDWSIFTTRSRAEHTRKRKIVSHAFSARSLAQVEQYAHNTMELLVRQWQ 28440
28441 KMVDSQEGLDDPYAVIDARVWCNYLTFDIIGDLAFGAPFGMLERGNAIVSMRKALEDAAV 28620
28621 TLDAVEVLNHRGDVSAALGICPDLIPYAKWLPDLFFRQGAEAIANVADVAGAAVDRRLKM 28800
28801 NTSMTEKRGDLLAHLIDAEDQAGAKLGYRELTGEAVTLIAAGSDTSSSTLCALLYWVSTT 28980
28981 PRVLWKLQNVLDEVIPVDVEVPYLAMVKKI 29070 intron
29117 PSLQWVIWEALRIHSTFGQGLPREVPPERGPVEICSHTFYPGDVLSVPGYTMHHSVDIWG 29296
29297 IDVEDFVPERWDPCRLTQRQKDSFIPFSEGPRACIGRNLAEMEFFVGCATLFRLFEFRVE 29476
29477 GQGPLEVREGWLRKPVSLQVGIRRRYLDVRSS* 29575
CYP53A3 Aspergillus nidulans
no accession number
James Fraser
Submitted to nomenclature committee Oct. 3, 2000
Gene name bzuA
82% identical to CYP53A1
CYP53A4 Neurospora crassa
9a61.tfa_50cg@9A61
65% to 53A1 over 417 aa
note the automated assembly of the gene is missing the KYG
fragment seen in 53A1. See my assembly for corrected seq.
Neurospora crassa sequence contig 1.248 (supercontig 26)
14619 MAIISLLMSPWAPVVLLAGVAFYYLVPYFVTYSALRKIPSPFPAQFTDLWLL
SVCRRGNRYQRVDELHKKLGPVVRIQPNHVSICDDAAIPTIYGHGNGF
LNDFYDAFVSIRRGLFNTRDRAEHTRKRKIVSHTFSAKSVQQFEPYMHSNLELFVKQWDSMIKNSKNPDKAAHLD
CLEWFNYLAFDVIGDLSFGQPFGMLSSGADMAEIRSSPDAAPIYAPAIEILNRRGEVSATLGIHPALKPFAKYLP
DPFFTKGLAAVENLAGIAIACVKSRLDNPPPVTRKDLLQRLMEGRDEKGEPLGREELTAEALTQLIAGSDTTSNS
SCALLFHAVRTPGVMQKLQAELDANIPPEVDVPTYDMVKELPYLEAVINEVLRFHSTSGIGLPRQIPHDASQGVH
IQGYYLPPGTVLSVPTYSIHHSKEIWGPDADEFKPERWERLTARQKNAFIPFSHGPRSCVGRNVAEMEMKLIVAT
WARRYEVKLLQDYMDTREGFLRKPLGLKVGLKLRK 13029
53B Subfamily
CYP53B1 Rhodotorula minuta (a fungi)
GenEMBL D63703(2766bp)
Fukuda,H., Nakamura,K., Shibuya,K., Tanase,S., Gotoh,O., Ogawa,T.
and Fujii,T.
Nucleotide sequence of gene for cytochrome P450rm from Rhodotorula
minuta IFO 1102.
Unpublished (1995)
54A Subfamily
CYP54 Neurospora crassa
GenEMBL X15033
Neurospora crassa sequence contig 1.1450 (supercontig 261)
The translation below is more accurate than the one in Genbank
MALTQLAHSASALLPLVLPSIAAILAFAIFQRYFAPNPLSNLPIVGEEYHGYEKKRQAYLTKAKDLYLEGYTKFKHGLFRIVTPNR
NSVIVVSPRFLDELKKLPDDVVSFDAANDDSMHTKYTLIPTHEPILPHTIKTSLTPSLPRLNPQLSEEVQAFSQEIAPLMSSSPSD
WAPININSKLLRIVAKVSGRVFIGPELCHDERYLEAAVGYTVSVMEAQRAVERMNPWVRPIAAWRLKEVRKLAQMERDATAFLRPV
VEARREKQRKGEEKDNDMLQWLMDSADQGQGKQHGKWGEDTTTTRKLARLQLAISFAAIHTTTLVTTHAVYSLAADPKLQATLREE
IQSVLQEHKGVFNTSALQAMKKTDSFLKETMRFHPLGQTSFNRKVLRTFALSNGQVIPKGSTIEVPNYAVSRDPEAYPNPDVPDPL
RFYNLRNEAREKGEAEQGASGQFVSVNKEFLTFGYGRHACPGRFFAANEIKMILANLVMTYEMGLVEGETERYRDWDIAAGTIPDP
TKDVMFRKL*
55A Subfamily
CYP55A1 Fusarium oxysporum
GenEMBL D14517 (2598bp)
Tomura,D., Obika,K., Fukamizu,A. and Shoun,H.
Nitric oxide reductase cytochrome P450 (P450nor) gene,
CYP 55, of the fungus Fusarium oxysporum containing a
putative FNR site in its upstream flanking region.
unpublished (1993)
CYP55A1 Fusarium oxysporum
PIR A40401 (403 amino acids)
Kizawa, H., Tomura, D., Oda, M., Fukamizu, A., Hoshino, T.,
Gotoh, O., Yasui, T., Shoun, H.
Nucleotide sequence of the unique nitrate/nitrite-inducible
cytochrome P-450 cDNA from Fusarium oxysporum.
J. Biol. Chem. 266, 10632-10637 (1991)
Note: This sequence is probably a bacterial sequence related to the
105
family.
CYP55A1 Fusarium oxysporum
GenEMBL D14517 (2598bp)
Park, S.-Y., Shimizu, H., Adachi, S.-i., Nakagawa,A., Tanaka, I.,
Nakahara, K.,
Shoun, H., Obayashi, E., Nakamura, H., Iizuka, T. and Shiro, Y.
Crystal structure of nitric oxide reductase from dentrifying fungus
Fusarium
oxysporum.
Nature Structural Biology 4, 827-832 (1997)
CYP55A2 Cylindrocarpon tonkinense (a denitrifying fungus)
GenEMBL D78511
Shoun,H.
nor1
CYP55A3 Cylindrocarpon tonkinense (a denitrifying fungus)
GenEMBL D78512
Shoun,H.
nor2
CYP55A4 Trichosporon cutaneum (a basidiomycete fungus)
GenEMBL AB052733
Zhang,L., Takaya,N., Kitazume,T., Kondo,T. and Shoun,H.
Purification and cDNA cloning of nitric oxide reductase cytochrome
P450nor (CYP55A4) from Trichosporon cutaneum
Eur. J. Biochem. 268 (11), 3198-3204 (2001)
Submitted to nomenclature committee Oct. 11, 2000
P450nor
66% to CYP55A1
CYP55A5 Aspergillus oryzae
No accession number
Masahiko Kaya
Submitted to nomenclature committee Jan. 23, 2001
60% to 55A1
CYP55A6 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.183 (supercontig 22)
1c8.tfa_90cg@1C8
70% to CYP55A1
EST h8b02ne.f1 in lower case
mgaltenlefpfrrasgldpppeyawlrktdpvtqvrlfdgslawlvtkyhdvCQVATDERLSKERTRPGFPELA
AGGKEAAKNRATFVDMDPPKHMQYRGMVQPIFEMEHVKELEPYIQKTVDDLLERMKNMGCEGGPVDLVQNFALPVPSY (?)
IIYTILGVPVEDLDFLTQQNSIRTNGSSTAREASAANQ (?)
ELLNYLRRLAEQRLQEPKDDLISKLMTEQVKAGKLDTADAVQMAFLLLVAGNATMVNMIAL (0)
GIVTLFLNPSQLEELKRDPSVVPAFVEELSRFHTASAMAIRRVAKEDL
EIRGKTIKKGEGVIASNLSANRDEEVFKDADKFDMHREWPVKELGFGWGEHRCIAEFLAKAELKAVF (?)
ETIFKKLPDLKIAVPFDEIEYTPLDRDVGIVKLPVTW
CYP55A7 Trichosporon domesticus SBUG 752 (a basidiomycete fungus)
GenEMBL AY044231 and AY044232
Dirk Patzak
Submitted to nomenclature committee March 4, 2002
124 amino acid N-terminal fragment
77% to 55A4
56A Subfamily
CYP56 Saccharomyces cerevisiae
GenEMBL X55713(2265bp) PIR S13502 (489 amino acids)
Briza, P., Breitenbach, M., Ellinger, A. and Segall, J.
Isolation of two developmentally regulated genes involved in
spore wall maturation in Saccharomyces cerevisiae.
Genes Dev. 4, 1775-1789 (1990)
57A Subfamily
CYP57A1v1 Nectria haematococca MP VI
GenEMBL L20976 (3276bp) S70757 (1373bp) PIR S45583 (515 amino
acids)
Maloney,A.P. and VanEtten,H.D.
A gene from the fungal plant pathogen Nectria haematococca that
encodes the phytoalexin-detoxifying enzyme pisatin demethylase
defines a new cytochrome P450 family.
Mol. Gen. Genet. 243, 506-514 (1994)
PDAT9 pisatin demethylase
CYP57A1v2 Nectria haematococca
no accession number
Kevin McCkluskey and Hans Vanetten, Univ. Arizona
personal communication
pda4 pisatin demethylase
Note 98% identical to CYP57A1v1, probable allele
CYP57A2 Nectria haematococca
GenEMBL X73145 (2750)
Reimmann,C. and VanEtten,H.D.
Cloning and characterization of PDA6-1 a fungal gene encoding a
cytochrome P-450 which can detoxify the phytoalexin pisatin from
garden
pea.
Gene 146, 221-226 (1994)
high similarity to CYP57A1
58A Subfamily
CYP58A1 Fusarium sporotrichioides (filamentous fungus)
GeEMBL U22462 PIR S57337 (520 amino acids)
Hohn.T.M., Desjardins,A.E. and McCormick,S.P.
The Tri4 gene of Fusarium sporotrichioides encodes a cytochrome P450
monooxygenase involved in tricothecene biosynthesis.
Mol. Gen. Genet. 248, 95-102 (1995)
Note: also called TOX4
CYP58A2 Myrothecium roridum, complete cds.
GenEMBL AF009417 (4058bp)
Trapp,S.C., Hohn,T.M., McCormick,S. and Jarvis,B.B.
Characterization of the gene cluster for biosynthesis of
macrocyclic trichothecenes in Myrothecium roridum
Mol. Gen. Genet. 257 (4), 421-432 (1998)
(TRI4) gene
59A Subfamily
CYP59 Emericella nidulans
GenEMBL L27825 (4310bp)
Keller,N.P., Kantz,N.J. and Adams,T.H.
Aspergillus nidulans verA is reqired for production of the mycotoxin
sterigmatocystin
Appl. Environ. Microbiol. 60, 1444-1450 (1994)
gene stcS
note: C-terminal missing
Aspergillus nidulans (same as Emericella nidulans, Emericella is the
sexual phase of the fungal life cycle.
GenEMBL U34740 (505 amino acids)
Brown, D.W., Keller, N.P. and Adams, T.H.
complete sequence of CYP59
Proc. Natl. Acad. Sci. USA 93, 1418-1422 (1996)
60A Subfamily
CYP60A1 Aspergillus parasiticus
GenEMBL L40839 (1266bp) L41149 (1284bp)
Yu,J., Chang,P.-k., Cary,J.W., Wright,M., Bhatnagar,D.,
Cleveland,T.E.,
Payne,G.A. and Linz,J.E.
Comparative mapping of aflatoxin pathway gene clusters in
Aspergillus
parasiticus and aspergillus flavus.
Applied Environ. Microbiol. 61, 2365-2371 (1995)
gene ord1
missing N-terminal (approximately 93 amino acids missing)
CYP60A2 Aspergillus nidulans
GenEMBL U34740 (507 amino acids)
Brown, D.W., Keller, N.P. and Adam, T.H.
Proc. Natl. Acad. Sci. USA 93, 1418-1422 (1996)
gene stcF
CYP60A3 Aspergillus parasiticus
GenEMBL U62774
Yu,J., Chang,P.K., Cary,J.W., Bhatnagar,D. and Cleveland,T.E.
avnA, a gene encoding a cytochrome P-450 monooxygenase, is involved
in the conversion of averantin to averufin in aflatoxin
biosynthesis in Aspergillus parasiticus.
Appl. Environ. Microbiol. 63, 1349-1356 (1997)
CYP60A4v1 Aspergillus flavus strain SRRC 141
GenEMBL AF106960
Bhatnagar,D., Cary,J.W., Ehrlich,K.C., Cleveland,T.E. and
Payne,G.A.
Molecular characterization of an aflatoxin B2 producing mutant
strain of Aspergillus flavus
Unpublished
verB gene 83% identical to 60A2
MYFLSLPALAIVIPVGYVLFQLGYNLFFHPLRGYPGPLLWRASS
LPWKIALLRGTMHHDLMRYHQKYGDTVRIKPDEISYANAQAWRDIHAHVPGRPEFLKD
PVRLPLAPNGVMSILVSDTKNHARFRSLFGHAFSDKGLRTQESTIVQYADLLVEVLRE
VANTGRSAEMVYYFNMAIFDSIGALSFGESFDSLKSRQLHPWVDAIHKNLKSVAISHV
LRSMGIEFLTPYVLPKELRGKRQENYSYAVEKLNKRMKMEGDQGDFWDKVLVKSADDN
QRGDGMSAGEMLNNAAVMVVAGSETTASALSGAMYLLCLSGKIKKATAEIRKSFASPE
DIDLISVSHLPYLTAVIDETLRMYPAVPGQPPRVVPAGGATVCGRFVPEETRVGVSHL
GAYFADYNFTHADKFIPERHLQKTEEPYKYDNYGAYQPWSVGLRNCIGRNLAYAEVRL
TLAKLLWHFDFTLDVGKTGNFLDQKIWSIWAKRELYMFMKTRGTSSSSPQ
CYP60A4v2 Aspergillus flavus strain SRRC 1007
GenEMBL AF106959
Bhatnagar,D., Cary,J.W., Ehrlich,K.C., Cleveland,T.E. and
Payne,G.A.
Molecular characterization of an aflatoxin B2 producing mutant
strain of Aspergillus flavus
Unpublished
verB gene
99% identical to 60A4v1
CYP60A4v3 Aspergillus parasiticus strain RH1
GenEMBL AF106958
Bhatnagar,D., Cary,J.W., Ehrlich,K.C., Cleveland,T.E. and
Payne,G.A.
Molecular characterization of an aflatoxin B2 producing mutant
strain of Aspergillus flavus
Unpublished
verB gene
97% identical to 60A4v1
60B Subfamily
CYP60B1 Aspergillus nidulans
GenEMBL U34740 (501 amino acids)
Brown, D.W., Keller, N.P. and Adam, T.H.
Proc. Natl. Acad. Sci. USA 93, 1418-1422 (1996)
gene stcL
61A Subfamily
CYP61A1 Saccharomyces cerevisiae (yeast)
GenEMBL Z49211 (30,011bp) PIR S54015
Lye,G. and Churcher,C.M.
Unpublished 1995
Barrell,B., Rajandream,M.A. and Walsh,S.V.
S.cerevisiae chromosome XIII cosmid 9711 complement of 3155-4771.
CYP61A2 C.albicans
GenEMBL AL033396 comp(39932..41485)
gene="Ca35A5.10c" 68.9% identical to 61A1
CYP61A3 S. pombe
GenEMBL Z98974 join(10345..10419,10514..12001)
gene="SPAC19A8.04" 51% identical to CYP61A1
CYP61A4 Botrytis cinerea (a plant-pathogenic fungus infecting over 200
plant species)
GenEMBL AL111744 AL111746
Bitton,F., Levis,C., Fortini,D., Pradier,J.M. and Brygoo,Y.
58% to S. pombe 61A3
CYP61A5 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.384 (supercontig 48)
4488 MASNATSFTSPLASAKLGTVAVPPQLEY
4404 VIDTISQASGWTIVFTLLAVLVAWDQ (0)
VKYVLNKGTIA 4225
4224 GPSWKIPFIGPFLESLDPKFEEYYAKWLSGPLSCVSVFH (2) 4108
4037 KFVVIASTRDMARKVFNSPSYVKPCVVDVAHKLLGADNWVFLDGPAHVEFRKGLNGLFTR 3858
3857 RALEIYLPGQEEVYNRYFKSFLEITKNAGGKPVPWMPHFREVITAVSCRSFVGHYISDEA 3678
3677 VKKIADDYYLITAALELVNFPIIIPYTKTWYGKKAADMVLAEFAKCAAKSKVRMAAGGEA 3498
3497 TCIMDAWIKQMIDSKKWREAVASGNTEGLEKPTHLLRDFTDYEISQTIFTFLFASQDATS 3318
3317 SAATWLFQIMAQRPDVLDRVREENLKVRNGDIHAPINMEQLESMTYTRAVVRELLRYRPP 3138
3137 VLMVPYLVKKAFPITEDYTIPKGSMVVPTTYMALRDPEVYPNPDYFDPERYYSGDAEAKG 2958
2957 AKNYLVFGVGPHYCLGQHYAQLNLALFVGKASLLLDWKHHPTPLSEEIKVFATIFPKVSL 2778
DRSCCSACRDFFLLTLYRMTVPSLSRSASGKQRGFWVWTSI*
62A Subfamily
CYP62 Aspergillus nidulans
GenEMBL U34740 L39121 (9392bp)
Brown,D.W., Yu,J.H., Kelkar,H.S., Fernandes,M., Nesbitt,T.C.,
Keller,N.P., Adams,T.H. and Leonard,T.J.
Twenty-five coregulated transcripts define a sterigmatocystin gene
cluster in Aspergillus nidulans
Proc. Natl. Acad. Sci. U.S.A. 93, 1418-1422 (1996)
gene stcB
63A Subfamily
CYP63A1 Phanerochaete chrysosporium (white rot fungus)
GenEMBL AF005475 (208 amino acids)
Kullman,S.W. and Matsumura,F.
Identification of a novel cytochrome P-450 gene from the white rot
fungus Phanerochaete chrysosporium
Appl. Environ. Microbiol. 63, 2741-2746 (1997)
CYP63A1 Phanerochaete chrysosporium (white rot fungus)
No accession number
Jagjit Yadav
Submitted to nomenclature committee Dec. 15, 2000
Clone name Pc-1
Aside from intron editing, this sequence and AF005475 have only
10 aa differences clustered in one region and are probably
the same gene
CYP63A2 Phanerochaete chrysosporium (white rot fungus)
No accession number
Jagjit Yadav
Submitted to nomenclature committee Dec. 15, 2000
Clone name Pc-2
56% identical to 63A1 (one middle region may be revised later
this would increase the similarity)
64A Subfamily
CYP64 Aspergillus flavus
GenEMBL U81806(2760bp)
Prieto,R. and Woloshuk, C.P.
ord1, an oxidoreductase gene involved in the conversion of
O-methylsterigmatocystin to aflatoxin in Aspergillus flavus
unpublished
CYP64 Aspergillus parasiticus
GenEMBL AF017151
Yu,J., Chang,P.K., Ehrlich,K.C., Cary,J.W., Montalbano,B.,
Dyer,J.M., Bhatnagar,D. and Cleveland,T.E.
Characterization of the critical amino acids of an Aspergillus
parasiticus cytochrome P-450 monooxygenase encoded by ordA that is
involved in the biosynthesis of aflatoxins B1, G1, B2, and G2
Appl. Environ. Microbiol. 64 (12), 4834-4841 (1998)
97% identical to Aspergillus flavus CYP64
65A Subfamily
CYP65A1 Fusarium sporotrichioides
GenEMBL AF011355
Alexander, N., Hohn, T.M., and McCormick, S.P. (1998)
The TRI11 gene of Fusarium sporotrichioides encodes a cytochrome
P450 monooxygenase required for C-15 hydroxylation in trichothecene
biosynthesis.
Appl.Environ.Microbiol. 64(1), In Press
The TRI11 gene catalyzes the hydroxylation of isotrichodermin to 15-
decalonectrin
submitted to nomenclature committee
CYP65B1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.742 (supercontig 102)
42% to CYP65A1
MNLSHLLPLGIPLPSPTSLIGIFLIG (0)
TSILVSSILLSAIYNLTLHPLARHPGPFFHRASILPYLYRQITGTLPCSILDFHARYGPVVRISPNALSFADPQAWKDIY 26703
26702 GHRPHGEEEFAKLNLFYRIKGSPPSLLSETKEAHGTLRKLMAHGFSDRSMRAQEGIIGGYVK (1)
EEQDERVPLDMVSWYNWTTFDIIGDLAFGEPFGCLEKAEYDPWVDAVGKSVRFGCVMFA
LRLLGLEDWVCPLVRKLSGNARRFHRKRTMDKLQRRVKLTKERPDFLEGLLQKREEW (0)
GIDMDALAANASLLIVAGSETTATLLAGATYMLLRDPEAMKKLTEEVRSTFKSEEEITL 25711
25710 SSVGNLEYMLACLNEAMRLYPPVPIGMPRVVPKGGAKVAGTFVPEG (0)
TVVAVWHWATSHNEQHFVEPFEFHPERWMQDPRFANDRLD 25372
25371 AVQPFSVGPRNCIGR (2)
SLAIAEMRLILTKVVYNFDMKLANPEKDWLDQHIYTLWDKHPLPVYLTPIERNGV*
CYP65C1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.96 (supercontig 12)
45% to 65B1
MPVIDKLAGVTGLSGLHLLLGAVGVT (0)
ILLYIIGSAIYNVFFHPLASYPGPLLQRASPIPFSIRHALGVQPFHTQALHDKYG
PVVRIAPNHLSFTDVRAWRDIFGHIAGSRHGQEELAKFKS
FVRPIDAQPIHIISAGREKHGQLRRALANSFSDASMRGQGPLIGKYIDLLLHKLH
EQGQEGKVPLNATSWYNSVTFDVAGDLIFGMSFGALDQNGNHSWLQYILGSLSSLAPMAA
LSYAGLHWLVQLLWRAKGAEVFRKSMESVDLMLKERLKMPVDRNDLFEGLVQRQEKL (0)
GLSFEELASNAWLLVL (2)
AGADTTATTLSGTTYLLTQHPEVLKRVTQEVRSSFKSVEEIDISSVNRLTYMLAVLNEA 64919
64918 MRLYPPVTSLTARIVPRGGAHVLGEYLPEG (0)
64756 TIVDIQQWSMNHSKENWVDPWAFNPARFLDDKETAREKGNVLDALQTFSVGPRNCIGR (2) 64580
NLAYAEMRWILARILFDFDLRGAPTNQNWIERQKAYGLWDRIPLDVYFEPARERK*
66A Subfamily
CYP66 Agaricus bisporus (a mushroom)
GenEMBL Z82021 (932bp)
De Groot, P.W.J., Schaap, P.J., Van Griensven, L.J.L. and Visser, J.
Isolation of develpmentally regulated genes from the edible mushroom
Agaricus
bisporus.
unpublished
Note: most similar to CYP64
67A Subfamily
CYP67 Uromyces fabae (fungus)
GenEMBL U81793(1849bp)
Hahn,M. and Mendgen,K.
Characterization of in planta induced rust genes isolated from a
haustorium-specific
cDNA library.
unpublished (1997)
CYP68A1 Gibberella fujikuroi (a rice pathogenic fungus producing plant
hormones(gibberellins))
GenEMBL Y15277
Tudzynski,B. and Hoelter,K.
Isolation and characterization of cytochrome P450 monooxygenase
genes from Gibberella fujikuroi.
Unpublished
called P450I
CYP68B1 Gibberella fujikuroi (a rice pathogenic fungus producing plant
hormones(gibberellins))
GenEMBL Y15278
Tudzynski,B. and Hoelter,K.
Isolation and characterization of cytochrome P450 monooxygenase
genes from Gibberella fujikuroi.
Unpublished
called P450II 38% identical to CYP68A1
CYP68C1 Fusarium sporotrichioides (fungi)
GenEMBL AY040587 (not released yet)
Andrew Peplow Isaac Meek
Clone name TRI15
Submitted to nomenclature committee July 25, 2001
Low 40% range with 68A1 and 68B1, 44% to 68D1
CYP68D1 Neurospora crassa
GenEMBL AL355928.2
Neurospora crassa sequence contig 1.317 (supercontig 36)
sequence below not annotated in genbank entry
46% to 68C1
MANPSATPSSIPSWMERLDIKSITDPSATPFSYLVTAFLLAVVVYSLQGPRFPKNIKHLN
PKGPLEFSDTRPKKEFVYGSRQMLANWFKANPNKPCRVISDFGEAIVLPPRMANEIKNDD
RLSFTRWTYK (phase 0 intron)
AFHGHLPGFEGFGEASRESHIVQEVIMRDLTKYLN (phase 1 intron)
KVTEPLAQETSMAMEANLPKAANGEWSTINLRSKILPIVARISSRVFLGEELCRNEEWLK
VTQQYTIDGFGAAEDLRLWPAALRPIVHWFLPSCQRARADVRVARSILDPVLKKRRQEKA
ANGGKAEHDDAIEWFERTAKGKYYDPAVAQLVLSLVAIHTTSDLTCQVMTNLMQNPEFIA
PLREEMIQVLSEGGWKKTSLYNMKLLDSVIKESQRVKPTGVASMRRYAEKDVTLSDGTFI
PKGGFVAVSAHDMWNSEVYEQAEKWDGRRFLRMRETPGAGKENVAQLVSTAPEHLGFGHG
QHACPGRFFAANEIKIALVHLLLNYEWRLPEGSDPKIRTFGFSMGVDPSLKVEYKGRQPEIEL*
CYP69A1 Gibberella fujikuroi (a rice pathogenic fungus producing plant
hormones(gibberellins))
GenEMBL Y15279
Tudzynski,B. and Hoelter,K.
Isolation and characterization of cytochrome P450 monooxygenase
genes from Gibberella fujikuroi.
Unpublished
called P450III a distant match to CYP54
Note: two digit names for lower eukaryotes end here and continue as three digit
names at
CYP501. Animal P450 families cover CYP1-49 and CYP301-499 are reserved for new
animal P450 families.
Cyp301a1 Drosophila melanogaster
GenEMBL AC017275 comp(35594-38192) also AC007356
possible mitochondrial P450 fits in the mitochondrial clan.
CYP301A1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs2l3
CYP301A2 Drosophila arizonae
no accession number
from Tina Yee and Phil Danielson
submitted to nomenclature committee 6/29/99
84% identical to Cyp301a1
clone name DU36az
CYP302A1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs2l2
Note: in Drosophila Cyp302a1 is the disembodied gene (dib),
a C22 hydroxylase in the ecdysteroid biosynthetic pathway from cholesterol
Cyp302a1 Drosophila melanogaster
GenEMBL AC015396 31954-33705
Michael B. O'Connor
Submitted to nomenclature committee 11/17/99
30% to Cyp12b1 other best matches are 12 family members
the fly gene is called disembodied Dib
see <A HREF="http://flybase.bio.indiana.edu:82/.bin/fbidq.html?FBrf0101392">abstract</A>
Cyp303a1 Drosophila melanogaster
GenEMBL AC017306 12933-14444 no introns
CYP303A1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs3r4
Cyp304a1 Drosophila melanogaster
GenEMBL AC008359 comp(80887-82836) also AC014292 AC009393
CYP304B1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPg2r1
CYP304C1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPg2r2
Cyp305a1 Drosophila melanogaster
GenEMBL AC018013 comp(34574-37224)
CYP305A2 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPj2l1
CYP305A3 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPj2l2
CYP305A4 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPj2l4
CYP305B1 Bombyx mori (silkworm)
GenEMBL AB044900
Nobuyoshi Katagiri, Okitsugu Yamashita
39% identical to 305a1
Cyp306a1 Drosophila melanogaster
GenEMBL AC012373 154223-156412 also AC015216 AC012164 AC012376
CYP306A1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs2r6
Cyp307a1 Drosophila melanogaster
GenEMBL AC014810 comp(54284-52654) also AC007840
CYP307A1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs1x4
Cyp307a2p Drosophila melanogaster
GenEMBL AC019383 comp(3284-3706) AL078186
CYP307B1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs3r2
Cyp308a1 Drosophila melanogaster
GenEMBL AC015216 72624-74273 also AC012164
Cyp309a1 Drosophila melanogaster
GenEMBL AC019891 comp(47437-49308) also AC009909
Cyp309a2 Drosophila melanogaster
GenEMBL AC009909 45541-45982 also AC019977
Cyp310a1 Drosophila melanogaster
GenEMBL AC017690 comp(12877-14650)
Cyp311a1 Drosophila melanogaster
GenEMBL AC014186 2270-3910
Cyp312a1 Drosophila melanogaster
GenEMBL AC015424 57845-59744 also AC009369 AC007573 AC010003
Cyp313a1 Drosophila melanogaster
GenEMBL AC017740 comp(167857-169728) also AC007809 AC007928
Cyp313a2 Drosophila melanogaster
GenEMBL AC017371 4100-5929 AC007724 AL057969 AL067059
Cyp313a3 Drosophila melanogaster
GenEMBL AC017371 1890-3720 also AC007724
Cyp313a4 Drosophila melanogaster
GenEMBL AC017336 153877-155807 also AC007752
Cyp313a5 Drosophila melanogaster
GenEMBL AC017371 6352-8145 also AC007724
Cyp313b1 Drosophila melanogaster
GenEMBL AC007571 comp(104264-106046) also AC013225
Cyp314a1 Drosophila melanogaster
GenEMBL AC012699 22585-24723 also AC006496
CYP314A1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs2r3
Cyp315a1 Drosophila melanogaster
GenEMBL AC015141 229-2538 also AC008307 AC007725 AC007648
CYP315A1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs1x1
Note: in Drosophila Cyp315a1 is the shadow gene (sad), a C2 hydroxylase
in the ecdysteroid biosynthetic pathway from cholesterol
Cyp316a1 Drosophila melanogaster
GenEMBL AC017388 35538-37158 also AC010113 AC010557 AC010112
Cyp317a1 Drosophila melanogaster
GenEMBL AC020477 126550-128106 no introns also AC009844 AL109349
Cyp318a1 Drosophila melanogaster
GenEMBL AC014186 comp(968-207) AA803931 AA803220 AA821188 N-terminal
half
AC019897 comp(2937-4007) C-terminal half
CYP319A1 Boophilus microplus (cattle tick)
No accession number
Haiqi He
Submitted to nomenclature committee March 29, 2000
CYP320A1 snail
No accession number
Anne Lockyer
C-terminal is 36% to 304a1 35% to rabbit 2E1
Submitted to nomenclature committee Aug. 29, 2000
CYP321A1 Helicoverpa zea (earworm)
No accession number
Masataka Sasabe, Xianchun Li, May Berenbaum, and Mary Schuler
Submitted to nomenclature committee June 15, 2001
Clone name X14
33% to CYP6B2
CYP322A1 Perna canaliculus (New Zealand green lipped mussel)
No accession number
Karl Herron
Submitted to nomenclature committee Oct. 9, 2001
47% to 4F12 partial seq. from I-helix to heme
placement uncertain may be in the CYP4 family but
clusters outside the 4 family in some trees
CYP323A1 Saccostrea glomerata (Sydney rock oyster)
No accession number
Karl Herron
Submitted to nomenclature committee Oct. 9, 2001
41% to 4Y1 partial seq. from I-helix to heme
CYP324A1 Trichoplusia ni Family: Noctuidae (caterpillar)
No accession number
Ping Wang
Submitted to nomenclature committee May 20, 2002
32% to 6K1 33% to 6G1 (sequence 3)
CYP325A1 Anopheles gambiae (mosquito)
no accession number
Rene Feyereisen
submitted to nomenclature committee 7/5/02
low 30% range to all drosophila not CYP4, 6 or 9
94% to 325A2
CYP325A1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r14
CYP325A2 Anopheles gambiae (mosquito)
no accession number
Rene Feyereisen
submitted to nomenclature committee 7/5/02
low 30% range to all drosophila not CYP4, 6 or 9
94% to 325A1
CYP325A2 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r12
CYP325A3 Anopheles gambiae (mosquito)
no accession number
Rene Feyereisen
submitted to nomenclature committee 7/5/02
low 30% range to all drosophila not CYP4, 6 or 9
89% to 325A1
CYP325A3 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r11
CYP325B1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r13
CYP325C1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r10
Formerly named CYP327A1
CYP325C2 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r7
CYP325C3 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs3r3
CYP325D1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r9
CYP325D2 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r6
CYP325E1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r5
CYP325F1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r3
Formerly named CYP326A1
CYP325F2 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r1
Formerly named CYP326A2
CYP325G1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPd2r2
CYP325H1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs2r2
CYP325J1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs2r1
CYP325K1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs2r4
CYP326A1X Anopheles gambiae (mosquito)
no accession number
Rene Feyereisen
submitted to nomenclature committee 7/5/02
low 30% range to all drosophila not CYP4, 6 or 9
80% to 326A2
retired number renamed CYP325F1
CYP326A2X Anopheles gambiae (mosquito)
no accession number
Rene Feyereisen
submitted to nomenclature committee 7/5/02
low 30% range to all drosophila not CYP4, 6 or 9
80% to 326A1
retired number renamed CYP325F2
CYP327A1X Anopheles gambiae (mosquito)
no accession number
Rene Feyereisen
submitted to nomenclature committee 7/5/02
low 30% range to all drosophila not CYP4, 6 or 9
retired number renamed CYP325C1
CYP328A1X Anopheles gambiae (mosquito)
no accession number
Rene Feyereisen
submitted to nomenclature committee 7/5/02
low 30% range to all drosophila not CYP4, 6 or 9
retired number renamed CYP325H1
CYP329A1 Anopheles gambiae (malaria mosquito)
GenEMBL
Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
See the paper: Evolution of supergene families associated with
insecticide resistance.
Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
Science 298, 179-181 (2002)
Anopheles map name CYPs2r5
CYP330A1 Carcinus maenas (shore crab)
GenENMBL AY328466
Kim Rewitz
submitted to nomenclature committee 6/18/2003
37% to 2J2
Note: three digit names for lower eukaryotes are reserved from CYP501-CYP699.
CYP501 Candida albicans
no accession number see What's New of Dec. 2 on the C. albicans
server CYP99
265137G06.x1.seq from the Candida albicans genome project
265137G06.x1.seq
3rd frame on complementary strand.
last 50 bases not reliable sequence. EXXR motif shows as DTLR with
the
D probably wrong here.
This sequence is incorrectly named as CYP99 which is a plant sequence.
CYP502 Coprinus cinereus(homobasidiomycete mushroom)
GenEMBL AB013443
Muraguchi,H., Takemaru,T. and Kamada,T.
The eln2 gene of the mushroom Coprinus cinereus, a dominant
mutation of which renders the fruit-body dumpy, encodes a
cytochrome P450 enzyme
Unpublished (1998)
gene name eln2
CYP503 Gibberella fujikuroi (rice fungal pathogen)
GenEMBL Y17243 (3090bp)
Tudzynski,B. and Hoelter,K.
Characterization of P450 monooxygenase genes from Gibberella
fujikuroi.
Unpublished
p450-4 gene
CYP504A1 Aspergillus nidulans (fungus)
GenEMBL AJ132442
Mingot JM, Penalva MA, Fernandez-Canon JM
Disruption of phacA, an aspergillus nidulans gene encoding a novel
cytochrome P450 monooxygenase catalyzing phenylacetate
2-hydroxylation, results in penicillin overproduction.
J Biol Chem 274, 14545-50 1999
gene name phacA
CYP504A2v1 Penicillium chrysogenum strain Wisconsin 54-1255
GenEMBL AF056978.1
phenylacetate hydroxylase (pahA) gene
Rodriguez-Saiz M, Barredo JL, Moreno MA, Fernandez-Canon JM, Penalva MA, Diez B.
Reduced function of a phenylacetate-oxidizing cytochrome p450 caused
strong genetic improvement in early phylogeny of penicillin-producing strains.
J Bacteriol. 2001 Oct;183(19):5465-71.
Rodriguez-Saiz,M., Barredo,J.L. and Diez,B.
The phenylacetate hydroxylase encoding gene from Penicillium
Chrysogenum
Unpublished 1999
CYP504A2v2 Penicillium chrysogenum strain NRRL1951
GenEMBL AF057558
Rodriguez-Saiz M, Barredo JL, Moreno MA, Fernandez-Canon JM, Penalva MA, Diez B.
Reduced function of a phenylacetate-oxidizing cytochrome p450 caused
strong genetic improvement in early phylogeny of penicillin-producing strains.
J Bacteriol. 2001 Oct;183(19):5465-71.
Rodriguez-Saiz,M., Barredo,J.L. and Diez,B.
The phenylacetate hydroxylase encoding gene from Penicillium
chrysogenum NRRL1951
Unpublished 2000
1 amino acid difference to CYP504A2v1
CYP504A2v3 Penicillium chrysogenum strain NRRL1951
GenEMBL AF057559
Rodriguez-Saiz M, Barredo JL, Moreno MA, Fernandez-Canon JM, Penalva MA, Diez B.
Reduced function of a phenylacetate-oxidizing cytochrome p450 caused
strong genetic improvement in early phylogeny of penicillin-producing strains.
J Bacteriol. 2001 Oct;183(19):5465-71.
Rodriguez-Saiz,M., Barredo,J.L. and Diez,B.
The phenylacetate hydroxylase encoding gene from Penicillium
notatum
Unpublished 2000
Note: the species in the title does not match the species in the Genbank entry
2 amino acid differences to CYP504A2v1
CYP504A3 Aspergillus terreus
GenEMBL AF141925.1
Kennedy,J., Auclair,K., Kendrew,S.G., Park,C., Vederas,J.C. and
Hutchinson,C.R.
Accessory Proteins Modulate Polyketide Synthase Activity During
Lovastatin Biosynthesis
Science 284, 1368-1372 1999
lovastatin biosynthesis gene cluster
72% identical to CYP504A2v1
CYP504B1 Aspergillus nidulans (fungus)
No accession number
Jose M. Fernandez-Canon
43% identical to CYP504A1, CYP504A2 and CYP504A3
submitted to nomenclature committee Feb. 24, 2003
CYP505A1 Fusarium oxysporum
GenEMBL AB030037
Naoki Takaya and H. Shoun
P450foxy, a fused reductase-P450 similar to CYP102
CYP505A2 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.388 (supercontig 50)
9a27.tfa_140cg@9A27 like P450foxy
58% identical to 505A1 over 505 aa N-term part
Neurospora crassa sequence contig 1.388 (supercontig 50) Length = 87086
MSSDETPQTIPIPGPPGLPLVGNSFDIDTEFPLGSMLNFADQYGEIFRLNFPGRNTVFVTSQALVHELCDEKRFQ
KTVNSALHEIRHGIHDGLFTARNDEPNWGIAHRILMPAFGPMAIQNMFPEMHEIASQLALKWARHGPNQSIKVTD
DFTRLTLDTIALCSMDYRFNSYYHDDMHPFIDAMASFLVESGNRSRRPALPAFMYSKVDRKFYDDIRVLRETAEG
VLKSRKEHPSERKDLLTAMLDGVDPKTGGKLSDDSIIDNLITFLIAGHETTSGLLSFAFVQLLKNPETYRKAQKE
VDDVCGKGPIKLEHMNKLHYIAAVLRETLRLCPTIPVIGVESKEDTVIGGKYEVSKGQPFALLFAKSHVDPAVYG
DTANDFDPERMLDENFERLNKEFPDCWKPFGNGMRACIGRPFAWQEALLVMAVCLQNFNFMPEDPNYTLQYKQTL
TTKPKGFYMRAMLRDGMSALDLERRLKGELVAPKPTAQGPVSGQPKKSGEGKPIS
Start of reductase part
IYYGSNTGTCETFAQRLASD
AEAHGFTATIIDSLDAANQNLPKDRPVVFITASYEGQPPDNAALFVGWLESLTGNELEGVQYAVFGCGHHDWAQT
FHRIPKLVDNTVSERGGDRICSLGLADAGKGEMFTEFEQWEDEVFWPAMEEKYEVSRKEDDNEALLQSGLTVNFS
KPRSSTLRQDVQEAVVVDAKTITAPGAPPKRHIEVQLSSDSGAYRSGDYLAVLPINPKETVNRVMRRFQLAWDTN
ITIEASRQTTILPTGVPMPVHDVLGAYVELSQPATKKNILALAEAADNAETKATLRQLAGPEYTEKITSRRVSIL
DLLEQFPSIPLPFSSFLSLLPPMRVRQYSISSSPLWNPSHVTLTYSLLESPSLSNPDKKHVGVATSYLASLEAGD
KLNVSIRPSHKAFHLPVDADKTPLIMIAAGSGLAPFRGFVQERAAQIAAGRSLAPAMLFYGCRHPEQDDLYRDEF
DKWESIGAVSVRRAFSRCPESQETKGCKYVGDRLWEDREEVTGLWDRGAKVYVCGSREVGESVKKVVVRIALERQ
KMIVEAREKGELDSLPEGIVEGLKLKGLTVEDVEVSEERALKWFEGIRNERYATDVFD
CYP505A3 Aspergillus oryzae
no accession number
Masahiko Kaya
Submitted to nomenclature committee Jan. 22, 2002
58% to CYP505A2
55% to CYP505A1
a fused reductase-P450 similar to CYP102
CYP505B1 Fusarium verticillioides
No accession number
Jeong-Ah Seo and Robert H. Proctor
Gene name FUM6 a second fused reductase-P450 similar to CYP102
CYP505B1? Gibberella moniliformis
GenEMBL AF155773
Proctor,R.H., Seo,J.-A. and Plattner,R.D.
Characterization of four clustered and coregulated genes associated
with fumonisin biosynthesis in Fusarium verticillioides
Unpublished
100% identical to 505B1 above after 1st exon, same authors, probably same gene
I am enquiring about this.
CYP506 Fusarium oxysporum
GenEMBL X82490
Mouyna,I. and Brygoo,Y.
Disruption of a Fusarium oxysporum f.sp. elaeidis cytochrome P450
gene by a repetitive sequence.
Unpublished
CYP507A1 Neurospora crassa
GenEMBL AA902084, AA902085, AA902086
Nelson, M. A., Kang, S., Braun, E. L., Crawford, M. E., Dolan, P. L.,
Leonard,
P. M., Mitchell, J., Armijo, A. M., Bean, L., Blueyes, E., Cushing,
T., Errett, A.,
Fleharty, M., Gorman, M., Judson, K., Miller, R., Ortega, J.,
Pavlova, I., Perea, J., Todisco, S., Trujillo, R., Valentine, J.,
Wells, A., Werner-
Washburne, M., Yazzie, S., and Natvig, D. O.
Expressed sequences from conidial, mycelial and sexual stages of
Neurospora crassa.
Fungal Genetics and Biology 21, 348-363 (1997).
Neurospora genome project
Sequence data
most like CYP57A1 (36% over 129 amino acids)
NM1B1-T7, NC4D11-T7 and NM1D12-T7
SPLHPRISAQVRDFHRHPHRLISSSSIITDQEARKLPYLQAVLYEAIRMM
PPLMNGFPRQVPAGGDTICGRFVPGGTDIFVNYVGMLRDKSVFGEDANVF
RPERYLEGDEERRNRMWKTTDLAFSHGRWRCLGQRLAWVQLQKVVVEFMR
NFEMQIVDPMHPCRLRCYCTPEMDDLMVKVTEAKWD*
CYP507A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.80 (supercontig 10)
MALTELLGALPLVSSGMSKLQVAAIYLAISFVSWYIISSFTDWYRLKAFPGPPTTGFSNLWAARAI
WTGKAHKIFPATQEKYGPITRIGPNALMVCDAATVVHINGVRGGYARSQDFYDCIRMDPW
DHTVLSESDSAMHNERKTKVYAGYHGKGEMDMEKDVDMVIAEAVELVRKKYMNSAATSGS
KPPLDYTRIARYIAVDSVTQTGFGKAWGDVREEKDHFGWLGFADMIVGYLHSLAYVPA
ISKIVISTPLMVLLGPKPTDKSGIGAFLG (2)
HAPQSRHRPLKHACKSSLQLFSHHPISLSINTCQDEWVKNGISQRTAVLEVSAQLPAGSDTTASAL
QGTMLYLLSTPSAYIRLKSEISTAIRTGLISSSSIITDQEARKLPYLQAVLYEAIRMMPPLMNGF
PRQVPAGGDTICGRFVPGGTDIFVNYVGMLRDKSVFGEDANVFRPERYLEGDEERRNRMWKTTDLA
FSHGRWRCLGQRLAWVQLQKVV (0)
FMRNFEMQIVDPMHPCRLRCYCTPEMDDLMVKVTEAKWD*
CYP508A1 Dictyostelium discoideum (cellular slime mold)
GenEMBL 16 ESTs C92049, C89925, C90052, C91122, C25600
C94043, C94448, AU034703, AU033852, AU037735, AU034252
AU037979, C92378, C24684 and clones SSC755, SLJ668 from
The Tsukuba dictyostelium genome project.
31% identical to fish 1A sequences and CYP73 sequences
MALFEIIISLF
VVYIIHNAISKYKKIHVNELCGPTPIPILGNLHQFGELPHRVLTKMTKKYGHILRVYMAD
MYTVVVSDPLLIREMYVDNSDIFTDRVKKPSVEHGTFYHGTVTSYGEHWKNNREIVGKAM
RKTNLKHIYELLDKQVDVLIRSMKSIETSGKTFDTRYYITKFTMSAMFKFLFNHDIPEDE
DINKGDTQKLMGPMSEVFQNAGRGSLFDVINITQPLYLLYLEMFDQSFKDIMKYHREKYN
EHLKTFDPDVERDLLDILIKEYGTDNDDKILSILATINDFFLAGVDTSSTALESMVLMLT
NYPEIQEKAFDEIKTVVNGRSKVNLSDRQSTPYLVAVIKETLRYKPMSPFGLPRSSSKDC
MIGGHFIPKNAQILINYQALGMNEEYYENPEQFDPSRFLKVESNVAFLPFSIGIRSCVGQ
SFAQDELYICISNILLNFKLKSIDGKKIDETEEYGLTLKTKNRFNVTLEKRII*
CYP508A2 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 22+45+69+86+88
CYP508A3 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 21+67+72+82+87
CYP508A4 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 66+70
CYP508B1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 15+77
CYP508B2P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 60
CYP508C1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 4+23+24+83
CYP508D1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 59+81
CYP508E1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 71+85
CYP509A1 Cunninghamella elegans (a fungus)
No accession number
Rongfu Wang
Submitted to nomenclature committee Feb. 29, 2000
CYP510A1 Lentinula edodes (a basidiomycete fungi)
GenEMBL AB045779 (still confidential on Aug 1 2000)
Susumu Kajiwara
Submitted to nomenclature committee July 17, 2000
CYP510A2P Lentinula edodes (a basidyomycete fungi)
GenEMBL AB015310
Partial sequence called SHP1 for small heme binding protein
This is clearly part of a P450. It appears to be
Interrupted by an insertion so it is a pseudogene
About 70% identical to 510A1
CYP510A3 Lentinula edodes (a basidiomycete fungi)
GenEMBL AB049963 (still confidential on Feb. 14 2001)
Susumu Kajiwara
Submitted to nomenclature committee Feb. 12, 2001
Clone name Le. CYP2 (499 aa)
67% to 510A2P 87 % identical to 510A1
100% identical to 510A1 from 286-483
CYP511A1 Botrytis cinerea (a plant-pathogenic fungus infecting over 200
plant species)
GenEMBL AL114463.1, AL113894.1, AL112730.1
These cDNAs overlap and cover most of the intact sequence
Bettina Tudzynski
Submitted to nomenclature committee Nov. 20, 2000
CYP512A1 Coriolus versicolor (a Basidiomycete fungus)
No accession number
Hiroyuki Wariishi
Submitted to nomenclature committee Feb. 14, 2001
Clone name P450Cv365
Most similar to AL355928.2|NCB1D4 Neurospora crassa unnamed P450
(35% over 364 amino acids) 16/17 identical amino acids at heme
signature region. This same region also conserved in
CYP54 from Neurospora crassa
Note, see lower eukaryote section of homepage for dictyostelium P450
Accession numbers, other identifiers and sequences. There are currently
25 complete sequence from Dictyostelium including CYP51 and CYP508 families.
CYP513A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 8+53.
CYP513A2P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 38.
CYP513A3 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 7+50.
CYP513B1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 49.
CYP513C1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 5+56.
CYP513D1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 54+55+78.
CYP513E1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq seq 51+52+90 ng3101.
CYP513E2P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 37a.
CYP513E3P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 37b upstream of 513E2P only 429 bp between
CYP513F1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 57.
CYP513G1P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 41.
CYP514A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 11
CYP514A2 Dictyostelium discoideum (cellular slime mold)
GenEMBL AB050504 C-term only
See lower eukaryote section for more accessions and sequences
Formerly seq 65
CYP514A3P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 89
CYP514A4 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
ng2792 only 7aa diffs to 514A1
CYP515A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 79b
CYP515A2P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 79a
CYP515B1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 6
CYP516A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 9+43
CYP516A2P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 64
CYP516B1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 10+42
CYP516B2P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 48
CYP517A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 3+61
CYP517A2 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 74
CYP517A3P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 36
CYP517A4 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
ng5440 most like 517A1, 13 aa diffs
CYP518A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 20
CYP518A2P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 25+27+29+30
CYP518B1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 26+28
CYP519A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 12
CYP519B1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 35+44+75
CYP519C1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 31+32+33
CYP519C2P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 46
CYP519D1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 14+34
CYP519E1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 17+84
CYP519F1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 47+68+80 renamed from CYP520A1
CYP519G1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 58 renamed from CYP520B1
CYP519H1P Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 39+73 renamed from CYP523 full length pseudogene
CYP520A1X Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 47+68+80 renamed CYP519F1
CYP520B1X Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 58 renamed CYP519G1
CYP521A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 13
CYP522A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 18
CYP523A1X Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 39+73 renamed CYP519H1P full length pseudogene
CYP524A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 91
Note: this seq is 34% identical to CYP710A1 and A2 from
Arabidopsis. That is the same identity between dicty CYP51 and
human CYP51. Therefore, the CYP710 seq may have shared a common
ancestor with CYP524A1 that was independent of CYP51.
CYP525A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 62
Note Didtyostelium names continue with CYP554A1 - CYP556A1
CYP526A1 Fusarium sporotrichioides
GenEMBL AF330109
Peplow,A.W., Tag,A.G., Garifullina,G.F. and Beremand,M.N.
Identification of genes under the control of TRI10
Unpublished
Clone name TRI13 (see TRI13 CYP65A1 above)
Not very similar to any other named P450
34% identical to rice blast fungus AQ361801 from
Magnaporthe grisea GSS sequence from BAC clone
CYP527A1 - CYP553A1 are Neurospora P450s
Note: several Neurospora P450s were named previously or fall into existing families
CYP51, 54, 53A4, 55A6, 61, 65B1, 65C1, 68D1, 505A2, 507A1
CYP527A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.73 (supercontig 9)
b2f20.tfa_120wg@B2F20 there are two P450 genes on this contig
MLQELLSLAQGPYALPILAASLVAYLLVSITITYRRLSHFPGPFLASFSPLFMLKVLFSGHHADGYSRLNQQYQS
PLVRIGTTDLITDDPAIIRHMNGARSAWGRSNWYRAMTLDPRGGSLFDEPNTKTHDLFKARLSFGYSGRENPGLE
GDIDEVIGMLIQYIKSRYISDDERGVLKRMDLAKVTQFFTLDVITRIAYGKEFGWLKTDKDQFEWTSTVIKAVPA
NSLIAELPLLQKIFLSKWFLQWFGPKHSDANGMG
162022 RVMGMAREMVAKRFGEKAEDRKDMLGSFVRHGIEQQACEVEVLFQILAGSDTTAVAI 162192
162193 RSTFFHLATSKKAYVRLLEEIDRAVKEGEVSSPIRAEEAKHLEYLQACVYEGLRMQAPFS 162372
162373 GLCMKSPPKGGDFINGMFIPEGTRVGHNFGGLIRRRDVFGEDADVFRPERWL 162528
162529 NAEPAKRLEMQQTTEMVFGYGRWACLGKPVALMELSKVFFE 162651
FFRRFDLQLVEPKNPWHEFNVNMCFQSDLWVKITERFPEGEKV
CYP527B1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.1433 (supercontig 257)
42% to CYP527A1 over 514 aa same family different subfamily
MASLGGFFDPSRLTTAS
PKAILITTFSLLIVYYITSSIIAWYRLRHFDGPWLGKFSYLWIFKIIHSGQMGEAAHAAQ
DKYGGVEPGYPSTVRIGPSDLITTELDLIRRMSGARSRYTRSDWYKLNRLDPYDDNMFS
TTSTTYHDKLKSKMAPGYAGRDNTGVEKDIAHVISKLVDKIRTKYAAPKGSTSPAGREM
LEFGQMAQYFTLDVICKTAFQVEFDHITKEEDVFGYIYMVHNLVYGNAMGATIPIIGKVL
SNPLFLKMFGPTTKDPRGMGLIMK (2)
VAREIVAKRFAPDAKDQQDML (0)
GSFIRRGLTQRQCETESIFQILAGSDTTATGIRSGLLYLCSNPRAYTRLQRE
IDERIASGLISSPCITNAESLAFPYLQAVIYESLRMRPPFDGLPFKIVPPEGDYTRDGKF
IPGGTKITATFGAMQRNKEVFGVDSDIWRPERWLEEEGCDAERRREMKSVVEMVFGYGRW
QCAGKMVAMLELNKIFCQ (0)
LMRNFDFQLANPTQPWKSRNYHIFFHDDFYLTVTDRHSKTEDEKSS*
CYP528A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.73 (supercontig 9)
b2f20.tfa_110wg@B2F20 there are two P450 genes on this contig
MSLPTLQDVLATSISLLKTTTPLLKRITSLLLKTTTSLLFIIVVTILAIAVYRLFFHPLAGIPGPRLAALSNIWH
AYHARNGRMYLLGKSLHKRYGPVVRVGPNELWFDSKEAFKTIYSAGSGFEKSDFYLSTLLTKPTLSLKFSPFKFP
FPNLQLPLQIHHPDTLDLLSERHLPRYRLQRRLIGPLYQLSSLKKFEPQIDAVLDRAIAQLRTLQGQEVDLKEWM
HIIAVECLGAVVLSWSPGYIRDKSDGGTSVQGYLGWKRKSVFGLFPGITILSFLEGGPFGLGERMGKGVARWWSN
TWGVTFATPKGFKPFFTAVYQKVSRRVSMAALLWMSDEVKVVKKPSSKKNKPAKGKMEEDKGKKDLLEDLIKLYQ
SKPEQFTETYLRRMAMTNFGAGHETLCSALTACMAMIGTHAAVQARVAEEVGSHLRGGGTVPFDTAAKQLPYTLA
AIKEAQRLHPVIAMSLSRTVPRDLDGGGVMLHGHWVPEGTTVGCNPVALHRNQEIFGEDADSEPEPTPNSEPAPV
PESVSGSPSEPDSDPKPEPNSAPKHQQPNLHPTPEQEEEEEEVERKRLREMLHLNLTYGGGARTCPGRHLAELIV
WKVIPRLVAEFRVEVTHMPRDEEKNMERYFMSMLTGVKVRFLERREDSGDMA
CYP529A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.73 (supercontig 9)
EST c1b06np.r1 (lower case)
METSNLTLLLLYLAIPSLLISAVQQILSWRKRTALVGSSGSKAKFPGPRQFPIVGRIHDLPRFSLWLK
FKEWADEFGPIYTTSVPDATFVIISDEKIAEELLVKRGHIYSGRPQIRSLIGHKEGVVYSALMDRH (1)
DTWKSQRKWAHAAMAAAYKHHFYGHVESemrrflvvllfdpdrfldhtreycgrvmsrlawddatqgrln
gesadktlhcmsvsgpitntmtplwslpasinpwykyevkreaelrqwwlglfrkakermrkgelpsdtwayryfeqmvqdtapsmpataRAPGPDVKNAERSPSTAAQEEAYEKKDALDQPEKDETFASCMIGFLNLVGVVTISGPLKFFQMAMALHPEWQHKAQAEIDEVCGDRMPTMKDFDKLPILRAC
LKETVRWRSGVPLGVPHQAERDDEFRGVKIKKGTIVLACEW (2)
TLNRNPQKYPDPETYNPARYLDPSYPTFQAPLTRYPNFREGSSMHSFGWGRRTCLGQNIVDDEMFVFGAALLWAF
EVTPKTCPVTGKEVPIDTHKTNSHVILEPVDFQLGYKVREGKKELILEGGKEFFGKV*
CYP530A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.85 (supercontig 11)
b10d6.tfa_160wg@B10D6
MAVITTYLPIVVGIALLIRLLMVGRRPKNYPPGPPTIPILGNLHLMPTKDVHLQFEKWAREYGPVYSLILGTKVM
VVLSSDRAVKDLLDKKSHMYSHRQEMYVGQTLCSGDMRILMLGYTAKWRMSRKLVHALLNVSSAKSYVPYQMLEN
KQMLYEMITKPDEFLYNIRRYSNALTTTMVYGWRTPTYNDEKMRQLFDGFSEFAEINQTGVAALIDFFPILRYLP
DFLLPARKKAKELHKKELGLYKGHWLKAKEVTLKGTITHCFCEDAVPVQKKEGFSDDEAAYIAGTLLEAGSDTTS
STLYGFVQAMLLYPEVQRKAQEEIDRVVGPDRLPTMDDEPHLQYCRAIVKESLRWMPTTIMGAVPHAVTQDDYYN
GYLIPAHAAVVNNAWAINHDPVRAPEPRKFKPERHLDDKLSLADSAANPDPTKRDQFTFGAGRRICPGIHVAERS
LFLGVARMLWGFSIAPKIGVDGKEIIPDQEKLTQGFVCMPEEYPATITPRDERRKGIMEEEWFAAERECLDPETK
QWKQGMNWVPKLSSTKV
CYP531A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.111 (supercontig 14)
MDSHAGLDGVLQPQVLKLTLRTLPSPFISFVVIPI
28695 LSILGWYIISWATNPLKKYPGPFLAG (1) 28775
FTNWWRLYQTRTGRYHEIVYDLHKKYGPIVRIGPNTLDLDYPELIKTIYSTDGKWLK (0) 29023
29083 TEFYHNNSAIVNGKITYHIFSTTSPTEHAKMKRPVAKYYSTSYVQALEPHVDVVLNDFCK 29262
29263 YVEERFINVPGGPKELDFGEWLGYRN (2) 29343
SVTFSRRFGYMEHGSDFDGTIEKAEAALRYFQTVGQIPILD 29555
29556 YFLDKNPIKRVGPPNLVHPARIA
29625 VQSFVARLQGKDENYDPKNPDYLQHFIDSKESHPDLVDDNQIISDVLVNLLAGADTTAIA 29804
29805 LRAVFYYMLKNRSVYNRVSEEIQAAGFDRSKPVPYSGARQLPYLEACVREALRIHPAA 29978
29979 AMLLERYVPADGITLPDGSFVPAGTAVGLNPWVVSRNKSIFGEDSDTFRPERW 30137
30138 LQQPGEDDEAFHVRMQKWNAADLTFGNGSRICTGRNFAMFELYKVCATLLHRFEIEL 30307
ADPTKEWKVWGSWFTIQKDVIAKMKVRG*
CYP532A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.145 (supercontig 19)
18532 MITDLIIKVLAQHWPAILCLVLVGWLVRNRYHNGLNKYPGPFLASLTDWWRFID (0)
KHGPIVRLGPNTLSFSDPEAIKTIYGLNKGFVK (0)
SDFYVVQQSTVKGHNLPSLFSTTDNEFHMQFRRCVNSAFAMSALVQYEPFVDNTTK 19077
19078 LFLEQTEKLYVEGGEKACDFTRWLQFYAFDVIGEITYSKRHGFIEHNKDIDGIVS 19242
YLTKLFNYVAP (0) this joint not well supported
IGQIPLLDRLFLKNPLYLKLSQWGLIDGTMPVAAFARARMAERLPELQTEKTAILPTSEKQAQLKIQTP
19548 DLLSKFLAARSARPNFMTDTLVQTMAVSMA
FAGSETTAISLSAVFYYLLRTPRALARLREEIDEAARQGKFSDYETG 19778
19779 LVTWAEAQTLPYLDACVKEAFRLHPAPGLPMERVVPPQGMEIVPGQFVKGGTIVGVSAWV 19958
19959 LHRDESIFGERVEEFRPERWFVDETVAKEDMEGKKREEEEKRIKTMNGHMLQFGMGSRTC 20138
20139 IGKNISLLEIYKLVPTLLRRFEVSLPPSLFFPPFLSYWIWRETID*
CYP533A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.246 (supercontig 26)
9a3.tfa_80cg@9A3 AL353820.2
N-terminal of this seq is probably incorrect since C-helix WGLHRK starts at aa79
MFRRMSPSGKYLEWSKKYNSDVLYFETFGTKWVVLNSYEAAHELLEKRGRKYSDRPRFVMFEEMGWAPTLTWLRW
GPQWGLHRKVLAPPFAKTGCVSFQPLQRKQAMIMCKNLIENPEEWMSAVTHFAVAIMFKIGYGIDVDTPTDEWVK
LAAEASEAVGKAGAPASSIMDRIPWTRHLPDWLPFMERLKYAHENKKVIQNITERPFNASMSDYHERDLDPTRKL
KECFVHRMYHLRGEDARRNRQNVYYEEDIKGGAATMLIAGNDTTASTLNLIILYLTKHPRVQRRAQAEVDYILLT
TGVPTEPVEDYRPSTPAPWASSSSNRAGIIRLPTLEDIPKFKYVNLILQEVYRINPLSPLGIPHASVKDSSDIDP
DGDDTYNGMRIPCGTIVYPNVWAMNRDEKRYREPERFFPERYLPKEEGGWGEPLPVGNFGFGRRVCVGQYLAENS
LLIAIAMMLATIEFDLPIGPDGNLKDFEPQFSHKGQSIVLPFKVSIKPRSPMVEELLDRHILVAKMAEKAAEDGK
KGGPTPSA
CYP534A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.285 (supercontig 30)
9a12.tfa_40cg@9A12 2 aa diffs with 9a22.tfa same gene missing first 326 aa
run of SSSSSSSS may be in an intron
MRLTPLVTLLGVFLAWATYKIYAGFRSNIAKARKTGYPVLITPIYPLSPLWLTTRHFLLPILTRLLPERLYASYV
FVMTPDWEYHPSPRSHFTRLGSDTFVVASWNGLACYTSDAAVISQVMSRREAFPKDTRQYGILEIFGSNVVTTEG
QVWRMHRKVTSASFNERNAGHTFREAVRQTRGMVGGWFSSSSSSSSSSSSSGSQEEGTGRREGDVEDMTGIITTL
EHDTMRWALNIIGYVGFGLKLLWPHESMPKDMDPKLAKYGSLQPPYGYSMTFADSLATVLERIVILLLIPSAVLK
RIPERWGSWGSKLHEAWEAKVNYSKYMNAFLAEKVQDIRAGVQEREGMNIMGQLVRSKYDPSSSEKNDGATMKLE
DSEIMGNAFIMTLAGHETTANVIHFAIVELAMHPEVQRRVQRDIDAIFGRDSDPETWDYEKSINALLASYIGAVI
NETLRLIPPVVVTPKRVADTDQWVQDGGQRHPMPAMMPVMPVISAVQRSRRYWPALHDKTTKKGDTDDLDEWHPD
RWYLPSMSQGQPSSDGEKGEGKQQQEENYGGYQGTDTSPTLFRPVRGSFIPFSDGARSCLGRRIAMVEMVAAIAV
LFQKYSVELAVDASDEEVQMMVPEQRRHVHAKAQEKAKETV
Probable break between P450 above and other seq that CONTAINS a MATCH TO
ELONGATION FACTOR 2 KINASE AND IS PROABBLY AN INNCORRECT ASSEMBLY FUSING THE P450
TO THIS GENE.
CYP535A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.294 (supercontig 32)
MGAATLIYEARWLLASVVLAIWAVLKTKQYYRLRHFKGPFGTGWFEIWHGLAYLSGHPHLKFKEATDKYG (1)
PIARIGPNELMTTSLELLAHINGARNRYTRTRWFYIAARFQPNSDNILSELDDNMHTKRRAQMAGG (0)
YSGKENRELESSVDIHVAELVQLIRTKYRSTEIKTNPFDLAQKAQYLALDVISHIGFG
EPFGMLLSDQDVNGYIKHADQGMNAITYLWGLGLTELVLDTPLMHLLGPSDTAKTGFGA
MTANTRKIISKRLEDDPGMTKGSDMLASWFRHGLDKEQLVTESILQLLAGAETAATSMRC
IMLYLVTNPRVYAKLQAEVDAAVKDGRAPPYPSVVSNATQRQLPYLWACCKEGMRMHPPV
TNSSQKKVPPEGDAVTVEGKTVYLPGGTNIGYCVWGLNRDQKIFGGDADCYRPERWLNEE
DPKKLAQMNLVHEMLFSYGKFQCLGKPIALMEIGKAIF (0)
LMRNFDWSVTNAAEPWKEMNAGALFLTHDFLVEATERSVS*
CYP536A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.402 (supercontig 52)
MGVVLKVKDGKVSPTRVLVGATATVLASYWLYILLFAIVIRAASKRYASPLRKYPGPFLASCSRLWKVL
STASGRTHLDHIELHRKYGPVVRIAPNEVSVASPEAARTLLSAGKRFFKTPFY
SVFPPAENPDIFTEIREDAHAQKKRVANVPYSMAAMQQLSPFIDDTIELLVRKIDEHI
AQHKGVFDLGDYLHYFAFDVLGEVAFSRSFGFLAQGRDVDNAIKTIDKSQTY
NGIVGQVPELDHLLRRNPLWRSIPWLSTKNALITRMALEEMGRRQPFNKD
TAVLQTGDTRQDLMASLILGHLKSPEKFGVGDVFAVAHGAI (2)
FAGSDSTASTMQSFFYHVLSSPSTYQSILYEIQSAVATGVIPATGNLTWNEAQTLPYLQAC
LKEAMRLRPAVGLNITRFVPPEGAELDGHFFPGGTSIAVNGWVLHRDKLTFGNDADEW
RPERWLENEEEARRMERYMFQ (0)
FGGGSHLCIGRNLALLEINKVVPRLLRDYRFELAHPGRELKATASFFVVQSGLEVFVRRA*
CYP537A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.443 (supercontig 59) C-term
Neurospora crassa sequence contig 1.444 (supercontig 59) N-term
MTLLLLLPIIIFLLHNLIYKPLTSPLSSLPGPLYTKFTSLVLKYHELRANRTRWIHALHVRYGPVVRIAPNEVSFASREGVKEIYCSGGSGFDKSGWYDLFKIYGRRW (0)
HAKRKRILADRYANSNVMRQAPLSGIIERADRVVKRCAESLDGSLDLYVSLLHNMTRRPWQLITDLLPRFVSFLLGSMMNTTK
sequence runs off end of contig
LTYFSRRKKKDRQLSYYSPTLHKAISKILYICGIKPRETPLADKFILDTAAKTDAAPFTLLDRLHSVRLG
46228 NSNGDGEHGGKDSKIAQIDQLDIAAELLDHMVAGIDTTGDALCFLMWELSQPRSRQIQ 46055
46054 TKLREELLSHPSSDINFDKLPYLDAVVMEGLRCFPAIPMSLPRIVPRGGKTID 45896
45895 GYFLPEGTTASCQAYSMHRIDMEAWGPDPDAFRPERWFEEKGDAARKKLFFAFANGARG 45719
45718 CIGK (2) 45704
HSLALAEMKILLREVYSKFTTVPDKTMTEEDMEMEDQLISTRPAGLKCLLKFEPIGGQ*
CYP538A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.498 (supercontig 67)
b2j23.tfa_120cg@B2J23
42% to Yarrowia CYP52F1 40% to 52A2 34-39% to other CYP52s
MLTPTTLLLLSPLLYLLTLLFRTALYRLRARSLGCSPVAVYPHKDPILGLDLFRESLRAMNTHALLPLWDARFKR
YGNTHYTLTLGKWVLMTNEPENVKVILGTKMAEWPIDGPRLHASVAVLGKKSIFTTNGAQW (?)
REARGMIRPSFVRD
QVADLHCFAKHVGNFLNAIPKDGSTFDMQELLLSMTMDSSTDFLLGCSTNSLLQPSPEAKQFLEDFEYTSREAAK
KSRLGTLLNWLPNGEFQATVTRVREYVRAYIRKSQAEKTDKKERDYVFLHEILKSGADEEHAIDQVLSVIIAGRD
TTAAAMTACFYYLARNPEAVKKLRKEIFDMEDEMPTWEQLKNMKYLNMIIKEG (?)
LRLFPPASTNSRAPIKDTVLPRGGGPDGKQPILVPKGQVVRWSLYSLHRRKDIWGEDAHEFRPERWDENLRVG (?)
WEYIPFSGGPRICLGQQFALTQIEYALFKFFRAFKSIEPRDDNGPLLLRTNLTVTFAKGCLVSATPDSN
CYP539A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.664 (supercontig 91)
seq in () may be introns most similar to CYP54A
low 40% range. Missing about 50aa at N-terminal
3707 HTNHSKGIKFIKTNVKMAAEHKNLQFWQQMFRSIGGYTGEVRLVGHRIIFTSEPENMKAI 3886
3887 LATQFEDYGKGEGFHQEWKDFLGDSIFTTDGDLWHASRQLIRPQFIKNRVSDLQCFENH 4063
4064 MQMLFRTIANG(GALNGEDQMVDMEAG)NGKPVDISDLFFRYTLDAATDFLLGKDIKSLRFV 4243
4278 LTENSTPVQPFADAFQEVQRVQIVIARAGPLNRFVPKKTFWEGLKVIDETINFYIDRALR 4457
4458 LDEEELASKSKGDEGYTFLHALAGFTKNRQVLHDQLMAVLLAGRDTTACTLSWAIYELAR 4637
4638 HPEAVAKLRAEILSVVGPDRAPTYDDLKSMKYLQNVMNETLRLYPVV(PFKQVPETSFSTRSLFP)NF 4835
4855 RMALKDTTLPRGGGPDGSQPIVILKDTPVGYSPLAMQRRPDLYPPVSEKFPDVEMFSPDR 5034
5035 WFHWQPKPWQYIPFNGGPRICIGQQFALTEMGYVLTRLFQRYDRVVSYMDEIDGGKPRMK 5214
5215 TDIVLMPGDGVKVA 5256
CYP540A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.764 (supercontig 106)
64591 FRKYGHVIKTTNMGRTNYLTDDPAVALVALAESAYFTKKITEEHPLFGIKDNSAIFIGD 64415
64414 TETENWRLAH
64384 KFLPPAMGPKAVRHYTPLMQECVRGSFKIFDELDSRGESWNVYQYMVKLASQTVGKFAMG 64205
YHHFDAVDTPVHSLVT (?)
YQQLPFGDPARL 64061
64060 REIRQITYGQLAETIEAAPKSGIENLPLNEAATKASCVADYLLNAVDEKGEHFPKGLILS 63881
63880 NMLVVTGAGYTTTSALLSWCIYCIVTYPKMQDRLLQELIDHGINSETDWDPDLAHS 63713
63712 LPFLDAFLKETQRLHNASFQPGRTTKTDVVLPGGFRLPENVIMIPALYAIHTNPEHWHD 63536
63535 PFRFDPDRWDTEEVKSRHRGAYMPFATGPRSCIGFNFALLE 63413
63412 IKVLLSELVYRYKFAREGFEAIEYDPEFQL 63323
63322 IRPLNLYVTAKRR 63284
CYP541A1 Neurospora crassa
No accession number
35% to CYP505A1 (P450 part only) 37% to CYP505A2 (P450 part only)
This contig does not have a match to the reductase part of 505A2
MSTPIPRPPGIPVLGNVFDITPSNTWWSLKALAEKYG (1)
EIFQVKILGKTIVFVASAALAEEICDEQRFQKYVGGPIVEIRATVHDSLFTAF
HHEQSWGIHHRIIAPLLTPQAVSGYTDDINLCATEVIQK
WASLGDSNVLEPLVDLNLLNLEATSLTLFSQKLNCIQAGGHPV
28005 IQAMEDAVSEAIMRPTRPGLVNWLLYSSKFKKATKTLRTWAADTVKY
27864 RQENPTDKHDMLWAFMNAKDPETGKGLSESEILDEIVTMPIGSATAPCAVTATIYYLLQ 27688
27687 NPEVVTKAREELDRVIGTGPLKDEHLSQLHYIEGITRETLRLSCAAPGFNIEPIPRQN 27514
27513 KADKSPLLLQGGKYQVAHDQKLIIVLAGVNRDPAVFEDSLAFKPERMM 27370
27369 GEEFDRLPKGVKKWYGNGKRECIGKEWANNFLKIVTARLIHEIDFEVAD 27223
EGYEFRQDGWFQIRPVAFKVRVKPRVRA*
CYP542A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.841 (supercontig 115)
66714 MFLLSLLTLYTIYIAYHRLYLSPLADIPGPRLAALTQWYEFYYEVILHGQYTFKIIELHK 66535
66534 QYGPIIRINPWEVHIADPDFHRELLPTNTNRRRHRTPFF 66418
gap in seq
65887 VEDPDFRAEITNGILTGSNYGKIFQHFPFLVPFLASIPPGMLAAISPFYRTFLHLRACIT 65708
65707 AQIGEIEKSLRSEEGKNAHLDIPHPTIFHSFVNTEALPPIEKSVPRIAQEGQV
65548 LVQAGTVTTSWALTISTFHLLDQHATALAKLRK (?)
LEQLPYLRAVIKESMRLSVGASGRITRVAPDETLRFK (?)
EWLLPPGTEVSMTSYQITTNPEIFPDPHAFVPERWLGKENEMRLDKYMT 65069
65068 VFGHGARVCLGMQLAYAEMYLMLSKMWRVWEGGPQVGGGEDEDG (?)
DAEMAEDWFIPVPYRGSKGVRVYFESY* 64766
CYP543A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.972 (supercontig 142)
b24b19.tfa_260cg@B24B19
MPCAFAITLLTVLVVLLVVQRWLERKRLRGGSRPLPGPTNLPFIGRVHDIPEKGSWLKFHEWSKVYGPIYQTKMF
GVTHVWISSEKIAHDLLSRRANIYSDRPQIPNLPDNRTSGDYLALLGRNDTWRRQRKLCNHLMHTSALASLHDYP
TLERNRFLYLLSQSPENYIEYIEQFTSRTIARLSWGSAHPAQILRHTTFGLLETISPSGALPNVISFLRHLPLAL
SPWQKKEKARHDLEDKQFRSNIGFVKRMMDVGRAEPSFIRTYLEEQQGSINSTSTNKKCDEQKGKEAAADEAMHV
VGLMAIAGALTIGSPIQSYILAMCHYPEWQARLQEEIDTELGGRCPMWEDREKLPLLRAVVKEVIRWRPPVPTGI
PHAVEKDDVYNGYFIPAGATIHALEWAITRDEQTYPQAETFNPSRFLVPSYPTYREPLTIYPNLSGFSQFGFGRR
TCQGVPIVEQDLFLTMGGMAWAFTISKKRDPVTGVEMPVHWNDYTPLLIAKPCRFPFDAVPRDEEKRKKMREMYE
EAVEGEEMERCESREERERTEGWMVEGGVRQKEEEIPWQSGVGLQRQVEGDVGAGVGAGGGGASSEDRRGQDHAQ
GPQDTSQEEPDDASDSHSTTSSRLDSTASGSGSVIGTPSASWGSEPSLSLRGSSVESSDCDEHEHDELRVVTRTT
PYKGEDGDDDGNEGMRMRSMGRMRGRRVRIKTMSNASSTIGMEALMTPMTPTTPEKEGLPFKEMGIMGIMGMGTM
KLGIATRRVMEVEDMEDLERVAVDDEMEVPGAWRW
CYP544A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.1020 (supercontig 147)
exon joints not well supported, just best guesses
MGIIHEKAHYVVAVALVAFLIRAFVLSQWNTIKRNGERLN (2)
LTILAHRKPPNTLPLVGNGLQFLQSRWKLFSWFDACQRKFGYETVAISVPTLPPGVLIHDPRNLDYVFKNEGIFTKGNFVKGRTWD
LFGNGIINAEGDFWKTQRKAGLSFLNTANLRVLTDVALPQYLSESISQLRSSTNGTV
VDLQHVFHEITTKLMGKMAYNASILG (1)
SFRLTHTQMAMHADDEFTVSFDYASGGTAERFQNPLWFVTEIFLGAELRKSIAVVKNFGRHIVTKAVQDRQEKEFGEEEGKLDQIS
GSLIQSLLDAIQDEQMVADAALTYLSAGRDTTGQALTWTFYLLMRHPRVVAKIREEATQLLKEKNVTLTPDQFDSSLFNPVTMPYS
MAVFYEVLRLYPPIPFEIRQCNEDVTLPDGTFLPKSSILVWCLWAMQRSKLTWGDDADEFRPERFLDGNKLISRSPSEFPVFYGGP
RTCLGRKMAEAIAAQVIPTMACLFDFVPTSDEERTSKTSLTLPMEGGLPVTVKTLTGEEREKLVRIPEIRFSEH* 42529
CYP545A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.1095 (supercontig 168)
long C-terminal
75354 MQLLTLVIVGLFMLIVAVVHFIKAFREVNDPNGIPGPTQIPYLGRVHDLPIQFMWLK 75524
75525 FKEWADKYGQQGFYRTMMLGAEFIVVTDEKVAEDLLVKRAKYNSDRPVIQSLFDSKSTHG 75704
75705 SMEYLPLM (1)
AYWARQRKLSHSYLTEATKAHYYG 75884
75885 VMYFEVQRWMARLLENPEDFQHSIEDMSSKVMCQLTWDDPSLSEYCTKSAWGLLTQMSPA 76064
76065 GPITNVLTPLWHLPTLINPWKRAERKRHDEQQAWWMERLLTCREKLARGELRPCWTRQFL 76244
76245 EKTSQKTSISGDYEASCVIGMLALVGIFTVVGPMSYWLVSMVHNPK 76382
76383 WQEAVQREVDEVCGNRMPRLEDAPRLPILRACIKETMRWKPNVPTGVAHETEADDHYQGY 76562
76563 FIPKGTRILPFDW (2)
SFLRNPVKYPDPENFRPERWLEPGWPT 76742
76743 YKEPLTQYPTIKGLTSFGWGQRQCLGMSLTQDELIVGCGALAWLFNLRHKRDPITGRELP 76922
76923 VPLDRSNSLLIIKPDPFQMEFEPRSKERKKEALRVWKESEAKDRARRERWL 77075
RNVKEGKPNVIKEPKVLQPTVKIPSPSPAAAVPAALVDGGEVRDELSKTVQVVKEKS
GGVNGHGDSLAEKAVMDVKKKADISITIARLDSTACVY*
CYP546A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.1099 (supercontig 168) C-term
Neurospora crassa sequence contig 1.1100 (supercontig 168) N-term
ORF with PIIGNT probable N-terminal fragment
MLSDKLLDLFALHKTQEALADVNLYKLLVLALALVLTAIIIDYGYMLWLRSKMPTGPFPWPIIGNT
FSLPENKPWIYFEELSKN (2)
ORF with WRIHRK probable C-helix
NPTIWINDAWCAHEILEKKAQIYASRPRMVVFGELGTGQKNLVTMRILNNNDRDR
WRIHRKLMHLGVGIQSVRGYREIQNNESKVVALDYLREPKEYVKHLERYATSVVSVIAFGRRVASYDDPIITEVIALMQLAADLNV
PGKRFPMLLETFPCEHSPLSISLSLHLSPP sequence runs off beginning of contig
22226 DASRQYSLPEAELSSLTGNLFGAGSDTSSSTLITFMLSCLAFPDAMHKAQAELDRVLG 22053
22052 GPAGGRSPHWDDAPNLPYINAYVKEVLRWRSVAIIGGQPHSNTSPDYYKGYFIPPHSWVQ 21873
21872 GNVWAIHHHEREFPDPDRFYPDRYLPGNDHHRPFPGEKGYMTFGWGRRVCSGQALAEQGT 21693
21692 WISVARLLWGFTIRKYRDPQTGKEEEVDIFA 21600
YTNGLNMRPQPFRCEIVPRSEEIREIIVREGEQALRDLKVLDGENRYRMSTFYQQKKREVAEMPEFDEKGNIRMVKVK*
CYP547A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.1351 (supercontig 233)
b2h3.tfa_30wg@B2H3
MPPTGNPFNFARITLLALVEVLIVKKTSYRHVLAISTTSLVLAILAINYVFFLAWKLYLYPVYFHPLSKFPAPKV
VDLWRVLARFRGKVPPGQLLLELAERTPNDGIIILQGGFGTSMLITKPAPLADILVHHPYDFVKHDAIRNFLRPI
LGDGLVIVEGEQHRFLRKNTQPAFKFGHIKELYSTMWTKAIEMNHVLKEELREKGNDTSVEINAWAGKVTLDIIG
IAAFGRDFHVLERPDHPLVKNYADLLEPGPAKFAYFFLALTLSRKFVDLFPWEISRRFNRTTSNIRRICAELVRE
RKAAIEKLGDDQFDILSLLIRSNNFSEAELADQLLTFLTAGHETTSSAFTWAVYKLAQDGEMQSTLRSELKRALP
DFPRFAPGQDIAIILEHLPYLNGVMNETLRLYPTVPMTVRTATCDTTVLGHPIKKGTEIMISPWLINRYSDNWPS
ASTEFVPERWIDQEGKHIAEDGTEQPTQKPNNTGGVTSNYAQMTFLHGPRSCIGQGFAKAELRCLLATFVLAFEW
KLGMDEKDVMPDGVITIKPNNGMHLILRSLDGHQKG
CYP548A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.1456 (supercontig 262)
ESTs b6f02ne.f1, b6g11ne.f1, b8g10ne.f1 (lower case = cDNA)
MVFGAAHLQLWLGALV (0)
56293 TSCLIFYRLFLHPLAKYPGPFLAKLTDGYMAYHAFKGDRHLEFWRMHEKYGKFVRFGPNSLSV 56472
56473 NSNTALKDIYGFKANVRKAEFYDAFVHPAPNTHNARDRDLHARKRRVLAHAFSDSAIKEV 56652
56653 ERYILSNIRTFCDAIGDLGRPSFAPLADKKGWSSPKNMSDWCSWLAMDILGDLCFGKAFH 56832
56833 MLERPDNRYAVDLVSVAAHRHLICGTMPMLDKLSLDRIFLRKIA 56964
56965 SGRAQYMAYSRQQLAERTALGDETDRRDFFYYLLKARDPETGQGFTTPELWGESNLLIIA 57144
57145 GSDTTSTAMAATLFYLVRNPTALARVTAEIRSRFASVEGIHQGPSLQSCTYLRACIDEAM 57324
57325 RLSPSVGGLLPREVLPGGITIDGEHVPQGTIIGVPHytihhnaayypdpwsytper 57492
57493 wlstrspsektsgaamqteeqiqlaqsafcpfsigprgcigkglayiemsitlarvlfly 57672
57673 dlrravgvddpaeggkkgaeygrhrvgemqlvdtftstkdgvmvefrpredlaa*
CYP549A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.1496 (supercontig 276)
9a41.tfa_150wg@9A41
MLIALLAPFSLLIAYVLHHVLSHQTLLLKQIFSQTIFNTIQLARLIGPLDPYDPLTVPDQVIPDSNSASEISEAQ
ARANQRIDDARSIIDFSYAGPKIELEDRLRLRALANSRLVAAFGINTSLTSSSVSVHKKFRKLASASINKSRADW
QKLYGVCMDFLQREKARYGRTGIGLAECVRCLCFVVVLVSQFGVDDKKVDKNLVKRITDGINAQWLKSKEDGEAV
KTSDSLNHDLGMLFGNLKTSVAEENGTGKDTSIDPSEALGLIMPQYETLWRVVLLTYVTAYYTQYDKRSLKKRVK
DIPSCLGNAATEKEALKLAT (0)
EGLRLYPSNNSIYRAATGPGPLKSADVQACHRDFNVWGRDALEFRPERFDNLTPL
QEKAYFPFSLGSHKCPAFGGFGNRMVTMLVVSMGRALSPETGKLNFEDAQLDNRVGIILPTGRDEMENWSWDM (0)
SWEF
CYP550A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.1526 (supercontig 290)
EST AI399597 NCSP6F8T7 (lower case)
MTVENSIVQHALTVYHDLSTATKATLAISIAFILYRLLFRHESTRRKELPAWGPIEMGLVMYLLdgarngivyrv (2)
YSAIRRYGGSFYGISSANQTLIGYEDVDRLFSSQMNHALSIEWHGYGLFLRFFGFPDTPTLKKKVESTFKPWHSP
IERVFNNDAGATAAFERGNLPKIVSEFVSFSQEKEKMKRWELRADIKVIKQGEAVEANLSALIMDFGACFTIPPI
FGHDLLDRNPHLIEDIWKFDHVVPLLFINMPTWAPFKVMKEGIEARKRLLAGVEAFSRRVTQFQRGEKVDDGADI
SDVSDVVLERNtalernewsyqeraasellvlfasntntqpvlfwlilyiystpglvntlrqevapfitlsdshh
Dkkgitaintsglsrecqllksiiletlrlafgavstrfvkrpitvqdgnhnhklypgtflsvphsffqrdpsly
PdphkfvpdrfleTDPSSGKLVAKYGKLRPWGAGASMCRGRVFAEKELMAIGAALVTLWDISPVGGGEWTVPEMV
GGPGPVKPKEEVRVVIKRRVFGGVSSSE*
CYP551A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.1539 (supercontig 295)
MAVLNWASSRAATLSPCPRTAALVVITTWLLYAIGLAVYRLYFHPLARFPGRKMAAVTTWYEFYYDFWYGGKYIF
EIEKMHKEFGPIVRINPHELSIHDPEYYNELYVGSSKRRTNFWPLFQGCTDDTDEKASFLESRIRECKGQGKAVR
LDRAFSAFSADCIERICTDDLEPGDGFLDQPDFGPEWYDGMLGLIRNAPILTKFPKALSSLRYVPPRLLLWLFPQ
GRIANKYDARTRTQIHKAVTKQHLSKDTTTNSEHTTLFRALAQSDNLSPADKAEERLVREAKLIFLGGTISTGRT
LSFVSYYILSRPDIKARLEDELRDVMATWPEVVPTWTELEKLPYLQAVIKEALRLSYGFMRRLPRVSPDVAIQYK
EYTIPPGTPVGMSAYLMHSNPEVYQDPDQFVPERWLGGDTRVMQRSYVPFTKGSRSCLGQNLSMAEISLVLAVLF
RPDGPRMELFETDESDVKHVHDFVVPLPKLDSLGVRVMVR
CYP552A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.1554 (supercontig 301)
sequence runs off beginning of contig missing about 33 aa at N-term
2 FFHPLRHIPGPKLAAASYIPEFYWDVLRGGMYQKRILLMHEKYG 133
229 GPVVRISPNEVHCNDWRFIDEIYASGSRKRDKPLHQ 336
1017 SSVFGALLTSDLPPSEKSL
1074 ERLTDEGFSLFAAGTETVSWALSVITYHLFANRELLEKVTKEVTQVIDAKSGQLPSWHTL 1253
1254 EKLPYLGGVIYEGLRLSYGVASRSSRIPVGEDLVYRG (0)
YVIPRGYAI 1433
1434 GMSPLSIHHDETVFPNSHSFIPERWIDDNDQHRKELDRALLSFSKGSRSCLG 1589
1654 SLAYCELYVLLSLLIVRVFPHMKLHETTEVDVTHDHDFFNPFPVSSSKGVRAIV
CYP553A1 Neurospora crassa
No accession number
Neurospora crassa sequence contig 1.1649 (supercontig 337)
N-terminal half
MIWVQSKTSPVQFLVGGVAVLLSYLLLTISPPHDPSEPPLIKPRFPVIGHLFGLLNHQVN
YFSELFARYHLPIATLPIGNQKLYVIFDAPLQQAALKAKDMDAQSFMVDFVPRIFGVKQG
TVDKLLGKDGVHPNIMGDMEQVFKSALSGDNLQKLASTTLATMADTLNDVDSKKGTKIPN
MFLWLQSLLSRSTSKALWGEKHNPFKDHQVIDAQWYVTFPLPSLSQS undefined joint
C-terminal half
11895 PPPKHHRDFESHLGPLVLGVLPSIVARRAYLARNKVQSALLSYFSARHYETDPSTSYFIR
ARTRLLKRYDLPDDELAKNEVAITLVATTNVLSTLFWCIAEIWTRPDLLQQIRAEALLAV
FGNSQKALSGEDMGTRTITIPATGLDAKCPLLASCFRESIRLASQIVTARRVLKDTLLTD
PVSGNSYLLKADTNVMMPAKVVHRNTSVWGADAEGFNPRRFMWEELGPKVERQRKAAFVP
FGGGKHMCPGRHFAFTENLALMAALAIGFEIEGLDRSKLKMGDSKRGETAKPLPGLEGGE
VVLKKRKGWEGVEWEFSC* 12851
Note: several Neurospora P450s were named previously or fall into existing families
51, 54, 53A4, 55A6, 61, 65B1, 65C1, 68D1, 505A2, 507A1
There are 12 fragments of Dictyostelium P450s that are not named because they
are too short or not similar enough to named P450s. These sequences are
numbered with temporary tags. P indicates a pseudogene.
Seqs 17, 37, 40, 41, 51, 52, 57P, 60P, 76, 84, 89P, 90
CYP554A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 40
CYP555A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 76
CYP556A1 Dictyostelium discoideum (cellular slime mold)
See lower eukaryote section for accessions and sequences
Formerly seq 92
CYP557A1 Rhodotorula sp. (a basidiomycete fungus)
No accession number
Martie Smit Department of Microbiology, Biochemistry and Food Science
University of the Free State P.O. Box 339 Bloemfontein 9300 South Africa
Submitted to nomenclature committee 11/19/2002
Unidentified fragments from birds, fish and mammals
rat
PIR S39762 (12 amino acids)
Ohishi, N., Imaoka, S., Suzuki, T. and Funae, Y.
Characterization of two P-450 isozymes placed in the rat
CYP2D subfamily.
Biochim. Biophys. Acta 1158, 227-236 (1993)
rat
PIR S39763 (10 amino acids)
Ohishi, N., Imaoka, S., Suzuki, T. and Funae, Y.
Characterization of two P-450 isozymes placed in the rat
CYP2D subfamily.
Biochim. Biophys. Acta 1158, 227-236 (1993)
rat
PIR A61597 (8 amino acids)
Shimeno, H., Toda, A., Ogata, S. and Nagamatsu, A.
Purification and aminopyrine monooxygenase activity of liver
microsomal cytochrome P-450 from alloxan-induced diabetic rats.
Drug Metab. Dispos. 19, 291-297 (1991)
rat
PIR B61597 (14 amino acids)
Shimeno, H., Toda, A., Ogata, S. and Nagamatsu, A.
Purification and aminopyrine monooxygenase activity of liver
microsomal cytochrome P-450 from alloxan-induced diabetic rats.
Drug Metab. Dispos. 19, 291-297 (1991)
rat
PIR E41425 (16 amino acids)
Imaoka, S., Kamataki, T. and Funae, Y.
Purification and characterization of six cytochromes P-450
from hepatic microsomes of immature female rats.
J. Biochem. 102, 843-851 (1987)
rat
PIR C60822 (20 amino acids)
Amelizad, Z., Narbonne, J.F., Wolf, C.R., Robertson, L.W. and
Oesch, F.
Effect of nutritional imbalances on cytochrome P-450 isozymes
in rat liver.
Biochem. Pharmacol. 37, 3245-3249 (1988)
rabbit
PIR A45103 (15 amino acids)
Ishida, H., Noshiro, M., Okuda, K. and Coon, M.J.
Purification and characterization of 7
alpha-hydroxy-4-cholesten-3-one 12 alpha-hydroxylase.
J. Biol. Chem. 267, 21319-21323 (1992)
This is a P450 involved in bile acid synthesis and it may be a new
mammalian family.
pig
PIR S15850 (9 amino acids)
Bergman, T. and Postlind, H.
Characterization of mitochondrial cytochromes P-450 from pig
kidney and liver catalysing 26-hydroxylation of
25-hydroxyvitamin D(3) and C(27) steroids.
Biochem. J. 276, 427-432 (1991)
pig
PIR S17048 (10 amino acids)
Sono, H., Sonoda, Y. and Sato, Y.
Purification and characterization of cytochrome P-450(14DM)
(lanosterol 14-alpha-demethylase) from pig liver
microsomes.
Biochim. Biophys. Acta 1078, 388-394 (1991)
May be N-terminal of pig CYP51 see rat CYP51 (missing N-terminal)
Chicken
PIR S29135 (28 amino acids) PIR S29136 (28 amino acids)
PIR S29137 (16 amnio acids) PIR S29138 (30 amino acids)
Gupta, R.P., Lapadula, D.M. and Abou-Donia, M.B.
Purification and characterization of cytochrome P450 isozymes
from beta-naphthoflavone-induced adult hen liver.
Arch. Biochem. Biophys. 282, 170-182 (1990)
chicken
PIR A44107 (25 amino acids)
Nakai, K., Ward, A.M., Gannon, M. and Rifkind, A.B.
Beta-naphthoflavone induction of a cytochrome P-450
arachidonic acid epoxygenase in chick embryo liver distinct
from the aryl hydrocarbon hydroxylase and from
phenobarbital-induced arachidonate epoxygenase.
J. Biol. Chem. 267, 19503-19512 (1992)
PIR A61056 (19 amino acids)
Gupta, R.P., Lapadula, D.M. and Abou-Donia, M.B.
Purification and characterization of cytochrome P-450
isozymes from phenobarbital-induced adult hen liver.
Comp. Biochem. Physiol. 96C, 163-176 (1990)
chicken
PIR B61056 (19 amino acids)
Gupta, R.P., Lapadula, D.M. and Abou-Donia, M.B.
Purification and characterization of cytochrome P-450
isozymes from phenobarbital-induced adult hen liver.
Comp. Biochem. Physiol. 96C, 163-176 (1990)
chicken
PIR B44107 (26 amino acids)
Nakai, K., Ward, A.M., Gannon, M. and Rifkind, A.B.
Beta-naphthoflavone induction of a cytochrome P-450
arachidonic acid epoxygenase in chick embryo liver distinct
from the aryl hydrocarbon hydroxylase and from
phenobarbital-induced arachidonate epoxygenase.
J. Biol. Chem. 267, 19503-19512 (1992)
chicken
PIR C44107 (26 amino acids)
Nakai, K., Ward, A.M., Gannon, M. and Rifkind, A.B.
Beta-naphthoflavone induction of a cytochrome P-450
arachidonic acid epoxygenase in chick embryo liver distinct
from the aryl hydrocarbon hydroxylase and from
phenobarbital-induced arachidonate epoxygenase.
J. Biol. Chem. 267, 19503-19512 (1992)
Oncorhynchus mykiss (trout)
GenEMBL T23107 (310bp)
Gong,Z.
unpublished (1994)
Note: 45.1% identical to 52A10, with similarity to other 52 family
members