P450s that have appeared since the 1993 P450 nomenclature update.
      Part C covering CYP10 to CYP69 
      Animal P450 names continue at CYP301 in this file
      Lower eukaryote names continue at CYP501 in this file

      ALL Drosophila melanogaster P450S ARE NOW INCLUDED

      This includes list references that were incomplete and duplications of 
      sequences that were already in the update.  If a sequence is assigned an 
      accession number that was not in the old update it is included in this list.  
      Some expressed sequence tags (ESTs) are also included from humans.

      This list was last revised on Feb. 4, 2003. 
      Added all Neurospora P450s see 527A1-553A1
      Added CYP51 Trypanosoma brucei
      Added Fugu P450s, added CYP505A3 added CYP18 Spodoptera
      Added all human genes and pseudogenes
      Added 104 Anopheles genes
      Revised May 13, 2003 named all remaining Dictyostelium P450 genes
      Compiled by David R. Nelson

There are 80 C. elegans P450s listed here, surpassing the known mouse and human 
complements.  The C. elegans genome is now about 90% finished sequence and 9% 
unfinished (one strand or one chemistry).  About 1Mb is not cloned at three 
telomeres and two internal sites.  A special issue of Science appeared in Dec. 98 even though the genome is not completely done.  The amount of sequence in the Blast searchable database at Washington Univ. is 140Mb, more than the 100Mb size of the genome.  Therefore, we can guess that this set includes all the P450 genes in C. elegans, but the distrubution is not even. Most P450 genes (43 genes) are on chromosome V. see additional info on C. elegans P450s see this list. To see the actual sequences go to the C. elegans sequence file.  So far we are missing CYP11A, CYP11B, CYP17, CYP19, CYP21, CYP24 and CYP27A and CYP27B.  Does C. elegans make steroids? The present evidence would suggest not. Does C. elegans have 
mitochondrial P450s?  There is one probable mitochondrial P450 in C. elegans on 
cosmid ZK177 named CYP44. It was thought to be incomplete but now a full length sequence
has been assembled from the genomic sequence.

10A Subfamily

11A Subfamily

11B Subfamily

12A Subfamily

12B Subfamily

13A Subfamily

13B Subfamily

14A Subfamily

16A Subfamily

17A Subfamily

18A Subfamily

19A Subfamily

21A Subfamily

22A Subfamily

23A Subfamily

24A Subfamily

25A Subfamily

26A Subfamily

27A Subfamily

27B Subfamily

28A Subfamily

29A Subfamily

30A Subfamily

31A Subfamily

32A Subfamily

33A Subfamily

33B Subfamily

33C Subfamily

33D Subfamily

33E Subfamily

34A Subfamily

35A Subfamily

35B Subfamily

35C Subfamily

35D Subfamily

36A Subfamily

37A Subfamily

37B Subfamily

40A Subfamily

41A Subfamily

42A Subfamily

43A Subfamily

44A Subfamily

45A Subfamily

46A Subfamily

51A Subfamily

52A Subfamily

52B Subfamily

52C Subfamily

52D Subfamily

52E Subfamily

52F Subfamily

53A Subfamily

53B Subfamily

54A Subfamily

55A Subfamily

56A Subfamily

57A Subfamily

58A Subfamily

59A Subfamily

60A Subfamily

60B Subfamily

61A Subfamily

62A Subfamily

63A Subfamily

64A Subfamily

65A Subfamily

66A Subfamily

67A Subfamily


10A Subfamily

CYP10       Lymnaea stagnalis (pond snail)
            GenEMBL S46130 (1870bp) PIR JX0225 (545 amino acids)
            Teunissen,Y., Geraerts,W.P., van Heerikhuizen,H., Planta,R.J.
            and Joosse,J. 
            Molecular cloning of a cDNA encoding a member of a novel
            cytochrome P450 family in the mollusc Lymnaea stagnalis.
            J. Biochem. 112, 249-252 (1992)

11A Subfamily

CYP11A1     human
            PIR A48733 (239 amino acids)
            Matteson, K.J., Chung, B.C., Urdea, M.S. and Miller, W.L.
            Study of cholesterol side-chain cleavage (20,22 desmolase)
            deficiency causing congenital lipoid adrenal hyperplasia
            using bovine-sequence P450scc oligodeoxyribonucleotide
            probes.
            Endocrinology 118, 1296-1305 (1986)

CYP11A1     rabbit
            GenEMBL S59219 (1336bp) PIR A49189 (445 amino acids)
            Yang,X., Iwamoto,K., Wang,M., Artwhol,J., Mason,J.I. and 
            Pang,S.
            Inherited congenital adrenal hyperplasia in the rabbit is caused by
            a deletion in the gene encoding cytochrome P450 cholesterol side
            chain cleavage enzyme.
            Endocrinol. 132, 1977-1982 (1993)

CYP11A1     bovine
            PIR A42033 (18 amino acids)
            Pikuleva, I.A., Lapko, A.G., Chashchin, V.L.
            Functional reconstitution of cytochrome P-450-scc with hemin
            activated with Woodward's reagent K. Formation of a
            hemeprotein cross-link.
            J. Biol. Chem. 267, 1438-1442 (1992)

CYP11A1     Sus scrofa (pig)
            GenEMBL L34259 (2376bp)
            Urban,R.J., Shupnik,M.A. and Bodenburg,Y.H.
            Insulin-like growth factor-I increases expression of the porcine
            P-450 cholesterol side chain cleavage gene through a GC-rich domain.
            J. Biol. Chem. 269, 25761-25769 (1994)

CYP11A1       goat
            GenEMBL D50058 (1825bp)
            Okuyama,E., Okazaki,T., Furukawa,A., Wu,R.-F. and Ichikawa,Y.
            Molecular cloning and nucleotide sequences of cDNA clones of sheep 
and goat 
            adrenolcortical cytochrome P450scc.
            unpublished (1995)

CYP11A1       sheep
            GenEMBL D50057 (1825bp)
            Okuyama,E., Okazaki,T., Furukawa,A., Wu,R.-F. and Ichikawa,Y.
            Molecular cloning and nucleotide sequences of cDNA clones of sheep 
and goat 
            adrenolcortical cytochrome P450scc.
            unpublished (1995)

CYP11A1     bovine
            PIR S29644 (21 amino acids)
            Tsujita, M. and Ichikawa, Y.
            Substrate-binding region of cytochrome P-450(scc) (P-450
            XIA1). Identification and primary structure of the
            cholesterol binding region in cytochrome P-450(scc).
            Biochim. Biophys. Acta 1161, 124-130 (1993)

Cyp11a1       mouse
             GenEMBL J05511
             Rice,D.A., Kirkman,M.S., Aitkin,L.D., Mouw,A.R., Schimmer,B.P. and
             Parker,K.L.
            Analysis of the promoter region of the gene encoding mouse
            cholesterol side-chain cleavage enzyme
            J. Biol. Chem. 265, 11713-11720 (1990)

CYP11A1     Oncorhynchus mykiss (rainbow trout)
            GenEMBL S57305 (1789bp) Swiss Q07217 (514 amino acids)
            PIR S32197 (514 amino acids)
            Takahashi,M., Tanaka,M., Sakai,N., Adachi,S., Miller,W.L. 
            and Nagahama,Y.
            Rainbow trout ovarian cholesterol side-chain cleavage
            cytochrome P450 (P450scc).  cDNA cloning and mRNA expression 
            during oogenesis.
            FEBS Lett. 319, 45-48 (1993)

CYP11A1     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_1630
37515 MARWSVWRSPVVLPLSRMEVPMTGARHSSTMPVARQTYSDSSSFV 37381
37380 RSFNDIPGLWKNGVANLYNFWKLDGFRNLHHIMVQNFNTFGPIYR 37246 (2)
37065 EKIGYYESVNIINPEDAAILFKAEGHYPKRLKVEAWTSYRDYRNRKYGVLLK 36904 (2)
36822 NGEEWRCNRVLLNKEVISPKVLENFVPLLDEVGNDFVVRVHKKIARSGQNKWTTDLSQELFKYALES 36628 (1)
36506 VSSVLYGERLGLFLDYIDPEAQHFIDCISLMFKTTSPML
      YIPPALLRKVGAKVWRDHVEAWDGIFNQ 36300 (1 expected) bad boundary
36208 ADRCIQNIYRRLRQETGPSKKYPGVLASLLLRDKLSIEDIKASITELMAGGVDT 36047 (0)
35975 TSITLLWTLYELARHPNLQEELRAEVAAARTESQGDMLEMLKRIPLVKGALKETLR 35808 (2)
35731 LHPVAVSLQRYIAEDIIIQNYHIPAG 35654 (0)
35568 TLVQLGLYAMGRDPKVFFRPEQYQPSRWLRSETHYFKSLGFGFGPRQCLGRRIAEAEMQLFLIH 35377 (0)
35297 MLENFRVEKQRHMEVQSTFELILLPDKPIILTLKPLSS* 35181

CYP11A1      Dasyatis americana (southern stingray)
            GenEMBL U63299(4619bp)
            Nunez,B.S. and Trant,J.M.
            Isolation of the cDNA encoding the interrenal form of cholesterol
            side chain cleavage cytochrome P450 of the southern stingray
            (Dasyatis americana).
            unpublished (1996)

11B Subfamily

CYP11B1     human
            GenEMBL J05140 M32863 (1482bp) M32878 (2633bp) M32879 (1155bp)
            Mornet,E., Dupont,J., Vitek,A. and White,P.C.
            Characterization of two genes encoding human steroid 11-beta-
            hydroxylase (P-450-11-beta).
            J. Biol. Chem. 264, 20961-20967 (1989)

CYP11B1     human
            GenEMBL D16155 (156bp)
            Naiki,Y., Shizuta,Y., Kawamoto,T., Yasushiro,M., Miyahara,K., 
            Toda,K., Tadao,O. and Imura,H.
            A nonsense mutation (TGG(116Arg)-TAG(stop)) in CYP11B1 
            causes steroid 11beta-hydroxylase deficiency.
            J. Clin. Endocrinol. Metab. 77, 1677-1682 (1993)

CYP11B1     human 
            GenEMBL D10169 D90428 (2085bp)
            Kawamoto,T., Mitsuuchi,Y., Toda,K., Yokoyama,Y., Miyahara,K., 
            Miura,S., Ohnishi,T., Ichikawa,Y., Nakao,K., Imura,H., Ulick,S. 
            and Shizuta,Y.
            Role of steroid 11beta-hydroxylase and steroid 18-hydroxylase in the 
            biosynthesis of glucocorticoids and mineralocorticoids in humans.
            Proc. Natl. Acad. Sci. USA 89, 1458-1462 (1992)

CYP11B1     human
            PIR S29068 (30 amino acids)
            Kawamoto, T., Mitsuuchi, Y., Toda, K., Miyahara, K.,
            Yokoyama, Y., Nakao, K., Hosoda, K., Yamamoto, Y., Imura,
            H. and Shizuta, Y.
            Cloning of cDNA and genomic DNA for human cytochrome P-450
            (11-beta).
            FEBS Lett. 269, 345-349 (1990)

CYP11B1      Papio hamadryas ursinus (chacma baboon)
            GenEMBL U52085(228bp)
            Hampf,M., Swart,A. and Swart,P.
            Expression of Papio ursinus steroid 11beta hydroxylase
            unpublished (1996)

CYP11B1     rat
            GenEMBL D10107 (1528bp) S58847
            PIR B46040 (499 amino acids)
            Matsukawa,N., Nonaka,Y., Higaki,J., Nagano,M., Mikami,H.,
            Ogihara,T. and Okamoto,M.
            Dahl's salt-resistant normotensive rat has mutations in 
            cytochrome P450 (11 beta), but the salt-sensitive hypertensive
            rat does not.
            J. Biol. Chem. 268, 9117-9121 (1993)
            Note: only 1 amino acid difference with 11B1 at position 84
            E (normal) changed to G (seen in 11B2 and 11B3)
            D10107 has five differences with D11354

CYP11B1     rat
            GenEMBL S58858 (330bp) PIR JX0251(499 amino acids)
            Nomura,M., Morohashi, K.-i., Kirita, S., Nonaka, Y.,
            Okamoto,M., Nawata,H. and Omura,T.
            Three forms of rat CYP11B genes: 11 beta-hydroxylase gene,
            aldosterone synthase gene, and a novel gene.
            J. Biochem. 113, 144-152 (1993)
            Note: Fig. 3 has 6 errors. 1,2) aldo-46 amino acids 2 and 3 are
            incorrectly translated gctctc = AL not HS. 3) 11B3 codon at 559-561
            incorrectly translated gac = D not N. 4,5,6) 11beta-62, 11B1 
            and 11B3 codon at 964-966 tcc = Ser not Pro.

CYP11B1     rat
            GenEMBL S58849, D14086 to D14091 PIR A46039 (499 amino acids)
            Mukai,K., Imai,M., Shimada,H. and Ishimura,Y.
            Isolation and characterization of rat CYP11B genes involved
            in late steps of mineralo- and glucocorticoid syntheses.
            J. Biol. Chem. 268, 9130-9137 (1993)

CYP11B1     rat
            GenEMBL S63899 (596bp)
            Mukai,K., Imai,M., Shimada,H., Okada,Y., Ogishima,T. and
            Ishimura,Y.
            Structural differences in 5'-flanking regions of rat cytochrome
            P-450aldo and P-450(11) beta genes
            Biochem. Biophys. Res. Commun. 180, 1187-1193 (1991)

CYP11B1     rat
            GenEMBL D11354 (1528bp) PIR A46040 (499 amino acids)
            Matsukawa,N., Nonaka,Y., Higaki,J., Nagano,M., Mikami,H.,
            Ogihara,T. and Okamoto,M.
            Dahl's salt-resistant normotensive rat has mutations in 
            cytochrome P450 (11 beta), but the salt-sensitive hypertensive
            rat does not.
            J. Biol. Chem. 268, 9117-9121 (1993)
            Note: This sequence is the wild type DS rat sequence.  The mutant 
            DR rat has five amino acids differences accession number D10107

CYP11B1     Cavia porcellus (guinea pig)
            GenEMBL Z69785(2028bp)
            Bulow,H.E., Mobius,K., Bahr,V. and Bernhardt,R.
            Molecular cloning and functional expression of the cytochrome P450
            11B-hydroxylase of the guinea pig.
            Biochem. Biophys. Res. Commun. 221, 304-312 (1996)

CYP11B1     bovine
            PIR JX0151 (503 amino acids)
            Kirita, S., Hashimoto, T., Kitajima, M., Honda, S.,
            Morohashi, K. and Omura, T.
            Structural analysis of multiple bovine P-450(11beta) genes
            and their promoter activities.
            J. Biochem. 108, 1030-1041 (1990)

CYP11B1     pig
            GenEMBL D38590(1671bp)
            Sun,T., Zhao,Y., Nonaka,Y. and Okamoto,M.
            Cloning and expression of cytochrome P450(11 beta) of porcine
            adrenal cortex.
            J. Steroid Biochem. Mol. Biol. 52, 227-232 (1995)

CYP11B1     sheep
            GenEMBL L34337 (1639bp)
            Boon,W.C., Roche,P.J., Hammond,V.E., Jeyaseelan,K.,
            Crawford,R.J. and Coghlan,J.P.
            Cloning and expression analysis of a cytochrome P450-11-beta
            cDNA in sheep.
            unpublished (1994)

CYP11B1     Ovis aries (sheep)
            GenEMBL L28716 (2300bp)
            Anwar,A., Jeyaseelan,K. and Coghlan,J.P.
            Molecular cloning and characterization of the ovine CYP11B1 
promoter.
            Biochem. Mol. Biol. Int. 33, 1169-1178 (1994)

CYP11B1     Ovis ammon (sheep)
            GenEMBL L47569(9027bp)
            Anwar,A., Jeyaseelan,K. and Coghlan,J.P.
            Characterization of an ovine 11-beta hydroxylase (cyp11b) gene.
            unpublished (1995)

Cyp11b1    mouse
            PIR A41552 (500 amino acids)
            Domalik, L.J., Chaplin, D.D., Kirkman, M.S., Wu, R.C., Liu,
            W., Howard, T.A., Seldin, M.F., Parker, K.L.
            Different isozymes of mouse 11beta-hydroxylase produce
            mineralocorticoids and glucocorticoids.
            Mol. Endocrinol.  5, 1853-1861 (1991)

Cyp11b1    mouse
            PIR A32210 (42 amino acids)
            Mouw, A.R., Rice, D.A., Meade, J.C., Chua, S.C., White, P.C.,
            Schimmer, B.P. and Parker, K.L.
            Structural and functional analysis of the promoter region of
            the gene encoding mouse steroid 11beta-hydroxylase.
            J. Biol. Chem. 264, 1305-1309 (1989)

CYP11B2     human
            GenEMBL J05140 M32864 (1809bp) M32880 (3088bp) M32881 (1101bp)
            Mornet,E., Dupont,J., Vitek,A. and White,P.C.
            Characterization of two genes encoding human steroid 11-beta-
            hydroxylase (P-450-11-beta).
            J. Biol. Chem. 264, 20961-20967 (1989)

CYP11B2     human
            GenEMBL D90429 D10170 (2114bp)
            Kawamoto,T., Mitsuuchi,Y., Toda,K., Yokoyama,Y., Miyahara,K., 
            Miura,S., Ohnishi,T., Ichikawa,Y., Nakao,K., Imura,H., Ulick,S. 
            and Shizuta,Y.
            Role of steroid 11beta-hydroxylase and steroid 18-hydroxylase in the 
            biosynthesis of glucocorticoids and mineralocorticoids in humans.
            Proc. Natl. Acad. Sci. USA 89, 1458-1462 (1992)

CYP11B2     human 
            GenEMBL D13752(6910bp)
            Kawamoto,T., Mitsuuchi,Y., Ohnishi,T., Ichikawa,Y., Yokoyama,Y.,
            Sumimoto,H., Toda,K., Miyahara,K., Kuribayashi,I., Nakao,K.,
            Hosoda,K., Yamamoto,Y., Imura,H. and Shizuta,Y.
            Cloning and expression of a cDNA for human cytochrome P-450aldo as
            related to primary aldosteronism.
            Biochem. Biophys. Res. Commun. 173 (1), 309-316 (1990)

CYP11B2      human
            GenEMBL S77398 S77401 S77403 S77406 S77409 (genomic sequences)
            Shizuta,Y., Kawamoto,T., Mitsuuchi,Y., Miyahara,K., Rosler,A.,
            Ulick,S. and Imura,H.
            Inborn errors of aldosterone biosynthesis in humans.
            Steroids 60 (1), 15-21 (1995)

CYP11B2     rat
            GenEMBL S58859 (353bp) PIR JX0252 (500 amino acids)
            Nomura,M., Morohashi, K.-i., Kirita, S., Nonaka, Y.,
            Okamoto,M., Nawata,H. and Omura,T.
            Three forms of rat CYP11B genes: 11 beta-hydroxylase gene,
            aldosterone synthase gene, and a novel gene.
            J. Biochem. 113, 144-152 (1993)

CYP11B2     rat
            GenEMBL S58850 D14092 to D14097
            PIR B46039 (500 amino acids)
            Mukai,K., Imai,M., Shimada,H. and Ishimura,Y.
            Isolation and characterization of rat CYP11B genes involved
            in late steps of mineralo- and glucocorticoid syntheses.
            J. Biol. Chem. 268, 9130-9137 (1993)

CYP11B2     rat
            GenEMBL S63898 (594bp)
            Mukai,K., Imai,M., Shimada,H., Okada,Y., Ogishima,T. and Ishimura,Y.
            Structural differences in 5'-flanking regions of rat cytochrome
            P-450aldo and P-450(11) beta genes.
            Biochem. Biophys. Res. Commun. 180, 1187-1193 (1991)

CYP11B2     rat
            GenEMBL S64136 (3001bp) PIR JN0615 (506 amino acids)
            GenEMBL U14908 (3000bp)
            Zhou,M. and Gomez-Sanchez,C.E.
            Cloning and expression of a rat cytochrome P-450 11
            beta-hydroxylase/aldosterone synthase (CYP11B2) cDNA variant.
            Biochem. Biophys. Res. Commun. 194, 112-117 (1993)
            Erratum: Biochem. Biophys. Res. Commun. 196, 1018 (1993)

CYP11B2     rat
            PIR A34281 (20 amino acids) Swiss P30099 (510 amino acids)
            Ogishima, T., Mitani, F., Ishimura, Y.
            Isolation of aldosterone synthase cytochrome P-450 from zona
            glomerulosa mitochondria of rat adrenal cortex.
            J. Biol. Chem.  264, 10935-10938 (1989)
            Note: sequence is from N-terminal after signal sequence

CYP11B2     rat
            Swiss P30099 (510 amino acids) GenEMBL D00567 (2824bp)
            Swiss P30100 (500 amino acids) GenEMBL D00568 (2705bp)
            Matsukawa,N., Nonaka,Y., Ying,Z., Higaki,J., Ogihara,T. and
            Okamoto,M.
            Molecular cloning and expression of cDNAs encoding rat aldosterone
            synthase: variants of cytochrome P-450 11beta
            Biochem. Biophys. Res. Commun. 169, 245-252 (1990)

CYP11B2     hamster
            GenEMBL S73810 (1503bp)
            LeHoux,J.G., Mason,J.I., Bernard,H., Ducharme,L., LeHoux,J.,
            Veronneau,S. and Lefebvre,A.
            The presence of two cytochrome P450 aldosterone synthase mRNAs in
            the hamster adrenal.
            J. Steroid Biochem. Mol. Biol. 49, 131-137 (1994)

CYP11B2      cavia porcellus (domestic guinea pig)
            GenEMBL   AF018569
            Buelow,H.E. and Bernhardt,R.
            Molecular Cloning and Functional Expression of the Guinea Pig
            Aldosterone Synthase.
            Unpublished

CYP11B2     Rana catesbeiana (bullfrog)
            GenEMBL D10984 (1919bp)
            Nonaka,Y., Takemori,H., Halder,S.K., Sun,T., Ohta,M., Hatano,O., 
            Takakusa,A. and Okamoto,M.
            Frog Cytochrome P-450 (11 beta, aldo), a single enzyme involved in 
the final steps 
            of glucocorticoid and mineralocorticoid biosynthesis.
            Eur. J. Biochem. 229, 249-256 (1995)

Cyp11b2    mouse
            GenEMBL S85260 (2804bp)
            Domalik,L.J., Chaplin,D.D., Kirkman,M.S., Wu,R.C., Liu,W.W.,
            Howard,T.A., Seldin,M.F. and Parker,K.L.
            Different isozymes of mouse 11 beta-hydroxylase produce
            mineralocorticoids and glucocorticoids.
            Mol. Endocrinol. 5, 1853-1861 (1991)

CYP11B3     rat
            PIR JX0253 (498 amino acids)
            Nomura,M., Morohashi, K.-i., Kirita, S., Nonaka, Y.,
            Okamoto,M., Nawata,H. and Omura,T.
            Three forms of rat CYP11B genes: 11 beta-hydroxylase gene,
            aldosterone synthase gene, and a novel gene.
            J. Biochem. 113, 144-152 (1993)

CYP11B3     rat
            GenEMBL U14907 (1497bp)
            Zhou,M., Gomez-Sanchez,E.P., Foecking,M. and Gomez-Sanchez,C.E.
            Cloning of the cytochrome P-450 CYP11B3 complementary DNA in the
            rat.
            unpublished

CYP11B3     rat
            GenEMBL S59144 D14098 to D14103
            Mukai,K., Imai,M., Shimada,H. and Ishimura,Y.
            Isolation and characterization of rat CYP11B genes involved
            in late steps of mineralo- and glucocorticoid syntheses.
            J. Biol. Chem. 268, 9130-9137 (1993)
            Note: only one amino acid difference with Nomura's 11B3

CYP11B3      rat
            GenEMBL U17082(1497bp)
            Mellon,S.H., Bair,S.R. and Monis,H.
            P450c11B3 mRNA, transcribed from a third P450c11 gene, is expressed
            in a tissue-specific, developmentally, and hormonally regulated
            fashion in the rodent adrenal and encodes a protein with both
            11-hydroxylase and 18-hydroxylase activities.
            J. Biol. Chem. 270, 1643-1649 (1995)

CYP11B8P    rat pseudogene
            GenEMBL D14104 to D14108
            Mukai,K., Imai,M., Shimada,H. and Ishimura,Y.
            Isolation and characterization of rat CYP11B genes involved
            in late steps of mineralo- and glucocorticoid syntheses.
            J. Biol. Chem. 268, 9130-9137 (1993)
            Note: authors call this sequence 11B4

12A Subfamily

CYP12A1       Musca domestica (housefly)
            GenEMBL U86618
            Rene Feyereisen
MIKYKQYSRAIVALRQRGAQQYSTNVTNASQPDVKATTTTTISP
EWQEAKPFEEMPSMNSWPIIKNMLPWGKYGKMEPTQFLMALRDDMGPIVRTAAFMGRP
PTVITHNPHDFEMVFRNEGIWPIRPGGDAQMYHRTVLREDFFQGVTGLVSVNGEKWGN
FRSTVNPVLMQPKNVRLYLNKMAQVNDEFMARIRQIRDPETLEVPASFQEEMNRWTLE
SVSVVALDKQLGLITTNRDNPDLKKLIGLLNDFFELGQKIEFGLPFWKYIKTPTFKLF
MKTLDGLLEIGNKYVNEAIDRLEAERQSGVPEKPENEKSVLEKLIKIDRKIATVMAID
MILAGVDTTSTTFTALLLCLAKNPEKQEKLREEIRQILPRKDSQFEPSSLNHIPYTRA
CIKEALRMYPLTLGNARILANDTVLSGYRVPKGTLVSMISTGLLQDDNHYTKAKEYLP
ERWMRPTKEETEDSATCPHALKASSPFIYLPFGFGPRSCVGRRIVEMELELGIARLVR
NFRIEFNYPTENAFKFKLINVPNIPLKFKFTDVEN

CYP12A2       Musca domestica (housefly)
            GenEMBL U94698
            Rene Feyereisen

CYP12A3       Musca domestica (housefly)
            GenEMBL U94699
            Rene Feyereisen

cyp12a4 Drosophila melanogaster
            GenEMBL AC006091 85973-87865 also AC015190 AC008141
            chromosome 3 clone BACR48G05 (D475)
MLKVRSALSLIQSQKATLSLATQKHTEYFKILLYIYINKLYYQRWQTNVATAEAREDSEW
LQAKPFEQIPRLNMWALSMKMSMPGGKYKNMELMEMFEAMRQDYGDIFFMPGIMGNPPFL
STHNPQDFEVVFRNEGVWPNRPGNYTLLYHREEYRKDFYQGVMGVIPTQGKPWGDFRTVV
NPVLMQPKNVRLYYKKMSQVNQEFVQRILELRDPDTLEAPDDFIDTINRWTLESVSVVAL
DKQLGLLKNSNKESEALKLFHYLDEFFIVSIDLEMKPSPWRYIKTPKLKRLMRALDGIQE
VTLAYVDEAIERLDKEAKEGVVRPENEQSVLEKLLKVDRKVATVMAMDMLMAGVDTTSST
FTALLLCLAKNPEKQARLREEVMKVLPNKNPEFTEASMKNVPYLRACIKESQRLHPLIVG
NARVLARDAVLSGYRVPAGTYVNIVPLNALTRDEYFPQASEFLPERWLRSPKDSESKCPA
NELKSTNPFVFLPFGFGPRMCVGKRIVEMELELGTARLIRNFNVEFNYPTENAFRSALIN
LPNIPLKFKFIDLPN

cyp12a5 Drosophila melanogaster
              GenEMBL AC006091 83449-85296 also AC015190 AC008141 
              chromosome 3 clone BACR48G05 (D475)
              76% identical to other AC006091 SEQ. 58% TO 12A1, 12A2
MLKGRIALNILQSQKPIVFSASQQ*RWQTNVPTAEIRNDPEWLQAKPFEE
IPKANILSLFAKSALPGGKYKNLEMMEMIDALRQDYGNIIFLPGMMGRDG
LVMTHNPKDFEVVFRNEGVWPFRPGSDILRYHRTVYRKDFFDGVQGIIPS
QGKSWGDFRSIVNPVLMQPKNVRLYFKKMSQVNQEFIKEIRDASTQEVPG
NFLETINRWTLESVSVVALDKQLGLLRESGKNSEATKLFKYLDEFFLHSA
DLEMKPSLWRYFKTPLLKKMLRTMDSVQEVTLKYVDEAIERLEKEAKEGV
VRPEHEQSVLEKLLKVDKKVATVMAMDMLMAGVDTTSSTFTALLLCLAKN
PEKQARLREEVMKVLPNKDSEFTEASMKNVPYLRACIKESQRVYPLVIGN
ARGLTRDSVISGYRVPAGTIVSMIPINSLYSEEYFPKPTEFLPERWLRNA
SDSAGKCPANDLKTKNPFVFLPFGFGPRMCVGKRIVEMELELGTARLIRN
FNVEFNHSTKNAFRSALINLPNIPLKFKFKFTDVPN*

CYP12A6       Drosophila wassermani
              no accession number
              Tina Yee and Phil Danielson
              59% to 12A2 56% to 12A1
              N-terminal does not match known P450 sequences may be in error

12B Subfamily

CYP12B1     Drosophila acanthoptera
            no accession number
            Phil Danielson
            Ac40
            submitted to nomenclature committee

Cyp12b2     Drosophila melanogaster
            GenEMBL AC018326 7227-9141 also AC004345 AC004657
            77% identical to 12b1

Cyp12c1     Drosophila melanogaster
            GenEMBL AC009385 comp(57935-59646) also AC012807

Cyp12d1     Drosophila melanogaster
            GenEMBL AC008187 comp(84371-86114)

Cyp12e1     Drosophila melanogaster
            GenEMBL AC018294 2070-3932

CYP12F1     Anopheles gambiae (malaria vector)
            No accession number
            Submitted by Rene Feyereisen July 5, 2002
            Low 40% range with other CYP12s 60% to 12F2

CYP12F1     Anopheles gambiae (malaria mosquito)
            GenEMBL 
            Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
            See the paper: Evolution of supergene families associated with 
            insecticide resistance.
            Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
            Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
            Science 298, 179-181 (2002)
            Anopheles map name CYPl3r4

CYP12F2     Anopheles gambiae (malaria vector)
            No accession number
            Submitted by Rene Feyereisen July 5, 2002
            Low 40% range with other CYP12s 60% to 12F1

CYP12F2     Anopheles gambiae (malaria mosquito)
            GenEMBL 
            Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
            See the paper: Evolution of supergene families associated with 
            insecticide resistance.
            Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
            Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
            Science 298, 179-181 (2002)
            Anopheles map name CYPl3r3

CYP12F3     Anopheles gambiae (malaria vector)
            No accession number
            Submitted by Rene Feyereisen July 5, 2002
            Low 40% range with other CYP12s 60% to 12F2

CYP12F3     Anopheles gambiae (malaria mosquito)
            GenEMBL 
            Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
            See the paper: Evolution of supergene families associated with 
            insecticide resistance.
            Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
            Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
            Science 298, 179-181 (2002)
            Anopheles map name CYPl3r2

CYP12F4     Anopheles gambiae (malaria vector)
            No accession number
            Submitted by Rene Feyereisen July 5, 2002
            Low 40% range with other CYP12s 55% to 12F2

CYP12F4     Anopheles gambiae (malaria mosquito)
            GenEMBL 
            Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
            See the paper: Evolution of supergene families associated with 
            insecticide resistance.
            Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
            Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
            Science 298, 179-181 (2002)
            Anopheles map name CYPl3r1

13A Subfamily

CYP13A1     Caenorhabditis elegans (nematode worm)
            GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
            Wilson,R., Ainscough,R., Anderson,K., Baynes,C., Berks,M.,
            Bonfield,J., Burton,J., Connell,M., Copsey,T., Cooper,J.,
            Coulson,A., Craxton,M., Dear,S., Du,Z., Durbin,R., Favello,A.,
            Fulton,L., Gardner,A., Green,P., Hawkins,T., Hillier,L., Jier,M.,
            Johnston,L., Jones,M., Kershaw,J., Kirsten,J., Laister,N.,
            Latreille,P., Lightning,J., Lloyd,C., McMurray,A., Mortimore,B.,
            O'Callaghan,M., Parsons,J., Percy,C., Rifken,L., Roopra,A.,
            Saunders,D., Shownkeen,R., Smaldon,N., Smith,A., Sonnhammer,E.,
            Staden,R., Sulston,J., Thierry-Mieg,J., Thomas,K., Vaudin,M.,
            Vaughan,K., Waterston,R., Watson,A., Weinstock,L.,
            Wilkinson-Sproat,J. and Wohldman,P.
            2.2 Mb of contiguous nucleotide sequence from chromosome III of C. 
elegans.
            Nature 368, 32-38 (1994)
             Product T10B9.8

CYP13A2     Caenorhabditis elegans (nematode worm)
            GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
            see CYP13A1 for reference.
             Product T10B9.7

CYP13A3     Caenorhabditis elegans (nematode worm)
            GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
            see CYP13A1 for reference.
             Product T10B9.5

CYP13A4     Caenorhabditis elegans (nematode worm)
            GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
            see CYP13A1 for reference.
             Product T10B9.1

CYP13A5     Caenorhabditis elegans (nematode worm)
            GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
            see CYP13A1 for reference.
             Product T10B9.2

CYP13A6     Caenorhabditis elegans (nematode worm)
            GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
            see CYP13A1 for reference.
             Product T10B9.3

CYP13A7     Caenorhabditis elegans (nematode worm)
            GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
            see CYP13A1 for reference.
             Product T10B9.10

CYP13A8     Caenorhabditis elegans (nematode worm)
            GenEMBL Z48717 (34881 bp) and Z83130 (ZK1325) and Z92859 (Y53C12)
            see CYP13A1 for reference.
             Product T10B9.4

CYP13A9P    Caenorhabditis elegans (nematode worm)
            GenEMBL Z48717 (34881 bp) and Z92859 (Y53C12)
            see CYP13A1 for reference.
            Product T10B9.6 pseudogene
            Note: closer inspection suggests this may not be a pseudogene.
            A complete gene can be assembled, but perhaps the genefinder program 
did not see 
            it.

CYP13A10    Caenorhabditis elegans (nematode worm)
            GenEMBL Z46934 (35989 bp) and Z92859 (Y53C12)
            see CYP13A1 for reference. 
            product ZK1320.4

CYP13A11    Caenorhabditis elegans (nematode worm)
            GenEMBL  Z81503 (F14F7) and Z92819 (Y37D8) and Z94158 (Y39E4)
            F14F7.a contig 666 10-13k region

CYP13A12    Caenorhabditis elegans (nematode worm)
            GenEMBL  Z81503 (F14F7) and Z92819 (Y37D8) and Z94158 (Y39E4)
            F14F7.b contig 666 13-18k region

13B Subfamily

CYP13B1    Caenorhabditis elegans (nematode worm)
            GenEMBL Z54269 (19692bp) and Z92827 (C29F7)
            F02C12.5 also C29F7 has the end of this sequence
            Formerly CYP16A1 but renamed after full length sequence sorted with 
CYP13

CYP13B2    Caenorhabditis elegans (nematode worm)
            GenEMBL  Z81565
            K06G5

14A Subfamily

CYP14A1    Caenorhabditis elegans (nematode worm)
            GenEMBL Z50742 (20856bp)
            Wilson,R., Ainscough,R., Anderson,K., Baynes,C., Berks,M.,
            Bonfield,J., Burton,J., Connell,M., Copsey,T., Cooper,J.,
            Coulson,A., Craxton,M., Dear,S., Du,Z., Durbin,R., Favello,A.,
            Fulton,L., Gardner,A., Green,P., Hawkins,T., Hillier,L., Jier,M.,
            Johnston,L., Jones,M., Kershaw,J., Kirsten,J., Laister,N.,
            Latreille,P., Lightning,J., Lloyd,C., McMurray,A., Mortimore,B.,
            O'Callaghan,M., Parsons,J., Percy,C., Rifken,L., Roopra,A.,
            Saunders,D., Shownkeen,R., Smaldon,N., Smith,A., Sonnhammer,E.,
            Staden,R., Sulston,J., Thierry-Mieg,J., Thomas,K., Vaudin,M.,
            Vaughan,K., Waterston,R., Watson,A., Weinstock,L.,
            Wilkinson-Sproat,J. and Wohldman,P.
            2.2 Mb of contiguous nucleotide sequence from chromosome III of C.
            elegans.
            Nature 368 (6466), 32-38 (1994)
            K09A11.2

CYP14A2    Caenorhabditis elegans (nematode worm)
            GenEMBL Z50742 (20856bp)
            see CYP14A1 for reference.
            K09A11.3

CYP14A3    Caenorhabditis elegans (nematode worm)
            GenEMBL Z50742 (20856bp)
            see CYP14A1 for reference.
            K09A11.4

CYP14A4    Caenorhabditis elegans (nematode worm)
            GenEMBL Z50742 (20856bp) Z70212(R04D3.1 continuation of Z50742)
            see CYP14A1 for reference.
            after K09A11.5
            partial sequence at end of cosmid continues on cosmid R04D3

CYP14A5     Caenorhabditis elegans (nematode worm)
            GenEMBL U64847
            F08F3.7

CYP15A1     cockroach
            No accession number
            Rene Feyereisen

CYP15B1     Anopheles gambiae (malaria mosquito)
            GenEMBL 
            Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen
            See the paper: Evolution of supergene families associated with 
            insecticide resistance.
            Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J.
            Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R.
            Science 298, 179-181 (2002)
            Anopheles map name CYPj2l3
            47% identical to 15A1

16A Subfamily

CYP16A1X    Caenorhabditis elegans (nematode worm)
            GenEMBL Z54269 
            number retired
            This sequence is now CYP13B1

17A Subfamily

CYP17       human
            GenEMBL NM_000102
MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLP
RHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQMATL
DIASNNRKGIAFADSGAHWQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATH
NGQSIDISFPVFVAVTNVISLICFNTSYKNGDPELNVIQNYNEGIIDNLSKDSLVDLV
PWLKIFPNKTLEKLKSHVKIRNDLLNKILENYKEKFRSDSITNMLDTLMQAKMNSDNG
NAGPDQDSELLSDNHILTTIGDIFGAGVETTTSVVKWTLAFLLHNPQVKKKLYEEIDQ
NVGFSRTPTISDRNRLLLLEATIREVLRLRPVAPMLIPHKANVDSSIGEFAVDKGTEV
IINLWALHHNEKEWHQPDQFMPERFLNPAGTQLISPSVSYLPFGAGPRSCIGEILARQ
ELFLIMAWLLQRFDLEVPDDGQLPSLEGIPKVVFLIDSFKVKIKVRQAWREAQAEGST

CYP17       human EST
            GenEMBL Z19875 (235bp)
            UK-HGMP (United Kingdom human genome mapping project)
            covers amino acids 270-348 when translating the complementary
            strand.  The fragment goes through at least 6 frame shifts.
            sequence ID AAAAWEO

CYP17       human EST
            GenEMBL Z20209 (248bp)
            UK-HGMP (United Kingdom human genome mapping project)
            covers amino acids 265-349 when translating the complementary
            strand.  The fragment goes through at least 8 frame shifts.
            sequence ID AAABPSZ

CYP17       human
            GenEMBL S85459 (556bp)
            Biason,A., Mantero,F., Scaroni,C., Simpson,E.R. and Waterman,M.R.
            Deletion within the CYP17 gene together with insertion of foreign
            DNA is the cause of combined complete 17
            alpha-hydroxylase/17,20-lyase deficiency in an Italian patient
            Mol. Endocrinol. 5, 2037-2045 (1991)

CYP17       rat
            PIR A31359 (507 amino acids)
            Namiki, M., Kitamura, M., Buczko, E. and Dufau, M.L.
            Rat testis P-450-17-alpha cDNA: the deduced amino acid
            sequence, expression and secondary structural configuration.
            Biochem. Biophys. Res. Commun. 157, 705-712 (1988)

CYP17       rat
            PIR D41425 (16 amino acids)
            Imaoka, S., Kamataki, T. and Funae, Y.
            Purification and characterization of six cytochromes P-450
            from hepatic microsomes of immature female rats.
            J. Biochem. 102, 843-851 (1987)

CYP17       rat
            GenEMBL S50146 Z11902 (3345bp)
            PIR S20655 (97 amino acids) 
            Nason,T.F., Han,X.G. and Hall,P.F.
            Cyclic AMP regulates expression of the rat gene for steroid
            17 alpha-hydroxylase/C17-20 lyase P-450 (CYP17) in rat Leydig
            cells.
            Biochim. Biophys Acta 1171, 73-80 (1992)
            Note: have not been able to download S50146 from GCG or NCBI.

CYP17       rat
            GenEMBL X69816 (7556bp)
            Givens,C.R., Zhang,P., Bair,S.R. and Mellon,S.H.
            Transcriptional regulation of rat cytochrome P450c17 expression in
            mouse Leydig MA-10 and adrenal Y-1 cells: identification of a
            single protein that mediates both basal and cAMP-induced activities.
            DNA Cell Biol. 13, 1087-1098 (1994)

CYP17       rat
            PIR S24316 (97 amino acids)
            Zhang, P., Nason, T.F., Han, X.G. and Hall, P.F.
            Gene for 17-alpha-hydroxylase/C(17-20) lyase P-450: complete
            nucleotide sequence of the porcine gene and 5' upstream
            sequence of the rat gene.
            Biochim. Biophys. Acta 1131, 345-348 (1992)

CYP17      hamster
           no accession number
           Cloutier,M., Fleury,A., Courtemanche,J., Ducharme,L.
           Mason,J.I. and Lehoux,J.G.
           Cloning and expression of hamster adrenal cytochrome P450c17 cDNA.
           Ann. N.Y. Acad. Sci. 774, 294-296 (1995)

CYP17       Cavia (guinea pig)
            GenEMBL S75277(1732bp)
            Tremblay,Y., Fleury,A., Beaudoin,C., Vallee,M. and Belanger,A.
            Molecular cloning and expression of guinea pig cytochrome P450c17
            cDNA (steroid 17 alpha-hydroxylase/17,20 lyase): tissue
            distribution, regulation, and substrate specificity of the
            expressed enzyme.
            DNA Cell Biol. 13 (12), 1199-1212 (1994)

CYP17       Cavia porcellus (guinea pig)
            PIR S52756 (508 amino acids)
            Huang,Y., Voigt,J.M. and Colby,H.D.
            unpublished

CYP17       pig
            GenEMBL Z11854 to Z11856 GenEMBL S40341 (1858bp)
            PIR S24233 (501 amino acids) PIR S30074 (501 amino acids)
            Zhang,P. Nason,T.F., Han,X.G. and Hall,P.F.
            Gene for 17 alpha-hydroxylasae/C(17-20) lyase P-450: complete
            nucleotide sequence of the porcine gene and 5' upstream sequence of
            the rat gene.
            Biochim. Biophys. Acta 1131, 345-348 (1992)

CYP17       Sus scrofa (pig)
            GenEMBL U41519 to U41525 
            Conley,A.J., Graham-Lorence,S.E., Kagimoto,M., Lorence,M.C.,
            Murry,B.A., Oka,K., Sanders,D. and Mason,J.I.
            Nucleotide sequence of a cDNA encoding porcine testis 17
            alpha-hydroxylase cytochrome P-450.
            Biochim. Biophys. Acta 1130, 75-77 (1992)

CYP17       bovine
            GenEMBL M64646 (1725bp)
            Zuber,M.X., John,M.E., Okamura,T., Simpson,E.R. and Waterman,M.R.
            Bovine adrenocortical cytochrome P-450-17-alpha: Regulation of gene
            expression by ACTH and elucidation of primary sequence
            J. Biol. Chem. 261, 2475-2482 (1986)

CYP17       Ovis aries (sheep)
            GenEMBL L40335 (1728bp)
            Murry,B.A., Swart,P. and Mason,J.I.
            Cloning and expression of ovine cytochrome P-450 17-alpha
            hydroxylase/c17-20 lyase.
            Unpublished (1995)

CYP17       Equus caballus (horse)
            GenEMBL D30688(1906bp) D13818 
            Hasegawa,T., Mukoyama,H., Yoshida,S. and Takahashi,M.
            Molecular cloning and nucleotide sequence of equine testicular
            cytochrome P-450 steroid 17alpha-hydroxylase/C17,20-lyase messenger
            ribonucleic acid.
            Biol. Reprod. Mono. 1, 615-622 (1995)

CYP17       Equus caballus (horse)
            GenEMBL D88184(6217bp)
            Hasegawa,T.
            Exon/Intron structure of Equine P450c17.
            unpublished (1996)

CYP17       Oncorhynchus mykiss (rainbow trout)
            GenEMBL S50356
            Sakai,N., Tanaka,M., Adachi,S., Miller,W.L. and Nagahama,Y.
            Rainbow trout cytochrome P-450c17 (17 alpha-hydroxylase/17-20
            lyase) cDNA cloning, enzymatic properties and temporal pattern of 
            ovarian P-450c17 mRNA expression during oogenesis.
            FEBS Lett. 301, 60-64 (1992)
            Identical to X65800

CYP17       Oncorhynchus mykiss (rainbow trout)
            GenEMBL X65800 (2287bp) Swiss P30437 (514 amino acids)
            Sakai,N., Tanaka,M., Adachi,S., Miller,W.L. and Nagahama,Y.
            Rainbow trout cytochrome P-450c17 (17
            alpha-hydroxylase/17,20-lyase). cDNA cloning, enzymatic properties
            and temporal pattern of ovarian P-450c17 mRNA expression during
            oogenesis
            FEBS Lett. 301, 60-64 (1992)
            Identical to S50356

CYP17       Oryzias latipes (medaka)
            GenEMBL D87121(2421bp)
            Kobayashi,D., Matsuyama,M., Tanaka,M., Fukada,S. and Nagahama,Y.
            Structural analysis of medaka P-450c17 and expression in the
            ovarian follicle.
            unpublished (1996)

CYP17       Oryzias latipes (medaka)
            GenEMBL D87122(2302bp)
            Kobayashi,D., Tanaka,M., Fukada,S. and Nagahama,Y.
            Presence of a Novel Cytochrome P-450c17 Transcripts in Medaka
            Gonads
            unpublished (1996)
            note: this sequence is missing exon 6
            otherwise identical to D87121

CYP17       Squalus acanthias (dogfish)
            GenEMBL S77384 (1964bp)
            Trant,J.
            Isolation and characterization of the cDNA encoding the spiny 
            dogfish shark
            (Squalus acanthias) form of cytochrome P450c17
             J. Exp. Zool. 272, 25-33 (1995)

CYP17       Ictalurus punctatus (channel catfish)
            GenEMBL AF063837
            Trant,J.M., Berard,C., Byrne,B.J. and Wunder,J.
            Isolation and heterologous expression of the cDNA encoding the
            cytochrome P450 17-hydroxylase from the channel catfish (Ictalurus
            punctatus)
            Unpublished (1998)

CYP17A1     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_4175
            80% to Oryzias latipes 17, 79% to trout 17 73% to catfish 
            61% to dogfish 48% to other CYP17 from Fugu
      MDWVLFVYAFSAVNLALLALHLKFRTPASGPRGPPRLPALPLIGSLLSLRSPHPPHVLFKE
      LQGKYGQTYSLMMGSHRVIIVNHHAHAKEVLLKKGKIFAGRPRS 11667 (0)
11572 VTTDVLSRDGKDIAFGDYSATWRFHRKIVHGALCMFGEGSASIEKI 11435 (1)
10871 ICAEAASLCSILSEAWTAGLALDLSPELTRAVTNVICSLCFSSSYRRGDAEFEAMLHYSQ 10692
10691 GIVDTVAKDSLVDIFPCLQ 10635 (0)
      IFPNADLRLLKRCVSVRDKLLQKEYDKHK (0)
      AAYSDHVQRDLLDALLRAKCSAENNNTTGINAESVGLTDDHLLMT
      VGDIFGAGVETTTTVMKWAITYLIHHPQ 9765 (0)
 9637 IQSRIQEELDSRVGMDRSPQLSDRGSLPYLEATIREVLRIRPVAPLFIPHVALSDT 9473 (2)
 9369 SIGDFAVKKGTRVVINLWSLHHDEKEWENPERFDP 9265 (1)
      GRFLNSEGTGLVIPSSSYLPFGAGVRVCLGEALAK 8558
 8557 MELFLFLSWILQRFTLTVPSGHSLPSLEGKFGVVLQPTKYKVNATPRPGWEGKCKACWN* 8378

CYP17A2     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_8086
            50% to Oryzias latipes 17, 48% to trout 17 50% to catfish 
            49% to dogfish 48% to other CYP17 from Fugu
1198 MVTVGSFLIFRRPVRGSEPGSEAGPPRVKVPCISWVPVLGSLPWLRGGRPLHLIFTQLSYR 1380 (2)
1600 YGPLFALYLGPHLTVVVNNHQHAREVLLLRGKDFAGRPRM 1719 (0)
1797 VTTDLLTRGGKDIAFSDYCPLWKSHRRLVQNSFTLFGEGTSRLQDM 1934 (1)
2061 VLAAVDSLCEELLSMEGRGFDPAPAVTRAVTNVVCMLVFSATYRHGDSELQEVLRYNDGI 2240
2241 VQTIAGGGLVDIYPWMK 2291 (0)
2372 VFPNKTLSKLKACIAVRDRLLTHKLEEHK 2458 (0)
2622 ATLTDNQPRDLLDALLMGQVGRGRRKGSGRVEEDIITEDHVLMTAAEAFGAGVETTSTTLLWILAYLLHHPQ 2837 (0)
2925 VQERVQKELDDHVGSERPVRVSDRARLTYLDCVINEGMRIRPVSPVLIPHTAMTDSR 3074 (2)
3870 IGGHHISRGTRVLVNMWSIHHDSAHWDKPDLFNP 3971 (1)
4519 DRFRDHQGQRVTPSCFLPFGAGPRVCVGESLARLELFLFLSSLLQRMSFRLPNGA 4692
4693 SPPDLQGRMGVVLQPVPYKVVVTPRVG* 4776

CYP17 fragment a Fugu rubripes (pufferfish)
                 No accession number
Fc:c028I22x2 LPC.10549.x2 Length = 1007 61% to 17A1 Fugu
GAPRATTDNHHAHAKEARPKKGKKAAGRPRK exon 1
ATTDGASRDGTDTAEGDHSATRRDQR exon 2

CYP17 fragment b Fugu rubripes (pufferfish)
                 No accession number
Fc:c028I22x1 LPC.10549.x1 Length = 894 
78% to Fc:c028I22x2 65% to 17A1 Fugu
KEQQAEHGQTTSRMMGSHRGNTDNQHAHAKEARQKKGKKGAGRPRK exon 1
ATTDGRSRDGTDIAEGDHSATWRHHRKIVQRARRRNGEGSAPSEKI exon 2 C-helix

18A Subfamily

Cyp18       Drosophila melanogaster
            GenEMBL S66112 (63bp)
            Hurban,P. and Thummel,C. 
            Isolation and characterization of fifteen ecdysone-inducible 
            drosophila genes reveal unexpected complexities in ecdysone regulation.
            Molec. Cell Biol. 13, 7101-7111 (1993)
            Note: very short ecdysone inducible fragment in the heme binding
            region about 2/3 of amino acids are identical to 2D sequences. 
            called Eig 17-1
            dimethylnitrosamine demethylase 

Cyp18       Drosophila melanogaster
            GenEMBL U44753(2539bp)
            Bassett,M.H, Waterman,M.R., McCarthy,J.L. and Sliter,T.J.
            Cloning and characterization of CYP18 in Drosophila melanogaster:
            identification of an insect member of a new cytochrome P450 family.
            unpublished (1996)
            complete sequence from the Eig 17-1 fragment

Cyp18       Drosophila melanogaster
            GenEMBL AC012164 114600-117669 also AC015216

CYP18       Spodoptera littoralis (cotton leafworm)
            No accession number 
            Lyndsay Davies
            Submitted to nomenclature committee 2/22/02
            61% identical to CYP18 from Drosophila
            since there is only one CYP18 seq in Drosophila this new sequence 
            will be called CYP18 without a subfamily or number like CYP18A1.
            If more CYP18s are found in a single species this may have to change.

19A Subfamily

CYP19       human
            GenEMBL S52034 (142bp) S52789 (106bp) S52793 (149bp)
            S52794 (125bp)
            Harada,N.
            A unique aromatase (P-450AROM) mRNA formed by alternative
            use of tissue-specific exons 1 in human skin fibroblasts.
            Biochem. Biophys. Res. Commun. 189, 1001-1007 (1992)

CYP19       human
            GenEMBL D14473 (295bp) S59092 S59095 S59171
            Toda,K and Shizuta,Y.
            Molecular cloning of a cDNA showing alternative splicing of
            the 5'-untranslated sequence of mRNA for human aromatase P-450.
            Eur. J. Biochem. 213, 383-389 (1993)

CYP19       human
            GenEMBL D13391 (2238bp)
            Katsumi,T. and Shizuta,Y.
            Identification and characterization of cis-acting regulatory 
elements for 
            the expression of the human aromatase cytochrome P-450 gene.
            J. Biol. Chem. 269, 8099-8107 (1994)

CYP19       human
            GenEMBL D21240 (794bp) D21241 (3231bp)
            Harada,N., Utsumi,T. and Takagi,Y.
            Tissue-specific expression of the human aromatase cytochrome P-450
            gene by alternative use of multiple exons 1 and promoters, and
            switching of tissue-specific exons 1 in carcinogenesis.
            Proc. Natl. Acad. Sci. U.S.A. 90 (23), 11312-11316 (1993)

CYP19       human
            GenEMBL X55983 (669bp)
            Toda,K., Miyahara,K., Kawamoto,T., Ikeda,H., Sagara,Y. and
            Shizuta,Y.
            Characterization of a cis-acting regulatory element involved in 
human-
            aromatase P-450 gene expression.
            Eur, J. Biochem. 205, 303-309 (1992)
            exon 1

CYP19       human
            GenEMBL S71536 (792bp)
            Toda,K., Simpson,E.R., Mendelson,C.R., Shizuta,Y. and Kilgore,M.W.
            Expression of the gene encoding aromatase cytochrome P450 (CYP19)
            in fetal tissues
            Mol. Endocrinol. 8, 210-217 (1994)

CYP19       human
            GenEMBL M32245 (840bp)
            Harada,N., Yamada,K., Saito,K., Kibe,N., Dohmae,S. and Takagi,Y.
            Structural characterization of the human estrogen synthetase
            (aromatase) gene.
            Biochem. Biophys. Res. Commun. 166, 365-372 (1990)

CYP19       human
            GenEMBL D29757 (875bp) PIR PC2041 (45 amino acids)
            Honda,S.-I., Harada,N. and Takagi,Y.
            Novel exon 1 of the aromatase gene specific for aromatase 
transcripts
            in human brain.
            Biochem. Biophys. Res. Commun. 198, 1153-1160 (1994)

CYP19       human
            GenEMBL M22246 (2966bp)
            Harada,N.
            Cloning of a complete cDNA encoding human aromatase: immunochemical
            identification and sequence analysis
            Biochem. Biophys. Res. Commun. 156, 725-732 (1988)

CYP19       human
            GenEMBL L21982 (1166bp)
            Mahendroo,M.S., Mendelson,C.R. and Simpson,E.R.
            Tissue-specific and hormonally-controlled alternative promoters
            regulate aromatase cytochrome P450 gene expression in human adipose
            tissue
            J. Biol. Chem. 268, 19463-19470 (1993)

CYP19       human
            GenEMBL S85356 (1384bp)
            Means,G.D., Kilgore,M.W., Mahendroo,M.S., Mendelson,C.R. and
            Simpson,E.R.
            Tissue-specific promoters regulate aromatase cytochrome P450 gene
            expression in human ovary and fetal tissues.
            Mol. Endocrinol. 5, 2005-2013 (1991)

CYP19       human
            GenEMBL S96437 (971bp)
            Kilgore,M.W., Means,G.D., Mendelson,C.R. and Simpson,E.R.
            Alternative promotion of aromatase P-450 expression in the human 
placenta.
            Mol. Cell. Endocrinol. 83, R9-R16 (1992)

CYP19       human
            PIR A40542 (48 amino acids)
            Mahendroo, M.S., Means, G.D., Mendelson, C.R. and Simpson, E.R.
            Tissue-specific expression of human P-450-AROM. The promoter
            responsible for expression in adipose tissue is different
            from that utilized in placenta.
            J. Biol. Chem. 266, 11276-11281 (1991)

CYP19        Macaca fuscata (Japanese macaque)
            GenEMBL S79807(369bp)
            Yamada-Mouri,N., Hirata,S., Hayashi,M. and Kato,J.
            Analysis of the expression and the first exon of aromatase mRNA in
            monkey brain.
            J. Steroid Biochem. Mol. Biol. 55 (1), 17-23 (1995)

CYP19       rat
            GenEMBL S59505 (639bp)
            Fitzpatrick,S.L. and Richards,J.S.
            cis-acting elements of the rat aromatase promoter required for
            adenosine 3',5'-monophosphate induction in ovarian granulosa cells
            and constitutive expression in R2C Leydig cells.
            Molec. Endocrinol. 7, 341-354 (1993)
            Note: promoter

CYP19       rat
            GenEMBL Z11815 (590bp)
            Hickey,G.J., Krasnow,J.S., Beattie,W.G. and Richards,J.S.
            Aromatase cytochrome P450 in rat ovarian granulosa cells before and
            after luteinization: Adenosine 3',5'-monophosphate-dependent and
            independent regulation. Cloning and sequencing of rat aromatase
            cDNA and 5' genomic DNA
            Mol. Endocrinol. 4, 3-12 (1990)

CYP19       Oryctolagus cuniculus (rabbit)
            GenEMBL Z68271(1783bp)
            Delarue,B., Mittre,H., Feral,C., Benhaim,A. and Leymarie,P.
            Rapid sequencing of rabbit aromatase cDNA using RACE PCR without
            cloning.
            C. R. Acad. Sci. III, Sci. Vie 319, 663-670 (1996)

CYP19       Oryctolagus cuniculus (rabbit)
            GenEMBL Z70302(1455bp)
            Delarue,B., Mittre,H. and Leymarie,P.
            Expression des transcrits codant pour l'aromatase de lapin dans
            differents tissus.
            unpublished (1996)

Cyp19       mouse
            Swiss P28649 (503 amino acids) GenEMBL D00659 (2420bp)
            Terashima,M., Toda,K., Kawamoto,T., Kuribayashi,I., Ogawa,Y.,
            Maeda,T. and Shizuta,Y.
            Isolation of a full-length cDNA encoding mouse aromatase P450
            Arch. Biochem. Biophys. 285, 231-237 (1991)

CYP19      pig
           no accession number
           Corbin, C.J., Khalil, M.W. and Conley, A.J.
           Functional ovarian and placental isoforms od porcine aromatase.
           Mol. Cell. Endocrinol. 113, 29-37 (1995)

CYP19       Sus scrofa (pig)
            GenEMBL L15471 (454bp)
            Ko,Y., Choi,I., Green,M.L., Simmen,F.A. and Simmen,R.C.
            Transient expression of the cytochrome P450 aromatase gene in
            elongating porcine blastocysts is correlated with uterine
            insulin-like growth factor levels during peri-implantation
            development.
            Mol. Reprod. Dev. 37 (1), 1-11 (1994)
            Note: This is only a fragment of 80 amino acids, including
            helix K and the EXXR conserved sequence.

CYP19      Sus scrofa (pig)
            GenEMBL U52141(1133bp)
            Choi,I., Collante,W., Simmen,R.C.M. and Simmen,F.A.
            Molecular cloning of multiple forms of cytochrome p450 aromatase
            and their developmental expression in porcine blastocysts,
            endometrium, and placenta.
            Unpublished (1997)

CYP19      Sus scrofa (pig)
            GenEMBL U52142(1584bp)
            Choi,I., Collante,W., Simmen,R.C.M. and Simmen,F.A.
            Molecular cloning of multiple forms of cytochrome p450 aromatase
            and their developmental expression in porcine blastocysts,
            endometrium, and placenta.
            Unpublished (1997)

CYP19      Sus scrofa (pig)
            GenEMBL U37309 (417bp)
            Choi,I., Simmen,R.C. and Simmen,F.A.
            Molecular cloning of cytochrome P450 aromatase complementary
            deoxyribonucleic acid from periimplantation porcine and equine
            blastocysts identifies multiple novel 5'-untranslated exons
            expressed in embryos, endometrium, and placenta.
            Endocrinology 137 (4), 1457-1467 (1996)

CYP19      Sus scrofa (pig)
            GenEMBL U37311(2470bp) 
            Choi,I., Simmen,R.C. and Simmen,F.A.
            Molecular cloning of cytochrome P450 aromatase complementary
            deoxyribonucleic acid from periimplantation porcine and equine
            blastocysts identifies multiple novel 5'-untranslated exons
            expressed in embryos, endometrium, and placenta.
            Endocrinology 137 (4), 1457-1467 (1996)

CYP19      Sus scrofa (pig)
            GenEMBL U57510
            Choi,I., Collante,W., Simmen,R.C.M., Troyer,D. and Simmen,F.A.
            Molecular cloning and structural characterization of porcine
            cytochrome p450 aromatase chromosomal genes: evidence for the
            existence of multiple, closely related genes that encode
            developmental and tissue-specific isoforms of aromatase.
            unpublished (1997)

CYP19      Sus scrofa (pig)
            GenEMBL U57517(287bp) U57518(287bp) U57519(358bp) 
            Choi,I., Collante,W., Simmen,R.C.M., Troyer,D. and Simmen,F.A.
            Molecular cloning and structural characterization of porcine
            cytochrome p450 aromatase chromosomal genes: evidence for the
            existence of multiple, closely related genes that encode
            developmental and tissue-specific isoforms of aromatase.
            unpublished (1997)

CYP19      Sus scrofa (pig)
            GenEMBL U57520(517bp) U57521(495bp)
            Choi,I., Collante,W., Simmen,R.C.M., Troyer,D. and Simmen,F.A.
            Molecular cloning and structural characterization of porcine
            cytochrome p450 aromatase chromosomal genes: evidence for the
            existence of multiple, closely related genes that encode
            developmental and tissue-specific isoforms of aromatase.
            unpublished (1997)

CYP19       bovine
            GenEMBL S66248 (2104bp)
            Hinshelwood,M.M., Corbin,C.J., Tsang,P.C. and Simpson,E.R.
            Isolation and characterization of a cDNA insert encoding bovine 
aromatase 
            cytochrome P450.
            Endocrinology 133, 1971-1977 (1993)
            Note two amino acid differences with Z32741

CYP19       bovine
            GenEMBL M64646 (1725bp)
            Zuber,M.X., John,M.E., Okamura,T., Simpson,E.R. and Waterman,M.R.
            Bovine adrenocortical cytochrome P-450-17-alpha: Regulation of gene
            expression by ACTH and elucidation of primary sequence.
            J. Biol. Chem. 261, 2475-2482 (1986)

CYP19       bovine
            GenEMBL Z32741 (4226bp) PIR S44210 (503 amino acids)
            Vanselow,J. and Furbass,R.
            Aromatase cytochrome P450 gene and pseudogene
            unpublished (1994)

CYP19       bovine
            GenEMBL Z69241 to Z69250 (genomic sequences)
            Furbass,R. and Vanselow,J.
            unpublished (1996)

CYP19P      bovine
            GenEMBL Z32813 (1006bp)
            Vanselow,J. and Furbass,R.
            Aromatase cytochrome P450 gene and pseudogene
            unpublished (1994)

CYP19        Equus caballus (horse)
            GenEMBL U37313 (458bp)
            Choi,I., Simmen,R.C. and Simmen,F.A.
            Molecular cloning of cytochrome P450 aromatase complementary
            deoxyribonucleic acid from periimplantation porcine and equine
            blastocysts identifies multiple novel 5'-untranslated exons
            expressed in embryos, endometrium, and placenta.
            Endocrinology 137 (4), 1457-1467 (1996)

CYP19       chicken (three different strains of chicken)
            GenEMBL M73277 to M73285, M73286 to M73294
            M73295 to M73303
            PIR A41063 (495 amino acids)
            Matsumine,H., Herbst,M., Ou,S.-H.I., Wilson,J.D. and 
            McPhaul,M.J.
            Aromatase mRNA in the extragonadal tissues of chickens with 
            the henny-feathering trait is derived from a distinctive
            promoter structure that contains a segment of a retroviral
            long terminal repeat.
            J. Biol. Chem. 266, 19900-19907 (1991)

CYP19       Coturnix coturnix japonica (Japanese quail)
            GenEMBL S46949 (692bp) PIR A48977(230 amino acids)
            Harada,N., Yamada,K., Foidart,A. and Balthazart,J.
            Regulation of aromatase cytochrome P-450 (estrogen synthetase)
            transcripts in the quail brain by testosterone.
            Brain Res. Mol. Brain Res. 15, 19-26 (1992)
            Note: only three amino acid differences with chicken.

CYP19      Coturnix coturnix (quail)
            GenEMBL D50336(4351bp)
            Kudo,T., Yamamoto,H., Sato,S. and Sutou,S.
            Comparison of 5' upstream regions of chicken and quail aromatase
            gene.
            Unpublished (1995)

CYP19       Poephila guttata (zebra finch)
            GenEMBL S75898(3188bp)
            Shen,P., Campagnoni,C.W., Kampf,K., Schlinger,B.A., Arnold,A.P. and
            Campagnoni,A.T.
            Isolation and characterization of a zebra finch aromatase cDNA: in
            situ hybridization reveals high aromatase expression in brain
            Brain Res. Mol. Brain Res. 24 (1-4), 227-237 (1994)

CYP19       Ictalurus punctatus (channel catfish)
            GenEMBL S75715(2102bp)
            Trant,J.
            Isolation and characterization of the cDNA encoding the channel
            catfish (Ictalurus punctatus) form of cytochrome P450arom
            Gen. Comp. Endocrinol. 95 (2), 155-168 (1994)

CYP19       Onchorynchus mykiss (rainbow trout)
            Tanaka, M., Telecky, T.M., Fukada, S., Adachi,S., Chen, S. and 
Nagahama, Y.
             Cloning and sequence analysis of the cDNA encoding P-450 aromatase 
             (P450arom) from a rainbow trout (Onchorynchus mykiss) ovary, 
relationship 
             between the amount of P450arom mRNA and the production of 
oestradiol-17-beta 
             in the ovary.
             J. Mol. Endocrin. 8, 53-61 (1992)

CYP19      Carassius auratus (goldfish)
            GenEMBL U18974(2939bp)
            Gelinas,D.M., Pitoc,G.A. and Callard,G.V.
            Isolation of goldfish brain aromatase cDNA and analysis of
            expression during the reproductive cycle and after steroid
            treatment in vivo.
            unpublished (1996)

CYP19       Oryzias latipes (medaka)
            GenEMBL D82968(1851bp)
            Tanaka,M., Fukada,S., Matsuyama,M. and Nagahama,Y.
            Structure and promoter analysis of the cytochrome P-450 aromatase
            gene of the teleost fish, medaka (Oryzias latipes)
            J. Biochem. 117 (4), 719-725 (1995)

CYP19      Tilapia nilotica (Cichlid fish)
            GenEMBL U72071(1804bp)
            Chang,X.T., Kobayashi,T., Nakamura,M., Kajura,H. and Nagahama,Y.
            Isolation and characterization of cDNA encoding the tilapia
            (Oreochromis niloticus) cytochrome P450 aromatase (P450arom):
            Changes in P450arom mRNA, protein and enzyme activity in ovarian
            follicles during oogenesis.
            J. Mol. Endocrinol. (1996) In press

CYP19     Haplochromis burtoni (cichlid fish)
            GenEMBL AF114716
            White,R.B.
            Aromatase expression during social change in the cichlid fish,
            Haplochromis burtoni.
            Unpublished

CYP19       Paralichthys olivaceus (Japanese Flounder)
            GenEMBL AB017182
            Kitano,T., Takamune,K., Kobayashi,T., Nagahama,Y. and Abe,S.
            Suppression of P450 Aromatase (P450arom) Gene Expression in
            Sex-Reversed Males Produced by Rearing Genetically Female Larvae at
            High Water Temperature during a period of Sex Differentiation in
            Japanese Flounder (Paralichthys olivaceus)
            Unpublished (1998)

CYP19A1     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_7098
            65% to AF183906 ovary form of CYP19 from Zebrafish
            60% to AF183908 brain form of CYP19 from Zebrafish
            61% to other Fugu CYP19
            this is probably the ovary form
9466 MAAVGLDAEVLVSVSPNATEAESPGSSAGTRALIILTCLLLLVWSHTEKKSVP 9308 (1)
9242 SLLGPSFCLGFGPLLTYVRFIWTGIGTASNYYNKKYGDIVRVWVNGEETLVISR 9081 (2)
8985 ASAVHHVLKSRQYTSRFGSKQGLSCIGMNERGIIFNNNVTEWRKIRGYFTK 8830 (1)
8759 ALTGPAVQNTVEVCNSSTQAHLDRLEDLAQVDVLSLLRCTVVDISNRLFLDIPIN 8595 (1)
8499 EKELLLKIHKYFDTWQTVLIKPDIYFKFGWIHQKHKTAA 8392 (2)
8296 RELQEAIEGLVEQKRRDLEQADKLENINFTAELLFAQ 8186 (0)
8084 NHGELSAENVMQCVLEMVIAAPDTLSVSLFFMLLLLKQNPDVELQLLQEIDAVVGK (0 expected, bad boundary)
     RQLQNGDLQKLRVLETFINECLRFHPV 7719
7718 VDFTMRRSLSDDVIEGYRVPKGTNIILNTGHMHRTEFFLRPTEFCLQNFEKN 7563 (0)
     APRRYFQPFGSGPRACVGKHIAMVMMKSILVTLLSQYSVCPHEGLT 7327
7326 LDCLPQTNNLSQQPVEHQEEAQQLSMRFLPRQRGSWQTV* 7207

CYP19A1     Danio rerio (zebrafish)
            GenEMBL AF183906
            Chiang,E.F.L., Yan,Y.L., Guiguen,Y., Postlethwait,J. and Chung,B.C.
            Two Cyp19 (P450 Aromatase) Genes on Duplicated Zebrafish
            Chromosomes Are Expressed in Ovary or Brain
            Mol. Biol. Evol. 18 (4), 542-550 (2001)
            Called CYP19a expressed in ovary

CYP19A2     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_4200
            65% to both brain and ovary forms of CYP19 from Zebrafish
            61% to other Fugu CYP19
            this is probably the brain form
2918 MKPKETLNITASGPFTPLPLPLMMMMMMLLLMMMMLFLTWNRPQRQHVP (1)
3451 GPLFLAGLGPLLSYCRFMWTGIGTACNFYNNKYGSLVRVWINGEETLILSR (2)
3674 SSAVYHVLRSAHYTARFGSRAGLECIGMEGQGVIFNSDVQLWRRARVYFSK (1)
3960 ALTGPGLQRTVGVCVTSTAKHLDCLVDMTDASGHVDALNLLRAIVVDISNRLFLRVPLN 4136 (1)
     EKDLLTKIHNYFETWQAVLIKPDIFFKIGWLFDKHRRAA 4319 (2)
     QELQDTMAALLKVKRKLVHEAEKLDDVLDFATELILAQ (0)
     EAGEFSADNVRQCVLEMVIAAPDTLSISLFFMLMLLKQHPDVELRIVEELSTvsrt (0)
     egeENIDYQRLKVMESFINESMRFHPVVDFTMRKALEDDTIEGIRIRKGTNIILNIGLMHKTE 5039
5040 FFPKPREFSLTNFEQT (0)
     VPSRFFQPFGCGPRSCVG 5219
5220 KHIAMVMMKAILATLLSRYTVCPRHGCTLTSIRQTNNLSQQPVEDEHSLAMRFIPRTIQSPS* 5408

CYP19A1     Danio rerio (zebrafish)
            GenEMBL AF183908
            Chiang,E.F.L., Yan,Y.L., Guiguen,Y., Postlethwait,J. and Chung,B.C.
            Two Cyp19 (P450 Aromatase) Genes on Duplicated Zebrafish
            Chromosomes Are Expressed in Ovary or Brain
            Mol. Biol. Evol. 18 (4), 542-550 (2001)
            Called CYP19b expressed in brain

CYP19       Trachemys scripta (red-eared slider turtle)
           no accession number
           Crews,D.
           Temperature-dependent sex determination: the interplay of steroid 
hormones 
           and temperature.
           Zool. Sci. 13, 1-13 (1996)

CYP20      human
           GenEMBL AC011737.8 chromosome 2 clone RP11-33N4, 
           AC011737.8 chr 2 (missing exons 12,13) 
           AC080075.2 (missing exons 1,7,8)
MLDFAIFAVTFLLALVGAVLYLYP (0)
ASRQAAGIPGITPTEEK (2)
DGNLPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINPNKTS (1)
DPFETMLKSLLRYQSGGGSVSENHMRKKLYENGVTDSLKSNFALLLK (0)
LSEELLDKWLSYPETQHVPLSQHMLGFAMKSVTQMVMGSTFEDDQEVIRFQKNHGT (0)
VWSEIGKGFLDGSLDKNMTRKKQYED (1)
ALMQLESVLRNIIKERKGRNFSQHIFIDSLVQGNLNDQQ (0)
ILEDSMIFSLASCIITAK (1)
LCTWAICFLTTSEEVQKKLYEEINQVFGNGPVTPEKIEQLR (2)
YCQHVLCETVRTAKLTPVSAQLQDIEGKIDRFIIPRE (0)
TLVLYALGVVLQDPNTWPSPHK (2) genomic seq stops here the rest is cDNA
FDPDRFDDELVMKTFSSLGFSGTQECPELR (2) intron site based on fish genomic DNA
FAYMVTTVLLSVLVKRLHLLSVEGQVIETKYELVTSSREEAWITVSKRY

CYP20      mouse
           GenEMBL AK020848 adult retina cDNA plus ESTs for C-term
MLDFAIFAVTFLLALVGAVLYLYPASRQASGIPGLTPTEEKDGN
LPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTTDVLKQHFNPNKTSDPFET
MLKSLLGYQSGGGSAGEDHVRRKLYGDAVTASLHSNFPLLLQLSEELLDKWLSYPETQ
HIPLSQHMLGFALKFVTRMVLGSTFEDEQEVIRFQKIHG 
TVWSEIGKGFLDGSLDKNTTRKKQYQEALMQLESTLKKIIKERKGGNFRQHT 
FIDSLTQGKLNEQQILEDCVVFSLASCIITAR 
LCTWTIHFLTTTGEVQKK
LCKEIDQVLGEGPITSEKIEQLSYCQQVLFETVRTAKLTPVSARLQDIEGKVGPFVIPKE 360
TLVLYALGVVLQDPSTWPLPHRFDPDRFADEPVMKVFSSLGFSGTWECPELXFAYMVTAV 540
LVSVLLEKLRLLAVDRQVVEMKYELVTSAREEAWITVSKRH*

CYP20       Bos taurus (cow)
MLDFAIFAVTFLLALVGAVLYLYPASRQAAGIPGITPTEEKDGNLPDIV
NSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINPNKTLDPFETMLKSLLR
YQSDSGNVSENHMRKKLYENGVTNCLRINFALLIKLSEELLDKWLSYPESQHVPLCQHML
GFAMKSVTQMVMGSTFEDEQEVIRFQKNHGTVWSEIGKGFLDGSLDKSTTRKKQYEDALM
QLESILKKIKERKGRNFSQHIFIDSLVQGNLNDQQILEDTMIFSLAS
CMITAKLCTWAVCFLTTYEEIQKKLYEEIDQVLGKGPITSEKIEELRYCRQVLCETVRTA
KLTPVSARLQDIEGKIDKFIIPRETLVLYALGVVLQEXGTWSSPYKFDPERFDDESVMKT
FSLLGFSGTRECPELRFAYMVTAVLLSVLLRRLHLLSVE
GQVIETKYELVTSSKEEAWITVSKRY 498

CYP20       Gallus gallus (chicken)
BU454844 603215388F1 CSEQRBN14 Gallus gallus cDNA clone ChEST201o21 5'.
BU111630 603125978F1 CSEQCHL13 Gallus gallus cDNA clone ChEST95k19 5'
BU356654 603474052F1 CSEQCHN70 Gallus gallus cDNA
MLDFAIFAVTFLLILVGAVLYLYP
ASRQASGIPGLAPTDDK
DGNLPDIIASRSLHEFLVNLHEKYGPLVSFWFGRRLVVSLGSIDLLKQHVNPNRSS
DPFEMMLKSFLRYQSSLNGDTGESHLRRKLYESGVSKSLQSNLALIQK
LSEELLAKWLSLPEAQHIPLCQHMLGFAMKSVTQTAMGSSFEDDQEVIRFRRHHDA
IWSEIGKGFLDGSLDKNATRKKLYED
ALKEMESTLRKVITGTPRQIIQQAFIDTLLQGNLSDQQ
ILEDTMIFSLAGCIITAN
LCTWAVYFLTTSEDVQQNLCKEVDHVLGKGPITHEKIEQLR
YCRQVLCETVRTAKLTPIAAQLQELEGRVDQHTVPKE
TLVLYALGVMLQDSSSWPSPYK
FDPERFSEDSAMTNFSLLGFSGSQECPELR
FAYMVATVLLSILVRKLYLHPVKGQVMETKYELVTSPKEEAWITVSKRS*

CYP20 Xenopus laevis (African clawed frog)
BJ037591.1 NIBB Mochii normalized Xenopus neurula library Xenopus laevis cDNA clone XL040p15 5'
BQ725586 BQ725586.1 AGENCOURT_8103453 NICHD XGC Emb2 Xenopus laevis cDNA clone
95% identical to tropicalis seq. 74% to chicken
BC044111.1 mRNA, complete cds Length = 2417
175  MLDFAIFAITFLLILVGAVLYLYPSSRQACGIPGLAPTEEKDGNLQDIVNSGS 333
334  LHEFLVNLHERFGPVASFWFGRRLVVSLGSLDLLKQHINPNKTSDPFQTMLKSLLGYQSG 513
514  VIGEAAESHVQKKLYENGITKALHSNFSVIIKLSEELLAKWGTYPQSQHVPLCQHMLGFA 693
694  MKSVTQTAMGSSFEDDQEVIHFRRNHDAIWSEIGKGFLDGSIERSPSRKKLYEDALMEME 873
874  TVLKKTIKERKGKNPGRHVFLDSLLQGNLSDKQVLEDSMIFSLAGCVITANLCTWAIYFL 1053
1054 TTSEEVQDKLYKEVNRVIGKGPITMDKLEQLSYCRQILCETVRTASLTPISARLQELEGR 1233
1234 VDQHIIPKETLVLYALGVVLQDNTAWPLAYRFDPDRFDDETAKQSLSLLGLSGSQECPEL 1413
1414 RFAYMVAMVLLCVLVRKLNLLPVKGQVMETKYELVTSPKEEAWITVSKRS 1563

CYP20 Xenopus tropicalis
From JGI blast server http://aluminum.jgi-psf.org/prod/bin/runBlast.pl?db=xenopus1&dump=1
And from Sanger http://www.sanger.ac.uk/Projects/X_tropicalis/blast_server.shtml
AL848968.1 Xenopus tropicalis EST, clone TEgg007o12 5'
AL870093 AL870093.1 Xenopus tropicalis EST, clone TEgg120l22 5'
60% to Fugu CYP20, 70% to human CYP20, 62% to zebrafish, 30% to ciona CYP20 like seq
72% to chicken
MLDFAIFAITFLLILVGAVLYLYP exon 1
SSRQACGIPGLAPTEEK exon 2
DGNLQDIVNSGSLHEFLVNLHERFGPVASFWFGRRLVVSLGSLDLLKQHINPNKTS exon 3
DPFQMMLKSLLGYQSGVIGEAAESHVQKKLFENGIIKALHSNFSVVIK exon 4
LSEDLLAKWLTYPQSQHVPLCQHMLGFAMKSVTQTAMGSSFDDDQEVIHFRRNHDA exon 5
IWSEIGKGFLDGSIERSPNRKKLYED exon 6
ALMEMETVLKKAIKERKVKNPGRHVFVDSLLQGNLSDKQ exon 7
VLEDSMIFSLAGcvitan exon 8
VCTWAIYFLTTSEEVQDKLFKEVTRVIGKGPITMDKLEQLS exon 9
YCRQILCETVRTASLTPISARLQELEGRVDQHIIPKE exon 10
tlvlyalgvvlqdntawplayr exon 11 X. laevis
FDPDRFNDETAKQSLTLLGFSGSQECPELR exon 12
FAYMVAMVLLSVLVRKLHLLPVKGQVMETKYELVTSPKEEAWITVSKRS exon 13

CYP20      Fugu rubripes (pufferfish)
           No accession number
           Scaffold_486
           59% TO CYP20 human
      MLDFAIFAVTFVIVLVGAVLYLYP (0)
      SSRRASGIPGLNPTDEK (2)
11654 DGNLQDIVGRGSLHEFLVSLHQEFGPVASFWFGSRPVVSLGSLQQLRQHINPNHST 11487 (1)
      DSFETMLKSLLGYHSGGGGASTDSIIRKKVYQGAIDTTLKNNFPLVLK (0)
      LVDELVGKWKSFPEDQHTPLCAHQLVLAMKTITQLALGESFSEDARVIAFRKNHDV (0)
      IWSEIGKGYMDGSLEKSTSRKGHYEK (1) 
      ALSEMESTLLSVVKERKSQRNKSVFVDSLIQSTLTERQ
      IMEDCMVFMLAGCAITAN
      VCIWALHFLSTSEEVQDRLYKEFEEVLGSSPVSLEKIPQLR
      YCQQVLNETLRTAKLTPIAARLQEVEGKVDQHLIPKE
      SLVIYALGVILQDSDTWNAPYR
      FDPDRFEEESVKKSFHLLGFSGSQTCPELR
      FAYTVATVLLSVLVRQLKLHRLKDTLMEVRSELVSTPRDETWITFNLRN*

CYP20       Tetraodon nigroviridis (freshwater pufferfish) 
>FS_CONTIG_2529_2 Length = 13425 from http://fugu.hgmp.mrc.ac.uk/blast/
>FS_CONTIG_2529_1 Length = 1009 from http://fugu.hgmp.mrc.ac.uk/blast/
MLDFAIFAVTFVVILVGAVLYLYP
SSRRASGVPGLNPTDEK
XXXXXXXXXXXDIVARGSVQEFLVSLHQEFGPVASFWFGSRPVVSLGSLQQLQQHANPNRSS
DSFETMLKSLLGYHSGGGGASTENIIRKKVYQGAIDATLKNNFPLVLK
LVDELVGKWTSSPEDQHTPLCAHQLVLAMKSITQLALGESFSQDARVVSFRRNYDA
IWSEVGKGFMDGSLERSTSRKGRYEX
ALSEMEATLLSVVKDRKSQRKTSVLVDTLLQSTLTDRQ
IMEDCMVFTLAGCAITAN
VCIWALHFLSSYEDVQDRLHQELEEVLGSGSVSLEKIPQLR
YCQQVLNETVRTAKLTPVAAGLQEVEGKVDQHLIPKE
TLVIYALGVILQDSHTWDAPCR
FHPDRFEEESVRKSFRLLGFSGSQTCPELR
VAYTVATVLLSAVVRQLRLHRLEDTLVEVRSELVSTPREETWITFSRRN

CYP20 Danio rerio (zebrafish)
      Assembled CYP20 from zebrafish ESTs 
      BQ259821 faa04d08.y1 zebrafish fin day3 regeneration
      BM185037 fv16g05.x2 zebrafish adult brain
      BG985721 2543 NICHD Zebrafish normalized I
      BM070720 fu98e06.y1 zebrafish adult brain cDNA
      CA472036 AGENCOURT_10739799 NCI_CGAP_ZKid1 cDNA (kidney?)
      BI981894 fu52c12.y1 zebrafish adult brain Danio rerio cDNA
      BM083017 fu28a11.y1 Campbell zebrafish ovary cDNA MLDFAIFAVTFVIILIGAVLYLYPSSRRASGVPGLNPTEEKDGNLQDIVNKGSLHEFLVG
LHDEFGSVASFWFGARPVVSLGAVNQLRQHINPNWTTDSFETMLKSLLGYQSGSGVGLTE
SMMRKKVYEGAINKTLENNFPLLLQQVEELVDKWASYPKSQHTPLCAHLLGLAMKAVTQL
AMGSRFRDDAEVIRFRKNHEAIWSQ 
IGKGYLDGSLEKSSSRKAHYESALAEMESVLKSVAKQRPGQGSSQSFVNYLLQANLTER 
QVMEDGMVFTLAGCVITANLCIWAVHFLSVSEAVQDRLYHELVEVLGDEPVSLEKIPQLR
YCQQVLNETVRTAKLTPVAARLQEVEGKVDQHIIPKETLVIYALGVVLQDADTWSLPYRF
NPDRFAEESVMKSFSLLGFSGSQACPELRFAYTVATVLL
STLVRRLRMHRVDGQVVEARYELVTTPKDDTWITVSKRN*

CYP20 Danio rerio (zebrafish)
>ctg10765  genomic contig CYP20 74% to fugu 
 9501 MLDFAIFAVTFVIILIGAVLYLYP (0) 9572
      SSRRASGVPGLNPTEEK (2?)
      DGNLQDIVNKGSLPEFLVGLH
      DEFGSVASFWFGARPVVSLGAVNQLRQHINPNWT (1) 10024
12291 TDSFETMLKSLLGYQSGSGVGLTESMMRKKVYEGAINKTLENNFPLLLQ (0) 12439
12929 QVEELVDKWASYPKSQHTPLCAHFL 13003 frameshift
13003 GLAMKAVTQLAMGSRFRDDAEVIRFRKNHEA (0) 13095
15738 IWSEIGKGYLDGSLEKSSSRKAHYES (1?) 15815 AC boundary istead of AG
15897 ALAEMESVLKSVAKQRPGQGSSQSFVNYLLQANLTERQ (0) 16010
16583 VMEDGMVFTLAGCVITAN (1) 16636
17689 LCIWAVHFLSVSEAVQDRLYHELVEVLGDELVSLEKIPQLR (2) 17811
19293 YCQQVLNETVRTAKLTPVAARLQEVEGKVDQHVIPKE (0) 19403
21269 TLVIYALGVVLQDADTWSLPYR (2) 21334
21425 FNPDRFAEESVMKSFSLLGFSGSQACPELR (2) 21514
      FAYTVATVLLSTLVRRLRMHRVDGQVVEARYELVTTPKDDTWITVSKRN* from wz10135.3

CYP20      Oryzias latipes (medaka fish)
BJ524824.1 MF01SSB cDNA Oryzias latipes cDNA
BJ003683.1 BJ003683 MF01SSA cDNA Oryzias latipes cDNA
MLDFAIFAVTFVVILVGAVLYLYPSSRRASGVPGLFPTDEKDGNLQDIVDRGSLHEFLV 
GLHEQFGPVASFWFGRQPVVSLGSVDPLRQHINPNHTTDSFETMLKSLLGYQAGAGGGAN 
ESVMRKKLYESAINNALKNSFPAVLKVAEELVDKWSSVPEDQHIPLCAHLLGLALKTV 
TQLALGERFKDDAEVISFRKNHEAIWSEIGKGYMDGSLEKSSSRKRHYE SA
LSDMEATLLAV AKDRKAQRRQTA FVDALLQSGLTERQIMEDCMVFTLAGC
VITANLCIWALHFLSTAEDVQEKLCQEVEDLFGSDPVSLDRIPQLKYCQQVLNE
TVRTAKLTPVAARLXEVEGKVGQHVIPKETLVIYALG
VVLQDADTWSTPYRFDPDRFQDESARKSFCLLGFSGSQTCPELRFAYTVATVLLATLVRRLKLRPLK
About 26 aa missing at end

CYP20      Salmo salar (Atlantic salmon)
           GenEMBL CA063128.1 ssalrgb509318 mixed_tissue Salmo salar cDNA.
           GenEMBL CB516811.1 ssalrgb509318_rev mixed_tissue Salmo salar cDNA.
           GenEMBL CB513409.1 ssalrgb531212_rev mixed_tissue Salmo salar cDNA.
           BG935303.1 SL1-0624 Atlantic Salmon liver Salmo salar cDNA clone SL1-0624
MLDFAIFAVTFVIFLVGAVLYLYPSSRSASGIPGLNPTEEKDGNLQDIVNRGSLHEF
LASLHGQFGPVASFWFGGRPVVSLGSVDQLRQHINPNRTTDSFETMLKSLLGYQSGTGGG
ATEAVMRKKLYESAVNNTLEKNFPMLLKLVEELVGKWQSFPKDQHTPLCAHLLGLAMKAVTQ
18 amino acids missing here
RKNHEAIWSEIGKGYLDGSMEKSSIRKEHYESA
LAEMETVLMSVAKDRKGQRSQTAFVDTLLQSNLTERQVME
DSMVFTLAGCVITANLCIWAVHFLST
SEDVQEKLHQELEDVLGSEPVSLDKIPQLRYFQQVLNETVRTAKLTPIAARLQENEGKVD
QHIIPKETLVIYALGVVLQDADTWSCPYKFDPDRFTEDSARKSFSLLGFSGNQACPELRF
AYTVATVVLSTVVRQLKLYQVKGQVVEARSELVSTPKDDTWITVSRRS*

21A Subfamily

CYP21A1P    human
            GenEMBL M13935 (3206bp)
            White,P.C., New,M.I. and Dupont,B.
            Structure of human steroid 21-hydroxylase genes.
            Proc. Natl. Acad. Sci. U.S.A. 83, 5111-5115 (1986)
            97% to 21A2 NT_033167.1|Hs6_33343
329415 MLLLGLLLLLPLLAGARLLWNWWKLRSLHLLPLAPGFLHLLQPDLPIYLLGLTQKFGPIYRLHLGLQ 329215
329117 DVVVLNSKRTIEEAMVKKWADFAGRPEPLTCK 329022
       LVSKNYPDLSL
328702 XXWSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCE 328592
       RMRAQPGTPVAIEEEFSLLTCSINCYLTFGDKIK 328383
328294 EDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLR 328193
328091 FFPNPGLRRLKQAIEKRDHNEEKQLRQHK
327835 ESLVAGQWRDMMDYMLQGVAQPSMEEGSGQLLEGHLHMAAVDLLIGGTETTANTLSWAVV
327654 FLLHHPE 327634
       IQQRL*EELDHELGPGASSSRVPYKDRARLPLLNATIAEVLRLWPVV
327292 PLALPHRTTRPS 327257
       SISGYDIPEGTVIIPNLQGAHLDETVWERPHEFWP 327069
326971 DRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPSGDALPSLQPLPH
326791 CSVILKMQPFQVRLQPRGMGAHSPGQNQ 326708

CYP21A1P    human with congenital adrenal hyperplasia
            GenEMBL M26857 (4034bp) X05445
            Rodrigues,N.R., Dunham,I., Yu,C.Y., Carroll,M.C.,
            Porter,R.R. and Campbell,R.D.
            Molecular characterization of the HLA-linked steroid
            21-hydroxylase B gene from an individual with congenital
            adrenal hyperplasia.
            EMBO J. 6, 1653-1661 (1987)

CYP21A1P    human
            GenEMBL S60612 (426bp)
            Collier,S., Tassabehji,M. and Strachan,T. 
            A de novo pathological point mutation at the 21-hydroxylase locus:
            implications for gene conversion in the human genome.
            Nature Genetics 3, 260-265 (1993)

CYP21A2     human
            GenEMBL M21544 M21545 M21547 to M21550 M23224 M23225
            M31022 M31023 (ten segments)
            Globerman,H., Amor-Gueret,M., Parker,K.L., New,M.I. and White,P.C.
            Nonsense mutation causing steroid 21-hydroxylase deficiency
            J. Clin. Invest. 82, 139-144 (1988)

CYP21A2     human
            GenEMBL M12792 M23280 (5141bp)
            Higashi,Y., Tanae,A., Inoue,H., Hiromasa,T., and 
            Fujii-Kuriyama,Y.
            Aberrant splicing and missense mutations cause steroid 21-
            hydroxylase [P-450(C21)] deficiency in humans: possible gene
            conversion products.
            Proc. Natl. Acad.Sci USA 85, 7486-7490 (1988)

CYP21A2     human
            GenEMBL X54940
            Partanen,J. and Campbell,R.D.
            Rapid characterisation of mutant P450c21 Genes by PCR
            unpublished (1993)

CYP21A2     human
            GenEMBL X58898 to X58908 PIR S26485 (97 amino acids)
            PIR S26484 (371 amino acids) PIR S29670 (495 amino acids)
            PIR S29671 (372 amino acids) PIR S26584 (371 amino acids)
            PIR S29673 (372 amino acids)
            Helmberg,A., Tabarelli,M., Dobler,G. and Kofler,R.
            Identification of molecular defects causing congenital adrenal
            hyperplasia by cloning of PCR-amplified 21-hydroxylase genes
            unpublished (1992)

CYP21A2     human with congenital adrenal hyperplasia
            GenEMBL M28548 (4044bp) X05449
            Rodrigues,N.R., Dunham,I., Yu,C.Y., Carroll,M.C.,
            Porter,R.R. and Campbell,R.D.
            Molecular characterization of the HLA-linked steroid
            21-hydroxylase B gene from an individual with congenital
            adrenal hyperplasia.
            EMBO J. 6, 1653-1661 (1987)
            Note: mutant gene, 2 amino acid differences

CYP21A2     human with congenital adrenal hyperplasia
            GenEMBL M26856 X05448 (4042bp)
            Rodrigues,N.R., Dunham,I., Yu,C.Y., Carroll,M.C.,
            Porter,R.R. and Campbell,R.D.
            Molecular characterization of the HLA-linked steroid
            21-hydroxylase B gene from an individual with congenital
            adrenal hyperplasia.
            EMBO J. 6, 1653-1661 (1987)
            Note: normal gene

CYP21     rat
            GenEMBL U56853(1964bp)
            Zhou,M.Y., Vila,M.C., Gomez-Sanchez,E.P. and Gomez-Sanchez,C.E.
            Cloning of two alternatively spliced 21-hydroxylase cDNAs in the
            rat adrenal.
            unpublished (1996)
MLLPGLLLLLLLLLLAGTRWLWGQWKLWKLRLPPLAPGFLHFLQPNLPVYLFGLAQKLGP
IYRIRLGLQDVVVLNSNKTIEEALIQKWVDFAGRPQILDGKMNFDLSMGDYSLTWKAHKK
LSRSALVLGMRDSMEPLVEQLTQEFCERMRAQAGASVAIHKEFSLLTCSIISCLTFGDKQ
DSTLLNATHSCVRDLLKAWNHWSVQILDIIPFLRFFPNPGLWKLKQFQESRDHIVMQELK
RHKDSLVAGQWKDMIDYMLQGVEKQRDARDPGQLHERHVHMSVVDLFVGGTETTAATLSW
AVAFLLHHPEIQKRLQEELDLKLAPSSQLLYKNRMQLPLLMATIAEVLRLRPVVPMALPH
RATKASSISGYDIPKDTIIIPNIQGANLDEMVWELPSKFWPDRFLESGKSPRIPTFGCGA
RVCLGEPLARLEFFVVLARLLQTFTLLPPPDGTLPSLQPLPYTGINLLIPPFQVRLQPRN
LAPQDQGQK

CYP21    rat
            GenEMBL U56854
            alternatively splice form

CYP21       pig
            GenEMBL S53049 M83939 (4792bp) Swiss Q02390 (492 amino acids)
            PIR S28169 (492 amino acids)
            Burghelle-Mayeur,C., Geffrotin,C. and Vaiman,M.
            Sequences of the swine 21-hydroxylase gene (Cyp21) and 
            a portion of the opposite-strand overlapping gene of unkown
            function previously described in human.
            Biochim. Biophys. Acta 1171, 153-161 (1992)

CYP21       sheep
            PIR A43349 (497 amino acids) B43349 (479 amino acids)
            Crawford, R.J., Hammond, V.E., Connell, J.M. and Coghlan, J.P.
            The structure and activity of two cytochrome P450c21 proteins
            encoded in the ovine adrenal cortex.
            J. Biol. Chem. 267, 16212-16218 (1992)
            B43349 is truncated early after alternate splicing

CYP21       Fugu rubripes (pufferfish)
            No accession number
            Scaffold_15
            44% to CYP21 human over 454 amino acids
            this is the first fish CYP21 sequence found
      MCSFLFCSSFSAPPAVKRNLPLSLWQLPLRPSSPPIPGPPCRFLIGNMTE
28230 LMHDHLPIHLTNLAKRYGNIYRLKCGNTT 28144 (1) 
28058 AMIVLNSSDIIREALVKKWSDFAGRAVSYT 27969 (1)
27888 ADIVSGGGRNISLGDYTEEWKALRRLVHGALQRCCKHSLHNVIERQALQLRK 27754 (0)
      VLVDYRGGAVDLSEDFTVAASNVIITLVFGKE (0)
      YDKSSSELQQLHRCLNEIVALWGSTWISALDTFPLLR (0)
      KFPNPVFSRLLREVSRRDEIIRKHLNQFK (0)
      CVLCCAQSEGHRRTDVITGSLLEG (0)
26896 VLTDMHVHMATVDLLIGGSETTAAWLNWTVAFLLHRPE (0) 
      FQTKVYEELCTVLEGRYPKYSDRQRLPILCSLIHEVLRLRPVAPLAVPHKAIRDS (2) 
      SIAGYFIPRNTIIIPNLFGAHHDPEVWSDPYSFKP (1)
26238 ERFLEGGGGSTRALIPFGGGARLCLGETVAKMELFLFTAYLLRDFCFVLPDSEAPLPDLR 26059
26058 GVASVVLKIKSFTVIARPRTGP* 25990

CYP21      Tetraodon nigroviridis (freshwater pufferfish)
           GenEMBL AL281449.1 C0BG094DE12LP1 G Tetraodon nigroviridis 
           genomic clone 094J24 T7.Length = 895
           AL233853.1 C0BG007BE04XD1 G Tetraodon nigroviridis 
           genomic clone 007I08 T7.Length = 1079
           86% to Fugu CYP21
MGCIFFFFYLPFSAPPAVKRSLLQSLCGLLHRPSSPSIPGPPCRFLIGNMTE
LMQDHLPIHLTDLAKRYGNIYRLKCGNTS
AMVVLSSGDVIREALVKKWSDFAGRSVSYT
ADIVSGGGRTISLGDYTEEWKAHRRLVHSAL (frameshift) ERCXKQSLHDVIERQALQLRK
Missing exon 4 and part of exon 5
                 GSAWISALDTFPLLR
KFPNPVFSRLLREVTRRDEIIRKHLNQYK
CVLCCVQSQDNKSTDVITGSLLEG
VLTDVHVHMATVDLLIGGTETTAAWLNWTVAFLLHRPE
IQTKVYEELCTVLEGRYPKYSDRHRLPVLCSLVHEVLRLRPVAPLAVPHKAVRDS
SIAGYFIPKNTIIIPNLFGAHHDPXVWPDPYSFXX
Missing exon 11

22A Subfamily

CYP22A1     Caenorhabditis elegans (nematode worm)
            GenEMBL U39648 (28581 bp) AF407572, NM_171699, NM_171698
            cosmid T13C5.1
            Jia,K., Albert,P.S. and Riddle,D.L.
            DAF-9, a cytochrome P450 regulating C. elegans larval development
            and adult longevity
            Development 129 (1), 221-231 (2002)
            daf-9 mutant This gene may be in the pathway to 
            synthesize a ligand for DAF-12, a nuclear receptor.  
            Daf-9 has a role in larval development.
            DAF comes from abnormal DAuer Formation

23A Subfamily

CYP23A1     Caenorhabditis elegans (nematode worm)
            GenEMBL U39472 (42282 bp)
            see CYP14A1 for reference
            cosmid B0304.3

24A Subfamily

CYP24       rat
            GenEMBL L04608 to L04619, S52625 to S52636
            Ohyama,Y., Noshiro,M., Eggertsen,G., Gotoh,O., Kato,Y.,
            Bjorkhem,I. and Okuda,K.
            Structural characterization of the gene encoding rat
            25-hydroxyvitamin D3 24-hydroxylase.
            Biochemistry 32, 76-82 (1993)

CYP24       rat
            GenEMBL Z28351 (1419bp)
            Hahn,C.N., Kerry,D.M., Omdahl,J.L. and May,B.K.
            Identification of a vitamin D responsive element in the promoter of 
the 
            gene for the rat 25-hydroxyvitamin D3 24-hydroxylase
            Nuc. Acids Res. 22, 2410-2416 (1994)

CYP24       rat
            GenEMBL X59506 (3209bp)
            Ohyama,Y., Noshiro,M. and Okuda,K.
            Cloning and expression of cDNA encoding 25-hydroxyvitamin D3 24- 
            hydroxylase.
            FEBS Lett. 278, 195-198 (1991)

CYP24       rat
            GenEMBL D17792 (?bp)
            Ohyama, Y., Ozono,K., Uchida,M., Shinki,T., Kato,S., Suda,T., 
Yamamoto,O., 
            Noshiro,M. and Kato,Y.
            Identification of a vitamin D-responsive element in the 5'-flanking 
region of the rat 
            25-hydroxyvitamin D3 24-hydroxylase gene.
            J. Biol. Chem. 269, 10545-10550 (1994)

CYP24        rat
            GenEMBL U03112 (?bp)
            Zierold,C., Darwish,H.M. and DeLuca,H.F.
            Identification of a vitamin D-response element in the rat calcidiol 
            (25-hydroxyvitamin D3) 24-hydroxylase gene.
            Proc. Natl. Acad.Sci. USA 91, 900-902 (1994)

CYP24       human 
            GenEMBL S67623 (776bp)
            Labuda,M., Lemieux,N., Tihy,F., Prinster,C. and Glorieux,F.H.
            Human 25-hydroxyvitamin D 24-hydroxylase cytochrome P450
            subunit maps to a different chromosomal location than that of
            pseudovitamin D-deficient rickets. 
            J. Bone Miner. Res. 8, 1397-1406 (1993)

CYP24       human
            GenEMBL L13286 (3254bp)
            Chen,K.-S., Prahl,J.M. and DeLuca,H.F.
            Isolation and expression of 1,25-dihydroxyvitamin D3
            24-hydroxylase cDNA.
            Proc. Natl. Acad. Sci. USA 90, 4543-4547 (1993)

CYP24       human
            NM_000782
MSSPISKSRSLAAFLQQLRSPRQPPRLVTSTAYTSPQPREVPVC
PLTAGGETQNAAALPGPTSWPLLASLLQILWKGGLKKQHDTLVEYHKKYGKIFRMKLG
SFESVHLGSPCLLEALYRTESVPQRLEIKPWKAYRDYRKEGYGLLILEGEDWQRVRSA
FQKKLMKPGEVMKLDNKINEVLADFMGRIDELCDERGHVEDLYSELNKWSFESICLVL
YEKRFGLLQKNAGDEAVNFIMAIKTMMSTFGRMMVTPVELHKSLNTKVWQGHTLAWDT
IFKSVKACIDNRLEKYSQQPSADFLCDIYHQNRLSKKELYAAVTELQLAAVETTANSL
MWILYNLSRNPQVQQKLLKEIQSVLPENQRPREEDLRNMPYLKACLKESMRLTPGVPF
TTRTLDKATVLGEYALPKGTVLMLNTQVLGSSEDNFEDSSQFRPERWLQEKEKINPFA
HLPFGVGKRMCIGRRLAELQLHLALCWIVRKYDIQATDNEPVEMLHSGTLVPSRELPI
AFCQR

CYP24       Fugu rubripes (pufferfish)
            No accession number
            Scaffold_4128
MLWRLRGALTLPPELTVLDAIPGPTNWPLVGSLFELLRKGGLTRQHEAL
VDYHKKFGKIFRLKLGSFESVHIGAPCLLESLYRTEGSYPQRLEIKPWTAYRDMRDEAYGLLIL
EGKDWQRVRRAFQQKLMKPTEVVKLDRKINE
VLEDFVSRIGKTNIGGKIEDLYFELNKWSFES
ICLVLYDKRFGLLQDKVNEEAMNFITAVKT
MMSTFGLMMVTPVELHKSLNTKTWQDHTAAWDRIFST
AKVYIDKKLKRNSVIAPDDLIGDILHQSRLSKKELYAAITELQIGGVET
8608 TANSMLWAIFNLSRNPGAQRRLLEEIRTVVPPEQDPCGEHIKSMPYLKACLKESMR 8441 (2)
7830 ISPSVPFTSRTLDKDTVLGDYAIPKG 7753
TVLMINSHALGSSEDYFDDGKKFKPERWLREHGTINPFAH
VPFGIGKRMCIGRRLAELQMSLFLQLVRDFE
IVATDNEPLDVIHSGLLVPNRELPVAFIKR

25A Subfamily

CYP25A1     Caenorhabditis elegans (nematode worm)
            GenEMBL Z66495 ( 40145bp)
            see CYP14A1 for reference
            cosmid C36A4.1

CYP25A2     Caenorhabditis elegans (nematode worm)
            GenEMBL Z66495 ( 40145bp)
            see CYP14A1 for reference
            cosmid C36A4.2

CYP25A3     Caenorhabditis elegans (nematode worm)
            GenEMBL Z66495 ( 40145bp)
            see CYP14A1 for reference
            cosmid C36A4.3

CYP25A4       Caenorhabditis elegans (nematode worm)
            GenEMBL Z66495 ( 40145bp)
            see CYP14A1 for reference
            cosmid C36A4.6

CYP25A5       Caenorhabditis elegans (nematode worm)
            GenEMBL AF038613
            see CYP14A1 for reference
            F42A6

CYP25A6P       Caenorhabditis elegans (nematode worm)
            GenEMBL U50072
            K06B9.1
            missing C-terminal

26A Subfamily

Cyp26a1       mouse
            No accession number
            Jim Ray
            submitted to nomenclature committee
            Note: new family in mammals, homolog to human ESTs R51129 and R21282

Cyp26a1      mouse
            No accession number
            Martin Petkovich
            submitted to nomenclature committee
            Note: new family in mammals, homolog to human ESTs R51129 and R21282

Cyp26a1     mouse
            GenEMBL Y12657
            Fujii, H., Sato, T., Kaneko, S., Gotoh, O., Fujii-Kiriyama, Y., 
Osawa, K., 
            Kato, S. and Hamada, H.
            Metabolic inactivation of retinoic acid by a novel P450 
differentially expressed in 
            developing mouse embryos.
            EMBO J. 16, 4163-4173 (1997)
            Note: new family in mammals, homolog to human ESTs R51129 and R21282

CYP26A1     human
            GenEMBL NM_000783
            White,J.A., Beckett-Jones,B., Guo,Y.D., Dilworth,F.J., Bonasoro,J.,
            Jones,G. and Petkovich,M.
            cDNA cloning of human retinoic acid-metabolizing enzyme (hP450RAI)
            identifies a novel family of cytochromes P450
            J. Biol. Chem. 272 (30), 18538-18541 (1997)
            Note: new family in mammals, equal to human ESTs R51129 and R21282

CYP26A1     human
            NM_000783 
MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPL
PPGTMGFPFFGETLQMVLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL
GDDRLVSVHWPASVRTILGSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEV
GSSLEQWLSCGERGLLVYPEVKRLMFRIAMRILLGCEPQLAGDGDSEQQLVEAFEEMT
RNLFSLPIDVPFSGLYRGMKARNLIHARIEQNIRAKICGLRASEAGQGCKDALQLLIE
HSWERGERLDMQALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREELKSKG
LLCKSNQDNKLDMEILEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGW
NVIYSICDTHDVAEIFTNKEEFNPDRFMLPHPEDASRFSFIPFGGGLRSCVGKEFAKI
LLKIFTVELARHCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFHGEI

CYP26A1     zebra fish
            GenEMBL U68234 
            White, J.A., Guo, Y.-D., Baetz, K., Beckett-Jones, B., Bonasoro, J.
            Hsu, K.E., Dilworth, F.E., Jones, G. and Petkovich, M. 
            Identification of the retinoic acid-inducible all trans retinoic 
acid 4-hydroxylase.
            J. Biol. Chem. 271, 29922-29927 (1996) 
            Note: new family in vertebrates, homolog to human ESTs R51129 and 
R21282

CYP26A1     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_12575
            67% to 26A1 human
            missing C-term zebrafish seq shown in lower case 
2144 MAVSALLATFLCTIVLPLLLFLVTVKLWEVY (frameshift)
     VIRERDSACP (frameshift)
     SPLPPG (frameshift)
     TMGLPFIGETLQLILQ (0)
1689 RRKFLRMKRQKYGYIYRTHLFGNPTVRVTGANNVRHILLGEHRLVAV 1552
1551 QWPASVRTILGSDTLSNVHGAQHKTKKK 1465 (0)
1230 AIMQAFSREALEFYIPAMQHEVQAAVQEWLAKDSCVLVY
     PEMKRLMFRIAMQILLGFQLE 1051 (frameshift)
1049 QIKTDEQKLVEAFEEMIKNLFSLPIDMPFSGLYR 948 (0)
784  GLKARNFIHAKIEENIKRKLRESNSDSKCRDALQQLIDSSKKSGQVLSMQ 635 (0)
548  VLKESATELLFGGHETTASTATSLIMFLGLNPEVLDKLRHELSDKVMHKGF 396 (1)
329  LDLRSLNLETLEQLKYTSCVIKETLRMNPPVPGGFRVALKTFELG 195 (0)
100  GYQIPKGWNVIYSICDTHDVAEIFP 26 (frameshift)
 24  NKEDFQPE 1 end of Scaffold_12575
     RFMMKNCGDSSRFQYIPFGGG 
     srmcvgkefakvllkiflveltqhcnwilsngpptmktgptiypvdnlptkftsyvrn

CYP26A1    Xenopus laevis
           GenEMBL AF057566
           Hollemann,T., Chen,Y., Grunz,H. and Pieler,T.
           Regionalized metabolic activity establishes boundaries of retinoic
           acid signalling.
           EMBO J. 17, 7361-7372 (1998)

CYP26B1    human
           GenEMBL AC007002
           Nelson, D.R. A second CYP26 P450 in humans and zebrafish: CYP26B1
           Archives of Biochemistry and Biophysics 371, 345-347 (1999)
MLFEGLDLVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQ
GSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILMGEHHLVSTEWPRSTRMLLGPNTVSNS
IGDIHRNKRKVFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQ
KLTFRMAIRVLLGFSIPEEDLGHLFEVYQQFVDNVFSLPVDLPFSGYRR
GIQARQILQKGLEKAIREKLQCTQGKDYLDALDLLIESSKEHGKEMTMQELKDGTLELIF
AAYATTASASTSLIMQLLKHPTVLEKLRDELRAHGILHSGGCPCEGTLRLDTLSGLRYLD
CVIKEVMRLFTPISGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDP
DRFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVLAVELASTSRFELATRTFPRI
TLVPVLHPVDGLSVKFFGLDSNQNEILPETEAMLSATV

CYP26B1     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_4267
            74% to 26B1 human
8448  MLFDSFDLVSALATLAACLVSMALLLAVSQQLWQLRWTATRDRNCKLPMPKGSMGFPFIGETCHWLLQ 8651
17930 GSGFHASRRQKYGNVFKTHLLGRPLIRVTGAENIRKVLMGEHTLVTVDWPQSTSTLLGPNSLA 18118
18119 NSIGDIHRKKRK 18154
1487  VFAKVFSHEALESYLPKIQQVIQESLRVWSSNPEPINVYR 1606 (2)
1689  ESQRLSFTMAVRVLLGFRVSEEEMKHLFSTFQDFVDNLFSLPIDLPFSGYRK 1844 (0)
1921  GIRARDTLQKSIEKAIREKPLCSQGKDYSDALDVLMESAKENGSELTMQELK 2076 (0) exon 4
2851  ESTIELIFAAFATTASASTSLIMQLLRHPPVLERLREELRARGL exon 5,6 fused
      LHNGCLCPEGELRLDTIVSLKYLDCVIKEVLRLFTPVSGAYRTAMQTFELD 3135 (0) exon 5,6 fused
3271  GVQIPKGWSVMYSIRDTHDTSTVFKDVDVFDPDRFSQERGEDKEGRFHYLPFGGGVRSCLGKQLA exon 7
      TLFLRILAIELASTSRFELATRQFPRVITVPVVHPVDGLKVKFYGLDSNQNEIMAKSEELLGAAV* 3663 exon 7

CYP26C1    human 
           GenEMBL AL358613.11 May 2, 2001
           522 amino acids, 6 exons, (0) = phase 0 intron
           52% to 26B1 human, also 15 amino acid insertion in exon 5 vs. 26B1
MFPWGLSCLSVLGAAGTALLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFGETLHWLVQ (0)
GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQSAHILLGSHTLLGAVGEPHRRRRK (0)
VLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVSVYDASKALTFRMAARILLGLRL
DEAQCATLARTFEQLVENLFSLPLDVPFSGLRK (0)
GIRARDQLHRHLEGAISEKLHEDKAAEPGDALDLIIHSARELGHEPSMQELK (0)
ESAVELLFAAFFTTASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGSEGPPPD
CGCEPDLSLAALGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD (0)
GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGAAREDSRGASSRLHYIPFGGGARSCLG
QELAQAVLQLLAVELVRTARWELATPAFPAMQTVPIVHPVDGLRLFFHPLTPSVAGNGLCL*

CYP26C1     mouse 
            GenEMBL AC110212.1
            84% to 26C1 human exon 5
ELAVELLFAAFFTTASASTSLILLLLQHPAAITKIQQELSAQGLGRACTCTPRASGSPPDCGCEPDLSLAMLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD

CYP26C1     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_11741
            Equally similar to 26B1 and 26C1 human
            But C-terminal is 68% to 26C1 while 58% to 26B1
Lower case region very poor match may not be correct exon structure here.
6362 MLGLVSALATALTTLLLLLLLLALTRQLWSFRWSLTRDRRCELPLPKGSMGWPLVGETFQWLFQ 6171 (0)
5571 GSNFHISRRKRHGNVFKTHLLGKPLVRVTGAENIRKILLGEHSLVCTQWPQSTRIILGPN 5392
5391 ALVNSIGELHKRKRK 5347 (0)
4963 ILAKVFSRKALESYLPRLQEVIKCEIAKWCAEPGSVDVYAATRSLTFRIAIGVLLGLHL 4781
4780 EEERIDYLAQIFGQLMSNLFSLPIDAPFSGLRK (0)
3972 GIKARKILHANMEKIIEKKMERQQEEEEYRDAFDYMLSTSKEQGQQISIQELK 3814 (0)
3581 ETAVELIFAAHSTTASAATSLVLQLLHHPEVVERVRVELEAQKLcynslnlpsqa 3417 (1)
3377 ctfpqsqchasnLSLDKLNQLHYIDCVIKEVLRFLPPVSGGYRTALQTFELD 3222 (0)
2770 GYQIPKGWTVMYSIRDTHETAEIFQNPELFDPDRFVTAQVESRSSRFSYVPFGGGVR 2600
2599 SCVGKELAQIILKTLTIELIRTCKWTLATEKFPKMQTVPIVHPVNGLHVNFMYKNLHEIDH* 2414 

27A Subfamily

CYP27A1       human
            Swiss Q02318 (531 amino acids)
            Cali J.J., Russell D.W.
            Characterization of the human sterol-27-hydroxylase: A mitochondrial 
P450
            that catalyzes multiple oxidation reactions in bile acid 
biosynthesis.
            J. Biol. Chem. 266, 7774-7778 (1991)

CYP27A1     human
            GenEMBL X59812 (2107bp)
            Guo,Y., Strugnell,S., Back,D.W. and Jones,G.
            Transfected human liver cytochrome P-450 hydroxylates vitamin D
            analogs at different side-chain positions
            Proc. Natl. Acad. Sci. U.S.A. 90, 8668-8672 (1993)

CYP27A1     human
            NM_000784
MAALGCARLRWALRGAGRGLCPHGARAKAAIPAALPSDKATGAP
GAGPGVRRRQRSLEEIPRLGQLRFFFQLFVQGYALQLHQLQVLYKAKYGPMWMSYLGP
QMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDQHDLTYGPFTTEGHHWYQLRQA
LNQRLLKPAEAALYTDAFNEVIDDFMTRLDQLRAESASGNQVSDMAQLFYYFALEAIC
YILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVLPFWKRYLDGWN
AIFSFGKKLIDEKLEDMEAQLQAAGPDGIQVSGYLHFLLASGQLSPREAMGSLPELLM
AGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHMPLLKAVLKE
TLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTAFSEPESFQPHRWL
RNSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPETGEL
KSVARIVLVPNKKVGLQFLQRQC

CYP27A1     rat
            GenEMBL M73231 (2300bp)
            Shayiq,R.M. and Avadhani,N.G.
            Sequence complementarity between the 5'-terminal regions of mRNAs 
for 
            rat mitochondrial cytochrome P-450c27/25 and a growth hormone-
inducible 
            serine protease inhibitor: a possible gene overlap.
            J. Biol. Chem. 267, 2421-2428 (1992)

CYP27A1    rat
           GenEMBL U17363, U17369 to U17376 genomic sequence
           Mullick, J., Addya,S., Sucharov,C. and Avadhani,N.G.
           Localization of a transcription promoter within the second exon of
           the cytochrome P-450c27/25 gene for the expression of the major
           species of two-kilobase mRNA.
           Biochemistry 34, 13729-13742 (1995)

CYP27A1     rabbit
            PIR A90152 (21 amino acids)
            Dahlbaeck, H.
            Characterization of the liver mitochondrial cytochrome P-450
            catalyzing the 26-hydroxylation of 5beta-cholestane-3alpha,
            7alpha,12alpha-triol.
            Biochem. Biophys. Res. Commun. 157, 30-36 (1988)

            PIR A90155 (21 amino acids)
            Dahlbaeck, H.
            Biochem. Biophys. Res. Commun. (1989) 159:370

CYP27A1     pig
            no accession number
            Kjell Wikvall
            77% identical to human

 CYP27A1    Fugu rubripes (pufferfish)
            No accession number
            Scaffold_3437
            46% to 27A1 missing N-terminal exon upstream of scaffold
136  (0) ILEKGRYGPIYRNGMNAVSVSTAKLLGEVLRNDDKFPNRGDMSIWKEYRDLRGYGYGPFTE 321 (2)
536  KDERWYNLRAVLNKRMLRPKDALQYGDTIGEVVTDFIRRIYFLRQRSPTGDVVTDLNNELYHFSLE 733 (1)
816  AIASILFETRLGCLEEEIPTGTQDFINAISQMFSNNFQVFLMPKWSRGVLPYWRRYVAGWDGIFSF 1013 (1)
1206 ATRLIDRKMEFIQQHLDNNQNVEGEYLTYLLSNTQMSIKDVYGSVSELLLAGVDT 1370 (0)
1465 TSNTLTWTLHLLSKYPQCQEILFKEVSTSVPADRAPSAEEVTRMPYLRAVVKESLR 1632 (2)
1756 MFPVIPMNGRILADKDVMIGGYQFSKN 1836 (0)
1949 TAFNFSHYAIGRDEDTFPEPATFMPERWLQDSHNRPNAFGAIAFGFGVRGCVGRRIAELEMYSFLCH 2149 (0)
2308 LMRHFEIKPDPKMGELKSVCRTVLIPDKPVSLRFLDRGSGHAA* 2439

CYP27A2     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_697
            53% to 27A1 mouse
29906 MASFTALRCAAIGARNSALRPATLPSRNLNLQATSEAANLKGIADLPGPNTYKILYWLFVKGYGERSHLLQ 30118 (0)
30735 GKLKNIYGPMWRWKLGPYDFVSVASPELIARVIQQEGRYPVRVQLPHWKEYRDLRGQAYGLHVE 30926 (2)
31027 TGPEWSRLRSALKPRMLKLREVVALSPDESR 31119 frameshift EVDGDLL
      29 aa of this exon missing in region of poor seq and a small sequence gap
31334 GISAILFETRLGCLGEKVDPNVQRFISGVNDMLSLSDITYLFPRWTRSFVPVWKRFAQAWDDISDV 31531 (1)
31614 ASSLIDRRIAEIDARVANGQSVEGLYLTYLLSSDKMSRAEISTCITDLLLGGVDT 31778 (0)
32763 TSNTLSWALYHLAKDPVAQDRLYDEVNSVCPNHHQPTTDDLANMPFLKAVIKEVLR 32930 (2)
33004 LYPVVHQNARFISENDVILNDYWFPKK (0)
      TQFHLCHYSVCHDETQFKHAERFLPERWLRHSAPLSGYYQHHPYSFIPFGVGVRACVGKRVAELEMXXXXXX 33348
      this exon runs into a sequence gap
      YFALTR (0) this is a Fugu seq for the end of the upper exon that is no 
      longer in the databases.  The last exon is missing

CYP27A3     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_6002 Length = 16767 
= LGS139924.x1 57% to 27A1 I-helix
= LGS125183.x1 Cyp27a1 
First 16 aa and 49-81 supported by EST from AU050037 Paralichthys olivaceus
aa 49 on also supported by AW343479 zebrafish EST
     MFRNRLLTVGLRASVPHREGLHRTAVNYAGARRRHASSAATEITEHNVR
     QKTMEDLGGPSFLTTLNWLFLKGYLPKTQQMQ (0)
4619 VEHSKIYGPLWKSKYGPMVVVNVASADLIEQVLRQEGRHPVRTDMPHWRRYRALRNQAHGPLTE 4428
     exons 3 and 4 in a sequence gap
1891 AEQMVQKKMEEIQNKVDLHQDVEGAYLTHLLLSEKMTVTEILGSITELLLAGVDT 1715
1661 TSNTISWALYQLAQNPSIQDQLYHEVRSVCPGNKMPDSDDIAQMPYLKAVIRETLR
     LYPVVPGNARVTVDKEIVVGGYLFPKQ (0)
754  TLFHLCHYCVSHDENIFPNSRVFQPERWLRGREEKSKQHPFGSVPFGFGVRACLGRRV 581
580  AELEMYLLLSRVRAVQTGLTAAGEGNQEVL* 

>CYP27A fragment a  Fugu rubripes (pufferfish)
                    No accession number
a may be exon 4 of 27A3 
LPC42076.x1 39% to 27A1 202-276 79% to LPC42075.x1 gene duplication? Exon 4
GISSVLFESRMGCLNDEVPEETQKFIYSVGEMFRLSAVVVLFPQSVWPYLPLWKKFVAAWDYLFKV

>CYP27A fragment b  Fugu rubripes (pufferfish)
                    No accession number 
may be exon 4 of 27A3 
LPC42075.x1 35% to 27A1 79% to LPC42076.x1 gene duplication? Exon 4
GISSVLFESRLGCLNDEVPEETRRVIYSVGEMCRLSAVVVLFPQSGWPYLPVWARLGAAGDYLLQF

27B Subfamily

CYP27B1 rat
            GenEMBL AF000139
            St-Arnaud,R., Messerlian,S., Moir,J.M., Omdahl,J.L. and
            Glorieux,F.H.
            The 25-hydroxyvitamin D 1-alpha-hydroxylase gene maps to the
            pseudovitamin D-deficiency rickets (PDDR) disease locus
            J. Bone Miner. Res. 12, 1552-1559 (1997)
            previously named CYP40. This sequence has errors in it.

CYP27B1 rat
            GenEMBL AB001992
            Cloning and expression of rat 25-hydroxyvitamin D3-1-alpha-
hydroxylase cDNA .
            Shinki, T., Shimada, H., Wakino, S., Anazawa, H., Hayashi, M.,
            Saruta, T., DeLuca, H.F. and Suda, T.
            Proc. Natl. Acad. Sci. USA  94, 12920-12925 (1997)
            Note: this sequence has been called CYP27B1 in this paper.  The name 
CYP40 was        
            given in May 1997 based on the sequence from John Omdahl that was 
not  
            completely accurate at the time of submission for a name.  It was 
necessary to 
            revise the name CYP40 to CYP27B1.

CYP27B1      mouse
          GenEMBL AB006034
          Takeyama, K-i., Kitanaka, S., Sato, T., Kobori, M., Yanagisawa, J. and 
Kato, S.
          25-hydroxyvitamin D3 1 alpha hydroxylase and vitamin D synthesis.
          Science 227, 1827-1830 (1997)

CYP27B1        human
           GenEMBL AB005989 cDNA sequence
           Takeyama,K., Kitanaka,S., Sato,T., Kobori,M., Yanagisawa,J. and 
Kato,S.
           25-Hydroxyvitamin D3 1alpha-hydroxylase and vitamin D synthesis
           Science 277, 1827-1830 (1997)

CYP27B1      human
            GenEMBL AB005990 gene sequence
            Murayama,A., Kitanaka,S., Takeyama,K. and Kato,S.
            Human 25-hydroxyvitamin D3 1alpha-hydroxylase gene
            Unpublished (1997)

CYP27B1     human
            GenEMBL AB005038 cDNA sequence AB006987 gene sequence
            Monkawa,T., Yoshida,T., Wakino,S., Shinki,T., Anazawa,H.,
            Deluca,H.F., Suda,T., Hayashi,M. and Saruta,T.
            Molecular cloning of cDNA and genomic DNA for human
            25-hydroxyvitamin D3 1alpha-hydroxylase
            Biochem. Biophys. Res. Commun. 239, 527-533 (1997)

CYP27B1     human
            GenEMBL AF020192 cDNA sequence
            Fu,G.K., Lin,D., Zhang,M.Y., Bikle,D.D., Shackleton,C.H.,
            Miller,W.L. and Portale,A.A.
            Cloning of human 25-hydroxyvitamin D-1alpha-hydroxylase and
            mutations causing vitamin D-dependent rickets type 1
            Mol. Endocrinol. 11, 1961-1970 (1997)

CYP27B1     human
            GenEMBL AF0027152 gene sequence
            Fu,G.K., Portale,A.A. and Miller,W.L.
            Complete structure of the human gene for the vitamin D 1alpha-
hydroxylase,  
            P450c1alpha.
            DNA Cell Biol. 16, 1499-1507 (1997)

CYP27B1     human
            NM_000785
MTQTLKYASRVFHRVRWAPELGASLGYREYHSARRSLADIPGPS
TPSFLAELFCKGGLSRLHELQVQGAAHFGPVWLASFGTVRTVYVAAPALVEELLRQEG
PRPERCSFSPWTEHRRCRQRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAGTLNN
VVCDLVRRLRRQRGRGTGPPALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDT
ETFIRAVGSVFVSTLLTMAMPHWLRHLVPGPWGRLCRDWDQMFAFAQRHVERREAEAA
MRNGGQPEKDLESGAHLTHFLFREELPAQSILGNVTELLLAGVDTVSNTLSWALYELS
RHPEVQTALHSEITAALSPGSSAYPSATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPD
KDIHVGDYIIPKNTLVTLCHYATSRDPAQFPEPNSFRPARWLGEGPTPHPFASLPFGF
GKRSCMGRRLAELELQMALAQ 
ILTHFEVQPEPGAAPVRPKTRTVLVPERSINLQFLDR

CYP27B1     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_470
            52% to 27B1 human
      MLQQALRVSCRSASPLVKWMERWAECASARPQAVKPLGDMPGPSVASFAWDLFAKRGLSRLHELQ (0)
      LEGVRRYGPMWKASFGPILTVHVADPALIEQVLRKEGQHPMRSDLSSWKDYRRLRGHHYGLLTS (2)
51430 EGEEWQSIRSLLGKHMLRPKAVEAYDQTLNSVVDDLITKLRLRRSSQGLVTDIASEFYRFGLE 51630 (1)
51726 GVSSVLFESRIGCLDKIVPEETERFIQCINTMFVMTLLTMAMPSWMHQLFPKPWNVFCQCWDYMFDF (1)
      AKGHIDQRMAAEAEKIARGEEVEGRYLTYFLSRTSLPMKTVYSNVTELLLAGVDT (0)
52280 ISSTLSWSLYELSRHQAVQASLREEVLSVLGGRRVPTAADVAQMPLLKATIKEVLR 52444 (2)
52527 LYPVIPANARVITERDIQVGGYLIPKN 52610 (0)
52697 TLITLCHYATSRDPAVFPRPDEFLPQRWLNKEQSHHPYASVPFGVGKRSCIGRRIAELELYLAVAR 52894 (0) 
53237 ILLEFDIKPDPEGISVKPMTRTLLVPENVINLQFTER* 53347

CYP27C1     human
            GenEMBL AC027142
            BM562765 BI459427 ESTs 
            43% identical to 27A1 assembled gene
intron starting with QIH ending in VDT is from Celera's data
CRA_Gene|hCG42613 /len=10487.  This Celera sequence is still missing the C-terminal. Probable last exon is now found in AC027142.  AG Intron boundary is in the same location as CYP26B1.  Stop codon is one codon away from 26B1s stop codon.  Length is preserved from cys to intron. (n) = intron phase, 9 exons

  1  85452 MQTSAMALLARILRAGLRPAPERGGLLGGGAPRRPQPAGARLPAGARAEDKGAGRPGSPPG 85634 61
 62  85635 GGRAEGPRSLAAMPGPRTLANLAEFFCRDGFSRIHEIQ (0) 85748 99
100  39574 QKHTREYGKIFKSHFGPQFVVSIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISA (2) 39371 163
164  43984 EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSME (1) 43787 229 
230  41743 GVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFC 41564 290
291  41563 RSWDGLFKFS 41534 300 (1)
301        QIHVDNKLRDIQYQMDRGRRVSGGLLTYLFLSQALTLQEIYANVTEMLLAGVDT (0) 354 (Celera sequence)
355 110201 TSFTLSWTVYLLARHPEVQQTVYREIVKNLGERHVPTAADVPKVPLVRALLKETLR (2) 110034 410
411 108566 LFPVLPGNGRVTQEDLVIGGYLIPKG (0) 108489 436
437 108006 TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAELEIHLVVIQ (0) 107794 504
505 102503 LLQHFEIKTSSQTNAVHAKTHGLLTPGGPIHVRFVNRK* 102619 542

CYP27C1     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_1410
            75% to 27C1 human
      MSVMNKLTTTCWTNFYGDRRNKQMLFVLRCLHKSATSGTFGVAREEPLPERLITSTDATKKRLPKT
19233 LAEMPGPGTISNLFEFFWRDGFSRIHEIQ 19319 (0)
20076 IEHSKMYGKIFKSRFGPQLVVSVADRDLVAEVLRAE
      GVAPQRANMESWHEYRDMRRRSTGLISA 20261 (2)
      EGEDWLRMRSVLRQLIMRPRDVAVFSDDVSEVVDEVVDDLIKR
      IVCLRSQSSDGTTICNINDLFFKYAME (1)
20741 GIAAILYECRLGCLSQKIPQETEDYIDALHLMFSSFKTTMYAGAIPKWLRPV 20911
20912 FPKPWEEFCDSWDGLFRF 20968 (1)
      SVHVDKRLKQIESQLQRGEKVTGGLLTYMLVAKEMSVEEIYANVTEMLLAGvdt (0)
23004 TSFTLSWASYLLARHPDVQQQIHAEVMRVLGSEKVATAEDVQHLPFIRGLVKETLR 23183 (2)
      LFPVLPGNGRITQDDMVLGGYFIPKG
23623 TQLALCHYSTSLDDENFPSSLEFRPDRWIRKHSSDRLDNFGSIPFGYGIRSCIGKRI 23793
23794 AELEMHLALIR (0)
23998 IIQKFHVCVSPLTTDVKAKTHGLLCPGAPINLQFIDREI* 24117

CYP27C1   Silurana tropicalis (frog)
          GenEMBL BQ392731 AL629634 AL595312 
          74% to human 27C1 not counting the divergent N-terminal
          87% to Xenopus 27C1
MAALGQLLRGSARLEGLARSFHRFPGAQAAGQALEHEQAEGVLGATVKGSPMVKN
LKEMPGPSTMANLVEFFWRDGFGRIQEIQQKHARQYGRIFKSHFGPQFVVSIADKDMVAQ
VLRAERDAPQRANMESWHEYRELRGRSTGLISAEGEKWLNMRSVLRQKILRPRDVAMYSG
GVNEVVEDLVKRIRKLRVQESDGLTVTNVNDLYFKYSMEAIATILYECRLGCLDDQIPQQ
TKEYIEALELMFSMFKTTMYAGAIPKWLRPLIPKPWREF
CRSWDGLFKFSQIHVDDRLRQIESQLEKGEEVQGGVLTHLLLSKELDLEEIYA
NMTEMLLAGVDTTSFTLSWATYLLAKNPGIQEAVYQQIVQNFGKDQVPTAEDVPKMPLVRAVVKETL
RLFPVLPGNGRVTQDDLVVGGYFIPKGTQLALCHYSTSYDAECFPAAEEFRPERWIRSGN
LERKENFGSIPFGYGIRSCIGRRVAELEMHLLLIQLLQ
NLEIKPSPQTTTVLPKTHGLLCPGGKINVRFVDRQ*

CYP27C1    Xenopus laevis (African clawed frog)
           GenEMBL BJ086834 CA982419 BQ385474 BJ069739
           87% to Silurana 27C1
MAVLGHLLKGSARLEGLARGFHQFPKIQAAGQALEQEQAEGEL
GARAKEAPMM KSLKDMPGPSTLANLVEFFWRDGFGRIHEIQQKHTRQYGRIFKSHFGPQF
VVSIADKDLVAQVIRAERDAPQRANMESWHEYRELRGRSTGLISAEGEKWLNMRSVLRQK
ILRPRDVAMYTGGVNEVIGDLVKKIHKLRAQESDGLTVTNVNDLYFKYSMEAIATVLYEC
RLGCLDDIIPQQTKEYIAALELMFSMFKTTMYAGGLPKWLRHHNFPNPWKEF
53aa gap see Silurana seq
HMTEMLLAGVDTTSFTLSWATYLLAKNPSIQESVYQQIVQNLGKDQVPTAEDVTKIPMVR
AVVKETLRLFPVLPGNGRVTQDDLVLDGYLIPKGTQLALCHYSTSYDDKCFPGAEEFKPE
RWIRSGYLERKENFGSIPFGNGIRSCIGRRVAELEIHLLLIQL 
LQNFEIKLSPETMTVLPKTHGLLCPGEKINVRFLDRQ*

CYP27C1    Gallus gallus (chicken)
           GenEMBL BU235126 BU209688
           Partial seq missing N and C-terminals
           78% to Silurana 27C1 82% to human 27C1
PGLSGSQSLEVRSGAENKAARPGELLEPSPQLGRV KSLHEMPGPNTLYNLYEFFWKDGFGRIHE
IQQKHTQEYGKIFKSHFGPQFVVSIADRDMVAQVLRSEGRAPQRANMESWQEYRDLRGR
ATGLISAEGEQWLKMRSVLRQKILKPKDVAVYSGGVNEVITDLIKRIYTLRSQEEDGETV
TNVNNLFFKYSMEGVATILYECRLGCLENNVPQQTVEYIEALELMFSMFKTTMYAGAIPR
WLRPFIPKPWREFCRSWDGLFKFSQIHVDNKLKSIQS
QLDQGEEVNGGLLTYLLVSKELTLEEIYA
NMTEMLLAGVDTTSFTLSWAIYMLAKHPEVQQRVYEEIINKLGKDQAPVNRDVPKLPLIR
AVLKETLR

CYP27C1 mouse and rat
        Mouse is missing this sequence in blast searches.  In humans it lies between 
        BIN1 and ERCC3 on chr 2.  This region does not contain 27C1 in mouse (chr 18)
        unless there is a sequence gap.  Blast of the rat genome 
        (Rat Genome Sequencing Consortium Assembly Posted date:  Dec 4, 2002)
        also had no hits, so 27C1 may have been lost in rodent evolution.
        It is present in human, Fugu, Xenopus, Silurana (another frog) and chicken.
        The absence of 27C1 may be a useful synapomorphy for comparing rodents to other mammals.

28A Subfamily

CYP28A1    Drosophila mettleri
             GenEMBL U89746 (1895bp)
             Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C.
             Molecular cloning of a family of xenobiotic-inducible drosophilid
             cytochrome p450s: evidence for involvement in host-plant
             allelochemical resistance.
             Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997)
             note: 369mt, full length, one of 109 seqs submitted to Nomenclature 
committee

CYP28A2    Drosophila mettleri
             GenEMBL U89747 (1764bp)
             Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C.
             Molecular cloning of a family of xenobiotic-inducible drosophilid
             cytochrome p450s: evidence for involvement in host-plant
             allelochemical resistance.
             Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997)
             note: 43mt, full length, one of 109 seqs submitted to Nomenclature 
committee

CYP28A3    Drosophila nigrospiracula (a desert dwelling Drosophilid)
             GenEMBL U91565 (451bp)
             Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C.
             Molecular cloning of a family of xenobiotic-inducible drosophilid
             cytochrome p450s: evidence for involvement in host-plant
             allelochemical resistance.
             Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997)
             54 amino acids

CYP28A4      Drosophila hydei (a desert dwelling Drosophilid)
             GenEMBL U91566 (366bp)
             Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C.
             Molecular cloning of a family of xenobiotic-inducible drosophilid
             cytochrome p450s: evidence for involvement in host-plant
             allelochemical resistance.
             Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997)
             54 amino acids

Cyp28a5     Drosophila melanogaster
            GenEMBL AC018242 6065-8082 also AC001660

CYP28B1     Musca domestica (housefly)
            no accession number 
            Nannan Liu
            44% identical to 28A1 over 498 amino acids
            submitted to nomenclature committee 7/29/99

Cyp28c1     Drosophila melanogaster
            GenEMBL AC014191 comp(8254-10002) also AL133495

Cyp28d1     Drosophila melanogaster
            GenEMBL AC017780 comp(28385-30262) also AC009355 AC008324 AC008327

Cyp28d2     Drosophila melanogaster
            GenEMBL AC017780 comp(31545-33456) AC008324

29A Subfamily

CYP29A1     Caenorhabditis elegans (nematode worm)
            GenEMBL Z69787 and AL022276 (Y102F5)
            C44C10.2 (with modifications)
            note: GeneFinder translation is incorrect in some regions.
            First exon probably ends at 11181 coding for 35 amino acids as does 
the first exon 
            of CYP29A2.  
            There is probably a sequence error in the stop codon at 11155-11157.
            2nd exon probably runs from 11227-11293.
            11400-11518 is the third exon.
            12465 to 12979 should be one exon.
            There is probably a sequence error in the stop codon at 12879-12881.
            some segments around I-helix missed by genefinder.  N-terminal of 
            genefinder translation is probably a different gene. Matches 
F23A7.3, but this gene 
            does not have a P450 sequence downstream.

CYP29A2      C. elegans
          GenEMBL Z74043
          T19B10.1
          nearly the whole gene of wEST00713, CEMSH91R, missing 5' end

CYP29A2      C. elegans
          GenEMBL Z74040
          T19B10.1 
          5' end of gene from Z74043

CYP29A3     C. elegans
          no accession number
          Y108G3 contig 541

CYP29A4     C. elegans
          GenEMBL Z99102
          B0331.1

30A Subfamily

CYP30              Mercenaria mercenaria (northern quahog, a clam)
           GenEMBL AF014795 (1628bp)
           Brown, D., Clark,G.C. and Van Beneden,R.J.
           A novel cytochrome P450 from the clam, Mercenaria mercenaria
           Unpublished

31A Subfamily

CYP31A1P    C. elegans
          GenEMBL Z68213
          C01F6.3
          missing 25 amino acids at C-helix plus in frame stop codon at ERK*

CYP31A2    C. elegans
          GenEMBL Z68336 and Z92789
          F22B3 and H02I12
          This gene is definitely different than CYP31A3, there are 8 amino 
acids differences 
           and the introns are divergent.

CYP31A3    C. elegans
          no accession number
          T16C6 this sequence revised according to Y17G9.contig 61
          cosmid T16C6 lies 3' of E04A4 and 5' of R11E3.  Y17G9 contig 61 covers 
this     
          region.  E04A4 ends at 29981 of Y17G9 contig 61 R11E3 starts at 49037 
of Y17G9 
          contig 61 this sequence lies between 41545-45037

CYP31A4P     C. elegans
          no accession number
          Pseudogene related to CYP31A sequences 44849-45028 of Y17G9.contig61 
          C-TERMINAL exon  fragment,  This sequence is inside the last intron of 
CYP31A3

32A Subfamily

CYP32    C. elegans
          GenEMBL U53148
          C26F1.2

33A Subfamily

CYP33A1     C. elegans
          GenEMBL U55365
          C12D5.7

33B Subfamily

CYP33B1      C. elegans
          GenEMBL U50311
          C25E10.2

33C Subfamily

CYP33C1      Caenorhabditis elegans (nematode worm)
           GenEMBL AF039053
           C45H4.a near 35k

CYP33C2      Caenorhabditis elegans (nematode worm)
           GenEMBL AF039053
           C45H4.b near 38k
           also on Y10C10 contig 60.  Probable end of 33C2 gene is on contig 57, 
           but the intron ends of these two contigs do not overlap yet. 

CYP33C3      Caenorhabditis elegans (nematode worm)
           GenEMBL AF016676
           F41B5.b        5k to 7k

CYP33C3     Caenorhabditis elegans (nematode worm)
          GenEMBL D35162
          EST 5 prime read with heme signature
          3 prime fragment of this clone yk18a10 = D32508 (not in coding region)
          this is a portion of clone YK824

CYP33C4      Caenorhabditis elegans (nematode worm)
           GenEMBL AF016438
           F44C8 whole gene of EST CEL10E1

CYP33C4      Caenorhabditis elegans 
            GenEMBL M88882 (501bp)
            EST CEL10E1 
            2nd frame contains PPGP and KKYG from N-terminal region of
            many P450s.

CYP33C5      Caenorhabditis elegans (nematode worm)
           GenEMBL AF016676
           F41B5.c

CYP33C6      Caenorhabditis elegans (nematode worm)
           GenEMBL AF016676
           F41B5.e

CYP33C7      Caenorhabditis elegans (nematode worm)
           GenEMBL AF016676
           F41B5.d

CYP33C8      Caenorhabditis elegans (nematode worm)
           GenEMBL AF003385
           R08F11

CYP33C9      Caenorhabditis elegans (nematode worm)
           GenEMBL AF016449
           C50H11

CYP33C10P      Caenorhabditis elegans (nematode worm)
           GenEMBL AF016676
           F41B5.a 
           contains first two exons split by 3700bp and no C-terminal sequence.

CYP33C11      Caenorhabditis elegans (nematode worm)
           no accession number 
           Y49C4 contig 103

33D Subfamily

CYP33D1     Caenorhabditis elegans (nematode worm)
            GenEMBL Z92804
            K05D4.4 (previously found on F10A3 and F11A5 at early stages of 
sequencing)

CYP33D2X     Caenorhabditis elegans (nematode worm)
            GenEMBL Z92830
            F11A5
            this sequence is really the same gene as CYP33D1

CYP33D3     Caenorhabditis elegans (nematode worm)
            GenEMBL Z81487 (C54E10), AL021470 (Y17D7A.4) and Z98877 (Y69H2)
            GenEMBL AL020988 (Y80D3)

33E Subfamily

CYP33E1     Caenorhabditis elegans (nematode worm)
            GenEMBL U61945
            C49C8.4

CYP33E2     Caenorhabditis elegans (nematode worm)
            GenEMBL U61952
            F42A9.5

CYP33E3P     Caenorhabditis elegans (nematode worm)
            GenEMBL U61952
            F42A9.4
            missing C-terminal    3' neighbor is C49C8 (with CYP33E1)

34A Subfamily

CYP34A1     Caenorhabditis elegans (nematode worm)
            GenEMBL Z81119 and AL022301 (Y75B12)
            T10H4.10
            There are two P450s listed in this clone as one gene
            This is the first one

CYP34A2     Caenorhabditis elegans (nematode worm)
            GenEMBL Z81119 and AL022301 (Y75B12)
            T10H4.11
            There are two P450s listed in this clone as one gene
            This is the second one

CYP34A3     Caenorhabditis elegans (nematode worm)
            GenEMBL  Z81047 and AL022301 (Y75B12)
            C41G6.1

CYP34A4     Caenorhabditis elegans (nematode worm)
            GenEMBL AF068712
            T09H2.1

CYP34A5     Caenorhabditis elegans (nematode worm)
            GenEMBL AF039050
            B0213.10

CYP34A6     Caenorhabditis elegans (nematode worm)
            GenEMBL AF039050
            B0213.11

CYP34A7     Caenorhabditis elegans (nematode worm)
            GenEMBL AF039050
            B0213.12

CYP34A8     Caenorhabditis elegans (nematode worm)
            GenEMBL AF039050
            B0213.14

CYP34A9     Caenorhabditis elegans (nematode worm)
            GenEMBL AF039050
            B0213.15

CYP34A10     Caenorhabditis elegans (nematode worm)
            GenEMBL AF039050
            B0213.16

35A Subfamily

CYP35A1     Caenorhabditis elegans (nematode worm)
            GenEMBL U97008
            C03G6 near 19k

CYP35A2     Caenorhabditis elegans (nematode worm)
            GenEMBL U97008
            C03G6 near 22k

CYP35A3     Caenorhabditis elegans (nematode worm)
            no accession number
            K09D9

CYP35A4     Caenorhabditis elegans (nematode worm)
            GenEMBL AF016418
            C49G7.8

CYP35A5     Caenorhabditis elegans (nematode worm)
            GenEMBL AF039049
            K07C6.5 [and part of F40C5 contig 6    9-578 plus strand Note: 
newest version of   
            F40C5 does not contain this P450. probably an error corrected by the 
sequencers]
            NOTE: K07C6 IS NOW IN GENBANK AND IT CONTAINS 4 P450S
            It is part of a contig with T09H2 and B0213 that has 13 P450s in a 
cluster
            6 P450s on B0213 appear to have inserted themselves into a cluster 
of
            three olfactory receptor-like genes.  K09D9 is the next clone 3' and 
it has at least 
            one more P450.  C49G7 is next with one more P450 
            Contig order K07C6 T09H2 B0213 K09D9 C49G7

35B Subfamily

CYP35B1     Caenorhabditis elegans (nematode worm)
            GenEMBL AF039049
            K07C6.4, [F40C5 contigs 5 and 14 Note: newest version of F40C5 does 
not 
            contain this P450. probably an error corrected by the sequencers]

CYP35B2     Caenorhabditis elegans (nematode worm)
            GenEMBL AF039049
            K07C6.3 [and F40C5 contig 15 Note: newest version of F40C5 does not 
contain    
            this P450. probably an error corrected by the sequencers]

CYP35B3     Caenorhabditis elegans (nematode worm)
            GenEMBL AF039049
            K07C6.2

35C Subfamily

CYP35C1     Caenorhabditis elegans (nematode worm)
            GenEMBL Z77652
            C06B3.3

35D Subfamily

CYP35D1     Caenorhabditis elegans (nematode worm)
            GenEMBL Z83105
            F14H3 near 20k

CYP35D2P     Caenorhabditis elegans (nematode worm)
            GenEMBL Z83105
            F14H3 near 14k
            only N-terminal is present

36A Subfamily

CYP36A1     Caenorhabditis elegans (nematode worm)
            GenEMBL Z83220
            C34B7.3

37A Subfamily

CYP37A1     Caenorhabditis elegans (nematode worm)
            GenEMBL Z81493 (F01D5) and Z92851 (Y39G8)

37B Subfamily

CYP37B1     Caenorhabditis elegans (nematode worm)
            GenEMBL Z93381 and Z93389
            F28G4 and T13F3

CYP38A1      Suberite domuncula (sponge)
             GenEMBL Y17816 (1789bp)
             Mueller,W.E.G., Wiens,M., Batel,R., Steffen,R., Borojevic,R. and
             Custodio,M.R.
             Establishment of a primary cell culture from a sponge: Primmorphs
             from Suberites domuncula
             Mar. Ecol. Prog. Ser. In press
             most similar to the CYP4 family
MLDFVIFAITAVAGLIGILLFFYFSRSTETKPVSSASPTSTIPR
WSAPPADIEKGDLDVMMKKHGSLHQFLLHLHDNGKTPVTSFWWGKTHVVSFCSPQAFK
ESAVFVNRPVELFVGFEPLITPFSIQYANDEDWVQRSKCLYHTLKGDDLKSYFHHFVQ
IAQEEESLWSSYTSDKEVSLTKEVFPMTIKGIARTCFGDIFKDENELSKMAESYHVCW
RTMEEGVPEAGSKRETEFLKHRRVLEDIIRRIIQERKEGEDLQELPFIDSMLQNYDSE
DKIIADAISFMVGGFHTSGYMFTWMLWYLSSHPESQDRLRTEIERETGGERGDRLKEY
SLRADTFLRQVQDETIRLSTLAPWAARYSDKKVTVCGYTIPAKTPMIHALGVGLKNKT
VWENTDSWDPDRFSPNGRRGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEIV
PVEGQTVIQVHGLVTEPKDDIKIYIRSRKED"

CYP39    human
            GenEMBL EST R07010 R11279 and UNIGENE entry Hs.25121
            covers the C-terminal part of a P450.  The 2 ESTs with coding 
regions 
            are not found in UNIGENE, but the opposite end of EST R11279 = 
R11221 and it 
            is in UNIGENE with 13 EST sequences all from the 3 prime noncoding 
region. 
            This sequence is most like CYP4A11, but the percent identity is only 
            39%.  Since this is the most conserved region of P450s, the sequence 
must be in a 
            new family.  More sequence is known form the mouse homolog.
ENLLLIKWCVLETIRLKAPGVITRKVVKPVEILNYIIPSGDLLMLSPFWLHRNPKYFPEPELFKPERWEKGKFRRKHSFL
GTASWA
FGAGSSQCPGKVFALLEVQVC

CYP39A1    human
           AC008104 AL035670 note heme region exon corrected 1/18/02
MELISPTVIIILGCLALFLLLQRKNLRRPPCIKGWIPWIGVGFEFGKAPLEFIEKARIK
YGPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYRT
ASIPKNVFLALHEKLYIMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLVR
HLLYPVTVNMLFNKSLFSTNKKKIKEFHQYFQVYDEDFEYGSQLPECLLR 
NWSKSKKWFLELFEKNIPDIKACKSAKDNSM 
TLLQATLDIVETETSKENSPNYGLLLLWASLSNAVP
VAFWTLAYVLSHPDIHKAIMEGISSVFGKAG
KDKIKVSEDDLENLLLIKWCVLETIRLKAPGVITRKVVKPVEIL
NYIIPSGDLLMLSPFWLHRNPKYFPEPELFKPERW
KKANLEKHSFLDCFMAFGSGKFQCPARW
FALLEVQMCIILILYKYDCSLLDPLPKQ
SYLHLVGVPQPEGQCRIEYKQRI

CYP39     mouse 
          GenEMBL ESTs AA096922, AA606237
          Note: this sequence has a poorly conserved I Helix and may not require 
molecular 
          oxygen.  The substrate may be a peroxide or some substance with its 
own oxygen.
consensus
QFKTYDEGFEYGSQLPEWLLRNWSKSKRWLLALFEKNIGNIKAHGSAGHS
GTLLQAILEVVETETRQYSPNYGLVVLWAALANAPPIAFWTLGYILSHPDIHRTVL
ESISSVFGTAGKDKIKVSEDDLKKLLIIKWCILESVRLRAPGVITRKVVK
PVKILNHTVPSGDLLMLSPFWLHRNPKYFPEPESFKPERWKEANLDKYIF
LDYFMAFGGRKFQCPGKWFALLEIQLCIILVLYKYECSLLDPL

40A Subfamily

CYP40X       rat, mouse, human  (name changed to CYP27B1)

Note: This family was created based on the rat sequence AF000139 submitted to 
the Nomenclature Committee in May 1997.  This was the only sequence available at 
the time and it was about 38% identical to rat CYP27.  The sequence was 
considered a new family and named CYP40.  Since then, there have been additional 
sequences determined for the rat mouse and human homologs.  The rat sequence 
AB001992 differs from the AF000139 sequence at 34 amino acids, mostly in two 
regions that are probably frameshifts.  The sequence AB001992 is 42.3% identical 
to the rat CYP27A sequence. The mouse and human sequences are also more than 40% 
identical to the CYP27A sequence.  Therefore, they belong in the CYP27 family as 
a new subfamily CYP27B1.  Since there only appears to be one gene in each 
species, all three species orthologs will be called CYP27B1.  CYP40 has been 
retired and this is indicated by the X in CYP40X.

41A Subfamily

CYP41      Boophilus microplus (southern cattle tick)
            GenEMBL U92732 (414bp)
            Crampton,A.L., Miller,C., Baxter,G.D. and Barker,S.C.
            Expressed sequenced tags and new genes from the cattle tick,
            Boophilus microplus.
            Exp. Appl. Acarol. 22 (3), 177-186 (1998)
            EST is from the middle of the P450 
            whole sequence known but still confidential.

42A Subfamily

CYP42        C. elegans
            GenEMBL AL020988 (Y80D3), M89401 (EST cm08B12)

43A Subfamily

CYP43        C. elegans
           GenEMBL AF026203
           E03E2.1

44A Subfamily

CYP44        C. elegans
            GenEMBL U21321
            CELZK177 
            only mitochodrial-like P450 in C. elegans (missing part of I-helix 
may be 
            pseudogene)

45A Subfamily

CYP45     Homarus americanus (American lobster)
            GenEMBL AF065892 (1581bp)
            Snyder,M.J.
            Identification of a new cytochrome P450 family, CYP45, from the
            lobster, Homarus americanus, and expression following hormone and
            xenobiotic exposures
            Arch. Biochem. Biophys. (1998) In press

46A Subfamily

CYP46        human
           GenEMBL NM_006668
           Lund EG, Guileyardo JM and Russell DW.
           cDNA cloning of cholesterol 24-hydroxylase, a mediator of
           cholesterol homeostasis in the brain.
           Proc. Natl. Acad. Sci. U.S.A. 96, 7238-7243 (1999)
           32% identity with Drosophila 4D2
           ESTs H06539, H51951, R36281
           mouse homolog EST AA096922
MSPGLLLLGSAVLLAFGLCCTFVHRARSRYEHIPGPPRPS 
FLLGHLPCFWKKDEVGGRVLQDVFLDW 
AKKYGPVVRVNVFHKTSVIVTSPESVK 
KFLMSTKYNKDSKMYRALQTVFGER 
LFGQGLVSECNYERWHKQRRVIDLAFSRSSLVSLMETFNEKAEQLVEILEAKADGQTPVSMQDMLTYTAMDILAK 
AAFGMETSMLLGAQKPLSQAVKLMLEGITASRNTLAK 
FLPGKRKQLREVRESIRFLRQVGRDWVQRRREALKRGEEVPADILTQILK 
AEEGAQDDEGLLDNFVTFFIA 
GHETSANHLAFTVMELSRQPEIVAR 
LQAEVDEVIGSKRYLDFEDLGRLQYLSQ 
VLKESLRLYPPAWGTFRLLEEETLIDGVRVPGNTPLL 
FSTYVMGRMDTYFEDPLTFNPDRFGPGAPK 
PRFTYFPFSLGHRSCIGQQFAQ 
MEVKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPAPPPPPC

CYP46       Fugu rubripes (pufferfish)
            No accession number
            Scaffold_4537
            60% to 46 human
MGVFNLIFGWISQASIFLLLLLFIALLGYCMYIKYTHMKYDHIPGPPRDS (2) 
FFSGHSSKLLDIMKDDGVVHDMFLKW (2) 
AETYGPVYKIYFLHHVMVFVSCPETTK (0)
EMLMSPKYTKDKFLHNRIGSLFGQR (2)
FLGNGLVTVRDHEKWYKQRRIMDPAFSSL (2)
YLRSLMGNFNETADKLMDKLSEIADNKTTANMLHLVNCVTMEV