|
P450 Families |
Named P450 Sequences |
Full Length Sequences |
Sequences in Genomic DNA |
Percent of genome completed |
Predicted P450s |
|
45 |
264 |
240 |
264 |
92.3% |
286 |
AL353865 CYP71B38 chromosome 3, BAC clone T10D17 79% to 71B5 29822-28106
AL163817 CYP724A1 62148-65132 translation given in Genbank is not correct
chromosome 5, BAC clone F18O22 gene = F18O22_190 in 85 clan
AC025417 CYP87A2 now complete 82132-84239
AC026480 CYP96A14P 9579-10061 Pseudogene fragment chr I clone F15E12
no more downstream seq in 80000 bp GSSs: B20346, B22314
9579 MALIGLIEAFIAFVCFLIFYYFLIKKPYSYILIKISQSGLWNWPVLGMSPGALMRLP
RIYDFSVDLLESNLTFHFKGPWFAGIDILATADSVNINHIYYRGPELREIFGPFGDGIINSD
SELWRNLKKATQVIFNHQKYQKFSTSTTRSKLKLGLVPLFN 10061
AL353995 CYP81K1 chr 5 clone F12B17 comp(9841-11343) EST AI997093
AL353995 CYP81K2 chr 5 clone F12B17 comp(11529-11633,11733-11837,12108-13583)
AF241643 CYP75B1 exon 1
AF241644 CYP75B1 exon 2
AF241645 CYP75B1 exon 3
AF241646 CYP75B1 exon 4
AF241647 CYP75B1 exon 1 mutant
AC025814 CYP721A1 64213-66048
AL163972 CYP94D2 89747-91303 chromosome 3 clone T5P19 88% to 94D1
AL162973 CYP96A13 69536-71045 70% to 96A12 my translation does not agree
With the translation of the Genbank entry. Small intron vs frameshift?
AL162972 CYP77A4 NOW COMPLETE 25962-27500 clone T1E3 also on AL162875 T32M21
AL162875 CYP77A9 87274-88803 T32M21_230 no GSSs no ESTs
AC061957 CYP86A4 comp(37593-39264) chromosome 1 clone F22L4
AC051629 CYP72C1 chromosome I clone F6I1 partial seq
AC027656 CYP78A5 comp(32530-34190) EST T45256 formerly confidential
AP001298 CYP71B33 comp(479-2126)
AP001298 CYP71B34 comp(2977-4644)
AP001298 CYP71B35 comp(4867-6557)
AP001298 CYP71B36 comp(8161-9842)
AP001298 CYP71B37 comp(10654-12314)
AP001304 CYP705A28 13048-14798 same as AL091537 T7 end of BAC T8C3 = B77649
AP001304 CYP705A29P 15786-16553
AP001304 CYP705A30 15786-16553 part of old 705A7X
AP001304 CYP705A31P 20651-20824
AP001304 CYP705A32 21561-23630 Identical to B74035 part of old 705A7X
AP001304 CYP705A33 24249-25900 2 amino acid diffs with B96118
AP001307 CYP90D1 comp(43183-45689) 51% to 90C ESTs = AI994541, R30379, R30380
AP001314 CYP702A8 comp(11921-14547)
AC025417 CYP87A2 67020-68647 75% to 87A1 partial seq
AC024226 CYP86D2P comp(6890-7005) numerous deletions and frameshifts
GSSs AL080732, AQ010792
AC024609 74483-77023, AC025808 CYP722A1 35% identical to CYP90A1
AC010795 CYP87A3P pseudogene fragment
AC023628 CYP703A2 comp(16084-17699) 75% to 703A1 EST AA720028
This sequence family was not known in Arabidopsis before Feb 16, 2000.
703A1 is from petunia
AC023628 CYP78A8 94000-96000 region
AC023279 comp(55362-56858) CYP94D1 F12K21 chr 1 formerly CYP86D1 March 6, 2000
The TIGR Arabidopsis site has sequenced several thousand reads from a second
Arabidopsis variety called Landsberg erecta looking for polymorphisms. A search of
all the partial arabidopsis P450 sequences in my collection (ESTs, GSSs and
incomplete mRNAs and genomic clones) found only one extension,
a C-terminal sequence was identified for CYP707A3 which had previuosly
been missing the C-terminal. The sequence was present in the AB020744 genomic
clone, but there were some errors that prevented recognizing it as the true end of
this gene. I have now searched exhaustively for new P450s in this data. The
entire set of 1944 contigs was downloaded and searched by TBLASTN using the
do-it-yourself BLAST server at
http://www.proweb.org/proweb-bin/WU-blastall.pl
This search was done using nine different P450s from the nine clans of Arabidopsis
P450s CYP51A2, 71B9, 72A8, 74A, 85A1, 96A1, 97C1, 710A1, and 711A1. These are
usually sufficient to find all known Arabidopsis P450s in a database. 11 contigs
were shown to contain P450s.
Contig # best P450 match % identity
281 707A3 C-terminal 360-470 this sequence used for correct end of 707A3
391 706A4 100% C-terminal 462-516
404 82C2 100% C-terminal 356-508
505 79C1 100% N-terminal 8-74
610 71B7 100% C-terminal 244-480 with intron
885 71B3 100% middle 252-342 with intron
1241 79C1 100% C-terminal 423-545
1510 79B2 99% C-terminal 343-541 one difference
1531 705A3 95% N-terminal 1-91 4 differences
1538 98A3 100% N-terminal 1-161
1783 51A1 99% N-terminal 21-142 one difference
On Jan. 28, 2000 TIGR deposited 14905 entries from the Landberg erecta data
These GSS accession numbers start with AQ. At least 48 of these sequences are P450s
And they match 25 different Arabidopsis P450s and two match a Deinococcus radiodurans
bacterial P450 at 44% identity. This clone may be a contamination of the genomic
library.
AQ961078 LERFJ05TR LERA Arabidopsis thaliana genomic clone LERFJ05
96% to 51A1
AQ961077 LERFJ05TF LERA Arabidopsis thaliana genomic clone LERFJ05
99% to 51A1
AQ966708 LERIL68TFB LERG Arabidopsis thaliana genomic clone LERIL68
100% to 71A19
AQ968500 LERJC87TF LERG Arabidopsis thaliana genomic clone LERJC87
97% to 71A24
AQ968501 LERJC87TR LERG Arabidopsis thaliana genomic clone LERJC87
96% to 71A24
AQ968537 LERJD18TR LERG Arabidopsis thaliana genomic clone LERJD18
97% identical to 71A26 N-terminal
AQ957646 LERAQ88TF LERA Arabidopsis thaliana genomic clone LERAQ88
99% to 71B3
AQ957647 LERAQ88TR LERA Arabidopsis thaliana genomic clone LERAQ88
100% to 71B3
AQ963043 LERGJ13TR LERG Arabidopsis thaliana genomic clone LERGJ13
98% to 71B4
AQ956837 LERAM36TF LERA Arabidopsis thaliana genomic clone LERAM36
100% to 71B7 0ver 225 amino acids
AQ956838 LERAM36TR LERA Arabidopsis thaliana genomic clone LERAM36
100% to 71B7
AQ968000 LERIW96TR LERG Arabidopsis thaliana genomic clone LERIW96
100% to 71B23
AQ967999 LERIW96TF LERG Arabidopsis thaliana genomic clone LERIW96
100% to 71B23
AQ962171 LERGD74TF LERG Arabidopsis thaliana genomic clone LERGD74
93% to 76C3 N-terminal
AQ962172 LERGD74TR LERG Arabidopsis thaliana genomic clone LERGD74
92% to 76C3
AQ968833 LERJF09TR LERG Arabidopsis thaliana genomic clone LERJF09
100% to 78A8
AQ968883 LERJF37TF LERG Arabidopsis thaliana genomic clone LERJF37
100% to 78A8
AQ968884 LERJF37TR LERG Arabidopsis thaliana genomic clone LERJF37
100% to 78A8
AQ960047 LERFB09TF LERA Arabidopsis thaliana genomic clone LERFB09
100% to 79B2
AQ960048 LERFB09TR LERA Arabidopsis thaliana genomic clone LERFB09
98% to 79B2
AQ956517 LERAK44TR LERA Arabidopsis thaliana genomic clone LERAK44
100% to 79C1
AQ956516 LERAK44TF LERA Arabidopsis thaliana genomic clone LERAK44
98% to 79C1
AQ958944 LEREG05TR LERE Arabidopsis thaliana genomic clone LEREG05
100% to 79C1
AQ958943 LEREG05TF LERE Arabidopsis thaliana genomic clone LEREG05
98% to 79C1
AQ969559 LERJO25TR LERG Arabidopsis thaliana genomic clone LERJO25
99% to 81D2
AQ969558 LERJO25TF LERG Arabidopsis thaliana genomic clone LERJO25
97% to 81D2
AQ959582 LEREL64TR LERE Arabidopsis thaliana genomic clone LEREL64
1 difference with 81G1 96%
AQ956256 LERAI92TF LERA Arabidopsis thaliana genomic clone LERAI92
99% to 82C2
AQ956257 LERAI92TR LERA Arabidopsis thaliana genomic clone LERAI92
100% to 82C2
AQ968189 LERIW66TF LERG Arabidopsis thaliana genomic clone LERIW66
100% to 83A2
AQ968190 LERIW66TR LERG Arabidopsis thaliana genomic clone LERIW66
97% to 83A2
AQ964979 LERGZ64TR LERG Arabidopsis thaliana genomic clone LERGZ64
100% to 96A11
AQ964978 LERGZ64TF LERG Arabidopsis thaliana genomic clone LERGZ64
100% to 96A11
AQ960246 LERFB51TF LERA Arabidopsis thaliana genomic clone LERFB51
100% to 98A3 N-terminal
AQ960247 LERFB51TR LERA Arabidopsis thaliana genomic clone LERFB51
100% to 98A3
AQ960228 LERFB40TR LERA Arabidopsis thaliana genomic clone LERFB40
99% to 705A3
AQ960227 LERFB40TF LERA Arabidopsis thaliana genomic clone LERFB40
95% to 705A3
AQ965033 LERGZ92TR LERG Arabidopsis thaliana genomic clone LERGZ92
100% to 705A4
AQ965032 LERGZ92TF LERG Arabidopsis thaliana genomic clone LERGZ92
100% to 705A4
AQ958087 LERAU82TR LERA Arabidopsis thaliana genomic clone LERAU82
95% to 705A22
AQ958546 LERAY24TR LERA Arabidopsis thaliana genomic clone LERAY24
100% to 705A22
AQ958987 LEREG36TR LERE Arabidopsis thaliana genomic clone LEREG36
100% to 706A4
AQ958986 LEREG36TF LERE Arabidopsis thaliana genomic clone LEREG36
100% to 706A4
AQ958546 LERAY24TR LERA Arabidopsis thaliana genomic clone LERAY24
98% to 706A6
AQ962106 LERGD38TR LERG Arabidopsis thaliana genomic clone LERGD38
100% to 707A1
AQ956035 LERAH08TF LERA Arabidopsis thaliana genomic clone LERAH08
98% to 707A3
AQ956036 LERAH08TR LERA Arabidopsis thaliana genomic clone LERAH08
98% to 707A3
AQ969679 LERJP26TR LERG Arabidopsis thaliana genomic clone LERJP26
44% to a Deinococcus radiodurans P450 sequence
this may be a bacterial contamination of the genomic library
it does not match any other plant sequence at better than 30%
AQ969679 DLIVSLTGPYVAGLDTVANTTAAITYTVLKHPDVLKRVHDEVDALFADGTIGEEDLLKRI 61
D+ ++ G ++AG+DT AN+ A + Y + H + L + E DALF DG E L R
AE002014 1317 DVRLAALGAFIAGMDTAANSLAFVLYRMHLHSEFLPALRAEADALFRDGPPTAE-ALGRS 1141
Query: 62 PSLQGAIMETMRLYPIAVAQMRSATKDFVFEGHQIYEGEMIYVGTSVPHFMHEYWPEPEN 121
P L +MET+R++PIA A R+ T+D F GH+I G + +GT+VPH + E +P+PE+
Sbjct: 1140 PLLHRFVMETLRVHPIAPALSRTLTEDVEFAGHRIPAGTPVIIGTTVPHGLPELFPDPEH 961
Query: 122 FDIDRYAKPRAEHMQPGVYSPYGRGPHTCLGKSLAEIQIALSMARLFYKLDLELESPNYK 181
FD R+A RAEH QPG Y+P+G G HTC G +AE I L A LDL LE P+Y
Sbjct: 960 FDPGRFAPGRAEHRQPGAYAPFGVGSHTCAGSGMAEGLIMLGAAAALRTLDLSLE-PDYV 784
Query: 182 LKTKTAPTP 190
L+ PTP
Sbjct: 783 LRQTAKPTP 757
AQ969672 LERJP23TF LERG Arabidopsis thaliana genomic clone LERJP23
Same seq as AQ969679
The EU Chromosme 3 sequencing project has been sequencing the bottom arm of
Chromosome 3. Many clones are available on their web site some have
been deposited in Genbank, others have not. See
http://www.genoscope.cns.fr/externe/English/Projets/Projet_S/S.html
AB036059 CYP78A9 also found on contig 31 of clone F21F14 from the EU chr 3 project
8955-10728 numbering will change as contigs are joined (formerly confidential)
clone T27I15 contig 5 from European Chr. 3 project contains CYP76C7 and CYP76C8P
AC010926 CYP87A3P chromosome I clone IGF-F28P22
This appears to be a pseudogene remnant with two small exons and a frame shift
64524 NFMAFGDGMRLCIGADFS
64585 KLQTAVFLHSSVTKYRWEETSGGNILRTPGLQ 64764
64765 FPNGYHVKLKKKEI* 64809
AL133421 AB005249 CYP75B1 chromosome 5, BAC clone F13G24.190
join(67348..67797,68521..68961,69040..69210,69291..69770)
AC016662 CYP78A10 chromosome I clone IGF-F2P9
64% to 78A2 comp(102654-104343)
MYLSFASRSGSSPFPSLELCLSIFLFISLFVFWLTPGGFAWALYKARFHTRPESKTGPAI
PGPSGLPIFGLLLAFVNNALTHRILANIADTCKAKALMAFSVGSTRFVITSEPETAKELL
NSSAFADRPVKESAYELLFDRAMGFAPFGDYWRELRRISSTHLFSPKRIFSSGESRRKIG
QNMVGEIKNAMECYGEVHIKKILHFGSLNNVMSSVFGKTYNFNEGIVYSKESNELEHLVS
EGYELLGIFNWSDHFPGMRWLDLQGVRRRCRSLVGRVNVFVGKIINDHKSKRSLRDNPEE
STYDDDFVDVLLGMHGNSKLSDSDMIAVLWEMIFRGTDTVAILLEWILARMVLHPDIQAK
AQAEIDCIVGDSGRQVTDSDLPKLPYVRAIVKETLRMHPPGPLLSWARLSIHDTQIGTHF
IPAGTTAMVNMWAITHDEKVWPEAHEYKPERFLGAQESNNFPIMGSDLRLAPFGAGRRVC
PGKSMGLATVELWLAQLLGSYKWVSCGEVDLSETLKLSLEMKNTLVCKAIPRG*
AL132979 contains (Chr III T3A5) 90B1
AC013428 contains (Chr III F18K10) 77A6, 77A7
AF184833 contains an inhibitor tagged site of 81D5
AC013288 14837-27604 contains CYP81D10 74% to 81D2 chromosome I clone T12I7
AC013258 63642-65480 CYP721 new family in the 72 clan most like CYP72B
AL132969 clone F8J2 from the EU Chr 3 project comp(64780-66639)
contains a new CYP76G1 seq.
AL133315 clone T8P19 comp(10683-12101) from the EU Chr 3 project
contains a new CYP94B3 seq.
AL132958 clone T04D02 from the EU Chr 3 project contains 71B5 and 3 new 71Bs (71B30P,
71B31, 71B32) and 97C1 (formerly a confidential sequence) may be AL132958?
71B30P, 71B31 and 71B32 are also found on AL132966
AL132966 clone F4P12 from the EU Chr 3 project overlaps T04D02 and contains two of the
three new 71Bs and part of the third
AC012561 contains CYP705A25, CYP705A26P and CYP705A27
AC012396 contains CYP720A1 a new family related to CYP90 in the 85 clan
AC011622 contains CYP86A7 formerly only known as a partial EST sequence
AC011765 contains CYP98A8 and CYP98A9
AC011624 contains CYP94B2 a duplicate of AB010797 and AC011664
AC011664 contains CYP94B2 a duplicate of AB010797 and AC011624
AF195115 contains CYP86A2
AP000601 provides the missing 38 internal amino acids for CYP86C2
AB010077 contains CYP77A8P pseudogene
51066-51110 (26-40), 56196-56252 (167-185), 56264-56377 (195-232), 75123-75146 (233-
240)
AC011560 contains CYP77A6 and CYP77A7
AB010797 AC011624 and AC011664 contain CYP94B2
AC010164 contains CYP76C5 and CYP76C6
AC010155 contains CYP705A24
Deposit of almost 8000 Arabidopsis ESTs identifies 9 new P450 sequences
and 11 ESTs match named P450s that previously did not have an EST in the
database. The new sequences are not named but fall in the following families
and subfamilies
71B, 72A(2 sequences), 81, 89A, 90, 94B, 96A, 97A
CYP78A9 is submitted (confidential)
AC009895 CYP89A9 (9/8/99) has 6 ESTs
AP000414 CYP77A5P (9/8/99)
AP000419 clone MVI11 has CYP707A4 (9/7/99)
10 new CYP705A sequences have been found from Kazusa clone MAL21 AP000383
8/3/99.
There has been no significant progress on Arabidopsis P450s since the July 2nd
deposit of AL080318 from the EU Arabidopsis sequencing project. The AGI
sequencing totals have not budged in several weeks. (A search done 7/29 found
AC008051 that has a new P450 CYP79C2 and three pseudogene fragments related to
it.)
CYP706A4, CYP706A5, CYP706A6 and CYP706A7 are on AL080318 and CYP83A2 is
on AL080283. 45 other GSS fragments with P450s have been deposited. Seven of
these are new fragments that do not match known sequences exactly. These are in
the 71B(2 seqs), 705A(4 seqs) and 98(1 seq) families or subfamilies.
96A9, 96A10, 96A11 and 96A12 are on AL078620.
Note: only three new Arabidopsis P450s sequences have been found since May 9
(96A7, 96A8, 72C1). There has been a slowdown in the discovery rate for these
genes. I noticed that the Kazusa site has not made a new deposit of chromosome
3 and 5 sequence data since May 7, and their webpage has not been updated since
May 7. I expect a large deposit of a months worth of sequence data will
probably appear soon, hopefully with several P450s. Examining the Kazusa FTP
site, There are 17 clones (1Mb) marked as being sequenced on the May 7 web page
that are now available by FTP (June 7) but they are not in Genbank yet. My
experience is that there are P450s about every 830 kb, so there ought to be one
clone in this set with P450 genes. A preliminary BLAST search of these 17
clones with a CYP71B sequence did not detect any P450s.
45 families of P450s have been named in Arabidopsis. (Note 713A has been merged with 71A, after addition of many new 71A sequences). 264 sequences have been assigned names. 240 of these sequences are full length (or missing only a short N-terminal or C-terminal piece). 24 sequences have been named based on incomplete sequences. 22 of these are pseudogenes that contain lone exons or incomplete sequences that are missing exons and these will never be completed. CYP71B30P, CYP72A12P, CYP76C8P, CYP77A5P, CYP77A8P, CYP79A4P, CYP79B4P, CYP79C3P, CYP79C4P, CYP79C5P, CYP86D2P, CYP87A3P, 96A14P, CYP702A4P, CYP702A7P, 705A10P, 705A11P, 705A14P, 705A17P, 705A26P, 705A29P and 705A31P. Two 71A17P and 96A6P are ordinary pseudogenes. There are at least 264 P450 genes based on comparing gene sequences with distinct overlapping ESTs and GSSs in each family. There are a few additional sequences and contigs that do not overlap, but some of these may be from the same genes. The CYP71 family seems to have the largest group of sequences with a minimum of 55 genes. The CYP81 family has at least 17 genes. The 705 family has at least 33 genes. The best estimate I can make about how many P450s Arabidopsis will have is based on the number of P450 genes sequenced only from genomic high throughput sequencing. The genome is currently (Feb. 9, 2000 data) 84.7% complete from the MOT tables . Another estimate is from the Arabidopsis Genome Initiative at 92.3% (May 30, 2000) AGI total. This is more accurate because it includes unfinished genomic sequences that are not included in the MOT tables. I have now switched to the AGI total for this calculation. I am assuming now that all of the Arabidopsis genome has been sequenced based on this effort and not from individual gene cloning projects. From counting the genes identified from the genomic data alone I find 264 P450 genes listed below. If 264 P450 genes have been found in 92.3% of the genome then I estimate there will be 286 P450s in Arabidopsis. This number assumes a random distribution of genes on the chromosomes and that the sequencers have not skimmed the highest gene density regions first. This number seems surprisingly high. There are approximately 22 more P450 genes that remain to be found, though some are represented by ESTs or GSS fragments. The FASTA file of Arabidopsis P450 sequences has 302 entries. 264 P450s found in Arabidopsis genomic sequencing 51A1, 51A2, 71A12, 71A13, 71A14, 71A15, 71A16, 71A17P, 71A18, 71A19 71A20 71A21, 71A22, 71A23, 71A24, 71A25, 71A26, 71A27 71A28 71B2, 71B3, 71B4, 71B5, 71B6, 71B7, 71B8, 71B9, 71B10, 71B11, 71B12, 71B13, 71B14, 71B15, 71B16, 71B17, 71B18, 71B19, 71B20, 71B21, 71B22, 71B23, 71B24, 71B25, 71B26, 71B27, 71B28, 71B29, 71B30P, 71B31, 71B32, 71B33, 71B34, 71B35, 71B36, 71B37, 71B38, 72A7, 72A8, 72A9, 72A10 72A11, 72A12P, 72A13, 72A14, 72A15, 72B1, 72C1, 73A5, 74A, 74B2, 75B1, 76C1, 76C2, 76C3, 76C4, 76C5, 76C6, 76C7, 76C8P, 76G1, 77A4, 77A5P, 77A6, 77A7, 77A8P, 77A9, 77B1, 78A5, 78A6, 78A7, 78A8, 78A9, 78A10, 79A2, 79A3P, 79A4P, 79B2, 79B3, 79B4P, 79C1, 79C2, 79C3P, 79C4P, 79C5P, 79F1, 79F2, 81D1, 81D2, 81D3, 81D4, 81D5, 81D6, 81D7, 81D8, 81D10, 81F1, 81F2, 81F3, 81F4, 81G1, 81H1, 81K1, 81K2, 82C2, 82C3, 82C4, 82F1, 82G1, 83A1, 83A2, 84A1, 85A1, 86A1, 86A2, 86A4, 86A7, 86A8, 86B1, 86C1, 86C2, 86C3, 86C4, 87A2, 87A3P, 88A3, 88A4, 89A2, 89A3, 89A4, 89A5, 89A6, 89A7, 89A9, 90A1, 90B1, 90C1, 90D1, 93D1, 94B1, 94B2, 94B3, 94C1, 94D1, 94D2, 94D3P, 96A1, 96A2, 96A3, 96A4, 96A5, 96A6P, 96A7, 96A8, 96A9, 96A10, 96A11, 96A12 96A13, 96A14P, 97B3, 97C1, 98A3, 98A8, 98A9, 701A3, 702A1, 702A2, 702A3, 702A4P, 702A5, 702A6, 702A7P, 702A8, 703A2, 704A1, 704A2, 705A1, 705A2, 705A3, 705A4, 705A5, 705A6, 705A8, 705A9, 705A10P, 705A11P, 705A12, 705A13, 705A14P, 705A15, 705A16, 705A17P, 705A18, 705A19, 705A20, 705A21, 705A22, 705A23, 705A24, 705A25, 705A26P, 705A27, 705A28, 705A29P, 705A30, 705A31P, 705A32, 705A33, 706A1, 706A2, 706A3, 706A4, 706A5, 706A6, 706A7, 707A1, 707A2, 707A3, 707A4, 708A1, 708A2, 708A3, 709A1, 709A2, 709B1, 709B2, 709B3, 710A1, 710A2, 710A3, 710A4, 711A1, 712A1, 712A2, 714A1, 714A2, 715A1, 716A1, 716A2, 718A1 720A1, 721A1, 722A1, 724A1 All 264 named P450 genes are found in genomic sequencing. There are no confidential P450s from Arabidopsis. The most frequent P450 Arabidopsis ESTs are from CYP90A1 (16ESTs), CYP83A2 (12ESTs), CYP76C2 (8 ESTs), CYP71B20 (7 ESTs), CYP73A5 (7 ESTs) CYP98A3 (7 ESTs), CYP86A1, (7 ESTs), CYP86A2 (6 ESTs), CYP708A2 (6 ESTs), CYP706A1 (5 ESTs and 7 more that could be either 706A1 or 706A2), CYP71B6 (5 ESTs), CYP71B7 (5 ESTs), CYP85A1 (5 ESTs) 124 named Arabidopsis P450 sequences have no ESTs present in the EST database This does not include the 24 pseudogenes (see named genes with P on end of name). This has not been double checked since the new Arab ESTs have been deposited with AW and AV accession numbers. CYP51A1, CYP71A14, CYP71A15, CYP71A18, CYP71A19, CYP71A20, CYP71A21, CYP71A23, CYP71A24, CYP71A27, CYP71A28, CYP71B8, CYP71B9, CYP71B10, CYP71B12, CYP71B13, CYP71B14, CYP71B15, CYP71B16, CYP71B17, CYP71B21, CYP71B22, CYP71B23, CYP71B24, CYP71B25, CYP71B26, CYP71B27, CYP71B29, CYP72A9, CYP72A10, CYP72C1, CYP75B1, CYP76C1, CYP76C3, CYP76C4, CYP76C7, CYP77A4, CYP77A6, CYP78A6, CYP78A7, CYP78A8, CYP78A10, CYP79A2, CYP79B3, CYP79C1, CYP79C2, CYP79F2, CYP81D1, CYP81D2, CYP81D3, CYP81D6, CYP81D7, CYP81D10, CYP81G1, CYP82C2, CYP82C3, CYP82C4, CYP82F1, CYP82G1, CYP86A4, CYP86A8, CYP86C1, CYP86C3, CYP86C4, CYP86D1, CYP88A4, CYP89A3, CYP89A4, CYP89A6, CYP89A7, CYP89A8, CYP90B1, CYP93D1, CYP94B1, CYP94B2, CYP96A1, CYP96A2, CYP96A3, CYP96A5, CYP96A7, CYP96A8, CYP96A9, CYP96A10, CYP96A11, CYP96A12, CYP98A8, CYP701A3, CYP702A1, CYP702A2, CYP702A3, CYP702A5, CYP702A6, CYP705A2, CYP705A3, CYP705A4, CYP705A6, CYP705A8, CYP705A9, CYP705A12,CYP705A25,CYP706A3, CYP706A4, CYP706A5, CYP706A7, CYP707A2, CYP707A4, CYP708A1, CYP709A1, CYP709A2 CYP709B1, CYP709B2, CYP710A2, CYP710A3, CYP710A4, CYP711A1, CYP712A1, CYP712A2, CYP714A1, CYP714A2, CYP715A1, CYP716A1, CYP716A2, CYP718, CYP720A1, CYP721A1 4 plant families are not yet represented in Arabidopsis (80, 92, 99, 719) CYP703 is the most recent family to be removed from this list Feb. 16, 2000. CYP717 was on this list, but it has been joined with CYP81. CYP80 and CYP719 are only seen in ranunculales and may be specific to this group. CYP92 is seen in both monocots and dicots so it is expected to be found in Arabidopsis. CYP701 was removed from this short list. CYP701A3 is the ent-kaurene oxidase gene GA3. CYP95 has been discontinued as a family since it contained a significant sequence error It was really a member of the CYP72A subfamily.
CYP51A1 Complete sequence AC002329 CYP51A2 Complete sequence AC007296 comp(52507-54167) ESTs: N65031, AA651059, AA720360 no GSSs CYP71A12 Complete sequence AC002340 T11J7.14 no ESTs CYP71A13 Complete sequence AC002340 T11J7.16 ESTs H76866, AA605540 CYP71A14 Complete sequence AF069716 10000-14000 region no ESTs CYP71A15 Complete sequence AF069716 16000-19000 region no ESTs GSS: AL093403 BAC T11H3 CYP71A16 Complete sequence AB022210 comp(26631-24211) ESTs AA395422, T43806 CYP71A17P pseudogene on AB017065 CYP71A18 Complete sequence AC007296 83847-85677 no ESTs no GSSs CYP71A19 Complete sequence AL049608 11443-13432 no ESTs no GSSs CYP71A20 Complete sequence AL049608 21221-23811 no ESTs no GSSs CYP71A21 Complete sequence AL049659 comp(61576..62187,62279..63139) also U31288 (fragment) GSS B25921, AL080661 BAC F1I22 CYP71A22 Complete sequence AL049659 comp (64631..65242,65328..66188) EST N38590 GSS: AL096411 BAC T16P17 CYP71A23 Complete sequence AL049659 comp (67262..67861,68005..68856) CYP71A24 Complete sequence AL049659 comp(70246..71784) CYP71A25 Complete sequence AL049659 comp (73216..73830,73929..74786) ESTs AA395671, T22148, AI100632 CYP71A26 Complete sequence AL049659 comp(76207..76818,76952..77809) CYP71A27 Complete sequence AL022224 F1C12.160a this sequence is really 2 genes represented as one gene. no ESTs CYP71A28 Complete sequence AL022224 F1C12.160b this sequence is really 2 genes represented as one gene. no ESTs CYP71B2 Complete sequence D78605 also AC007357 31915-33510 ESTs: T43466, Z18072, Z35218 CYP71B3 Complete sequence D78602 no ESTs CYP71B4 Complete sequence D78603 ESTs: T21833 CYP71B5 Complete sequence D78601 clone T04D02 EU Chr 3 project EST AI997548 CYP71B6 Complete sequence D78604 ESTs: F14179, F14199, F15479, N65349, F19855 GSS: AL095273 BAC T14L18 CYP71B7 Complete sequence X97864. also AC007357 39975-41560 ESTs: R65111, T04541, T04814, T44310, T44875 CYP71B8 Complete sequence AB011485 no ESTs GSS: AL089388 BAC T3M21 CYP71B9 Complete sequence AC004136 no ESTs 47% identical to 71B1 CYP71B10 Complete sequence AB019233 no ESTs or GSSs CYP71B11 Complete sequence AC006259 and AC005964 ESTs Z33963, Z33677 GSSs B28673, AQ011301, B78204, AL095836 BAC T15N16, AL087348 BAC F14D13 CYP71B12 Complete sequence AC006259 and AC005964 no ESTs or GSSs CYP71B13 Complete sequence AC006259 and AC005964 no ESTs GSS: AL089222 BAC T3H8, AL084167 BAC F8E24 CYP71B14 Complete sequence AC006259 no ESTs or GSSs CYP71B15 Complete sequence AB016889 no ESTs or GSSs CYP71B16 Complete sequence AB024038 comp(13279-14849) no ESTs no GSSs CYP71B17 Complete sequence AB024038 comp(15932-17496) no ESTs no GSSs CYP71B18 Complete sequence AB024038 comp(18089-19609) ESTs ESTs AA650762, T44224, AI099821 (2 diffs) T44855 (3 diffs) GSS B25335 CYP71B19 Complete sequence AB024038 comp(21194-22760) ESTs H76241 no GSSs CYP71B20 Complete sequence AB024038 comp(26044-27630) ESTs T04186, T45449, T21193, AA585920 (1 diff) T20987 (3 diffs) T04452, T04714 (these two not in GenBank anymore, but = AA585920) no GSSs CYP71B21 Complete sequence AB024038 comp(31112-32720) no ESTs GSS AL092575 BAC T10E10, AL092574 BAC T10E9 CYP71B22 Complete sequence AB024038 36984-38609 no ESTs no GSSs CYP71B23 Complete sequence AB024038 comp(40969-42839) no ESTs no GSSs CYP71B24 Complete sequence AB024038 comp(45837-47529) no ESTs GSSs AQ011201 (5 diffs/178 = 97%) B96063 4 diffs/126 = 97% these may be a related gene. CYP71B25 Complete sequence AB024038 comp(75254-76832) no ESTs no GSSs CYP71B26 Complete sequence AB024038 comp(80407-82076) no ESTs no GSSs CYP71B27 Complete sequence AC007357 28180-29804 GSSs B11396, B21019 CYP71B28 Complete sequence AC007357 34549-36103 ESTs T76255, T20906, T14112, N65665, AI100027 GSS: AL093252 BAC T11D21 CYP71B29 Complete sequence AC007357 36686-38241 CYP71B30P Partial sequence T04D02 from the EU Chr 3 project 82204-83671 missing N-term 80 AA CYP71B31 Complete sequence T04D02 from the EU Chr 3 project 84593-86302 GSSs AL092788 AL081333 No ESTs CYP71B32 Complete sequence T04D02 from the EU Chr 3 project 87497-89156 two in frame stop codons may be a pseudogene CYP71B33 AP001298 comp(479-2126) CYP71B34 AP001298 comp(2977-4644) ESTs Z33952, Z34037, Z34791, two diffs with Z34126 CYP71B35 AP001298 comp(4867-6557) CYP71B36 AP001298 comp(8161-9842) CYP71B37 AP001298 comp(10654-12314) EST Z27299 GSS B22554 CYP71B38 AL353865 complete sequence 29822-28106 chromosome 3, BAC clone T10D17 79% to 71B5 CYP71D3X Complete sequence = 71B37 EST Z27299 CYP72A7 Complete sequence AB023038 EST R90024 CYP72A8 Complete sequence AB023038 no ESTs CYP72A9 Complete sequence AB023038 no ESTs CYP72A10 Complete sequence AB023038 no ESTs CYP72A11 Complete sequence AB023038 ESTs H36129, T04134, T13009 CYP72A12P Complete sequence AB023038 no ESTs GSS B24203 CYP72A13 Complete sequence AB023038 ESTs T44202, N96036, N96740, T22603 CYP72A14 Complete sequence AB023038 ESTs H36956, Z46541, Z46540 CYP72A15 Complete sequence AB023038 ESTs T45915, T22449 CYP72B1 Complete sequence AC003105 42% identical to 72 from Cath. roseus ESTs T04442 CYP72C1 Complete sequence AC007651 45% identical to 72A9 also on AC051629 CYP73A5 Complete sequence U71080/U71081 and U93215. ESTs: F19837, N65601, T04086, T43776, T44874, AA395398, AA657317 CYP74A Complete sequence X92510 and AB007647. ESTs: N65720, AA394958(almost identical) CYP74B2 Complete sequence Z97339 ESTs T20864, AA395621 CYP75B1 AB005249 AL133421 chromosome 5, BAC clone F13G24.190 join(67348..67797,68521..68961,69040..69210,69291..69770) no ESTs no GSSs CYP76C1 Complete sequence AC003680 F17K2.9 no ESTs CYP76C2 Complete sequence AC003680 F17K2.10 ESTs H37250, T04013-T04017 Z33828, Z34697 GSS: AL084053 BAC F8B14 CYP76C3 Complete sequence AC003680 F17K2.11 no ESTs GSS: AL088340 BAC T1M7 CYP76C4 Complete sequence AC003680 F17K2.8 no ESTs CYP76C5 Complete sequence AC010164 F14M2 CYP76C6 Complete sequence AC010164 F14M2 CYP76C7 Complete sequence clone T27I15 contig 5 from European Chr. 3 project CYP76C8P nearly complete missing last exon and an internal deletion clone T27I15 contig 5 from European Chr. 3 project CYP76G1 Complete sequence Clone F08J02 from the EU Chr 3 project comp(64780-66639) CYP77A4 Complete sequence AL162972 25962-27500 clone T1E32 also on AL162875 T32M21 GSS B97241 from BAC T32M21TR no ESTs GSS: AL082681 BAC F5F6 CYP77A9 Complete sequence AL162875 87274-88803 T32M21_230 no GSSs no ESTs CYP77A5P C-terminal fragment AP000414 13981-14298 Matches EST H36337 but this is probably not from this pseudogene, but from the parent gene. CYP77A6 Complete sequence AC011560 71181-72725 chromosome III clone F13M14 No ESTs, GSS B30234 CYP77A7 Complete sequence AC011560 72998-74939 chromosome III clone F13M14 GSSs B25606, B24360, AL093147 ESTs AI995272 W43241 CYP77A8P pseudogene fragments AB010077 clone MYH19 51066-51110 (26-40), 56196-56252 (167-185), 56264-56377 (195-232), 75123-75146 (233- 240) CYP77B1 Complete sequence AC007296 comp(89516-91048) ESTs Z30775, Z30776 noGSSs CYP78A5 Complete sequence AC027656 comp(32530-34190) EST T45256 no GSSs chromosome 1 clone F21F23 CYP78A6 Complete sequence AC005819 no ESTs no GSSs CYP78A7 Complete sequence AB016893 no ESTs GSS AQ011443 CYP78A8 Complete sequence AC007323 comp(15232-17051) no ESTs no GSSs CYP78A9 Complete sequence EST AI995549 no GSS contig 31 of clone F21F14 from the EU chr 3 project 8955-10728 numbering will change as contigs are joined CYP78A10 Complete sequence AC016662 no ESTs no GSS chr I clone F2P9 64% to 78A2 comp(102654-104343) CYP79A2 Complete sequence AB010692 comp(11000-13200 region) I helix motif not strong no ESTs CYP79A3P Complete sequence AB025602 14755-16760 no ESTs no GSSs CYP79A4P Lone C-terminal exon AB025602 22090-22536 no ESTs GSS B61504 CYP79B2 Complete sequence AF069495, AL035708 T5J17.120 ESTs: T42902, AA395121 CYP79B3 Complete sequence AC006592 comp(103303-105685) BAC F14M13 85% identical to 79B2 noESTs noGSSs also AC007168 76873-79255 clone T26C19 CYP79B4P lone exons AC006592 comp(101836-101952) and comp(102489-102605) also AC007168 79953-80069 and 80606-80722 clone T26C19 CYP79C1 Complete sequence AC007202 comp(77381-79610) clone T8K14 AC002986 comp(75-1680) nearly complete sequence (missing C-terminal exons) no ESTs CYP79C2 Complete sequence AC008051 1899-4148 four frameshifts, might be a pseudogene CYP79C3P partial sequence AC008051 5709-6541 pseudogene fragments CYP79C4P partial sequence AC008051 7210-7435 pseudogene fragments CYP79C5P partial sequence AC008051 9049-9895 pseudogene fragments CYP79F1 Complete sequence AC006341 comp(71509-69310) ESTs F14189, F14190 GSS: AL093101 BAC T11A9 CYP79F2 Complete sequence AC006341 comp(75131-73147) no ESTs 98% identical to GSS B20058 CYP81D1 Complete sequence D78606 formerly CYP91A1 families joined no ESTs also on AB026661 comp(8484-10672) CYP81D2 Complete sequence AL035601 F6G17.10, Z99707 no ESTs GSS AQ250603 AL091699 BAC T8H13 CYP81D3 Complete sequence Z99707 seq. (b) nucleotides 1167-2731 no ESTs CYP81D4 Complete sequence Z99707 seq. (c) nucleotides 3301-5134 ESTs T41670, T42716, T43640, AA395149 CYP81D5 Complete sequence Z99707 seq. (d) nucleotides 6180-7938 no ESTs CYP81D6 Complete sequence AC002391 T20D16.15 no ESTs CYP81D7 Complete sequence AC002391 T20D16.18 no ESTs CYP81D8 Complete sequence ESTs H76015, T41596, N38687, AA586064 CYP81D10 Complete sequence AC013288 14837-27604 74% to 81D2 chromosome I clone T12I7 GSSs B22310, B21261 CYP81F1 Complete sequence D78607 formerly CYP91A2 families joined ESTs T42723, AA394371, AA597575, AI1000386 CYP81F2 Complete sequence EST T88150 no GSSs CYP81F3 Complete sequence AL035601 F6G17.50 EST AA712784 (formerly 81D9) CYP81F4 Complete sequence AL035601 F6G17.60 ESTs T46441, AI099973, AA598000 GSS: AL090945 BAC T6O13 CYP81G1 Complete sequence AB007645 comp(7177-8746) no ESTs CYP81H1 Complete sequence Z99707 seq. (e) nucleotides 9042-11495 ESTs: F13573, F13574 CYP81K1 Complete sequence AL353995 chr 5 clone F12B17 comp(9841-11343) EST AI997093 CYP81K2 Complete sequence AL353995 chr 5 clone F12B17 comp(11529-11633,11733-11837,12108-13583) CYP82C2 Complete sequence AL021636 (F10N7.200) also AL049607 no ESTs CYP82C3 Complete sequence AL021636 (F10N7.220) also AL049607 no ESTs CYP82C4 Complete sequence AL021636 (F10N7.230) also AL049607 no ESTs CYP82F1 Complete sequence AL031369 no ESTs GSSs B29081, B27454 CYP82G1 Complete sequence AB026647 comp(36473-38300) no ESTs no GSSs CYP83A1 Complete sequence: U18929, U69134, D78599 AL035528 ESTs: T88085, AA389789 GSS: AL095159 BAC T14I2 CYP83A2 Complete sequence D78598 formerly CYP83B1 also AL080283 clone F3L17 ESTs: N65459, R30228, R86798, T44634, T46080, T75944, T76397, T88645, Z33831, Z34720, AA404945, T20600 CYP84A1 Complete sequence U38416 also AL022141 EST AA598015 CYP85A1 Complete sequence AB009048 ESTs: N65267, T76590, AA394869, AA404883, AA713019 CYP86A1 Complete sequence: X90458, ESTs: R90073, T45669, Z26357, Z26358, Z35025, AA712696, AA605446 GSS: AL088478 BAC T2A13 CYP86A2 Complete sequence AF013293. ESTs: H76762, T04172, N95869, T46373 T46372, Z25705 CYP86A4 Complete sequence Y09581 11000-12678bp region AC061957 comp(37593-39264) chromosome 1 clone F22L4 no ESTs CYP86A7 Complete sequence AC011622 43171-44742 clone F24D7 EST F14165 CYP86A8 Complete sequence AC004665 77% identical to CYP86A2 no ESTs GSS: AL089168 BAC T3F17 CYP86B1 Complete sequence AB007648 (comp. 14676-16659) ESTs AA586187, T20562 CYP86C1 Complete sequence AC000103 F21J9.9 no ESTs CYP86C2 Complete sequence AB024038 1-1074 AP000601 ESTs Z26059 Z26058 CYP86C3 Complete sequence AC007357 comp(51369-52931) CYP86C4 Complete sequence AC007357 comp(54696-56285) CYP86D1X missing C-terminal AC023279 18465 - 19850 11-FEB-2000 no ESTs GSS B97734, AL084336 BAC end F8K5 renamed CYP94D1 CYP86D2PX AC024226 comp(6890-7005) numerous deletions and frameshifts GSSs AL080732, AQ010792 Renamed CYP94D3P CYP87A2 Complete sequence AC025417 82132-84239 64% identical to 87A1 ESTs H77048, N37614 and AI995175 CYP87A3P AC010926 chromosome I clone IGF-F28P22 This appears to be a pseudogene remnant with two small exons and a frame shift 64524-64809 CYP88A3 Complete sequence AC000098 YUP8H12.23 (85724-87912) ESTs: T43711 CYP88A4 Complete sequence AC005700 no ESTs GSS: AL088126 BAC end T1G2 AL088480 BAC end T2A14 CYP89A2 Complete sequence U61231. ESTs: H77059 probably identical to CYP89A8 with some sequence errors and three frame shifts. CYP89A3 Complete sequence AB010073 no ESTs CYP89A4 Complete sequence AC005897 no ESTs GSS: AL087677 BAC F14M11 also an exact match to 89A7 CYP89A5 Complete sequence AC006193 ESTs H77059, T42576 no GSSs CYP89A6 Complete sequence AC006193 no ESTs no GSSs CYP89A7 Complete sequence AC006193 no ESTs GSS: AL087677 BAC F14M11 also an exact match to 89A4 CYP89A8X Complete sequence AC006193 no ESTs no GSSs see CYP89A2 CYP89A9 Complete sequence AC009895 37255-38905 clone T21P5 54% to 89A4 ESTs T76749, T44110, T45663, T14186, T21361, AA394479 CYP90A1 Complete sequence: X87367, X87368 and AB005237. ESTs: N37221, N38445, N65494, N97134, T22324, T22325, T41675, T43151,T43286, Z17988, Z26124, Z29017, Z29018, AA394655, AA586079, AA720263 GSS: AL087931 CYP90B1 Complete sequence AF044216 clone T03A05 EU Chr 3 project DWF4 gene no ESTs CYP90C1 Complete sequence Z99708 ESTs: AA651564, N96214 CYP90D1 comp(43183-45689) 51% to 90C ESTs = AI994541, R30379, R30380 *CYP91A1 now CYP81D1 Complete sequence D78606 no ESTs *CYP91A2 now CYP81F1 Complete sequence D78607 ESTs: AA394371, T42723, AA597575 *CYP91A3 now CYP81D8 Complete sequence ESTs: H76015, T41596, N38687, AA586064 CYP93D1 Complete sequence AB010697 comp(17400-19000 region) no ESTs GSS: AL088834 BAC T2L17 CYP94B1 Complete sequence AB007649 (comp. 20688-22173) no ESTs GSS: Al081067 BAC end F2E21 CYP94B2 Complete sequence AB010797 AC011624 AC011664 no ESTs no GSSs CYP94B3 Clone T08P19 from the EU Chr 3 project comp(10683-12101) ESTs AI996574, H76099 CYP94C1 Complete sequence AC005824 no ESTs no GSS CYP94D1 missing C-terminal AC023279 18465 - 19850 11-FEB-2000 no ESTs GSS B97734, AL084336 BAC end F8K5 formerly CYP86D1 CYP94D2 AL163972 complete sequence 89747-91303 clone T5P19 CYP94D3P AC024226 comp(6890-7005) numerous deletions and frameshifts GSSs AL080732, AQ010792 formerly CYP86D2P CYP96A1 Complete sequence AC002391 T20D16.19 (comp. 74934-76484) no ESTs CYP96A2 Complete sequence AL021811 (F10M6.190) also on AL021636 (F10N7.20) no ESTs CYP96A3 Complete sequence AC004512 no ESTs CYP96A4 Complete sequence AB019226 no ESTs GSS B96393 CYP96A5 Complete sequence AC007019 F7D8.23 no ESTs no GSS CYP96A6P pseudogene 62% identical to 96A2 AB025623 missing N-terminal CYP96A7 complete sequence AC007519 no ESTs no GSSs CYP96A8 complete sequence AC007519 no ESTs no GSSs CYP96A9 complete sequence AL078620 54378-55928 F23K16.110 CYP96A10 complete sequence AL078620 57169-58608 F23K16.120 CYP96A11 complete sequence AL078620 comp(58770-60179) F23K16.130 CYP96A12 complete sequence AL078620 comp(60765-62291) F23K16.140 CYP96A13 complete sequence AL162973 69536-71045 clone F9G14 CYP96A14P Pseudogene fragment AC026480 9579-10061 chr I clone F15E12 GSSs: B20346, B22314 CYP97B3 Complete sequence Z97337. ESTs: AA040985 CYP97C1 Complete sequence clone T04D02 from the EU Chr 3 project 17165-19444 N-terminal similar to Glycine max EST AI736606 no ESTs CYP98A3 Complete sequence AC002409 T20B9.5 ESTs: H36469, T43253, Z35213, N37715, Z35017, AA728684, AA651162 CYP701A3 Complete sequence AF047719-AF047721 no ESTs GSS: AL087395 BAC F14E19 CYP702A1 Complete sequence AC001229 BAC F5I14.20 CYP702A2 Complete sequence Z97338 seq. (a) nucleotides (comp. 95383-97405) no ESTs GSS: AL087968 BAC T1A22, AL094730 BAC T13L21 CYP702A3 Complete sequence Z97338 seq. (b) nucleotides 101369-104695 no ESTs CYP702A4P lone exon sequence Z97338 seq. (d) nucleotides 126846-126941 no ESTs CYP702A5 Complete sequence Z97338 seq. (h) nucleotides 168806-171661 no ESTs CYP702A6 Complete sequence Z97338 seq. (i) nucleotides 172246-175058 no ESTs CYP702A7P lone exon sequence Z97338 seq. (j) nucleotides 176173-176280 no ESTs CYP702A8 AP001314 comp(11921-14547) CYP703A2 Complete sequence AC023628 comp(16084-17699) 75% to 703A1 EST AA720028 CYP704A1 Complete sequence AC002388 EST AA597388 CYP704A2 Complete sequence AC003680 F17K2.4 ESTs AA597388 GSS: AL087352 BAC end F14D15 CYP705A1 Complete sequence Z97388 seq. (c) nucleotides (comp. 116171-117782) ESTs T21786 CYP705A2 Complete sequence Z97388 seq. (e) nucleotides 127652-128588 no ESTs CYP705A3 Complete sequence Z97388 seq. (f) nucleotides 134833-136422 no ESTs Also on B97399 BAC end sequence T32J16TF CYP705A4 Complete sequence Z97388 seq. (g) nucleotides 153425-155024 genomic sequence X94366 is same gene with some frame shifts. no ESTs CYP705A5 Complete sequence AB017064 9827-11417 ESTs: T04613, T45789, T43499, N97058, T04417 CYP705A6 Complete sequence AC007018 BAC end B97581 no ESTs CYP705A7X chimeric sequence now = 705A30 and 705A32 B97148 and B97150 BAC end sequences T32F24TR and T32G12TR ESTs T42559, AA395098 CYP705A8 Complete sequence AC005623 no ESTs CYP705A9 Complete sequence AC005623 no ESTs CYP705A10P AB017064 comp(81801-81661) lone exon 1 no ESTs CYP705A11P AB017064 comp(12233-12054) lone exon no ESTs CYP705A12 Complete sequence AB022210 comp(18214-16643) no ESTs no GSSs CYP705A13 Complete sequence AC007197 comp(7079-8717) clone T22C12 ESTs T42861, T43276, AA395378 CYP705A14P partial sequence AP000383 amino acids 5-36 followed by a large gap then 414-550 27367-27973 CYP705A15 Complete sequence AP000383 29031-30681 GSS AL009622 CYP705A16 Complete sequence AP000383 32158-33813 CYP705A17P partial sequence AP000383 amino acids 14-271 (222 amino acid gap followed by) 494-519 34749-35590 CYP705A18 Complete sequence AP000383 36850-38505 CYP705A19 Complete sequence AP000383 39232-40867 ESTs F19802, F19864 CYP705A20 Complete sequence AP000383 41713-43331 GSS AL088383 no ESTs CYP705A21 Complete sequence AP000383 44386-46007 GSS AL095325 CYP705A22 Complete sequence AP000383 47200-48831 ESTs T21800, R30233, T88278, AI100646 CYP705A23 Complete sequence AP000383 49393-51005 CYP705A24 Complete sequence AC010155 47066-48677 clone F3M18 CYP705A25 AC012561 47417-49106 clone F11F12 CYP705A26P AC012561 49934-50863 pseudogene missing C-terminal clone F11F12 CYP705A27 AC012561 52311-53952 EST W43443 no GSSs clone F11F12 CYP705A28 AP001304 13048-14798 same as AL091537 T7 end of BAC T8C3 = B77649 CYP705A29P AP001304 15786-16553 CYP705A30 AP001304 15786-16553 part of old 705A7X CYP705A31P AP001304 20651-20824 CYP705A32 AP001304 21561-23630 Identical to B74035 part of old 705A7X CYP705A33 AP001304 24249-25900 2 amino acid diffs with B96118 CYP706A1 Complete sequence AL021635 T12H17.80 ESTs T46392, T44621, T46193, N65907, AA404797 These ESTs could be either 706A1 or 706A2 T21057, T76384, Z24511, AA395860, AA597583, AA597644, AI138147 CYP706A2 Complete sequence AL021635 T12H17.100 ESTs Z17966, These ESTs could be either 706A1 or 706A2 T21057, T76384, Z24511, AA395860, AA597583, AA597644, AI138147 CYP706A3 Complete sequence AB024024 comp(50063-51694) clone K15C23 no ESTs no GSSs CYP706A4 Complete sequence AL080318 clone T4C9 comp(42111-43787) T4C9.140 GSS AL086483 no ESTs CYP706A5 Complete sequence AL080318 clone T4C9 comp(44693-46617) T4C9.150 no ESTs no GSSs CYP706A6 Complete sequence AL080318 clone T4C9 comp(49036-50528) T4C9.160 no ESTs no GSSs CYP706A7 Complete sequence AL080318 clone T4C9 comp(51874-53756) T4C9.170 no ESTs no GSSs CYP707A1 Complete sequence AL021687 (T18B16.200) may be a little short EST T43171 GSS: AL090955 BAC end T6019 may be T6O19 CYP707A2 Complete sequence AC005315 no ESTS GSS B12307 CYP707A3 Complete sequence AB020744 last 22 amino acids are missing CYP707A4 Complete sequence AP000419 comp(68986-71501) clone MVI11 from Kazusa Sept. 3, 1999 GSSs B67654 AND B98253 no ESTs CYP708A1 Complete sequence AC002304 (33000-35500 region) 32% identical to CYP90 no ESTs CYP708A2 Complete sequence AB017064 comp(21958-19304) ESTs H76703, R90011, T13661, T45729, AA605413, AA713328 CYP708A3 Complete sequence AB007260 16229-18271 ESTs Z34056, Z34880 opposite end of Z34056 outside coding region, Z46573 CYP709A1 Complete sequence AB005248 comp(67000-70500) similar to CYP72 no ESTs CYP709A2 Complete sequence AC004146 no ESTs CYP709B1 Complete sequence AC004411 first P450 on cosmid no ESTs CYP709B2 Complete sequence AC004411 second P450 on cosmid no ESTs CYP709B3 Complete sequence AL035602 T29A15.200 EST N38616 CYP710A1 Complete sequence AC004077 phase 1 sequence 16000-18000 region EST T44171, AA395380 also in AC004481 CYP710A2 Complete sequence AC004077 phase 2 sequence comp(25221-26720) no ESTs Also in AC004481 CYP710A3 Complete sequence AC005727 no EST no GSS CYP710A4 Complete sequence AC005727 no EST no GSS CYP711A1 Complete sequence AC004484 most like mammalian CYP5 may be plant homolog of CYP3 clan. No ESTs GSS: AL089622 BAC end T4F8 CYP712A1 Complete sequence AC002561 T24P15.16 comp(38716-39345,39436-40350) no ESTs CYP712A2 Complete sequence AB010697 comp(15190-16802) no ESTs CYP713A1X and CYP713A2X discontinued see 71A27 and 71A28 CYP714A1 Complete sequence 43500-46500 region AF069716 no ESTs B96503 BAC end fragment F23A22TF CYP714A2 Complete sequence 47000-50000 region AF069716 no ESTs CYP715A1 Complete sequence AB019226 no ESTs GSS: AL019226 BAC end F6J7 CYP716A1 Complete sequence AB018112 no ESTs no GSSs CYP716A2 Complete sequence AB018112 no ESTs no GSSs CYP718 Complete sequence AC006931 no ESTs no GSSs CYP720 Complete sequence AC012396 comp(80036-82173) like CYP90 in the 85 clan 38% to 90A1 but less to 90B1 and 90C1 no ESTs no GSSs CYP721 complete sequence AC013258 63642-65480 in the 72 clan most like CYP72B GSSs B20052, B27750 CYP722A1 AC024609 74483-77023, AC025808 35% to 90A1 CYP724A1 AL163817 62148-65132 translation given in Genbank is not correct chromosome 5, BAC clone F18O22 gene = F18O22_190 in 85 clan
There are two CYP51 genes There appear to be at least 17 members in the CYP71A subfamily in Arabidopsis, based on distinct overlapping sequences. B21027 PERF to heme 71B subfamily B21125 Nterm to C-helix 71B subfamily AI993108 91% to 71B18 There appear to be at least 37 members in the CYP71B subfamily in Arabidopsis, based on distinct overlapping sequences. There are at least 54 members in the 71 family AI993723 91% to 72A13 AW004264 89% to 72A15 There appear to be at least 12 members in the CYP72 family in Arabidopsis, based on distinct overlapping sequences. There is 1 CYP73 family member There are 2 CYP74 family members There is at least 1 CYP75 family member There are at least 5 CYP76 family members There are at least 4 different CYP77 members in Arabidopsis There appears to be at least 5 CYP78 members in Arabidopsis. B19224 79F like There are at least 10 members of CYP79 in Arabidopsis No CYP80 members have been found yet H77232 same as F14118 probably in the CYP81D subfamily amino acids 207-338 T14177 probably in the CYP81D subfamily Z26103, Z29010, Z29011 probably in the CYP81 family, subfamily uncertain AI997093 56% to 81D1 subfamily uncertain There are at least 17 CYP81 family members in Arabidopsis There are at least 5 CYP82 family members in Arabidopsis There are at least 2 CYP83 family members in Arabidopsis There is one CYP84 family member in Arabidopsis There appears to be one CYP85 member in Arabidopsis. There appear to be at least 11 members in the CYP86 family in Arabidopsis, based on distinct overlapping sequences. B08375 87 family B10696 87 family differs from B08375 There appears to be at least 3 members in the CYP87 family in Arabidopsis, based on distinct overlapping sequences. B11817 CYP88 family B11581 CYP88 family There appears to be at least 3 CYP88 members in Arabidopsis AI992955 93% to 89A5 There are at least 8 members in the CYP89 family in Arabidopsis B96406, B97796 probably in CYP90 family B21696 probably in CYP90 family There are at least 5 members in the CYP90 family if the BAC ends above are in this family CYP91 was merged with CYP81 CYP92 has no members in Arabidopsis yet There appears to be at least 1 member of the CYP93 family in Arabidopsis AL080732 BAC end F1L6 = AQ010792 40% to 94B1 There appears to be at least 3 CYP94 members in Arabidopsis CYP95 is a discontinued number AI999275 75% to 96A3 CYP96 has at least 9 members in Arabidopsis B67502, B26976 probably in the 97 family AI995768 71% to 97A1 CYP97 has at least 3 members in Arabidopsis H36457 51% identical to CYP98A3 extreme C-terminal opposite end = AA651146 AL080882 BAC end F1017 = AA651146 CYP98 has at least 3 members CYP99 is not yet found in Arabidopsis CYP701 has one member in Arabidopsis CYP702 has seven members in Arabidopsis CYP703 has one member in Arabidopsis AA728483 704 family 54% identical to 704A2 CYP704 has at least 3 members in Arabidopsis There appears to be at least 33 CYP705 members in Arabidopsis CYP706 has 3 members in Arabidopsis CYP707 has at least 5 members in Arabidopsis CYP708 has at least 6 members in Arabidopsis CYP709 has 5 members CYP710 has 4 members CYP711 has 1 member CYP712 has 2 members CYP713 merged with 71A as 71A27 and 71A28 CYP714 has 2 members CYP715 has 1 member CYP716 has 2 members CYP717 has been joined with CYP81 = CYP81J1 CYP718 has 1 member CYP719 has not been seen in Arabidopsis yet CYP720 has 1 member CYP721 has 1 member CYP722 has 1 member