P450s that have appeared since the 1993 P450 nomenclature update.
This is part E of the bibiographic P450 files.
This section contains bacterial sequences CYP101 to CYP174.
This includes references that were incomplete and duplications
of sequences that were already in the update. If a sequence
is assigned an accession number that was not in the old update
it is included in this list. 48 new P450s were added July 27, 2000
Four new sequences were added Jan. 9, 2001 CYP102C1, CYP172-174.
Added CYP175A1 9/17/2001
Compiled by David R. Nelson
Last modified June 2, 2003 added 25 new sequences.
Last modified Nov. 5, 2003 There are now 501 bacterial P450s
51 Family
101 Family
102 Family
103 Family
104 Family
105A Subfamily
105B Subfamily
105C Subfamily
105D Subfamily
105E Subfamily
106 Family
107A Subfamily
107B Subfamily
107C Subfamily
107D Subfamily
107E Subfamily
107F Subfamily
107G Subfamily
107H Subfamily
107J Subfamily
108 Family
109 Family
110 Family
111 Family
112 Family
113A Subfamily
113B Subfamily
114 Family
115 Family
116 Family
117 Family
118 Family
119 Family
120 Family
121 Family
122 Family
123 Family
124 Family
125 Family
126 Family
127 Family
128 Family
129 Family
130 Family
131 Family
132 Family
133 Family
51 Family
CYP51 Mycobacterium tuberculosis
GenEMBL Z80226 (34809bp) gi 1550642 Rv0764c
complement (6140-7495)
33.7% identical to CYP51 over 439AA overlap
this is a bacterial CYP51
CYP51 Mycobacterium bovis subsp. bovis AF2122/97
NC_002945 complete genome complement(858662..858868)
CYP51 100% match
locus_tag = Mb0786c
CYP51 Mycobacterium avium
TIGR contig:3273:m_avium Length = 5,475,738
79% to CYP51 M. tuberculosis
3021360 TSTVVPRVSGGEEEHGHLEEFRTDPIGLMQRVRDECGDVGWFQLVDKHVILLSGAQANEF 3021539
3021540 FFRSADEDLDQAEAYPFMTPIFGKGVVFDASPERRKEMLHNSALRGEQMKGHASTIEGEV 3021719
3021720 KKMIADWGDEGEIELLDFFAELTIYTSTACLIGLKFREQLDHRFAEYYHDLERGTDPLCY 3021899
3021900 VDPYLPIESFKRRDEARVKLVALVQEIMDQRLANPPKDKADRDMLDVLVSIKDEDGKPRF 3022079
3022080 SADEITGMFISLMFAGHHTSSGTSAWTLIELIRHPDVYAEVLAELEELYADGQEVSFHAL 3022259
3022260 RSIPKLDNVVKETLRLHPPLIILMRVAKGEFEVEGFPIHEGDYVAASPAISNRIPEDFPD 3022439
3022440 PDAFKPDRYNKPEQADIVNRWTWIPFGAGRHRCVGAAFAQMQIKAIFSVLLREYDFEMAQ 3022619
3022620 PADSYRNDHSKMVVQLARPAKVRYRKR 3022700
CYP51 Mycobacterium smegmatis
TIGR contig:3439:m_smegmatis Length = 6,989,783
80% to CYP51 M. tuberculosis
4858809 VPRVSGGEEEHGHLEEFRTDPIGLMKRVRSECGDVGWFQLADKQVVLLSGAEANEFFFRS 4858988
4858989 SDSELNQAEAYPFMTPIFGEGVVFDADPERRAEMLHNTALRGEQMKGHAATIENEVRRMV 4859168
4859169 ESWGDEGEIDLLEFFAELTIYTSTACLIGVKFRNQLDKRFADYYHLLERGTDPLCYVDPY 4859348
4859349 LPIESFRIRDEARANLVELVQEVMNGRIANPPKDKSDRDLLDVLVSIKDEDGTPRFSANE 4859528
4859529 VTGMFISLMFAGHHTSSGTASWTLIELLRHPEFYAKVQAELDDLYADGQEISFHALRQIP 4859708
4859709 NLDNALKETLRLHPPLIILMRVAQDEFEVAGRPIHKGQMVAASPAISNRIPEDFPDPDTF 4859888
4859889 DPDRYDKPRQEDLINRWTWIPFGAGKHRCVGAAFAQMQIKAIFSVLLRDFEFEMAQPSES 4860068
4860069 YRNDHSKMVVQLARPAKVRYRRR 4860137
CYP51 Methylococcus capsulatus
TIGR contig:221:m_capsulatus
49% to CYP51 M. tuberculosis
NOTE FUSION PROTEIN EXTENDS C-TERMINAL.
SEE J. Biol. Chem., Vol. 277, Issue 49, 46959-46965, December 6, 2002
A Novel Sterol 14-Demethylase/Ferredoxin Fusion Protein (MCCYP51FX) from
Methylococcus capsulatus Represents a New Class of the Cytochrome P450
Superfamily
Colin J. Jackson¤, David C. Lamb¤, Timothy H. Marczylo, Andrew G. S. Warrilow, Nigel J. Manning¦, David J. Lowe, Diane
E. Kelly, and Steven L. Kelly
908332 MSHPPSNTP
908305 PVKPGGLPLLGHILEFGKNPHAFLMALRHEFGDVAEFRMFHQRMVLLTGSQASEAFYRAP 908126
908125 DEVLDQGPAYRIMTPIFGRGVVFDARIERKNQQLQMLMPALRDKPMRTYSEIIVAEVEAM 907946
907945 LRDWKDAGTIDLLELTKELTIYTSSHCLLGAEFRHELNTEFAGIYRDLEMGIQPIAYVFP 907766
907765 NLPLPVFKRRDQARVRLQELVTQIMERRARSQERSTNVFQMLIDASYDDGSKLTPH 907598
907597 EITGMLIATIFAGHHTSSGTTAWVLIELLRRPEYLRRVRAEIDALFETHGRVTFESLRQM 907418
907417 PQLENVIKEVLRLHPPLILLMRKVMKDFEVQGMRIEAGKFVCAAPSVTHRIPELFPNPEL 907238
907237 FDPDRYTPERAEDKDLYGWQAFGGGRHKCSGNAFAMFQIKAIVCVLLRNYEFELAAAPE 907061
907060 SYRDDYRKMVVEPASPCLIRYRRRDAP 906980
101 Family
CYP101A1 Pseudomonas putida
GenEMBL D00528 (1950bp)
Koga,H., Yamaguchi,E., Matsunaga,K., Aramaki,H. and Horiuchi,T.
Cloning and nucleotide sequences of NADH-putidaredoxin reductase
gene(camA) and putidaredoxin gene(camB) involved in cytochrome
P-450cam hydroxylase of Pseudomonas putida
J. Biochem. 106, 831-836 (1989)
Note: only the last 93 nucleotides of the cam gene was cloned along
with two downstream genes.
CYP101A1 Pseudomonas putida
PIR C60886 (last 8 amino acids)
Romeo, C., Moriwaki, N., Yasunobu, K.T., Gunsalus, I.C.,
Koga, H.
Identification of the coding region for the putidaredoxin
reductase gene from the plasmid of Pseudomonas putida.
J. Protein Chem. 6, 253-261 (1987)
CYP101B1 Novosphingobium aromaticivorans
NZ_AAAV01000165.1
complement(29626..30870) gene = Saro2804
43% to CYP101
MLPHDRGQNSTRRITAMEAPAHVPADRVVDIDIYMPPGLAEHGF
HKAWSDLSAGNPAVVWTPRNEGHWIALGGEALQEVQSDPERFSSRIIVLPKSVGEMHG
LIPTTIDPPEHRPYRQLLNAHLNPGAIRGLSESIRQTAVDLIEGFAAQGHCNFTAQYA
EQFPIRVFMALVGIEASEAPRIRHWAECMTRPGMDMTFDEAKAVFFDYVGPLVDARRE
TPGEDMISAMINADLGDGRRLTRDEALSVVTQVLIAGLDTVVNVLGFIMRELAGNPAL
RADLRQRGADILPVVHELFRRFGLVSIAREVRRDIEFHGVHLKAGDMIAIPTQVHGLD
PRVNPDPLAIDPSRKRARHSTFGSGPHMCPGQELARKEVAITLEEWLRRIPDFALGPN
SDLSPVPGIVGALRRVELVWNT
CYP101C1 Novosphingobium aromaticivorans
NZ_AAAV01000133.1
complement(4199..5389) gene = Saro1574
44% to CYP101A1
MIPAHVPADRVVDFDIFNPPGVEQDYFAAWKTLLDGPGLVWSTA
NGGHWIAARGDVVRELWGDAERLSSQCLAVTPGLGKVMQFIPLQQDGAEHKAFRTPVM
KGLASRFVVALEPKVQAVARKLMESLRPRGSCDFVSDFAEILPLNIFLTLIDVPLEDR
PRLRQLGVQLTRPDGSMTVEQLKQAADDYLWPFIEKRMAQPGDDLFSRILSEPVGGRP
WTVDEARRMCRNLLFGGLDTVAAMIGMVALHLARHPEDQRLLRERPDLIPAAADELMR
RYPTVAVSRNAVADVDADGVTIRKGDLVYLPSVLHNLDPASFEAPEEVRFDRGLAPIR
HTTMGVGAHRCVGAGLARMEVIVFLREWLGGMPEFALAPDKAVTMKGGNVGACTALPL
VWRA
CYP101D1 Novosphingobium aromaticivorans
NZ_AAAV01000085.1
complement(6803..8068) gene = Saro0669
44% to CYP101
MNAQTSTATQKHRVAPPPHVPGHLIREIDAYDLDGLEQGFHEAW
KRVQQPDTPPLVWTPFTGGHWIATRGTLIDEIYRSPERFSSRVIWVPREAGEAYDMVP
TKLDPPEHTPYRKAIDKGLNLAEIRKLEDQIRTIAVEIIEGFADRGHCEFGSEFSTVF
PVRVFLALAGLPVEDATKLGLLANEMTRPSGNTPEEQGRSLEAANKGFFEYVAPIIAA
RRGGSGTDLITRILNVEIDGKPMPDDRALGLVSLLLLGGLDTVVNFLGFMMIYLSRHP
ETVAEMRREPLKLQRGVEELFRRFAVVSDARYVVSDMEFHGTMLKEGDLILLPTALHG
LDDRHHDDPMTVDLSRRDVTHSTFAQGPHRCAGMHLARLEVTVMLQEWLARIPEFRLK
DRAVPIYHSGIVAAVENIPLEWEPQRVSA
CYP101D2 Novosphingobium aromaticivorans
NZ_AAAV01000042
complement(5601..6899) gene = Saro0208
63% to 101D1
MGTTRMDTFNPQESRLATNFDEAVRAKVERPANVPEDRVYEIDM
YALNGIEDGYHEAWKKVQHPGIPDLIWTPFTGGHWIATNGDTVKEVYSDPTRFSSEVI
FLPKEAGEKYQMVPTKMDPPEHTPYRKALDKGLNLAKIRKVEDKVREVASSLIDSFAA
RGECDFAAEYAELFPVHVFMALADLPLEDIPVLSEYARQMTRPEGNTPEEMATDLEAG
NNGFYAYVDPIIRARVGGDGDDLITLMVNSEINGERIAHDKAQGLISLLLLGGLDTVV
NFLSFFMIHLARHPELVAELRSDPLKLMRGAEEMFRRFPVVSEARMVAKDQEYKGVFL
KRGDMILLPTALHGLDDAANPEPWKLDFSRRSISHSTFGGGPHRCAGMHLARMEVIVT
LEEWLKRIPEFSFKEGETPIYHSGIVAAVENVPLVWPIAR
102 Family
CYP102A1 Bacillus megaterium
Ruettinger,R.T.,Wen, L.-P. and Fulco, A.J.
Coding Nucleotide, 5'-Regulatory, and Deduced Amino Acid Sequences of
P450BM-3, a Single Peptide Cytochrome P450:NADPH-P450 Reductase from
Bacillus megaterium.
J. Biol. Chem. 264, 10987-10995 (1989)
CYP102A1 Bacillus megaterium
GenEMBL J04832 (4957bp)
Ravichandran,K.G., Boddupalli, S.S., Hasemann,C.A.,
Peterson,J.A. and Deisenhofer,J.
Crystal structure of hemoprotein domain of P450BM-3, a prototype
for microsomal P450s.
Science 261, 731-736 (1993)
P450 is N-terminal
CYP102A2 Bacillus subtilis
GenEMBL D87979
Yamamoto, H., S. Uchiyama, F. A. Nugroho, and J. Sekiguchi.
A 23.4 kb segment at the 69 degrees-70 degrees region of the
Bacillus subtilis genome.
Microbiology. 143, 1317-20 (1997)
Gene name yfnJ 66.4% identical to CYP102A1 P450 part only
also called YetO (fusion of P450 and reductase like CYP102A1, P450 part is
N-terminal)
CYP102A3 Bacillus subtilis
GenEMBL U93874, Z99117
Sorokin, A., A. Bolotin, B. Purnelle, H. Hilbert, J. Lauber, A.
Dusterhoft, and S. D. Ehrlich.
Sequence of the Bacillus subtilis genome region in the vicinity of
the lev operon reveals two new extracytoplasmic function RNA
polymerase sigma factors SigV and SigZ.
Microbiology. 143, 2939-43 (1997)
Gene name yrhJ most similar to CYP102A2
(fusion of P450 and reductase like CYP102A1 P450 part is N-
terminal)
CYP102A4 Bacillus anthracis str. Ames
GenPept AAP27014
bifunctional P-450:NADPH-P450 reductase 1
79% to 102A2
1 MDKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKLAEEYG PIFRMQTLSD TIIVVSGHEL
61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETQEPNWQ KAHNILMPTF SQRAMKDYHA
121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM
181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSE NQEENDLLSR
241 MLNVQDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR
301 VLTDSTPTYQ QVMKLKYIRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV
361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG
421 MLLQHFEFID YEEYQLDVKQ TLTLKPGDFK IRIVPRNQTI SHTTVLAPTE EKLKNHEIKQ
481 QVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVAAL NDRIGSLPKE
541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKG DELKGVQYAV FGCGDHNWAS TYQRIPRYID
601 EQMAQKGATR FSTRGEADAS GDFEEQLEQW KQRMWSDAMK VFGLELNKNM EKERSTLSLQ
661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSERSTRHIE ISLPEGATYK EGDHLGVLPI
721 NSEKNVNRIL KRFGLNGKDQ VILSASGRSV NHIPLDSPVR LYDLLSYSVE VQEAATRAQI
781 REMVTFTACP PHKKELESLL EDGVYQEQIL KKRISMLDLL EKYEACEIRF EPFLELLPAL
841 KPRYYSISSS PLVAQDRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP
901 QSNFQLPENP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MNVGEAHLYF GCRHPEKDYL
961 YRTELENDER DGLISLHTAF SRLEGQAKTY VQHVIKEDRI HLISLLDNGA HLYICGDGSK
1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR LQEEGRYGKD VWAGI
CYP102A5 Bacillus cereus ATCC 14579
GenPept AAP10153
NADPH-cytochrome P450 reductase/P450 fusion
79% to 102A2 Bacillus subtilis
1 MEKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKIAEEYG PIFQIQTLSD TIIVVSGHEL
61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETHEPNWK KAHNILMPTF SQRAMKDYHA
121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM
181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSG DQEENDLLSR
241 MLNVPDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR
301 VLTDPTPTYQ QVMKLKYMRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV
361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG
421 MLLQHFELID YQNYQLDVKQ TLTLKPGDFK IRILPRKQTI SHPTVLAPTE DKLKNDEIKQ
481 HVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVVAL NDRIGSLPKE
541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKP DELKGVQYAV FGCGDHNWAS TYQRIPRYID
601 EQMAQKGATR FSKRGEADAS GDFEEQLEQW KQNMWSDAMK AFGLELNKNM EKERSTLSLQ
661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSDRSTRHIE VSLPEGATYK EGDHLGVLPV
721 NSEKNINRIL KRFGLNGKDQ VILSASGRSI NHIPLDSPVS LLALLSYSVE VQEAATRAQI
781 REMVTFTACP PHKKELEALL EEGVYHEQIL KKRISMLDLL EKYEACEIRF ERFLELLPAL
841 KPRYYSISSS PLVAHNRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP
901 QSNFELPKDP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MNLGQAHLYF GCRHPEKDYL
961 YRTELENDER DGLISLHTAF SRLEGHPKTY VQHLIKQDRI NLISLLDNGA HLYICGDGSK
1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR VQDEGRYGKD VWAGI
CYP102A6 Bradyrhizobium japonicum USDA 110
GenPept BAC48147
NC_004463 complete genome 3173438..3176674
NADPH-cytochrome P450 reductase/P450 fusion
54% to 102A2
1 MSSKNRLDPI PQPPTKPVVG NMLSLDSAAP VQHLTRLAKE LGPIFWLDMM GSPIVVVSGH
61 DLVDELSDEK RFDKTVRGAL RRVRAVGGDG LFTADTREPN WSKAHNILLQ PFGNRAMQSY
121 HPSMVDIAEQ LVQKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE
181 SLVRSLETIM MTRGLPFEQI WMQKRRKTLA EDVAFMNKMV DEIIAERRKS AEGIDDKKDM
241 LAAMMTGVDR STGEQLDDVN IRYQINTFLI AGHETTSGLL SYTLYALLKH PDILKKAYDE
301 VDRVFGPDVN AKPTYQQVTQ LTYITQILKE ALRLWPPAPA YGISPLADET IGGGKYKLRK
361 GTFITILVTA LHRDPSVWGP NPDAFDPENF SREAEAKRPI NAWKPFGNGQ RACIGRGFAM
421 HEAALALGMI LQRFKLIDHQ RYQMHLKETL TMKPEGFKIK VRPRADRERG AYGGPVAAVS
481 SAPRAPRQPT ARPGHNTPML VLYGSNLGTA EELATRMADL AEINGFAVHL GALDEYVGKL
541 PQEGGVLIIC ASYNGAPPDN ATQFVKWLGS DLPKDAFANV RYAVFGCGNS DWAATYQSVP
601 RFIDEQLSGH GARAVYPRGE GDARSDLDGQ FQKWFPAAAQ VATKEFGIDW NFTRTAEDDP
661 LYAIEPVAVT AVNTIVAQGG AVAMKVLVND ELQNKSGSNP SERSTRHIEV QLPSNITYRV
721 GDHLSVVPRN DPTLVDSVAR RFGFLPADQI RLQVAEGRRA QLPVGEAVSV GRLLSEFVEL
781 QQVATRKQIQ IMAEHTRCPV TKPKLLAFVG EEAEPAERYR TEILAMRKSV YDLLLEYPAC
841 ELPFHVYLEM LSLLAPRYYS ISSSPSVDPA RCSITVGVVE GPAASGRGVY KGICSNYLAN
901 RRASDAIYAT VRETKAGFRL PDDSSVPIIM IGPGTGLAPF RGFLQERAAR KAKGASLGPA
961 MLFFGCRHPD QDFLYADELK ALAASGVTEL FTAFSRADGP KTYVQHVLAA QKDKVWPLIE
1021 QGAIIYVCGD GGQMEPDVKA ALVAIRHEKS GSDTATAARW IEEMGATNRY VLDVWAGG
CYP102B1 Streptomyces coelicolor cosmid F43.
GenEMBL AL136502 CDS 10570..12153 gene="SCF43.12"
Highly similar to the N-terminal P450 domain of Bacillus
megaterium 41.9% identity in 497 aa overlap.
45% to 102A1 over 433 amino acids
cloned and expressed by David Lamb and Steve Kelly
CYP102B2 Streptomyces avermitilis
GenEMBL AP005050
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV7426
78% to 102B1 from Streptomyces coelicolor
CYP102C1 Rhodococcus sp. X309
GenEMBL AF059700.1 complement(3619-4584) runs off end of sequence
partial gene 48% to 102B1
CYP102D1 Streptomyces avermitilis
GenEMBL AP005023
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV575 47% to 102A3
40% to 102B1, 44% to 102C1 partial seq
CYP102E1 Ralstonia metallidurans
GenEMBL NZ_AAAI01000371
104500-107000 region
51% to 102D1
MSTATPAAALEPIPRDPGWPIFGNLFQITPGEVGQHLLARSRHHDGIFELDFAGKRVPFVS
SVALASELCDATRFRKIIGPPLSYLRDMAGDGLFTAHSDEPNWGCAHRILMPAFSQRAM
KAYFDVMLRVANRLVDKWDRQGPDADIAVADDMTRLTLDTIALAGFGYDFASFASDELDP
FVMAMVGALGEAMQKLTRLPIQDRFMGRAHRQAAEDIAYMRNLVDDVIRQRRVSPTSGMD
LLNLMLEARDPETDRRLDDANIRNQVITFLIAGHETTSGLLTFALYELLRNPGVLAQAY
AEVDTVLPGDALPVYADLARMPVLDRVLKETLRLWPTAPAFAVAPFDDVVLGGRYRLRKD
RRISVVLTALHRDPKVWANPERFDIDRFLPENEAKLPAHAYMPFGQGERACIGRQFALTE
AKLALALMLRNFAFQDPHDYQFRLKETLTIKPDQFVLRVRRRRPHERFV
TRQASQAVADAAQTDVRGHGQAMTVLCASSLGTARELAEQIHAGAIAAGFDAKLADLDDA
VGVLPTSGLVVVVAATYNGRAPDSARKFEAMLDADDASGYRANGMRLALLGCGNSQWATY
QAFPRRVFDFFITAGAVPLLPRGEADGNGDFDQAAERWLAQLWQALQADGAGTGGLGVDV
QVRSMAAIRAETLPAGTQAFTVLSNDELVGDPSGLWDFSIEAPRTSTRDIRLQLPPGITY
RTGDHIAVWPQNDAQLVSELCERLDLDPDAQATISAPHGMGRGLPIDQALPVRQLLTHFI
ELQDVVSRQTLRALAQATRCPFTKQSIEQLASDDAEHGYA
CYP102F1 Actinosynnema pretiosum subsp. auranticum
AF453501 complement(6501..9518)
maytansinoid antitumor agent ansamitocin biosynthetic gene cluster I
49% to 102A3
gene = asm30
MVATGTRIPGPKPLPLVGNLLDVLTSDLDTDVDFLDRCHREHGG
IVALTFAGQRQVFASSHELVARMCSDPSWGKAVHPALEQVRDFAGDGLFTARGDEPNW
GKAHRLLMPAFGPTAMRDHFPAMLDIAEQMLVRWRRFGPDHRIDVADDMTRLTLDTIA
LCAFGARFNSFYRDRAHPFVDAMVRSLVEAGERAERLPGVQPFLVGRNQRYRDDIATM
NRIADGIVAARAALPAGERPDDLLERMLTCADPVTGERLSARNVRYQLATFLIAGHET
TSGLLSFAVHRLLAHPEVLRKAKDAVDGVLGDRVPAFEDLARLDYLGQVLRETLRLHP
TAPAFALAPDEPAELGGHAIGAGEPVLVMLPTLHRDPAVWRDPDVFDPERFAPERMDE
IPACAWMPFGHGARACIGRPFALQEATLVLALVLQRFDLALADPDHRLTIKQTLTLKP
DSLVVRARPRADRPGATATVETVVPHQVPATHRHGTPLHVFYGSNGGSGEGLARTIAG
DGAARGWATSVAPLDDAVRALPASGPVVIVSSSYNGAPPDNAAHFVRWLTQDGPDLSG
VDYLVLGCGNLDWSATYQRVPTLIDEAMAAAGARRLRERGATDARADFFGDWERWYEP
LWPLLSAECGVEVGEIGPRFRVVESDAADGLGDLASAVVLENRELVRGPDAGSKRHLE
LRLPDGTSYRTGDYLSVLPQNHPDLVRRAVARLGTRAERVVTVESSAPTGLVPVGRAL
RVDELLTRCVDLSAPAGAGVVARLAERCPCPPERAELAATTGATLLELLERFPSCAVD
LALALELLPAPRTRLYSISSAAEEQRAEVALTVSVTGVTSGYLSRVRPGDRVAVGIAS
PPESFRPPADNTVPVVLIAAGTGIAPFRGFLRARAALGGEPGPALLLFGCRGPELDDL
YAEEFAALGDWLEVDRAYSRHPDGEVRHVQHRLWQRRDRVRELVDAGARVYLCGDATR
VGPAVEEVLGRIGPGAGWLDALRAGGRYATDVF
103 Family
CYP103A1 Agrobacterium tumefaciens
GenEMBL M19352, AF242881 CDS 141158.142426
gene="virH1"
CYP103A2 Agrobacterium tumefaciens
GenEMBL AF034769
GenEMBL AB016260 CDS 124584..125759
CYP103A3 Agrobacterium tumefaciens plasmid pTiAB2/73 vir region
GenEMBL AF329849 892..2148
gene = virH
61% TO 103A1
MNARGPEKVSQTSGPIISASLDPDNVSVSDLDRSGHAIFAEWRP
KRPFLRRQDGVYVLLRADDVLGLSSDPRTRQIETELMLNRGINEGAVFDFVRYSMLFS
NNEVHSRRRSPFTRTFAFRMIENLRPQVSQLTETLFQDLKELDSFNFVEEFASKLPAV
AIAGLLGLPPSDIPYFTQLVYRVARCLSPSWRDADLPDIEASAAEFKNYVQAVIDDRR
SNPRDDFLSSFIRATREAEDLSPDEGLAQLMLIVLAGTDTTKTGLTALTGQLLRHRHV
WEALLKDESLVPAAVEEGLRFEPPVGSYPRLALADIDLEGFILPKGSLLALCTMSALR
DEKHFAHPELFDIHRKQMHWHMVFGAGAHRCLGEALARLELQEGLATVLRYAPTLSIE
GEWPTVQGHGGVRRIAEMRVGFRRQI
104 Family
CYP104A1 Agrobacterium tumefaciens
GenEMBL M19352, AF242881 CDS 142447..143670
gene="virH2"
CYP104A2 Agrobacterium tumefaciens
GenEMBL AB016260
103A2 CDS 124584..125759 and
104A2 CDS 125919..127094 83% to 104A1
105A Subfamily
CYP105A1 Streptomyces griseolus
GenEMBL M36480 (1629bp) Y18556 CDS 2447..3703
Omer,C.A., Lenstra,R., Litle,P.J., Dean,C.R., Tepperman,J.M.,
Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
Genes for two herbicide-inducible cytochromes P-450 from
Streptomyces griseolus
J. Bacteriol. 172, 3335-3345 (1990)
Gene suaC
CYP105A2 Amycolata autotrophica
GenEMBL D26543 (1197bp)
Kawauchi,H., Sasaki,J., Adachi,T., Hanada,K., Beppu,T. and
Horinouchi,S.
Cloning and nucleotide sequence of a bacterial cytochrome P-450
VD25
gene encoding vitamin D-3 25-hydroxylase
Biochim. Biophys. Acta 1219, 179-183 (1994)
CYP105A3 Streptomyces carbophilus
GenEMBL D30815 PIR JC4287
Watanabe,I., Nara,F. and Serizawa,N.
Cloning, characterization and expression of the gene encoding
cytochrome P-450sca-2 from Streptomyces carbophilus involved in
production of pravastatin, a specific HMG-CoA reductase inhibitor
Gene 163 (1), 81-85 (1995)
105B Subfamily
CYP105B1 Streptomyces griseolus
GenEMBL M36481 (1688bp) M32239
Omer,C.A., Lenstra,R., Litle,P.J., Dean,C.R., Tepperman,J.M.,
Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
Genes for two herbicide-inducible cytochromes P-450 from
Streptomyces griseolus
J. Bacteriol. 172, 3335-3345 (1990)
Gene subC, SU-2
CYP105B2 Streptomyces tubercidicus strain R-922
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Cyp229
78% to 105B1
105C Subfamily
CYP105C1 Streptomyces sp.
GenEMBL M31939 PIR S19629 (381 amino acids)
Horii, M., Ishizaki, T., Paik, S.Y., Manome, T. and Murooka, Y.
An operon containing the genes for cholesterol oxidase and a
cytochrome P-450-like protein from a Streptomyces sp.
J. Bacteriol. 172, 3644-3653 (1990)
Gene choP
105D Subfamily
CYP105D1 Streptomyces griseus
GenEMBL S45823 X63601 (1700bp) PIR S24750 (412 amino acids)
Trower,M.K., Lenstra,R., Omer.C., Buchholz,S.E., and
Sariaslani,F.S.
Cloning, nucleotide sequence determination and expression
of the genes encoding cytochrome P-450soy (soyC) and
ferredoxinsoy (soyB) from streptomyces griseus.
Mol. Microbiol. 6, 2125-2134 (1992)
PIR S35901 (412 amino acids)
Erratum. Cloning, nucleotide sequence determination and
expression of the genes encoding cytochrome P-450(soy)
(soyC) and ferredoxin(soy) (soyB) from Streptomyces griseus.
Mol. Microbiol. 7, 1024-1025 (1993)
CYP105D2 Streptomyces griseus
GenEMBL AF071145
84% identical to 105D1
CYP105D3 Streptomyces sclerotialus
GenEMBL AF071149
68% identical to 105D1
CYP105D4 Streptomyces lividans
GenEMBL AF072709 CDS complement(1593..2813)
69% to 105D1 67% to 105D2 82% to 105D3 57% to 105A1
CYP105D5 Streptomyces coelicolor
3StF60 [Full Sequence] Sanger cosmid
CDS comp(2106-3344) 98% identical to CYP105D4
cloned and expressed by David Lamb and Steve Kelly
CYP105D6 Streptomyces avermitilis
GenEMBL AB070949.1 69121-70371
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV412_pteD 55% to 105D1 from Streptomyces griseus,
53% to 105D4, 54% to 105D5 (if first 17aa left off 105D5)
Gene = pteD
CYP105D7 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV7469 73% to 105D4 from Streptomyces lividans
CYP105D8 Streptomyces tubercidicus strain I-1529
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Cyp233
68% to 105D7
CYP105D9 Streptomyces sp. JP95
GenEMBL AF509565 11774..13024
griseorhodin biosynthesis gene cluster
55% to 105D6
gene = grhO3
MTDTLDEPQTLADGAEDAPAYPVKRTCPYRMPPGYEELREKGPI
SRVTLWNGRTAWLVTGNDLGRRLFPDARLSSDVLDPRFPLLAPRIEAQRQQAAAPPLV
GVDDPVHARQRRMVLPSFGIRQINALRPEIQKYADDLLDTMLAKGPGVTVDLLTEYAL
PMPSAVICMLLGVPYEDHHYFDERSRHVLSSSGEEQAAQAQQAFTEILAYLDDLIVRK
QAEPGDTLLDELIARQLEEGKVDRQELAMIATVLLVSGHETTSNMIALSTMALLADPD
QLAALRADESLMPRAVDELMRFSSIGDMLMRVAKEDIEIEGHLIRAGDGVILSTMLMN
RDPGAFERPDELDIRRPAGRHVAFGYGIHQCIGQNLARAEMEIALATLFRRVPTLKLA
VPAEQVPVNAPFVLQGVSELPVTW
105E Subfamily
CYP105E1 Rhodococcus fascians
GenEMBL Z29635 (7139bp) PIR S42052 (399 amino acids)
Crespi,M., Vereecke,D.M., Temmerman,W.G., Van Montagu,M.
and Desomer,J.
The fas operon of Rhodococcus fascians encodes new genes required
for efficient fasciation of host plants.
J. Bact. 176, 2492-2501 (1994)
MAGTADLPLEMRRNGLNPTEELAQVRDRDGVIPVGELYGAPAFL
VCRYEDVRRIFADSNRFSNAHTPMFAIPSGGDVIEDELAAMRAGNLIGLDPPDHTRLR
HILAAEFSVHRLSRLQPRIAEIVDSALDGLEQAGQPADLMDRYALPVSLLVLCELLGV
PYADRDELRDRTARLLDLSASAEQRAVAQREDRRYMATLVTRAQEQPGDDLLGILARK
IGDNLSTDELISIISLIMLGGHETTASMIGLSVLALLHHPEQAAMMIEDPNCVNSGIE
ELLRWLSVAHSQPPRMAVTEVQIAGVTIPAGSFVIPSLLAANRDSNLTDRPDDLDITR
GVAGHLAFGHGVHFCLGHSLARMTLRTAVPAVLRRFPDLALSPSHDVRLRSASIVLGL
EELQLTW
CYP105F1 Streptomyces lavendulae
GenEMBL AF127374 CDS 2006..3229
48% to 105C1 42% to 105B1 40% to 105D1 new subfamily in 105
CYP105F2 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
85% to 105F1
clone name SP8812
CYP105G1 Amycolatopsis mediterranei
GenEMBL AF040571 CDS complement(5011..6066)
49% to 105C1, 105B1 new subfamily in 105
looks like an insertion in the seq from 80-120
CYP105H1 Streptomyces noursei ATCC 11455 nyst
GenEMBL AF263912 CDS comp (58637..59833)
gene="nysN" 47% to 105B1 46% to 105A1 46% to 105D1
function="presumably involved in modification of the
nystatin macrolactone ring"
CYP105H2 Streptomyces albus
GenEMBL AF071143
77% to 105H1
LLIAGHETTANNIGLGVVTLLSHPQWAGDERAVEELLRLHSVAD
MVALRVAVDDVEIAGQVIRKGEGIVPLLAAANHDTEVFGCPHAFDPERSERRHVAFGY
GVHQCLGQNL
CYP105H3 Streptomyces natalensis
GenEMBL AJ278573 52789..53985
pimaricin biosynthetic gene cluster.
68% to 105H1
gene = pimG
MTYTDPAAPETDPPAVDFPQRKPGVPFPPPDYADYRDRKGLVLS
QLSDGKRVWLVTRHEDVRAVLTSPSISSNPEHKGFPNVGNLGVPKQDQIPGWFVGMDS
PEHDRFRKALIPEFTVRRVRAMKPAIERTVDAQLDAMLAAGNTADLVADFALPIPSLV
ISALLGVPPADREFFESRTRVLVSLRSSTDDDRMAAAKDLLRYINRLVEIKQKWGGDD
LITRLLATGAIAPHEMSGVLMLLLIAGHETTANNIALGVVTLLANPQWIGDDRAVEET
LRFHSVADLVSLRVAVQDVEIAGQLIKAGEGIVPLVAAANHDENAFECPHAFDPSRSA
RHHVAFGYGVHQCLGQNLVRIEMEVAYRKLFERIPNLELAVPTDGLDIKYDGVLYGLN
ELPVRW
CYP105H4 Streptomyces nodosus
GenEMBL AF357202 complement(62051..63250)
amphotericin biosynthetic gene cluster
84% to 105H1
MTAETEMTTFAPGCPVAFPLRRPGRPFPPPEYADYRAGEGLVRS
ELPASGPVWLVTRHEDVRTVLTDPRISADPSRPGFPRARRTGGAPSQSEIPGWFVALD
PPEHDRFRKTLIPEFTVRKVRELRPAIQQIVDERIDALLAAGNSADLIADFALSVPSL
VISDLLGVPKADRDFFEAKTKVLVTLSSTDEQRDEASKALLRYLNRLIQIKGRRPGED
LISRLLQAGTMNRQELSGVSMLLLIAGHETTANNIGLGVVQLLTNPQWIGDDRIVEEM
LRYYSVADLVSFRVAVEDVEIGGQLIKAGEGIVPLIAAANHDGSVFDKPEEFNPERSA
RSHVAFGYGVHQCLGQNLVRVEMEIAYRTLFERIPTLELAVPVEELPLKYDGVLFGLH
ELPVTWS
CYP105H5 Streptomyces griseus
GenEMBL AJ300302 10678..11859
Gene = canC
72% to 105H3
MTTSPGPTVVDFPRRTPREPLPLSQYAEHRKQNGLVQTHLPNGR
PIWLVTRHEDVRAVLTHPRISANPDNEGFPNVGETMGVPKQEQIPGWFVGLDSPEHDR
FRKVLIPEFTVRRVRELRPAIERTVDERIDAMLAGGNTADLVNDFALPVPSLVISALL
GVPSADRDFFESRTRTLVAIRTSTDEERAEATRQLLRYINRLIVIKKKWRGEDLISRL
LSTGKLSDEELSGVLLLLLIAGHETTANNIGLGVVTLLSHREWIGDDRLVEELLRLHS
VADMVALRVAVDDVEIAGQTIRKGEGIVPLLASANHDTEAFGCPHAFNPERTERRHVA
FGYGVHQCLGQNLVRVEMEIAYRKLFERIPELRLAVPEDQLAYKYDGILFGLHELPVR
W
CYP105J1 Amycolatopsis mediterranei rifamycin
GenEMBL AF040570 CDS comp (67462..68673)
52% to AF072709 105D4 50% to 105D1 new subfamily in 105
CYP105K1 Streptomyces tendae strain Tue901
GenEMBL Y18574 CDS 6325..7557
45% to 105A3 46% to 105D1 43% to 105B1 new subfamily in 105
gene="nikF"
CYP105K2 Streptomyces ansochromogenes
GenEMBL AF469953 14..1246
95% to 105K1
note="involved in nikkomycin biosynthesis
MTEAFDHDIPSFPMARECPMHPPAEYRELRGQEPVSRVRMPDGQ
VAWLVLKHALARKLLADPRVSADRLHPAFPGRLTAEQRAATERVRRLTTRRSMIHLDG
DEHGAHRRILTGEFSLRRIAAQRPRVQEIVDRSIDEMLAAPQPADLVEHVSQAVPSLV
ICELLGVPHEQRRDFHEWAGMLVSRSVSIQERAAASDALNDFLEALVTEKERGEPADD
LIGRLIARNRQTPVMTHDEIVGTAVMLLVAGHQTTANMISLGVVALLENPEHKARIAA
DSSLLPPAIEEMLRYFSVVENAPARVATEDIAIGGVTIRKNEGIVVSGLAADWDDEVF
GHPDRLDFERGARHHVAFGYGVHQCLGQNLARVELEIVFETLLRRVPGLSLAVPAEEL
PYKDDAGIYGIYRVPVNC
CYP105L1 Streptomyces fradiae
GenEMBL AF055922 CDS comp (6507..7769)
GenEMBL AF147703 complement(2565..3875)
Fouces,R., Mellado,E., Diez,B. and Barredo,J.L.
The left edge of the tylosin gene cluster from Streptomyces
fradiae
Microbiology (1999) In press
tylH1
46% to 105A1 42% to 105D1 43% to 105B1 new subfamily in 105
MSSSGDARPSQKGILLPAARANDTDEAAGRRSIAWPVARTCPFS
PPEQYAALRAEEPIARAELWDGAPVWLISRQDHVRALLADPRVSIHPAKLPRLSPSDG
EAEASRSLLTLDPPDHGALRGHFIPEFGLRRVRDVRPSVEQIVTGLLDDLTARGDEAD
LLADFALPMATQVICRLLDIPYEDRDYFQERTEQATRPAAGEEALEALLELRDYLDRL
ISGKTGRESGDGMLGSMVAQARGGGLSHADVLDNAVLLLAAGHETTASMVTMSVLVLL
QHPTAWRELTVNPGLLPGAVDELLRYLSIADGLRRSATADIEIDGHTIRAGDGLVFLL
AAANRDEAVFSEPEAFDIHRSARRHVAFGYGPHQCLGQNLARMELEVALGAVLERLPA
LRPTTDVAGLRLKSDSAVFGVYELPVAW
CYP105L2 Micromonospora griseorubida
GenEMBL AB089954 1490..2641
gene cluster for the polyketide macrolide mycinamicin
54% to 105L1
gene = mycCI
MDRTCAWALPEQYAEFRQRATGWPAKVWDGSPTWLVSRYEHVRA
LLVDPRVTVDPTRQPRLSEADGDGDGFRSMLMLDPPEHTRLRRMFISAFSVRQVETMR
PEIEKIVDGILDRLLALEPPVDILTHLALPMSTQVICHLLGVPYEDREFFQERSELAS
RPNDDRSMPALIELVEYLDGLVRTKTAHPDTGLLGTAVTERLLKGEITHQELVNNAVL
LLAAGHETSANQVTLSVLTLLRHPETAAELREQPELMPNAVDELLRYHSIADGLRRAA
TADIVLGDHTIRAGDGLIILLSSANHDGNTFGAEATFDIHRPARHHVAFGYGPHQCLG
QNLARLEMEVTLGKLFRRVPALRLAQEPDALRVRQGSPIFGIDELLVEW
CYP105M1 Streptomyces clavuligerus clavulanic
GenEMBL AF200819 CDS 136..1359
GenEMBL AY034175 CDS 200..1423
GenEMBL U87786 CDS 13810..15036
function="involved in clavulanic acid biosynthesis"
48% to 105B1 42% to 105A1 41% to 105D1 new subfamily in 105
MNEAAPQSDQVAPAYPMHRVCPVDPPPQLAGLRSQKAASRVTLW
DGSQVWLVTSHAGARAVLGDRRFTAVTSAPGFPMLTRTSQLVRANPESASFIRMDDPQ
HSRLRSMLTRDFLARRAEALRPAVRELLDEILGGLVKGERPVDLVAGLTIPVPSRVIT
LLFGAGDDRREFIEDRSAVLIDRGYTPEQVAKARDELDGYLRELVEERIENPGTDLIS
RLVIDQVRPGHLRVEEMVPMCRLLLVAGHGTTTSQASLSLLSLLTDPELAGRLTEDPA
LLPKAVEELLRFHSIVQNGLARAAVEDVQLDDVLIRAGEGVVLSLSAGNRDETVFPDP
DRVDVDRDARRHLAFGHGMHQCLGQWLARVELEEILAAVLRWMPGARLAVPFEELDFR
HEVSSYGLGALPVTW
CYP105N1 Streptomyces coelicolor
St4C2 [Full Sequence] Sanger cosmid
CDS 29986-31221 45% to 105A1 new subfamily in 105
cloned and expressed by David Lamb and Steve Kelly
CYP105N2 Streptomyces glaucescens cytochrome P450
GenEMBL AF071144
95% to 105N1 only 5 aa diffs
57% to AF071148 56% to AF071146 59% to 105D3 54% to 105A3
LLIAGHETTTSMIALSTLLLLDRPELPAELRNDPDLMPAAVDEL
LRVLSVADSIPLRVAAEDIELSGRTVPADDGVIALLAGANHDPEQFDDPERVDFHRTD
NHHVAFGYGMHQCLGQNL
CYP107N3 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
91% to 107N1
clone name SP0881
CYP105P1 Streptomyces avermitilis
GenEMBL AB070949.1 67376-68575
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV413_pteC low 40% range to 105 subfamilies
Gene = pteC
CYP105P2 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
92% to 105P1
clone name SP7863
CYP105Q1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV1611 49% to 105B1 from Streptomyces griseolus
46% to 105D4 and D5
CYP105Q2 Streptomyces sp.
GenEMBL BD133549
78% to CYP105Q1
3 LIAGHETTATMISLGTLALFEHPDQLAVLRDTEDPKVVAGAVDELLRYLSIVHSGLRRVA 182
183 KGDIEIDGRLIRKGDGLLFDLQTANWDPNAFPGAERLDLARPARQHNAFGFGTHQC 350
CYP105Q3 Streptomyces sp.
GenEMBL BD133546
77% to 105Q1
139 MADTLTDAAPDTDGRVPEYPMPRATGCPLAPSPAAAELRGDRPITRVRIWNGSTPWLITR 318
319 HADQRTLLTDPRVSNDDHEPDFPHVNAHRAAIAPHTPKLITNTDAPEHTRLRRSVNAPFL 498
499 VKRIEAMRPAVQKIVDDLIDDMLAGPSPADLLTALALPVPSLVIAELLGVPYEDHHFFQE 678
679 NSNRVLDNSLTAEEAQESSRALGGYLDTLFRTKLEQPGEDVLSEMGSKVKAGEMTHQEAV 858
859 SMGVAMLIAGHETTATMISLGTLALFEHPDQLAVLRDTEDPKVVAGAVDELLRYLSIVHS 1038
1039 GLRRVAKGDIEIDGRLIRKGDGLLFDLQTANWDPNAFPGAERLDLARPARQHNAFGYGPH 1218
1219 QCLGQNLARLELQVVYGTLYRRVLTLRPAVPVDQLAFNHTGTTYGVKCLPVTW 1377
CYP105R1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV7186
CYP105S1 Streptomyces tubercidicus strain R-922
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Cyp230
56% to CYP105S2
CYP105S2 Streptomyces tubercidicus strain I-1529
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Cyp234
56% to CYP105S1
CYP105T1 Burkholderia fungorum
GenEMBL NZ_AAAJ02000095
8366..9610 gene = Bcep2217
44% to 105H1
MRKTMTSAINDVRPQTTSTFPFARTGSPLHPPAEYARYRDGQPV
TRVQMWDGRYAWIFTRMEDVKAVLSSPHFSVVPSKPGYPFLTPARAATVKSYQTFITM
DPPDHTRFRRMLTRDFTQKRMEELRPQIAAYVNRLIDEMLARGSPGDLVSALALKLPV
TVVSMLVGVPYEDHEDLVKWSGQRLDLEQNPTVSESAADNMLAYFDGLLQRKERDPGD
GADMLSRLVIEQIKPGHLSRLEAIHMVNLLYFAGHETTANQIALGTLSFLLDPRQRAL
LENNPGLLKNAIEEMLRFHTISHYNSCRVATADVEVGGTLIREGEGAYALIMAANRDP
AAFPAPDRFDIERPNSQEHVAFSYGLHMCLGQPLARLELQVCFEALFRRLPRLRLAVP
LEELPFKREMYVYGLHALPVTW
CYP105U1 Streptomyces hygroscopicus strain NRRL 3602
AY179507 complement(63940..65133)
Geldanamycin biosynthesis gene cluster
50% to 105B1 52% to 105B2 not 105S
gene = gdmP
MDEIRDYPESRAAACPFSPPLGYEELRERSAVTRVRMWDGSTPF
LVTGYHEARAALGDSRFSADGTHKAMPRFVKFEVPAEVFNLGRMDDPEHARIRRMLTA
NFTIRRTEAMRPMIQGIVDGLLDRLIAQGPPADLVADFAFPLPSQVIGVMLGVSDADF
AEFQQASQGVMDFTASAEEMGAALGVMVDYVARMCAAKRADPGDDLLSRLIVDQELTG
GLTQQQVVATALVLLLAGHETTANMIALSTVLLLSHPEQLARLRADAGLMGNAVDELL
RYITIVQEGTGRVATEDVEVGGVLIPGGEGVIINLPSANRDPHFADAHELDLSRPNAR
EHVAFGFGVHQCLGQTLARVELQIALETLLRRLPTLRLEVPFDDLAFLYESMNFGVAR
VPVAW
CYP105V1 Streptomyces sp. HK803
GenEMBL AY354515 36297..37508
Gene = plmT4
43% to CYP105Q1
MSQLSSELPAFPMSKAKGCPLDPPPEYAQLRSDRPVAKARLWDG
KEVWLITGYDEIRSIFTDPRISVDNTQPGYPWLSEQARTVVLTGGVKPVGRMDPPEHT
AMRRMLGQGFLVKKIQNMRGDVEALVNELIDDILAGPRPTDLVPSLAMPVPSTALGWV
LGVPPADKRLISLVPRLFDEDSGLEGAMEARAELFAYIDELITHRENQPGDDIISHLV
GYYQKGELSRVSVLTQSVTLIAAALDTTRSMITNGILALLQHPEQAAALIEDPDLVPA
AVEELLRYTVVTEFSSKRVAAADIEIAGETIKAGDGIICLISAGNRDEKVFTDPDTLD
VRRDAKQHLGFGAGIHTCIGKQLARMELEVVYGTLFRRIPELRLAVPFDQLVFRNTFD
VQGVRALPVTW
CYP105W1 Micromonospora echinospora
GenEMBL AF497482 84045..85229
Gene = calE10
calicheamicin biosynthetic locus
45% to CYP105K1 47% to 105D4
MPRRCPFGPPAEYARLRTERPVARLPMLGGNTAWVVSRYADVKR
VLSDPRMSADRRRAGFPRFAPTTESQRQASFANFRPPLNWMDPPEHTAARRQIVDEFA
ARRVRQLRPLVERVVDEHLDAMTAGRSSADLVPSFSYPVPSRVICEMLGVPYGEHAFF
ERRSTRMLSRGVPADERARCAREIREFLDGVVTDKERHPGDDVLSRLLAAQRAAGEPD
HEAVVSMAFVLLVAGHVTTSNMISLSVLALLTHPERLARLRAEPDRFPAAVEELLRYF
TIVEAATARTATADVTVGGVTIRAGEGVVALGQAANRDPAAFDRPDEFDPDRDARHHL
AFGYGRHICPGQHLARLELDVALSRLVRRLPGLRLTVDVDDLPLKEDGNIFGLHALPVAW
CYP105X1 Pseudonocardia autotrophica same as Amycolata autotrophica
GenEMBL AF525299 2766..3974
Gene = pauC
P-450 gene cluster
49% to 105A3
MAEDTLGQDFPMQRQCPFEPPKEYERLRAEQPISRVRMPDGTPA
WLVTLHEDVRTVLASPAFSSDLAHPGMPAVNPEIRTIARQQRPPFSRMDPPEHSFFRR
MLIPEFTVKRTKTLRAGIQSVVDGLIDDLLRKSPPVDLVDEFALPVPSLVICQLLGVP
YSRHEFFQQQARVILSRQSTREQVGAAFTALRAYLDTLVEEKLHTPGDDLTSRLATEH
LEPTGDVRRQDLVASCMLLLTAGHETTSHMISLGVTALLEHPDQLAALQNDLTLLPEA
VEELVRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDI
HRGNRRHACFGYGVHQCIGQHLARTELEVAFSTLFTRIPTLQIAAPSDELDYDHDGML
FGLHELPVTW
CYP105X2 Amycolata autotrophica same as Pseudonocardia autotrophica
GenEMBL AF071148
99% to 105X3 94% to 105X1 61% to 165B2
LLIAGHETTSHMISLGVTALLERPDQLAALQNDLTLLPEAVEEL
LRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDIHRGN
RRHVAFGYGVHQCLGQNL
CYP105X3 Micromonospora inyoensis
GenEMBL AF071146
99% to 105X2 61% to 165B2 60% to 105A3
LLIAGHETTSHMISLGVTALLEHPDQLAALQNDLTLLPEAVEEL
LRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDIHRGN
RRHVAFGYGVHQCLGQNL
106 Family
CYP106A1 Bacillus megaterium
GenEMBL X16610
Gene BM-1
CYP106A2 Bacillus megaterium
GenEMBL Z21972 (4317bp) PIR S32216 (410 amino acids)
PIR S39924 (410 amino acids) Swiss Q06069 (410 amino acids)
Rauschenbach,R., Isernhagen,M., Noeske-Jungblut,C., Boidol,W.
and Siewert,G.
Cloning, sequencing and expression of the genes for cytochrome
P450meg, the steroid-15beta-monooxygenase from Bacillus
megaterium ATCC 13368.
Molec. Gen. Genet. 241, 170-176 (1993)
CYP106B1 Bacillus anthracis str. Ames
Genpept AAP26480
47% to 106A2 47% to 109B1
1 MASPENVILV HEISKLKTKE ELWNPYEWYQ FMRDNHPVHY DDEQDVWNVF LYDDVNRVLS
61 DYSLFSSRRE RRQFAIPPLE TRININSTDP PEHRNVRSIV SKAFTPRSLE QWKPRIQSIA
121 NELVKDIENC SEVDIVEQFA APLPVTVISD LLGVPTTDRK KIKAWSDILF MPYSKEKFND
181 LDAEKGIALN EFKAYLLPIV QEKRYHLTDD IISDLIRAEY EGERLTDEEI VTFSLGLLAA
241 GNETTTNLII NSFYCFLVDS PATYKEVREK PKLISKAVEE VLRYRFPVTL ARRITEDTNI
301 FGPLMKKDQM VVAWVSAANL DEKKFSQASK FNIHRIGNEK HLTFGKGPHF CLGAPLARLE
361 AEIALTTFIN AFEKIALSPS FNIEQCILEN EQTLKFLPIR LKPQ
CYP106B2P Bacillus cereus ATCC 14579
GenPept AAP09572 GenEMBL AE017006
83% to 106B1 54% to CYP109B1 YjiB Z99110 Bacillus subtilis I -helix
1 MTSVITDGEI VTFSLGLLAA GNETTTNLII NSFYCFLVDS PGIYEELRKE PNLILKAIEE
61 VLRYRFPVTL TRRITALSER ESPSPLGMG
CYP106B3P Bacillus cereus ATCC 14579
GenPept AAP09575 GenEMBL AE017006
87% to 106B1 54% to 106A2 C-term fragment
LKEDTNIFGPF
1 MKKNQMIVAW VSAANLDEKK FSQASQFNVH RTGNEKHLTF GKGPHFCLGA PLARLEAEIA
61 LTTFINAFEK IELFPSFCLE KCILENEQTL KYLPIRLKAT
107A Subfamily
CYP107A1 Saccharopolyspora erythraea
GenEMBL X60379 Swiss Q00441 (406 amino acids)
Haydock S.F., Dowson J.A., Dhillon N., Roberts G.A., Cortes J.,
Leadlay P.F.
Cloning and sequence analysis of genes involved in erythromycin
biosynthesis in Saccharopolyspora erythraea: sequence similarities
between eryG and a family of S-adenosylmethionine-dependent
methyltransferases.
Mol. Gen. Genet. 230, 120-128 (1991).
Weber J.M., Leung J.O., Swanson S.J., Idler K.B., Mcalpine J.B.
An erythromycin derivative produced by targeted gene disruption in
Saccharopolyspora erythraea.
Science 252, 114-117 (1991)
CYP107A2 Streptomyces rochei plasmid pSLA2-L
NC_004808 complement(44847..46067)
64% to 107A1
note="ORF26 (406 aa), lankamycin biosynthesis protein
similar to M54983-1 Saccharopolyspora erythraea
6-deoxyerythronolide B hydroxylase, EryF CYP107A1
MTTDAHTAVPSLDSDLFHIDQYEAYAALREREPVSKVSFIGREA
FLITRHAEAKAALGDLRLSNDFKKQPPGVELPTYHGIPEDVRPYFANNMGSNDPPAHT
RLRRLVSREFTARRVESMRTRVAQLAEHLLDGLAGERETDLVERFAYPLPITVISELL
GVEERYQGDFGRWSNEFLVIDADRVEQREHAARALVGFILELVDRRRADPGSDLLSAL
IHVHDEDEDRLSTDELASVVLILLIAGFETSVSLIAMATYLLLTHPGELAKVRADPSL
VPNAVDEVLRFLGPAEITTRGTLEPVEIGGVHIPAHSTVLIAGAAANRDPRRFPDPER
FDVTRDTGGHLSFGHGIHFCVGGPLARLEGEIALRALLNRFPGLDLAIPAEQVRWRRS
FLRGIESLPVRLGR
107B Subfamily
CYP107B1 Saccharopolyspora erythraea
GenEMBL M83110 Swiss P33271 (405 amino acids) PIR B42606 (405
amino acids)
Andersen J.F., Hutchinson C.R.
Characterization of Saccharopolyspora erythraea cytochrome P-450
genes
and enzymes, including 6-deoxyerythronolide B hydroxylase.
J. Bacteriol. 174, 725-735 (1992)
CYP107B2 Streptomyces sp.
GenEMBL BD133548
58% to 107B1
3 LIAGHETTTNFIGNAALALLRHPESLAHLRAEPQLLGGALDELLRYDSPVGIATFRFSTE 182
183 ALTLGGTEIPEGVPVLIAPGAANRDPDRFPDPDRLDLTRGATGHLAFGFGMHHC 344
107C Subfamily
CYP107C1 Streptomyces thermotolerans
GenEMBL D30759 (3267bp complete sequence of CarA)
Arisawa,A., Kawamura,N., Takeda,K., Tsunekawa,N.,
Okamura,K. and Okamoto,R.
Cloning of a macrolide antibiotic biosynthesis gene acyA, which
encodes 3-O-acyltransferase, from Streptomyces thermotolerans and
its use for direct fermentative production of a hybrid macrolide.
Appl. Environ. Microbiol. 60, 2657-2660 (1994)
Arisawa,A., Tsunekawa,N., Okamura,K. and Okamoto,R.
Nucleotide sequence analysis of carbomycin biosynthetic genes
including macrolide antibiotics 3-O-acyltransferase gene from
Streptomyces thermotolerans.
unpublished (1994)
CYP107C1 Streptomyces thermotolerans
GenEMBL M80346 (2393bp C-terminal fragment of CarA)
Schoner,B.E., Geistlich,M., Rosteck,P., Rao.R.N., Seno,E.,
Reynolds,P., Cox,K., Burgett,S. and Hershberger,C.L.
Sequence similarity between macrolide resistance determinants and
ATP binding transport proteins.
Gene 115, 93-96 (1992)
Note: P450 fragment called carX. is equivalent to C-terminal of CarA.
107D Subfamily
CYP107D1 Streptomyces antibioticus
GenEMBL L37200 (1400bp)
Rodriguez,A.M., Olano,C., Mendez,C., Hutchinson,C.R. and
Salas,J.A.
A cytochrome P450-like gene possibly involved in oleandomycin
biosynthesis by Streptomycese antibioticus.
unpublished (1994)
107E Subfamily
CYP107E1 Micromosospora griseorubida
GenEMBL D16098 (2168bp)
Inouye,M., Takada,Y., Muto,N., Horinouchi,S. and Beppu,T.
Cloning and nucleotide sequences of a gene governing mycinamicinIV
hydroxylation.
unpublished (1993)
107F Subfamily
CYP107F1 Streptomyces griseus
GenEMBL D45916 (2787bp) AB018074 CDS 341-1561
Ueda,K. and Horinouchi,S.
Cloning and Nucleotide Sequence of a Gene Involved in Redbrown
Pigment Biosynthesis in S. griseus
Unpublished (1995)
CYP107F2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV1171 55% to 107F1
this subfamily is on the outskirts of CYP107
107G Subfamily
CYP107G1 Streptomyces hygroscopicus
GenEMBL X86780 (107379bp)
complement (91764-92978)
rapN
107H Subfamily
CYP107H1 Bacillus subtilis
GenEMBL U51868 (10153bp) Z99119, AF008220
coding region 7164-8351
pimelic acid biosynthesis
gene name bioI
107J Subfamily
CYP107J1 Bacillus subtilis
GenEMBL Y11043 U93876, Z99117
Belitsky, B. R., M. C. Gustafsson, A. L. Sonenshein, and C. Von
Wachenfeldt.
An lrp-like gene of Bacillus subtilis involved in
branched-chain amino acid transport. J Bacteriol. 179, 5448-57
(1997).
gene name cypA 42.6% identical to 107B1
also called yrdE
CYP107J2 Bacillus anthracis str. Ames
GenPept AAP26475
58% to 107J1 cypA of Bacillus subtilis
1 MAMKNKVGIR IEDGINLASA QFKEDAYEIY KESRKVQPVL FVNKTELGAE WLITRYEDAL
61 PLLKDNRLKK DPANVFSQDT LNVFLTVDNS DYLTTHMLNS DPPNHNRLRS LVQKVFTPKM
121 IAQLEGRIQD IADDLLNEVE RKGSLNLVDD YSFPLPIIVI SEMLGIPKED QAKFRIWSHA
181 VIAYPETPEE IKETEKQLSE FITYLQYLVD MKRKEPKEDL VSALILAESE GHKLSARELY
241 SMIMLLIVAG HETTVNLITN TVLALLENPN QLQLLKENPK LIDAAIEEGL RYYSPVEVTT
301 SRWADEPFQI HDQTIEKGDM VVIALAAANR DETVFENPEV FDITRENNRH IAFGHGSHFC
361 LGAPLARLEA KIAITTLFER MPELQIKGNR EDIKWQGNYL MRSLEELPLT F
CYP107J3 Bacillus cereus ATCC 14579
GenPept AAP09568
59% to 107J1 cypA Y11043 Bacillus subtilis
1 MKNKVGLSIE DGINLASAQF KEDAYEIYKE SRKKQPILFV NQVEIGKEWL ITRYEDALPL
61 LKDNRLKKDW TNVFSQDIKN MYLSVDNSDH LTTHMLNSDP PNHSRLRSLV QKAFTPKMIA
121 QLDGRIQRIA DDLISDIERK GTLNLVDDYS FPLPIIVISE MLGIPKEDQA KFRIWSHAVI
181 ASPETPEEIK ETEKQLSEFI TYLQYLVDIK RKEPKEDLVS ALILAESEGH KLSARELYSM
241 IMLLIVAGHE TTVNLITNTV LALLENPNQL QLLKDNPKLI DSAIEEGLRY YSPVEVTTAR
301 WAAEPFQIHH QTIQKGDMVI IALASANRDE TVFENPEIFD ITRENNRHIA FGHGSHFCLG
361 APLARLEAKI AITTLFNRMP ELQIKGNREE IKWQGNYLMR SLEELPLTF
CYP107J4P Bacillus cereus ATCC 14579
GenPept AAP09593
46% to CYP107J3 in same genomic region
47% to CYP107Y1 SAV2377 AP005030 Streptomyces avermitilis
50% to 107H1
1 MKEPQLQQHL EKFIQYIEAL VNEKRLNPDA DLISELVQTK EQEDKLSNNE LLSTIWLLII
61 AGHETTVNLI SNGLLALLQH PEQMNLIREN PSLIPSAVDE LLRHSGPVMF ISRLASEDMT
121 IHGKRIPKGD LVLLSLTAAN IDPQKFTYPE TLNISREENN HLAFGAGIHH CLGAPLARLE
181 GQIALGTLLQ RLPNLRLAIK PDQLNYNHSK IRSLVNLPVV F
CYP107K1 Bacillus subtilis
GenEMBL AL009126 Z99113 comp(76702-77832)
polyketide hydroxylase pksS
just over 41% identical to CYP107J1
CYP107L1 Streptomyces venezuelae
GenEMBL AF087022
GenEMBL AF079139 CDS 122..1372
pikC gene
function="catalyzes the hydroxylation of YC-17 into
methymycin and neomethymycin and narbomycin into
pikromycin"
51% to 107B1 47% to 107A1 44% to AF254925 42% to 107J1
41% to AL049754 new CYP107 subfamily
CYP107L2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV1987 60% to 107L1 from Streptomyces venezuelae
CYP107L3 Streptomyces tubercidicus strain I-1529
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name CypLA
60% to CYP107L1 91% to 107L4
CYP107L4 Streptomyces tubercidicus strain R-922
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name CypLC
61% to CYP107L1 91% to 107L3
CYP107L5 Streptomyces sp.
GenEMBL BD133547
68% to 107L2
3 LIAGHETTVNLITGAVHALLTHPGQLAQVRGDMSLVDAVVEETLRHEGPVENATFRFAAE 182
183 PLEIGGTVIPAGDPVLIGLAAADRDGARYPGPDRFDIHRDTRGHLAFGFGTHRC 344
CYP107L6 Streptomyces sp.
GenEMBL BD133544
72% to 107L2
MGHEHVIDLGEYGPGFTENPHPVYAELRARGPVHRVRLPKHDAHHEAWLVVGYEEARAAL
ADPRLSKDGSTIGVTFLDEELIGKYLLIADPPQHTRLRGLIAREFTGRRVERLRPRVQEI
TDSLLDEMLPRGRADLVESFAYPLPLTVICELLGVPEIDRAAFRKLSTEAVAPTSGESEY
AAFVQLAAYLEELVEEKRCAPPADDLLSALIRTTDEDGDRLSPAELRGMAFILLIAGHET
TVNLITGAVHALLTHPGQLAQVRGDMSLVDAVVEETLRHEGPVENATFRFAAEPLEIGGT
VIPAGDPVLIGLAAADRDGARYPGPDRFDIHRDTRGHLAFGHGIHFCLGAPLARLEARVA
LRALLERCPGLTPDGAPGEWLPGMLIRGVRSLPVRW*
CYP107L7P Streptomyces narbonensis
GenEMBL AF521878 13901..14661
desosamine biosynthetic gene cluster
91% to 107L1
gene= nbmL
note= frameshift and deleltion generates premature
stop codon and truncated protein"
MSRTHQGTTASRPVLDLAALGQDFAADPYPTYARLRAEGPAHRV
RTPEGDEVWLVVGYDTARAVLADPRFSKDWRNSATPPTEAEAALSHNMLESDPRCGPT
(deletion)
ALRADLTLLDGAVEEMLRYGGPVESATYRFPVEPVDLDGTVLPAGETVLVVLAD
AHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCTGAPLARMEARIAVRALLERCPDLALD
VSPGELFWYPNPMIRGLESLPIRWRSGREAGRRVPVEPACRP*
CYP107L8 Streptomyces sp. HK803
GenEMBL AY354515 complement(72672..73871)
Gene = plmS2
56% to CYP107L6
MVTVDLSAYGPGFFTDPYPYYARLREAGPVHEIVLADGDRFWLI
VGYDEARAALADPRLAKSLDPPSEDERHVLITDPPDHTRLRRLVSREFTARRVEAMRP
RVQEITDGLLDEMVAGRRRADLVPSLGSPLPITVLCELLGVPLADREDFRGWTERVLV
PAEPDTIAWWKSRGFAQAGMALTDYLKNMIEDKRRSTPTGDLISSLLRTTAEDNDRLS
AAELHSMVFILIVAGHETTANLITNGVRALLAHPEQLAALRTDPEGLIDQAVEEMLRY
DGPVETSTKRFTLEAVRYGATKIPPGETLLVSIAATGRDPAQFERPDTFDIHRGTTGT
RSGHVAFGHGIHFCLGAGLARMESRVAILTLLRRCPDLALDIDPAGLDWLPGIRVRGV
RSLPVRW
CYP107L9 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
62% to 107L6 before frameshift at C-term
clone name SP0854
CYP107M1 Actinomadura hibisca
GenEMBL D87924CDS complement(6299..7534)
45% to AF127374 CDS 3226..4458 44% to AF254925
45% to 107D1 44% to 107G1, 107E1 new subfamily in 107
CYP107N1 Streptomyces lavendulae
GenEMBL AF127374 CDS 3226..4458
50% to 107D1 52% to AF254925 47% to 107E1 new subfamily in 107
CYP107P1 Streptomyces coelicolor cosmid H10
GenEMBL AL049754 CDS complement(10413..11648)
41% to AF087022 40% to 107B1 40% tp 107G1
40% to 107D1 new subfamily in 107
cloned and expressed by David Lamb and Steve Kelly
CYP107P2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV4539 86% to 107P1 from Streptomyces coelicolor
CYP107P3 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
78% to 107P2 missing 156 aa at N-term
C-term may be frameshifted
clone name SP0887
CYP107Q1 Amycolatopsis mediterranei
GeEMBL AF040571 CDS complement(781..>2316)
66% to AF040570 comp(68704..69969) 43% to 107C1
41% to 107B1 40% to 107A1 new subfamily in 107
CYP107Q2 Amycolatopsis mediterranei
GenEMBL AF040570 CDS comp (68704..69969)
66% to AF040571 complement(781..>2316) new subfamily in 107
CYP107R1 Streptomyces maritimus
GenEMBL AF254925 CDS comp (18384..19589)
gene="encR"
53% to AF127374 CDS 3226..4458 49% to 107E1 new subfamily in 107
MTTHTQQLRDFPFAPPAELHMEPAFAQLREEEPISRVRLPYGGE
AWLVTRYQDIKTVLGDPRFSRAATQHAQAPRIQPDPAGEGVLMSLDPPDHTRLRKTVA
GVFTKRRVEDLRPATQRIAEELLEAMEASGAPADLVASYALPLPVTVICDLLGVPGDD
REQLRGWSDALLSTTACTPAESAAAAQAMADHFAALVSQRRRQPTDDLLGALVQTWDR
EEGLLRDEELVLLTRDLLIAGHETTASQIANCTYLLLQRPHDMDRLRTDPSAMASAVE
ELLRFIPLGSGSFRARVATEPVELCGVRIQPGDTVFAPTVAANWDPDVFAEPGRLDID
RSPNPHVAFGHGVHHCLGAQLARLELQVALGVLLRRLPRLRLAVDEAEIVWKTGMQVR
GPKTLPVKW
CYP107S1 Pseudomonas aeruginosa
NZ_AABQ07000001
NC_002516 3741011..3742267
locus_tag = PA3331
47% to 107B1
CYP107T1 Streptomyces coelicolor
StH63 [Full Sequence] Sanger cosmid
51% to CYP107L1 CDS 16028-17233
cloned and expressed by David Lamb and Steve Kelly
CYP107U1 Streptomyces coelicolor
StE41 [Full Sequence] Sanger cosmid
comp(7438-8739) 44% to CYP107B1
cloned and expressed by David Lamb and Steve Kelly
CYP107U2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV3536 85% to 107U1 from Streptomyces coelicolor
CYP107U3 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
84% to 107U1 missing 90 aa at N-term
clone name SP0819
CYP107V1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV3519 low 40% range with some 107 subfamilies
CYP107W1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV2894_olmB low 40% to 107 subfamilies
CYP107X1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV6249 49% to 107L1 from Streptomyces venezuelae
CYP107Y1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV2377 50% to 107L1 from Streptomyces venezuelae
CYP107Z1 Streptomyces rimosus ssp. paromyceticus strain R-2374
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema11
96% to CYP107Z2v1
CYP107Z2v1 Streptomyces albofaciens strain C-0083
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema8
96% to 107Z2v2 and CYP107Z1
CYP107Z2v2 Streptomyces rimosus ssp. paromyceticus strain BOEH-4355
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema3
96% to CYP107Z2v1 95% to CYP107Z1
CYP107Z3 Streptomyces sp. strain IHS-0435
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema7
76% to 107Z12
CYP107Z4 Streptomyces lydicus strain NRAB-0114
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema16
82% to 107Z12
CYP107Z5V1 Streptomyces lydicus strain NRRL-2433
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema15
97% to 107Z5v3
CYP107Z5v2 Streptomyces chattanoogensis DSM-40241
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema6
1 aa diff to CYP107Z5v3
CYP107Z5v3 Streptomyces lydicus strain R-401
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema4
100% to S. kasugaensis strain A/96
CYP107Z5v3 Streptomyces kasugaensis strain A/96
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema10
100% to S. lydicus strain R-401
CYP107Z6 Streptomyces sp. strain I-1525
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema5
85% to CYP107Z8
CYP107Z7 Streptomyces tubercidicus strain DSM-40261
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema17
90% to CYP107Z8
CYP107Z8 Streptomyces platensis strain Tu-3077
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema13
89% to CYP107Z9
CYP107Z9 Streptomyces tubercidicus strain NRAA-7027
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema12
89% to CYP107Z8
CYP107Z10 Streptomyces tubercidicus strain I-1529
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema2
90% to CYP107Z11
CYP107Z10 Streptomyces platensis strain I-1548
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema14
100% to S. tubercidicus strain I-1529
CYP107Z11 Streptomyces platensis strain NRAA-7479
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema9
92% to 107Z12
CYP107Z12 Streptomyces tubercidicus strain R-922
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name Ema1
92% to CYP107Z11
CYP107AA1 Bradyrhizobium japonicum USDA 110
GenPept BAC51802
NC_004463 complete genome complement(7193424..7194725)
41% to 133B1v1 45% to 107L1
1 MVTPGSGAAI GVFVSCGNRF EVTMNEQAQP AGGDPLFNPL SPDFIRNPYP HYDRLRAIDP
61 IHVTPFGQFV ASRHADVSLV MRDKRFGKDF VERSKRRYSE KIMDEPVFRS MSHWMLQADP
121 PDHTRLRGLV VKAFTARRVE DMRPRIQEIV DEAIDAVIDR GHMDLIEDFA FRLPVTIICD
181 MLGIPEDHRE VFYKSSRDGG RLLDPVPLTP EEIAKGNAGN MMAQMYFQQL FELRRRNPAD
241 DLTTQLVQAE EDGNKLTNEE LTANIILLFG AGHETTVNLI GNGLLALHRN PDQLALLKAR
301 PELMVNAIEE FLRYDSSVQM TGRVTLEDID DLGGRKIPKG ETVLCLLGSA NRDPAVYPDR
361 PDRLDVTRPN VKPLSFGGGI HFCLGAQLAR IEAEIAIATL LRRLPDLRID DVENPEWRPT
421 FVLRGLKSLP ASW
CYP107AB1 Streptomyces rochei plasmid pSLA2-L
NC_004808 Links 87725..88939
49% to 107A1
note="ORF37 lankamycin biosynthesis protein
MNQPQLPEIPALNSELFHTDQYATYREILEQRPVTRVRFYDGSL
VWLVNRHEDVRAALTDPRLSNDPMKQSDIDLSAATGIPADLIEYFQRNMFRSDEPDHG
RLRKLVTREFTVRRINALRPRIRQIADDLLEKFAATGGGDLVEALARPLPLTVMCELL
GVPEEDRADFQTWSQHIVESSPEFAERNAVSYRSLFECVRSLIRRRRDEPGDDLLSAL
VDLRDVADRLSENELISTVFLLLVAGIETTVNVLGTGTFLLLTHPGELARLRADGALL
GPAVEEMLRYMAPIEITSRHTLEPVEIGGVSIDAQSTVLINLAAANRDPARFEDPQSF
RVDRNDGGHLTFGHGIHYCLGAALARAEAEVTFEALLERFPDLRLAASASDLTWRHAF
MRGPVELPVSWG
CYP107AC1 Streptomyces atroolivaceus
GenEMBL AF484556 60948..62147
leinamycin biosynthetic gene cluster
48% to 107N1
gene = LnmA
MSATRRVHIYPFEGEVDGLEIHPKFAELRETDPLARVRLPYGGE
GWMVTRYDDVRAANSDPRFSRAQIGEDTPRTTPLARRSDTILSLDPPEHTRLRRLLSK
AFTARRMGAMQSWLEELFAGLLDGVERTGHPADIVRDLAQPFTIAVICRLLGVPYEDR
GRFQHWSEVIMSTTAYSKEEAVSADASIRAYLADLVSARRAAPHDDLLGVLVSARDDD
DRLTEDELITFGVTLLVAGHETSAHQLGNMVYALLTHEDQLSLLREQPELLPRAVEEL
LRFVPLGNGVGNARIALEDVELSGGTVRAGEGVVAAAVNANRDPRAFDDPDRLDITRE
KNPHLAFGHGAHYCLGAQLARMELRVAIGGLLERFPGLRLAVPADQVEWKTGGLFRGP
QRLPIAW
CYP107AD1 Streptomyces hygroscopicus
GenEMBL AF521896 4248..5489
ansamycin biosynthesis gene cluster
43% to 107X1
gene = gdnH
MSGRHFEQGERGTAMADTPEEELRILDPQSVAQELRKHGPPRQI
TMHGTTAWLVSRYEEVRDCLGHPGMSPAAAYAASQGQTNPVSGLFEDTVAGTNPPQHT
RLRRLLAKAFTVRRVESLRPRVQEITDTLLDRIAVDGRADLVSALAIPLPMQVICELL
GVPIADRTEFHQWADLMLTPPLDPDTAARSQDASAKLWTYMEDLAEARRKAPEDDLIS
DLMSAHEDDRLSHREVVATARMMLIAGYELTGSFISNAVFSLLSQPDQMELLRKDPEL
AGRGLEELLRHAGPGILIVRFANEDVEIGSVSIRAGDQVLLDMDAAHSDPAHFTDGER
LDLTRDSAVHLQFGHGIHYCIGAPLARVEGQIALESLVRRFPGLRLSVPAAEISHSKN
PFIRSLTALPVEFEAQQPVAG
CYP107AE1 Streptomyces sp.
GenEMBL BD133545
50% to 107X1
VILLKSLAANGLTASSCFTVSPLPIRSASPSIAFLTSSSERDSGVRNDRPSDAQPAIARF
RFPTPPHPRNPTQPHPTPPRPSPTDDPLQAPTFFADPYPTYARLRDTAPVLKVPTGSGGG
GRHSYVVTGYAEAREAFTDPRLSKDTASFFAGRPSQRDLHPAVSRNMLATDPPQHARLRA
LVTKAFTTGAVARLRPYISSLVDELLDTWPTHGTVDLIADLAVPLPVTVICELLGVPDSD
RASVRTWSSDLFAAGDPQRIDAASHAVGDYMTALVAAKRTAPGDSLLDDLIAVRDGQDHL
SEDELVSLAVLLLVAGHETTTNFIGNAALALLRHPESLAHLRAEPQLLGGALDELLRYDS
PVGIATFRFSTEALTLGGTEIPEGVPVLIAPGAANRDPDRFPDPDRLDLTRGATGHLAFG
HGIHRCLGAPLARAEAELALHAVITRYPQAALATPPETLPWRHTRLTRGLASLPITLRDH
PK*
CYP107AF1 Streptomyces collinus DSM2012
GenEMBL AF293355 24259..25518
Gene = rubU
rubrinomycin gene cluster
52% to 107B1
MARTDAPQAAPPADLFTPAFHQNPHEALAGLRRTAPAVPVMTPN
GLRTWLVTGHEHARALLADPRLSKDMRVGRDLIPRNFVDPDKQREFLAESGERSQFPH
VLSVHMLDSDPPDHTRLRRLVGRAFTARRVESLRPRITELTDELLDAMARHERLDLME
ALAFPVPFTVICWLLGVPPDDRAAFRRWSNLLVSGAGTDEVREASASMITYLTELIEA
KRNEPADDMLTDLVHARDAGDQLSSDELISMAFLLLVAGHETTVNLIGNGALALLTHP
EVREQLAADESLWPGAVEEFLRYDGPVTNATWRFTTEPVEVGSVTIPEGEFVTISIGA
AGRDPDRYPDPDRLDITRAHSGSVAFGHGIHHCLGAPLARLEGRIVLSRLFARLPGLR
LAADPDELSWRSSLMMRGLEELPVFTA
CYP107AG1 Streptomyces atroolivaceus
GenEMBL AF484556 complement(120436..121638)
Gene = LnmZ
leinamycin biosynthetic gene cluster
49% to 107E1
MSTEVETEKPAPVAYPFTGSEGLELSQSYAKLFEDGDPIRVQLP
FGEPAWLVTRYDDARFVLTDRRFSRHLATQRDEPRMTPRAVPESILTMDPPDHTRLRT
LVSKAFTPRRIESKRAWIGELAAGLVADMKAGGAPAELVGSYALAIPVTVICELLGVP
EDDRTRLRGWCDAALSTGELTDEECVQSFMDLQKYFEDLVKERRAEPRDDLTSALIEA
RDAHDRLAEPELIGLCISILIGGFETTASEISSFVHVLQQRRELWTRLCADPEAIPAA
VEELLRFVPFAANGISPRYALEDMTVGGVLVREGEPVIVDTSAVNRDGLVFDNADEVV
IDRADNRHMVFGHGAHHCLGAHLARVELQEALKALVEGMPGLRLSGDVEWKADMIIRA
PRVMHVEW
CYP107AH1 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
50% to 107L6 missing about 42 aa at N-term
clone name SP0749
CYP107AJ1 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
52% to 107B1 frameshifted C-term
clone name SP0908
108 Family
CYP108A1 Pseudomonas spp.
Swiss P33006 (428 amino acids) PIR S27653 A42971 (428 amino acids)
Also found a PIR cross-reference to EMBL S39894 but could not
retrieve it
Peterson J.A., Lu J.-Y., Geisselsoder J., Graham-Lorence S.,
Carmona C., Witney F., Lorence M.C.
Cytochrome P-450 terp: Isolation and purification of the protein
and sequencing of its operon.
J. Biol. Chem. 267, 14193-14203 (1992)
CYP108A1 Pseudomonas spp.
GenEMBL M91440 (6620bp)
Hasemann,C.A., Ravichandran,K.G., Peterson,J.A. and
Deisenhofer,J.
Crystal structure and refinement of cytochrome P450terp at 2.3A
resolution.
J. Molec. Biol. 236, 1169-1185 (1994)
CYP108B1 Caulobacter crescentus CB15
GenEMBL AE005918 GenPept AAK24465
NC_002696 complete genome 2703947..2705221
Complete genome sequence of Caulobacter crescentus
Proc. Natl. Acad. Sci. U.S.A. 98 (7), 4136-4141 (2001)
47% to CYP108A1
1 MTISTDIANT IIDPKAYADG DRIDQAFAHL RREAPLAVAQ PDGFDPFWVV TRHADILEVE
61 RQNELFHNGD RATVVTTIEP DKKVREMMGG SPHLVRSLVQ MDNPDHFAYR KITQGALLPQ
121 NLRALEARIR EIARGFVDRM AEHGDRCDFA RDVAFLYPLH VIMEVLGVPE SDEPRMLKLT
181 QELFGNADPD LNRTGKSVTD VGEGVDSIQS VVMDFMMYFN AITEDRRANP RDDLATLIAN
241 GKINGEPMGH LEAMSYYIIA ATAGHDTTSS TTAGALWALA ENPDQFAKVK ADPSLIPGLI
301 EESIRWVTPV KHFMRTATAD AELGGQKIAK GDWIMLSYPS GNRDEAVFED PFTFRVDRTP
361 NKHVAFGYGA HICLGQHLAR MEMRVLWEEL FARLDHVELD GAPTRMVANF VCGPKSVPIR
421 FKMH
CYP108C1 Saccharopolyspora spinosa strain NRRL 18395
No accession number
Istvan Molnar
Syngenta Biotechnology, Inc.
47% to CYP108B1 43% to CYP108A1
CYP108D1 Novosphingobium aromaticivorans
GenEMBL NZ_AAAV01000137
16805..18166 gene = Saro1710
47% to 108B1 39% to 108C1
MTNTSRLTKRRRPRRSDGKREGFMDSIPMVPAEVGRAVIDPKSY
GTWEPLLDRFDALRAEAPVAKVVAPDDEHEPFWLVSSFDGVMKASKDNATFLNNPKST
VFTLRVGEMMAKAITGGSPHLVESLVQMDAPKHPKLRRLTQDWFMPKNLARLDGEIRK
IANEAIDRMLGAGEEGDFMALVAAPYPLHVVMQILGVPPEDEPKMLFLTQQMFGGQDE
DMNKSGLKDLPPEQISQIVAGAVAEFERYFAGLAAERRRNPTDDVATVIANAVVDGEP
MSDRDTAGYYIITASAGHDTTSASSAGAALALARDPDLFARVKADRNLLPGIVEEAIR
WTTPVQHFMRTAATDTELCGQKIAAGDWLMLNYVAANHDPAQFPEPRKFDPTRPANRH
LAFGAGSHQCLGLHLARLEMRVLLDVLLDRVDSLELAGEPKRVNSTFVGGFKSLPMRW
KAA
CYP108E1 Ralstonia metallidurans
GenEMBL NZ_AAAI01000348
46192..47481 gene = Reut4024
41% to 108B1 39% to 108A1 48% to 108C1
MTIASDFDTELASHEIYSDPERMHEMFETLRREDPVHWTTAPGH
PPFWAVTKQADVIEVGKHPDVFIASPKSFLMNDVEQRVRIEETAATGGKLVRTMIHMD
DPDHKKYRGLTQSYFMPANIKRLESVIQERARALVGRLIEKGTSEFCSEIAVWYPLQI
VMTLLDVPESEHPYLLKLTQQFLAPKDPTLRRDGPDERGKGAVAKEYFAYFGKMLAER
RAAPLKEDLGSLIAHATVDGEPLPLMEAVSYYVILATAGHDTTSSSMCSGLYYLLTQP
GELDRLRARPELMPSAIEEMFRHGSPVKHFVRTATRDFELRGKKIQAGDEVALMYHSA
SFDEEVFDEPRSFRIDRGPNKHVAFGFGIHACLGQNLARASMRTFFTELLARTESIEV
VGKAEFIASNQVGGMKTLNIRVTPSKQSTTDRIEVAA
109 Family
CYP109A1 Bacillus subtilis
GenEMBL M24523 (3187bp)
Lewis,P.J. and Wake,R.G.
DNA and protein sequence conservation at the replication terminus
in Bacillus subtilis 168 and W23
J. Bacteriol. 171, 1402-1408 (1989)
Ahn,K. and Wake,R.G.
A unique open reading frame adjacent to the replication terminus
of the Bacillus subtilis W23 chromosome compared with Bacillus
subtilis 168
unpublished (1990)
Ahn,K.S. and Wake,R.G.
Variations and coding features of the sequence spanning the
replication terminus of Bacillus subtilis 168 and W23 chromosomes
Gene 98, 107-112 (1991)
CYP109B1 Bacillus subtilis
GenEMBL AF015825 Z99110
YjiB
also similar to CYP106A, both 106 and 109 are close
together on a tree
110 Family
CYP110A1 Anabaena sp. (a cyanobacterium)
Swiss P29980 (354 amino acids) GenEMBL M38044 (5933bp)
GenEMBL U38537, M13161
Lammers,P.J., McLaughlin,S., Papin,S., Trujillo-Provencio,C. and
Ryncarz,A.J.II.
Developmental rearrangement of cyanobacterial nif genes:
Nucleotide sequence, open reading frames, and cytochrome p-450
homology of the Anabaena sp. strain PCC 7120 nifD element
J. Bacteriol. 172, 6981-6990 (1990)
This sequence was later revised to give a complete P450 sequence
of 448 amino acids.
CYP110A1 Nostoc sp. PCC 7120 same as Anabaena sp. PCC 7120
GenPept BAB73407, C37842 (this entry missing N-term)
NC_003272 complete genome 1708114..1709493
1 aa diff to M38044
1 MLTQLPNPIS VPSWWQLINW IADPIGFQKK YSKKYGNIFS MQLAGIGSFV ILGEPQALQE
61 IFTQDSRFDV GRGNTLAEPL IGRTSLMLMD GDRHRRERKL LMPPFHGERL QAYAQQICLI
121 TNQIASEWQI GQPFVARSAM QKLSLEVIIQ IVFGLADGER YQQIKPLFTD WLNMTDSPLR
181 SSMLFLKSLQ KDWGTWTPWG QMKHKQRSIY DLLQAEIEEK RTKENEQRGD VLSLMMAARD
241 ENGQAMTDEE LKDELLTILF AGHETTATTI AWAFYQILKN VNVQEKLQQE LDRLGANPNP
301 MEIAQLPYLT AVSQETLRMY PVLPTLFPRI TKSSINIAGY QLEPDTTLMA SIYLIHYRED
361 LYPNPQQFRP ERFIERQYSP SEYIPFGGGS RRCLGYALAL LEIKLVIATV LSNYQLALAE
421 DKPVNVQRRG FTLAPDGGVR VIMTGKKSLK FEQSSKIFN
CYP110A2 Anabaena variabilis (a cyanobacterium)
GenEMBL U38478 (1743bp)
Lammers, P.J. and Duran, S.
possible alkane/fatty acid hydroxylase
CYP110B1 Nostoc sp. PCC 7120 Same as Anabaena
GenPept BAB75445, AC2274
NC_003272 complete genome complement(4523158..4524546)
45% to CYP110A2 53% to 110E1 49% to 110D1 47% to 110C1
1 MHLPKGPQTP VFVQVLRWVF SPMSFLEDCA KRYGDIFSVK LAKDVPAIVF LSNPKDIQQI
61 LTNDNNQLDS PGDWNDLFEP LLGKRSVITL SGAEHQRQRQ LLMPPFHGER MRGYSQVITD
121 VTEKVISQHQ IGQPFQVRSV TQAITLRVIM QAVFGLYEGS RAEKLQHLLS DLLEKSSSPF
181 SVALLYFPSL RRDFGPIKFW GEQVQIQQQA DELIYQEIQE RRENPDPSRT DILSLLMDAR
241 DADGQPMTDV ELRDELMTLL VAGHETTATA LAWAMYWIHK LPPVKARLLE ELDSLGDNPD
301 STTIFKLPYL NAVYSETLRI YPVAMLTFAR RVIETMALGG YELPPGTPVL GSIYLTHHRE
361 DLYPEPKKFK PERFLERQFS PYEYLPFGGG TRRCLGLAFA QWEMKLALAK ILTSYELELV
421 NNSVEVRPKR RGLVTGPHRP IEMVIKSQRQ ITSRILETTT VS
CYP110B2 Nostoc punctiforme
NZ_AAAY02000005 GenPept ZP_00111619.1
complement(58895..60277) gene = Npun6097
75% TO 110B1
MKLPKGPQSPAVLQMLRWITSPMSFMETCAKRYGDMFTIRLDSK
SPPLIFVSKPEVLEQILTNDIKGLEAPGDTNLVFESLLGKHSVITISGAEHQRQRQLL
LPPFHGERMRSYSQIISDITEKVISQYQIGQPFNIRSVTQAITLRVIMQAVFGLDEGP
RAEKLQHCLAEMLEKGSSVLSAALLYFPALQRDFGPINFWGKQMRRQQAADKLIYEEI
RERQEQPDPSRTDILSLLMAARDEAGQPMTDEKLRDELMTLLVAGHETTATALAWAFY
WIQKIPTVRQKLLKELDSLGDNPDPSTIFKLPYLNAVCSETLRIYPVAMLTFARVVRT
PLSLGGYELEPGIGVIGSIYLTHHREDLYPEPKQFKPERFLERQFSPYEYLPFGGGAR
RCIGLAFAQLEMKLALAKILSTRELELVDNSEVRPKRRGLVTGQDRPIQMVVTSQRQV
KFPILQTATV
CYP110C1 Nostoc sp. PCC 7120 Same as Anabaena
GenPept BAB76385, AF2391
NC_003272 complete genome 5587079..5588485
48% to CYP110A2 49% to 110E1 47% to 110B1
1 MKYQIQRPNP LKTHPFLQKL QWIADPVEYM KKASLQHPDM FTAEVIGFGD TVVFVSHPQG
61 IQTLFANDRK KLVAVGEANR ILYPLVGNNS MFLLEGVKHK QRRQLLMPSF HGERMREYGH
121 LIRNITENLF SQLQQDVTFS ALTAMREISM QVILQAVFGF YEGERCQQFK HLLPIFLSEL
181 FQSPLASSIL FFPSLQKDLG NLTPWGRFVR QREKIDKLLY AEIAERRQEI NSDRIDILSL
241 LISARDETGD SMSDKELRDE LITLMISGHE TTGTAMAWSL YWILQTPEVF QRLIQELDSL
301 GDSPDPMSIF RLPYLTAVCN ETLRINPVAM LTLPRVVKEP IELLGNRLET STTVVGCIYL
361 THHREDLYPE SKLFKPERFL KREFSQYEFM PFGGGVRGCI GQALAMFEMK IVLATVLSRY
421 QLALADRKPE RPQRQGFTLT PTNGVKMLIT GQHKRQNYSM AASTTFNA
CYP110C2 Nostoc punctiforme
GenPept ZP_00108280.1
GenEMBL NZ_AAAY02000070 complement(34550..35941)
gene = Npun2703
60% to 110C1
1 MQLPNILKSP SLLQKLHWVS DPIGYMENAA QEYPDIFTGK IVGFGDTVVF VNHPQAIQEI
61 LTNDRKKFTA VGELNGILKP LLGDNSVLML ESDRHKRQRQ LVTPSFHGER MQAYGQLICN
121 VSKKIFNQLP LNKPFVARNL TKEISLQVIL QSIFGFYEGE KIQKLRQLLP LLLELFESPL
181 SSSLFLFSFL QQDLGAWSPW GNFLRVREKI DQFLYTEIAE CQQQADPERI DILSLLISCR
241 DEAGQPMTDQ ELRDQLITLI LAGYDTTATA MAWGLYWIHK QPLVCEKLLQ ELDTLGDSPD
301 PMSISRLPYL TAVCNETLRI HPVTMFSFPR VVQEPLELLG HSLEPGTILL PSIYLTHHRE
361 NLYPQSKQFK PERFIERQFS PYEFLPFGGG VRRCMGEALA LFEIKLALAT IVSHYHLALV
421 DQRPEQPQRR GFNLAPGSGV KMVMTDQRAR KESLINMTTT PLS
CYP110D1 Nostoc sp. PCC 7120 Same as Anabaena
GenPept BAB76465, AF2401
NC_003272 complete genome 5678382..5679743
48% to CYP110A1 53% to 110E1, 49% to 110B1
1 MTVTQNLPNG PRIPRLLRLF KFITQPIQYV EDFAKVYGDN FTIWGSGESY FVYFSHPQAL
61 EQIFTNVSCF ESSGGGSPLL ELLLGKNSLI LLEGDRHQRQ RQLLTPPFHG ERMRAYGQTI
121 REITQQVTQA WQMGKPFNIR ASMQEITMRV ILRVVFGVDE GELFQELRQL LTTLLDFMGS
181 PLMSSTFFFS FTQKDYGAWS PWGRMVRLIK KIDQLIYALI AQRRAEFGEN RQDILSLLIS
241 ARYDDGQPMS DVELRDELMT MLVAGHETTA SALTWAFYWI DSVPEVREKL FQELDTLNDD
301 SEPSIIAKLP YLTAVCQETL RFYPIVLNAF FRRTKNPMEI MGYKLPKATL VVPSIYLAHH
361 REEVYPQSKQ FRPERFLEKQ FSPYEYLPFG GGNRRCIGLA FAQYEMKIVL ATILSQFQVS
421 RLSKRPVQPV RRGLTLAAPG GMKMVANKRM RNS
CYP110D2 Nostoc punctiforme
NZ_AAAY02000028 GenPept ZP_00109203.1
52704..54170 gene = Npun3650
68% to 110D1
MNIPLSVTLSNMKSRNNKIQKPSNLQTPMTATYNLPDGPQMPRW
LRTIKFISQPVKYVDDFAKTYGDTFTIRSSRSDNHIVYFSQPQALEEIFTADSRHFEV
GRGNTGLRFLLGDRSFMLVDGDRHQRQRQLLAPPFHGERMRAYGEDIRKITQQVSHEW
KIGKPFNIRESMQEITLRVILRVVFGLNEGELFEELRRSLSDLLDFISSPIMSSAFFF
RFIQKDFGAWSPWGRILLQRQKVDLLIYTLLRERRAQTDQNRQDILSLMMAARYDDGQ
GMSDEELHDELMTLLVAGHETTASALTWAFYWIDHLPEVREKLLQELNTIGVNPDLSS
VAKLPYLTAVCQETLRIYPIAMTAFVRIVKTPITIMGYELREGTAIVPSIYLAHHREE
VYPQSKQFKPERFLERQYSPYEYLPFGGGNRRCIGMAFAQYEMKIVLATVLSEFQVSL
VNKRPVHPVRRGLTVATPAGMRMVATPQVKRANTPALV
CYP110D3 Trichodesmium erythraeum
GenPept ZP_00074554.1 GenEMBL NZ_AABK02000068
complement(10019..11407) gene = Tery3870
54% to 110D1
MTLPDGPSLSPLQRRLRTWKFIFSPLSAIEERYSEYGDIFRTNT
NSLYPFIYFCNPKAIQQIFTADPDTFTSGSINGILKYFVGLNSLLLQDGDRHKRQRKL
LMPPFHGDRMRKYGDLIYNITSNVISQWKIEQPFPIRKSTQEISLKVILAAVFGLDQE
GKSYEKLRVLMSDLLDSMSSPLSSTFLFFNFLRKDWGPWSPWGRFLRKKQELHELIIA
EIQTAKKEGNHRDDILSLLLEARDEAGNAMSDEEIKDELLTMLFAGHETTASALAWAL
YWIDMIPSVGEKLMAELATIPSNSDQVAITKLPYLSAICQETLRIYPIAMNAFPRVVQ
KPIEIMGYQLEPGMVAIVPIYLTHHREDIYPEPKKFKPERFLERQFSPYEYLPFGGGS
RRCIGSAFALFEMKLVLATILSQWELKLLPNQRISPVRRGLTMAPPANMRMVVKPKKS
WQKVSQPILTSG
CYP110E1 Nostoc sp. PCC 7120 Same as Anabaena
GenPept BAB76532, AI2409
NC_003272 complete genome 5753083..5754450
50% to CYP110A2 53% to CYP110B1 53% to 110D1
1 MKLPDSPKIP KFMQLVQWIY QPLQLMEASA KAHGDSFTLW LTNKRPIVFL SNPQAIQELF
61 TTPLEQLDAR GTAQVLQPLL GENSLLLLSG ETHQRQRKLL TPPFHGDRMR AYGDIITNIT
121 KEVISNWQLG KPFSVRDSMQ EITLRVILQA VFGLREGERY TQLQKRLCDI LDLSGSALRS
181 TLSFLPALQI DLGRWSPWGH FLRQREAIDQ LLYAEIQDRR DHPDPSRTDI LSLMMAARDE
241 NGEAMTDVEL RDELMTLLVA GHETTASALT WALYWIHKLP QVREKLLAEL DNFGDNGDVN
301 EITRLPYLTA VCQETLRIYP IAMVTIPRIT KTNLEIGGHQ FAPGTMLVGC IYLMHRRPDL
361 YPQPQEFKPE RFLEKQYSLY EYLPFGGSNR RCVGMAFALY EMKLILATVL ANVDLALVDN
421 YPVKPTRRGV TLAPSGGKWL IATAQHQKIK NPVEV
CYP110E2 Nostoc punctiforme
NZ_AAAY02000088 GenPept ZP_00107327.1
complement(18173..19567) gene = Npun1723
58% TO 110E1 55% TO 110B1
MSLLKLPNGPQTHPWIQMYQWLTNPLEYMEACTKRYGDIFTLKL
GQNFAHQVFISNPQAIQQIFTTDPKQLDSGESAGIKAPLLGQQSLLALDGKPHQRQRK
LLTPPFHGERMLAYGELIREITEQVSSQWQVGETFAVLPSMQAISFQVILKAVFGLED
GPRYKKLNELLIKILNPKIPLLRTVLLIFPSMRQDLGAWSPWGKYLRLRQQIDQLIYA
QIQERKAQPNLSGTDILSLMMAARDEAGEPMTDLELRDELMTLLVAGHETTATSLSWA
LYWIHHRPQVREKLLQELDNLGEKPDPNAIFRLPYLNAVCSETLRLYPVAMSALNRLV
KSPLQIGEYNFEPGTILIPSIYLTHHREDLYPESKQFKPERFLERQFSPYEYLPFGGG
NRRCIGMAFALFEMKLVLATVLSRWQMELADSKPVRPVRKGLLFSPAGGVQMVVKGKR
LQNQPILQTSSSSV
CYP110E3 Trichodesmium erythraeum
GenPept ZP_00072591.1
GenEMBL NZ_AABK02000017 complement(<3..1016)
53% to 110E1 missing C-terminal 121 aa (runs off end of clone)
1 MIKLPGPKSP ALTQILQWTA KPIKFMEKCA REYGDTFEVK LNYPIVFISH PKAIEEIFKA
61 NPKKFDCGSS NKLAQPLLGD YSLLLLDDIP HQRQRKLLMP PFHGKRMQAY GELICNVAQE
121 VASKWEIGQV FSMREFTAEI SLKVILQAVF GLYEGERYSK LEKLLGSLLE SLSSPLKTSM
181 LFFQFLQIDL GPWSPWGNFI KNREEIYELL CAEISERRQK LDPERSDILT MLLLARDEEG
241 EGMSDIELRD ELMTLLIAGH ETTATSLSWA FYWIHHQPEI YQKLSRELET FGDDLNPMTV
301 INLPYMNAVC SETLRIYPVV IIVSPRKTKL PITIMGQT
CYP110E4 Gloeobacter violaceus PCC 7421
GenEMBL AP006578 complement(257348..258724)
gene = gll3063
NC_005125 complete genome complement(3256348..3257724)
locus_tag = gll3063
71% to 110E5 55% to 110E1
MSLPPGPSSPSPFQLMQWIGCPTDYLHTTAARYGDPFTMRVGVF
PPLVMFSDPRAIQQLFTAEAGTFDAGASNVALRPTLGANSLLLLDGERHQQQRRLLTP
PFHGERMRAYGELIRQVTEEVIVRWQPGKPFLVRNAMQRISLAVILQAVFGLHDGTRL
VRLRQALGSMLDAMSSPLSMAMLLMLPEDFGPWSPRARLQAHLGAIDELLYAEIRERR
EHFDAGAGDILGLLLAARDEAGAAMGDAELRDELMTLLVAGHETTATAMAWALYWIHY
LPQVRERLLAELDSLGSDPDPEAIARLPYLGAVCSETLRIYPVALIASPRVARHTVRI
LERDYEAGTRLAAGIYLAHHRPETYPEPERFRPERFLERTFSPYEFVPFGGGSRRCIG
MAFALYEMKLVIATVLLERDLRLVQPRLLRPVRRGVTLAPPEGLYLVPTGERSASRLL
SRTSTAGQ
CYP110E5 Gloeobacter violaceus PCC 7421
GenEMBL AP006578 complement(258800..260176) gene = gll3064
NC_005125 complete genome complement(3257800..3259176)
locus_tag = gll3064
71% to 110E4 55% to 110E2
MSLPAGPASPPPLQLLQWIGRPTDYLERTARRYGDPFTMRLGLH
SPVTGVFFSSPEAFQQLFNTEPGLFDSGGANASSTFNLLFGTNSLILLDGERHQQQRR
LLTPPFHGERMRSYGELIRTLAEQVTARWNLGTPFQARRSMQRISLGVILKAVFGLHD
GTRYLRVCRLLGNLIDASASPLLFGLRLIFPQDAGPMSPMGQLKAQIDAIDELLYAEI
RERRERPDPRADDILSLLMAARDEAGQGMGDVELRDELMTLLVAGHETTATAMAWALY
WIHRLPQVRERLLAELDSLGSDPDPEAIARLPYLGAVCSETLRIYPVAMVAFARVPRR
PVRILDREYPAGTFLIPNIYLAHRRPEAYPDPERFRPERFLERTFSPYEFVPFGGGSR
RCIGVAFALYEMKLVLATVLSRVELRLADPRPRLPVRRGLTLAPPEDLHLIPTALRSG
HRDLLPAC
CYP110F1 Nostoc punctiforme
NZ_AAAY02000005 GenPept ZP_00111618.1
complement(57031..58407) gene = Npun6096
48% TO 110E1 48% TO 110D1
MKILDSLTTPSLLQTLQLIAKPTKTLENYATKYGDIFTMRVMGL
KSPPIVFFSHPQAISDCFAVPAHKLDFKKATHVFKPLFGENSIVFKEARSHQQQRQLL
LPAFHGDNLKSYGQAICQIAEELTQSWTSGTNICIHKLMSKITLEIILQVVFGITHGV
RYQQLKEQLSALLEDVTKPWYSSLFFFPSLQKDLGAWSPWGIFLKRREQIDKLIYAEI
SERRWQNDAMRTDILSLLMSAHDVNGQQMTDEELRDQLVSLLLLGYETTSGVLAWIFY
LIHSHPEVKHRLMQELSTLDNLTNPEAITQLPYLTAVCQETLRIHPIALICTPRMLKE
PVEIMGHKFTSETVLVPCIHLAHRRTDTYPEPEQFRPERFLNQKFSPYEYLPFGGGYR
GCIGAAFSMYELKLVTAIILSRFELSLTDKRPAYPVRRGITIVPSGGVKMVVTKKAKF
KRQTILST
CYP110G1 Trichodesmium erythraeum
GenPept ZP_00074734.1
GenEMBL NZ_AABK02000081 complement(2404..3738)
42% to 110C1
1 MKQVCALKTP LWLQRFNYIT NPVSYWQKAY SSYKDAFYAQ GINFGKPLMV FYTPSAAKQI
61 IENCQGDLTT TSFDSELTAI FGDSSFFILE GTNHKKMRKL LIPALHGKHI KTYGELICNL
121 VNNLIENLPF NQSFSALEIA QEISMQVMIK LLFGNYQQER YQKIKQLMIN MVSLFAANVF
181 GFPLFFKFLQ QDLGLVSPWG NFLQQRRKIQ QLIYQEIAER RNHPNQERTD ILSLLMTAQD
241 EKGNFLNDEE LLGQLLSLLF TGNESTAASI AWSWYEVYRN SKIKEKLLEE INNLGDSPEP
301 LSLFNLPYLS AVCNETLRKY PVTMFMIPRI VKNTTEINGY QLDKGMLVTV GTYILHHRED
361 IYDQPEEFKP ERFIEHRFSS FEFLPFGRGM RGCIGADIAL YQMKLTLATI ISHHRLELTN
421 YGQIFPKRRN TILTPIKLRI IKAC
111 Family
CYP111A1 Pseudomonas incognita
GenEMBL L23310 (2080bp)
Ropp,J.D., Gunsalus,I.C. and Sligar,S.G.
Cloning and expression of a member of a new cytochrome P-450
family: cytochrome P450lin (CYP111) from Pseudomonas incognita.
J. Bact. 175, 6028-6037 (1993)
CYP111A2 Novosphingobium aromaticivorans
GenEMBL NZ_AAAV01000134
complement(20145..21356) gene = Saro1618
65% to CYP111A1
MLDLKNPDTYQGGVPYAALQDLRAEGPVHWNPESDGAGFWAVLG
HDEIVAVSRQPDLFSSAFENGGHRIFNENQVGLTGAGESAIGIPFISRDPPSHTQYRK
FVMPALSPARLQGIEERIAKRVERLFAQVPLGETVNILPLLTVPLPLLTLAELLGVPA
DLWPDLHRWTDAFVGEDDPDFRQSPEAMQAVLAEFMGFATALFEDRRANPGPDIASLL
ANTEIRGEPAPLRDFIANLILALVGGNETTRNSINHTMIALAENPGQWDILRADPSLM
TAAVKEMVRFASPVIHMRRTAMRDTQLGQQAICKGDKVVIFYPAGNRDPAVFENPDRF
EITRPVRQHLAFGSGAHVCVGSRLAEMQLRLAFAEMARHVRAFEVVGEPSRVRSNFIN
GFKRLEVRLLV
112 Family
CYP112A1 Bradyrhizobium japonicum
GenEMBL L02323 L12971 U12678 (11,715bp)
NC_004463 complete genome 2317922..2319127
Tully,R.E. and Keister,D.L.
Cloning and mutagenesis of a cytochrome P-450 locus from
Bradyrhizobium japonicum that is expressed anaerobically and
symbiotically
Appl. Environ. Microbiol. 59, 4136-4142 (1993)
Note: called BJ-1 see CYP114, CYP115P, CYP117
CYP112A2 Rhizobium sp. NGR234 plasmid pNGR234a
GenEMBL AE000083
NC_000914 complement(233666..234868)
Gene = y4lD
Freiberg,C., Fellay,R., Bairoch,A., Broughton,W.J., Rosenthal,A.
and Perret,X.
Molecular basis of symbiosis between Rhizobium and legumes
Nature 387 (6631), 394-401 (1997)
about 92% identical to 112A1
MPEQPLPTLPMWRVDHIEPSPTMLALRANGPIHNVRFPRGHEGW
WVTGYDEAKAVLSDAAFRPAGMPPAAFTPDCVILGSPGWLVSHEGGEHARLRTIVAPA
FSDRRVKLLAQQVEAIAAQLFETLAAQPQPADLRRHLSFPLPAMVISALMGVLYEDHA
FFAGLSDEVMTHQHESGPRSASRLAWEELRAYIRGKMRDKRQDPGDNLLTDLLAAVDR
GEATEEEAIGLAAGMLVAGHESTVAQIEFGLLAMLRHPQQRERLVGNPSLVDKAVEEI
LRMYPPGAGWDGIMRYPRTDVTIAGVHIPAESKVLVGLPATSFDPRHFEDPEIFDIGR
DAKPHLAFSYGPHYCIGMALARLELKVVFGSIFQRFPALRLAVAPEELKLRKEIITGG
FEEFPVLW
CYP112A3v1 Mesorhizobium loti
GenPept NP_106888
95% to 112A2 Rhizobium sp. NGR234
1 MSEQPLPTLP MWRVDHIEPS PEMLALRANG PIHHVRFPSG HEGWWVTGYD EAKAALSDAA
61 FRPAGMPPAA FTPDSVILGS PGWLVSHEGG EHARLRTIVA PAFSNRRVKV LAQQVEAIAA
121 QLFETLAAQP QPADLRRHLS FPLPAMVISA LMGVLYEDHA FFAGLSDEVM THQHESGPRS
181 ASRLAWEELR AYIRGKMWDK RQDPGDNLLT DLLAAVEQGN ATEEEAIGLA AGMLVAGHES
241 TVAQIEFGLL AMFRHPQQRE RLVGDPSLVD KAVEEILRMY PPGAGWDGIM RYPRTDVTIA
301 GVHIPAESKV LVGLPATSFD PRHFDDPEIF DIGRDENPHL TFSHGPHYCI GMALARLELK
361 VVVGSIFQRF PALRLAVAPE ELKLRKEIIT GGFEEFPVLW
CYP112A3v2 Mesorhizobium loti
GenEMBL AL672112 complement(85404..86606)
Strain R7A symbiosis island
Gene = msi071
2 DIFFS with CYP112A3v1
CYP112A4 Rhizobium etli symbiotic plasmid p42d
NC_004041 55365..56645
89% to 112A3
gene = cpxP2
MSEQSLPTLPMWRVDHIEPSPEMLALRAKGPIHRVRLPSGHECW
WVTGYDEAKAVLSDAAFLPAGMPPADFTPDSVILGSPGWLVSHEGDEHARLRTIVAPA
FSNSRVKLLTQQVEAITVQLFDTLAVQPQPADLRRHLSFPLPAKVISALMGVPFEEHA
FFAGLSDEVMTHQHESGPRSASGLAWEELRAYIHGKIRGKRQDPGDNLLTDLLAAVDQ
GKATEEEAIGLAAGVLVAGHESTVAQIEFGLLAMFRHPQQRERLVRDPSLVDKAVEEI
LRMYSPGAGWDGIMRYPRTDVTIAGVHIPAESKVLVGLPATSFDPCHFKDPEVFDIGR
DANPHLAFSYGQHNCIGAALARLELKAIFGSIFQRFPALRLAVAPEELKLRKEIITGG
FEEMPVLWCGRPPASQSSHLAAPGAHRSDQPLDR
113A Subfamily
CYP113A1 Saccharopolyspora erythraea
GenEMBL L05776 (1320bp) S51613 U82823 PIR B40634 (412 amino acids)
Stassi,D.L., Donadio,S., Staver,M.J. and Katz,L.
Identification of a Saccharopolyspora erythraea gene
required for the final hydroxylation step in erythromycin
biosynthesis.
J. Bact. 175, 182-189 (1993)
eryK erythromycin C-12 hydroxylase
Note: two different database entries have different start
codons. Neither is ATG.
113B Subfamily
CYP113B1 Streptomyces fradiae
GenEMBL U08223 (7082bp)
Merson-Davies,L.A. and Cundliffe,E.
Analysis of five tylosin biosynthestic genes from the tylIBA
region of the Streptomyces fradiae genome.
unpublished (1994)
CYP113B2 Streptomyces caelestis
cytochrome P-450 hydroxylase homolog (nidi)
GenEMBL AF016585 CDS complement(1-396) N-term only, 60% to 113B1
MVDSVTGPMELSKDANAKELLDWFSHNRTHHPVFWDEGRQAWQV
FRYDDYLTVSNHPEFFSSDFTEVAPTPPELEMILGPGTIGALDPPAHGPMRKLVSQAF
TPRRMAGQEQRIRVIAEELLDRVRGQKTIA
CYP113C1 Streptomyces virginiae
GenEMBL AB072568 4994..6202
46% to 113A1
gene = visD
MAQQTPPAPPSMADGGKAMLAWLRTMRDEHPVHEDQYGVFHVYR
HSDVLAVTSDPAVFSSDLSRLRPDSSALSEEILSVIDPPLHRKLRSLVSQAFTLRTVA
DLEPRVTELAGRLLEKVEGSEFDLVGDFAYPLPVIVIAELLGVPAEDRELFRGWSDRM
LSMQVDDPLEIQFGDEAGEDYERLVKEPLKEMHAYLQRHVDARRETPGDDLLSRLVTA
EIAGERLTDRQIVEFGALLLMAGHVSTSMLLGNTVLCLEENPETAAALRADRALISGV
IEEVLRMRPPITVAARVTTGEVVVGGVTIPKDRMVMASLLSANHDERHIQDPEVFDPR
RSPNPQLAFGHGIHYCLGGPLARLEGRVALEMLLDRFEDIRVTPGAPYDFHREGLFVP
ARSPLTVRRG
114 Family
CYP114A1 Bradyrhizobium japonicum
GenEMBL L02323 L12971 U12678 (11,715bp)
NC_004463 complete genome 2319222..2320511
Tully,R.E. and Keister,D.L.
Cloning and mutagenesis of a cytochrome P-450 locus from
Bradyrhizobium japonicum that is expressed anaerobically and
symbiotically
Appl. Environ. Microbiol. 59, 4136-4142 (1993)
Note: called BJ-3 see CYP112, CYP115P, CYP117
CYP114A2 Rhizobium sp. NGR234 plasmid pNGR234.
GenEMBL AE000082 CDS comp (9861..11264) gene = y4lC
NC_000914 complement(232170..233573)
cytochrome P450 BJ-3 homolog" 90% to CYP114A1
MDMQETTTACADAFAELASPACIDDPYPFMRWLREHDPVHRAAS
GLFLLSRHADICWALKATGDAFRGPAPGELARYFPRAATSLSLNLLASTLAMKEPPTH
TRLRRLISRDFTMREIDNLRPSIARFVAARLDGMAPALERGEAVDLHRQFALALPMLV
FAELFGMPQDDMFGLAAGIGAILEGLSPHASDPQLAAADAASARMKAYFGDLIQRKCI
DPRHDIVATLVGAHDDDADTLSDAELISMLWGMLLGGFATTAATIDHAVLAMLAYPDQ
RHWLQGDAAGVEAFVEEVLRCDAPAMFSSIPRIAQSDIELSGVVIPKNADVRVLIAAG
NRDPDAFADPDRFDPARFYGTSPGMSTDGKIMLSFGHGIHFCLGAQLARVQLAESLPR
IQARFPTLTVAEQPTREPSAFLRTFRALPVRLHAQGDSPRLTSAFLNGQRGVEGGASF
EHGDGERRSATDRRAQP
CYP114A3v1 Mesorhizobium loti
GenPept NP_106889
92% to 114A2
1 MDVQETTAAC RDAFAELASP ACIQDPYTFM RWLREHDPVH RAASGLFLLS RHADIYWALK
61 ATGDVFRGPA PGELARYFPR AETSLSLNLL ASTLAMKEPP THTRLRRLIS RDFTIRQIDN
121 LRPSIARIVA ARLDGMAPAL ERGEAVDLHW EFALAVPILV FAELFGMPQD DMFGLAAGIG
181 AILEGLSPHA SDPQLAEADA ASARVQAYFG DLIQRKRTDP RNDIVSMVVG AHDDDADTLS
241 DAELISMLWG MLLGGFATTA ATIDHAVLAM LAYPEQRHWL QGDAVGVKAF VEEVLRCDAP
301 AMFSSIPRIA QRDIELGGVV IPKNADVRVL IAAGNRDPDA FSDPDRFDPA RFYGTTPGMS
361 TDGKIMLSFG HGIHFCLGAQ LARVQLAESL PRIEARFPTL ALAEQPTREP SAFLRTFRAL
421 PVRLHAQGG
CYP114A3v2 Mesorhizobium loti
GenEMBL AL672112 complement(84020..85309)
Strain R7A symbiosis island
Gene = msi070
10 DIFFS with CYP114A3v1
CYP114A4 Rhizobium etli symbiotic plasmid p42d
NC_004041 56651..58252
90% to 114A3
gene = cpxP3
MDVQDTTAACHDAFAELASPACIQDPYPFMRWLREHDPVHRAAS
GLFLLSRHADIYWAFKATGDAFRGPAPSELARYFPRAASSLSLNLLASTLAMKEPPTH
TRLRRLISRDFTVGQIDNLRPSIARIVAARLDGMAPALERGEAVDLHREFALALPMLV
FAELFGMPQDDVFELSAIVSAILEGLSPHASDPQLAAADVASARVKAYFGDLILRKRA
DPRRDIVSTLVGAHTDDADTLSDAELISMLWGMLLGGFATTAATIDHAVLAMLAYPEE
RHWLQGDAAGVEAFVEEVLRCEAPAMFSSIPRIAQRDIELHGVVIPKDADVRVLIAAG
NRDPDAFADPDRFDPVRFYGTRPGMSSDGKIMLSFGHGIHFCLGAQLARVQLAESLPQ
IQARFPTLALAEQPTREPSAFLRTFRALPVRLHAQAAAEVRVVVDQDLCGTTGQCVLT
LPGTFRQREPDGVAEVCMATVPQALHAAVRLAASQCPVAAIRVIESEAGDDHCTNPGP
TPSPADAERHAAKDLRNPGEHDGTI
115 Family
CYP115A1P Bradyrhizobium japonicum
GenEMBL L02323 L12971 U12678 (11,715bp see 1351-1578)
NC_004463 complete genome 2317600..2317905
Tully,R.E. and Keister,D.L.
Cloning and mutagenesis of a cytochrome P-450 locus from
Bradyrhizobium japonicum that is expressed anaerobically and
symbiotically
Appl. Environ. Microbiol. 59, 4136-4142 (1993)
Note: called BJ-2 see CYP112, CYP114, CYP117
Note: This gene fragment has a perfectly good P450 sequence
of 76 amino
acids that includes the C-terminal up to a stop codon.
This may be a fragment of another intact P450 that was
broken up or
rearranged during cloning. A pseudogene would be expected
to have lost
integrity slowly and the whole gene should fade at about
the same rate.
This fragment is good but no upstream region continues it.
GDADRFDVTRRHNPHLSFGQGPHFCLGAALARLELGCAFPAL
FVRLEHLALTIAAEDVVYMPSYVIRCPQRLPVTFRPSIA
CYP115A2v1 Mesorhizobium loti
GenPept NP_106680 88% to CYP115A1P 39% to 154C1 41% to 154A1
1 MPAAPTQLDR LSSAILRQGG MARVSLPGDV VTWAAARHQT LRQMLSDQRF NKDWRQWRAL
61 QDGEIPEDHP LIGICKVDNM TTAHGADHRR LRGLLSSSFA PSRIALLAPR VEQCVDRLLA
121 EMAQRGGSAD LMSEFAAPLP TNVIAELFGL PDEQREEIVA LTYSLASTSA TAEEVRQTRQ
181 RIPEFFRRLI ALKRGQLGDD LASALIVARD KGELVSDTEL IDMLFMVLSA GFVTTAGVIG
241 NGVLALLTHP QQLHLVRSGQ VPWSQAIEEI LRWGTSAANL PFRYATQDVE IDGCLVRRGD
301 AVLMAFHAAN RDEKAFGPGA NRFDVTRRHN PHLSFGEGPH SCLGAALARL ELRCAFPPLF
361 GRLEDLALTI AAEDVVYMPS YVIRCPQRLP VSFRPSVA
CYP115A2v2 Mesorhizobium loti
GenEMBL AL672113 41375..42607
Strain R7A symbiosis island
Gene = msi159
10 DIFFS with CYP115A2v1
CYP115A3P Rhizobium etli symbiotic plasmid p42d
NC_004041 54883..55296
70% to 115A1P 70% to 115A2
gene = cpxP1 pseudogene C-terminal
ANSYGRPTYGDTDMFDFNRLQNPHLPLGQGPHLCLGAALARLELGSVFPPPFVRPEDLALAIAAE
116 Family
CYP116A1 Rhodococcus erythropolis
GenEMBL U17130 (6458bp)
Nagy,I., Schoofs,G., Compernolle,F., Proost,P., Vanderleyden,J.
And De Mot,R.
Degradation of the thiocarbamate herbicide EPTC (S-ethyl
dipropylcarbamothioate) and biosafening by Rhodococcus sp. NI86/21
involve an inducible cytochrome P-450 system and aldehyde
dehydrogenase.
unpublished
CYP116B1 Ralstonia metallidurans
GenEMBL NZ_AAAI01000322
25751..28093 gene = Reut3205
52% to CYP116A1 with C-term. Extension
extension may contain a reductase and a ferredoxin component
MPQTNAPASSGSCPIDHSALRAPNGCPISHQAAAFDPFEDGYQQ
DPPEYVRWSRAQEPVFYSPKLGYWVVTRYDDIKAIFRDNITFSPSIALEKITPTGEAA
NAVLASYGYAMNRTLVNEDEPAHMPRRRALMEPFTPAELAHHEPMVRKLTREYVDRFI
DTGRADLVDEMLWEVPLTVALHFLGVPEEDMDLLRQYSIAHTVNTWGRPKPEEQVAVA
HAVGNFWQLAGRILDKMREDPSGPGWMQYGLRKQRELPEVVTDSYLHSMMMAGIVAAH
ETTANASANAIKLLLQHPDVWREICEDPALIPNAVEECLRHNGSVAAWRRLVTRDTEV
GGMSLAAGSKLLIVTSSANHDEHHFADADLFDIHRDNASDQLTFGYGSHQCMGKNLAR
MEMQIFLEELTSRLPHMRLAGQRFTYVPNTSFRGPEHLWVEWDPARNPERTDPTVLAP
RDAVRIGEPTGGTTGRTLIVERVETAAQGVSRIRLVSPDGRALPRWSPGSHIDIECGH
TGISRQYSLCGDPADTSAFEIAVLREPESRGGSAWIHASLRAGDKLKVRGPRNHFRLD
ETCRRAIFIAGGIGVTPVSAMARRAKELGVDYTFHYCGRSRASMAMIDELRALHGDRV
RIHAADEGQRADLAQVLGAPDANTQIYACGPARMIEALEALCATWPEDSLRVEHFSSK
LGTLDPSREQPFAVELKDSGLTLEVPPDQTLLATLRAANIDVQSDCEEGLCGSCEVRV
LAGEIDHRDVVLTRGERDANNRMMACCSRAAKGGKIVLGL
CYP116B2 Rhodococcus sp. NCIMB 9784
GenEMBL AF459424
66% to 116B1 over full fusion protein length
extension may contain a reductase and a ferredoxin component
MSASVPASAPACPVDHAALAGGCPVSANAAAFDPFGSAYQTDPA
ESLRWSRDEEPVFYSPELGYWVVTRYEDVKAVFRDNILFSPAIALEKITPVSAEATAT
LARYDYAMARTLVNEDEPAHMPRRRALMDPFTPKELAHHEAMVRRLTREYVDRFVESG
KADLVDEMLWEVPLTVALHFLGVPEEDMATMRKYSIAHTVNTWGRPAPEEQVAVAEAV
GRFWQYAGTVLEKMRQDPSGHGWMPYGIRKQREMPDVVTDSYLHSMMMAGIVAAHETT
ANASANAFKLLLENRAVWEEICADPSLIPNAVEECLRHSGSVAAWRRVATADTRIGDV
DIPAGAKLLVVNASANHDERHFERPDEFDIRRPNSSDHLTFGYGSHQCMGKNLARMEM
QIFLEELTTRLPHMELVPDQEFTYLPNTSFRGPDHVWVQWDPQANPERTDPAVLHRHQ
PVTIGEPAARAVSRTVTVERLDRIADDVLRLVLRDAGGKTLPTWTPGAHIDLDLGALS
RQYSLCGAPDAPSYEIAVHLDPESRGGSRYIHEQLEVGSPLRMRGPRNHFALDPGAEH
YVFVAGGIGITPVLAMADHARARGWSYELHYCGRNRSGMAYLERVAGHGDRAALHVSE
EGTRIDLAALLAEPAPGVQIYACGPGRLLAGLEDASRNWPDGALHVEHFTSSLAALDP
DVEHAFDLELRDSGLTVRVEPTQTVLDALRANNIDVPSDCEEGLCGSCEVAVLDGEVD
HRDTVLTKAERAANRQMMTCCSRACGDRLALRL
117 Family
CYP117A1 Bradyrhizobium japonicum
GenEMBL L02323 L12971 U12678 (11,715bp)
NC_004463 complete genome 2321653..2322996
Tully,R.E. and Keister,D.L.
Cloning and mutagenesis of a cytochrome P-450 locus from
Bradyrhizobium japonicum that is expressed anaerobically and
symbiotically
Appl. Environ. Microbiol. 59, 4136-4142 (1993)
Note: called BJ-4 see CYP112, CYP114, CYP115P
CYP117A2 Rhizobium sp. NGR234 plasmid pNGR234a
GenEMBL AE000082 complement(7357..8700) U00090
NC_000914 complement(229666..231009) gene = y4kV
Freiberg,C., Fellay,R., Bairoch,A., Broughton,W.J., Rosenthal,A.
and Perret,X.
Molecular basis of symbiosis between Rhizobium and legumes
Nature 387 (6631), 394-401 (1997)
about 90% identical to 117A1
MNVLLNPLNRRHRLRYDIPVMPGAFPLVGHLPAIVCDLPRLLRR
AERTLGSHFWLDFGPAGHLMTCVDPHAFALLRHKDVSSALIEEIAPELLGGTLVAQDG
GAHRQARDAIKAAFLPEGLTQAGIGDLFAPVIRARVQAWRDRGDVTILPETGDLMLKL
IFTLMGVPAQDLPGWHRKYRQLLQLIVAPSVDLPGLPLRRGRAARDWIDAQLRQFVRD
ARAHAARTGLINDMVSAFDRSDDALSDDLLVANIRLLLLAGHDTTASTMAWMVIELAR
QPMLWDALVEEAQRVGAVPTRHADLEQCPVAEALFRETLRVHPATTLLPRRALQELQL
GQRRIPAGTHLCIPLLHFSTSALLHEAPDQFRLARWLQRTEPIRPVDMLQFGTGPHVC
IGYHLVWLELVQFSIALALTMHKAGVRPLLLSGVEKGRRYYPTAHPSMTIRIGFS
CYP117A3 Mesorhizobium loti
GenPept NP_106891
NC_002678 complete genome 5191629..5192972
locus_tag = mlr6367
94% to 117A2
1 MDMLLNPLDR RHRLRDDIPV VPGAFPLVGH LPAIVCDLPR LLRRAERTLG SHFWLDFGPA
61 GHLMTCVDPD AFALLRHKDV SSALIEEIAP ELLGGTLVAQ DGGAHRQARD AIKAAFLPKG
121 LTQAGIGNLF APVIQARVQA WRDRGDVTIL RETGDLMLKL IFSLMGIPAQ DLPGWHRKYR
181 QLLQLIVAPP VDLPGLPLRR GRAARDWIDA QLRQFVRDAR AHAARTGLIN DMVSSFDRGD
241 DALSDDVLVA NIRLLLLAGH DTTASTMAWM VIELARQPGL WDALVEEAQR VGAVPTRHAD
301 LAQCPVAEAL FRETLRVHPA TTLLPRRALQ ELQLGQRRIP AGTPLCIPLL HFSTSALLHE
361 APDQFRLARW LQRTEPIRPV DMLQFGTGPH VCIGYHLVWL EMVQFCIALA LTMHKAGVRP
421 RLLSAVEKGR RYFPTAHPSM KIRIGFS
CYP117A3v2 Mesorhizobium loti
GenEMBL AL672112 complement(81551..82888)
Strain R7A symbiosis island
Gene = msi068
2 DIFFS with CYP117A3v1
CYP117A4 Rhizobium etli symbiotic plasmid p42d
NC_004041 59081..60424
85% to 117A2
gene = cpxP4
MDMLLNPLNRWRRLRDDIPVMPGAFPLVGHLPAIVCDLPRLLRR
AERTLGSHFWLDFGPAGHLMTCLDPDALALLRHKEVSSALIEEMAPDILGGTLVTLDG
SAHRQARDGIKAAFLPRGLTEAGIGELFEPIIRAQVKAWRDRGEVAILPDTRNLMLKL
TFSLMGIPAQDLSEWHRKYRQLLQLMVAPPIDLPGMPLRRGRAARDWIDAQSRQFIRD
ARARAARTGLINDMVSAFDCSDGALSDDVLVANIRLLLLAGHETSASTIAWMVIELAQ
HPELWDALVEEAQRVGAVPTGHEDLAQCPVAEALFRETLRMHPASSLVPRRAMQELQL
GQRRIPSGTHLCIPLLHFSTSPLLHEAPDQFRLGRWLQRTEPIRPVDMLQFGAGPHVC
MGYHLVWLELVQFSIALALTMQEAGVRPRLMSGVEKGRRYYPTAHPSMTVRIGFS
118 Family
CYP118P1 Mycobacterium leprae
GenEMBL L04666 (40,123bp)
Smith,D.R.
M. leprae cosmid dna sequence
Unpublished (1992)
Note 15,700 to 17,350 is the region of interest
CYP118P1 Mycobacterium leprae
GenPept CAC31116
NC_002677 547312..547788 locus_tag = ML0447
NC_002677 complement(2562932..2563627) locus_tag = ML2159
(a duplication of the seq.)
putative fatty oxidation complex alpha subunit
Sequence below is from TIGR primary nucleotide sequence for ML2159
CYP118 exact match, 49% to 102C1
4 TASQHDDILDIMLYSADPSTGEQLDTDNVVNQILTLLVSGSQTLANAIAFALHYLLSIHH 183
184 DIAAQTRREIYQNRSDRGIANVSY
258 FGDVVKLRCLRRVVDATLRLWS
VPCYLRQARRD 360
361 TTLGNGTSLFHKGQWVIVLLTAPMPG
WGPDANEFNPDRXXXXXXXXXXXXXXXX 470
520 FGTGLRTCIGRRFALHEMALELTMIVHQYILSRADPG
YCLSISEAFTLKTVGL 677
119 Family
CYP119A1 Sulfolobus solfataricus (an archaebacterium)
GenEMBL U51337 (1254bp)
Wright, R.L., Harris, K., Solow, B., White, R.H. and Kennelly, P.J.
Cloning of a potential cytochrome P450 from the Archaeon Sulfolobus
solfataricus.
FEBS. Lett. 384, 235-239 (1996)
CYP119A2 Sulfolobus tokodaii
GenPept BAB66184
64% to CYP119A1 U51337 Sulfolobus solfataricus
1 MYDWFKQMRK ESPVYYDGKV WNLFKYEDCK MVLNDHKRFS SNLTGYNDKL EMLRSGKVFF
61 DIPTRYTMLT SDPPLHDELR NLTADAFNPS NLPVDFVREV TVKLLSELDE EFDVIESFAI
121 PLPILVISKM LGINPDVKKV KDWSDLVALR LGRADEIFSI GRKYLELISF SKKELDSRKG
181 KEIVDLTGKI ANSNLSELEK EGYFILLMIA GNETTTNLIG NAIEDFTLYN SWDYVREKGA
241 LKAVEEALRF SPPVMRTIRV TKEKVKIRDQ VIDEGELVRV WIASANRDEE VFKDPDSFIP
301 DRTPNPHLSF GSGIHLCLGA PLARLEARIA LEEFAKKFRV KEIVKKEKID NEVLNGYRKL
361 VVRVERA
120 Family
CYP120A1 Synechocystis sp. (strain PCC6803) Cyanobacterium
GenEMBL D64003(113064bp)
coding region 62160-63494
Kaneko,T., Tanaka,A., Sato,S., Kotani,H., Sazuka,T., Miyajima,N.,
Sugiura,M. and Tabata,S.
Sequence analysis of the genome of the unicellular cyanobacterium
Synechocystis sp. strain PCC6803. I. sequence features in the 1Mb
region from map positions 64% to 92% of the genome
DNA Res. 2,153-166 (1995)
note: gene slr0574 (previously had incorrect gene identifier here)
NT01NS3472 Nostoc sp. PCC 7120 in TIGR not in Genbank
40% to CYP120 aa 399-443
MEMKIVAAHLLRRYHWEILPNQSLDSVLVPTNQPQDGLRVRFQPL
CYP120A2 Trichodesmium erythraeum
NZ_AABK02000021
complement(1844..2800) gene = Tery2088
318aa (short) 57% to 120A1 (missing N-term 127aa)
MTANYLEKWVEMGTLTWYPEIRNYTFDIASLLFMGSDESSQTKL
VSLFEEWVKGLFSIPLSLPWTRFGKSLRCRQKLLQHIEEIILQRQQQQNLGEDALGIL
LQAQDKEVNGLSLDELKDQILLLLFAGHETLTSAIASFCLLTSQHLDVLTRLRQEQKQ
FSAIEPLTLENLKRMTYLDMVLKEVLRLIPPVGGGFRQVTQDCEFCGYSIPKGWLVQY
QIAKTHQDETLYPDDKNFDPERFAPENAVDKQKVFGYVPFGGGMRECLGKEFARLEMK
IFAVMLLRGYEWELLPEQDLSVVAAPTPYPRDGLKVKFRKVE
CYP120B1 Nostoc punctiforme
NZ_AAAY02000018.1
complement(62382..63695) gene = Npun4299
43% TO CYP120A1
MKTNQIPPGSFGLPVLGETLSFVFDRDFAKKRYHQYGPIFKTHL
LGRPTVVMAGPEALEFVLSSHIENFSWREGWPDNFKTLLGESLFLQDGEEHRRNRRLM
MPALHGPALASYFSTMEDITRSYLQKWEKKQEFTWFQEFKQLTFDIASQLFLGTRPGP
ECVRLSQLFTTLTNGLLAINPLPLPFTTFGKAIAARNEILEHLTQVVRERQQNPTQDT
ISLLIKAKDEDGNSLSEKEIIAQAVLLLFAGHETTTSMLTWLCTELACHPEVLEKARV
EQLQLASQGDLDLEQLGKMPYLEQVLWEVERLHQPVGGGFRGVIKDFELNGYHVPTGW
QLYYSIGVTHQIEEIYSEPELFDPDRFSPQRQEHKKYPFSLVGFGGGPRICIGIAFAK
MEMKIVAAHLLRSYHWEILPNQSLEVVAVPTNRPKDGLRVRFQPR
CYP120C1 Nostoc punctiforme
NZ_AAAY02000127.1 GenPept ZP_00106106.1
8154-9512 gene = Npun477
44% to 120B1 36% to CYP120A1
MQQLKSAEEIPGSYGLPILGETLEIFRDSELYLWRRFQQYGSVF
KTSVLGRKRAYLIGPSANRLVLVEQAENMSSRIGWYFLESTFGNNILLQDGEEHRLTR
RLMYPAFHGKAIATYFDTIQNIVQDFLKDWGERGTISLNSSFRQLTLMIATRLFLGSQ
NKSEVEQTSQWFTQLLDSSMAIFKWNVPFTLYGRGQNARGKLVAFLREAIAQRIEQGN
LEESKDVLGLLLAAVDEDGNKLSETQVINEALLLLFAGHETTASLLTWVIFELGNHPE
WRERLRQEQLAVVGNNPLSLSHLKQFPQLTNVLKEAERLYPPVYAYNRGVLKDIEYGG
YRIPAGWFVTISPMLTHRLPELYTEPDRFDPDRFAPPREEDKKHPLALMGFGYGSHSC
LGMEFAQMEMKIVLSTLLRHYDWTVKPDYSAIAPVRQPSKVKDILQAYIEPLLIKHPL
DS
121 Family
CYP121A1 Mycobacterium tuberculosis
GenEMBL Z77163 (42861bp) gi 1449344 Rv2276
complement (32358 to 33548)
unpublished
CYP121A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 2526703..2527893
Gene = cyp121 100% match
locus_tag = Mb2299
122 Family
CYP122A1 Streptomyces sp.
GenEMBL U65940( 2500bp)
nearly identical to rapJ gene of St. hygroscopicus involved in
rapamycin biosynthesis
CYP122A2 Streptomyces hygroscopicus
GenEMBL X86780 (107379bp)
coding region 96465-97625
rapJ
CYP122A3 Streptomyces hygroscopicus var.
GenEMBL AF235504 CDS 71460..72626
gene="fkbD"
note="C9 hydroxylase" 89% to 122A1 77% to 122A2
123 Family
CYP123 Mycobacterium tuberculosis
GenEMBL Z80226 (34809bp) gi 1550644 Rv0766c
complement (8322-9530)
CYP123 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(861053..862261)
Gene = cyp123 100% match
locus_tag = Mb0789c
124 Family
CYP124A1 Mycobacterium tuberculosis
GenEMBL Z77163 (42861bp) gi 1449354 Rv2266
complement (39907-41193)
CYP124A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 2519058..2520344
Gene = cyp124 100% match
locus_tag = Mb2289
CYP124B1 Streptomyces cinnamonensis
GenEMBL AF440781 93981..95273
polyether antibiotic monensin biosynthesis gene cluster
41% to CYP124
gene = monD
MGLTVGPDNAKRGIVPITDSKPAATFPDLVDPSFWARPHAERVA
LFEEMRGLPRPAFIRQNMPGVPWTFGYHALVKYADIVEVSRRPQDFSSNGATTIIGLP
PELDEYYGSMINMDNPEHSRLRRIVSRSFGRNMIPEFEAVATRTARRIIDELIARGPG
DFIRPVAAEMPIAVLSDMMGIPAEDHDFLFDRSNTIVGPLDPDYVPDRADSERAVIEA
SRELGDYIAGLRAERLAAPGNDLITKLVQVQADGEQLTRQELVSFFILLVIAGMETTR
NAISHALVLLTEHPEQKQLLLSDFDTHAPNAVEEILRVSTPINWMRRVATRDCDMNGH
RFRRGDRIFLFYWSGNRDESVFPDPYRFDITRGTNAHVTFGAVGPHVCLGAHLARMEI
TVLYRELLAALPQIHAVGQPRRLDSSFIEGIKHLHCAF
CYP124B2 Streptomyces nanchangensis NS3226
GenEMBL AF521085 complement(100196..101467)
polyether ionophore nanchangmycin biosynthetic gene cluster
41% to CYP124
gene = nanP
MNRGVVSPTEATPASSAKATRPPDFMDPSFWLRPRDERAEVFEK
LRALPGPEFVPPRLPWGPLASGYYALSKHADICEVSRRPQDFSSEGATAILPPEMDEF
YGSMINMDNPEHSRLRRIVARSFGRGMAPKFDAMSRRVARRIVDELIERGPGDFIRPA
AEMPIAVLSTMMGIPGEDYEFLFERTNTIMGGADPELAADPEKMAAAVLGALRDLGDY
IGRLREDRLARPGPDVITKLVQVQEDGEQLTNQELVSFFILLINAGMETTRNVIAQAL
VLLTEHPDQRQLLLSDFELHAKGAVEEILRVGTPINWMRRTATGDCEMNGHRFRKGDE
IFLFYWSANHDEKVFEDAYRFDITRDPNPHLSFGAVGPHFCLGAHLARIEIIAMLREL
LASLPDIRVEGEPVRLASSFIEGFKELSCTF
125 Family
CYP125A1 Mycobacterium tuberculosis
GenEMBL Z82098 (34154bp) gi 1666115 Rv3545c also AD000003
coding region 8135-9436
CYP125A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(3927359..3928660)
Gene = cyp125A1 100% match
locus_tag =Mb3575c
CYP125A2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV5841 57% to CYP125A1 from Mycobacterium tuberculosis
CYP125A3P Mycobacterium leprae
GenPept CAC30983
NC_002677 2415021..2416227
locus_tag="ML2024
Sequence below is from TIGR primary nucleotide sequence for ML2024
51% to CYP125A1 Rv3545c Z82098 Mycobacterium tuberculosis
1 PGFDFPDPEIYTEQLSV*EPAEMCQAETI**NEQPIGRSGFYDDDY 138
XXXXXXXXXXXXXX
174 HSGTFSNLEKTALACYQEGMNDEQISRGKLVLLNIDASQYTRLHKIISPGFIP*AAEQLR 353
354 DDLXXXXXXXXXXXXXXX 362
410 SGDFVEHVSCELSRQAAIAGLPSG 481
480 VPQEDCKKLFHWSN 521
522 QTVGAQDPKFATNDPMVTSVKLIM*AMQIAADRAKPLGQVIVTNLVEADIEGHKLSKDEFGSF 710
713 VIMLTAAGKENTRNCIMQSMMQFTNFPD*WELYK 814
816 KKAPGTTADKIIRQATLVMS 875
876 FQRTVLK*YELSSVSIKKGQRVVVIYRSANFDEKVLTIRLPCSIMRNPT 1022
1022 PHAGFNDTNVHYCIGIN 1072
1073 LARMTIDRMFHAIAESMPNL*STGKPK*LRSGWLNGVKHWQVD 1201
CYP125A4 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
75% to 125A2 before frameshifted region
clone name SP0266
126 Family
CYP126A1 Mycobacterium tuberculosis
GenEMBL Z80226 (34809bp) gi 1550656 Rv0778
coding region 20888-22132
CYP126A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 873620..874864
Gene = cyp126 100% match
locus_tag = Mb0801
CYP126A2P Mycobacterium leprae
GenPept CAC31567
NC_002677 complement(1384839..1385327)
locus_tag="ML1185
Sequence below is from TIGR primary nucleotide sequence for ML1185
37% to CYP126 C-terminal
184 DRSLIPSAIEEGSRSETPNWASVTRITIA*LAIGGKTILPNAGVDILMGSANRDGSRWTE 363
364 PNTFDIHWPRQAHTTLAGSHMCLGIGLAQLDTRVMLNNLFD 486
127 Family
CYP127A1 Rhizobium sp.
GenEMBL Z68203(34010bp)
coding region 29431-30675
also AE000101 Rhizobium sp. NGR234
CYP127A2 Rhizobium sp. BR816
No accession number
Ellen Luyten
Submitted to nomenclature committee 4/12/2000
73% identical to CYP127A1
CYP127A3v1 Mesorhizobium loti
GenPept NP_106463
NC_002678 complete genome 4745586..4746803
79% to 127A2
1 MAINPVPDHV PPEMVRDFSL FTSPGMPPTP NGDPHAAVAC AHDGPPIFYS PYNTQDGRGT
61 WVITRAADQR KVLQDTETFS SHRSIFSSIL GETWPTIPLE LDPPAHGAFR SLLSPLLSPK
121 RVTALEPAVR ERAIALIDRI TASATSCDVM KDFAFPFTVS IFLRFLGLPD QGLDTFVGWA
181 KDLLHGDDVE RPVAARKIVA FIDELATNRR KDPVDDLMTF IVQAQIEGRR LTDGEIRGIG
241 VLVFVAGLDT VAAAIGFDLA YLARNLKDQE LLRSEPARIL LATEELLRAY PPIQLIRVAT
301 KDIDFEGAPI RKGDYVSCAT MIANRDPEEF ESPNTVDLAR DHNRHAAFGY GPHRCLGSHL
361 ARREIVIGLE EWLARIPTFR IKEGTAPITC GGHVFGIENL ILDWS
CYP127A3v2 Mesorhizobium loti
GenEMBL AL672114 complement(100678..101895)
Strain R7A symbiosis island
Gene = msi332
2 DIFFS with CYP127A3v1
CYP127A4 Rhizobium etli symbiotic plasmid p42d
NC_004041 97484..98974
81% to 127A2
gene = cpxA5
MHLCSERIYRKRGTRENPMSTGRAGEASKKFRLRPTKQRGFRAA
RRSDRCIACHWRLALLRLEIWRSTILLAPSPRRIRSRRRGFDDRRKAVATIRVPEHVP
PEMVKDFSLFTSPGMERMPNGDPHAAVACLHNGPRIFYSPCNTRDGRGTWVIVRAQDQ
RKLLQDTGTFSSHRSLFASALGENWPLIPLELDPPAHSVFRSLLNPLLSPRRIMELEP
AVRDRAIALISKISASSTSCDILTDFAFPFAVSIFLRLLGLSDERLNTFVGWGKDLLH
GDGIRRTAAARTILAFIDELAAMRRKEPADDFMTFVVQAKVDGRLLRDQEIHGIGVLL
FVAGLDTVATAIGFDLAYLARNPTEQELLRSKPDRIVLAAEELLRAYSTVQMIRVATK
DINFEGAPIRKGDYISCATMIANRDPVEFENPNTIDLAREDNRHTAFAYGPHRCLGSH
LARREIIIGLEEWLSRIPDFRIKDGTAPITYGGHVFGMENLILDWS
128 Family
CYP128A1 Mycobacterium tuberculosis
GenEMBL Z77163 (42861bp) gi 1449352 Rv2268c
coding region 37021-38490
CYP128A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(2521761..2523230)
Gene = cyp128 100% match
locus_tag = Mb2291c
129 Family
CYP129A1 Steptomyces sp.
GenEMBL U50973(3196bp)
Dickens,M.L. and Strohl,W.R.
Isolation and characterization of a gene from Streptomyces sp.
strain C5 that confers the ability to convert daunomycin to
doxorubicin on Streptomyces lividans TK24
J. Bacteriol. 178, 3389-3395 (1996)
gene name doxA
CYP129A2 Streptomyces peucetius
GenEMBL U77891 CDS comp (83..1330)
gene="doxA"
product="daunorubicin C-14 hydroxylase" 94% to 129A1
130 Family
CYP130A1 Mycobacterium tuberculosis
GenEMBL Z77137 (36096bp) gi 1480330 Rv1256c
coding region 30691-31908 cy50.26
CYP130A1X Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome
CYP130 lies in a deletion in M. bovis
CYP130A2P Mycobacterium leprae strain TN
GenPept AL583920.1
59% to CYP130A1
VMSHRFRFTTADIWPNPWSMYRTLRDHEAVHHVVPANQPEDDYYVLPRHADVWSMAMRS
HAKLSSAQRLTVNYSDMELIGLQDNPPMVMQDQPV*TKCRKLVSRRFTPRQTNVVEPKVR
HFVVEHIEQLRAKGSVDIVTELFKPLPPMVVAHYFGFPEKVRSQFDGW
TTAADGGGALFRFPRKSPITIRRLAPAIVAANTADAGGITNELDVAGYAVESMLAYFTR
IATGGNNTVTGMLGG*MPL
SHRRKQHRHWHARRLDAVKDTAEAD
LLRLTSSVRGLMRTTTRDVAIGHTTVSPGRRVLMRYGQAKRDER*YSAAAS*LDVTW*
PPNILIFSHGAH
YLGAKVTRMQRR
VRLTELLARYPDFEVDESSIAWAGGKLHTTP
131 Family
CYP131A1 Streptomyces peucetius
GenEMBL L47164(3444bp)
coding region 32-1348
gene dnrQ duanosamine biosynthesis
possible sequence errors at C-terminal (no recognizable signature sequence
in the last 68 amino acids)
CYP131A2 Streptomyces sp.
GenEMBL L35154 (4134bp)
3838-4134 N-terminal fragment 94% identical to L47164
gene dauQ daunomycin biosynthesis
132 Family
CYP132A1 Mycobacterium tuberculosis
GenEMBL Z80108 (40778bp) gi 1542902 Rv1394c
complement (9842-11227)
most often matches CYP4 family in blast search
CYP132A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(1566263..1567648)
Gene = cyp132 1 aa diff
locus_tag = Mb1429c
133 Family
CYP133A1 Erwinia herbicola
Randy S. Fischer, Roy A. Jensen
First P450 from an enteric bacteria (similar to E. coli)
submitted to nomenclature committee
CYP133B1v1 Xylella fastidiosa, section 35 of 22.
AE003889 CDS complement(3751..4959)
82% to AE003887 48% to CYP133A1
CYP133B1v1 Xylella fastidiosa 9a5c
GenPept AAF83187
100% match
1 MKLTDLSNPA FLENPYPLYE TLRAQAPFVS IGPNALMTGR YSLVDSLLHN RNMGKKYMES
61 MRVRYGDSAA DMPLFQAFSR MFITINPPAH THLRGLVMQA FTGRESESMR PLAIDTAHQL
121 IDNFEQKPSV DLVAEFAFPF PMQIICKMMD VDIGDAVTLG IAVSKIAKVF DPSPMSADEL
181 VHASTAYEEL AQYFTKLIEL RRTHPGTDLI SMFLRAEEDG EKLTHDEIVS NVIMLLIAGY
241 ETTSNMIGNA LIALHRHPEQ LALLKSDLSL MPQAVSECLR YDGSVQFTMR AAMDDIEVEG
301 ELVPRGTVVF LMLGAANRDP AQFTHPDQLD ITRKQGRLQS FGAGIHHCLG YRLALIELEC
361 ALTTLFERLP HLRLAHLDAL NWNQRSNLRG VNTLIVDLHA KN
CYP133B1v2 Xylella fastidiosa Temecula1
GenPept AAO29526
6 diffs to CYP133B1v1
1 MKLTDLSNPA FLENPYPLYE TLRAQAPFVS IGPNALMTGR YSLVDSLLHN RNMGKKYIES
61 IRLRYGDTAA DMPLFQAFSR MFITINPPAH THLRGLVMQA FTGRESESMR PLAIDTAHQL
121 IDNFEQKPSV DLVAEFAFPF PMQIICKMMD VDIGDAVTLG MAVSKIAKVL DPSPMSADEL
181 VHASTAYEEL AQYFTKLIEL RRTHPGTDLI SMFLRAEEDG EKLTHDEIVS NVIMLLIAGY
241 ETTSNMIGNA LIALHRHPEQ LALLKSDLSL MPQAVSECLR YDGSVQFTMR AAMDDIEVEG
301 ELVPRGTVVF LMLGAANRDP AQFTHPDQLD ITRKQGRLQS FGAGIHHCLG YRLALIELEC
361 ALTTLFERLP HLRLAHLDAL NWNQRSNLRG VNTLIVDLHA KN
CYP133B1v3 Xylella fastidiosa Dixon
NZ_AAAL01000071 complement(6849..8057)
98% to 133B1v1 7 diffs 97% TO 133B1v2 11 diffs
gene = XfasA0474
MKLTDLSNPAILENPYPLYETLRAQAPFVSIGPNALMTGRYSLV
DSLLHNRNMGKNYMESMRVRYGDSAADMPLFQAFNRMFITINPPAHTHLRGLVMQAFT
GRESESMRPLVIDTAHQLIDNFEQKPSVDLVAEFAFPFPMQIICKMMDVDIGDAVTLG
MAVSKIAKVFDPSPMSADELVHASTAYEELAQYFTKLIELRRTHPGTDLISMFLRAEE
DGEKLTHDEIVSNVIMLLIAGYETTSNMIGNALIALHRHPEQLTLLKSDLSLMPQAVS
ECLRYDGSVQFTMRAAMDDIEVEGELVPRGTVVFLMLGAANRDPAQFTHPDQLDITRK
QGRLQSFGAGIHHCLGYRLALIELECALTALFERLPHLRLAHLDALNWNQRSNLRGVN
TLIVDLHAKN
CYP133B2v1 Xylella fastidiosa, section 33 of 22
AE003887 CDS 6723..7925
82% to AE003889 48% to CYP133A1
CYP133B2v2 Xylella fastidiosa Ann-1
NZ_AAAM01000051 complement(2764..3966)
97% TO 133B2v1 8 diffs
gene = XfasO1476
MKLADLSSPAFLENPYPLYETLRRQGPFVSIGPNALMTGRYSIV
DGLLHNRNMGKSYMESIRVRYGDDALDMPLFQGFNRMFLMLNPPVHTHLRGLVMQAFT
GRESESMRPLATDTAHRLIDDFEQKSSVDLVTEFSFPLPMRIICRMMDVDISDAISLS
VAVSNIAKVFDPAPMSPDELVHASAAYEELAHYFTRLIELRRAQPGTDLISMLLRAEE
EGQKLTHDEIVSNVILLLLSGYETASNMIGNALIALHRHPKQLARLKSDLSLMPQTVL
ECLRYDGSVQFTVRAAMDDVSIEGDVVPRGTIVFLMLGAANRDPAQFTDPDHLEITRK
QGRLQSFGAGVHHCLGYRLALVELECALTVLLERLPHLRLANLDTLSWNQRGNLRGVN
ALIADLHP
CYP133B2v3 Xylella fastidiosa Dixon
NZ_AAAL01000066 complement(2275..3477)
97% TO 133B2v1 9 diffs 10 diffs to CYP133B2v2
gene = XfasA0420
MKLADLSSPAFLENPYPLYETLRRQGPFVSIGPNALMTGRYSIV
DGLLHNRNMGKSYMESIRVRYGDDALDMPLFQGFNRMFLMLNPPVHTHLRGLVMQAFT
GRESESMRPLAIDTAHRLIDDFEQKSSVDLVTEFSFPLPMRIICRMMHVDISDAISLS
VAVSNLAKVLDPAPMSPDELVHASAAYEELAHYFTRLIELRRAQPGTDLISMLLRAEE
EGQKLTHDEIVSNVILLLLGGYETTSNMIGNALIALHRHPKQLARLKSDLSLMPQAVL
ECLRYDGSVQFTIRAAIDDVSIEGDVVPRGTIVFLMLGAANRDPVQFTDPDHLEITRK
QGRLQSFGAGVHHCLGYRLALVELECALTVLLERLPHLRLANLDTLSWNQRGNLRGVN
ALIADLHP
CYP133B3 Xanthomonas axonopodis pv. citri str. 306
GenPept AAM38014
56% to 133B2
1 MLLSDLATPQ FRHDPYPTYA RLREEGPLVQ VADGRLMSGR YAVVDRLLSD RRVGRDYLQS
61 VRLRYGEAAV HLPLFQGMSR MFLLLNPPLH TQLRGLMTQA FGARQMESMR EVASDIAAGL
121 IDAFQANGHC DLLTEFAFPL PIAIICRMLD IAAADVTALS HATSALAKVF DPMMTAEELQ
181 ATSVAYDQLA TYFHGVIAQR RSAGGDDLIA RFIQAEDNGR RLSEEEIVSN VILLFFAGHE
241 TTSNMICNAL VALHRHPQQL RLLQETPGLL PNAVLECMRY DSSVQMATRT ALQDFEIEGV
301 AVPRGTMLYL MLGAANHDTL QFTDPQVLDI RRQQGRALSL GGGIHHCLGN RLALIEVEAA
361 LACLLARLPA LRLEQLDTLS WNDRANLRGV DALLASW
CYP133B4 Xanthomonas campestris pv. Campestris str. ATCC 33913
GenPept AAM42318
BioI biotin synthesis
52% to 133B2 65% to 133B3
1 MQLSDFATPA FRQDPYPMYA RLRAAGPLVQ ISDNGWVSGH YTVVDALLSD RRVGRNYLDS
61 IRVRYGANAA EMPLFQGMSR MFLLLNPPVH TQQRALMTKA FGARQLEALR EVAVDTADAL
121 LDQHEDRRSC DLLNDFAMPM TISLICRMLG LAVTDVAALG QASSALAKVF DPLMRPEDMA
181 QATAAYTTLE QYFRAIVLQR RDTQEDDLIA RLIAAEDHGQ RMPVDDIVSN VIMLFTAGHE
241 TTANMICNAL IALHRHPEQL QLLRDTPTLM PNAVLECMRY DSSVQVAMRS VLQPLQVEGT
301 TLPVGAILYL MLGSANHDAE QFTAPQQLDL RRQQGRALSF GGGVHHCLGN RLALIELETA
361 LERLLQRAPA LRLPELDNLS WNERANLRGI QALHATW
CYP133B5 Ralstonia solanacearum GMI1000 megaplasmid
GenEMBL AL646080 77388..78584
gene = RSp0709
77% to CYP133B2v2
MKLADLSTPSFLENPYPLYETLRSQGPFVRIGPNALMTGHYSIV
DALLHNRQMGKSYMESIRLRYGDEGPNMPLFQGFSRMFLMLNPPMHTRLRGLMMQVFN
ARQIESMREVATATAHQLIDDFEQKPSADLVAEFAFPLPVRIICQMMDLDIDDAMALG
VGVSKLAKVFDPAPMSADALVETSAAYEELAQYFTKVIEARRAQPGTDLISMLMRAEE
NGETLTHDEIVSNVILLFIAGHETTSNMIGNALIALHRNPQQLDLLKREPSRMPNAVL
ECLRYDGSVQVTIRAALEDVEVEGEVLPRGTTVFLMLGAANRDPAQFTDPDQLDIGRQ
QGRLQTFGAGIHHCLGYRLALIELESALGALFERLPNLRLTNLDQLSWNQRGNLRGVN
ALMAAW
CYP134A1 Bacillus subtilis
GenEMBL AF017113, Z99121, Z99122
cypB also called cypX
CYP134A2P Bacillus cereus ATCC 14579
GenPept AAP10061
57% to CYP134A1 cypX Z99122 Bacillus subtilis C-term
1 MIGATNCDSN VFERPDKFNV YRPDIDIKKA FSGTARHLAF GLSIYNCVGV AFAKLKIEID
61 STIKDNISRK KLRDIKDFVK KTSKMN
CYP134B1 Photorhabdus luminescens subsp. laumondii TTO1
GenEMBL NC_005126 complete genome complement(313663..314886)
locus_tag = plu0296
46% to CYP134A1
MAKLSSFNIHDPKFIKNPYDFYDILHKQDLVYFEQSQNSYFIGK
YEDVDAILKSSIFNTKPLTALAEPVMGDRVLAQMEGEEHACKRKFIMQGLSRDYFNRY
YEPMIRKITEDLLQPYMEKGNIDIVNDFGRDYAVLVTLSILGLPSDNYRDIAEWHKGI
ASFITQFDQTELEKMHSLECSQKLIRLLKPIIDQRRRNPSKDIISIFCQDTAMSMSEI
TALCLNILLAATEPADKILAMMLNHLISNPSMLDVVLKDRSLVRDAFEETLRLTSPVQ
LIPREASEDVTISGIDIPKGAVVFCMIGAANRDPSVFHKPNEFDLYRRKNTTSPQKAN
RKRHLAFGAGTHACAAAAFSLSQLEVSSNIILDLLHNLRFADHYHYQETGVYTRGPSK
LLLSFDPIASSAIKE
CYP135A1 Mycobacterium tuberculosis
GenEMBL Z96800
Rv0327c
CYP135A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(393726..395075)
Gene = cyp135A1 1 aa diff
locus_tag = Mb0334c
CYP135B1 Mycobacterium tuberculosis
GenEMBL AL021942
Rv0568
CYP135B1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 660693..662111
Gene = cyp135B1 100% match
locus_tag = Mb0583
CYP136A1 Mycobacterium tuberculosis
GenEMBL Z83866
coding region 23158-24636
Rv3059
CYP136A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 3376038..3377516
Gene = cyp136 1 aa diff
locus_tag = Mb3085
CYP136B1 Mycobacterium abscessus
GenPept AAN38721
46% to CYP136A1
1 MDAVEAAQRP GGTMTNHLLA PAHHVKERLS SVIMVPAPHA VDDRWRRWSR DWPVRELAPA
61 PAGSGLKAVR GDAGLPFVGH TLDYIRFGSD FSRERYDRLG SVSWMGAFGT KMVVIAGPDA
121 TREAFTSEAK AFSQDGWSFL IDAFFHRGLM LMSFDEHLMH RRIMQEAFTR PRLTGYVEQV
181 TPCVRSAVPA WPVGPSVRIY PLLKELTLDI ATDVFMGGRG KDESDAVNKA FVATVRAASS
241 LVRAPLPGTR FRAGVQGRRV LEDYFFRHLP AARAGETEDL FAALCQATTE DGERFSDEDV
301 VNHMIFLMMA AHDTSTITTT AVTYFLAKYP QWQEAAAAEA AAIGDGLPDI EALEKMTVID
361 RVIKEALRLL APVPLVMRKT VRDVAIDGYH IPSNTLCAIT PAVNHFDRTI WNDPERFDPS
421 RFDEPRREDQ HHRFAWVPFG GGAHKCIGMQ FGTLEVKAIL HRMLRSFTWK VPENYHVRWD
481 NTSLPIPVDG LPLEMKRR
CYP137A1 Mycobacterium tuberculosis
GenEMBL AL022121
Rv3685c
CYP137A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(4064642..4066072)
Gene = cyp137 1 aa diff
locus_tag = Mb3710c
CYP138A1 Mycobacterium tuberculosis
GenEMBL Z92770
Rv0136
CYP138A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 163556..164881
Gene = cyp138 100% match
locus_tag = Mb0141
CYP138A2P Mycobacterium leprae
GenPept CAC32181 GenEMBL AL583926
NC_002677 complement(3167438..3168451)
locus_tag = ML2648
Sequence below is from TIGR primary nucleotide sequence for ML2648
40% to CYP138
1 NRVAREIVVEVIYGALFGAFEALSGLVPQDTVLGPMGRYSMAPSLIR
439 ITINVIMRAGFGSELDELRRLHPTAATL 522
RWTVERQARCNHDIFMLDSRSTAERLRRRLHGTCMKNH 351
352 VRIFEAEPLWGLRTGLKASLLPHCRLINRITINVIMRAGFGSELDELRRLHPTAA 516
517 TLVGLF*LLSQHLGVLADPSSMGATMPGDDPAPALRQATIPG
638 LGVQWTRTVIDFAARRVYSSVYHLSEWAIPREDSILISIAQIYXXXXXXXXXXXX 766
795 DPRRYVEHKPSSFAWI 842
PFSGGT 861
862 SRCVSICQDGDGMNVVLKMVLRYWIIDTTTAPGER*HLRGVVYTPRNGGR 1011
CYP139A1 Mycobacterium tuberculosis
GenEMBL Z95617 GenPept AAK45973 (with 7 more aa at N-term)
Rv1666c
CYP139A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(1877656..1878948)
Gene = cyp139 start codon differs by 6 aa
locus_tag = Mb1694c
CYP139A2P Mycobacterium leprae
GenPept CAC31622 GenEMBL AL583921.1
NC_002677 complement(1474970..1475188) and
complement(1474991..1475161)
locus_tags = ML1237 ML1238
61% to CYP139A1
GAAVATTSMTVILARLASRTRLHLLAHYTHRVRARNFAALIP*LSLTVEVINSMPTQ
CYP140A1 Mycobacterium tuberculosis
GenEMBL Z97193
Rv1880c
CYP140A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(2120751..2122067)
Gene = cyp140A1 100% match
locus_tag = Mb1912c
CYP140A2 Mycobacterium ulcerans
No accession number
Pam Small
Submitted to nomenclature committee 10/17/2003
62% to CYP140A1
CYP140A3P Mycobacterium leprae TN
GenEMBL L01095.1
48% to 140A1
VRQRLHWFAQYGFIRGIAATHH
RRSDPLARLDIALAIKANPVP
YCHKPRPRRPLVQSRISYLTANRAITHELLQSEDFHVFWLNVTLPAPSHWL
RRRTGYRTSSQYNL
LHPLLAIQ*AYHIHYRKTVSPLFAPKAVATLRDRIEQTTLALLDQLAHQHDVVDVVNRY
CSQLPVAVISDILGYP
VPDRDRSHILKFGELVAPSLDVELT*Q*YQQA*REVAGFNFWL
LKHLPQLQRTPGDNLVRHLSH*EDNKPTEISLSKSKLQAISG
GLVLATGGETTVNLLGRGI
LLLDTPEHMVMLQACPEPGHKRG*EILRLDSPIQMAARVARKDVDLAGSTIKRSQVVVLY
FGRSQPGPVRLCRSR*VQHRTPQCGKESRIFR*QEFCLENALTRAYNAVGLRAFFDHLP*
TRAAGTRSRLDTRVLRGWSTLPIALGPTRSMVS
CYP140A4 Mycobacterium avium subsp. paratuberculosis
GenEMBL AJ250018 complement(2795..>3145)
59% to 140A1 runs off end
GAAARQSRPVGPRRSRRSCDRQPGPDDRAHAPPATSTSAPAMVG
LVPRRARNRDPKVFSDPTTFDVTRPNAREHLAFASGIHACLGAALARIEGATCARSFE
NFPDRSSRARNGGR
CYP141A1 Mycobacterium tuberculosis
GenEMBL Z95150
Rv3121
cosmid cY164 from Sanger Centre
coding region 29289-30488
CYP141A1P Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 3441289-3441483
aa 337-400 (first part of gene is in a deletion)
IAFGYGPHACPASAYSRMCLTTFFTSLTQRFPQLQLARPFEDLERRGKGLHSVGIKELLVTWPT*
CYP142A1 Mycobacterium tuberculosis
GenEMBL
Rv3518c
CYP142A1aP Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(3898119..3898736)
gene = CYP142A1a aa 1-197 100%
locus_tag = Mb3548c
In Mycobacterium bovis, a frameshift due to a single base
deletion (c-*) splits cyp142 into 2 parts (pseudogene)
CYP142A1bP Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(3897541..3898122)
gene = CYP142A1b aa 207-end 100%
locus_tag = Mb3547c
In Mycobacterium bovis, a frameshift due to a single base
deletion (c-*) splits cyp142 into 2 parts (pseudogene)
CYP143A1 Mycobacterium tuberculosis
GenEMBL AL022021
Rv1785c
CYP143A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome complement(2013905..2015086)
Gene = cyp143A1 100% match
locus_tag = Mb1813c
CYP143A2P Mycobacterium leprae
GenPept CAC30494
NC_002677 1861010..1862160
locus_tag = ML1542
P450 pseudogene
Sequence below is from TIGR primary nucleotide sequence for ML1542
55% to CYP143 Mycobacterium tuberculosis Rv1785c
1 MSTSAKANPTHFTYCSLNYSALSMITDRGVIWKTLXX 105
113 AKPVVFMNG*YYLNVSRKCILHTTSITKGFSSREAXXX 217
225 PGNALPVLPXXXXXXXXXXXXXXXXXXX 251
278 SLNNLNKALPALRTYTVTMANAITSRGEW 364
366 EAMTDFANX 389
391 LFPLQLFLVL*GLXX 429
434 AQDRDHLIALLKDVVIGMSDKPFLSQADIADQGELCEYLVDTIAERKQNPA 586
585 PDVLSQVLIGEDPLSEIKVLDLESL 659
659 MLILAELDTVTATVGFSLLQPACRQQLRTMLRDKPKQIRILIED 790
792 ILQLEPPAQITPYITTEFVNVDGMTLSPGSRVRLC 896
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
993 GSHLARLKLTLAVDEWLINI 1052
XXXXXXXXXXXXXXXXXX
1116 LFALKALALHW 1148
CYP144A1 Mycobacterium tuberculosis
GenEMBL Z97345
Rv1777
CYP144A1 Mycobacterium bovis subsp. bovis AF2122/97,
NC_002945 complete genome 2001114..2002418
Gene = cyp144 1 aa diff
locus_tag = Mb1806
CYP145 Nocardioides sp.
GenEMBL AB000735
gene for 2-carboxybenzal
CYP146 Amycolatopsis orientalis
GenEMBL AJ223998
cosmid PCZA361 (gene 2 of 2)
CYP147A1 Myxococcus xanthus Partial missing C-term
GenEMBL AF111947 CDS 1939..>2877
42% to AF087022 partial new family
CYP147B1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV584 50% to 147A1 from Myxococcus xanthus
(147A1 is missing C-term so could be higher % identity)
CYP147C1 Streptomyces tubercidicus strain I-1529
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name CypEA
50% to 147A1
CYP147D1 Magnetospirillum magnetotacticum
GenEMBL NZ_AAAP01002628
complement(814..1824) gene = Magn3224
N-terminal is about 61 aa short
51% to 147B1 44% to 147C1
MCAPGPGRDPQCGTGRSSSGDPPDHDRLRGQVMRCFTPQRVRGM
REKTRRITDDLIAKMAGKTRIDLVDDFSYPLPVTVICELLGVPPEDEAQFHGWATQLA
TALEPNQRGDEETQAKNEVCFNEIADYIQGLIKEKRKNPQEDILSDLATDTDGMNDFD
LIATAVLLLVAGHETTVNLITNGMLTLLRFPEHLERLRAEPETAPRLIEELLRYEPPV
HYRTRLALADIPVAGITIPKDAPVILLLAAANRDPLRFSDPDRFDPDRPDNRHLGFGG
GLHYCVGAPLARIEAEVALVSLVRRLKGLSLTENPPPYRPGASLRGPCHLRLALEEVA
EG
CYP147E1 Methanosarcina barkeri Archaea; Euryarchaeota
GenEMBL NZ_AAAR01001943 4935..6305
52% to 147D1 probable lateral transfer
gene = Meth3340
MYRQGSGPNDRRQTMTQQSLYEQVLDYANRANPYPLYAKLRQTP
ITRQIDGSYVVSTYREIVSLLHDPRIGSDFRMRSA
HDRPSAGLSANQELASKNQAQDEGAETSSSNQGSETEVV
PSFIGLDPPEHDRLRRQATWPFGPPHTPGRVADMEPELILLA
NRQIDTIKGRTSIDIVEDFAYPIPVTMISELLGVPPEDQPRLHALSEAIIEDIDLDPR
QSPEEQKRRQEQSSQTFKELEQYMEVLIEHHRKQPGSDLLSGLITDHGSDGPMAQADL
VSTASLLLIAGHETTVNLITNGMLTLLRHPDVLERLRREPDLVIRLVEEFLRYEPPVQ
ILPNRVALSDITIAGTTIQKGSPVILLLASGSRDPARFHDPEKFDPDRRDNMHLGFGS
GIHYCYGAPLARLETQIALTELVQRLENPRLAHDPPPYRQSATLRGPRHLIVEIDGVK
DWEFHL
CYP147F1 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
55% to 147B1 if 9 aa removed, 54% to 147E1
clone name SP0549
CYP148A1 Deinococcus radiodurans R1
GenEMBL AE002083 CDS 1719..2948
38% to AL049754 complement(10413..11648)
CYP148A1 Deinococcus radiodurans
GenPept AAF12079
GenEMBL NC_001263 2539498..2540727
Gene = DR2538
1 MTASSGSSAP SSGPLLAAVQ GLWSGAALAD PHPIYEQIRG FANADGLVRL PEWNTAFAVG
61 HAATSAVLRS PAARSGEWDH GPSDGGKLLQ HMMLFRNGIP HARLRGLVQK AFTPRVVEEQ
121 RDLVRSLLDE LLSDMARAGG PVDLVAGLSG PLPGRVIMRM LGLRGADEER FLGWSASVAE
181 LLGGADRSPA LLARIEADAR EMRGYFRDLA DELRVSPQPG LLSALAAVED GGERLSGDEL
241 LSNAVLLLAA GHETTSNLIP GGVLALSQQP GAWAALLNHP RHPGVADELL RHVSPVQLDG
301 RMLTEAQTVG ETPLPAGTPV QLLLAAANRD PQVFPDPERL DWDRPNASRH LAFAAGPHYC
361 LGASLARLEI AETFAALAER FPDLRVSAAP HYKANFVLRG PQELWVTLG
CYP149A1 Microcystis aeruginosa
GenEMBL AB036790 CDS complement(779..2254)
gene="mapks"
41% to 107H1 partial seq new family
CYP150A1 Mycobacterium species
GenEMBL AF107046
Pascal Poupin
gene 1
CYP150A2 Mycobacterium smegmatis mc2155
GenEMBL AF107047 1092..2405
Pascal Poupin
gene 2
MTDSTATDPAATTPDFDTVDYFTDQSLVPDPHPYFDHLRSKCPV
VREPHYGVLAITSFEEATTVLKDTETFSSCIAVGGPFPPLPFTPEGDDITGQIEQHRT
QLPMFEHMVTMDPPEHTNARSLLNRLLTPKRLKENEDFMWRLADECLDDFIDDGSCEF
LKQYAKPFSLLVIADLLGVPEEDHDEFRHVLGAPRPGAIVGSLDGDQLAMNPLAWLDD
KFVRYLEDRRKEPRDDVLTALATAKYPDGSTPEVIDVVRSATFLFAAGQETTTKLLSA
SLRVLGDRPDIQQALREDRSRIPTFVEEALRMDAPVKSQFRLAKKTTQLGGVDVPAGT
TLMVCPGAVNRDPVRFEDPHTFSLDRKNVREHIAFGRGVHSCPGGPLARVEGRVSLER
ILDRMADIRIDEEHHGPADNRRYTYEPTYILRGLTDLHIKFEPVR
CYP151A1 Mycobacterium smegmatis
GenEMBL AF102510
Poupin P, Ducrocq V, Hallier-Soulier S, Truffaut N
Cloning and Characterization of the Genes Encoding a Cytochrome
P450 (PipA) Involved in Piperidine and Pyrrolidine Utilization and
Its Regulatory Protein (PipR) in Mycobacterium smegmatis mc2155.
J Bacteriol 181, 3419-3426 1999
CYP151A2 Mycobacterium sp. strain RP1
GenEMBL AJ310142
Pascal Poupin
Submitted to nomenclature committee March 22, 2001
86% identity in 399 aa overlap with CYP151A1
CYP152A1 Bacillus subtilis
GenEMBL AB006424
ybdT gene
this sequence is missing part of the heme signature sequence, but has
PERF and EXXR
CYP152A2 Clostridium acetobutylicum
GenPept AAK81262
YBDT B.subtilis ortholog
59% to 152A1
1 MLLKENTAKD KGIDSTLDLL KEGYLFIKNR ADHYQSDLFE TRLMGQRIIC MTGEEAARIF
61 YDSDKFKRQG AAPKRVQETL LGENAIQTLD GESHLHRKKL FMLLTNQVQQ KRLAELTTEK
121 WEASASKWHT KSIVLFNEAN EILCQVACHW AGVPLMESDI KNRAEDFSSM IDSFGAVGPR
181 HWKGKKARNT IEAWIKEIIE NVRSGRIRAE EGSPLHEIAF YIDVNGQQMP AEMAAIELIN
241 ILRPIVAIST FITFSALALY EHSEYREKLQ SKDIRYLEMF TQEVRRYYPF APFVGARVRK
301 DFLWNNCEFK KEMLVLLDIY GTNHDSRIWQ KPYEFIPDRF RSYKGNLFDF IPQGGGDPSS
361 THRCPGEGIT LEIMKTSLDF LSTKIDFTVP DQDLSYSLSK IPTLPKSGFI IDNINLKL
CYP152B1 Sphingomonas paucimobilis
GenEMBL AB006957
Isamu Matsunaga
this sequence is missing part of the heme signature sequence, but has
PERF and EXXR
CYP152B2 Azotobacter vinelandii
NZ_AAAU02000007 102969-104183
56% to CYP152B1 with two frameshifts
may have 10 aa deletion after aa 136
MHRIPRDKGLDSTLALLHDPYRFIARRCRLHGSNLFETRLLLRKTLCMSGAEAARLFYDP
ERFVRHGAMPPRLQKTLFGVGGVQGLDGEAHRHRKHMFVALLMDAERVAQLVEAVRGEWR
TCARRWERMEKVVLYD
CAWAGIPLAEEEAGPRAREIALLFDYAGSVGPKHWRSRLARRRSEAWMGALVESIRASRR
QPPAETAAQVISWHRGLDGNLLEAR
VAAVELLNVIRPVVAIAVYLTFVAHALHRYPHCRHGLRSGDAEYREWFVQEVRRFYPFFPA
VVARVRQDFEWRGYAFPAGRRVMLDLYGTDHDVRLWQAPETFRPERFGSREYGPCDFIPQ
GGGEHESGHRCPGERIVMKLGADVLARELSYAVPMQNLEIDFSRLPALPRSRFVMSDIHGAP*
CYP152C1 Rhodobacter sphaeroides
GenEMBL NZ_AAAE01000129 complement(38633..39955)
gene = Rsph2136
40% to 152A1, 41% to 152B1
MTTDEGRRPEEPGTPASLREMPRDPRIDASMALMSEGYRFVSNL
CDRMDSDAVATRLRLREVVCLRGSAAARLLYGAEGLTRVGAMPSTVLHLLQDKGSVQQ
LEGPAHRHRKALFLSICMDPARVEALVSEMRLAWRERLPAWEAEGRIVLQQEAARLLT
RAGCRWAGVAHQPEAQLADEIFDMIDKAGSVGPRNWLAQMRRAGTEKRLRTLVEEVRA
GEVVPEAATALHAIAFHREEDGTLLDPSVAAVELLNLLRPIVAVGRYITFAALALHRE
TTWRELFRSGNLELAGDFAEEVRRASPFFPFTAAVTTRPITWEGYDFPEGQWLLLDLY
GTTHDPRHFPEPTRFRAERMLSWTGQDEAFIPQGAGDVARTHRCPGEMITVELMKEAI
RLLCCEMDYEVPAQDLGVRLNRMPAQPRSGMILSAISRRAGTEASRNG
CYP153A1 Acinetobacter calcoaceticus
No accession number
MAIER; T; FOERSTER, H.H.; ASPERGER, O. AND HAHN, U.:
Cloning of an unusual 56 kDa-P450 from the n-alkane-assimilating
bacterium Acinetobacter calcoaceticus EB104.
Unpublished
32% identical to CYP111
trivial name P450EB104
CYP153A2 Caulobacter crescentus CB15
GenPept AAK22050
NC_002696 complete genome 61849..63153
56% to 153A1
1 MMSQNTDPRE DLMSDGSIDL KADARARAYS IPLEDYHVAD PALFQADAMW PYFERLRKEA
61 PVHYSKGDEE VGPYWSVTRY NDIMTVDTTH QVFSSDAHLG GITIRNFDED FVLPMFIAMD
121 QPKHDIQRKT VSPIVSPANL GRLEGIIRER VCGILDALPI NEPFDWVDKV SIELTTQMLA
181 TLFDFPWEER RKLTRWSDIA TASPESGLIE SEEARRAELL ECLAYFTNLW NERVNLTEPG
241 NDLISMLAHG EATRDMPPME YLGNVILLIV GGNDTTRNSL TGGLYALSKN PQEEAKLRAD
301 PGLIPNMVSE IIRWQTPLAH MRRTALEDYE LAGQTIKKGD KVVMWYVSGN RDDTVIENAD
361 QFIVDRPNAR RHLSFGFGIH RCVGNRLAEM QLKIVWEEIL KRFPKIEVLE EPKRVYSTFV
421 KGYERMMVRI PERI
CYP153B1 Bradyrhizobium japonicum USDA 110
GenPept BAC47118
NC_004463 complete genome 2007526..2008767
49% to 153A1
1 MNRRLEIHRA DDGYIIPLSE LDVSEGKRFQ DDSIWGCFER LRREDPVHYC QNSAHGPYWS
61 ITKYRDIVAV DTNHHAFSSQ QGVTIVEVPD KHWTPSFIKM GPPQHAEQRN TVSPIVGPES
121 LTRLETLIRS RVRMILDGLP RNEVFNWVTK VSIELTTQTL ATLFDFPFED RRLLTYWSDA
181 AVTTPKAGYA IDSWDKRSTI LSECLDYFTR LWNERINAEP RLDLISLMAH SPVTRHMEPT
241 EFLGNLILLI VGGNDTTRNS ITGGLLFMSQ YPSELRKLTD NPKLISSAVS EIIRYQTPIA
301 HMRRTAAIDS IVGGKPIRTG DKVVMWYISG NRDEEVIENA NSFVIDRKNV RQHLSFGFGI
361 HRCLGRHLAE LQLRVLWEEI LDGGLKIKVV GEPERIASNF VHGYSALPVR IEA
CYP153B2 Bradyrhizobium japonicum USDA 110
GenPept BAC52507
NC_004463 complete genome 7964879..7965991
51% to 153A1
1 MHYCKDSMFG PYWSVTRYND IMEIETNHSV FSSASALGGI TIRDIDPDLR RESFISMDPP
61 RHAAQRKTVA PMFTPTHLDN LALNIRARSA ECLDNLPRGE VFDWVDRVSI ELTTQMLAVL
121 FDFPWEDRRK LTRWSDIATT IPGPDGLVAT EDERQAELTE CAGYFARLWK ERIEQPPKSD
181 LLSMMAHGAA TRDMDAKNFL GNLVLLIVGG NDTTRNTMSG SIYALSQHPE QYRKLRENPA
241 LLDSFVPEVI RWQTPLAHMR RTALSDFEFR GKQIKKGDKV VMWYVSGNRD EEAIEKPYDF
301 IIDRARPRTH LSFGFGIHRC VGLRLAELQL KIIWEEILKR FDHIDVVGEP KRVYSSFVKG
361 LETLPVKIAA
CYP153B3 Bradyrhizobium japonicum USDA 110
GenPept BAC52508
NC_004463 complete genome 7966159..7967508
51% to 153A1
1 MDGRRRRPMP LPQAGEVRKT TGATTMNIQT PVKVDKAERM RRARGEAYAT PLAQFHPGAP
61 RLFQDDTLWP WFERLRKEEP VHYCTNAPIE PYWSVVKYND IMHVDTNHGI FSSDSTLGGI
121 SIRDVPEGYD YPSFIAMDQP RHSAQRKTVS PMFTPTHLDE LAKLIRQRSQ TVLDNLPRNE
181 TFNFVERVSI ELTTQMLATL FDFPWEERRK LTRWSDVSTA LPKSGIVASA EERRREMDEC
241 YAYMSKLWNE RVNSAPRNDL LSLMAHNDAT RFMDPDNLMG NIILLIVGGN DTTRNTMTGS
301 VLALNENPEQ YDKLRANPAL IDSMVPEVIR WQTPLAHMRR TALQDTEIGG KQIKKGDRVV
361 MWYVSGNRDE EAIDRPNEFI IDRPRPRTHL SFGFGIHRCV GMRLAELQLK IVWEEMLKRF
421 DRIEVVGEPK RIYSSFIKGY ESLPVRIPG
CYP153B4 Rhodopseudomonas palustris
NZ_AAAF01000001
complement(3283981..3285270) gene = Rpal2887
80% TO 153B3
MTWPGRTTMHGTIETGKAARLRAAREEAYATPLKDFHPGAPRHF
RDDTLWPWFERLRAEEPVHYCTNAPIEPYWSVTKYNDIMHVDTNHQIFSSDSTLGGIS
IRDAPVGYDWPSFIAMDEPRHSAQRKTVSPMFTPQHLDELAVLIRGRTQKVLDGLPRG
ETFNFVDRVSIELTTQMLATLFDFPFDERRKLTRWSDVATALPKSGVVDSEQQRRDEL
NECAAYFARMWNDRVNSEPRNDLLSMMAHHDATRTMDRDNLIGNILLLIVGGNDTTRN
TMSGSVLALNENPHEFEKLRANPKLIDTLVPEVIRWQTPLAHMRRTALQDAELGGKTI
RKGDRVVMWYVSGNRDDEVIERPEEFIIDRARARIHLSFGFGIHRCVGMRLAELQLRI
VWEEMLKRFERIEVVGEPKRVYSSFVKGYESLPVRVS
CYP153B5 Burkholderia fungorum
NZ_AAAJ02000161
complement(10797..12332) gene = Bcep0832
56% to 153B2
MRHVSCLRRSRLERSRGRARRPGGFHADLFTDPQGQLAALLPDR
GARGTRRTGPAPSRRPALTTLARHSNPVPDKALAPLTRNCHSAKVNCMNTLVVDSSHV
RLAPDALSQPVEDIDPSLPYRFQQQTHFAMFDRLRRESPVHYVKDSEYGPFWSVHRYN
DIIDVEIDHATFSSDVKYGGMLIKDLPENMRRTSFINADPPLHDHQRRVVSPIVAPGN
LNRLEHTIRREAADILDGLPRGETFDWVDNVSIELTGRVLCELMDFPRADRRLLTYWS
DIVNVDLEVGGEINTEEKRYVKLKECASYFGVLFKERMNSEPKDDLISMLAHSEYTKN
MPEQEFLGMIVLLMVGGNDTTRNSISGGLVALNQFPEQYAKLHNDPGLIPKLVPEILR
WVTPVTHMRRTATRDIEFRGKQIRQGDKVVVWYASGNRDSDVIKDPYQFIIDRANPRL
HLSFGFGIHRCLGNRLAELQLRVLWEEILKRQMLIEMMGEPVRKYANNITGVMALPVRIAA
CYP153C1 Novosphingobium aromaticivorans
GenEMBL NZ_AAAV01000116
complement(18298..19530) gene = Saro1194
45% to 153B4
MAATLAPDRAINPHDVSLNALYTEDRWREPFRWLRENMPVSYRA
ESPFGAYWSVVTHDLIQQVELDPGTYSSSWQRGNITIADSVNETEFPNFIAQDPPIHT
AQRKVIAPAFGPSQMVKLERLVRERTTQLLDGLPMGEEFDWVERVSIPLTLGMLLILF
DMPFDEWRDIKRWSDWASGVSEDSLNDAYRAEFVQQMGQMLMRFDRELEARRALPPSD
DLLSRMVHSDAMGHLTPPERIANIALLIVGGNDTTRNSMSGLIEALHRYPAELDKLRA
DPALSANAAQEIIRWQSPVTHMRRTLTRDAELGGQRLAEGDKIVMWYISGNRDENVFP
DAERFDVTRENARRHIGFGHGIHRCVGARLAEVQIAAVIEEIATRRLRITPQGAPTRL
ASPFLHGFTAMPVVMSRD
CYP153D1 Novosphingobium aromaticivorans
NZ_AAAV01000178
4242..5579 gene = Saro3794
47% to 153C1
MATQLAPEVPQFTYHSSPTATEAFAAWLKDNPQAIPAHSHPWDV
SRSDIYVEDRWQPIFAEMRAKAPVNRVPDSPYGAYWNVASHKAIMHVESLPELFSSSW
QYGGITIGDPPEDVDPQKLAERQLPMFIAMDRPDHTGQRRTVAPAFTPAKMVEMEAEI
RRRTASVLDSLPWGERFDWVDKVSIELTTGMLAILFGFPWADRRLLTFWSDWAGDVEL
TLARELADTRFGFLGEMAHYFQRLWGARMQAPPSGDLISMMIHSEAMNHMSPQEFMGN
LVLLIVGGNDTTRNTMSGIVHALDKFPDQRELLERDASLIPNAVQECIRYVTPLAHMR
RTATADTELFGNQIKAGEKVILWYISANRDETVFENPDKLMVDRPNARRHLSFGHGIH
RCVGARLAELQLRILLEEMHERRMRVRVAGEVERVRANFVHGFRKLEVELEKR
CYP154A1 Streptomyces coelicolor cosmid E6
GenEMBL AL353832 CDS 17561..18787
51% to AF145049 42% to AL158061 complement(14764..15987)
38% to 107A1, 107B1 39% to AF087022
cloned and expressed by David Lamb and Steve Kelly
CYP154A2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV109 70% to 154A1 from Streptomyces coelicolor
CYP154A3 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
75% to 154A1
clone name SP0673
CYP154B1 Streptomyces fradiae tylosin-biosynt.
GenEMBL AF145049 CDS 3108..4409
51% to AL353832 CDS 17561..18787 44% to AL158061
complement(14764..15987)
39% to 107C1
CYP154B2 Streptomyces avermitilis
GenEMBL AP005036
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV3704 64% to 154B1 from Streptomyces fradiae
A tylosin-biosynthesis gene in fradiae
CYP154C1 Streptomyces coelicolor cosmid 6D11
GenEMBL AL158061 CDS complement(14764..15987)
44% to AF145049 42% to AL353832 CDS 17561..18787 39% to 107C1
cloned and expressed by David Lamb and Steve Kelly
CYP154C2 Streptomyces avermitilis
GenEMBL AP005036
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV3882 87% to 154C1 from Streptomyces coelicolor
CYP154D1 Streptomyces avermitilis
GenEMBL AP005026
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV1308 47% to 154A1 from Streptomyces coelicolor,
46% to 154B1, 45% to 154C1
CYP154E1 Thermobifida fusca
GenEMBL NZ_AAAQ01000039b
99628..100836 gene = Tfus2243
49% to NZ_AAAQ01000042a 154F1
47% to NZ_AAAQ01000029a 154G1
46% to 154D1
MGQSRRPHTVYLDPAKGVDIPAQRRELLDKGPVVRVAFPGNLEV
WALTHDAPLRNALADESVFVRGWRNWRALMAGEVDPTHPVANMLRVESMLARSGADHK
RMRGLVQAAFTRRRVEALRPRIEEITNELLDRMAESDGVVDLKAAYSFPLPIRVISEL
LGLNEEDHLTLQTLVTRTLSGTDPEANADAFTFVASLIEAKRKNLDDGLISAMIEARA
EDGDRLSETELIHNTLLLIIGGFETTMGMISNSVQLLLTHPDQLHLLRTGQASWENAI
EECLRFESAVVMLPFLYTTRDVEIDGITIPAGDAVLIGFGPANRDPQAYDDPDRFDIT
RPRPRHLAFGHGAHLCLGAALARLELLIALPALFERFPDITLVGEAPPTPTVFMNHPL
SRPVLLRPKP
CYP154F1 Thermobifida fusca
GenEMBL NZ_AAAQ01000042a
complement(347138..348745) gene = Tfus3014
49% to NZ_AAAQ01000039b 154E1
44% to 154C1 and 154C2
N-terminal extension may be incorrect
MAVSADHAAAGALPGARHPAGRRVRAGRRLLDHRLRSGTRRPGG
CPVHRPAQPVRDPRPPGVGDRAPRVPGAADLPADRGRRGECAAGPVAGRASRGSPRVA
GAAPFPVRPRTQGASCHLHPRRHHPTNGGT
MAAVPEPIVLVPGKSREQALQLREAGPL
ARVVVEGLEVWALTHDRELREALIDPRFRRNWRTWRALNEGEVATDHPVAAMVYLDNM
LTVDGEAHRRMRSPVAQAFTPRRVELLRPRVTEIVNALLDQLAERDGTVDFKTEFAYP
LSMRVFSALFGIPERDHGRMQQMVNTAFSPSSPEEVRAMREELDAFLDELIEDKRRSP
GEDLTSALVTATDEEHKLSDAELRDTLWLLVTAGFETTSSALANAVQTLLTHPDQLAH
LRSGSIAWEDAIEEVLRQSSSVATLPFLFAAEDVQIGDRTIRAGEPVLLAYLAANLDV
ERYGEDAAEFDATQSRPRHLAFGHGPHTCLGAALARLEMEVALTTLFTEFPEVSLAEG
EAPRLESVFIHAPAALPIRLGPRRTAA
CYP154G1 Thermobifida fusca
GenEMBL NZ_AAAQ01000029a
complement(12913..14079) gene = Tfus0891
47% to NZ_AAAQ01000039b 154E1
41% to 154C1
MLDTERGLTEADIHALAEHGPVVRLSVMGLDVWAVTGYEELRTL
MADPEVKRGVEHWTAVAQGKVPAEHPLVKLVSMGSMLSKNPPEHTRLRRLVQHAFTTR
RVEGLRPVVQELTRACLDRIDASQPFDINAALSHPVPVGVIGRLLGIPETDQPALDSL
VTRLLSGTDATVHEELYAYVAAMVAARREQPDDGLISALLHVHDDDGSTLSEEDLMWT
VVLLVDAGFETTVGQISNSVRLLLEHPDQLALVTSGEVPWERAVEECLRHTASVVMLP
FCFPTREKELGGYTIGAGEPVMMVYGAANRDSRVHAAPKVFDVTRSDSRHITFGHGPH
HCLGAPLARLELNVVLPELFARFPKLALAERDIPRVKSLFVNRPSELWVTAGMG
CYP154H1 Thermobifida fusca
GenEMBL NZ_AAAQ01000035b
93951..95183 gene = Tfus1569
51% to 154C2
MMASPTDNPIVLDPYVSDLEGERERLYEAGPIAWVELPGGVRTW
SVTHHQAARELLTDSRLSKNMAHWGAYNRGEISPTWPLLSVIPPTPTNLLGTDGAEHK
RLRTLTAQAFTPRRVEKLRPRIREITEELLDALEERANEPQDLKSEFSFKLPMRVIGE
LYGVEEAAHGQLRSLYDKFFSSVTPPEEFLATREALVQFYTELMERKKANPSDDLTTA
LLQANENGDRMTDEEVLGTLQIVVAAGHETTVNLLTNTVRALLRFPDQLELLRTGKAT
WEAAIEESLRWDPPTTNFIFRFATEDIEYGGVTIAKGDSVMISYGAIGRDRGQHGDNP
EVFDVTRKTSSRHISFGYGPHVCPGAPLARLEAQVALPMLFERYPDMKLAVDDSELVP
NPSVIVNSLKEFPVILRP
CYP154J1 Streptomyces carzinostaticus subsp. neocarzinostaticus
GenEMBL AY117439
complement(35390..36622)
47% to 154B1
MCPYRLDPEGADTHGETARLREQGPIARVELQDGVLAWSVHDYA
VAKQIMADERFSKNPRKNWPAYINGEISNGWPLITWVAMDTMATQDGADHARLRKLLL
KAFTERRVESMRPHIEKTVKELLDNMAAKADDEIVDIKEMFHAELPTRLMCDLFGVPE
ERRAEVLAGGHKNIDTRISSEAAEANLGQWQEAISDLVEYKRHHPGDDLTSALIEARD
EGSRLSDSELIGTLHLLLGAGSETLVNALAHSSLALLVDADLRKKVTSGEIPWVNVWE
ETLRVESPVAHLPFRYATEDFEIGGVKISKGDPLLVDFAGIGRDPAVHSDAPDEFDAL
RPDKTHLSFGHGVHYCLGARLAKHAWMIGIPALFERFPDMELAVRRDELKGQGSFVVN
GHASLPVHLKGRAAALAR
CYP154K1 Streptomyces rochei plasmid pSLA2-L
NC_004808 complement(144081..145325)
53% to 154B2
note = ORF84
MLRQEAPYVIDSAGRDLPGEAARLRERGPVVRVVLPGGVSAWAV
TDLDLIKQLLTDSRASKDAYRHWPAWAGGEVDESWQMSMWVSVRNMLTAYGEEHARLR
RLVAGAFTARRTADLRPRVERITARLLDGLAAVPPGAAVDVRNEFARPLSVLVMGETL
GLPEDLHADLQRMVDVLFKTTAEPEEARANQYELYALLTELVAARRSAPGTDLTSELI
AARDEDGGEGLSEKELVDTLLLLIGAGTETTVNLIDQAVHGLITHPAQLALVLGGEAT
WDSVIDETLRHQPVVANVPFRFAVEDIEVGGVTIPKGDPILLSLAAAARCPHRHGADA
DQFDVARPSRRDHVPFGYGVHHCVGRPLARLEVSIALESLFARYPRMAAAVPEAELAV
RESFISSGHVALPVVLVPGAAA
CYP155A1 Streptomyces coelicolor cosmid 6D11
GenEMBL AL158061 CDS complement(40542..>41807)
32% to AF127374 new family
cloned and expressed by David Lamb and Steve Kelly
CYP155B1 Deinococcus radiodurans
NC_001264 192795..193784
frameshifted C-term, frameshift not a seq error
43% to 155A1
MPSFLSFRSSAMTAHDAQPEPARCPFTGQAAPTETITRRHVPPQ
GDLAQPVETYARARDLLKSEQAQQAGFLADMVSRVPGSQHPPVLYLEGEEHTEMRRAT
AKYFTPTQVNTYQPDIARLADELIGKLARRGEAKLDDLSLELAVRVAAGVVGLTNSRL
PGMDRRIERFIPSGVDAEPGVKLEGASPLENARQAANMALFYALDVKPAIEARRKAPQ
DDLISYLLSRGYNDQDILTECVTYGTAGMITTREFISVAAWHLLKNPELRAAYVHGTE
KERHAVLHEILRLEPVVGTLYRRA
GTGDDCGRRSHPAGSVFALDIGQANLDPAVMGEGAEQLCPMRELPRGVQAQGLSFG
DGHHRCPGAFLAIKETDVFLRRLLIWNDLHVVSEPRVTYNEVIKGYELRGFRVRLGGARA*
CYP155B2P Deinococcus radiodurans
GenEMBL NC_000959 73..384
plasmid CP1
44% to 155B1
C-term only runs off end might be a full gene
SGSTDAEILPYVVEAEPSSPVVHLDISQ
ANRDESVFTQAQQFCPHRKNVRQHLSFGKGEHACLGQSLVYTICRVMAHALELLSAPAGQ
RTDQVSQ*
CYP156A1 Streptomyces coelicolor cosmid E6 gene
AL353832 16317..17549
34% to AL158061 complement(15984..17225)
cloned and expressed by David Lamb and Steve Kelly
CYP156B1 Streptomyces coelicolor cosmid IF3 gene
AL590982.1 10933-12276
41% to 156A1
cloned and expressed by David Lamb and Steve Kelly
CYP157A1 Streptomyces coelicolor cosmid 6D11
GenEMBL AL158061 CDS complement(15984..17225)
51% to AL132991 complement(7477..8739) 40% to AL132648
cloned and expressed by David Lamb and Steve Kelly
CYP157A2 Streptomyces avermitilis
GenEMBL AP005036
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV3881 86% to 157A1 from Streptomyces coelicolor
CYP157A3 Thermobifida fusca
GenEMBL NZ_AAAQ01000035a
92419..93954 gene = Tfus1568
55% to 157A1
MGPAVHRGARRSRPRRRPPTRRLLVLHRPAGGARHSLRRRPQQL
RVLLHPGPDPRGTRPGPLSPPPGMRRPGTRVRQTGAGHPGPTCDDDHRGGGRPMNAQR
GIPSSHQNAFRLYGPQFQNKPAELYRQMRTDYGPVAPVLLDGDIPAWLVIGYREVTHV
LNHPETFARSSRRWNAWDLVPENWPLYPMVTRTPNILYSEGEEHRRRATAISDALSGA
DQHEVRQYAVQAADRLIDGFCAASRADLRADYASRLPAIVLGRLYGLDQKHAEVLAEA
MTTMIDSGPDAVKAQQFLLQTMGTLVAERRKQPGPDVVSRLVHHPAKLRDEELIPDLV
VILGGGHQPTTEWLGNTLRLMLTDDRFAASLTGARSSVREALNEVLWEDTPTQIYLGR
YAAHDVELGGQLIRRGDLVLLGLAGANSDPQINPGPECRMSQGNQAYLSFSHGEHRCP
YPAPELAEIIVTAGIEVLLDRLPDVELAVPVDELRWRPSPWMRGLVALPVVFTPVPPI
GGQ
CYP157B1 Streptomyces coelicolor cosmid F55
GenEMBL AL132991 CDS complement(7477..8739)
51% to AL158061 (15984..17225) new family
cloned and expressed by David Lamb and Steve Kelly
CYP157B2 Streptomyces hygroscopicus subsp. yingchengensis
AY260760 complement(2270..4150)
P450 fusion protein
83% to 157B1 C-term part
gene = shy2
note= fusion to ATP/GTP binding protein at N-term
MGSATSELPSQRTPLTAAAETGLKIVVVGGFGVGKTTLVRSVSE
IRPLNTEELMTQAGQGIDETAGVERKTTTTVAFDFGRISLNDRMVLYLFGAPGQERFW
FLWDRLFAGTLGAVVLVDTRRMEDCWYAIDRLEHHGTPFVVAVNRFDGDEKRFSLDEV
RQALALGEHVPMIECDARVRASGKEVLIALVDHLYTRALAKESTACSDTTGFPSTDAP
PPGCPAHGSAVPLAGLEYQQTPSQLYRTLRREHGAVAPVLLDGGIPAWLVLGYPEVCY
VTAHDELFARDSRRWNQWEHIPPDWPLLPYVGYQPSVLFTEGAEHQRRAGVITQALEG
VDQFELARECQLIAARLISSFSGSGRAELMSMYAHALPARGVLWMCGMPAEDADTERL
VDDLRISLDAGEGDDPVAAYTRVGERIMRLVKEKRERPGPDVTSRMILHPAGLGDEEI
VQDLISVIAAAQQPTANWICNTLRLLLTDERFAVNVAGGRVSVGEALNEVLWLDTPTQ
NFIGRWAVRDTQLGGRHIREGDCLVLGLAAANTDPQIWPEPHAGSGNSAHLSFSNGEH
RCPYPAPLLADVMARTAVETLLEHLPDLVLAVEPEELTWRPSIWMRGLTSLPVEFTPA
MN
CYP157C1 Streptomyces coelicolor cosmid I41
AL132648 CDS complement(9396..10892)
40% to AL158061 (15984..17225) 39% to AL132991 complement(7477..8739)
cloned and expressed by David Lamb and Steve Kelly
CYP157C2 Streptomyces avermitilis
GenEMBL AP005047
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV6706 70% to 157C1 from Streptomyces coelicolor
CYP157C3 Streptomyces griseus
GenEMBL AB044803 3754..5328
61% to 157C2
gene = rarE
MTTPFHHEPGTVPPPQCPAHNLDIGPGGLRRLHGPEAENNPAGL
YDKLRAEHGTVAPILLHGDVPAWLVLGHSENLHVTRTPSQFSRDSRRWRALQDGSVAP
DHPLAPIFTWQPICVFADGPKHERQRGAVTDSMERIDTRGVRRHINRFSNRLVNDFCE
KGTADLVGQFAEHLPMMVVCAIFGMPEEYDERLVQAARDMTRGTETAVASNAHIVSVL
TRLVERRRAEPSPDLASWLVEHPATMTDTEVIEHLRLIMIAAYESTANLIANVLRMVL
IDPRFRARLSGGHMTVPEAVEQTLWDEPPFTAVFGRWAVGDTELGGQQIKAGDALLVG
IAPANTDPTVRPDLGADMGGNRAHLAFSGGPHECPGQDIGRAIADVGVDALLMRLPDL
ELGVGESELHWVGNIMSRHLVELPVKFAPGPQQKLDADPLTVMARLLAPPTPGRSPPR
PGRSPSPATRGPWRRAHAPGAAPTAEPDPAPAAPPAPEPAAAPEPAPVATIPQQRRPA
APARFWQAVTRWWSGY
CYP157C4 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
57% to 157C3
clone name SP0618
CYP158A1 Streptomyces coelicolor cosmid 8F11
AL353864 CDS complement(25687..26910)
40% to AF254925, 107E1, 107F1 34% to 107A1 probably a new family
cloned and expressed by David Lamb and Steve Kelly
CYP158A2 Streptomyces coelicolor 2StG58 [Full Sequence] Sanger cosmid
AL939108.1 CDS 76346-77560 61% to 158A1
Note this sequence shows greater than 40% identity to some CYP107
subfamilies
cloned and expressed by David Lamb and Steve Kelly
CYP158A3 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV7130 58% to 158A1 from Streptomyces coelicolor
CYP158B1 Saccharopolyspora erythraea
GenEMBL AY078067 4211..5416
gene = rppB
red pigment gene cluster
47% to 158A3
MTDRCPARMYDPADLPGMTFDPVFLELLRDEPVARIRMRYGEGE
AWLLTRYEDVKFVTSDPRFSRKIMGRPFPKMTKHHIPMDRAISFSDPPEHARVRRVVA
RDFSPGSIERLRPTGREIMHRYLDELVASGPPADLVRHVTSPFPMAVLGELMGIPESD
RQWLIDCSSQVLSMAPDQAAVDRINGIKAEVAEYFLALVESRRADPRDDVVSTVAAAR
ERGDLDDEEVGAMTVLLALNGWHAVRNNSTNMVYVLLTDPELRSRLKADLALVPTAVE
ELLRYIPHKRGIGQPRIATEDVDIRGVRISKGDVVYVSYIAANWDEEVYPEPDRVDLD
RPEVPHLAFGHGPHYCMGPMLARMESQVLLSSLLTRLPDLALAVPPEQVAWQPNALIR
GPVELPVTW
CYP159A1 Streptomyces coelicolor cosmid F55
GenEMBL AL132991 CDS 6235..7458
41% to 107K1 41% to 134 new family
cloned and expressed by David Lamb and Steve Kelly
CYP159A2 Streptomyces hygroscopicus subsp. yingchengensis
GenEMBL AY260760 983..2209
84% to 159A1
gene = shy1
MSAAHHLPDILSPEFAANPYPAYAVMREKEPLIWHEATQSYIIS
RYEDVERVFKDKKAEFTTDNYNWQLEPVHGKTILQLSGREHAVRRALVAPAFRGSDLE
QKFLPVIERNSRELIDAFRHTGSADIVNDYATRFPVNVIADMLGLDKADHARFHGWYT
AVIAFLGNLSGDPEVAAAGERTRVEFAEYMLPVIRERRANPGDDLLSTLCAAEVDGVR
MSDEDIKAFCSLLLAAGGETTDKAIAGILANLLSHPDQLAAVRADRSLIPAAFAETLR
YTPPVQMIMRQSATDVEVTGGTIPAGATVTCLIGAANRDERRYRDPDRFDIFRDDLAT
TSAFSAAAGHLAFALGRHFCVGALLAKAEVEVGLNQLLDAMPDLRLADGHDLVEQGVF
TRGPKTLPVRFTPVTA
CYP160A1 Streptomyces lavendulae LinA homolog.
GenEMBL AF127374 CDS complement(43595.44782)
gene="mmcN" 39% to D87924 new family
CYP161A1 Streptomyces noursei ATCC 11455 nyst.
GenEMBL AF263912 CDS 57095.58279 gene="nysL"
function="presumably involved in modification of the
nystatin macrolactone ring" 36% to AF127374 = new family
CYP161A2 Streptomyces natalensis
GenEMBL AJ278573 81418..82611
pimaricin biosynthetic gene cluster.
58% to 161A1
gene = pimD
MTAASHDLPCLNLEPPKMLKLSPLLRALQDRGPIHRVRTPAGDE
AWLVTRHAELKQLLHDERIGRTHPDPPSAAQYVRSPFLDLLISDADAESGRRQHAETR
RLLTPLFSARRVLEMQPKVEEAADTLLDAFIAQGPPGDLHGELTVPFALTVLCEVIGV
PPQRRAELTTLLAGIAKLDDREGAVRAQDDLFGYVAGLVEHKRAEPGPDIISRLNDGE
LTEDRVAHLAMGLLFAGLDSVASIMDNGVVLLAAHPDQRAAALADPDVMARAVEEVLR
TARAGGSVLPPRYASEDMEFGGVTIRAGDLVLFDLGLPNFDERAFTGPEEFDAARTPN
PHLTFGHGIWHCIGAPLARLELRTMFTKLFTRLPELRPELPVEQLRLKEGQLSGGFAELRVVW
CYP161A3 Streptomyces nodosus
GenEMBL AF357202 56829..58019
gene = amphL
amphotericin biosynthetic gene cluster
71% to 161A1
note = probably hydroxylates amphotericin precursor at C-8
MVNPTPPPSLEDAAPSVLRLSPLLRELQMRAPVTKIRTPAGDEG
WLVTRHAELKQLLHDERLARAHADPANAPRYVKSPLMDLLIMDDVEAARAAHAELRTL
LTPQFSARRVLNMMPMVEGIAEQILNGFAAQEQPADLRGNFSLPYSLTVLCALIGIPL
QEQGQLLAVLGEMATLNDAESVARSQAKLFGLLTDLAGRKRAEPGDDVISRLCETVPE
DERIGPIAASLLFAGLDSVATHVDLGVVLFTQYPDQLKEALADEKLMRSGVEEILRAA
KAGGSGAALPRYATDDIEIADVTIRTGDLVLLDFTLVNFDEAVFDDADLFDIRRSPNE
HLTFGHGMWHCIGAPLARMMLKTAYTQLFTRLPGLKLASSVEELQVTSGQLNGGLTEL
PVTW
CYP162A1 Streptomyces tendae nikkomycin
AJ250199 CDS 22..1212 gene="nikQ"
38% to AF170880 new family
CYP163A1 Streptomyces spheroides novobiocin
AF170880 CDS 9688..10911 gene="novI"
38% to AJ250199 new family
CYP163A2 Streptomyces roseochromogenes subsp. oscitans
AF329398 15196..16419
Clorobiocin biosynthetic gene cluster
90% to 163A1
gene = cloI
MSTRPTVSPDELEQIDLASPILHAEYELGEVFRYLRANRPMYWQ
QPRGEQPGFWVISRYADVNEVYKDKAHFTTEHGNALATLLTGGDSASGAMLAVTDGVR
HHQVRNLLSKGFSPQMLDLIANSLRETVDGLLLAALDRGECDAAQDIAANVPLGAICD
LLEIPQTDRKYLLGLTAHAWSTDYADETPEEGWVAKNEILLYFSKLLKERRGGDRDDM
VSLLANCRIDGHPLNAAEQVANCYGLMIGGDETGRHAITGTILALIENPDQWRALKNG
DVDLKTATEEALRWTVPSLHGGRKATGDVVINGQQIKAGDVVSVWISSANRDEAIFDA
ADEFKLARTPNKHFTFAYGSHYCLGHYLGRMEVYAVLDGLRRLVGDLEQIGEERWIYS
SILHGMSSLPIRITV
CYP163A3 Streptomyces antibioticus
GenEMBL AF322256 22228..23478
simocyclinone biosynthetic gene cluster
62% to 163A1
gene = simI
MNPRPMLSPDVLEDIDLNDRQLHADYDLSEVWRYLRAERPFYYQ
TARGSQPGFWVVTRHADCTAVYKDKTNFTAERGNVLPTLLAGGDSASRTMLALTDGDR
HTQVRNLLMKAFSPKMLSNIGQSLRTTVDGLLRDAIEKGECDFARDVSGKVPLVAICD
LLAVPQEDREYLLSLTAHALSADEADATAEDNWTAKNEILLYFADLAESRRSSGHNDV
VSLLATSSIEGEPLSDGELMANCYGLMIGGDETGRHAITGGLRALIHHPDQWRMLRNG
EADLQTATEEVLRWTVPSLHGARTATADVVVNGKQQIRAGEIVSVWFASANRDEEVFR
DADRFDLNRTPNKHLTFAFGPHFCLGHYLARMEVEAILDGLRRMVDDIQQTGPEKLIY
SSILQGISSFPALLKPDRRVPPQT
CYP163A3 Streptomyces antibioticus
GenEMBL AF324838 26497..27762
simocyclinone biosynthetic gene cluster
gene = simD1
note="involved in aminocoumarin formation"
same seq as AF322256 except this seq has MKGTM added
at the N-terminal (a disagreement on start codon)
CYP164A1 Mycobacterium leprae cosmid B1788
GenEMBL AL007924 CDS complement(38296-39228)
40% to AL049754 partial
CYP164A1 Mycobacterium leprae
GenPept CAC31043
NC_002677 complement(2481544..2482848)
locus_tag = ML2088
100% match but this is complete
1 MRTCVPTRTC VYAFIEYLSH NRPMGTNPPS LVEAQMLLLR LIDPGTRADP FPVYRALIDY
61 GPMQLPGMPL TVFSSFSDCD EALRHPLSAS DRLKATLAQQ AIAAGAEPRP FYASSFMFLD
121 PPDHTRLRKL VSKAFAPKVV QALEGDIAAL VDSLLDKGAA AGQFDVIADL AFPLAVAVIC
181 RLLGVPYEDA PEFGRVSALL VQSVDPFITI TGEPPEATEE RLRAGVWLRD YLEQLVKCRR
241 GTPGEDLISR LIELDESGDQ LTEEEIIATC GLLLVAGHET TVNLIANAVL AMLRNPSQWK
301 ALSSNPQRAP LVVEETLRYD PAIHLIGRVA AKDMTIGQTT LTEGDTMVLL LAAANRDPAV
361 YSRPDEFDPD RPSSRHLAFA VGSHFCLGAA LARLEATVTL SAISARFPQV QLAGELVYKP
421 NVAMRGMSAL PVQV
CYP164A2P Mycobacterium leprae cosmid B1450
GenEMBL AL035159.1
45% to 164A1 C-term region before heme binding seq
AVDALLCFLSPPGLAGPRFAVTDVEIGQHTVVAGQTVRLYLASANHDPQRFNCTDELEPTRPAPHTA
CYP164B1 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
45% to 164A1 missing about 112 aa at N-term
clone name SP0831
CYP165A1 Amycolatopsis mediterranei
Y16952 CDS 224..1399 gene="oxyA"
85% to AJ223999 CDS 31325..32500 42% to Y16952 CDS 1449..2645
41% to AJ223999 32520..33740
CYP165A2 Amycolatopsis orientalis cosmid PCZA363.
GenEMBL AJ223999 region 31325-32500
van Wageningen,A., Kirkpatrick,P., Williams,D., Harris,B.,
Kershaw,J., Lennard,N., Jones,M., Jones,S. and Solenberg,P.
Sequencing and analysis of genes involved in the
biosynthesis of a vancomycin group antibiotic
Unpublished
CYP165A3 Amycolatopsis orientalis
GenEMBL AF486630
Zerbe,K., Pylypenko,O., Vitali,F., Zhang,W., Rouset,S., Heck,M.,
Vrijbloed,J.W., Bischoff,D., Bister,B., Sussmuth,R.D., Pelzer,S.,
Wohlleben,W., Robinson,J.A. and Schlichting,I.
Crystal Structure of OxyB, a Cytochrome P450 Implicated in an
Oxidative Phenol Coupling Reaction during Vancomycin Biosynthesis
J. Biol. Chem. 277 (49), 47476-47485 (2002)
oxyA
84% to CYP165A2
MFEEKNALRGTEIHRRERFDPGPELRALMAEGRMSVMESEESPG
GRTGWLATGYEETRQVLGSDKFSAKLLFGGTAAGRIWPGFLNQYDPPEHTRLRRMVAS
AFTVRRMRDFRPRIEAVVKATLDDIEATGGPVDFVPRFAWPIATTVICDFLGIPRDDQ
AELSRVLHASRSERSGKRRVAAGNKYWTYMGQVAAKTRRDPGDDMFGAVVREHGDDIT
DAELLGVAAFVMGASGDQVARFLSAGAWLMVEHPEQFAVLRDDPDSVPDWLNEVARYL
TSDEKTTPRIALEDVRIGDQLVKKGDAVTCSLLASNRHRFPDPEDRFDITREKPSHVT
FGHGIHHCLGRPLAEMVFRTAIPALAHRFPTLRLAEPDREIKLGPPPFDVEALLLDW
CYP165A4 Streptomcyes toyocaensis strain NRRL 15009
GenEMBL U82965 AF039028 complement(11438..12613)
72% to CYP165A1
MFEEINVVRASQLHRRDRFDPVPELHSLMKEGGLTVLGTEDSTE
GRTAWLATGIDEVRQVLGSDKFSARLLYGGTAAGITWPGFLTQYDPPEHTRLRRMVVP
AFSHRRMQKFRPRVEQIVQDSLDTIESLGGPVDFVPHFGWAIATPATCDFLGIPRDDQ
ADLARILLASRTDRSDKRRTAAGNKFMTYMKQHVAQSRRGSGDDLFGIVGRENGDAIT
DAELTGVAAFVMGAAADQVARLLAAGAWLMVEQPAQFALLREKPETVPEWLDETMRYL
TTDEKTHPRVATQDVRIGNQLVKAGDTVTCSLLAANRPNYPSAEDEFDITREKAEHLA
FGHGIHHCLGRAMAELMFKVSIPALAHRFPTLRLADPQREITLGPPPFDVEALLLDW
CYP165A5 Actinomadura sp. ATCC 39727
GenEMBL AJ561198 complement(23387..24568)
gene = dbv14
gene cluster for biosynthesis of
glycopeptide antibiotic A40926
function="cross-linking of amino acids 2 and 4"
72% to 165A1
MEVFEELNVVLPGELHWRDRFDPVPQLRSFMAEGPMTELGAEEG
PGGRTAWLATGFDEVRQVLGSDKFSSRLLYGGTAAGIVFPGFITQYDPPEHTRLRRVV
SPAFTVRRMERFRPQVDQVVEDCLDAIESIGGPLDFVPHFGWSIATTATCDFLGIPRD
DQAELSRSLHASRSQRAASRRGAAGNKFMTYMGQVVARTRRDPGDDMLSVVVREHGDE
ITDAELTGLAAFVMGAGGDQVARFLAAGAWLMAEVPEQFALLRDKPDVVPDWLEEMVR
YLTIDEKLTPRIALEDVRIGDRIVKAGDTVTCSLLGANRRHFPGPDDQFDLTRDRAPN
VAFGHGIHHCLGRPLAELIFRSAIPALARRFPALRLAEPEQEIRLGPPPFDVKALLLDW
CYP165B1 Amycolatopsis mediterranei 9.9kB DNA
Y16952 CDS 1449..2645 gene="oxyB"
87% to AJ223999 32520..33740 47% to Y16952 CDS 2795..4015
47% to AJ223998 gene 1 53% to U84350 partial
CYP165B2 Amycolatopsis orientalis cosmid PCZA363.
GenEMBL AJ223999 region 32520-33740
van Wageningen,A., Kirkpatrick,P., Williams,D., Harris,B.,
Kershaw,J., Lennard,N., Jones,M., Jones,S. and Solenberg,P.
Sequencing and analysis of genes involved in the
biosynthesis of a vancomycin group antibiotic
Unpublished
CYP165B3 Amycolatopsis orientalis
GenEMBL AF486630; AAL90878.1
Zerbe,K., Pylypenko,O., Vitali,F., Zhang,W., Rouset,S., Heck,M.,
Vrijbloed,J.W., Bischoff,D., Bister,B., Sussmuth,R.D., Pelzer,S.,
Wohlleben,W., Robinson,J.A. and Schlichting,I.
Crystal Structure of OxyB, a Cytochrome P450 Implicated in an
Oxidative Phenol Coupling Reaction during Vancomycin Biosynthesis
J. Biol. Chem. 277 (49), 47476-47485 (2002)
Zhang W., Zerbe K., Vrijbloed J.W., Robinson J.A.
DNA sequence coding for P450 monooxygenases of vancomycin producer
87% to CYP165B1 and 85% to CYP165B2
MSEDDPRPLHIRRQGLDPADELLAAGALTRVTIGSGADAETHWM
ATAHAVVRQVMGDHQQFSTRRRWDPRDEIGGKGIFRPRELVGNLMDYDPPEHTRLRRK
LTPGFTLRKMQRMAPYIEQIVNDRLDEMERAGSPADLIAFVADKVPGAVLCELVGVPR
DDRDMFMKLCHGHLDASLSQKRRAALGDKFSRYLLAMIARERKEPGEGMIGAVVAEYG
DDATDEELRGFCVQVMLAGDDNISGMIGLGVLAMLRHPEQIDAFRGDEQSAQRAVDEL
IRYLTVPYSPTPRIAREDLTLAGQEIKKGDSVICSLPAANRDPALAPDVDRLDVTREP
IPHVAFGHGVHHCLGAALARLELRTVFTELWRRFPALRLADPAQDTEFRLTTPAYGLT
ELMVAW
CYP165B4 Streptomcyes toyocaensis strain NRRL 15009
GenEMBL U82965 AF039028 complement(9077..10273)
77% to CYP165B1
MSGDDRPPIHTLRQGFDPADELRAAGELTRVRLGSGADAEHTWL
ATGHDVVRQVLGDHTRFSTRRRFDRNDEIGGKGVFRPRELVGNLMDYDPPEHTRLRRL
LAPGFTHRKIRRMAPYIEQIVTERLDEMEREGSPADLIELFADEVPGPVLCELLGVPR
DDRAMFLQLCHRHLDASLSGRRRAAAGEAFSRYLVTMVARERKDPGDGLIGMVVAEHG
DTVTDEELRGVCVQMMLAGDDNISGMIGLGVLALLRNPEQIAALRGDVPAAERAVDEL
IRYLTVPYAPTPRTAIEDSTVGDQVIKAGETVLCSLPTANRDPALLPDADRLDVTREA
VPHVAFGHGVHHCLGAALARLELRIAYTALWRRFPDLRLADPDGATEFRLSTPAYGIS
RLMVTW
CYP165B5 Streptomyces lavendulae
GenEMBL AF386507 42222..43493
complestatin biosynthetic gene cluster
58% to 165B1
note = ORF11; comO2
MPQQAQRQAPQQQPRAQQAYPELLYTRRTRFDPADDLRAAPPLS
RYVIGPNESDEWVWLATGYTEVRRILGDHTNFSTRRRWGAEGPNWRPPELVGHLMDYD
PPEHTRLRQMLTPEFTVRRLRRLEPDITAIIEEHLDTVEATGPGADLMPLFAQPVPGE
VLCELIGVPRDDRPEFLRHCHRHLDFSRSRKVRAADGAAFSRYLVSMVARQRKDPDDG
FIGALVREHGDDFTDEEMRGVCVLLILAGIDNIEGMIGLGVLAMLENPDQLPLLLGER
DSTGGPGAGKGDGGRLASDRALDELIRYMSVANAPTPRTAVNDVRIGDQLIKAGETVI
CSLTMANRDPALTDGPDRLDLAREPVAHVAFGHGVHHCLGAALARTELRIAYKALWRR
FPELRLAVPVEEVRFYNRALAHGVHRLPVAW
CYP165B6 Actinomadura sp. ATCC 39727
GenEMBL AJ561198 complement(21025..22221)
gene = dbv12
gene cluster for biosynthesis of
glycopeptide antibiotic A40926
function="cross-linking of amino acids 4 and 6"
84% to 165B2
MSGDGARPLHTRRQDLDPADELRAAGTLTRITIGSGADAETTWL
ATGYTVVRQVLGDHRRFSTRRRWNERDEIGGRGNFRPRELVGNLMDYDPPEHTRLRQK
LTPGFTLRRIRRLKPYIEQIVTERLDALERAGPPADLVELVADEVPGAVLCELIGVPR
DDRAMFMQLCHGHLDASRSQKRRAAAGAAFSRYLLAMIARERKDPGEGLLGAVLAEYG
DTATDEELRGFCVQVMLAGDDNISGMIGLGVLALLRHPEQIAALQGDDQSADRAVDEL
IRYLTVPYAPTPRVAMEDVTIGGQVIKEGETVSCSLPMANRDPALLPDAGRLDVRREP
VPHVAFGHGVHHCLGAALARLELRTVYTALWRRFPTLRLADPDREPSFRLTTPAYGLT
SLMVAW
CYP165C1 Amycolatopsis mediterranei 9.9kB DNA
Y16952 CDS 2795..4015 gene="oxyC"
90% to AJ223998 gene 1 89% to U84350 partial
47% to AJ223999 32520..33740
47% to Y16952 CDS 1449..2645 40% to AJ223999 CDS 31325..32500
CYP165C2 Amycolatopsis orientalis hypothetical
U84350 CDS <1..935
92% to AJ223998 gene 1 new family
CYP165C3 Amycolatopsis orientalis cosmid PCZA363.
GenEMBL AJ223999 region 33791-34244 incomplete
This gene continues on AJ223998
van Wageningen,A., Kirkpatrick,P., Williams,D., Harris,B.,
Kershaw,J., Lennard,N., Jones,M., Jones,S. and Solenberg,P.
Sequencing and analysis of genes involved in the
biosynthesis of a vancomycin group antibiotic
Unpublished
CYP165C4 Amycolatopsis orientalis
GenEMBL AF486630
Zerbe,K., Pylypenko,O., Vitali,F., Zhang,W., Rouset,S., Heck,M.,
Vrijbloed,J.W., Bischoff,D., Bister,B., Sussmuth,R.D., Pelzer,S.,
Wohlleben,W., Robinson,J.A. and Schlichting,I.
Crystal Structure of OxyB, a Cytochrome P450 Implicated in an
Oxidative Phenol Coupling Reaction during Vancomycin Biosynthesis
J. Biol. Chem. 277 (49), 47476-47485 (2002)
OxyC
93% to CYP165C2 83% to CYP165C3
MGHDIDQVAPLLREPANFQLRTNCDPHEDNFGLRAHGPLVRIVG
ESSTQLGRDFVWQAHGYEVVRRILGDHEHFTTRPQFTQSKSGAHVEAQFVGQISTYDP
PEHTRLRKMLTPEFTVRRIRRMEPAIQSLIDDRLDLLEAEGPSADLQGLFADPVGAHA
LCELLGIPRDDQREFVRRIRRNADLSRGLKARAADSAAFNRYLDNLLARQRADPDDGL
LGMIVRDHGDNVTDEELKGLCTALILGGVETVAGMIGFGVLALLDNPGQIELLFESPE
KAERVVNELVRYLSPVQAPNPRLAIKDVVIDGQLIKAGDYVLCSILMANRDEALTPDP
DVLDANRAAVSDVGFGHGIHYCVGAALARSMLRMAYQTLWRRFPGLRLAVPIEEVKYR
SAFVDCPDQVPVTW
CYP165C5 Streptomcyes toyocaensis strain NRRL 15009
GenEMBL U82965 AF039028
complement(6312..7490)
70% to CYP165C3
MRRTLCDPHEDMFALRAHGPLIRIEGNASDQMSTDYVWQAMGYD
VVRKILGDHENFTTRLRLTDAQPLSGEGVSVPPELAGQISIYDPPEHTRLRRMLTPEF
TVRRIRRLEPAIEGIIEEHLDALEGAGPPADLQVLFADPVGGETLCELLGVPRDDRNE
FIRRVRQNVDLSRGYKARAADSAAFNRYLMTLITRQRKDPDEGFLGMLVREHGDRITD
EELKGVCTALILGGVESVAGMIGFGVLALLEHPDQRRLLFGSREEADRLVNELLRFLS
AVQQPTPRMAVRDVVVEGQLIKAGEYVLCSILMANRDEGLTSDSHLLDANREPLPHVA
FGHGIHHCIGAAVARAVLRITYQSLWRRFPRLSLAVPAGEVKFRNAFIDSPDRLPVTW
CYP165C6 Actinomadura sp. ATCC 39727
GenEMBL AJ561198 complement(19681..20943)
gene = dbv11
gene cluster for biosynthesis of
glycopeptide antibiotic A40926
function="cross-linking of amino acids 5 and 7"
76% to 165C5
MRIDSEWSFDPGMDDDIDAGAPVLQPTANYMMRTHCDPHEDMFA
LRAHGPLVRIGGDAATQLRVDYVWQALGYDVVRRILGDHENFTTRPRWSSAPSIAGEP
IPPNLVGQLSVYDPPEHTRLRGMLTPEFTARRIRRLEPAMQDLIDDRIDELEAAGPPA
DVQALFADPVGGGVLCELLGIPRDDRIEFIRRVRQNVDLSRGFKARAADSAAFNRYLN
GLIIRQRKDPDEGFIGMLVREHGDDVTDEELKGVLTALILGGVETVAGSIGFGVLALL
DHPDQRQSLFAGREEADRVVGELLRFLSPVQQPNPRLAVRDVVVDGQLIKAGDYVLCS
ILMANRDEALTPNANVLDVRRDCGSHVGFGHGIHYCIGAAIARTLLRMAYQSLWRRFP
GLRLAVSAEEVKFRNAFIDCPDELPVTW
CYP165D1 Streptomcyes toyocaensis strain NRRL 15009
GenEMBL U82965 AF039028 complement(10263..11417)
45% to CYP165A3
MALPLPHRRHRLDPVPEFHDLQNEGPLHEYDTEPGMDGRKQWLV
TGYHEVRDILADPERFSSMRPVDDEADRALLPGILQAYDPPDHTRLRRTVAPAYSARR
MERLRPRIEEIVEECLDDLEDVGSPVDFVRYAAWPIPALIACEFLDVPRDDRAELSRM
IRESRESRLPRQRTSSGMGVVNYTQKLAARKRLDPGEGMIGVIVREHGAEVSDEELAG
LAEGNLIMAAEQMAAQLAVAVLLLVTHPDQMALLREHPELVDGATEELLRHASIVEAP
APRVALEDVSVAGRDIRAGDVLTCSMMAVNRPQGEHFDITRENPKHMAFGYGIHHCLG
APLARLQLRVALPAVLRRFPSLRLAVPEEDLRFKPGRPAPFAVEELPVEW
CYP165D2 Actinomadura sp. ATCC 39727
GenEMBL AJ561198 complement(22211..23365)
Gene = dbv13
gene cluster for biosynthesis of
glycopeptide antibiotic A40926
function="cross-linking of amino acids 1 and 3"
75% to 165D1
MVVPLPHQRLRLDPVPALFDLQEDGPLHEYDTEPGLDGHKQWLV
TGYGEIREILADANRFSSMRPVEDEAERAWLPGILQSYDAPDHTRLRRTVTRANTARR
IESLRPVVEETVEDCLADLESMGSPVDFVRNAAWPIPALIACDFLGVPRDDQAELSRM
FRDSRESRVPRQRNVSGLGIVDYARKLAARERLDPGTGMIGGIVREHGGEVTDEELAG
LVEGIMIGAVEQMASQLAIAVLLLVTHPDQMALLRERPELADSAAEEVFRYASIVETP
SPRTALVDTRLAGRDIHAGDVLTCSILAGNRAREDRFDLTRGNPEHLAFGHGVHFCLG
APLARLQAQVALPALVRRFPSLRLAVPAEDLRFKPGKPAPFAVEELPVEW
CYP165E1 Streptomyces lavendulae
GenEMBL AF386507 41015..42208
complestatin biosynthetic gene cluster
45% to 165B1
note = ORF10; comO1
MASRDVPVYNRRDRLDPVPELVELRNRCPVLRTELHGGPSSQVV
GWLVTGIDESREVLSDQHRFTMLPPADTEAQSRRLQNIGNPLHYDPPEHTRLRKMLNP
EFTMRRLRRLQPRIDAVVEECLDAMEQAGAPADLMQHFAWQIPGHTACELLGVPRDDR
AELSRHLDITRDDGRGRARQMAAGRAYRAYFHQLTARQRRDPGDDLLGMLVREYGDEI
TDEELEGLAASLTSAGIENVASMLGLGTLVLLEHPDQLAELREKPELIDRAVEELLRH
VSVIPTLSPRTALEDVPLGGHVVPKGERVICSAFAANRIATPGDDLEDGFDITREPAP
HMAFGHGVHHCLGAPLARMQLRTAYQALWRRFPELRLAVPHEEIRFRMPSSRVYSVDA
LPVAW
CYP166A1 Amycolatopsis mediterranei rifamycin.
AF040570 CDS 2652..3842 identical to AJ223012
similar to Streptomyces Sp. ChoP"
probably a new family 41% to 105C1 but less to other 105s
CYP166B1 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
46% to 166A1
clone name SP0879
CYP167A1 Sorangium cellulosum = Polyangium cellulosum
AF210843 CDS 62369..63628 gene = epoF
35% to 107B1 new family
CYP167A1 Sorangium cellulosum = Polyangium cellulosum
GenEMBL AF217189 56757..58016
epothilone gene cluster
gene = epoK
note = P450 epoxidase
CYP168A1 Pseudomonas aeruginosa
NC_002516 complement(2792798..2794132)
locus_tag = PA2475
35% to 107P1
MDDAFSEEGSAQPRHDAQRPALAPRSDGFDIHTYHPDFVADPYP
LLRLIRSRAPVCRDQASIWWISRYADVSACLRDRRFSADPARLGAAGVRQGGASWFGH
QQLQPLARFYDNFMLFNDAPRHTRLRRLFAPAFGPDAVRRWEARIEVLVEELLDSLLE
RREPDLLRDFAEPLTIRVAAELFGFPREDTGQLLPWGRDLAAGLDLAASHGDAGQINR
SAVAFSDYLQRQARGWSDGSSRPPSGAAPSILDGAAMLEAGLGLEDLVAAYAMVFMAA
FETTISMVGNATLALLTHPDQLDLLRRCPELAANAVEELLRFDGAVRGGVRCTLEEVE
IGGQRIPPGEKVWLSFLAANRDPEMFAAPDRLQLQRANAKQHVAFAHGPHYCLGAYLA
RLELQCALRGLVRRRFALASEPTDLRWRRSSVFRTLERLPIVPEGDAQKTCE
CYP169A1 Pseudomonas aeruginosa
NC_002516 complement(4121113..4122390)
locus_tag = PA3679
MQQTIDCPIRRRLAHLPWANDGRAGVRHWLEMQRDPLAWLQKMH
VAQPDLAVARMGPQRLWCLFHPQAVQELMVDRRDDLQRWQPALCMLKQWNGRSFMMRE
GAPAQARRKEVRPHLAPPPASEVRRLAAEWGERVEEGREYDLDLEMAAFSVTLSGHAL
FDVDLQPSAYRIAKAVRLLSRVALLEMSTGLPLGHWFPSKLCPRKRWALGQLREAVGE
VAERSPRPLADLRDELCTLLMASHQSTGVTLTWSLLLLAQRPELLARLRAELAGVNWT
AIRSVADLRDCALLRAVLQECLRLYPPAYGLAPRQVTADIEVFGQRLKRGDVTMVSSW
ITQRDPRWFEAPLEFRPERFLEPARWPRGAYFPFGLGDRACPGTAMAMIDLAAALAYW
VEHWDIMHDGDLAPRGWFSLRPQRARVRFRRRA
CYP170A1 Streptomyces coelicolor cosmid 7E4 gene="SC7E4.20
GenEMBL AL359214 CDS 18278..19663
32% to Streptomyces avermitilis CYP171A1
cloned and expressed by David Lamb and Steve Kelly
CYP170A2 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV3031 79% to CYP170A1 from Streptomyces coelicolor
CYP171A1 Streptomyces avermitilis polyketide synthase gene cluster
GenEMBL AB032367 CDS complement(31914..33284)
gene="aveE"
note="probably catalyzes furan ring formation at C6 to C8
32% to CYP170A1
CYP171A1 Streptomyces avermitilis
GenEMBL AB032367 CDS complement(31914..33284)
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV941_aveE see above entry
CYP171A2 Streptomyces nanchangensis
GenEMBL AY129009
complement(2614..3993)
MeiE; involved in meilingmycin biosynthesis
62% to 171A1
MPAPLQVPDVPGSWPVVGHLPQLARRPLDFLSSLADHGDLVRIR
LGRKPVYVATHPDLVRSLLVTDAHAYTRGAGHAKALAFIGPILVATTGEPHRRQRRMM
QPCFHRQRLGSYVSAMCSAATETADSWSAQDVVDVVPVMTELATAMIAKSLFVSERAA
HAEAELRKTGNAILTVARMSAILPGIYRRLPTPGNRQLPPARTVIEETIAAYRAEGQD
HGDMLSTLLRTTDATGTGLTDEEIRDEVMGLAITGIGGPAAIASWIFYELGQNPDLER
RLHEELDTVLDGRPPSSQDLTRLVFTQRLVKEALRKYPGWVGARRTRESVRLGGHEIP
ADAEVMYSAYALQNDPRWYPDPERFDPDRWDPQQNATRVKKGAWVPFSGGVYKCIGDA
FTETETAVAVAVIASRWRLRPADGRSVRASHLATHVVPRPLRMVVEPRSRNKDEAERS
PHEAPQVSR
CYP172A1 Campylobacter jejuni
GenEMBL AL139078 comp(120285-121649)
Parkhill,J., Wren,B.W., Mungall,K., Ketley,J.M., Churcher,C.,
Basham,D., Chillingworth,T., Davies,R.M., Feltwell,T., Holroyd,S.,
Jagels,K., Karlyshev,A., Moule,S., Pallen,M.J., Penn,C.W.,
Quail,M., Rajandream,M.A., Rutherford,K.M., VanVliet,A.,
Whitehead,S. and Barrell,B.G.
The genome sequence of the food-borne pathogen Campylobacter jejuni
reveals hypervariable sequences
Nature 403 (6770), 665-668 (2000)
only 28% to 102A3
CYP172A1 Campylobacter jejuni subsp. jejuni NCTC 11168
GenPept CAB73835
100% match
1 MSECPFFPKP YKNKASTLLT FLLKRRSWLD GLYERSYKMQ TGYVKMPNFD LYVINDTKEV
61 KRMMVDEVRE FPKSAFLHEL LSPLLGESIF TTNGEVWKKQ RELLRPSFEM TRINKVFNLM
121 SEAVADMMDR FSKYPNHAVI EVDEAMTFIT ADVIFRTIMS SKLDEEKGKK ILNAFVTFQE
181 QSVHTAMRRM FRFPKWLSYV LGDCKRAKAG DVIRQVLSDI IKPRYDMADN AEFEDILGSL
241 LLVVDADTNK RFSFEEILDQ VAMLFLAGHE TTASSLTWTL YLLSLYPKEQ EKAYEEITQV
301 LQGGVIEISH LRQFKYLTNI FKESLRLYPP VGFFAREAKK DTQVRDKLIK KGSGVVIAPW
361 LIHRHEEFWT NPHGFNPSRF EGEYKKDAYL PFGVGERICI GQGFAMQEAI LILANILKTY
421 KLELEEGFVP DVVGRLTVRS ANGMRIKFSK RKL
CYP173A1 Mesorhizobium loti
GenEMBL AP003005.1 327049-328782
only 30% to 172A1
CYP173A1 Mesorhizobium loti
GenPept NP_105897
NC_002678 complete genome 4126331..4127698
100% match
1 MDTQPAPFVP PAPKPRTSPP STLEMIRIVY RNPLELWGEP TYNEPWISAN GVGGHLIVAN
61 DPGLIRHVLV DNAKNYKMAT VRQKILRPIL RDGLLTAEGE VWKRSRKAMA PVFTPRHIFG
121 FAQPMLKRTK EFVTRYEEGG ASDIAHDMTL LTYDILAETL FSGEIAGEPG SFANEIDRLF
181 ETMGRVDPLD LLRAPDWLPR LTRIRGRKTM AYFRKIVTDT VKMREEKFRR DPDAVPQDFL
241 TLLLKAEGPD GLTRSEVEDN IITFIGAGHE TTARALGWTL YCLAESPWER NRVEQEIDEV
301 LAREPDPTKW LDAMPLTRAA FDEALRLYPP APSINREPIE PEMWKDLYIP RHAAVLVMPW
361 VVHRHRKLWD RPDAFLPERF HPGNREKIDR FQYLPFGAGP RVCIGASFAM QEAIIALAIL
421 LSRFRFDTTA ETKPWPVQKL TTQPQGGLPM QVTPR
CYP173A2 Sinorhizobium meliloti 1021
Genpept CAC41447
NC_003047 65218..66618 = 173A2
locus_tag = SMc02579
62% to CYP173A1 Mesorhizobium loti
MDTRPEPFEPPAPVPRTGIPSRLEIIRTVLRNPLELWGEPSYTL
PWIETKFINQRTLIVNDPGLIRYILVENAANYEMSNVRRLILRPILRDGLLTAEGEVW
KRSRKAMAPVFTPRHAQGFAGQMLRVCEAFVDRYAGASSEPFVTNVAVDMTELTFEIL
AETLFSGEIAVEKQGFAANVEELLHRMGRVDPMDLLVAPSWVPRLTRIGGRKVLDRFR
GVVSETMSLRRRRTTEAPGDVPNDFLTLLLQLEGPDGLSTSEIEDNILTFIGAGHETT
ARALAWCFYCVANTPAYRETMEQEIDSVLASGADPVDWLGRMPHVLAAFEEALRLYPP
APSINRAAIEEDAWTSPEGERVPIRKGISVLVMPWTLHRHALYWQKPRAFMPERFLPE
NRDKINRFQYLPFGAGPRVCIGATFALQEAVIALAVLMHRFRFDLTDETHPWPVQRLT
TQPRGGLPMKVSARVK
CYP173B1 Magnetospirillum magnetotacticum
GenEMBL AAAP01002719
42% to 173A1 missing N-term 91 aa
1466 NGLLTAEGDEWRLQRRTLAPIFSARHVAGFVAQMDAAGARLGRRLARRDGATVDIALEMT 1287
1286 RATLDVLERTIVHAKGLPGPIPDALGRAITRLLESVGPIDP 1164
1185 SDVFGFPAFVPRLGRLRAGRHCVSSRRW*HLLDGRKQALARGEAPHDLM 1015
1014 TLLLAAQDPETGRGLSDIEVKANIVTFIAAGHETTANALTWALYCLSQDGAARARVEAEA 835
834 DAAAGPEGNLRLDRLPFTKAVMEETMRLFPPVPFLSRQALRDDRIGRVKIPRNSTVIV 661
660 APWVMQRHRKLWDEPDAFIPDRFFGSRRESIERYAYLPFGAGPRVCIGQSFSVQEATLVL 481
480 AHVARAVRFTLPDEHPPVTPLHRVTLRPKDGLRMVARRRM* 358
CYP174A1 Halobacterium sp. NRC-1
GenEMBL AE004998 comp(560-1777)
only 33% to 171A1
CYP175A1 Thermus thermophilus
GenEMBL AB001637 N-terminal 30 amino acids only 1587-1677
AX451783 full seq
Francesca Blasco francesca.blasco@po.uni-stuttgart.de
Full sequence submitted to nomenclature committee 9/17/2001
Crystal structure known
CYP176A1 Citrabacter braakii
GenEMBL AF456128
David Hawkes and James De Voss, University of Queensland, Australia
Full sequence submitted to nomenclature committee 11/15/2001
P450cin involved the hydroxylation of cineole
CYP177A1 Rhodococcus rhodochrous
No accession number
Helena Seth-Smith
submitted to nomenclature committee 7/30/2002
degrades an explosive to confer RDX degrading phenotype
alternative name XplA (for explosive gene A)
CYP178A1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV838
CYP179A1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV2061
CYP180A1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV2165
CYP181A1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV2385
CYP182A1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV2806
CYP183A1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV2999
CYP184A1 Streptomyces avermitilis
No accession number
Submitted by David Lamb and Haruo Ikeda 9/3/02
Clone name SAV5111
CYP185A1 Streptomyces tubercidicus strain I-1529
No accession number
Istvan Molnar, Syngenta Biotechnology, Inc.
Submitted to nomenclature committee June 2, 2003
Clone name CypLB
Low 30% range to other bacterial P450s
CYP186A1 Nostoc sp. PCC 7120 (cyanobacteria same as Anabaena)
GenPept BAB73318
NC_003272 complete genome complement(1615917..1617470)
24% to 172A1
1 MLQYITAQID NSSSFPYLVT VLSVTTIAGT FAWRWWKQKK KYKSLQSLPS PPKHWLLGNL
61 PQVLAAVKQK KLFQLFFDWS QQLGPMYVVW NGSSPVVILS KPKVIEDTIV NGMRDGSLIR
121 SARLRQAWND ISGPILIGET GNEWQWRRKV WNPEFSSSSL AKYLKIINQA CVQVIDTLKE
181 TALPKEVEVD PLFVELTMRV ISSLVLGIPV DRKITTNEGP PLEVLKVYEA MCVVGYRFLR
241 QATGEKIWMK YLPTKNSQDY WASRRYLEEF LTPRVDLALQ MREQSTDFPQ VSPLFRESML
301 VRIAAKEPKY NRQTLIAESV EFLIAGTDTT AHTLSFAVGE LSLNPRVFQK AREIVDQAWQ
361 GQDNINTESF KELAYISAIL KETLRLYSIA SGSTSLEAQR DTVIEGKVIP SGTRISWSML
421 AAGRDPEVYA NPEEFLPERW LDKSKETSSL PMIDFGSGPH RCLGEHLSML EGTMMLALLL
481 RHFDWELVNG RSSLEQLQQN LLIYPSDKMP VRFRLRN
CYP186A2 Nostoc punctiforme
GenEMBL NZ_AAAY02000016.1
complement(17366..18917) gene = Npun4356
77% to 186A1
MFQQIAAQITFSDSFPYLVTALGITSTAGIFGWRWWKQKNTYKS
LQSFPSPKRHWLLGNIPQVLAAVKEKKFFQLLFDWSQQLGPMYVYWTGFP
18635 VLVLSKPKVIEDTIVNGMRDGSLIRSQRASKAWNDIGGPILLGQNGSEWQYRRKAWNPEF 18456
18455 SSSGLSKYVEIINQACEQIIEKIQSVASPEVQVDPLFVELTMRVISCLVLGIPVDKN 18285
18284 IATNEGQPLDVLKVYEAMSIVGYRFLRVATGEKIWMKYLPTKNSRDYWAARRYLEEFIT 18108
18107 PRVDLALQMREQNQTDLTQVSPLFQESMLVKIAAKEPKYNRETLVAEVIELLIA 17946
17945 GTDTTAHTLSFAIGELALNPRVFHQAQAVVDQVWESQGTINGESLKELNYIRAILKETLR 17766
17765 LYSDDS 17748
XXXSLXAQR
DTVIEGTVIPRGTKIYWSMLAAGRDPEVYSHPDEFLPERWLEKGKEN 17579
17578 SQLPMIDFGSGSHRCLGEHLSMLEGTMMLALLVYYFDWELVNGRSSLEQLQQNLLIYPPD 17399
RMPVRFRLRK* 17366
CYP187A1 Deinococcus radiodurans
GenPept AAF11281
GenEMBL NC_001263 complement(1748793..1750130)
Gene = DR1723
25% to CYP120
1 MVPAPPFLGH AAEMGTIKLR PFLTRCYQAY GPVFQLTVPG QKITVLAGPE ANLFAMKEGH
61 RVLRSLEAWR DNDHEMGSDR SMISLDGAEH RAYRRVEGRA FARSFFAAGL RPALAVLAED
121 LAPFQPGDVL PVATWCKKTI TEQLARMAVG GTVRPYLPDL LHFIQTALQV TVNRQLPPAV
181 LRLPKYRRAK ARIFQMVDDL IEDHRQNPPE KSGRAPDLID DVLADQQVNP ERWEHPDVRL
241 AALGAFIAGM DTAANSLAFV LYRMHLHSEF LPALRAEADA LFRDGPPTAE ALGRSPLLHR
301 FVMETLRVHP IAPALSRTLT EDVEFAGHRI PAGTPVIIGT TVPHGLPELF PDPEHFDPGR
361 FAPGRAEHRQ PGAYAPFGVG SHTCAGSGMA EGLIMLGAAA ALRTLDLSLE PDYVLRQTAK
421 PTPSLDNKLQ LRVNAVRHNP VFLVH
CYP188A1 Deinococcus radiodurans
GenPept AAF12016, F75270
GenEMBL NC_001263 2469804..2470949
Gene = DR2473
30% to 175A1
1 MLSSLHDLPE PASRPGSGHL QDWAARPLAL IEEGATQALA AGQDLFRLRL GLPAVVGLSP
61 AWNRRVLTDL NTFVSAGSFS AVVPYLAGGV ILTDAPGHGA RRRALNPGFG KGSVQQLRER
121 MRQASSPVPT GRFNALAWAD ETVRRQLNAA YFASEFDDRL LAAFLAPLRR PFPVPALPRP
181 LLFRRVEQEI RRLAERRLRE GGDDLLSTLA PLPGGLLETR ISLAAAHDTT THALAYAVWE
241 LAKAPQDQTP HTHSAVLKEV LRLYPPGWMG SRRLSRAAEW QGTEIPRGTL ALYSPYLTGR
301 DPTLWERPLD FRPERWEKSP PAWAYLPFGG GERTCLGVHL AQTLILDVLA ELPPLQAHWG
361 NDEPHPGITL GPRGPLVVER R
CYP189A1 Corynebacterium efficiens YS-314
GenPept BAC17372
35% to CYP116 82% to CYP189A2 NP_599791
1 MIHNMETRCG SYHRATGILT GDDMTSDSTT ALSTAPETTS GGCPYGHGNP DATGTPGTSH
61 HGYEPFNMTN PFPAYEELRR EEPVMFDERI GYWVVTRYDD IKATFDDWET FSSENAQAPV
121 RKRGPQATKI MEEGGFTAYS GLSARIPPEH TRIRAIAQKA FTPRRYKALE PDIRANVVAR
181 LETMLKQGAP ADIVPALAYD IPTITILTLI GADVSMVDTY KRWSDSRAAM TWGDLSDEEQ
241 IPHAHNLVEY WQECQRMVAD AHANGGDNLT ADLVRAQESG QEITDHEIAS LLYSLLFAGH
301 ETTTTLISNC FRVLLAHRDQ WEALIEDPKK IPAAIDEVLR YSGSIVGWRR KALRDTEIGG
361 QPIKKGDGVL LLMGSANRDE ARFDDGETFD ITRPNAREHL SFGFGIHYCL GNMLAKLQAK
421 ICLEEATRLV PSLELADDQS VEFRENLSFR VPVSVPVTWS N
CYP189A2 Corynebacterium glutamicum ATCC 13032
GenPept NP_599791
35% to CYP116 82% to BAC17372
1 MTSQTSQQST STGGCPFGHT SESTSHHGYQ PFDMHNPFPA YKELRQEEPV MFDERIGYWV
61 VTKYDDIKTT FDDWETFSSE NAQAPVRKRG PQATQIMTDG GFTAYSGLSA RIPPEHTRIR
121 AIAQKAFTPR RYKALEPDIR AMVIDRVEKM LANDQHVGDM VSDLAYDIPT ITILTLIGAD
181 ISMVDTYKRW SDSRAAMTWG DLSDEEQIPH AHNLVEYWQE CQRMVADAHA HGGDNLTADL
241 VRAQQEGQEI TDHEIASLLY SLLFAGHETT TTLISNCFRV LLDHPEQWQA ILENPKLIPA
301 AVDEVLRYSG SIVGWRRKAL KDTEIGGVAI KEGDGVLLLM GSANRDEARF ENGEEFDISR
361 ANAREHLSFG FGIHYCLGNM LAKLQAKICL EEVTRLVPSL HLVADKAIGF RENLSFRVPT
421 SVPVTWNA
CYP190A1 Brucella melitensis 16M
GenPept AAL54121
PUTATIVE CYTOCHROME P450 YJIB
31% to CYP145
1 MRAGGIMTSP TERPKQDWDP RSEAVLSDQI GAYDAMRHQC PVAHSDYLGW SLFSYDDVVR
61 VLDDHETFSS VVSAHLSVPS GMDPPQHTAF RQLVERYFEP ERIKAFEPIC REISKKLVCE
121 LPRDAEIDLV TQFAQLYAVR IQCAFLGWPD SLQGPLLDWV HKNHAATLAR DTKAMAAIAL
181 EFDEYIRDLL DERRKLGADA PDDVTTRLLR DRIDRRNLTH EEIVSILRNW TVGELGTITA
241 CVGILCHYLA KSQQTQALMR GGPDLLPAAI DEILRLHAPL ISNRRVTTRA VVVGGREIPA
301 GEKITLMWAS ANRDEAVFDK PDELRLNRDP ALNLLYGRGI HVCPGAELAR AGLRILMEEL
361 LGQTRKLDLV PGSVPALAVY PASGFSRLPA RIS
CYP191A1 Caulobacter crescentus CB15
GenPept AAK22930
NC_002696 complete genome 1051211..1052545
34% to CYP180A1
1 MDLISQTVVD GKAGPGAPPT YPTLKDVDLA DIFRFTKGQP WADFARMRQE APVMWHPEPM
61 GGPGFWALTR YEDVHRVNGD PETFSSQRGG ILMSMGAPEK RHALLFRASM DTMINMDAPH
121 HLQLRREHMP YFTPSYLRGL TERVKGEVTR LLDEMEPLLA NGAEIDMVEH FSSVLPLFTL
181 CEILGVPPED RPKFLTWMHY LERAQDLAVK QANAPMQPTL ELMQFVMDFN NNVEEMFEYG
241 RTMLHKRRED PKEDLMTAIA RAQLDGAVLP DEYLDGSWLL IVFAGNDTTR NTLSGAMRLL
301 TEFPDQKQKL IADPSLLGGA VDEFIRMVSP VVYMRRTATR DVEVNGQLIR EGEKAIMYYG
361 AANRDPAMFE NPDQLDVTRA NAGKHIAFGY GPHTCIGKRV AQIQLEEAYR QILARFPDLN
421 WTGNIEIAPN NFVHAISKLG VKRG
CYP192A1 Caulobacter crescentus CB15
GenPept AAK24959, C87620
NC_002696 complete genome 3214816..3216192
35% to 173A1 40% to BAC45822
1 MDADVRSAPL IPPAPKVHPR QLGGSFVGEL RIALEMSRNL MGAWCEEDFD NLFTPYVFMG
61 QPGMVVSDPA AARRILSSPN YVRPVKAARS VRPIAGDGLL LSEGETWRRQ RKSLAPVFTP
121 MAVEGLLPHF VAAGASLAEA LSGHARADLS EAFHHATLDA VLSALFSRRA DAQGDQLAYM
181 VRRYMEGPAH FNLMDFVSRG ADDLTFLDVE RRRQGAAWFQ AVEHLIAQRQ AHPHAEARDL
241 LDRLLAARDE DGAPLSNQEI RDQCGTMLVA GFETTSRLLF WATYLLALDP ATQDRLRAEV
301 LAAPAAAVRT LDDLQAWPLM RSVLFETLRL YPTAPLLARE AIGPDTVMGH AVVPGQIITI
361 SPWLIHRHRK LWDAPTAFVP DRFIDQPHPW GIEAFLPFGA GPRVCIGASF ALAEAQIVLA
421 SLLERFEIGL VSDRPVIPIA SITLGPDHAP AFTLTPVS
CYP193A1 Bradyrhizobium japonicum USDA 110
GenPept BAC52277
NC_004463 complete genome complement(7721594..7723027)
33% to 104A2
1 MARGCASRCS RPVSDASPAP RAGTLSGARL ECRLCRFMAL DERCLCVCRC PLLWVMRTRY
61 RARTPCPNRW GQTAAPERAS LAGHSRETAL STAPRIDIDP AAFWADPYPM LANMRKEAPI
121 AFVPQLGSTL LTSRDDISIS EKQIDVFSSH QPAGLMNRLM GHNMMRKDGE AHQVERRAMF
181 PTVSPKTVKG YWTALFQAHA DRIIDAIEPG RIDFMRDFAL PFSGECLKSI TGLTNIGFAE
241 MDAWSQGMIE GIANYVGDPA VEARCHAATS GIDAAIDDML PVMRKNPDQS ILGVLLASGM
301 PMESVRANVK LAISGGQNEP RKAIAGTVWA LMTHPEQLDL VRRGEVTWLQ AFEEYARWIS
361 PIGMSPRRIA KPWSIRDVAF ELDERVFLMF GSANRDEKHF ERADQFDVRR DTSKSVAFGA
421 GPHFCAGAWA SRAMIADVAL PTLFARAGRI EIADDEPVRI GGWAFRGLQN LPARWLH
CYP194A1 Bradyrhizobium japonicum USDA 110
GenPept BAC48170
NC_004463 complete genome complement(3198120..3199340)
36% to 176A1
1 MSDVSEPVAH PPVTDWVNDF DHTDPQWTDD PFPIWDELRA ASPVVHTERF LGCYMPTTYE
61 AVREIANDTE HFSSRRIIVR DVRPEIARNA APPITSDPPV HKPAKQLLLP PFTPDAMKKL
121 EPRVRTICNE LIDGFISDGK VDAAARYSKY IPVRAIAHML GIPESDSDLF VNWIHMILEL
181 GIKDETMLLQ AVQEMSAYFR THIEERRSRP TDDLISYLMN AKDKEGQPLE ESHVLGSLRL
241 LLIAGIDTTW SAIGSSLWHL ARTPADRERL IAEPGLIPIA VEELLRAYSP VTMAREVVKE
301 TTISGCPVKA GNMVLLSFPA ANRDPKMFPD ADKVVIDRRE NRHAAFGLGI HRCVGSNLAR
361 MEMQVALEEW LKRIPDFRLD PAGTVTWSQG TVRGPRQLPF LLGKAM
CYP194A2 Rhodopseudomonas palustris
NZ_AAAF01000001
complement(3427752..3428951) gene = Rpal3007
74% TO 194A1
MSERAPVTDWVNDFDHTDPRWTENPYPIWDELRSAGPLVHTDRF
LGCYMPTTFAAVKEISYDTDHFSSRRVIVRNVRSESPPPAPPITSDPPEHKPAKRLLL
PPFTPDAVAKLEPRVRAICNELIDAFIEDEGCDAATAYTKHIPVKTICHMLGIPEDDS
DIFIRWIHEILELGINDDAILMKAVFEMSTYFQGHIAHRKQKPTDDLISTLMNARDDK
GQPLSDAHVLGSLRLLLIAGIDTTWSAIGAALWHLATHPADRERLLAEPELMPTAIEE
FLRAYSPVTMAREVMKETSIAGCPVKPGNMVLLSFPAANRDPSVFPEADRVMIDRKEN
PHVAFGLGIHRCVGSNLARMEMTVAIEEWLKRIASFRLDPSQKVRWSEGTVRGPRSLP
LLFGKPS
CYP195A1 Bradyrhizobium japonicum USDA 110
GenPept BAC48121
NC_004463 complete genome complement(3142651..3143883)
34% to 107P2 37% to BAC51802
1 MNADAKELAA SFDLEKLTPE FYDNPYPTYR ALRENEPVKR LPNGTVFLTR YDDLVTTYKN
61 TKSFSSDKKR EFAPKYGNTP LYEHHTTSLV FNDPPAHTRV RRLIMGALSP RAIAGMEADL
121 IKLVDGLLDA IAAKGSCELI EDFAASIPIE VIGNLLDVPH DERTPLRDWS LAILGALEPV
181 VSPEAAARGN KAVKDFLSYL ETLVARRRGK PGNPERDVLT RLIQGEGNGE ENGERLTEKE
241 LLHNCIFLLN AGHETTTNLI GNGLVALDRH PDQKQRLIDH PDMIKTAVEE MLRYESSNQL
301 GNRMTTERVE LGGVMLDAGT SVTLCIGAAN RDPAQFPDPE SFDIARTPNR HLAFATGAHQ
361 CAGMALARLE VAIAISRFLA RFPNYAVNGR PVRGGRVRFR GFLSVPCAIG
CYP195A2 Rhodopseudomonas palustris
NZ_AAAF01000001
305794..307173 gene = Rpal0264
74% TO 195A1
MSVEEAALAAGPGPPTGRMTCVAPVPAARPDPQISRPFRQRRKS
PIDVKNGNDMETAPAELAEAFDLARLTPDFYDNPYPTYHALRAHQPVKRLASGGYFLT
RYDDLVAVYKNTTLFSSDKKREFTPKYGDSLLFEHHTTSLVFNDPPSHTRVRRLIMGA
LTPRAIAGMEPDLIALVDRLLDAMAAKGRVDLIEDFASAIPIEVIGNLLGVPHDERGP
LRGWSLAILGALEPVIGPEAFALGNAAVAEFLGYLDTLIARRTAEPGDPERDVLTRLI
RGEAGGEKLTAKELLHNCIFLLNAGHETTTNLIGNGLVTLAANPDQKRRLIAEPALIK
TAVEEILRYESSNQLGNRITTAEVEIGGVSMPANTSLTLCIGAANRDPAQFADPDRFD
VSRSPNRHLAFASGPHQCAGMALARLEGAIALSRFLAHFPDYVLDGPPQRGGRVRFRG
YLGVPCRLG
CYP195A3 Burkholderia fungorum
GenEMBL NZ_AAAJ02000119
3110..4828 gene = Bcep1797
64% to 195A2
MGAAIDVEVGALGDACIGNPDRVGVDDAGVRTARQRNERFVMVG
GKGRGALNDEVHGRRRSDTVEGGRAARAANPAIVTVRRFASPRAIRAKRCCARHPAWR
RAILRRAVHTDESNDASQPPRTPTVACTPSRRAKSGTRTQRHPRRGAPLRRHRMTPAT
ASDASTLARDFDLRHLNPAFHADPYPVYHALRAHEPVKRMPDGSLFLTRFRDVQAVYR
DPKTFSSDKTVEFKPKYGDSPLYAHHTTSLVFNDPPRHTRVRKLIAGALTARAIAAME
PGLVRLVDGLLDAAAARGRIDLIDEFASAIPVEVIGNLLDVPHTERAPLRDWSLAILG
ALEPSLSEAQLERGNRAVSEFIDYLRDLVARRRREPGDPQHDVLTRLIQGEAGGEQLS
EAELLQNCIFILNAGHETTTNLIGNGLVTLTQWPEQRAALLHEPSLIESAVEECLRFE
SSNQLGNRMATVDTEIGGVAVARGTPVTLCIGAANRDPEQFADPDRFDIRRDPNRHLA
FGFGIHQCAGLSLARLEARIAIGRFVQRFPAYRVNGEPTRGGRVRFRGFAAVPVELEP
AGRRTA
CYP195A4 Stigmatella aurantiaca
GenEMBL AJ421825 complement(4978..6231)
66% to 195A2
note = ORF8
MLRSVSASRPPSPANPVAAFDLARLDDAFYADPFPLYRAMRERD
PVHRMPDGSLFLTRWADLDCIYRDTRTFSSDKRAEFGAKYGDAPLFEHHTTSLVFNDP
PLHTRVRRLIVGALTPRALSTMEPGLRTLVDRLLDGLAVKGAADLIEDFAAAIPIEVI
GNLLDIPTEERGPLRGWSLAILGALEPRLTAEQEACGNEAVTEFLDYLRILVAQRRAR
PGDPATDVLTRLIQGESDGERLTETELLHQCVFLLNAGHETTTNLIGNALELLARFPD
ERARLLRAPALIPTAVEEVLRYESSNQLGNRRVAEDTEIGGVAVPTGTFLTLCIGAAN
RDPARFEDPEHFDVGRQPNRHLAFAGGAHTCAGMNLARIEARIALAAFLARFPDYALT
APPVRARRARFRGFTAMPTRLGSLR
CYP196A1 Bradyrhizobium japonicum USDA 110
GenPept BAC46159
NC_004463 complete genome complement(972593..974053)
36% to CYP136A1
1 MLPRSAGAGR NKVAVARCPF GQPTGALVMS MQNVAAPALQ FTAPRRNELT HIPGDEGWPV
61 IGKTFQVLAD PKGHIEANGA KYGPVYRTHV FGETNVVLLG PEANELVMFD QQKLFSSTHG
121 WNKVLGLLFP RGLMLLDFDE HRLHRKALSV AFKSGPMKSY LSDLDRGISA RVAQWKAKPG
181 EMQLYPAMKQ LTLDLAAASF LGADIGPEVD EINRAFVDMV AAAVAPIRRP LPGTQMARGV
241 AGRKRIVAYF RQQIPLRRGN HGGDDLFSQL CRATHEDGAL LSEQDIIDHM SFLMMAAHDT
301 LTSSLTSFIG ELAANPDWQD RLRAEVLALG LAPGAPSSFD DLEKMPLSEM AFKEALRIKP
361 PVPSMPRRAM RDFTFKGFRI PAGTAVGVNP LYTHHMKDIW PEPDRFDPLR FTEEAQRNRH
421 RFAWVPFGGG AHMCLGLHFA YMQAKCFARH FLQNIEVSLA PGYKPDWQMW PIPKPRDGLK
481 VRVKAV
CYP196A2 Rhodopseudomonas palustris
NZ_AAAF01000001
complement(2360490..2361869) gene = Rpal2059
70% TO 196A1
MSIQVADSSLVARLSPPKPSALAHVPGDEGWPIIGRTLAVLADP
KGEVEKMARTYGPVYRSRVLGETSITLLGPEANELVLFDNTKLFSSTHGWGPILGRLF
PRGLMMLDFDEHRLHRRTLSVAFKAGPMQSYLAELNAGIAHRVAEWRARPGEMLCYPA
MKQLTLDLAATSFLGTAIGAETEEVNRAFIDMVAASVAPIRKPWPGTAMARGVKGRQR
IVAYFAEQIPIRRAKGGDDLFSQLCRATHDDGALLSNQAIIDHMSFLMMAAHDTLTSS
LTSFVAALAAHPEWQQKLREEIAGLGLKPGEPISFEQLDALPLTEMAFKEAMRLRPPV
PSLPRRATRAFSFKGYTIPAGTMVAVNPLFTHHMPEIWPNPDQFDPLRFTDEASRGRH
RFAWIPYGGGAHMCLGLNFAYMQAKCFAVHLLQHLDLSLPPNYQASWQMWPIPKPKDG
LRVNVAPLN
CYP196A3 Novosphingobium aromaticivorans
NZ_AAAV01000177
complement(59115..60509) gene = Saro3719
57% TO 196A1
MASIAPDSRTDLHTERANPHWVRLGGDHKLDHVPGEDGWPVLGT
TLMQLADPLGFQRRMVETHGPVFRTRSFGRRGVNLIGADANELVLFDRDRLFSNEQGW
GPVLNLLFPRGLMLMDFEAHRVDRRALSIAFKPEPMRAYCSVLNTGIAQAVQGWGGQM
RFYDAIKALTLDTAASSFLGLPLGPEADRLNKAFVDMVQASGGVVRRPLPFTRMGKGV
AGRRLMVEYFGRLVRERRADPGQDMFSQFALATREDGSLLPEDVVVDHMIFLMMAAHD
TITSSATVLFWQLARNPDWQDRLRAEARAVTGGDGLPLAYEDLGRMELTEMAFKEALR
FMPPVPNMPRRALRDFEFGGYRIPAGTPVGISPAAVHADPAHWPEPDRFDPLRFTPEN
VSGRHKYAWVPFGGGAHMCLGLHFAYMQVKLLVSHILTRYEVAMQPGPAPSWQAWPIP
KPRDGLRVEMRRIC
CYP197A1 Bacillus halodurans
GenPept BAB04298 GenPept NP_241445.1
34% to CYP174A1 Halobacterium sp. NRC-1
1 MPTNTMPTGP KGNPVLGNTI EFGKDPLQFI TRCSQEYGEI VRLRFERERD TFLLNDPKHI
61 QYVFMNKGGE FSKGYQQDPI MGLVFGNGLL TSEGSFWLRQ RRLSQPAFHP KRIADYADTM
121 VGYCERMLNT WMDNDTRDIN DEMMQLTMAI ATKTLFDLDL HKGDTQEASR SLDTVMTAFN
181 EQMTNVFRHV LHLIGLGKLV PPVSRELREA VESLDKMIYS IIEERRKHPG DRGDLLSMLI
241 STYDEDDGSY MTDRQLRDEI ITLFLAGHET TANTLSWAFY LLSQHPHVEE KLYQEVSQVL
301 GNRPATLEDM PKLSYAEHVI KETLRVQPTV WLISRRAEKD VTLGDYHISA GSEIMISQWG
361 MHRNPRYFND PLTFLPERWD NNDNKPSKYV YFPFGGGPRV CIGERFALME ATLIMATIVR
421 EFRMELVDEL PIKMEPSITL RPKHGVTMKL RKR
CYP197B1 Nostoc punctiforme
NZ_AAAY02000014 GenPept ZP_00110793.1
complement(44530..45885) gene = Npun5259
44% TO 197A1
MVADVFELPAPSVNSIVGHLFELGQDPLGFLTRCRDYGDIVPLQ
LGLTPSCLIINPEYIEEVLKNRNDFIKSRGLRALKSLLGEGLLSAEGESWFWQRRLAQ
PVFHQKRINGYSQTMVEYTNRMVQTWHDGETHDIHEDMMRLTLQIVMKCIFSDDIDAG
EAKVVADALDVAMQWFESKRRQNFLVWEWFPRPENIRYRDAIAQMDEAIYKLIQERRN
GGEKTNDLLTMLMEAKDEQTLQQMDDKLLRDEVATLMLAGHETTANTLSWTWMLLAQN
PGVREKLESELNQVLQGKLPTLEDLGQLVYTQQIIKESMRLYPPVPLMGREAAVDTQI
GDYEIPQGMAIMISQWVMHRHPKYFENPEAFQPERWTQEFEKQLPKGVYIPFGDGPRI
CIGKGFAQMEAALLLATIAQRFQIDLVPGYPIVPQPSITLRPENGLKVQLKQIALDTS
K
CYP198A1 Xanthomonas campestris pv. Campestris str. ATCC 33913
GenPept AAM42184
32% to 174A1
1 MRRPAFPASV MNSSGAVWQH KRRTLMPAFR AALVRESAMQ ASAATRSLLH ELGDSCATQD
61 MRTLMTGLCA QLGAGFLLGD SANAADLLRM LPMVDAISKQ TRRQSLAPTW WPSSGRRRLR
121 RLRADIDMAL DRILMQSTQR PPRAASVLAL LLAETARDDG DWCRDEAAAI LMSALEPMSA
181 ALTWTLLLLA QHPHIAQEVA QEASALDGAD VASGTSLLDR LPQSRACVKE SMRLYPPAWI
241 TARIAQRDAT LNGFHVPRGT QLLVSAWVVH RDGRHFPDPE IFLPARWLDD SATHSLTRYS
301 YFPFGGGPRS CIGCMLALTQ MTIVIATVLH ACSLHLAPDA RPSPFPALVL RPMDVRIALR
361 PRVIRSVVPS RAHASPVRLA SVTPND
CYP199A1 Bradyrhizobium japonicum USDA 110
GenPept BAC46313
NC_004463 complete genome 1159976..1161184
39% to CYP145
1 MSAPGSAASG VPHLDVDPFD MNFFADPYAA HELLREAGPV VYLDKWNVYG VARYAEVHAV
61 LNDPATFCSS RGVGLSDFKK ETPWRPPSLI LEADPPAHTR TRAVLSKVLS PTVMKQVRDR
121 FAAAAEERVD ALIEKRSFDA IADLAEAYPL SIFPDALGLK SEGREHLIPY ASVVFNAFGP
181 PNQLRQEAIA RSTPHQAYVA EQCQRENLAP GGFGACIHAQ VDEGAITASE APLLVRSLLS
241 AGLDTTVNGI GAAVYCLARF PEQWQRLRGD LSLARSAFEE AVRFESPVQT FFRTTTREVE
301 LSGATIGEGE KVLMFLAAAN RDPRRWDKPD SYDVTRRSSG HVGFGSGIHM CVGQLVARLE
361 GEVMLTALAR RIAKIEITGE PKRRFNNTLR GLDGLPVTIT PA
CYP199A2 Rhodopseudomonas palustris
NZ_AAAF01000001
3590168..3591556 gene = Rpal3146
78% TO 199A1
MRPSGLGAAAADRRDFGGRDAPVMGMSAANACADIRTCHGISQV
AIEEDNMTTAPSLVPVTTPSQHGAGVPHLGIDPFALDYFADPYPEQETLREAGPVVYL
DKWNVYGVARYAEVYAVLNDPLTFCSSRGVGLSDFKKEKPWRPPSLILEADPPAHTRT
RAVLSKVLSPATMKRLRDGFAAAADAKIDELLARGGNIDAIADLAEAYPLSVFPDAMG
LKQEGRENLLPYAGLVFNAFGPPNELRQSAIERSAPHQAYVAEQCQRPNLAPGGFGAC
IHAFSDTGEITPEEAPLLVRSLLSAGLDTTVNGIAAAVYCLARFPDEFARLRADPSLA
RNAFEEAVRFESPVQTFFRTTTRDVELAGATIGEGEKVLMFLGSANRDPRRWDDPDRY
DITRKTSGHVGFGSGVHMCVGQLVARLEGEVVLAALARKVAAIEIAGPLKRRFNNTLR
GLESLPIQLTPA
CYP200A1 Bradyrhizobium japonicum USDA 110
GenPept BAC49097
NC_004463 complete genome 4251077..4252333
39% to 107L2
1 MAPRLDFTSE AFFRDPPAAI AALRASGPVV ATRFPLVGDV WITTTHDATA EVLKDGTTFT
61 LRKEDGKVAG LRWWMPKLVT TIANNMLTMD EPDHTRLRSI VDEAFRRRAI VAMEPRIRAI
121 ADGLANDLFA DGSPADLVQC YARILPVSVI CELLGLPAAD RPRFIAWANK MSSLTNVVSF
181 FRLLFAFRKM RAYLERQLQI ARVRGGEGLI AELVQVELEG GQITPDEMVS MVFLLLAAGS
241 ETTTHLISGS VYELLRNPAL RDWLEEDWSR ISLAVEEFLR FVSPVQFSKP RYLRRDVELA
301 GVRLKKGDRV MVMLAAANMD PAVHDRPERL DLTRKPNRHM SFGTGIHFCL GHQLARIEAT
361 CALQALLARW PKLELAVDPA QIHWRKRPGM RAIARLPVVA GGNRRPSRGA AAEPLLAD
CYP201A1 Bradyrhizobium japonicum USDA 110
GenPept BAC45822, NP_767197.1
NC_004463 complete genome complement(596460..597827)
35% to 173A1 40% to AAK24959 CYP192A1
1 MNIASVRRPI VPPTPPRAPD DMSFLGRVAV IRQNMIATWG QRAYEEDVLE GRFFLHKSFI
61 LNRPDAIRHV LLSNYENYTR TPAGIRMLRP VLGEGLLIAE GHAWTFQRRT LAPAFTPRAT
121 ANLVPHMTAV LDETIAKLDA RSGETVDLRE TMQRMTLEIA GRTMFSFGMD RHGPTLRNFV
181 VEYGERLGRP YFLDMLLPVS WPSPMDFARA RFRKRWTEFV AMLIAERRAA GKKDGAPPRD
241 LFDLMDEARD PETGKGFSDE QLIDEVATMI LAGHETTATA LFWALYLLAL DPDTQEEVAS
301 ETRGEHLDSM ADIDRQKFTR AVIEETMRLY PPAFLIARAA RAKDNAAGIE IGRGDIIMIA
361 PWLLHRHEKL WDQPNAFVPK RFMSTEAPDR FAYLPFGAGP RVCVGAPFAQ AESVLALARL
421 IGAFRVELVD TVPVIPHGVV TTQPDRSPMF RITRR
CYP201A2 Rhodopseudomonas palustris
NZ_AAAF01000001 GenPept ZP_00010020.1
complement(1953073..1954536) gene = Rpal1705
65% TO 201A1
MGSAGAFSLPVSHTTFVEQQRGSGFEVSIAAIDDRPASRAPLIP
PTPPRAPENLSALGRLAAIRHNAIASWGDRAYQDDVVRGRFFAHSSYILNTPDAIRHV
LVDNTDNYRRTATGIRVLRPMLGEGLLLAEGRAWKHQRRTLAPAFTPRAVATLVPHMA
SATDEVVEGLRRKTGVPLDLRETMQHLALEIAGRTMFSFEMGTHGQALRGFVIDYGTR
LASPRFLDLLLPLGWPTPQDVSRALFRRRWTRFIGELIAARRAAGKAEGAPPRDLFEL
MLAARDPETGEAFSDAQLGDQVATMILAGHETTATALFWALYLLALDPDAQERLANEV
RRVGFGGTEIERLPFTRAVLDETLRLYPPAFLIVREAAGPDRVAGFAVRKHDVMLIAP
WLLHRHDKLWSDPNAFVPERFLPGVPSPDRFAYLPFGVGPRVCIGAHFALVEATLALA
KIVGTFRIELIDTEPVIPIGVVTTQPDRSPLFRLTPR
CYP201A3 Magnetospirillum magnetotacticum
GenEMBL NZ_AAAP01000877
60% to 201A2 runs off the end
675 LILAGHETTAV 643
640 PLLWACTLLALSPETQERVAAESGQADKPFTRAVIDETVRLYPPAFVLARR 488
487 AAGVDTLGGETVQPGDSVTISPWLLHRHRRLWRDPDAFDPGRFLPGASPVPRFAYLPF 314
313 GAGPRVCIGAAFALTEATLALSRIVGRFRLARADARPVLP 194
AAVVTTQPDHAPAFRLTLRT* 131
CYP202A1 Sinorhizobium meliloti
GenPept CAC45831
40% to 107P1 56% to 202A2 NP_108561
1 MSIAPGITID GPARRVSLDV RNPRFFRNPL PAYAALHAQC PAFFWEEPQQ WFFAGYEQVN
61 SLLRDRRFGR QILHVATREE LGMPEPKPHL KDFDALEAHS LLELEPPAHT RLRTLVNRAF
121 VSRQIEELRP EIEALSHAVI DGFEKDGETE LLKTYAETIP VTIIARMLGI PVEAAPRLLD
181 WSHRMVKMYV FNPSLETEFD ANNASAEFAD YLKGIIAEKR TNPADDLLTH MITSEKDGER
241 LSDAELISTT VLLLNAGHEA TVHQIGNAVR TILQSGLSPA ELFSDEKATE RTVEECLRFA
301 APLHIFQRYA LMDIELENGI ALRKGDKIGL MLGAANVDPR KFSSPDTFRP DRNEGANVSF
361 GAGLHFCIGA PLARLELQIS LPILFRRLPG MRLKNEPPVK DAFHFHGLER LDLVW
CYP202A2 Mesorhizobium loti
GenPept NP_108561
42% to 107P2 56% to 202A1
1 MTNSTLPYLA FDPATRRLRL DPHEPAFFLN PYEAYGFLHD VSNAFFWEEF GFWCFGGFDD
61 VNRLLRDRRF GRQNPAGIPD SRGVGQDRSH LRAFDGIEAN SMLELEPPVH TRLRTLVNRA
121 FVSRQVERLR PRVEALANEL IDRFDPTGPV DLLPAFASPL PITIIAEMLG VPVEMGPQLL
181 DWSHQMVAMY IHGRTRETEE TANRAASEFA DFLRGYVAER RRNPGDDLLS LLISAQEDGE
241 RLSEDEMVSS AILLLNAGHE ATVHQTGNAV RSILAQGGDP SRFFTSAEAT AATVEECLRF
301 DAPLHMFMRY AYQEIEIAPG IVVRPGQTIG LLLGMANHDP RAFAEPQAFR PDRADQKNVS
361 FGAGIHFCIG APLARLELQV SLKTLFERHP RLHLAEQSRF RDTYHFHGLE TLAVGF
CYP202A3 Agrobacterium tumefaciens strain C58
GenEMBL NC_003062 complement(1243027..1244274)
63% TO 202A1
locus_tag = AGR_C_2319
MTATFPFLKIDPATRRVSLNARDPAFYNDPNPVYAALHAQCPTF
YWEEQRQWFFTCYDHVSTLLRDRRFGRQILHVASREEIGLPEPLEHVKHFDLAEQHSL
LELEPPEHTRLRTLINRAFVSRHVDKMKPEIEELANRLIEAFEANGETELLSSYADII
PVTMIARMIGIPEEMGPQLLKWSHAYVGMYMFKRTPEDELLADKAAQEFSDYVRRVIA
ERRAEPKDDLLSHMIHTEHKGQYLTDDELVSTTIVLLNAGHEATVHQIGNSVRIILES
GLDPKTLFHDETATERTVEETLRICAPVHIFQRWVLEPVEIDGVQFKRGDKVSLILAA
ANLDPAKFSDPLAFQPDRNEGANVSFGAGIHFCIGAPLARLELNLALPLLFKRLPGLK
IAEPPKVKDVYHFHGLERLDLAW
CYP202B1 Rhodobacter sphaeroides
GenEMBL NZ_AAAE01000131
30501..31682 gene = Rsph2231
48% to 202A2
MQTLSQSPHDRRFLRNPYRFYREARAAGPFFHWEELGLVCTTSY
AAANAILRDRRFGREVPPGRASAVPDHLAPFAAVEAHSMLELEPPRHTRLRNLVLRAF
TSRRIGTMQPEVAALSESLVAAVPEGPFDLLPAFSQRLPITLIARLIGIPESLAPELL
RWSSAMVAMYQAGRTRKTEERAALAAADFSDFLRLHIEARRHAPADDLLTHLIAAEAD
GQQLSTDEIVSTCILILNAGHEAAVHAIGNAAAVLLRHRTPPEALAPPHLLGTVEELL
RFDPPLHLFRRMAYERVEIMGRTIEEGCEVALLLGAANRDPGPWERPDRFLWNRPEKT
HLAFGAGLHFCLGAPLARLELATALPILFGRLPNLQLVKPPSYGDSWHFRGLERLIVSA
CYP203A1 Rhodopseudomonas palustris
NZ_AAAF01000001
2600909..2602096 gene = Rpal2284
35% TO 113B1
MFSFDPYSPIVDADPFPLYKTLRDEYPVFWSEPAQMWILSRYLD
VAGAGSNWQVFSSAKGNLMTELPNRAGATLGTTDPPRHDRLRGLVQHAFMKRNLEALA
EPMREIARDAAEALRGRDQFDFISDFSSKFTVRVLFAALGLPMGDEQTVRDKAVLMVQ
SDPVTRAKGPEHLAAYAWMQDYASSVIAQRRAEPKNDLISHFSMAEIDGDRLDEREVL
LTTTTLIMAGIESLGGFMSMLALNLADFADARRAVVADPALLPDAVEESLRYNTSAQR
FKRCLQSDLTLHGVTMKAGDFVCLAYGSANRDERQFPNPDVYDVKRKPKGHLGFGGGV
HACLGSAIARMAIRIAFDEFHKVVPDYTRTEQQLNWMPSSTFRSPLRLDFAVEQAASR
SAA
CYP203A2 Novosphingobium aromaticivorans
GenEMBL NZ_AAAV01000114
complement(9289..10479) gene = Saro1156
62% to 203A1
MATVIERPQFRFDPYSPAIDADPFPAYKVLRDEYPCFWSEEAGK
WVLSRYDDVLAALQDWRTYSSAKGNLVDEFPGRAGSTLGSSDPPRHDRLRALIQSAVT
KRALEHIIAPARASAQAHLAALADKPVFDLVGDYTSKLTVDLLFYLFALPDEGAQQVR
ENAVLMVQTDPVTRQKSPEHLAAFHWMADYAEKLVASRKANPGDDLLSSFITAEIDGE
KLLDKEVQLTVTTLIMAGIESLSGFMAMFGLNLADYPEARSALVADPSLIPDAIEESL
RFNTSAQRFKRTLTRDVELHGQVMKAGDAVILAYGSANRDERMFENPDVYDITRKPRR
HLGFGGGVHACLGSMIGRLATQIAYEELLKAVPDFRRADAPLDWVPSSNFRSPKSLML
EKKA
CYP204A1 Novosphingobium aromaticivorans
NZ_AAAV01000167 GenPept ZP_00095854.1
16299-17720 gene = Saro2888
30% to CYP51A2 in Arabidopsis 27% to CYP51 M. tuberculosis
MARAATAAGNGLPLLDGGVPLLGHLAQFFRDPVSVLKRGYRSKG
RLFAMNFMGQRMNVMLGPEHNRFFFEETDKLLSIRESMPFFLKMFSPEFYSFAEMDEY
LRQRSIIMPRFKAASMKQYVPVMVEESLNLVERLGEEGEFDLIPTLGPVVMDIAAHSF
MGREFHEKLGHEFFELFRDFSGGMEFVLPLWLPTPKMVKSQRAKRKLHAILQSWIDKR
RAAPLDPPDFFQTMIETKYPDGRPVPDEIIRHLILLLVWAGHETTAGQVSWALADLLQ
NPDYQKVLRGEISSLLGGSDGRDLGWEQAVAMEKMDLALRETERLHPVAYMLSRKARA
DIERDGYVIRKGEFVLLAPSVSHRMEETFRNPDAYDPERFNPANPDAQIESNSLIGFG
GGVHRCAGVNFARMEMKVLVAILLQNFDMELMDEVRPIAGASTYWPAQPCRVRYRRRK
LDGSEAGADMAALARAAGCPAHT
CYP205A1 Chloroflexus aurantiacus
NZ_AAAH01000322 GenPept ZP_00019057.1
1835..3214 gene="Chlo2055"
41% to 197A1 and 197B1
MIPAIPVLIGDTAMIRFPSPVAIRNLQQLRREPLTLLEELAARG
DVVPFRVGPQMMVLVNHPDLIREVLVTQHRSFVKGRVLERAKRLLGEGLLTSEGELHL
RQRRLMQPAFHRQRIAAYGDAMVAVAEARSARWQDGLVLDVSREFMAITLQIVGITLF
SADTEADADEVFAAMHDLVAMFDLAVLPFADWLFALPLPPVRRFQAVKARLDAIIYRL
IAQRRANPVDRGDLLSMLLTAVDHEGDGYRMTDTQLRDELLTIFLAGHETTANALTWA
LYLLAQYPSLAAHLAAELDTVLGGRKPTVADLPKLTYTSWFFAEALRLYPPAWLIGRR
AIAPVTLGDVRIAPDTIVLLSPWLMHHDPRFFHEPYHCDPLRHTPEAQAQRPKFAFFP
FGGGPRTCIGEPFAWMEGILVLATLAQRWQFLPVADHPVVLQTGITLRPRYGMQLQLR
ERRTVLGAA
CYP206A1 Agrobacterium tumefaciens (strain C58, Cereon)
GenPept H97549
NC_003062 1555403..1556797
34% to CYP173A1
1 MTEIGFRTPS TDTTGAQPVS KLATARLALS LIRNPLKALP PEIFSEPAVF TRLGGVMRVH
61 LADPVLIHEA LVKNAALLGK GEDVRRALGP ALGQGLLTAD GDHWKWQRQS VAAAFRHEKL
121 LELLPVMIET ARRTQKRWRS SSTADIDIGH EMMRTTFDII VETMMSGGYG IDIARVEQSI
181 TDYLKPTGWT FALAMLGAPE WLPHPGRRKS RAAVDYLRAS LATVITGRRK NPTDRPDLVS
241 MLLEAKDPET GRMMSDEEII DNLLTFITAG HETTALGLAW TFHLLSQNPE TERKAVEEIE
301 AVTGGEPVAA EHIANLAYVR QVFSEAMRLY PPAPVITRTA LQDFRLGEHD IPAGTVLYVP
361 IYAVHRHTAL WDEPERFDPS RFEPEKVKAR HRYAYMPFGA GPRVCIGNAF AMMEAVAILA
421 VILQKNHLEN RTMASAEPLM RVTLRPQERL MMKITQRQNK SPAV
CYP207A1 Kitasatospora griseola
GenPept BAB39208
GenEMBL AB048795 6250..7698
30% to 183A1
1 MKGNLPAMST ATSSSVTGSV TTPRTATTPG RRPGLAPGGV PVLGHLPMIL RSRFEFIETV
61 RNAGPVTRVK LGPKTAYFVN DYELLAQILV SDADKFVRGI HFKKMRNMVG NGVVTTSGDL
121 HRRQRRIMLP SFSQRRLAMH LPVMRKIMSE FVASVPERRP YDLMGPVMGV GCDIVTSTML
181 GEKTPPEVLR LVREAVPVFV ENAAIQAVDV TGIYKHLPTK SNRDFERLLN AFNEYMYSVI
241 DDKFRNGAGE EAGLLDMLIN ATDPETGEKF DRTEVRDQAA TILLASTETT ANTISWACYE
301 LARHPRIFAE CRAEIDALVK DRDWLDIEIG RHDLPALKRV LFEALRMYPS SYLLSRQASV
361 DTTLGGYAIP KDAAILYSHY GQQRDERNFP HGDEFDPDRW LDKDGAEVTA SAFMPFGFGA
421 YRCLGESVAV LEATYCLAMM VHQWDFALSD YSEPKMNATI TLSPKDLEFL FTKRTESGAH
481 DE
CYP208A1 Streptomyces globisporus
GenPept AAL06697
38% to 184A1
1 MRIDPPGPPL RALPGLLRKL AVDRLGMMRD AAGLGDAVRV SMGPKKLYIF NRPDYAKHVL
61 ADNSDNYHKG IGLVQSRRVL GDGLLTSDGE TWREQRRIVQ PAFKPGRINQ QAAAVAEEAA
121 KLVALLRGHE GGGPVDVLQE VTGLTLGVLG RTLLDSNLTA HESLAHSFEE VQDQAMLEMV
181 SQGTVPAWLP LPPQARFRRA RRELYRVADL LVADRRSRMA DGGPGDDALS RIIVAADRRR
241 DDPARARNRL REELVTLLLA GHETTASTLG WTLHLLERHP EVRDRVRAEA RAALGDGVPG
301 PEDLHRLTYT TMVVQEAMRL FPPVWILPRV AQQRDVVGGY TVSAGSDVLV CPYIMHRHPG
361 LWEDPERFDP ERFEPRQTAD RPRYAYIPFG AGPRFCVGSN LGMMEAVFVT ALVTRDLDLR
421 TVAGHRAVAE PMLSLRMRGG LPMTVSTAR
CYP208A2 Streptomyces carzinostaticus subsp. neocarzinostaticus
GenEMBL AY117439
complement(50199..51551)
70% to 208A1
MNGRRPVSPPLRALPGLLRKLAVDRLGMMRDAAALDDAVLVSMG
PKKLFVFNRPDYAKHVLADNAANYRKGIGLIESRKMLGDGLLTSEGELWREQRRTVQP
AFRPARVAAQADAVAEETMNLRDLLMRRGADGPVDVLQEVTGFTLGVLGRTVLNTDLG
GYGGIAHAFEAVQDQAMFDMVTQNMVPTWAPLATQRRFRRARRELIRTVDELVADRSA
RMTDGEEADDAFSLMIAAARRQTDPRTGQGRLRDELVTLLLAGHETTASTLAWTLLLL
ARHPHMRDLVREEARGVLADGRAPDAGDLRKLTYTTQVVQEAMRLYPPVWILPRVARQ
SDEVGPYSVSAGADVLICPYTLHRHPDLWERPEQFDPGRFDPARVADRPRYAYIPFGA
GPRFCVGSNLGMMEAVFVTALLTRDLVLEVVPGDERTPEPMMSLRMRGGLPMTVRPVR
CYP208A3 Actinomadura madurae
GenEMBL AY271660 10415..>11426
maduropeptin biosynthesis gene cluster
partial sequence RUNS OFF END
69% TO 208A1
gene = madE7
MSIDELDARGGTPRAAGRVPPGPPRRATPNLLRMLATDRLGMMQ
AALRHGDAVRVGLGPKALYLFNRPEHAKHVLADNSGNYHKGIGLVQARRALGDGLLTS
EGDLWREQRRVVQPAFQHKRIAGLADAVVEEAGALVARLRARAGGPPVDVVGEMTALT
LGVLGRTLLDADLTAHTSLGRAFETVQDQAMFEMVSQGMVPMWLPLPGQLRFRRARRE
LDRIVRALVAERLREGGGAEDALSRLIESARREPDGRVGRRRLRDELVTLLLAGHETT
ASTLGWTFHLLDRHPLVRARVRAEARAVFGDGTPTLDDLSALSYTTMVVQEVMRMYPPVWI
CYP209A1 Myxococcus xanthus
GenPept CAB40542
38% to 110D2
1 MGTSEPVEPD HALSKPPPVA PVGAQALPRG PAMPGIAQLM MLFLRPTEFL DRCAARYGDT
61 FTLKIPGTPP FIQTSDPALI EVIFKGDPDL FLGGKANNGL KPVVGENSLL VLDGKRHRRD
121 RKLIMPTFLG ERMHAYGSVI RDIVNAALDR WPVGKPFAVH EETQQIMLEV ILRVIFGLED
181 ARTIAQFRHH VHQVLKLALF LFPNGEGKPA AEGFARAVGK AFPSLDVFAS LKAIDDIIYQ
241 EIQDRRSQDI SGRQDVLSLM MQSHYDDGSV MTPQELRDEL MTLLMAGHET SATIAAWCVY
301 HLCRHPDAMG KLREEIAAHT VDGVLPLAKI NELKFLDAVV KETMRITPVF SLVARVLKEP
361 QTIGGTTYPA NVVLSPNIYG THHRADLWGD PKVFRPERFL EERVNPFHYF PFGGGIRKCI
421 GTSFAYYEMK IFVSETVRRM RFDTRPGYHA KVVRRSNTLA PSQGVPIIVE SRLPS
CYP210A1 Polyangium cellulosum = Sorangium cellulosum
GenPept CAD43453
spiL biosynthesis of spirangiene
40% to 110E2
1 MISISKSKQK LLPPGPRSPM ALQTLQWLKN PVPFLEACGA RYGEMFTLKL PTQWPVVVVQ
61 HPEAVKEVFA LDSNAGHAGE ANNILKPFLG KYSLLVLDGE EHMRQRKMMM PAFHGERMEA
121 YGHAMIDAAH ASIDAWPVGS PFGVHAPMQA ITLQVILRTV VGMTDGPLLA ELEALYPQVI
181 DAASAPAMHF ELFRKDLGPW SPWGKFKRRS ARGKEIMIHE IRRAREKGTA GRTDVLAMII
241 DAKDENGELL TEDEIHGELM TLLVAGHETT ATALCWALRW LLRDAALTRR VAEEAAEVAD
301 DPVKIAKSEL LDRVVKEALR LQPIGPVVAR VLKQPLTIQG RELPADVMVA PCVQLLHHRP
361 SLYPEPTRFD PDRYATFTPK PWEFIPFGGG LRKCIGAAFS MYEMKMVLAT AFSRLSMELA
421 TDDIKIIRRG VTLAPSGGLP LVIRKKSPRA TKPIAA
CYP211A1 Streptomyces globisporus
GenEMBL AY048670
47223..48479 ORF29
39% to 107L1
MAGLVMSPVEALDALGTVQGRQDPYPFYEAIRAHGQAVPTKPGR
FVVVGHDACDRALREPALRVQDARSYDVVFPSWRSHSSVRGFTSSMLYSNPPDHGRLR
QVVSFAFTPPKVRRMHGVIEDMTDRLLDRMARLGSGGSPVDLIAEFAARLPVAVISEM
IGFPAKDQVWFRDMASRVAVATDGFTDPGALTGADAAMDEMSAYFDDLLDRRRRTPAD
DLVTLLAEAHDGSPGRLDHDELMGTMMVLLTAGFETTSFLIGHGAMIALEQRAHAARL
RAEPDFADGYVEEILRFEPPVHVTSRWAAEDLDLLGLSVPAGSKLVLILAAANRDPGR
YPEPGRFDPDRYAPRPGGPEATRPLSFGAGGHFCLGAPLARLEARIALPRLLRRFPDL
AVSEPPVYRDRWVVRGLETFPVTLGS
CYP212A1 Chromobacterium violaceum ATCC 12472
GenEMBL AE016919 complement(139513..140919)
NC_005085 complement(2867614..2869020)
locus_tag = CV2656
57% to 2A12A2 33% to 205A1
MKTPPQSSCPFHAVGRPPTPPRSSAGRWPPGPESGLTGWGLLKL
MSRDLMGTLAGWQREFGDLVHVRTWPEHQVIVSDPQLARELLVNQADALQRWERALTV
YRRVHGHSVLIAEGQVWREKRQALQPDFTRKSVQAFSPSIVEAARRAFEQWPARHAAW
PIESELTSVTMEVILRMMFSSGVGSEAQQAEEAVHTLMVASTEELWRPASLPDWVPWQ
RKRRRARLLMNGLIERHLQARLAMPQDAWPEDLLSRLLRLHLQQPQSWPLQAVRDECK
TAFLAGHETVATSLTWWAWCMASHPEIQERAREEALAALSGGGQADPAALQYVSQTLL
ETMRLYPAVPLLMSRRALKPVTLGDWTFPAKTVFMVPMQLMQHDERWFPEPRSYRPER
FGPDAARPQQGAYLPFGGGPRVCLGQHLAMAEMALVAAQLLLRYRLSAPEGAEPPRPV
FHVSQRPSQPLTLGIARI
CYP212A2 Ralstonia metallidurans
GenEMBL NZ_AAAI01000313
14554..16035 gene = Reut2995
57% to 2A12A1 32% to 197B1
MTVDVVSIQDSPRGSVWNLHMTTESQAQCPFQTAKSPPPAGSPL
PHPYGTWPPGPAAGLTGWHLLRRMSRDLLGTLGEWQQTYGDVVHLRMWPEHAVVVTDP
QLVRELLVTHHDSLIRWERGTRVFSRVHGHSVLTAEGDAWSRKRQALQPGFMPKAVHG
FVPGIVEIVDKGLATWPTRVADWPVESALTSLTMDVIVRMMFSDEIGEDARVAECAVR
AISEAANADFYWPASLPDWVPWKRARRRALHTLRDLIERHLQARLRMRTDTWPDDLLS
RLLCLHRDDATAWPLQAVRDECMTTFLAGHETTAATLTWWAWCMASNPSAQDAARAEV
THVLRGQAPTADSRQALRQVVQTITESMRLYPVAPVLISRRAVRPITLGPWRLPARTL
FMLPLQLMHHDPRLFPEPERFQPDRFSTGSPQAPRGAYMPFGTGPRVCLGQHLATAEM
TVIAAMLLQRYKLSVPEGAAHPRPLLNVTLRPDQPLWLAVTPI
CYP213A1 Synechococcus sp. WH 8102
GenEMBL AABA02000001 or GenEMBL BX569692 335184..336428
63% to 213A2 31% to 120A1
1370816 MTAAPLPSTGAVTGLGETLAFFRDPSFSQRRFSELGDVFETKLLAQSIV
FIRGERAIGDLLKQEDCLQGWWPDS
1381038 VRQLLGSKSLANRSGADHKARRRVVGQLFSSAALS
RYTPAIEALVNDLANELQQAEGPIPLAARMRRFAFSVIA
TTVLGLEAENRDALFADFEIWTRALFSIPLALP 1381358
1381359 GTPFARALAARQRLLARLKTVLQTNNNRQQGGLDLLSGGLDEAGLPLDDDDLVEQ 1381523
1381524 LLLLLFAGYETTASSLSCLFRALLLNPEVEQWLMQDLNNHERPSRLDATVL 1381676
1381677 EVMRMTPPVGGFFRQNTQSIELADVAIPQGRVIQVVLSSSSTTNQIDLETFRPQRHL 1381847
1381848 DGSFQQTLLPFGGGERVCLGKALAELEIRLMAMGLLQRVQLHL 1381976
EPDQDLNLQLIPSPTPRDGLLVRATAR* 1382160
CYP213A2 Prochlorococcus marinus str. MIT 9313
GenEMBL AAAZ02000001
63% to 213A1 35% to 120A1
1661568 MASSDDAKLRPLPNTAALSGVLEAFAFFRDPAFAQKRFERHGNVFETSLLGQPMVFIQGGQAIRDLLA 1661771
1661772 QPNAVEGWWPESVRQLLGSHSLANRNGASHRARRRVIGQLFSASALQRYSAGIISMVQ 1661945
1661946 DLADELQAAKTALPLAERMRRFAFSVIATTVLGLEGTDRDELFVDFEIWT 1662095
1662096 RALFSIPIALPGSSFAKALKARERLLRRLQKVLLKASNGNGGLDLLAGGLDE 1662251
1662252 AGIPLTDEDMVEQLLLLLFAGYETTASSLSCLMRELLLNPQVETWLREEINGVDWPPAPE 1662431
1662432 QATTAYDQVNAPKLDAVVSEVMRLTPAVGGFFRRTKCALVIDGVEVPKNRVVQVALA 1662602
1662603 ASNRHGAGDLEAFRPQRHLEDGCSATLLPFGGGERVCLGKPLAELEIRLMVVGLFHQLR 1662779
1662780 LHLIPDQDLTLQMLPSPTPRDGLLTKVL* 1662866
CYP214A1 Trichodesmium erythraeum IMS101
GenEMBL NZ_AABK02000024
74165..75511 gene = Tery2345
36% to 120A1
MNMNFKSSNPASSLALPPGDLGLPFIGQNKKIFKNPQNFIEEVY
QKYGPVYKTNFLGKNFIYFQGYEAIKFILTNENKYFTYSQILRNYQRIFGENDITVLA
GKEHRERQKILAKTIKSKNLNNYIDIIHDLSQSYFLKWIKSDYVDLYSEINNYTLDMI
LKLLLGIDYASKSEISNYLKDMSSGLNTIPVVFPWTKFGSALESKNKLFNQFEQIIVR
RKKENNFGSDILGILLTVQEQMNYELTPREIVGQMVNLLSLGKKELSSALSSFFILTS
EHLDVLKLLQIEQEKMDVSEPLSLDKYKKMVYLEQVIKEVLRLVPPVSGGLRKIIEDC
SFQGFRIPKGWHAYYYISSVLKDPEIYKQPEIFNPERFNPTNAEDKKKPLCYIPFGGG
ARECIGKEFAYLVIKIFISALLDNCSWKFKENQDLTINTFPVARPAHKIEVCFTPK
CYP215A1 Thermobifida fusca
GenEMBL AAAQ01000026
complement(53223-54434)
35% to 127A3
MPAPIRIQQCPVSHTDFSTDTEIYG
HYAMLDAEREASRFRFNDTTDRGFLMIQRYDDVVEGFQHHETWTTKVRSAINPDSRSGTA
LLPQDLNGEAHAKLRRVLNPFFSPAAVRRMEPMAVARCIELIEELQPKGSCDFVAEFAIR
YPTDLFLALLGLPVSDGEFFLPWSETVFAGFFGEDPAKVAEARKNIIDYFTETIRERRAN
PRDPKEDMVTRLLEARIDGTPIPEEDILTICLTLMLAGLDTTRSALGYIYAHLARNDADR
QAIIDNPDLVPRAVEEFLRMYPLVFQVGREVQEPTEFHGLDLQPGDVVWLGIAQANRDPR
KFPEPDRFNPDRKGVNQHIAFGAGPHRCLGMHLARLELAIVLREWHQRIPHYRIREGVTL
TERGGQLTLPTLPLEWEA*
CYP216A1 Thermobifida fusca
GenEMBL NZ_AAAQ01000039a
98411..99769 gene = Tfus2242
37% to NZ_AAAQ01000042 Thermobifida fusca
39% to 157B1
MTRSPSPPIRLYQAADPTALWPALLAEYGEIAPVELEPGINGWL
LLSWRLNREVMRDAETYARDPDRWADMANGRIGPRSPLRVLYGKYQSALYSDGADHKR
YRAALTAALDSLSLRQVTGHITAAADQLIDAFCLHGKADLVTQYTRQLTVTVLARLFG
LGPAETMRICLEMQAMWDEGPNAFPALMRIAEMLTELARKRRRTPGKDLASVLVAQGL
SDVEVRDNLLLIIAAADDPVTHLTGNTIRMLLTTPESRAALTTSPGLISEAVNTALWT
TPPLQTLVGRYPVRDVELAGVPIRAGECLVLGFAAASLDLAQHSDADTMSTNRAHLTW
GVGPHSCPLRGQDMALAIAETAVERLLRRLPDVALAVPPEQLRWRPSLAVRGLESLPV
RFTPTPVSNPGGTEWDSPADPTPSTSIRPKESTSPRSDANSSTKARWYAWLFPGTWKS
GL
CYP217A1 Thermobifida fusca
GenEMBL NZ_AAAQ01000029b
complement(46399..47577) gene = Tfus0917
39% to 145A1
MTTLAPSTDLDFYSEEVILNPYDTYRRLRDTAPAVWLERYQVWA
VTRYDDVYAALHDHRTFASGYGVALNDPLNEKMKGSALVSDPPYHDHVRGVMGGPLTP
RALRKHHDYFQERADALIDHLLELGRFDGVRDFAQIFPLSVVPDLLGWPAEGREHFLD
WASAGFQALGPMNERALRDLPTLKGMWEYMAEIVRPGRLAPGSWGAALVEAAEKGEVD
KQLLPALIGDYLVPSLDTTVSALSTMLWQLGENPDQWRMVREDPSLIPNVLNESIRYE
APIRALSRYVTEDTEIGGIPLARGSRALMVYASANRDERHRKDPDTFDVTRSNANTHV
GFGHGIHGCVGQGLARLEGHSLLNALVRKVARIEVGQPTWRVHNTIRAMTTLPVEFSA
CYP218A1 Thermobifida fusca
GenEMBL NZ_AAAQ01000042b
complement(348226..349737) gene = Tfus3015
35% to 156B1
MILNCEHDPVFHVPVVSTPGRWSVTHPLYPVGNSDTPHDRNAGS
SEGCPVSLDDREALHRLAVPVYGELDADLPDVLERLRKQFGRIAPVEVAPGVAAWILL
GYDVNRRVLQDSAEFARDPRRWREAREGRATPEKVPGPFWYFRNALASDDPDHSRYRP
VIVDALSGITADGTRLAVRRIATELLAPVALTGKADLVADFAFRVPLLVLNRYFGLSA
EEGLELVELMRQVWDGGEEAEKARLGLFAYAQSVTARRRENPGSDIVSRMVRHPNALD
DEEIAHQLILLISAAHDPLMNLIANTAHTLLTDEEVRYDLAGAHLRVEEVVDTVLWRS
PPITLLPGRYPVRDTLLEGAYVQEGDCLIIGYGPAHADPAVAPYIDPLSPSGIRGHLA
WGTGPHACPAQRISRQIAVDAVSALLDLLPDVRLAVPPESLERRRSLFAHGLKALPVT
FTPVDITPPTEAPWQQSQNPSSSSQENPASKPSSSVKRGRLPEWWSKVWKFGR
CYP219A1 Novosphingobium aromaticivorans
GenEMBL NZ_AAAV01000054
complement(4309..5553) gene = Saro0307
41% to 107AA1
MEAEAAIPPLDTSDPALIPDPWPTFTTLRERDPFHWSKYGYWVV
SRHEHVRDVLMNRKDFGTGDFAANLRLFYGPDFDVLANPAYRWLSEVFIMQDPPQHTR
IRNLVVGSLTAKRVRAMEPRIREIAQALTDGFKARGSADLITEFAYKFPVMVICDLMG
IDYEASEMADLIAAIPEAFTVFEARILSPEELALANRRILELEAFFQAQFENRLAHPR
GDLLTSLARTGQEPGGLSVHEAITVTIGLFGAGFETTANIIGNGLHALHANPEQWARL
VADPSGMASGACEEALRHQSSLIATYRTALADTSVCGHPVSAGQRVLTLIGAANRDPR
KFADPDRFDIARNDADHLTFGGGIHFCVGAELARIEARVAFEHLARELPQMQVDTGGA
CWRENFLFRGLTGLEARWPAQA
CYP220A1 Burkholderia fungorum
GenEMBL NZ_AAAJ02000117
18981..20198 gene = Bcep1617
35% to 107B1
MTFIYDPNDPDVRRDPHAVFRKLRETEPVHWSPKLSGWVVTSYE
LASEVLTTNGTYSAERFTAVQQHLSEEKRVTAAEVMRWFQHWMVFRDPPDHTRLRRHM
ANTLNIPVFDARRETVISVVNELLDRIPVGDAFDFFQAFSLWMPGIVVADLLGVERDR
LLEVKQWSDDMMTFIGSARGVPDKYERARRGANGMGTYFLDLIAKRRAEPREDALSRL
IASEVEGQRLSDDELVGCMMMVLNGGHETTANLLNNSMLALAAHPQTVQHLRQHPDEM
AAAVEEFLRYDSPVLSIGRIVTEDTELGDQEIAAGDRVFAMLVGANRDPEVFSDPDEL
RTSRNPNPHMAFGKGPHFCLGTPLARLEGQIALTAILERFSSIELCEPVESIPWLNSL
VTHGPTRLPLRLK
CYP221A1 Pseudomonas fluorescens PfO-1
GenEMBL NZ_AAAT02000002
complement(179015..181831) gene = Pflu5298
36% to 107B1
N-term is 38% to acyl CoA oxidase of Streptomyces coelicolor
MSDPLLKLLQKPLFDPEKRHRVSLREYMDLNIDRMRAIIGNGLM
TNAMWLSQPRQSEFRLMLERAALIGAVDYSLLACIVDHFIAGDAFFAHGSQHQIAQYH
QEICQLKAVYAFGCTEIASGSDVANLQTTINYDPHKHCLILNSPTPQSCKFWIGNALH
AAVVVMVLGRLIVKGVDEGLHWFRVRIREQENGPLLPGVRITTCDPKGGIHANQVAGI
RFCNMKLPLDALMQRYARFSAQGVFSSEIPPKERLKSAMQTFIQERLFLIAGARGAAS
MCVYLAYRFACHRLVKGNEGSQSLLTKALFRQRLYAEQLKVLALKLLEQAVLSRFEAC
WHQPARRKELHILAAVVKSVGTWLGLEVMSACRELCGSQGFHHHNRIVTLVMDHGIST
TFAGDNNILCCQVARDAINRPRFANENIAQRIESLIVDQCRRAGDFSHRQAVALTYAR
ALDLIINEGKHHPLVTSEIFEDIVHVFSPKLYEWELVASTKLEQNATEAQLILLNELL
KPPSELVRAPIDKKNYVKHF TKPLYDNKPDFSNRNTIRNPYRAYTWLRKHQPVYWCEH
LQAWFLTRYCDVIAAQADSRRFSSNRMQQLIDARVPENKRTHLNEFIKLASRWMYSQD
GDTHKASRHLLGNAFTPRSIEALRAIIQDITDRELSRLHGQTDLKTALFDRVPALILA
RLYGMKDDEALRLRRWTRDIVMFLGGSQDADQGPDQALEGIKEMYACFAELIEQRRRQ
PGDDLVSRVLESGQNSAASLDEVLAQIVFILVAGYTTSADQMCLGLLHLLKHPQQLEA
LLADPTLIGSFIEEMLRFDPAGSLSHRILMEDVTIDNITMKKGNLVYLIRASANRDPE
KFHAPNTFDIRRARNEHLTFGKGEHFCMGTSLFRLEAEIVFTSLLKRFPDLQLIARRP
AKWRNSNLQFRGLKTLPVDLGTGV
CYP222A1 Thermobifida fusca
GenEMBL NZ_AAAQ01000022.1
52729..54468 gene = Tfus0480
33% to CYP101
MSLRGLPGTAPRQRAGSPGPASFLKRRRDAAPRPTISLDGDGPL
RAGLGSIPGRRLQFGAHLFVEHVGVPVFFARRKHVGRGHVAHTVPLALLSVDSDAHVA
LLRDQRSLFSLAILTLMLLSITIITRTAIKITHHTRSRHRPYPHSSQRTSAHEPGQER
AHSMSNPTRCPVIHFDHHSPEHAKDPVGAYRALRQTHKRGWTEAHGGYWVLSDYQSVF
DAARDDDLFSSTREAAGTDGLAIVIPPTPMYHHIPIEVDPPEFRKYRKIVNQVTAPAA
VERMKEMVERYTVAFVDSVIEKGECDFTTIIGVPAIVTIDWLGLPVKDWKRYADAHRA
TLAEPQDSEAFRHAVEVELPALSQQMWDTIRARRQEPKDDVISFLVSQKVDDRPITDE
EVFAMVDLLVSGGTGTTASLVSQALVWLAKNPDVRQELIDDLSLLDRAVEEFLRVFSP
TQALARTVTRDVEFHGCSMKKGDRVLLSWASANRDEEQFENPDTIDIHRWPNRHVAFG
IGVHRCAGSHLGRFMAKRLLQEILTRMPDYTIDFDALVPFHDQGTNVGFRSIPAKFTP
GKRVLPLSEAVIG
CYP223A1 Novosphingobium aromaticivorans
NZ_AAAV01000162.1
complement(33798..34988) gene = Saro2671
31% to 194A1
MHRAMTTTVQDFDPEVPEDFDSPHAEYARLRRECPVAHTNGLGG
FWALTRYEDVKRAASDSTTFITSVQNVVPKVAFTGRRPPLHLDPPEHTPYRKALNPLL
SLERSEAFAGKARELTRKLLAPMVENGGGDICVELSSYLPVHVFGEWMRMPEEWLDTL
HDAGRAFILAVHSNTPERMKETSLRLYDMARGLIAVRRENPQDPALDPTSALLAARHE
GEPLPEELLVGTVRQVLVVGMVAPMVMIGNICVHLSRDKALQQQLRADPSLVPAAIEE
FLRLYTPYRGFARTAVCDVDMGGRTIPKDEAIALVYASANRDEDVFPDGDKFILNRPN
IAQHLAFGRGPHNCPGVHLGRMQLRVALEEILAATREFELSGPVSVSRWPEVGALSVP
LRFV
CYP224A1 Novosphingobium aromaticivorans
NZ_AAAV01000154
complement(32211..33443) gene = Saro2336
36% to 127A2
MTLLFQPSPPDHVPGERMVDFDMFHVPEGQDDPVEIWHDLVRRG
VPRIFYTPRNGGHWVFLDYADIVEAYRDHTVFSTYQTPVPPIEPFPVVQPQGVDPPAH
NVFRRLLAPMFTPTAVRGMIGELERRASELIDRFAARGECDFITEFAERFPTSTFLHL
FGLPEEQLDAFLALANVFFRSTDAETRARNIGEIYAVLDTLFREKERNPGNDIASAIV
AARDEEGRQHPWEDILNCGFLLFVAGLDTVTNTMAYIWRYLATTPAARRHFRERLDDP
DAFLRAIEELMRINAVSNLFRRVTHDCEYKGVQLRRNDRVVLPNTVANRDPRVFSDPQ
AIDLDREVNVHLTFGVGPHRCIGSVLAKREVMVSLQQWLRRIPEFELAPEQPAGSAFG
GSVMGFTALRLRWVRVEA
CYP225A1 Novosphingobium aromaticivorans
NZ_AAAV01000151
17550..18872 gene = Saro2207
34% to CYP124
MPSPPRLAGTSTASGLEDQMQFPFSRSTNPNVDLSSLDAFNEGA
PFATFDRMRREDPMAWSEMVNGDRGFWSVTRHADLLELNRQADLLSSAKGIRMEDQTE
EEYEARKTFQETDAPHHRGFRALVSKAFSKGTVAGFEDQIRKIVTDLLDVALAEGEFD
AVDRIARRLPMQMLAQIMGVPQEDGPWLVEKGDALISNSDPDYTDFVVDQVDTEAYRM
LPFRSPAAVELFDYANGLLDRMDAGEQIGVLNLVREPTSTGTRMSRDEFRNFFCLLVA
AGNDTTRYSISATIHALANNPHLLQALKDGDFTSWEAAADEMIRYASPTTHFRRTATR
DFTFHDRHVKAGDKVLLWFISGNRDETAILDPYTINLRRERNPFLSFGQGGPHICLGM
WLAKLEVAIVMQELAKRLSSIEQVAEHSYLRSNFIHGIKHLPVRIVAR
CYP226A1 Burkholderia fungorum
NZ_AAAJ02000018
25472..26761 gene = Bcep5906
61% to 226A2 30% to 109B1
METGMTSPAKSNALESAFESTASNYRGSDVDLNAIYRDMRRNSP
VIAQDFMASLGVPNIAKLDPNRPTFTLFKYKDVMSVLRDAANFTSGFIAEGLGSFFDG
LILTGLDGEAHRRARALLQPVFLPEVVNRWRESKMEPIVRNEFIGPMVPQRRADLMHF
GLHFPIRLIYSLIGFADDRPEQVEQYAAWALAILAGPQVDAQKAAIARKAAMEAAEAL
YAAIRSEVAVVRAKGAEGEDLISRLIRAEYEGRRLDDHDIATFVRSLLPAAGETTTRT
FGSLMTLLLERPALLERVKADRSLVSKAIDEAVRFEPVATFKVRQAAVDTGIGGVSIP
KGAMVQCIVSSANRDEEVFENSETFDIDRKPKPSFGFGFGPHMCIGQFIAKVELQVAV
NAILDLLPNLRLDPDRSPPKIVGAQLRGPDAVHVVWD
CYP226A2 Burkholderia fungorum
NZ_AAAJ02000018
complement(62819..64102) gene = Bcep5938
61% to 226A1 32% to 107N1
MSTTLENPMLDLEAAYHAVSDTYRGPDIDLQALCREMRHKNPVM
KGDFVATHLGIPTNAGASAAKCEVTLFRYQDVLAVMRDATTFTNGFIAEGLGGFFDGL
IILAMDGDAHRRARGLLQPVFMPETVNRWRPELDRVIREDFLAPLVPNRHADLMDFGL
YFPIREMYALMGFPTDDTAKFNQYATWALAMVAGNQIDPGKIRIFGPIAAAAVKHLYD
AVMEVVLQRRAAGADGNDLISRLMRAEYEGHKLDDHEVTTFVRSLLPAAGETTTRTFS
SVITLLLERPALVERVRNDRSLIPRLIDEAVRYEPVATFKVRQAARDIEIGGVKVRSG
GLVQCMVMSANRDEDVFENADTFDIDRKPKPSFGFGFGAHMCIGQFVAKIELQCAVNA
ILDLFPNVRLDPARPAPKIAGAQLRGAKSVPVIWD
CYP226A3 Pseudomonas diterpeniphila
GenEMBL AF274704 7700..8974
P450 monooxygenase (tdtD) gene
72% TO 226A1 32% to 159A1
note="abietic acid metabolism"
MSGPAHSNLEQVFANVASNYRGADVDLHAVYREMREKSPVLPEN
FMARLGVPSIAGLDPNRPTFTLFKYDDVMAVMRDATNFTSGFIAEGLGSFFDGLILTA
MDGEAHKNIRSLLQPVFMPETVNRWKETKIDRVIREEYLRPMVASKRADIMEFALYFP
IRVIYSLIGFPEDRPEEIEQYAAWALAILAGPQVDPEKAAAARGAAMEAAQALYDVVK
VVVAQRRAEGATGDDLICRLIRAEYEGRSLDDHEITTFVRSLLPAASETTTRTFGTLM
TLLLERPELLARIREDRSLVGKAIDEAVRYEPVATFKVRQAAKDVEIRGVAIPKGAMV
SCIVTSANRDEDAFENADTFDIDRRAKPSFGFGFGPHMCIGQFVAKTEINCALNAILD
LMPNIRLDPDKPAPEIIGAQLRGPHHVHVIWD
CYP227A1 Nostoc punctiforme
GenEMBL NZ_AAAY02000109 21775..23139
very poor similarity to P450s
gene = Npun0867
MTVNIMTLKDKVLKGQDRQLWLAPILAKVGYDTAIGKFLRLIFY
YTDASIILKAWRDFLYIRKDNIGDKYFAVDQAIMHSSHSQVRELMQTQPQLRGNDLGI
IRILAPSYLLDNPLSLGTNGNEHTGLRTVILQALPEPSQKIDFLGNLVEQSLLEAAKQ
GKLHIGNDLPKIILSILHQLVFQISLSEEEITASDSYIKGLALASLPNFINKYLLAIL
TAPKIRHRQYLTNRYKQSAKWASYFETGAQYQLNEHQIANTLFDMIHIAGTAGTSALL
GSVIGVLCLDNDLRNDVVSEVNAVWNGKKTLDPDALEQLTILNQVILETARLYPPVRF
VSQLTNEGGEVEIGEQKCPFQKGTRLLGSIFTANRDANRYQNPNDFDLTRNFSDILSW
NGEGHERACPGKSLSIGFIKIFCLHLFQNYQWDSITEVKWDFEKVTAVTPNNLVLQGF
AQRL
CYP228A1 Magnetospirillum magnetotacticum
GenEMBL NZ_AAAP01002937 216..>1496
Gene = Magn3794
34% to 107N1
MGPRPHARLVRRPPLPARARGHPVTGTALAPAPAGLAAAMRWEE
RVQRGAHPLVYPAIRALRHRGPVVRVPGIGVVVSDAATARAVLLDTEHFSKVGPGSPS
DLWTPVLGPSVLLNMEGADHARLRRALSGLFTPRAVRDLVAVSVPDVLAGLAPRLLAG
ERVDLVAETATMAGTVVCAMTGLPPTDSAVREAMTAAQSVVGLVRLHRRSLTPSQVRH
ARAVLARLSAPARDAYRAGDPATVPGRMRELGLSEDEALGAVGAFVLTGTETIQSFVP
RLVALTADTGWLDRLLAADPGAGPEAAALRGRVVEEALRVTAPTPAMLRSVRAATTVG
DVRVRAGDRVVIATISCCKDAGPFDPDAPVDPAVRHLWFGAGPHFCLGMPLATAQVDA
VLDALRPVAAAGRSLQVTDRAVARGVLIPAYRSLV
CYP229A1 Pseudomonas fluorescens PfO-1
GenEMBL NZ_AAAT02000063 complement(5763..6863)
Gene = Pflu1316
MDPIIAATHADPYPYYAELRAAGGLTFHHGLKLWVASSARAVCA
VLAHPDCRVRPVQEPVPKAIVDGMAGKVFGLLMRMNDGEAQRCPRSAIEPPLGLIDRE
EVGALVSARLITNDSDGLYKAMFRGPVCVVASLLGFTPAQARVISELTADFAACLSPL
SNDLQLAAAHRAAEQLRGYFIEMLADPNPFLADIRQRFVGNEEVLLANLIGLCSQTFE
ATAGLIGNALVALHRQPELRNASVDSLLAEVQRFDPSVQNTRRFMANSCEIDGVRLEA
GDVILVLLASANRDPALNENPDRFRVDRPNRRSFTFGSGRHQCPGQTLAMTIASATLT
EILARNIDPGRFTWHYRPSLNGRVPMFSEVQP
CYP230A1 Pseudomonas fluorescens NCIMB
GenEMBL AF318063 58574..59938
mupirocin biosynthetic gene cluster.
Gene = mupO
MTSWEREVSRGAGNRQLPVVKGWPLLGSALALIRNPLGFLQTTR
STYGDVYRVKAAHMNFVVLAGMEANRFVADKGKDCFVSSGFWEATLQEMQCPHSFIGV
DGDAHRFQRNLMKPLFSKSAFNERIPMLAQIFTDTLQARYGVDQKVSALFRHVLSQQI
GGSLQGYQPTPDEVEALMRYQNTAMNVCALKKWPRLALRLPGYRAAKKQVQALADRII
ESERTQEQTQGYFQTLKEKGQMVQPQWFTPGDMRNHAIISYLAGIDTVGATLSFMLLE
LFKQPHLHQALRDEVDACFSQGLPDADGLENMETLKNFIREVMRLYPTAYAVRRTRRK
DFEFQGYSIDKGQDIILFTTANHTDPAWFKNPQVFDITRYEEPRLEHRASGAWAPFGR
GPHTCIGAGLANILLSLNLALFLYHTDLRPACKLSDIKMDFSNPAAGLSERFAISFTPRNRP
CYP231A1 Ferroplasma acidarmanus Archaea; Euryarchaeota
GenEMBL NZ_AABC02000007 complement(32352..33461)
Gene = Faci1014
38% to 109A1
MEHDVFQYYRKMRKESPVHFNNDTGSWDVFDYKSVYFVLMNPDI
YSSDPSYAGNIPENRQGPGASFITMDNPDHKELRNVTTPYFLTSKITGYRDMIESTSK
RLMEGINKNSDFIRDYAVMLPVTVISELLGVPENDRSKFKEWSDYIIGNRSDAGFQDL
NRYMYSTMAEIFKTNTEDNIISTINKGLFHSEPLSINQKIGYVMLLVIGGNETTTNLI
GNMVKVLSEHPEIADKLRQEPELKKGFIEETLRYYSPIQFLPHRFAARDSVLNGQEIK
KGQRLSIWLGSANRDGAKFEDPDTFNMERQNNDHLAFGMGIHMCLGSPLARLEAEIAL
NDILNKFKHVKINAEKTSMLKNPMVYGFSTMQLDD
CYP232A1 Ferroplasma acidarmanus Archaea; Euryarchaeota
GenEMBL NZ_AABC02000015 12150..13295
42% to 109B1 40% to 119A1
gene = Faci0565
MEIPTYKEEPFEWYREMRKNSPVYREGNMIHIFKYNTISKILSD
HQNFSSQFRDLLGEEMAAMLNEKTTPSILLLDPPLHTTLRGLVGSAFTPRSIELFEPR
IREIARMLAHAIVEKENSDIVSDLSYQLPIRVISEMLGVPESDSEIFRDWSDKLATSL
GRGPDIETQYDMADYFYKKIDRNSKGNNLISRLSTVEMDGRKLSDKEIAGFAILLLVA
GNETTTNLITNAILSLYDHPEIYNEMRKTPSLIPGVVEETLRYRSPVQSTRRYSKIDT
EIEGEEILKNDILALYLGSANRDEEAFEDGESFNPYRKEKRHMAFGQGIHFCLGAPLA
RLEARIALEEFSKAVPGFEIEKPSPDDRIDSDIMYGFRKLNLKVNRS
CYP233A1 Gloeobacter violaceus
GenEMBL NC_005125 complete genome
complement(2058447..2059673)
40% to 107X1
MSALPPPRFNPFDSEFRQDPYRVYAHLRVAAPIHRSLGMWVLTR
YADVLAVLKDPHFSSSQIPLAVRQRSERPDQAQSHPLARLAAKSIVFTDEPDHTRLRH
LVVRAIKRRTPEQEQAHLTRIASALLERVGPKGRMDAVADYAERLPLQFMAESMALPP
DSWQTVRDWTHQLRYLLEPGLMGRGDFERVQAVLDEVIAFFEDMLAVRRQQPGDDLIS
ALDAAHREAQADRLSDEEIVYCCIMMFVAGHETTRSLIASGLLALLQHPEQLAYLRMH
PERMGAAVTEMLRYESPLQQTKRRATAAVAVGGRTIQPQEQVLLCLGAANRDPARFEQ
PDRFDITRTDNGHLAFGQGMHHCLGAALAQMEAQVALRVLLERFANLTLQDTPEWLEH
SFILRGLKTLPVQWDR
CYP234A1 Photorhabdus luminescens subsp. laumondii TTO1
GenEMBL NC_005126 complete genome
complement(4894114..4895217)
locus_tag = plu4183
very poor match
MMNVLINEYKKKMDSVRLGDPERKGFFYDAKQAIWHCYSYDICS
YFLNSDYVTKKKLSIPLEIFSASDQSRVARFILYLNNSLIFNDDKYNTDAVSFIRGKF
NEMNFEVIANDLLSPLKQCDLLTAKHLRGVNNLLAASLVGLKASAFFSAHALNVGMFF
DGSMSGRAHFVSIAESFIAIYQQVLRQITINGGAEDVIHIEKFVADLSVTFIAAHETT
MQLIIATFLYIKSHVITVTENNIKSIVTETYRLSSPVLAVNRVFKERLIYKNSCFNKG
DRVLFYTGLANFDATVFDHPYQFQLDREGCPLSFGVGVKKCIGMNIAIHFTCQLITKI
LSCYQLDDVEIHEVTVGSLAIGCSKFTLKISKK
CYP235A1 Streptomyces antibioticus
GenEMBL AJ002638 184..1389
35% to 131A1 no heme sign.
Gene = oleP1
MEDSELGRRLQMLRGMQWVFGANGDPYARLLCGMEDDPSPFYDA
IRTLGELHRSRTGAWVTADPGLGGRILADRKARCPEGSWPVRAKTDGLEQYVLPGHQA
FLRLEREEAERLREVAAPVLGAAAVDAWRPLIDEVCAGLAKGLPDTFDLVEEYAGLVP
VEVLARIWGVPEEDRARFGRDCRALAPALDSLLCPQQLALSKDMASALEDLRLLFDGL
DATPRLAGPADGDGTAVAMLTVLLCTEPVTTAIGNTVLGLLPGQWPVPCTGRVAAGQV
AGQALHRAVSYRIATRFAREDLELAGCEVKSGDEVVVLAGAIGRNGPSAAAPPAPPGP
AAPPAPSVFGAAAFENALAEPLVRAVTGAALQALAEGPPRLTAAGPVVRRRRSPVVGG
LHRAPVAAA
CYP236A1 Microscilla sp. PRE1 plasmid pSD15
GenEMBL NC_002806 40616..41776
32% TO 109A1 C-TERM
MKKDLIPDPFEKTREAAGYGEMNDQNDPVTMILRLKDVRKCAHN
FKTFQSGARPGRIVVPSEVSIRDTRQIPFEVDPPEHTDYRALVEPWFKRPLEAEYREK
LSQQIGYIVEERLARDAVEVVEEFSLPLQSRALTLLLNIPIEEAETWIKWGTHVFRSE
DSPVDGDKAKILYDYIDQQIDRALEKPGEDLYTVLLNSEINGKKLSREEVKGVMILTF
AGGRDTVINAVTNSVAYFAVHPESLELLRKEPEITGRAVEELIRYFAPLTHMGRVVTE
DTQVCEYAVKADSRISLVWASANRDSSVFEKPNEVVLDRKINPHVSFGFSHHNCLGAT
HARQIMHILLKTLAEKVGSIEIQEHEDNIETWGEFERKVGYDRLKVQFNPLQ
CYP237A1 Pirellula sp.
NC_005027 complete genome 6102783..6104327
34% TO 184A1 C-TERM
gene = cypX
locus_tag = RB11252
MRNGTSYLQNFFPGYRRLSRLPKQCLAINPFAGDCMPSRVRLLA
PRSDRPTQPFPHRWNYEDPVRILETYFWKADEEQGPGRHNRYLDVPGFAPVLVTRDPG
MIRAIATATGDREGQFDRDTLPSVGIARATGTDTLLYANGAEWKKQRKIAACPFGKTT
LFQPEQFCEFADTFRETVRGRIDVLRQHLTASGKKTVDIQLEPEVKVVMLEMLTNNFF
GADISYEELREKYVPALERVIDHIVKDTVKNRLGIPWRKFPSVSDRIVRAKADDATFE
ELTQRILVPRGEKKALWKQFKSDAPDAKLISNLKVFLAGALEATTSYATWAISHLARH
PDAQEKVFEEVKDIDVYTPEILAGAKYLRAVLDETLRLTPSLYFLPRRATADTWVTSA
DGRKMFIPWGTHLLLDVWHANRHEDHWGVQVSGYPANEFEPDRWRILAEWGRATKDTL
HFGFGHGPRVCPGKHLGELEVGLTVGALVKTFRFQSESPENLARAGVSTKPADGTRVC
MSLRLS
CYP238A1 Pseudomonas putida KT2440
NC_002947 2211866..2213101
26% to CYP101A1 30% to 107X1
locus_tag = PP1955
MEILDRPQAPSDFNPMSEQSFRDPASICQRAREETPVFFYAPLG
VWMVTRREDAERVLSEWETFSSLANSPNVPEEFRSRFAPSVMADSIVAIDPPRHTQAR
NVIQRGFMKPKIDPLEPIIEQRAHEIIDRFAGESGTEIMNNYCLELTTRTLMALYDLP
LEDRPMFERIRDVSIKVLASVYEPMQEPEKSRVWNEYVSGYEYFYQLVEQRRNSDARD
IISTMASQKDNQGNPALSTERIALHLVEIAFAGTDTTAQMMANAILFLDSHPEALAAA
KADKTLWSRVFEETVRRRPSAPFAGRITTTEVEIQGVKIPAGSPVWVSLAAANTDPRH
VGCPMNFDINREAPQDHLAFTKGRHTCPGAPLARLQGATGLRVLFERLPELKVVPDQP
LNFAPMALLPVRLSLQVIW
CYP239A1 Pseudomonas aeruginosa strain SG17M
GenEMBL AF440524 34312..35472
Integrated gene island PAGI-3(SG).
38% to 194A1 96% 14 diffs to CYP239A2 AJ311159
gene =ORF SG16"
MKDVNEVARNFDFHGEALDDIFDTYSTLRHGCPVGRSENYGGFW
FLTKSDDIFAAEQDPEAFSVYPSMMVPSVSEGIQLPPIDIDPPEHTAYRRILLPLFTP
QELKKLEQPIRDTARKLAEEFAKEGSGADASYHYSRPLPTIIFSRLAGYPEQDWPKFD
KWVDDIIYERVEKPEVANQASKDVFSYFENLLDNWKDDSESANLIDYLCRAKINGRPL
TRDELLRYCYLLFLAGLDTTAWSIRAGLWYLANNPADQQKLRDNPDLIPLACEEFLRT
LSPVQVMARTCLKDTVIRDQEIKAGERVMLVFGAGNRDEEVFPNPDKIDIERQENRHL
AFGGGIHRCLGSNLGRRELVVGIEEFLRAVPQFKPADPSEKWHGVGPLKLAF
CYP239A2 Pseudomonas sp. KIE171
GenEMBL AJ311159 4194..5354
isopropylamine degradation gene cluster (ipuABCDEFGH genes)
38% to 194A1 96% 14 diffs to CYP239A1 AF440524
gene = ipuD
MKDVNEVARNFDFHGEALDEIFDTYSTLRNGCPVGRSENYGGFW
FLTKSDDIFAAEQDPEAFSVYPSMMVPSVSEGIQLPPIDIDPPEHTAYRRILLPLFTP
QELKKLEQPIRDTARKLAEDFAKEGTGADASYHYSRPLPTIIFSRLAGYPEKDWPKFD
KWVDDIIYERVEKPEVANQASKDVFSYFENLLDNWKDNGESANLMDYLCRAKIDGRPL
TRDELLRYCYLLFLAGLDTTAWSIRAGLWYLANNPEDQQKLRDNPELIPLACEEFLRT
LSPVQVMARTCLKDTVIRGQDIKAGERVMLVFGAGNRDEEIFPNPDKIDIERQENRHL
AFGGGIHRCLGSNLGRRELVVGIEEFLRAVPQFKPADPSEKWHGVGPLKLAF
CYP240A1v1 Bordetella bronchiseptica
NC_002927 3943708..3944865
extremely poor match may not be a P450
locus_tag = BB3721
100% to B. parapertussis 98% to B. pertussis 7 diffs
MIADTARQHRGDIMQPADPLEAVAHPDPYPYYAALARERPFYHD
DRLGLWVAAGPQAIRAVLTCPAARVRPPGEPVPAALGAGPAAQMFGRFIRMNDGAVHE
RLKPMLTAYLTQRTAADLAEPAWPAIGNDPAQVDRYLYQAPVHAQACLMGLPDEVAAS
CAREIEAFMAACRPGADAAAVARADQAAQALQARMLAHLRAARGDAALGVLRRLALAG
GVEADALAANLAGLLLQSCEAGAGLLGNALVHAGRLSPAAAAAAPDLLHTCVEIVTHV
ARHDPPLHNTRRFLAAPATLLGQSVPAGAGILVVLAAAHALAEGAWPWTFGAERHACP
GRTPALLHAAQALAHALRHGVDAPALARRVRYRPLPNARVPRFHFPPGDTP
CYP240A1v1 Bordetella parapertussis
NC_002928 3527249..3528406
locus_tag = BPP3270
100% to B. bronchiseptica
CYP240A1v2 Bordetella pertussis
NC_002929 2545572..2546720
locus_tag = BP2405
98% to CYP240A1v1 7 aa diffs
MIADTARQHRGDSMQPADPLEAVAHPDPYPYYAALARERPFYHD
DRLGLWVAAGPQAIRAVLTCPAARVRPPGEPVPAALGAGPAAQMFGRFIRMNDGAVHE
RLKPMLTAYLTQRTAADLAEPAWPAIGNDPAQVDRYLYQAPVHAQACLMGLPDEVAAS
CAREIEAFMAACRPGADAAAVARADQAAQALQARMLAHLRAARGDAALGVLRRLALAG
GVEADALAANLAGLLLQSCEAGAGLLDNALVHAGRLSPAAAPDLLHTCVEIVTHVARH
DPPLHNTRRFLAAPATLLGQSAPAGAGILVVLAAAHALAEGAWPWTFGAERHACPGRT
PALLHAAQALAHALRHGVDAPALARCVRYRPLPNARVPRFHFPPGDTP
CYP241A1 Enterococcus faecium
NZ_AAAK01000185 complement(2963..4222)
Gene = Efae1119
36% to 152B1
MKEVPVVDIKITDLKKLYQKGYNMLEELRHEADAPVVKAKIFNK
EAITIYGSSAAKVFYDPRNFKRKGAMPKLVLKTLFGQGGVQTLDGAAHHHRKNIFMDL
MTPERMEDYHRILDKNLTQALEAQHGQFELFDLSKMVFFTSICEWAGINLSAISKDEV
EKLAEYQISMISGTFTSPIDHIKGVENRKKSEKWAQGLIEEARQNPVAGKENVALYAF
ANATDLDGQLLPLEVAAVELLNIIRPTVALTVWAALMGHALFSRPDLYQQLKNDFSTL
QDPFIQEMRRYYPFFPMLPAISLKEVEVDGYRIPEGSWVILDLYGTDHDERTVEAPDS
FMIKRYVGKAKDISYKEEYEMIAQGGGNFRQMHRCAGEWITLHSLRVFSDQLVNKFEF
SVPEQDWTIPFNQFPTYPNSRALLYKN
CYP242A1 Kitasatospora griseola
GenEMBL AB048795 2479..3729
36% to 107P1
MADLETKFPRYELISSGKYVDRIPELHELREKSPIAWVPVMDAA
FLTRHADIVRVLKDHRMAPANLTQGIRLLSPEQQEELEPLSSAVKKWMGHTVPADHQR
FIGLLKRYFTPAMIDRMRPRVRQLSHELLDAVEPAGRMDIVSDIAYPLPACVIAEMLG
VPMDNRAQLLAWSADIGAIAEIVSYDRLMECQRSLLAMQDFVLEVVKERRAEPKDDLI
SMFVAAEREGLVSEAEILSNCVMLLFSGHETTGGLITSGLVQLFDHPDQLELLKSDPD
LMPGAVEEMLRLAGPASVISRVSTEPVEVAGSSIRSGAAVPPGADGGKRDPRVFEDPD
RLDATRRPNDHLAFATGMFYCLGAALARMEADEFFRILLDRFPDVNPGYETPDWQPVL
LISRRLKTLPVNLRGVGGSGAGDE
CYP243A1 Mycobacterium avium
AF232829 complement(389..1609)
38% to 222A1
MTPTRSTSTNWPSTPNCGNDVRSRGTRTTAGSGSSAATTPSAKP
PQRRHFRPQSTSRTPQTAWTTRARWASRDPEGQPALGLGEVDGPYHQALRHALAPFFS
PGAVEKLNPFMEQSAHWFLDQQITTGQMDLVLDYASPVPAILTMKLMGLPYDNWRLYA
NLFHSVMAVSQDSDEYAAAIAKVPAMMHEVLDYAATRRAKPEEDLTSFLIRFEFDGHR
LTDEQLLNILWNLIGGGVDTTTSQTALTVLHLGTHPDLRQQLIDHPELYRTATDEFLR
YFSVNQTLSRTVTHDVVLAGQRLRKNDRVVISWLSANHDENEFDRPDEIILDRAPNRH
VAFGLGPHRCIGSHLARLMSEVMVRAVLXRIPDYQVDVENVHQYLGNPSMTGLGQLPV
TFAPGKSRKTLRPW
CYP244A1 Streptomyces sp. TP-A0274
GenEMBL AB088119 complement(5410..6594)
staurosporine biosynthetic gene cluster
31% to 107B1
gene = staN
MTDMPVDPGPFDCMPELLAAARVAPVVRIPYLEEHAWVVCDPEL
VRTALTHPKMAKDITLVPQFMRKPGLMVGSQPPPEYARAMIMSDGEDHARIRRVHQPV
LSPRNTQRWGERVGVKVGGFLDELEQSRASDSAEVDVVTDYTHRVPLAFISEMLGLPL
EAERRLRSITDVMLYSSDYPARQEAVGALFGAVESWVQNPAPLRDGVITGFLAAADGP
DKVTEGEVIVWTVGMIITGYETTGSLISASLYEALRRPPEERPGTDEEIKSWIEEALR
VHPPFPHPTWRFPTEDIELGGYLIPKGAPVQVSIAAANRQPGEGADSFEAARGGHGHL
SFGLGMHYCIGASLVRLEAQIAVREFLRRFPKARLSDGSAVQWESEWMIRRLSALPAVLN
CYP245A1 Streptomyces sp. TP-A0274
AB088119 13260..14513
staurosporine biosynthetic gene cluster
38% to 107P2 53% TO Lechevalieria (10860..12053)
gene = staP
MASATLPRFDLMGWDKKDIADPYPVYRRYREAAPVHRTASGPGK
PDTYYVFTYDDVVRVLSNRRLGRNARVASGDTDTAPVPIPTEHRALRTVVENWLVFLD
PPHHTELRSLLTTEFSPSIVTGLRPRIAELASALLDRLRAQRRPDLVEGFAAPLPILV
ISALLGIPEEDHTWLRANAVALQEASTTRARDGRGYARAEAASQEFTRYFRREVDRRG
GDDRDDLLTLLVRARDTGSPLSVDGIVGTCVHLLTAGHETTTNFLAKAVLTLRAHRDV
LDELRTTPESTPAAVEELMRYDPPVQAVTRWAYEDIRLGDHDIPRGSRVVALLGSANR
DPARFPDPDVLDVHRAAERQVGFGLGIHYCLGATLARAEAEIGLRALLDGIPALGRGA
HEVEYADDMVFHGPTRLLLDLPDAA
CYP245A2 Lechevalieria aerocolonigenes
GenEMBL AF534707 complement(10860..12053)
36% to 107P1
gene = rbmE
MKPFDLKAFTGADLADPYPVYREYLTGDPVHHNGEAWYVFGYDG
VAHVLTSRDYGRRGPGGRATPIPPSHDTLSRIVENWLVFLDPPRHTALRSLLAKEFSP
AVVTGLRERVRKIAGELLAGLGDAGEIDLVEDFAAPLPILVISELLGVPARLRSWFRR
CAVDLQEASTARATRNPGALARADGAASELVEFFGGELGTRKPDDEDLVALLVNAQRR
GEALTDEEIVSTCVHLLTAGHETTTNLISKSVLALLANPAAAAEPLAGLDVTPQVVEE
LNRFDTPVQMVTRWAHQDTALGGKPIRRGDKVVLVLGSANRDPAAFAEPDRLDLRRDS
RRHCGFGLGIHYCLGAALARTEAEIGLSVLFTNFPGLRLGGEPVRYADDLVFHGPARL
PMLTR
CYP245A2 Lechevalieria aerocolonigenes
GenEMBL AB090952 15478..16671
rebeccamycin biosynthetic gene cluster
gene = rebD
100% to AF534707 rbmE
100% to Saccharothrix aerocolonigenes rebP
CYP245A2 Saccharothrix aerocolonigenes
GenEMBL AJ414559 15358..16551
gene cluster for rebeccamycin biosynthesis.
gene = rebP
36% to 107P1, 100% to AB090952, AF534707
CYP246A1 Streptomyces acidiscabies
GenEMBL AF393159 537..1724
40% to 105M1
gene = txtC
MESPATQVDPANSPLEPYHIYPEAKSCPVAKVGLWNGTPAHVFS
GYEDVRTVLQDRRFSSDSRRPNFTELTPTLQSQAAAPPFVRTDNPDHRRLRGTIAREF
LPKHIELLRPAIREIVQGVLDGLAETAPPQDMLEAFAVPVASATVFRLLGIPAEDRAL
LTRCVKGVVSAVGSEDEGAEVFRTLGEYIGGLVQDPSELPEDSLIRRLVTGPYQEKQL
TFHETIGVILMLIVGGYDTTASTISLSLVSYALQPEKFSVVHEHPERIPLLVEELLRY
HTVSQLGLGRIATEDVEVGGVTVRAGQMVVAALPLANRDESVFPNPDELDFDRPSVPH
VGFGYGPHQCVGQALARVELQEAIPAVIRRLPGMRLACALEDLPFRHDMATYGIHELPMTW
CYP247A1 Actinomadura verrucosospora
GenEMBL AF411574 1..>954 runs off end
40% to 162A1
MRARAPLHHQVLPDGREFWSVTRYDDVCRVLGEHQRFTSERGTV
VTHLGVDDVAAGTLLTSTDPPRHTLVRRAIGARLTARAVAPWRERIPERDWDELVQLT
AMVTAPSDPHFRHGSEAATLAIAHHELVTYVKEWAARRRSAGGDDGSLLDHLMTVRVA
GAPLTDEEIALDGYSILLGANVTTPHTVSGTVLALIERPEQFGKVQADPSLVPNLVEE
GLRWTSAACNFMRYAVDDVRIAGGTIPARGAVVAWIGSANRDESQFADPHTFDVTRNA
SRQVAFGYGPHYCVGAPLARLTLRVFFKELLRRFGSLSSGGS
CYP248A1 Micromonospora echinospora
GenEMBL AF497482 72606..73799
gene = calO2
calicheamicin biosynthetic locus
34% to 107H1
MTAFDPTDADVRRDPYPSYHWLLRHDPVHRGAHRVWYVSRFADV
RAVLGDERFARTGIRRFWTDLVGPGLLAEIVGDIILFQDEPDHGRLRGVVGPAFSPSA
LRRLEPVIAGTVDDLLRPALARGAMDVVDELAYPLALRAVLGLLGLPAADWGAVGRWS
RDVGRTLDRGASAEDMRRGHAAIAEFADYVERALARRRREGGEDLLALMLDAHDRGLM
SRNEIVSTVVTFIFTGHETVASQVGNAVLSLLAHPDQLDLLRRRPDLLAQAVEECLRY
DPSVQSNTRQLDVDVELRGRRLRRDDVVVVLAGAANRDPRRYDRPDDFDIERDPVPSM
SFGAGMRYCLGSYLARTQLRAAVAALARLPGLRLGCASDALAYQPRTMFRGLASLPIA
FTPGG
CYP249A1 Rhodococcus ruber
GenEMBL AF333761 12042..13244
Chauvaux,S., Chevalier,F., Le Dantec,C., Fayolle,F., Miras,I.,
Kunst,F. and Beguin,P.
Cloning of a genetically unstable cytochrome P-450 gene cluster
involved in degradation of the pollutant ethyl tert-butyl ether by
Rhodococcus ruber
J. Bacteriol. 183 (22), 6551-6557 (2001)
gene = ethB
37% TO CYP217A1
MTLSLATAQERYATDADVFAHDTLVDPYDTYRSLRDIGRVSYMT
RYDTWALTRYDEVRHALGDWQTFSSAQGIGMSTALNEAWKDFAPCKDGADHLPMRKLM
MQDLGPKAAAAYKEKIQQAAVTLVEELLDRREFDAVLDFAQMMPMRVFMEVLGVEPDI
EQRRTMLHWGTDTYNCAAPDGLYDDTLPSMDKLYSWALENITPETAREGSVAASTWES
VERGDITDVQAVATLAAYVTAGLDTTAGTLGNTIAQFAANPDQWAIVRDDPKTIPGAI
LEGIRFDSVAQWFTRVTTRDVEYDDIVIPAGSRTYHSYAAANRDERHYRDPDSFDVLR
NPTDHVGFGYGPHMCVGKSVSNTEMIALWTELGRRVDRIEQIGPKKQHINNLIRSLDS
LPVRIYPK
CYP250A1 Arthrobacter aurescens
GenEMBL AF146701
32% TO 219A1 N-TERM ONLY
MKEPLDFADPTLYQNPVPAFNKMREEHPVFWSDSAGSWVVSRHA
DVVRVLNNLEDAQASLFKINDYAEQCPFGKGTAISRGIENALVTTDLPDHPRLRRHTA
PLLTRRSVERDYAETVEQTVIALLEGIEEDTRFDVLDSISVPLPLAVVTKLIGFDAED
CYP251A1 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
38% to 183A1 and 171A2
clone name SP0759
CYP252A1 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
39% to 197A1
clone name SP0812
CYP253A1 Streptomyces peucetis
No accession number
Niranjan Parajuli
Submitted to nomenclature committee Nov. 2, 2003
36% to 197B1
clone name SP0913
Unidentified fragments
Rhodobacter capsulatus 1st sequence from a genome project
Rhodobacter capsulatus 2nd sequence from a genome project
Rhodobacter capsulatus 3rd sequence from a genome project 32% id with U08223
>AF071147 Streptoalloteichus hindustanus cytochrome
61% to 105C1 59% to AF040570 CDS 2652..3842
LLIAGHETTANMLALGAFALLEHPEQLAELRANPDLMPGAVEEL
MRYLSIVHIGPVRTAVADVEIEGQLIRAGESVTVSVPAANWDPAKFPEPERLDLTRRT
SGHLAFGHGVHQCLRQNL
>CYP? Streptomyces noursei putative P450 hydroxylase gene, partial cds.
GenEMBL AF071516 CDS complement(85..>519)
function="may hydroxylate a macrolide antibiotic polyketide moiety
C-term only 55% to 107A1
WTTPTRWSCSAPSLICLPRHGRNTAHRRTSRSHHPGRARRHPDR
DTLIPARSTVFIAGAAANRDPQKFPNPDTFDITRNTQGHLAFGYGVHHCIGRPLAQME
GEVAITALLRRFPHLHLTTPSQNLTWRRSFLRGLTALPVTLN
>L76374 Mycobacterium avium paratuberculosis 39% identical to 107B1
483 LIFTDPPRHRQLRKLINSGFXXRRVSVLEPKIRKIVXXILDGIEXGAVHEFTEQITAPLP 304
303 TRMIAELIGAPPDDWEQFRAWSDAATGTADPEIELDPAVAAGQLYEYFQRLIAARRARPR 124
123 ADLLSVLAEAEIDEHRLTDEDLLNFAFLLLVAGNETTRNLI 1
Rhodococcus rhodochrous
Swiss P31718 (20 amino acids)
Eltis L.D., Karlson U., Timmis K.N.
Eur. J. Biochem. 213, 211-216 (1993)