Language selection

Search

Patent 2523786 Summary

Third-party information liability

Some of the information on this Web page has been provided by external sources. The Government of Canada is not responsible for the accuracy, reliability or currency of the information supplied by external sources. Users wishing to rely upon this information should consult directly with the source of the information. Content provided by external sources is not subject to official languages, privacy and accessibility requirements.

Claims and Abstract availability

Any discrepancies in the text and image of the Claims and Abstract are due to differing posting times. Text of the Claims and Abstract are posted:

  • At the time the application is open to public inspection;
  • At the time of issue of the patent (grant).
(12) Patent Application: (11) CA 2523786
(54) English Title: A .DELTA.-12 DESATURASE GENE SUITABLE FOR ALTERING LEVELS OF POLYUNSATURATED FATTY ACIDS IN OLEAGINOUS YEASTS
(54) French Title: GENE DE ?-12 DESATURASE APPROPRIE POUR DIFFERENTS NIVEAUX DE DEGRADATION D'ACIDES GRAS POLYINSATURES DANS DES LEVURES OLEAGINEUSES
Status: Dead
Bibliographic Data
(51) International Patent Classification (IPC):
  • C12N 15/53 (2006.01)
  • C12N 1/19 (2006.01)
  • C12N 9/02 (2006.01)
  • C12N 15/63 (2006.01)
  • C12P 7/40 (2006.01)
  • C12P 7/64 (2006.01)
  • C12Q 1/68 (2006.01)
  • C40B 30/04 (2006.01)
(72) Inventors :
  • YADAV, NARENDRA S. (United States of America)
  • ZHANG, HONGXIANG (United States of America)
(73) Owners :
  • E.I. DU PONT DE NEMOURS AND COMPANY (United States of America)
(71) Applicants :
  • E.I. DU PONT DE NEMOURS AND COMPANY (United States of America)
(74) Agent: BENNETT JONES LLP
(74) Associate agent:
(45) Issued:
(86) PCT Filing Date: 2004-05-07
(87) Open to Public Inspection: 2004-12-02
Availability of licence: N/A
(25) Language of filing: English

Patent Cooperation Treaty (PCT): Yes
(86) PCT Filing Number: PCT/US2004/014542
(87) International Publication Number: WO2004/104167
(85) National Entry: 2005-10-26

(30) Application Priority Data:
Application No. Country/Territory Date
60/468,677 United States of America 2003-05-07
60/484,209 United States of America 2003-06-30

Abstracts

English Abstract




The present invention relates to a .DELTA.12 fatty acid desaturase able to
catalyze the conversion of oleic acid to linoleic acid (LA; 18:2). Nucleic
acid sequences encoding the desaturase, nucleic acid sequences that hybridize
thereto, DNA constructs comprising the desaturase gene, and recombinant host
microorganisms expressing increased levels of the desaturase are described.
Methods of increasing production of specific .omega.-3 and/or .omega.-6 fatty
acids are described by overexpression of the .DELTA.12 fatty acid desaturase
or by disruption of the native gene.


French Abstract

La présente invention concerne une désaturase d'acide gras ?12 capable de catalyser la conversion de l'acide oléique en acide linoléique (LA; 18:2). L'invention concerne aussi des séquences d'acide nucléique qui codent pour la désaturase, des séquences d'acides nucléiques qui s'hybrident à celle-ci, des séquences d'ADN hybrides comprenant le gène de désaturase, et des micro-organismes hôtes recombinés exprimant des taux supérieurs de désaturase. L'invention a également pour objet des procédés pour augmenter la production d'acides gras ?-3 et/ou ?-6 spécifiques, grâce à la surexpression de la désaturase d'acide gras ?12 ou à la dislocation du gène natif.

Claims

Note: Claims are shown in the official language in which they were submitted.



CLAIMS


What is claimed is:

1. An isolated nucleic acid molecule encoding a Yarrowia .DELTA.12
desaturase enzyme, selected from the group consisting of:
(a) an isolated nucleic acid molecule encoding the amino acid
sequence as set forth in SEQ ID NO:24;
(b) an isolated nucleic acid molecule that hybridizes with (a)
under the following hybridization conditions: 0.1X SSC,
0.1% SDS, 65 °C and washed with 2X SSC, 0.1 % SDS
followed by 0.1X SSC, 0.1% SDS; or
an isolated nucleic acid molecule that is complementary to (a)
or (b).

2. The isolated nucleic acid molecule of Claim 1 as set forth in
SEQ ID NO:23.

3. A polypeptide encoded by the isolated nucleic acid molecule of
Claim 1.

4. The polypeptide of Claim 3 as set forth in SEQ ID NO:24.

5. An isolated nucleic acid molecule comprising a first nucleotide
sequence encoding a .DELTA.12 desaturase enzyme of at least 419 amino acids
that has at least 53% identity based on the Clustal method of alignment
when compared to a polypeptide having the sequence as set forth in SEQ
ID NO:24;
or a second nucleotide sequence comprising the complement
of the first nucleotide sequence.

6. A chimeric gene comprising the isolated nucleic acid molecule
of any of Claims 1-2 operably linked to suitable regulatory sequences.

7 A transformed host cell comprising the chimeric gene of
Claim 6.

8. A transformed host cell according to Claim 7 selected from the
group consisting of plants, algae, bacteria, yeast and fungi.

9. A transformed host cell according to Claim 8 wherein the yeast
is an oleaginous yeast.

10. A transformed host cell according to Claim 9 wherein the
oleaginous yeast is selected from the group consisting of Yarrowia,
Mortierella, Candida, Rhodotorula, Rhodosporidium, Cryptococcus,
Trichosporon and Lipomyces.

11. A transformed host cell according to Claim 10 wherein the
oleaginous yeast is Yarrowia sp.



79



12. A transformed host cell according to Claim 11 selected from
the group consisting of Yarrowia lipolytica ATCC #20362, Yarrowia
lipolytica ATCC #8862, Yarrowia lipolytica ATCC #18944, Yarrowia
lipolytica ATCC #76982 and Yarrowia lipolytica LGAM S(7)1.

13. A method for the production of linoleic acid comprising:
a) providing a microbial host cell comprising:
(i) the chimeric gene of Claim 6 encoding a .DELTA.12
desaturase polypeptide; and
(ii) a source of desaturase substrate consisting of oleic
acid;
b) growing the microbial host cell of step (a) under conditions
wherein the gene encoding a .DELTA.12 desaturase polypeptide
is expressed and the oleic acid is converted to linoleic
acid; and
c) optionally recovering the linoleic acid of step (b).

14. A method for producing .omega.-3 fatty acids comprising:
a) engineering a microbial host cell comprising the following
elements:
(i) a disrupted endogenous gene encoding a .DELTA.12
desaturase polypeptide; and
(ii) genes encoding enzymes of the .omega.-3 fatty acid
biosynthetic pathway; and
b) providing a source of desaturase substrate consisting of
.alpha.-linolenic acid;
c) growing the microbial host cell of step (a) under
conditions wherein the genes of the .omega.-3 fatty acid
biosynthetic pathway are expressed, producing .omega.-3 fatty
acids; and
d) optionally recovering the .omega.-3 fatty acids of step (c).

15. A method according to Claim 14, wherein the genes encoding
enzymes of the .omega.-3 fatty acid biosynthetic pathway are selected from the
group consisting of: a .DELTA.6 desaturase, an elongase, a .DELTA.5 desaturase
and a
.DELTA.4 desaturase.

16. A method according to Claim 14, wherein the .omega.-3 fatty acids
are selected from the group consisting of: steraidonic acid,
eicosatetraenoic acid, eicosapentaenoic acid, docosapentaenoic acid and
docosahexaenoic acid.


80



17. A method according to Claim 13 or 14 wherein the host cell is a
yeast cell.

18. A method according to Claim 17 wherein the yeast cell is an
oleaginous yeast selected from the group consisting of Yarrowia, Candida,
Rhodotorula, Rhodosporidium, Cryptococcus, Trichosporon and
Lipomyces.

19. A method according to Claim 18 wherein the oleaginous yeast
is Yarrowia sp.

20. A method according to Claim 19 selected from the group
consisting of Yarrowia lipolytica ATCC #20362, Yarrowia lipolytica ATCC
#8862, Yarrowia lipolytica ATCC #18944, Yarrowia lipolytica ATCC
#76982 and Yarrowia lipolytica LGAM S(7)1.

21. A method for modulating the biosynthesis of .omega.-3 or .omega.-6 fatty
acids in a microbial host cell comprising:
a) providing a microbial host cell comprising a functional .omega.-3
/ .omega.-6 fatty acid biosynthetic pathway;
b) over-expressing a .DELTA.12 desaturase gene in the host cell of
(a); whereby the biosynthesis of .omega.-3 or .omega.-6 fatty acids is
modulated.

22. A method according to Claims 21 wherein the host cell is an
oleaginous yeast.

23. A method according to Claim 22 wherein the host cell is
Yarrowia and the .DELTA.12 desaturase gene encodes the .DELTA.12 desaturase
enzyme as set forth in SEQ ID NO:24.

24. A method according to Claim 21 wherein the .DELTA.12 desaturase
gene is over-expressed on a multicopy plasmid.

25. A method according to Claim 21 wherein .DELTA.12 desaturase gene
is operably linked to an inducible or regulated promoter.

26. A microbial oil produced by the method of any one of
Claims 13-20.

27. A method of obtaining a nucleic acid molecule encoding a .DELTA.12
desaturase enzyme comprising:
(a) probing a genomic library with the nucleic acid molecule of
Claim 1;
(b) identifying a DNA clone that hybridizes with the nucleic
acid molecule of Claim 1; and


81



(c) sequencing the genomic fragment that comprises the
clone identified in step (b),
wherein the sequenced genomic fragment encodes a .DELTA.12 desaturase
enzyme.

28. A method of obtaining a nucleic acid molecule encoding a .DELTA.12
desaturase enzyme comprising:
(a) synthesizing at least one oligonucleotide primer
corresponding to a portion of the sequence as set forth in
SEQ ID NOs:23; and
(b) amplifying an insert present in a cloning vector using the
oligonucleotide primer of step (a);
wherein the amplified insert encodes a portion of an amino acid sequence
encoding a .DELTA.12 desaturase enzyme.



82

Description

Note: Descriptions are shown in the official language in which they were submitted.




CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
TITLE
A X12 DESATURASE GENE SUITABLE FOR ALTERING LEVELS OF
POLYUNSATURATED FATTY ACIDS IN OLEAGINOUS YEASTS
This application claims the benefit of U.S. Provisional Application
No. 60/468677, filed May 7, 2003, and U.S. Provisional Application
No. 60/484209, filed June 30, 2003.
FIELD ~OF THE INVENTION
This invention is in the field of biotechnology. More specifically, this
invention pertains to the identification of a nucleic acid fragment encoding
a 012 fatty acid desaturase enzyme useful for disrupting or enhancing the
production of polyunsaturated fatty acids (PUFAs) in oleaginous
microorganisms, such as oleaginous yeasts.
BACKGROUND OF THE INVENTION
It has long been recognized that certain polyunsaturated fatty acids,
or PUFAs, are important biological components of healthy cells. For
example, such PUFAs are recognized as:
~ "Essential" fatty acids that can not be synthesized de novo in
mammals and instead must be obtained either in the diet or derived
by further desaturation and elongation of linoleic acid (LA) or a-
linolenic acid (ALA);
~ Constituents of plasma membranes of cells, where they may be
found in such forms as phospholipids or triglycerides;
~ Necessary for proper development, particularly in the developing
infant brain, and for tissue formation and repair; and,
~ Precursors to several biologically active eicosanoids of importance
in mammals, including prostacyclins, eicosanoids, leukotrienes and
prostaglandins.
(n the 1970's, observations of Greenland Eskimos linked a low
incidence of heart disease and a high intake of long-chain w-3 PUFAs
(Dyerberg, J. et al., Amer. J. Clin Nutr. 28:958-966 (1975); Dyerberg, J. et
al., Lancet 2(8081 ):117-119 (July 15, 1978)). More recent studies have
confirmed the cardiovascular protective effects of e~-3 PUFAs
(Shimokawa, H., World Rev Nutr Dief, 88:100-108 (2001 ); von Schacky,
C., and Dyerberg, J., 1/Vorld Rev Nutr Diet, 88:90-99 (2001 )). Further, it
has been discovered that several disorders respond to treatment with cu-
3 fatty acids, such as the rate of restenosis after angioplasty, symptoms of
inflammation and rheumatoid arthritis, asthma, psoriasis and eczema. y-
linolenic acid (GLA, an w-6 PUFA) has been shown to reduce increases in
1



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
blood pressure associated with stress and to improve performance on
arithmetic tests. GLA and dihomo-y-linolenic acid (DGLA, another cu-6
PUFA) have been shown to inhibit platelet aggregation, cause
vasodilation, lower cholesterol levels and inhibit proliferation of vessel
wall
smooth muscle and fibrous tissue (Brenner et al., Adv. Exp. Med. Biol. 83:
85-101 (1976)). Administration of GLA or DGLA, alone or in combination
with eicosapentaenoic acid (EPA, an w-3 PUFA), has beeri shown to
reduce or prevent gastrointestinal bleeding and other side efFects caused
by non-steroidal anti-inflammatory drugs (U.S. 4,666,701 ). Further, GLA
and DGLA have been shown to prevent or treat endometriosis and
premenstrual syndrome (U.S. 4,758,592) and to treat myalgic
encephalomyelitis and chronic fatigue after viral infections (U.S.
5,116,871 ). Other evidence indicates that PUFAs may be involved in the
regulation of calcium metabolism, suggesting that they may be useful in
the treatment or prevention of osteoporosis and kidney or urinary tract
stones. Finally, PUFAs can be used in the treatment of cancer and
diabetes (U.S. 4,826,877; Horrobin et al., Am. J. Clin. Nutr. 57 (Suppl.):
732S-737S (1993)).
PUFAs are generally divided into two major classes (consisting of
the cu-6 and the e~-3 fatty acids) that are derived by desaturation and
elongation of the essential fatty acids, LA and ALA, respectively. Despite a
variety of commercial sources of PUFAs from natural .sources [e.g., seeds
of evening primrose, borage and black currants; filamentous fungi
(Mortierella), Porphyridium (red alga), fish oils and marine plankton
(Cyclotella, Nitzschia, Crypthecodinium)], there are several disadvantages
associated with these methods of production. First, natural sources such
as fish and plants tend to have highly heterogeneous oil compositions.
The oils obtained from these sources therefore can require extensive
purification to separate or enrich one or more of the desired PUFAs.
Natural sources are also subject to uncontrollable fluctuations in
availability (e.g., due to weather, disease, or over-fishing in the case of
fish
stocks); and, crops that produce PUFAs often are not competitive
economically with hybrid crops developed for food production. Large-
scale fermentation of some organisms that naturally produce PUFAs (e.g.,
Porphyridium, Morfierella) can also be expensive andlor difficult to
cultivate on a commercial scale.
2



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
As a result of the limitations described above, extensive work has
been conducted toward: 1.) the development of recombinant sources of
PUFAs that are easy to produce commercially; and 2.) modification of fatty
acid biosynthetic pathways, to enable production of desired PUFAs. For
example, advances in the isolation, cloning and manipulation of fatty acid
desaturase and elongase genes from various organisms have been made
over the last several years. Knowledge of these gene sequences offers
the prospect of producing a desired fatty acid and/or fatty acid composition
in novel host organisms that do not naturally produce PUFAs. The
literature reports a number of examples in Saccharomyces cerevisiae,
such as:
~ Domergue, F., et al. (Eur. J. Biochem. 269:4105-4113 (2002)),
wherein two desaturases from the marine diatom
Phaeodactylum fricornutum were cloned into S. cerevisiae,
leading to the production of EPA;
~ Beaudoin F., et al: (Proc. Natl. Acad. Sci. U.S.A. 97(12):6421-
6426 (2000)), wherein the w-3 and w-6 PUFA biosynthetic
pathways were reconstituted in S. cerevisiae, using genes from
Caenorhabditis elegans;
~ Dyer, J.M. et al. (Appl. Eniv. Microbiol., 59:224-230 (2002)),
wherein plant fatty acid desaturases (FAD2 and FAD3) were
expressed in S. cerevisiae, leading to the production of ALA;
and,
~ U.S. 6,136,574 (Knutzon et al., Abbott Laboratories), wherein
one desaturase from Brassica napus and two desaturases from
the fungus Mortierella alpina were cloned into S. cerevisiae,
leading to the production of LA, GLA, ALA and STA.
There remains a need, however, for an appropriate microbial system in
which these types of genes can be expressed to provide for economical
production of commercial quantities of one or more PUFAs. Additionally, a
need exists for oils enriched in specific PUFAs, notably EPA and DHA.
One class or microorganisms that has' not been previously
examined as a production platform for PUFAs are the oleaginous yeasts.
These organisms can accumulate oil up to ~80% of their dry cell weight.
The technology for growing oleaginous yeast with high oil content is well
developed (for example, see EP 0 005 27781; Ratledge, C., Prog. Ind.
Microbiol. 16:119-206 (1982)) and may offer a cost advantage compared
3



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
to commercial micro-algae fermentation for production of w-3 or cu-6
PUFAs. Whole yeast cells may also represent a convenient way of
encapsulating w-3 or w-6 PUFA-enriched oils for use in functional foods
and animal feed supplements.
Despite the advantages noted above, most oleaginous yeast are
naturally deficient in cu-6 and w-3 PUFAs, since naturally produced PUFAs
in these organisms are usually limited to 18:2 fatty acids (and less
commonly, 18:3 fatty acids). Thus, the problem to be solved is to develop
an oleaginous yeast that accumulates oils enriched in cu-3 and/or cu-6 fatty
acids. Toward this end, it is not only necessary to introduce the required
desaturases and elongases that allow for the synthesis and accumulation
of w-3 andlor w-6 fatty acids in oleaginous yeasts, but also to increase the
availability of the 18:2 substrate (i.e., LA). Generally, the availability of
this
substrate is controlled by the activity of 012 desaturases that catalyze the
conversion of oleic acid to LA.
There are a variety of known X12 desaturases disclosed in the
public literature, some of which originate from fungal sources (e.g.,
Morfierella alpina, Emericella nidulans, Mucorrouxi~~. These desaturases
are not known to- be effective for altering fatty acid composition in
oleaginous yeasts and are not preferred for use in oleaginous yeasts.
Thus, there is need for the identification and isolation of genes encoding
X12 desaturases that will be suitable for expression in these particular host
organisms for use in the production of PUFAs.
Applicants have solved the stated problem by isolating the gene
encoding a X12 desaturase from the oleaginous yeast, Yarrowia lipolyfica.
SUMMARY OF THE INVENTION
The invention relates to a gene encoding a 012 desaturase enzyme
isolated from Yarrowia useful for the manipulation of the biochemical
pathway for the production of ~-3 and/or cu-6 fatty acids. Accordingly, the
invention provides an isolated nucleic acid molecule encoding a Yarrovvia
012 desaturase enzyme, selected from the group consisting of:
(a) an isolated nucleic acid molecule encoding the amino acid
sequence as set forth in SEQ ID N0:24;
(b) an isolated nucleic acid molecule that hybridizes with (a)
under the following hybridization conditions: 0.1X SSC,
0.1 % SDS, 65 °C and washed with 2X SSC, 0.1 % SDS
followed by 0.1X SSC, 0.1% SDS; or
4



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
an isolated nucleic acid molecule that is complementary to (a)
or (b).
Additionally the invention provides transformed host cells
comprising the nucleic acid molecules of the invention, genetic chimera
and polypeptides encoded by the same.
In an alternate embodiment the invention provides a method for the
production of linoleic acid comprising:
a) providing a yeast comprising:
(i) a chimeric gene of the invention encoding a X12
desaturase polypeptide; and
(ii) a source of desaturase substrate consisting of oleic
acid;
b) growing the yeast of step (a) under conditions wherein the
gene encoding a X12 desaturase polypeptide is
expressed and the oleic acid is converted to linoleic acid;
and
c) optionally recovering the linoleic acid of step (b).
In another embodiment the invention provides a method for
producing w-3 fatty acids comprising:
a) engineering a microbial host cell comprising the following
elements:
(i) a disrupted endogenous gene encoding a X12
desaturase polypeptide; and
(ii) genes encoding enzymes of the w-3 fatty acid
biosynthetic pathway; and
b) providing a source of desaturase substrate consisting of
a-linolenic acid;
c) growing the yeast of step (a} under conditions wherein the
genes of the c~-3 fatty acid biosynthetic pathway are
expressed, producing w-3 fatty acids; and
d) optionally recovering the ~-3 fatty acids of step (c).
Similarly the invention provides a method for modulating the
biosynthesis of ~-3 or ~-6 fatty acids in a host cell comprising:
a) providing a host cell comprising a functional w-3 / cu-6 fatty
acid biosynthetic pathway;
5



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
b) over-expressing a X12 desaturase gene in the host cell of
(a); whereby the biosynthesis of ~-3 or ~-6 fatty acids is
modulated.
In another embodiment the invention provides a method of
obtaining a nucleic acid molecule encoding a 012 desaturase enzyme
comprising:
(a) probing a genomic library with the nucleic acid molecule of
the invention;
(b) identifying a DNA clone that hybridizes with the nucleic
acid molecule of the invention; and
(c) sequencing the genomic fragment that comprises the
clone identified in step (b),
wherein the sequenced genomic fragment encodes a X12
desaturase enzyme.
Similarly the invention provides a method of obtaining a nucleic acid
molecule encoding a X12 desaturase enzyme comprising:
(a) synthesizing at least one oligonucleotide primer
corresponding to a portion of the sequence. as set forth in
SEQ ID NOs:23; and
(b) amplifying an insert present in a cloning vector using the
oligonucleotide primer of step (a);
wherein the amplified insert encodes a portion of an amino acid sequence
encoding a X12 desaturase enzyme.
BRIEF DESCRIPTION OF THE DRAWINGS AND
SEQUENCE DESCRIPTIONS
Figure 1 shows a schematic illustration of the biochemical
mechanism for lipid accumulation in oleaginous yeast.
Figure 2 illustrates the w-3 and w-6 fatty acid biosynthetic
pathways.
Figure 3 illustrates the construction of the plasmid vector pY5 for
gene expression in Yarrov~ria lipolytica.
Figure 4 illustrates the construction of plasmid vectors pY5-13 and
pY5-4 for gene expression in Y. lipolytica.
Figure 5 shows a pairwise comparison (% Identity) between and
among different yeast and fungal X12 desaturase homologs using a
ClustalW analysis (Megalign program of DNASTAR sofware).
6



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Figure~6 is a schematic presentation of the construction of
intermediate vector pYZMSCHPPA.
Figure 7 shows a comparison between the DNA sequence of the
Saprolegnia diclina X17 desaturase gene and the synthetic gene codon
optimized for expression in Y. lipolytica.
Figure 8 illustrates the favored consensus sequences around the
translation initiation codon 'ATG' in Y. lipolytica.
Figure 9 illustrates the strategy for in vifro synthesis of the codon-
optimized 017 desaturase gene.
Figure 10 shows plasmids for expression of the synthetic codon-
optimized and wildtype 017 desaturase genes in Y. lipolyfica.
Figures 11A and 11 B show the results of gas chromatographic
analysis of fatty acids produced in Y, lipolytica transformed with the
wildtype and synthetic codon-optimized X17 desaturase genes,
respectively.
Figure 12 is a schematic presentation of the construction of
intermediate vector pY24-4.
Figure 13 is a schematic presentation of the construction of
intermediate vector pYZV16.
Figure 14 is a schematic presentation of the construction of
integration vector pYZM5EL6.
Figure 15 is a schematic presentation of the construction of
integration vectors pYZV5EL6 and pYZV5EL6/17.
The invention can be more fully understood from the following
detailed description and the accompanying sequence descriptions, which
form a part of this application.
The following sequences comply with 37 C.F.R. ~1.821-1.825
("Requirements for Patent Applications Containing Nucleotide Sequences
and/or Amino Acid Sequence Disclosures - the Sequence Rules") and are
consistent with World Intellectual Property Organization (WIPO) Standard
ST.25 (1998) and the sequence listing requirements of the EPO and PCT
(Rules 5.2 and 49.5(a-bis), and Section 208 and Annex C of the
Administrative Instructions). The symbols and format used for nucleotide
and amino acid sequence data comply with the rules set forth in
37 C.F.R. ~1.822.
SEQ ID NOs:1 and 2 correspond to primers TEFS' and TEF3',
respectively, used to isolate the TEF promoter.
7



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
SEQ ID NOs:3 and 4 correspond to primers XPRS' and XPR3',
respectively, used to isolate the XPR2 transcriptional terminator.
SEQ ID NOs:S-18 correspond to primers YL1, YL2, YL3, YL4,
YL23, YL24, YLS, YL6, YL9, YL10, YL7, YLB, YL61 and YL62,
respectively, used for plasmid construction.
SEQ ID NOs:19 and 21 are the degenerate primers identified as
P73 and P76, respectively, used for the isolation of a Yarrowia lioplytica
012 desaturase gene.
SEQ ID NOs:20 and 22 are the amino acid consensus sequences
that correspond to the degenerate primers P73 and P76, respectively.
SEQ ID N0:23 shows the DNA sequence of the Y. lipolytica X12
desaturase gene, while SEQ ID N0:24 shows the amino acid sequence of
the Y. lipolytica X12 desaturase.
SEQ ID NOs:25-28 correspond to primers P99, P100, P101 and
P102, respectively, used for targeted disruption of the Y. lipolytica d12
desaturase gene.
SEQ ID NOs:29-32 correspond to primers P119, P120, P121 and
P122, respectively, used to screen for targeted integration of the disrupted
Y. lipolytica 012 desaturase gene.
SEQ ID NOs:33 and 34 correspond to primers P147 and P148,
respectively, used to amplify the full-length Y. lipolyfica X12 desaturase
gene.
SEQ ID NO:35 shows the DNA sequence of the Saprolegnia diclina
X17 desaturase gene.
SEQ ID N0:36 shows the DNA sequence of the Morfierella alpina
d6 desaturase gene, while SEQ ID N0:37 shows the amino acid sequence
of the M. alpina D6 desaturase.
SEQ ID N0:38 shows the DNA sequence of the Morfierella alpina
a5 desaturase gene, while SEQ ID N0:39 shows the amino acid sequence
of the M. alpina ~5 desaturase.
SEQ ID NOs:40 and 41 correspond to primers YL11 and YL12,
respectively, used for amplifying the M. alpina ~5 desaturase.
SEQ ID NOs:42 and 43 correspond to primers YL21A and YL22,
respectively, used for amplifying the wild type S. diclina 017 desaturase.
SEQ ID N0:44 shows the DNA sequence of the Mortierella alpina
high affinity elongase gene, while SEQ ID N0:45 shows the amino acid
sequence of the M. alpina high affinity elongase.
8



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
SEQ ID N0:46 shows the DNA sequence of the synthetic X17
desaturase gene codon-optimized for expression in Yarrowia lipolytica,
while SEQ ID N0:47 shows the corresponding amino acid sequence of the
S. diclina X17 desaturase.
SEQ ID NOs:48-69 correspond to the 11 pairs of oligonucleotides
that together comprise the entire codon-optimized coding region of the S.
diclina X17 desaturase gene (e.g., D17-1A, D17-1B, D17-2A, D17-2B,
D17-3A, D17-3B, D17-4A, D17-4B, D17-5A, D17-5B, D17-6A, D17-6B,
D17-7A, D17-7B, D17-8A, D17-8B, D17-9A, D17-9B, D17-10A, D17-10B,
D17-11A and D17-11B, respectively).
SEQ ID NOs:70-75 correspond to primers D17-1, D17-4R, D17-5,
D17-8D, D17-8U and D17-11, respectively, used for PCR amplification
during synthesis of the codon-optimized X17 desaturase gene.
SEQ ID NOs:76 and 77 correspond to primers YL53 and YL54,
respectively, used for site-directed mutagenesis to generate pYSD17M.
SEQ ID NOs:78 and 79 correspond to primers KU5 and KU3,
respectively, used for amplifying a 1.7 kB DNA fragment (SEQ ID N0:80;
amino acid sequence provided as SEQ ID N0:81 ) containing the Yarrowia
URA3 gene.
', SEQ ID NOs:82 and 83 correspond to primers KI5 and KI3,
respectively, used for amplifying a 1.1 kB DNA fragment (SEQ ID N0:84;
amino acid sequence provided as SEQ ID N0:85) containing the
conjugase gene of Impatients balsams.
SEQ ID NOs:86 and 87 correspond to primers KTf5 and KT13,
respectively, used for amplifying a 1.7 kB DNA fragment (SEQ ID N0:88;
amino acid sequence provided as SEQ ID N0:89) containing a
TEF::conjugase::XPR chimeric gene.
SEQ ID NOs:90 and 91 correspond to primers KH5 and KH3,
respectively, used for amplifying a 1 kB DNA fragment (SEQ ID N0:92;
amino acid sequence provided as SEQ ID N0:93) containing the E, coli
hygromycin resistance gene.
SEQ ID NOs:94 and 95 correspond to primers KTH5 and KTH3,
respectively, used for amplifying a 1.6 kB DNA fragment (SEQ ID N0:96;
amino acid sequence provided as SEQ ID N0:97) containing the
TEF::HPT::XPR fusion gene.
SEQ ID NOs:98 and 99 correspond to the 401 by of 5'-sequence
and 568 by of 3'-sequence of the Yarrowia lipolyfica URA3 gene,
9



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
respectively, used to direct integration of expression cassettes into the Ura
loci of the Yarrowia genome. '
SEQ ID NOs:100-103 correspond to primers YL63, YL64, YL65 and
YL66, respectively, used for site-directed mutagenesis to generate pY24-
4.
SEQ ID NOs:104-107 correspond to primers YL81, YL82, YL83 and
YL84, respectively, used for site-directed mutagenesis to generate
pYZMSCH.
SEQ ID NOs:108 and 109 correspond to primers YL105 and YL106,
respectively, used for site-directed mutagenesis to generate pYZMSCHPP.
SEQ ID NOs:110 and 111 correspond to primers YL119 and YL120,
respectively, used for site-directed mutagenesis to generate
pYZMSCHPPA.
SEQ ID NOs:112 and 113 correspond to primers YL121 and YL122,
respectively, used for amplifying 440 by of 5'-non-coding DNA sequence
(SEQ ID NO:114) upstream from the Y. lipolytica URA3 gene.
SEQ ID NOs:115 and 116 correspond to primers YL114 and YL115,
respectively, used for site-directed mutagenesis to generate pYZV5 and
pYZVSP.
SEQ ID N0:117 corresponds to a 5.2 kB DNA fragment suitable for
integration and expression of the ~5 desaturase gene in the Y. lipolytica
genome.
SEQ ID NOs:118 and 119 correspond to primers YL69 and YL70,
respectively, used for site-directed mutagenesis to generate pY58BH.
SEQ ID NOs:120-123 correspond to primers YL77, YL78, YL79A
and YL80A, respectively, used for site-directed mutagenesis to generate
pY54PC.
SEQ ID N0:124 corresponds to a 8.9 kB DNA fragment suitable for
integration and coordinate expression of the ~6 desaturase, PUFA
elongase and o5 desaturase genes in the Y. lipolytica genome.
SEQ ID NOs:125-128 correspond to primers YL101, YL102, YL103
and YL104, respectively, used for site-directed mutagenesis to generate
pYSD17SPC.
SEQ ID N0:129 corresponds to a 10.3 kB DNA fragment suitable
for integration and coordinate expression of the ~6 desaturase, PUFA
elongase, O5 desaturase and 017 desaturase genes in the Y. lipolytica
genome.



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
SEQ ID N0:130 corresponds to the codon-optimized translation
initiation site for genes optimally expressed in Yarrowia sp.
DETAILED DESCRIPTION OF THE INVENTION
In accordance with the subject invention, Applicants have isolated
and confirmed the identity of a Yarrowia lipolytica gene encoding a 012
desaturase. Additionally, methods and compositions are provided which
permit modification of the long chain polyunsaturated fatty acid (PUFA)
content of oleaginous yeasts, such as Yarrowia lipolytica.
The invention relates to a new X12 desaturase enzyme and gene
encoding the same that may be used for the manipulation of biochemical
pathways for the production of healthful PUFAs. The subject invention
finds many applications. PUFAs, or derivatives thereof, made by the
methodology disclosed herein can be used as dietary substitutes, or
supplements, particularly infant formulas, for patients undergoing
intravenous feeding or for preventing or treating malnutrition. Alternatively,
the purified PUFAs (or derivatives thereof) may be incorporated into
cooking oils, fats or margarines formulated so that in normal use the
recipient would receive the desired amount for dietary supplementation.
The PUFAs may also be incorporated into infant formulas, nutritional
supplements or other food products and may find use as anti-inflammatory
or cholesterol lowering agents. Optionally, the compositions may be used
for pharmaceutical use (human or veterinary). In this case, the PUFAs are
generally administered orally but can be administered by any route by
which they may be successfully absorbed, e.g., parenterally (e.g.,
subcutaneously, intramuscularly or intravenously), rectally, vaginally or
topically (e.g., as a skin ointment or lotion).
Supplementation of humans or animals with PUFAs produced by
recombinant means can result in increased levels of the added PUFAs, as
well as their metabolic progeny. For example, treatment with arachidonic
acid (ARA) can result not only in increased levels of ARA, but also
downstream products of ARA such as prostaglandins. Complex regulatory
mechanisms can make it desirable to combine various PUFAs, or add
different conjugates of PUFAs, in order to prevent, control or overcome
such mechanisms to achieve the desired levels of specific PUFAs in an
individual.
11



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Definitions
In this disclosure, a number of terms and abbreviations are used.
The following definitions are provided.
"Open reading frame" is abbreviated ORF.
"Polymerase chain reaction" is abbreviated PCR.
"American Type Culture Collection" is abbreviated ATCC.
"Polyunsaturated fatty acid(s)" is abbreviated PUFA(s).
The term "fatty acids" refers to long chain aliphatic acids (alkanoic
acids) of varying chain length, from about C~2 to C22 (although both longer
and shorter chain-length acids are known). The predominant chain
lengths are between C~6 and C2~. The structure of a fatty acid is
represented by a simple notation system of "X:Y", where X is the total
number of carbon (C) atoms in the particular fatty acid and Y is the
number of double bonds.
Generally, fatty acids are classified as saturated or unsaturated.
The term "saturated fatty acids" refers to those fatty acids that have no
"double bonds" between their carbon backbone. In contrast, "unsaturated
fatty acids" have "double bonds" along their carbon backbones (which are
most commonly in the cis-configuration). "Monounsaturated fatty acids"
have only one "double bond" along the carbon backbone (e:g., usually
between the 9t" and 1 Ot" carbon atom as for palmitoleic acid (16:1 ) and
oleic acid (18:1 )), while "polyunsaturated fatty acids" (or "PUFAs") have at
least two double bonds along the carbon backbone (e.g., between the 9t"
and 10t", and 12t" and 13t" carbon atoms for linoleic acid (18:2); and
between the 9t" and 10t", 12t" and 13t", and 15t" and 16t" for a-linolenic
acid (18:3)).
"PUFAs" can be classified into two major families (depending on the
position (n) ofthe first double bond nearest the methyl end of the fatty acid
carbon chain). Thus, the "omega-6 fatty acids" (~-6 or n-6) have the first
unsaturated double bond six carbon atoms from the omega (methyl) end
of the molecule and additionally have a total of two or more double bonds,
with each subsequent unsaturation occuring 3 additional carbon atoms
toward the carboxyl end of the molecule. In contrast, the "omega-3 fatty
acids" (cu-3 or n-3) have the first unsaturated double bond three carbon
atoms away from the omega end of the molecule and additionally have a
total of three or more double bonds, with each subsequent unsaturation
12



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
occuring 3 additional carbon atoms toward the carboxyl end of the
molecule.
For the purposes of the present disclosure, the omega-reference
system will be used to indicate the number of carbons, the number of
double bonds and the position of the double bond closest to the omega
carbon, counting from the omega carbon (which is numbered 1 for this
purpose). This nomenclature is shown below in Table 1, in the column
titled "Shorthand Notation". The remainder of the Table summarizes the
common names of w-3 and cu-6 fatty acids, the abbreviations that will be
used throughout the specification and each compounds' chemical name.
Table 1
Nomenclature Of Polyunsaturated Fatty Acids
Common Name AbbreviationChemical Name Shorthand


Notation


Linoleic LA cis-9,12-octadecadienoic18:2 w-6


y-Linoleic GLA cis-6, 9, 12- 18:3 w-6


octadecatrienoic


Dihomo-y- DGLA 1 cis-8, 11, 14- 20:3 cu-6


Linoleic eicosatrienoic


Arachidonic ARA cis-5, 8, 11, 14- 20:4 w-6
,


eicosatetraenoic


a-Linolenic ALA cis-9, 12, 15- 18:3 w-3


octadecatrienoic


Stearidonic STA cis-6, 9, 12, 15- 18:4 w-3


octadecatetraenoic


Eicosa- ETA cis-8, 11, 14, 17- 20:4 w-3


tetraenoic eicosatetraenoic


Eicosa- EPA cis-5, 8, 11, 14, 20:5 w-3
17-


pentaenoic eicosapentaenoic


Docosa- DPA cis-7, 10, 13, 16, 22:5 cu-3
19-


pentaenoic docosapentaenoic


Docosa- DHA cis-4, 7, 10, 13, 22:6 w-3
16, 19-


hexaenoic docosahexaenoic


The term "essential fatty acid" refers to a particular PUFA that an
individual must ingest in order to survive, being unable to synthesize the
particular essential fatty acid de novo. Linoleic (18:2, w-6) and linolenic
(18:3, ~-3) fatty acids are "essential fatty acids", since humans cannot
synthesize them and have to obtain them in their diet.
13



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
The term "fat" refers to a lipid substance that is solid at 25 °C
and
usually saturated.
The term "oil" refers to a lipid substance that is liquid at 25 °C
and
usually polyunsaturated. PUFAs are found in the oils of some algae,
oleaginous yeasts and filamentous fungi. "Microbial oils" or "single cell
oils" are those oils naturally produced by microorganisms during their
lifespan. Such oils can contain long chain PUFAs.
The term "PUFA biosynthetic pathway enzyme" refers to any of the
following enzymes (and genes which encode said enzymes) associated
with the biosynthesis of a PUFA, including: a 04 desaturase, a ~5
desaturase, a O6 desaturase, a X12 desaturase, a 015 desaturase, a 017
desaturase, a O9 desaturase and/or an elongase.
The term "w-3/w-6 fatty acid biosynthetic pathway" refers to a set of
genes which, when expressed under the appropriate conditions encode
enzymes that catalyze the production of either or both w-3 and cu-6 fatty
acids. Typically the genes involved in the cu-3/c~-6 fatty acid biosynthetic
pathway encode some or all of the following enzymes:~12 desaturase, ~6
desaturase, elongase, 05 desaturase, 017 desaturase,015 desaturase, O9
desaturase and D4 desaturase. A representative pathway is illustrated in
Figure 2, providing for the conversion of oleic acid through various
intermediates to DHA, which demonstrates how both w-3 and cu-6 fatty
acids may be produced from a common source. The pathway is naturally
divided into two portions where one portion will generate w-3 fatty acids
and the other portion, only cu-6 fatty acids. That portion that only generates
~a-3 fatty acids will be referred to herein as the w-3 fatty acid biosynthetic
pathway whereas that portion that generates only cu-6 fatty acids will be
referred to herein as the cu-6 fatty acid biosynthetic pathway.
The term "functional" as used herein in context with the w-3/~-6
fatty acid biosynthetic pathway means that some or all of the genes in the
pathway express active enzymes. It should be understood that "c~-3/w-6
fatty acid biosynthetic pathway" or "functional ~-3/c~-6 fatty acid
biosynthetic pathway" does not imply that all the genes listed in this
paragraph are required as a number of fatty acid products will only require
the expression of a subset of the genes of this pathway.
The term "desaturase" refers to a polypeptide that can desaturate,
i.e., introduce a double bond, in one or more fatty acids to produce a
mono- or polyunsaturated fatty acid. Despite use of the omega-reference
14



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
system throughout the specification in reference to specific fatty acids, it
is
more convenient to indicate the activity of a desaturase by counting from
the carboxyl end of the substrate using the delta-system. Of particular
interest herein are 012 desaturases that desaturate a fatty acid between
the 12t" and 13t" carbon atoms numbered .from the carboxyl-terminal end
of the molecule and that catalyze the conversion of oleic acid to LA. Other
desaturases relevant to the present disclosure include: X15 desaturases
that catalyze the conversion of LA to ALA; X17 desaturases that
desaturate a fatty acid between the 17th and 18th carbon atom numbered
from the carboxyl-terminal end of the molecule and which, for example,
catalyze the conversion of ARA to EPA and/or DGLA to ETA; ~6
desaturases that catalyze the conversion of LA to GLA and/or ALA to STA;
~5 desaturases that catalyze the conversion of DGLA to ARA and/or ETA
to EPA; 04 desaturases that catalyze the conversion of DPA to DHA; and
09 desaturases that catalyze the conversion of palmitate to palmitoleic
acid (16:1 ) and/or stearate to oleic acid (18:1 ).
The term "elongase" refers to a polypeptide that can elongate a
.fatty acid carbon chain to produce an acid that is 2 carbons longer than
the fatty acid substrate that the elongase acts upon. This process of
elongation occurs in a multi-step mechanism in association with fatty acid
synthase, whereby CoA is the acyl carrier (Lassner et al., The Plant Cell
8:281-292 (1996)). Briefly, malonyl-CoA is condensed with a long-chain
acyl-CoA to yield C02 and a [i-ketoacyl-CoA (where the acyl moiety has
been elongated by two carbon atoms). Subsequent reactions include
reduction to [i-hydroxyacyl-CoA, dehydration to an enoyl-CoA and a
second reduction to yield the elongated acyl-CoA. Examples of reactions
catalyzed by elongases are the conversion of GLA to DGLA, STA to ETA
and EPA to DPA. Accordingly, elongases can have different specificities
(e.g., a C~6,~$ elongase will prefer a C~6 substrate, a C~8,2o elongase will
prefer a C~$ substrate and a C2o,22 elongase will prefer a C2o substrate).
The terms "conversion efficiency" and "percent substrate
conversion" refer to the efficiency by which a particular enzyme (e.g., a
desaturase or elongase) can convert substrate to product. The conversion
efficiency is measured according to the following formula:
([product]/[substrate+product])*100, where 'product' includes the
immediate product and all products in the pathway derived from it.



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
The term "oleaginous" refers to those organisms that tend to store
their energy source in the form of lipid (Vlleete, In: Fungal Lipid
Biochemistry, 2nd ed., Plenum, 1980). Generally, the cellular oil or
triacylglycerol content of oleaginous microorganisms follows a sigmoid
curve, wherein the concentration of lipid increases until it reaches a
maximum at the late logarithmic or early stationary growth phase and then
gradually decreases during the late stationary and death phases
(Yongmanitchai and Ward, Appl. Environ. Microbiol. 57:419-25 (1991 )).
The term "oleaginous yeast" refers to those microorganisms
classified as yeasts that can accumulate at least 25% of their dry cell
weight as oil. Examples of oleaginous yeast include, but are no means
limited to, the following genera: Yarrovvia, Candida, Rhodotorula,
Rhodosporidium, Crypfococcus, Trichosporon and Lipomyces.
'The term "fermentable carbon substrate" means a carbon source
that a microorganism will metabolize to derive energy. Typical carbon
substrates of the invention include, but are not limited to:
monosaccharides, oligosaccharides, polysaccharides; alkanes, fatty acids,
esters of fatty acids, monoglycerides, carbon dioxide, methanol,
formaldehyde, formate and carbon-containing amines.
The term "codon-optimized" as it refers to genes or coding regions
of nucleic acid molecules for transformation of various hosts, refers to the
alteration of codons in the gene or coding regions of the nucleic acid
molecules to reflect the typical codon usage of the host organism without
altering the polypeptide encoded by the DNA.
As used herein, an "isolated nucleic acid fragment" is a polymer of
RNA or DNA that is single- or double-stranded, optionally containing
synthetic, non-natural or altered nucleotide bases. An isolated nucleic
acid fragment in the form of a polymer of DNA may be comprised of one or
more segments of cDNA, genomic DNA or synthetic DNA.
A nucleic acid molecule is "hybridizable" to another nucleic acid
molecule, such as a cDNA, genomic DNA, or RNA molecule, when a
single-stranded form of the nucleic acid molecule can anneal to the other
nucleic acid molecule under the appropriate conditions of temperature and
solution ionic strength. Hybridization and washing conditions are well
known and exemplified in Sambrook, J., Fritsch, E. F. and Maniatis, T.
Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor
Laboratory: Cold Spring Harbor, NY (1989), particularly Chapter 11 and
16



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Table 11.1 therein (entirely incorporated herein by reference). The
conditions of temperature and ionic strength determine the "stringency" of
the hybridization. Stringency conditions can be adjusted to screen for
moderately similar fragments (such as homologous sequences from
distantly related organisms), to highly similar fragments (such as genes
that duplicate functional enzymes from closely related organisms).
Post-hybridization washes determine stringency conditions. One set of
preferred conditions uses a series of washes starting with 6X SSC, 0.5%
SDS at room temperature for 15 min, then repeated with 2X SSC, 0.5%
SDS at 45 °C for 30 min, and then repeated twice with 0.2X SSC,
0.5%
SDS. at 50 °C for 30 min. A more preferred set of stringent
conditions
uses higher temperatures in which the washes are identical to those above
except for the temperature of the final two 30 min washes in 0.2X SSC,
0.5% SDS was increased to 60 °C. Another preferred set of highly
stringent conditions uses two final washes in 0.1X SSC, 0.1 % SDS at 65
°C. An additional set of stringent conditions include hybridization at
0.1X
SSC, 0.1 % SDS, 65 °C and washed with 2X SSC, 0.1 % SDS followed
by
0.1X SSC, 0.1 % SDS, for example.
Hybridization requires that the two nucleic acids contain
complementary sequences, although depending on the stringency of the
hybridization, mismatches between bases are possible. The appropriate
stringency for hybridizing nucleic acids depends on the length of the
nucleic acids and the degree of complementation, variables well known in
the art. The greater the degree of similarity or homology between
two nucleotide sequences, the greater the value of Tm for hybrids of
nucleic acids having those sequences. The relative stability
(corresponding to higher Tm) of nucleic acid hybridizations decreases in
the following order: RNA:RNA, DNA:RNA, DNA:DNA. For hybrids of
greater than 100 nucleotides in length, equations for calculating Tm have
been derived (see Sambrook et al., supra, 9.50-9.51 ). For hybridizations
with shorter nucleic acids, i.e., oligonucleotides, the position of
mismatches becomes more important, and the length of the
oligonucleotide determines its specificity (see Sambrook et al., supra,
11.7-11.8). In one embodiment the length for a hybridizable nucleic acid is
at least about 10 nucleotides. Preferably a minimum length for a
hybridizable nucleic acid is at least about 15 nucleotides; more preferably
at least about 20 nucleotides; and most preferably the length is at least
17



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
about 30 nucleotides. Furthermore, the skilled artisan will recognize that
the temperature and wash solution salt concentration may be adjusted as
necessary according to factors such as length of the probe.
A "substantial portion" of an amino acid or nucleotide sequence is
that portion comprising enough of the amino acid sequence of a
polypeptide or the nucleotide sequence of a gene to putatively identify that
polypeptide or gene, either by manual evaluation of the sequence by one
skilled in the art, or by computer-automated sequence comparison and
identification using algorithms such as BLAST (Basic Local Alignment
Search Tool; Altschul, S. F., et al., J. Mol. Biol. 215:403-410 (1993). In
general, a sequence of ten or more contiguous amino acids or thirty or
more nucleotides is necessary in order to putatively identify a polypeptide
or nucleic acid sequence as homologous to a known protein or gene.
Moreover, with respect to nucleotide sequences, gene specific
oligonucleotide probes comprising 20-30 contiguous nucleotides may be
used in sequence-dependent methods of gene identification (e.g.,
Southern hybridization) and isolation (e.g., in situ hybridization of
bacterial
colonies or bacteriophage plaques). In addition, short oligonucleotides of
12-15 bases may be used as amplification primers in PCR in order to
obtain a particular nucleic acid fragment comprising the primers.
Accordingly, a "substantial portion" of a nucleotide sequence comprises
enough of the sequence to specifically identify and/or isolate a nucleic acid
fragment comprising the sequence. The instant specification teaches the
complete amino acid and nucleotide sequence encoding a particular yeast
protein. The skilled artisan, having the benefit of the sequences as
reported herein, may now use all or a substantial portion of the disclosed
sequences for purposes known to those skilled in this art. Accordingly, the
instant invention comprises the complete sequences as reported in the
accompanying Sequence Listing, as well as substantial portions of those
sequences as defined above.
The term "complementary" is used to describe the relationship
between nucleotide bases that are capable of hybridizing to one another.
For example, with respect to DNA, adenosine is complementary to
thymine and cytosine is complementary to guanine. Accordingly, the
instant invention also includes isolated nucleic acid fragments that are
complementary to the complete sequences as reported in the
18



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
accompanying Sequence Listing, as well as those substantially similar
nucleic acid sequences.
The term "percent identity", as known in the art, is a relationship
between two or more polypeptide sequences or two or more
polynucleotide sequences, as determined by comparing the sequences.
In the art, "identity" also means the degree of sequence relatedness
between polypeptide or polynucleotide sequences, as the case may be, as
determined by the match between strings of such sequences. "Identity"
and "similarity" can be readily calculated by known methods, including but
not limited to those described in: 1.) Computational Molecular Bioloay
(Lesk, A. M., Ed.) Oxford University: NY (1988); 2.) Biocomputing,_
Informatics and Genome Projects (Smith, D. W., Ed.) Academic: NY
(1993); 3.) Computer Analysis of Sequence Data, Part I (Griffin, A. M., and
Griffin, H. G., Eds.) Humans: NJ (1994); 4.) Seauence Analysis in
Molecular Bioloay (yon Heinje, G., Ed.) Academic (1987); and
5.) Seauence Analysis Primer (Gribskov, M. and Devereux, J., Eds.)
Stockton: NY (1991 ). Preferred methods to determine identity are
designed to give the best match between the sequences tested. Methods
to determine identity and similarity are codified in publicly available
computer programs. Sequence alignments and percent identity
calculations may be performed using the Megalign program of the
LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison,
WI). Multiple alignment of the sequences is performed using the Clustal
method of alignment (Higgins and Sharp, CA810S. 5:151-153 (1989)) with
default parameters (GAP,PENALTY=10, GAP LENGTH PENALTY=10).
Default parameters for pairwise alignments using the Clustal method are:
KTUPLE 1, GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5.
Suitable nucleic acid fragments (isolated polynucleotides of the
present invention) encode polypeptides that are at least about 70%
identical, preferably at least about 75% identical, and more preferably at
least about 80% identical to the amino acid sequence reported herein.
Preferred nucleic acid fragments encode amino acid sequences that are
about 85% identical to the amino acid sequence reported herein. More
preferred nucleic acid fragments encode amino acid sequences that are at
least about 90% identical to the amino acid sequence reported herein.
Most preferred are nucleic acid fragments that encode amino acid
sequences that are at least about 95% identical to the amino acid
19



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
sequence reported herein. Suitable nucleic acid fragments not only have
the above homologies but typically encode a polypeptide having at least
50 amino acids, preferably at least 100 amino acids, more preferably at
least 150 amino acids, still more preferably at least 200 amino acids, and
most preferably at least 250 amino acids.
"Codon degeneracy" refers to the nature in the genetic code
permitting variation of the nucleotide sequence without affecting the amino
acid sequence of an encoded polypeptide. The skilled artisan is well
aware of the "codon-bias" exhibited by a specific host cell in usage of
nucleotide codons to specify a given amino acid. Therefore, when
synthesizing a gene for improved expression in a host cell, it is desirable
to design the gene such that its frequency of codon usage approaches the
frequency of preferred codon usage of the host cell.
"Chemically synthesized", as related to a sequence of DNA, means
that the component nucleotides were assembled in vitro. Manual chemical
synthesis of DNA may be accomplished using well-established
procedures; or automated chemical synthesis can be performed using one
of a number of commercially available machines. "Synthetic genes" can
be assembled from oligonucleotide building blocks that are chemically
synthesized using procedures known to those skilled in the art. These
building blocks are ligated and annealed to form gene segments that are
then enzymatically assembled to construct the entire gene. Accordingly,
the genes can be tailored for optimal gene expression based on
optimization of nucleotide sequence to reflect the codon bias of the host
cell. The skilled artisan appreciates the likelihood of successful gene
expression if codon usage is biased towards those codons favored by the
host. Determination of preferred codons can be based on a survey of
genes derived from the host cell, where sequence information is available.
"Gene" refers to a nucleic acid fragment that expresses a specific
protein, including regulatory sequences preceding (5' non-coding
sequences) and following (3' non-coding sequences) the coding
sequence. "Native gene" refers to a gene as found in nature with its own
regulatory sequences. "Chimeric gene" refers to any gene that is not a
native gene, comprising regulatory and coding sequences that are not
found together in nature. Accordingly, a chimeric gene may comprise
regulatory sequences and coding sequences that are derived from
different sources, or regulatory sequences and coding sequences derived



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
from the same source, but arranged in a manner different than that found
in nature. "Endogenous gene" refers to a native gene in its natural
location in the genome of an organism. A "foreign" gene refers to a gene
not normally found in the host organism, but that is introduced into the
host organism by gene transfer. Foreign genes can comprise native
genes inserted into a non-native organism, or chimeric genes. A
"transgene" is a gene that has been introduced into the genome by a
transformation procedure. A "codon-optimized gene" is a gene having its
frequency of codon usage designed to mimic the frequency of preferred
codon usage of the host cell.
"Coding sequence" refers to a DNA sequence that codes for a
specific amino acid sequence. "Suitable regulatory sequences" refer to
nucleotide sequences located upstream (5' non-coding sequences), within
or downstream (3' non-coding sequences) of a coding sequence, and
which influence the transcription, RNA processing or stability, or
translation of the associated coding sequence. Regulatory 'sequences
may include promoters, translation leader sequences, introns,
polyadenylation recognition sequences, RNA processing sites, effector
binding sites and stem-loop structures.
"Promoter" refers to a DNA sequence capable of controlling the
expression of a coding sequence or functional RNA. In general, a coding
sequence is located 3' to a promoter sequence. Promoters may be
derived in their entirety from a native gene, or be composed of different
elements derived from different promoters found in nature, or even
comprise synthetic DNA segments. It is understood by those skilled in the
art that different promoters may direct the expression of a gene in different
tissues or cell types, or at different stages of development, or in response
to different environmental or physiological conditions. Promoters that
cause a gene to be expressed in most cell types at most times are
commonly referred to as "constitutive promoters". It is further recognized
that since in most cases the exact boundaries of regulatory sequences
have not been completely defined, DNA fragments of different lengths may
have identical promoter activity.
The term "3' non-coding sequences" or "transcription terminator"
refers to DNA sequences located downstream of a coding sequence. This
includes polyadenylation recognition sequences and other sequences
encoding regulatory signals capable of affecting mRNA processing or
21



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
gene expression. The polyadenylation signal is usually characterized by
affecting the addition of polyadenylic acid tracts to the 3' end of the mRNA
precursor. The 3' region can influence the transcription, RNA processing
or stability, or translation of the associated coding sequence.
"RNA transcript" refers to the product resulting from RNA
polymerase-catalyzed transcription of a DNA sequence. When the RNA
transcript is a perfect complementary copy of the DNA sequence, it is
referred to as the primary transcript or it may be a RNA sequence derived
from post-transcriptional processing of the primary transcript and is
referred to as the mature RNA. "Messenger RNA" or "mRNA" refers to the
RNA that is without introns and that can be translated into protein by the
cell. "cDNA" refers to a double-stranded DNA that is complementary to,
and derived from, mRNA. "Sense" RNA refers to RNA transcript that
includes the mRNA and so can be translated into protein by the cell.
"Antisense RNA" refers to a RNA transcript that is complementary to all or
part of a target primary transcript or mRNA and that blocks the expression
of a target gene (U.S. 5,107,065; WO 99/28508). The complementarity of
an antisense RNA may be with any part of the specific gene transcript, i.e.,
at the 5' non-coding sequence, 3' non-coding sequence, or the coding
sequence. "Functional RNA" refers to antisense RNA, ribozyme RNA, or
other RNA that is not translated and yet has an effect on cellular
processes.
The term "operably linked" refers to the association of nucleic acid
sequences on a single nucleic acid fragment so that the function of one is
affected by the other. For example, a promoter is operably linked with a
coding sequence when it is capable of affecting the expression of that
coding sequence (i.e., the coding sequence is under the transcriptional
control of the promoter). Coding sequences can be operably linked to
regulatory sequences in sense or antisense orientation.
The term "expression", as used herein, refers to the transcription
and stable accumulation of sense (mRNA) or antisense RNA derived from
the nucleic acid fragments) of the invention. Expression may also refer to
translation of mRNA into a polypeptide.
"Mature" protein refers to a post-translationally processed
polypeptide; i.e., one from which any pre- or propeptides present in the
primary translation product have been removed. "Precursor" protein refers
to the primary product of translation of mRNA; i.e., with pre- and
22



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
propeptides still present. Pre- and propeptides may be (but are not limited
to) intracellular localization signals.
"Transformation" refers to the transfer of a nucleic acid molecule
into a host organism, resulting in genetically stable inheritance. The
nucleic acid molecule may be a plasmid that replicates autonomously, for
example; or, it may integrate into the genome of the host organism. Host
organisms containing the transformed nucleic acid fragments are referred
to as "transgenic" or "recombinant" or "transformed" organisms.
The terms "plasmid", "vector" and "cassette" refer to an extra
chromosomal element often carrying genes that are not part of the central
metabolism of the cell, and usually in the form of circular double-stranded
DNA fragments. Such elements may be autonomously replicating
sequences, genome integrating sequences, phage or nucleotide
sequences, linear or circular, of a single- or double-stranded DNA or RNA,
derived from any source, in which a number of nucleotide sequences have
been joined or recombined into a unique construction which is capable of
introducing a promoter fragment and DNA sequence for a selected gene
product along with appropriate 3' untranslated sequence into a cell.
"Transformation cassette" refers to a specific vector containing a foreign
gene and having elements in addition to the foreign gene that facilitate
transformation of a particular host cell. "Expression cassette" refers to a
specific vector containing a foreign gene and having elements in addition
to the foreign gene that allow for enhanced expression of that gene in a
foreign host.
The term "altered biological activity" will refer to an activity,
associated with a protein encoded by a nucleotide sequence which can be
measured by an assay method, where that activity is either greater than or
less than the activity associated with the native sequence. "Enhanced
biological activity" refers to an altered activity that is greater than that
associated with the native sequence. "Diminished biological activity" is an
altered activity that is less than that associated with the native sequence.
The term "homologous recombination" refers to the exchange of
DNA fragments between two DNA molecules (during cross over). The
fragments that are exchanged are flanked by sites of identical nucleotide
sequences between the two DNA molecules (i.e., "regions of homology").
The term "regions of homology" refer to stretches of nucleotide sequence
on nucleic acid fragments that participate in homologous recombination
23



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
that have homology to each other. Effective homologous recombination
will take place where these regions of homology are at least about 10 by
in length where at least about 50 by in length is preferred. Typically
fragments that are intended for recombination contain at least two regions
of homology where targeted gene disruption or replacement is desired.
The term "sequence analysis software" refers to any computer
algorithm or software program that is useful for the analysis of nucleotide
or amino acid sequences. "Sequence analysis software" may be
commercially available or independently developed. Typical sequence
analysis software will include, but is not limited to: 1.) the GCG suite of
programs (Wisconsin Package Version 9.0, Genetics Computer Group
(GCG), Madison, WI); 2.) BLASTP, BLASTN, BLASTX (Altschul et al.,
J. Mol. Biol. 215:403-410 (1990)); 3.) DNASTAR (DNASTAR, Inc.
Madison, WI); 4.) Sequencher (Gene Codes Corporation, Ann Arbor,
MI); and 5.) the FASTA program incorporating the Smith-Waterman
algorithm (W. R. Pearson, Comput. Methods Genome Res., [Proc. Int.
Symp.] (1994), Meeting Date 1992, 111-20. Editor(s): Suhai, Sandor.
Plenum: New York, NY). Within the context of this application it will be
understood that where sequence analysis software is used for analysis,
that the results of the analysis will be based on the "default values" of the
program referenced, unless otherwise specified. As used herein "default
values" will mean any set of values or parameters that originally load with
the software when first initialized.
Standard recombinant DNA and molecular cloning techniques used
herein are well known in the art and are described by Sambrook, J.,
Fritsch, E. F. and Maniatis, T., Molecular Cloninq: A Laboratory Manual,
2nd ed., Cold Spring Harbor Laboratory: Cold Spring Harbor, NY (1989)
(hereinafter "Maniatis"); by Silhavy, T. J., Bennan, M. L. and Enquist, L.
W., Experiments with Gene Fusions, Cold Spring Harbor Laboratory: Cold
Spring Harbor, NY (1984); and by Ausubel, F. M. et al., Current Protocols
in Molecular Bioloay, published by Greene Publishing Assoc. and
Wiley-Interscience (1987).
Microbial Biosynthesis Of Fatty Acids
In general, lipid accumulation in oleaginous microorganisms is
triggered in response to the overall carbon to nitrogen ratio ,present in the
growth medium (Figure 1 ). When cells have exhausted available nitrogen
supplies (e.g., when the carbon to nitrogen ratio is greater than about 40),
24



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
the depletion of cellular adenosine monophosphate (AMP) leads to the
cessation of AMP-dependent isocitrate dehydrogenase activity in the
mitochondria and the accumulation of citrate, transport of citrate into the
cytosol and subsequent cleavage of the citrate by ATP-citrate lyase to
yield acetyl-CoA. Acetyl-CoA is the principle building block for de novo
biosynthesis of fatty acids. Although any compound that can effectively be
metabolized to produce acetyl-CoA can serve as a precursor of fatty acids,
glucose is the primary source of carbon in this type of reaction (Figure 1 ).
Glucose is converted to pyruvate via glycolysis and pyruvate is then
transported into the mitochondria where it can be converted to acetyl-CoA
by pyruvate dehydrogenase ("PD"). Since acetyl-CoA can not be
transported directly across the mitochondria) membrane into the
cytoplasm, the two carbons from acetyl-CoA condense with oxaloacetate
to yield citrate (catalyzed by citrate synthase). Citrate is transported
directly into the cytoplasm, where it is cleaved by ATP-citrate lyase to
regenerate acetyl-CoA and oxaloacetate. The oxaloacetate reenters the
tricarboxylic acid cycle, via conversion to malate.
The synthesis of malonyl-CoA is the first committed step of fatty
acid biosynthesis, which takes place in the cytoplasm. Malonyl-CoA is
produced via carboxylation of acetyl-CoA by acetyl-CoA carboxylase
("ACC"). Fatty acid synthesis is catalyzed by a multi-enzyme fatty acid
synthase complex ("FAS") and occurs by the condensation of eight two-
carbon fragments (acetyl groups from acetyl-CoA) to form a 16-carbon
saturated fatty acid, palmitate. More specifically, FAS catalyzes a series
of 7 reactions, which involve the following (Smith, S. FASEB J, 8(15):1248-
59 (1994)):
1. Acetyl-CoA and malonyl-CoA are transferred to the acyl carrier
peptide (ACP) of FAS. The acetyl group is then transferred to the
malonyl group, forming ~3-ketobutyryl-ACP and releasing C02.'
2. The ~i-ketobutyryl-ACP undergoes reduction (via (i-ketoacyl
reductase) and dehydration (via ~i-hydroxyacyl dehydratase) to form
a trans-monounsaturated fatty acyl group.
3. The double bond is reduced by NADPH, yielding a saturated fatty
acyl group two carbons longer than the initial one. The butyryl
group's ability to condense with a new malonyl group and repeat
the elongation process is then regenerated.



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
4. When the fatty acyl group becomes 16 carbons long, a thioesterase
activity hydrolyses it, releasing free palmitate.
Palmitate (16:0) is the precursor of longer chain saturated and
unsaturated fatty acids (e.g., stearic (18:0), palmitoleic (16:1 ) and oleic
(18:1 ) acids) through the action of elongases and desaturases present in
the endoplasmic reticulum membrane. Palmitate and stearate are
converted to their unsaturated derivatives, palmitoleic (16:1 ) and oleic
(18:1 ) acids, respectively, by the action of a 09 desaturase.
Triacylglycerols (the primary storage unit for fatty acids) are formed
by the esterification of two molecules of acyl-CoA to glycerol-3-phosphate
to yield 1,2-diacylglycerol phosphate (commonly identified as phosphatidic
acid) (Figure 1 ). The phosphate is then removed, by phosphatidic acid
phosphatase, to yield 1,2-diacylglycerol. Triacylglycerol is formed upon
the addition of a third fatty acid, for example, by the action of a
diacylglycerol-acyl transferase.
_Biosynthesis Of Omeaa Fatty Acids
Simplistically, the metabolic process that converts LA to GLA,
DGLA and ARA (the w-6 pathway) and ALA to STA, ETA, EPA, DPA and
DHA (the cu-3 pathway) involves elongation of the carbon chain through
the addition of two-carbon units and desaturation of the molecule through
the addition of double bonds (Figure 2). This requires a series of special
desaturation and elongation enzymes present in the endoplasmic
reticulum membrane.
cu-6 Fatty
Oleic acid is converted to LA (18:2), the first of the w-6 fatty acids,
by the action of a 012 desaturase. Subsequent w-6 fatty acids are
produced as follows: 1.) LA is converted to GLA by the activity of a D6
desaturase; 2.) GLA is converted to DGLA by the action of an elongase;
and 3.) DGLA is converted to ARA by the action of a ~5 desaturase.
~-3 Fatty
Linoleic acid (LA) is converted to ALA, the first of the ~-3 fatty
acids, by the action of a X15 desaturase. Subsequent w-3 fatty acids are
produced in a series of steps similar to that for the cu-6 fatty acids.
Specifically: 1.) ALA is converted to STA by the activity of a D6
desaturase; 2.) STA is converted to ETA by the activity of an elongase;
and 3.) ETA is converted to EPA by the activity of a 05 desaturase.
Alternatively, ETA and EPA can be produced from DGLA and ARA,
26



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
respectively, by the activity of a 017 desaturase. EPA can be further
converted to DHA by the activity of an elongase and a ~4 desaturase.
Genes Involved In Omeaa Fatty Acid Production
Many microorganisms, including algae, bacteria, molds and yeasts,
can synthesize PUFAs and omega fatty acids in the ordinary course of
cellular metabolism. Particularly well-studied are fungi including
Schizochytrium aggregatm, species of the genus Thraustochytrium and
Morteriella alpina. Additionally, many dinoflagellates (Dinophyceaae)
naturally produce high concentrations of PUFAs. As such, a variety of
genes involved in oil production have been identified through genetic
means and the DNA sequences of some of these genes are publicly
available (non-limiting examples are shown below in Table 2):
Table 2
Some Publicly Available Genes Involved In PUFA Production
Genbank Description
Accession
No.


AY131238 Argania spinosa ~6 desaturase


Y055118 Echium pitardii var. pitardii 06 desaturase


AY055117 Echium gentianoides 06 desaturase


AF296076 Mucor rouxii ~6 desaturase


AF007561 Borago officinalis ~6 desaturase


L11421 S nechoc stis s . 46 desaturase


NM 031344 Raitus norvegicus ~6 fatty acid desaturase_


AF465283, Mortierella alpina D6 fatty acid desaturase
AF465281,
AF110510


AF465282 Mortierella isabellina d6 fatt acid
desaturase


AF419296 Pythium irregulare d6 fatt acid desaturase


AB052086 Mucor circinelloides D6d mRNA for 06
fatt acid desaturase


AJ250735 Ceratodon purpureus mRNA for 06 fatt
acid desaturase


AF126799 Homo sapiens ~6 fatty acid desaturase


AF126798 Mus musculus O6 fatty acid desaturase


AF199596, Homo sapiens ~5 desaturase
AF226273


AF320509 Rattus norve icus liver ~5 desaturase


AB072976 Mus musculus D5D mRNA for O5 desaturase


AF489588 Thraustoch rium sp. ATCC21685 O5 fatt
acid desaturase


AJ510244 Phytophthora megasperma mRNA for D5
fatty acid
desaturase


AF419297 P hium irre ulare ~5 fatt acid desaturase


AF07879 Caenorhabditis elegans ~5 fatt acid
desaturase


27



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
AF067654 Mortierella alpina ~5 fatty acid desaturase


AB022097 Dictyostelium discoideum mRNA for o5
fatty acid
desaturase


AF489589.1 Thraustoch rium sp. ATCC21685 D4 fatty
acid desaturase


AX464731 Mortierella alpina elongase gene (also
WO 00/12720)


AAG36933 Emericella nidulans oleate X12 desaturase


AF110509 Mortierella al ina 012 fatt acid desaturase
mRNA


AB020033 Mortierella alpina mRNA for X12 fatty
acid desaturase


AAL13300 Mortierella alpina 012 fatty acid desaturase


AF417244 Mortierella alpina ATCC 16266 X12 fatty
acid desaturase
ene


AF161219 Mucor rouxii X12 desaturase mRNA


X86736 Spiruline platensis 012 desaturase


AF240777 Caenorhabditis elegans 012 desaturase


AB007640 Chlam domonas reinhardtii X12 desaturase


AB075526 Chlorella vulgaris 012 desaturase


AP002063 Arabido sis thaliana microsomal 412
desaturase


AY332747 Pavlova lutheri ~4 fatt acid desaturase
des1 mRNA


NP 441622, Synechocystis sp. PCC 6803 X15 desaturase
BAA18302,
BAA02924


AAL36934 Perilla frutescens X15 desaturase


AF338466 Acheta domesticus ~9 desaturase 3 mRNA


AF438199 Picea lauca desaturase 09 Des9. mRNA


E11368 Anabaena o9 desaturase


E11367 S nechoc stis O9 desaturase


D83185 Pichia an usta DNA for ~9 fatt acid
desaturase


U90417 Synechococcus vulcanus 09 acyl-lipid
fatty acid desaturase
desC ene


AF085500 Mortierella alpina ~9 desaturase mRNA


AY504633 Emericella nidulans D9 stearic acid
desaturase sdeB ene


NM_069854 Caenorhabditis elegans essential fatty
acid desaturase,
stearo I-CoA desaturase 39.1 kD fat-6
com lete mRNA


AF230693 Brassica oleracea cultivar Rapid Cycling
stearoyl-ACP
desaturase 49-BO-1 ene, exon se uence


AX464731 Mortierella alpina elongase ene (also
WO 02/08401)


NM 119617 Arabidopsis thaliana fatty acid elongase
1 (FAE1 )
At4 34520 m RNA


NM_134255 Mus musculus ELOVL family member 5,
elongation of long
chain fatt acids east Elovl5 , mRNA


NM 134383 Rattus norvegicus fatty acid elongase
2 (rEL02), mRNA


NM 134382 Rattus norvegicus fatty acid elon ase
1 (rEL01 ), mRNA


NM 068396, Caenorhabditis elegans fatty acid ELOngation
NM 068392, (elo-6), (elo-
N M_070713,5), (elo-2), (elo-3), and (elo-9) mRNA
N M_068746,
NM 064685


Additionally, the patent literature provides many additional DNA
sequences of genes (and/or details concerning several of the genes above
and their methods of isolation) involved in PUFA production. See, for
28



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
example: U.S. 5,968,809 (O6 desaturases); U.S. 5,972,664 and U.S.
6,075,183 (~5 desaturases); WO 91113972 and U.S. 5,057,419 (~9
desaturases); WO 93/11245 (~15 desaturases); U.S. 2003/0196217 A1
(~17 desaturases); WO 02/090493 (O4 desaturases); and WO 00/12720
and U.S. 2002/0139974A1 (elongases). Each of these patents and
applications are herein incorporated by reference in their entirety.
Of particular interest herein are 012 desaturases, and more
specifically, 012 desaturases that are suitable for expression in oleaginous
yeast (e.g., Yarrowia lipolytica). A variety of sequences encoding fungal
X12 fatty acid desaturases have been previously disclosed that could be
used for heterologous expression in oleaginous Yarrowia lipolytica (e.g.,
GenBank Accession No's AAG36933, AF110509, AAL13300, AF417244,
AF161219 (supra)). Additionally, for example, the X12 fatty acid
desaturases of Glycine max, Brassica napus, Arabidopsis thaliana,
Ricinus communis, Zea mays; Neurospora crassa and Botrytis cinerea are
disclosed in WO 94/11516, U.S. 5,443,974 and WO 03/099216.
Many factors affect the choice of a specific polypeptide having 012
desaturase activity that is to be expressed in a host cell for production of
PUFAs (optionally in combination with other desaturases and elongases).
Depending upon the host cell, the availability of substrate and the desired
end product(s), several polypeptides are of interest; however,
considerations for choosing a specific polypeptide having desaturase
activity include the substrate specificity of the polypeptide, whether the
polypeptide or a component thereof is a rate-limiting enzyme, whether the
desaturase is essential for synthesis of a desired polyunsaturated fatty
acid and/or co-factors required by the polypeptide. The expressed
polypeptide preferably has parameters compatible with the biochemical
environment of its location in the host cell. For example, the polypeptide
may have to compete for substrate with other enzymes in the host cell.
Analyses of the KM and specific activity of the polypeptide are therefore
considered in determining the suitability of a given polypeptide for
modifying PUFA production in a given host cell. The polypeptide used in a
particular host cell is one that can function under the biochemical
conditions present in the intended host cell, but otherwise can be any
polypeptide having 012 desaturase activity capable of modifying the
desired fatty acid (i.e., oleic acid). Thus, the sequences may be derived
from any source, e.g., isolated from a natural source (from bacteria, algae,
29



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
fungi, plants, animals, etc.), produced via a semi-synthetic route or
synthesized de novo.
Seguence Identification Of The Yarrovvia lipolytica 012 Desaturase
Despite public disclosure of a variety of sequences encoding fungal
X12 fatty acid desaturases (supra), expression of a native enzyme is
preferred over a heterologous (or "foreign") enzyme since: 1.) the native
enzyme is optimized for interaction with other enzymes and proteins within
the cell; and 2.) heterologous genes are unlikely to share the same codon
preference in the host organism. Additionally, advantages are incurred
when the sequence of the native gene is known, as it permits facile
disruption of the endogenous gene by targeted disruption.
Concerning disruption of a native 012 fatty acid desaturase gerie, it
may be useful for to engineer an oleaginous yeast that is not capable of
producing PUFAs in some embodiments. Commercial applications where
this lack of functionality would be desirable include the production of high
value cocoa butter substitutes, oxidatively stable oils and specialty fatty
acids derived from 18:1 (e.g., hydroxy- and epoxy-fatty acids).
Alternatively, oleaginous yeast lacking 012 fatty acid desaturase activity
could be utilized to produce "pure" ~-3 derivatives of ALA (e.g., STA, ETA,
EPA, DPA, DHA) by transforming the organism with the appropriate genes
(e.g., D6 desaturase, elongase, 05 desaturase, 04 desaturase) and
feeding the organism ALA as a substrate; cu-6 fatty acids would not be
synthesized under these conditions (see Figure 2).
Thus, the Applicants sought to isolate a X12 fatty acid desaturase
from Yarrowia lipolytica. Comparison of the 012 desaturase nucleotide
base and deduced amino acid sequences to public databases reveals that
the most similar known sequences are about 53% identical to the amino
acid sequence of X12 desaturase reported herein (SEQ ID N0:24) over a
length of 419 amino acids using a Clustal method of alignment (Thompson
et. al., Nucleic Acids Res. 22:4673-4680 (1994)). More preferred amino
acid fragments are at least about 70%-80% identical to the sequence
herein, where those sequences that are 85%-90% identical are particularly
suitable and those sequences that are about 95% identical are most
preferred. Similarly, preferred X12 desaturase encoding nucleic acid
sequences corresponding to the instant ORF are those encoding active
proteins and which are at least about 70%-80% identical to the nucleic
acid sequence of 012 desaturase reported herein, where those sequences



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
that are 85°l°-90% identical are particularly suitable and those
sequences
that are about 95% identical are most preferred.
_Isolation Of Homoloas
The o12 desaturase nucleic acid fragment of the instant invention
may be used to isolate genes encoding homologous proteins from the
same or other bacterial, algal, fungal or plant species. Isolation of
homologous genes using sequence-dependent protocols is well known in
the art. Examples of sequence-dependent protocols include, but are not
limited to: 1.) methods of nucleic acid hybridization; 2.) methods of DNA
and RNA amplification, as exemplified by various uses of nucleic acid
amplification technologies [e.g., polymerase chain reaction (PCR), Mullis
et al., U.S. Patent 4,683,202; ligase chain reaction (LCR), Tabor, S. et al.,
Proc. Acad. Sci. USA 82:1074 (1985); or strand displacement amplification
(SDA), Walker, et al., Proc. Natl. Acad. Sci. U.S.A., 89:392 (1992)]; and
3.) methods of library construction and screening by complementation.
For example, genes encoding similar proteins or polypeptides to the
desaturase described herein could be isolated directly by using all or a
portion of the instant nucleic acid fragments as DNA hybridization probes
to screen libraries from any desired yeast or fungus using methodology
well known to those skilled in the art (wherein those yeast or fungus
producing LA and/or LA-derivatives would be preferred). Specific
oligonucleotide probes based upon the instant nucleic acid sequences can
be designed) and synthesized by methods known in the art (Maniatis,
supra). Moreover, the entire sequences can be used directly to synthesize
DNA probes by methods known to the skilled artisan (e.g., random primers
DNA labeling, nick translation or end-labeling techniques) or RNA probes
using available in vitro transcription systems. In addition, specific primers
can be designed and used to amplify a part of (or full-length of) the instant
sequences. The resulting amplification products can be labeled directly
during amplification reactions or labeled after amplification reactions, and
used as probes to isolate full-length DNA fragments under conditions of
appropriate stringency.
Typically, in PCR-type amplification techniques, the primers have
different sequences and are not complementary to each other. Depending
on the desired test conditions, the sequences of the primers should be
designed to provide for both efficient and faithful replication of the target
nucleic acid. Methods~of PCR primer design are common and well known
31



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
in the art (Thein and Wallace, "The use of oligonucleotide as specific
hybridization probes in the Diagnosis of Genetic Disorders", in Human
Genetic Diseases: A Practical Approach, K. E. Davis Ed., (1986) pp 33-50,
IRL: Herndon, VA; and Rychlik, W., In Methods in Molecular Bioloay,
White, B. A. Ed., (1993) Vol. 15, pp 31-39, PCR Protocols: Current
Methods and Applications. Humania: Totowa, NJ).
Generally two short segments of the instant sequences may be
used in polymerase chain reaction protocols to amplify longer nucleic acid
fragments encoding homologous genes from DNA or RNA. The
polymerase chain reaction may also be performed on a library of cloned
nucleic acid fragments wherein the sequence of one primer is derived from
the instant nucleic acid fragments and the sequence of the other primer
takes advantage of the presence of the polyadenylic acid tracts to the
3' end of the mRNA precursor encoding microbial genes.
Alternatively, the second primer sequence may be based upon
sequences derived from the cloning vector. For example, the skilled
artisan can follow the RACE protocol (Frohman et al., PNAS USA 85:8998
(1988)) to generate cDNAs by using PCR to amplify copies of the region
between a single point in the transcript and the 3' or 5' end. Primers
oriented in the 3' and 5' directions can be designed from the instant
sequences. Using commercially available 3' RACE or 5' RACE systems
(BRL, Gaithersburg, MD), specific 3' or 5' cDNA fragments can be isolated
(Ohara et al., PNAS USA 86:5673 (1989); Loh et al., Science 243:217
(1989)).
In other embodiments, the instant desaturase sequences may be
employed as hybridization reagents for the identification of homologs. The
basic components of a nucleic acid hybridization test include a probe, a
sample suspected of containing the gene or gene fragment of interest and
a specific hybridization method. Probes of the present invention are
typically single-stranded nucleic acid sequences that are complementary
to the nucleic acid sequences to be detected. Probes are "hybridizable" to
the nucleic acid sequence to be detected. The probe length can vary from
5 bases to tens of thousands of bases, and will depend upon the specific
test to be done. Typically a probe length of about 15 bases to about
30 bases is suitable. Only part of the probe molecule need be
complementary to the nucleic acid sequence to be detected. In addition,
the complementarity between the probe and the target sequence need not
32



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
be perfect. Hybridization does occur between imperfectly complementary
molecules with the result that a certain fraction of the bases in the
hybridized region are not paired with the proper complementary base.
Hybridization methods are well defined. Typically the probe and
sample must be mixed under conditions that will permit nucleic acid
hybridization. This involves contacting the probe and sample in the
presence of an inorganic or organic salt under the proper concentration
and temperature conditions. The probe and sample nucleic acids must be
in contact for a long enough time that any possible hybridization between
the probe and sample nucleic acid may occur. The concentration of probe
or target in the mixture will determine the time necessary for hybridization
to occur. The higher the probe or target concentration, the shorter the
hybridization incubation time needed. Optionally, a chaotropic agent may
be added. The chaotropic agent stabilizes nucleic acids by inhibiting
nuclease activity. Furthermore, the chaotropic agent allows sensitive and
stringent hybridization of short oligonucleotide probes at room temperature
(Van Ness and Chen, Nucl. Acids Res. 19:5143-5151 (1991 )). Suitable
chaotropic agents include guanidinium chloride, guanidinium thiocyanate,
sodium thiocyanate, lithium tetrachloroacetate, sodium perchlorate,
rubidium tetrachloroacetate, potassium iodide and cesium trifluoroacetate,
among others. Typically, the chaotropic agent will be present at a final
concentration of about 3 M. If desired, one can add formamide to the
hybridization mixture, typically 30-50% (v/v~.
Various hybridization solutions can be employed. Typically, these
comprise from about 20 to 60% volume, preferably 30%, of a polar organic
solvent. A common hybridization solution employs about 30-50% v/v
formamide, about 0.15 to 1 M sodium chloride, about 0.05 to 0.1 M buffers
(e.g., sodium citrate, Tris-HCI, PIPES or HEPES (pH range about 6-9)),
about 0.05 to 0.2% detergent (e.g., sodium dodecylsulfate), or between
0.5-20 mM EDTA, FICOLL (Pharmacia Inc.) (about 300-500 kdal),
polyvinylpyrrolidone (about 250-500 kdal) and serum albumin. Also
included in the typical hybridization solution will be unlabeled carrier
nucleic acids from about 0.1 to 5 mg/mL, fragmented nucleic DNA (e.g.,
calf thymus or salmon sperm DNA, or yeast RNA), and optionally from
about 0.5 to 2% wt/vol glycine. Other additives may also be included,
such as volume exclusion agents that include a variety of polar water-
soluble or swellable agents (e.g., polyethylene glycol), anionic polymers
33



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
(e.g., polyacrylate or polymethylacrylate) and anionic saccharidic polymers
(e.g., dextran sulfate).
Nucleic acid hybridization is adaptable to a variety of assay formats.
One of the most suitable is the sandwich assay format. The sandwich
assay is particularly adaptable to hybridization under non-denaturing
conditions. A primary component of a sandwich-type assay is a solid
support. The solid support has adsorbed to it or covalently coupled to it
immobilized nucleic acid probe that is unlabeled and complementary to
one portion of the sequence.
Availability of the instant nucleotide and deduced amino acid
sequences facilitates immunological screening of DNA expression
libraries. Synthetic peptides representing portions of the instant amino
acid sequence may be synthesized. These peptides can be used to
immunize animals to produce polyclonal or monoclonal antibodies with
specificity for peptides or proteins comprising the amino acid sequences.
These antibodies can be then be used to screen DNA expression libraries
to isolate full-length DNA clones of interest (Lerner, R. A. Adv. Immunol.
36:1 (1934); Maniatis, supra).
Gene Optimization For Improved Heterologous Expression
A variety of techniques can be utilized to improve the expression of
the X12 desaturase in an alternate host. Two such techniques include
codon-optimization and mutagenesis of the gene.
Codon Optimization
In some embodiments, it may be desirable to modify a portion of the
codons encoding the 012 desaturase polypeptide, for example, to
enhance the expression of the gene encoding that polypeptide in an
alternate host (e.g., an oleaginous yeast other than Yarrowia lipolytica).
In general, host-preferred codons can be determined within a
particular host species of interest by examining codon usage in proteins
(preferably those proteins expressed in the largest amount) and
determining which codons are used with highest frequency. Then, the
coding sequence for the polypeptide of interest having desaturase activity
can be synthesized in whole or in part using the codons preferred in the
host species. All (or portions) of the DNA also can be synthesized to
remove any destabilizing sequences or regions of secondary structure that
would be present in the transcribed mRNA. All (or portions) of the DNA
34



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
also can be synthesized to alter the base composition to one more
preferable in the desired host cell.
Mutactenesis
Methods for synthesizing sequences and bringing sequences
together are well established in the literature. For example, in vitro
mutagenesis and selection, site-directed mutagenesis, error prone PCR
(Melnikov et al., Nucleic Acids Research, 27(4):1056-1062 (February 15,
1999)), "gene shuffling" (U.S. 5,605,793; U.S. 5,811,238; U.S. 5,830,721;
and U.S. 5,837,458) or other means can be employed to obtain mutations
of naturally occurring desaturase genes, such as the 012 desaturase
described herein. This would permit production of a polypeptide having
desaturase activity in vivo with more desirable physical and kinetic
parameters for function in the host cell (e.g., a longer half-life or a higher
rate of production of a desired PUFA).
If desired, the regions of a desaturase polypeptide important for
enzymatic activity can be determined through routine mutagenesis,
expression of the resulting mutant polypeptides and determination of their
activities. Mutants may include deletions, insertions and point mutations,
or combinations thereof. A typical functional analysis begins with deletion
mutagenesis to determine the N- and C-terminal limits of the protein
necessary for function, and then internal deletions, insertions or point
mutants are made to further determine regions necessary for function.
Other techniques such as cassette mutagenesis or total synthesis also can
be used. Deletion mutagenesis is accomplished, for example, by using
exonucleases to sequentially remove the 5' or 3' coding regions. Kits are
available for such techniques. After deletion, the coding region is
completed by ligating oligonucleotides containing start or'stop codons to
the deleted coding region after the 5' or 3' deletion, respectively.
Alternatively, oligonucleotides encoding start or stop codons are inserted
into the coding region by a variety of methods including site-directed
mutagenesis, mutagenic PCR or by ligation onto DNA digested at existing
restriction sites. Internal deletions can similarly be made through a variety
of methods including the use of existing restriction sites in the DNA, by use
of mutagenic primers via site-directed mutagenesis or mutagenic PCR.
Insertions are made through methods such as linker-scanning
mutagenesis, site-directed mutagenesis or mutagenic PCR. Point



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
mutations are made through techniques such as site-directed mutagenesis
or mutagenic PCR.
Chemical mutagenesis also can be used for identifying regions of a
desaturase polypeptide important for activity. A mutated construct is
expressed, and the ability of the resulting altered protein to function as a
desaturase is assayed. Such structure-function analysis can determine
which regions may be deleted, which regions tolerate insertions, and
which point mutations allow the mutant protein to function in substantially
the same way as the native desaturase. All such mutant proteins and
nucleotide sequences encoding them that are derived from the desaturase
described herein are within the scope of the present invention.
Thus, the present invention comprises the complete sequence of
the X12 desaturase as reported in the accompanying Sequence Listing,
the complement of that complete sequence, substantial portions of that
sequence, codon-optimized desaturases derived therefrom and those
sequences that are substantially homologous thereto.
_Microbial Production Of w-3 And/Or c~-6 Fatty Acids
'Microbial production of cu-3 and/or cu-6 fatty acids has several
advantages over purification from natural sources such as fish or plants.
For example: .
1.) Many microbes are known with greatly simplified oil compositions
compared with those of higher organisms, making purification of
desired components easier;
2.) Microbial production is not subject to fluctuations caused by
external variables, such as weather and food supply;
3.) Microbially produced oil is substantially free of contamination by
environmental pollutants;
4.) Microbes can provide PUFAs in particular forms which may have
specific uses; and
5.) Microbial oil production can be manipulated by controlling culture
conditions, notably by providing particular substrates for microbially
expressed enzymes, or by addition of compounds or genetic
engineering approaches to suppress undesired biochemical
pathways.
In addition to these advantages, production of cu-3 and/or cu-6 fatty acids
from recombinant microbes provides the ability to alter the naturally
occurring microbial fatty acid profile by providing new biosynthetic
36



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
pathways in the host or by suppressing undesired pathways, thereby
increasing levels of desired PUFAs (or conjugated forms thereof) and
decreasing levels of undesired PUFAs (see co-pending U.S. Provisional
Application 60/468677, herein incorporated entirely by reference).
Methods For Production Of Various cu-3 And/Or w-6 Fatty Acids
It is expected that introduction of chimeric genes encoding the 012
desaturase described herein, under the control of appropriate promoters
will result in increased production of LA. As such, the present invention
encompasses a method for the direct production of PUFAs comprising
exposing a fatty acid substrate (i.e., oleic acid) to the PUFA enzyme
described herein (i.e., the 012 desaturase), such that the substrate is
converted to the desired fatty acid product (i.e., LA).
Alternatively, the PUFA gene and its corresponding enzyme product
described herein can be used indirectly for the production of PUFAs.
Indirect production of PUFAs occurs wherein the fatty acid substrate is
converted indirectly into the desired fatty acid product, via means of an
intermediate steps) or pathway intermediate(s). Thus, it is contemplated
that the 012 desaturase described herein may be expressed in
conjunction with one or more genes that encode other enzymes, such that
a series of reactions occur to produce a desired product. In a preferred
embodiment, for example, a host organism may be co-transformed with a
vector comprising additional genes encoding enzymes of the PUFA
biosynthetic pathway to result in higher levels of production of w-3 and/or
cu-6 fatty acids (e.g., GLA, DGLA, ARA, ALA, STA, ETA, EPA, DPA and
DHA). Specifically, for example, it may be desirable to overexpress the
X12 desaturase described herein in host cells that are also expressing:
1.) a gene encoding a 46 desaturase for the overproduction of GLA; 2.) an
expression cassette comprising genes encoding a ~6 desaturase and a
high-affinity elongase for the overproduction of'DGLA; 3.) genes encoding
a ~6 desaturase, high-affinity elongase and 05~desaturase for the
overproduction of ARA; or 4.) genes encoding a ~6 desaturase, high-
affinity elongase, ~5 desaturase and X17 desaturase for the
overproduction of EPA. In alternate embodiments, it may be desirable to
overexpress the X12 desaturase as described herein in cells that are also
expressing: 1.) a gene encoding a d15 desaturase for the overproduction
of ALA; 2.) genes encoding a X15 desaturase and 06 desaturase for the
overproduction of STA; 3.) genes encoding a X15 desaturase, 06
37



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
desaturase and a high-affinity elongase for the overproduction of ETA; or
4.) genes encoding a 015 desaturase, ~6 desaturase, high-affinity
elongase and D5 desaturase for the overproduction of EPA. As is well
known to one skilled in the art, various other combinations of the following
enzymatic activities may be useful to express in a host in conjunction with
the desaturase herein: a 015 desaturase, a 04 desaturase, a ~5
desaturase, a 06 desaturase, a 417 desaturase, a 09 desaturase andlor
an elongase (see Figure 2). The particular genes included within a
particular expression cassette will depend on the host cell (and its PUFA
profile and/or desaturase profile), the availability of substrate and the
desired end product(s).
In alternate embodiments, it may be useful to disrupt a host
organism's native 012 desaturase, based on the complete sequences
described herein, the complement of those complete sequences,
substantial portions of those sequences, codon-optimized desaturases
derived therefrom and those sequences that are substantially homologous
thereto. For example, the targeted disruption of the X12 desaturase
described herein in Yarrovvia lipolytica produces a mutant strain that is
unable to synthesize LA. This mutant strain could be useful for:
1.) production of other specialty oils (e.g., high value cocoa butter
substitutes, oxidatively stable oils and fatty acids derived from 18:1 such
as hydroxy- and epoxy-fatty acids); or 2.) production of "pure" cu-3 fatty
acid derivatives of ALA, when the host cells are grown on e.g., ALA
(without co-synthesis of cu-6 fatty acids).
Expression Systems Cassettes And Vectors
The gene and gene product of the instant sequences described
herein may be produced in various microbial host cells, particularly in the
cells of oleaginous yeasts (e.g., Yarrowia lipolytica). Expression in
recombinant microbial hosts may be useful for the production of various
PUFA pathway intermediates, or for the modulation of PUFA pathways
already existing in the host for the synthesis of new products heretofore
not possible using the host.
Microbial expression systems and expression vectors containing
regulatory sequences that direct high level expression of foreign proteins
are well known to those skilled in the art. Any of these could be used to
construct chimeric genes for production of any of the gene products of the
instant sequences. These chimeric genes could then be introduced into
38



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
appropriate microorganisms via transformation to provide high-level
expression of the encoded enzymes.
Vectors or DNA cassettes useful for the transformation of suitable
host cells are well known in the art. The specific choice of sequences
present in the construct is dependent upon the desired expression
products (supra), the nature of the host cell and the proposed means of
separating transformed cells versus non-transformed cells. Typically,
however, the vector or cassette contains sequences directing transcription
and translation of the relevant gene(s), a selectable marker and
sequences allowing autonomous replication or chromosomal integration.
Suitable vectors comprise a region 5' of the gene that controls
transcriptional initiation and a region 3' of the DNA fragment that controls
transcriptional termination. It is most preferred when both control regions
are derived from genes from the transformed host cell, although it is to be
understood that such control regions need not be derived from the genes
native to the specific species chosen as a production host.
Initiation control regions or promoters which are useful to drive
expression of the instant ORF in the desired host cell are numerous and
familiar to those skilled in the art. Virtually any promoter capable of
directing expression of this gene in the selected host cell is suitable for
the
present invention. Expression in a host cell can be accomplished in a
transient or stable fashion. Transient expression can be accomplished by
inducing the activity of a regulatable promoter operably linked to the gene
of interest. Stable expression can be achieved by the use of a constitutive
promoter operably linked to the gene of interest. As an example, when the
host cell is yeast, transcriptional and translational regions functional in
yeast cells are provided, particularly from the host species. The
transcriptional initiation regulatory regions can be obtained, for example,
from: 1.) genes in the glycolytic pathway, such as alcohol dehydrogenase,
glyceraldehyde-3-phosphate-dehydrogenase (see U.S. Patent Application
Number 60/482263), phosphoglycerate mutase (see U.S. Patent
Application Number 60/482263), fructose-bisphosphate aldolase (see U.S.
Patent Application Number 60/519971 ), phosphoglucose-isomerase,
phosphoglycerate kinase, etc.; or, 2.) regulatable genes such as acid
phosphatase, lactase, metallothionein, glucoamylase, the translation
elongation factor EF1-a (TEF) protein (U.S. 6,265,185), ribosomal protein
S7 (U.S. 6,265,185), etc. Any one of a number of regulatory sequences
39



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
can be used, depending upon whether constitutive or induced transcription
is desired, the efficiency of the promoter in expressing the ORF of interest,
the ease of construction and the like.
Nucleotide sequences surrounding the translational initiation codon
'ATG' have been found to affect expression in yeast cells. If the instant
desaturase is poorly expressed in non-Yarrovvia lipolytica yeast, the
nucleotide sequences of exogenous genes can be modified to include an
efficient yeast translation initiation sequence to obtain optimal gene
expression. For expression in yeast, this can be done by site-directed
mutagenesis of an inefficiently expressed gene by fusing it in-frame to an
endogenous yeast gene, preferably a highly expressed gene.
Alternatively, one can determine the consensus translation initiation
sequence in the host and engineer this sequence into heterologous genes
for their optimal expression in the host of interest. ,
The termination region can be derived from the 3' region of the
gene from which the initiation region was obtained or from a different
gene. A large number of termination regions are known and function
satisfactorily in a variety of hosts (when utilized both in the same and
different genera and species from where they were derived). The
termination region usually is selected more as a matter of convenience
rather than because of any particular property. Preferably, the termination
region is derived from a yeast gene, particularly Saccharomyces,
Schizosaccharomyces, Candida, Yarrovvia or Kluyveromyces. The 3'-
regions of mammalian genes encoding y-interferon and a-2 interferon are
also known to function in yeast. Termination control regions may also be
derived from various genes native to the preferred hosts. Optionally, a
termination site may be unnecessary; however, it is most preferred if
included.
As one of skill in the art is aware, merely inserting a gene into a
cloning vector does not ensure that it will be successfully expressed at the
level needed. In response to the need for a high expression rate, many
specialized expression vectors have been created by manipulating a
number of different genetic elements that control aspects of transcription,
translation, protein stability, oxygen limitation, and secretion from the host
cell. More specifically, some of the molecular features that have been
manipulated to control gene expression include: 1.) the nature of the
relevant transcriptional promoter and terminator sequences; 2.) the



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
number of copies of the cloned gene and whether the gene is plasmid-
borne or integrated into the genome of the host cell; 3.) the final cellular
location of the synthesized foreign protein; 4.) the efficiency of translation
in the host organism; 5.) the intrinsic stability of the cloned gene protein
within the host cell; and 6.) the codon usage within the cloned gene, such
that its frequency approaches the frequency of preferred codon usage of
the host cell. Each of these types of modifications are encompassed in
the present invention, as means to further optimize expression of the X12
desaturase described herein.
Transformation Of Microbial Hosts
Once the DNA encoding a polypeptide suitable for expression in an
oleaginous yeast has been obtained, it is placed in a plasmid vector
capable of autonomous replication in a host cell, or it is directly integrated
into the genome of the host cell. Integration of expression cassettes can
occur randomly within the host genome or can be targeted through the use
of constructs containing regions of homology with the host genome
sufficient to target recombination within the host locus. Where constructs
are targeted to an endogenous locus, all or some of the transcriptional and
translational regulatory regions can be provided by the endogenous locus.
Where two or more genes are expressed from separate replicating
vectors, it is desirable that each vector has a different means of selection
and should lack homology to the other constructs) to maintain stable
expression and prevent reassortment of elements among constructs.
Judicious choice of regulatory regions, selection means and method of
propagation of the introduced constructs) can be experimentally
determined so that all introduced genes are expressed at the necessary
levels to provide for synthesis of the desired products.
Constructs comprising the gene of interest may be introduced into a
host cell by any standard technique. These techniques include
transformation (e.g., lithium acetate transformation [Methods in
Enzymology, 194:186-187 (1991 )]), protoplast fusion, biolistic impact,
electroporation, microinjection, or any other method that introduces the
gene of interest into the host cell. More specific teachings applicable for
oleaginous yeasts (i.e., Yarrowia lipolytica) include U.S. Patent Nos.
4,880,741 and 5,071,764 and Chen, D. C. et al. (Appl Microbiol
Biotechnol. 48(2):232-235 (1997)).
41



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
For convenience, a host cell that has been manipulated by any
method to take up a DNA sequence (e.g., an expression cassette) will be
referred to as "transformed" or "recombinant" herein. The transformed
host will have at least one copy of the expression construct and may have
two or more, depending upon whether the gene is integrated into the
genome, amplified, or is present on an extrachromosomal element having
multiple copy numbers. The transformed host cell can be identified by
selection for a marker contained on the introduced construct.
Alternatively, a separate marker construct may be co-transformed with the
desired construct, as many transformation techniques introduce many
DNA molecules into host cells. Typically, transformed hosts are selected
for their ability to grow on selective media. Selective media may
incorporate an antibiotic or lack a factor necessary for growth of the
untransformed host, such as a. nutrient or growth factor: An introduced
marker gene may confer antibiotic resistance, or encode an essential
growth factor or enzyme, thereby permitting growth on selective media
when expressed in the transformed host. Selection of a transformed host
can also occur when the expressed marker protein can be detected, either
directly or indirectly. The marker protein may be expressed alone or as a
fusion to another protein. The marker protein can be detected by: 1.) its
enzymatic activity (e.g., ~-galactosidase can convert the substrate X-gal
[5-bromo-4-chloro-3-indolyl-~3-D-galactopyranoside] to a colored product;
luciferase can convert luciferin to a light-emitting product); or 2.) its
light-
producing or modifying characteristics (e.g., the green fluorescent protein
of Aequorea victoria fluoresces when illuminated with blue light).
Alternatively, antibodies can be used to detect the marker protein or a
molecular tag on, for example, a protein of interest. Cells expressing the
marker protein or tag can be selected, for example, visually, or by
techniques such as FACS or panning using antibodies. For selection of
yeast transformants, any marker that functions in yeast may be used.
Desirably, resistance to kanamycin, hygromycin and the amino glycoside
6418 are of interest, as well as ability to grow on media lacking uracil or
leucine.
Following transformation, substrates suitable for the instant X12
desaturase (and optionally other PUFA enzymes that are co-expressed
within the host cell) may be produced by the host either naturally or
transgenically, or they may be provided exogenously.
42



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Metabolic% Engineering Of w-3 And/Or ~-6 Fatty Acid Biosynthesis In
Microbes
Knowledge of the sequence of the present 012 desaturase will be
useful for manipulating w-3 and/or cu-6 fatty acid biosynthesis in
oleaginous yeasts, and particularly, in Yarrowia lipolytica. This may
require metabolic engineering directly within the PUFA biosynthetic
pathway or additional manipulation of pathways that contribute carbon to
the PUFA biosynthetic pathway. Methods useful for manipulating
biochemical pathways are well known to those skilled in the art.
Technigues To Up-Regulate Desirable Biosynthetic Pathways
Additional copies of desaturase and elongase genes may be
introduced into the host to increase the output of the e~-3 and/or w-6 fatty
acid biosynthetic pathways, typically through the use of multicopy
plasmids. Expression of the desaturase or elongase genes also can be
increased at the transcriptional level through the use of a stronger
promoter (either regulated or constitutive) to cause increased expression,
by removing/deleting destabilizing sequences from either the mRNA or the
encoded protein, or by adding stabilizing sequences to the mRNA (U.S.
4,910,141 ). Yet another approach to increase expression of heterologous
desaturase or elongase genesis to increase the translational efficiency of
the encoded mRNAs by replacement of codons in the native gene with
those for optimal gene expression in the selected host microorganism.
Technigues To Down-Regulate Undesirable Biosynthetic Pathways
Conversely, biochemical pathways competing with the cu-3 and/or
w-6 fatty acid biosynthetic pathways for energy or carbon, or native PUFA
biosynthetic pathway enzymes that interfere with production of a particular
PUFA end-product, may be eliminated by gene disruption or down-
regulated by other means (e.g., antisense mRNA). For gene disruption, a
foreign DNA fragment (typically a selectable marker gene) is inserted into
the structural gene to be disrupted in order to interrupt its coding sequence
and thereby functionally inactivate the gene. Transformation of the
disruption cassette into the host cell results in replacement of the
functional native gene by homologous recombination with the non-
functional disrupted gene (see, for example: Hamilton et al. J. 8acteriol.
171:4617-4622 (1989); Balbas et al. Gene 136:211-213 (1993);
Gueldener et al. Nucleic Acids Res. 24:2519-2524 (1996); and Smith et al.
Methods Mol. Cell. Biol. 5:270-277(1996)).
43



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Antisense technology is another method of down-regulating genes
when the sequence of the target gene is known. To accomplish this, a
nucleic acid segment from the desired gene is cloned and operably linked
to a promoter such that the anti-sense strand of RNA will be transcribed.
This construct is then introduced into the host cell and the antisense strand
of RNA is produced. Antisense RNA inhibits gene expression by
preventing the accumulation of mRNA that encodes the protein of interest.
The person skilled in the art will know that special considerations are
associated with the use of antisense technologies in order to reduce
expression of particular genes. For example, the proper level of
expression of antisense genes may require the use of different chimeric
genes utilizing different regulatory elements known to the skilled artisan.
Although targeted gene disruption and antisense technology offer
effective means of down-regulating genes where the sequence is known,
other less specific methodologies have been developed that are not
sequence-based. For example, cells may be exposed to UV radiation and
then screened for the desired phenotype. Mutagenesis with chemical
agents is also effective for generating mutants and commonly used
substances include chemicals that affect nonreplicating DNA (e.g., HNO~
and NH20H), as well as agents that affect replicating DNA (e.g., acridine
dyes, notable for causing frameshift mutations). Specific methods for
creating mutants using radiation or chemical agents are well documented
in the art. See, for example: Thomas D. Brock in Biotechnology-A
Textbook of Industrial Microbiology, 2nd ed. (1989) Sinauer Associates:
Sunderland, MA; or Deshpande, Mukund V., Appl. Biochem. Biotechnol.,
36:227 (1992).
Another non-specific method of gene disruption is the use of
transposable elements or transposons. Transposons are genetic elements
that insert randomly into DNA but can be later retrieved on the basis of
sequence to determine where the insertion has occurred. Both in vivo and
in vitro transposition methods are known. Both methods involve the use of
a transposable element in combination with a transposase enzyme. When
the transposable element or transposon is contacted with a nucleic acid
fragment in the presence of the transposase, the transposable element will
randomly insert into the nucleic acid fragment. The technique is useful for
random mutagenesis and for gene isolation, since the disrupted gene may
be identified on the basis of the sequence of the transposable element.
44



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Kits for in vitro transposition are commercially available [see, for example:
1.) The Primer Island Transposition Kit, available from Perkin Elmer
Applied Biosystems, Branchburg, NJ, based upon the yeast Ty1 element;
2.) The Genome Priming System, available from New England Biolabs,
Beverly, MA, based upon the bacterial transposon Tn7; and 3.) the EZ::TN
Transposon Insertion Systems, available from Epicentre Technologies,
Madison, WI, based upon the Tn5 bacterial transposable element].
Within the context of the present invention, it may be useful to
modulate the expression of the fatty acid biosynthetic pathway by any one
of the methods described above. For example, the present invention
provides a gene (i.e., a 012 desaturase) encoding a key enzyme in the
biosynthetic pathway leading to the production of w-3 and/or w-6 fatty
acids. It will be particularly useful to express this gene in oleaginous
yeasts that produce insufficient amounts of 18:2 fatty acids and to
modulate the expression of this and other PUFA biosynthetic genes to
maximize production of preferred PUFA products using various means for
metabolic engineering of the host organism. Likewise, to maximize PUFA
production with this gene, it may be necessary to disrupt pathways that
compete for the carbon flux directed toward PUFA biosynthesis. In
alternate embodiments, it may be desirable to disrupt the 012 desaturase
herein, to promote synthesis of w-3 fatty acids while simultaneously
preventing co-synthesis of cu-6 fatty acids. In another alternate
embodiment it will be possible to regulate the production of e~-3/ e~-6 fatty
acids by placing the present 012 desaturase gene under the control of
inducible or regulated promoters.
Preferred Microbial Hosts For Recombinant Expression Of X12
Desaturase
Host cells for expression of the instant gene and nucleic acid
fragments may include microbial hosts that grow on a variety of
feedstocks, including simple or complex carbohydrates, organic acids and
alcohols, and/or hydrocarbons over a wide range of temperature and pH
values. Although 'the genes described in the instant invention have been
isolated for expression in oleaginous yeast, it is contemplated that
because transcription, translation and the protein biosynthetic apparatus is.
highly conserved, any bacteria, yeast, algae and/or filamentous fungus will
be a suitable host for expression of the present nucleic acid fragments.



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Preferred microbial hosts are oleaginous organisms, such as
oleaginous yeasts. These oleaginous organisms are naturally capable of
oil synthesis and accumulation, wherein the oil can comprise greater than
about 25% of the cellular dry weight, more preferably greater than about
30% of the cellular dry weight, and most preferably greater than about
40% of the cellular dry weight. Genera typically identified as oleaginous
yeast include, but are not limited to: Yarrovvia, Mortierella, Candida,
Rhodotorula, Rhodosporidium, Cryptococcus, Trichosporon and
Lipomyces. More specifically, illustrative oil-synthesizing yeasts include:
Rhodosporidium toruloides, Lipomyces starkeyii, L. lipoferus, Candida
revkaufi, C. pulcherrima, C. tropicalis, C: utilis, Trichosporon pullans,
T. cutaneum, Rhodotorula glutinus, R. graminis, Mortierella alpina and
Yarrowia lipolytica (formerly classified as Candida lipolytica).
Most preferred is the oleaginous yeast Yarrovvia lipolytiea; and, in a
further embodiment, most preferred are the Y. lipolytica strains designated
as ATCC #20362, ATCC #8862, ATCC #18944, ATCC #76982 and/or
LGAM S(7)1 (Papanikolaou S., and Aggelis G., Bioresour. Technol.
82(1 ):43-9 (2002)).
Fermentation Processes For PUFA Production
The transformed microbial host cell is grown under conditions that
optimize activity of fatty acid bio,'synthetic genes and produce the greatest
and the most economical yield of fatty acids (e.g., LA,~which can in turn
increase the production of various w-3 and/or ~-6 fatty acids). In general,
media conditions that may be optimized include the type and amount of
carbon source, the type and amount of nitrogen source, the carbon-to-
nitrogen ratio, the oxygen level, growth temperature, pH, length of the
biomass production phase, length of the oil accumulation phase and the
time of cell harvest. Microorganisms of interest, such as oleaginous yeast,
are grown in complex media (e.g., yeast extract-peptone-dextrose broth
(YPD)) or a defined minimal media that lacks a component necessary for
growth and thereby forces selection of the desired expression cassettes
(e.g., Yeast Nitrogen Base (DIFCO Laboratories, Detroit, MI)).
Fermentation media in the present invention must contain a suitable
carbon source. Suitable carbon sources may include, but are not limited
to: monosaccharides (e.g., glucose, fructose), disaccharides (e.g., lactose,
sucrose), oligosaccharides, polysaccharides (e.g., starch, cellulose or
mixtures thereof), sugar alcohols (e.g., glycerol) or mixtures from
46



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
renewable feedstocks (e.g., cheese whey permeate, cornsteep liquor,
sugar beet molasses, barley malt). Additionally, carbon sources may
include alkanes, fatty acids, esters of fatty acids, monoglycerides,
diglycerides, triglycerides, phospholipids and various commercial sources
of fatty acids including vegetable oils (e.g., soybean oil) and animal fats.
Additionally, the carbon substrate may include one-carbon substrates
(e.g., carbon dioxide or methanol) for which metabolic conversion into key
biochemical intermediates has been demonstrated. Hence it is
contemplated that the source of carbon utilized in the present invention
may encompass a wide variety of carbon-containing substrates and will
only be limited by the choice of the host organism. Although all of the
above mentioned carbon substrates and mixtures thereof are expected to
be suitable in the present invention, preferred carbon substrates are
sugars and/or fatty acids. Most preferred is glucose and/or fatty acids
containing between 10-22 carbons.
Nitrogen may be supplied from an inorganic (e.g., (NH4)2S04) or
organic source (e.g., urea or glutamate). In addition to appropriate carbon
and nitrogen sources, the fermentation media must also contain suitable
minerals, salts, cofactors, buffers, vitamins, and other components known
to those skilled in the art suitable for the growth of the microorganism and
promotion of the enzymatic pathways necessary for PUFA production.
Particular attention is given to several metal ions (e.g., Mn+2, Co+2, Zn+2,
Mg+2) that promote synthesis of lipids and PUFAs (Nakahara, T. et al., Ind.
Appl. Single Cell Oils, D. J. Kyle and R. Colin, eds. pp 61-97 (1992)).
Preferred growth media in the present invention are common
commercially prepared media, such as Yeast Nitrogen Base (DIFCO
Laboratories, Detroit, MI). Other defined or synthetic growth media may
also be used and the appropriate medium for growth of the particular
microorganism will be known by one skilled in the art of microbiology or
fermentation science. A suitable pH range for the fermentation is typically
between about pH 4.0 to pH ~.0, wherein pH 5.5 to pH 7.0 is preferred as
the range for the initial growth conditions. The fermentation may be
conducted under aerobic or anaerobic conditions, wherein microaerobic
conditions are preferred.
Typically, accumulation of high levels of PUFAs in oleaginous yeast
cells requires a two-stage process, since the metabolic state must be
"balanced" between growth and synthesis/storage of fats. Thus, most
47



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
preferably, a two-stage fermentation process is necessary for the
production of PUFAs in oleaginous yeast. In this approach, the first stage
of the fermentation is dedicated to the generation and accumulation of cell
mass and is characterized by rapid cell growth and cell division. In the
second stage of the fermentation, it is preferable to establish conditions of
nitrogen deprivation in the culture to promote high levels of lipid
accumulation. The effect of this nitrogen deprivation is to reduce the
effective concentration of AMP in the cells, thereby reducing the activity of
the NAD-dependent isocitrate dehydrogenase of mitochondria. When this
occurs, citric acid will accumulate, thus forming abundant pools of acetyl-
CoA in the cytoplasm and priming fatty acid synthesis. Thus, this phase is
characterized by the cessation of cell division followed by the synthesis of
fatty acids and accumulation of oil.
Although cells are typically grown at about 30 °C, some studies
have shown increased synthesis of unsaturated fatty acids at lower
temperatures (Yongmanitchai and Ward, Appl. Environ. Microbiol. 57:419-
(1991 )). Based on process economics, this temperature shift should
likely occur after the first phase of the two-stage fermentation, when the
bulk of the organisms' growth has occurred.
20 It is contemplated that a variety of fermentation process designs
may be applied, where commercial production of omega fatty acids using
the instant X12 desaturase is desired. For example, commercial
production of PUFAs from a recombinant microbial host may be produced
by a batch, fed-batch or continuous fermentation process.
25 A batch fermentation process is a closed system wherein the media
composition is set at the beginning of the process and not subject to
further additions beyond those required for maintenance of pH and oxygen
level during the process. Thus, at the beginning of the culturing process
the media is inoculated with the desired organism and growth or metabolic
activity is permitted to occur without adding additional substrates (i.e.,
carbon and nitrogen sources) to the medium. In batch processes the
metabolite and biomass compositions of the system change constantly up
to the time the culture is terminated. In a typical batch process, cells
proceed through a static lag phase to a high-growth log phase and finally
to a stationary phase, wherein the growth rate is diminished or halted. Left
untreated, cells in the stationary phase will eventually die. A variation of
the standard batch process is the fed-batch process, wherein the substrate
48



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
is continually added to the fermentor over the course of the fermentation
process. A fed-batch process is also suitable in the present invention.
Fed-batch processes are useful when catabolite repression is apt to inhibit
the metabolism of the cells or where it is desirable to have limited amounts
of substrate in the media at any one time. Measurement of the substrate
concentration in fed-batch systems is difficult and therefore may be
estimated on the basis of the changes of measurable factors such as pH,
dissolved oxygen and the partial pressure of waste gases (e.g., CO~).
Batch and fed-batch culturing methods are common and well known in the
art and examples may be found in Thomas D. Brock in Biotechnology: A
Textbook of Industrial Microbioloay, 2"d ed., (1989) Sinauer Associates:
Sunderland, MA; or Deshpande, Mukund V., Appl. Biochem. Biotechnol.,
36:227 (1992), herein incorporated by reference.
Commercial production of omega fatty acids using the instant 012
desaturase may also be accomplished by a continuous fermentation
process wherein a defined media is continuously added to a bioreactor
while an equal amount of culture volume is removed simultaneously for
product recovery. Continuous cultures generally maintain the cells in the
log phase of growth at a constant cell density. Continuous or semi-
continuous culture methods permit the modulation of one factor or any
number of factors that affect cell growth or end product concentration. For
example, one approach may limit the carbon source and allow all other
parameters to moderate metabolism. In other systems, a number of
factors affecting growth may be altered continuously while the cell
concentration, measured by media turbidity, is kept constant. Continuous
systems strive to maintain steady state growth and thus the cell growth
rate must be balanced against cell loss due to media being drawn off the
culture. Methods of modulating nutrients and growth factors for
continuous culture processes, as well as techniques for maximizing the
rate of product formation, are well known in the art of industrial
microbiology and a variety of methods are detailed by Brock, supra.
Purification Of PUFAs
The PUFAs may be found in the host microorganism as free fatty
acids or in esterified forms such as acylglycerols, phospholipids, sulfolipids
or glycolipids, and may be extracted from the host cell through a variety of
means well-known in the art. One review of extraction techniques, quality
analysis and acceptability standards for yeast lipids is that of Z. Jacobs
49



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
(Crifical Reviews in Biotechnology 12(5/6):463-491 (1992)). A brief review
of downstream processing is also available by A. Singh and O. Ward (Adv.
Appl. Microbiol. 45:271-312 (1997)).
In general, means for the purification of PUFAs may include
extraction with organic solvents, sonication, supercritical fluid extraction
(e.g., using carbon dioxide), saponification and physical means such as
presses, or combinations thereof. Of particular interest is extraction with
methanol and chloroform in the presence of water (E. G. Bligh & W. J.
Dyer, Can. J. Biochem. Physiol. 37:911-917 (1959)). Where desirable, the
aqueous layer can be acidified to protonate negatively-charged moieties
and thereby increase partitioning of desired products into the organic
layer. After extraction, the organic solvents can be removed by
evaporation under a stream of nitrogen. When isolated in conjugated
forms, the products may be enzymatically or chemically cleaved to release
the free fatty acid or a less complex conjugate of interest, and can then be
subject to further manipulations to produce a desired end product.
Desirably, conjugated forms of fatty acids are cleaved with potassium
hydroxide.
If further purification is necessary, standard methods can be
employed. Such methods may include extraction, treatment with urea,
fractional crystallization, HPLC, fractional distillation, silica gel
chromatography, high-speed centrifugation or distillation, or combinations
of these techniques. Protection of reactive groups, such as the acid or
alkenyl groups, may be done at any step through known techniques (e.g.,
alkylation, iodination). Methods used include methylation of the fatty acids
to produce methyl esters. Similarly, protecting groups may be removed at
any step. Desirably, purification of fractions containing GLA, STA, ARA,
DHA and EPA may be accomplished by treatment with urea and/or
fractional distillation.
DESCRIPTION OF PREFERRED EMBODIMENTS
The ultimate goal of the work described herein is the development
of an oleaginous yeast that accumulates oils enriched in cu-3 andlor cu-6
PUFAs. Toward this end, X12 desaturases must be identified that function
efficiently in oleaginous yeasts, to enable synthesis and high accumulation
of preferred PUFAs in these hosts. Identification of efficient 012
desaturases is also necessary to manipulate the ratio of cu-3 to w-6 PUFAs
produced in host cells.



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
In the present invention, Applicants have isolated and cloned the
only gene in Yarrowia lipolytica that encodes a 012 desaturase enzyme.
Confirmation of this gene's activity was provided based upon: 1.) the lack
of detectable LA in a strain wherein disruption of the native 012
desaturase by targeted gene replacement through homologous
recombination had occurred (Example 2); 2.) restoration of LA
biosynthesis (complementation) in the disrupted strain upon transformation
with the chimeric gene (Example 4); and 3.) the overproduction of LA in
wild type cells upon transformation with the chimeric gene (Example 4).
Thus, this 012 desaturase gene is useful for expression in various
microbial hosts, and particularly for overexpression in oleaginous yeasts
(e.g., the native host Yarrowia lipolytica). Additional benefits may result
since expression of the 012 desaturase can also be put under the control
of strong constitutive or regulated promoters that do not have the
regulatory constraints of the native gene.
Following the initial demonstration of functionality of the X12
desaturase in Yarrowia lipolytica, the Applicants then explored methods of
optimizing PUFA production within this model host organism. Specifically,
a 012 desaturase-disrupted host strain of Y. lipolytica was created and
transformed with an expression cassette comprising a heterologous ~6
desaturase, elongase, O5 desaturase and X17 desaturase. When fed
ALA as a substrate, the transformed host was able to produce STA without
co-synthesis of any w-6 fatty acid (Example S). Thus, this work
demonstrated that upon transformation with appropriate genes of the cu-3
biosynthetic pathway and feeding of ALA as a substrate, only w-3 fatty
acids (e.g., ETA, EPA, DPA, DHA) could be synthesized (i.e., without co-
synthesis of w-6 fatty acids) in Yarrowia strains lacking X12 desaturase
activity.
EXAMPLES
The present invention is further defined in the following Examples.
It should be understood that these Examples, while indicating preferred
embodiments of the invention, are given by way of illustration only. From
the above discussion and these Examples, one skilled in the art can
ascertain the essential characteristics of this invention, and without
departing from the spirit and scope thereof, can make various changes
and modifications of the invention to adapt it to various usages and
conditions.
51



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
GENERAL METHODS
Standard recombinant DNA and molecular cloning techniques used
in the Examples are well known in the art and are described by: 1.)
Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A
Laboratory Manual; Cold Spring Harbor Laboratory: Cold Spring Harbor,
NY (1989) (Maniatis); 2.) T. J. Silhavy, M. L. Bennan, and L. W. Enquist,
Experiments with Gene Fusions; Cold Spring Harbor Laboratory: Cold
Spring Harbor, NY (1984); and 3.) Ausubel, F. M. et al., Current Protocols
in Molecular Biology, published by Greene Publishing Assoc. and Wiley-
Interscience (1987).
Materials and methods suitable for the maintenance and growth of
microbial cultures are well known in the art. Techniques suitable for use in
the
following Examples may be found as set out in Manual of Methods for General
Bacteriology (Phillipp Gerhardt, R. G. E. Murray, Ralph N. Costilow, Eugene W.
Nester, Willis A. Wood, Noel R. Krieg and G. Briggs Phillips, Eds), American
Society for Microbiology: Washington, D.C. (1994)); or by Thomas D. Brock in
Biotechnology' A Textbook of Industrial Microbiology, 2r,a ed., Sinauer
Associates: Sunderland, MA (1989). All reagents, restriction enzymes and
materials used for the growth and maintenance of microbial cells were obtained
from Aldrich Chemicals (Milwaukee, WI), DIFCO Laboratories (Detroit, MI),
GIBCOIBRL (Gaithersburg, MD), or Sigma Chemical Company (St. Louis, MO),
unless otherwise specified.
E. coli TOP10 cells and E. coli Electromax DH10B cells were obtained
from Invitrogen (Carlsbad, CA). Max Efficiency competent cells of E. coli
DH5cr
were obtained from GIBCO/BRL (Gaithersburg, MD). E. coli (XL1-Blue)
competent cells were purchased from the Stratagene Company (San Diego, CA).
E. coli strains were typically grown at 37 °C on Luria Bertani (LB)
plates.
General molecular cloning was performed according to standard methods
(Sambrook et al., supra). Oligonucleotides were synthesized by Sigma-Genosys
(Spring, TX). PCR products were cloned into Promega's pGEM-T-easy vector
(Madison, WI).
DNA sequence was generated on an ABI Automatic sequencer
using dye terminator technology (U.S. 5,366,860; EP 272,007) using a
combination of vector and insert-specific primers. Sequence editing was
perFormed in Sequencher (Gene Codes Corporation, Ann Arbor, MI).
All sequences represent coverage at least two times'in both directions.
52



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Comparisons of genetic sequences were accomplished using DNASTAR
software (DNA Star, Inc.).
The meaning of abbreviations is as follows: "sec" means second(s),
"min" means minute(s), "h" means hour(s), "d" means day(s), "pL" means
microliter(s), "mL" means milliliter(s), "L" means liter(s), "pM" means
micromolar, "mM" means millimolar, "M" means molar, "mmol" means
millimole(s), "pmole" mean micromole(s), "g" means gram(s), "pg" means
microgram(s), "ng" means nanogram(s), "U" means unit(s), "bp" means
base pairs) and "kB" means kilobase(s).
Cultivation Of Yarrovvia lipolytica
Yarrovvia lipolytica strains ATCC #76982 and ATCC #90812 were
purchased from the American Type Culture Collection (Rockville, MD). Y.
lipolytica strains were usually grown at 28 °C on YPD agar (1 % yeast
extract, 2% bactopeptone, 2% glucose, 2% agar). For selection of
transformants, minimal medium (0.17% yeast nitrogen base (DIFCO
Laboratories, Detroit, MI) without ammonium sulfate or amino acids, 2%
glucose, 0.1 % proline, pH 6.1 ) was used. Supplements of adenine,
leucine, lysine and/or uracil were added as appropriate to a final
concentration of 0.01 %.
Fatty Acid Analysis Of Yarrowia lipolytica
For fatty acid analysis, cells were collected by centrifugation and
lipids were extracted as described in Bligh, E. G. & Dyer, W. J. (Can. J.
Biochem. Physiol. 37:911-917 (1959)). Fatty acid methyl esters were
prepared by transesterification of the lipid extract with sodium methoxide
(Roughan, G., and Nishida I. Arch Biochem Biophys. 276(1 ):38-46 (1990))
and subsequently analyzed with a Hewlett-Packard 6890 GC fitted with a
30-m X 0.25 mm (i.d.) HP-INNOWAX (Hewlett-Packard) column. The
oven temperature was from 170 °C (25 min hold) to 185 °C at 3.5
°C/min.
For direct base transesterification, Yarrovvia culture (3 mL) was
harvested, washed once in distilled water, and dried under vacuum in a
Speed-Vac for 5-10 min. Sodium methoxide (100 ~,I of 1 %) was added to
the sample, and then the sample was vortexed and rocked for 20 min.
After adding 3 drops of 1 M NaCI and 400 ~,I hexane, the sample was
vortexed and spun. The upper layer was removed and analyzed by GC as
described above.
53



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
EXAMPLE 1
Construction Of Plasmids Suitable For Gene Expression In Yarroviria
li of ica
The present Example describes the construction of plasmids pYS,
pY5-4, pY5-13 and pY5-20.
Construction Of Plasmid pY5
The plasmid pYS, a derivative of pINA532 (a gift from Dr. Claude
Gaillardin, Insitut National Agronomics, Centre de biotechnologie Agro-
Industrielle, laboratoire de Genetique Moleculaire et Cellularie INRA-
CNRS, F-78850 Thiverval-Grignon, France), was constructed for
expression of heterologous genes in Yarrov~ria lipolytica, as diagrammed in
Figure 3.
First, the partially-digested 3598 by EcoRl fragment containing the
ARS18 sequence and LEU2 gene of pINA532 was subcloned into the
EcoRl site of pBluescript (Strategene, San Diego, CA) to generate pY2.
The TEF promoter (Muller S., et al. Yeast, 14: 1267-1283 (1998)) was
amplified from Yarrovvia lipolytica genomic DNA by PCR using TEFS'
(SEQ ID N0:1 ) and TEF3' (SEQ ID NO:2) as primers. PCR amplification
was carried out in a 50 ~,I total volume containing: 100 ng Yarrowia
genomic DNA, PCR buffer containing 10 mM KCI, 10 mM (NH4)2S04,
20 mM Tris-HCI (pH 8.75), 2 mM MgSO4, 0.1 % Triton X-100, 1'00 ~,g/mL
BSA (final concentration), 200 ~.M each deoxyribonucleotide triphosphate,
10 pmole of each primer and 1 ~I of PfuTurbo DNA polymerase
(Stratagene). Amplification was carried out as follows: initial denaturation
at 95 ~C for 3 min, followed by 35 cycles of the following: 95 ~C for 1 min;
56 ~C for 30 sec, 72 ~C for 1 min. A final extension cycle of 72 ~C for
10 min was carried out, followed by reaction termination at 4 ~C. The
418 by PCR product was ligated into pCR-Blunt to generate pIP-tef. The
BamHllEcoRV fragment of pIP-tef was subcloned into the BamHllSmal
sites of pY2 to generate pY4.
The XPR2 transcriptional terminator was amplified by PCR using
pINA532 as template and XPRS' (SEQ ID N0:3) and XPR3' (SEQ ID
N0:4) as primers. The PCR amplification was carried out in a 50 ~,I total
volume, using the components and conditions described above. The
179 by PCR product was digested with Sacll and then ligated into the
Sacll site of pY4 to generate pYS. Thus, pY5 (shown in Figures 3 and 4)
is useful as a Yarrowia-E, coli shuttle plasmid containing:
54



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
1.) a Yarrovvia autonomous replication sequence (ARS18);
2.) a ColE1 plasmid origin of replication;
3.) an ampicillin-resistance gene (AmpR), for selection in E. coli;
4.) a Yarrovvia LEU2 gene (E.C. 1.1.1.85, encoding
isopropylmalate isomerase), for selection in Yarrovvia;
5.) the translation elongation promoter (TEF P), for expression
of heterologous genes in Yarrowia; and
6.) the extracellular protease gene terminator (XPR2) for
transcriptional termination of heterologous gene expression
in Yarrowia.
Construction Of Plasmids pY-4, pY5-13 And pY5-20
pY5-4 and pY5-13 (Figure 4) were constructed as derivatives of pY5
to faciliate subcloning and heterologous gene expression in Yarro~nria
lipolytica.
Specifically, pY5-4 was constructed by three rounds of site-directed
mutagenesis using pY5 as template. A Ncol site located inside the Leu2
reporter gene was eliminated from pY5 using oligonucleotides YL1 and
YL2 (SEQ ID NOs:5 and 6) to generate pY5-1. A Ncol site was introduced
into pY5-1 between the TEF promoter and XPR2 transcriptional terminator
by site-directed mutagenesis using oligonucleotides YL3 and YL4 (SEQ ID
NOs:7 and 8) to generate pY5-2: A Pacl site was then introduced into
pY5-2 between the TEF promoter and XPR2 transcriptional terminator
using oligonucleotides YL23 and YL24 (SEQ ID NOs:9 and 10) to
generate pY5-4.
pY5-13 was constructed by 6 rounds of site-directed mutagenesis
using pY5 as.template. Both Sall and Clal sites were eliminated from pY5
by site-directed mutagenesis using oligonucleotides YL5 and YL6 (SEQ ID
NOs:11 and 12) to generate pY5-5. A Sall site was introduced into pY5-5
between the Leu2 gene and the TEF promoter by site-directed
mutagenesis using oligonucleotides YL9 and YL10 (SEQ ID NOs:13 and
14) to generate pY5-6. A Pacl site was introduced into pY5-6 between the
LEU2 gene and ARS18 using oligonucleotides YL7 and YL8 (SEQ ID
NOs:15 and 16) to generate pY5-8. A Ncol site was introduced into pY5-8
around the translation start codon of the TEF promoter using
oligonucleotides YL3 and YL4 (SEQ ID NOs:7 and 8) to generate pY5-9.
The Ncol site inside the Leu2 gene of pY5-9 was eliminated using YL1
and YL2 oligonucleotides (SEQ ID NOs:5 and 6) to generate pY5-12.



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Finally, a BsillVl site was introduced into pY5-12 between the ColEl and
XPR2 region using oligonucleotides YL61 and YL62 (SEQ ID NOs:17 and
18) to generate pY5-13.
Plasmid pY20 is a derivative of pYS. It was constructed by insertinc
a Not I fragment containing a chimeric hygromycin resistance gene
(hygromycin-B phosphotransferase; GenBank Accession No. P00557) into
the Not I site of pYS. The chimeric gene had the hygromycin resistance
ORF under the control of a Yarrowia lipolytica TEF promoter.
EXAMPLE 2
Cloning Of The Partial Yarrovvia lipolytica 012 Desaturase And Disruption
Of The Endogenous 012 Desaturase Gene
Based on the fatty acid composition of wildtype Yarrowia lipolytiea
(ATCC #76982) which demonstrated that the organism could make LA
(18:2) but not ALA (18:3), it was assumed that Y. lipolytica would likely
contain genes) having X12 desaturase activity but not X15 desaturase
activity. Thus, the present Example describes the use of degenerate PCR
primers to isolate a partial coding sequence of the Y. lipolytica X12
desaturase and the use of the partial sequence to disrupt the native gene.
Cloning Of The Partial Putative X12 Desaturase Seguence From Y.
lipolytica By PCR Using Degenerate PCR Primers
Genomic DNA was isolated from Y. lipolytica (ATCC #76982) using
DNeasy Tissue Kit (Qiagen, Catalog #69504) and resuspended in kit
buffer AE at a DNA concentration of 0.5 ~,g/~,I. PCR amplifications were
performed using the genomic DNA as template and several sets of
degenerate primers made to amino acid sequences conserved between
different fungal 012 desaturases (i.e., Mortierella alpina, Mucorrouxii,
Emericella nidulans and Pichia augusta). The best results were obtained
with a set of upper and lower degenerate primers, P73 and P76,
respectively, as shown in the Table below.
56



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Table 3
Degenerate Primers Used For Amplification Of The Partial Putative 012
Desaturase
Primer DescriptionDegenerate Nucleotide Corresponding
Set Sequence Amino Acid
Sequence


P73 (32) 26- 5'- WVLGHECGH


mers TGGGTCCTGGGCCAYGART (SEQ ID N0:20)


GYGGNCA-3' SEQ ID N0:19


P76 (64) 30- 5'- (M/I)PFYHAEEAT


mers GGTGGCCTCCTCGGCGTGR (SEQ ID N0:22)


TARAANGGNAT-3' (SEQ ID


N0:21


[Note: Abbreviations are standard for nucleotides and proteins. The nucleic
acid
degeneracy code used is as follows: R= A/G; Y=C/T; and N=A/C/G/T.]
The PCR was carried out in an Eppendorf Mastercycler Gradient
thermocycler according to the manufacturer's recommendations.
Amplification was carried out as follows: initial denaturation at 95 ~C for
1 min, followed by 30 cycles of denaturation at 95 0C for 30 sec, annealing
at 58 ~C for 1 min, and elongation at 72 ~C for 1 min. A final elongation
cycle at 72 ~C for 10 min was carried out, followed by reaction termination
at 4 ~C.
The expected (ca. 740 bp) size PCR product was detected by
agarose gel electrophoresis, isolated, purified, cloned into a pTA vector
(Invitrogen) and sequenced. The resultant sequence (contained within
SEQ ID N0:23) had homology to known X12 desaturases, based on
BLAST program analysis (Basic Local Alignment Search Tool; Altschul, S.
F., et al., J. Mol. Biol. 215:403-410 (1993).
Taraeted Disruption Of TheYarrovtria lipolvtica X12 Desaturase Gene
Targeted disruption of the native 012 desaturase gene in Y. lipolytica
#76982 was carried out by homologous recombination-mediated
replacement of the 012 desaturase gene with a targeting cassette
designated as pY23D12. pY23D12 was derived from plasmid pY20
(Example 1 ). Specifically, pY23D12 was created by inserting a 642 by
Hind IIIlEco RI fragment into similarly linearized pY20. This 642 by
fragment consisted of (in 5' to 3' orientation): 3' homologous sequence
57



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
from position +718 to +1031 (of the coding sequence (ORF) in SEQ ID
N0:23), a Bgl II restriction site and 5' homologous sequence from position
+403 to +717 (of the coding sequence (ORF) in SEQ ID N0:23). The
fragment was prepared by PCR amplification of 3' and 5' sequences from
the 642 by PCR product using sets of PCR primers P99 and P100 (SEQ
ID NOs:25 and 26) and P101 and P102 (SEQ ID NOs:27 and 28),
respectively.
pY23D12 was linearized by Bgl II restriction digestion and
transformed into mid-log phase Y. lipolytica cells by the lithium acetate
method according to the method of Chen, D. C. et al. (Appl Microbiol
Biotechnol. 48(2):232-235 (1997)). Briefly, Y. lipolytica ATCC #76982 was
streaked onto a YPD plate and grown at 30 °C for approximately 18 hr.
Several large loopfuls of cells were scraped from the plate and
resuspended in 1 mL of transformation buffer containing:
~ 2.25 mL of 50% PEG, average MW 3350;
~ 0.125 mL of 2 M Li acetate, pH 6.0;
~ 0.125 mL of 2 M DTT; and,
~ 50 ~,g sheared salmon sperm DNA.
About 500 ng of plasmid DNA were incubated in 100 ~,I of
resuspended cells, and maintained at 39 °C for 1 hr with vortex mixing
at
15 min intervals. The cells were plated onto YPD hygromycin selection
plates and maintained at 30 °C for 2 to 3 days.
Four hygromycin-resistant colonies were isolated and screened for
targeted disruption by PCR. One set of PCR primers (P119 [SEQ ID
N0:29] and P120 [SEQ ID N0:30]) was designed to amplify a specific
junction fragment following homologous recombination. Another set of
PCR primers (P121 [SEQ ID N0:31] and P122, [SEQ ID NO:32]) was
designed to detect the native gene. Three of the four hygromycin-resistant
colonies were positive for the junction fragment and negative for the native
fragment, thus confirming targeted integration.
Determination Of Fatty Acid Profile In The 012 Desaturase-Disrupted
Strain
Disruption of the 012 desaturase gene was further confirmed by GC
analysis of the total lipids in one of the disrupted strains, designated as "Q
d12D". Single colonies of wild type (ATCC #76982) and Q-d12D Y.
lipolytica were each grown in 3 mL minimal media (formulation/L: 20 g
glucose, 1.7 g yeast nitrogen base, 1 g L-proline, 0.1 g L-adenine, 0.1 g L-
58



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
lysine, pH 6.1 ) at 30 °C to an ODsoO ~ 1Ø The cells were harvested,
washed in distilled water, speed vacuum dried and subjected to direct
trans-esterification and GC analysis (as described in the General
Methods).
The fatty acid profile of wildtype Yarrovvia and the transformant Q-
d12D comprising the disrupted X12 desaturase are shown below in Table
4. Fatty acids are identified as 16:0 (palmitate), 16:1 (palmitoleic acid),
18:0, 18:1 (oleic acid) and 18:2 (LA); and the composition of each is
presented as a % of the total fatty acids.
Table 4
Fatty Acid Composition (% Of Total Fatty Acids) In Wildtype And
Transformant Yarrovvia lipolytica
Strain 16:0 16:1 18:0 18:1 18:2


Wild t a 11 14 2 33 34


Q-d12D disrupted6 ~ 15 1 74 nd
I I I


*nd= not detectable
Results indicated that the native X12 desaturase gene in the Q-d12D
strain was inactivated. Thus, it was possible to conclude that there was
only one gene encoding a functional X12 desaturase in Yarrouvia lipolytica
ATCC #76982.
EXAMPLE 3
Cloninct Of The Full-Length Yarrowia lipolyfica X12 Desaturase Gene
The present Example describes the recovery of the genomic
sequences flanking the disrupted gene by plasmid rescue, using the
sequence in the rescued plasmid to PCR the intact open reading frame of
the native gene. The full-length gene and its deduced amino acid
sequence is compared to other fungal desaturases.
Plasmid Rescue Of The Yarrowia lipolytica X12 Desaturase Gene
Since the X12 desaturase gene was disrupted by the insertion of
the entire pY23D12 vector that also contained an E. coli ampicillin-
resistant gene and E. coli ori, it was possible to rescue the flanking
sequences in E. coli. For this, genomic DNA of Y. lipolytica strain Q-d12D
(carrying the disrupted X12 desaturase gene; Example 2) was isolated
using the DNeasy Tissue Kit. Then, 10 p.g of the genomic DNA was
digested with 50 ~,I of restriction enzymes Age 1, Avr II, Nhe I and Sph I in
59



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
a reaction volume of 200 ~,I. Digested DNA was extracted with
phenol:chloroform and resuspended in 40 ~I deionized water. The
digested DNA (10 p,l) was self ligated in-a 200 p,l ligation mixture
containing 3 U T4 DNA ligase. Ligation was carried out at 16 °C for 12
hrs. The ligated DNA was extracted with phenol:chloroform and
resuspended in 40 p,l deionized water. Finally, 1 p.l of the resuspended
ligated DNA was used to transform E. coli by electroporation and plated
onto LB plates containing ampicillin (Ap). Ap-resistant colonies were
isolated and analyzed for the presence of plasmids by miniprep. The
following insert sizes were found in the recovered or rescued plasmids~
(Table 5):
Table 5
Insert Sizes Of Recovered Plasmids, According To Restriction Enzyme
Enz me Plasmid Insert Size
kB


Agel 1.6


Avrll 2.5


Nhel 9.4


Sphl 6.6


Sequencing of the plasmids was initiated with sequencing primers P99
(SEQ ID N0:25) and P102 (SEQ ID N0:28).
Based on the sequencing results, a full-length gene encoding the
Yarrowia lipolytica D12 desaturase gene was assembled (1936 bp; SEQ
ID N0:23). Specifically, SEQ ID N0:23 encoded an open reading frame of~
1257 bases (nucleotides +283 to +1539), while the deduced amino acid
sequence was 419 residues in length (SEQ ID N0:24).
The Yarrowia lipolyfica X12 desaturase protein (SEQ ID N0:24)
was used as a query against available sequence databases of filamentous
fungi, including: 1.) public databases of Neurospora crassa, Magnaporthe
grisea, Aspergillus nidulans and Kluyveromuces lactis; and 2.) a DuPont
EST library of Fusarium moniliforme strain M-8114 (E.I. du Pont de
Nemours and Co., Inc., Wilmington, DE) (F, moniliforme strain M-8114
available from the Fusarium Research Center, University Park, PA; see
also Planf Disease 81 (2): 211-216. (1997)). These BLAST searches
identified the following homologs (Table 6).



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Table 6
Description of X12 Desaturase Homoloas
Source S mbol Or anism


Contig 1.122 (scaffold 9) in the An1 Aspergillus
A. nidulans


genome project (sponsored by the nidulans
Center


for Genome Research (CGR), Cambridge,


MA.


Contig 1.15 (scaffold 1 ) in the An2 Aspergillus
A. nidulans


enome project; AAG36933 nidulans


DuPont EST sequence database, U.S. Fm1( Fusarium


Provisional A lication No. 60/519191 moniliforme


DuPont EST sequence database, U.S. Fm2 Fusarium


Provisional Application No. 60/519191 moniliforme


Ctg4369-0000002-2.1 in the GenolevuresKI Kluyveromyces


project. lactis


Locus MG08474.1 in contig 2.1597 Mg1 Magnaporthe
in the M.


grisea genome project (sponsored grisea
by the


CGR and International Rice Blast


Genome Consortium. '


Locus MG01985.1 in contig 2.375 Mg2 Magnaporthe
in the M.


rises enome project rises


GenBank Accession No. AABX01000374 Nc1 Neurospora


crassa


GenBank Accession No. AABX01000577 Nc2 Neur~spora


crassa


All of the homologs were either unannotated or annotated as a fatty acid
desaturase. Furthermore, the nucleotide sequences from A. nidulans
were incomplete and/or genomic with putative intron sequences; the
Applicants made a tentative assembly of the deduced amino acids for
comparison with amino acid sequences from the other homologs.
A comparison of the deduced amino acid sequence of the Yarrowia
lipolytica X12 desaturase (SEQ ID N0:24) was made with the fungal
homologs shown above in Table 6 and other known X12 desaturases, as
described below in Table 7.
61



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Table 7
Known X12 Desaturases
Source S mbol Orc anism


GenBank Accession No. AAG36933En Emericella nidulans


GenBank Accession No. AF110509Ma Mortierella alpina


GenBank Accession No. AB020033MaB Mortierella alpina


GenBank Accession No. AAL13300;MaC Mortierella alpina
AF417244


GenBankAccession No. AF161219 Mr Mucorrouxii


Ctg1334- Pa Pichia augusta
0000001-1.1.
see Genolevures project.


Specifically, the analysis was performed using the ClustalW
alignment algorithm (Slow/Accurate, Gonnet option; Thompson et. al.,
Nucleic Acids Res. 22:4673-4680 (1994)) of the DNASTAR software
package (DNASTAR Inc., Madison, WI). This comparison revealed the
Pair Distances shown in Figure 5, wherein "YI" corresponds to the
Yarrovvia lipolytica 012 desaturase. Percent similarity and divergence are
shown in the upper and lower triangles, respectively. Thus, the Y.
lipolytica X12 desaturase was at least 53% identical to the other X12
desaturase homologs (having maximal identity to the A. nidulans
sequence (An2)).
EXAMPLE 4
i
Expression Of Yarrovvia lipolytica X12 Desaturase ORF Under The
Control Of A Heteroloaous Yarrovvia Promoter
The present Example describes the expression of the X12
desaturase ORF in a chimeric gene under the control of a heterologous
(non-012 desaturase) Yarrovvia promoter to complement the 012
desaturase-disrupted mutant and enable the overproduction of LA in the
wildtype strain.
Expression Of Y, lipolytica X12 Desaturase In Yarrowia lipolytica.
The ORF encoding the Y. lipolytica X12 desaturase was PCR
amplified using upper primer P147 (SEQ ID N0:33) and lower primer P148
(SEQ ID N0:34) from the genomic DNA of Y. lipolyfica ATCC #76982.
The correct sized (1260 bp) fragment was isolated, purified, digested with
Nco I and Not I and cloned into Ncol-Not I cut pY5-13 vector (Example 1 ),
62



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
such that the gene was under the control of the TEF promoter. Correct
transformants were confirmed by miniprep analysis and the resultant
plasmid was designated pY25-d12d.
Plasmids pY5-13 (the "control") and pY25-d12d were each
individually transformed into Y. lipolytica ATCC #76982 wild-type (WT) and
d12d-disrupted strains (Q-d12D, also referred to as "d12K0" in the Table
below) and selected on Bio101 DOB%CSM-Leu plates.
Single colonies of transformants were grown up and GC analyzed as
described in the General Methods. Results are shown in the Table below. Fatty
acids are identified as 16:0, 16:1, 18:0, 18:1 (oleic acid) and 18:2 (LA); and
the
composition of each is presented as a % of the total fatty acids. "D12d SC"
was
calculated according to the following formula: ([18:2]/[18:1+18:2])*100 and
represents percent substrate conversion.
Table 8
Fatty Acid Composition (% Of Total Fatty Acids)
Strain Piasmid % % % % % D12d
16:0 16:1 18:0 18:1 18:2 SC


D12K0 pY5-13 8 10 2 80 nd 0


D12K0 pY25-d12d 11 8 2 34 45 57


WT pY5-13 10 10 1 32 47 59


WT pY25-d 12d 12 7 2 21 59 74


*nd= not detectable
The results showed that the 012 desaturase promoter was
equivalent in strength to the TEF promoter (57% substrate conversion in
the d12K0 strain expressing the X12 desaturase under the control of the
TEF promoter, compared to 59% substrate conversion in the wild type
strain expressing the 012 desaturase under the control of the native 012
desaturase promoter). On this basis, it is expected that the X12
desaturase promoter can be used for heterologous expression of other
ORFs in Yarrovvia.
Additionally, the results demonstrated. that overexpression of the
X12 desaturase in wild type cells resulted in even higher levels of LA
production (18:2). Specifically, 74% substrate conversion was observed in
the wildtype strain overexpressing the X12 desaturase under the control of
the TEF promoter, as opposed to only 59% substrate conversion in the
63



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
wild type strain. On the basis of these results, it would be expected that
overexpression of the X12 desaturase, in combination of other genes for
PUFA biosynthesis (e.g., a Ll6~desaturase, elongase, 05 desaturase, 017
desaturase), would result in higher production of cu-3 and/or cu-6 PUFAs.
Additionally, it would be expected that disruption of the native 012
desaturase and expression of other genes for PUFA biosynthesis (e.g., a
D6 desaturase, elongase, D5 desaturase, X17 desaturase) would result in
production of "pure" cu-3 PUFAs, without co-synthesis of any w-6 PUFAs.
EXAMPLE 5
Selection Of O6 Desaturase ~5 Desaturase 017 Desaturase And Hiah Affinity
PUFA Elonaase Genes For Expression In Yarrotwia lipolytica
Prior to the introduction of specific genes encoding an w-3 andlor cu-6
biosynthetic pathway into Yarroiwia lipolytica containing a disrupted d12
desaturase (Example 8), it was necessary to confirm the functionality of
heterologous ~6 desaturase, elongase, ~5 desaturase and X17 desaturase
genes expressed in Yarrovvia. This was accomplished by measuring the
conversion efficiency of each wildtype protein in the alternate host.
Specifically,
a Mortierella alpina 05 desaturase, a M. alpina 06 desaturase, a Saprolegnia
diclina X17 desaturase and a M. alpina high affinity PUFA elongase were
separately expressed and screened for activity in substrate-feeding trials.
Construction Of Expression Plasmids
In general, wildtype desaturase or elongase genes were either isolated
by restriction digestion or amplified by PCR and inserted into appropriate
vectors
for expression. Each PCR amplification was carried out in a 50 pl total
volume,
comprising PCR buffer containing: 10 ng template, 10 mM KCI, 10 mM
(NHq,)~S04, 20 mM Tris-HCI (pH 8.75), 2 mM MgSOq., 0.1 % Triton X-100,
100 pg/mL BSA (final concentration), 200 pM each deoxyribonucleotide
triphosphate, 10 pmole of each primer and 1 pl of PfuTurbo DNA polymerase
(Stratagene, San Diego, CA). Amplification was carried out as follows (unless
otherwise specified): initial denaturation at 95 °C for 3 min, followed
by 35 cycles
of the following: 95 °C for 1 min, 56 °C for 30 sec, 72
°C for 1 min. A final
extension cycle of 72 °C for 10 min was carried out, followed by
reaction
termination at 4 °C.
Wild Ty~~e Morfierella alaina (Accession #AF465281 ) O6 Desaturase
The 1384 by NcollNotl fragment of pCGR5 (U.S. 5,968,809), which
contains the M. alpina D6 desaturase gene (SEQ ID N0:36), was inserted
into the NcollNotl sites of pY5-2 (Example 1 ) to generate pY54.
64



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Wild Type Mortierella alpina (Accession #AF067654) 05 Desaturase
The M. alpina ~5 desaturase gene (SEQ ID N0:38) was amplified
by PCR using oligonucleotides YL11 and YL12 (SEQ ID NOs:40 and 41)
as primers and plasmid pCGR-4 (U.S. 6,075,183) as template. PCR
amplification was carried out as described above, with the exception that
the elongation step was extended to 1.5 min (for cycles 1-35). The
1357 by PCR product was digested with NcollNotl and ligated to
NcollNotl-digested pY5-13 (described in Example 1 ) to generate
pYMASpb (Figure 6).
Wild Type Saproleania diclina (ATCC #56851 ) 017 Desaturase
' The wild type X17 desaturase gene of S. diclina was amplified from
plasmid pRSP19 (US 2003/0196217 A1 ) by PCR using oligonucleotides YL21A
(SEQ ID N0:42) and YL22 (SEQ ID N0:43) as primers. The PCR products were
digested with NcollPacl and then ligated to Ncol/Pacl-digested pY5-4 (Figure
4;
described in Example 1 ) to generate pYSD17.
Wild Type Mortierella alpina (Accession #AX464731 ) Hiah Affinity
Elonaase
The 973 by Notl fragment of pRPB2 (WO 00/12720), containing the
coding region of a M. alpina high affinity PUFA elongase gene (SEQ ID
NO:44), was inserted into the Notl site of pY5 (described in Example 1;
Figures 3 and 4) to generate pY58.
Transformation Of Yarrovvia lipolytica
The plasmids pY54, pYMASpb, pYSD17 and pY58 were
transformed separately into Y. lipolytica ATCC# 76982 according to the
method of Chen, D. C. et al. (Appl Microbiol Biotechnol. 48(2):232-235
(1997)), and as described in Example 2 (with the exception that a leucine
auxotroph of Yarrovvia was used for transformation and transformants
were selected on minimal media plates lacking leucine).
Determination Of Percent Substrate Conversion
Single colonies of transformant Y. lipolytica containing pY54,
pYMASpb, pYSD17 or pY58 were each grown in 3 mL minimal media
(20 g/L glucose, 1.7 g/L yeast nitrogen base without amino acids, 1 g/L L
proline, 0.1 g/L L-adenine, 0.1 g/L L-lysine, pH 6.1 ) at 30 ~C to an ODg00
1Ø For substrate feeding, 100 pl of cells were then subcultured in 3 mL
minimal media containing 10 pg of substrate for about 24 hr at 30 ~C.
Cells were subsequently collected by centrifugation and the lipids were
extracted as described in the General Methods. Fatty acid methyl esters



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
were prepared by transesterification of the lipid extract. Percent substrate ,
conversion was determined as: [product/(substrate + product)]*100.
_Percent Substrate Conversion By M. alpina ~6 Desaturase
The M. alpina 06 desaturase converts LA to GLA and/or ALA to
STA. Y. lipolytica strains containing pY54 were grown as described above
(no substrate feeding required) and lipids were analyzed. The results
showed that Yarrov~ria strains with pY54 converted about 30% LA to GLA.
Percent Substrate Conversion By M. alpina ~5 Desaturase
The ~5 desaturase from M. alpina converts DGLA to ARA and/or
ETA to EPA. Y. lipolytica containing pYMASpb was grown from a single
colony, subcultured in minimal media containing 10 pg of DGLA and then
subjected to lipid analysis as described above. Yarrovvia strains with
pYMASpb converted about 30% of intracellular DGLA to ARA.
Percent Substrate Conversion By S. diclina X17 Desaturase
The S. diclina 017 desaturase converts ARA to EPA and/or DGLA
to ETA. Y. lipolytica strains containing pYSD17 were grown from single
colonies, subcultured in minimal media containing 10 pg of ARA and
subjected to lipid analysis as described above. The results of the ARA
feeding experiments showed that Yarrowia strains with pYSD17 converted
about 23% of intracellular ARA to EPA.
Percent Substrate Conversion Of Wild Type M. alpina Hiah Affinity
Elonaase
The M. alpina high affinity PUFA elongase converts GLA to DGLA,
STA to ETA, and/or EPA to DPA. Y. lipolytica strains containing pY58
were grown from single colonies, subcultured in minimal media containing
10 pg of GLA and subjected to lipid analysis as described above. The
results of the GLA feeding experiments showed that Yarroviria strains with
pY58 converted about 30% of intracellular GLA to DGLA.
EXAMPLE 6
Synthesis And Expression Of A Codon-Optimized 017 Desaturase Gene
In Yarrowia lipolyfica
Based on the results of Example 5, genes encoding ~6 desaturase,
elongase and ~5 desaturase activies were available that each enabled
~30% substrate conversion inYarrowia lipolytica. The 017 desaturase
from S. diclina, however, had a maximum conversion efficiency of only
23%. Thus, a codon-optimized 017 desaturase gene was designed,
based on the Saprolegnia diclina DNA sequence (SEQ ID N0:35),
66



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
according to the Yarrov~ria codon usage pattern, the consensus sequence
around the 'ATG' translation initiation codon and the general rules of RNA
stability (Guhaniyogi, G, and J. Brewer, Gene 265(1-2):11-23 (2001 )).
In addition to modification to the translation initiation site, 127 by of
the 1077 by coding region, comprising 117 codons, were codon-optimized.
A comparison between this codon-optimized DNA sequence (SEQ ID
N0:46) and the S. diclina X17 desaturase gene DNA sequence (SEQ ID
N0:35) is shown in Figure 7, wherein nucleotides in bold text correspond
to nucleotides that were modified in the codon-optimized gene. None of
the modifications in the codon-optimized gene changed the amino acid
sequence of the encoded protein (SEQ ID N0:47).
The synthetic, codon-optimized X17 desaturase was suitable for
expression with other genes for PUFA biosynthesis, to test the hypothesis
of whether expression in a Yarrovvia lipolytica host having its native D12
desaturase disrupted would result in production of "pure" cu-3 PUFAs,
without co-synthesis of any cu-6 PUFAs (infra, Example 8).
_Determinina The Preferred Codon Usaae In Yarrowia lipolytica
Approximately 100 genes of Y. lipolytica were found in the National
Center for Biotechnology Information public database. The coding regions
of these genes, comprising 121,167 bp, were translated by the Editseq
program of DNAStar to the corresponding 40,389 amino acids and were
tabulated to determine the Y. lipolytica codon usage profile shown in
Table 9. The column titled "No." refers to the number of times a given
codon encodes a particular amino acid in the sample of 40,389 amino
acids. The column titled "%" refers to the frequency that a given codon
encodes a particular amino acid. Entries shown in bold text represent the
codons favored in Yarrov~ria lipolytica.
TABLE 9
Codon Usaae In Yarrowia lipolytica
Codon Amino No. % ~ ~ Codon Amino No.
Acid ~ ~ Acid
GCA Ala (A) 359 11.4 AAA
Lys
(K)
344
14.8


GCC Ala (A) 1523 48.1 AAG L s (K) 1987 85.2


GCG Ala (A) 256 8.1 AUG Met M 1002 100


GCU Ala (A) 1023 32.3 UUC Phe (F) 996 61.1


67



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Codon Amino No. % Codon Amino No.


Acid Acid


AGA Arg (R) 263 13.2 UUU Phe 621 38.9
F


AGG Arg (R) 91 4.6 CCA Pro 207 9.6
(P)


CGA Arg (R) 1133 56.8 CCC Pro 1125 52.0
(P)


CGC Arg (R) 108 5.4 CCG Pro 176 8.2
(P)


CGG Arg (R) 209 1.0 CCU Pro 655 30.2
P


CGU Ar R 189 9.5 AGC Ser 335 11.3
(S)


AAC Ans (N) 1336 84.0 AGU Ser 201 6.8
(S)


AAU Ans N 255 16.0 UCA Ser 221 7.5
(S)


GAC Asp (D) 1602 66.8 UCC Ser 930 31.5
(S)


GAU As D 795 33.2 UCG Ser 488 16.5
(S)


UGC Cys (C) 268 53.2 UCU Ser 779 26.4
(S)


UGU C s C) 236 46.8 UAA Term 38 46.9


CAA Gln Q 307 17.0 UAG Term 30 37.0


CAG Gln (Q) 1490 83.0 UGA Term 13 16.1


GAA Glu (E) 566 23.0 ACA Thr 306 12.7
(T)


GAG Glu (E) 1893 77.0 ACC Thr 1245 51.5
(T)


GGA Gly (G) 856 29.7 ACG Thr 269 11.1
(T)


GGC Gly (G) 986 34.2 ACU Thr 595 24.6
T


GGG Gly (G) 148 5.1 UGG Trp 488 100


GGU GI G 893 31.0 UAC Tyr 988 83.2
(Y)


CAC His (H) 618 65.5 UAU T r 200 16.8
Y


CAU His H 326 34.5 GUA Val 118 4.2
(V)


AUA Ile (I) 42 2.1 GUC Val 1052 37.3
(V)


AUC Ile (I) 1106 53.7 GUG Val 948 33.6
(V)


AUU Ile I 910 44.2 GUU Val 703 24.9
V


CUA Leu (L) 166 4.7


CUC Leu (L) 1029 29.1


CUG Leu (L) 1379 38.9


CUU Leu (L) 591 16.7


UUA Leu (L) 54 1.5


UUG Leu L 323 9.1


For further optimization of gene expression in Y. lipolytica, the
consensus sequence around the 'ATG' initiation codon of 79 genes was
examined. In Figure 8, the first 'A' of the underlined ATG translation
68



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
codon is considered to be +1. Seventy seven percent of the genes
analyzed had an 'A' in the -3 position, indicating a strong preference for
'A' at this position. There was also preference for 'A' or 'C' at the -4, -2
and -1 positions, an 'A', 'C' or 'T' at position +5, and a 'G' or 'C' at
position
+6. Thus, the preferred consensus sequence of the codon-optimized
translation initiation site for optimal expressiowof genes in Y. lipolytica is
'MAMMATGNHS' (SEQ ID N0:130), wherein the nucleic acid degeneracy
code used is as follows: M=A/C; S=C/G; H=A/C/T; and N=A/C/G/T.
In Vitro Synthesis Of A Codon-Optimized Gene
The method used to synthesize the codon-optimized X17
desaturase gene is illustrated in Figure 9. First, eleven pairs of
oligonucleotides were designed to extend the entire length of the codon-
optimized coding region of the S. diclina X17 desaturase gene (e.g., D17-
1 A, D 17-1 B, D 17-2A, D 17-2 B , D 17-3A, D 17-3 B , D 17-4A, D 17-4 B, D 17-
5A,
D17-5B, D17-6A, D17-6B, D17-7A, D17-7B, D17-8A, D17-8B, D17-9A,
D17-9B, D17-10A, D17-10B, D17-11A and D17-11B, corresponding to
SEQ ID NOs:48-69). Each pair of sense (A) and anti-sense (B)
oligonucleotides were complementary, with the exception of a 4 by
overhang at each 5'-end. Additionally, primers D17-1A, D17-4B, D17-5A,
D17-8A and D17-8B also introduced Neol, Bglll and Sall restriction sites
for subsequent subcloning, respectively.
100 ng of each oligonucleotide was phosphorylated at 37 ~C for 1
hr in a volume of 20 pl containing 50 mM Tris-HCI.(pH 7.5), 10 mM MgCl2,
10 mM DTT, 0.5 mM spermidine, 0.5 mM ATP and 10 U of T4
polynucleotide kinase. Each pair of sense and antisense oligonucleotides
was mixed and annealed in a thermocycler using the following parameters:
95 ~C (2 min), 85 ~C (2 min), 65 ~C (15 min), 37 ~C (15 min), 24 ~C
(15 min) and 4 ~C (15 min). Thus, D17-1A (SEQ ID N0:48) was annealed
to D17-1 B (SEQ ID N0:49) to produce the double-stranded product "D17
1AB". Similarly, D17-2A (SEQ ID N0:50) was annealed to D17-2B (SEQ
ID N0:51 ) to produce the double-stranded product "D17-2AB", etc.
Three separate pools of annealed, double-stranded
oligonucleotides were then ligated together, as shown below:
~ Pool 1: comprised D17-1AB, D17-2AB, D17-3AB and D17-4AB;
~ Pool 2: comprised D17-5AB, D17-6AB, D17-7AB and D17-8AB; and
~ Pool 3: comprised D17-9AB, D17-10AB and D17-11AB.
69



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Each pool of annealed oligonucleotides was mixed in a volume of 20 pl
with 10 U of T4 DNA ligase and the ligation reaction was incubated
overnight at 16 ~C.
The product of each ligation reaction was then amplified by PCR.
Specifically, using the ligated "Pool 1" mixture (i.e., D17-1AB, D17-2AB,
D17-3AB, and D17-4AB) as template, and oligonucleotides D17-1 (SEQ ID
N0:70) and D17-4R (SEQ ID N0:71 ) as primers, the first portion of the
codon- .optimized o17 desaturase gene was amplified by PCR. The PCR
amplification was carried out in a 50 pl total volume, comprising PCR
buffer containing 10 mM KCI, 10 mM (NHq.)~SOq., 20 mM Tris-HCI (pH
8.75),, 2 mM MgS04, 0.1 % Triton X-100, 100 pg/mL BSA (final
concentration), 200 pM each deoxyribonucleotide triphosphate, 10 pmole
of each primer and 1 pl of PfuTurbo DNA polymerase (Stratagene, San
Diego, CA). Amplification was carried out as follows: initial denaturation at
95 °C for 3 min, followed by 35 cycles of the following: 95 °C
for 1 min,
56 °C for 30 sec, 72 °C for 40 sec. A final extension cycle of
72 °C for
10 min was carried out, followed by reaction termination at 4 °C. The
430 by PCR fragment was subcloned into the pGEM-T easy vector
(Promega) to generate pT17(1-4).
Using the ligated "Pool 2" mixture (i.e., D17-5AB, D17-6AB, D17-
7AB and D17-8AB) as template, and oligonucleotides D17-5 (SEQ ID
N0:72) and D17-8D (SEQ ID N0:73) as primers, the second portion of the
codon-optimized o17 desaturase gene was amplified similarly by PCR and
cloned into pGEM-T-easy vector to generate pT17(5-8). Finally, using the
"Pool 3" ligation mixture (i.e., D17-9AB, D17-10AB and D17-11AB) as
template, and oligonucleotides D17-8U (SEQ ID N0:74) and D17-11 (SEQ
ID N0:75) as primers, the third portion of the codon-optimized o17
desaturase gene was amplified similarly by PCR and cloned into pGEM-T-
easy vector to generate pT17(9-11 ).
E. coli was transformed separately with pT17(1-4), pT17(5-8) and
pT17(9-11 ) and the plasmid DNA was isolated from ampicillin-resistant
transformants. Plasmid DNA was purified and digested with the
appropriate restriction endonucleases to liberate the 420 by NcollBglll
fragment of pT17(1-4), the 400 by BgllllSall fragment of pT17(5-8) and the
300 by SalllNotl fragment of pT17(9-11 ). These fragments were then
combined, ligated together and used as template for amplification of the
entire synthetic codon-optimized o17 desaturase gene using D17-1 (SEQ



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
ID NO:70) and D17-11 (SEQ ID N0:75) as primers. The PCR
amplification was carried out in a 50 pl total volume, using the conditions
described above for each portion of the X17 desaturase gene and the
thermocycling program as follows: initial denaturation at 95 °C for 3
min,
followed by 35 cycles of the following: 95 °C for 1 min, 56 °C
for 30 sec,
72 °C for 1.1 min. A final extension cycle of 72 °C for 10 min
was carried
out, followed by reaction termination at 4 °C. This generated a 1.1 kB
PCR
product.
Construction Of Plasmid pYSD17S Containing The Codon-Optimized 017
Desaturase -
The 1.1 kB PCR product comprising the entire synthetic o17-
desaturase was digested with NcollNotl and subcloned into NcollNotl -
digested pY5-13 (Example 1 ) to generate pYSD17S (Figure 10A).
As an additional "control", to compare the efficiency of the wild type
and synthetic genes in Yarrowia, the AT-rich Pacl site in pYSD17
(comprising the wild-type gene; described in Example 5) was eliminated by
site-directed mutagenesis using YL53 (SEQ ID N0:76) and YL54 (SEQ ID
NO:77) as primers to generate pYSD17M (Figure 10B).
Transformation Of Yarroviria licolytica With The Codon-Optimized 017
Desaturase Gene
Plasmids containing the wildtype and codon-optimized X17
desaturase were transformed separately into Y. lipolytica ATCC #76982
according to the methods described above in Example 5. Using this
technique, transformants were obtained that contained the following
plasmids:
Table 10
Summary Of Plasmids In Transformant Yarrowia
Plasmid Descri tion


YSD17 wildt a X17 desaturase


YSD17M wildtype 017 desaturase, minus AT-rich
Pacl site


pYSD17S codon-optimized X17 desaturase


71



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
Percent Substrate Conversion With The Codon-Optimized X17 Desaturase
Gene
X17 desaturase converts ARA to EPA (see Figure 2). The percent
substrate conversion ([product]/[substrate + product]*100) of the wildtype
and codon-optimized 017 desaturase genes was determined in Yarrovvia
lipolyfica containing each alternate plasmid construct, using the
methodology described in Example 5.
The results of the ARA feeding experiments showed that Yarrovvia
strains with control plasmids pYSD17 or pYSD17M converted about 23%
of intracellular ARA to EPA (Figure 11A) while those containing the codon
optimized X17 'desaturase gene within pYSD17S converted about 45% of
intracellular ARA to EPA (Figure 11 B). Thus, Yarrowia containing the
codon-optimized o17 desaturase converted about 2-fold more ARA than
the strains containing the wild type S. diclina gene.
EXAMPLE 7
Construction Of Plasmids Suitable For The Coordinate Expression Of
_Multiple Omeaa Fatty Acid Biosynthesis Genes In Yarrowia lipolytica
A variety of expression plasmids were constructed to produce a
construct comprising a D6 desaturase, elongase, 05 desaturase, and X17
desaturase that would be suitable to integrate into the Y. lipolytica
genome. Expression of this construct was necessary to test the
hypothesize that "pure" w-3 PUFAs, without co-synthesis of any ~-6
PUFAs, could be produced in a Y. lipolytiea host containing a disrupted
native X12 desaturase (infra, Example 8).
Construction Of Plasmid pY24
Plasmid pY24 (Figure 12) was a parent vector for construction of
expression cassettes suitable for integration into the genome of Yarrowia
lipolytica. pY24 was constructed as follows:
Using oligonucleotides KU5 and KU3 (SEQ ID NOs:78 and 79) as
primers and Yarrowia genomic DNA as template, a 1.7 kB DNA fragment
(SEQ ID N0:80) containing the Yarrovvia URA3 gene was PCR amplified.
The PCR amplification was carried out in a 50 pl total volume containing:
100 ng Yarroviria genomic DNA, PCR buffer containing 10 mM KCI, 10 mM
(NH4)2SOq., 20 mM Tris-HCI (pH 8.75), 2 mM MgSOq., 0.1 % Triton X-100,
100 pg/mL BSA (final concentration), 200 pM each deoxyribonucleotide
triphosphate, 10 pmole of each primer and 1 pl of PfuTurbo DNA
polymerase (Stratagene, San Diego, CA). Amplification was carried out as
72



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
follows: initial denaturation at 95 ~C for 3 min, followed by 35 cycles of the
following: 95 ~C for 1 min, 56 ~C for 30 sec, 72 ~C for 2 min. A final
extension cycle of 72 ~C for 10 min was carried out, followed by reaction
termination at 4 ~C. The PCR product was inserted into pGEM-T easy
vector (Promega, Madison, WI) to generate pGYUM.
Using oligonucleotides K15 and KI3~(SEQ ID NOs:82 and 83), a 1.1
kB DNA fragment (SEQ ID N0:84) containing the conjugase gene (or "imp
H8") of Impatients balsams (clone ids.pk0001.h8; E. I. du Pont de
Nemours and Company, Inc., Wilmington, DE) was PCR amplified. The
PCR amplification was carried out using the components described above,
with the exception that 10 ng plasmid DNA of ids.pk0001.h8 was used as
template. Amplification was carried out as follows: initial denaturation at
95 ~C for 3 min, followed by 35 cycles of the following: 95 ~C for 1.5 min,
56 ~C for 30 sec, 72 ~C for 1.2 min. A final extension cycle of 72 ~C for
10 min was carried out, followed by reaction termination at 4 oC. The PCR
products were digested with Notl, and then inserted into the Notl site of
pY5 (Figure 3) to generate pY9.
Using oligonucleotides KT15 and KT13 (SEQ ID NOs:86 and 87), a
1.7 kB DNA fragment (SEQ ID N0:88) containing the TEF::IMP HB::XPR
chimeric gene of pY9 was PCR amplified. The PCR amplification was
carried out as described above, with the exception that 10 ng plasmid DNA
of pGYUM was used as template. Amplification was carried out as
follows: initial denaturation at 95 ~C for 3 min, followed by 35 cycles of the
following: 95 ~C for 1 min, 56 ~C for 30 sec, 72 ~C for 2 min. A final
extension cycle of 72 ~C for 10 min was carried out, followed by reaction
termination at 4 ~C. The PCR products were inserted into PCR-Script
(Stratagene) to generate pY9R. The 1.7 kB XholEcoRV fragment of pY9R
was exchanged with the XhollEcoRV fragment of pGYUM to generate
pY2 f .
Using oligonucleotides KH5 and KH3 (SEQ ID NOs:90 and 91 )'as
primers and genomic DNA of KS65 as template, a 1 kB DNA fragment
(SEQ ID N0:92) containing the E, coli hygromycin resistance gene ("HPT";
Kaster, K.R., et al., Nucleic Acids Res. 11:6895-6911 (1983)) was PCR
amplified. The PCR amplification was carried out in a 50 pl total volume
using the components described above, with the exception that 10 ng
plasmid DNA of ids.pk0001.h8 was used as template. Amplification was
carried out as follows: initial denaturation at 95 ~C for 3 min, followed by
73



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
35 cycles of the following: 95 ~C for 1 min, 56 oC for 30 sec, 72 ~C for
1.2 min. A final extension cycle of 72 ~C for 10 min was carried out,
followed by reaction termination at 4 ~C. The PCR products were digested
with Notl and then inserted into the Notl site of pY5 (Figure 3) to generate
pTHPT-1.
Using oligonucleotides KTH5 and KTH3 (SEQ ID NOs:94 and 95)
as primers and pTHPT-1 plasmid DNA as template, a 1.6 kB DNA
fragment (SEQ ID N0:96) containing the TEF::HPT::XPR fusion gene was
amplified as described above. The PCR products were digested with Bglll
and then inserted into pY21 to generate pY24.
Construction Of pY24-4
Plasmid pY24 (Figure 12) was used for construction of expression
cassettes suitable for integration into the Y. lipolytica genome. The 401 by
of 5'-sequence (SEQ ID N0:98) and 568 by of 3'-sequence (SEQ ID
N0:99) from the Yarrovvia lipolytica URA3 gene in pY24 plasmid were
usedao direct integration of expression cassettes into the Ura loci of
theYarrowia genome. Two chimeric genes (TEF::HPT::XPR and TEF::IMP
HB::XPR) were first removed from pY24 by digestion with BamHl and self-
ligation to generate pY24-1. Pacl and BsiV1/1 sites were introduced into
pY24-1 by site-directed mutagenesis using YL63 and YL64 (SEQ ID
NOs:100 and 101 ) and YL65 and YL66 (SEQ ID NOs:102 and 103) primer
pairs, respectively, to generate pY24-4.
Construction Of An Integration Vector For Expression Of ~5 Desaturase
The 4261 by PacllBsillVl fragment of pYMASpb (comprising the M.
alpina 05 desaturase gene; described in Example 5) was ligated into the
PacllBsiVlll sites of pY24-4 (Figure 12) to generate pYZM5 (Figure 6).
Hindlll and Clal sites were introduced into pYZM5 by site-directed
mutagenesis using primer pairs YL81 and YL82 (SEQ ID NOs:104 and
105) and YL83 and YL84 (SEQ ID NOs:106 and 107), respectively, to
generate pYZMSCH (Figure 6). A Pmel site was introduced into
pYZMSCH by site-directed mutagenesis using YL105 and YL106 (SEQ ID
NOs:108 and 109) as primers to generate pYZMSCHPP. An Ascl site was
introduced into pYZMSCHPP by site-directed mutagenesis using YL119
and YL120 (SEQ ID NOs:110 and 111 ) as primers to generate
pYZMSCHPPA (Figure 6).
To optimize the integration vector, 440 by of 5'-non-coding DNA
sequence upstream from the Yarroviria lipolytica URA3 gene (SEQ ID
74



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
N0:114) was amplified by PCR using YL121 and YL122 (SEQ ID NOs:112
and 113) as primers. The PCR product was digested with Ascl and BsiV1/I
and then exchanged with the AscllBsiV1/I fragment of pYZMSCHPPA
(Figure 6 and 13) to generate pYZMSUPA (Figure 13). An Ascl site was
introduced into pYZMSUPA by site-directed mutagenesis using
oligonucleotides YL114 and YL115 (SEQ ID NOs:115 and 116) to
generate pYZV5: In order to reduce the size of the 3'-non-coding region of
the URA3 gene in pYZVS, a second. Pacl site was introduced into the
middle of this region by site-directed mutagenesis using oligonucleotides
YL114 and YL1~15 (described above) to generate pYZVSP. The Pacl
fragment of pYZVSP was excised by digestion with Pacl and religation to
generate pYZV16 (Figure 13). Digestion of pYZV16 with Ascl liberates a
5.2 kB DNA fragment (SEQ ID N0:117) suitable for integration and
expression of the O5 desaturase gene ("MAD5") in the Y. lipolytica
genome.
Construction Of An Integration Vector For Expression Of The Hiah Affinity
Elonaase And ~5 Desaturase
BsiVlll and Hindlll sites were introduced into pY58 (containing the
coding region of the M. alpina high affinity PUFA elongase; described in
Example 5) by site-directed mutagenesis using YL61 and YL62 (SEQ ID
NOs:17.and 18) and YL69 and YL70 (SEQ ID NOs:118 and 119) primer
pairs, respectively, to generate pY58BH (Figure 14; elongase gene
labeled as "EL"). The 1.7 kB BsiWllHindlll fragment of pY58BH, which
contains the TEF::EL::XPR chimeric gene, was ligated into the
BsihVllHindlll site of pYZMSCHPP (construction described in Figure,6) to
generate pYZMSEL (Figure 14). This plasmid is suitable for integration and
coordinate expression of the M. alpina 05 desaturase and high affinity
PUFA elongase genes in Y. lipolytica.
Construction Of An Integration Vector For Expression Of The 06
Desaturase Hiah Affinity Elongase And 05 Desaturase
Pacl and Clal sites were introduced into pY54 (containing the M.
alpina 06 desaturase; described in Example 5) by site-directed
mutagenesis using YL77 and YL78 (SEQ ID NOs:120 and 121) and
YL79A and YL80A (SEQ ID NOs:122 and 123) primer pairs, respectively,
to generate pY54PC (Figure 14; 06 desaturase gene labeled as "MADE").
The 2 kB ClallPacl DNA fragment of pY54PC, which contains the
TEF::MAD6::XPR chimeric gene, was ligated into the ClallPacl sites of



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
pYZMSEL to generate pYZM5EL6 (Figure 14). This plasmid is suitable for
integration and coordinate expression of the M, alpina D6 desaturase, O5
desaturase and high affinity PUFA elongase genes in the Y. lipolytica
genome.
Construction Of A DNA Fragment Suitable For Integration Into The
Yarrowia Genome, For Expression Of The ~6 Desaturase, PUFA
Elongase And ~5 Desaturase
The plasmid pYZV16 (construction described in Figure 13) was
used for construction of plasmids containing multiple expression cassettes.
First, the 3.5 kB BsiVllllPacl fragment of pYZV16 was ligated to the
7.9 kB BsiVllllPacl fragment of pYZM5EL6 (construction described in
Figure 14) to generate pYZV5EL6 (Figure 15). Digestion of pYZV5EL6
with Ascl liberates a 8.9 kB DNA fragment (SEQ ID N0:124) suitable for
integration and coordinate expression of the 06 desaturase, PUFA
elongase and ~5 desaturase genes in the Y. lipolytica genome.
Construction Of A DNA Fragment Suitable For Integration Into The
Yarrotlvia Genome, For Expression Of The ~6 Desaturase PUFA
Elongase, ~5 Desaturase And 017 Desaturase
A synthetic S. diclina 017 desaturase gene was inserted into the
NcollNotl sites of pY5-13 to generate pYSD17S (Figure 10A). Clal and
Pmel sites were introduced into pYSD17S by site-directed mutagenesis
using YL101 and YL102 (SEQ ID NOs:125 and 126) and YL103 and
YL104 (SEQ ID NOs:127 and 128) primer pairs, respectively, to generate
pYSD17SPC (Figure 15).
The 347 by ClallPmel fragment of pYZV5EL6 (Figure 15) was
exchanged with the 1760 by ClallPmel fragment from pYSD17SPC
containing the 017 desaturase expression cassette to generate
pYZV5E6/17. Digestion of pYZV5E6/17 with Ascl liberates a 10.3 kB DNA
fragment (SEQ ID N0:129) suitable for integration and coordinate
expression of the ~6 desaturase, PUFA elongase, 05 desaturase and 017
desaturase genes in the Y. lipolytica genome.
EXAMPLE 8
Use Of X12 Desaturase Disrupted Strains For The Production Of
Pure Omega-3 Fatty Acids By Substrate Feeding
The present Example describes the utility of a 012 desaturase-
disrupted Yarrowia lipolytica host strain containing appropriate
heterologous genes (e.g., a D6 desaturase, elongase, D5 desaturase, X17
76



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
desaturase, as described in Example 7) for the production of cu-3 PUFAs,
without co-synthesis of any ~-6 PUFAs. Feeding studies were performed
with ALA as the substrate. The results demonstrate that it is possible to
produce w-3 PUFAs in the absence of w-6 PUFAs.
Feeding Studies
Wildtype Yarrovvia lipolytica ATCC #76982 was transformed with an
integrating 10.3 kB DNA fragment (SEQ ID N0:129) containing a ~6
desaturase, PUFA elongase, ~5 desaturase and X17 desaturase (from
Example 7). This resulted in creation of strain "WT+4G". Then, the X12
desaturase was disrupted in strain WT+4G, as described in Example 2.
This resulted in creation of strain "D12KO+4G".
Cells from each of the four strains listed below in Table 11 (100 ~I)
were grown in 3 mL minimal media containing no substrate addition, 10 ~,g
of LA, 10 ug ALA, or 5 ug each of LA and ALA for about 24 hr at 30 °C.
Table 11
Description Of Strains Tested In The Feedinct Studies
Strain Desi nationStrain Descri tion Example


WT wild-type Yarrowia lipolytica ---
ATCC


#76982


WT + 4G wild-type Yarrovvia lipolytica, 8
containing a


~6 desaturase, PUFA elongase,
~5


desaturase


and X17-desaturase


D12K0 X12 desaturase-disrupted Yarrowia2


lipol ica


D12K0+4G X12 desaturase-disrupted YarroINia8


lipolytica, containing a ~6 desaturase,


PUFA elongase,


~5 desaturase and 017 desaturase


Fatty acid composition was determined by direct transesterification,
as described in the General Methods. The fatty acid profile of each of the
strains grown with no substrate addition, 10 ~,g of LA, 10 ug ALA, or 5 ug
each of LA and ALA are shown below in Table 12. Fatty acids are
identified as 16:0, 16:1, 18:0, 18:1 (oleic acid), 18:2 (LA), GLA, DGLA,
77



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
ALA, and STA. The composition of each is presented as a % of the total
fatty acids.
Table 12
Fatty Acid Composition (% Of Total Fatty Acids)
Strain FA 16 16 18 18 18 GLA DGLA ALA STA


:0 :1 :0 :1 :2


feed


WT None 10 6 8 50 21 nd nd nd nd


WT LA 11 3 6 30 47 nd nd nd nd


WT ALA 9 3 4 29 5 nd nd 48 nd


D12K0 None 10 7 8 68 nd nd nd nd nd


D12K0 LA 9 4 5 26 53 nd nd nd nd


D12K0 ALA 10 4 6 41 nd nd nd 35 nd


WT+4G None 11 6 7 57 6 5 0.9 nd nd


WT+4G LA ' 3 6 32 31 9 1.0 nd nd
11


WT+4G ALA 9 3 5 31 2 1 0.2 40 4


WT+4G LA+A 6 1 2 10 33 4 0.3 39 2


D12K0+4G None 9 6 8 69 nd nd nd nd nd


D12K0+4G LA 8 2 6 24 45 10 1.0 nd nd


D12K0+4G ALA 8 5 5 45 nd nd nd 27 4


D12K0+4G LA+A 7 2 4 14 26 4 0.3 37 3


*nd= not detectable
The results showed that feeding ALA to D12 KO cells resulted in
the production of only cu-3 fatty acids (i.e.,, ALA and STA), without
biosynthesis of any w-6 fatty acids (i.e., GLA or DGLA).
78



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.5T25
SEQUENCE LISTING
<110> E.I. du Pont de Nemours and Company, Inc.
<120> A DELTA-12 DESATURASE GENE SUITABLE FOR ALTERING LEVELS OF
POLYUNSATURATED FATTY ACIDS IN OLEAGINOUS YEASTS
<130> CL2301 PCT
<150> US 60/484209
<151> 2003-06-30
<150> US 60/468677
<151> 2003-05-07
<160> 130
<170> Patentln version 3.2
<Z10> 1
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer TEFS'
<400> 1
agagaccggg ttggcggcg 19
<210> 2
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer TEF3'
<400> Z
ttggatcctt tgaatgattc ttatactcag 30
<210> 3
<211> 29
<212> DNA
<213> Artificial sequence
<220>
<223> Primer XPRS'
<400> 3
tttccgcggc ccgagattcc ggcctcttc 29
<210> 4
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer XPR3'
<400> 4
tttccgcgga cacaatatct ggtcaaattt c 31
Page 1



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.ST25
<210> 5
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL1
<400> 5
cagtgccaaa agccaaggca ctgagctcgt 30
<210> 6
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL2
<400> 6
gacgagctca gtgccttggc ttttggcact g 31
<Z10> 7
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL3
<400> 7
gtataagaat cattcaccat ggatccacta gttcta 36
<210> 8
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL4
<400> 8
tagaactagt ggatccatgg tgaatgattc ttatac 36
<210> 9
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL23
<400> 9
atggatccac tagttaatta actagagcgg ccgcca 36
<210> 10
<211> 36
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL24
Page 2



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.5T25
<400> 10
tggcggccgc tctagttaat taactagtgg atccat 36
<210> 11
<211> 39
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL5
<400> 11
cccccctcga ggtcgatggt gtcgataagc ttgatatcg 3g
<210> 12
<211> 39
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL6
<400> 12
cgatatcaag cttatcgaca ccatcgacet cgagggggg 3g
<210> 13
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL9
<400> 13
tggtaaataa atgatgtcga ctcaggcgac gacgg 35
<210> 14
<211> 35
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL10
<400> 14
ccgtcgtcgc ctgagtcgac atcatttatt tacca 35
<210> 15
<211> 37
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL7
<400> 15
caaccgattt cgacagttaa ttaataattt gaatcga 37
<210> 16
<211> 37
<212> DNA
<213> Artificial Sequence
Page 3



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
~~2301-conversion.sT25
<220>
<223> Primer YL8
<400> 16
tcgattcaaa ttattaatta actgtcgaaa tcggttg 37
<210> 17
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL61
<400> 17
acaattccac acaacgtacg agccggaagc ata 33
<210> 18
<211> 33
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL62
<400> 18
tatgcttccg gctcgtacgt tgtgtggaat tgt 33
<210> 19
<211> 26
<212> DNA
<213> Artificial Sequence
<220>
<223> Degenerate primer P73
<220>
<221> misc_feature
<222> (24)..(24)
<223> n is a, c, g, or t
<400> 19
tgggtcctgg gccaygartg yggnca 26
<210> 20
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> Consensus sequence in deltal2 desaturases
<400> 20
irp val Leu Gly 5is Glu Cys Gly His
<210> 21
<211> 30
<212> DNA
<213> Artificial Sequence
Page 4



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.5T25
<220>
<223> Degenerate primer P76
<Z20>
<221> misc_feature
<222> (25)..(25)
<223> n is a, c, g, or t
<220>
<221> misc_feature
<222> (28)..(28)
<223> n is a, c, g, or t
<400> 21
ggtggcctcc tcggcgtgrt araanggnat 30
<210> 22
<211> 10
<212> PRT
<213> Artificial Sequence
<220>
<223> consensus sequence in deltal2 desaturases
<220>
<221> MISC_FEATURE
<222> (1)..(1)
<223> xaa = Met or Ile
<400> 22
xaa Pro Phe Tyr His Ala Glu Glu Ala Thr
1 5 10
<210> 23
<211> 1936
<212> DNA
<213> Yarrowia lipolytica
<220>
<221> CDS
<222> (283)..(1539)
<400> 23
cgtagttata tacaagaggt agatgcgtgc tggtgttaga ggggctctca ggattaggag 60
gaaaatttga cattggccct caacatataa cctcgggtgt gcctctgttt accctcagct~ 120
tttgcttgtc cccaagtcag tcacgccagg ccaaaaaggt tggtggattg acagggagaa 180
aaaaaaaagc ctagtgggtt taaactcgag gtaagacatt gaaatatata ccggtcggca 240
tcctgagtcc ctttctcgta ttccaacaga ccgaccatag as atg gat tcg acc 294
Met Asp Ser Thr
1
acg cag acc aac acc ggc acc ggc aag gtg gcc gtg cag ccc ccc acg 342
Thr Gln Thr Asn Thr G~Iy Thr G~Iy Lys Val Ala Val Gln Pro Pro Thr
10 15 20
gcc ttc att aag ccc att gag aag gtg tcc gag ccc gtc tac gac acc 390
Ala Phe Ile Lys Pro Ile Glu Lys Val Ser Glu Pro Val Tyr Asp Thr
Page 5



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.ST25
25 30 35
tttggcaac gagttcact cctccagac tactctatc aaggatatt ctg 438


PheGlyAsn GluPheThr ProProAsp TyrSerIle LysAspIle Leu


40 45 50


gatgccatt ccccaggag tgctacaag cggtcctac gttaagtcc tac 486


AspAlaIle ProGlnGlu CysTyrLys ArgSerTyr ValLysSer Tyr


55 60 65


tcgtacgtg gcccgagac tgcttcttt atcgccgtt tttgcctac atg 534


SerTyrVal AlaArgAsp CysPhePhe IleAlaVal PheAlaTyr Met


70 75 80


gcctacgcg tacctgcct cttattccc tcggettcc ggccgaget gtg 582


AlaTyrAla TyrLeuPro LeuIlePro SerAlaSer GlyArgAla Val


85 90 95 100


gcctgggcc atgtactcc attgtccag ggtctgttt ggcaccggt ctg 630


AlaTrpAla MetTyrSer IleValGln GlyLeuPhe GlyThrGly Leu


105 110 115


tgggttctt gcccacgag tgtggccac tctgetttc tccgactct aac 678


TrpValLeu AlaHisGlu CysGlyHis SerAlaPhe SerAspSer Asn


120 125 130


accgtcaac aacgtcacc ggatgggtt ctgcactcc tccatgctg gtc 726


ThrValAsn AsnValThr GlyTrpVal LeuHisSer SerMetLeu Val


135 140 145


ccttactac gcctggaag ctgacccac tccatgcac cacaagtcc act 774


ProTyrTyr AlaTrpLys LeuThrHis SerMetHis HisLysSer Thr


150' 155 160


ggtcacctc acccgtgat atggtgttt gtgcccaag gacc as
a 822


GlyHisLeu ThrArgAsp MetValPhe ValProLys AspAggLyg Glg


165 170 175 180


tttatggag aaccgaggc gcccatgac tggtctgag cttgetgag gac 870


PheMetGlu AsnArgGIy AlaHisAsp TrpSerGlu LeuAlaGlu Asp


185 190 195


getcccctc atgaccctc tacg9cctc atcacccag caggtgttt g 918
a


AlaProLeu MetThrLeu TyrGlyLeu IleThrGln GlnValPhe G~y


200 205 210


tggcctctg tatctgctg tcttacgtt accggacag aagtacccc aag 966


TrpProLeu TyrLeuLeu SerTyrVal ThrGlyGln LysTyrPro Lys


215 220 225


ctcaacaaa tgggetgtc aaccacttc aaccccaac gccccgctg ttt 1014


LeuAsnLys TrpAlaVal AsnHisPhe AsnProAsn AlaProLeu Phe


230 235 240


gagaagaag gactggttc aacatctgg atctctaac gtcggtatt ggt 1062


GluLysLys AspTrpPhe AsnIleTrp IleSerAsn ValGlyIle Gly


245 250 255 260


atcaccatg tccgtcatc gcatactcc atcaaccga tggggcctg get 1110


IleThrMet SerValIle AlaTyrSer IleAsnArg TrpGlyLeu Ala


265 270 275


tccgtcacc ctctactac ctgatcccc tacctgtgg gtcaaccac tgg 1158


SerValThr LeuTyrTyr LeuIlePro TyrLeuTrp ValAsnHis Trp


280 285 290


ctcgtggcc atcacctac ctgcagcac accgacccc actctgccc cac 1206


LeuValAla IleThrTyr LeuGlnHis ThrAspPro ThrLeuPro His


Page
6





CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.ST25
295 300 305
tac cac gac cag tgg aac acc ggagccgccgcc acc atc 1254
gcc ttc cga


Tyr His Asp Gln Trp Asn Thr GlyAlaAlaAla Thr Ile
Ala Phe Arg


310 315 320


gac cga ttt ggc ttc atc tcc tgcttccatgac atc atc 1302
gag ggc ttc


Asp Arg Phe Gly Phe Ile Ser CysPheHisAsp Ile Ile
Glu Gly Phe


325 330 335 340


gag acc gtt ctg cac cac gtg cgaattcccttc tac aac 1350
cac tac tct


Glu Thr Val Leu His His Val ArgIleProPhe Tyr Asn
His Tyr Ser


345 350 355


gcc cga gcc act gag aag aag gtcatgggcaag cac tac 1398
atc atc aag


Ala Arg Ala Thr Glu Lys Lys ValMetGlyLys His Tyr
Ile Ile Lys


360 365 370


cga cac gac acc aac ttc aag ctttacactgtc gcc cga 1446
gac atc tct


Arg His Asp Thr Asn Phe Lys LeuTyrThrVal Ala Arg
Asp Ile Ser


375 380 385


acc tgc ttt gtt gaa ggt gaa attcagatgttt aga aac 1494
cag aag ggc


Thr Cys Phe Val Glu Gly Glu IleGlnMetPhe Arg Asn
Gln Lys Gly


390 395 400


gtc aat gtc gga gtt get gac ctgccttctaaa aag 1539
gga cct ggc


Val Asn Val Gly Val Ala Asp LeuProSerLys Lys
Gly Pro G1y


405 410 415


tagagctagaaatgttattt taactgaaca gcaccgagcc 1599
gattgtgttt cgaggctaag


ccaagcgaagccgaggggtt tggacgtaacgagtaggcga 1659
gtgtagtcca tatcaccgca


ctcggcactgcgtgtctgcg aagtcacattacgctgacaa 1719
ttcatgggcg ccgttgtagt


ttccctttagtatcaatact ccggtctcgtactcgtactg 1779
gttacaagta atacgaatct


gtgggaagaagtcaccctta atactgatgtttcggatatc 1839
tcagaccttc aatagaactg


gcatagagccgttaaagaag tcactccaaccctcctactt 1899
tttcacttaa gtagattcaa


gcagatcgataagatggatt tgctagc 1936
tgatggtcag


<210>
24


<211>
419


<212>
PRT


<213> lipolytica
Yarrowia


<400> 24
Met Asp Ser Thr Thr Gln Thr Asn Thr Gly Thr Gly Lys Val Ala Val
1 5 10 15
Gln Pro Pro Thr Ala Phe Ile Lys Pro Ile Glu Lys Val Ser Glu Pro
20 25 30
Val Tyr Asp Thr Phe Gly Asn Glu Phe Thr Pro Pro Asp Tyr Ser Ile
35 40 45
Lys Asp Ile, Leu Asp Ala Ile Pro Gln Glu Cys Tyr Lys Arg Ser Tyr
50 55 60
Page 7



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.5T25
Val Lys Ser Tyr Ser Tyr Val Ala Arg Asp Cys Phe Phe Ile Ala Val
65 70 75 80
Phe Ala Tyr Met Ala Tyr Ala Tyr Leu Pro Leu Ile Pro Ser Ala Ser
85 ~ 90 95
Gly Arg Ala Val Ala Trp Ala Met Tyr Ser Ile Val Gln Gly Leu Phe
100 105 110
Gly Thr Gly Leu Trp Val Leu Ala His Glu Cys Gly His Ser Ala Phe
115 120 125
Ser Asp Ser Asn Thr Val Asn Asn Val Thr Gly Trp Val Leu His Ser
130 135 140
Ser Met Leu Val Pro Tyr Tyr Ala Trp Lys Leu Thr His Ser Met His
145 150 155 160
His Lys Ser Thr Gly His Leu Thr Arg Asp Met Val Phe Val Pro Lys
165 170 175
Asp Arg Lys Glu Phe Met Glu Asn Arg Gly Ala His Asp Trp Ser Glu
180 185 190
Leu Ala Glu Asp Ala Pro Leu Met Thr Leu Tyr Gly,Leu Ile Thr Gln
195 200 205
Gln Val Phe Gly Trp Pro Leu Tyr L'eu Leu Ser Tyr Val Thr Gly Gln
210 215 220
Lys Tyr Pro Lys Leu Asn Lys Trp Ala Val Asn His Phe Asn Pro Asn
225 230 235 240
Ala Pro Leu Phe Glu Lys Lys Asp Trp Phe Asn Ile Trp Ile Ser Asn
245 250 255
Val Gly Ile Gly Ile Thr Met Ser Val Ile Ala Tyr Ser Ile Asn Arg
260 265 270
Trp Gly Leu Ala Ser Val Thr Leu Tyr Tyr Leu Ile Pro Tyr Leu Trp
275 280 285
Val Asn His Trp Leu Val Ala Ile Thr Tyr Leu Gln His Thr Asp Pro
290 295 300
Thr Leu Pro His Tyr His Ala Asp Gln Trp Asn Phe Thr Arg Gly Ala
305 310 315 320
Ala Ala Thr Ile Asp Arg Glu Phe Gly Phe Ile Gly Ser Phe Cys Phe
325 330 335
Page 8



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
.~~z301-conversion.5T25
His Asp Ile Ile Glu Thr His Val Leu His His Tyr Val Ser Arg Ile
340 345 350
Pro Phe Tyr Asn Ala Arg Ile Ala Thr Glu Lys Ile Lys Lys Val Met
355 360 365
Gly Lys His Tyr Arg His Asp Asp Thr Asn Phe Ile Lys Ser Leu Tyr
370 375 380
Thr Val Ala Arg Thr Cys Gln Phe Val Glu Gly Lys Glu Gly Ile Gln
385 390 395 400
Met Phe Arg Asn Val Asn Gly Val Gly Val Ala Pro Asp Gly Leu Pro
405 410 415
Ser Lys Lys
<210> 25
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer P99
<400> 25
ggcaagctta acgccccgct gtttgagaa 29
<210> 26
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer P100
<400> 26
tgacgttgtt agatctacgt gggtctcgat gatgtc 36
<210> 27
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer P101
<400> 27
gacccacgta gatctaacaa cgtcaccgga tgggt 35
<210>28


<211>29


<212>DNA


<213>Artificial Sequence


<220>


<223>Primer P102


Page 9



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
c~2301-conversion.ST25
<400> 28
cgggaattcg gggttgaagt ggttgacag 29
<210> 29
<211> 16
<212> DNA
<213> Artificial sequence
<220>
<223> Primer P119
<400> 29
taataacgcc agggtt 16
<210> 30
<211> 22
<212> DNA
<213> Artificial sequence
<220>
<223> Primer P120
<400> 30
gtagaagggc attcgagaca cg 22
<210> 31
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer P121
<400> 31
tgtgcccaag gaccgaaagg ag 22
<210> 32
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer P122
<400> 32
tgcaggtagg tgatggccac gagttggg 28
<210> 33
<211> 25
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer P147
<400> 33
tcatgccatg gattcgacca cgcag Z5
<210> 34
<211> 26
<212> DNA
<213> Artificial Sequence
Page 10



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.sT25
<220>
<223> Primer P148
<400> 34
acatgcggcc gcctactttt tagaag 26
<210>
35


<211>
1077


<212>
DNA


<213> olegnia
sapr diclina


<400>
35


atgactgaggataagacgaaggtcgagttcccgacgctcacggagctcaagcactcgatc6b


ccgaacgcgtgctttgagtcgaacctcggcctctcgctctactacacggcccgcgcgatc120


ttcaacgcgtcggcctcggcggcgctgctctacgcggcgcgctcgacgccgttcattgcc180


gataacgttctgctccacgcgctcgtttgcgccacctacatctacgtgcagggcgtcatc240


ttctggggcttcttcacggtcggccacgactgcggccactcggccttctcgcgctaccac300


agcgtcaactttatcatcggctgcatcatgcactctgcgattttgacgccgttcgagagc360


tggcgcgtgacgcaccgccaccaccacaagaacacgggcaacattgataaggacgagatc420


ttttacccgcaccggtcggtcaaggacctc,caggacgtgcgccaatgggtctacacgctc480


ggcggtgcgtggtttgtctacttgaaggtcgggtatgccccgcgcacgatgagccacttt540


gacccgtgggacccgctcctccttcgccgcgcgtcggccgtcatcgtgtcgctcggcgtc600


tgggccgccttcttcgccgcgtacgcgtacctcacatactcgctcggctttgccgtcatg660


ggcctctactactatgcgccgctctttgtctttgcttcgttcctcgtcattacgaccttc720


ttgcaccacaacgacgaagcgacgccgtggtacggcgactcggagtggacgtacgtcaag780


ggcaacctctcgagcgtcgaccgctcgtacggcgcgttcgtggacaacctgagccaccac840


attggcacgcaccaggtccaccacttgttcccgatcattccgcactacaagctcaacgaa900


gccaccaagcactttgcggccgcgtacccgcacctcgtgcgcaggaacgacgagcccatc960


atcacggccttcttcaagaccgcgcacctctttgtcaactacggcgctgtgcccgagacg1020


gcgcagatcttcacgctcaaagagtcggccgcggccgccaaggccaagtcggactaa 1077


<210>
36


<211>
1374


<212>
DNA


<213> ierella
Mort alpina
AF465281


<400>
36


atggctgctgctcccagtgtgaggacgtttactcgggccgaggttttgaatgccgaggct60


ctgaatgagggcaagaaggatgccgaggcacccttcttgatgatcatcgacaacaaggtg120


tacgatgtccgcgagttcgtccctgatcatcccggtggaagtgtgattctcacgcacgtt180


ggcaaggacggcactgacgtctttgacacttttcaccccgaggctgcttgggagactctt240


gccaacttttacgttggtgatattgacgagagcgaccgcgatatcaagaatgatgacttt300


gcggccgaggtccgcaagctgcgtaccttgttccagtctcttggttactacgattcttcc360


Page 11





CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.sT25
aaggcatactacgccttcaaggtctcgttcaacctctgcatctggggtttgtcgacggtc420


attgtggccaagtggggccagacctcgaccctcgccaacgtgctctcggctgcgcttttg480


ggtctgttctggcagcagtgcggatggttggctcacgactttttgcatcaccaggtcttc540


caggaccgtttctggggtgatcttttcggcgccttcttgggaggtgtctgccagggcttc600


tcgtcctcgtggtggaaggacaagcacaacactcaccacgccgcccccaacgtccacggc660


gaggatcccgacattgacacccaccctctgttgacctggagtgagcatgcgttggagatg720


ttctcggatgtcccagatgaggagctgacccgcatgtggtcgcgtttcatggtcctgaac780


cagacctggttttacttccccattctctcgtttgcccgtctctcctggtgcctccagtcc840


attctctttgtgctgcctaacggtcaggcccacaagccctcgggcgcgcgtgtgcccatc900


tcgttggtcgagcagctgtcgcttgcgatgcactggacctggtacctcgccaccatgttc960


ctgttcatcaaggatcccgtcaacatgctggtgtactttttggtgtcgcaggcggtgtgc1020


ggaaacttgttggcgatcgtgttctcgctcaaccacaacggtatgcctgtgatctcgaag1080


gaggaggcggtcgatatggatttcttcacgaagcagatcatcacgggtcgtgatgtccac1140


ccgggtctatttgccaactggttcacgggtggattgaactatcagatcgagcaccacttg1200


ttcccttcgatgcctcgccacaacttttcaaagatccagcctgctgtcgagaccctgtgc1260


aaaaagtacaatgtccgataccacaccaccggtatgatcgagggaactgcagaggtcttt1320


agccgtctgaacgaggtctccaaggctacctccaagatgggtaaggcgcagtaa 1374


<210>
37


<211>
457


<212>
PRT


<213>
Mortierella
alpina
AF465281


<400> 37
Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu
1 5 10 15
Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe
20 25 30
Leu Met Ile Ile Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro
35 40 45
Asp His Pro Gly Gly Ser Val Ile Leu Thr His Val Gly Lys Asp Gly
50 55 60
Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu
65 70 75 80
Ala Asn Phe Tyr Val Gly Asp Ile Asp Glu Ser Asp Arg Asp Ile Lys
85 90 95
Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gln
Page 12



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
~~2301-conversion.ST25
100 105 110
Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val
115 120 125
Ser Phe Asn Leu Cys Ile Trp Gly Leu Ser Thr Val Ile Val Ala Lys
130 135 140
Trp Gly Gln Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu
145 150 155 160
Gly Leu Phe Trp Gln Gln Cys Gly Trp Leu Ala His Asp Phe Leu His
165 170 175
His Gln Val Phe Gln Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe
180 185 190
Leu Gly Gly Val Cys Gln Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys
195 200 205
His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp
210 215 220
Ile Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met
225 230 235 240
Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe
245 250 255
Met Val Leu Asn Gln Thr Trp Phe Tyr Phe Pro Ile Leu Ser Phe Ala
260 265 270
Arg Leu Ser Trp Cys Leu Gln Ser Ile Leu Phe Val Leu Pro Asn Gly
275 280 285
Gln Ala His Lys Pro Ser Gly Ala Arg Val Pro Ile Ser Leu Val Glu
290 295 300
Gln Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe
305 310 315 320
Leu Phe Ile Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser
325 330 335
Gln Ala Val Cys Gly Asn Leu Leu Ala Ile Val Phe Ser Leu Asn His
340 345 350
Asn Gly Met Pro Val Ile Ser Lys Glu Glu Ala Val Asp Met Asp Phe
355 360 365
Phe Thr Lys Gln Ile Ile Thr Gly Arg Asp Val His Pro Gly Leu Phe
Page 13



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.ST25
370 375 380
Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gln Ile Glu His His Leu
385 390 395 400
Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys Ile Gln Pro Ala Val
405 410 415
Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met
420 425 430
Ile Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys
435 440 445
Ala Thr Ser Lys Met Gly Lys Ala Gln
450 455
<210>
38


<211>
1341


<212>'
DNA


<213> ierella
Mort alpina
AF067654


<400>
,38


atgggaacggaccaaggaaaaaccttcacctgggaagagctggcggcccataacaccaag60


gacgacctactcttggccatccgcggcagggtgtacgatgtcacaaagttcttgagccgc120


catcctggtggagtggacactctcctgctcggagctggccgagatgttactccggtcttt180


gagatgtatcacgcgtttggggctgcagatgccattatgaagaagtactatgtcggtaca240


ctggtctcgaatgagctgcccatcttcccggagccaacggtgttccacaaaaccatcaag300


acgagagtcgagggctactttacggatcggaacattgatc.ccaagaatagaccagagatc360


tggggacgatacgctcttatctttggatccttgatcgcttcctactacgcgcagctcttt420


gtgcctttcgttgtcgaacgcacatggcttcaggtggtgtttgcaatcatcatgggattt480


gcgtgcgcacaagtcggactcaaccctcttcatgatgcgtctcacttttcagtgacccac540


aaccccactgtctggaagattctgggagccacgcacgactttttcaacggagcatcgtac600


ctggtgtggatgtaccaacatatgctcggccatcacccctacaccaacattgctggagca660


gatcccgacgtgtcgacgtctgagcccgatgttcgtcgtatcaagcccaaccaaaagtgg720


tttgtcaaccacatcaaccagcacatgtttgttcctttcctgtacggactgctggcgttc780


aaggtgcgcattcaggacatcaacattttgtactttgtcaagaccaatgacgctattcgt840


gtcaatcccatctcgacatggcacactgtgatgttctggggcggcaaggctttctttgtc900


tggtatcgcctgattgttcccctgcagtatctgcccctgggcaaggtgctgctcttgttc960


acggtcgcggacatggtgtcgtcttactggctggcgctgaccttccaggcgaaccacgtt1020


gttgaggaagttcagtggccgttgcctgacgagaacgggatcatccaaaaggactgggca1080


gctatgcaggtcgagactacgcaggattacgcacacgattcgcacctctggaccagcatc1140


actggcagcttgaactaccaggctgtgcaccatctgttccccaacgtgtcgcagcaccat1200


Page 14





CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.5T25
tatcccgata ttctggccat catcaagaac acctgcagcg agtacaaggt tc.catacctt 1260
gtcaaggata cgttttggca agcatttgct tcacatttgg agcacttgcg tgttcttgga 1320
ctccgtccca aggaagagta g 1341
<210> 39
<211> 446
<212> PRT
<213> Mortierella alpina AF067654
<400> 39
Met Gly Thr Asp Gln Gly Lys Thr Phe Thr Trp Glu Glu Leu Ala Ala
1 5 10 15
His Asn Thr Lys Asp Asp Leu Leu Leu Ala Ile Arg Gly Arg Val Tyr
20 25 30
Asp Val Thr Lys Phe Leu Ser Arg His Pro Gly Gly Val Asp Thr Leu
35 40 45
Leu Leu Gly Ala Gly Arg Asp Val Thr Pro Val Phe Glu Met Tyr His
50 55 60
Ala Phe Gly Ala Ala Asp Ala Ile Met Lys Lys Tyr Tyr Val Gly Thr
65 70 75 80
Leu Val Ser Asn Glu Leu Pro Ile Phe Pro Glu Pro Thr Val Phe His
85 90 95
Lys Thr Ile Lys Thr Arg Val Glu Gly Tyr Phe Thr Asp Arg Asn Ile
100 105 110
Asp Pro Lys Asn Arg Pro Glu Ile Trp Gly Arg Tyr Ala Leu Ile Phe
115 120 125
Gly Ser Leu Ile Ala Ser Tyr Tyr Ala Gln Leu Phe Val Pro Phe Val
130 135 140
Val Glu Arg Thr Trp Leu Gln Val Val Phe Ala Ile Ile Met Gly Phe
145 150 155 160
Ala Cys Ala Gln Val Gly Leu Asn Pro Leu His Asp Ala Ser His Phe
165 170 175
Ser Val Thr His Asn Pro Thr Val Trp Lys Ile Leu Gly Ala Thr His
180 185 190
Asp Phe Phe Asn Gly Ala Ser Tyr Leu Val Trp Met Tyr Gln His Met
195 200 205
Leu Gly His His Pro Tyr Thr Asn Ile Ala Gly Ala Asp Pro Asp Val
Page 15



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.ST25
210 215 220
Ser Thr Ser Glu Pro Asp Val Arg Arg Ile Lys Pro Asn Gln Lys Trp
225 230 235 240
Phe Val Asn His Ile Asn Gln His Met Phe Val Pro Phe Leu Tyr Gly
245 250 255
Leu Leu Ala Phe Lys Val Arg Ile Gln Asp Ile Asn Ile Leu Tyr Phe
260 265 270
Val Lys Thr Asn Asp Ala Ile Arg Val Asn Pro Ile Ser Thr Trp His
275 280 285
Thr Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu
290 295 300
Ile Val Pro Leu Gln Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe
305 310 315 320
Thr Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gln
325 330 335
Ala Asn His Val Val Glu Glu Val Gln Trp Pro Leu Pro Asp Glu Asn
340 345 350
Gly Ile Ile Gln Lys Asp Trp Ala Ala Met Gln Val Glu Thr Thr Gln
355 360 365
Asp Tyr Ala His Asp Ser His Leu Trp Thr Ser Ile Thr Gly Ser Leu
370 375 380
Asn Tyr Gln Ala Val His His Leu Phe Pro Asn Val Ser Gln His His
385 390 395 400
Tyr Pro Asp Ile Leu Ala Ile Ile Lys Asn Thr Cys Ser Glu Tyr Lys
405 410 415
Val Pro Tyr Leu Val Lys Asp Thr Phe Trp Gln Ala Phe Ala Ser His
420 425 430
Leu Glu His Leu Arg Val Leu Gly Leu Arg Pro Lys Glu Glu
435 440 445
<210> 40
<211> 30
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL11
<400> 40
Page 16



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.sT25
ttttccatgg gaacggacca aggaaaaacc 30
<210> 41
<211> 30
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL12
<400> 41
tttgcggccg cctactcttc cttgggacgg 30
<210> 42
<211> 33
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL21A
<400> 42
tttccatggc tgaggataag acgaaggtcg agt 33
<210> 43
<211> 36
<212> DNA
<213> Artificial sequence
<220>
<223> Primer' YL22
<400> 43
cccttaatta attagtccga cttggccttg gcggcc 36
<210>
44


<211>
957


<212>
DNA


<213>
Mortierella
alpina
Ax464731


<400>
44


atggagtcgattgcgccattcctcccatcaaagatgccgcaagatctgtttatggacctt60.


gccaccgctatcggtgtccgggccgcgccctatgtcgatcctctcgaggccgcgctggtg120


gcccaggccgagaagtacatccccacgattgtccatcacacgcgtgggttcctggtcgcg180


gtggagtcgcctttggcccgtgagctgccgttgatgaacccgttccacgtgctgttgatc240


gtgctcgcttatttggtcacggtctttgtgggcatgcagatcatgaagaactttgagcgg300


ttcgaggtcaagacgttttcgctcctgcacaacttttgtctggtctcgatcagcgcctac360


atgtgcggtgggatcctgtacgaggcttatcaggccaactatggactgtttgagaacgct420


gctgatcataccttcaagggtcttcctatggccaagatgatctggctcttctacttctcc480


aagatcatggagtttgtcgacaccatgatcatggtcctcaagaagaacaaccgccagatc540


tccttcttgcacgtttaccaccacagctccatcttcaccatctggtggttggtcaccttt600


gttgcacccaacggtgaagcctacttctctgctgcgttgaactcgttcatccatgtgatc660


atgtacggctactacttcttgtcggccttgggcttcaagcaggtgtcgttcatcaagttc720


Page 17





CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.5T25
tacatcacgc gctcgcagat gacacagttc tgcatgatgt cggtccagtc ttcctgggac 780
atgtacgcca tgaaggtcct tggccgcccc ggatacccct tcttcatcac ggctctgctt 840
tggttctaca tgtggaccat gctcggtctc ttctacaact tttacagaaa gaacgccaag 900
ttggccaagc aggccaaggc cgacgctgcc aaggagaagg caaggaagtt gcagtaa 957
<210> 45
<211> 318
<212> PRT
<213> Mortierella alpina Ax464731
<400> 45
Met Glu Ser Ile Ala Pro Phe Leu Pro Ser Lys Met Pro Gln Asp Leu
1 5 10 15
Phe Met Asp Leu Ala Thr Ala Ile Gly Val Arg Ala Ala Pro Tyr Val
20 25 30
Asp Pro Leu Glu Ala Ala Leu Val Ala Gln Ala Glu Lys Tyr Ile Pro
35 40 45
Thr Ile Val His His Thr Arg Gly Phe Leu Val Ala Val Glu Ser Pro
50 55 60
Leu Ala Arg Glu Leu Pro Leu Met Asn Pro Phe His Val Leu Leu Ile
65 70 75 80
Val Leu Ala Tyr Leu Val Thr Val Phe Val Gly Met Gln Ile Met Lys
85 90 95
Asn Phe Glu Arg Phe Glu Val Lys Thr Phe Ser Leu Leu His Asn Phe
100 105 110
Cys Leu Val Ser Ile Ser Ala Tyr Met Cys Gly Gly Ile Leu Tyr Glu
115 120 125
Ala Tyr Gln Ala Asn Tyr Gly Leu Phe Glu Asn Ala Ala Asp His Thr
130 135 140
Phe Lys Gly Leu Pro Met Ala Lys Met Ile Trp Leu Phe Tyr Phe Ser
145 150 155 160
Lys Ile Met Glu Phe Val Asp Thr Met Ile Met Val Leu Lys Lys Asn
165 170 175
Asn Arg Gln Ile Ser Phe Leu His Val Tyr His His Ser Ser Ile Phe
180 185 190
Thr Ile Trp Trp Leu Val Thr Phe Val Ala Pro Asn Gly Glu Ala Tyr
195 200 205
Page 18



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.5T25
Phe Ser Ala Ala Leu Asn Ser Phe Ile His Val Ile Met Tyr Gly Tyr
210 215 220
Tyr Phe Leu Ser Ala Leu Gly Phe Lys Gln Val Ser Phe Ile Lys Phe
225 230 235 240
Tyr Ile Thr Arg Ser Gln Met Thr Gln Phe Cys Met Met Ser Val Gln
245 250 255
Ser Ser Trp Asp Met Tyr Ala Met Lys Val Leu Gly Arg Pro Gly Tyr
260 265 270
Pro Phe Phe Ile Thr Ala Leu Leu Trp Phe Tyr Met Trp Thr Met Leu
275 280 285
Gly Leu Phe Tyr Asn Phe Tyr Arg Lys Asn Ala Lys Leu Ala Lys Gln
290 295 300
Ala Lys Ala Asp Ala Ala Lys Glu Lys Ala Arg Lys Leu Gln
305 310 315
<210>
46


<211>
1077


<212>
DNA


<213>
saprolegnia
declina


<400>
46


atggctgaggataagaccaaggtcgagttccctaccctgactgagctgaagcactctatc60


cctaacgcttgctttgagtccaacctcggactctcgctctactacactgcccgagcgatc120


ttcaacgcatctgcctctgctgctctgctctacgctgcccgatctactcccttcattgcc180


gataacgttctgctccacgctctggtttgcgccacctacatctacgtgcagggtgtcatc240


ttctggggtttctttaccgtcggtcacgactgtggtcactctgccttctcccgataccac300


tccgtcaacttcatcattggctgcatcatgcactctgccattctgactcccttcgagtcc360


tggcgagtgacccaccgacaccatcacaagaacactggcaacattgataaggacgagatc420


ttctaccctcatcggtccgtcaaggacctccaggacgtgcgacaatgggtctacaccctc480


ggaggtgcttggtttgtctacctgaaggtcggatatgctcctcgaaccatgtcccacttt540


gacccctgggaccctctcctgcttcgacgagcctccgctgtcatcgtgtccctcggagtc600


tgggctgccttcttcgctgcctacgcctacctcacatactcgctcggctttgccgtcatg660


ggcctctactactatgctcctctctttgtctttgcttcgttcctcgtcattactaccttc720


ttgcatcacaacgacgaagctactccctggtacggtgactcggagtggacctacgtcaag780


ggcaacctgagctccgtcgaccgatcgtacggagctttcgtggacaacctgtctcaccac840


attggcacccaccaggtccatcacttgttccctatcattccccactacaagctcaacgaa900


gccaccaagcactttgctgccgcttaccctcacctcgtgagacgtaacgacgagcccatc960


attactgccttcttcaagaccgctcacctctttgtcaactacggagctgtgcccgagact1020


Page 19





CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.ST25
gctcagattt tcaccctcaa agagtctgcc gctgcagcca aggccaagag cgactaa 1077
<210> 47
<211> 358
<212> PRT
<213> Saprolegnia declina
<400> 47
Met Ala Glu Asp Lys Thr Lys Val Glu Phe Pro Thr Leu Thr Glu Leu
1 5 10 15
Lys His Ser Ile Pro Asn Ala Cys Phe Glu Ser Asn Leu Gly Leu Ser
20 25 30
Leu Tyr Tyr Thr Ala Arg Ala,Ile Phe Asn Ala Ser Ala Ser Ala Ala
35 40 45
Leu Leu Tyr Ala Ala Arg Ser Thr Pro Phe Ile Ala Asp Asn Val Leu
50 55 60
Leu His Ala Leu Val Cys Ala Thr Tyr Ile Tyr Val Gln Gly Val Ile
65 70 75 80
Phe Trp Gly Phe Plhe Thr Val Gly His Asp Cys Gly His Ser Ala Phe
85 90 95
Ser Arg Tyr His Ser Val Asn Phe Ile Ile Gly Cys Ile Met His Ser
100 105 110
Ala Ile Leu Thr Pro Phe Glu Ser Trp Arg Val Thr His Arg His His
115 120 125
His Lys Asn Thr Gly Asn Ile Asp Lys Asp Glu Ile Phe Tyr Pro His
130 135 140
Arg Ser Val Lys Asp Leu Gln Asp Val Arg Gln Trp Val Tyr Thr Leu
145 150 155 160
Gly Gly Ala Trp Phe Val Tyr Leu Lys Val Gly Tyr Ala Pro Arg Thr
165 170 175
Met Ser His Phe Asp Pro Trp Asp Pro Leu Leu Leu Arg Arg Ala Ser
180 185 190
Ala Val Ile Val Ser Leu Gly Val Trp Ala Ala Phe Phe Ala Ala Tyr
195 200 205
Ala Tyr Leu Thr Tyr Ser Leu Gly Phe Ala Val Met Gly Leu Tyr Tyr
210 215 220
Tyr Ala Pro Leu Phe Val Phe Ala Ser Phe Leu Val Ile Thr Thr Phe
Page ZO



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.5T25
225 230 235 240
Leu His His Asn Asp Glu Ala Thr Pro Trp Tyr Gly Asp Ser Glu Trp
245 250 255
Thr Tyr Val Lys Gly Asn Leu Ser Ser Val Asp Arg Ser Tyr Gly Ala
260 265 270
Phe Val Asp Asn Leu Ser His His Ile Gly Thr His Gln Val His His
275 280 285
Leu Phe Pro Ile Ile Pro His Tyr Lys Leu Asn Glu Ala Thr Lys His
290 295 300
Phe Ala Ala Ala Tyr Pro His Leu Val Arg Arg Asn Asp Glu Pro Ile
305 310 315 320
Ile Thr Ala Phe Phe Lys Thr Ala His Leu Phe Val Asn Tyr Gly Ala
325 330 335
Val Pro Glu Thr Ala Gln Ile Phe Thr Leu Lys Glu Ser Ala Ala Ala
340 345 350
Ala Lys Ala Lys Ser Asp
355
<210> 48
<211> 105
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-1A
<400> 48
catggctgag gataagacca aggtcgagtt ccctaccctg actgagctga agcactctat 60
ccctaacgct tgctttgagt ccaacctcgg actctcgctc tacta 105
<210> 49
<211> 106
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-1B
<400> 49
cagtgtagta gagcgagagt ccgaggttgg actcaaagca agcgttaggg atagagtgct 60
tcagctcagt cagggtaggg aactcgacct tggtcttatc ctcagc 106
<210> 50
<211> 106
<212> DNA
<213> Artificial Sequence
Page 21



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.ST25
<220>
<223> Primer D17-2A
<400> 50
cactgcccga gcgatcttca acgcatctgc ctctgctgct ctgctctacg ctgcccgatc 60
tactcccttc attgccgata acgttctgct ccacgctctg gtttgc 106
<210> 51
<211> 106
<212> DNA
<213> Artificial sequence
<220>
<223> Primer D17-2B
<400> 51
gtggcgcaaa ccagagcgtg gagcagaacg ttatcggcaa tgaagggagt agatcgggca 60
gcgtagagca gagcagcaga ggcagatgcg ttgaagatcg ctcggg 106
<210> 52
<211> 105
<212> DNA
<213> Artificial sequence
<220>
<223> Primer D17-3A
<400> 52
gccacctaca tctacgtgca gggtgtcatc ttctggggtt tctttaccgt cggtcacgac 60
tgtggtcact ctgccttctc ccgataccac tccgtcaact tcatc 105
<210> 53
<211> 105
<212> DNA ,
<213> Artificial Sequence
<220>
<223> Primer D17-3g
<400> 53
ccaatgatga agttgacgga gtggtatcgg gagaaggcag agtgaccaca gtcgtgaccg 60
acggtaaaga aaccccagaa gatgacaccc tgcacgtaga tgtag 105
<210> 54
<211> 105
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-4A
<400> 54
attggctgca tcatgcactc tgccattctg actcccttcg agtcctggcg agtgacccac 60
cgacaccatc acaagaacac tggcaacatt gataaggacg agatc 105
<210> 55
<211> 105
<212> DNA
Page 22



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.5T25
<213> Artificial Sequence
<220>
<223> Primer D17-4B
<400> 55
tagaagatct cgtccttatc aatgttgcca gtgttcttgt gatggtgtcg gtgggtcact 60
cgccaggact cgaagggagt cagaatggca gagtgcatga tgcag 105
<210> 56
<211> 105
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-5A
<400> 56
acgagatctt ctaccctcat cggtccgtca aggacctcca ggacgtgcga caatgggtct 60
acaccctcgg aggtgcttgg tttgtctacc tgaaggtcgg atatg 105
<210> 57
<211> 107
<212> DNA
<213> Artificial sequence
<220>
<223> Primer D17-5B
<400> 57
aggagcatat ccgaccttca ggtagacaaa ccaagcacct ccgagggtgt agacccattg 60
tcgcacgtcc tggaggtcct tgacggaccg atgagggtag aagatct 107
<210> 58
<211> 105
<212> DNA
<213> Artificial Sequence
<Z20>
<223> Primer D17-6A
<400> 58
ctcctcgaac catgtcccac tttgacccct gggaccctct cctgcttcga cgagcctccg 60
ctgtcatcgt gtccctcgga gtctgggctg ccttcttcgc tgcct 105
<210> 59
<211> 106
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-6B
<400> 59
aggcgtaggc agcgaagaag gcagcccaga ctccgaggga cacgatgaca gcggaggctc 60
gtcgaagcag gagagggtcc caggggtcaa agtgggacat ggttcg 106
<210> 60
Page 23



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
02301-conversion.ST25
<211> 104
<212> DNA
<213> Artificial sequence
<220>
<223> Primer D17-7A
<400> 60
acgcctacct cacatactcg ctcggctttg ccgtcatggg cctctactac tatgctcctc 60
tctttgtctt tgcttcgttc ctcgtcatta ctaccttctt gcat 104
<210> 61
' <211> 103
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-7B
<400> 61
ttgtgatgca agaaggtagt aatgacgagg aacgaagcaa agacaaagag aggagcatag 60
tagtagaggc ccatgacggc aaagccgagc gagtatgtga ggt 103
<210> 62
<211> 106
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-8A
<400> 62
cacaacgacg aagctactcc ctggtacggt gactcggagt ggacctacgt caagggcaac 60
ctgagctccg tcgaccgatc gtacggagct ttcgtggaca acctgt 106
<210> 63
<211> 106
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-8B
<400> 63
gtgagacagg ttgtccacga aagctccgta cgatcggtcg acggagctca ggttgccctt 60
gacgtaggtc cactccgagt caccgtacca gggagtagct tcgtcg 106
<210> 64
<211> 102
<212> DNA
<213> Artificial Sequence '
<220>
<223> Primer D17-9A
<400> 64
ctcaccacat tggcacccac caggtccatc acttgttccc tatcattccc cactacaagc 60
tcaacgaagc caccaagcac tttgctgccg cttaccctca cc 102
Page 24



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.sT25
<210> 65
<211> 102
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-9B
<400> 65
cacgaggtga gggtaagcgg cagcaaagtg cttggtggct tcgttgagct tgtagtgggg 60
aatgataggg aacaagtgat ggacctggtg ggtgccaatg tg 102
<210> 66
<211> 76
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-l0A
<400> 66
tcgtgagacg taacgacgag cccatcatta ctgccttctt caagaccgct cacctctttg 60
tcaactacgg agctgt 76
<210> 67
<211> 76
<212> DNA
<213> Artificial sequence
<220>
<223> Primer D17-10B
<400> 67
cgggcacagc tccgtagttg acaaagaggt gagcggtctt gaagaaggca gtaatgatgg 60
gctcgtcgtt acgtct 76
<210> 68
<211> 67
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-11,4
<400> 68
gcccgagact gctcagattt tcaccctcaa agagtctgcc gctgcagcca aggccaagag 60
cgactaa 67
<210> 69
<211> 62
<212> DNA
<213> Artificial sequence
<220>
<223> Primer D17-11B
<400> 69
ttagtcgctc ttggccttgg ctgcagcggc agactctttg agggtgaaaa tctgagcagt 60
Page 25



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
c~2301-conversion.sT25
ct 62
<210> 70
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-1
<400> 70
tttccatggc tgaggataag accaaggtcg ag 32
<210> 71
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-4R
<400> 71
ccctagaaga tctcgtcctt atcaatgttg ccag 34
<210> 72
<211> 27
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer D17-5
<400> 72
cccacgagat cttctaccct catcggt 27
<210> 73
<211> 24
<212> DNA
<213> Artificial sequence
<220>
<223> Primer D17-8D
<400> 73
gaaagctccg tacgatcggt cgac 24
<210> 74
<211> 24
<212> DNA
<213> Artificial sequence
<220>
<223> Primer D17-8u
<400> 74
gtcgaccgat cgtacggagc tttc 24
<210> 75
<211> 34
<212> DNA
<213> Artificial Sequence
Page 26



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.sT25
<Z20>
<223> Primer D17-11
<400> 75
aaagcggccg cttagtcgct cttggccttg gctg 34
<210> 76
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL53
<400> 76
gccaagtcgg actaagctgc taactagagc ggccgc 36
<210> 77
<211> 36
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL54
<400> 77
gcggccgctc tagttagcag cttagtccga cttggc 36
<210> 78
<211> 35
<212> DNA
<213> Artificial sequence
<220>
<223> Primer KU5
<400> 78
tttgcccggg cgagtatctg tctgactcgt cattg 35
<210> 79
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer KU3
<400> 79
aaagcccggg caaaggcctg tttctcggtg tac 33
<210> 80
<211> 1710
<212> DNA
<213> Yarrowia lipolytica
<400> 80
gtcgacgagt atctgtctga ctcgtcattg ccgcctttgg agtacgactc caactatgag 60
tgtgcttgga tcactttgac gatacattct tcgttggagg ctgtgggtct gacagctgcg 120
ttttcggcgc ggttggccga caacaatatc agctgcaacg tcattgctgg ctttcatcat 180
gatcacattt ttgtcggcaa aggcgacgcc cagagagcca ttgacgttct ttctaatttg 240
Page 27



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.ST25
gaccgatagccgtatagtccagtctatctataagttcaactaactcgtaactattaccat300


aacatatacttcactgccccagataaggttccgataaaaagttctgcagactaaatttat360


ttcagtctcctcttcaccaccaaaatgccctcctacgaagctcgagctaacgtccacaag420


tccgcctttgccgctcgagtgctcaagctcgtggcagccaagaaaaccaacctgtgtgct480


tctctggatgttaccaccaccaaggagctcattgagcttgccgataaggtcggaccttat540


gtgtgcatgatcaagacccatatcgacatcattgacgacttcacctacgccggcactgtg600


ctccccctcaaggaacttgctcttaagcacggtttcttcctgttcgaggacagaaagttc660


gcagatattggcaacactgtcaagcaccagtacaagaacggtgtctaccgaatcgccgag720


tggtccgatatcaccaacgcccacggtgtacccggaaccggaatcattgctggcctgcga780


gctggtgccgaggaaactgtctctgaacagaagaaggaggacgtctctgactacgagaac840


tcccagtacaaggagttcctggtcccctctcccaacgagaagctggccagaggtctgctc,900


atgctggccgagctgtcttgcaagggctctctggccactggcgagtactccaagcagacc960


attgagcttgcccgatccgaccccgagtttgtggttggcttcattgcccagaaccgacct1020


aagggcgactctgaggactggcttattctgacccccggggtgggtcttgacgacaaggga1080


gacgctctcggacagcagtaccgaactgttgaggatgtcatgtctaccggaacggatatc1140


ataattgtcggccgaggtctgtacggccagaaccgagatcctattgaggaggccaagcga1200


taccagaaggctggctgggaggcttaccagaagattaactgttagaggttagactatgga1260


tatgtcatttaactgtgtatatagagagcgtgcaagtatggagcgcttgttcagcttgta1320


tgatggtcagacgacctgtctgatcgagtatgtatgatactgcacaacctgtgtatccgc1380


atgatctgtccaatggggcatgttgttgtgtttctcgatacggagatgctgggtacaagt1440


agctaatacgattgaactacttatacttatatgaggcttgaagaaagctgacttgtgtat1500


gacttattctcaactacatccccagtcacaataccaccactgcactaccactacaccaaa1560


accatgatcaaaccacccatggacttcctggaggcagaagaacttgttatggaaaagctc1620


aagagagagaagccaagatactatcaagacatgtgtcgcaacttcaaggaggaccaagct1680


ctgtacaccgagaaacaggcctttgtcgac 1710


<210>
81


<211>
286


<212>
PRT


<213>
Yarrowia
lipolytica


<400> 81
Met Ser Glu Arg Ala Val His Ser Phe
Pro Tyr Ala Asn Lys Ala Ala


1 5 10 15


Ala Val Lys Val Ala Lys Lys Asn cys
Arg Leu Leu Ala Thr Leu Ala


20 25 30


Ser Asp Thr Thr Lys Leu Ile Leu Asp
Leu Val Thr Glu Glu Ala Lys


Page 28





CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.ST25
35 40 45
Val Gly Pro Tyr Val Cys Met Ile Lys Thr His Ile Asp Ile Ile Asp
50 55 60
Asp Phe Thr Tyr Ala Gly Thr Val Leu Pro Leu Lys Glu Leu Ala Leu
65 70 75 80
Lys His Gly Phe Phe Leu Phe Glu Asp Arg Lys Phe Ala Asp Ile Gly
85 90 95
Asn Thr Val Lys His Gln Tyr Lys Asn Gly Val Tyr Arg Ile Ala Glu
100 105 110
Trp Ser Asp Ile Thr Asn Ala His Gly Val Pro Gly Thr Gly Ile Ile
115 120 125
Ala Gly Leu Arg Ala Gly Ala Glu Glu Thr Val Ser Glu Gln Lys Lys
130 135 140
Glu Asp Val Ser Asp Tyr Glu Asn Ser Gln Tyr Lys Glu Phe Leu Val
145 150 155 160
Pro Ser Pro Asn Glu f_ys Leu Ala Arg Gly Leu Leu Met Leu Ala Glu
165 170 175
Leu Ser Cys Lys Gly Ser Leu Ala Thr Gly Glu Tyr Ser Lys Gln Thr
180 185 190
Ile Glu Leu Ala Arg Ser Asp Pro Glu Phe Val Val Gly Phe Ile Ala
195 ~ 200 205
Gln Asn Arg Pro Lys Gly Asp Ser Glu Asp Trp Leu Ile Leu Thr Pro
210 215 220
Gly Val Gly Leu Asp Asp Lys Gly Asp Ala Leu Gly Gln Gln Tyr Arg
225 230 235 240
Thr Val Glu Asp Val Met Ser Thr Gly Thr Asp Ile Ile Ile Val Gly
245 250 ~ 255
Arg Gly Leu Tyr Gly Gln Asn Arg Asp Pro Ile Glu Glu Ala Lys Arg
260 265 270
Tyr Gln Lys Ala Gly Trp Glu Ala Tyr Gln Lys Ile Asn Cys
275 , 280 285
<210> 82
<211> 35
<212> DNA
<213> Artificial Sequence
Page 29



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.sT25
<220>
<223> Primer KI5
<400> 82
agagcggccg catgggagaa gtgggaccca caaac 35
<210> 83
<211> 38
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer KI3
<400> 83
gtggcggccg ctcaaatgtc gttattgtac caataaac 38
<210>
84


<211>
1152


<212>
DNA


<213> tients
Impa balsama


<400>
84


atgggagaagtgggacccacaaaccgaaccaaaaccaagttggacaagcaacaagaatcc60


gaaaacagggttcctcacgagccacctccattcacactaagtgaccttaagaaagccatc120


ccaccccattgcttcgagcgctccctcgtgaaatcattctaccacgtgattcacgacatt180


atcatcctgtcctttttctactatgtcgccgccaattacatccccatgctaccccaaaac240


ctccgttacgttgcatggccaatttattgggccatccaaggctgtgtccaacttggtata300


ttggtcttaggccatgaatgcggccaccacgccttcagcgactaccaatgggtagacgac360


atggtcgggttcgtcctccactcgtcccaattgattccctacttctcatggaaacatagc420


caccgtcgccaccactccaacacggcctccatcgagcgcgacgaggtctacccgcccgcg480


tacaaaaacgacctgccgtggttcgccaaatacctacgcaaccccgtcggtcgtttcctc540


atgattttcggggcgctactgttcggctggccgtcgtaccttctgttcaacgcgaacggc600


cgtctctacgaccgcttcgcttcccactacgacccgcaatccccgatcttcaacaaccgc660


gagaggctgcaagtgatcgcgtccgacgtcgggctcgtcttcgcgtactttgtcctgtac720


aagatcgcgctggccaagggatttgtgtggttaatttgtgtgtatggcgtcccgtacgtg780


atcctcaacgggcttatcgtcttgatcacgttcctacagcacacgcacccgaatctgccc840


cgttacgacctttccgagtgggactggcttaggggagccctgtcgactgtggaccgcgat900


tacgggatgttgaataaggtgttccataacgtgacggacacgcacttggtgcatcatttg960


ttcacgaccatgccacattatcgcgccaaggaggcgaccgaggtgattaaaccgatattg1020


ggagactactataagtttgacgacactccgtttctcaaagcgttgtggaaggacatggga1080


aagtgtatttatgtggagtcggacgtgcctggcaagaacaagggagtttattggtacaat1140


aacgacatttga 1152


<210> 85
<211> 383
Page 30



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
<212> PRT
<213> Impatients balsama
<400> 85
CL2301-conversion.ST25
Met Gly Glu Val Gly Pro Thr Asn Arg Thr Lys Thr Lys Leu Asp Lys
1 5 10 15
Gln Gln Glu Ser Glu Asn Arg Val Pro His Glu Pro Pro Pro Phe Thr
20 25 30
Leu Ser Asp Leu Lys Lys Ala Ile Pro Pro His Cys Phe Glu Arg Ser
35 40 45
Leu Val Lys Ser Phe Tyr His Val Ile His Asp Ile Ile Ile Leu Ser
50 55 60
Phe Phe Tyr Tyr Val Ala Ala Asn Tyr Ile Pro Met Leu Pro Gln Asn
65 70 75 80
Leu Arg Tyr Val Ala Trp Pro Ile Tyr Trp Ala Ile Gln Gly Cys Val
85 90 95
Gln Leu Gly Ile Leu Val Leu Gly His Glu Cys Gly His His Ala Phe
100 105 110
Ser Asp Tyr~Gln Trp Val Asp Asp Met Val Gly Phe Val Leu His Ser
115 120 125
Ser Gln Leu Ile Pro Tyr Phe Ser Trp Lys His Ser His Arg Arg His
130 135 140
His Ser Asn Thr Ala Ser Ile Glu Arg Asp Glu Val Tyr Pro Pro Ala
145 150 155 160
Tyr Lys Asn Asp Leu Pro Trp Phe Ala Lys Tyr Leu Arg Asn Pro Val
165 170 175
Gly Arg Phe Leu Met Ile Phe Gly Ala Leu Leu Phe Gly Trp Pro Ser
180 185 190
Tyr Leu Leu Phe Asn Ala Asn Gly Arg Leu Tyr Asp Arg Phe Ala Ser
195 200 205
His Tyr Asp Pro Gln Ser Pro Ile Phe Asn Asn Arg Glu Arg Leu Gln
210 215 220
Val Ile Ala Ser Asp Val Gly Leu Val Phe Ala Tyr Phe Val Leu Tyr
225 230 235 240
Lys Ile Ala Leu Ala Lys Gly Phe Val Trp Leu Ile Cys Val Tyr Gly
245 250 255
Page 31



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.sT25
Val Pro Tyr Val Ile Leu Asn Gly Leu Ile Val Leu Ile Thr Phe Leu
Z60 265 ;270
Gln His Thr His Pro Asn Leu Pro Arg Tyr Asp Leu ser Glu Trp Asp
275 280 285
Trp Leu Arg Gly Ala Leu ser Thr Val Asp Arg Asp Tyr Gly Met Leu
290 295 300
Asn Lys Val Phe His Asn Val Thr Asp Thr His Leu Val His His Leu
305 310 315 320
Phe Thr Thr Met Pro His Tyr Arg Ala Lys Glu Ala Thr Glu Val Ile
325 330 335
Lys Pro Ile Leu Gly Asp Tyr Tyr Lys Phe Asp Asp Thr Pro Phe Leu
340 345 350
Lys Ala Leu Trp Lys Asp Met Gly Lys Cys Ile Tyr Val Glu ser Asp
355 360 365
Val Pro Gly Lys Asn Lys Gly Val Tyr Trp Tyr Asn Asn Asp Ile
370 375 380
<210> 86
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer KTIS
<400> 86
aagctcgaga ccgggttggc ggcgtatttg tgtc 34
<210> 87
<211> 38
<212> DNA
<213> Artificial sequence
<220>
<223> Primer KTI3
<400> 87
ggtctcgaga tctccaccgc ggacacaata tctggtca 38
<210> 88
<211> 1756
<212> DNA
<213> Artificial sequence
<220>
<223> TEF/conjugase/XPR chimeric gene
<400> 88
gaccgggttg gcggcgtatt tgtgtcccaa aaaacagccc caattgcccc aattgacccc 60
Page 32



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
02301-conversion.5T25


aaattgacccagtagcgggcccaaccccggcgagagcccccttcaccccacatatcaaac120


ctcccccggttcccacacttgccgttaagggcgtagggtactgcagtctggaatctacgc180


ttgttcagactttgtactagtttctttgtctggccatccgggtaacccatgccggacgca240


aaatagactactgaaaatttttttgctttgtggttgggactttagccaagggtataaaag300


accaccgtccccgaattacctttcctcttcttttctctctctccttgtcaactcacaccc360


gaaatcgttaagcatttccttctgagtataagaatcattcaaaggatccactagttctag420


agcggccgcatgggagaagtgggacccacaaaccgaaccaaaaccaagttggacaagcaa480


caagaatccgaaaacagggttcctcacgagccacctccattcacactaagtgaccttaag540


aaagccatcccaccccattgcttcgagcgctccctcgtgaaatcattctaccacgtgatt600


cacgacattatcatcctgtcctttttctactatgtcgccgccaattacatccccatgcta660


ccccaaaacctccgttacgttgcatggccaatttattgggccatccaaggctgtgtccaa720


cttggtatattggtcttaggccatgaatgcggccaccacgccttcagcgactaccaatgg780


gtagacgacatggtcgggttcgtcctccactcgtcccaattgattccctacttctcatgg840


aaacatagccaccgtcgccaccactccaacacggcctccatcgagcgcgacgaggtctac900


ccgcccgcgtacaaaaacgacctgccgtggttcgccaaatacctacgcaaccccgtcggt960


cgtttcctcatgattttcggggcgctactgttcggctggccgtcgtaccttctgttcaac1020


gcgaacggccgtctctacgaccgcttcgcttcccactacgacccgcaatccccgatcttc1080


aacaaccgcgagaggctgcaagtgatcgcgtccgacgtcgggctcgtcttcgcgtacttt1140


gtcctgtacaagatcgcgctggccaagggatttgtgtggttaatttgtgtgtatggcgtc1200


ccgtacgtgatcctcaacgggcttatcgtcttgatcacgttcctacagcacacgcacccg1260


aatctgccccgttacgacctttccgagtgggactggcttaggggagccctgtcgactgtg1320


gaccgcgattacgggatgttgaataaggtgttccataacgtgacggacacgcacttggtg1380


catcatttgttcacgaccatgccacattatcgcgccaaggaggcgaccgaggtgattaaa1440


ccgatattgggagactactataagtttgacgacactccgtttctcaaagcgttgtggaag1500


gacatgggaaagtgtatttatgtggagtcggacgtgcctggcaagaacaagggagtttat1560


tggtacaataacgacatttgagcggccgccaccgcggcccgagattccggcctcttcggc1620


cgccaagcgacccgggtggacgtctagaggtacctagcaattaacagatagtttgccggt1680


gataattctcttaacctcccacactcctttgacataacgatttatgtaacgaaactgaaa1740


tttgaccagatattgt 1756


<210> 89
<211> 383
<212> PRT
<213> Artificial Sequence
<220>
<223> TEF/conjugase/xPR chimeric protein
<400> 89
Page 33



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.ST25
Met Gly Glu Val Gly Pro Thr Asn Arg Thr Lys Thr Lys Leu Asp Lys
1 5 10 15
G1C~ Gln Glu Ser Glu Asn Arg Val Pro His Glu Pro Pro Pro Phe Thr
20 25 30
Leu Ser Asp Leu Lys Lys Ala Ile Pro Pro His Cys Phe Glu Arg Ser
35 40 45
Leu Val Lys Ser Phe Tyr His Val Ile His Asp Ile Ile Ile Leu Ser
50 55 60
Phe Phe Tyr Tyr Val Ala Ala Asn Tyr Ile Pro Met Leu Pro Gln Asn
65 70 75 80
Leu Arg Tyr Val Ala Trp Pro Ile Tyr Trp Ala Ile Gln Gly Cys Val
85 90 95
Gln Leu Gly Ile Leu Val Leu Gly His Glu Cys Gly His His Ala Phe
100 105 110
Ser Asp Tyr Gln Trp Val Asp Asp Met Val Gly Phe Val Leu His Ser
115 120 125
Ser Gln Leu Ile Pro Tyr Phe Ser Trp Lys His Ser His Arg Arg His
130 135 140
His Ser Asn Thr Ala Ser Ile Glu Arg Asp Glu Val Tyr Pro Pro Ala
145 150 155 160
Tyr Lys Asn Asp Leu Pro Trp Phe Ala Lys Tyr Leu Arg Asn Pro Val
165 170 175
Gly Arg Phe Leu Met Ile Phe Gly Ala Leu Leu Phe Gly Trp Pro Ser
180 185 190
Tyr Leu Leu Phe Asn Ala Asn Gly Arg Leu Tyr~Asp Arg Phe Ala Ser
195 200 205
His Tyr Asp Pro Gln Ser Pro Ile Phe Asn Asn Arg Glu Arg Leu Gln
210 215 220
Val Ile Ala Ser Asp Val Gly Leu Val Phe Ala Tyr Phe Val Leu Tyr
225 230 235 240
Lys Ile Ala Leu Ala Lys Gly Phe Val Trp Leu Ile Cys Val Tyr Gly
245 250 255
Val Pro Tyr Val Ile Leu Asn Gly Leu Ile Val Leu Ile Thr Phe Leu
260 265 270
Page 34



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.ST25
Gln His Thr His Pro Asn Leu Pro Arg Tyr Asp Leu Ser Glu Trp Asp
275 280 285
Trp Leu Arg Gly Ala Leu Ser Thr Val Asp Arg Asp Tyr Gly Met Leu
290 295 300
Asn Lys Val Phe His Asn Val Thr Asp Thr His Leu Val His His Leu
305 310 315 320
Phe Thr Thr Met Pro His Tyr Arg Ala Lys Glu Ala Thr Glu Val Ile
325 330 335
Lys Pro Ile Leu Gly Asp Tyr Tyr Lys Phe Asp Asp Thr Pro Phe Leu
340 345 350
Lys Ala Leu Trp Lys Asp Met Gly Lys Cys Ile Tyr Val Glu Ser Asp
355 360 365
Val Pro Gly Lys Asn Lys Gly Val Tyr Trp Tyr Asn Asn Asp Ile
370 375 380
<210> 90
<211> 32
<212> DNA
<213> Artificial sequence
<220>
<223> Primer KH5
<400> 90
tagagcggcc gcttaaacca tgaaaaagcc tg 32
<210> 91
<211> 33.
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer KH3
<400> 91
gtggcggccg ctttaggtac ctcactattc ctt 33
<210> 92
<Z11> 1026
<212> DNA
<213> Escherichia coli
<400> 92
atgaaaaagc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga aaagttcgac 60
i
agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt cagcttcgat 120
gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt ctacaaagat 180
cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt gcttgacatt 240
ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg tgtcacgttg 300
Page 35



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.sT25
caagacctgcctgaaaccgaactgcccgctgttctgcagccggtcgcggaggccatggat360


gcgatcgctgcggccgatcttagccagacgagcgggttcggcccattcggaccgcaagga420


atcggtcaatacactacatggcgtgatttcatatgcgcgattgctgatccccatgtgtat480


cactggcaaactgtgatggacgacaccgtcagtgcgtccgtcgcgcaggctctcgatgag540


ctgatgctttgggccgaggactgccccgaagtccggcacctcgtgcacgcggatttcggc600


tccaacaatgtcctgacggacaatggccgcataacagcggtcattgactggagcgaggcg660


atgttcggggattcccaatacgaggtcgccaacatcttcttctggaggccgtggttggct720


tgtatggagcagcagacgcgctacttcgagcggaggcatccggagcttgcaggatcgccg780


cggctccgggcgtatatgctccgcattggtcttgaccaactctatcagagcttggttgac840


ggcaatttcgatgatgcagcttgggcgcagggtcgatgcgacgcaatcgtccgatccgga900


gccgggactgtcgggcgtacacaaatcgcccgcagaagcgcggccgtctggaccgatggc960


tgtgtagaagtactcgccgatagtggaaaccgacgccccagcactcgtccgagggcaaag1020


gaatag 1026


<210> 93
<211> 341
<212> PRT
<213> Escherichia coli
<400> 93
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 ~ 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
Page 36



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.ST25
130 135 140
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
145 150 155 160
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
165 170 175
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
180 185 190
His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn
195 200 205
Gly Arg Ile Thr Ala Val Ile Asp Trp Ser Glu Ala Met Phe Gly Asp
210 215 Z20
Ser Gln Tyr Glu Val Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala
Z25 230 235 240
Cys Met Glu Gln Gln Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu
245 250 , 255
Ala Gly Ser Pro Arg Leu Arg Ala Tyr Met Leu Arg Ile Gly Leu Asp
260 265 270
i
Gln Leu Tyr Gln Ser Leu Val Asp Gly Asn Phe Asp Asp Ala Ala Trp
275 280 285
Ala Gln Gly Arg Cys Asp Ala Ile Val Arg S~er Gly Ala Gly Thr Val
290 295 300
Gly Arg Thr Gln Ile Ala Arg Arg Ser Ala Ala Val Trp Thr Asp Gly
305 310 315 320
Cys Val Glu Val Leu Ala Asp Ser Gly Asn Arg Arg Pro Ser Thr Arg
325 330 335
Pro Arg Ala Lys Glu
340
<210> 94
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer KTHS
<400> 94
tttagatctc gagaccgggt tggcggcgta tttg 34
<210> 95
Page 37



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
c~2301-conversion.sT25
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer KTH3
<400> 95
tttagatctc caccgcggac acaatatctg g 31
<210> 96
<211> 1650
<212> DNA
<213> Artificial sequence
<220>


<223> TEF ::HPT::XPR
fusion


<400> 96


gaccgggttggcggcgtatttgtgtcccaaaaaacagccccaattgccccaattgacccc60


aaattgacccagtagcgggcccaaccccggcgagagcccccttcaccccacatatcaaac120


ctcccccggttcccacacttgccgttaagggcgtagggtactgcagtctggaatctacgc180


ttgttcagactttgtactagtttctttgtctggccatccgggtaacccatgccggacgca240


aaatagactactgaaaatttttttgctttgtggttgggactttagccaagggtataaaag300


accaccgtccccgaattacctttcctcttcttttctctctctccttgtcaactcacaccc360


gaaatcgttaagcatttccttctgagtataagaatcattcaaaggatccactagttctag420


agcggccgcttaaaccatgaaaaagcctgaactcaccgcgacgtctgtcgagaagtttct480


gatcgaaaagttcgacagcgtctccgacctgatgcagctctcggagggcgaagaatctcg540


tgctttcagcttcgatgtaggagggcgtggatatgtcctgcgggtaaatagctgcgccga600


tggtttctacaaagatcgttatgtttatcggcactttgcatcggccgcgctcccgattcc660


ggaagtgcttgacattggggaattcagcgagagcctgacctattgcatctcccgccgtgc720


acagggtgtcacgttgcaagacctgcctgaaaccgaactgcccgctgttctgcagccggt780


cgcggaggccatggatgcgatcgctgcggccgatcttagccagacgagcgggttcggccc840


attcggaccgcaaggaatcggtcaatacactacatggcgtgatttcatatgcgcgattgc900


tgatccccatgtgtatcactggcaaactgtgatggacgacaccgtcagtgcgtccgtcgc960


' gcaggctctcgatgagctgatgctttgggccgaggactgccccgaagtccggcacctcgt1020


gcacgcggatttcggctccaacaatgtcctgacggacaatggccgcataacagcggtcat1080


tgactggagcgaggcgatgttcggggattcccaatacgaggtcgccaacatcttcttctg1140


gaggccgtggttggcttgtatggagcagcagacgcgctacttcgagcggaggcatccgga1200


gcttgcaggatcgccgcggctccgggcgtatatgctccgcattggtcttgaccaactcta1260


tcagagcttggttgacggcaatttcgatgatgcagcttgggcgcagggtcgatgcgacgc1320


aatcgtccgatccggagccgggactgtcgggcgtacacaaatcgcccgcagaagcgcggc1380


cgtctggaccgatggctgtgtagaagtactcgccgatagtggaaaccgacgccccagcac1440


Page 38





CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.sT25
tcgtccgagg gcaaaggaat agtgaggtac ctaaagcggc cgccaccgcg gcccgagatt 1500
ccggcctctt cggccgccaa gcgacccggg tggacgtcta gaggtaccta gcaattaaca 1560
gatagtttgc cggtgataat tctcttaacc tcccacactc ctttgacata acgatttatg 1620
taacgaaact gaaatttgac cagatattgt 1650
<210> 97
<211> 341
<212> PRT
<213> Artificial sequence
<220>
<223> TEF::HPT::XPR fusion
<400> 97
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Th'r Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
130 135 140
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
145 150 155 160
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
165 170 175
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
180 185 190
His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn
Page 39



CA 02523786 2005-10-26
WO PCT/US2004/014542
2004/104167


cL2301-conversion. 5T25


195 200 205


GlyArg IleThrAla ValIleAsp TrpSerGlu AlaMet PheGlyAsp


210 215 220


SerGln TyrGluVal AlaAsnIle PhePheTrp ArgPro TrpLeuAla


225 230 235 240


CysMet GluGlnGln ThrArgTyr PheGluArg ArgHis ProGluLeu


245 250 255


AlaGly~SerProArg LeuArgAla TyrMetLeu ArgIle GlyLeuAsp


260 265 270


GlnLeu TyrGlnSer LeuValAsp GlyAsnPhe AspAsp AlaAlaTrp


275 280 285


AlaGln GlyArgCys AspAlaIle valArgser GlyAla GlyThrVal


290 295 300


GlyArg ThrGlnIle AlaArgArg SerAlaAla ValTrp ThrAspGly


305 310 315 320


CysVal GluValLeu AlaAspSer GlyAsnArg ArgPro SerThrArg


325 330 ~ 335


ProArg AlaLysGlu


340


<210>
98


<211>
401


<212>
DNA


<213>
Yarrowia
lipolytica


<400>
98


cgagtatctgtctgactcgtcattgccgcctttggagtacgactccaactatgagtgtgc60


ttggatcactttgacgatacattcttcgttggaggctgtgggtctgacagctgcgttttc120


ggcgcggttggccgacaacaatatcagctgcaacgtcattgctggctttcatcatgatca180


catttttgtcggcaaaggcgacgcccagagagccattgacgttctttctaatttggaccg240


atagccgtatagtccagtctatctataagttcaactaactcgtaactattaccataacat300


atacttcactgccccagataaggttccgataaaaagttctgcagactaaatttatttcag360


tctcctcttcaccaccaaaatgccctcctacgaagctcgag 401


<210>
99


<211>
568


<212>
DNA


<213>
Yarrowia
lipolytica


<400> 99
atcataattg tcggccgagg tctgtacggc cagaaccgag atcctattga ggaggccaag 60
Page 40



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
02301-conversion.ST25


cgataccagaaggctggctgggaggcttaccagaagattaactgttagaggttagactat120


ggatatgtcatttaactgtgtatatagagagcgtgcaagtatggagcgcttgttcagctt180


gtatgatggtcagacgacctgtctgatcgagtatgtatgatactgcacaacctgtgtatc240


cgcatgatctgtccaatggggcatgttgttgtgtttctcgatacggagatgctgggtaca300


agtagctaatacgattgaactacttatacttatatgaggcttgaagaaagctgacttgtg360


tatgacttattctcaactacatccccagtcacaataccaccactgcactaccactacacc420


aaaaccatgatcaaaccacccatggacttcctggaggcagaagaacttgttatggaaaag480


ctcaagagagagaagccaagatactatcaagacatgtgtcgcaacttcaaggaggaccaa540


gctctgtacaccgagaaacaggcctttg 568


<210> 100
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer Y~63
<400> 100
ttatgatatc gaattaatta acctgcagcc cggggg 36
<210> 101
<211> 36
<212> DNA
<213> Artificial sequence
<220>
<223> Primer Y~64
<400> 101
cccccgggct gcaggttaat taattcgata tcataa 36
<210> 102
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer Y~65
<400> 102
tacgccgcca acccgtacgt ctcgagcttc gta 33
<210> 103
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL66
<400> 103
tacgaagctc gagacgtacg ggttggcggc gta 33
<210> 104
Page 41



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.sT25
<211> 33
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL81
<400> 104
gttatccgct cacaagcttc cacacaacgt acg 33
<210> 105
<211> 33
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL82
<400> 105
cgtacgttgt gtggaagctt gtgagcggat aac 33
<210> 106
<211> 33
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL83
<400> 106
atttgaatcg aatcgatgag cctaaaatga acc 33
<210> 107
<211> 33
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL84
<400> 107
ggttcatttt aggctcatcg attcgattca aat 33
<210> 108
<211> 38
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL105
<400> 108
ccaagcacta acctaccgtt taaacaccac taaaaccc 38
<210> 109
<211> 38
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL106
<400> 109
Page 42



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
02301-conversion.sT25
gggttttagt ggtgtttaaa cggtaggtta gtgcttgg 38
<210> 110
<Z11> 36
<212> DNA
<213> Artificial sequence
<220>
<223> Primer Y~119
<400> 110
cgggaaacct gtcgtggcgc gccagctgca ttaatg 36
<210> 111
<211> 36
<212> DNA
<213> Artificial sequence
<220>
<223> Primer Y~120
<400> 111
cattaatgca gctggcgcgc cacgacaggt ttcccg 36
<210> 112
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer Y~121
<400> 112
tttggcgcgc ctatcacatc acgctctcat caag 34
<210> 113
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer Y~122
<400> 113
tttcgtacga accaccaccg tcagcccttc tgac 34
<210> 114
<211> 440
<212> DNA
<213> Yarrowia lipolytica
<400> 114
aaccaccacc gtcagccctt ctgactcacg tattgtagcc accgacacag gcaacagtcc 60
gtggatagca gaatatgtct tgtcggtcca tttctcacca actttaggcg tcaagtgaat 120
gttgcagaag aagtatgtgc cttcattgag aatcggtgtt gctgatttca ataaagtctt 180
gagatcagtt tggccagtca tgttgtgggg ggtaattgga ttgagttatc gcctacagtc 240
tgtacaggta tactcgctgc ccactttata ctttttgatt ccgctgcact tgaagcaatg 300
tcgtttacca aaagtgagaa tgctccacag aacacacccc agggtatggt tgagcaaaaa 360
Page 43



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.ST25
ataaacactc cgatacgggg aatcgaaccc cggtctccac ggttctcaag aagtattctt 420
gatgagagcg tgatgtgata 440
<210> 115
<211> 35
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL114
<400> 115
tgatagtatc ttggcgcgcc ttctctctct tgagc 35
<210> 116
<211> 35
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL115
<400> 116
gctcaagaga gagaaggcgc gccaagatac tatca 35
<210>
117


<211>
5218


<212>
DNA


<213> ficial ence
Arti Sequ


<220>


<223> by fragment for integration expressionof the -5
5218 and delta


desa turase -
gene


<400>
117


tatcacatca,cgctctcatcaagaatacttcttgagaaccgtggagaccggggttcgatt60.


ccccgtatcggagtgtttattttttgctcaaccataccctggggtgtgttctgtggagca120


ttctcacttttggtaaacgacattgcttcaagtgcagcggaatcaaaaagtataaagtgg180


gcagcgagtatacctgtacagactgtaggcgataactcaatccaattaccccccacaaca240


tgactggccaaactgatctcaagactttattgaaatcagcaacaccgattctcaatgaag300


gcacatacttcttctgcaacattcacttgacgcctaaagttggtgagaaatggaccgaca360


agacatattctgctatccacggactgttgcctgtgtcggtggctacaatacgtgagtcag420


aagggctgacggtggtggttcgtacgttgtgtggaagcttgtgagcggataacaatttca480


cacaggaaacagctatgaccatgattacgccaagctcgaaattaaccctcactaaaggga540


acaaaagctggagctccaccgcggacacaatatctggtcaaatttcagtttcgttacata600


aatcgttatgtcaaaggagtgtgggaggttaagagaattatcaccggcaaactatctgtt660


aattgctaggtacctctagacgtccacccgggtcgcttggcggccgaagaggccggaatc720


tcgggccgcggtggcggccgcctactcttccttgggacggagtccaagaacacgcaagtg780


ctccaaatgtgaagcaaatgcttgccaaaacgtatccttgacaaggtatggaaccttgta840


Page 44





CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.sT25


ctcgctgcaggtgttcttgatgatggccagaatatcgggataatggtgctgcgacacgtt900


ggggaacagatggtgcacagccggtagttcaagctgccagtgatgctggtccagaggtgc960


gaatcgtgtgcgtaatcctgcgtagtctcgacctgcatagctgcccagtccttttggatg1020


atcccgttctcgtcaggcaacggccactgaacttcctcaacaacgtggttcgcctggaag1080


gtcagcgccagccagtaagacgacaccatgtccgcgaccgtgaacaagagcagcaccttg1140


cccaggggcagatactgcaggggaacaatcaggcgataccagacaaagaaagccttgccg1200


ccccagaacatcacagtgtgccatgtcgagatgggattgacacgaatagcgtcattggtc1260


ttgacaaagtacaaaatgttgatgtcctgaatgcgcaccttgaacgccagcagtccgtac1320


aggaaaggaacaaacatgtgctggttgatgtggttgacaaaccacttttggttgggcttg1380


atacgacgaacatcgggctcagacgtcgacacgtcgggatctgctccagcaatgttggtg1440


taggggtgatggccgagcatatgttggtacatccacaccaggtacgatgctccgttgaaa1500


aagtcgtgcgtggctcccagaatcttccagacagtggggttgtgggtcactgaaaagtga1560


gacgcatcatgaagagggttgagtccgacttgtgcgcacgcaaatcccatgatgattgca1620


aacaccacctgaagccatgtgcgttcgacaacgaaaggcacaaagagctgcgcgtagtag1680


gaagcgatcaaggatccaaagataagagcgtatcgtccccagatctctggtctattcttg1740


ggatcaatgttccgatccgtaaagtagccctcgactctcgtcttgatggttttgtggaac1800


accgttggctccgggaagatgggcagctcattcgagaccagtgtaccgacatagtacttc1860


ttcataatggcatctgcagccccaaacgcgtgatacatctcaaagaccggagtaacatct1920


cggccagctccgagcaggagagtgtccactccaccaggatggcggctcaagaactttgtg1980


acatcgtacaccctgccgcggatggccaagagtaggtcgtccttggtgttatgggccgcc2040


agctcttcccaggtgaaggtttttccttggtccgttcccatggtgaatgattcttatact2100


cagaaggaaatgcttaacgatttcgggtgtgagttgacaaggagagagagaaaagaagag2160


gaaaggtaattcggggacggtggtcttttatacccttggctaaagtcccaaccacaaagc2220


aaaaaaattttcagtagtctattttgcgtccggcatgggttacccggatggccagacaaa2280


gaaactagtacaaagtctgaacaagcgtagattccagactgcagtaccctacgcccttaa2340


cggcaagtgtgggaaccgggggaggtttgatatgtggggtgaagggggctctcgccgggg2400


ttgggcccgctactgggtcaatttggggtcaattggggcaattggggctgttttttggga2460


cacaaatacgccgccaacccggtctctcctgaattctgcagatgggctgcaggaattccg2520


tcgtcgcctgagtcgacatcatttatttaccagttggccacaaacccttgacgatctcgt2580


atgtcccctccgacatactcccggccggctggggtacgttcgatagcgctatcggcatcg2640


acaaggtttgggtccctagccgataccgcactacctgagtcacaatcttcggaggtttag2700


tcttccacatagcacgggcaaaagtgcgtatatatacaagagcgtttgccagccacagat2760


tttcactccacacaccacatcacacatacaaccacacacatccacaatggaacccgaaac2820


taagaagaccaagactgactccaagaagattgttcttctcggcggcgacttctgtggccc2880


Page 45



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.sT25


cgaggtgattgccgaggccgtcaaggtgctcaagtctgttgctgaggcctccggcaccga2940


gtttgtgtttgaggaccgactcattggaggagctgccattgagaaggagggcgagcccat3000


caccgacgctactctcgacatctgccgaaaggctgactctattatgctcggtgctgtcgg3060


aggcgctgccaacaccgtatggaccactcccgacggacgaaccgacgtgcgacccgagca,3120


gggtctcctcaagctgcgaaaggacctgaacctgtacgccaacctgcgaccctgccagct3180


gctgtcgcccaagctcgccgatctctcccccatccgaaacgttgagggcaccgacttcat3240


cattgtccgagagctcgtcggaggtatctactttggagagcgaaaggaggatgacggatc3300


tggcgtcgcttccgacaccgagacctactccgttcctgaggttgagcgaattgcccgaat3360


ggccgccttcctggcccttcagcacaacccccctcttcccgtgtggtctcttgacaaggc3420


caacgtgctggcctcctctcgactttggcgaaagactgtcactcgagtcctcaaggacga3480


attcccccagctcgagctcaaccaccagctgatcgactcggccgccatgatcctcatcaa3540


gcagccctccaagatgaatggtatcatcatcaccaccaacatgtttggcgatatcatctc3600


cgacgaggcctccgtcatccccggttctctgggtctgctgccctccgcctctctggcttc3660


tctgcccgacaccaacgaggcgttcggtctgtacgagccctgtcacggatctgcccccga3720


tctcggcaagcagaaggtcaaccccattgccaccattctgtctgccgccatgatgctcaa3780


gttctctcttaacatgaagcccgccggtgacgctgttgaggctgccgtcaaggagtccgt3840


cgaggctggtatcactaccgccgatatcggaggctcttcctccacctccgaggtcggaga3900


cttgttgccaacaaggtcaaggagctgctcaagaaggagtaagtcgtttctacgacgcat3960


tgatggaaggagcaaactgacgcgcctgcgggttggtctaccggcagggtccgctagtgt4020


ataagactctataaaaagggccctgccctgctaatgaaatgatgatttataatttaccgg4080


tgtagcaaccttgactagaagaagcagattgggtgtgtttgtagtggaggacagtggtac4140


gttttggaaacagtcttcttgaaagtgtcttgtctacagtatattcactcataacctcaa4200


tagccaagggtgtagtcggtttattaaaggaagggagttgtggctgatgtggatagatat4260


ctttaagctggcgactgcacccaacgagtgtggtggtagcttgttactgtatattcggta4320


agatatattttgtggggttttagtggtgtttaaacggtaggttagtgcttggtatatgag4380


ttgtaggcatgacaatttggaaaggggtggactttgggaatattgtgggatttcaatacc4440


ttagtttgtacagggtaattgttacaaatgatacaaagaactgtatttcttttcatttgt4500


tttaattggttgtatatcaagtccgttagacgagctcagtgccttggcttttggcactgt4560


atttcatttttagaggtacactacattcagtgaggtatggtaaggttgagggcataatga4620


aggcaccttgtactgacagtcacagacctctcaccgagaattttatgagatatactcggg4680


ttcattttaggctcatcgattcgattcaaattaattaattcgatatcataattgtcggcc4740


gaggtctgtacggccagaaccgagatcctattgaggaggccaagcgataccagaaggctg4800


gctgggaggcttaccagaagattaactgttagaggttagactatggatatgtcatttaac4860


tgtgtatatagagagcgtgcaagtatggagcgcttgttcagcttgtatgatggtcagacg4920


Page 46



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.sT25


acctgtctgatcgagtatgtatgatactgcacaacctgtgtatccgcatgatctgtccaa4980


tggggcatgttgttgtgtttctcgatacggagatgctgggtacaagtagctaatacgatt5040


gaactacttatacttatatgaggcttgaagaaagctgacttgtgtatgacttattctcaa5100


ctacatccccagtcacaataccaccactgcactaccactacaccaaaaccatgatcaaac5160


cacccatggacttcctggaggcagaagaacttgttatggaaaagctcaagagagagaa 5218


<210> 118
<211> 33
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL69
<400> 118
agcccatctg cagaagcttc aggagagacc ggg 33
<210> 119
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL70
<400> 119
cccggtctct cctgaagctt ctgcagatgg get 33
<210> 120
<211> 33
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL77
<400> 120
tagtgagggt taattaatcg agcttggcgt aat 33
<210> 121
<211> 33
<212> DNA
<213> Artificial sequence
<220>
<223> Primer YL78
<400> 121
attacgccaa gctcgattaa ttaaccctca cta 33
<210> 122
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL79A
<400> 122
Page 47



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
CL2301-conversion.sT25
attcctgcag cccatcgatg cagaattcag gaga 34
<210> 123
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer YL80A
<400> 123
tctcctgaat tctgcatcga tgggctgcag gaat 34
<210>
124


<211>
8894


<212>
DNA


<213> ficial
Arti sequence


<220>


<223> by fragment expressionof the delta-6
8894 for integration
and


and delta-5 elongase
desaturase gene
genes
and the


<400>
124


tatcacatcacgctctcatcaagaatacttcttgagaaccgtggagaccggggttcgatt 60


ccccgtatcggagtgtttattttttgctcaaccataccctggggtgtgttctgtggagca 120


ttctcacttttggtaaacgacattgcttcaagtgcagcggaatcaaaaagtataaagtgg 180


gcagcgagtatacctgtacagactgtaggcgataactcaatccaattaccccccacaaca 240


tgactggccaaactgatctcaagactttattgaaatcagcaacaccgattctcaatgaag 300


gcacatacttcttctgcaacattcacttgacgcctaaagttggtgagaaatggaccgaca 360


agacatattctgctatccacggactgttgcctgtgtcggtggctacaatacgtgagtcag 420


aagggctgacggtggtggttcgtacgttgtgtggaattgtgagcggataacaatttcaca 480


caggaaacagctatgaccatgattacgccaagctcgaaattaaccctcactaaagggaac 540


aaaagctggagctccaccgcggacacaatatctggtcaaatttcagtttcgttacataaa 600


tcgttatgtcaaaggagtgtgggaggttaagagaattatcaccggcaaactatctgttaa 660


ttgctaggtacctctagacgtccacccgggtcgcttggcggccgaagaggccggaatctc 720


gggccgcggtggcggccgcttactgcaacttccttgccttctccttggcagcgtcggcct 780


tggcctgcttggccaacttggcgttctttctgtaaaagttgtagaagagaccgagcatgg 840


tccacatgtagaaccaaagcagagccgtgatgaagaaggggtatccggggcggccaagga 900


ccttcatggcgtacatgtcccaggaagactggaccgacatcatgcagaactgtgtcatct 960


gcgagcgcgtgatgtagaacttgatgaacgacacctgcttgaagcccaaggccgacaaga 1020


agtagtagccgtacatgatcacatggatgaacgagttcaacgcagcagagaagtaggctt 1080


caccgttgggtgcaacaaaggtgaccaaccaccagatggtgaagatggagctgtggtggt 1140


aaacgtgcaagaaggagatctggcggttgttcttcttgaggaccatgatcatggtgtcga 1200


caaactccatgatcttggagaagtagaagagccagatcatcttggccataggaagaccct 1260


tgaaggtatgatcagcagcgttctcaaacagtccatagttggcctgataagcctcgtaca 1320


Page 48





CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
c~2301-conversion.sT25
ggatcccacc gcacatgtag gcgctgatcg agaccagaca aaagttgtgc aggagcgaaa 1380
acgtcttgac ctcgaaccgc tcaaagttct tcatgatctg catgcccaca aagaccgtga 1440
ccaaataagc gagcacgatc aacagcacgt ggaacgggtt catcaacggc agctcacggg 1500
ccaaaggcga ctccaccgcg accaggaacc cacgcgtgtg atggacaatc gtggggatgt 1560
acttctcggc ctgggccacc agcgcggcct cgagaggatc gacatagggc gcggcccgga 1620
caccgatagc ggtggcaagg tccataaaca gatcttgcgg catctttgat gggaggaatg 1680
gcgcaatcga ctccatgcgg ccgctctaga actagtggat cctttgaatg attcttatac 1740
tcagaaggaa atgcttaacg atttcgggtg tgagttgaca aggagagaga gaaaagaaga 1800
ggaaaggtaa ttcggggacg gtggtctttt atacccttgg ctaaagtccc aaccacaaag 1860
caaaaaaatt ttcagtagtc tattttgcgt ccggcatggg ttacccggat ggccagacaa 1920
agaaactagt acaaagtctg aacaagcgta gattccagac tgcagtaccc tacgccctta 1980
acggcaagtg tgggaaccgg gggaggtttg atatgtgggg tgaagggggc tctcgccggg 2040
gttgggcccg ctactgggtc aatttggggt caattggggc aattggggct gttttttggg 2100
acacaaatac gccgccaacc cggtctctcc,tgaagcttgt gagcggataa caatttcaca 2160
caggaaacag ctatgaccat gattacgcca agctcgaaat taaccctcac taaagggaac 2220
aaaagctgga gctccaccgc ggacacaata tctggtcaaa tttcagtttc gttacataaa 2280
tcgttatgtc aaaggagtgt gggaggttaa gagaattatc accggcaaac tatctgttaa 2340
ttgctaggta cctctagacg 'tccacccggg tcgcttggcg gccgaagagg ccggaatctc 2400
gggccgcggt ggcggccgcc tactcttcct tgggacggag tccaagaaca cgcaagtgct 2460
ccaaatgtga agcaaatgct tgccaaaacg tatccttgac aaggtatgga accttgtact 2520
cgctgcaggt gttcttgatg atggccagaa tatcgggata atggtgctgc gacacgttgg 2580
ggaacagatg gtgcacagcc tggtagttca agctgccagt gatgctggtc cagaggtgcg 2640
aatcgtgtgc gtaatcctgc gtagtctcga cctgcatagc tgcccagtcc ttttggatga 2700
tcccgttctc gtcaggcaac ggccactgaa cttcctcaac aacgtggttc gcctggaagg 2760
tcagcgccag ccagtaagac gacaccatgt ccgcgaccgt gaacaagagc agcaccttgc 2820
ccaggggcag atactgcagg ggaacaatca ggcgatacca gacaaagaaa gccttgccgc 2880
cccagaacat cacagtgtgc catgtcgaga tgggattgac acgaatagcg tcattggtct 2940
tgacaaagta caaaatgttg atgtcctgaa tgcgcacctt gaacgccagc agtccgtaca 3000
ggaaaggaac aaacatgtgc tggttgatgt ggttgacaaa ccacttttgg ttgggcttga 3060
tacgacgaac atcgggctca gacgtcgaca cgtcgggatc tgctccagca atgttggtgt 3120
aggggtgatg gccgagcata tgttggtaca tccacaccag gtacgatgct ccgttgaaaa 3180
agtcgtgcgt ggctcccaga atcttccaga cagtggggtt gtgggtcact gaaaagtgag 3240
acgcatcatg aagagggttg agtccgactt gtgcgcacgc aaatcccatg atgattgcaa 3300
acaccacctg aagccatgtg cgttcgacaa cgaaaggcac aaagagctgc gcgtagtagg 3360
Page 49



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
c~2301iconversion.ST25
aagcgatcaa ggatccaaag ataagagcgt atcgtcccca gatctctggt ctattcttgg 3420
gatcaatgtt ccgatccgta aagtagccct cgactctcgt cttgatggtt ttgtggaaca 3480
ccgttggctc cgggaagatg ggcagctcat tcgagaccag tgtaccgaca tagtacttct 3540
tcataatggc atctgcagcc ccaaacgcgt gatacatctc aaagaccgga gtaacatctc 3600
ggccagctcc gagcaggaga gtgtccactc caccaggatg gcggctcaag aactttgtga 3660
catcgtacac cctgccgcgg atggccaaga gtaggtcgtc cttggtgtta tgggccgcca 3720
gctcttccca ggtgaaggtt tttccttggt ccgttcccat ggtgaatgat tcttatactc 3780
agaaggaaat gcttaacgat ttcgggtgtg agttgacaag gagagagaga aaagaagagg 3840
aaaggtaatt cggggacggt ggtcttttat acccttggct aaagtcccaa ccacaaagca 3900
aaaaaatttt cagtagtcta ttttgcgtcc ggcatgggtt acccggatgg ccagacaaag 3960
aaactagtac aaagtctgaa caagcgtaga ttccagactg cagtacccta cgcccttaac 4020
ggcaagtgtg ggaaccgggg gaggtttgat atgtggggtg aagggggctc tcgccggggt 4080
tgggcccgct actgggtcaa tttggggtca attggggcaa ttggggctgt tttttgggac 4140
acaaatacgc cgccaacccg gtctctcctg aattctgcag atgggctgca ggaattccgt 4200
cgtcgcctga gtcgacatca tttatttacc agttggccac aaacccttga cgatctcgta 4260
tgtcccctcc gacatactcc cggccggctg gggtacgttc gatagcgcta tcggcatcga 4320
caaggtttgg gtccctagcc gataccgcac tacctgagtc acaatcttcg gaggtttagt 4380
cttccacata gcacgggcaa aagtgcgtat atatacaaga gcgtttgcca gccacagatt 4440
ttcactccac acaccacatc acacatacaa ccacacacat ccacaatgga acccgaaact 4500
aagaagacca agactgactc caagaagatt gttcttctcg gcggcgactt ctgtggcccc 4560
gaggtgattg ccgaggccgt caaggtgctc aagtctgttg ctgaggcctc cggcaccgag 4620
tttgtgtttg aggaccgact cattggagga gctgccattg agaaggaggg cgagcccatc . 4680
accgacgcta ctctcgacat ctgccgaaag gctgactcta ttatgctcgg tgctgtcgga 4740
ggcgctgcca acaccgtatg gaccactccc gacggacgaa ccgacgtgcg acccgagcag 4800
ggtctcctca agctgcgaaa ggacctgaac ctgtacgcca acctgcgacc ctgccagctg 4860
ctgtcgccca agctcgccga tctctccccc atccgaaacg ttgagggcac cgacttcatc 4920
attgtccgag agctcgtcgg aggtatctac tttggagagc gaaaggagga tgacggatct 4980
ggcgtcgctt ccgacaccga gacctactcc gttcctgagg ttgagcgaat tgcccgaatg 5040
gccgccttcc tggcccttca gcacaacccc cctcttcccg tgtggtctct tgacaaggcc 5100
aacgtgctgg cctcctctcg actttggcga aagactgtca ctcgagtcct caaggacgaa 5160
ttcccccagc tcgagctcaa ccaccagctg atcgactcgg ccgccatgat cctcatcaag 5220
cagccctcca agatgaatgg tatcatcatc accaccaaca tgtttggcga tatcatctcc 5280
gacgaggcct ccgtcatccc cggttctctg ggtctgctgc cctccgcctc tctggcttct 5340
ctgcccgaca ccaacgaggc gttcggtctg tacgagccct gtcacggatc tgcccccgat 5400
Page 50



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
c~2301-conversion.5T25
ctcggcaagcagaaggtcaaccccattgccaccattctgtctgccgccatgatgctcaag5460


ttctctcttaacatgaagcccgccggtgacgctgttgaggctgccgtcaaggagtccgtc5520


gaggctggtatcactaccgccgatatcggaggctcttcctccacctccgaggtcggagac5580


ttgttgccaacaaggtcaaggagctgctcaagaaggagtaagtcgtttctacgacgcatt5640


gatggaaggagcaaactgacgcgcctgcgggttggtctaccggcagggtccgctagtgta5700


taagactctataaaaagggccctgccctgctaatgaaatgatgatttataatttaccggt5760


gtagcaaccttgactagaagaagcagattgggtgtgtttgtagtggaggacagtggtacg5820


ttttggaaacagtcttcttgaaagtgtcttgtctacagtatattcactcataacctcaat5880


agccaagggtgtagtcggtttattaaaggaagggagttgtggctgatgtggatagatatc5940


tttaagctggcgactgcacccaacgagtgtggtggtagcttgttactgtatattcggtaa6000


gatatattttgtggggttttagtggtgtttaaacggtaggttagtgcttggtatatgagt6060


tgtaggcatgacaatttggaaaggggtggactttgggaatattgtgggatttcaatacct6120


tagtttgtacagggtaattgttacaaatgatacaaagaactgtatttcttttcatttgtt6180


ttaattggttgtatatcaagtccgttagacgagctcagtgccttggcttttggcactgta6240


tttcatttttagaggtacactacattcagtgaggtatggtaaggttgagggcataatgaa6300


ggcaccttgtactgacagtcacagacctctcaccgagaattttatgagatatactcgggt6360


tcattttaggctcatcgatgcagaattcaggagagaccgggttggcggcgtatttgtgtc6420


ccaaaaaacagccccaattgccccaattgaccccaaattgacccagtagcgggcccaacc6480


ccggcgagagcccccttcaccccacatatcaaacctcccccggttcccacacttgccgtt6540


aagggcgtagggtactgcagtctggaatctacgcttgttcagactttgtactagtttctt6600


tgtctggccatccgggtaacccatgccggacgcaaaatagactactgaaaatttttttgc6660


tttgtggttgggactttagccaagggtataaaagaccaccgtccccgaattacctttcct6720


cttcttttctctctctccttgtcaactcacacccgaaatcgttaagcatttccttctgag6780


tataagaatcattcaccatggctgctgctcccagtgtgaggacgtttactcgggccgagg6840


ttttgaatgccgaggctctgaatgagggcaagaaggatgccgaggcacccttcttgatga6900


tcatcgacaacaaggtgtacgatgtccgcgagttcgtccctgatcatcccggtggaagtg6960


tgattctcacgcacgttggcaaggacggcactgacgtctttgacacttttcaccccgagg7020


ctgcttgggagactcttgccaacttttacgttggtgatattgacgagagcgaccgcgata7080


tcaagaatgatgactttgcggccgaggtccgcaagctgcgtaccttgttccagtctcttg7140


gttactacgattcttccaaggcatactacgccttcaaggtctcgttcaacctctgcatct7200


ggggtttgtcgacggtcattgtggccaagtggggccagacctcgaccctcgccaacgtgc7260


tctcggctgcgcttttgggtctgttctggcagcagtgcggatggttggctcacgactttt7320


tgcatcaccaggtcttccaggaccgtttctggggtgatcttttcggcgccttcttgggag7380


gtgtctgccagggcttctcgtcctcgtggtggaaggacaagcacaacactcaccacgccg7440


Page 51





CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cLZ301-conversion.ST25
cccccaacgtccacggcgaggatcccgacattgacacccaccctctgttgacctggagtg7500


agcatgcgttggagatgttctcggatgtcccagatgaggagctgacccgcatgtggtcgc7560


gtttcatggtcctgaaccagacctggttttacttccccattctctcgtttgcccgtctct7620
v


.
cctggtgcctccagtccattctctttgtgctgcctaacggtcaggcccacaagccctcgg7680


gcgcgcgtgtgcccatctcgttggtcgagcagctgtcgcttgcgatgcactggacctggt7740


acctcgccaccatgttcctgttcatcaaggatcccgtcaacatgctggtgtactttttgg7800


tgtcgcaggcggtgtgcggaaacttgttggccatcgtgttctcgctcaaccacaacggta7860


tgcctgtgatctcgaggaggaggcggtcgatatggatttcttcacgaagcagatcatcac7920


gggtcgtgatgtccacccgggtctatttgccaactggttcacgggtggattgaactatca7980


gatcgagcaccacttgttcccttcgatgcctcgccacaacttttcaaagatccagcctgc8040


tgtcgagaccctgtgcaaaaagtacaatgtccgataccacaccaccggtatgatcgaggg8100


aactgcagaggtctttagccgtctgaacgaggtctccaaggctacctccaagatgggtaa8160


ggcgcagtaagcggccgccaccgcggcccgagattccggcctcttcggccgccaagcgac8220


ccgggtggacgtctagaggtacctagcaattaacagatagtttgccggtgataattctct8280


taacctcccacactcctttgacataacgatttatgtaacgaaactgaaatttgaccagat8340


attgtgtccgcggtggagctccagcttttgttccctttagtgagggttaattaattcgat8400


atcataattgtcggccgaggtctgtacggccagaaccgagatcctattgaggaggccaag8460


cgataccagaaggctggctgggaggcttaccagaagattaactgttagaggttagactat8520


ggatatgtcatttaactgtgtatatagagagcgtgcaagtatggagcgcttgttcagctt8580


gtatgatggtcagacgacctgtctgatcgagtatgtatgatactgcacaacctgtgtatc8640


cgcatgatctgtccaatggggcatgttgttgtgtttctcgatacggagatgctgggtaca8700


agtagctaatacgattgaactacttatacttatatgaggcttgaagaaagctgacttgtg8760


tatgacttattctcaactacatccccagtcacaataccaccactgcactaccactacacc8820


aaaaccatgatcaaaccacccatggacttcctggaggcagaagaacttgttatggaaaag8880


ctcaagagagagaa 8894


<210> 125
<211> 31
<212> DNA
<213> Artificial Sepuence
<220>
<223> Primer YL101
<400> 125
gagcttggcg taatcgatgg tcatagctgt t 31
<210> 126
<211> 31
<212> DNA
<213> Artificial Sepuence
Page 52



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
c~2301-conversion.5T25
<220>
<223> Primer Y~102
<400> 126
aacagctatg accatcgatt acgccaagct c 31
<210> 127
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer Y~103
<400> 127
atgatgactc aggcgtttaa acgacggaat tcctgc 36
<210> 128
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer Y~104
<400> 128
gcaggaattc cgtcgtttaa acgcctgagt catcat 36
<210>
129


<211>
10328


<212>
DNA


<213>
Artificial
sequence


<220>


<223> n of the -6,
10328 delta
by fragment
for integration
and expressio


delta-5, elongase
and delta-17~.desaturase gene
genes
and the


<400>
129


tatcacatcacgctctcatcaagaatacttcttgagaaccgtggagaccggggttcgatt 60


ccccgtatcggagtgtttattttttgctcaaccataccctggggtgtgttctgtggagca 120


ttctcacttttggtaaacgacattgcttcaagtgcagcggaatcaaaaagtataaagtgg 180


gcagcgagtatacctgtacagactgtaggcgataactcaatccaattaccccccacaaca 240


tgactggccaaactgatctcaagactttattgaaatcagcaacaccgattctcaatgaag 300


gcacatacttcttctgcaacattcacttgacgcctaaagttggtgagaaatggaccgaca 360


agacatattctgctatccacggactgttgcctgtgtcggtggctacaatacgtgagtcag 420


aagggctgacggtggtggttcgtacgttgtgtggaattgtgagcggataacaatttcaca 480


caggaaacagctatgaccatgattacgccaagctcgaaattaaccctcactaaagggaac 540


aaaagctggagctccaccgcggacacaatatctggtcaaatttcagtttcgttacataaa 600


tcgttatgtcaaaggagtgtgggaggttaagagaattatcaccggcaaactatctgttaa 660


ttgctaggtacctctagacgtccacccgggtcgcttggcggccgaagaggccggaatctc 720


gggccgcggtggcggccgcttactgcaacttccttgccttctccttggcagcgtcggcct 780


Page 53



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
cL2301-conversion.sT25


tggcctgcttggccaacttggcgttctttctgtaaaagttgtagaagagaccgagcatgg840


tccacatgtagaaccaaagcagagccgtgatgaagaaggggtatccggggcggccaagga900


ccttcatggcgtacatgtcccaggaagactggaccgacatcatgcagaactgtgtcatct960


gcgagcgcgtgatgtagaacttgatgaacgacacctgcttgaagcccaaggccgacaaga1020


agtagtagccgtacatgatcacatggatgaacgagttcaacgcagcagagaagtaggctt1080


caccgttgggtgcaacaaaggtgaccaaccaccagatggtgaagatggagctgtggtggt1140


aaacgtgcaagaaggagatctggcggttgttcttcttgaggaccatgatcatggtgtcga1200


caaactccatgatcttggagaagtagaagagccagatcatcttggccataggaagaccct1260


tgaaggtatgatcagcagcgttctcaaacagtccatagttggcctgataagcctcgtaca1320


ggatcccaccgcacatgtaggcgctgatcgagaccagacaaaagttgtgcaggagcgaaa1380


acgtcttgacctcgaaccgctcaaagttcttcatgatctgcatgcccacaaagaccgtga1440


ccaaataagcgagcacgatcaacagcacgtggaacgggttcatcaacggcagctcacggg1500


ccaaaggcgactccaccgcgaccaggaacccacgcgtgtgatggacaatcgtggggatgt1560


acttctcggcctgggccaccagcgcggcctcgagaggatcgacatagggcgcggcccgga1620


caccgatagcggtggcaaggtccataaacagatcttgcggcatctttgatgggaggaatg1680


gcgcaatcgactccatgcggccgctctagaactagtggatcctttgaatgattcttatac1740


tcagaaggaaatgcttaacgatttcgggtgtgagttgacaaggagagagagaaaagaaga1800


ggaaaggtaattcggggacggtggtcttttatacccttggctaaagtcccaaccacaaag1860


caaaaaaattttcagtagtctattttgcgtccggcatgggttacccggatggccagacaa1920


agaaactagtacaaagtctgaacaagcgtagattccagactgcagtaccctacgccctta1980


acggcaagtgtgggaaccgggggaggtttgatatgtggggtgaagggggctctcgccggg2040


gttgggcccgctactgggtcaatttggggtcaattggggcaattggggctgttttttggg2100


acacaaatacgccgccaacccggtctctcctgaagcttgtgagcggataacaatttcaca2160


caggaaacagctatgaccatgattacgccaagctcgaaattaaccctcactaaagggaac2220


aaaagctggagctccaccgcggacacaatatctggtcaaatttcagtttcgttacataaa2280


tcgttatgtcaaaggagtgtgggaggttaagagaattatcaccggcaaactatctgttaa2340


ttgctaggtacctctagacgtccacccgggtcgcttggcggccgaagaggccggaatctc2400


gggccgcggtggcggccgcctactcttccttgggacggagtccaagaacacgcaagtgct2460


ccaaatgtgaagcaaatgcttgccaaaacgtatccttgacaaggtatggaaccttgtact2520


cgctgcaggtgttcttgatgatggccagaatatcgggataatggtgctgcgacacgttgg2580


ggaacagatggtgcacagcctggtagttcaagctgccagtgatgctggtccagaggtgcg2640


aatcgtgtgcgtaatcctgcgtagtctcgacctgcatagctgcccagtccttttggatga2700


tcccgttctcgtcaggcaacggccactgaacttcctcaacaacgtggttcgcctggaagg2760


tcagcgccagccagtaagacgacaccatgtccgcgaccgtgaacaagagcagcaccttgc2820


Page 54



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
c~2301-conversion.ST25


ccaggggcagatactgcaggggaacaatcaggcgataccagacaaagaaagccttgccgc2880


cccagaacatcacagtgtgccatgtcgagatgggattgacacgaatagcgtcattggtct2940


tgacaaagtacaaaatgttgatgtcctgaatgcgcaccttgaacgccagcagtccgtaca3000


ggaaaggaacaaacatgtgctggttgatgtggttgacaaaccacttttggttgggcttga3060


tacgacgaacatcggg.ctcagacgtcgacacgtcgggatctgctccagcaatgttggtgt3120


aggggtgatggccgagcatatgttggtacatccacaccaggtacgatgctccgttgaaaa3180


agtcgtgcgtggctcccagaatcttccagacagtggggttgtgggtcactgaaaagtgag3240


acgcatcatgaagagggttgagtccgacttgtgcgcacgcaaatcccatgatgattgcaa3300


acaccacctgaagccatgtgcgttcgacaacgaaaggcacaaagagctgcgcgtagtagg3360


aagcgatcaaggatccaaagataagagcgtatcgtccccagatctctggtctattcttgg3420


gatcaatgttccgatccgtaaagtagccctcgactctcgtcttgatggttttgtggaaca3480


ccgttggctccgggaagatgggcagctcattcgagaccagtgtaccgacatagtacttct3540


tcataatggcatctgcagccccaaacgcgtgatacatctcaaagaccggagtaacatctc3600


ggccagctccgagcaggagagtgtccactccaccaggatggcggctcaagaactttgtga3660


catcgtacaccctgccgcggatggccaagagtaggtcgtccttggtgttatgggccgcca3720


gctcttcccaggtgaaggtttttccttggtccgttcccatggtgaatgattcttatactc3780


agaaggaaatgcttaacgatttcgggtgtgagttgacaaggagagagagaaaagaagagg3840


aaaggtaattcggggacggtggtcttttatacccttggctaaagtcccaaccacaaagca3900


aaaaaattttcagtagtctattttgcgtccggcatgggttacccggatggccagacaaag3960


aaactagtacaaagtctgaacaagcgtagattccagactgcagtaccctacgcccttaac4020


ggcaagtgtgggaaccgggggaggtttgatatgtggggtgaagggggctctcgccggggt4080


tgggcccgctactgggtcaatttggggtcaattggggcaattggggctgttttttgggac4140


acaaatacgccgccaacccggtctctcctgaattctgcagatgggctgcaggaattccgt4200


cgtcgcctgagtcgacatcatttatttaccagttggccacaaacccttgacgatctcgta4260


tgtcccctccgacatactcccggccggctggggtacgttcgatagcgctatcggcatcga4320


caaggtttgggtccctagcc'gataccgcactacctgagtcacaatcttcggaggtttagt4380


cttccacatagcacgggcaaaagtgcgtatatatacaagagcgtttgccagccacagatt4440


ttcactccacacaccacatcacacatacaaccacacacatccacaatggaacccgaaact4500


aagaagaccaagactgactccaagaagattgttcttctcggcggcgacttctgtggcccc4560


gaggtgattgccgaggccgtcaaggtgctcaagtctgttgctgaggcctccggcaccgag4620


tttgtgtttgaggaccgactcattggaggagctgccattgagaaggagggcgagcccatc4680


accgacgctactctcgacatctgccgaaaggctgactctattatgctcggtgctgtcgga4740


ggcgctgccaacaccgtatggaccactcccgacggacgaaccgacgtgcgacccgagcag4800


ggtctcctcaagctgcgaaaggacctgaacctgtacgccaacctgcgaccctgccagctg4860


Page 55



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
02301-conversion.ST25


ctgtcgcccaagctcgccgatctctcccccatccgaaacgttgagggcaccgacttcatc4920


attgtccgagagctcgtcggaggtatctactttggagagcgaaaggaggatgacggatct4980


ggcgtcgcttccgacaccgagacctactccgttcctgaggttgagcgaattgcccgaatg5040


gccgccttcctggcccttcagcacaacccccctcttcccgtgtggtctcttgacaaggcc5100


aacgtgctggcctcctctcgactttggcgaaagactgtcactcgagtcctcaaggacgaa5160


ttcccccagctcgagctcaaccaccagctgatcgactcggccgccatgatcctcatcaag5220


cagccctccaagatgaatggtatcatcatcaccaccaacatgtttggcgatatcatctcc5280


gacgaggcctccgtcatccccggttctctgggtctgctgccctccgcctctctggcttct5340


ctgcccgacaccaacgaggcgttcggtctgtacgagccctgtcacggatctgcccccgat5400


ctcggcaagcagaaggtcaaccccattgccaccattctgtctgccgccatgatgctcaag5460


ttctctcttaacatgaagcccgccggtgacgctgttgagg,ctgccgtcaaggagtccgtc5520


gaggctggtatcactaccgccgatatcggaggctcttcctccacctccgaggtcggagac5580


ttgttgccaacaaggtcaaggagctgctcaagaaggagtaagtcgtttctacgacgcatt5640


gatggaaggagcaaactgacgcgcctgcgggttggtctaccggcagggtccgctagtgta5700


taagactctataaaaagggccctgccctgctaatgaaatgatgatttataatttaccggt5760


gtagcaaccttgactagaagaagcagattgggtgtgtttgtagtggaggacagtggtacg5820


ttttggaaacagtcttcttgaaagtgtcttgtctacagtatattcactcataacctcaat5880


agccaagggtgtagtcggtttattaaaggaagggagttgtggctgatgtggatagatatc5940


tttaagctggcgactgcacccaacgagtgtggtggtagcttgttactgtatattcggtaa6000


gatatattttgtggggttttagtggtgtttaaacgacggaattcctgcagcccatctgca6060


gaattcaggagagaccgggttggcggcgtatttgtgtcccaaaaaacagccccaattgcc6120


ccaattgaccccaaattgacccagtag,cgggcccaaccccggcgagagcccccttcaccc6180


cacatatcaaacctcccccggttcccacacttgccgttaagggcgtagggtactgcagtc6240


tggaatctacgcttgttcagactttgtactagtttctttgtctggccatccgggtaaccc6300


atgccggacgcaaaatagactactgaaaatttttttgctttgtggttgggactttagcca6360


agggtataaaagaccaccgtccccgaattacctttcctcttcttttctctctctccttgt6420


caactcacacccgaaatcgttaagcatttccttctgagtataagaatcattcaccatggc6480


tgaggataagaccaaggtcgagttccctaccctgactgagctgaagcactctatccctaa6540


cgcttgctttgagtccaacctcggactctcgctctactacactgcccgagcgatcttcaa6600


cgcatctgcctctgctgctctgctctacgctgcccgatctactcccttcattgccgataa6660


cgttctgctccacgctctggtttgcgccacctacatctacgtgcagggtgtcatcttctg6720


gggtttctttaccgtcggtcacgactgtggtcactctgccttctcccgataccactccgt6780


caacttcatcattggctgcatcatgcactctgccattctgactcccttcgagtcctggcg6840


agtgacccaccgacaccatcacaagaacactggcaacattgataaggacgagatcttcta6900


Page 5 6





CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
02301-conversion.ST25


ccctcatcggtccgtcaaggacctccaggacgtgcgacaatgggtctacaccctcggagg6960


tgcttggtttgtctacctgaaggtcggatatgctcctcgaaccatgtcccactttgaccc7020


ctgggaccctctcctgcttcgacgagcctccgctgtcatcgtgtccctcggagtctgggc7080


tgccttcttcgctgcctacgcctacctcacatactcgctcggctttgccgtcatgggcct7140


ctactactatgctcctctctttgtctttgcttcgttcctcgtcattactaccttcttgca7200


tcacaacgacgaagctactccctggtacggtgactcggagtggacctacgtcaagggcaa7260


cctgagctccgtcgaccgatcgtacggagctttcgtggacaacctgtctcaccacattgg7320


cacccaccaggtccatcacttgttccctatcattccccactacaagctcaacgaagccac7380


caagcactttgctgccgcttaccctcacctcgtgagacgtaacgacgagcccatcattac7440


tgccttcttcaagaccgctcacctctttgtcaactacggagctgtgcccgagactgctca7500


gattttcaccctcaaagagtctgccgctgcagccaaggccaagagcgaccaccaccatca7560


ccaccattaagcggccgccaccgcggcccgagattccggcctcttcggccgccaagcgac7620


ccgggtggacgtctagaggtacctagcaattaacagatagtttgccggtgataattctct7680


taacctcccacactcctttgacataacgatttatgtaacgaaactgaaatttgaccagat7740


attgtgtccgcggtggagctccagcttttgttccctttagtgagggttaatttcgagctt7800


ggcgtaatcgatgcagaattcaggagagaccgggttggcggcgtatttgtgtcccaaaaa7860


acagccccaattgccccaattgaccccaaattgacccagtagcgggcccaaccccggcga7920


gagcccccttcaccccacatatcaaacctcccccggttcccacacttgccgttaagggcg7980


tagggtactgcagtctggaatctacgcttgttcagactttgtactagtttctttgtctgg8040


ccatccgggtaacccatgccggacgcaaaatagactactgaaaatttttttgctttgtgg8100


ttgggactttagccaagggtataaaagaccaccgtccccgaattacctttcctcttcttt8160
'


tctctctctccttgtcaactcacacccgaaatcgttaagcatttccttctgagtataaga8220


atcattcaccatggctgctgctcccagtgtgaggacgtttactcgggccgaggttttgaa8280


tgccgaggctctgaatgagggcaagaaggatgccgaggcacccttcttgatgatcatcga8340


caacaaggtgtacgatgtccgcgagttcgtccctgatcatcccggtggaagtgtgattct8400


cacgcacgttggcaaggacggcactgacgtctttgacacttttcaccccgaggctgcttg8460


ggagactcttgccaacttttacgttggtgatattgacgagagcgaccgcgatatcaagaa8520


tgatgactttgcggccgaggtccgcaagctgcgtaccttgttccagtctcttggttacta8580


cgattcttccaaggcatactacgccttcaaggtctcgttcaacctctgcatctggggttt8640


gtcgacggtcattgtggccaagtggggccagacctcgaccctcgccaacgtgctctcggc8700


tgcgcttttgggtctgttctggcagcagtgcggatggttggctcacgactttttgcatca8760


ccaggtcttccaggaccgtttctggggtgatcttttcggcgccttcttgggaggtgtctg8820


ccagggcttctcgtcctcgtggtggaaggacaagcacaacactcaccacgccgcccccaa8880


cgtccacggcgaggatcccgacattgacacccaccctctgttgacctggagtgagcatgc8940


Page 57



CA 02523786 2005-10-26
WO 2004/104167 PCT/US2004/014542
c~2301-conversion.ST25
gttggagatg ttctcggatg tcccagatga ggagctgacc cgcatgtggt cgcgtttcat 9000
ggtcctgaac cagacctggt tttacttccc cattctctcg tttgcccgtc tctcctggtg 9060
cctccagtcc attctctttg tgctgcctaa cggtcaggcc cacaagccct cgggcgcgcg 9120
tgtgcccatc tcgttggtcg agcagctgtc gcttgcgatg cactggacct ggtacctcgc 9180
caccatgttc ctgttcatca aggatcccgt caacatgctg gtgtactttt tggtgtcgca 9240
ggcggtgtgc ggaaacttgt tggccatcgt gttctcgctc aaccacaacg gtatgcctgt 9300
gatctcgaag gaggaggcgg tcgatatgga tttcttcacg aagcagatca tcacgggtcg 9360
tgatgtccac ccgggtctat ttgccaactg gttcacgggt ggattgaact atcagatcga 9420
gcaccacttg ttcccttcga tgcctcgcca caacttttca aagatccagc ctgctgtcga 9480
gaccctgtgc aaaaagtaca atgtccgata ccacaccacc ggtatgatcg agggaactgc 9540
agaggtcttt agccgtctga acgaggtctc caaggctacc tccaagatgg gtaaggcgca 9600
gtaagcggcc gccaccgcgg cccgagattc cggcctcttc ggccgccaag cgacccgggt 9660
ggacgtctag aggtacctag caattaacag atagtttgcc ggtgataatt ctcttaacct 9720
cccacactcc tttgacataa cgatttatgt aacgaaactg 'aaatttgacc agatattgtg 9780
tccgcggtgg agctccagct tttgttccct ttagtgaggg ttaattaatt cgatatcata 9840
attgtcggcc gaggtctgta cggccagaac cgagatccta ttgaggaggc caagcgatac 9900
cagaaggctg gctgggaggc ttaccagaag attaactgtt agaggttaga ctatggatat 9960
gtcatttaac tgtgtatata gagagcgtgc aagtatggag cgcttgttca gcttgtatga 10020
tggtcagacg acctgtctga tcgagtatgt atgatactgc acaacctgtg tatccgcatg 10080
atctgtccaa tggggcatgt tgttgtgttt ctcgatacgg agatgctggg tacaagtagc 10140_
taatacgatt gaactactta tacttatatg aggcttgaag aaagctgact tgtgtatgac 10200 ,
ttattctcaa ctacatcccc agtcacaata ccaccactgc actaccacta caccaaaacc 10260
atgatcaaac cacccatgga cttcctggag gcagaagaac ttgttatgga aaagctcaag 10320
agagagaa 10328
<210> 130
<211> 10
<212> DNA
<213> Yarrowia lipolytica
<220>
<221> misc_feature
<222> (8)..(8)
<223> n is a, c, g, or t
<400> 130
mammatgnhs
Page 58

Representative Drawing
A single figure which represents the drawing illustrating the invention.
Administrative Status

For a clearer understanding of the status of the application/patent presented on this page, the site Disclaimer , as well as the definitions for Patent , Administrative Status , Maintenance Fee  and Payment History  should be consulted.

Administrative Status

Title Date
Forecasted Issue Date Unavailable
(86) PCT Filing Date 2004-05-07
(87) PCT Publication Date 2004-12-02
(85) National Entry 2005-10-26
Dead Application 2008-05-07

Abandonment History

Abandonment Date Reason Reinstatement Date
2007-05-07 FAILURE TO PAY APPLICATION MAINTENANCE FEE

Payment History

Fee Type Anniversary Year Due Date Amount Paid Paid Date
Application Fee $400.00 2005-10-26
Maintenance Fee - Application - New Act 2 2006-05-08 $100.00 2005-10-26
Registration of a document - section 124 $100.00 2006-02-15
Owners on Record

Note: Records showing the ownership history in alphabetical order.

Current Owners on Record
E.I. DU PONT DE NEMOURS AND COMPANY
Past Owners on Record
YADAV, NARENDRA S.
ZHANG, HONGXIANG
Past Owners that do not appear in the "Owners on Record" listing will appear in other documentation within the application.
Documents

To view selected files, please enter reCAPTCHA code :



To view images, click a link in the Document Description column. To download the documents, select one or more checkboxes in the first column and then click the "Download Selected in PDF format (Zip Archive)" or the "Download Selected as Single PDF" button.

List of published and non-published patent-specific documents on the CPD .

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.


Document
Description 
Date
(yyyy-mm-dd) 
Number of pages   Size of Image (KB) 
Cover Page 2006-02-08 1 37
Description 2005-10-26 136 6,727
Drawings 2005-10-26 16 532
Claims 2005-10-26 4 157
Abstract 2005-10-26 1 60
Prosecution-Amendment 2006-02-02 1 28
PCT 2005-10-27 6 257
PCT 2005-10-26 17 632
Assignment 2005-10-26 3 107
Prosecution-Amendment 2005-10-26 3 91
Correspondence 2006-01-05 1 28
Assignment 2006-02-15 5 194
Correspondence 2006-08-25 1 29
Prosecution-Amendment 2006-07-17 1 61
Prosecution-Amendment 2006-03-15 2 43

Biological Sequence Listings

Choose a BSL submission then click the "Download BSL" button to download the file.

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.

Please note that files with extensions .pep and .seq that were created by CIPO as working files might be incomplete and are not to be considered official communication.

BSL Files

To view selected files, please enter reCAPTCHA code :