Language selection

Search

Patent 2350775 Summary

Third-party information liability

Some of the information on this Web page has been provided by external sources. The Government of Canada is not responsible for the accuracy, reliability or currency of the information supplied by external sources. Users wishing to rely upon this information should consult directly with the source of the information. Content provided by external sources is not subject to official languages, privacy and accessibility requirements.

Claims and Abstract availability

Any discrepancies in the text and image of the Claims and Abstract are due to differing posting times. Text of the Claims and Abstract are posted:

  • At the time the application is open to public inspection;
  • At the time of issue of the patent (grant).
(12) Patent Application: (11) CA 2350775
(54) English Title: CHLAMYDIA PNEUMONIAE GENOME SEQUENCE
(54) French Title: SEQUENCE GENOMIQUE DE CHLAMYDIA PNEUMONIAE
Status: Dead
Bibliographic Data
(51) International Patent Classification (IPC):
  • C12N 15/31 (2006.01)
  • C07K 14/295 (2006.01)
  • C07K 16/12 (2006.01)
  • A61K 38/00 (2006.01)
  • A61K 39/00 (2006.01)
(72) Inventors :
  • STEPHENS, RICHARD (United States of America)
  • MITCHELL, WAYNE (United States of America)
  • KALMAN, SUE (United States of America)
  • DAVIS, RONALD (United States of America)
(73) Owners :
  • THE REGENTS OF THE UNIVERSITY OF CALIFORNIA (United States of America)
(71) Applicants :
  • THE REGENTS OF THE UNIVERSITY OF CALIFORNIA (United States of America)
(74) Agent: FETHERSTONHAUGH & CO.
(74) Associate agent:
(45) Issued:
(86) PCT Filing Date: 1999-11-12
(87) Open to Public Inspection: 2000-05-18
Availability of licence: N/A
(25) Language of filing: English

Patent Cooperation Treaty (PCT): Yes
(86) PCT Filing Number: PCT/US1999/026923
(87) International Publication Number: WO2000/027994
(85) National Entry: 2001-05-11

(30) Application Priority Data:
Application No. Country/Territory Date
60/108,279 United States of America 1998-11-12
60/128,606 United States of America 1999-04-08

Abstracts

English Abstract




<i>C. pneumoniae</i> genome sequence and analysis of the encoded polypeptides
and RNAs are provided. The <i>C. pneumoniae</i> gene nucleic acid compositions
find use in identifying homologous or related proteins and the DNA sequences
encoding such proteins; in producing compositions that modulate the expression
or function of the protein; and in studying associated physiological pathways.
In addition, modulation of the gene activity <i>in vivo</i> is used for
prophylactic and therapeutic purposes, such as identification of cell type
based on expression, and the like.


French Abstract

L'invention concerne la séquence génomique de <i>Chlamydia pneumoniae</i> et l'analyse des polypeptides et des ARN codés. Les compositions d'acide nucléique du gène de <i>Chlamydia pneumoniae</i> permettent d'identifier des protéines homologues ou associées ainsi que les séquences d'ADN codant pour ces protéines, de produire des compositions modulant l'expression ou la fonction de la protéine, et d'étudier les mécanismes physiologiques associés. En outre, la modulation de l'activité génique <i>in vivo</i> est utilisée à des fins prophylactiques et thérapeutiques, telles que l'identification du type cellulaire sur la base de l'expression et analogues.

Claims

Note: Claims are shown in the official language in which they were submitted.





What is Claimed is:
1. An isolated nucleic acid encoding a C. pneumoniae protein as set
forth in Table 3.
2. The isolated nucleic acid of Claim 1, wherein said nucleic acid has
a nucleotide sequence of an open reading frame in SEQ ID NO:1.
3. A probe comprising a hybridizing fragment of an isolated nucleic
acid according to Claim 2.
5. An isolated nucleic acid that hybridizes under stringent conditions
to the nucleic acid sequence of Claim 2.
6. An expression cassette comprising a transcriptional initiation
region functional in an expression host, a nucleic acid having a sequence of
the isolated
nucleic acid according to Claim 1 under the transcriptional regulation of said
transcriptional initiation region, and a transcriptional termination region
functional in said
expression host.
7. A cell comprising an expression cassette according to Claim 6 as
part of an extrachromosomal element or integrated into the genome of a host
cell as a
result of introduction of said expression cassette into said host cell, and
the cellular
progeny of said host cell.
comprising:
8. A method for producing a C. pneumoniae protein, said method
growing a cell according to Claim 7, whereby said C. pneumoniae protein
is expressed; and
isolating said C. pneumoniae protein free of other proteins.



124



9. A purified polypeptide composition comprising at least 50 weight
% of the protein present as a C. pneumoniae protein comprising an amino acid
sequence
of claim 1.
10. A monoclonal antibody binding specifically to the polypeptide of
Claim 9.



125

Description

Note: Descriptions are shown in the official language in which they were submitted.


CA 02350775 2001-05-11
DEMANDES OU BREVETS VOLUMINEUX
LA PRESENTS PART1E DE CETTE DEMANDS OU CE BREVET
COMPREND PLUS D'UN TOME.
CECI EST LE TOME _ ~'DE c1
NOTE. Pour les tomes additionels, veuillez contacter le Bureau canadien des
brevets
JUMBO APPLICAT10NS1PATENTS
THIS SECTION OF THE APPUCATION/PATENT CONTAINS MORE
THAN ONE VOLUME
THIS IS VOLUME -O>=
- . -
WOTE_ For additional volumes please contact'the Canadian Patent Offfice
:~. ..


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
CHLAMYDIA PNEUMONIAE GENOME SEQUENCE
CROSS-REFERENCES TO RELATED APPLICATIONS
The present application is related to 60/128,606, filed April 8, 1999 and
60/108,279, filed November 12, 1998, which are incorporated herein by
reference.
STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT
FIELD OF THE INVENTION
This invention relates to nucleic acids and polypeptides from Chlamydia
pneumoniae and to their use in the diagnosis, prevention and treatment of
diseases
associated with C. pneumoniae.
BACKGROUND OF THE INVENTION
Chlamydiaceae is a family of obligate intracellular parasite with a tropism
for epithelial cells lining the mucus membranes. The bacteria have two
morphologically
distinct forms, "elementary body" and "reticulate body". The elementary body
is the
infectious form, and has a rigid cell wall, primarily of crass-linked outer
membrane
proteins. The reticulate body is the intracellular, metabolically active form.
A unique
developmental cycle between these two forms characterizes Chlamydia growth.
C. pneumoniae is a human respiratory pathogen that causes acute
respiratory disease, and approximately 10% of community-acquired pneumonia.
Antibody prevalence studies have shown that virtually everyone is infected
with C.
pneumoniae at some time, and that reinfection is common. In addition to
respiratory
disease, studies have shown an association of this organism with coronary
artery disease.
It has been demonstrated in atherosclerotic lesions of the aorta and coronary
arteries by
immunocytochemistry and by polymerase chain reaction (Kuo et al. (1993) J
Infect Dis
167(4):841-849).
Recent reports have further demonstrated the presence of C. pneumoniae
in the walls of abdominal aortic aneurysms (Juvonen et al. (1997) J Vasc Sure
25(3):499-505). Abdominal aortic aneurysms are frequently associated with
atherosclerosis, and inflammation may be an important factor in aneurysmal
dilatation.


CA 02350775 2001-05-11
WO 00/27994 PCT/US99I26923
C. pneumoniae may play a role in maintaining an inflammation and triggering
the
development of aortic aneurysms.
Muhlestein et al. (1996) JACC 27:1555-61, reported a differential
incidence of Chlamydia species within the coronary artery wall of patients
with
~ atherosclerosis versus those with other forms of cardiovascular disease. The
extremely
high rate of possible infection in patients with symptomatic atherosclerotic
disease
compared to the very low rate in patients with normal coronary arteries or
coronary artery
disease from chronic transplant rejection provides evidence for a direct link
between the
atherosclerotic process and Chlamydia infection. Because a history of
chlamydial
infection is so prevalent in the population, the issue of causality remains.
On a
physiologic and pathologic level, abnormal interactions among endothelial
cells, platelets,
macrophages and lymphocytes may lead to a cascade of events resulting in acute
endothelial damage, thrombosis and repair, chronically leading to the
development of
atheroma in blood vessels.
C. pneumoniae is related to other Chlamydia species, but the level of
sequence similarity is relatively low. Very little is known about the biology
of this
organism, although it appears to be an important human pathogen. Allelic
diversity and
structural relationships between specific genes of Chlamydial species is
described in
Kaltenboeck et al. (1993) J Bacteriol 175(2):487-502; Gaydos et al. (1992)
Infect Immun
60{12):5319-5323; Everett et al. (1997) Int J Syst Bacteriol 47(2):461-473;
and
Pudjiatmoko et al. (1997) Int J Syst Bacteriol 47(2):425-431.
A number of studies have been published describing methods for detection
of C. pneumoniae, and for distinguishing between Chlamydial species. Such
methods
include PCR detection (Rasmussen et al. (1992) ~Vlol Cell Probes 6(5):389-394;
Holland
et al. (1990) J Infect Dis 162(4):984-987); a simplified polymerase chain
reaction-enzyme
immunoassay (Wilson et al. (1996) J Appl Bacteriol 80(4):431-438); sequence
determination and restriction endonuclease cleavage (Herrmann et al. (1996) J
lin
Micro io134(8):1897-1902).
Antigenic and molecular analyses of different C. pneumoniae strains is
described in 3antos et al. (1997) J Clin Microbiol 35(3):620-623. Some genes
of C.
pneumoniae have been isolated and sequenced. These include the Gro E operon
(Kikuta
et al. { i 99I ) Infect Immun 59( 12):4665-4669); the major outer membrane
protein Perez et
2


CA 02350775 2001-05-11
WO 00/2994 PCT/US99/26923
al. ( 1991 ) Infect Immun 59(6):2195-2199; the DnaK protein homolog (Kornak et
al.
(1991) Infect Immun 59(2):721-725); as well as a number of ribosomal and other
genes.
SUMMARY OF THE IIWENTION
This invention provides the genomic sequence of Chlamydia pneumoniae.
The sequence information is useful for a variety of diagnostic and analytical
methods.
The genomic sequence may be embodied in a variety of media, including computer
readable forms, or as a nucleic acid comprising a selected fragment of the
sequence.
Such fragments generally consist of an open reading frame, transcriptional or
translational
control elements, or fragments derived therefrom. Proteins encoded by the open
reading
frames are useful for diagnostic purposes, as well as for their enzymatic or
structural
activity.
DEFIhIITIONS
The term "amino acid" refers to naturally occurring and synthetic amino
acids, as well as amino acid analogs and amino acid mimetics that function in
a manner
similar to the naturally occurring amino acids. Naturally occurring amino
acids are those
encoded by the genetic code, as well as those amino acids that are later
modified, e.g.,
hydroxyproline, 'y-carboxyglutamate, and 0-phosphoserine. Amino acid analogs
refers to
compounds that have the same basic chemical structure as a naturally occurring
amino
acid, i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an
amino group, and
an R group., e.g., homoserine, norleucine, methionine sulfoxide, methionine
methyl
sulfonium Such analogs have modified R groups (e.g., norleucine) or modified
peptide
backbones, but retain the same basic chemical structure as a naturally
occurring amino
acid. Amino acid mimetics refers to chemical compounds that have a structure
that is
different from the general chemical structure of an amino acid, but that
functions in a
manner similar to a naturally occurring amino acid.
Amino acids may be referred to herein by either their commonly known
three letter symbols or by the one-letter symbols recommended by the ILTPAC-
ILJB
Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to
by
their commonly accepted single-letter codes.
3


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
"Amplification" primers are oligonucleotides comprising either natural or
analoQUe nucleotides that can serve as the basis for the amplification of a
select nucleic
acid sequence. They include, e.g., polymerase chain reaction primers and
Iigase chain
reaction oligonucleotides.
"Antibody" refers to an immunoglobulin molecule able to bind to a
specific epitope on an antigen. Antibodies can be a polyclonal mixture or
monoclonal.
Antibodies can be intact immunoglobulins derived from natural sources or from
recombinant sources and can be immunoreactive portions of intact
immunoglobulins.
Antibodies may exist in a variety of forms including, for example, Fv, Fab,
and F(ab)Z, as
well as in single chains. Single-chain antibodies, in which genes for a heavy
chain and a
light chain are combined into a single coding sequence, may also be used.
An "antigen" is a molecule that is recognized and bound by an antibody,
e.g., peptides, carbohydrates, organic molecules, or more complex molecules
such as
glycolipids and glycoproteins. The part of the antigen that is the target of
antibody
binding is an antigenic determinant and a small functional group that
corresponds to a
single antigenic determinant is called a hapten.
"Biological sample" refers to any sample obtained from a living or dead
organism. Examples of biological samples include biological fluids and tissue
specimens.
Such biological samples can be prepared for analysis of the presence of C.
pneumoniae
nucleic acids, proteins, or antibodies specifically reactive with the
proteins.
The term "C. pneumoniae gene" shall be intended to mean the open
reading frame encoding specific C. pneumoniae polypeptides, as well as
adjacent 5' and
3' non-coding nucleotide sequences involved in the regulation of expression,
up to about
2 kb beyond the coding region, but possibly further in either direction. The
gene may be
introduced into an appropriate vector for extrachromosomal maintenance or for
integration into a host genome.
"Conservatively modified variants" applies to both amino acid and nucleic
acid sequences. With respect to particular nucleic acid sequences,
conservatively
modified variants refers to those nucleic acids which encode identical or
essentially
identical amino acid sequences, or where the nucleic acid does not encode an
amino acid
sequence, to essentially identical sequences. Specifically, degenerate codon
substitutions
may be achieved by generating sequences in which the third position of one or
more
selected (or all) codons is substituted with mixed-base and/or deoxyinosine
residues
4


CA 02350775 2001-05-11
WO OO1Z7994 PCT/US99/26923
(Batter et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol.
Chem. 260:2605-
2608 (1985); Rossolini et al., Mol. Cell. Probes 8:91-98 (1994)). Because of
the
degeneracy of the genetic code, a large number of functionally identical
nucleic acids
encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all
encode the amino acid alanine. Thus, at every position where an alanine is
specified by a
codon, the codon can be altered to any of the corresponding codons described
without
altering the encoded polypeptide. Such nucleic acid variations are "silent
variations,"
which are one species of conservatively modified variations. Every nucleic
acid sequence
herein which encodes a polypeptide also describes every possible silent
variation of the
nucleic acid. One of skill will recognize that each codon in a nucleic acid
(except AUG,
which is ordinarily the «nly codon for methionine, and TGG, which is
ordinarily the only
codon for tryptophan) can be modified to yield a functionally identical
molecule.
Accordingly, each silen: variation of a nucleic acid which encodes a
polypeptide is
implicit in each describ :d sequence.
As to amino acid sequences, one of skill will recognize that individual
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide,
or protein
sequence which alters, adds or deletes a single amino acid or a small
percentage of amino
acids in the encoded sequence is a "conservatively modified variant" where the
alteration
results in the substitution of an amino acid with a chemically similar amino
acid.
Conservative substitution tables providing functionally similar amino acids
are well
known in the art. Such conservatively modified variants are in addition to and
do not
exclude polymorphic variants, interspecies homologs, and alleles of the
invention.
The following groups each contain amino acids that are conservative
substitutions for one another:
1 ) Alanine (A), Glycine (G);
2) Serine (S), Threonine (T);
3) Aspartic acid (D), Glutamic acid (E);
4) Asparagine (N), Glutamine (Q);
5) Cysteine (C), Methionine (M);
6) Arginine (R), Lysine (K), Histidine (H);
7) Isoleucine (I), Leucine (L), Valine (V); and
8) Phenylalanine (F), Tyrosine (Y), Tryptophan (W).
see, e.g., Creighton, Proteins (1984)).
5


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
The terms "identical" or percent "identity," in the context of two or more
nucleic acids or polypeptide sequences, refer to two or more sequences or
subsequences
that are the same or have a specified percentage of amino acid residues or
nucleotides that
are the same, when compared and aligned for maximum correspondence over a
comparison window, as measured using one of the following sequence comparison
algorithms or by manual alignment and visual inspection. This definition also
refers to
the complement of a test sequence, which has a designated percent sequence or
subsequence complementarity when the test sequence has a designated or
substantial
identity to a reference sequence. For example, a designated amino acid percent
identity
of 95% refers to sequences or subsequences that have at least about 95% amino
acid
identity when aligned for maximum correspondence over a comparison window as
measured using one of the following sequence comparison algorithms or by
manual
alignment and visual inspection. Such sequences would then be said to have
substantial
identity, or to be substantially identical to each other. Preferably,
sequences have at least
about 70% identity, more preferably 80% identity, more preferably 90-95%
identity and
above. Preferably, the percent identity exists over a region of the sequence
that is at least
about 25 amino acids in length, more preferably over a region that is 50-100
amino acids
in length.
When percentage of sequence identity is used in reference to proteins or
peptides, it is recognized that residue positions that are not identical often
differ by
conservative amino acid substitutions, where amino acids residues are
substituted for
other amino acid residues with similar chemical properties (e.g., charge or
hydrophobicity) and therefore do not change the functional properties of the
molecule.
Where sequences differ in conservative substitutions, the percent sequence
identity may
be adjusted upwards to correct for the conservative nature of the
substitution. Means for
making this adjustment are well known to those of skill in the art. Typically
this involves
scoring a conservative substitution as a partial rather than a full mismatch,
thereby
increasing the percentage sequence identity. Thus, for example, where an
identical amino
acid is given a score of l and a non-conservative substitution is given a
score of zero, a
conservative substitution is given a score between zero and 1. The scoring of
conservative substitutions is calculated according to, e.g., the algorithm of
Meyers &
Miller, Computer Applic. Biol. Sci. 4:11-17 {1988) e.g., as implemented in the
program
PCIGENE (Intelligenetics, Mountain View, California, USA)..
6


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/Z6923
For sequence comparison, typically one sequence acts as a reference
sequence, to which test sequences are compared. When using a sequence
comparison
algorithm, test and reference sequences are entered into a computer,
subsequence
coordinates are designated, if necessary, and sequence algorithm program
parameters are
designated. Default program parameters can be used, or alternative parameters
can be
designated. The sequence comparison algorithm then calculates the percent
sequence
identity for the test sequences) relative to the reference sequence, based on
the
designated or default program parameters.
A comparison window includes reference to a segment of any one of the
number of contiguous positions selected from the group consisting of from 25
to 600,
usually about 50 to about 200, more usually about 100 to about 150 in which a
sequence
may be compared to a reference sequence of the same number of contiguous
positions
after the two sequences are optimally aligned. Methods of alignment of
sequences for
comparison are well-known in the art. Optimal alignment of sequences for
comparison
can be conducted, e.g., by the local homology algorithm of Smith & Watetman,
Adv.
Appl. Math. 2:482 ( 1981 ), by the homology alignment algorithm of Needleman &
Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method
ofPearson &
Lipman, Proc. Nat'l. Acad. Sci. USA 85:2444 (1988), by computerized
implementations
of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics
Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or
by
manual alignment and visual inspection (see, e.g., Ausubel et al., supra).
One example of a useful algorithm is PILEUP. PILEUP creates a multiple
sequence alignment from a group of related sequences using progressive,
patrwise
alignments to show relationship and percent sequence identity. It also plots a
tree or
dendogram showing the clustering relationships used to create the alignment.
PILEUP
uses a simplification of the progressive alignment method of Feng & Doolittle,
J. Mol.
Evol. 35:351-360 (1987). The method used is similar to the method described by
Higgins
& Sharp, CABIOS 5:151-153 (1989). The program can align up to 300 sequences;
each
of a maximum length of 5,000 nucleotides or amino acids. The multiple
alignment
procedure begins with the pairwise alignment of the two most similar
sequences,
producing a cluster of two aligned sequences. This cluster is then aligned to
the next
most related sequence or cluster of aligned sequences. Two clusters of
sequences are
aligned by a simple extension of the pairwise alignment of two individual
sequences. The
7


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
final alignment is achieved by a series of progressive, pairwise alignments.
The program
is run by designating specific sequences and their amino acid or nucleotide
coordinates
for regions of sequence comparison and by designating the program parameters.
Using
PILEUP, a reference sequence is compared to other test sequences to determine
the
percent sequence identity relationship using the following parameters: default
gap weight
(3.00), default gap length weight (0.10), and weighted end gaps. PILEUP can be
obtained
from the GCG sequence analysis software package, e.g, version 7.0 (Devereaux
et al.,
Nuc. Acids Res. 12:387-395 (1984).
Another example of algorithm that is suitable for determining percent
sequence identity (i.e., substantial similarity or identity) is the BLAST
algorithm, which
is described in Altschul et al., J. Mol. Biol. 215:403-410 (1990). Software
for performing
BLAST analyses is publicly available through the National Center for
Biotechnology
Information (http://www.ncbi.nlm.nih.govn. This algorithm involves first
identifying
high scoring sequence pairs (HSPs) by identifying short words of length W in
the query
sequence, which either match or satisfy some positive-valued threshold score T
when
aligned with a word of the same length in a database sequence. T is referred
to as the
neighborhood word score threshold (Altschul et al, supra). These initial
neighborhood
word hits act as seeds for initiating searches to find longer HSPs containing
them. The
word hits are then extended in both directions along each sequence for as far
as the
cumulative alignment score can be increased. Cumulative scores are calculated
using, for
nucleotide sequences, the parameters M (reward score for a pair of matching
residues;
always > 0) and N (penalty score for mismatching residues, always < 0). For
amino acid
sequences, a scoring matrix is used to calculate the cumulative score.
Extension of the
word hits in each direction are halted when: the cumulative alignment score
falls off by
the quantity X from its maximum achieved value; the cumulative score goes to
zero or
below, due to the accumulation of one or more negative-scoring residue
alignments; or
the end of either sequence is reached. The BLAST algorithm parameters W, T,
and X
determine the sensitivity and speed of the alignment. The BLASTN program (for
nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation
(E) of 10,
M=5, N=4, and a comparison of both strands. For amino acid sequences, the
BLASTP
program uses as default parameters a wordlength (W) of 3, an expectation (E)
of 10, and
the BLOSLTM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci.
USA
89:10915 ( 1989)).


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
The BLAST algorithm also performs a statistical analysis of the similarity
between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'l. Acad. Sci.
USA
90:5873-5787 (1993)). One measure of similarity provided by the BLAST
algorithm is
the smallest sum probability (P(N)), which provides an indication of the
probability by
which a match between two nucleotide or amino acid sequences would occur by
chance.
For example, a nucleic acid is considered similar to a reference sequence if
the smallest
sum probability in a comparison of the test nucleic acid to the reference
nucleic acid is
less than about 0.1, more preferably less than about 0.01, and most preferably
less than
about 0.001.
L O An indication that two nucleic acid sequences or polypeptides are
substantially identical is that the polypeptide encoded by the first nucleic
acid is
immunologically cross ,-eactive with the antibodies raised against the
polypeptide
encoded by the second nucleic acid, as described below. Thus, a polypeptide is
typically
substantially identical to a second polypeptide, for example, where the two
peptides differ
I S only by conservative suostitutions. Another indication that two nucleic
acid sequences
are substantially identical is that the two molecules or their complements
hybridize to
each other under stringent conditions, as described below.
Another indication that polynucleotide sequences are substantially
identical is if two molecules hybridize to each other under stringent
conditions. Stringent
20 conditions are sequence dependent and will be different in different
circumstances.
Generally, stringent conditions are selected to be about 5°C lower than
the thermal
melting point (Tm) for the specific sequence at a defined ionic strength and
pH. The Tm
is the temperature (under defined ionic strength and pH) at which 50% of the
target
sequence hybridizes to a perfectly matched probe. Typically stringent
conditions for a
25 Southern blot protocol involve hybridizing in a buffer comprising Sx SSC,
1% SDS at
65°C or hybridizing in a buffer containing Sx SSC and 1% SDS at
42°C and washing at
65°C with a 0.2x SSC, 0.1% SDS wash.
A "label" is a composition detectable by spectroscopic, photochemical,
biochemical, immunochemical, or chemical means. For example, useful labels
include
30 3zP, Iluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly
used in an
ELISA), biotin, dioxigenin, or haptens and proteins for which antisera or
monoclonal
antibodies are available.
9


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/269Z3
The term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides
and polymers thereof in either single- or double-stranded form. The term
encompasses
nucleic acids containing known nucleotide analogs or modif ed backbone
residues or
linkages, which are synthetic, naturally occurring, and non-naturally
occurring, which
have similar binding properties as the reference nucleic acid, and which are
metabolized
in a manner similar to the reference nucleotides. Examples of such analogs
include,
without limitation, phosphorothioates, phosphoramidates, methyl phosphonates,
chiral-methyl phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids
(PNAs).
Unless otherwise indicated, a particular nucleic acid sequence also
implicitly encompasses conservatively modified variants thereof (e.g.,
degenerate codon
substitutions) and complementary sequences, as well as the sequence explicitly
indicated.
The term nucleic acid is used interchangeably with gene, cDNA, mRNA,
oligonucleotide,
and polynucleotide.
As used herein a "nucleic acid probe or oligonucleotide" is defined as a
nucleic acid capable of binding to a target nucleic acid of complementary
sequence
through one or more types of chemical bonds, usually through complementary
base
pairing, usually through hydrogen bond formation. As used herein, a probe may
include
natural (i.e., A, G, C, or T) or modified bases (7-deazaguanosine, inosine,
etc.). In
addition, the bases in a probe may be joined by a linkage other than a
phosphodiester
bond, so long as it does not interfere with hybridization. Thus, for example,
probes may
be peptide nucleic acids in which the constituent bases are joined by peptide
bonds rather
than phosphodiester linkages. It will be understood by one of skill in the art
that probes
may bind target sequences lacking complete complementarity with the probe
sequence
depending upon the stringency of the hybridization conditions. The probes are
preferably
directly labeled as with isotopes, chromophores, lumiphores, chromogens, or
indirectly
labeled such as with biotin to which a streptavidin complex may later bind. By
assaying
for the presence or absence of the probe, one can detect the presence or
absence of the
select sequence or subsequence.
A labeled nucleic acid probe or oligonucleotide is one that is bound, either
covalently, through a linker, or through ionic, van der Waals or hydrogen
bonds to a label
such that the presence of the probe may be detected by detecting the presence
of the label
bound to the probe.


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
"Pharmaceutically acceptable" means a material that is not biologically or
otherwise undesirable, i.e., the material can be administered to an individual
along with a
Chlamydia antigen without causing any undesirable biological effects or
interacting in a
deleterious manner with any of the other components of the pharmaceutical
composition.
The terms "polypeptide," "peptide" and "protein" are used interchangeably
herein to refer to a polymer of amino acid residues. The terms apply to amino
acid
polymers in which one or more amino acid residue is an analog or mimetic of a
corresponding naturally occurring amino acid, as well as to naturally
occurring amino
acid polymers.
The phrase "specifically or selectively hybridizing to," refers to
hybridization between a probe and a target sequence in which the probe binds
substantially only to the target sequence, forming a hybridization complex,
when the
target is in a heterogeneous mixture of polynucleotides and other compounds.
Such
hybridization is determinative of the presence of the target sequence.
Although the probe
may bind other unrelated sequences, at least 90%, preferably 95% or more of
the
hybridization complexes formed are with the target sequence.
The term "recombinant" when used with reference to a cell, or nucleic
acid, or vector, indicates that the cell, or nucleic acid, or vector, has been
modified by the
introduction of a heterologous nucleic acid or the alteration of a native
nucleic acid, or
that the cell is derived from a cell so modified. Thus, for example,
recombinant cells
express genes that are not found within the native (non-recombinant) form of
the cell or
express native genes that are otherwise abnormally expressed, under expressed
or not
expressed at all.
The phrase "specifically immunoreactive with", when referring to a protein
or peptide, refers to a binding reaction between the protein and an antibody
which is
determinative of the presence of the protein in the presence of a
heterogeneous population
of proteins and other compounds. Thus, under designated immunoassay
conditions, the
specified antibodies bind to a particular protein and do not bind in a
significant amount to
other proteins present in the sample. Specific binding to an antibody under
such
conditions may require an antibody that is selected for its specificity for a
particular
protein. A variety of immunoassay formats may be used to select antibodies
specifically
immunoreactive with a particular protein and are described in detail below.
11


CA 02350775 2001-05-11
WO X00/27994 PCT/US99I26923
The phrase "substantially pure" or "isolated" when referring to a
Chlamydia peptide or protein, means a chemical composition which is free of
other
subcellular components of the Chlamydia organism. Typically, a monomeric
protein is
substantially pure when at least about 85% or more of a sample exhibits a
single
polypeptide backbone. Minor variants or chemical modifications may typically
share the
same polypeptide sequence. Depending on the purification procedure, purities
of 85%,
and preferably over 95% pure are possible. Protein purity or homogeneity may
be
indicated by a number of means well known in the art, such as polyacrylamide
gel
electrophoresis of a protein sample, followed by visualizing a single
polypeptide band on
a polyacrylamide gel upon silver staining. For certain purposes high
resolution will be
needed and HPLC or a similar means for purification utilized.
DETAILED DESCRIPTION
The present invention provides the nucleotide sequence of the C.
pneumoniae genome SEQ ID NO: 1 or a representative fragment thereof, in a form
which
can be readily used, analyzed, and interpreted by a skilled artisan. As used
herein, a
"representative fragment" of the nucleotide sequence depicted in SEQ ID NO: 1
refers to
any portion which is not presently represented within a publicly available
database.
Preferred representative fragments of the present invention are open reading
frames,
expression modulating fragments, uptake modulating fragments, and fragments
which can
be used to diagnose the presence of G pneumoniae in sample. Using the
information
provided in the present application, together with routine cloning and
sequencing
methods, one of ordinary skill in the art will be able to clone and sequence
all
"representative fragments" of interest including open reading frames (ORFs)
encoding a
large variety of C. pneumoniae proteins. A non-limiting identification of such
preferred
representative fragments is provided in Tables 2 and 3.
Diasnostic use of C pneumoniae nucleic acids
Hybridization-based assays
Using the nucleic acids disclosed here, one of skill can design nucleic acid
hybridization-based assays for the detection of C. pneumoniae. Any of a number
of well
known techniques for the specific detection of target nucleic acids can be
used.
Exemplary hybridization-based assays include, but are not limited to,
traditional "direct
12


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
probe" methods such as Southern Blots, dot blots, in situ hybridization (e.g.,
FISH), PCR,
and the like. The methods can be used in a wide variety of formats including,
but not
limited to substrate- (e.g. membrane or glass) bound methods or array-based
approaches
as described below. As noted above, this invention also embraces methods for
detecting
the presence of Chlamydia DNA or RNA in biological samples. These sequences
can be
used to detect Chlamydia in biological samples from patients suspected of
being infected.
A variety of methods of specific DNA and RNA measurement using nucleic acid
hybridization techniques are known to those of skill in the art (see Sambrook
et al.,
supra).
In situ hybridization assays are well known (e.g., Angerer {1987) Meth.
Enrymol 152: 649). Generally, in situ hybridization comprises the following
major steps:
(1) fixation of tissue or l;~iological structure to analyzed; (2)
prehybridization treatment of
the biological structure t ~ increase accessibility of target DNA, and to
reduce nonspecific
binding; (3) hybridizatic n of the mixture of nucleic acids to the nucleic
acid in the
biological structure or tissue; (4) post-hybridization washes to remove
nucleic acid
fragments not bound in the hybridization and (5) detection of the hybridized
nucleic acid
fragments. The reagent used in each of these steps and the conditions for use
vary
depending on the particular application.
In a typical in situ hybridization assay, cells are fixed to a solid support,
typically a glass slide. If a nucleic acid is to be probed, the cells are
typically denatured
with heat or alkali. The cells are then contacted with a hybridization
solution at a
moderate temperature to permit annealing of labeled probes specific to the
nucleic acid
sequence encoding the protein. The targets (e.g., cells) are then typically
washed at a
predetermined stringency or at an increasing stringency until an appropriate
signal to
noise ratio is obtained.
The nucleic acids of this invention are particularly well suited to array-
based hybridization formats. Arrays are a multiplicity of different "probe" or
"target"
nucleic acids (or other compounds) attached to one or more surfaces (e.g.,
solid,
membrane, or gel). In a preferred embodiment, the multiplicity of nucleic
acids (or other
moieties) is attached to a single contiguous surface or to a multiplicity of
surfaces
juxtaposed to each other.
In an array format a large number of different hybridization reactions can
be run essentially "in parallel." This provides rapid, essentially
simultaneous, evaluation
13


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
of a number of hybridizations in a single "experiment". Methods of performing
hybridization reactions in array based formats are well known to those of
skill in the art
(see, e.g., Pastinen (1997) Genome Res. 7: 606-614; Jackson (1996) Nature
Biotechnology 14:1685; Chee (1995) Science 274: 610; WO 96/17958.
Arrays, particularly nucleic acid arrays can be produced according to a
wide variety of methods well known to those of skill in the art. For example,
in a simple
embodiment, "low density" arrays can simply be produced by spotting (e.g. by
hand using
a pipette) different nucleic acids at different locations on a solid support
(e.g. a glass
surface, a membrane, etc.).
This simple spotting, approach has been automated to produce high
density spotted arrays (see, e.g., U.S. Patent No: 5,807,522). This patent
describes the
use of an automated systems that taps a microcapillary against a surface to
deposit a small
volume of a biological sample. The process is repeated to generate high
density arrays.
Arrays can also be produced using oligonucleotide synthesis technology. Thus,
for
example, U.S. Patent No. 5,143,854 and PCT patent publication Nos. WO 90/15070
and
92/10092 teach the use of light-directed combinatorial synthesis of high
density
oligonucleotide arrays.
Many methods for immobilizing nucleic acids on a variety of solid
surfaces are known in the art. A wide variety of organic and inorganic
polymers, as well
as other materials, both natural and synthetic, can be employed as the
material for the
solid surface. Illustrative solid surfaces include, e.g., nitrocellulose,
nylon, glass, quartz,
diazotized membranes (paper or nylon), silicones, polyformaldehyde, cellulose,
and
cellulose acetate. In addition, plastics such as polyethylene, polypropylene,
polystyrene,
and the like can be used. Other materials which may be employed include paper,
ceramics, metals, metalloids, semiconductive materials, cermets or the like.
In addition,
substances that form gels can be used. Such materials include, e.g., proteins
(e.g.,
gelatins), lipopolysaccharides, silicates, agarose and polyacrylamides. Where
the solid
surface is porous, various pore sizes may be employed depending upon the
nature of the
system.
In preparing the surface, a plurality of different materials may be
employed, particularly as laminates, to obtain various properties. For
example, proteins
(e.g., bovine serum albumin) or mixtures of macromolecules (e.g., Denhardt's
solution)
can be employed to avoid non-specific binding, simplify covalent conjugation,
enhance
14


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
signal detection or the like. If covalent bonding between a compound and the
surface is
desired, the surface will usually be polyfunctional or be capable of being
polyfunctionalized. Functional groups which may be present on the surface and
used for
linking can include carboxylic acids, aidehydes, amino groups, cyano groups,
ethylenic
groups, hydroxyl groups, mercapto groups and the like. The manner of linking a
wide
variety of compounds to various surfaces is well known and is amply
illustrated in the
literature.
For example, methods for immobilizing nucleic acids by introduction of
various functional groups to the molecules is known (see, e.g., Bischoff
(1987) Anal.
Biochem., 164: 336-344; Kremsky {1987) Nucl. Acids Res. 15: 2891-2910).
Modified
nucleotides can be placed on the target using PCR primers containing the
modified
nucleotide, or by enzymatic end labeling with modified nucleotides. Use of
glass or
membrane supports (e.g., nitrocellulose, nylon, polypropylene) for the nucleic
acid arrays
of the invention is advantageous because of well developed technology
employing
manual and robotic methods of arraying targets at relatively high element
densities. Such
membranes are generally available and protocols and equipment for
hybridization to
membranes is well known.
Target elements of various sizes, ranging from 1 mm diameter down to 1
p,m can be used. Smaller target elements containing low amounts of
concentrated, fixed
probe DNA are used for high complexity comparative hybridizations since the
total
amount of sample available for binding to each target element will be limited.
Thus it is
advantageous to have small array target elements that contain a small amount
of
concentrated probe DNA so that the signal that is obtained is highly localized
and bright.
Such small array target elements are typically used in arrays with densities
greater than
104/cmz. Relatively simple approaches capable of quantitative fluorescent
imaging of 1
cmz areas have been described that permit acquisition of data from a large
number of
target elements in a single image (see, e.g., Wittrup (1994) Cytometry 16:206-
213).
If fluorescently labeled nucleic acid samples are used, arrays on solid
surface substrates with much lower fluorescence than membranes, such as glass,
quartz,
or small beads, can achieve much better sensitivity. Substrates such as glass
or fused
silica are advantageous in that they provide a very low fluorescence
substrate, and a
highly efficient hybridization environment. Covalent attachment of the target
nucleic
acids to glass or synthetic fused silica can be accomplished according to a
number of


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
known techniques (described above). Nucleic acids can be conveniently coupled
to glass
using commercially available reagents. For instance, materials for preparation
of
silanized glass with a number of functional groups are commercially available
or can be
prepared using standard techniques (see, e.g., Gait ( 1984) Oligonucleotide
Synthesis: A
~ Practical Approach, IRL Press, Wash., D.C.). Quartz cover slips, which have
at least 10-
fold lower autofluorescence than glass, can also be silanized.
Alternatively, probes can also be immobilized on commercially available
coated beads or other surfaces. For instance, biotin end-labeled nucleic acids
can be
bound to commercially available avidin-coated beads. Streptavidin or anti-
digoxigenin
antibody can also be attached to silanized glass slides by protein-mediated
coupling using
e.g., protein A following standard protocols (see, e.g., Smith (1992) Science
258: 1122-
1126). Biotin or digoxigenin end-labeled nucleic acids can be prepared
according to
standard techniques. Hybridization to nucleic acids attached to beads is
accomplished by
suspending them in the hybridization mix, and then depositing them on the
glass substrate
for analysis after washing. Alternatively, paramagnetic particles, such as
ferric oxide
particles, with or without avidin coating, can be used.
A variety of other nucleic acid hybridization formats are known to those
skilled in the art. For example, common formats include sandwich assays and
competition or displacement assays. Hybridization techniques are generally
described in
Hames and Higgins (1985) Nucleic Acid Hybridization, A Practical Approach, IRL
Press;
Gall and Pardue (1969) Proc. Natl. Acad. Sci. USA 63: 378-383; and John et al.
(1969)
Nature 223: 582-587.
Sandwich assays are commercially useful hybridization assays for
detecting or isolating nucleic acid sequences. Such assays utilize a "capture"
nucleic acid
covalently immobilized to a solid support and a labeled "signal" nucleic acid
in solution.
The sample will provide the target nucleic acid. The "capture" nucleic acid
and "signal"
nucleic acid probe hybridize with the target nucleic acid to form a "sandwich"
hybridization complex. To be most effective, the signal nucleic acid should
not hybridize
with the capture nucleic acid.
Detection of a hybridization complex may require the binding of a signal
generating complex to a duplex of target and probe polynucleotides or nucleic
acids.
Typically, such binding occurs through ligand and anti-ligand interactions as
between a
ligand-conjugated probe and an anti-ligand conjugated with a signal.
16


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
The sensitivity of the hybridization assays may be enhanced through use of
a nucleic acid amplification system that multiplies the target nucleic acid
being detected.
Examples of such systems include the polymerise chain reaction (PCR) system
and the
ligase chain reaction (LCR) system. Other methods recently described in the
art are the
nucleic acid sequence based amplification (NASBAO, Cangene, Mississauga,
Ontario)
and Q Beta Replicase systems.
Nucleic acid hybridization simply involves providing a denatured probe
and target nucleic acid under conditions where the probe and its complementary
target
can form stable hybrid duplexes through complementary base pairing. The
nucleic acids
that do not form hybrid duplexes are then washed away leaving the hybridized
nucleic
acids to be detected, tyl:ically through detection of an attached detectable
label. It is
generally recognized that nucleic acids are denatured by increasing the
temperature or
decreasing the salt concentration of the buffer containing the nucleic acids,
or in the
addition of chemical agents, or the raising of the pH. Under low stringency
conditions
(e.g., low temperature and/or high salt and/or high target concentration)
hybrid duplexes
{e.g., DNA:DNA, RNA:RNA, or RNA:DNA) will form even where the annealed
sequences are not perfectly complementary. Thus specificity of hybridization
is reduced
at lower stringency. Conversely, at higher stringency (e.g., higher
temperature or lower
salt) successful hybridization requires fewer mismatches.
One of skill in the art will appreciate that hybridization conditions may be
selected to provide any degree of stringency. In a preferred embodiment,
hybridization is
performed at low stringency to ensure hybridization and then subsequent washes
are
performed at higher stringency to eliminate mismatched hybrid duplexes.
Successive
washes may be performed at increasingly higher stringency (e.g., down to as
low as 0.25
X SSPE-T at 37°C to 70°C) until a desired level of hybridization
specificity is obtained.
Stringency can also be increased by addition of agents such as formamide.
Hybridization
specificity may be evaluated by comparison of hybridization to the test probes
with
hybridization to the various controls that can be present.
In general, there is a tradeoff between hybridization specificity
(stringency) and signal intensity. Thus, in a preferred embodiment, the wash
is performed
at the highest stringency that produces consistent results and that provides a
signal
intensity greater than approximately 10% of the background intensity. Thus, in
a
preferred embodiment, the hybridized array may be washed at successively
higher
17


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
stringency solutions and read between each wash. Analysis of the data sets
thus produced
will reveal a wash stringency above which the hybridization pattern is not
appreciably
altered and which provides adequate signal for the particular probes of
interest.
Methods of optimizing hybridization conditions are well known to those of
skill in the art (see, e.g., Tijssen (1993) Laboratory Techniques in
Biochemistry and
Molecular Biology, Vol. 24: Hybridization With Nucleic Acid Probes, Elsevier,
N.Y.).
_LabelinQ and detection of nucleic acids.
In a preferred embodiment, the hybridized nucleic acids are detected by
detecting one or more labels attached to the sample or probe nucleic acids.
The labels
may be incorporated by any of a number of means well known to those of skill
in the art.
Means of attaching labels to nucleic acids include, for example nick
translation or end-
labeling (e.g. with a labeled RNA) by kinasing of the nucleic acid and
subsequent
attachment (ligation) of a nucleic acid linker joining the sample nucleic acid
to a label
(e.g., a fluorophore). A wide variety of linkers for the attachment of labels
to nucleic
acids are also known. In addition, intercalating dyes and fluorescent
nucleotides can also
be used.
Detectable labels suitable for use in the present invention include any
composition detectable by spectroscopic, photochemical, biochemical,
immunochemical,
electrical, optical or chemical means. Useful labels in the present invention
include biotin
for staining with labeled streptavidin conjugate, magnetic beads (e.g.,
Dynabeads~),
fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent
protein, and
the like, see, e.g., Molecular Probes, Eugene, Oregon, USA), radiolabels
(e.g., 3H, lzsh
355,''~C, or 32P), enzymes (e.g., horse radish peroxidase, alkaline
phosphatase and others
commonly used in an ELISA), and colorimetric labels such as colloidal gold
(e.g., gold
particles in the 40 -80 nm diameter size range scatter green light with high
efficiency) or
colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.)
beads. Patents
teaching the use of such labels include U.S. Patent Nos. 3,817,837; 3,850,752;
3,939,350;
3,996,345; 4,277,437; 4,275,149; and 4,366,241.
A fluorescent label is preferred because it provides a very strong signal
with low background. It is also optically detectable at high resolution and
sensitivity
through a quick scanning procedure. The nucleic acid samples can all be
labeled with a
single label, e.g., a single fluorescent label. Alternatively, in another
embodiment,
different nucleic acid samples can be simultaneously hybridized where each
nucleic acid
18


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
sample has a different label. For instance, one target could have a green
fluorescent label
and a second target could have a red fluorescent label. The scanning step will
distinguish
cites of binding of the red label from those binding the green fluorescent
label. Each
nucleic acid sample (target nucleic acid) can be analyzed independently from
one another.
Suitable chromogens which can be employed include those molecules and
compounds which absorb light in a distinctive range of wavelengths so that a
color can be
observed or, alternatively, which emit light when irradiated with radiation of
a particular
'- wave length or wave length range, e.g., fluorescers.
Desirably, fluorescers should absorb light above about 300 nm, preferably
about 350 nm, and more preferably above about 400 nm, usually emitting at
wavelengths
greater than about 10 nm higher than the wavelength of the light absorbed. It
should be
noted that the absorption and emission characteristics of the bound dye can
differ from
the unbound dye. Therefore, when referring to the various wavelength ranges
and
characteristics of the dyes, it is intended to indicate the dyes as employed
and not the dye
which is unconjugated and characterized in an arbitrary solvent.
Fluorescers are generally preferred because by irradiating a fluorescer with
light, one can obtain a plurality of emissions. Thus, a single label can
provide for a
plurality of measurable events.
Detectable signal can also be provided by chemiluminescent and
bioluminescent sources. Chemiluminescent sources include a compound which
becomes
electronically excited by a chemical reaction and can then emit light which
serves as the
detectable signal or donates energy to a fluorescent acceptor. Alternatively,
luciferins can
be used in conjunction with luciferase or lucigenins to provide
bioluminescence.
Spin labels are provided by reporter molecules with an unpaired electron spin
which can
be detected by electron spin resonance (ESR) spectroscopy. Exemplary spin
labels
include organic free radicals, transitional metal complexes, particularly
vanadium,
copper, iron, and manganese, and the like. Exemplary spin labels include
nitroxide free
radicals.
The label may be added to the target (sample) nucleic acids) prior to, or
after the hybridization. So called "direct labels" are detectable labels that
are directly
attached to or incorporated into the target (sample) nucleic acid prior to
hybridization. In
contrast, so called "indirect labels" are joined to the hybrid duplex after
hybridization.
Often, the indirect label is attached to a binding moiety that has been
attached to the
19


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
target nucleic acid prior to the hybridization. Thus, for example. the target
nucleic acid
may be biotinylated before the hybridization. After hybridization, an avidin-
conjugated
fluorophore will bind the biotin bearing hybrid duplexes providing a label
that is easily
detected. For a detailed review of methods of labeling nucleic acids and
detecting labeled
hybridized nucleic acids see Laboratory Techniques in Biochemistry and
Molecular
Biology, Vol. 24: Hybridization With Nucleic Acid Probes, P. Tijssen, ed.
filsevier, N.Y.,
( 1993)).
Fluorescent labels are easily added during an in vitro transcription
reaction. Thus, for example, fluorescein labeled UTP and CTP can be
incorporated into
the RNA produced in an in vitro transcription.
The labels can be attached directly or through a linker moiety. In general,
the site of label or linker-label attachment is not limited to any specific
position. For
example, a label may be attached to a nucleoside, nucleotide, or analogue
thereof at any
position that does not interfere with detection or hybridization as desired.
For example,
certain Label-ON Reagents from Clontech (Palo Alto, CA) provide for labeling
interspersed throughout the phosphate backbone of an oligonucleotide and for
terminal
labeling at the 3' and 5' ends. As shown for example herein, labels can be
attached at
positions on the ribose ring or the ribose can be modified and even eliminated
as desired.
The base moieties of useful labeling reagents can include those that are
naturally
occurring or modified in a manner that does not interfere with the purpose to
which they
are put. Modified bases include but are not limited to 7-deaza A and G, 7-
deaza-8-aza A
and G, and other heterocyclic moieties.
It will be recognized that fluorescent labels are not to be limited to single
species organic molecules, but include inorganic molecules, mufti-molecular
mixtures of
organic and/or inorganic molecules, crystals, heteropolymers, and the like.
Thus, for
example, CdSe-CdS core-shell nanocrystals enclosed in a silica shell can be
easily
derivatized for coupling to a biological molecule (Bruchez et al. (1998)
Science, 281:
2013-2016). Similarly, highly fluorescent quantum dots (zinc sulfide-capped
cadmium
selenide) have been covalently coupled to biomolecules for use in
ultrasensitive
biological detection (Warren and Nie (1998) Science, 281: 2016-2018).
AmQ,ification-based assays.
In another embodiment, amplification-based assays can be used to detect
nucleic acids. In such amplification-based assays, the nucleic acid sequences
act as a


CA 02350775 2001-05-11
WO 00/Z7994 PCT/US99/26923
template in an amplification reaction (e.g. Polymerase Chain Reaction (PCR).
Detailed
protocols for quantitative PCR are provided in Innis et al. ( 1990) PCR
Protocols, A Guide
to Methods and Applications, Academic Press, Inc. N.Y.).
Other suitable amplification methods include, but are not limited to ligase
chain reaction (LCR) (see Wu and Wallace (1989) Genomics 4: 560, Landegren et
al.
(1988) Science 241: 1077, and Barringer et al. (1990) Gene 89: 117,
transcription
amplification (Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86: 1173), and
self
sustained sequence replication (Guatelli et al. ( 1990) Proc. Nat. Acad. Sci.
USA 87:
1874).
Detection;. of C. pneumoniae gene expression
The nucl:;ic acids of the invention can also be used to G pneumoniae
detect gene transcripts. Methods of detecting and/or quantifying gene
transcripts using
nucleic acid hybridization techniques are known to those of skill in the art
(see Sambrook
et al. supra). For example , a Northern transfer may be used for the detection
of the
desired mRNA directly. In brief, the mRNA is isolated from a given cell sample
using,
for example, an acid guanidinium-phenol-chloroform extraction method. The mRNA
is
then electrophoresed to separate the mRNA species and the mRNA is transferred
from the
gel to a nitrocellulose membrane. As with the Southern blots, labeled probes
are used to
identify and/or quantify the target mRNA.
In another preferred embodiment, the gene transcript can be measured
using amplification (e.g. PCR) based methods as described above for directly
assessing
copy number of the target sequences.
Expression of C. Dneumoniae proteins
The nucleic acids disclosed here can be used for recombinant expression
of the proteins. In these methods, the nucleic acids encoding the proteins of
interest are
introduced into suitable host cells, followed by induction of the cells to
produce large
amounts of the protein. The invention relies on routine techniques in the
field of
recombinant genetics, well known to those of ordinary skill in the art. A
basic text
disclosing the general methods of use in this invention is Sambrook et al.,
Molecular
Cloning, A Laboratory Manual (2nd ed. 1989).
Standard transfection methods are used to produce prokaryotic,
mammalian, yeast or insect cell lines which express large quantities of the
desired
21


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
polypeptide, which is then purified using standard techniques (see, e.g.,
Colley et al., J.
Biol. Chem. 264:17619-17622, 1989; Guide to Protein PuriJZCation, supra).
The nucleotide sequences used to transfect the host cells can be modified
to yield Chlamydia polypeptides with a variety of desired properties. For
example, the
polypeptides can vary from the naturally-occurring sequence at the primary
structure
level by amino acid, insertions, substitutions, deletions, and the like. These
modifications
can be used in a number of combinations to produce the final modified protein
chain.
The amino acid sequence variants can be prepared with various objectives
in mind, including facilitating purification and preparation of the
recombinant
polypeptide. The modified polypeptides are also useful for modifying plasma
half life,
improving therapeutic efficacy, and lessening the severity or occurrence of
side effects
during therapeutic use. The amino acid sequence variants are usually
predetermined
variants not found in nature but exhibit the same immunogenic activity as
naturally
occurring protein. In general, modifications of the sequences encoding the
polypeptides
may be readily accomplished by a variety of well-known techniques, such as
site-directed
mutagenesis (see Gillman & Smith, Gene 8:81-97 (1979); Roberts et al., Nature
328:731-
734 (1987)). One of ordinary skill will appreciate that the effect of many
mutations is
difficult to predict. Thus, most modifications are evaluated by routine
screening in a
suitable assay for the desired characteristic. For instance, the effect of
various
modifications on the ability of the polypeptide to elicit a protective immune
response can
be easily determined using in vitro assays. For instance, the polypeptides can
be tested
for their ability to induce lymphoproliferation, T cell cytotoxicity, or
cytokine production
using standard techniques.
The particular procedure used to introduce the genetic material into the
host cell for expression of the polypeptide is not particularly critical. Any
of the well
known procedures for introducing foreign nucleotide sequences into host cells
may be
used. These include the use of calcium phosphate transfection, spheroplasts,
electroporation, liposomes, microinjection, plasmid vectors, viral vectors and
any of the
other well known methods for introducing cloned genomic DNA, cDNA, synthetic
DNA
or other foreign genetic material into a host cell (see Sambrook et al.,
supra). It is only
necessary that the particular procedure utilized be capable of successfully
introducing at
least one gene into the host cell which is capable of expressing the gene.
22


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
Any of a number of well known cells and cell lines can be used to express
the polypeptides of the invention. For instance, prokaryotic cells such as E.
toll can be
used. Eukaryotic cells include, yeast, Chinese hamster ovary (CHO) cells, COS
cells, and
insect cells.
The particular vector used to transport the genetic information into the cell
is also not particularly critical. Any of the conventional vectors used for
expression of
recombinant proteins in prokaryotic and eukaryotic cells may be used.
Expression
- vectors for mammalian cells typically contain regulatory elements from
eukaryotic
viruses.
The expression vector typically contains a transcription unit or expression
cassette that contains all the elements required for the expression of the
polypeptide DNA
in the host cells. A typical expression cassette contains a promoter operably
linked to the
DNA sequence encoding a polypeptide and signals required for efficient
polyadenylation
of the transcript. The term "operably linked" as used herein refers to linkage
of a
promoter upstream from a DNA sequence such that the promoter mediates
transcription
of the DNA sequence. The promoter is preferably positioned about the same
distance
from the heterologous transcription start site as it is from the transcription
start site in its
natural setting. As is known in the art, however, some variation in this
distance can be
accommodated without loss of promoter function.
Following the growth of the recombinant cells and expression of the
polypeptide, the culture medium is harvested for purification of the secreted
protein. The
media are typically clarified by centrifugation or filtration to remove cells
and cell debris
and the proteins are concentrated by adsorption to any suitable resin or by
use of
ammonium sulfate fractionation, polyethylene glycol precipitation, or by
ultrafiltration.
Other routine means known in the art may be equally suitable. Further
purification of the
polypeptide can be accomplished by standard techniques, for example, affinity
chromatography, ion exchange chromatography, sizing chromatography, HkS6
tagging and
Ni-agarose chromatography (as described in Dobeli et al., Mol. and Biochem.
Parasit.
41:259-268 ( 1990)), or other protein purification techniques to obtain
homogeneity. The
purified proteins are then used to produce pharmaceutical compositions, as
described
below.
An alternative method of preparing recombinant polypeptides useful as
vaccines involves the use of recombinant viruses (e.g., vaccinia). Vaccinia
virus is grown
23


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
in suitable cultured mammalian cells such as the HeLa S3 spinner cells, as
described by
Mackett et al., in DNA cloning Vol. IL~ A practical approach, pp. 191-211
(Glover, ed.).
Antibod~Production
The proteins of the present invention can be used to produce antibodies
specifically reactive with C pneumoniae antigens. If isolated proteins are
used, they may
be recombinantly produced or isolated from Chlamydia cultures. Synthetic
peptides
made using the protein sequences may also be used.
Methods of production of polyclonal antibodies are known to those of skill
in the art. In brief, an immunogen, preferably a purified protein, is mixed
with an
adjuvant and animals are immunized. When appropriately high titers of antibody
to the
immunogen are obtained, blood is collected from the animal and antisera is
prepared.
Further fractionation of the antisera to enrich for antibodies reactive to
Chlamydia
proteins can be done if desired (see Harlow & Lane, Antibodies: A Laboratory
Manual
( 1988)).
Polyclonal antisera are used to identify and characterize Chlamydia in the
tissues of patients using, for instance, in situ techniques and
immunoperoxidase test
procedures described in Anderson et al. JA VMA 198:241 ( 1991 ) and Barr et
al. Vet.
Pathol. 28:110-116 (1991).
Monoclonal antibodies may be obtained by various techniques familiar to
those skilled in the art. Briefly, spleen cells from an animal immunized with
a desired
antigen are immortalized, commonly by fizsion with a myeloma cell (see Kohler
&
Milstein, Eur. J. Immunol. 6:511-519 (1976)). Alternative methods of
immortalization
include transformation with Epstein Barr Virus, oncogenes, or retroviruses, or
other
methods well known in the art. Colonies arising from single immortalized cells
are
screened for production of antibodies of the desired specificity and affinity
for the
antigen, and yield of the monoclonal antibodies produced by such cells may be
enhanced
by various techniques, including injection into the peritoneal cavity of a
vertebrate host.
Monoclonal antibodies produced in such a manner are used, for instance,
in ELISA diagnostic tests, immunoperoxidase tests, immunohistochemical tests,
for the in
vitro evaluation of spirochete invasion, to select candidate antigens for
vaccine
development, protein isolation, and for screening genomic and cDNA libraries
to select
appropriate gene sequences.
24


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
Immunodiagonostic detection of C. pneumoniae infections
The present invention also provides methods for detecting the presence or
absence of C. pneumoniae, or antibodies reactive with it, in a biological
sample. For
instance, antibodies specifically reactive with Chlamydia can be detected
using either
Chlamydia proteins or the isolates described here. The proteins and isolates
can also be
used to raise specific antibodies (either monoclonal or polyclonal) to detect
the antigen in
a sample. In addition, the nucleic acids disclosed and claimed here can be
used to detect
Chlamydia-specific sequences using standard hybridization techniques.
For a review of immunological and immunoassay procedures in general,
see Basic and Clinical.rmmunology (Stites & Terr ed., 7th ed. 1991)). The
immunoassays
of the present invention can be perfonmed in any of several configurations,
which are
reviewed extensively in Enzyme Immunoassay (Maggio, ed., 1980); Tijssen,
Laboratory
Techniques in Biochem.stry and Molecular Biology ( 1985)). For instance, the
proteins
and antibodies disclose 1 here are conveniently used in ELISA, immunobiot
analysis and
agglutination assays.
In brief, immunoassays to measure anti-Chlamydia antibodies or antigens
can be either competitive or noncompetitive binding assays. In competitive
binding
assays, the sample analyte (e.g., anti-Chlamydia antibodies) competes with a
labeled
analyte (e.g., anti-Chlamydia monoclonal antibody) for specific binding sites
on a capture
agent (e.g., isolated Chlamydia protein) bound to a solid surface. The
concentration of
labeled analyte bound to the capture agent is inversely proportional to the
amount of free
analyte present in the sample.
Noncompetitive assays are typically sandwich assays, in which the sample
analyze is bound between two analyte-specific binding reagents. One of the
binding
agents is used as a capture agent and is bound to a solid surface. The second
binding
agent is labelled and is used to measure or detect the resultant complex by
visual or
W strument means.
A number of combinations of capture agent and labelled binding agent can
be used. For instance, an isolated Chlamydia protein or culture can be used as
the
capture agent and labelled anti-human antibodies specific for the constant
region of
human antibodies can be used as the labelled binding agent. Goat, sheep and
other non-
l.uman antibodies specific for human immunoglobulin constant regions (e.g., y
or p.) are


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
well known in the art. Alternatively, the anti-human antibodies can be the
capture agent
and the antigen can be labelled.
Various components of the assay, including the antigen, anti-Chlamydia
antibody, or anti-human antibody, may be bound to a solid surface. Many
methods for
immobilizing biomolecules to a variety of solid surfaces are known in the art.
For
instance, the solid surface may be a membrane (e.g., nitrocellulose), a
microtiter dish
(e.g., PVC or polystyrene) or a bead. The desired component may be covalently
bound or
noncovalently attached through nonspecific bonding.
Alternatively, the immunoassay may be carried out in liquid phase and a
variety of separation methods may be employed to separate the bound labeled
component
from the unbound labelled components. These methods are known to those of
skill in the
art and include immunoprecipitation, column chromatography, adsorption,
addition of
magnetizable particles coated with a binding agent and other similar
procedures.
An immunoassay may also be carried out in liquid phase without a
separation procedure. Various homogeneous immunoassay methods are now being
applied to immunoassays for protein analytes. In these methods, the binding of
the
binding agent to the analyte causes a change in the signal emitted by the
label, so that
binding may be measured without separating the bound from the unbound labelled
component.
Western blot (immunoblot) analysis can also be used to detect the presence
of antibodies to Chlamydia in the sample. This technique is a reliable method
for
confirming the presence of antibodies against a particular protein in the
sample. The
technique generally comprises separating proteins by gel electrophoresis on
the basis of
molecular weight, transferring the separated proteins to a suitable solid
support, (such as a
nitrocellulose filter, a nylon filter, or derivatized nylon filter), and
incubating the sample
with the separated proteins. This causes specific target antibodies present in
the sample
to bind their respective proteins. Target antibodies are then detected using
labeled anti-
human antibodies.
The immunoassay formats described above employ labelled assay
components. The label may be coupled directly or indirectly to the desired
component of
the assay according to methods well known in the art. A wide variety of labels
may be
used. The component may be labelled by any one of several methods.
Traditionally a
radioactive label incorporating 3H,'ZSh ass, i4C, or 32P was used. Non-
radioactive labels
26


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
include ligands which bind to labelled antibodies, fluorophores,
chemiluminescent agents,
enzymes, and antibodies which can serve as specific binding pair members for a
labelled
ligand. The choice of label depends on sensitivity required, ease of
conjugation with the
compound, stability requirements, and available instrumentation.
$ Enzymes of interest as labels will primarily be hydrolases, particularly
phosphatases, esterases and glycosidases, or oxidoreductases, particularly
peroxidases.
Fluorescent compounds include fluorescein and its derivatives, rhodamine and
its
'- derivatives, dansyl, umbeliiferone, etc. Chemiluminescent compounds include
luciferin,
and 2,3-dihydrophthalazinediones, e.g., luminol. For a review of various
labelling or
signal producing systems which may be used, see U.S. Patent No. 4,391,904,
which is
incorporated herein by reference.
Non-radioactive labels are often attached by indirect means. Generally, a
ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand
then binds
to an anti-ligand (e.g., streptavidin) molecule which is either inherently
detectable or
covalently bound to a signal system, such as a detectable enzyme, a
fluorescent
compound, or a chemiluminescent compound. A number of ligands and anti-ligands
can
be used. Where a Iigand has a natural anti-ligand, for example, biotin,
thyroxine, and
cortisol, it can be used in conjunction with the labelled, naturally occurring
anti-ligands.
Alternatively, any haptenic or antigenic compound can be used in combination
with an
antibody.
Some assay formats do not require the use of labelled components. For
instance, agglutination assays can be used to detect the presence of the
target antibodies.
In this case, antigen-coated particles are agglutinated by samples comprising
the target
antibodies. In this format, none of the components need be labelled and the
presence of
the target antibody is detected by simple visual inspection.
Phazmaceutical Compositions
The peptides or antibodies (typically monoclonal antibodies) of the present
invention and pharmaceutical compositions thereof are useful for
administration to
mammals, particularly humans, to treat and/or prevent Chlamydia infections.
Suitable
formulations are found in Remington's Pharmaceutical Sciences, Mack Publishing
Company, Philadelphia, PA, 17th ed. (1985).
27


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
The immunogenic peptides or antibodies of the invention are administered
prophylactically or to an individual already suffering from the disease. The
peptide
compositions are administered to a patient in an amount sufficient to elicit
an effective
immune response to Chlamydia. An effective immune response is one that
inhibits
infection. An amount adequate to accomplish this is defined as
"therapeutically effective
dose" or "immunogenically effective dose." Amounts effective for this use will
depend
on, e.g., the peptide composition, the manner of administration, the stage and
severity of
the disease being treated, the weight and general state of health of the
patient, and the
judgment of the prescribing physician, but generally range for the initial
immunization
(that is for therapeutic or prophylactic administration) from about 0.1 mg to
about 1.0 mg
per 70 kilogram patient, more commonly from about 0.5 mg to about 0.75 mg per
70 kg
of body weight. Boosting dosages are typically from about 0.1 mg to about 0.5
mg of
peptide using a boosting regimen over weeks to months depending upon the
patient's
response and condition. A suitable protocol would include injection at time 0,
4, 2, 6, 10
and 14 weeks, followed by further booster injections at 24 and 28 weeks.
For therapeutic use, administration should begin at the first sign of
infection. This is followed by boosting doses until at least symptoms are
substantially
abated and for a period thereafter. In some circumstances, loading doses
followed by
boosting doses may be required. The resulting immune response helps to cure or
at least
partially arrest symptoms and/or complications. Vaccine compositions
containing the
peptides are administered prophylactically to a patient susceptible to or
otherwise at risk
of the infection.
The pharmaceutical compositions (containing either peptides or
antibodies) are intended for parenteral or oral administration. Preferably,
the
pharmaceutical compositions are administered parenterally, e.g.,
subcutaneously,
intradermally, or intramuscularly. Thus, the invention provides compositions
for
parenteral administration which comprise a solution of the immunogenic
polypeptides
dissolved or suspended in an acceptable carrier, preferably an aqueous
carrier. A variety
of aqueous carriers may be used, e.g., water, buffered water, 0.4% saline,
0.3% glycine,
hyaluronic acid and the like. These compositions may be sterilized by
conventional, well
known sterilization techniques, or may be sterile filtered. The resulting
aqueous solutions
may be packaged for use as is, or lyophilized, the lyophilized preparation
being combined
with a sterile solution prior to administration. The compositions may contain
28


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
pharmaceutically acceptable auxiliary substances as required to approximate
physiological conditions, such as buffering agents, tonicity adjusting agents,
wetting
agents and the like, for example, sodium acetate, sodium lactate, sodium
chloride,
potassium chloride, calcium chloride, sorbitan monolaurate, triethanolamine
oleate, etc.
The compositions may also comprise carriers to enhance the immune
response. Useful carriers are well known in the art, and include, e.g., KLH,
thyroglobulin, alburnins such as human serum albumin, tetanus toxoid,
poiyamino acids
such as poly(lysine:glutamic acid), influenza, hepatitis B virus core protein,
hepatitis B
virus recombinant vaccine and the like.
For solid compositions, conventional nontoxic solid carriers may be used
which include, for exarr.ple, pharmaceutical grades of mannitol, lactase,
starch,
magnesium stearate, soc!.ium saccharin, talcum, cellulose, glucose, sucrose,
magnesium
carbonate, and the like. For oral administration, a pharmaceutically
acceptable nontoxic
composition is formed Y y incorporating any of the normally employed
excipients, such as
1 ~ those carriers previously listed, and generally 10-95% of active
ingredient, that is, one or
more peptides of the invention, and more preferably at a concentration of 25%-
75%.
As noted above, the peptide compositions are intended to induce an
immune response to Chlamydia. Thus, compositions and methods of administration
suitable for maximizing the immune response are preferred. For instance,
peptides may
be introduced into a host, including humans, linked to a carrier or as a
homopoiymer or
heteropolymer of active peptide units from various Chlamydia proteins
disclosed here.
Alternatively, a "cocktail" of polypeptides can be used. A mixture of more
than one
polypeptide has the advantage of increased immunological reaction and, where
different
peptides are used to make up the polymer, the additional ability to induce
antibodies to a
number of epitopes.
The compositions also include an adjuvant. As used here, number of
adjuvants are well known to one skilled in the art. Suitable adjuvants include
incomplete
Freund's adjuvant, alum, aluminum phosphate, aluminum hydroxide,
N-acetyl-rnuramyl-L-threonyl-D-isoglutamine (thr-MDP),
N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as nor-
MDP),
N-acetylinuramyl-Lalanyl-D-isoglutaminyl-L-alanine-2-{1'-2'-dipalmitoyl-sn-
g:ycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 19835A, referred to as MTP-
PE),
and RIBI, which contains three components extracted from bacteria,
monophosphoryl
29


CA 02350775 2001-05-11
WO 00!17994 PCT/US99/26923
lipid A, trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) in a 2%
squalenelTween 80 emulsion. The effectiveness of an adjuvant may be determined
by
measuring the amount of antibodies directed against the immunogenic peptide.
The concentration of immunogenic peptides of the invention in the
S pharmaceutical formulations can vary widely, i.e. from less than about 0.1
%, usually at
or at least about 2% to as much as 20% to 50% or more by weight, and will be
selected
primarily by fluid volumes, viscosities, etc., in accordance with the
particular mode of
administration selected.
The peptides of the invention can also be expressed by attenuated viral
hosts, such as vaccinia or fowlpox. This approach involves the use of vaccinia
virus as a
vector to express nucleotide sequences that encode the peptides of the
invention. Upon
introduction into a host, the recombinant vaccinia virus expresses the
immunogenic
peptide, and thereby elicits an immune response. Vaccinia vectors and methods
useful in
immunization protocols are described in, e.g., U.S. Patent No. 4,722,848.
Another vector
is BCG (Bacille Calmette Guerin). BCG vectors are described in Stover et aI.
(Nature
351:456-460 (1991)). A wide variety of other vectors useful for therapeutic
administration or immunization of the peptides of the invention, e.g.,
Salmonella typhi
vectors and the like, will be apparent to those skilled in the art from the
description
herein.
The DNA encoding one or more of the peptides of the invention can also
be administered to the patient. This approach is described, for instance, in
Wolff et. al.,
Science 247: 1465-1468 (1990) as well as U.S. Patent Nos. 5,580,859 and
5,589,466.
In order to enhance serum half life, the peptides may also be encapsulated,
introduced into the lumen of liposomes, prepared as a colloid, or other
conventional
techniques may be employed which provide an extended serum half life of the
peptides.
A variety of methods are available for preparing liposomes, as described in,
e.g., Szoka et
al., Ann. Rev. Biophys. Bioeng. 9:467 (1980), U.S. Pat. Nos. 4, 235,871,
4,501,728 and
4,837,028.
EXAMPLES
The following examples are offered to illustrate, but no to limit the
claimed invention.
Examvle 1:


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
This example describes comparison of the C. pneumoniae genome
disclosed here and the, previously sequenced, C. trachomatis genome (Stephens,
et al.
Science 282:754-759 (1998)).
The apparent low level of DNA homology between C. trachomaris and C.
pneumoniae (Campbell, et al., J. Clin. Microbiol. 25:1911-1916 {1987)) yet
analogous
cell structures and developmental cycles, predicts that comparative analysis
of the two
genomes will significantly enhance the understanding of both pathogens.
Identification
of genes that are present in one species but not the other are of particular
importance for
the mutually exclusive biological, virulence and pathogenesis capabilities of
each.
Identification of genes shared between the two species strongly supports the
requirement
for these capabilities in a biological system that has, over its long-term
association with
mammalian host cells, evolved to reduce the metabolic capacities while
optimizing
survival, growth and transmission of these unique pathogens.
The previously sequenced G trachomatis genome contains 1,042,519
I S nucleotides and 875 likely protein-coding genes. Similarity searching
permitted the
inferred functional assignment of sequences 636 {60%) genes disclosed here and
251
(23%) are similar to hypothetical genes for other bacterial organisms
including those for
G trachomatis. The remaining 186 (17%) genes are not homologous to sequences
deposited in GenBank.. Seventy C. trachomatis genes are not represented in the
C.
pneumoniae genome. These are contained within blocks consisting of 2-17 genes
and 19
single genes. Of the 70 G trachomatis genes without homologs in C. pneumoniae,
60 are
classified as encoding hypothetical proteins. The remaining genes not
represented in C
pneumoniae consist of the tryptophan operon (trpA,B,R), trpC, two predicted
thiol
protease genes, and 4 genes assigned to the phospholipase-D superfamily.
It is evident that there is a high level of functional conservation between C.
pneumoniae and C. trachomatis as orthologs to C. trachomatis genes were
identified for
859 (80%) of the predicted coding sequences for G pneumoniae. The level of
similarity
for individual encoded proteins spans a wide spectrum (22-95% amino acid
identity) with
an average of 62% amino acid identity between orthologs from the two species.
The
percent amino acid identity between orthologous chlamydial proteins is similar
among
functional groups with the highest for proteins associated with translation
and the lowest
for proteins whose function in chlamydiae is uncharacterized and not related
to proteins
encoded by other organisms. The gene order of the homologous set of genes in
C.
31


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/2b923
pneumoniae shows reorganization relative to the genome of C. trachomatis;
however,
there is a high level of synteny for the gene organization of the two genomes.
We
identified thirty-nine blocks of 2 or more genes whose gene organization is
colinear with
homologs to C. trachomatis, although some of these are inverted. The
distribution of
genome reorganization is not evenly distributed on the chromosome as the
region
between G pneumoniae coding sequences 0130-0300 contains substantially more
reorganization than other areas of the genome. This region coincides with the
predicted
chromosome replication terminus.
We identified orthologs of enzymes characterized in other bacteria that
account for the essential requirements for DNA replication, repair,
transcription and
translation including two predicted DNA helicases of the Swi2/Snf2 family
found in C.
trachomatis. Similar to G trachomatis, alternative sigma subunits for RNA
polymerase,
X28 ~d ~54~ were identified in addition to anti-a~ regulatory system factors
RsbV, a
RsbW-like single-domain histidine kinase, and a RsbU-like protein phosphatase.
These
findings suggest that the fundamental mechanisms of transcriptional regulation
are
conserved among Chlamydia. The C. trachomatis proteins containing SET and SWIB
domains, and a SWiB domain fused to the C-terminus of the chlamydial
topoisomerase I,
not identified outside eukaryotes, are found in C. pneumoniae supporting their
possible
role in the chromatin condensation-decondensation characteristic of the
biologically
unique chlamydial developmental cycle.
The central metabolic pathways inferred from the G pneumoniae genome
sequence are the same as those identified for C. trachomatis G pneumoniae has
a
glycolytic pathway and a linked tricarboxylic acid cycle, although likely
functional, is
incomplete as genes for citrate synthase, aconitase, and isocitrate
dehydrogenase were not
identified. C. pneumoniae has a complete glycogen synthesis and degradation
system
supporting a role for glycogen synthesis and utilization of glucose-
derivatives in
chlamydial metabolism. Genes encoding essential functions in aerobic
respiration are
present and electron flux may be supported by pyruvate, succinate, glycerol-3-
phosphate,
and NADH dehydrogenases, NADH-ubiquinone oxidoreductase and cytochrome
oxidase.
C. pneumoniae also contains the V (vacuolar}-type ATPase operon and the two
ATP
translocases found in C trachomatis.
The type-III secretion virulence system required for invasion by several
pathogenic bacteria and found in the C. trachomatis genome in three
chromosomal
32


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
locationsis also present in the C. pneumoniae genome. Each of the components
is
conserved and their relative genomic contexts are conserved. Genes such as a
predicted
serine/threonine protein kinase and other genes physically linked to genes
encoding
structural components of the type-III secretion apparatus, but without
identified
homologs, are also highly similar between the two species suggesting the
functional roles
in modifying cellular biology are fundamentally conserved.
Chlamydia-encoded proteins that are not found in chlamydial organisms
but localized to the intracellular chlamydial inclusion membrane are likely
essential for
the unique intracellular biology and perhaps differences in inclusion
morphology
observed between species of Chlamydia. Several such proteins, termed incA,B&C,
have
been characterized for a !:. psittaci strain (Rockey, et al. Mol. Microbiol.
15:617-626
(1995); Rockey et al. Inf:~ct. Immun. 62:106-112 (1994)). C. pneumoniae and C.
trachomatis encode orthc~logs to C. psittaci Inca and IncC and C. trachomatis
also
contains an ortholog to LicA. C. pneumoniae contains two genes that encode
proteins
with similarity to IncA (CPn0186 and CPn0585), although the level of homology
is low
suggesting analogous but possibily altered functions.
The tryptophan biosynthesis operon (trpA, trpB, trpR) and trpC identified
in C. trachomatis is conspicuously missing in the C. pneumoniae genome. This
represents the entire repertoire of genes associated with tryptophan
biosynthesis identified
in C. trachomatis. Seventeen genes adjacent to the C. trachomatis tryptophan
operon also
were not found in the G pneumoniae genome. This region is the single largest
loss of a
contiguous genomic segment and includes 4 HKD superfamily encoding genes that
encompass a family of proteins related to endonuclease and phospholipase D.
These
findings may be important for the ability of Chlamydia to persist in their
hosts and cause
disease by eliciting potent, focal and persistent inflammatory responses
thought to be
essential for pathogenesis.
The C. pneumoniae genome contains 187,711 additional nucleotides
compared to the C. trachomatis genome, and the 214 coding sequences not found
in C.
trachomatis account for most of the increased genome size. Eighty-eight of
these genes
are found in blocks of >10 genes {11-30 genes/block), 41 are single genes, and
the
remainder are partnered with at least one other gene. Based upon the
observation that
~%U% of all the C. pneumoniae genes have an identifiable homolog in GenBank,
exclusive of C. trachomatis, it would be expected that over 150 of the 214
genes should
33


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
have a homolog in GenBank, many associated with a function. However, only 28
coding
sequences have similarity to genes from other organisms. Thus the majority of
the genes
that are mutually exclusive of C. trachomatis (186 of 214), and the 60 of 70 G
trachomatis genes that lacked an identifiable homolog in C. pneumoniae, do not
have
detectable homologs to genes from other organisms. We predict that most of the
unique
genes are essential for specific attributes that define the differential
biology, tropism and
pathogenesis of C. trachomatis and C. pneumoniae. Moreover, this suggests that
C.
pneumoniae has more unique biological (i.e., virulence) capacity than C.
trachomatis.
The ability of C. pneumoniae to be more invasive and survive in a broader
range of host
cell types than C. trachomatis is consistent with this hypothesis. Not all of
the
differences in biological capacity may be associated with mutually exclusive
genes. One
explanation for the significantly lower level of homology between protein
sequences
assigned as having G pneumoniae and C. trachomatis orthologs but no
identifiable
orthologs in other organisms is that this set of proteins is not only
associated with
biological requirements specific for Chlamydia but this polymorphism may
account for
differential biology between the two species. The determination of the genome
sequence
from a representative of the C. psittaci group will precisely delineate those
genes that are
mutually exclusive and specific for each species.
The major functionally identifiable addition to the C. pneumoniae genome
is a large expansion of genes encoding a new family of chlamydial polymorphic
membrane proteins (Pmp), alone representing 22% of the increased coding
capacity.
While the C. trachomatis genome has 9 pmp genes, remarkably the C. pneumoniae
genome contains 21 pmp genes. Most of these genes appear to be amplified in
two
regions of the genome with three stand-alone genes. Interestingly one of the
stand-alone
genes is most closely related to the C. trachomatis pmpD which is the only
stand-alone
pmp gene in the C. trachomatis genome and it is located with the same relative
genomic
context, suggesting an essential and conserved function for this paralog. Six
Pmp-coding
genes are presumably not functional as five contain predicted coding frame-
shifts and one
is truncated. The amplification of this gene family and the confidently
predicted frame-
shifts suggest a specific molecular mechanism to promote functional or
antigenic
diversity. The biological role of this protein family remains enigmatic,
although at least
one of the proteins in G psittaci related to this family is exposed on the
chlamydial
surface.
34


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/2b923
While a function could not be assigned for most of the unique G
pneumoniae genes, several have significant similarity to genes from other
organisms.
Functional assignments could be made for genes encoding GMP synthetase, IMP
dehydrogenase, (JMP synthase, uridine kinase, biotin svnthase pathway
proteins,
methylthioadenosine nucleosidase, a DNA glycosylase and aromatic amino acid
hydroxylase. Thus a complete pathway was identified for biotin biosynthesis.
The
additional purine and pyrimidine salvage pathway genes presumably reflect
metabolic
' limitations in one of the cell types that G pneumoniae infects or
differences in the ability
of C. pneumoniae to transport precursor nucleosides or nucleotides.
The addition of aromatic amino acid hydroxylase in G pneumoniae is
intriguing especially in light of the loss of tryptophan biosynthetic genes
and the inability
to synthesize other amino acids including phenylalanine. Aromatic amino acid
hyroxlyases include three distinct enzymes that function to receptively
oxidize
phenylalanine to tyrosine, tyrosine to Dopa, and tryptophan to 5-
hydroxytryptophan and
serotonin. Although the chlamydial protein is similar to proteins of this
family and
incrementally more closely related to tryptophan hydroxyiase, its specific
function could
not be confidently predicted. We hypothesize that it may be involved in C.
pneumoniae
virulence. Tryptophan hydroxylase has not been previously identified in
bacteria and the
origin of the chlamydial gene appears to be from eukaryotes. The functional
role of an
aromatic amino acid hydroxyiase for C. pneumoniae is linked to the unique
intracellular
biology of this organism and may represent a key contribution to C. pneumoniae
persistence and pathogenesis.
It is understood that the examples and embodiments described herein are
for illustrative purposes only and that various modifications or changes in
light thereof
will be suggested to persons skilled in the art and are to be included within
the spirit and
purview of this application and scope of the appended claims. All
publications, patents,
and patent applications cited herein are hereby incorporated by reference in
their entirety
for all purposes.
Table 1 provides functional assignments of C. pneumoniae nonprotein-
encoding genomic sequences. Table 2 provides functional assignments of protein
coding
sequences. Table 3 provides the amino acid sequences of the proteins
corresponding to
the coding sequences.


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
TABLE 1
type SEQ iD N0:1 SEQ tD N0:1 Gene


start position end position


Ori 841664 841396 (R) Putative Origin of Replica


tmRNA 138493 138074 (R) tmRNA


pRNA 607342 607649 Ribonuclease
P
RNA


rRNA 1000564 1002115 165 rRNA


rRNA 1002415 1005278 235 rRNA


rRNA 1005393 1005509 5S rRNA


tRNA 269070 269142 Ala tRNA_1


tRNA 164318 164389 Asn tRNA


tRNA 296224 296151 (R) Asp tRNA


tRNA 836191 836119 (R) Ala tRNA_2


tRNA 1030533 1030603 Cys tRNA


tRNA 784896 784822 (R> Glu tRNA


tRNA 781680 781610 (R) Gly tRNA'1


tRNA 961536 961607 Gly tRN~2


tRNA 999949 1000023 His tRNA


tRNA 268992 269065 Ile tRNA


tRNA 672236 672318 Leu tRNA 1


tRNA 680178 680257 Leu tRNA'2


tRNA 715889 715971 Leu tRNF~3


tRNA 739403 739486 Leu tRNPr_4


tRNA 1175863 1175944 Leu tRNA'5


tRNA 784994 784922 (R) Lys tRNA


tRNA 843926, 843999 Pro tRNA_2


tRNA 409922' 409848 (R> Pro tRNA_1


tRNA 631373 631445 Phe tRNA


tRNA 677337 677264 (R) Arg tRNA~,2


tRNA 807413 807341 (R) Arg tRNA_3


tRNA 877473 877400 (R) Arg tRNA_4


tRNA 462141 462214 Arg tRNA_1


tRNA 1085605 . 10.85676 Gln tRNA


tRNA 786780 786708 (R) Thr tRNA_3


tRNA 89728 89657 (R) Thr tRNA_I


tRNA 293477 293405 (R) Thr tRNA'2


tRNA 87522 87450 (R) Met tRNP~l


tRNA 199301 199229 (R) Met tRNA_2


tRNA 199390 199317 (R) Met tRNA_3


tRNA 626904 626987 Ser tRNA_1


tRNA 708359 708440 Ser tRNA_2


tRNA 1112034 1142117 Ser tRNA_3


tRNA 1230028 1229945 (R) Ser tRNA_4


tRNA 91070 90999 (R) Trp tRNA


tRNA 293399 293317 (R) Tyr tRNA


tRNA 296147 296075 (R) Val tRNA_1


tRNA 1137389 1137462 Va1 tRNA_2


36


gacggatttgcactgccggtagaactccgcgaggtcgtccagcctcaggcagcagctgaa2520


ccaactcgcgaggggatcgagcccggggtgggcgaagaactccagcatgagatccccgcg2580


ctggagg


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
TABLE 2
~ana ~rem "2 ctraeW:eee a~~n~'fee ~~ rerheeierfl erheiee iartateaGheaeel
1


Clft0001111 4 R CT001 hypothetical protein


CPa0003577 175 t QatC-Glu-CRNA Gla luaidotransterasa tC subunit)-
1CT0031


CPn0007195 X770 T aat~-Glu eRNl1 Gln Aaridotransterae-1GT003!


ma)un.t:!ih :~:! t- ,(.~rN.cPe~ll=t ~:In rnrt:, vln Ammto~san.-.:~rrrsr
t1 ::uWtniW -t~:T9n.n


vPn~0U5ils7 edJ1 F pmp_1-7olymorphic ~uc.rt !(emDrin~ Procmn
G Famsly


CPn0005~=93 7111 R


CPn00077805 10196 F


CPn000810975 11615 F


CPn000911115 13119 t


cPnooloa a 13x6 r
s


CPa00101379 13746 t frame-shift with 0010


CPn00I11519= 16114 !


CPn001316it4 11x12 !


CPn001311511 =1106 F ymp_1-Polynnrphie Outer Membrane Protein C
lamily


CPa0014II392 219x3 r ymD_3-Polyanrphic outer Membrane Protein G
lastly


CPn001537.135x174 t' pmp-"3-PNP_3 lirasre-shift with 0011)


CPn001614416 26118 t' pmp_t-Polyteosphie Outer Membrane Protein
G lastly


CPn001726094 x7170 F' pop_t-PMP_< Iiras~-shift with 00161


CPn0018375x2 29007 t pmp_5-Polymorphic outer Membrane Procain G
Pamily


cIPn001929007 30356 t pap_S-PMP_S Iirame-shift with 0011)


CPa00x031617 30603 P. Predieted OHP (leader (14) pepCide= outer
membrane)-ICT7511


CPn00x131410 3x707 R Predicted OHP (Leader 119) prptide)-1CT350)


CPn00xx1191 34395 F maL-1CT319)


CPa00x336607 3301 F y~ilc/alr-ABC TraasDOrter Protein ATPeee-ICT348)


CPn00x137596 36661 F xerC-InteQrsse/reeombinaae-lCT3t7)


CPa00x51860 37614 A elaC/atsa-Sulphohydrolase/Glycosuliataae-(
Crlt6l


CPa00x639625 3176= R GT3t5 hypoe3tetical protein-ICTltS)


CPn00x7txx3t 39778 R lon-Lon ATP-dependent Protease-lGT3tl


CPn00x813325 txSt3 R


CPa00x93755 43310 R


CPn003013191 415x9 f Qep_1-O-SialoQlycaDroeaia Eadopeptidase_1-ICT343)


CPa003114711 44111 ! rs=1-SZ1 Ribosaetal Ptrouin-fCT3t=I


Cln003x4913 46098 T daaJ-Meat Shoek Proteia J-ICT7ti)


CPn003346138 8171 P pdhArrB/odbAiodbH-lpyruvatel Oxoisovalerate
DehYdroOsasse Alpba i seta


Pusioa-ICT3t0)


CPa003449457 41=10 R


CPis0035510x9 49569 R CT339 hypothetical protein


CPts00365100? 51796 t CT338 hypothetical Droeeia


CPn003751792 5x115 F ptsH-PTS Phosphocarrisr Protein Hpr-It-T337)


CPn00385x119 63831 F DtsI-PTS PtP Phosphotransierase-ICT336)


ePn003954x50 53163 R ybal-ICT335)


CPn00405563 St318 R dnaX_1-DN11 Pol III Gamma and Tau_1-fCT3341


CPn004156996 5733 T


CPa004257103 5113 !


CPn00t35847 60372 !


CPn004460419 60771 !


CPn00t561069 6=790 t


CPa004661790 61x63 t


CPn004761155 63151 ~
T


CPn00486311? 85101. ! yqiP-Is conserved hypothetical IM proesin


CPn001966x96 651)7 R


CPn0050B613 66199 R


CPn00s166173 67111 t


CPn005261005 6730 R hemC-Porphobilinot:en Oeaaunass-ICTZ991


CPn005369744 67916 A sms-Sau Protein-ICTZ91)


CPn0054700:3 69713 R rnc-Ribonucluse III-(CTx97)


CPn0055701x9 70590 F CT296 hypothetical proteia '


Cpn00s670953 72746 t .n~rsA-PhosDhoernnomucasr ICTt95)


CPn00577=971 73551 F sodM-SuDeroxide Dismucase lMnlIC'T'l94)


CPn00587)839 7156= F aec0-AeCoA Casboxylase/Transierase eca-tCT293)


ePa0059'1618 71050 F duc-dtlTP Nueleocidohydrolase-ICTZ92)


CPn006075055 755x8 F pesN_t-PTS IIA Protein-IC'C:9lf


~Pn005175514 76:08 F DtaN_Z-PTS IIA Protein ttTN DNJ1-lin3inQ
Domain-lGTx90)


~Pn~05274)04 77490 F CT-~9 hypocnecieAl proceln


~:Pn~~b77811: 74':67 F


~Pn00b17N146 78S7b F


~:Pnn9551N9:4 40651 F C't'=99 hyDOther:i,:nl protein


~Pn0055409:5 d=655 F


37


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
CPn006782953 8053 F


CPn006884903 81331 R CT360 hypothetical pzocein


CPn006985236 87086 F


CPn007087378 87208 R


CPn007188045 87599 R CT325 hypothetical protein


CPn007289061 88057 R CT324 hypothetical protein


CPn007389356 89574 F infA-Initiation Factar IF-1-ICT323)


CPn007d89774 90955 F cufA-Elongation Factor Tu-1CT322)


CPn007591102 91350 F secE-preprotein cranslocaseICT321)


CPn007691358 91903 F nusG-Transcriptional Anciterminacion-(CT320)


CPn007792013 92435 F zlll-L11 Ribosomal Protein-(CT3191


CPn007892465 93160 F zll-L1 Ribosomal Protein-(CT318)


CPn007993179 93688 F r110-L10 Ribosomal Protein-(CT317)


CPn008093735 91131 F r17-L7/L12 Ribosomal Protein-(CT316>


CPn008194261 98016 F rpoH-RNA Polymerase Heca-(CT315)


CPn008298043 102221 F rpoC-RNA Polymeraae Heta~ -ICT314)


CPn0083102332 103312 F tal-Transaldolase-(CT313)


CPn0084103362 103751 F predicted ferredoxin-ICT312)


CPn0085104506 103755 R CT311 hypothetical protein


CPn0086104904 105527 F atpE-ATP Synchase Subuait E-(CT310)


CPn0087105579 105376 F CT309 hypothetical protein


CPn0088106373 108145 F atpA-ATP Syachase Subuait A-(CT308)


CPn0089108153 10966 F atpH-ATP Synthase Subunit e-(CT307)


~Pn0090109454 110080 F atpD-ATP Synthase Subunit D-(CT306)


CPn009110074 112053 F 1
atpI-ATP Synthase Subunit I-ICT3051


CPn0092112151 112573 F atpK-ATP Synthase Subunit K-(CT304)


CPn0093112509 113015 F CT303 hypothetical protein


CPn0094113152 115971 F valS-Valyl tRNA Synthetase-ICT302)


CPn0095116037 118790 F pfai0-5/T Protein Kinsse-(CT301)


CPn0096124314 118837 R uvrA-Excinuclease AeC Subunit A-(CT333)


CPn0097124555 126006 F pyk-Pyruvate Kinase-(CT332)


CPn0098127491 126091 R htrH-ACyltransferase-ICT010)


CPn0099127593 127865 F


CPn0100129141 127882 R CT011 hypothetical protein


CPn0101129932 129141 R ybbP family hypothetical protein-ICT012)


CPn0102130123 131466 F cydA-Cytochrome Oxidase Subunit I-(CT013)


CPn0103131480 132511 F cycle-Cytochrome Oxidase Subunit II-(CT014)


CPnOlOd133875 132676 R~ CT017 hypothetical protein


CPn0105134847 134029 R CT016 hypothetical protein


CPn0106135091 136374 F phoH-ATPase-(CT015)


CPn0107137162 136392 R CT058 hypothetical pzotein_1


CPn0108137857 137303 R CT018


CPn0109138655 141783 F ileS-Isoleucyl-tRNA Synthecase-1CT019)


CPn01101373 141827 R lepe-Signal Peptidase I-ICT020)


CPn011114686 143934 R CT021 hypothetical protein


CPn0112144767 145093 F r131-L31 Ribosomal Protein-(CT022)


CPn0113145335 146405 F pfrA-Peptide Chain Releasing Factor
(RF-1)-(CT0231


CPnOlld146398 147261 F hemK-A/G specific methylase-(CT024)


CPa0115147279 148622 F ffh-Signal Recognition Particle GTPase-(CT025)


CPn0116148616 148972 F rsl6-516 Ribosomal Protein-(CT026)


CPn0117148989 150071 F tzmD-tRNA (guanine N-1)-Methylttansferase-(CT027)


CPn0118150102 150464 ~ s119-L19 Ribosomal Protsin-(CT'028)
F


CPn0119150523 151164 F rnhe_1-Ribonuclease HII_1-ICT029)


CPn0120151164 151778 F gmk-GMP Kinase-(CT030)


CPn0121151778 152068 F CT031 hypothetical protein


CPn0122152071 153723 F mete-Methionyl-tRNA Synehetase-(CT032)


CPa0123155969 153774 R recD_1-Exodeoxyribonuclease V (Alpha
Subunit)_1-(CT033)


CPn0124156614 158068 F


CPn0125158096 158605 F


CPn0126158809 161085 F


CPn0127162143 161130 R ytfF-Cationic Amino Acid Transporter-(CT034)


CPn0128162277 163053 F bpll-Hiotin Protein Lipase-(CT035)


CPn0129163717 16306 R similarity to CT036


CPn013016425 163751 R


CPn0131164519 165580 F


CPn0132165587 166561 F


CPn013316733 16656 R CHLPS hypothetical protein-(CT109)


CPn0134169098 167467 R groEL_1-HSP-60_1-tCT1101


CPn0135169448 16913 R groES-lOKDa Chaperonin-(CT1111


CPn0136171401 169569 R pepF-Oligopepcidnse-ICT112)


CPn0137172254 171502 A ybgI-ACR family-tCT1081


CPn013817019 172700 R hem:..-Glucatrace-1-semialdehyde-2.1-aminomutase-
ICT2101


38


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
CPn013917465617093 R ypq=-(~j10)


CPaoltO175110171173 R yqdi-tCTSlI


CPnoltl175103175110 R splA-Ribose-5-P Isasrrase A-tCTZ131


CP1l01t2176091175116 R


CPn01t317T33s176114 R 'yxjC_Ds_1 ttypothecical Proceia


CPn0114177963180560 F elpl-Clp Protease ATPass-tCT1I31


cPaoltsI8o777I1=369 F CTllt hypochetiul protein


cPnoltsIaI1131e3o9s r


cPetolt7Ia3tI5113171 F


CPn0ltB18316 183702 F plasl-S/T Protein Kinase-tCT115)


CPaolt918371517700 F dalJ-DNA LiQase-tCTi46)


CPno1501171311911 F CTlt7 hypothetical protein


CPn01511911 19=635 R mhpJ~-tioaooxypeeuse-(CT1181


ClnolSl19!=6s19718 R CT119 hypotbetiul prot~ia


~Paol5319533 197113 F leul-Leueyl tRNA 8yaehttass-fCTt09!


ClnolSt197892199301 F pseA-1CD0 Traaslsrase-ICTI08f


CPnolSS191691191118 R


CPnol561001171!1770 R


CPao157100713100=98 A


CPtl015820130 100191 R


CPnoi59=01772101167 R


CPno160303791303137 R pfkJ~i-Fructose-b-P Phospdocraasferase_1-lCTI07)


CPao161101612303798 R psedietad aeylcraasferase lamily-tGTI06)


CPn016220511810803 R


CPa0163308016=06391 !


CPnolbt208198106!98 !


CPnoi65306198207583 P


CPetoi66z07830207963 !


CP~WI67201306107977 R


CPnolBB20161 201417 R


CPno169109501101710 R


CPao170111016110015 R


Clnol7lIi=13621119 R '~faA-Clip 9ynthaas


Clnol7l11317721=110 R QuaD/lapD-laosiae 5'-moaophosphass dehydro0anase
IC00R-tesa~iaal savior.


only)


Clno173113987213715 R


cPnol7tIlass7Iu7It F


CPnol75214198215175 F'


Claol76213=86z16318 F CTi53 hypoehatieal protein


CPno17721759 116608 R


CPn0178211052317789 R


CPn0179211103218056 R


CPuo180111851218356 R


CPao181219175111777 R


CPn0182110596219331 R aceC-Biocia Carboxylass-tCTiIt)


CPn0183111195330695 R ace!-Diocia Carboxyl Carrier Protein-tCTlI3)


CPa018t211775231331 R s!p_1-EloaQacion Factor P_I-tCTl3I)


CPUo185113151231765 R spe/araD-Ribulose-P Epimsrus-tCT121)


CPn0186111199111068 F stmilaricy to Cps IaeJL1-tCT1191


CPnQ117111118213015 F predicted metdylass-fCTi331


CPno188116111111100 F CTI3I hypoebecical prouia


CPn0189116100211815 F CSI31 homolo0-(Possible Transmembraas Prouin)
~


Clao19013!!19131271 F


Cla019133199133131 R QlaQ-1180 Amigo Acid Trsasportes ATPase-1CT130)


CPao192I3=631131981 R Qln!-A8C Amigo Acid Tsaasporeer Pesmsass-ICT1I9I


CPa0193233126231686 R arQR-ArQiaias Re~tssor


CPa019t13311023111 F Qep_I-O-Sialo0lyeoprotsia trrdopsptidase_I-tC?197)


CPaoi95231190135786 F oppA_1-Olfpopeptide BindiaQ Procai~l


CPa0196336939137519 F app!'.I-Olipopepeids DindiaQ Protais~l-(CT1981


CPnol972375'!8331183 F oppJ~3-Olipopeptlds Diadtap Protsitt-3


Clnol98=79169It07tb F opplL,t-OliQOpsptide Dindta? Psocsl~l


CPnoi99ItlOtz31983 F oppD_1-Olipopeptide Pesmsase_1-ICT1l9)


CPn0I00111017147868 F opp~i-Olipopeptide Pesmease_1-(CTt00)


Cln0I01111161zt371s F oppD-oli0opepclde Transport ATPass-tCTI01)


CPnO=02zt1715111500 F oppF-Olipopepcids Trtasporc ATPass-tCTtOI


ClelO=03-'25008 I510z F


craolotztsel7ztlooz F


csoozosztu3 It13z7 F


Clet0I0611610927161 F CTI03 hypothetical psoteia


CPt10I07zt7I08111617 F ybhI/sodiT!-OxoQluearacs/Halace Translocatos-tGT20t1


CPa0I08111953z50602 F pi)cJ~.Z-Fructose-b-P Ptsosphotraaatesass_1-tCTI051


CPe0I09251036:51172 F


39


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
CPn0210252384 251140 R


CPn0211252756 252463 R


CPn0212254066 252888 A


CPn0213254342 254190 R


CPn0I14255657 254146 R


CPn0215257015 255759 R


CPn0216257608 257174 R


CPn0217257896 258579 F ypdP-(CT140)


CPn0218259058 258582 R


CPn0219259357 260472 F tgt-pueuine tANA Ribosyl Transferase-(CT193)


CPn0220260696 261238 F


CPn0221261657 262064 F


CPn0222262504 262842 F wak similarity to Hacteriophage CHP1 (Orl4>


CPn0223262956 263333 F


CPn0224263435 263674 !
.


Cpn0225263873 264541 !


CPn0226264566 261967 F


CPn0227265116 265009 R dsb8-Disulfide bond Oxidoreductase-(CT176)


CPn0228266110 265412 R dsbG-Disulfide Bond Chaperone-(CT177)


CPn0229266328 267560 F CT178 hypothetical protein


CPn0230268253 267576 R CT179 hypothetical protein


CPn0231268957 268253 R tauH-AHC Transport ATPase (Nitrate/Fe)-(CT180)


CPa0232270122 269232 R similarity to 5~-Methylthioadenosine / S-
Adenosylhosaeysteine


Nucleosidase


CPa0233270424 270218 R


CPn0234271240 270548 R CT181 hypothetical protein


CPa0235271416 272177 F kdaH-deoxyoetulonosic Acid Syathetase-(CT182)


CPn0236272156 273766 F pyre-CTP Synthecase-(CT1831


CPn0237273762 274214 F yggF Family-(CT184) '


CPn0238274303 27$838 F zwf-Glucose-6-P Dehyrogenase-(CT185)


CPn0239275899 276672 F devB-Glucose-6-P Dehyrogenase (DevH family)-(CT186)


CPa02d0277861 276698 R


CPn0241279354 278203 R


CPn02d2279918 279487 R


CPa02d3280555 280133 R


CPn0244280918 281556 F adk-Adenylate Kinase-(CT128)


CPn0215281645 282499 F ydh0-Polysaccharide tiydrolase-Invasin Repeat
Family-(CT127)


CPn02d6282952 282551 R~ rs9-S9 Ribososial Protein-(CT126)


CPn0247283615 282969 R r113-L13 Ribosomal Protein-(CT125)


CPa02d8284327 283650 R ycfV/ybbA-AHC Transporter ATPase-(CT152)


CPn02d9285841 28333 R CT151 hypothetical protein


CPn0250286057 285902 R r133-L33 Ribosomal Protein-(CT1501


CPn0251286060 287559 F eonserved hypothetical protein


CPa0252288112 287576 R CT144 hypothetical protein (frame-shift
with 0253?)


CPn0I5328856 287950 R CT144 hypothetical protein_1


CPn0254289262 288159 R CT143 hypothetical protein'1


CPn0255290165 289329 R CT142 hypothetical protein_1


CPn0256291264 290398 A CTl4d hypothetical protein_2


CPn0257292127 291267 R CTld3 hypothetical proteln,",2


CPn0258292531 292133 R CT142 hypothetical protein (frame-shift
with 02591)


CPa0259292986 292441 R CTld2 hypothetical protei~2


CPn0260294045 29358 R sec~l-Protein Translocase Subunit_1-(CTidl)


CPn0261294302 295033 F ydn0-PP-Loop Superfnmily ATPase-(CT217)


CPn0262295091 295933 F surf-Surf-like Aeid Phosphatase-(CT218)


CPn0263296249 297136 F yQfU hypothetical protein-ICT221)


CPn0264297730 297155 A ubiD-Phenylacrylate Decarboxylase-(CT2201


CPn0265298620 297730 R ubiA-Benzoate Oetaphenyltransferase-(CT219)


CPn0266299184 299876 F


CPn0267300122 300910 F


CPn0268300935 301318 F


CPn0269302150 301476 R Dipeptidase-(CT138)


CPn0270303325 302468 R ywlC-SuAS Superfamily-related Protein-(CT137)


CPn0271303634 301362 F Lysophoepholipase esterase-(CT1361


CPn0272305233 304340 R dnaX_2-DNA Pol III Gamma and Tau_2-(CT187)


CPn0273305844 305227 R tdk-Thymidylace Kinase-(CT1881


CPn0274308353 305852 R gyrA_1-DNA Gyrase Subunit A_1-1CT189)


CPn0275310786 308372 R gyr8_1-DNA Gyrase Subunit H_1-(CT190)


CPn0276311137 310793 R CT191 hypothetical protein


CPn0277311910 311104 A


CPn0278312875 312060 R conserved outer membrane lipoprotein protein


CPn0279313537 312875 A Posaibls ABC Transporter Pe:mease Protein


CPa0280314572 313550 A dppF-Oipeptide Transporter ATPaee-(CT689)




CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
CPn0281315057 316103 F dhnA-Predicted 1.6-Fructose Hiphosph..-i Aldolase
Idehydrin family)-


(CT215)


CPn0282316126 317529 F xasA/gadC-Amino Acid Transporter-(CT216)


CPn028331897 317532 R


CPn0284319045 318551 R


CPn0285320595 319051 R


CPn018632=059 320650 R mgtE-Mq Transporter ICHS Domain)-(CT194)


CPn0287321221 322089 R '


CPn0288325716 321571 R CT195 hypothetical protein


CPn0289325812 326996 F aaaT-Neutral Amino Acid lGlutamate) Traruporter-
(CTZ30I


CPn0290327042 328523 F Na-dependent Transporter-ICT231)


CPn0291321667 3=9191 F incH-Inclusion Membrane Protein H-ICT232)


CPn0292329118 329836 F incC-Iaclusioa Membrane Protein C-ICT233)


CPn0293329919 332723 F CT234 hypoehecieal proteia


CPn0291333092 333502 F eAMP-Dependent Protein Kiaase Regulatory Subuait-
fCT=35)


CPn0295333863 333627 R aepP-ACyl Carrier Protein-ICT236)


CPn0296331765 331022 R labG-Oxoacyl lCarrier Procaia) Reductase-ICT237)


CPn0297335697 334771 a fabD-Malonyl Acyl Carrier Tr~sacyclase-fCT238)


CPn0298336721 335717 1t fabN-Oxoacyl Carrier Protein Synthase ZZZ-fCT239)


CPn0299336816 337115 ) reeR-Recombination Protein-fCTZ40)


CPn0300337783 340152 I yaeT-Omp85 Analog-fCT141I


CPn0301340250 340762 I' fCmpH-Like outer Membrane Protein)-fCT242)


CPn0302340787 311866 I' lpxD-UDP Glueosamine N-Aryltransferase-fCT243)


CPn0303342958 341921 F' CT211 hypothetical protein


CPn0304343133 344158 F pdhA/odpA-Pyruvate Dehydrogenase Alpha-!0235)


CPn0305341154 345137 I pdhe/odp8-Pyruvace Dehydrogenase Beta-(022461


CPn0306345145 346431 1 pdhC-Dihydrolipoamide Aeetyltra~leraae-102247)


CPn0307348986 346515 1: glgP-Glycogen Phosphorylase-!02248)


CPn0308349231 349596 F' simflarity to CT249


CPn0309350974 349595 R dnaA_1-Replication Initiation Protein_1-!02250)


CPn0310353433 351049 R 60IM-60kDa Inner Membrane Proteia-!02251)


CPn0311354438 353575 R lgt-Prolipoprocein Diacylglycerol Transferase-!0225=I


CPn0312354524 354976 F CT101 hypothetical protein


CPa0313354990 355355 F acpS-Aeyl-carrier Pzotein Synchase-102100)


CPa0314356285 355353 R trxe-Thioredoxin Reduccase-1020991


CPa0315356977 358716 F rsl-51 Ribosomal Protein-102098)


CPa0316358820 360121 F nusA-N Utilisation Protein A-(02097)


CPn0317360081 362750 F~ infH-Initiation Faecor-2-!02096)


CPn0318363767 363126 F rbfA-Ribosome Binding Factor A-102095)


CPn0319363175 363879 F truth-tRNA Pseudouridine Synthase-!02091)


CPn0320363860 364783 F ribF-FAD Syntluse-(CTD93)


CPn0321365858 364767 R ychF-GTP Binding Protein-!02092)


CPn0322366219 367328 F yscU-YopS Translocation Protein U -!02091)


CPn0323367331 369460 F lcrD- Low Calcium Response D-(02090)


CPn032d369492 3.70688F lcrE- Low Calcium Response E-(02089)


CPn0325370708 371148 F sycE-Secretion Chaperone-(02088)


CPn0326371148 372725 F malQ-Glueanotransferase-102087)


CPn0327372915 373211 F r128-L38 Ribosomal Protein-!02086)


CPn0328373241 371992 F GT085 hypothetical protein


CPn0329375088 376146 F Phopholipase D SuDerf~lY (leader f33) peptide)-!02084)


CPn0330376675 376202 R CT083 hypothetical protein


CPa0331378437 376701 R CT082 hypothetical protein
~


CPn0332378655 378536 R CNLTR T2 Protein-!02081)


CPa0333379090 378800 R ltue-102080)


CPn0334379311 379823 F CT079 similarity


CPa0335379817 380671 F folD-Methylene Tetrahydrofolate Dehydrogenase-(02078)


CPn0336380650 381591 F yojL-1020771


C>?n0337382027 381575 R smp8- Small Protein 8-102076)


CPn0338383278 383375 F dnaN-DNA Pol III (beta chain)-iGT075)


CPn0339383420 384030 F reeF-ABC superfamily ATPase-!02074)


CPn0340383802 '384156F (frame-shift with 0339)


CPn0341384160 384195 F (frame-shift with 0340)


CPn0342384622 385062 F predicted OMP ;leader 119) peptide)-tCT073)


CPn0313:84999 385595 F (frame-shift with 0342?)


CPn0341387420 385558 R yaeL-Metalloprocease-(02072)


CPn0315388572 387136 R yaeM-IGT071)


CPn0346389675 388704 R cro0/ycgD-Integral Membrane Protein-!02070)


CPn0317391021 389678 R croC/ytgC-Integral Membrane Protein-102069)


CPn0348391803 391027 R troe/ytqH-ABC transporter ATPase-(020681


CPn0349392770 391790 R t:oA/ycgA-Solute Protein Binding Family-(0':067)


CPn035J393181 39368 F CT066 ty~ochecscal Drotein


CPn0351397888 395132 F adt_1-ADP/ATP Transloease_1-!02065)


41


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
CPn0352395574 396830 F


CPn0353396893 397135 F


CPn0354397167 398507 F


CPn0355399889 398591 R


CPn0356400459 400109 R


CPn0357401317 400469 R


CPn0358401751 401578 R


CPn0359402012 403817 F lepA-GTPase-ICT064)


CPn0360405358 403922 R gnd-6-Phosphogluconace Dehydrogenase-tCT063)


CPn0361406647 405382 R tyrS-tyrosyl tRNA Synthecase-ICT062)


CPn0362407825 407055 R fliA/rpsD-Sigma-28/WhiG Family-(CT061)


CPn0363409688 407943 R flhA-Flagellar Secretion Protein-(CT060)


CPn0361409966 410238 F ferd-Ferredoxin IV-(CT059)


CPn0365410528 411544 F


CPn0366411976 412440 F


CPn0367413102 413836 F


CPn0368413790 114107 F


CPn0369414351 415562 F CT058 hypothetical protein_2


CPn0370415800 416912 F CT058 hypothetical procein_3


CPn0371417147 417503 F


CPn0372417687 418001 F


CPn0373418380 420218 F gcpE-ICT057)


CPn0374420218 420961 F CT056 hypothetical protein


CPn0375421121 411615 F


CPn0376421854 422294 F


CPn0377423438 422347 R suc8_1-Dihydrolipoamide Succiayltransferase_1-ICT055)


CPn0378426168 423445 R aucA-Oxoglutarate Dehydrogsnase-ICT054)


CPn0379426322 426765 F CT053 hypothetical protein


CPn03H0426758 427876 F hemN_1-Coproporphyrinoqen III Oxidase_1-ICT052)


CPn0381429809 428037 R CT326 similarity


CPn0382430719 470036 R yabC/yraL-SAM-Dependent Methytransferase-(CT048)


CPn0383431693 430749 R CT047 hypothetical protein


CPn0384432377 431862 R hcte-Histone-like Protein 2-(CT016)


CPn0385434018 432522 R pepA-Leuryl Aminopeptidase A-fCTOdS)


CPn0386434525 434046 R ssb-SS DNA Binding Protein-ICTOd4)


CPn0387435196 431699 R CT043 hypothetical protein


CPn0388435329 437320 F qlgX-Glycogen Hydrolase Idebranchiag)-ICTOd2)


CPn0389438134 437319 R CTOdl hypothetical protein


CPn0390439144 438134 R ruvH-HOlliday Junction Helicase-(CTOdO)


CPn0391439692 439510 R


CPn0392439811 440383 F dcd-dCTP Deaminase-fCT039)


CPn0393440379 440723 F CT038 hypothetical protein


CPn0394440736 441968 F tlyC_1-CBS Domain protein (Hemolysin
Homolog)_1-fCT256)


CPn0395441964 443175 F CT257 hypothetical protein


CPn0395444353 443241 R yhf0-NifS-related protein-ICT258)


CPn0397445115 444381 R PP2C phosphatase family-tCT259)


CPn0398445533 445700 F


CPn0399445879 446523 F CT253 hypothetical protein


CPnOd00446536 447306 F CT254 hypothetical protein


CPnOd01117881 417195 R CT255 hypothetical protein


CPnOd02448994 447888 R mutt-Adenine Glycosylase-fCT1071


CPnOd03449015 419710 F yceC-predicted pseudouridine synthetase
~ family-(CT106)


CPnOd04450887 419871 R


CPa0d05451739 450966 R CT105 hypothetical protein


CPn0406451969 452865 F fabI-Enoyl-ACyl-Carrier Protein Reductase-fCT104)


CPnOd07453742 452858 R HAD superfamily hydrolase/phosphatase(CT103)


CPnOd08454105 454581 F CT102 hypothetical protein


CPn0109154645 455127 F CT260 hypothetical protein


CPn0410455123 455833 F dna~l-DNA Pol III Epsilon Chain_1-ICT261)


CPnOdll455833 456609 F CT262 hypothetical protein


CPn0412456590 457246 F CT263 hypothetical protein


CPn0413459203 457227 R msbA-Transport ATP Binding Protein-ICT264)


CPn0414460113 459172 R accA-ACCOA Carboxylase/Transferase
Alpha-fCT265)


CPn0115461498 160221 R CT266 hypothetical protein


CPn0416461856 461557 R himD/ihfA-Integration Host Factor
Alphn-tCT267)


CPn0117463035 462244 R nmiA-N-Acetylmuramoyl Alanine Amidase-fCT268)


CPn0118464401 462953 A murE-N-ACetylmuramoylalanylglutamyl
DAP Lipase-tCT269)


CPn0419466834 464876 R pbp3- transqlyeolase/transpeptidase-tCT270f


CPnOd20467108 466824 R CT271 hypothetical protein


CPn0121467998 467108 R yabC-PHP2H Family.methylcransferase-ICT'272)


CPn0122db8242 46.8784F CT273 hypochatical protein


CPn0423468791 469216 F CT271 hypothetical protein


42


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
CPn0t2d169612470961 F dnaA_2-Replication Initiation Factor_c-ICT1751


CPn0425470980!71564 F CT276 hypothetical proteins


CPn0426472111471536 R CT277 similarity


CPn0427472207473715 F nqrZ-NJ1DH fUbiquinonel DehydroQenase-;CTZ781


CPnOt2847372247681 F nqr3-NJ1DH Itlbiquinonel Oxidoreductass,
Gamma-tCT2791


CPn0129471681475319 F nqrl-NADH It7biquinonel Reduetase 1-fCT1801


CPn0130475326476093 F nqr5-N1~DH ttlbiquiaonel Reductase 5-ICT281)


CPn0131476183176151 R


CPn0t32176816476514 R


CPn0133477273476929 R QesH-Glycine Clsavape System H Protein-ICT2821


CPn0134179462477276 R CT2B3 hypothetical Drotein


Cln0t3548090247975 R Phospholipase D superfamily (uncleavable
leader peptide)-(CT38t


CPn0t36481618180902 R lpl~-LiDoau Protein LiQase-Like Protein-(CT2851


CPn0137481816184350 F clpC-ClpC Protease-ICT1861


CPn0138185116181334 R yebF-PP-loop superfamily aTPase-ICT287)


CPn0139485553486077 F


CPn0ta0486105486710 F


CPn04t1486891187838 F CT007 hypothetical protein


CPn0t42188013188528 F Ct006 hypothetical protein


CPn043!88729189979 F CT005 hypochecieal protein


CPn0114190187191507 F mnp_6-POlymorphic Outer liembrasse Protein
G/I Family


CPn0115194772197579 F pn~_7-Polymorphic outer ltembraae Protein
C Family


CPn0446197626500115 F pmD_8-Polymosphic Outer Hembrane Protein
G Family


CPn0147500568503351 F ps~ 9-Poiyaarphic Outer Membrane Protein
G/i Family


CPn01t8501810503698 R yxjC~s_2 Hypothetical Protein


CPn01t9507131505330 R pmp_10-P!!P_10 tlrame-shift with 0151)


CPn0150508112507180 R pmp_10-POlyaasphic Outer Membrane Proteia
G Family


CPn0t51508275511058 F ymp_11-Polyaasphic Outer !lembrane Protein
C Family


CPa0152511319512860 F pmp_12-POlymorphie Outer Hembrans Protein
11/I Famfly ltruncated)


CPn0453513234516152 F pmp_13 -POlymorphic Outer Hembrane Protein
C Family


CPn015d516182519115 F pmp_14-POlymorphic Outer Membrane Protein
H Family


CPn0155520348519458 R


CPa0t56521532520337 A


CPn015751386552=120 R


CPn0458526310521136 R


CPn0t59517005526619 R


CPn0460527840526992 R


CPn0461528638527811 R


CPa0t6Z531052519037 R


CPn0463532357531191 R


CPn0t64531842532366 R


CPn0465533212532871 R


CPn0466533724536537 F pa~_15-Polymosphic outer Membrane Protein
E Family


CPn04b7536633539434 F pop_16-Poiymorphic Outer M~bsane Protein
E Family


CPn0168539632540132 F pmp_17-Polymorphic Outer Membrane Proteia
E Family


CPn0t69540399511160 F pmp_17-POlymorphic Outer Membrane Protein
(Frame-shift with 01691


CPn0t705!1357512532 P pmp_17-Polymorphic Outer Membrane Proteia
(Frame-shift with 01701


CPn0t715!2564515401 F pn~_18-Polymorphic outer Membrane Protela
EIF lamily


CPn0t72517905515581 R


CPn0473519593548070 R


CPn0171551573519807 R CT365 hypothetical protein


CPnOt755538!4551685 ~ Q198-Gluean Hranchir~ Lnzyme-ICT8661
R


CPa0176551844553858 R CT865 hypothetical proteia


CPn0t77556106551814 R yqsV_8s Hypothetical Protein


CPn0478557615556210 R hilX-GTP 8indiaQ Protein-tCT3791


CPn0179558125557616 R phnP-Metal Dependent Nydrolase-ICT3801


CPa0t80559301558650 R CT383 hypothetical protein


CPn0t81560946559339 R


CPa0482561737560961 R artJ-7lrQinina Periplasmic 8indinQ Protein-tCT3811


CPn0t8356183656961 F


CPn0484564970565824 F aroC-Deoxyhepconats Aldolsse-ICT3B21


CPn0t85566038566129 F CT382.1 hypothetical protein


CPa0t86567781566105 R hypothetical proline permease


CPn0487569740568112 R CT384 hypothetical protein


CPa0t88570096569767 R hitA-HIT Family Nydrolase-ICT3851


CPnOt89570965570096 R CT386 hypothetical protein


CPn0490571279573333 F CT387 hypothetical pzotein


CPn0191571352577336 R CT389 hypothetical Drotein


CPa019Z571652571804 F


CPn0193575004571855 R


CPnOtS1575364575146 R


CPn0495575607576793 F aspC-l~spartate Aminotran:ferase-ICT3901


43


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
CPn0196576793 57712 F CT391 hypothetical protein


CPn0197571069 5771=0R CT388 hyposhscical protein


CPn0198579035 5705 R


CPa0199580359 579=05R


CPn0500580559 581363F pros-Prolyl tRNA Synchetass-ICT393)


CPn0501SA=57 563550F hreA-HTH Transcrzpcional Repressor-ICT39t1


CPa0502563550 SA1=01F qrpt-HSP-70 Colactor-ICT395)


CPa05035613 55113 F dasK-HSP-70-tCT396)


CPa050t56587 56151 F vacD-riboaueluse family-ICT397)


CPn0505586519 SA9105F 3-aeehyladsnins DNA qlycosylass


CPn0506569172 56940 E CT4=1 hypothetical protein


CPn0507589961 590112F CT121.1 hypothetical protein


CPa050A59012 590300F CTt=1.= hypoctsetical protein


CPn0509590335 590108F IDredietsd Metallosazyme)-ICTt33)


cPn0510590113 591973F ClyC_3-C8S Damaias tHamolysin homoloq)_2-ICTt33I


CPn051159111 59118 F rsbV_1-Siqsa Rspulatory Factor_1-ICZIII)


Cpn051259=553 59113 F CT115 hypothetical pzoteiss


CPn0513591517 593753F Fs-8 oxidorsduetass_1-ICT=6)


CPn051d5957=9 596!=0F Ct117 hypothetical protein


CPn0515595192 597111F obit-Ubiquiaone Mschyltraa:fsrase-ICT138)


CPa0516598111 597255R


CPa0517599531 59795 R


CPa0518600103 59933 A CTt29 hypothetical protein


CPa051960167 60090 R dap!-Diaminopioulace tpimerass-ICTt30)
~


CPa0520601=18 601616R elpP-CLP Protuss-lCTt31)


CPn0511603797 60331 R qlyA-Ssrine Hydsoxymsehylcraasferass-tCTt37)


CPa0522503987 601655F CTt33 hypothetical protein


CPa0523604733 505052F


CPa051t605103 606179F


CPn0525505532 607=83F CT398 hypothetical protein


CPn05Z6601696 607710R yrbH-GutO/lCpsd Tamily Suqar-P Isomerase-ICT399)


Cln0527609l0a 607=6 R sucs_Z-Dihydrolit~oasids Succinyltrsnsferase_2-tCTt00)


CPa0538611162 509931R qltT-tilutaaate Sympore-tCT101)


CPUQ5I961==59 511165R yeah-ATPass-IGTtOt)


CPn0530613=51 61160 R spotJ_1rRNA Hstlsylass_1-ICT103)


CPn0531511069 613315R S1v!! dependent msthyltransfsrus-1CT101)


CPa0532611674 61075 R ribC/risA-Riboflavin Syutbaas-ICT105)


CPn0533611930 61335 F~ CTt05 hypothetical protein


CPa053t515113 51578 F dksA-Dnalc Suppressor-lCTt07)


CPn0535615793 616395F lspA-Lipoprotaia Sisal Peptidase-tCT108)


CPn0535616315 617591F daQA_1-D-Ala/Gly Psrmsase_1-tCTt09)


CPa0537617633 611169F CTtll.l hypothetical protein


Cln0538618212 51511 F C?d1t hypothetical protein


CPn0539616705 611515F pmp_19-polysoorphic outer membrane protein
A family -ICT112)


CPn0510521590 626862F pmp_20-polymorphic outer membrane protein
a Tamily-ICT113)


CPa05t1617170 6=003 F Solute binding protein I-ysbL-Synschoeyscis
Adheein Haeoloq)-tGTtlS)


CPa05t2526003 6=737 F JIaC Transporter ATPass-1CT116)


CPa0513531735 619603F IMStal Traosporc Protein)-iCTtl7)


CPn051a630529 629525A yhbL-GtP binding protein-tCTtlA)


CPa05t5630tea 630533R r117-L=7 ribosomal protein-tCTtl9)


CPn0516631=Z9 630911R rlll-LZl Ribosaul Protein-IGTt=0)


CPn05t7631661 631188~ yqbs family-ICT131)
F


CPn0518533=31 631191R eysJ-Sulfite RsductaseICT435)


Cln0519633669 ' 53355R rsl0-SIO Ribosomal Protein-ICTt35)


CPn0550635561 633560R lusA-tloaqation Factor G-tCTt371


CPn0551638166 635596R rs7-S7 Ribosomal Protein-tCTt381


Cltt0552635587 535=19R rsll-512 Ribosomal Protein-ICT1391


CPn0553537717 53812 R


CPa0551637651 636111F C?tt0 hypoehstieal protein


CPn0555531=9B 50211 F tsp-Tail-SDseific Protusr ICTtl1)


CPa05566t091~ 610325A cspA-lSkDa Cysteins-Rich Protein-ICTt1=)


CPSf055761161 611191R omcD-60kDa Cysceins-Rieh Outer Membrane Complex
Protein-tLTtl3)


CPn0558613300 613031A omcA-9kDa-Cyscsine-Rich outer Membrane Complex
Lipoprotein-ICTttt)


CPn0559613712 53927 F CTt41.1 hypothetical prouin


CPn0560515612 611098R qlGX-Clutamyl-cRNA Synchetass-ICTtlS)


CPn056i6510 6571 R euo-CHLPS too Protein-ICTtt51


CPn056268036 615918R CHLPS t3 k0a prouin honwloq_1


CPn0563650056 611=97A recJ-ssDNA txonucleaas-tCT117)


CPn0561651350 650115R seeDisseF-Protein Export Proteins SeeD/SeeF
Itusionl-ICTItB)


CPn0565655530 65533 R CTIt9 hypothetical Drocein


CPn056665511 656890F yaeS family-tCTt50)


CPn0567655191 657817F cdsa-PhospMCidacs Cytidylytransferaes-lC:t51)


44


CA 02350775 2001-05-11
WO 00127994 PCT/US99/26923
CPn0568657817 658161 F cdsA-Priosphacidaee cytidylytransierast-lCTt52)


Cln05696516 659099 F plat-Glycerol-3-P Aeylcranslesue-ICT153)


CPa0570659107 660789 F arg8-Argsnyl tJtNA Transierase-ICT451)


CPn0571662122 660719 R musA-tJDP-N-Aeetylglucosamine Transierase-ICT1551


CPn0572662352 661616 F CT156 hypothetical protein


CPn0573665101 661191 R yebG lamily-ICT157)


CPn057t665915 665391 R


CPa057566619 665182 R YhhY-Amino Group Acetyl Transisrast-(CT58)


CPn0576667513 666191 R pri8-Peptide Chain Release Faecor 2 tnacural
tTGA irawe-shift )-(CT155


Cla0576657598 667530 R pri8-Inatural UGA trams-shift 1


CPe105776b7195 561155 F SWIG tYH7t) coaoplex protein-ICT601


CPa0578668106 689365 F yaeI-phosphohydrolase-(CTt61)


CPn0579bbl361 669993 F ygbP/yaeH-Sugar Nucleotide Phosphorylase-fCTt6Z)


CPn0580669993 670793 F truA-Pseudouridylate Syntbase I-ICTt63)


CPn0581b7113t 670715 R Phosphoglycolace Phosphatase-(CTt6t)


CPa058Z671503 672177 F CT165 hypothetical Drotsln


CPn0583671100 671717 F CT166 hypoehetieal protein


CPn0584671707 673798 )' aco8/atr8-Z-Component 8ansor-ICTt67)


GPa0585675817 673855 F: similarity co Cps laeA_Z


CPa0586676026 677183 F' atoC/ntrC-Z-Component Regulator-fCTtbB)


CPn0587677ta1 671121 F yvyD~s conserved hypothetical protein


CPa0588678081 6786=6 F' CTt69 hyposbetieal protein


CPn0589671610 679795 F CT470 hypothetical proctin


CPn0590680112 679516 F CTt71 hypothetical protein


CPa0591680373 681010 F yagE family-tCTt7Z)


CPn059Z681153 611161 F yidD family-(CTt73)


CPn0593682176 681391 F. CTt7t hypothetical protein


CPn059468=583 681958 F pheT-phenylalaayl tRNA Synthetase Beta-(CTt751


CPn0595611958 615926 F CT176 hypothetical protsin


CPe10596615939 61bt57 F ada-mecbyltraasierase-(CTt77)


CPn0597681215 685179 R oppC~-Oligopeptide Psrmeast_Z-(CTt78)


Cla0598619697 611=19 R opp8_Z-Oligopepcide Ptsmease_Z-tCTt79)


CPn0599691802 681882 R oppl~5-oligopeptide 8indiag Lipoprotein-,5-(CT110)


Cln0100693117 691137 R


CP80i01693053 69=736 R CTt83 hypochetieal protein


GPn0i02691105 693101 R CTtet hypotheeical protein


CPa0603691305 695115 F hmZ-Fsrroehecalase-(CT415)


Cla060t695115 615196 A~ iliY-Glucamiae 8lading Procsin-(CTttb)


Cla0605691707 696150 R yhbd-Ilethylase -iCT187)


CPa0606617111 691707 R CTtlB hypothetical protsin


CPn0607698195 697573 A glpC-Olueose-1-P Adenyltransierase-1CT119)


CPn0608691615 699016 R -pyre-tJrid3ne 5'-HOnophosplsate Syntbass
It)a~ Sy~sthase)-truaeatad7


CPn0609699705 699916 F CTt90 hypochseical protein


GPn06i0T01tZ0 700029 R rho-Traascripcioa Terraisucion Factor-fCTt91)


CPa0611702025 701120 R yacE-predicisd phosphatase/kinase-(CTt9Z)


CPn0612701631 701022 R polA-DNA Polys~srise I-ICTt93)


CPa0613705656 701651 R soh8-Proteasr 1CT194)


GPa0611707102 705713 R adt~-ADP/ATP Transloease_Z-FCT195)


CPn0615701137 707634 R pgsA_1-Glycerol-3-P Phosphatidyltransisrase_1-fCTt96)


Cla0516708791 710137 F dnaD-Replieatlw DNA Heliease-ICTtl7)


CPn051771081 732316 F gidA-FADdependtac oxidoseduetase-ICTt98)


CPn061B711306 713010 F lplA-Lipoace-Protein Lipase A-ICTtl9)


Cln0619713114 713013 R ndk-Nucleoside-Z-P Ftinase-(CT300)


CPa0620711139 717519 R ruvA-Holliday Junction Heliease-1CT301)


CPa06Z1711617 711111 R ruvC-Grosswer Junction Endonuclease-fCT502)


CPn05Z2715752 711793 R CT503 hypocheeieal protein


CPn0633716993 7161b3 R CTSOt hypothetical protein


CPn0631711015 717011 R gapA-Olyeeraldthyds-3-P DehyroQetfase-)C'i"305)


CPn06ZS711115 711060 R r117-L17 Ribosomal Procsin-ICT506)


CPn0iZ6711616 718495 R rpoA-RNA Polymerase Alpha-(CT507I


CPa06Z7720018 719610 R rsll-S11 Ribosomal Protein-ICT508>


CPit06Z8720128 720063 R rsl3-513 Ribosomal Protein-ICT509)


CPa06Z9721157 720117 R seeY-Translocase-ICT510) '


Cla0630:22316 721815 R r115-L15 Ribosomal Protein-(GT511I


C1n0631722106 722312 R rs5-S5 Ribosasul Protein-ICTS1Z)


CPn0632723195 721127 R r111-L18 Ribosomal Protein-fCT513)


CPn0633723757 733209 A rib-L6 Ribosomal Protein-lCT511)


CPn063172115 7=3717 R rs1-S8 Ribosomal Protein-ICT515)


CPn0635721715 721206 R rl5-LS Ribosomal Protein-ICTS16)


C?n0536725012 721750 A rlZt-L21 Ribosomal Protein-(Cl'S171


CPn0637725161 ~Z1099 R r111-L11 Ribosomal Protein-(C:5-8)


CPn053B~Z57t7 725190 R rsl7-817 Ribosomal Procein(C519)




CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
CPn0539725958 725743 R r129-L29 Ribosomal Protein-fCT520)


CDn0640725377 725961 R r116-L16 Ribosomal Protein-fCT521)


CPn0541727077 725109 R rs3S3 Ribosomal Protein-fCT522)


CPn0542727428 727096 R r122-L22 Ribosomal Protein-fCT523>


CPn0643727713 727450 R rsl4-519 Ribosomal Protein-fCT521I


CPn05t4728573 727722 R r12-L2 Ribosomal Protein-(CT525)


CPn0545728930 728598 R r123-L23 Ribosomal Protein-fCT526>


CPn0546729621 728950 R r14-LI Ribosomal Protein-fCT527)


CPn0647730331 729657 R r13-L3 Ribosomal Protein-fCT328)


CPn0518731603 730605 R CT529 hypothetical protein


CPn0649732572 731710 R fmc-Nechioryl eRNA Fornylcransferase-fCT530)


CPn0650733501 731665 R lpx~1-Aey1-Carrier tIDP-GlcNAe -fCT531)


CPn0651733975 733317 R fabt-!lyriseoyl-hcyl Carrier Dehydratase-fCT532)


CPn0652731835 733990 R lpxC-Myriscoyl GlcNae Deacttylase-fCTS33)


Cla0653736490 731868 R eucE-Apolipoprotein N-l~cetyleransferase-fCT534)


CPn065t735957 735503 R vdlD/yciA-aeyl-CoA Thiossterase-ICTS35)


CPn0655737847 737101 R dnaQ_2-DNA Pol III Lpsilon Chain_2-fCT536)


CPn0656737872 738048 F


CPn0657738473 738051 R yjeE (I~TPase or Kinase)-fCT537)


CPn065A739168 738455 R CT538 hypothetical proton


CPn0559739533 739838 F trxh-Thioredoxin-ICTS39)


CPn0660710327 739860 R spoD_2-rRNa Ntthylass_2-fCT540)


CPn0661741100 740327 R mip-FKeP-type pepcidyl-prolyl cis-crane
isomerase-;CTStl)


CPn0662742923 741172 R asps-l~spartyl tRNA Synthetase-fCT5t2)


CPn0563744190 742901 R hiss-Hiscidyl tRNR Synthetase-fCT5t3)


CPn0664744757 744557 R


CPa0665745001 716365 F uhpC-Hexosphosphate Transport -fCT541)


CPn0666746388 750107 F dnaE-DNA Pol III Jllpha-fCT515)


CPn0567751058 750177 R predicted 0lIP (leadar f17)-fCT516)


CPn0558751209 752162 F CT547 hypothetical protein


CPn0559752179 752775 F CT548 hypothetical protein


CPn0670732765 753196 F rsbN-sigma regulatory factor-hiscidine
kiaase-fCT519)


CPn0571753530 753205 R CT550 hypothetical protein


CPn0672753741 755018 F dacF(pbp5)-D-hla-D-Ala Caroxypeptidase-fCT551)


CPn0673755287 755163 F CT552 hypothetical protein


CPn0574755568 755577 R fmu-RN1~ Hechyltransfezase-fCT553)


CPn0675757919 756768 R CT69b hypothetical protein


CPn0676759217 758051 R~ homologous to CT695


CPn0677750401 759256 R


CPn0678751320 760582 R


CPn0679762930 761725 R pqk-Phosphoglyesrate Kinase-fCT693)


CPn0580764248 762971 R yqo4-Phosphate Permeast-ICT692)


CPn0681764929 764258 R CT691 hypothetical protein


CPn0582761984 765955 F dppD-A8C ATPaee Dipeptide Transport-fCT690)


CPn0583765948 766919 F dppF-A8C ATPase Dipeptide Transport-ICT6891


CPn0684768038 767181 R spoJ/par8-Chromosome Partitioning Protein-fCT588)


CPn0585768068 768217 F


CPn0686758361 768176 R


CPn0687758564 769214 F CT482 hypothetical protein


CPn0688769382 770137 F CT481 hypoehacieal protein


CPn0689771104 770187 R yfh0_1-NilS-related Jlminotransferast_1-ICT687)


CPn0590772580 771136 R AeC Transporcsr tiembrane Protein-fCT685)
~


CPn0691773452 772685 R abcX-R8C Transporter llTPase-fCT685)


CPn0592774912 773161 R J18C Transporter-fCT6Bt)


CPn0593776256 775240 R TPR Repeats to-Linked G1CNJIC Tzansferase
similarity!-fCT683)


CPn0594779599 776330 R pbp2-P8P2-cransqlycolase/cranspepcidase-fCT582)


CPn0695780216 781382 F ompA-Major Outer Nambrane Protein-fCT681)


CPn0696781769 782599 F rs2-S2 Ribosomal Protein-ICT5801


CPn0697782602 783447 F csf-Elongation Factor TS-ICT679)


CPn0698783458 784201 F pyres-UHP Kinase-fCTB79)


CPn0599784182 784721 F rrf-Ribosome Releasing Factor-ICT677)


CPn0700785097 785609 F CT676 hypothetical protein


CPn0701785599 786672 F karG-Arqinine Kinase-fCT675)


CPn0702789685 786929 R yscC/qapD-YOp C/Gen Secretion Protein
D-fCT67d)


CPn0703791190 789685 R pkn5-S/T Protein Kinase-fCT677)


CPn0704792321 791209 R fllN- Flaqellar Motor Snitch Domain/YSeQ
family-fCT672)


CPn0705793173 792334 R CT671 hypothetical protein


CPn0706793683 793180 R CT670 hypothetical protein


CPn0707795029 793704 R yscN-Yop N lFlaqellar-Type ATPase)-fGT569)


CPn0708795705 795034 R CT668 hypochecicnl protein


CPn0709796188 795742 R CT667 hypothetical protein


CPn0710796461 796210 R CT666 hypott:ecical protein


46


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
CPn0711796771 796186 R CT665 hypochecicai protein


CPn0712799315 796781 R FICA domain: homology to adenylace eyelase)-fCT664)


CPn0713799721 799332 R CTb63 hypothetical protein


CPn0714801107 800091 R haM-Glutamyl tRNA Reductase-1CT662)


CPn0715801657 803462 F gyre_2-triJA Gyrase Subunic 8_2-1CT661)


CPn0716803469 801902 F gyrA_2-DNA Gyrase Subunit A_2-fGT6601


CPn0717805010 805306 F CT656 hypothetical protein


CPn0718805309 805626 F CT657 hypothetical protein


CPn0719805916 806890 F sth8-lPseudouridine Synthase)-1CT658)


CPn0720807003 807236 F CT659 hypothetical protein


CPa0721807683 808489 F kdsA-KDO Synthetase-1CT6551


CPa0722808489 808974 F CT654 hypothetical protein


CPn0723808984 809703 F yhbG-AHC Transporter ATPase-(CT653)


_CPn0724810527 809706 A


CPn0725810811 810387 R C?652.1 hypothetical procsin


CPn0726813372 810880 R CT620 hypothetical protein


CPn0727813577 816192 F CT619 hypothetical protein


CPn0728818477 816525 R CIiLPN 76k0a HomoloQ_1 (CT6221


CPn0T29819857 818592 A CHLPN 76kOa somolog_2 tCT623)


CPn0730821603 818963 R mviN-Integral !lembrane Protein-(CT624)


CPn0731821587 821760 F


CPn0732822098 822976 F ato-Endonuclease IV-(CT625)


CPn0733823727 823101 R rs4-S4 Ribosomal Protein-fCT626)


CPa073d823914 824915 F yceA-ICT627)


CPn0735825668 825003 R pyrH/udk-Uridine Kinase fUridine lionophosphokinase)
(Pyrimidine


Ribonucleoside Kfnasel.


CPn0736827686 825992 R ygeD-Lttlux Protein-(CT641)


CPn0737827685 830756 F recC-Exodeoxyriboauclease v, Gamma-fCT640)


CPn0738830746 833895 F race-Exodeoxyribonucluse V, Heta-(CT639)


CPn0739834871 833861 R CT638 hypothetical protein


CPn0740836018 031861 R tyr8-Aromatic 871 Aminotransterase-(CTb37)


CPt10741838350 836185 R greA-Transcription Elongation Factor-(CT636)


CPn0742838463 838888 F CT635 hypothetical protein


CPn0743838962 840762 F aqzA-Vbiquinone Oxidoraduccase. Alpha-(CT631)


CPa0714841384 840389 R heutB-POZphobilinogen Synchase-(CT633I


CPn0T45841903 841742 R


CPn0T46841975 843567 F CT632 hypothetical protsin


CPn074783675 843740 F~ CT631 hypothetical protein


CPn0747843725 843910 F CT671 hypothetical protein (frame-ahitt)


CPn0748844987 844121 A ispA-Geraryl Transtransterase-(CT628)


CPn0719845629 845006 R glsW-VDP-GlcNAC Pyrophosphorylase-ICT629)


CPa0750846411 845707 R tctD/epxR-fiTH Transeriptional Regulatory
Protein Receiver Doman-


ICT6301


CPn0751846606 848434 F CT651 hypothetical protein


CPn0752848601 850082 F reeD_2-fxodeoxyribonuelease V, Alpha_2-(CT6521


CPa0753851006 850161 R


CPn075d851336 851040 R rs20-S20 Aibososul Protein-(CT617)


CPn0755851597 852799 F CT616 hypothetical protein


CPn0756852961 854676 F rpoD-RNA POlymersss Sigma-66 -(CT615)


CPn0757854733 855134 F tolX-Dihydroneopterin Aldoiase-(CT614)


CPn0758855110 856459 F tolP/dhpS-Dihydropteroate Synthase-ICT613)


CPn0759856488 856997 F tolls-Dihydrotolace Reduecase-(CT6121
-


CPn0760856957 857694 F CT611 hypothetical protein


CPn0761857704 858375 F CT610 hypothetical protein


CPn0762859597 858539 R recA-ReG reeos~bination protein-(CT650)


CPn0763860511 859972 R ygtA-FOrmyltetrahydrotolace Cycloligase-fCT649)


CPn0764861807 860524 R CT648 hypochscical protein


CPn0765862382 861801 R CT647 hypothetical protein


CPn0766863782 862394 R CT646 hypothetical protein


CPn0767863881 864177 F CT645 hypothetical protein


CPn0768864159 865163 F yohI/nir3-predicted oxidoreduccase -(CT644)


CPn0769867733 865121 R topA-DNA Topoisomerase I-Fused to SWI Domnin-fCT643)


CPn0770868340 869131 F CT642 hypothetical protein


CPnOT71870163 869144 R rpoN-RNA Polymerase Sigma-54-(CT609)


CPn0772872385 870469 R uvrD-DNA Nelicase-fCT608)


CPn0T73872188 873195 F ung-Vracil DNA Glyeosylase-fCT607)


CPn0774873195 873425 F CT606.1 hypothetical protein


CPn0775871031 873414 R yggV family-ICT606)


CPn0776874246 875487 F CT605 hypothetical protein


CPn0T77875601 877178 F groEL_2-heat shock protein-60 -fCT604)


CPn0778877505 878092 F tsa/ahpC-Thio-specific Anuoxidanc (TSA) Peroxidase-
(CT6031


CPn0779878481 878095 R CT602 hypothetical protein


47


CA 02350775 2001-05-11
WO 00/27994 PCT/US99I26923
CPn07A0179205 871591 R papQ/amie-N-ACetylmuramoyl-L-111a Amidaae-CT601)


CPn0781879773 179191 A pal-PeDtidoqlycan-Associated Lipoprotein-ICT6001


CPn0782181065 879773 R tolH-polysaccharide transporter-ICTS991


CPn07AJ881115 881100 R CT59A hypothetical protein


CPn07B1812296 881892 R exbD-8iopolymer Transport Protein-ICT5971


CPh0785812991 881296 A exb8/tolQ-polysaccharide transporter-GT5961


CPa0786883185 815293 F dsbD/xprA-Thio:disulfide Interchange Protein-CT595)


CPa07A7885619 116401 F yabD/ycl:!-PHP superlamily luruse/pyrimidinasel
hydrolase-ICT5911


CPa07A8816542 887432 F sdhC-Succinace Dehydroqenase-fCT593!


CPa0789887139 889316 F sdhA-Succinate Dehydroqenase-ICT592f


CPn0790889330 890103 F sdhe-Succinace Dehydrogenase-ICT5911


CPn0791893050 190111 R CT590 hypothetical proceia


CPn0792894919 893108 R CT5A9 hypothetical protein


CPn0793196123 894919 R rbsU-sigma regulaeory family protein-PP2C
phosphatase IRSbW


ancaqoniscl-ICT5881


CPa0791897171 898001 F


Cla0795891128 899195 F


CPn0796899301 901310 F


CPn0797901600 902694 !


CPa0791902116 903156 F


CPa079990916 903910 R


CPn0800906532 905249 R eno-ISSOlase-ICT587)


CPa0801908697 906727 R uvrn-Fxiauclease AeC Subunit H-ICT5861


CPn0102909740 908709 R CrpS-Trypeophanyl CRNA Synthetaae-(CT5151


CPn0A03910303 909752 R CT58d hypothetical protein


CPa010d911059 910310 R qp6D-CitLTR Plasmid Paraloq-ICT583)


CPn0105911831 911067 R miaD-chromosome partitioning ATPase-CHLTR
plasmi.d protein GPSD-ICT5ti2)


CPn0106913771 911867 R thrS-Threo~l tRNA Syachecaae-ICT5811


CPn0A07913971 91879 F CTSAO hypothetical proeein


CPn010A916287 914956 R CT579 hypothetical protein


CPn0A09917785 916307 R CT578 hypothetical protein


CPn0110918111 917825 R GT577 hypothetical protein


CPn0111918900 918308 R lesti_1-Low Ca Response Proeein H_1-ICT5761


CPa0812919123 910162 F mucL-DNA tdiamstch Repair-ICT5751


CPa.0A13920870 921934 F pepP-Aminopeptidase P-ICTS7df


CPn011d922107 933357 F CT573 hypothetical protein


CPn0815923361 9=5622 F gspD/pilQ-Gen. Secretion Protsfn D-ICT5721


CPa0A16925615 927102 F~ gspE-Gen. Secretion Protein E-fCT571)


CPa0A17927115 928287 F gspF-Gea. Secretion Protein F-ICT570I


CPa081B928314 92868? F predicted OtiP (leader 1161 peptide)-CT5691


CPn0119928619 929132 F CT56A hypothetical protein


CPa0820929120 929659 F CT567 hypothetical protein


CPn0821929667 930668 F CT566 hypothetical protein


CPn0122930756 931229 F CT565 hypothetical protein


CPa0823932367 931501 R yscT/spaR-YopT Tranlocation T-ICT5641


CPa0121932662 932378 R yscS/IliQ-YOpS/IliQ Transloeation Protein-fCT563)


CPn0A25933594 932677 R yscR-YOp Transloeation R-ICTSB2)


CPn0826934310 933612 R yscL-YOp Ti:anslocation L-ICT5611


CPn0127935264 934434 R CT560 hypothetical protein


CPa0828936771 935267 R yacJ-Yop Traaslocation J-ICTS59)


CPn08299367da 937298 F


CPn0830937441 937959 F


Cln0831938267 938434 F


CPn0132939747 938827 R lipA-Lipoace Synchetase-ICT55A1


CPn0A33941129 939747 R lpdA-Lipoamide Dehydrogenase-ICT557)


CPa0A31941553 942014 F CT556 hypoehecieal protein


CPn0835915689 962015 R motl_1-SWI/SNF family helicase_1-ICTS551


CPa0A3696879 95722 R brnQ-Amino Acid (Branched) Transport-iCT55d1


CPn0837917771 917115 R nth-F.nodnueluse III-ICT697)


CPa0838949106 97781 A thdF-Thiophene/Puran Oxidation Protein-1CT698)


CPn0839949257 950159 F psdD-Phosphatidylserine Oecsrboxylase-ICT699)


CPa0Ad0950222 951541 F CT700 hypochetlcal protein


CPn08d1951771 95640 F secA_2-Translocase SecA_2-ICT701)


CPa01d2954883 954710 R CT702 hypothetical prosain Ilrame-spilt with
0843)


CPn0813955191 951991 R CT702 hypothetical protein


CPn08dd956730 955270 R yphC-CTPase/CTP-binding protein-ICT703)


CPn0A15951079 956150 R pene_1-Poly A Polymerase_1-fC:70d1


CPn0816959371 958112 R clp%-CLP Protease ATPase-ICT7051


CPn0817959995 959387 R clpP-CLP Protease subunit-ICT7061


CPa0811961502 960177 R tig/murI-Triqqar factor-pepcidyl-prolyl isomerase-
ICT707)


CPn0819961781 965285 F ~tl_2-SWI/SNF family heliease_2-ICT7011


CPnC85996529) 966390 F m:eB-Rod Shape Proceirt-sugar %inase-1GT7091


48


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
CPn0A51 966396 96A195 F pckA-Phosphoenolpyruvate Carboxykinese-ICT710)
CPn0A5Z 968316 970613 F CT711 hypothetical protein
CPn0853 970637 971A03 F CT712 hypothetical protein
CPn0A54 972837 971806 R ompB-Outez Membrane Protein B-ICT713)
CPn0855 973995 972994 R gpdA-Glycerol-3-P Dehydrogenase-fCT711)
CPn0856 975377 973995 R Apx-1 Homolog-VDP-Glucose Pyrophoaphorylase-tCT715)
CPn0857 975757 975392 R CT716 hypothetical protein
CPn0858 977055 975757 R tliI-Flagellum-apeeitic ATP Synthase-(CT717)
CPn0A59 977588 977055 R CT7I8 hypothetical protein
CPn0A50 978630 977608 R tliF-Flagellar M-Ring Protein-ICT719)
CPn0851 979722 97A925 R nitV-NitV-related protein-IC:"720)
CPn0862 980873 979722 R yth0_2-Nits-relaeed protain_2-tCT721)
CPn0A63 981514 980831 R pgmA-Phosphoglyeerate Mutase-ICT722)
CPn0A5d 981670 982374 F yjbC-predicted pseudouridine synthase-1CT7231
CPn0A55 98241A 982912 F CT724 hypothetical protein
CPn0866 9A3491 982916 R birA-Biotin Synthetase-ICT725)
CPn0867 983t23 984667 F rodA-Rod Shape Protein-1CT726)
CPn0868 986613 981670 P. zntA/cadA-Metal Transport P-type ATPase-ICT727)
CPn0869 987401 986658 F. CT728 hypothetical protein
CPn0870 988728 987!48 F. serS-Seryl cRNA Synthecase_2-ICT7291
CPn0871 988772 989899 F' ribD-Riboflavin Deaminase-ICT730)
CPn0872 989963 991216 F' ribA4ribe-('TP Cyclohydratase i DHHP Synthase -
ICT731)
CPn0873 991233 991694 F ribF:-Ribicyllumazine Synthase-ICT732)
CPn0871 993107 991719 F CT733 hypothetical protein
CPn0A75 993372 994022 F CT734 hypothetical protein
CPn0876 99!144 995517 F dagA_2-D-Alanine/Glycine Permease_2-ICT735)
CPn0877 995533 995982 F ybcL family-ICT7361
CPn0878 996654 995992 F SET Domain protein-ICT737)
CPn0A79 997439 996645 R yycJ-metal dependent hydrolase-ICT73A)
CPn08B0 999A61 9971!! R ttsK-Cell Division Protein FtsK-fCT739)
CPn08A1 1005667 1006209 F
CPn0A82 1006268 1007~04 F
CPn0A83 1008865 1007573 R dmpP/nqr6-Phenolhydrolase/NADH ubiquinone
oxidoreduetase-(027!0)
CPn0A8t 1009359 1009009 R CT7t1 hypothetical protein
CPn0885 1010635 1009433 R ygcA-rRNA Methyltransterse-IGT742)
CPn08Bb 1011276 1010908 R hetA-Histone-Like Developmental Protein-fCT7t3Y
CPn08A7 1011692 101!157 F CHLTR possible phosphoprotein-ICT7lt)
CPa0A88 1015423 1011119 R- hemG-protoporphyrinogen Oxidase-ICT?15)
CPn08B9 1016835 I015t62 R hemN_2-Coproporphyrinogen III Oxidase_2-ICT746)
CPn0890 1017805 1016819 R hemE-Uroporphyrinogen Decarboxylase-ICT747)
CPn0891 1021073 1017A19 R mtd-Transcription-Repair Coupling-ICT71A)
CPn0892 1023661 1021016 R alas-Alanyl CRNA Synchecase-ICT719)
CPn0893 1023894 1025A88 F cktH-Transkecolase-IGT750)
CPn0894 1026766 10258AA R anus-AMP Nucleosidase-fCT751)
CPn0A95 1026988 1027557 F efp_2->=longation Factor P_2-fCT752)
CPn0896 1027595 1027822 F CT753 hypothetical protein
CPn0A97 1028737 1027853 R (possible phosphohydrolasel-ICT75t)
CPn0898 1030~60 1028904 R Mitochondrial HSP60 Chaperonin Homolog-ICT7551
CPn0899 1030875 1032215 F murF-MUramoyl-DAP Lipase-fCT756)
CPn0900 1032235 1033281 F mraY-MUramoyl-Pentapeptfde Transterase-ICT757I
CPa0901 1033287 1031537 F murD-Muramoylalanine-Glutamate Lipase-ICT7581
CPa0902 1034513 1035211 ~ F nlpD-Muramidase finvasin repeat family>-It:T759)
CPn0903 1035263 1036417 F ttsw-Cell Division Protein Ftsw-fCT760)
CPn090d 1035326 I037396 F murG-Pepcidoglycan Transterase-ICT761)
CPn0905 1037109 1039835 F murCiddlA-Huramace-Ala Lipase 4 D-AJ.a-D-Alum Ligass-
fCT762)
CPn0906 1040310 1039915 R CT763 hypothetical protein
CPn0907 I0407B0 1010!45 R ~cutA Periplasmic Divalsnt Cation Tolerance Protein
CutA IC-Type
Cytoehrome Biogenesis Procainf
CPn0908 1041589 1040780 R CT761 hypothetical protein
CPn0909 10!1537 1041966 F rsbV_2-Sigma Factor Regulator_2-fCT765)
CPn0910 1041979 1043004 F 'miaA-tRNA Pyrophosphate Transterase-ICT766)
CPn0911 10!1043 1012985 R Fe-S cluster oxidoreduetase_2-ICT767)
CPn0912 1014129 10<5760 F GT76B hypothetical protein
CPn0913 :045760 1015945 F
CPn0914 1045999 1016397 F
CPn0915 1015461 1016817 F ybeH-iojap supertamily ortholog-tCT769)
CPn0916 1016837 1018084 F tabF-Acyl Carrier Protein Synthase-ICT7701

CPn0917 10!8090 1018539 F hydzolaseiahosphacase homolog-tCT771I
CPn0918 1049223 1048579 R ppa-Inorganic Pyrophosphatase-tCT773)
CPn0919 10!9378 1050430 F ldh-Leuciae Dehydrogenase-tCT777)
CPn0920 1051405 1050431 R eys0-Sul:::e Synchesis/biphosphate phosphatase-
ICT774)
CPn0921 1051535 1052293 F snGlycezoi-3-P Acylczans:erase-fCT775)
49


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
CPn092210523141053927F ass-ACylplycerophosphoechanolamine Acycransferass-
ICT776)


CPn092310539841055093F bioF_1-Oxononanoaca Synthase_1-ICT777)


CPn092410572741055028R priA-Primosomal Protein N' -fGT7781


CPn092510579001057226R G?779 hypothetical protein


CPn092610580601058557F Thioredoxin Disulfide Isomerase-ICT7801


CPa092710598091058670R CItLPS 43 kDa protein homoloQ_2


CPn092810610081059884R CHLPS 43 kDa protein homoloQ_3


CPn092910622921061186A CHLPS 43 kDa protein homoloy_4


CPn093010628571063330F


CPn093110641381065718F lysS-Lysyl tRNA Synthetase-(CT7811


CPn093210671421065721R cysS-Cysteinyl cRNA Synchetase-ICT7821


CPn093310675351068578F predicted disulfide bond isomerase-ICT783)


CPn093410689421068526R rnpA-Ribonuclease P Protein Componeat-fCT78d)


CPn093510690911068957R rl3d-L34 Ribosomal Proeein-ICT'785)


CPn093610693361069470F r136-L36 Ribosomal Proesin-ICT786)


CPn0937.10694961069798F raid-514 Ribosomal Protein-ICT787)


CPn093810703221069849R CT788 hypothetical protein -(leader 160) peptide-
periplasa~fe~


CPn093910707281071195F CT790 hypothetical protein


CPn09d010730121071204R uvrC-Excinueluse ABC. Subunft C-fCT791)


CPn09d110755011073018R stutS-DNA Mismatch Repair-ICT792)


CPn09d210759851077754F dnaC/prf!!-DNA Primsse-(CT7941


CPn094310779781078238F CT794.1 hypothetical protein


CPn094d10785121078997F


CPn094510790701079660F C'f795 hypothetical protein


CPn09d610827861079745R QlyQ-Glycyl tRNA Synthetase-ICT796)


CPn094710834421084059F pQsA_2-Glycerol-3-P-Phosphacydylcransfarase_2-ICT797)


CPn09d810854741084047R Q1QA-Glycogen Synthase-(CT798)


CPn09d910859291086483F etc-General Stress Protein-ICT799)


CPn095010864881087027F pth-Pepcidyl CRNA ttydrolase-ICT8001


CPn095110871221087157F rs6-S6 Ribosomal Protein-ICT8011


CPn095210874781087723F rsl8-518 Ribosomal Protein-fCT802)


CPn095310877421088218F r19-L9 Ribososial Protein-ICT8031


CPn095410882861088708P yehe-Predicted Kinase-ICTBOdI


CPn095510886121089175F Iframs-shift with 0951)


CPn095610895601090909F CT805 hypothetical proeein


CPn095710937881090963R ide/ptr-Insulinase family/Prouase ZII-fCT806)


CPn095810947851093793R pls8-Glycerol-3-P Acylcransferase-ICT8071


CPn095910963431094799R~ cafE-Axial Filament Protein-ICT80B)


CPn096010967641097102F CT809 hypothetical protein


CPn096110971181097297F r132-L32 Ribosomal Procsin-ICT810)


CPn096210973161098I75F plsX-FA/Phospholipid Synthesis Protein-ICT811)


CPn096310983981103221F pnq~_21Polymorphie Outer Membrane Protein D
Family-(CT812)


CPn096d11047581103301R


CPn096511067361104925R lpxe-Lipid A Disaccharide Synthase-(CT411)


CPa096611080371106718R pcnH_2-PolyA Polymerase_2-ICT4101


CPn096711085121109885F mrsA/pgm-PhosphoQlueomutase-ICT815)


CPn096811098951111721F QlmS-Glucosamine-Fructose-6-P Aminocransferase-ICT816)


CPn096911118121112999F 0969-CyrP_1-Tyrosine Transport_1-ICt817) tyrP_1-
Tyrosine
Transport 1-


ICT8I7)


CPn097011134611114648! 0970-CyrP_2-Tyrosine Transport_2-ICt818) tyrP_2-
Tyrosine
Tzansport_2-


Irreie)


CPn097111147021115115F yeeA-Transport Permease-(CT819)
'


CPn097211162991115430A ltsY-Cell Division Protein TtsY-ICT8201


CPn097311163701117527F sucC-Succinyl-CoA Synchacase. Beta-ICT821)


CPa097411175411118432F sucD-Succiuyl-CoA Synthecase. Alpha-ICT822)


CPn097511191041119637f


CPn097611200821121185F .


CPn097711213711122402F


CPn097811226651123693F


CPn097911239801125413F htrA-DO Serine Protease-ICT8231


CPn098011269821125501A similarity to Saccharomyees sersvisiae hypothetical
52.9KD protein


CPa098111270311129952F tint Metalloprotease linsulinase family)-ICT814)


cPn0982113119a1129962R yipN family-IGT825)


CPn098311320001131206R pssA-Glycerol-Serine Phosphacidyltransferase-ICT8261


CPn098411323791135510F nrdA-Ribonucleoside Reduccase. Large Chain-ICT827)


CPa098511355341136571F nrd8-Ribonueleoside Reduetase. Small Chain-ICT828)


CPn098611367241:37395F Y00H-Dredieted rRNA tdethylase-ICT8291


CPn098711375161138115F ytQB-like predicted rRNA methylase-ICT830)


CPn09881138986113805 R murB-UDP-N-AeecylsnolpyruvoylQlucosamine Rsduccase-
ICT8311


CPa098911391951139016R CT832 hypothetical protein


CPn099011398831140440F iafC-Initiation Factor ~-(CT8331


CPn09911140421111061?F r135-L35 Ribosomal Protein-IC':8341




CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
CPn099:11406341110996F r120-L20 Ribosomal Protein-ICT8351


CPn099311410141112030F pheS-Phenyialanyl tRNA Synthecaee. Alpha-ICTB361


CPn099d11423981141410F CT837 hypothetical protein


CPn099511455121111415R CT838 hypotheticnl protein


CPn099611165891145519R CT839 hypothetical protein


CPn099711467081147664F mssJ-PP-loop superfamily ATPase-ICT8401


CPn099811478551150584F ftsH-ATP-dependent zinc protease-ICT8411


CPn099911538471150766R pnp-Polyribonueleocide Nucieotidyltrnnsferase-fCT8421


CPn100011531571152891R rsl5-S15 Ribosomal Protein-tCT8431


CPn100111534051153869F yfhC-cytosine deaminase-ICT8441


CPn10021153862115089 F CT845 hypothetical protein


CPn100311517961154092R CT846 hypothetical protein


CPa100d1155397115879 R CT8d7 hypothetical protein


CEn100511559331155115R CT818 hypothetical protein


CPa100611564721155990R CT819 hypothetical protein


CPa100711566891156907F GT819.1 hypothetical protein


CPn100811569281158223! CT850 hypothetical protein


CPn100911590581158186R map-Hschionine Aminopeptidase-fCT8511


CPn101011596721159067R CT852 hypothetical protein


CPn101111603061159902R CT853 hypothetical protein


CPn101211621931160421R yzs8-AHC transporter permease-ICT8541


CPn10131162245. 1163624F fuaaC-Fumarats Hydraease-fCT8551


CPn101411654261163732R yehM-Sulfate Transporter-fCT8561


CPn101511656341166893F CT857 hypoctsecical protein !possible I1i
proteia)


CPn101611670421168898F CT858 hypothetical protein


CPn101711690061169935T lytB-Metalloproteass-ICT8591


CPn101811698981170629F


CPn101911721281170638R CT860 hypocheciesl protein


CPn102011736791172150R CT861 hypothetical protein


CPnI02111742131173698R lcrH_2-Low Calcium Response_2-ICT8621


CPn102211756'731174216R CT863 hypothetical protein


CPn102311760351176331F


CPn102411772361176334R xerD-InteQrase/ree~binase-fCT86d1


CPa102511773021178879F pgi-Glucose-6-P Isomsrase-ICT3781


CPa102611789971.179137F ltuA-ICT3771


CPn102711791751180755F


CPn102B11810161181999F s~dhC-palate Dehyropenase-ICT3761


CPn102911820081182844F


CPa103011838861182843R predicted D-amino acid dehyrogenaae-ICT3751


CPn103111855521184098R areD-Arginine/arnithine Antiporter-ICT374)


CPn103211861501185566R CT373 hypothetical proesin


CPn103311875001186187R CT372 hypothetical protein


CPn103411885171187732R Predicted OItP_1 ICT371I (leader f18) peptide]


CPn103511900001188570R AroE-Shikisnace 5-DehyroQenase-(CT3701


CPn103611911351189984R AroB-Dehyroquinate Synthase-IGT3691


CPn103711921991191123R AroC-Chorissiats Synchase-ICT3681


CPa103811927261192199R aroL-Shikimats Xinase II-fCT3671


CPn10391193999119=665R aroA-Phosphoshikimats Vinyltransfsrase-ICT3661


CPn101011947411194073R


CPn104111959941194726R bioA-Adsnosylmtthionine-8-Amino-7-Oxononanoats
Aminotrutsferase


CPn1042X1965901195934R bioD-dechiobiotin synehecass


CPa10431197717119657?R bioF_l-Oxononanoats Synchass_2
~


CPn104411986911197699R bioH-Biotin Synthase


CPn104511995901198901R conserved hypothetical bacterial membrane
protein


CPn104612006751199590R TSyptophan Hyroxylase


CPn104712005521201343F dap8-DihydrodiDicolinace Reduetasa-ICT364f


CPn104B12016061202604F asd-ASpartate DehydroQenase-ICT3631


CPn101912025951203914F lysC-ASpartokinass III-fCT3621


CPn105012039261104798F dapA-Dihydrodipieolinace Synthase-ICT3611


CPn105112049621205270F


CPn105212054171206169F


CPn10531=061531206701F


CPn105d12070341209466F


CPn105512096941210521F


CPn105612105271211228F


CPn105712111971213596F CT156 hypothetical protein


CPn105812137481214836F CT355 hypothetical protein


CPn105912148481215678F kpsA-Dimethyladenosine Transferase-fCT3541


CPn106011176581215727R dxs/tkt-Transketolase-ICT3311


CPn106112179201217666A CT330 hypothetical protein


CPn106212198201218159R xseA-Exodoxyribonucluse VII-ICT3I91


CPn106712199511220712F cpiS-Triosephosphate Isomerase-(CT3281


S1


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
CPn106112=07191=20895F


CPsa105512210951=20928R


CPa106611311351221!88F


CPn1067122173512=2292F def-Polypepcida Deformylase-ICT353)


CPn106B12232581222365R rnh8_2-Ribonucleue HII_2-ICT008)


CPn106912235131123941F yfp~-HTH Tranacripcional ReQulacor-fCT0091


CPn10701225511122114 R


CPn107112273241225885R


CPn107212279691228835f


CPn107312290111229832F Predicted 0!!P_2 -ICT371)


52


CA 02350775 2001-05-11
WO 00127994 PCT/US99/26923
Table 2 (Supplemental Data) Functional Assignrxnts of C. pneumonine Coding
Sequences. C. trncltomatis genes arc shown in
parrntheses.
Amino Acid Blosynthcsis


.Iromatic
Familv


1039 (CT366)aroAPhosphoshikimate Vinyltransferase


1036 (CT369)aroBDehyroquinau Synthase


1037 (CT368)aroCChorismate Synthase


1 1035 (CT370)aroEShikimate i-Dehyrogenase
~


0486 (CT382)aroGDeoxyheptonate Aldolue


1038 (CT367)aroLShikimate Kinase II


0740 (CT637)tyrBAromatic AA Aminatransfense


AsparrateFomily
!lysine)


1 1048 (CT363)asd Asp:~ute Dehydrogenase



1050 (CT361dapADihy<bodipicolinate
) Synthasc


1047 (CT364)dapBDihydrodipicolinate
Reductasc


0519 (CT430)dapFDian inopirnelate Epimerue


1049 (CT362)IysCAspa zokinase llI


2~ Serint
Family


0433 (CT282)gcsfiGiyc ne Cleavage System
H Protein


0521 (CT432)glyASerine tiydroxymethyltransfense


Base
&
Nuclmtidt
Metabolism


0171 guaAGMP Synfhase


25 0172 guaBInosine 5'-Monophosphase
Dehydrogenase


0608 Utidine S'-Monophosphate
Synthase


0735 Uridine Kinase


0244 (CT128)adk Adenylate Kinase


0894 (CT751atnnAMP Nucieosidase
)


3~ 0568 (CT452)cmk CMP Kituue


0392 (CT039)dcd dCTP Deaminue


0059 (CT292)dut dUTP Nucleotidohydrolase


OI20 (CT030)gmk GMP Kinase


0619 (CT500)ndk Nucleoside-2-P Kinase


3 0984 (CT827)nrdARibonucleoside Reductase.
5 Large Chain


0985 (CT828)nrdBRibonucieoside Reductase,
Small Chain


0236 (CT183)pytGCTP Synthetase


0698 (CT678)pyresUMP Kittase


0271 (CT188)tdk Thymidylate Kinase


0659 (CT539)mtA Thioredoxin


0314 (CT099)trx8Thiorcdoxin Reductase


I (CT844)yfhCCytosine Deaminasc
OOI


45 Biotin. Lipoate dr Ubiquinone
Biosynthesis of Cotacton
1041 bioAAdenosylmethionine-8-Amino-7-Oxononanoate
Aminottatufetasc


1044 bioBBiotin Synthase


1042 bioDDethiobiotin Synthetase


0923 (CT777)bioF_IOxononanoate Synthase-1


1043 (C'C777)bioFOxononanoate Synthase-2
2


0866 (CT725)birABiotin Synthetase


0748 (CT628)ispAGmnyl Tnnsaatuferasc


0832 (CT558)IipALipoate Synthetase


53


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
0265 (CT219)ubiA Benzoau Ocnphenyhransiense


0264 (CT220)ubiD Phenylaerylate Decarboxylue


OSIS (CT428)ubiE UbiquinoneMethyltransfense


Folic
Acid


0759 (CT612)folA DihydrofolateReducuse


0335 (CT078)folD Methylene Tcaahydrofolate
Dehydrogenase


0758 (CT613)folP Dihydropteroate Synthuc


0757 (CT614)folX Dihydroneopmrin Aldolue


0763 (CT649)ygfA FortnyltetrahydrofolateCycloligase


1 Porphyrin
~


0714 (CT662)hertWGlutamyl tRNA Reducnse


0744 (CT633)hemB Porphobilinogen Synthue


OOS2 (CT299)hemC Porphobilinogen Deaminue


0890 (CT747)hemE Uroporphyrinogen
Decarboxylase


I 0888 (CT74$)hemG protoporphyrinogen
S Oxidise


0138 (CT210)hems.Glutamate-1-Semialdehyde-2.1-Aminomutue


0380 (CT052)hemN_ICoproporphyrinogen
Ilt Oxidase_I


0889 (CT746)hemlVCoproporphynnogen
2 111 Oxidise 2
_


0603 (CT485)hemZ Ferrochentue


Riboflavin


0872 (CT731nbA&rib8
) GTP
Cyclohydranse
&
DHBP
Synthase


0532 (CT40S)ribC Riboflavin Synthase


0871 (CT730)ribD Riboflavin Deaminue


0877 (CT7J2)ribE Ribiryllumazine Synthue


25 0320 (CT09))ribF FAD Synthase


Cell
Envelope


Forty
Acid
&
Phospho(ipid
Merabolisrn


0161 (CT206) (predicted uyltnnsferase
family)


0922 (CT776)au Acylg(yeerophosphoethanolamine
Acyhnnsferase


0414 (CT265)accA AcCoA CarboxylasrrTransferrse
Alpha


0183 (CT123)accB Biotin Carboxyl Carrier
Protein


0182 (CT124)accC BiotinCarboxylase


0058 (CT29J)accD AeCoA Carboxylaseffranafense
Ben


35 0295 (CT2Jb)acpP Acy1 Cartier Protein


0313 (CTI00)acpS Acyl-rartier Protein
Synthue


0567 (CT451)cdsA Phosphatidate Cytidylytransferasc


0297 (CT238)fabD Malonyl Acyl Cartier
Transeyclase


0916 (CT770)fabF Acyl Carrier Protein
Synthasc


0296 (CT237)fabG Oxoacyl (Carrier
Protein) Reductue


.0298(CT239)fabH Oxoacyl Carrier Protein
Synthue III


0406 (CTlOa)fabl Enoyl-Acyl-Cartier
Protein Reducnsc


0651 (CT532)fabZ Myristoyl-Aeyl Carrier
Dehydranse


0098 (CTOIO)hcB Acyltransferue


45 0271 (CTIJ6) LysophoapholipueEsterue


0615 (CT496)pgsA-1Glycerol-3-P Phosphatidyltratufense-I


0947 (CT797)pgsA Glycerol-J-P Phospharydyltransfensse_2
2


0958 (CT807)plsB Glycerol-3-P Acylcansferase


0569 (CT453)plsC Glycerol3-P Acylaansferau


50 0962 (CT811plsX FA/Phospholipid Synthesis
) Protein


0839 (CT699)psdD Phosphatidylserirte
Deearboxylue


0983 (CT826)pssA Glycerol-Serine Phosphatidyltransfecue


0921 (CT775) sttGlyeerol-J-P Acyltraruferase


0654 (CTS35)yciA Acyl-CoA Thioestcrasc


S 0877 (C1'736)ybcL CT1J6 Hypothetical
Protein


LPS
54


CA 02350775 2001-05-11
WO 00127994 PCT/US99/Zb923
0154 (CT208)gseAKDO Tnnsfense


0721 (CT655)kdsAKDO Synthetue


0235 (CT182)kds8Deoxyoctutotrosic Aeid
Synthetue


0650 (CT531IpxAAcyl-Carrier UDPGIcIvAc
) O-Acyltnnsfensc


0965 (CT411IpxBLipid A Disucharide
) Synthase


0652 (CT533)IpxCMyristoyl GIcNac Deaeetyiau


0302 (CT243)lpxDUDP Glueosamine N-Acyltransferase


Membrant
Proteins.
Lipoproteins
&
Porins


0310 (CT25160IM60kDa lacer Membrane
) Protein


0556 (CT442)crpAISkDa Cysnine-Rich
Protein


0653 (CT534)cutEApolipoprotein N-ACetyltnttsferue


031 (CT252)Igt Prolipoprotein Diacylglyeerol
I Tnnsfense


0558 (CT444)omcA9kDa-Cysteine-Rich
Lipoprotein


0557 (CT443)omcB60kDa Cysteine-Rich
OMP


0695 (CT681ompAMajor Outer Membrane
) Protein


0854 (CT713)ompBOuter Memebnne Protein
8


0781 (CT600)pat Pepddoglyean-Associated
Lipoprotein


0300 (CT241yaeTOmp85 Hotnolog
)


Peptidoglye:an


0417 (CT268)amiAN-Acetylmuramoyl Alanine
Amidue


0780 (CT601amiBN-Acetylmunmoyl-L-Ala
) Amidue


0672 (CT55duF D-Ala-D-Ala Caroxypeptidase
t
)


0968 (CT816)glmSGlueoumine-Fructose-6-P
Aminotnnsfense


0749 (CT629)glmUUDP-GIcNAc Pyrophosphorylue


0900 (CT757)mnY MunmoylPennpeptide
Tnnxferue


0571 (CT455)murAUDP-N-Acetylg(ucosamine
Tnmferue


0988 (CT831)murBUDPN-At:etylenolpyruvoylglucosamineReductue


0905 (CT762)murCdcddlA
Mutartutc-Ala
Liguc
&
D-AlaD-Alam
Ligue


0901 (CT758)murDMunmoylalanine-Glunmate
LiBase


0418 (CT269)murEN-Aeetylmunmoylalanyl8lunmyl
DAP Ligue


0899 (CT756)murFMuramoylDAP Ligau


0904 (CT761murGPeptidoglyean Tnnsferue
)


0902 (CT759)nlpDMunmidue (invuin repeat
family)


0694 (CT682)pbp2PBP2-Tnnsglycoluelfnnspeptidue


0419 (CT270)pbp3Tntesglycoluelfnnsprptidase


0421 (CT272)yabCPBP2B Family Methyltntufensc


Cellular Prueeases
Ctil
Division


0959 (CT808)catEAxial Filament Protein


0880 (CT739)ftsKCell Division Protein
FaK


0903 (CT760)fhW Cell Division Protein
FtsW


0972 (CT820)ftsYCell Division Pronin
FnY


0617 (CT498)gidAFAD-dependerttOxidorcducttue


0805 (CT582)minDChromosatx Partitioning
ATPase


0850 (CT'109)mteBRod Shape: ProteinSugar
Kinue


0867 (CT726)rodARad Shape Protein


0684 (CT688)parBChromosome Partitioning
Protein


Deroztijcatioa


5~ 0057 (CT294)sodMSupe:roxideDismunsefMn)


0778 (CT603)ahpCThio-spucifie Antioxidant
(TSA) Peroxidase


Signal
Transduetioa


0148 (CT145) S!T Protein Kinue


0584 (CT467)uoS Two-Component Sensor


0294 (CT235) cAMP-Dependrnt Protein
Kinase Regulatory
Subunit


0712 (CT664) (FHA domain)




CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
0478 (CT379)h!!XGTP Binding Protein


0703 (CT673) S!C Protein Kinase


0095 (CT301 S!f Protein Kinau
)


0397 (CT259) PP2C Phosphatax
Family


0037 (CT337)puH PTS Phosphoeartier
Protein Hpr


0038 (CT336)ptslPTS PEP Phosphotnnsferase


0060 (CT29prsN_1PTS IIA Protein_t
f
)


0061 (CT290)ptsNPTS IIA Protein
2 r HTH DYA-Binding
Dorttain


0262 (CT218)surfSurf-like Acid Phosphatase


0838 (CT698)thdFThiophenelFuran
Oxidation Protein


0693 (CT683) TPR Repeats-CT683
Hypothetical Protein


0321 (CT092)ychFGTP Binding Protein


0544 (CT4 yhbZGTP binding protein
t
8)


0844 (CT703)yphCGTPaseiGTP-binding
protein


Smedard
Protein
Secretion


01 (CT025)fIh Signal Recognition
I Particle GTPax
S


03b3 (CT060)tlhAFtagellar Secretion
Protein


0858 (CT717)ffiIFlagellum-specific
ATP Synthax


0704 (CT672)fl(NFlagellu Motor Switch
DomainIYseQ family


0815 (CT572)gspDGen. Secrcdon Protein
D


0816 (CT571gspEGen. Secretion Protein
) E


0817 (CT570)gspFGen. Secretion Protein
F


0359 (CT064)IepAGTPase


0110 (CT020)lepBSignal Peptidue
I


0535 (CT408)IspALipoprotein Signal
Peptidax


0260 (CT141xeA_IProtein Translocax
) Subunit-1


0841 (CT701secA_2Transloerue SecA-2
)


0564 (CT448)secD&secF
Protein
Export
Proteins
SecDiSecF
(fusion)


0075 (CT321secEPrcprorcin Transloeax
)


3v 0629 (CT510)xcY Tnnslocase


0848 (CT707)rig Trigger Factor-Peptidyl-prolyl
lsomersse


Tronsporr-Related
Proteins


0486 Hypothetical Praline
Permease


0289 (CT230)aaaTNeutral Amino Acid
(Glutamate) Tranaponer


3 0691 (CTb85)abcXABC Transporter
5 ATPax


1031 (CT374)arcDArginine/Omithine
Antiporter


0482 (CT381artlArginine Periplasmic
) Binding Protein


0836 (CT554)bmQ Amino Acid (Benched)
Transpon


0536 (CT409)dagA_ID-Ala/Gly Permcax
I


0876 (CT735)dagAD-AlaninelGlycine
2 Permease 2


0682 (CTb90)dppDABC ATPase Dipeptide
Transpon


Ob83 (CT689)dppFABC ATPase Dipeptide
Transport


0280 (CT689)dppFDipeptide Transporter
ATPase


0785 (CT596)exbBMuromolecule Transporter


45 0784 (CT597)exbDBiopolymerTansporiProtein


0404 (CT486)OiY Glutatnine Binding
Protein


0192 (CT129)glnPABC Amino Acid Trmsporter
Permease


0191 (CT130)ginQABC Amino Acid Transporter
ATPase


0528 (CT401)gltTGlutamateSymport


028b (CT194)mgtEMg'+Transportt:r(CHS
Domain)


0413 (CT264)msbATransport ATP Binding
Protein


0290 (CT231) Na;-dependentTnnsporier


0195 (CT198)oppA_IOligopeptide Binding
Protein_1


0196 (CT198)oppA_2Oligopeptide Binding
Protein 2
_


5 0197 (CT139)oppAOligopeptide Binding
3 Protein 3
5


0198 (CT175)oppAOligopeptide Binding
4 Protein .t


56


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
0599 ICT480oppA Oligopeptide Binding
5 Lipoproretn i
)


0199 (CTI99)opp8-1Oligopeptide Pemtesse-:


0598 (CT479)oppB Oligopeptide Permease_2
2


0200 (CT200)oppC_1Oligopeptide Permeue_1


0597 (CT478)oppC_2Oligopeptide Pemuase_'_


0201 (CT201oppD Oligopeptide Tnnspon
) ?~TPase


0202 (CT202)oppF Oligopeptide Transport
ATPue


0231 (CT180)tauB ABC Tnnspon ATPue
f~itntaFe)


0782 (CT599)tolB Macromolecule Transporter


0969 (CT8I7)tyrP_ITyrosine Tnnsport_1


0970 (CTalB)tyrP_2Tyrosine Transport
2
_


0665 (CT544)uhpC He~osphosphate
Transport


0282 (CT216)xuA Amine Acid Transporter


0207 (CT204)ybhl dicarboxylate Tnnslocator


1 0971 (CT819)yccA Tnnspon Permease
S


0248 (CT152)ycCV ABC TnnsporterATPase


lOt4 (CT856)ychM Sulfa a Tnnsponer


0736 (CT641ygeD fllu:. Protein
)


0680 (CT692)ygo4 Phosp gate Pennease


0723 (CT653)yhbG ABC Tnnsponer ATPue


0023 (CT348)yjjK ABC Transporter
Protein ATPue


0127 (CTD34)ytfF Catioi.ie Amino
Acid Transporter


0349 (CT067)ytgA Solute Protein
Binding Family


0348 (CT068)ytgB ABC ransporter
ATPue


0347 (CTD69)ytgC Integrsl Membrane
Protein


0346 (CT070)ytg0 Integral Membrane
Protein


1012 (CT854)yze8 AHC Tnnsponer Permease


0868 (CT727)znlA Metal Tnnspon P-type
.4TPase


0279 Possible ABC Tnnsportcr
Pertneue Protein


0543 (CT417) (Metal Tnnspon
Protein)


0692 (CT684) ABC Transponer


0542 (CT416) ABC Transporter
ATPase


0690 (CT686) ABC Transporter
Membrane Protein


0541 (CT415) solute binding
protein


3 7yve-msttetro~
5


0323 (CT090)IcrD Low Caleium Response
D


0324 (CT089)IcrE Low Calcium Response
E


D8 (CT576)IcrH_tLow Ca Response
c Protein H-1
I


1021 (CT862)IcrH Low Calcium Response
2 2
_


0325 (CT088)sycE Seerction Chaperone


0702 (CT674)yscC Yop GGen Secretion
Protein D


0828 (CT559)yscJ Yop Tnnslocation
J


0826 (CT561yscL Yop Tnnsloeation
) L


0707 (CT669)yscN Yop N (Flagellar-Type
ATPase)


45 0825 (CT562)yscR Yop Tnnslocadon
R


0824 yscS YopS Tnnslocation
(CT563) Protein


0823 yscT YopT Tnnloeation
(CT564) T


0322 yscU Yop Translocation
(CT091 Protein U
)


5o Central Intermediary Metabolism
Glycogen
Merobofism


0856 (CT715) UDP-Glueose
Pyrophosphorylue


0948 (CT798)glgAGlycogen Synthase
,


0475 (CT866)glgBGlucan Benching
Enzyme


JS 0607(CT489)glgCGlucoseI-P Adenyltransferase


0307 (C7-248)glgPGlycogen Phosphorylase


0388 (CT042)glBXGlycogen Hydrolase
(debnnching)


57


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
0326 (CT087)malQGlucanoesnsfense


0851 (CT710)pckAPhosphoenolpyrovate
Carboxykinase


Phosphorous
Qc
Suljur


0548 (CT435)cystSulfite Reductase


S 0920 (CT774)cysQSulfite SynthesivBiphosphace
Phosphatau


0025 (CT346)actASulphohydrolue


0918 (CT77?)ppaInorganic Pyrophosphacase


DNA Replication. Madlfication. Repair & Recombination
1 O DNA Mismareh Repair
0505 3-Methyladenine
DNA Glycosylue


0812 (CTS75)mutt DNA Mismatch Repair


0941 (CT792)mutS DNA Mismatch Repair


0402 (CT107)mutt Adenine Glycosyiase


S 0732 (CT625)nfo Endonuclease IV


0837 (CT697)nth Enodnucleue 111


DNA
Modification


0596 (CT477)ada Methylmnsferau


Ol (CT024)hemK AJG-specific Methylue
14


ZO 0891 (CT748)mfd Tnnscnprion-Repair
Coupling


0620 (CT501ruvA Holliday Junction
) Helicue


0390 (CT040)rov8 Holliday Junction
Helicue


0621 (CT502)rovC Crossover Junction
Endonucleue


0053 (CT298)sms Strn Protein


2S 0771 (CT607)un8 Uncil DNA Glycosylue


1062 (CT329)xseA Exodoxyribonucleue
VII


DNA
Recombination


0762 (CT650)recA RecA Recombination
Protein


0738 (CT639)recB Exodeoxyribonuclease
V. Beu


3O 0737 (CT640)recC Exodeoxyribonucleue
V, Gamma


0123 (CT033)rccD_IExodeoxyrtbonuclease
V (Alpha Subunit)_I


0752 (CT6S2)reeD Exodeoxyribonuclease
2 V. Alpha 2
_


0339 (CT074)recF ABC Superfamdy
ATPuc


0340 (CT074) (frame-shift with
0339)


3S 0563 (CT.t47)recJ ssDNA Exonucleue


0299 (CT240)rceR Recombination Protein


DNA
Replication


0309 (CT2S0)dnaA_IReplication Initiation
Protein_I


0424 (CT275)dnaA Replication Initiation
2 Faetor_2


4O 0616 (CT497)dnaB Replicative DNA
Helicue


0666 (CT545)dmE DNA Pol tI1 Alpha


0942 (CT794)druG DNA Primax


0338 (CT075)dnaN DNA Pol III (Beta)


0410 (CT261dnaQ_1DNA Pol III Epsilon
) Chain_1


4S 0655 (CT536)dnsQ DNA Pol III Epsilon
2 Chain_2


0040 (CT334)dnaX_1DNA Pol III Gamma
and Tau_l


0272 (CTI87)dnaX DNA Pol III Gamma
2 and Tau_2
_


0149 (CT146)dnU DNA Ligue


0274 (CT189)ByrA_IDNA Gyrue Subunit
A_I


SO 0716 (CT660)gyrA DNA Gyrase Subunit
2 A 2
_


0275 (CTI90)gyrB_IDNA Gynse Subunit
8_I


0715 (CT661gyrB DNA Gynse Subunit
) 2 B_2


0416 (CT267)himD lntegntion Host
Factor Alpha


0612 (CT493)polA DNA Polymerise
I


S 0924 (CT778)priA Primosomal Protein
S N


0386 (CT044)ssb SS DNA Binding
Protein


S8


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
0835 (CT555) SWUSNF family helieue_t


0849 (~7pg) SWUSNF family helicue
2
_


0769 (CT643)topADNA Topoisomense
t-Fused to SWI
Domain


0024 (CT347)xerCIntegruvrecombinue


1024 (CT864)xerDIntegrudrccombinue


Eukaryotic-Typt
Chromatin
Factors


0886 (CT743)hctAHiswne-Like Developmental
Protein


0384 (CT046)hct8Histone-like Protein
2


0878 (CT737) SET Domain protein


0577 (CT460) SWIB (YM74) Complex
Protein


UVR
Exinutlease
Repair
System


0096 (CT33;)uvrAExcinueletux ABC
Subunit A


0801 (CT586)uvr8Exinucleue ABC Subunit
B


p9a0 (CT791uvrCExcinucleue ABC.
) Subunit C


I 0772 (CT608)uvrDDNA Hclicue
S


Energy Metabolism
Aerobic


0855 (CT714)gpdA Glycerol-3-P Dehydrogenase


0743 (CT634)nqrA Ubiquinone Oxidorcductue.
Alpha


0427 (CT278)nqr2 NADH (Ubiquinone) Dehydrogenase


0428 (C'C279)nqr3 NADH (Ubiquinone) Oxidorcductase.
Gamma


0429 (CT280)nqr4 NADH (Ubiquinone) Reductue
4


0430 (CT281nqr5 NADH (Ubiquinone) Redueuse
) 5


25 0883 (CT740)nqr6 PhenolhydrolasdNADH
(Ubiquinone) Oxidoreductase
6


.1 TP
Biogenesis
and mttabolistn


0351 (CT065)adt_IADPIATP Traralaeast_1


0614 (CT495)adt ADP/ATP Tnnslocase_2
2


0088 (CT308)atpA ATP Synthue Subunit
A


0089 (CTJ07)atpB ATP SYn~uc Subunit B


0090 (CT306)atpD ATP Synthue Submit D


0086 (CT3I0)atpE ATP Synthue Subunit
E


0091 (CT305)atpl ATP Synthase Subunit
1


0092 (CT304)atpK ATP Synthue Subunit
K


35 0860 (CT119)fliF FIageIlar M-Ring Protein


Electron
Transport
Chain


0102 (CT013)cydA Cytochrorne Oxidue Subunit
1


0103 (CT014)cydB Cytochrome Oxidise Subunit
11


0364 (CT059) Fertedoxin


L~~ 0084 (CT312) Predicted Ferrcdoxin


Glyrnlysis
& Gluconeogtnesis


0281 (CT215)dhnA Predicted 1.6-Fructose
Biphosphate Aldolase


OB00 (CT587)erro Enolue


0624 (CT505)gapA Glyceraldehyde-3-P Dehyrogenue


45 0056 (CT295)mrsA PhosDhornannomutue


0967 (CT8I5)pgm Phosphoglucomutue


0160 (C'T207)plkA_IFructose-6-P Phosphotransfense_I


0208 (CT205)ptkA Fructose-6-P Phosphoasnsferue_2
2 -


1025 (CT378)pgi Glucose-6-P Isomerase


0679 (CT693)pgk Phoaphoglyeerate Kituse


0863 (CT722)pgrrtAPhosphoglyetnte Mutant


0097 (CT332)pyk Pyruvate Kinase


1063 (CT328)tpiS Triosephosphate Isomertue


Ptntose
Phosphate
Pothway


55 0239 (CTI86)devB Glucose-bP Dehyrogtnast
(DevB family)


1060 (CT331)dxs Tnruketolue


59


CA 02350775 2001-05-11
WO 00/Z7994 PCT/US99/26923
0360 ICT063)gnd 6-Phosphogluconate
Dehydrogenase


0185 ICTI21)rpe Ribulose-P Epimense


0141 (CT213)tpiARibose-5-P Isomerase
A


0083 (Cf313)tal Transaldolue


S 0893 IC1-750)UttBTnnsketolue


0238 (CT185)zwf Glucose-6-P Dehyrogenue


Pyruvalt
Dehydrogenase


0833 (CT557)IDdALipoamitle Dehydrogenue


0436 ICT285)IpIA_ILipoate Protein
Ligue-Like Protein


0618 (CT499)IpIALipoam-Protein
? Ligase A


0033 (CT340)ptJhA&BOxoisovalente Dehydrogenase
a/(i Fusion


0304 (CTZ45)pdhAPyruvate Dehydrogenase
Alpha


0305 (CT246)pdhBPyruvate Dehydrogenue
Beta


0306 (CT247)pdhCDihydrolipoamide
Acetyltransferase


1 TCA
S Cycle


0495 (CT390)aspCAspartam Aminotnnsferase


1013 (CT855)fumCFumarate Hydratue


1028 (CT376)mdhCMalate Dehyrogenase


0789 (CT592)sdhASuccinam Dehydrogenue


0790 (CT591)sdhBSuccinate Dehydrogrnase


0788 (CT593)sdhCSuccinate Dehydrogenue


0378 (CT054)sueAOaoglutarate Dehydrogenue


0377 (CTO55)sucB_1Dihytleolipoamide
Succinylnansfetase_I


0527 (CT400)sucBDihydrolipwmide
2 Succinyltratuferue
?
-


2 0973 (CT821sucCSuccinyl-CoA Synthetue.
S ) Ben


0974 (CT822)sucDSuecinyl-CoA Synthetase,
Alpha


Protein Folding, Assembly & Modification
Chaperonu


30 0949 (CT799)ctc General Stress Protein


0534 (CT407)dksA DnaK Suppressor


0032 (CT34IdnaJ Heat Shock Protein
) J


0503 (CT396)dnaK Hsp-70


0134 (CT110)groEL_1Hsp-60_1


3 0777 (CT604)groELHsp-60 2
S 2


0898 (CT755)groELHsp-60 3
3


0135 (CTI groESlOKDa Chaperonin
11
)


0502 (CT395)grpE HSP-70 Cofutor


0661 (CT541mip FKBP-type Peptidyl-prolyl
) CisTrans lsotnerue


Prattasts


OI44 (CTI clpB CIp Proteue ATPue
13)


0437 (CTt86)clpC CIpC Proteue


0520 (CT431clpP CLP Protease
) 1


0847 (CT706)clpP CLP Protease Subunit
2


4S 0846 (CT705)dpX CLP Proteue ATPue


0269 (CC138) Dipeptit3ue


0998 (CT841)fliesATPdepmdent Zinc
Proteue


0030 (CT343)gcp_1O-Sialoglytoprotein
Endopeptidue_I


0194 (CT197)gcp_2OSialoglycoprotein
Endopeptidast_2


S~ 0979 (CT823)htrA DO Senne Proteue


0957 (CT806)ide Insulinue family/Proteaae
II1


0027 (CT344)ion Lon ATP-dependent
Protect


1017 (CT859)IytB Metalloproceue


1009 (CT85t)trap MethionmeAminopeptitlase


S 0185 (CT045)pepA Leucyl Aminopeptidau
S A


OI36 (CT113)DepF Oligopeptidase




CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
0813 (CT574)pepPAminopeptidase
P


0613 fCT494)soh8Protease


0555 (CT441tsp Tail-Specific
) Protease


0344 (CT072)yaeL~tetal!optoteue


0981 (CT824) Zinc ~tetalloprotease
(insu)intue family)


Proteinomtrasts
ls


0227 (CT176)dsb8Disulfide bond
Oxidorcductue


0786 (CT595)dsbDThio:disulfide
Interchan8e Protein


0228 fCT177)dsbGDisulfide Bond
Chaperone


0933(CT783) Prcdieted Disulfide
Bond lsotnerase


0926 (CT780) Thiorcdoxin Disulfide
Isomerase


61


CA 02350775 2001-05-11
WO 00/27994 PGT/US99/26923
Transcription
RNA
Degradation


0999 (CT842)pnp Polyribonucleotide ~ueleotidylmnsfense


0054 (CT297)me Ribonuelease III


0119 (CT029)mhB_1Ribonuelesse HII_1


1068 (CT008)mhB Ribonucleax Hlf
2


0934 (CT784)mpA Ribonueleue P Protein Component


0504 (CT397)vac8Ribonucleue Family


1 RNA
~ Elongation
Qe
Termination
Faetors


0741 (CT636)greATranscription Elongation Factor


0316 (CT097)nuSAN Utilization Protein A


0076 (CT320)nusGTnrueriptional Antitermination


0845 (CT704)pcnB-1Poly A Polymenx_I


I 0966 (CT410)pcnBPolyA Polymerise ?
2
S


0610 (CT491rho Transcription Termination Factor
)


RNA
Merhyiases


0674 (CTSS3)fmu RNA Methyloansferue


1059 (CT3S4)kgsADimethy(adenosine Tnnsfense


2~ 0187 (CT133) PttdictedMethylue


0530 (CT403)spoU_1rRNA Methylue_1


0660 (CTS40)spoUrRNA Methylue_2
2


0117 (CT027)trmDtRNA (Gtunine N-I )-Methylttansfense


0885 (CT742)ygcArRNA Methyltransferse


25 0986 (CT829)yggHPredicted rRNA Methylue


0987 (CT830)ytg8Predicted rRNA Methylase


RNA
Modification


0649 (CTS30)fmt Methionyl tRNA Formyhnnsferase


0910 (CT766)miaAtRNA Pyrophosphate Tnnsferise


30 07t9 (CT658)sthBPredicted Pxudouridine Synthue


0219 (CT193)tgt Queuine tRNA Ribosyl Tnnsfense


OS80 (CT463)truAPseudouridylate Synthue I


0319 (CT094)tru8tRNA Pseudouridine Synthue


0401 (CTt06)yceCPredicted Pseudouridine Synthetue
Family


3 0864 (CT723)yjbCPredicted Pseudouridine Synthue



RNA
Po(ymerote
Qc
Trantcriprion
Rtgulators


OS86 (CT4b8)atoCTwo-Component Regulator


0362 (CTObIrpsDSigma-28/WhiG Family
)


OS01 (CT394)hrcAHTH Tnnxriptional Repressor


40 0793 (CTS88)rbsUSigma Regulatory Family Protein-PP2C
Phosphanse (RsbW Anngonist)


062b (CTS07)tpoARNA Polymerise Alpha


0081 (CT31rpoBRNA Polymerise Ben
S)


0082 (CT314)rpoCRNA Polymerise Ben'


075b (CT61rpoDRNA Polymenx Sigma-66
S)


45 0771 (CT609)rpoNRNA Polyrrrcnse Sigma-S4


OSI1 (CT424)rsbV_ISigtnaRegulatoryFaetor_I


0909 (CT76S)rsbVSigma Factor Regulator 2
2


0670 (CTS49)rsbWSigma Regulatory Factor-Histidine
tCitux


0750 (CT630)tctDHTH Tranuriptional Regulatory
Protein Receiver Doman


1069 (CT009)yfgAHTH Tnnscripoonal Regulator


Amino Aeyl tRNA Synthesis
0892 (CT749) alas Alanyl tRNA Synthetue
55 0570 (CT454) argS Arginyl tRNA Tnnsfenx
0662 (CT542) asps Aspartyl tRNA Synthense
Translation
62


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
0932 (CT782)cysSCysuinyl tfL'IA
Synthetue


0003 (CT003)gatAGlu tRNA Gln Amidotnmfertue
(A subunit)


0004 (CT0o4)gatesGlu tRNA GIn Amidotnnsfmse
(B Subunit)


0002 (CT002)gatCGlu tRNA Gln Amidotnnsfetase
(C subunit)


0560 (CT445)gltXGlutamyl-tRNA Synthetue


0946 (CT796)glyQGlycyl tRNA Synthetax


0663 (CT543)hissHisadyl tRNA Syntherase


0109 (CT019)ileSIsoleucyl-tRNA
Synthetax


0153 (CT209)IeuSLeucyltRNA Synthetue


1 0931 (CT781IysSLysyl tRNA Synthetase
~ )


OI22 (CT032)tttetGMeth'ronyl-tRNA
Synthenx


0993 (CT836)pheSPhenylalanyl tRNA
Synthetase, Alpha


0594 (CT475)pheTPhenyla)anyl tRNA
Synthetax Beta


0500 (CT393)prosProlyl tRNA Synthetax


I 0870 (CT729)xrS Seryl cRNA Syntherase
S 2


0806 (CT581)thrSThrconyltRNA Synthense


0802 (CT585)apS TryptophanyItRNA
Synthetase


0361 (CT062)tyrSTyrosyl tRNA Synthetase


0094 (CT302)vaiSValyl tRNA Synthetue


Pepridc
Chain
Initiation.
Elongation
&
Termination


1067 (CT333)def Polypeptide Dcformylase


0184 (CT122)eCp_IElongation Futor
P-1


0895 (CT752)efp Elongation Futor
2 P 2


0550 (CT437)CusAElongation Facror
G


25 0073 (CT323)inCAInitiation Factor
IF-I


0317 (CT096)inf8Initiation Factor-2


0990 (CT$33)infCInitiation Futon
3


01 (CT023)plrAPeptide Chain Releasing
I3 Futon 1


0576 (CT459)prt8Peptide Chain Release
Factor 2


3~ 0950 (CT800)pth Peptidyl tRNA Hydrolax


0318 (CT095)rbfARibosome Binding
Futon A


0699 (CT677)rrf Ribosome Releasing
Factor


0697 (CT679)tsC Elongation Factor
TS


t>074(CT322)tufAElongation Factor
Tu


35 Ribosomal
Prortins


0078 (CT318)rll LI Ribosomal Protein


0644 (CT525)r12 L2 Ribosomal Protein


0647 (CT528)r13 L3 Ribosomal Protein


0646 (CT527)rl4 L4 Ribosomal Protein


0635 (CT516)r15 LS Ribosomal Prouin


0633 (CT514)rl6 L6 Ribosomal Protein


0080 (CT316)r17 L7/LI2 Ribosomal
Prouin


0953 (CT803)rl9 L9 Ribosomal Protein


0079 (CT317)r110L10 Ribosomal Protein


45 0077 (CT319)rl L1 f Ribosomal
I Prouin
1


0247 (CT125)r113Ll3 W'b~onui Prouin


0637 (GT518)r114L14 Ribosomal Prouin


0630 (CT511)r115LIS Ribosomal Prouin


0640 (CT521)r116L16 Ribosomal Prouin


0625 (CT506)r117Ll7 Ribosomal Protein


0632 (CT513)rll8Lt8 Ribosomal Prouin


01 (CT028)r119Ll9 Ribosomal Protein
l8


0992 (CT835)r120L20 Ribowmal Protein


0546 (CT420)r121L21 Ribosomal Protein


55 0642 (CT5I3)r122L22 Ribosomal Prouin


0643 (CT526)r123L23 Ribosomal Protein


63


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
0636(CT517)r124L24 Ribosomal
Prouin


0545(CT419)r127427 ribosomal
protein


0327(CT086)r128L28 Ribosomal
Prouin


0639(CT520)r129L29 Ribosomal
Protein


0112(CT022)r131L31 Rbosomal
Protein


0961(CT810)r132L32 Ribosomal
Prouin


0250(CT150)r133L33 Ribosomal
Prouin


0935(CT785)r134L34 Ribosomal
Prouin


0991(CT834)r135L35 Ribosomal
Prouin


1 0936(CT786)r136L36 Ribosomal
~ Prouin


0315(CT098)rst SI Ribosomal
Protein


0696(CT680)rs2 S2 Ribosomal
Prouin


0641(CT522)rs3 S3 Ribosomal
Prouin


0733(CT626)rs4 S4 Ribosomal
Prouin


15 0631(CT512)rs5 S5 Ribosomal
Prouin


0951(CT801rs6 S6 Ribosomal
) Prouin


0551(CT438)rs7 S7 Ribosomal
Prouin


0634(CT515)rs8 S8 Ribosomal
Protein


0246(CT126)rs9 S9 Ribosomal
Prouin


0549(CT436)rs10S10 Ribosomal
Prouin


0627(CTSOB)rsl1511 Ribosortul
Protein


0552(CT439)rsl2SI2 Ribosomal
Prouin


0628(CT509)rs13SI3 Ribosomal
Prouin


0937(CT787)rs14514 Ribosomal
Prouin


25 1000(CT843)rsl5S15 Riboaomal
Protein


0116(C'f026)rs16SI6 Ribosomal
Protein


0638(CT519)rsl7517 Ribosomal
Protein


0952(CT802)rs18SI8 Ribosomal
Protein


0643(CT524)rsl9519 Ribosomal
Prouin


0754(CT617)rs20S20 Ribosomal
Prouin


0031(CT342)rs21521 Ribosomal
Protein


35 Other Catc'orica


Ch(cmydiaSpccific
Proteins


0561 (CT446)Euo CHLPS Euo Prouin


0804 (CT583)Gp6D CHLTR Plasmid Paralog


0186 (CTt SimiLriey to IncA_t
t9)


0291 (CT232)ineB Inelmion Membrane
Protein B


0292 (CT233)incC Inclusion Membrane
Protein C


1026 (CT377) LtuA Prouin


0333 (CTO80) LtuB Protein


0005 (CT871pmp_IPolymorphic Ouur
) Membrane Protein
G Family


45 0013 (CT871pmp_2Polymorphie Ouur
) Membrane Prouin
G Family


0014 (CT871pmp Polymorphic Ouur
) ~ Membrane Prouin
G Family


0015 (CT871pmp_3PMP 3 (frame-shit!
) with 0014)


0016 (CT874)pmp Polymorphic Ouur
4 Membrane Prouin
G Family


OOi7 (CT871)pmp_4PMP 4(fttune-shiftwith0016)


0018 (CT874)pmp Polymorphic Outer
5 Membrane Protein
G Family


0019 (CT87IPmp_5PMP 5 (frame-shift
) with 0018)


0444 (CT871pmp Polymorphie Ouur
) 6 Membrane Prouin
G/I Family


0445 (CT871pmp_7Polymorphic Outer
) Membrane Protein
G Family


0446 (CT871pmp Polymorphic Outer
) 8 Membrane Protein
G Family


55 0447 (CT871pmp Polymorphic Ouur
) 9 Membrane Prouin
G/I Family


0450 (CT871pmp_IPolymorphic Ouur
) O Membrane Protein
G Family


0449 (CT871DmP_10PMP_l0 (Frame-shift
) with 0450)


64


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
0451 (CT87t ) pmp_I I Polymorphic Outer Membrane Protein G Family
0452 (CT874) Potymorphic Outer Membrane
pmp_12 Protein (truncated)
AEI Family


0453 (CT871) Polyrnorphie Outer Membrane
pmp_I3 Protein G Family


0454 fCT872) Polymorphic Outer Membrane
pmp_t4 Protein H Family


0466 (CT869)pmp_I5Polymorphic Outer Membrane
Protein Family


0467 (CT869)pmp_16Polymorphic Outer Membrane
Protein E Family


0468 fCT869)pmp_17Polymorphic Outer Membnnc
Protein E Family


0469 (CT869)ptnp_17PMP_t7 (Fame-shift with
0468)


0470 fCT869)prnp_I7PMP_17 (Fame-shill with
0469)


0471 (CT870)pmp_18Polymorphic Outer Membrane
Protein FrF Family


0579 fCT412)prrtp_19Polymorphic Membrane
Protein A Family


0540 (CT413)pmp Polytrrorphic Membrane
30 Protein B Family


0967 (CT8t2)pmp_21Polymocphic Membrane
Protein D Family


0562 CHLPS 47 kDa Protein
Hotnolog_I


1 0927 CHLPS 47 kDa Protein
S Homolog_2


0928 CHL?S -43 kDa Protein
Homolog 3


0929 CHL.'S _43 kDa Protein
Homolog 4


0728 (CT622) CHL.'N 76kDa Homolog_I
(CT622)


07.9 (CT623) CHLPN 76kDa Homolog_3
(CT623)


0137 (CTI09) CHLI'S Hypothetical Protein


0332 (CTO81 CHL"'R T2 Protein
)


Mistellonmur
Err-rymu~Conservtd
Prote
irtf


0193 argR Possi de Arginine Repressor


106 Arort atie Amino Aeid
Hydroxyiase


25 0232 Similarity ro 5'-Methylthioadentnine
Nucleosidase


0128 (CT035) Biotin Protein Ligue


0513 (CT426) Fe-S Oxidoreducuse_I


I (CT767) Fe-S Oxidorcductue 2
091 -


0373 (CT057)gepE GcpE Protein


30 0407 (CT103)' HAD Superfamily HydrolauJPhosphatue


0917 (CT771) HydrolasdPhosphatue Homolog


0488 (CT385)ycfF HIT Family Hydrolase


070! (CT675)karG Arginine Kinase


0526 (CT399)kpsF GutQ/KpsF Family Sugar-P
Isomense


35 0919 (CT773)Idh Leucine Dehydrogenase


0022 (CT349)maC Mafprotein


0997 (CT840)mes! PP-loop superfamily ATPase


OISI (CT148)mhpA Monooxygrnase


0730 (CT624)mviN Integral Membrane Protein


0861 (CT720) NiN-Related Protein


0479 (CT380)phnP Metal Dependent Hydrolase


0106 (CT015)phoH ATPase


0729 (CT084) Phophotipue D Sttperfamily


0435 (CTI84) Phospholipase D Superfamily


45 0581 (CT464) Phosphoglycolate Phosphanse


0897 (CT754) Predicted Phosphohydrolue


0509 (CT422) Predicud Metalloen:yme


1030 (CT375) Pmdicted D-Amino Acid
Dehyrogenase


0531 (CT404) SAM Deprndent Methyltramferue


50 0337 (CT076)smp8 Srnatl Protein B


0394 (CT256)t(yC_ICBS Domain Protein (Hemolysin
Homolog)_t


0510 (CT423)ttyC_2CHS Domains (Hemolysin
Homolog)_2


0382 (CT048)yabC SAM-Dependent Methyarnaferase


0787 (CT594)yabD PHP Superfamity (Urcase/Pyrimidinuc)
Hydrolau


55 0611 (CT492)yacE Predicted PhoaphatuelKinue


0579 (CT462)yachtSugar Nucleotide: Phosphorytue


OS78 (CT461)yael Phosphohydrolase _


65




CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
0145(CT071yaeM CT071 Hypothetical Ptotem
)


0566(CT450)yaeS YaeS family Hypothetical
Protein


0591(CT472)yagE YagE family


0039(CT335)ybaB YbaH family Hypothetical
Protein


OI01(CTOl2)ybbP YbbP family Hypothetical
Protein


0915(CT769)ybeB iojap Superfamily Ortholog


0137(CTf08)ybgl ACR family


0529(CT402)ycaH ATPau


0438(CT287)ycbF PP-loop Superfamily ATPase


1 0734(CT627)yceA YceA Hypothetical Protein
~


0954(CT804)ychH Predicted Kinase


0261(CT217)yda0 PPLoop Superfamily ATPase


0245(CT127)ydh0 Polysaccharide Hydrolue-tnvasin
Repeat Family


0573(CT457)yebC YebC Family Hypothetical
Protein


IS 0689(CT687)yfh0_I Nif$-rclatedAminotransfenae_I


0862(CT721yfh0 2 Nits-related Aminomnsfetau-2
)


0547(CT43t)ygbB YgbB Family Hypothetical
Protein


0237(CT184)yggF YggF Ftunily Hypothetical
Protein


0775(CT606)yggV YggV Family Hypothetical
Promin


0396(CTZ58)yh10,3 NifS-related AminotnnsCense
3


0605(CT487)yhhf Predicted Methylase


0575(CT458)yhhY Amino Group Acetyl Tnnsfense


0592(CT473)yidD YidD Family


0982(CT825)yigN YigN Family Hypothetical
Protein


25 0657(CT537)yjeE YjeE Hypothetical Protein


0768(CT644)yohl Yoht Predicted Oxidoteductue


0336(CT077)yajL YojL Hypothetical Protein


0217(CT140)ypdP YpdP Hypothetical Protein


0140(CT212)yqdE YqdE Hypothetical Protein


0263(CT221yqfiJ YqfU Hypothetical Protein
)


0139(CT211yqgE YqgE Hypothetical Protein
)


0270(CT137)ywlC SuAS Superfamilyrelated
Protein


0879(CT738)yyc! Menl Dependent Hydrolase


35 Homologs to CHLTR Hypothetical
Caling Genes


0001(CT001CTOOI Hypothetical Protein
)


0020(CT351CT351 Nypothetieal Protein
)


0021(CT350)CT350 Hypothetical Protein


0026(CT345)CT345 Hypothetical Protein


0035(CT339)CT339 Hypothetical Protein


0036(CT338)CT338 Hypothetical Protein


0055(CT296)CT296 Hypothetical Protein


0062(CT289)CT289 Hypothetical Proxin


0065(CTZ88)CT288 Hypothetical Protein


45 0068(CT360)CT360 Hypothetical Protein


0071(CT325)CT325 Hypothetical Protein


0072(CT324)CT324 Hypothetical Protein


0085(CT31CT711 Hypothetical Protein
l
)


0087(CT309)CT309 Hypothetical Protein


0093(CT303)CT303 Hypoehstieal Protein


0100(CT011CT011 Hypothetiesl Protein
)


0104(CT017)CT017 Hypothetical Protein


0105(CT016)CT016 Hypothetical Protein


0107(CT058)CT058 Hypothetical Protein_I


55 otoetcrnlg)crolg similarity


011 (CT021CT021 Hypothetical Protein
I )


0121(CT031CT031 Hypothetical Protein
)


66


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
0129(CT036tCT036 Similarity


0145(CTt CT114 Hypothetical
14) Protein


Ot50(CTI47)CT147 Hypothetical
Protein


0152(CTt49)CT149 Hypothetical
Protein


0176(CTI53)CT153 Hypothetical
Protein


0188(CT132)CT132 Hypothetical
Protein


0189(CT131CTl3l Hypothetical
) Protein


0206(CT203)CT203 Hypothetit:al
Protein


0229(CT178)CT178 Hypothetical
Protein


0230(CT179)CT179 Hypothetical
Protein


0234(CT18ICT181 Hypothetical
) Protein


0249(CTI51CTlS t Hypothetical
) Protein


- 0253(CT144)CT144 Hypothetical
Protein_1


0254(CT143)CT143 HypoUtetical
Protein-1


I S 0255(CT142)CT142 Hypothetical
Protein_I


0256(CTtaa)CT144 Hypothetical
Protein 2


0257(CT143)CT143 Hypothetical
Protein 2


0259(CT142)CT142 Hypothetical
Protein 2


0276(CT191CT191 Hypothetiesl
) Protein


0288(CT195)CT195 Hypothetical
Protein


0293(CT234)CT234 Hypothetical
Protein


0301(CT242)CT368 Hypothetical
Protein


0303(CT244)CT244 Hypothetical
Protein


0308(CT249)CT249 Similuity


25 0312(CT101)CT101 HypothetiealProtein


0328(CTO85)CT085 Hypothetical
Proosin


0330(CT083)CT083 Hypothetical
Protein


0331(CT082)CT082 Hypothetical
Protein


0374(CT079)CT079 Similarity


0342(CT073)CTOT3 Hypothetical
Protein


0343(CT073)(hams-ahiR
with 0342?)


0350(CT066)CT066 Hypothetical
Protein


0369(CT058)CTO58 Hypothetical
Protein 2


0370(CTO58)CT058 Hypothetical
Protein 3


35 0374(CT056)CT056 Hypothetical
Protein


0379(00053)CT053 Hypothetical
Protein


0381(CT326)CT326 Similarity


0383(CT047)CT047 Hypothetical
Protein


0387(CT043)CT043 Hypothetical
Protein


0389(CT041CT04 t Hypotitetieal
) Protein


0393(CT038)01'038 Hypothetical
Protein


0395(t.'C257)CT257 Hypothetical
Protein


0399(CT253)CT253 Hypothetical
Protein


0400(CT254)CT254 Hypothetical
Protein


45 0401(CT255)CT255 Hypothetical
Protein


0405(CT10S)CTI05 Hypothetical
Protein


0408(CT102)CT102 Hypothetical
Protein


0409(CT260)Cf260 Hypotheatal
Protein


0411(CT262)CT262 Hypothetical
Prooein


0412(CT263)CT'263 Hypothetical
Protein


0415(t:T266)CT266 Hypothetiea!
Protein


0420(CT271CT271 Hypothetical
) Protein


0422(CT273)CT273 Hypothetical
Protein


0423(CT274)CT274 Hypothetical
Protein


55 0425(CT276)CT276 Hypothetical
Proteins


0426(CT277)CT277 Similarity


0434(CT283)CT283 Hypothetical
Protein


67


CA 02350775 2001-05-11
WO 00/17994 PCTNS99/Z69Z3
0441ICT007)CT007 Hypothetical
Protein


0442(CT006)CT006 Hypothetical
Protein


0443(CT003)CT003 Hypothetical
Protein


0474(CT363)CT363 Hypothetical
Protein


0476(CT863)CT863 Hypothetical
Protein


0480(C7383)CT383 Hypothetical
Protein


0485(CT382)CT382.1 Hypothetical
Protein


0487(CT384)CT384 Hypothetical
Protein


0489(CT386)CT386 Hypothetieat
Protein


1 0490(CT387)CT387 Hypothetical
~ Proxin


0491(CT389)CT389 Hypothetical
Protein


0496(CT791CT391 Hypothetical
) Protein


0497(CT388)CT388 Hypothetical
Protein


0506(CT421CT421 Hypothetical
) Protein


1 0507(CT421CT421.1 Hypothetical
S ) Protein


0508(CT421CT421.2 Hypothetical
) Protein


Osl2(CT423)CT423 Hypothetical
Protein


0314(CT427)CT427 Hypothetical
Protein


0518(CT429)CT429 Hypothetical
Protein


2~ Os22(CT433)CT433 Hypothetical
Protein


0525(CT398)CT398 Hypothetical
Protein


0533(CT406)CT406 Hypothetical
Protein


0537(CT814)CT814.1 Hypothetical
Protein


0538(CT814)CT814 Hypothetical
Protein


25 oss4(CT440)CT440 Hypothetical
Prouin


OSS9(CT441)CT441.1 Hypothetical
Protein


0363(G?449)CT449 Hypothetical
Protein


0372(CT436)CT436 Hypothetical
Protein


0382(CT463)CT463 HypotlKtieal
Protein


30 0383(CT466)CT466 Hypothetical
Protein


0388(CT469)CT469 ~iypothetieal
Protein


0589(CT470)CT470 Hypothetical
Protein


0390(CT471)CT471 Hypothetical
ProOein


0393(CT474)CT474 Hypothetical
Protein


35 0393(CT476)CT476 Hypothetical
Protein


0601(CT483)CT483 Hypothetical
Protein


0602(CT484)CT484 Hypothetical
Protein


0606(CT488)CT488 Hypothetical
Protein


0609(CT490)CT490 Hypothetical
Protein


4U 0622(CT303)CT303 Hypothetical
Protein


0623(CTS04)CT304 Hypothetical
Protein


0648(CTS29)CTS29 Hypothetical
Protein


0658(CTS38)CT338 Hypothetical
Protein


0667(CT346)CT346 Hypothetical
Protein


45 0668(CTS47)CT347 Hypothetical
Protein


0669(CTS48)CT348 Hypothetical
Protein


0671(CTS30)CT350 Hypothetical
Protein


0673(CT332)CT332 Hypothetical
Protein


0673(CT696)CT696 Hypothedeal
Protein


0676(CT695)CT693 Similarity


0681(CT691CT691 Hypothetical
) Prooein


0687(CT482)CT482 Hypothetical
Protein


0688(CT481CT481 Hypothetical
) Protein


0700(CT676)CT676 Hypothetical
Protein


55 0703(CT671)CT671 Hypothetical
Protein


0706(CT670)CT670 Hypothetical
Protein


0708(CT668)CT668 Hypothetical
Protein


68


CA 02350775 2001-05-11
WO 00/Z7994 PCT/US99/26923
0709 vCT667)CT6b7 Hypothetical
Prouin


0710 ~CT666)CTb6b Hypothetical
Protein


0711 lCTbbS)CT665 Hypothetical
Protein


0713 (CTb63)CT663 Hypothetical
Prouin


0717 (CT6Sb)CTbSb Hypothetical
Prouin


0718 (CT6S7)CT637 Hypothetical
Prouin


0720 (CT659)CT659 Hypothetical
Prouin


0722 (CTbS4)CTbS4 Hypothetical
Prouin


0725 (CTbS2)CT652.1 Hypothetical
Prouin


1 0726 i CT620 Hypothetical
~ CT620)Prouin


0727 (CT619)CT619 Hypothetical
Ptouin


0739 fCTb38)CT368 Hypothetical
Prouin


0742 (CT63S)CT635 Hypothetical
Prouin


0746 (CTb32)CT632 Hypothetical
Prouin


I 0747 (CTb31CT631 Hypothetical
S ) Prouin


0751 (CTbSCT65I Hypotheti:at
1 Protein
)


0755 (CT616)CT616 Hypotheti:al
Prouin


0760 (CTbII)CT611 Hypotheti:alProuin


07b1 (CT610)CT610 Hypotheti:al
Prouin


0764 (CT648)CT648 Hypotheti:al
Prouin


0765 (C1'647)CT647 Hypotheti:al
Prouin


076b (CT646)CT64b Hypothetic
al Prouin


07b7 (CT64S)CT64S Hypothed
al Prouin


0770 (CT642)CT642 Hypotheti
;al Protein


25 0774 (CT606)CT60b.1 Hypothetical
Prouin


077b (CT605)CT60S Hypothetical
Protein


0779 (CT602)CT602 Hypothetical
Protein


0783 (CTS98)CTS98 Hypothetical
Protein


0791 (CTS90)CT590 Hypothetical
Protein


0792 (CTS89)CT589 Hypothet'rcal
Protein


0803 (CTS84)CTS84 Hypothetical
Prouin


0807 (CTS80)CTS80 Hypothetical
Protein


0808 (CTS79)CT579 Hypothetical
Prouin


0809 (CTS78)CTS78 Hypothetical
Protein


3 0810 (CTS77)CT577 Hypothetical
> Protein


0814 (CT573)CTS73 Hypothetical
Protein


0818 (CT569)CTS69 Hypothetical
Prouin


0819 (CTS68)CTS68 Hypothetical
Prouin


0820 (CTSb7)CTSb7 Hypothetical
Protein


0821 (CTS66)CTSbb Hypothetical
Protein


OB22 (CTSbS)CTS65 Hypothetical
Protein


0827 (CTS60)CTS60 Hypothetical
Prouin


0834 (CTSSb)CTSSb Hypothetical
Prouin


0840 (CT700)CT700 Hypothetical
Protein


45 0842 (CT702)CT702 Hypothetical
Protein


0843 (CT702)CT702 Hypothetical
Prouin


0852 (CT711CT71 ! Hypothetical
) Protein


0851 (CT712)CT712 Hypothetical
Prouin


0857 (CT716)CT7Ib Hypothetical
Prouin


OSS9 (CT718)CT718 Hypothetical
Prouin


0865 (CT724)CT724 Hypothetical
Prouin


0869 (CT728)CT728 Hypothetical
Prouin


0874 (Ct'773)CT733 Hypothetical
Protein


0875 (CT734)CT734 Hypothetical
Protein


55 0884 (CT741)CT741 HypotheticalProuin


0887 (CT744)CHLTR Possible
Phosphoprouin


0896 tCT753)CT751 Hypothetical
Prouin


69


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
0906 (CT7631CT763 Hypothetical Protein


0908 (CT764)CT764 Hypothetical Protein


0912 (CT768)CT768 Hypothetical Protein


0925 (CT779)CT779 Hypothetical Prouin


0938 (CT788)CT78B Hypothetical Protein


0939 (CT790)CT790 Hypothetical Prouin


0943 (CT794)CT794.1 Hypothetical Prouin


0945 (C'f795)CT795 Hypothetical Prouin


0956 (CT805)CTSOS Hypothetical Prouin


1 0960 (CT809)CT809 Hypothetical Prouin
~


0989 (CT832)CT832 Hypothetical Protein


0994 (CT837)CT837 Hypothetical Prouin


0995 (CT838)CT838 Hypothetical Prouin


0996 (CT839)CT839 Hypothetical Prouin


I 1002 (CTB45)CT845 Hypothetical Protein
S


1003 (CT846)CT846 Hypothetical Protein


1004 (CT847)CT847 Hypothetical Prouin


1005 (CT848)CT848 Hypothetical Prouin


1006 (CT849)CT849 Hypothetical Prouin


1001 (CT849)CT849.1 Hypothetical Protein


1008 (CT850)CT850 Hypothetical Prouin


1010 (CT852)CT852 Hypothetical Prouin


1011 (CT853)CT853 Hypothetical Prouin


1015 (CT857)CT857 Hypothetical Prouin


25 1016 (CT858)CT858 Hypothetical Prouin


IOl9 (CT860)CT860 Hypothetical Prooein


1020 (CT861CT861 Hypothetical Prouin
)


1022 {CT863)CT863 Hypothetical Prouin


1032 (CT373)CT373 Hypothetical Prouin


30 IOl3 (CT372)CT372 Hypothetical Prouin


1034 (ty CT371 Hypothetical Protein
f37I
)


1057 (CT356)CT356 Hypothetical Prouin


1058 (CT355)CT355 Hypothetical Prouin


1061 (CT330)CT330 Hypothetical Prouin


35 1077 (CT371CT77I Hypothetical Prouin
)


Coding Genes Vot in C. trachomaris


0486 Hypothetical Praline Permeau


0279 Possible ABC Transporter Petmease
Prouin


0505 3-Methyladenine DNA Glycosylue


0193 argR Similarity to Arginine
Reprcswr


1041 bioA Adenosylmethionine-8-Amitto-7-Oxononanoate
Aminouatuferue


1044 bioB Biotin Synthase


1042 bioD Dethiobiotin synthetue


45 0585 Similarity to Cps tneA 2


0562 CHIPS 43 kDa Prouin Homolog_I


0927 CHLPS 43 kDa Prouin Homolog_2


0928 CHLPS 43 kDa Prouin Homolog_3


0929 CHLPS 43 kDa Prouin Hornolog
4


1045 Conxrved Hypothetical Metttbrana
fhouin


0251 Conxrved Hypothetical Prouin


0278 Comerved Ouur Membrane Lipoprotein
Protein


0907 CutA-like Periplumic Divalent
Cation Tolerance Protein


0171 guaA GMP Synthase


55 0172 guaB lnosine 5'-Motwphosphue
Dehydrogenase


0608 Uridine 5'-Monophosphate Synthase


0735 Uridine Kinase




CA 02350775 2001-05-11
WO 00/2994 PCT/US99/26923
pgg0 Similar w Sacchnromyces
ctrevisiat 52.9KDa
Protein


0232 Similarity to 5'Wtethyhhioadanosine
Nucleosidue


1046 Tryptophan Hydroxylase


0477 yqeV Conserved Hypothetical
Bs Protein


0048 yqfF-Bs Conserved Hypothetical
1\A Protein


0587 yvyD_Bs Conxrved Hypothetical
Protein


0143 yxjG Conxrved Hypothetical
Bs_l Protein


0448 yxjG
Bs_2
Conserved
Hypothetical
Protein


0006 OI80 0440 0977


0007 oral o4ss o97s


' 0008 0190 0456 1018


0009 0203 0457 1023


- 0010 0204 0458 t027


OOII 0205 0459 1029


1 5 0012 0209 0460 1040


0028 0210 0467 1051


0029 0211 0462 1052


0034 0212 0463 1053


0041 0213 0464 1054


0042 ozl4 o46s loss


0047 0215 0472 1056


0044 0216 0473 1064


0045 0218 0481 1065


0046 0220 0483 1066


0047 0221 0492 1070


0049 0222 0493 1071


0050 0223 0494 1072


0051 0224 0498


0063 0225 0499


0064 0226 0516


0066 0233 0517


0067 0240 0523


0069 0241 0524


0070 0242 0553


3 5 0099 0243 0574


0124 0266 0600


0125 0267 0656


0126 0268 0664


0130 0277 0677


0131 0283 0678


0172 0284 0685


0142 0285 0686


0146 0287 0724


0147 0352 0731


olss o3s3 o74s


0156 0354 0753


0157 0355 0794


0158 0356 0795


Ols9 0357 0796


0162 o3s8 0797


0163 0365 0798


0164 0366 0799


0165 0367 0829


0166 0368 0830


5 5 0167 0371 0831


0168 0372 0881


0169 0375 0882


~1


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
OI70 0376 0913


0173 0391 0914


0174 0398 0930


0175 0404 0944


0177 0431 0964


0178 0432 0975


0179 0439 0976


72


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
r.
CKYFYLR..~YPPPP~rISIA:U. ~'K:.RVL1ITF:::Frlt:.Lf.l.uwl.F:.TL.:LFCiSMLS


Clalssdrdla t~~~nslt 0tnar Beaodw FCLG tCi.~.Ai.CCViJII9GLL :LLVKREI
.roulos P1YRPEEI Pf~V,,'LAPSEEPAiAAACK':
LACL


PKELOpLLTtDLOEVJ:BSI:R~tbSitYBltliilLNDAw(~IVFDEY'.~.~VV


CPn_OOOt )30 4
AOB~IDWFLINCGRSih!!!'AESLSLDLFNVSKRLCTLPSCDVAC~C'klGStlK!'tllJl~1


~.'l'001 hyp~~hnr ieal Protein
SLHCEIHKYAVAFORNSYAhAEKAFAKALuALEESVYRSL?QSYRDKFLESERAKIPNNG
TSLRRKANUGKIIRGLSSLIVLLCAW~GLICITHNKWILAKI~'OCVS'IPPR~RNLCKQSFf'
'


KRWCDEIKI '
CCYIIi iIACVICLLS!'CPFC3KK'aRHSHCD5C3SLuCHSHHSOKtIIWLRDDAK.iGCAEKKILi
AT HI


Q CPn OOtO.: 11768 15715
q75


~Pn nn0? 570 :.rm.. ,.:"
~ r


nt,r. y
... :,n.vi.i... ~..,., rr~... ...._ ; y'~.h,;vt~:: ~.~r..
'~rl.r': r:\c:x: :~:.r ~:,r--
. :: ~\F~:,I:I:I::.r. -w::;. : w
\:':" ;
'
w
::
T
::
v
'
'


.. .. FLKAWRKCAWtT'l'FEK1CF-
iKKNWAVEEANARRLKYVROWYDfiEFQKnY:6RLEKWAL
.
.
n
:.. _
.
:
.
,..I:.:I L;.F.c.::.:.:.::i
.:,.i :. "~.
:,.


HHWNVEDLREDSVTSDttJREEFLRHVPE.iIJGGLVKVPAVIKYP6YSVSIRDiKIQETRSNLEKAYGIEENYRCCVR
OpEIfYiIKEEEKKEAEFK>T~BCIL


Spear ~cer ~pp~',I,pIFSp(ytya',HILKL.OICGTAEVC41CIL$OIIBSRLEIVfm'V


CPn_0003 889 2370
KDIPCRIEEiEKTIJWAG.PLLPTKKAFEKACSOYNSCADILEKVKPYCXCSIaYYISKE


qatA-Glu tRNA Gin NsidotransEerae
RLVSLDEDLRRAYFfl~AFOCDSGLESEVRACRlCLRERIQEFL:pGLDL.VDfSLLCVS
~
IDK
KEOALEOAEf


.
SRW~II'Z~DCVSCIfKKGPPGKKFYAOYYDEIYRVRVOSIBtIflIISERLK1~VOAC~01LK
KINYRYSALEWtAViIGSLTA'ICVfafFFNRIEEA30VCAPISLC
G


KRSRGEPLGKi.ACVPUCIKLNItM'CLKTCCASRVL>T1YOPPFDATVVERIKI<F~CIILYKEIRKNKEKRLVGTKI
VA'l'QQRIQCFQPSOIVESSNOIVSLIt
ACLBEEDKVLKE6EYWL


KI1~EFAMGSTTLYS11FNF17BiP41DLSRVFOGSSCGSAAAVSARFCPVALCSD1GGSIA.
NRI~ftS


QPMFCCWGFKPSYCAVSRYCLVAFJ1SSLDDICPL.AN1'VmVALJB~VfSGADPKD11TSOKIIItFLF


REFFRDSFNSKLS?EVPIfVIGVPRTFLECLRDDIRCNFFSSL1IFDCl~rTHLVDVCLDIL
C' 0011 15877 Iddl1
' CPf1


'KEVIBIKILIr .
' OatB-IPet1121 Glu tRM1 Gln AmidotransEerase
3HAVSIYYILASAFJ1ATM.IIRFDCVRYGYRSPQ11fI1'ISOLYDISROQ3f1t8 Subunit)
'


LYL
P14YSIlfCAAPAIWVSP'IPPEfTItBYIPKDSKSRJ1LGITLLVL'GIIwV1fiG71IVtBGVIS
NYVLSAERQNVYYKKATAVRAKIVKAFRTAFEKCEILAMPVCSSPAFEIGEILDPIr!ELKELQ41'I
QDIYTVM4dLAYLPAIAVPSCFSKF7CLPLGL0IIG00C~DQQVCQVCYSFGEHAOIKOLFK


QIAITFJID
GLSALIVCCLCISTISL ..NVLfVIGLILLLRXRELTLEpIEA


SKRY)1K~S 09TDtSLEKIFiiSRYSDQCf
WR711'OKILDLESSLSSITSEFRDLRQLFDEEKIELLBGI


3833
RLLEFIAANLFKOCRDVYIi4GGHLADIRAYIIOPNtMNIWVIEKAKAWHEFIVLT'LlUIR


CPn_0004 233 f~fP
VacB-IPeelllt Glu CRNII Gln AnliriotransEeras
1B Subllnttl
'


G
LICQIOCCSRASINSAVYADWESVIGLEVHVGL.NI'ASIa.!'SSAIlJAFGDLPNlZ4IS~C10011 16596
1831?
CPi!


LPGSLPVGNOSAVEKAVLFGCAVECLISLLSRFZMKS7fFYPDgpRHIOLrppplplIl~i'
11 QatB-IPstllZl Clu tRNA Cln Asidotransferass
IB Subunitl


RTKAIVQGEERYFELAQTHIEDDiiIGNLKHFGEFACVDYIiRAGVPLILIV5KPQ0CPl~GIRVFFLI(NICYCLW(~
4Y00'~A~:RLLYNSVOKSYADRLFSYflITKMMDTPLIPNBE


VAYATSLVS:.LDYIGISDQ~B1EEGSIRFDVNVSVRPIa:SPELRNKVEIKN6~1SFA1NWAK00CAfJ10tAE'LEG
OKILLDYGKSIFWLNENDEINIl4DPWSWCWIIfKTRICVIpEVDDS


LFUfWRQIDEYLNOPt4KDPIG.VIPMTYRWDPEIGIKIYIJ9tLKESAt~YKYFPEPtE.PTD
13~1f~r r rrreSK~~KL(SDLVDRLEDU1K19fFlWKQI~VCIR
IIKVLIG1G


LQLTESYIERIRIiTLPF.LPYDKYfOtYIOEYGLSmIASILISDIOQIATFFEV11CKDGC~1F.
~
VKDLKAKYCGTVDPKQL1TF~11QGtVLT.E71.SLETFLDSIESELVOCLEDQDIYfiIt~DVI~L


RSLSlIWIIIYEFGGRCKTLGV10:.PSSGiFPEGVACLVNAIt7pCVIIGKIAKEIA0U11ESPM'1'O~EEODI~~I
"~'WK~'~~IITIZ?BC
VDY10~KTKAiGFLVCOtIBCtT


GKNPEEILIC>XPELI.PNSDE7GELQKIIAEIfVLANPE.STDNKARLKILIfEDITSVLPEIDEIL'TCISLLEi.P
LL.TTRELLTKSYLICFKICSETLiafl'S


AGIUPPKRVNELLLLLDKG
VF~lII7NOEYEVOLONLCFR4CISQKTGKKQDOFAId.EDOVAL4KKRLKEL'1'~'iFCIQ


GFNFIOC~FItIIAAKDLYIRST


AKI~.C$LpI.DBKFi.LOICEIIGCCEIRQI(ltpQRNADRSRfITI'YQKLIIIAEG.ALEL.1UDCI


18109 21106
Outer Nse6sane~Protein
AVDfi'S~P. OFPOEIfTPFVKVQAVttARODSFVBLCAISRDFSDSHL.YM.AIPIaIU.IGDtF


t'IdDYTEIdGItGSIECRPNARHmIHCCSKlRPAOpYYHWJWYS CSNLARQAGIYQA9GFRSLGAAA6


CPn_0006 7299 7111 LFrIIICF!'SfRCBSRSYNVDAGSKIKF


No robust homoloQ Dcesent in Gentbenk/EI~L
as of 11/7/98 CRL0011 11365 21922


KQLQEPLRSALLERLSEWLVLtGITSPETTRSTPEKpMpLPKDSRNItTLESLp11~3-POlymorphic Ouur
Nalsbsane
Protein


0007 7498 10196
IONOSIYPI't0C8SFPKFVFSTFAIFPLSNIATETYLDSSASFDCNKNf~IFBVIIESQEDA~D
CPn '
'
'


_ YDAG
No robust houolop Dresent in Cewbank/E'!'Z8LIYAGAAVHSS
as of 11/7/98 KCDLTP1~ISLLF(71
TTYLfKf~VTLENLPGTCL'AITIfSCFNtII
"


KSFRYNLSLIFSFLWIPLTDSTTSSLSI'SLLOECNPOSt9IKLRILAIVLIALSIILIAGIIGSLSLTIWSVCSSAKT
ICRIIIAVLS
WDK.STTFIC!'SSLSFI1LSPCSSITTCKGAVSCS


GWLLTVAIPGLSSVISSPACNGACALGCV!C.AIGIDVLLXXAEVPIVIrISV'lITPG'IIGSPOfU.FH


PRSGISISCADSTIRSLP1'YLLDmQiPQSNRKtRILAIVLIVFSIILIASC<fVLL'M1IP


GLSSVISSP7~tIGACALCCIIHLrILGItNLi.IfJIRIVPIVLASYIT?PCi'GSPRSGLSISGACPn,-0015
21875 71171


DSl'IRSLPTYPGDEGHPOSNRItLRILAIVLIVFSIILIASGWLLTVAIPGLSSIISSPApmD_3-PNP_7
IEramt-shift with 0011
'


FMG71CALGCI.rLAiGIOVLLKKREVPIWPAPIPEEWIDDIDEESIItLQQEAtAALARLSSKKOGAIplSDALTIT
LEFdQiVSLLPSKNFSTDNGC11ITAKTLSLTCTTMSALFSEIJ1


PEE?tSAFECYIKWFSNLI~cSLPYDCHGLEEtcTKtIQIRWRSSLKANVPEFLDIRRIFGNOGEVSFSDN'I'SSDSG
AiIIITEASVTISNNAKVSPIDNKVIGASa~SITCDNStxIIIGY
'
'


EEEFF?LSaRKRLIDIATTLVERKILTEQLEW9iLRJtAESYLYQDSIPIOtIItINFEIWAPRGGA
l
KTS'fOTIML1'CNOIG.LFSDBI'l'S1TAGGAIYVKKLCLASGCLTLFSRHSVI~G
'


WKtTIILSKSICRPTIIFENHEtKIIAKSLLHIQ4AVLLEKIIIYRSLOKSYRDIGNSSAl00CISAKM'ALRSAACMI
YFYD
IAIEDSCELSLSADSCDLVFLGNTYfSTI'PCTNRSSIDLC'1
'


LHCNPFFSLEDNIaTINKtNAEl4<.ESLSSYRKVFLALSOENVVD'1'PSDPKIbrDISCIpCRfEAADSKMTSKLLQ
PVrLS
PITICSSTZYi'DVLKVNCI'PADSJ1LOY1'rNIIFTCICiSE


wILSEISRDEQNOKKAHLKNOESLYTQARDRL'1'DOSSKENOKELEIUWEYISSNERVKK~aTLSGKNGVTLQ'SOAP
'1~OADSRLCHDN'T1'LCPAt7lSTINNLVINISSIDCAKKAKIE


F=IERVQERIIGIOKLYPNILEREEEi'IGOETVTPfVQCZTASSDLTDILGRIEYSSREDTKJ1TSKNLTLSC?ITLL
DPTGTFYCrtIISLRNPQSYDIt.ELNASGT<?STAVTPDPIIbEK


:JQNQESCVKVLRSIffiVEflSirE\IKQEYGPKKKEPQOOMGSLERFFTEHIEELEVL4KDYSKFHYGYQG'1WGPI
VNCEGASTTATFNN'tKTGIfIPNPERIGSLVPNSLNNAPIDISSLNYtJI


HLSYFKKVtINKKEVQYAKFRLKVLESDLfCILAOTESAESLLTQEELPILATItGALCKAVE1'ANEGLQGDRAfWCA
GLSNFFHKDSTKTRRCFRHLSGGYVIGGNLIIECSDKIISAAFCQ
'
'


FKGSLCCALASKJ1KPYFEEDPRFQDSLri'QLIGLTLRL.OEAKASLEEEIKRFSNLENDIAEI
LFGRWIDYFVAKtfQCTVYGn'LrfOHI4CTYISLPCKLRPCSLSYYPTEIPVLFSCM.BY


ERRLLKFSKQ1'FERAGL,GVLRELAVESTYDLRSLTN7ylECCPESEKVYFSNYLNYYNEEKH?ONDLKTKYITYPI'
VKCSW~IOSFALEPxRAPICLDESALfEQYNPFIOfLpNYANDE
'


RRAKTRLVMCORYRDFKM1ILEAl~FNEE71LLDEEL.~aIQAPSEL
riFKEOCTF~1REFCSSRLVNLdLFICIRFDKE80CQ0ATYNL?LGY7If~.VRSNIOCTlI


RISGCFCTNLARQAL\tRACNHFCFNSNFEAFSOFSFELRCSSRHYNVLKGAKYQF


!:Pn_0009 10780 11685
7 CPn 0016 21383 251B8
98


/ 1~lymocphic Outar Nelntcant Pcottin
No robusr homoloq present in Genebank/EHBLmp
as oL 11/
S
~


A f
VLLLTLGIPCLTdGISFCACLGF _
':KYSYLLNYPPPPRRSLGVSCSKLRSLSITL.LVLu'sL7IFFLLMSVSADAADLTLCSRDSYNCDI'~a'fTEl1'P
K
' RSDFALKRCCHNRSSFSLLLIs3
'


1KICTLDRLPKE .
PSEEPALEKAQKEPF
MTSDASG1TYILDGDVSI:'r'AI:KQTGLTT.~.CFSNTACNLTFIGNu'FSWFONLIBSTVA
CGG'JL'JISGi.LFLLVRREVPIVRSEEIPRCVSVI
(.DOLCI'YIQEVFACLERLKDPKYfDRCLLTEAKEKLRVFDWEKOMHSEFLDIQRVWEE


AYWEHCODFLENIAYEIFSSQELitDYYCAGYCCYLPSCt)ARADRLKRa'VKEVI~ItFNRV~NTMOCiTIfp3CFST
'..RNiJVIPRTTr:KKGAIKITOf:LVFE.sICtJt~LJDrAB&OJC


TWkWEASVMLOHSYCVARELFKIUVGVLEESVYKILFKSYRI)J1FYDCEKAKIQRDGRFKr.AINTKTGSL'LC.STR
fYAFG:NB3.i0Qa:AIYASCO.."VI~ENACILSFGNNSATTSOGiII
'
'


At7~
~nABDNLVTSNNQNtPFDCCK.aTTtIX:AtUNKMIANPDPILTI:ii.NESWFWM
'


1'ST
~.AIYTKKLVLSS-GROCVLF:NNYAMIATPF.GC:AIALLDSCEICLSADtL:NI
IF~tfl


<.'Pn
'P:GPA.art'RNAL~15NAKFWLPJiTR:IIY.'ItFYDPITS:X:ATfRIL.~,LttKAOIIL'w70tfiYE
DOU') i1e89 11117


_
~:YtVF!X:EKL.vEEELKKFONLK.:TF'h~lv/KLNi7ALVLKDf~J717t:1Nr!'ft~iSKVYIID
tNr mMr.~r honnlorr prrrsa:nt in
r:aneb.<nk/P11BL na of I1/7/nR


tri':;AIIAF/JRYRDINfxWEDLKQ'l'IFWVf:EIIOCTDLC111RN::CHWLDRYaDKFILREKEEKrY:ITFF.
n\:.AhT:V'Ml4c.l.AINID:.LDt:TtIYAf
IKATAA:.KDVAL:x:1'fNI.VLYIOfY'IYYPJIH


Nf.RHELFIIATtIVRKA::f:IIAYAKAKMFEKER::Nt~RY.VKDVEK4iLSIG:U1EFRNpFSRRtlL:~wJVt'f
t.ft:1~:A0rlrM!'~rLtltY1'I:If!'ITNIIYr:YQtiWltaJt.'IA:NIn'!k'KtNtICCIIW


AftERLkt3A,"l'LYPEV.SI/F:ERVLERQRTKKVNLFNL'fAD
L EIfKYfOK.'VREOEH'MIfEVFI7K


I ALYRt?%:LKVI::AEEII:a:LLQftLEGt:
Ll:'ISi::KKLTKAE~.'VFE?IKFM\TEKIltIKVLED~
'


'./'PNRLEIIa:EDAEFI~ItPRIEEIENTLIbAI/ELt'LL.FItKNI'FEKA.SL.vYN.'.t:KENf.AKVEPO.
;~II
':In ~ml'l ..",~
lu


:Y.F~PTYR::::("RNLERLN~~fH.~rPn\YTtr.'QERL(t:F::GLE.~.KVRfCRDIILRFJJNKIIFEVOC:f
~uf~1'IMI' I Ilr.nrur-,:hit m.f.
nul~.l


~HFfNRI:W.WW:Af:Lt'IVARLDLVATVPYREPYt.~IYIItItKREKVR::~rMIAKTERYREIROtW!t'fNt:l
K~:n:ItVWVf>r\'rAYTKNATI:IWI'K'1':YY.fiJPFHrJ:I'LVtfc:lYl::FVDVR:,


Al:r~r;V11KR1'LI.AP:ITtIf*RF.Iri'WIdJ7pOWLl.RDERKNfwRRI.(t'NKIIAMJJhVKf:FIV::II
IIrR::r::::l::::L"1711.YA':i:INit'Idll:IIJKr:NVk::INII::.~.AtiYAll%%SFFTI~:~FFN



r'At'' LL.tt :Yl7KIt111.VAKtIrfIIV'/IJ:AM::YhI
ll a :1~:Y'CI JvK 11.:'r ~I::Ik:l.f'PVtTt1111FAYr:


.:Irr
IrrllINbrl'rKY'h:I::INKr:a.A:HLAtt:IF;Y:AI1NVA:%:NR::4N1rlrrlFlN1.P11IYN1()
rlnill I f 1'.'.1 11 f:'S


_ rltrt'Kt?%Tt7t:lt:a'V::l'.IA.t'NI
IY, tr.lr,::r lumn.lm pn'.rrnr
\'/1~r:IYt'YN.Y::frY:TIIII::IAYI'I'INfRNIrHITITLM
in r:,-rrlr.rnk/EMDI, .r.: .r
I1/7/'IH


73


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
fYAIOLGGRfCF PSIIITTVDStTlISDSPILG
:'VEKLLDE:EE,~,rEJps't'ERLLpSIG,FILPt.MDtPFF
fEVELRGSSR '
'
'


. l iCLVLTKKCNADILKVSFIpLNKIOVAJIRILA
.L.iRpALLVILU:NHHAFA~NPE. . PClIMP ILI ESCPYIfEVL.KY411t::::OK
.
v~~.,'W.~.tr.'~


LNPICCCSAOVLBSZiEIITAILT~~~
~~


=Pn_OOIN :751! 7'3003 BNALI
LLKLNPLFKBapIFGCH.~.DF'fEPGII1"G~~
ALTT'71


I>eP_5-POlymnrPhu outer Membrane
LLKKELDISALOSSINOKIEATITKSOKEPFLKEOLKTIKKEiCLEJICOAJIIDTfKIBER
Protein '
GA33IVLtIMTTPIStPEDGFICDL'~INtFSPKSTCDJ1AG1'lYS


:.YNMKTSVSMLLALLCS SSAEY71ICANYLDWLTLTPWCIQSKCr107LKlGE
rTCF'!TIVGDLTFL~I~tfLKFLSVDAI:ANIAVAHVpCSKNLSLRKRtNPDYAIIiIIIODEIEK:.C'l'LE~.
IAIGG3I' KG~
IIC
CY
C
"RSTAKYLIIAKF
'l
P
'
'


. a
L: ~EVLYIDP ..SKGL
CAYOLODIMWL?SNASVEOCCIITKCNSCLIO L
iPKSIIYRCKGSLVSL G
FTDFL P
iLVITE CI
:
SIv
iVIi'llfOtIYGLDEIKORILE:.I,iVGK
SY11CDPA
i0
tPVIMIDLYOKIG~
KMVDat
S
A
0


. .
. p
. l
!'fAN ~
:LTiElAltf~fLKFNENKIIVTSGCALDLGAASTI
7f 0
t' FRF3VC~fADFJIEIKGHRRTYICAMPG
~ ~
'


_ . . ...lt:\. F
SA ~.. in!~.,. is..
~~ !
.IKtI.;AIFf:OIIt:RKKt . . .... ..
_ rv:.v ICL'.Y
. .. .. . . ::::T~f. ~r ::. '.:;~...: :~..v. WC
''.\:"'9:.'.:\"::1NE.1.:1~T.,....).KfIYLR
_ '
':dt'.'t.': s
' F r
"'

~

~
,
~



. ..
~ .
I:i.:
UCf'J,~WEKPKSKKLTFKI:::iKNWtY:w:KflF:i.iURFYESTP4GVA:IGLiiWfi
~L "f~TL
~ .".: ."IL\TTU
'J:i.:
::.1:.:N..~..:' :!
i~:..'~ 7~Y
:!tl
AIYFYDPITTND1GASON
'


TAKYIfCLAASOON
YIESVOVSSLKTDlOILTCOAGEVNKESSCIIWIIYLNSALHRYAPGYTFFPI~pNHINIPE
r:GDLVF~QVTN:APNATCKM1VIHLE.o AASA
LSCSIVFSGERLSI'AFJIIAt7~ILTSRINOPV'I'LVEGSLVL.KOGV'fLSTOG'
RINEVS


L SLL.iLLLETPWFNLQ~ETTLTGRVLGVOGIR6KLIAA
J1NOK GJ1TPKDCPSACITfM


fa"OEPESTLL:.Otar~TSL
Ii~tILIFPF~ImRDYEELPA'ILIrCGLItINFYSHY00VLKV11FPKLK


CPr>_0019 19007 30756
0028 43318 12513
CPn


pmp_5-PMP_5 Itrtlmesht,Et with 0019)_
O NO robust hanoloA Present in Genebank/EI~OL
' as of 11/7/91


CitaLVNJIOGAtYit IO'IFLOFFHPIVFSOQSLSFLPYIGKSBGI
ASTEDIVITNLSINADTIYGKNPINIV11SA7lNItNITLIIEKCSNIVEHYLHICCDTSVITTGVSCATFL
CHWNIIpVIPCICfQPSQAl6.it'RI
Y
Y


G
8VON71LPISKSEXITKIISYILILPLIIaLfIKIVLRIILFFKYAGLILDVKKEDi.IaTL
O
DYSFVlCLSPGRGCt IITOtUISOKPL6VAPSAPH
RTCYLPNPEROGSLVPNSLHGSFVDORAIOEINVNSSOILCOEAOVWGi1GI11NPI3IRDICI


NEItCYRHSCVCYLVGVGTHAFSOATIN11AFC0LFSRDXO'YWSK1~CCSYSGVVFLEDTLTPOOENLSLPLPSPTTL
KIIINrILIfILVRSGIOfYNELIOECFSFTKITIMr00AP811~DIC


EFRSPQCfYTDSSSE7ICCNpW"I>xIDLSYSHtIldJDtlXTKYTTYPFJ10G&fANDVFGLEFFSYNSLLPNIYFHS
LVSVPNISCEERAIlIYItKEQQEENAVKLKTIpACSFVfASLIIi.PSh


GAT'fYYYPNSTFLFDYYSPFLRI4CTYAHOEDFKiI'OOEVRtIFI'SG18.FFE~VPTCVKFEOTKDKKAGFGLL.T
FFPWKIYPL


RFSOCKRGSYEL: JIYVPOVIRKDPKBTATLASGJI'1WSTFI~R~SROCLOLAL~ICLIN
CPlf~0029 13839 13390


PGIE91FSF1GAIL3.ACSSPNYHINLCCKYRF No robust homoloQ present in
Genebank/k9'18L
as of 11/7/9L


SNID~tpINHdTYCFNLFRYIRFFR71INIAM4DGLRFCYSYILLRPIdWSSLT.R1~00ELL


CPn_0020 32717 30603
KK10IKLRTZ'STIISSLISLR00LGKAE71TOSDILYGTSRFQYGNSFEIEDPRIPPlNilitQ
Predicted OMP !leader 111) peptide:
outer membrane!


KLWSFIPNt.RIIOCRCFLFLABFVI~fGSSADALTH0EAV10IKNSYLSHFKSVSGIVTIEDCVLOEIIIiSRSVNFL
KIKFYVYLNSERNKTKP


uIIHIaILATCAenIVnrEarlvccss.IavAtK~avMVrnR~LKTLVCDYLEYYEVrI)scLLTNc


RFANYPWFLOGSNITLTPETIVIRKClISTSEGPKIfI>LCLSODYLEYSSSLLSIGKTfLCPtL0030 13A10
14529


RVCRTPILFLPPFSiMPMEIPKpPTNFRGGfGGFLGSYLCIISYSPISA1WFSSfFlLDSF0cp-
OSialoglycoprocein F.~Opepcidase


FKtICVCMGFNLHC~KQVPQ11>fIBOCSYYAt~LJIItkIJIEAHbRYRLNGDFCFTItKHVNfSLKNCTtTIfSLFF
YIKNRANYFYKYVIIDISGYYPFL71CV1R~pQVLFJMSLPIR7PDI.GIVLE


GEYItLSDSWCfVHDIFPNNFML10'tl'CPTRVDCl51tR7lIYFECYLTSSVKVNSFONiWOa.PFLFKSIDd.6F0
0VAVALGPGNFSATAIGISFA0GL71NJIKNVPIdGYSSLITYLLSKDODt


YLTLRpYPISIYl~Ti'f.VYLFNIVECG)fLNPIIFSDHIVGFNFSSLRLiWtPKLHKTVPLPIGALIILPLCKPGGY
LTLSSEIPEECLNEKRRGSIGf~LLSYLE71SDYCVAI~Y~NI8PNP0


'."LSS'17.GSSLIYYSDVPEISSRN50LSAKLOLDYRFLLHKSYIORAIIITEPFVTFITLTRLFASSFSDKTTVEE
VAPSVEpIARNVTSOFlISVEIIDItOLSPDYRSYSCIf


PLIIQIEOHYIFSIODAFNSLNLLKAGTCfSVLSXIt4PAFPRIN)11Q.F11ZNIL9~TfESKPT


FPKTACELSLPFGKKtIfVSLD~tIWIQOiCWDlB4~tTRNEifIIA'IDNAK1LESTJDtSKYBLCPeL0031
11708 44A84


tKCDRENFILINSRPIDQLLDSPLSDNRNLI rs21-521 Ribosomal Protein
DTPN


YLEYpNILGT)CTFF2hIQLYGVYERRFaDSAFFFETJC.DKPIDIPPFCMPSVKVRVGEPVDRALAIL10CPCIDKEG
ILKAAKSHRtYOKPSVKKIWf8101ilAKYRBA


0021 34470 32707 CPtL0032 14A81 16098
~Pn


_ dnW-Meat Shook Protein J
Predicted OMP (leader 119) pepeidel


CSRSPYPNIETL71AGVF3IRSMGLFNLTLFGLLLCSLPISLV1LKFPESOCHKILYISTOSTSLIGQIVItFVCSVSQ
IDYYSIIGISKTAS71EEIKItAYR7G.71VKYMPDKNPGDR1171iKA1'KE


QOALATYLPaLDJIYGDHDFFVi.AICIC>~YZJCOSZNS80PQ'fRKSTIIG7lGiJ4GSSFaiaVVSEJ1YLYLSDP
OKFDSYDRPGImCPF
'


LSQ11!!!1'ADPLOpLLVIS7IVSGltGGKTSDDLLFKALJ1SPYPVTRLE7UlYRLillRi0if1'INIr
FFDGLfOCLGE71F'0llRSDPAGARQGABKKVHTNLTFEEAAlIDVAIiLW80YKBCIft~


DHLNSFINKLPEEIf7CLSAAIFi.Rf.E2EESDJ1YIRDLL71J1I0I871IRSATALOIGIYODKROG71VNPGCIK
SCERCKCSGpNVOSRGFFSN7LS11CPECGG8GAII1'DPC8~11000A~II


FLPTLRNLLTSASPQOpEAILYAIGKL1~GQSYYNIKXQLQKPDVDIn'LAAAOALIAIGKRSVNVMIPAGVD
YVFIDVESNlVFp1RG00f.ILdPI


EEDiILPVIK100ALEERPRALYALRfrLPSEIGIPIALPIPLKT1Q'ISFaICLNNALhLL>tLGCCFVDA7Ii~lOO
CEIP1T.LKTDGSCRLTVPEOIOSGTTLKVANOGFPMIIiIOKl311fiDti.VItIS


17T1>XLLEYITERLVOPNYNETLALSFSI~RTLONWKRVNIIVPQDPOEAERLLBTlRGLEVCfP(~1LS880XELLR
TF11STLKAGiFPIO(RSPLDKIKCFFSDF1Y


EpILTFLFRLPKE11YLPCIYKLTr~QKTOLATT11ISFL8lffSHQIALDIi.FQJWIID.PGEP'


IIRAyJIDUITyMItICppEtO(RSLHDyA>LIO~fLLF~T~IORpIip~fpyLAYQVTPESCPn_0033 16129
43171


RTRlG.DILETL71TSK8SEDIRLLIQIXf80DAl01FPVLAGLLIKIVEpdhAiB/odbJliodb8-lpyruvacef
Oxoisovalerace
Dehydsopeaase Alpha


i Beta Fusion


CPn
ERSIk:VtroFIOVISSIRDVLKLVwELR!'AEtIK,C.LLSAOSGSOGTIbLSCIlOIt>Q.i~fL)1G
0022 35042 34395


_
KSLIPGKDiiSFPYYRDOCFPIGIGCDLSEIFASFWAZTPNIISSAAIIIPYIIYrBIC
,Nyt


TILQVIStICCiNSNTRSFYSMSLPLViGSSSPRRKTILEKFRVPFTVIPBNPDESKVSYSC085V9CI10!'LOAAGR
7WIAYKfISSADEYVYVBGCOGJITSpGEFNmIIJO!VJItiKaLtLITII


GDPIAYTQEIrI~OKAYAVSELHSPCDCTILTGD1'IVSYDGAItTKPODKADAIQtC.KTLRI~WAISVPfEDOCGiI
DIJISLGRCIIOGLAVYEVDOGTIYTSLTETFSHJIVDOwIbNSVP


NQ1'HITItIiSIAVLNKCKLLiGSET50ISLTNIPDIIAIESYID1YCT'IMiCGAYDUCHGGLALILTDVVRLSSNS
NSDNQEKYRSt_~r rr
e(~DPLILLEKE7IIMIfCiSPFIIEEIK11


ZLIDfNtGCVYNVpCLPIOTLKYLLEELHIDLWDYSItIIOEEVRXSCETAEALPFPSKGSTSHEVFSPYTGTLIDYEN
&ESCPKVl4tMI


SE11LVEEMTR054yIVfCEDVACDIIGf.IF4YtRNLTEKFGPpACFNSPLJ1GTITGTAIC


CPrL0023 36657 35011
MALDGTHKP1NEIOFADYIWPGINDLFSEASSIYYRSACiWEVPLVTA71P~T000PY


Y5)K/alr-A8C Transporter Protein
HSOSIEGFLAHCPCIKVAYPSNA11D7dULLWl7IIiIDPNPWFLENKJ1T.Y0A1(IFS7IGlVF
ATPase


ENRAKLLYSKOHFVt4.S7WSIVLDKIGKBLGTRILFDDVSWFNPGNCYGLTGPNGACICSSNOYVLPFGKAAIVtiPG
KDLTTVSGKRfPLVLSLEYApELASACISTMDLRilNFCDFA


TLLKIINQIIEPTACSISLPKKVGILRONIDBFNDf'M.DCVIFI~TfRIF1t71LQRRONLYTVLKSLEKTGRLLVIH
EASEFCGFGSELVATMSEpCYAYLDAPIRRLOCLNJIPVp)ISKVL


C,QEPTDAIGMELCEIEEIICEFaICYRA0S8AEELLTGIGIPNENFDK101A1fIPIDI4FRVtNEIILPHKESILOA
AKSLAEF


LLCOALFGNPEALLLDEPTrMLDLYSINWfGNFLImYEGTVIWSHDRHFWTITTHIAD


IDYDTIIIYPGNYDOMhI4KTASRDQEK71DIKSKEKKISOLKEtIIAKfC7lGSRASOVOSRCPn_0034 19196
48210


LREIKKL0P0ELKKSNIORPYIRFPLSDKBSGKWL.SLEAITKOYGOHQVIHPFSLETYOCT315 hypothetical
Protein


CDKLGI1CNNGLGKTTIaOCLLACLVFAPSBGSIKLGHOATCSYFPONFISDVLADCGOE'fLFVNFLLP1'fCRGI4M
EISTPSLPOSSIVSOKTPPVPDPDSSPOHIPTIP3'pAPFKKP~Dt


EWLRNRKTGINDOEIASVLGKNLFGGDDAfKOIpALSCCETAALLMAGMILENtOJVLILDETPSSIVNJIIAFAIG7t
FLSCIL~GVP'AICLGCSLEITMPLFILTAVFIAFTLLYFINYLEK


EAIdIHLDLESVSALSWATNDYKGTAIFVSHDRCLIODCATKLLIFDKDItITFPOG?MYDYPKIPCPLPTPPPSPfLM
PTLTPIPAPAPGIPLPP'fLPINDRTKLTCNPDIIIYPB'f)IDP


TAGNKQLL
KACFSLLKQLFSLDPETRPEDRKYSNKLASTLLRSKEKSGFRFHCFKCIIPBNOKILNKKS


G11WISSHSSMDFSTTIrGMPAVITCi,QRSCwEKIKNNIPTPEIWLPIG~fSCPNDVEE


CPn_0024 37605 3b661
GAQLYTSHLIVINPPTLETLIKEKMRRAITLIIDFSNKEAFTNLVWYIACFDTCIGI~iLE


xerGIncegrase/recombinase
SVOLEVFGLNNLSADOEEFTTWESCCHLAtd.ESVRILLASKEIYALSNVSVN8I8pVPLQ


REVMIAStYSFLDYLKMNtSASPNTLRNYCLDWGLKIFLEERCNLAPSSPLOLATEItRKTACMLFLN


vSELPF3LtTKEHVRHYIAKLtENGKAKRTIKRCLSSIKSFAHYCVIOKILLFIJPAETIH


CPRLPKELPSPMT'fAOVEVLMATPDtSKYHCLR0RCt14ELFYSSGLRISEIVAVNKODfD~'.Pn_0035 51115
99569


LSTHLIRIAGKGKKEAIIPVTSNAtOWIQIYLNHPDRKRLEKOPOAtFt.NRfGRRISTRSCT33:(
nyporhecic~l protein


IDRSFOEILRASCfsCHITPHTIRHTIATHWLESCMDLKTIOALLGHSSLETTIIIY'fOVSARTTLEF~AGSSLKPLP
IITFPCATALYITHRAERKSEHOMWNRCQVFSSFFFRYPISSfiL.


VKLKKtytH0E7WPHA
IRLRASCECFOORHPIFLCGLYWLAGITSItCHPECSALILIPIGNPLPRNPKONLPLASA


WtISIlfLTPAPFLHDCPISGTFVTHHAOCpCCYYGEA(CTOTPCGKRAHNLSCpILSESR


CPn_0025 38610 37681
LELKKV'IELECTLNHTCOIVFKSNACYKEIPRSRFYIMKEKCRCiSCHFL1RIRPPSSEVC


slat:latsA-
9ulptushydrolase/GlyeosulfacasePFAa'SLLLCTPLPONLRDLfROKGi.SHLFAISGWHFBLCATTtiML
CALLPLKIKKILSF


ILMSSRELLIIl.CSSOpPTRTRNpCAYt.FRWNGHGLLFOPGF.CTQROFIFANIAP1"IVNRIVLTSI.ACiFPMSL
:.1IWRSWISVTLLCFSWCF"..CSC~ItIRIGAOFtLCGIFF8PF8PTF


IFVSHFHr,DHCLGLCBMLHRWLDKVSHPIHCYYPASGK1(YFDRLIIYCtIYNETIOWENVLSFLATL(:ILLFFPKI
FSFLYTPW~pFLSPtyR.YPIR'ILANTLAISL3AOLFIVLPT110


PI3EECIVEDFC3FRIEAORLQFIpV01'LL19RLTEPDTIKFLPKELESRGTRCLIIODLIRYffLSLPLE7DLL'lI
iLIVPFTILPIIVFLIATITt.PCCCFTTFJ1LI0CFGSNPIi<JIFIPNILK


DpEISIrI:~TNYGi0V5YVRKGD:aIAtIADTLPCOMIDLAKN.iCMMLCE.3T'lLEOHRNL'rL$FAFVPPWl1LT
41::LILFFTGILRTINSPYIISISAT.iIRPTlTL


AE~IIFHMTAKQAATLJ1KRM't'QKLILTI1F.~'.ARY
WLDDFYKEA;AVFPMMVApEYRSYP


FfIfNPLCtIK nPn ou f4 SOr.i') 5179':


IT'I!H hylx~rh~r i.'.O prr)tsrn


I:PIr o02r, sr.:17 )s7o2
AK::I~IG::GftKKMYKI'ONIR:'rfD'/RSFFFFDVLCIEOLFY.El1~~YIF.W:.AKtFRLPpIX(EL


rT(4'. irlpxll.:r.iral prot~tn
Mr:L:rKRGRLtIFr:IDIWv:::VG:IEtIKE.:F:ICRFFGLLETIEVYI'IRLEKEP'fQLKIIFYVF


.:NF.1M::IILIt::List)::VT::YfIIKI'r~PIKt)AAFt:K::IIeI~Ir:NtAYLtII(.'yl.V'Pf/LW:
AMLItOraaia~".iRI'fIJ.~I'tWIIIRLPNIf7DRilYEY.FF.~.iNtY:Fr7KWEDf7:IF1'NP.::I
VfIr.~K


'MfII'::Wa19r:1.::::LALLVI.I.:':IF'NfCLINWft:M:K'rKKIAFKIM:C~:UITYaA::RKC~~.~nl
'LV'/MNKN~At:I:Nt.'Y:;11:IF1PYr:IERPFA'l0.~.FFFDPRtRRGLI::Iff/t.LNBE::LE


I! L::1'11111A Ih:l'KlIF I It'fULRK<SVNyN'rNKFK:xI IIt::I.FT I L:L1J
II::Y.::YYI'::f
:FE:: FI I I :UE1111:3fRO::KN.~.::EL:IiLKNY1J1:3EGYlNE
I F i::Df:


.:::I'l:llINYAKu:Y.A)119'ATtKl::K'r:.'ftf~:::KKKK%TKII
:1lIRTf~.:: f )IKR.~.APKfMI/P.~.K


KitKINI.I,Y.K')VILII(aH.(:IIp:::x:NF:aU:.;::PPfNqpKAILf4IFy'Kpl'fGF::IyUll:'1
~1'1(li 'Wll:


la:N 1t:: 1INa:l4e...rc: ~.:r Fr,Irin
~ Nlrr


~'In U()::'1 q.'.~':.'. tv~1'IH KLI~k's:l':LHAI
ILI.:::lal.IItT1'IIVIM.'IMNKfTRTfLE.iEKD1'QOUIFFt:~J191.'1'/KN
2~~T


Lu. L.r. ff'I' 1:(w:rrl.nr FW
t..nr.:AN:IINNf/uNIVlll.t'Ir:I:IY'fiJIIFT'IN:IITTNAN.~.INiLIJIUtAWJt7I3KIl:I1'I
R::VFJI
'


74


CA 02350775 2001-05-11
WO 00/27994 PCTNS99I26923
CPGPCFDIItFIDtLKtANFE~ 'E?I~CYK:Ef~.GKRCIEKLTf:TPILEKYQRIDDRD


HAILC*h~DAFS'~'~:EL AK tLKOLRAOLLr'~LF3CR..tY~ICA IPV
VLLILL.WfIYCALKAL.: Pfl4:.KSP?IlffGYIA
Y~1~S"
'


.
ILTL.iLWCRGTCIE'~'~A'~IMfl~tL3YP8L1!~TAFa.LPIJIi
rIKLbCIIAIri~' I~~RE
VFiOi
G
'


~:fhytOtH 'W1::' S)dJl A
9IS
L7C3WRIOVStJtRViA
OL1~1118WFLSINL


~"1 to hypochec ica: pr~rttrt
AL.W10GIESPVYSLITAIa~WALLPVFFJ18F~GASI't~tFBLLTYLSPC~ALLKRLFKt7IPCI
SGl'r?IELMRIACT::Y:f.7IALGKVFFLGT~PLMIRELTLPpEEVEHEINRYYKAtl1~t~~ICADSLYCLVAAHY
NOIGKLINiGFFS~ICIL005t~8L8PL
HD'P Y~~t


O
K3DIW1Lt70EVL(7~CLOEVSStLOANLEINKDPLLTFLVVFfttRlCDR10U1YVFSSVECAIMIFIRttIP04vF2
.Att01(GLPESFICVLEEHtIGT5VIR5AYY5NlIV~7PSTGSFDE4.


F:;
FttYSGNKPSSIIETT<'tINIADSFEAASRSLIQIASLPOLQRLID0II0GIti.00COFSCSPIT
LTIJR(TIP.~.'J'IDRVQOIHDh7IRVIGHLCCCNKSSLGE.iDONLIIFSEELTPS
H.KIEE
~


. :,~,hVlr/rr -, ~ .\~. ..p~..,..
. t fr~w
AYIRI:FV.'.f.X:AA?:Irt'AtIISRAKSIPYLANISEEii410II1KRYNCKLVLII7GY
.
NAAN
~


r . . . ..
.
. : :~a::
.... ... , ;11...:..,1,,:: ;..;..,.,.I;:,~.:
.. ~~ ,..1,L; , -.
::.:.:.. :'1.f:l.l'f.iv::.Yi'.'I':.:Yfll!F'.
~RY1
:
'
"1
:'
"'


. .
. .:U>_0041 n.
:I No robust homolo0 Pr~~c m Genebank/FltBI,
. as of '1!7/91
.
iA
H~t'.~RWLLDIf;tlILEDUWALAKA.itnOCSIKVLIECVSOVSEIIkIIKKKWETIRTRFPItGH


KV;14CMIEFPSAVWNIEEiLPECDFLSIGTNDLVOYTLGISRISALPKHL1JVTLPPAViLKIl~ItVYLLVIIOEIf
WL'lt4.HOPYYtutIL~Ni'IYIPGHTNKDSNKLEQK


RtiIHtNIAAANONQVPVSICCEAACOL.iLTPLFIGLCVGt3.SVANPVtNRLRNNIALLELVDFJtPFSLDCFSINF
LIFVSLVPIJ4:.LVRAYOIKKSLDRTIIIQIGYSPSTiCmiI~tEA


N:CLEITEALLQAKTCSEVEELLNRF44KITS
FVNCYCLICISIIl~LCILVPILlLW'LSLLLLGIL31LFSLnYFSIIDtItISIOtICI~ISN
.


CPt~0079 5II56 57967 AT


~T)79 hypochetiul protein
KKKKEAICIMEOOFLf~tEASLLEItRY 'UGOA~GLVSWLH~LI~PT00050 66819 66199
resent In Genebank/DlBL as of 11/719A
lo
h


ISNGSGYA Q p
~ t ~~ ~F atno
No robust
VSWFPILCtFLAIOIYAKt~aiF~Fi~IVKANLGYLPSTNCKNALCRNSSIILTSSIKlIIGIL


GGCGILLPIP'LLLt~iItsISVLFQLL~G.!!'Rt.CCF71IRpSVSSDIIrINt<LLLLHNZLA


CPn_0040 55677 54318


dnaK-DNA Pol III Caeea and Tau
IPYQASSRKYRPQTFREIt.CQSSWAVLIDJALVirNRAAttAYL!'SCIRCPn_0051 66797 67111
cL c in GenebankJENBL as of 11/7/98


p No robust homoloq 0resen
AFnHStGYTCf
CFAYLIARNIPRtIGMiETYINPGVLPSSNAQDVSRS'IIIYPSRSFiIOIPNtJ~BIFNRVPS
TGKT1'IaRILIKU.yCVHLSEOGEPCNOCFSCKEIASGSSLOVLtIDGASHRGIEDIRO


ItJTVLpTPVKAKFKIYIIDEVIOQ.TKEAFNALLKTLEEPPOFNKTF!'1i''TEINKIPCfIKSSEQiJ4XiNRIPL
IFFCID41PTISILNVNRPSWLSIFYNCERGF
KIWLORIPC1C1'ILEKISLMAGDDILIEASOIX.APIARAAOGSLRDAESLYDYVIS
I
"
RC


a
a 0052 68008 6730d
p CPn
LFPKSLSPD'IVAQALCFASQGSLRTLON11ILORDY11TALGIYt'DFLtISGVAPVTFLtIDLr


LFYRNLIyTHStTSKFS50YKTE0LLEIIDFLGESAtINtpN1'IFEQTFLETVIINIIRIY.-
hvmC-POSphobilinopen Oeemlnase


aRPVLSELISSI1ISRGFfGLRNIKEP1'LTQQVSAPQPOZ"~EOSPAAOCKIIKt4.SVrYSDPCLSDFCOQIRPLRI
ASRNSIiIJIICAQVNDCISLLRSInPKLI~ifQLSTi'CL'1'


SVEVKSSASIKSAAVOTL:.QFAWEFSCItRQ
G~IPLHLVENSIfFFTOCVDALVH1~VCDLAIHSAKDLPE'1'PSLPVSIAITRCLNPAD


LLVYADNYVNEPLPLSPRtRSSSLRRSAVLICOLFP~00QILDIAGTIFfJtL00Lt0tDttYDA


00d1 SS8A8 57342
IVWIAJLSLRLJILttItAYSILPPPYNALOGSL1ITAKDNAGKwKpLtTPII~NSS
CPn


_
No robust homoloQ present in Genebank/TC48L50 67986
as of 11/7/98
HSVCSISSRYKLRVLAITFLVLItr'VLLLISGALFLTLGIPGL?AGySF
P


CKYLYNMSYPP CFeL0057 697
Q
CCVLWSG1.L.~:.VfWEVBXVCPEIPAWPEtTPEDVPVTPFIUPAt~A
GLGIGLSAt


. ~'~ Protein
QKEpKTOKILDOLPGELDOLORYIOEAFACLGPLKDLKYtDGGTLO~~~KIRNATICIK'1'Ot~IDCGATAPt6iI~p
CPGCt04i'INSLVEEYVPQARSCfSSRSS'1'SAW.S


DMIAEFVEi.G0ILC0ECRLLEFVINQTRYIGRIS.FKRt'~SLYKWEyI(l.YLPSGOVRCERSIa.H4ESRIFIW(A
GiDRIiL:OLWRGSLTLIIOGDPCICKSTLLIIOTAERIJLSWLYRVL
LrYREA


LK%SAAEWDRFMRT'tQ4IRItIN4TFDPNVYSVAKTATEICAFGitLETCVYESNRYV~SSTOrSLPAKRLitISSPL
IYLFPITNi.DMIKQOIATLEPDiLIIDSIQIIINPI'
FIWVKeDQtIEIDDRIGNSQDISE


?CEYEKAIQ.LCDEEKSAtIAEOAFpDIKNRWBDNImit~PG~T~I~~IICttYlK9GEIAGPRVLaILVDlVLYFICN



RYEflIRITRARWYKVAENGLFNAIT0tV1(DSLRgOJEARV11FEKERSKIT40R~IKKJ~RSNANYRNIRSVRiRFG
PrNt<.LILSNHADGL1LEVSNPSGLFLQBJfTGP'i'iGSIIIIPIItii


:.ROLItEGHDOF3.PRAGERLRELOALYPEIAVSYVFJ1RREYASDLEKAtfESIDKHYOSCVRSGALLIBIAALVSS
SPF11NPVRKT11GFDPNRFSLLLJ1VLEKRAQVKZ.F1?mVPLSI'~f.


~~Y
KIICP11110LGALLAVASSLYNRLLPNHSIVICEVGLGGEIRNVAtQ.FRRIK~IItJGFEG


AILPECQISSIPICCIRENFW.GGVKTIImAIRLLL


CPn_004Z 573d6 58112
No robust haaloloQ present in GeneWnk/EI4BL70089 69313
as of 11/7/98


EECEf00EAEFRENGTKIRSNEEYSEYLOQVI~IQLESCSKAL'1'!00"fFlli.CVItLtaKEEIECPti".0054
rttc-Ribotttieleattt III


SII4SDVVNRlE~ILCROIEDFILSRVEEIERIB.RNACLPLLPIKEiItTKAFWttt4&CK>ZIG.TLSFFPPIKIPN
SKFKDGAIiSNNPPIDITAIWILNF!!"IOPKLLEI11LTRPS~IO~iJ1
OSIJaOTIQRAYIIGSQKVSGLESEVRACREGLKDOVROFL~S'ZIaGIG
PYFKESPAYLTSSFRL '
V


. LLFP9IO~TtSTARASLYttAKAGCRYZ
TK VGIEDSERLEFIIrMViCLZVTDG1'
C
DYLLIGKG6KIOSERGRLBAYANLFESILGAVYS.DGGLSPARKLTVPLLPPREEILPL18
pGVSLIKEEILt11TS1'FRTKFSYHSFRIJIVPCIOti.YLEYYODID~LERTRAAWNAFIStItYR


t~p~tlQplIJ<J~LVEAQAL~TEYWLYRZER1ISK>Ofitt7N110a.LpOF'IIQKOFRVLPVY05TAVT0AGCNVS
YQIQVLVNQ6ItYtG~I~Y158KKE710CI
....- nn., cem cn177


cPn..ooss 7oo9s 7os9o
CT396 hypotMtical protein
CIwICYLIRIRIOISALNLOHLRNFIWHaSILFE3JLLTIKDGFLLETKt.ONPIAKASRTID
TVIIMtF~TIFRSNPt:IYTWRKRRLOFFAAF1.VNRPKi.SLVRDLWV!'PCEEILEGEiOCTIL
PLLLSGDRAGSGIFFTGPYP&DLYELEIGCI'1'CLLLAFSSVCIPVI
CPtL..0056 70917 72746


sItCSA_Phosphoaanetosucaee


EFLIQ.SiitRISLIIIEYEORIRSLYD11VTA~IICRWLStJDCIaQDNt'fIILWLD'tDPAOLE


pLfGA?LTFG1GCLRSIlIGIGTNRINLtTIRRTZ'OGLVOVLPANLPNPGOPNRWIIOCDT


Rt045IEFA08'fAlNiilt~lCICiVPLFOYPEPL1LVSF7YRYBRAIOCVNITJ18t87PPNYNC
YKVYNASGGpVLPPL00EIVAACSAVNEILSVPSIDtIPNINLIGKEYFaLYRDI'LIIOI4L


YPF)1NRISGRSLSISYSPLIECTCISLVPIIVLIIDWGFLSYNL.VOOD71IWGDFP'I'VOLPNP


CPn_0014 6078 60778
EDPEALTLCI0~4.ANDDDLFIATDPOADRVCVVCI.EOGOPYRFNC~MI~SLLADNILGJ1
f 11/7!98


No robust homolop present 1n
Genabtulk/t?18LWSKTRHLGEN~(f.VILSLVTTEFtISAIAKtIYth~.INVG7tGFKYIGEKIFSWANS'1'NK
FVF
as o '
IAKSDCRVWIRiJiSAYKESGKVSSLETEACTYREYLREOWOFETOGVSLIKEELL!'LSS


TLKSKLSYDPLiANIPCFIKtYYCYYDDIOKARAOSRWLEKSERYRNAKRRE'OEIVKI~LFYCYFANKTES
GAEESYCCLYr;fItVF.DKDAIIASALIAFAAt.00Ki.OCKTLCDALLSLYC1


KEAIfPLIDIEEYRLT-
0EERSNILEKRLIYMOfAVARpRVGEFESMEIPEWFSAKTDEQEIRKKLSNLEEISSM1FFSGKYOVEKPENYKGGIGF
NLiSItDSYALTLPK


.
TSIC.CYYFSOOGRVIIRPSCfEPKIKFYF~fSTHYPERVTDKEIOKQRFaFSFOH<.ODfI


CPn_0015 60961 62790


CT)45 hypothetical protein
CKYTYHPPOLPPDHSVGATSWpPKLRILTITlLYLC'JLLLISGALFLTLGVPCLAAGL5FCPt~0057 7.91)
73554


CLGIGLSALOGVLW9CLLFFLIRRGVSKVRPEEIPV1'PSHEAGKIL~_QLPOELDOLDTSsodH-Superoxide
Disnwcase lHn1


IDEWSCIGICLKDLKYEDpCLLTEVOLIQ.RVFDFVRKDtIV't'EFLELOOWAOEQOFLDYLILKRYWNSFVPYSLPE
LPYDYDALEPVISSEINILHHOKNNOIYINNtJrMLKRLOAAE


INQVOSISHKLFVPOVNIGAHLAEIGGYLPSGDVRVERLKRSAROWDRF?IRVTCDTRXV'PQpNtllEtIUPU'RfNC
OGHtNHSLFWETL11PLDQCGGOPPKHEIlSLIERF~JCI?ION
'


AMAFDENJVCCVAKNAFDKAFCALEECVYKSLTESYREAFYEYEKAKILRNEDVE~R.OOKNGKLPLLL1IDIMfItAY
Y
fLKKLIEVAACVOGSCWAWIGFCPAKOELVLOATANpDPLEPL1


KSARAEORFR6VKWlWEDLKETVFiJVKENGCIDLE'JL?A40L11Pt)ItCPENLIPBIUIRNINfpYtINVRNTnLK
AFPCtItM~ICHItIJNFSEIISSK


NSHKLWFJ1THRFtKGAECTYSV.1RVAFEKDGSR)WQKY.F~EKTKEWLRCLKDLHDOECNRA
0058 77eZ7 74562
CPn


RERLAELEALYPEVSVSWETERETKFKLCtAYCNLEERYGStMCDQEDYWKEE4?IKEAE_
' IceOACCOA Carloxytasv/Transferase
Beta


tAEFEYTILSDAAN
IRWLVRLI''SYDKPKIKVpKIKADCFSCWLKCNtICNENINANEIGOHYNCCPKCSYNYRiT
FREKGTKVRSPEEWEYLDiLFIJGaEDCSKQLTIAE'~IVhGIIELEA


RLKVtGEDTEDILPRVEEIEINLRIAELPFLPIKOAPI'KAFLpYNSCKDRLAt(VEPYCQEAIERVKLLADKDSWRPL
YTDLKSQDPLEFIDI'DTYANRLEKARKNtI'ESELVIVCICTIC


SVDYKarFRV
IJIPVAGAVNDFNFNAGa'NGAWC89G.TP.LIEFJRCfRLPVI
IVSASGGARNOFSVF8L110


.~Pn 004e e2775 6)261
tNKTSAAi.AKLNFJVGLPYISVLTNPT~VI'ASFMLGOIIIAEPKALICFAGPRVIIAtW
'
'
'


N" robust hnlmlv4t presanc Cn CenebanY./F11BLOCKSKAPRDLSKR
.~s of 11!7/98 l
Il~TLLDYFLApfY1
ICEDLPB6AOKSEFLLEHGNIDKIVERKELKT


ERf'Q.;LNpOI4NVY0COKATGLF~~~EVSAYRDHLREOITEFEZ'OfiLDVIKEELLFVBSTLLKEIFLLTDOSE


iC$KL..':lDf LiAOIPCNKFYEYYDCIDKARVQ$RWLEY~aERYRKAKKOFQA4LICECLFIfE
4'7 7.1562 75050
'


t>pALY.KALYRt.LREKRFNKF.KLLtCNKIEM(7t,~RVtrEF:P.:Dw:Pn_0
'fut-,111TP Nu,:le,'r t,lohy,irolacr:


t.l ;7 0 )bS~J IKHIfI'A::l.'NIkII
IC'NAIIXIVFCELD:fX3ELPfITI'PI:AN:AOLRANIEEPIALt.i't~RA
1N4'1
'I


,
LIPTr:IxJIEtPEr:YEWVRMt:a7:(.ALKINaTVItI:Pt:I'ID.r.DYRi:EiRVILINPC:D'1'FI
.
n ~
JEMBL ns; ~C ll/7/9H
:C tmn.,l"u
nw
:,!nr tn ~
tunr
en
w
tm


y
IEPKNFIAOWL:.I'~1i'A1'FWKQE.:LhTARG..IY:Ft:lrl'r:ll;
.
.
.
,
m~
:
I:tIF'ItIJ!VTISra~F'ItlVL.t''.J:ILTHYIIFQKIRFfI'LT'I'rJ:F/LNK~WtKhYEL.WFYYt::;C
l'EC


KVY.IIJ!'::::IIF:WI.
~.Pn_Ot,SO 751104 'l5 53H


.'tn 11114a .. ",an ,:SpOI GcaN-VI's I IA Pr.e.rin


V'ItF' u:: ,:rncuem~~t
hytxrttuti,:.tlFKLPEC/EVLVILEI'AKMI::YtxrFtOqLF::LF::LL:PRLVNFt.GKIICRDCIWUI.Tn
t.VDA
IM t.r.rrtin '
'


HKFJkItI::YNttALIIKI.::IKJWVNYFt.YTFlI.'X:::F'IVAIFTFAWf.KVL1'1.'I'EIKN:EISRISl

Dt:L139111IDc:
ACIIi.EGY.OAFFDAL~RRf7itlCTR%IC%M'."JAIMK:KLECC:aIFFIAIC:III


I:PAf?41)F::I:.W::NIKFYKHtNIt::E11Ft1KWIII:rL::f~:::l.I::KF7:tlADEKrOYYIPKKJ1A0
ALVRL'JFLIC:GFG4AQAEYLKL4'.TLTL:L.RftECRRqrJt.LUVNTtELIMNVFVt71


1~Lt..'ftIFVU::::I'iK,'LKDLt' IY FPLL.tIKBKKTLE l1 i I: I::NlIr:HV
LAS\:FC:IILK IFLIOE7J


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
u:flv IIW.I 75iu1 762DN EATYAtORKANKKPtCJtE
:r$'ttFlYTt:!rCK,~,L.W1LIEINRNNC'L'PATJINLGAS
pcsN~PT:: IIA Protein ~ NTII tsNl-BamJnq Oamain UtWUIOPItPYCF:xIPOCC:W
.:.:YL7t11t0'ta'CD:LARADCCLHTt.;'C'I'L9pIKKLPORt
It::HOCtt.".:DVKNGLKLDEVA:iLLOVCQRVLOWLK~AIPSY5IlIlICfRF3RECI~ft.L t '~
IItpALHLOERt:EOKEALKDGiLKYSLYKAiNROLYt.CDVWNSKZ~J~4YASKYiAQKFQ
LDE..~VLFEHL:'rIRENLN.."fCLCFZIALPHAKDFLIHAYYDIWPNFt.AEPIEYG7It.OCKP CPn 0073
87153 9757.1
6'GILFFLFAL'ODK~HLNLVNKIVHIGHSLN71RSFFKNt~OL'L'AwKintA-IntCiatlon Factor IF-
SNAKKEDTLVLDGKYEELLPGfIHfRV .LENChIPIrfAHLCCKHRHSNIRLi.IC~VT~15
.:Pn_Q057 76251 77690 A~~~R sf4iir~'~'
~:;.,...,:; ,:;-w~l.~,..,:.~:rt~°:.I:;~~.::::: °:--
n::r:r:n:::::..:::..:~.r:~.'.. .,r .:... : :. ..,;.._
fWKKPQKt:;iERiIOAKKEPRARKB'fLVPSSRTL'alRat7KMItNSSRIINEISANST CutA-tlunSiecaun
~dctor 'tu
PRSVKLRRHKRAEOKMKOGESAPSN<~TLKS~KL'PS1LOKTSIHEREKA?SRtVNfSCL
EDFEHSKET/QRNrIPNINIGTIGHVt#IGKT:'LTAAITAnLSCOGIJISFRDYSSItI~PLE
SSARKRYCTPSSMP;LFLETEIVMWERTKC1QDNEIHIPWQWfNPKLQNI'KZTKQ
MRC1TIN118HVEYEfPNKNYJWVOCPCNADYV14~8ti1G11J101mGilILWSJfI'DGRIIPOT
LASOASIOQSEGTEOSLREI.iICCASLPVLVPSNPEYSYORCKEC3.KCL'VAERIlOCtOIKS
KENILL71R'04GVPYZWE'LN~ISQEDJIF3.a'DLVpIELSELL66~'YI~CPIIIIGi7IL
VROALFJIRSLTKIfVARGGSVTSTLRYDPWU1EIKSRIfNCKVSPtCAREOIDfSSCKRpIIM
K7lLt~DANItIGIVRELHOAVDDNIP?PEREIDKPFLNPIEDVFSI$GwGTWTGRI01GI
NCKOOKTfPSEDASOEEGOTCJ1GLVRKTPKSQVJISIUONFYIWSKH'tNIDSYLTANOIfSC
VKVSDKVOLETIVZCVEtQRKELPEGRJ1GB~RrCLLLRGICKNtaril~lRNCOP
SSEE'fDWPCSSCVSKRRTIOrSISVCTHwt~lIl4lZVCALIIIIWITESdiTSDPTPPIPTP
NBVKPNTKTKSAVYVLOKC~rGRHKPFFSCYRPQFFFRTTLNfCV41'LPt~TfJI~IPCDN
V~V<SLICTVALEEGIWFAIREGCR'tIG7IGTISKiNA
CPn_0063 78109 78267
No robust homoloq presanc in Genebank/!?0L ss of 11/7/98 CPn_0075 91087 91350
pHYANCKrWCLCLYDFSRHRSPPCLPLTFTPPYSFTI~IFLGRCLSTSNIVLL suet-PreDSOCSin
cranslocase
gRgWpIptapNNRKAtSRKIGTVKKW1KFAGSFLDEIKKIEWVSKHDUOIYIKWLISIFG
CPn_0064 78310 78576 FGFAIYFVDLVLRXSITCLDCITiFLFG
No robust homolog presort in Gensbrnk/EI~L as ot~ 11/7198
LVM'KIOCSApYYRSRPAERAOTPPQPFLARDRRADFiiEAItPRFSJVC~VtJ.t~VU'~L'AL~ CPeL0076
91371 91903
LFLFVIdLPLAAGSYLLAF nueG-Transeriptional Antltesmlnati0a
pplCg11K7~I1frMYWOVFTJIpEKKVKKALEDFKESSCIffDFIOEIILPIFJ~MlIVIOK:EH
KyyltJlyIWpGyLLVl40!<.TDESWLYVKSTAGIVEFLOOGVPVAISEDCVRSILTDI~1(
S(fWCIfHOFfVGSRVKINDCVFVNFIOtVSEVFtIDKGRLSVMISIFGREl'RYDDLeFWCY
BEVAPGOESE
CPe1_0077 91956 92135
r111-L11 Rlbosowl Protein
FP'ygypLFygygQCKVRFSHSVbMXIIKI4IPODKANPAPPIGPJ1LGIYACVNIIGICIc
EFTtMLLPVVITVYADKTFTFiTKQPPVSSLIKKTLNLESDSKIPNWiKYCKL
'tpApNEAIAEDKIIXI~IVLf.ESAttANY00TARSlCIDVE
CPr1-0078 92157 93160


rll-L1 Ribosomal Protein
SCRIlITKNGKRIRGILKHYDFSKSYSLREAIDILKQCPPVR!'DOTVWSIKLCI1IPIOtBD


GOtRGAVPLPNCiGKTLKILVFASGftKVKE7IViJYGADFMCSI~LVEKiKSQiLEFWAVA


CPn_0066 80916 82655
TPZIIHEVGKGC11VLGPRNLNPfPRTCrVI'tDVIIKAISELRKCKIEF1010R11GVOiWOG
No robust homolosi present in
Genebank/Er~LKLSPESSDIKENIPJ1LSS71LIlUlfPPAAICGQYLVSFTI5S1'MGPGISIDfItQJNS
as of 11/7/98


CVYHANR'fQSRPPSPEISICELELOELM:SSNZ'LTISNI'PPPSCMTAEEVSLFILOGRR


NSEDEECP~EVYDVVCITNOGDPESIRDll6VRVN1(INGSCRTaHECILDAIBIiCDZ.PG
a1DYIYLt'8GN 0079 93170 93688
CPrI


EPVRFINNS4'YGLRSGFLCIRNRIPPRIxJVISDAIQA~FFIFA11~111-
Yt'GOWITLLiSIRGA?AVlfl~ri' r110-L10 Ribosamel Protein


CCLYLQVAGOILSIYSrtItILCVGIGSSYYIOCiIYAVtOiYR~08t0E~LIi.OEY~SAA~'FILLRYLRITMYSRE
IPNSUID11SIICFM.1001IF
LPYADSAEGLFLPSVttCPSYQWALACGEpCLIHtIf~pQVOFRPODSSStIALVVtVLDFNKpHKDBLVFU1DKN~LI
SLS
VSMIO


STWIRLIEWIDRGDSOAVLEfiiPGPSt~RDIJ1LTALYAT'tRISSLiID~L~L~~Vp
' FMLE~~~~DP
GALYEAYAKLPSLKG.RCOWGLFMPHSQWGINNSVLSGVIBCYDQKJIGIQI


RR
FV'1'IrAIVIIGYSIM1'LRYFILLLTNRPOCRRHFRVLRi.MLGL.OStGFLTVLLDttZM~
'


V!1
VNRRPPLISVIFCTASFATGSFIYVDLTRNIfTSLRSRI.OL1WRRL11GRGLPLNAV!CPe10080 93720
91121
FHPLIIGFINOLVIQVPAVVIRPN1TAVY~OTSOE~
LITFttO~d
r


. r17-L7/L12 Ribotamal Protein
HLDSLRF~
I


FFFVPSVRIfHI.IDrRPIa
VRtIfKVITLBLETLV~ILBNLTVLELSOLKKLLEFJtiIDVTASAPWAVMOOODV1PVM
GDVtAIGO,hIH!'Ii.QIILLVIN1


BPPEFAV'tLmVPJI~DCICVLKWRL~ICLIILJCFJUtE4lt'ODLPIfNKEKTSKSt111m~11I1C


CP1L0067 87910 81053 KI~IIGNIASFI(GL
No robwc haeolop Dresent in Genebsnk/ENBL
as of 11/7/98


~YSYPDPPNAVEGRVNSSOALNpt7C0Nt~G8IGGLLRCRILSIWAVITFIJILIrV
t CP1L4081 91=19 98016
W


FW rpoe-~ ~lY~rase 8eca
LIALTIJLSILTSYPYL7~I,GVFLLIVTIGCiIFAI~C.SEItIKRVPPfPIStd~EIIA
0VLLf'1mN
RSH


0
FREILBt~ItSRRTRl4JfCPERVSVIGCKEDIPDLPNLIEI01K$YItQFLOIQUJI~tI
KNIDNEKEKEDPEtIFGRTATDIPNRSALOQFNHSCNItIHEBPALTBTY
L


CPYTLPOYfSEEEVLIRSVVGSYLLiEIICVPKYSH<.iDEUNKLK81S6RCCLTIDXKTCLtEYFREIFPIKSYNGTV
LBYLSYtAfGYPKYSPEECIRRGITYSVTLKYRFRLTDQCIG
NDFFTPCHS
Ui
R


p
IKEEIVYNGTIPtJtfDKr.TFIING71ERVWSOVNR$PGINF6pEKNE100NILFSPRIIPY
~
ORIUSFLlI'OKDLATFFLAYTRVNOGNWPFRJIGAItWILItfYVRLR
TfiDttCDGFLE


CYYARLAFNDTQRLYHOLFNVEKLRSIYAPImKDPLCNPWAPIPIYDLLItpG>RiLEiIIFDINDLIYINIOWtKRRA
KILJ1ITFIRALGYSSDADIIEtCFFTfGttSt~SE


pQE~tEYPSRMQDQFWG
KDFALLVCRILADNIIDEIISSLVYri%71G1~'t'AI4.IDb1U711GI1LSVKiAYD11081a11I
I


81331
104.AXaP'fD'nFJIALtDFYRRLR~EPATWNARSTIIDtLPFDPIOtYtit.CRYGRYKi,~K


CPn_0068 81909
LCFSIDDEALSQtIfLRKEOVIGALKYLIRLIWDDEK7~~CVDDI1111L~RRVR84CELICNQ


CT360 hypothetical Drotein
CR9Q.iIRNEKIVRERHNLFDFSSD?LTPCKW5J110LSL.11&VLKDFFGRSDLSOfIDCTNPV
SF11IKKFFIYSLIFSCSFSAPLNGICNEDVSSpSAIIEDPEVLITOLNELILTPIEOGKEI


0AISDGOICSSEEIEESCGTSDSEGLSEKTOKESSNEYVLDFFDSNWRLEGISKHAELTNKRRtSAiGPGGLNRERJ1C
PEVROVtDISNYGRICPIETPECPNICLITSLSSPJ11C
IRNt3 YCTfVIIYJIGE
'
'
'


. fEP
COSCQVAGIIDCFNREFDIRNRELELIDIRELEWrrnt.c~SIU7NMKONSRELAFQRADVEEECVIAOII.SASLDEY
ISV
fDEIEYItI
NEPGFIETPYRIVROGIV
AVPLLKT1'JIIIISIC
R


Q
AFEACfSTVTIrIONSPKpLVSTtrZCLIPFLEHDDANRAUIGStJIIp


EQDIKQTI11LLKK
1'GLflCMAKDSCAIWAEEDGWDPVDGIfKWVMKfDiPTIKATYNUfKFt.RSNSCtCIN


OQPLCAITKCOVI710GPATDRGEL.ALCKNVfNAFNPWYGYNP~JlIiI88KLIRCD
~


CPn_0069 85191 87086 .
AYTSIYIEEFELTARD1'KtGKEEITRDIPNVSDEVLrINLCEDGIIRIGAKVKPGDILVCK
No robust homoloq present in Genebank/EMHLKSDtle
as of 11/7/98 F
RK
R


LNFLYVYLLIFNtGIHTTPPPSRSSSPPPYDWILpDLCMT>NtJSSRATPPPPEIIGCELPS
D
Ia
ITPKSk'tEUIPEERU.RJIIiGEKAADVKDJ1SLTYPPGT~bWImVKV
LWEKAPMIINRRTACIWIImGLID
A


PYFSJISNiyVIERGAPSLPSPQpLLSLPEYSROPPP.r.YFDETJISITSRTSE~CfLYSTL
LVEFJ1VHLKt%~DKGYKHQVATLKTEYREKIIG
WU~IPNCDt'IGVLtCGL4SDYETAf.~IL6IN1rKT6VlJIIR~OIIIDLDItt
OETTERIEGEIX


LCCPANSERDWEDNEVNCIYIAS1'SD'tOLEAVp(xMtIITtLAGEPVRVLYt:TGNLYAFAR,
'
GVIAOPKWVASKRKtAVCD101AGRHGNK411VSKIVPRIIONPItt~NCISIVONIIIIPIL~VP


QDI
SRNMAGVLETHLDYAAK'fAGIYVIffPVFEDFPEQRIWDINIOpCLPBDDKSFLYDCIITG
ENTCNSRLEVSH7YRAKrYPYIDRFFSPN4MViCRRFLVFYpCIKiCAYVOAALDSSNN1


'IVLGLSPTVYIRCNIrNVQHYRVRDFWPSCLDSLMGItM'SVLPYCtSSDGIFYPSLFSNERFCNKWICYI1MLKLSH
LIADKIHARSICPYSLV1COPLCCKApIIODORFC0~R1AL


TFDMAIRYCERCLLVCSECMGNLPETCpOTSPLTSLEOGfIEtrALVUIPQpNPEALSLASRQEIL'fVIt5DON5CRT
RIYESIVIIrENLLRSCTPEBFNVLiKAlIxLCLDVR
EAYGVAWIL


I11HEERCCRLESNY?IPGRSSNPFM'fSNYVLVrtINfLIIQIYLHSPYYSFQSNDIVCLIFIS.


.~.MVETV~YLFLTVTDSTCCRRYLRVPRLVCTCLRNLALPiTLLCLLILSYPRSIrLCVPFPNWDA


tJVrtFIIG'fHf.'ITRWFFAWNLILIIWPFIICLRIIGIpLFVNRSI
IfSITtGARITDLTLASHR 0082 97992 102221
I CPrt


YAIVFPSIVC~LLTAIJWANiNIt.ALDPYRLIESGDLRRPAPNODEM00~rPWDJIYS_
rpoC-RM Polymarase Bate'


(:LVINTCtYMLILFANLiFINYSVRRYNRSRR
CSSYGRRRLKNDVLEKINFCENSRDtCVISKECLFDKLEICIASDITIRDKW&CGKIKKP
.


~Pn_Utl'la 87399 8720A
ETINYRTfICP~OCLFCEKIFCPTKDYIECCCCK'fKKIKHKCtIACDRCGVMLSKVIIRER
NAHICLAVPIVHtwFFKTTPSRtCNVLCN'tA.~.DLERVTYYECYVIIttM~GKT04T100~iJJ


th robust ntsnoloq present in
Cenebank/EHBLDAQYREWEKM'.KtHIPVAKMOCPJ1IY0LLK.iEDLQSLL1C0LKERLAKTKSCOMP4ttJlKR
as of 11/7/99
'


LLISFRDTCLKR LKL
IOGFYSSSMIFCWNVLKNtPVVFPDLRPL'/PLOODRFATSDINDLYRRVtNMBIRLK
'fKyrLFNLKNONFFSNOSRTYEORFPKVSPHPESILP.tQSVGFSSOG1


riLYI AIGRtXTPEYIVRHOtRMLOEAVDALPDNDRH
:HPVN17AGNRPLKSLSOXJODKIKIRFRQ


t1U71 N8U~6 8759?
NLiCKRVDY!aCRSIIiVCPELKFNCC~.PKEMALELFEPPIIKRLKOOCiiVYTLRSAKIM
:Fn


_
IWKiAPEVWONLEEIIKf;HPVI.WPAM'LF1RIJ:IpAPF.PVLt0t71fAIRINPLVCMFNAD
-f 125 1'y(xxhcc ical protein '
'


IK::LR::ILEPIf FL4IIARt:LKKDNKIIEELFPEPFUYDNLYLKfIIENS:iSRL1/1FOKKRNLf1
F
FGtDl7lIAVIIVPL.aVEA0LFJ1KVUIMf'I~IrFI.P:.'xKPVALP.'.7falfllSLYYIIIApP


'V::IH.YLYEVYQDGILFFFTYTKAtJt.~.fIA.~.LFTI:W.~uCE1'P.STIL1'CKPIFPEFJKY:KTKtFKDE
tEVUtAUJNrr:FIJ717VPt:LPRDL~Ic'.tlc:flllIlP.KIKVRIDr7pIIEIT
YF
NrVVtx


. Pt7RVLFNR t VPKEta:F(XJY.~,l11
. ..rP t..EL t t//:YKKVr:1.E11TV1tPLfMfLhDUSP
FYJI<LTh ll_:('~:RIJ4(x:E::LYNRNKO tQATKA
f AVpYLXPP(,T '
'


JAIVKYt,YDDtlttTECERIt:KTI~IWflY:f00luD
ft~Y
At ~Nr:LIrOllrttf'DIK::11ILKDA


'In~ inn/ Ntl5l Na1157
ALriEt::KC~SKNNPLFt?rID:Y:AW7NK:aII.Y~JIt:AI.RIaJIANPNr:AIIF~.PIT.'OJFRE


T:~A hyrrmlirric.~l prntwin
~'LTVLE'l.~.taaK:ARKCLADTAt.KTAD:Y:YI:fIRLVDVApDVItTF.KIX\iTLNIItEI511I
'


'P::YIKEKYI.ILI'fCLLFYFFIfYRIt.TPL.::rf:LCI'L'DDWPOEI,FCDRL%3SRI~rf~:Dlt::f1.IJ
W:a:DVIH:;1/VAF.AIDI'.ltaKTIKLR3
iw:Yh::l'Y tXlt::%EEI.LI'LKpRIY':RTVAK!I
'


. f:~
::IE:YIX:NA:Y:IXa'IV::Pr'I::ALVALTDLKLVPYNGtI.iFIihl1'fRLKNAVEKII,LFIpNI~IK'fLT
i:K.':r'Rr7Vl:AKI'Yv:GNLAN:hI.I~:hI~:P:AII~::
tllAtk:lt:EPlfittl.TNRTF111lx:IM


IIItIYALTLTIWLIT/17ILIIGV'/F~IPTATCLDKENKHRNVNSWNL:.Tr'EIITN:;tXSlLYI7ttKJlWIIY
~E~:rINLVInYY~JIIJIWr:DNf:RTWKTKKUl7fK.iIE
IILLfII '
AWALtI
VI'


. KW
.
::LKVrPVC4:VKtl.VAIY.TPV:aJYrhI~:FYI:1.rltrtl'Itr:MtfnFIKYKIH.VrJ:I:.TB
.
talc:lUJId~FII:fRUILLA'rNIASI::ALLYAVP57A'/r:LViI:FSIfxIQI:,INfVYCARU.:D


76


CA 02350775 2001-05-11
WO 00/27994 PCTNS99i26923
NKHIS:LVELIWONRGfLtIMIAIYDOADL.iEL HIIrMEVI::LG:!'
JATPSGAII'llEEf:QRVDPClD.LA


RLPPGA IKTYD ITOCLPRVAELVEMKPEDMDLAK
IDCWOttI(G IOKNKRTLWCDEFff CPn_00t.1 .:~ ~ r~trr~k~~'~"v
CCVRELOKYLVNEt/OCVYR


~71EECNLIPLTKNLtVpRCOS'VIKCOOLTDGLVVPNEILET...
IIVIIpNLQKVRL1'DPfiDTI'LLPGEDVHKKLFYCFNRRTCEDGGKPAOAv.\1S-ValYl C1U~IA
fiYnllflC.1
..
SROIIKFIiLRIt~f'fEDFPIUlNF(;L'fEPL'i'IFWEKNr,7lFKAEA.iaOKPPrSVIN
~fFt


f.pCVDINDKHI Yl
VPVLLGI'fKA:uLGTESFTrAASF'OD~~'TDAAf'CSt~t~t''
'I~FK~I~I~ PPPNYI'CVLHNGHALVNTLODVGVPYKRN.'Y:FEt:.'a
f Ir:TL'Itlk% IATQAWIRNLOASEG
'


?ETHKRIKOYLEKEODLVFDf VSETECVC :
KRR'IDYSREDFLKNINAWIIEKSEKV':Lat:LAC:G:
~~.JWdIKRIr'1'IIEPLANRAVKfIAFK


FC~ICYtYRCfILVNrDPVWIAt~DE1'EfEEI(LVA'ILYYIR~RMVC"~E.iIWATTRPE
t


~pn X091 102:96 103312 .
t
.r,..... . .-cnY,..:~f.".~ ~~'
.. . . ....,......'.v.~Vt!
..
. .,.
....
.y.
.
~


.....~... -. F
',
:..r.w ,
.s :. ,.
'~,.. .. ' r'K PKSr
'.Y~F'f. :.i
~:' .,.... ::1:
' . ;e:~.
1 ;~~"--rnr, ~
~
4'IEPYL:iKHWFVe'v.::.rm;nu:ia:F.:aL:K:FiK4YVKV'i'u,iw'47iNLRi.ii
"


.
. Vs:V::YRsur
.
CISROLNMGNAIWWYNIO~CDERYL.'.'C'OI:ErIPEEVACDPDSWYODPDVLD'I~IFSBGWP
.
..
: :1F .
'ELWEAWWGIAONGODLCTLSFLLOKTOVNFALElIKNIPCRISLELDARL1FNVEAM


VOMVFL50LFEAMOGOKKRLLVKIPfsIWECIRAVEFLEAKGI~ftLIFNLVOAIAAALTCLGNPDENSFDLK1IFYPT
ALLY'CCNDILFFWVt'PNVLI.CSSM&GEKPFSMIJIOi.IP
'


OIMAASFIt
l'OISYKRYHflEGEWSYISGKEKLAYOMCFrILPDL11VAKNCKLSKBKONVIDtLBIIIATYtt:
KAK11TLISPFIGRIYDwMIMYGDF7CYSIDADP41lA5VSNIYAYYKKFGIF1


TKEpVLALAGCDLLTISPK1.LDELKKSONPVKXZLDPAEAKKLOVCPIEL?ESFFRFIl~TIDAWL?I~~IDt'D~~~
F~~tIFGNISDirOGKDLLaGIDCD


EDANATKLAEGIRIFJIGCtOILETAITEFIXOIAAEGASLGfmFYILOGFNOLIHOLEEAYATYAFDKVATL1YEFFR
NDLCSTYICIIKPTLKiKQ
'


103356 103751 GNRLR
~~0~ I~t~t ITESLFLR ODI'hGAi'P~DCDAFI
PYTPODLRESFTLJIOItLVYTIRNIRGEMQLDPRLHL1UMICS


CPn_OOB1 MLPSRAC1IG
predicted fersedoxin
0'1'TCIOSCIPIIpALIGGLESIOLLDIIEPEKGLYSFCYVD1'IRwIFVPEfJILLKCmRGE


SEMKMO'lalfKSOLVFSCPCCCK(TIVCFSVFNLOVIL1'CNVCSSTY'1'FDSVIHNEIROFYAKEAVRLERAVEZi
LCRL.LGDEStCOKAHPNLWAKOEALKMJitIELw7GILmG~ISFA


:.CKRIHDANSI'w"NATVSVSVEDN011DIPFOLLFSRFPVVIlILSLDOK1CIAIRFLFD11LN


TSII~sQESDLIS CPn_0095 115956 118790


0085 104512 103'166 pknD-S/T Protein Kinase
CPr
ACIVCLDRCOORSLERYDTVRIICIC~1CEVY'.aYDPtt~.SRKVALKKIRCP~NPLLKR


_
RFLREARIAADLINP'GWPVY'I'IYSEKDPVYYI2lPYIDGYTLtCrLLKSVNOKESLBKELA
c~311 hypothetical protein .
EKTSVCAFLSIFHKICCTIIYVNSRGIWRDLKPDFIILLCLFSCAVIL.D~.AaVACGCEC
FSMKPFILFILIVAOFPAFSAOPATOVSASHSKpAKARRTSRIRSSMTNASVSRYKTRA


AARKKIGKFflOIPSLSPVOWVRYSCKNYSICfP5L4FOCIDL:K1'QLPEKLDVLLICKGKGNDLLDIDVSIIEEVLS
SRM'IPGRIVC1'PDYMAPERLIGNPJ1SKSTDIYALGVVLYOhLTLS


LTP'fINIAOEITSKSSKEYIEEILAYNKJWEMTLESGIFTOIOSPSCCFTIIK?ESNFPYItR%!~%KIVLDOQRIPS
POEYAFYRCIPP
CRVfCL0A1'lYImiTAYIF'"STATLDDYALSFI'FLKWSSFOIRGGKGTSCDAILEKJ1WNRIQ.AVDPOCItYSSV
TC.1~DI
ESNLI~SPKIiI'L'ITALPPKKSSSWKtirE?ILLS'RIi.~,VSPASIiY$LAISNIESFSil1


LEAt.ONf?1K RLSYTLSKKCWEOFGILLP'ISENAIA7GDFYQGYCiFNW
IKERTLSVSLVIDiSLEZORCs


8 lOS5Z7
ODLFSDKLTFLIAL~00ISLSLIYOGTi~'ILIlOI~tYLPSRSGAIIAIiVRDI~DILEDZCI
SFPCRItOGYCARFRAGITVLCKAS
R
A


CPn_0086 10489 I
E
FESSGSLRVSCLAVPDAFLAmtLYDRALVLYR
LIQY50NPE


acPE-ATP Synthese Subunic E
TDNN~~T11IJ1ZECPSIOi~GSfAIIPLEYLGKALVYORLOEYHEEI1CSLZi.A
NINANLNADG%LKOICDALALDTLKPAEDSAAALfJiNAICEOAKRTI0EA0Et~IRKIT'fTAHESFYIfRDRiJIWF
lLILVLEIAFOAITPCOEEKILVWLKDKSRATLFC
DNVVYRT


EEWItpKIKOGEVAT.S0AG1(PALE71LICOAVFNICIFAESLVEWLEEIV'i'1'DPEYST1G.IOAL.
~ IFRIX
LLDPNLCLIISSIQ~L.FLSYWSCYIPfR3iSLFHRAWDOSDVMLIEIFYVACDIJODAFL


'IOALFaOGVSGN1LTAYIfD0IV5PRAVNELIGKAV'1'1'IC.RIGfSWOGSFVOGYOLKVESSCIDIFKESLEflO
KATE6IVEFSFBWf:AIT.FAIOSITNKCDACIIIFVSNDOLBPILLV


wVt.D<SSSALLEIFTAYL.OKDP'Ra4IlOGS
YIFDLFANRALLE.ROGEAI!'QAL.DLIRSKVPF3IFYItDYLRNHEIRANWCRNFJIALSTIF


ENYT~OL>tDEOY~t'~IOCAF~1AI(OHFDVCRF~RIFPASLLARiIYNAIrGLP


CPn_0087 105510 106376
KDALSYOERRLLLRQKFLYFHCLGt~OtDERDLCQTHYNLLTEEFOL


CT309 hypothetical protein
SHCKIFSIFXVWKIOYYFLSSFLPTOLPESVPLFSISDLDDLLYWLSENDLCNYGLLKCPrL0096 134347
118837


AFFDFENFAFFwAGKPIPFSFGEV1'pENVFIl4SSQOi~ISOI~fFKD~IKS50DCT'396 hypocheeieal
Protein


RWNFSDLFREFLSYttOTNSSKFLODYfRFO00LRWLAGTRARVL~SY~~SCfFLSILRCTfMCSLPVYVSCIKVRNLK
IIVSIHPNSEEtVLLTGVSQSGKSSIAFDTLYA
TLNRALiILYOFHKLECFCSDSYF '
'


DPWLM~IOKDSPNYELPEEFSDIOCVL~YGLLPH GJSN
NCSTI
.nwrsvrcrt.srrlllTITfLPNPKVECItIGLSPTIAIK~7IIFSHYSNA


DL'NyI;,ARCATYMFAIRNSL.ASVCKGACIINHIC%AIlDr1


CPn_0088 106351 10A14S
CT288 hypocMtieal Drocein
SYRIAiIOIDtIVSDGTAOGtIVIEAYCRiII.RVRFDCYVROGIYAYVMfDNTNLKAF:YICVAD
OEYINiDYFC>yIpCACRGALVT!'SGHLLF~IF~GPGLLOGIFDGLOFIRL111FU1EDSSfIARG
KNVNAISDNNLNMTPVASVGDI'LRRGDL.IGIY?~RITIIKINVPFSCFOEVTT~~TSE
RWPIKpAFIflGflCIPAHKIMDIit".LrtILDIbIPVL
KOGTFCTPGPFGaGIITVLOHHLSKYAAVDIVIICACGEPAGEVVCVLQEIPHLIDPH1GK
SLNHSI'CIICNTSSlIPVAARESSIYiaVTIAEYYROIr..LDILLLADSTSRWAQA1RCISG
RL6EIpCEGfPAYLRSAIAAFY~QAIT1'I~GSEGSLTICGAVSPA~?1FECPVTOST
L71Vi1CAFCGLSKARADARRYPSIDPLISWSKYL~KNGOTLEEKVSC~GAVI0fAA0tLEII
CSEICKPMCWGEEGVSMEOMEIYLRACLYDFC'1ii00NAPDPVDCYC~ItI.FSLIS
RIFOAKINFDSPDI1ARSFFLELpSKIKTLlICLIIFLSBEYttESKEVIVRLLEKTMV~'IA
CPn_0089 1D8111 109466


CT289 hypothetical protein


LDL'WIOLON1IKWRKI7MOTIYTKITDIKGNLITVEAEG71RLGEL1TTTRSDCRSSYASVLAF


DLKIM'LQVPGCrTSGLSTGDHVTFIGRPMEYI'!'GSSLLCARLNCIGKPIDNEGECFGEPI


EIA?PTtIJPVCRIVPRSNVR'1'NIPNIDVFNCLVKSOKIPIFSSSGENNNJILLIOtIAAQTD


ADIWIO~LTF1IOYSFFVECSKKiRFADI~MFIIiKIIVDAPVECVLVPDMALACAEKF


AVEEKKNVLVLLTOMI'AFADALttEISITMDOIPANRGYFGSLYSDLALRYEKAVEIADGC


STTLITVTTMPSDDITHPVPOH1GYITDGOlYLRDEJRIDPFCSLSRLKOLVICKVTREDH


~DLANALIRLYADSRKATCAMAMG!'KLSNNDKKLL71FSELFETRWSLEVNIPLEEALDT


~IIfILAOSPCSCEVCIItAOLINKYWPKACLSK
FKKQAELLIAKGTI1'F'SDLWIDSHPIASSORSDISTYFUiAPSUIAeTwsL7Vwawu~a


SSMFSIT~'KOGOCSDCOGLGYOt'1TDRAFYALEKAFCPTCSGFRIOPLAOEVLY~IOtFG


CPn_009t1 109439 110080
ELLNTPTETV11LRFPFIKKIOKPLKALLDIGLGYLPIGOKLSSLSVSEKTAt.KT'AYFLYO


acPD-ATP Synthase Subunic D
TPETPTLFLIDELFSSLDPIKKOHLPEKLRSLTNSGHSVIYIDNDVKLLKSADYLICIGP
P?LKLK1SALLOAEVQNaV%ZMAECDKDYV
FRLE
1
TYL
'
'


IO~IS GSGKOGGKLLFSCSPKDIYASKDSLLKIfYICNEELDS
.Q
.
l
KOKLAR
VLAKSMSYpVKL
OAYERIYAFAELFSIPIGTDCVEKSFEIOSIDNDPENTACVt~IPIVREVTLFPASYSLiG


TPtWLDTMLSASKELWKKVNAEVSKCRLKILEEELMVSIRVNLFEKXLIPEZTKILKK
~ 12459 126006


tAVFLSDRSITCriICQVItMAKKKIELRKARGDECV
CPn_009
PYk-Pyruvace Kinale


DSMITRTKIICTIGPATNSPF?ILAKLLDACFl~fVAAWFSNCSHETNGOAICFLK6LRE0K


CPn_0091 110071 112053
RVPf.AINLDTKGPELRLCNIPOPISV~Cp%LRLVSSDItX7:aA0DGV&LYPKCIPPPVPC


acpl-ATP Synchase Subunit I
CADVLIDDGYINAVWSSEADSLCLEFMNSGLLK".allKSt.:allKS'VOVALPPNFEKDIADLK


'JRWIHKYLFtGRHKADFFSASRELGWEFISKKCFITTEOGHAFVECLKVPDHLEAEYSFCVEONNDWAaSFVRYGEDI
ETMNKC1.ADLCWPKMPILAKIENRLCVENFSKIAKLaOG


:.EALEFVKDESVSVEDIVSEVLTLNKEIKCLLETVKALRKEIVRVKPLGAFSSSEIAELSIMIARGOLCIELSWEYPN
IQKIOUIKV.iRETGHPCVTATQMLESIIIRNVLpIMCYSDIA


RKTGLiLRFf'YRTHKDNEDLEEDSPNVFYLSTAYNFDYYLVLGVVI1LPRDRYTEIEAPASNAIYOGSSAVhILSGET
ASGANPVAAVY.IMRSVTLETEINIL.iHDSft.KLD~NFnAI4VSPY


VNELOYDWdLOREIRNRSDRLCDLYAYRREVLAGLCNYONEORLH<1AKF:CCEDLFDGKVLSAICLAGIOTAERADAK
ALIVYTEw.~3PMFLJKYRPKFPLIAVTPSTSVYYRLAtiiiG


FAVAfiNLVDRIKELOSLCNRYQIYMERVPVDPDETIPTYLENKDVGMICEDLVOIYDTPVYPMLTOCSDRAVWRHQAC
IYGtEOGIL.:NYDRTLYLSR(:ACMEC'ttaILTLTLVNDILTG


AYSDKDPS'IWVFFAFVLFFSMIVNDwGIfCLLFWSSLLFSNKFRRKNKFSKNLSRMLKMT


AIII~f~ICWCTRTSFFGMSFSKTSVFREYSMTMVIJIi.KKAEYYLOfatPKAYKELINEYSEFPE!


PSLKAIRDPKAFLLATEICSACIESRYWYDKFIDNIWELALFICWHL.iLGMLRYLRY
17194 !huU


RYA:TGWILFMISAYLYVPIYL.CtVSLIHYLFHVPYEt.CCOICYYGMFCCIGWWLAMIrpn 0099
sc hrnrolnq pt'asenc m, r:..n.:rarlnk/t118L
v:: m tt/7J9R
r


OA.~.WR~VEELI:VtOVP30VL5YLRIYALGLACA!?fCATFNQMGaItLPMLIGSIVILLGHNa ro
m
IK:KKFHOtKRTILFJvPLYYLV:7faIlLr:Rlrl'I'R::FI:fi:U:Kt':Ff:FhAFYII::DYRKTAL
t


.~JtltIL3tMrJSVIIK:LRWFIEWYHYSFDCCGRPLRPLRKIVC3EDAfALCIHLtINNStV.
TNLALAFPEATFOERYKI ARO::L01IL: ITLLEI.LAIlXIVA':NtINILITIVT::.~.RIIpIOCFS


::FF.V t.~.tfEDt.EETFKNtAEKIY:L
t LF! :1 NoV hlHt.PFI:I ITKNY I',':
t AFAKA I KNORl::K


:1 ayrp r1 l .'.121 l t=57 3
NIFALIEVFIG:KtVI'PKN:;~IFII:Yr:KI:P:tth:f~AtlJl:":Y'1'YM.P't:::PAFT1'FS


.'r.f.N-ATf. ::ynrhasP ::rtbunit
fALLr1'IY'1't:PtVIAVNV:Ut~AK~FFYf..::AK1.YANK::1.1.MYF:::VAtIJMIIlMttFLEKCTA
K
~.A
:HAVrA::AfDFI7Ir:KLtf?1
C
A
'txlAf:V\
'


.
::UIRrrWIMIHKRNIIIRKI::NVIKKK'lP'Y.:1111.VI"/I~N::::IIF~.:YI~YAIJII:I.'P:x:ITWL
AL
:P
:
LVtGt.AMI(:::AI0.
l1.Y.f:AHt.Y::MIDM::W
I
"J:'t:fOAYA
:1'M~Y:K(
r
dLLV::
X
~
'
t


. r :NAII ILF.fLA?t~FPE'l..L It~I.ItNDpL
. I:.AI.I TN "/f ;\ 11~ lnl.Tf INI
: AJI II r YKIIFHY'1'rl'X AVY
MIKNGTLiPV(xIAU:L::
:A
lt:l
ItlILIJII
Mf:::X3::I


b:::::a'lr:Kt:Vn\A ( : f Vf'.::F::LFAW::KHYLFY:iLDllt'OAPf.KN::IJt
IFY::YIILY.ItKF:IrYNFKVY::Yr:LItl:lYh
FALLL.L .



':fv 44rt 11:440 lliUlS
r:rn on'r'1 t:n'.~n :vrr'. I~.~.gLS


r-r:nt hyl.al,.:rtr:.r1 protein Flr. e.dm::r Iw~lnrJ l'trr.:.ml
IIAI~V n . n:'rr'Lmk/h'HItI. .n:: ..I
I ll/'//'rN
:KLVFtiLTVI
RYYY
'


.
YYwA;:YIf.KI::HFMKIIAf'F:IttMIlJ.::'tl'Tf~rl'9'M't'LLJLKVIfFJth::TfN(JIItMI~IK
.
:
:AI:FKf.ML.DI.NItYMf:::VMDRLGL.IILFIICLLLF1.
:YA::W.
~.RMRKCt.f'VT
vHDE'JYt:WVWISd::LP
:
TEYVr:t'EYaAAA
'
:
:
L
:IEO
'


.
'1'IAVFAY.::f'AD'IVAI'FALU::1:1.::17Jlil'VLI:.A::NIIIYrJ::IK111KF1Y.TKF'
. F
:
O
:
:
It:f.UX:::F:1
:
IL
F
TV:iARl1
'
'


.7:NOE .
AVALC
t:lliN'PI~:FX:KVEKf:I'YL:VNQ::At:IAVYf:LY.f:LEYYEL~:r::f::\


77


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
LK'C!'FJGLIAEELMnINV: v!'::F:'."~!'.'KF4FKNI:.:KKV':::K!IKEN:KAL:iELPNN
:


.
ALOKLIOEEIIrVLTIDOPE:n..iJ:00t::'.:PIr'.'DP:Y:\R:::.4.F~J::.DCCLREPLIVE


~.Pn_OlDt1 l3'Jl9S LI788Z
~IARELVNKIN'llIRRNOG:~tQORtALRI,I~EAWIR.~,F.,LOY,I,X.k~,~~rbl


'T'J1l Isypornecac.il DracHm SDFOCQ'81DINQtRIOiLC:.~.'r:rtDO
I'vfILIJrI(75VTITRTLItIVP~
FAf'
'
'


a
JVULG '
vLFIRNIiPRK
a'fpKKTFILG~.LEINIKFLr
SWIXIID
EWISAAMrIF
"


. CPn 0110 :4:'55 14192
JpOLAPSM.
:'JDLHPDpI'VL.c:LOK:.CfGNKKVSLTT1CllIO~II
VNPK
'


KPICSPPIICIfEYID LepB-S(9na1 Pepc(daa t
KHlILVSVOHEINIRKDIHSVDANDI!'VRLTOY'JTEOTLLTI:


YIiJpKVSGPKEYINALKEOGLELTFNWKLSFEELENNRIAOGSHOEIIFPTPKWIUIILSYPSIIfOtOHYSLJ4K:a
PH:LR~I"fKLLK3KKLAHSPADKKCI"ELLEOLEGIFtNDOE
L


fPFFNrFlIDfIJDFOADFLRt.LFLKAEI:TPINLNf.PVFLFFPVtFI0tf8IPLEYSLDPVPP.
ry' ;e...., s zw..'.FAIS'\'\FI:rl~F'rt~S:.:'~Je'!~:~IRP'K
-...
\.


...,...'.r . ..~J._.;- --r-!.i.I: .
_ . . .. r -: .
' . . .. :.v.~.TKSFr::.tn
::
- :.-
~


' : -.1 "
':Ir: :: ;::
' ::
., .
' , I ' '.I:F....F:i:T'.afl'
~..;~ 'NF\iLF:.~ 1..~...: W.:l,:i:i.:~I'P....
.\''1. ::ai r .~
:.:Ir ::':........ ;'. Nil. . ...t:.l~KHYiKRI:ML ~ '' "'
.


TKTKETTKLYKKEW
GQKTIIDPKOFNOSYGAL:=iy:'ISIIYCOFFOHKFSMQDEPNKLKDPHLSPVSYA<>L!'~Ki


NYAlNRILTEHQARTS fG.rplKVYt'EICIiTANLSYpKPLLRNY1~L5PAI0!!8t


0101 129996 117141 TLLPLRKEJILHLIRM4G.'""f'
.FIVAOGCAYKYlIOPItIHfSGIAKAYAILLPKVII>4CYClf
Cpn


_
$KGGYOIGFGEIRYKLILi.:MPLTOLNDKOVIELFNCCINF$SIYNPVV1t11Pts011PLi~YA
ybbP tamilY hYPothecacal Protein
'


f~TM~t'I'
FTIICIiNL.YI1~SPVFItOtGP'fL.OKI~I'S.~Of%SSE'IOPYIAlYOKGLPPCt>EKT~VE
?S'I'IG!(fgOY'ITOCPSKTNPFDITYYTtPLLEIILIWVMNYt.LK't
'
'


fOE
FINHFGIOVPKGtfVLVLv:rITfPNSADSREItGPVPNENL.GSPLCTfS~IPIGWC~.'DCVSA
FID
pFLFLf~IIJIDKLHLPIIRRII2iMllIIAAIWFTIFOPEIRLIILSRIRPtIGKICF


OFVOpLAA5IY0ISER0ICALWLENKDSFDEYLSFSSVKINAT!'SEELLETIFEPSSPLPCTLSGYLVSCIAIJ1TGL
SLICYVYYOKRRRLFPKKEEKNItKK


HDC7IVIWtG0IIJ1YARWLPWiDTfOLSRSttCfPNMAtJGA501IS0ALIITVSEDiCSV


SLSRDGLLTRGVKIDRFKAVLRSIISPKEHKIIIIPLFSWIYBLR
CPn_0111 114761 113934


Ct031 hypothetical Protein


~Pn_Di02 130099 131166 Ot-0NRYPTNPNDSSTYFER-
L~OKYLiKK00KTLF..FLFLSfLFSTAFSC:LFASQ!'S$LRT


cydA-Cycochrome Oxidase Subunic IO~I~'S~~~'~P~IEI THFPCIAHKERP$LEOAS~IT
I
51


FYIOFNKFHDJ1LILSRIOFGLFITFHYLFVPGSNGLSh?Il.VINECLYLV?I~OTYKON7VIIIOLESPSOVFW$LS
SEGSOFFSLIffRTKSLEPVGKSTTVPAFLOIFDLPLSPAPANV


iWJCIFALTFVW'WfCIMOIFSFGSNNANFSEY1GNIF'CCLL35DGVFAFFLESGFLGII1CTIDQIENKPWSPKVSF
EGAPLTSISVNAWOGLWPKDRCPL.S>:fGI:J4Y!'fOPDISVFIL


:.LFGIUiKVSKKNHFFSTCMVAf
GAF91SAF~1IICANSWN01'PSGYEMVIOiKOKLIPALTSFNVSIETPKGTSIVR1WDIGHCATSPYVYSLPDSK'It
Q
'
'


fICAVIVL
WG1VFSP1'fIDRFINAVI:CtWLSGVFLVISVSAYYLWIOtAIOIETAKOIaOtIG


:'L0I1tS71Wt'ARCVAKNOPAKLAAF'ECTFitTEEYTPIWAFGYVOMEKERVIGLPIPGU.S
0112 111743 115093
CPe1


FLVRItIIIKTPVTGLDOTPRDEWPNVOAVFOLYHL3It4.WCVNVALTLISNSAYIfGk1Rw11i.~
VI 9atH-IPetllZl Glu cRNA Cln Amidotransferase
t8 Subunitl


KPPFLVILTFSVLLPEIC74ECGNCAAMGROPWV~CLLKT'KZ~V$P~SLDSDIGVVIHK>0~71PEYRQVLFtfDSST
CYIfFIICGSTYOSEKTVPEGKEYPV~GYVSVS$S


FSLVFIALL'ILFI'M,CKKIKHGPEEENDLTEFEVKSHPfPTGSKK1VDAEGRVDKFLKRYSINROPAOOPOPEEDAL
PAA10CIOLKWTKRKIt


CPn_0103 131465 132511
0117 115329 16105
CPti


cydB-Cytochr~le Oxidase Subuaic .
II PtrA-Peptide Chain ReleasieW factor
KAKEDROtRILLNSIG IRF-11


NACIf7IELSLTSLLPLAWYVTLLIIAVFAYSFGDGFDLCLCAVYLGP'MD'DIVAEYIiJRLAEVEIKISNPEIFSNS
KEYSALSKENSYL.t.EL10iJ1YDKILIIftkYi.
FSV$


PVWaZiEVWLVIIVGGLFAGFPACYATLLSIFYMPIWILVLLYIFt~SLEFRSKSADOR011LAIEKDPElNVl4.EEG
INENKVCLEItIliKILESLLVPPDPODDtiNI!>:LRAGT


'rJICIFWDIIFICSGTAISFFLGTIVIirILILCI.PLSPtYfSYASLSyIILFFIIPYAALCG11WAIU1ALFVGOL
~IRNYHLYAS$10(rldIfYT:YISASESDLXGYKEYYl4ISG1'f~IIKRLLAYFa


"oAFAItiGSCFALI9fTSt>railiARIAOQFPYILSSFLVFM.FIJGASLISIPICtFOAFPfYPGGS
C'fHRVORVpETET00RVSTSAtTIAVLPCPSEECfELLINEKDLCII7TFR71SG71040fIVt0


..:.ILLIALTSCCCVAAKTSVSKKRYGYAFIYSTIiiLLSLILSAATLTPPNTLLSTVDPOYVT06AVRiTNLPtGVW
fCODERSOfiKNKDKJIIOtILItJIRIRDADIQKRIDiFJISAIWEiIQV


3Y1'IYNSAVZ:TKTLl(S1:LIIVLTGLPFIITY1'CYIYRVFRGKTNPPSIYGSf~SEAIRTYNESONRVfONRICL
TLYNLDKVNOGDLDPITTANV$NAYNOLLIaGi


CPS~O1D4 133884 132676
CPlL0114 146371 117261


CTD17 hypothetical protean hamK-AfG sPKitit methylase
EIC5~0IStR.ti.At.CfAINSPAIYAJ1DSOSVSFPEQLPSSt7CEIKGMNRl4rtLAPNTVMPTTSYSlR2IKKAI
O1T'AYLDYYOVPLSOCEAT.YII~07LtE1ISSRA10.FI1LVOISlT


OGTIIREFSKGDLYAVIGFS%DYYVISAPPCITGYVFRStVL.ONWOGEQVNVRLEPSTSYRICRIJILIiGORCPTAY
i1JG71VSFIGLRLRVDSRVLIPRTLTELIJ1EYIIbYLLiNB
lu
a


APVLVRLSRG1'QIOPASOEPtIGKWLWLPSOCVFYVAKlilVANl06PIELYTOR~CI.
.
t
EIOTFYDICCCSGCLGLrII>aCSCPINEWLSDVCPOAVAVANBtIJIKS1(OLWKILIG~S


AIAOLINSAWFAHIELEK5Il9EIDLE71IYXKINLVOSEEF!(aVPCIOGLIOKALEEIODAAPYTRPADAFIR:NPP
YLSFNEIINIDPEVRCYEPWKALVOG51CLCFYQAIApdfltlV!


YLSXSLPSONf$IAS$OCSTPIIVSSSIVTTSLLSRNIAKCrAIJffAPLTQCREIG.EYSLFS'fGVIRiLEICSSOG
ESIKNIFSKtIQIYCRLIIQI>L9GRDRIFFLfI~GRDWS>i~Y$


RIWASLtQOGI~HSE11LT0EAFYRAE0K10(OVL71GVLEVYPtIVV10~8(PGDYLLIWOENTIA'


FLYCISINLDOW<GKRVTVECLPRPIRMFAFPAYYWGI1IFJ1SCPiL0115 117779 118632


CPiL0105 1318$3 134039 Cfh-Signal RecoOnicitm Particle
OTPase
IMNVKDFISRVImCIL
V
ll


CT016 hypoehetical protein iW
ALJILit
MINSLSOKLSSIFSPLVSSRRINEC'ITSESIRE
A0%11TJ1i11aJ1


YVPFRItFSNpNPl2LIYCKIO~ItiiQwPOTAKIRFTPKIAIOMCTNDOLICIPpFISIUtwGELIf001VSP000FI
RCLRIi~.VAFLSt7CREEFTIOKTPSIILf~CGL0II
K6
'


SOI71FIESOEGatKDOGTLALfILIOCKIISIPNLDOSIIDIAFOENLLYt~fSO~SAAVOOLKILVAQTKAEFYOSO
ENKPIMIWK71T.~YJI
DYVI10~1K1tAKKVI'WPCtILIOtP
iLD


Rt7DDKLGVGYII4~iVL00ITKGNDIOVLPKM.TSPLFS17TNPIFJ1ILON1'PCNKdlPDAP?NGN1NVILD?AGR
WIt7NELItdELTAI0KVS0ANERLrV!lHAIICQDVLIITV0111001
R$FDPO&IAiRIIGIGO'1'I
~fDGIIARAGAVFSIKHVfGKPIKFDCCCERIO~


tMlJOIADVIRVLSG4NIltLLPRPEPIICOICRVM4EEDTLAVSD~t.TFRIWDIN.
LT4IlIL>i
V'fAAf'fYEI7YYKQNKAFMfII3PIJIKLL0IlPOI~AKP
GIOC
EDAEt


QSCDKLYIVTNPWPSOOFSVYLCPPIGC'fCGEPNCEHIKJ1VLYT.
.
NIVK~tREYISEE
pITLCDVNOPItIOpIIS
~~ ~


CPn_0106 135073 136371 g
~


phoH-ATPaee
EIIVRTOIOIKI2NIGCSVFIYDPEALFSFENTRIIIPFPVIEELF~1FGKFRDESAIOJASRACPn~0116
11!592 148971


eyAKTKVTpGyyLP5GS1LLRIEVApLSNDDRRGKLLTLELLXIIAIaIEPNVFrsl6-S16 Ribosomal
Protein .
LSNIRLr~


.
EICJ11RR1C$VALKIRLROOGRRMiVhtALVLADVESPADCKYIELLGWYDPHSSINYOLKS
VTKSLGRRVRAFJILQIESRDYESKRFSFRSLYRGFRELQVSOEDIm~IFYlOdCYLI%.PLDV'


VSSPNEYFFIISJIGENHFJ1LGRYYVSECKIIJILKAfmKSVWCIKPGNT00RCALDLLLRDDOREE
EAIFYWLERGAOLSSKAF~LVKOOAfGVYSALISKQEARKLVYR!(KRRIIYItORR$1


VKLtIfLIGOAGSGKTTL71L.AAAIfiIINFDKE1YHKVLVSRPTVPMORDIGFLPGLIIEDKIa!AAIIDiITK


HwIpPTYDNMEYLFSIt~IpNL-ItSSFJILOALNDAKKL6NGLTYIRCRSLPKAFII
IDEAON 0117 18983 150071
' CPn


fGNNFNTA _
LTPHEIK'fIISAAGKGTKIVLTGDPt'OtDSLYFDENSNCLTYWGKFttHLJILcrmD-tRiVJ1 I9isanane N-
11-Nechylcraneferase


TERSEIJVAAAATIL
'IiGMfIDILSLFPGYfOCPiw.'"ISIIGMIKORLLDVOLTNLRDFGLGKWIfQVlM7fPP$OCG


0107 137321 136392
NLIt4AEPYTSAIRSVRIIFSISKYIYLSPOCA1.LTAEK&RELAAASHLILLOf~IYOCIDAIA
CPn


_
IESEVDEEISIGDYVLTNGGTAALVLIDAVSRFIPGVLGNQESAERDSLENCLL~POYT
1
~f05B hypothetical procein


_
RPREFECKEVPEVLLOCOHIUISOWRLEOSERRTYERRPDLYLNYLYKASIDHIIPDE81T
KKSPPPVTPKEIPfQPKPPIPORPEVSPTPTDHIVPGSIEASPILCKKPSPDSlIVSPLSL


FHKMLLENWTPVEEPFPWPPAEKNOKIFAWALNOSKLIFVSTSCTIIAOPRLVTDSNSIIMITNRDHFKCDKISSNLEV
NKLKRA10IFYCKVFCLDAtISCENKFCLPItEOKTTIwLR6V0AE


VNAANRTNSRDCAC'INOVLSAAVSVDSWGLSORPLNPEROGTPLNOCECPAGMWPNAOGSKKNIVTLSLSLOCACEEC
Fs:YLLARWELFCGKLLiKQADIaIiAVWALAQDLOCNAWIFSWH


NHTGKQf;KPNYLA44LGPKAVDHNNKSpAAFDRC10JAYLNCFSLAQTIGV'IFLOIPLISSRIIK


.'.IYAPPf3dRKKPNSEENKVRMRWIHAVKCALVAAIpEICNEPf.M'DRRNLI'JLTDLKTPA
150075 150461


ITOPKKIL7HL CPn_0118
r119-Lt9 Ribosaalal Protein


010R 177857 137303 KKEH!'RNYIMMLLKELF~
EsQCRNDLPEFHVCCIIRLATKISEOCKERIIOtIFOCfVWIRR
~Pn


_
~ENSLNRVAYGECNEKSFLI~1SPRTVSIEIVKRGKVARARLYYLRCIfI~KAAKVK
Ct'018


KNLFNYIG1ILNSIFNEEVFIISHRHTPIGQTSTALRIfIPLVNPLFIRTNLOAIASYIPIFSEFVCPRSSKK


TFIGIKTLKGIS3LOYSNVLIrfCNFSSVCKTLPCPEIYEELP1NRKEANLEIfGIKALIY
1511si4
9 15~52D


LVL,~VIKIIKLIVRYLCPCCRPPEPREPONPLTPTPLDNGOOIDAIFS1'PTSPT6FKDPF.
CPn_Oll


LDDLLOEDKKKAPNL rrihe-Aibofsuclease HII
IMNf.itSEIORPLS?ttAFEKELVSEDFSWAGIDEACIIGPLAGPWASACILPRCKV!'PG


~:~ 0109 138e46 (11783
VNDSKKLSPKQRAQVItDAIdIOOPEVCFI:IGVISVERIL>QVNILEATKGNIQJ1ISSLPTS


ilaS-Isoleucyl-cRNA Syncnecaer
PDILLVDCLYLPHDIPCKKIIO~ONISASIAAASILAKEHRDDLNLOLHRLYPEYGFDRH
'


RQKIffADEVf:Y113PAKKEEOVGKFWKDNOtFEK.iIANROCKTLYSFYOCPPFATGLPHYGHAKSP.iPIKONCAI
V
KGYriT.~.GNEAIRRY(:p: A


IILLA.:TIKDWr:RYATNOCYYVPRRFfWr
DCHGVPVEYEVEK.iLSLTAPGAIED!'GIASFN 1'1125 111779
%


EECRKtVPRYVHFIdEYYINRLGAWVDFS:"ISrK171DJ1S6?IE.fVIIWVFOSLYN'~LVYl7C'fKn ill3D
CI


'NFF.'TAIJ;'fFLONFEJia1\NYKEVDDPCLWRNpLONLL:ASLLtMM'TPkffLP.."lBIAfAVqmk-
J.llt' Kin.sur
~ LKLFTIwAPN:Vr:KTTLVRMLEQEPSSAP
' EI:LF:IIKNfJI:YL'fWKtU'~::PF.~.pOIN)ICCI'


fEFPFTPf .
DS EFORI.I.DROALLfIIVFLG~f:pCYGT.iMLEIERIW
:I:TLY'l'JRtQIW.K:X:EOWtLSQcxI/OAWF::NPEEFVTLE;:F~KDLVr:RTFRECEVtt:KCYHlV::HF
~ 'CI::V'ITAY
A!


.FTEEIP .
Y.rtEE~\FNVtNL:FVF.E.:OLTCVV111MPAf~CEC:DFLVCKENH1/PLVCPVDA1N:..
' .
'.~EELERRLA
SRt::EFr~.ORKERLFJISL
IIAVAVtDIC~Iw\LPIF::RNP.~.V::IFIAPP
::II:Y


JAVEKTK .
y'ftfYJlIt:HADKEIIKFLKKECRIFYIfCNKIIAYPFt3rATUfCLIYKAVN~iJF.
.
' '


~ N(HLIsNJ: B: IIfYNpElI fUHa :Rff:K4lLlI:AAISYIAAYRVLK:: t f t Al:f7lAtt I f.
I::RNAYWt.TP t p fYiK::AU:E ILWr'w~I filJlAAlK7F VN I INDDIIw
I


ItRLEEL'D:l! ~ t'tU I IIHI IP I
OL'LN I VK0.:XPFIIR I PYVFDCWFD:x:ANPYAr~t
iHYPFENOK


I:'fEEAFPADFtAfi:LOCt'rR(~IFYTGTVI::AIC.FORPAFIINAIVfK:ItLAFII:IIKNSKRLN1.'I~r_
Ulal i'i7n:n I'.11.'if


NYI".:l'Y.'NLIIfYr:AIMt.RLYt.WC:WIfKAEDLitF.~uGKC:IFYNLKQILt.PLTIIJL:iFFNTYt.'l
tlil hYt~rlurt u:.nl srnr..in
'
f7lINlKKIN<F'ITII:KLNKI.t".':'.f'F::LVtYIAIKVAKIY
' fAKf:INIt::::NVAIl.'fL.VLLDRErt


JTFIDDL .
fN
fTf:I:IV'JTA::ITIJf:Ai:H::l7rrN::lYKftl'::AY'IW::D'/N
Al:I.Y:FDfK:7JDIl:PAYTEtLtWIIL::Nf.Y::VVt:KVHF-':M:aJYIILNNAVEPFI'Y
' t


.KVIApFVPFtAI:DIYOY.LKt.EKEPFS .
wYtl.W'NHItlWI:AEDTCMiI4VIF::TLYCVLTVFiJ


VIIIfhf'I! NFrir~K l LPIILEKRMIIDI
AI:IVCtt:II::LRKEIIKI*VAOPI.ANf"f~JJ:3KDAL:i


7$


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
ALLKNPOf:I~ I KGLKrjFL/: _ ~:.~
_


"f'n; l 1.:.: ' 5!0:7 1 : J7.'.
f


mntr,Nt)fntmrYl-CRNA ::YntnrtC.tao....
::ALPYANGPIJIFrfItAGVfLPADYIARFRRLLGODVL'lI~~3DEFCIAit.'Pn 01?~ . ..1.i.61S84
tt nLsG56l
, p~8.~
.Dr~rsMtn..lVi..l7MediWflitIBL arG.
IY~ .S
ac ttowo'IuIa


':K1MPOKVL:T .
TLNADRBCU:yOEYVONYHKWKOTFEKLGFALDF!'SRTTNPPHAELVODFYSOLKASGLNo rotm
NIEFAPVPHTSYTADR:EDRNACRIpIKLSTLAITSLCYL.ISS~fCIN:O:;.:ISCIVGTY


tFl~KtISEDLYfQEORFtJIDRYVFLRCPRCGFDNARCDCCOSCGADYEAIDLIGPKSKISS
ALVACVfFLYFFYFSSEEPKGASSOEFRFLFIPAWSJ1LRSYEYISODA
FWfitIFSVL


'VELVKKETEILiYFLLDRNKDALLSFIOr'rLYLPDNVRKFWDYTOfVR.iRAITRDLSWGI.
A
INDVIKL9fNOt.~':ILiSLLDPEAPFLEPPYFNSLIVNNSNKEADRLSRGIFLI:.1GEI'ISiK


PVPDPPCKVF'fVWFDAPIJYLi~'NEWAAiOCNPDE~IIfRFNLEDGVLfWQFTrKONLPFHDCETKILPWLKDPNTT
PDGfVfKLLKDNFDLKDFKKRIA'IWIRKJ1YPEIRLPKKNCLDKS:


~NFP1INCLl:OYLD.fKK'JDALYV.'.EFYLLDCROESKSflGN'Nd~KFI-SSYSLDKLRWL. ... "~, fe
,FFPV?11T: a.~...rJf"r"r....P1P1P?~.'!.'!1~R:~F
. -~Y.:....t....


o ,
... . E . , . ,.
W.\EY.!1115~~ .. D..:~. . . .
.., ~ 4h:rl!NI ..
. .
'
M


. : . .
. 157349 166561
..y y .!
" .i:RY~\I
' .
r,~~r . ~:'; I?K'': Al4J~:..'
'/ALWFl4I::FK5LE?ICNLI7fhfYMGL.iIKEEtLDVINEE
'


lPI IPEiA CPn~OIJ3
:.F~:AI:yCpKLLrli.i2


FNLKSPRLLF'TfYE C11LP5 ttYDOCl~lcal protein
NRIJflOV1'VCRVSIRTSCIKIRN
if


L5
NSSAYNPKLLIOrLfLIrPCCIVGYfWNRitC5IVE0


0123 155775 15377.1
ICItMP4ISERFPYAACIEYADVRtSSISNLLTKOLEISfT.IIiICJINPTIFPYDSNC!1KT
CPn


_
NWSLVW!~POK~'P~IDRAPVLIRRCLFLNfRLYGLRANNKDIPNLSVPSLFJNS
recD-EJCOdeoxyribonuelease v tAlpha
Subunicl


NSNEKICfiYLEOILVEt4KDSGDZTAYIKIPNKT'fPILIKCKLPOPLELGSPIOIYGVWSNNTSSAKE<'P!(LSFJ
II'PSLL'fGAL6ESLYNLNLPCDIIKPLS00ANKNFYSSYPQFODRW


WLDITP DINTPC1'P'fEEIICFIRCLPFH
.iPSM'KYFOIHSYDSPLLYEYRGVFNYL?SKLIKGIGPKI11BKIIEKFOE1(Ti.


"NLSCVSGISE:RCVSTCKOLCEOKILR1CTLL!'LDEYNIPINYQVRIFKIfYOEKSIEKIC
r


. . _ _. -. . .
EDPfLLARENECIGFIfPADFIAIOfLGSfPRNSESRLCAGIONSLEELO~YPI1Q.LI


a/VAKLtlJQwFD'tPITLEEID'fpILP810KRKLWI0DI5~1'LHViFfRYWLAEIITIVSD


JIRZLPSSRRIRSIDGEKJ1TAWVEF?1LSIDLAEOORGIKACFSEKLT..I:GCPGTCKST


:'.'QAILKIFEQVrIfIII:LAAP1'GKAAKRNTEITOKHSVTIHAiS.05fOFKTKSFRKNNONP


tDCDLIIVDESGHNDTHGLHNFLKALPDY1?LVFICDINOLPSVCPGNILKDLITSNKMT


'JLRWKIPROVNDSCIVfNJINRVNt7GELPILYSETCRI~tDFLFFOLmDOtF~IiJI!IINLVT


KFVPOKYN1YPODIOVLAPNKKCTfGIYNWKALIUiALNPKKANLNCRfOSYAVGDKVNQ


IRNNYNKEVENCDICYVS:INFEDKAVWRNEG1QIVGYSFSELDDLVLAYATSVNKYOGS
'


CL1E
ESPCIIIPINTSHFNMLYANLLYTAITRGKKLVILVCI'KKAlAIATPfB'tRV0NAC1


VLKELDT1CKHYADL


=?r>_0124 156575 158068
Genebenk/E!>8L as of 11/7/98
i


n
No robust tJOmolog present 169N8 169143
IRSKORTVAITLLVLGILLIASGIIFWVAIPGLSSAVALGLGCGMI'AILTVLLTIGLVL


LIRSEKLALEOVEIKOAR'fR~LDOLSOYVFYTEIiVLDNt.KFW&YR~~FVR~OE',;:
EI CPtL0135
proES-LORDS Glapesonin


TNLEODIEEIFL:~..RDIRNALDNEEPFNTNAKOCLAOVGFSLP'ODASIDEFIMJWLShIS00ATfLRIKPLCDRIL
VKREEEE71TAR~IILPDTAKKKODMEIILVfGlGKItTODCf


ROHLDINDPRWSMITKICVIICIINRIIYVSTNYKQIK9JPDISDfGQLR~4.L~!1ITIEELLPFEVWQOIliiIDII
YACOEITIDDEEYVILOSSEINAVLK


VLYOSFOKGYNRAALLSFJ<TAIINTSSLLIBaEKDEDIDILNIRfIfCASRL.aIFRxPRTLFL
P


GLSEFFnVVIDFTDASG4DCSKLPAKEVPLI)GGKKKLNFKRTFADCQVGDWDRTTSLCPA-0136 171119
169569
OEEDPLDRLImQVEOFATSVLKDODRYWKEIETSFaKFRSLPREOS0IDSIlIItDL


DDHLSVW11NOLSAAEDALIEV:'DVOBH~tRF~iL104I00GLELIEDAVKATLPRVDFIOELpepP-
OliQOpepcidase
!!i'f>~%TF~PKFICIiDTIflfIfANREEWKKDPDLCSSI~PSPItIPEF
KCVPSI


LEKEELPLVAARMSLENS ,
SPSIfYQII7NPESLLEIi.BKItFSVOIKLDDLYIYANLINDODITNP00ESDY0SIVYGYTI.


FSOEISWIOPALIALSEEKYAAI3SSSVC~PYRPnF~IIEALSPEfI~'A~KIL~FA


CPn_0125 158072 158605
ALNVSNKAFSSLBDAEIPPGIAKt~NCEatPLSNALASLYhOSPDO'AY
f 11/7/98


No r~usc tromolog present in Genebank/EtIaLYDYR?IfPANLfi'
rif'INOAHLFEAKARNYPSCLFJ15LFONNIPrNIfINLINCll00lTSLIN
as o VDLVCICSLLPIr171YV6IL1~L
KISSCAEINSEYKPLFLI~fDSFDIJ1TORFQIIL.It~G.QEOAEIYNEYEI0C111RWNEIKEO~


KDPVIQtCIEDPPARGL417L><rT'f-''TrRDFttD%AKALTSl2IECPCIGtYIfSIN0E1nOR0.
tE RYPNLKIIIaIld.KtPHFYDVYAPISQ1TSIWYBYEE
D
ISlAIWVDRYLf'8IN1WSGIlYSSCCY0SJ1PYILIiiYltFfLYDVSVIJWIJION9O15rFSAP~I


O OPyNpFlpypypyAEIA81'FNDSd~IEAtSRSDO8KED1IIVI
ROERL.QKNAHtYRDCKOVLEAVQVEQKDNISSRWVDDSYtEEAfEEOKVDNRIOITKTLDI'IFATLFRQ'!!TA


AFCYEINSAAmGTPLTEEFGSATYGM4>fEPYOCVVTSDSL.SALEiOUIIPHFYYNFWY


CPtt~0126 158806 161085
pYATCIIAALSFAEKILTDEFGALELYLKFLKSGRSDFPI~IILKKSGLDMI'fSAPLatAF
7


/98 AFITK1CIDLLSSLLS~
No robust hoalolog prssenc in Gsnebank/EHBL
ae of 11/
~LLLpKIOPK
V
LLLL


L
. 0137 17=263 171502
LLVPSYYCNCLFFFSGAISSCCLLVSLGVGIGLS1LCCPt1
APDLLDLEDASERLRVKASASLJLSL.PKEI~LGRYIRSAANDIaTfIK'LDiPNKDORLVZTV


SRKLERLiIAA0N1MISELCEISEILEEEEIOILILAQESL6I,lIGKSLFSTFLDIIESFWLS~
il


NLSEVRPYLAVNDPRLLEITEESWEVtfSHFINVfSAFIOtAQILPXNNHtSPl90~LEb~fOY
YbDI-11CR tas
L4&NLETLLSSKIFODYOPNGL-0VGDPO'fPVKIfIAVAVTADLETIK011VJ1I16


ELLFTFIYKSLKRSYRELCCLSEIQBIIINDNPLFPWV000pKYANAIDIEFGEIARCLEEFVC31CNAD
ANVLIVNNOIINKfiItPYPi1'CNIMIRIOLLIEtCfIOLIAYNLPI.DIINPTIGNlBiRIfALDI.


EKTFFNLDEECAISYMOCWDFLNFSIONKXSRV~t01fIS3'ACIAL.K17RARTIfAKVLLEEtiIIWImLKPPGSSL
PYLGVOGSFSPIDIDSFIDLLSOYYQAPLKCSAROGP91l1fiSAfILISO


PTCGG1CIIE.OOIIQRAFEROSOEFYTLENTLTIfVRLCALGOCFSOGREATNVRpVRfTNSECAYREZSSAATSO'1
IDCFI1'G~iFDEPAWSTALESNINFtaf'CIFfATEKVCPKSLAfiILKEE


NANDLICESFEKIDKERVRYOKEORLYWETIL>RNEOELREEICESLRWNRRKCYRA01!DA


GRLKGLLROWKKNLADVEANLEDAThIDFENEVSKSEGCSVRARLEVLEEEWGlLSPKVADFPIS1TFIDTANPF


IEELCSYEERCILPIRENLERIIYLpYNKCSEILSKA1(FFPPEDEOLLVSEANLREVCAQL
171091 172700


KQVpGKCOERAQKFAIFEKHI0E0KSLIKLOVRSFDLAGVGFLKSELLSIACNLYINJ1WCPn_0138
-Glucaleate-1-seaialdehyde-2.1-awinoniucase-
'Itemt


KESIPVDVPCNOLYYSYYEDNF~IVVRNRLLt~IfERYt7NFKRSLNSIOFNDDVLLRDPWO.
TNSRLFLAIImOLLOI~WKLTKRNl;~ICSNOKF~'VTFEEACOVFPOGVNSPVRIICRSVC


PEGNETALKERELOEZTLSCKKLKVAQDNLSELESRLSRRVTPPIVSS71QCDI!'L~fIfCREFIDFCCDWCALIHGN
SHPKIVKAIOKTALKGTSYCLTSE


EEILFATNLLSSLKLKEHKIRPVSSCTFJ11M'AVRLAf~ITNRSI
I IKPICCY11CIIADTL


CPt>_0127 162152 161130
LCGI57TECTIDNLTSLINtPSPNSLLISLPYNNSQILHHVNEJ~IGPOVIYCIIFEPICAN


ycfF-Cationic Amino Acid
TransportertIDIVLPKAEFLDDIIELCKRFCSLSINDEYVfGFRVAFQGAQDIFNLSPDITlYCKILDC
ESFNPPSANOESRTRNVPLGIFtiGLVACLYWGTVPVIPNFLGSFGDLDIVLTRYTIFCIF'


SLIACAIKNPSVIIDI'1'PLYIWRKSLLWTLLINPVYYFCITLGIRYVGSAITWIASLAPTIFOACMSGNFWNATGHA
AIOLCOSDDPYDMSOLFJ1
CLPJW1LVONRSILDNI?IPF~'I
'


AVLYNSNT'KOKELPYSLLFAISSVIITCVILTHLSAWLPTAASPLYSIILVTAVILSIStNFDEAIOJStrifEICfQ
TfYSEVPONG
LFYSPIEEEIRSOGFPVSLVt~GTIIFSLFFTESAP


LWVIYVIRNQSLLElDfPNLTPD1WSYLICISALITCLPNIIILDLCCITHVTNNLISHTPVYLSPSPLEANFTSSAHT
EENLTYAONIIIDSLIKIFDSSAORFF


GSERLLFLLLCSIINGIFSSAKALIAWNKASLNLSPALLGA1LIFEPIFGLVLTYLYSOSL
174686 174093


PSLOECICIFTIIi.CGSLLCLVLFGRIfVOKSLENSOVSSSNECPn_0139


SPA' KNItLRDIMCIPYARLEKCSLLVASPDINpCVFARSIfIt.LCEHSI14CSFCLIWKTL.C


CPn_0129 16226: 163057
FEISDDLPtFEIfVSNHNLRPCNCCPLOANpt4dLLHSCSEIPEO'CLEICPSWL.~DLPPL


bpll-Bioctn Protein L)gase
QEIASSESCPEINLCFGYSG1'lpAGpLEKEFLSNDWFLJIPfBJICL)YVPYSEPEDWALVLKO
EDRCRNLRNt7Vf.WCSECVSPYYLRHTIRFLKWSTODCAFDTIRVDCNFLIIaJPFWEET


TRLLVFPGGADRPYIfRVLHGLCTARTFOYVSECfR,IPLGIC1GAYFCS1WIYFYEPECAPLLCGKYA:aLa'-
TVPONLLW


:GARDLCFFPGTAKCFAYRGNFSYVSPSGVRVSPOLFSDFCLGYANFNGCCFFEOSECYP
L75110 174673


.~.JMIE.iRYDDLFGKPASIVSRIVSKGWVLSGPHIEfLPHYCRMVKENVOKTREFLORERC~ 0140


TTLDRYCOt4LVQRLRQPAFSKAfIC ~E
PRSNOOKIFCNSLEKELLETPLVLLNF.IKLVSFCNIACNILGTEEKKFAIYGHVSIICOJI


012J 163747 1ti3064
FOCAOTE.HSPORPFAHDLWFVFSCFDIOVLR'NLNDYKDNVFYTRLFLEOKDREFLYV
vfm


_ VWDARPSDSTPLALTHKIPILCVICSVFDAWPYEE
similarity co CT036


DEQYILSHIIMDPRIFVrSEPLOKTYOKLQEKHVNNLGIASQVSLTDLONKTQYtTfIJLIE
175817 175110


TTMEITYYFPWIMPDILRSEWDPISNOLYLIFKKFFIHYHNLPSTALfRiJQTLLIDSLCPn 0141


NTG~SNPTARONELL1FLCVFEOLDYNEDEYTIEPRGYFNRFVYKNSOTAPOIOSFCLLHrpiA-Rtte!-S-P
Iicmerafe A
' HSSSAVEYDW W EKKCL1HEAATOVT:
CNILCIG3CSTAKEPIFA
W iR IOfESLAVHA


LTfIS IVLCSP ILYOL ITEFClTKIHADDFOCLII41~.rONa~fALAKOLJ1I PLW
PEKPSSLDLTVDCADEVDPOLRN
I IKCCOGA I FREKILLRA
o f CF?L~YASNN L RN1.


Pn 01)tl 164251 167751
ANRSII:.'JDESKLVFV4:RFRVFLEI3RF~IRa~AIIEEIRNLGYEGSiRWDI'COLFITDS
:'


.
::NYIYLLF3PNSYPNPEKDLLKLIOIHr:/IEWSP/LONEtIW:SN.4pGLI::KKYSV
Nu rotxmt hrmwloa pr~senc in uenehfnk/f7tAL
au .~t IL/7/~8


.:::MVKf::::: I I I IENKKP.k't.LFESKF11
I'PKLSL1I L~LFLG IANI: I L I ALSf:LLI'NCLLl 175914
I 4% l:dl'


Ir\L:LI::It'Jf::'it:ILf.P:TQt'.::K.~.VQKDEOKPK.:IFPKBfP.:LDPWLWM.KNKIQa:;s
CPnIIl
71OR
u: ur 11
~nk/f:NflL
rrranc m :wnet
mlwl
t
t


FTLLLDff::INLKNtT.i'FN:FEfIiKKIFLKGPDFLIY::ALAN41KILE.
.
,
v
.,r.:
c
No rn
'
'
'
~
~
'


.H V. iIIDNA 101( I R: f1
i
LKP LAEN
KN.~. L LIfPr.
:HFE t.
1
.
!:H::Y::Y::Y':'LLEKFIIFK I LvLL..


mCynltl I~.44.1L lu,'.':RO
ftlNfl':FYDLKiDYfKCh:KRFRFLY.~.I':PIfLIIYLWF:IF'IT


t7.. rAar:r Innw.l..t I,r..r,,au
in .aenrhmk/FlAlll. .n:. ,r 11/')lnk.fn!)l4: 1.'::47 1'/r.14
' '
'


. .
:WIIJfT:IV/AQ t IWf>,nttJCt:.il Pr.m.tn
.::;:LYKh:Rh::l::1 \IhI.IPFF'l::AYVFP::If:FLt'LFHI7JAH::::::hVYNaB-
' "Yxlti
' '
'
'


/FV1.TIALIMI \I::LVLFLLIRSV _
f::VMt.FLU:1A _
LI
tR'fftftf..LKRf'LIf:01FUl1TL~.FLRI'EIILYKTFE:LY.Of:::I::LWLNUIhxJIALUDLIKK
r7/YY
Y.IH:Kt:IC'VLL:.
AFIFCt:\
:I
llEDP:DpM
HALWP
:
'
'
~


~
'


n r
Mt:L::hI'PU:EhItRA'IwIIYLFHW:F'IY:H:uItFATEtNFFtf.EHANfIJLfIYLTDKf:Y..
YIKV JK
:
~
.:F:K.:\:\IJWIIhIAa
LINKIS
rrL
fi.Ll
::l
IC:::FNUkLw.
1
:Ah'fLDDWI:O
PI
"
'
'
"


.:
IIIII'hVLIIFY.FYKALEUEFTT\YyTLPAINrFLYrhIIFMIhIfEVTRKFYf~'ELfEDIVA
JI~YY:::HIF:RV::1.I.V
':1.1.1::IWfYI":::VI~VF:ALLI
.:I
~'Ehat:K1
lLf
:h'uV::UAMF:IWRY
:M
:

L
Y'fr:Vl'
UI
r
'
'


.
':YIIKVfIGI:fDKX'ItYl.~tl.ltft:TV't:l:llnFlr'/r:.'Wfr:IDEY.r:II~ULIyOYLLIMJLVIA
D
:
:
:
,:VHIf1411NI:F:I
LYh
(NKI
v\HF:IIv:IVUfKIIEfN:
E
L
LIe:AId:l'C::ha.la'I'Itl.l:a'1'Yt:INVIHU
\::Ft?!I'KG1/1U::UF'LW 1'KNVI:Wt:KFI:iAI:I:K


79


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
OKYMDILGDAPVSLLYUAFA 1F:.:lHt~:.taAit?IL1AEEEAKRYVEEK(~CSPIT:'


Pf'(>OL'PrttiJIVr:Rf~IYH.~.KFFA:x7.:lDFtAKPLF.WEYJ11t0ALFJ1LAAELDOIltNOL'TL:.EC
E:PLANLK:iiFSDLNGRLKVSVEKAiILEEEiO
:'DCfYLEFDNEK iGDF3PLTFI


::f)EK'CIP:II:L'r:SKTPTLENKOEYfARIfIpAAOYLPLERt~iLiPO~FA.iCEtGMLTEEGIOEOY~ImRED
Lt?lt~i"LO~AIDfwI/DKQki~LG[rhCiEE
:


EOWAKVAt:~ItEL iEEVtdK .


~pn ~1t44 17794?. 190560 CPn_0151 1?4179 :?2i25


:100-t:Ip pt~cu.)ae ATPase
DAVSFrILEKAFELAK.iSK)fCl'lTi?1HLLL\:S.EMP=:.FYLVIADINCndtpA-lbnoo>NCfanase
~
CY~LPKYPiLVICI1NP'K:L:L1NMLLGHCLSVKVIDNRASPEDPSF'..DCRKLP
M/LGVNFMEKF
'


. ..)n~...-.r-- ..,_\rhn... -..,rA.vf,e-..ryr-
rt~..._r.,.pr~L..n'.~'Y:
. v
REP'N'/t:EIfDPKPSFr:LCTLLRLIAKOEAKTLCD~IISCOH1.(.I.At
~
ftAVKDAL
A:Lf


. . . ,.~r.:r. : 'N : :.\.._:rl:Y
. ,.P.:: h ",: . . .. .._.F'::':
N ?:.'
d '
.. ..". .~ :.i"."'- \:r\f'.' ~ F
.::... . . : 'Ill
i:w :,:rr-.. f
~'1 '~


.. . .
. .
. ..f. .
'::Irl:.N:.: .
._.,,... ~./:-:I:;,... ,...;v.: .
t111 '~
:.'i~~::..:.:'re.'sJ,ii:H:.:.i::KaFI.IFV
P1.
"n:4hE:it'
NLDLROLVK:.'r:.~in


KQLYVLOMCALIAGAKYRGEFEERLKSVLKInIE.:Gt>:EH11FIDEVHTLYGaCATOCAMDLCLPOGTNSISPKLKS
.KT'I'CfYNLVIa'DENFHIKT3HHAFPPEI~NVLFLGSLSNtLLLS


AANIiJfPAWtGTLt~ICATTLNEYOKYIEKDAALERRFOPIFVTEPSLtDJIVFILRGLRYI14DINtttINAAFHtr
IWKLLP~'KK~KNLVITXDGCOf.TIILPYISPT'ItBtAAt~.PFS


EKYEIFNGYRITEGALNMVLLSYRYIPDRFLPDItAIDLIDGASLIRNOIGSLPLPIDERFYTPAI~tYYFLK0.'AAF
HtTCEEYYYPPHQAIJfYASSDIIAMSP00AEIHGPGPG101AI
GWD~.I EEYGLrWIEICNVKEPRIIld.YNJINP
LREELASIJti PDLXEAL


. .Q
KCAELAALIVKDFJIIKRflOSPSYOEEADAMOKSIDADARL~SFLLDPLKSSKNLLIFFKDI
E~DE


KpIgLEa~'ptf'gEgtaIERVADYNRVAELRYSLIPOLCECIKDDEASLNOAI:t'iRLLONSIJIIRPDRYIGYR'1
'Ifl'FKLNELISYLLRIFASEATg


RLIAOWAt~t'GIPVOKNLEGF~1EIG.LILEESL6CAWGOPFAV811VSDSiRAARVQ1'tD


PQRPt:CVFLFLGP'1GVGKTELAKALADLLFNKIEAMVRFDNSCYNGDfBIS%LIGSSPGY
015I 195274 191318
CPn


'rGYEEGGSLSEALRRAPYSVVLFDEIEKADKEYIHILLOVPDOCILTDG~.
..
CT119 hYDOthectcal Drotein


FIM'SNICSPELADYCSKK~SCL'l7tFJIILSWSPVLKRYLSP6l1l4RIDEILPPVPLTitELIK~~VS~'~AICAS
1NIPVIIVPGFPOIPEDLYOIXTtI7~CPA~ICLAtIfII~D


DLVKIVOIQMtRIAORLKARRINLSWDOSVILFLSEOGYDSAFGARPLttRLI00KVV~DtDtt..IGVI2Ii.PN1'P
TPtOGP'V1WL.F'NGFRGTI~C..'iLAYRKIGRAFAAVGIA2LRYDM


K~1LLKGDIAPD'ISLELTMAKEVLVFKKVETPS AG~DSEf'V,
AEEYPIErYLRDAQl'ILF1'VQEHPDIlIAYRL;sISGFSLGCHIAFR.iIKIYN


PRDIXIXALSVWAPIAOOCILLKELYR,tFSKHGECDIISIrCKI~GIGPPPIItIC8CD4I~.


CPn_0145 180717 182369
LIRIWfNTA~LPTKPYILHQOCIODTLVSRTpOTLFKNfAPL~tT!'ISYPM'OttI1t.11T


CT114 hypocnecical Dracein APDLOttILOClIVSNPOATL
tKTYDK1U15


NCAASFIWLNKSSNRNLRSPMFKSFIVRYMIYOGLVSFLLPIPtH.F.CAtil4V
WNLDPYKLESLCAYOYLSSKRIAFlFPQIOKD~PIPATM195130 197892


VISRDLIG.OEDCOKF CPtL0153
:LTLCKVDRGFSPEEISLIOt(LSYPGLSLASLRCS1'EIDPNTtHJ1R11LWSEFSCDI.A~1C4leu8-LaucYl
tRNA BYnthetsse


RADYYSNCLDILALRIHAERORYLDDSPCVPC1'SEFHKATItAINI'ILFYtFAYRYPSKKt~tYDPNLI1D00a0QF
iiKEHR5F0ItNEDEDINKYYVLDNFPYPSCAGLNVGMLIGY?ATD


EHFSDEFSFLSSVTDRKFGVCL.GVSSLYFSLSORLDLPLEAVTPPCNIYLRYOOGBVNIEIVARYKRAAGPSVLHPMO
WDSFOLPAtOYAIRTGTNPKVTTQ10JIANFKKOLBAI~PSYD


TTAOGRHLPTASYCDCL.DLE'LOVATPEEMIGLT!lINOaSFALOKKIfYKFAtGY~D~EOGREPATBDPGrYNWtOi
(LFLPLYOOCLAYMA~IA~PtLCfVt.BNEEVE~.FSImG


YIr'u'DEEWELL.GtVOIt.GGKKKLGASLIGXSPPASORCSVAYDYLIIGAINI?TLALLPSYYPVAtIOQ.RONIL
KITAYA~.LECL0AL0YfPESIVKQL0f0'MIGICS~ALVTPl0.1'~S


PCSNIfEEIASYEEELKKANKSgMPCCDGOARLaSVAFNLGATAFJ1YJILt.EItL~IFDIPNDLLEAIT'I'ALDrL
IGVBPLVIAPENPDa.DSIVSEDOROEV'1'AYVOEBLAXSERDRI8SVA1'K


SLtILRt.CAILCDRN1:YZ7fALKYFIIAERLNEDOCFLKImI~tSFJILtYEVXKII8KVJ1POK'PfyPZCNYJ11
01PIi~.LPVNISDYVVi~YCI'L"VVf4CVPAND~REPA00''SLPINEVI


ANTLLLllESR
IppCI,NGL9G0E11107YVINYLEI4lSIGRAk'tMYRLR~R.FSRORYi~IP


IPIIRPEDG?IOfPLmDE<.PLLPP1'tIDDYRPf7CPCOGPLAKA00WVNIY06ElCRi'DCRI:


0i46 182595 183095 TY'1~ONA0'A~~~K~~~'YIGCUNAYLILLYtRII~Dt
CPn


_ VPYD~GLYSTPEPFA7CLIN~
W.W15SYAIPGKGYVSIEpNAEENGIWISI'CGEIVt~tO
No robust horaolog present in Cenebank/EM84
as of 11/7/98


IIVGISILSSgEWPOTVtIGLGFCCL55KSVVPFKKBLSDAPRVVCSILVLTLGIGALVCGdK~~POVI'IEEIfGiID
ALRItYAMPSGPLD104K1w5N8GVGOCRRPt181tYDLV


IAI'KWCVPGVIIlIGGICAIVLGAIgLALSLFWLWGt.PSNCCGSRRVLPGEGLLADRT.LDTSBEVODIFDRDGLVL
AIaLVFAITtHIE1048LN1'IPSSFMEFLNDFSALWYBaIALSIt


GGF&RAAPSQCLPCDGSPRAS'i'PSCLEEWAEIOAV1'W1IDOMSDDt>DAAWPOIDESYLVAQIYtIVVO'VNCKLx
AL6V
AVRVLEPIAPHISEEt1114VILGNPPGI


_


CPtL0147 183213 183671
No robust hostoloQ Dresent >,n Genabank/EMBL0151 197174 199202
as of 11/7/98 CPtf


HCGPMAVOSIKFrIVTSAATgVCCIrtJCSRLAIPAFITZEPRATSIARSVIAAIIAWAISLI',.,
t 9seA-KDO ?raaslerase


GLGLVVLAGCCPiGMAAf:AI1?IZL~fALLAWAILITLRLtNIPKAEIPSPQB,REP~TSEPCPMNLRGVNItIFACT
YWVLVC1WIALPKLLYKlILVYGKYKKSU1VRPGLIOtPIN


SA1'PPLEGGSfAGEAGRGGGSPLTOLDLNSGAGSpGtIGPLVWftIGilgyt;CVRLLLPVLFJtFCEEF1~WRCLY1
'SCTELGYOVABWPIPI~'!V
SILPLaFSIIIABWAKLNPSLWF514CDt.'YtLNFIEFJIItRICAlTLYINGRI8ID88AltF


CPrt0118 183822 185702
APWtt~IYPSPVDGPLLODEYpKOAFLSLGIPF3iRT~tIICCYIfARpTALNL~l1'


pknl-S/T Protein Kinase
WAI7RLRLPTDSKL.VIIG5141R8DAGf0,ILPVWKLIKt7GVSVLWVPRINP.LTLtDVEiN01
tJItVSSltEBEIfDICAA1IGDYRILYRKGQSIFtSrnrr.acuRp'IAItAYLIRIil.PDtOS000TF~KtO~iiP
LOCE
GK1


! LNIPItCLWSRGJ1NFSYVWVVVDEICLLKQLYVAGDL.AF
SOP/fffJlF'NIriNVKLi11G11tfPGILSIENVSFSEGRCFLVTODCDIPILSL'1GYZ.1t8I?RKVPLITGPNI
T80SFZa0ALLL8GACLCLDEIEPIIIriYSPLLt~iQElfuJIYVGI~IOf!\tK


LTILEIVDIVSQiASLt.DYV11g~10EEWtd.DSVYIHILNGVPItVILPDIGFASLIXERAEIABPDRZIfRALItS
YIPLY10'1S


ILDGFISDEINRLSKII(FRVLLNTS~GAED1YA1'GAIlYYLi.FGPLpOGIFPNP81N


FS~IYOND!'LISSCLSCTftEERAKfiGFPLIRIUCTt&EEI4NVVTNCIEBtiLRCVPDPLE
VGEI7NSti00KESAENLEFVLYF~ICSIDEAl~1'AIESt888GVEP.80YSCPeL0155 199697 199488
7
98


SSONLPOAVLA /
RYVEAEKEEPKPOPILTEMVLISRGSVEGQADELPVNKVILNo robust tmmolop Dresent in
Cewbank/EMBL
V as of 11/
r


LALQSLLVREPV IfBDLtGYEDL
S
NSLSFCVPFLEKLKISLIPIEEMRNELFfIKTNNSSSNGFSNOEIOGIRTYI
NSP'FLDVHWZTIP.QFIAYLECCCSEOTHfYYNELIALRDSAIOARSGItLVIEPGYA1D1PW


CVTWYCASGYAEWIGKRLPTEAE~TEIAASGGYAALRYPCGEIEASAANFPTADII'~fMSYFLIAtINPN


YPPNPYGLYIA~1VY&ICOI7WYGYDFYEISAQEPESPOGPAOtittYRVL~~LKaD
CPtL0156 200147 199770


LRCAtpWRM4PGAVNSTYGPRCAtOIIN No robust homolop present in Genebank/EMBL
as of 11/7/9A


IG%QKLLARt~I~AP~TAPPP~PIAQOGVCIPSTICHLITIWYC


CPn_0119 185706 187700
FYIYRAATPQSIriIPDGCCFILLERLKELGAGFFYCDIJtESNTTGFTLFPGGSNKGVLIQIN


dnlJ-DNA Liqase L!'IADE
ERFIOtft?1SOJWYL71LGRLEDHDYSYYVLNRPRISDYEYDNXLRKLLEIERSNPEWRVL


WSPSTRLGDRPSGTFSVVSfIKEPIQ.SIANSYSKEELSEFFBRVFxSLGTSPAY1VELKID
CPfL0157 - 200753 200298


GIAVAIRYE~IVLVOALSRCNCKOCEDITSNIRTIRSLPLRLPEDAPEPIEYRGtVPPSYNo robust homolo0
Present in Genebank/EMBL
ICLL5P0EVAKRKLEISIYNLIAPGDNDgltYE as of 11/7/98
STFQIINEKOQQLEKTIFANPRNAACGTL L
atOfE


. .
NWRCLlS4GFPV~KPRLCSTPEEVISVLKTIETERASLPNEIDG)1VIKVDSLASDRVLGFSFYI'YKEAL'liIY0F5
PGJ1SPNWQAStatAQLNSYFCLGGETVTRIISLAPSGLI
IIA


ATCKNYRWALAYKYAPEEAETLLEDILVOVGR1'GVt.TPVI1KLTPVLLSGSLVSRASLYNEKAWSTAEKILKILSFI
LFPLVLIALJ1IRYLLYNKFNKDLDR11VFFIPTEITKACEL
'


DEINAKDIRIGD'1YCVAKGGEYIP!(WItVCRCARPEG5E1IWNMPEPCPVGHSNVVRftDRtP
KttPIILYKEAAL'IYSPLFYSLP1G(YOLI9CVf


VSVRCVNPECVACAIEKIRFFVGRGALNIDHhGVttVITKLFEIGLVNl'CADLFOL'n'EDL
0158 201163 200894
CPn


HQIPGIRERSARNLLESIEOAKNVDLDRFLVALGIPLIGIGVATVL7IGIfFETLDRVISAT_
No robust tlomolo0 Dresent in Genebank/EMBL
as of 11/7/98


FEELISLEGICEKVAHAIAEYFSDSTHLNEIAKMODLGVCISPYNKSGSTCFGAAtVITGPPNLTLSINLDLLLEDLOT
DSLPWPKL'1LSEDFDFAYYPfSKAIID'IYAKLtIaHPGCCP


TrDCMSRLDAETAIRNCOGKVC55VSKOTDYWMGNNPCSKLffKARKIGVSILDOEAITNCLtSKKItJIRYLLEOLFK
LETGtl'IFPTSTIDGCRESFLIEFSNE1'KKPTIMAFIYFYYYN


LIHLE
SNGPKLEKDPKOAGCEVHNRLLM.GLKfRPOAGAONDGRNCGPYGPICFLIVWEENYGSV


~Pn_0150 187759 192141 LKONGFLKON


~f117 hypothetical protein
CIYYKFFYSYNCPYFISFFVLLGYNMASSSNNSTKODGIPSWVNPMIOWNRASOVGDOEACPn,-0159 01811
201467
f 11/7/98


MSLTPEAp!fSR.S'WFSDRKHFLEhWgLEEMENNDLKKYSRYKTIILIATLVTVAIi'CIVNo robust
hanoloq present )n Cunehsnk/EMBL
as o
CCP!OCE1'ATRIF'aMPSGFSLATEK/OVSTAEKVIKILALIFFPIILIAIJ1IRYFfJOtK


PISNVFGIPMWVPCLILFi.JIGLSSAFLSHRWSKCKEIHLRYRAYOtYROOLLgOYPDLRFDRIOCFVLPCD'fPKEL
ELIW1NPOL'JENAAREVHPGFFALPTKYOSMYIO'tSKG


K3TLYKYSITIiVKPKKCFVGKLVENLRPDLNANKD00GAAADSRLDFAGYCVKHYOlDAL


L .V~.lfttSVIYpRLASLIMSVKNOttlIDNCSREPIDFAORSALWSC~DtGGEIOP~I
L CPn 0160 203794 20.127
D


DLSRDILAICCYCMNtlGVE7U(KAiDOYKKWYLNSSTFIAWNPOLPAIAOSYLLE00ANLpfkA-Fructose-ti-
P Phosphotransiarase
~ALTTAHG1GOALEDLDSLLCYYDOLIESKCVGEKILASItIpKHLDt.AMOD'
'i
KIF
DL


u RIPE
IA
TV6LLSWKSYPEI(NtLRYRPEILTLLETIRSKNtOE't'SSPPSPPPEWKNIPNit
O
::cTiOENLKKWSNLYNVFSITiKEFTECKLEONEWSRIORLRGALEKSKCSILCNCItTNA


ElITK3EKKLADYLWIGDREPFLTGMHKAIATCKAIQGKVECSIISONPEIfOIMILPCSIVSLYTEOETSSKPLKICV
LL.iCCOAPr7.HMIVICL.FDALRVFNPKTRLfCFIKGPG~.TR
DL'VIYDYYMACItFDMLSSGREKIKTEEDKKNTtIJtVKOLKLOCLLIIOf~&ft
.L'fKtR


Ef!LEL7ALRRE04Jf:AIWKNEDEVL.ALK"TMFaQWf;FItDLVCTiItGKYOEFKKNKi.SINL.
'
TDTN1LAEYFIr\HNCKTSVICVPKTICI:OLKNCWIETSIf;FN7".iCRTYgeIICNL1KI1AL


tILFQVTPECLBLL (rTLPNIALt.~nELIATRKISLKOLSODLAI1CLVRRY
tfDFTK::Y;Nt.LNRLEVLHAErrT'DDLVLtIVDRMSEDLKKTIEEIIII.iAKKIHIIFtRLNCOQA::YTTLEf:
r:t


ANt:lQcat'TIELL'LIVOEDNRLOEAt.~.~.f'"..VSQGLMLLIt.~.LLt7RDEKtNKNtEiSRKI4LVA.
.
KrCIQf!~fVLLPEGLIEHtFD?RKLILELN\'LI.HHiCD...~.IEK1L:K4iPETLKTFNLFPK


AY')AH::f1\ItNtL.:(r:L\PLIORNR/13L(Ntiti~faflLFtK7SIRNIHALDTETLVATSSNMDIANOLLIA
RD.~.IKxIVItV::KIATEEL.IJWMI'KKEfEKIKMIMEFIC:V::IIFFf:'IFJWAGFP
?tHIiLHHO
DV
~
'


.
:iNFri'.N'l(aaLCIt::\I.FLVRGK'Ir:IMfTINN(JW::YTEW01~4\TPLYKIIIHLPIIRv't".CE
.
fNt.t.DVLfIOSKPAPAfMENPLH.P:ALPf6VODAVAE
t'::AMIITFD4INlY
:LLILSC17C
'
\LFI
X'tIIL
r
II
'
.


.
'PfNtYTD.'1't'PK~PAVQ)If.t.OO::D.':rt:/I:~LVNFf'C:f'IlYIFt:KERLtUONPLTL(J!IDpT
..
L
v
AUf.KIMt::QWK:iINKY
iIAKAIV<L:fVA
ILFrI
VI.YNfa
LIfATY:LPwEFNNKDLNRWY.WDNLNLE
'I'LIFURI::K::KEFEYOVLETAO
~
f
6'~


. IIChPr~Ai.'f::hY:KR::I.
.
.
LL::
0
.a':f~IWAHNIV::ULE~:ff'TKf:K:a.iCDL'fKEFRRD.~.Yf1I11KRIKRRFKMCf.I:OFJIPWRPT


lI JI/tH~~\tYPAt:LIIRLI.I:IIWKOKEEI::IRCOAL'/'rEPMCLt:LEK;:KYDNP.KNIAAAMT
ILIASN iO Ut'm
'
t)161
,:fK


KK'Ir:KL~M)IUHI.t'KNNLTYVRIOHFFRTLIQEKLGt.tI%rVpEtIriIVKEAKELfIELAAIIYG.
.
S


Nf:X:N::~.K(tIIAKKOt'Kf7firUlIAC:KfiQLEL.LG\'iL:flCA~p:IA:NfK.~MOAwPRERLLLNPIfn
f!rliCtwt .ncylCC.uc;tnee.r:m
Lun, lyl
IIR:::a':RKQENLLI'n:WLt:KFIt7JN'fMt::I:1'I.IlJNFTTFt:ldJtfl'LIIYNPfYtIVILtJIG


It:AKIPa;\t:HTlr\::RKt?1Wt't'LCI':YLTPFVRFs::fF::1'O:Y:YMJII.INREOt.FDIEORLt.IL
:IAtTt:SKR::HVRLIWELTRII:IMLNVIH.It:IK:Dt.D':F:lliUt'::LtNYK~tIINEtIEYT
CLV
:KV7TLMRDIJ1AVF
rP
'
'
:
'


.
Il::ldlItOL)ERLAIFt::::aLX:I'1.AllJP:.LFFNKtKAir\VWAITf:aa:lWlAKMrYNAPEYI
:
~/NKIIE~.LIV:,9
Yv:LI:A
ILY:~
:
,rlVrt:IV::IItINMV!)AALAA
tTY:VLt'1'RLN(c:l~'I).ItRVtI::VLIISIILRGCD.~.:;tar:IIDWKKLFEt.LNNNI:IWPNOPEC




CA 02350775 2001-05-11
WO 00/27994 PCT/US99I26923
'I'N::OK~ALTYACNTIIJPDFYTpFLIIIDIYKELHFLTDSNSPELLSEVKFaLK. '" '
NLPPtLYNOGE00LLVSINHRTL


FTFJ1FANCDKP ITILTYPOV1H1AFPFAE.iSALSDL'f'QNLKRELTSCE
CPtf~017!i .. w.,.l.ir927r r :~~~Sld
,


cr153 Iffooensr:i~ar p~onwoa "..~
..


:~ ~rf~: 2nse7o 2DIeD3
NDDOPI~SDDEFJ1SKDSAfSASFSyEfYKSSTRGKt~fl1''"11TASRTLYILRpOCdYDP


,b rrfousc nowoloq present in
CenWank/ENHLRALKVDDEPIIYfiVEKRLDAKNPOSLNAFHKEVG111YVAs'VrYGCTCFpVLRIISYL4VCEL
as of 11/7/98


tI/YTLYN:OSPFRtNKLYSIS50VCl'PWIFOLNSKVDSYLFIGCNRIIfWSIVhpEPNLIEKEKISISVAAASSLLK
SKT~IATEK~SSYQSESSAGIVFt.O~'VL.POLOOIHtLDFKDN


tCKVfIJVRI3TIVKILKTLSPLIFPLLLIALALInFLHAKYANNLLVSKIt.ER11P0YVPLiLPNEPIPLAIr~SIT
CI:IIPELFPSEDJI0VGi0KKSALAxVILNYLLSNKPKE~SP
SE


irR:>r.L.h'A.SHIKLTTLVPV.';01fM4AlICSNPLEVFJULR'I'I'KPSFINPAKYROITISSH.
:...t.,..... ;..Y:1~~ '-''T"r'.!~Yf~"'~~.,......,ir..w....kn~
FYLRF
. _
.


'=:.y~mr'; t.vr:'.r:W '.".~'/ .,.;~~y,.
~ .
.. " ':.:,lm!~-:1:1.:~.'uta~ . ',-.-;r.,... " ..: r rcn:_w'. :.
' ~ :~I 'Ai ..;..;y~KDpc;,~:
' ;n::nrr.....
ri~


, :
:. i
tm.Fr,. Lk:Nll~ll'1CDLF lR.ir~LEIR;.iUlNi:i~:iCV
:~;";;.~:..; ":a' Lii:'..dl.::I i i w:.: ~ : i r':;iU.:v
:~:~.n!~'.alr F I IT'J
':t;,. .: . . .... L,yl.: 'rrr:x:rl.~.':.:W


PLDEDRC'uCFEILEOLOELCVRFPICPSOCPDNPNFOCFOCIRtYWEDSYDPNKPV


0177 317517 216608
CPn


CPtf_016) 205931 206191 _
No robust hateoloq present in Genebenk/fllBL
as of 11/7/98


No robust tfomoloq present in
Genebank/EMHLDKRIaTI'KSIIFIFLISCESIOfOPNSLIFSSVCLt~GLCSLbSd~IOKP>WMIiHrI'STSEEF
F
as oC 11/7/98


?EI(AIVYCIKCKOIIKC'SIItITP'1'PATPILTE('aEIFPGPVDSAIQ~fDLERLLTyIDfRPDNOLPMIPSAFR
TTQIFSEEfHiDPYWAKTDEESRIfINR6IN1~1LICIIfGSYIPI


IIRIYLR~IGOSLV'I'IYPKDGORLRSPEDLRVGDDLVOSYPNHLNAIELDCWIP~LIGASTIfGSLJ41PKSAALTL
KTYRPNPIWINCYERSFNIDTCKYLKEGSRRRT$NDGP10111RVL


TYIITFADFSTYILSLRSYOANSPSD011~lGIWPGSIDDPVOAVISFLKt#IGFALPSTLWM.IKSSGRRGHAICL~f
I'EEDFYIJUIRRCGVYSLYWlVCSYPQI?IPFVIAYAIiIA0is11


DPLt.CrNlt
CSKLVLPVKCYYSLVi~f~'iVSSSDSLirAFCDSF71~YGRSTFLANCl'SILCVIItSYKRVPP


0161 206141 206998 OP . .
CPn


_
No robust tawoloq present in Genebsnk/EHBLCP(L0178 218052 217789
as oC 11/7/98
V


I .
LCFKCIY:KIIFSFLKDLNTRSTIESSDSLCSRSFSOKLSVpTt.IOiICESRiJOCITSLenc in
Genebsnk/E1'oiL as of 11/7/91
LTLIVOCALIALAGGOVLSFPLGLII~GSVLVLFSSIYLVSCCKFFlLKaIIXCCSVICS>amoio0 Dres
frobusx !
No
KICLG
'
'


AF _
DLFGEEBCRNOCNRSARNOLFJIILHETDGIILKRYtsOGAK_
_
!
ECQLFIIIIVGKTEPCNC
ESIMICI
~


'KLNIWFEKpPNICDIEKALENP ~~ G~


NYF1IL


CPtL0165 206983 207582 CPr~0179 218550 218056
No robust homoloq present in Genebenk/EHHL
as of 1117/98


No robust honaloq presaft in
Gerfebank/EI~LPKLWDI'NFETRIGTSVPKFNRRLPKSFHKSGRSSRPSKAL1IANFPN1'TipJIGRSCIIPG
as of 11/7/99


NVLLFNNhfVPKTIDifiIDPESEIDIRKWSCY>Q.IKECQPLFRSLISFLLCVIRCOLRI1.KKIfAILLaiVNDAKT
PNYSCItLSIGFPNEpDLEAQtBJpQAALVRKILICWPNNfLKGLIJ1K


RSKYOmARTVSDEDAPLFCLTRSYYQDGYLTPUWGPRDLINNYIRLRRRENPlOIFFSPLKKDRlQ2LSSLIFiKLSYA
LDLSAPISILEGKPMSYEE1ILD
I


KNPCYYARLAFNESVCYYRZ<.FDIERLTKMYVECDYSKEOEKNt4AILSlyK'1'Lt>DGImFLP
IS


LIEHKDTDLIGACFlDVFCT
CPIL,0180 218963 218355


~ No Irobust haeolo0 Dresetft in GMebank/ENBL
as of 11/7/91


CPn_0166 207591 207962
TSLIHIILOCKYRPYFION'1~ASETYPSOILIU10REVRDiIYFNOADCNPARANOtLGIDtI
No robust tfamotoq present in Genebsnk/ElmI.'IWINL
as of 11/7/98


NCLROYM(SDSD1SESINRSIHLEASTPF!'IKLllnCESRLVItI?SLVISLtaLVGAGVTCLLDVYf~NYS~T~DI'
~R'RFTFVSSKNDIENNGLS?IPLONVLViAMVRR1
'
LAAdCIRNIEWRWCLDLRSOILIS1U.FZKOPOFOSLTEDFVNNS'1'IIOEGRVIpNtNL


D S
LWLF1/AGILPLLPVLILEIILITVLVLLFCLVLEPYLIFxPSKIKELPKVDELSVVETR
'


STL S
OEKK
LISLIIZCItCiAVLESE


CPn_0167 208309 207977 CPn_o181 219175 218777
No robust tfomoloq present in Genebenk/O~L
as of 11/7/98


.lo robust homoloq present in
Cenebenk/EHBLFYIHnSLNSHNLIXPSSLFJIAVpALDSYIYWOGDITDVL71A
as oC 11/7/98 ELFKIOCVYIfFFIDIFNKL
V


NLwSHFPRGFFNLPFCPTILWCPFIaISENYGLEAL71J1TVD5YF1%.GOSOIYFL.
SKODDD H
DDISREIYCVPRLYIRFWIVSISOSLSRIPWRLKRILLRYCfLRGKYVNPILIKRIJ1ILL


ITVELSaI(1%tKFKP~GSIIiCI'LYTEDPILPAIC'tSFSNCSDIOHRTPISPIHCLIRFSRLRNSNY


CPtfr0168 208715 IOAI17
No robust ffomoloQ preserve in CPn_01s2 220701 219331
Genebank/0~L as oC 11/7/98


SyINLRRREZIpENFlNpGIIpCYYARLtUTIE,gVRIYR1G.PM'AQJIONYGAGDYEOt~aedHlotin
Carboocylase


LKSILSFVQILDEKDGF11DFLATlIKDr1'FIGROG71DITCSRCZIHDNLIAMtDtIAVRIIMGfIDLCL3TVAVYS
L1DOGLNVLL71DFJ1ICICHPQMR


sYLKISNU.A~ICEnGAa~fYHPCYCFLSENtwFASrcESC~.TFIaPSSSSr~IL~cIa


0169 209537 208710
ANSLi110CI1ICPVIFOS1~iIIEDGS~IAE1IIG!'PIVIKAVJ1000CROIRItI~FY
CPn '
'


_ IIGNYVIaGLIDCTIGRIUIpNL
No robuse homoloq present in Genebenk/~i.ItZfPRNLEICVI0D1
as of 11/7/98 RAFSA7IRAl:AGGF1~811~NV1fIEK!


SFNIEFTICENNIBe~NCSECSOPLVIdEtM'OPLRNLCESRLVIfII'SFYI~VGGLTLIEETPSPIL.NiI6IRVKV
OLVA


TJ1L8G71GILSfLPWLVL.GIVLVVLCAL.FLLFSYIU'CPINaGVI/Yl~ti'DSDIHQNFDRpRNZTELtrI'CID
LV1CC0INVAlGEDfLPWKa00IEPSGNIIOCRIN11EDPTpiFBPHIORLDFII
LPPAGPSIRVOGACYSCYAIPPYYDS!lIAIfVIAI0G10DlEGIAIisWALIItlNZ~1108'1'


K IPFHOFHLDNPKFLFSIiYDINYIDNt.L7IQCNSPFhEP
TNDOVDPVSEDSIRTVISCYIQ.IKACKPEFRSLISELLRAIIQSGIGLLSRCSRYQEMKT


VStIIUSIPLFCPTIISYYRDGYLTPLRAGPRYIINRAI
CPef.0183 231207 220695


0170 311098 210025 aces-Hiocin Grboxyl Grrier Protein
CPn


_
RRtL~LI00IEKL11IANORHDHfRPAIKAL~.GdLERDTAECSiRpEPVIYDSRLFBGFS
No robust hanloloq Dresent in Genebenk/EMHL
as of 11/7/98


NVRIQetURGE!(YNTCTVIAPVLSMSYInLFKNLLKEDSVHKICNEIFALWRtlTrIACTOERPIPTDPKKD?IKEIT
rENSE't'STCTSSCDFISSPLVIfTFYGSPAPOSPSFVKPODIV


E71IIKNLPKADIHVHLPCTI'rPOLiIWII~GV1044PLKWSYNS~IrNtIRLLSPKNPNKOYSNISED'fIVCIVEJ
U~IKVIHtlYK7l~lSCRVLEVLITNGDPVOFCSKLFRIAI~J1S


FRNFimICKLfOPDLSVIQYttIIIQYDFNSPD1IVNATVOGHRPPPOCIDNEF~LLLIFNNY
221221


LOpCLDprIYYTEVODNIRLANVLYPSLPEKHARl9cFY0ILYRASQTFSIDiGITLRFIlVCCPIL_0184
221811


FNKTFAPOINIDEPAOGtrQWt.OEVDSTFpGLE11GI0SACSESAP011CPKRL71SGYRNkYe!pElonQacion
Fattor P


DSGFGCEANAGEGIETRTIFSSAKVNPEGLIEITRVTFSSLKRKOPSSLPIRV'ICpLGOWKIKFt7CCEE1IINVLSS
OLSVCI~IFISrKDCLYKVTSVSKVJ1GPKGLRFIIIVAt.pAADSD


WIDWFKATOEVKGOFCfRTLEYLYLEDESYLfLDiGNYEKLFIPOEIMKI8JJf4FLIU


0171 212111 211119
GY1YSANVYDNWFSVELPHFLELNVSKTDFPCDSLSLSOCVILIU1LLCCQILVNVPPPVE
CPff


_ ICDVIKIOfRTCEYIORV'
~OuaA-ONP Synchase


IIKLOSJIRtIHLNTIFILDFCSOYZYVLAKOVRKLFVYCEVLP4MISVCCIJCERAPLGIIL


SCCPNSVYENKAPHLDPEIYKLOIPILJ1ICYGNOLtIARDFGG'IYSPG11GEFCYTPIHLY?CPfi_0185
222157 221765


CELFKHIYDCESLD?EIRNSHRI~fVTTIPEDFNVIASTSQCSISGIEIPrICORLYGLOFHPcpe/araD-
Ribulose-P Epialerase


EVSDSTPl'QJK:L6~.'FVOEICSAPTLWNPLYIODDLVSKIOD'IYIEVFDIYAOSLDVONI.AEVKKQESVWGPSI
NGADLTCLNEAKKLEOAGSDFIHIDIMOOtfFI/PNLTP!CPCIIM


AOLTIYSDVIESSRSGHASEVIKSHHNVGCLPKNLKLKLVEPLRYLPKDEVRILOFa(.CLINRSTDLFLEYNJWIYNP
FEFILSFVRSGADRIIVHFGSEDIKELLSIfIRICGGVpAGLA


SSYLLDRHPFPGPGLTIRVICEILPEYL71ILRMDLIFIEELRKAKLYDKISOAFALfLPFSPDTSTEFLPSPLPFCWV
VLNSVYPCIIGDSFLPN2'IEKIAFARHJIIICriGLKDBCLI


tKSVSVKODCRSYfuY'EIJ1LRAVESTDFNTGRWAYLPCDVLSSCSSRIINEIPEVSRWYDEVOGOIDDOSAPLCRDa
GADILVTASYLFEADSIJWEDKILLLRCHrYWIC


ISDNPPATIEtrIE
CPn_0186 :_'3878 221069


0172 213237 312110 'sinilartcY co Cps IncA
CFn


_
PIKDKILItSSPVNNI'PSAPNIPIPAP'ITPGIPT1'KPRSSFIEKVIIVAKYILFAIM'1'SO
-fmpD-Inosine 5'-monophosphese
detwdroqenase ICOOH-terminal


rsqton oniyl
ALCrILOLSCALTPOICIALLVIFFVSNVLIGLILKDSLSOGEERRLRECVSRPT80pR


APIGAAICIOPLCISRAHHLVGGANVLVIDTAHANSKGVFt3'lIILELKSOFPOLSLWCNLTVITTTLETEVKDLKAA
KDOLTLEIGFRNENCMLKTTAEIILEEpVSKLSFQLGLiRI


LVTAEAJ1VSL1EICVDAVKVOICPGSICI'1'RIVSGVCYPOITAITNVAKALKNSAVTVTANOLIpANAG0A0EISS
ELKKLt~1DSKWEDINTSIpAiJfVL(GOEiAPQG017IVID1NQ


D,RIRYSCDWKALAAG10CVHLCSLLIGTDGPCDIVSIDEKLFKRYRCMDSLCIWKOCEOIOALOAEIIGIWNDSTAWK
SVFNLLVODQI1LTRWriELLE.iC'DLLS011Cb'ALRQEIE


.~.ADRYFVtOGOKKLVPOCVEGLVAYKCSVHDVLYOILCCIRSGHaYVCAAETLXDLKTNASKLAOHETSLOORIDAN
LAOEONLAEQVTALEKNKOEJIpKAESEFTACVRDRTlORRETPP


F'IRITESGPAESHIHNTYKVOPTItIY P'r'l'PWOCDE~EED~CI'PPVSQPSSPVDRATCDCO


017: 211041 211715 CPn_0197 331218 ..5015
~.Pn


_ Dradiccad methylasa
nn rotf.:.~.c tfoaaloq pcesenc
fn Genebenk/EMBL as of 11/7/98


TIFDLIYKIDSYKHQQCFMDFSVFPDRFVESTSPSFIEDIDAI(rLVSNCCNYCSRCLFLFVPLTYTRTLPMNSKFI~~
sRRKKN..iHKEET.~.WDr.'LAS.~>yHKfIIODKrHYYIIRETILPOLLP


I::LL SI I
Ir_F::Vlr;l'SCETASLVFCIL.~rLIVLVLLLSLTLOSKSSVLDICCCOt:FLERALPKECRYLI:IDL::.~.FL
IALAY.Y.NIL:VNSIIDPIfVADLS
IECRNRECCRRIS


KRLEFVEPTLFSHAVATL.iGNMEFPt:F.A
1 RNTATLLEPiMFF I VLMIPt'.PR I
PRASSN


:Nn Ot'lA 314215 3L1721
IIYDEtIKIUISRHILIItYL.~.FHIIIPIHAIIt't:OND.':P::TL.:FIIFPL::IWFKELI:.~.IK'.Fi.V
DDL


th. rMoar lu>tnolrul present in
EF7dl'::.~.ifT:.'r.KRAKAfN4'RKEFPLFIIII:;rtKtK
~enab.mk/ENOL .i:: or 11/7/99
A'lT I PAOCRRS W
~
'X'mT::L
~
Y I F INF'/RK I V I Lahl Ihrrr
L
NSP::PALNPEL::LI FFM'L'J


.. ~.'.~..fuo
. W'fsOlu>t ~
.
.
.
'I'Lfi(ILLIF'IIILl:W1'II:.1'FTVIFFLNC:LNLL:."CC::IIG.~.::1'.LIIVGLLFLINCLYFH


::::LO(~:L'Jt:LL~Y.EL::OAEEREEEYIOEIEALR(7AFRAE:a'Tf:::P~IwL~'I'if2
tWP~cIa'.ru:.W pmcr:tn


A'rIlrX:INFRKLFPP::KKK'I'a(JY.ORLkNM:LV~I11
tV::IYVLIJItINA::KFAI:VL::YY1ILI.


'Frsnl'/: ':11>I'n. .'.Li275
'15'VIIGVFFLRL::~~IILFTNLNWYtWLtIKF'1IiIY.Kf'I'/AIVFJ1A'IIIATr::~IIr:LVl.l2'w"F



th. r.ds~.-.r tuNnfIW prssnnr m
WtY.SrN:ILMLI.::I.FJx:LNKIFPT'..'Wrf'I::LYILV::YI'/iYLV::I'MfYtIVtY::bIIYITO
.;v.nrtamk/r?IfsL .n:: nl IL/')/ny


LLLACFYiFt.I.HItPIMEUI'Nr:VLOD~'1'fVLYALN:;FL(RI::WCIfRLGICp::PLEAFNAtA:E!HI'Ir
~'IAKLF::L;.11:1f1'AL1FI::RFVI~YLLI:It.AI.Fir:YA1L.F!YAIf,IK'r::AI.I::I'LII<:


t'rFl:Lr/ITC:FI'LEbVAfI'ILPr:YfIPKFYLSFIDRDUf:I/HYEVLDt.VFLK1YAACLIN:i::VWI'/FW
tAFF::I,~~11::IFM'::FTIt:At.VALI".:FLLt.I.lt'l'rNI'II.FYk:Ai.TFftI~NRIIf.'I'


81


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
AQFIAK03I(VPIGEVSOC:.DVL E:tFtYYL."YD1IL'fODF:FJ. .'..~.AVDLK'tTYF:
t


:L .,. ~ ' .
FLFtGDKILf.~.CYLOLIT.:"."!ILALTTPOi7~tECFn ~)t7N ".L.h,1145. ; .~.9t1,'!1n
PVPNPSELTtKDIADKLLHREIfKKfNPOLGTTFtENSFONTfNOA


EKHCfLFPYNFK:YO opM-Q.i t'td~t "0 t~ldmt>s: ~'rf,e
IWICfNLTL::EIARRIK ) n (,~~ ,;~
IEYYINKMIRUtPt'LK:L.PNLLPLLLTL3.:CSKGKCEFLGK~~fI1111SHDI~I.~RN


225371 227925 Q
AY4ER~4~'Y~'~~f~~~~P~~rAY


~.Pn OlN7
DFEKSIKQLYfEEFSPSIH'.'~'VIKNSSAIHNAGK.~nLE:wICAtfDCL:v.VITLGOPfP
ols Transnlaalbrane Prxelnl
P


ossl
YFLTLIARPVf3PVHHTLPE..'YKKCfPPSTYTSNGPFVLKKHEItQHY:.::.CKNPHYYOHE
~:131 aalwtrxt-t .
NpIB(RRs'WLKIiGL~.LI:S.'aL'JIGfLIFL?QLISZ'ESRKnVFSLIHKE3uLiCa'eIEELK-.,
EPAS "
' '!
;
~e
.
--rFT'
,F~~r-
:
r
-
c
~
~


A ~
.GIdSLKIN .
EAKDEVF:>AEKFELDG~LLRLLIYKKPKGITL..
. ,
~.- y
....",.. ;-
M'ARKIKLTr _
.r .a
~lF~ .;
t .A
.
'j
_
~,t
w.
...,,...t.r....,~-._.,
-


, . :'. t' 'v. .
,
. ..1:
, t,lr wYf
. .
. .... .. ..
...~H......n~~r.~ ' ~ v:"_:1,:::..::.. ::1:'-:.~
n . ~..111.:~i : :'~!
'I '~\F_ tL::
,. 1 '
' ..t. r.
'f'!'dfstl:~
\
:"
w
'
~


. ...
.\Ft li
~ ilnFr
. htlfiirUv\
YHCfLI(KRROGDFFIAT~)IIAEYVSPVAfLSILCNPRDLTQWRNSDYEKTLEKI'YLpHA
r
! : I.
;'.:17: ":
T.FYIVECsSSFIELKPELASALCNGItPLS'fP
IONKLOGHIHK
N
'
'
'


.
'IKH~.KRAFMIIEEETPIIFL'fHGKYIYAIHPKI0N1'FCSILSJITDCICiIDILS
ASGDG
O
NOCKT:rtfi
1
ITSKOIHAtYSYAKIPLDITI6dKItIEIT9QAGLpEyAINPKDPM.At.OLI~E
VI?N


DIAYSSSIVIFCASPSHG1JGL.ISIONKKtILTKFRLJ?OIIOLp~I'RAIFPOPF
KF~
K~E


a CPn_0199 211019 :11983
PLOVAYYSLNIF~1"IKNAHLtJIMILONPLS.KISCSM.SG1~N~'FKfNl-011~pe0ctde Psnnease
TD1?JLFFPKFSGKITJIRENtLLIEIAKIGSp~pIKP6ITSI


:IaptIPSYAFJJtF~KJI0I opp
LItIGOFCSLPLSLVSNHLApFHLKKLTfSFIIfDGGKFVTKGNL4ALIENPDYPOLMJfRIKCLICLSLVF&YILO1R
ILF:ILi.SLIaIVLTLTFLVIOITIPGDPFNC~NLSEEVLOTLK
fV IG
it'I


LDCSSTSPSSKDLKIOGSGEIFSLPLDSITKTYaItOVRLSpYfGSSGDIiJ.
SRYGGDKpLYGCYT'QYLHSIAKLDfCNSLVYKDRKV7NLST.11PISAIiwiCSL
'
'


IPDGLLS !It
KLTLLSNFKSFaLIGEL%LVlIaFSNIfLSSOICD'tIAWLVSPERYASFFIOiAGGIAI~'IAALE~RRYILG7LSIt
.OISIPAFIFA?LL7YVFAVKIPLLPIAL1GP


VNYNPKDON
TILPTLAIJ1YPPN~IIOLTIfSSVSAAt3IKDYVLLAYAK~.SpLKWIKHILPYAIfIII
'CSPItLLHRTANVALDISKISCPEETKGLSCLTLLA71GGLEGSLGTpLIFYDINSKET1
r
l'


a
SY~'T~V~AIFlIIFCIPGIGKWFICSIKQRDYPVAIhiLSVFYGTLfNt~SIZ.
llOC SIIDFQIRYAtiGKF.%%R!I
FIINDfKCSLRAMiLDAKIEYDL.KCSCLApAGDSKT4AE~~SPESRtI
LKAQISSLAGPRINVSItOQAFRTGEGPVDT~S~
PMSPr'
'
'


.n OL
aI Q
uu
ANYIIHIPSSFIA
Rf?1LTAHLSIfLEDVHKAFL.OEFNpt.L~YSGYPVTLEIIrifONFYLp
AEKSILi


.
IPLIill 0200 241996 212968
tRPYSFEEFRIQSATLOte;KISIAtnGTMYALfOfLDITOQKOfVE~TpIfF5V01~SCPn


IICKRLDALIDRRIRIJILIfGKTDTAHDRLF!!t'LGIDPLVIKKYFIiTSLKTIWffLIKIR..
fSStlrDSTPPPTYHPFPWDCSNFD oDPC-Olipopepcide Pe>:~ase
LADKt LTT IILI4If'.J1LLLPWFYQ
VL:
IKSIIpN104
RSI'


. .
CSISSPEVDWSSAYAAIALL4SYSLGHPFSS .
r
C~OO~I?ISrILSSAPS
ILVSPCSRFPFCTDTLGRCIIFARTLAGLRLStd'IATIATLIDIILIIt'.LWATYAISOGKKI


SIEHK
DPLJ!!R'ITEILFSLPRIPIfILLLVIftdB~.f.PLIfJg!'1'I'KAIIPISRIIYCQFLLLIDiK


0190 229901 231271
PFVGfALIAI~(A.4TFNILKZIIS1'LIFtIPNAIYTCAIISFIGLGIGPPOAS
CPn '


y .S1IG
170 :obusc homolog present in
Genebank/tf~LLG'ILVK~INJ1IDYYtsILFFFPSLII4IAf.SISfNLIGEG1KTLCLE~
as of 11/7/99
STSTKKF)1VSKAIQKIIKINCITDPSIlIVETPNAEIGSILQEIKEI


Lf.GIKLNRK1WSFO
KOKLSKQAEDLGLLLILYCSQETLSM.fldINASLKLSIGSVZEt~SLKOLVEESIEtShGCPeI-'0101 212110
713715
IVIOCLLIKfiNPEKSEAASgGIIVOTLL t ATPase


pQDpLIGSVLIEISDXFLSSIGEILSLNLOIf' oppD-0119opeDCide Transpor
~SVA~tERCHIt7M~CYRVL~iGEpI~EOiIV
SKADLttDNYLLNIKDL?ITSTNPKRTLI~1LSLGLIt!lIAI~LVG~
e


LG ASISSAPGLIDIW
~yS~IAS,,
Ci~II7fAILGFLPF3JCLIK1GSILFEDIDITIG.SPKELIKIRGI~CIATIIL
RYVSSGLTIDKVEDKPITKFIRaGKLLYSGGTSt?~ESMP4GL4TSGI9fPlWK


SKt7YLE
TPSCItIGNOIIE?LROHHKtI~tKEEAYWOl110LLTDVCIPNPKYSPJpYPFS.SritilIlORV
SASKSNDGSFPFSALRHKFTFSt7TDCPGITS'1'1'LSGNOAGfY191SLSLKVLVPSIP9IEK
'


NDT1XI
VIAIAL71SOPKLILIIDEPITALOSNSOAQVLRILRNI00QICW1TILLV114a.9LVKtt~l1
PEVOLSLVYSYEONLPIDNIFIOfSOPRTIPL71LI~Tt4.xDKYDILEL.AAHC'1


SPNCSRFSLOIxOTNOfENS>MIfYtVNAAHSf
OICIIKDG1G.IE'1'CI'VEFIfLSPKHPYTLXLIN7NSKIPIAtCtSSPILR~OtiiJI~CG


CPtL0191 231079 271314


gln0-AIiC Jlmt,no Aeid Transporter
ATPase CP1L0202 213692 211500
CYDKREGVMI'IRVRNLJ1YSVNlGDIILDGVTFSLERQIITLF9GICSGSGK11IIP-OlipopaPCide
?ramporc ATPase
OHHFPVFL o


. pp
LRALJIGLVDP1GGDTt~tIEGF~IpALVFGQPHLFSfNiIVLGNGTHpDIHIKGRSTEF~AtKAVPT9NEYAAWAlfI
'LLSIK~.SLTIRCKKILNHINLNLIKGSYLTIVGP>Kt~%Si.iLLT


FEf
yHLr~IEE51AKNYPDOLSCCDKORVAIVRSLGIDKIITLLFDEPL'SJ1LDPFATASFAH%FTisCi'ITfI~PKIPR
JWNGVIWGDIDSSLNPC?1SIKCZISEPIliIIGTYfKA
ILDLT


LLZTLRDQELTVGL217~tIpFVHSCLORIYLIDpGTVAGVYIGtDGp4~YIHS.
1G
yNyyDI,yNLpKbYWLKInKLSGGGKpRIAIAKALVSKPELLICOB>tL1LDTL
flt;LDLIQTIXKEYGtn'LLFITHIxISAAYYIAOTIAVlmOC81.V0~CitSTPKH
N


. p
'I~DLLDAIPIF6LISTOIZpStCYBLOVASK


CPn_0192 232617 271991


glnP-ABC )uaino Aeid Transporter
pernaaae CP1L0303 211966 215802
CVSGIGIICGSIIGLLIGTV'ISLYFPSIG.TKLLIINSYVhomolog Dresene in Genebsnk/t?~L
RGCGYZT as o! 11/7/11
GVpttNWIARLtt
E


. No sobusc
.
IVPLPpIO'S~TFSPTt4KSFSLFLLEKLDSYFPFOClRI9ILVI?I'L11IALA
V
TVIRG?PLFIOILIIYFGLPEVLPIEPTPLV11GITALSisB'ISAAYLI~NIRGGTNSLSIGD


vIESIWVLCYKKYQIFVYTIYPQVFlf7ILPSLTNEPVSLIKESSILMVVGVpELTKVTiInIAW010CKVSTIEKIIK
ILSFILLPLVIIAFILRYfLHiDtFDKpPLCIPKVIt~I.iG


VSREIJiI~S4YLICAGLYfLMI'SFSCISAISBCRRSYDNSRFQAVEK71VAEISP11FFSIPRKYQLIAIDTPK17D
AP8ILFPIGIEIII.I~CI~I~a
'


r
NLTLIOtEI07TLGNPEEKaLFDSICSIEK00~1N8LESKKLLI'1'HILIDfWSGIIOWIf


CPeL0193 233111 232696 FNp~'IGRGYFSEISTAKIHFHGI~YCPIRSSCPIIOttI


acpR-AS'Qinine Repressor
KLtILIlPl00CKVTIDf.V.KEILRLEG7UITOEc'rtavr.f~,FATTOSSVSRWLRKIQAVIN0201 215691
216002
CP11


AGOIGARYSLPSSTEKi'I'1'RHLVISIRtOtASLIVIRlYPGSASWIAALLDOGLImEILGT~
No sobuec hoaalog present in Genebank/~tBL
as o! 11/7/99


LaGODTIFVTPIDEGRLPLLNVSIAti4LDVFLDpAaAtNFfNNKYSImPFSSARBIWANPFIbTItHEGNIKIKClIC
IfQIFTRLKt<GItf88


YNSINFNPYFFDEDCIVYwtESOIKSAIADHGILGKCILTFYPNT


CPn,-019 273162 231211


qcp-O-Sialoglycoprocein Entiopeptidase
EVPHTIKfi~M'FSNFFIQ.TLGLESSCDLTACAIVNEt>KQI1.ANIIASQDIHASYGGWPECPIL0205 216077
216327
No robust tloeOloQ pree'uIC in GenebenkJF1'~L
as o! 11/7/98


U1SRAHLHIfpQttINItAL.OCANLLIEDImLIAVTGTPGLIGSLSVCVHfCKGIAIGAKKSICDSIKGYGSASIIFt
WPpCIi.LKFFLVCEELCILTVATHRALLETPL7ILSFlXG.ATKYV


LIGV1MVEANLYAAYHAAQNWFPJ1LGLWSGAHfAAFfIFlIPI'SYIG.IGKTRI~AIGETYRAKDIIALM'11f10C
pTILh?SPLCS


FD~FIGLPYPAGPLIEKLiILEGSEDSYPFSPAKVt,IfYDFSFSGLKTAVLYAIIttORIS


SPRSfAPEISLEKORDIAASFOKAAC1TIAQKLPTITKEFSCRSILICGGVAINlYtRSA
CP1L0206 216316 217161


IQTACNLPVYFPPAKT.CSDNAAMIAGIGGBtFQIQ7SSIPEIRICAttYGWESVSPFSL71SPCT207
hypothetical D~cein


IVDAASPACYDSINSDAIGVSLtJ~ISHILF~UIYDt7GILPREJ1IB~11AIVKGNQITpYLL


CPn_0195 231172 :75785
HILNDAInRVPEIVNDGSYOGHLYANYLLAOFRESAALPLTIKLPAPE~TPHAIAGWL


oppA-Oligopepcide BmdinQ Protein
TEDLPRILASYC1IDDSLIKELILTPXINPYVIWN1I9GLVTLVCJIGKIPRDKVIRYtAEL
'lSCNSYNRKISW
TCITTLLSLSVVLOCCKSSHSSTSRGELJ1INIRDEPRSLDPROVRLLNYRLEKOPSFAWONLIAu'ICTLYPGELFYP
ISKAFDGGLVDTSFISNEDVCNIINtrfTl1
'


fAED ESCIHTLCSSTELINDTLEEHEKWLEDfPIEP
LSEISLVKHIYEGLVOENNLSGNIEPALAEDYSLSSOGLTYTPKLKSAPNSNGDPL
'


fLESPTSH
FTESIdIfGVATGLNSCIYAFAINPIfO~IVRKIQEGHLSII>EtFGVNSPNESTLW


FLNLIaLPVfFPVHKSORTLOSKSLPIASGAFYP1WIKOKQWIIfI3KNPHYYNDSOVEtX
0207 -17209 218617
CPn


"ITIHPIPOANCAAKLFTaOGtct.NwpGPPwGERIPpETt.SNtASKGHLHSFOtIACTSNLTf_
ybnl/sodiTl-Oxoglucarace/Nelate
Translxacor


NTNKFPLNNNKLRFaLIsALDKFr\LVSTTFLGPAKTADHLLp'INTHSYPEHOKQGUWROVNKKKRFLSLLFLTAVL:
riTWFSPNPASINStJA4.lOLFAIFTfI'INGIIFQPVPNG11IAII


AYAKKLfKEALEEtAITAKDt.EHIliLTFPVSSSASSLLVQLIR&QIiKFSIGFAIpIVGKEGISTLLLTOTLTLEQG
L:.:fHNpIAWLVfLSFStJIIIGIIKIGi.GIRIAYPFVSAIGKBPL


FALLQADISSCNFSLATCCWFADFADPMAFLTIFAYPSGVPPYAINNKDFLEILQNIEQEGLaI.LVITDFFtrIPAIF
aTARAGGILYPW1'SLSOSPGSSAEIFCCODLICSFLIINAY


ODHpKItSELVSQASLYLETFHIIEPIYHDAFGFAMVKKLSNLGVSP'ICVVDFRYAKFNOSStItTSJWFLTANJ1G1
IFLV 1ALAGHVrIISLS4MWAKAAI
IPCLPSLFfJIpIILYKLYP


PKITrCEEALRSAKLRLKt~tDpLKKEEKTTLtIPFLLV'JWl'P~.LCISA'ITAALIGLS


CPn_0196 235906 237519
LLILTNILOw~COVTANTTANETPTWft:ALIeatASPIIK?LGFIPLVGOSAAALVSG49MC


appA-OW gopepclde Binding Protein
ICFPLLFLIYfYSHYLFA.'NI'ANICAIfIPIFWVSISIIUTNPTPAALTLAFASHLFCCLT
KLKSYSKERSFNLRFFAVfISTLWLITS'GCSPSOSSKGIFwHHKEHpRSLDPGKTRLIApAPLYFGSHLVT1'~EWWI
!SGfALifVNIVIWWtCSLMiKA(J~'LI


DO'LUIRHLYEGLVfEHSQNCEIKPALAESYTISEDGTRYTfKTKNILWSNCDPLTAQDFVHYr~


:>SWKEILKDaSSVYLYAFLPIiQJARAIfDOTESPENIOVMLDKIpILEIOLETPCAHFL
IFFPVHETLRNYSTSFEE71PITCCAFRPVSLEICCLRLHLEKNPNYHNKSRVKLHCPr~-0208 :x9935
250(102
E


HFLTLP ttrase
KIIVpFIStIAHI'AAILFKHKKLDWOOPPWCEPIPPEISASLHOOOOLfSLpGASI'IWti.fptkA-Fructose-
n-P Fho:.photrane
SVAVIL19IPLYYDLDTIL''3Y'opPLPKEPOEAA.SLtA'/PDT.SHSKPWPCVKTLFPO~!'1H


NIQKKW1NNAKLRKAL.iLAIDKOMLTKt~YYOCIJ1EPCDHILHPRLYPGTYPERKRONERILpYLKFVQtTEMMITf
LKi'':VNF'.X7f:PAPtX.71NVI0liLFNSLKDFHPDSSLVGFYN11r7DG


Lti\OOLFEEALDELCM'fREDLEKETL'tFSTfSFSY.RICpIILREOWKKVLKFTTPIVGOE~.IDITEEFI::KFR
N:::X:FNI:IrTORKKIYI'PEAY.EAt.'LKTAEAGDLODLVIIODD
TtQIK
I


FFTIOrtIFt.G:NY::LTVN~%'fl'AI1FIDCN::YI11IFANPt3GISPYI1LOD::HFDfLLIKITOE.
.
I::aITATAILAEYFaIiARf~T::I'/rVPYTIDr:UL011TFLDLTPGFDTATKFYS3IISNISR


IIKKHLf!tK)LItFALDYLClIf.HtLEpL.'l1I'NLRI\ClIKNTtWFNLFVRR't~DFRFIEKLtlIII:X:KAt
IYtIFtKtatt:K: :\:at
tALD:ALVI'tN'rllAL Ir:EEIAP.IWLM.KTI
IHKIC:iVIA


I>itAIWEKYY.NILtfF,.a:SStfETItIt.fTfIF.~..L::E'lt~ItI::RL:a~7pRLLKSFPAPI


rays n1.7 W751'1 :'aHR=
IEUfINDRONY.:N1:YSI:~R::."/f.YLI.IIII:I::NIII!)r)YFMNPFNAI::HfIGYt.A'K


..Nlu\ "It.p,lId 1.1. Ititul)n.t
1'I,ftl'Y~:!::IITI<:.4:ILt'I:\il.'.'Y:ff:."PtI:::IJU.'fYftYYIrI.RAIfNVKMFTVK(~A
~M.Q
1'r.,t.ein
:KIIKO::LIIPtHDDPV
'
Y:
'
'
'
'


fta.
1KIYK'ILVDIf:::Pl\F'RI:.'s:.~lYIwAt.HIr.:YPF'll:ltlrtF:rl1'fTlU:l)NFP~LTLLWHN
:
::::
LfLLFI::L
1
IX:KVIKVItKNF::RWI
rllYlthr.:LDI.KF
:EDFt:::Y'fFFTK0..AL
V
'
'


: Fl,IrrPINJ:t:cetrmn'r
IT
rIIRk:~NOLRLAfA:at
nF::l'Ir~AKILIMUI::IdVI.LFCL:LTHE.
.TP::aNAfPIIILD::PNPDFPKLIJIFp
NF'
TII'h:F:DIRfIAWI:YAVENSPIIISIfOt:I
:
f


.
. ~:IW!l.'.IIV :'l04'l ..'.l'.'.'l
W:
D
.
Ah'A ( fKPFNfKLF:?a'1"rLVEYf P1:11NiFILKKNpIffYDYIK'V::IN.i
IKLLi t PDIYTAIH


Id.NIK:YVIMVt~I5n1~1:fIWRt.11K0::y1'ItYYTYfNI)t:AfyIIL:I.NMCSpHIJ)DLQNRIIRLAm:v
:mt m .:.,n.t.mY/h?Htl. .ts:
.,t Ili'!"IN
~.t tu>Ha,l.m
tmtrtt
tY


'ri'fI.KP:aII:F'.At.cl:ryH'Af.'I'L:ah:Al'~,pMOYKltrll(TL'ff174:1ILVLTYP::OtLRCO
RfA~
KLWNF .
.
tE:::IItIIHI.KNE1'Y:'.1':"1'!~...yYtR.'a.IIMI:KIdd:Y1'.:h'I:YI!'1'INIAI'l'1'N':
LALAYEGNI


IaLrFY.HIRM4:IId.(t.l:!LfYlI1.l'VNKRKVQDYAfAT)TtNAYYfY:ANLI::fED


82


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
rxv rodt.r cuxmUxr G. i. t.~. ';~n~ottu.
EF18L Js ~: 1::~I98-


IIL:.7t~IK::.Y'IL3FPRSLLR'.T.'.LWYRF:TNLIf:RY :..~.DDCPTEATKNtr't
iK:..:F1.'RDN4Er:.TNPISEIVSET35SI1IDSYGRSL


IF
lA..$
FtI~::IIf.CAR4l:~L:.TDIOdLDCC0C14iWlrLLRLirt,
LG
"/ .h[u~st~hllE
~
~


~:Fns021v! ".215 251147 .
noraolog Prea~tnc tn rJenebank/ENHL~
as Uf !1/7(99 ., .
H ..
NLVICFCK


o rodt3c E3674
IEfIEREIFKTIREKEHATISITrLVELE71L.1tREFJWLKDQICPTSDOETTSLYQCLDH
'I
KL


r r.pn_O::S 2b7402 _
O No robes: nomaloQ Presrnc :n GmtbanK/EMBL
:.EFYLLGL::rDKFLKATEDED'JLFESOKALD~AP8L1L.LTKARDYIrGi.GDI~wIIYOTIEFLiras of
L1~7/99
'


.NDCVEIAKAKL
YTF10~IPKKNKKMKFNSIIFLENTKHYPOIFRECEYRDRNGUtEASL%JL:.STZTIZRSIL
A'fGSKYNRRAFCiT:iEIHF;.KTAIRDLNAYYLLDPRWPLCKIEEFVWI
'


LPS . '~!...:
QEEYOKD -'
~!'FtDtETKEGt7E.:LLREEHANEKCSIODLORKL.."D::IELHDVSLF':FSKrf
' ..
'


_ .
.: .
... . .. :.a:~.. . ... ..... ... .
: F:r ~ . . ..
L~~


.'."'r..KI!:~',:.'.~.r'...:_...:
it:; '.MI'r:l:?:.:.. .. =y-.rr: U..'. Loldic :oiaii
:Vn


_
No coDusc rJamolog present in Gaubmk/t>,8L
as of 11:7(98


CPn_0211 252765 252167
NSRIKFLtXtIDrAINSO'l'1'I'POPNL:'DAEPIASRAQCKSIAYIISLIWQIL.LLGL::I
No robust homolop presort in Gmebmk/DlBLSE
as of 11/7/98


ECVFISYPDISNVQASSiOSALLNKTSDOIOOKRCPKOSTFVTLAVSLYIIGSLFLLAGVALISIPIPGLAAOVALCLG
IVSLILGI1L1NIG1LCLL:.RCKOVPOKPDCLPSESSKOP
K


AGGVGLL'JLF1KSLL
CSTPI'ALPWOAGEFLEKVOVSATPILLPKNKDEELSAKVIOCEGAFItASSTKOAYLCB?E
SLVFCVLGIYLCLLLi
LTVP


.
LIDivRKOEESRREiIRKKIVAEEAUURXRI~OOMAaOpErILRKNKELYJ1KRK
I
SHGVL


=Pn_0213 254081 252888
No roausc hrxeolog presort in Genebank/EMHLCPC~032b 261515 264967
as of 11/7/98


ELSYWWSIYSETLSFSELTSC%NSLJPFGPIETASIRINNVPNVtIIVCLI:LCTLFVCNo robust hdsolop
presene in Gmebank/ENBL
as of 1117/98
'


LGNVPLGVFSTYLIGNSSMTILLLLISIGIltLLKFKERYCL.EPKELFiYEOGFDK>IG.PSElifK
AIfNRRRNPYYANfLEFIOGTOSLCPLfKYCFVRFIHYIIGOLEIEDASIIDiIDfLEPPB
'


'II~DQTADLARELDLEOKKD'ZLIRD!'SARLIM~SKTEKKOILKIGVPRN(SBIOERLCAVLCIIIGLiIVALILIt
I
RTLLAAIPILGSVIGLGRS.FSIWSIREPODSOEYKSIfWtTI


AOEONSILEOCKFJ1LLFRRKS110EIFKKLYDRK)1AFWRSYREDLWCYSEINVSKXALSNLLATFIMANPCLKRYAT
FLFYS


YICDVFEC'I'APFFFIIIEaIYAMCRTJU04L11HYVINCIfEDNRYNEEIWiAKOLSVSELLCCCT
DRVRAR 0227 265167 265009
CPn


EIyTDLFiETtiLfTSDSEDVLEEYOIFICIRV1TIWALWAIYNDEWSItKPIDTL.-
~ d
d
id


OMCYVIff><:'LELEIJ1QLYYDZ.:~.F, Ox
MAVEDCZE"Fg< orv
uccase
dsbB-Disulfide bon


..
KEPPNIFVSCKLIJ(EIf!lINFIRSYALYFAWAISCAG?LISIFYSYIIIiVEPCZilYYOR


0213 251315 251190
ICLFPL'iYILCZSaYREDSSIKLYILPQAVLGIGISIYpvFLpEIPGMOI~IC~CST
CPn


_ KIFLFSYVfIPMASWAlGAZVCLLVLTKKYRC
No robust homolog presort in Gatvbenk/EM8L
as of 11/7(98


ILWFSRVIFSYfNQIGIPRLELILPLWKXENDPFCFLFSRVECtF'IIWIK


CPeL0228 266242 26512


0211 255768 25146 dabG-DlsulEide Bond Chaperone
CPn '


_ ZNSSL.RCPL111DCILVLCTANPFIY
No robust honalog Dresmc in Gmebank/EtlRLVKDBADTtM.%gKFSCSILKItENAFEFYVFGSIKOL
as of 11/7/98


PLGLIIEDYERPTYCIIPPAPHPQRVDSKGCIAStIVS?<N1JVALEILGIFFLSGSLAPLVtfCFGFLI1lK10fIIL
PPKANIPTNA1WFP?ICNPYAPINTTVfEEPSC571CJ1EF1TNFPLL


TSCCVLIaAALPILCIC'dYL:.JWALIVFLCNKHkI'RODLOIfYDODLDSLVTHIOCEIPNDIKIQIYIDIGEZSFT
LIPVCFIRGSKP11A0ALdICIYIBiDPRQADIDAYIICIfFNRTLTYPI~E


SELRVTFEKLONLFQFHTImFSDLSOELOC1CFINCNERWLTLFDEVIIfFLIVRDIIfLETRCSHWI1TPEYLTXI~I
ECLILINSGRSVNPKGL.EQCIASCQYNDDTKKNNL7fGSOVLOGOLLIT


RNPTI'fGEQVKGI05NIFDLIiEEKSSLYLELYRLtAfDIAVLLt~IFFLLPPGIL1CVDYOLIEPTAWCDYLIEDPT
FHEIEAAIONIROLOJ1YDGDN~


AIKGLFIRLTSRLDIG.DVKAQERIOIFINF715REP'IfLVEKAFDIVDRATKIO:J~RAKKESP


ARLINGRTESLLt?OLIaIEtAL.ID~K)GLDPF1ILSIiFET.FSPYOQLLZLltYLNSIVLHfIYEFCPnL0229
266163 267560


LISCTVTSCLTLEECCRMRAASIIGLNALLVRlG4FR~IKSAYFEKLTEZEKELRSLODCT178 hypothetical
P~tein


'lIKSLELE;LIHKIKDIVTLET
NS1D1!'SFLRIEOENFSFK~OfSIILSi'IYNI'ANLTKSTFTFILLLLLRiDIDipCLRt11D8*T


LEMYRHFRYRFLLGltILPAl7.cLLLRCSPNTLNY'1'pVDVIFSDRLCSCLLIFL1IABLT


0215 257039 255759
KRSLLWLGIIPLGIWVCLF11CVAGASP'I'TFANDTLIGF71ILAWCISPTIlP6AZ.6SICPTLP
CPn


_
ECPSYNPSA~RRAAYLFLSLLGWL.FARYLTASSLGITSSOSSNFLLLYSSIttEVYSLLV
No robust hotnolo0 Dresmt in Genebrnk/ENHL
as of 11/7/98


LTSSIOCOVNSSAIARDCFPSPSPQPSSTLGVFtPPKYKSLILSVSLfVLGVLLL.CVCFELLI.VLSIaGSERRWHTR
PKIVIIITAIUTGIIIILTLLPIIGHpLRYOCWICIGLTIEPAEJIW
'


VNAIFSFSVL'IYGIGCAGVFIGSLLLILGLIFPVSYNRKL8EJ1TRSLLf<.Q4KTLLEYQPWJ1DFGSEYYKIfZLS
IEER'1'Vi.PWKAY>oQiIP~TS
FAYD!<LRATLRYISOFI&DKRALTNAS!


LRKEWEVOWSNFLLDEWEDTKEWAOHKSOFATFECDLLLFGREVCKYI"lIWILELDGRFPINOLWILVA'LVF'V1NN
SSNCLP1'TPRNFWICCIifIIVLFIW11BSLRNLRY1WLI


DVJ1LLTELIDOIWCPLEFLRIfKCDRiOCEIOEQ.RKZ'.lffBMiKSGLXT.ACELTXFKSALImVFSAAILFSPVL
PNIPVESPNFLPTIV1'GLILIILSIGKRRRTIOIKI.


KIEpfxYRDXRKVIIQ.EVFPOGYRRELL.EVLKTRLSVOCEIOLFEEW511!'LEICJISLNA
'CVFSEEELOEAL~tAKAELLDIOVRKSWEDLSCEP'1'LIpYHIiJtL.YE170CRIVtOFLTOCPtL0230
268277 267576


TFSSEpEKVLEEYFU.KARIRKTLiNKLDOVR71NVAFVAS1TDLLSFSESLt%~16VFEDCT179
hypothetical protein


p
RPIOTALIYMSSOPLYITSSSLSRYWLTGEEKVACYKItAPNHIWN011PAIIL71ML1JIPC


IFCPVLCSILiGAPLEGASILYDVILPWLLPSILVFYLLVLPWIYAYSNfDOpVLJItJIER


CPn_0216 257623 25717
Z1'08I~KEIYDHCEKEKRTPNKKALSLYIESOVLVPEYSKR1SSNTIGKTL1CIIPID~SP


No sobust hoarolog Dresmc in
Genebank/Dt9LLSL~DELIOKALiR7IKENZYIB~JDRtKRDERFJUtRGxNIVSK'1NPLW8LiiG't
as of 11/7/98


NKJ1RTNNPVTFDRIQVDFIPFDTSLRINSYIVAOGLLIt.CWLSIISYICLDIGLVGLSA


GAAl1'LGLGCLIFALFLFSFSLILLL9QEKRVPDVLSLYLEKEVPOYE'LPLYKEDLEBERCPI~0271 268996
268253


DMSAiSERLGTTEEKLRIAOCFRYSDSVfIf cauB-A8C Transport ATPSSe INierate/Fel


POAFVSIOD10GFSMLOAHRLCYSCDt~VILJIDASFOASPCTIT:ILGSSGVGkITLFRLL


CPIt_02I7 257881 258579
l1G!'LPLOEGLLWHGSPWR1~VAYNpOK>JtLLPWRTALKNNTLS'fEIGINTSNE~l7IL~iE


yip
RLEEIIMIFDLCQLLDRYPDELSCOORORIALAAQCLSLKPILLLDEPPSSLWLLIC~L


PKCGKLKGFLSVNELIFCFOTFSVWIGV!'FASRCKAWL1GWLSLLSSIHNVFVWCpIHLYQOIVAWtKENICTVLLVT
HDFHDVSCLCDVLYVIKNKTLTPVPLDPSMRPLii4f3LCFIK


WCFEVTSADVYVICLLTCLNYARFJIytKNDINDVIIQ.CSWVISIAFLVLTOLNLFLIPSPNDLIDWLYT


DSSOEHFL1LFSSTPRTWASLVTLIFVOIVDIKLFTFLpRVFSKKYFA!!RS'LISLLFSO


LIDTZIFSFIL3IYGLVSNLCDVHIFAtIt.VKGTVITLATPTL.TVTKAVL~tRSSCPn_0232 270171
269232


siailaricy to 5'-Neehylchioadmosine/S-Adenosylhaaoeyseeine


CPn_0218 259061 258582 Nueleosidsse


No robust hdtlolog presort in
GmeDank/f?1BLKKP'I1BtRFLFLILSSLPLVAFSADNFTILEEKOSPLSRVSIIFALPGYtPVSFDCNCPIP
as of 11/7/98


IFLSKIIVFFESYDFANV115SWPKSLRALVOGRYFVDSELKtTPYRINDFKXTPINHRLYWFSHSKIITLECORIYYS
GDSFGKYFWS11LWPNKVSSAWACtMILKNRVDLZLIIGSCY


RSLPIISTIGCIIRLIEAliSGPIHPRDKNIfYRFEVLQAVIEILCLCYL:LVFDITCCFLASRSODSRfCSVLVSKCY
INYDAONRPFFERFEIPDIKKSVFATSEVHREAILRGCEEFIS


FLVAIILSLLLYCNSTFTCVONLSPTERFII.EGTGEAVNFLATNKOEIEELLKTHCYLKS1TKTEtTI'IJIEGLVAT
GESFANSPNYFLSLOKLYPEIiIGIDSV


SGAYSQVCYEYSIfCLGVNILLPHPLESASNEOWKHLQSE115KIYNDTLLKSVLKtICSS


IF'ti_0219 259319 260172 H


cgc-Oueu:,ne cRNA Ribosyl TransEerase


CSSL1LKFHLIHOSKIISOARVGOIETSHGVIDTR1F/PIfATFtGALKGVIDNSDIPLLFCNCPt>_0273
270179 270218


TYHLLLNPGPEAVAItt.~LHOFMCROAPIZTDSGGFOIFSIJIYCSVlrEEIKSCGKNRCMSNo robust
homolog present in Genebank/F71BL
as of 11(7(98


SLVKITDECAWFKSYRDCRKLFtSPELSVOAONOt.GADIIIPLDELLPFHTDOEYFLTSCEKARt?tFiGIIVLLFLL
RISRRSYVOEtGIFFHLETPDLKIVt.CAPYSTFLWIIIWSLKN


3RTYVWEKRSLEYHRKDPRHOSMYCVIHCCLOPEQRRIGVRFVEDEPFDGSAfOGSLGRNKGQS


Lpf?ISGWI(I7~SFLSKERPVNLLGICOLPSIYANVN~FGZDSFDSSYFT%AARHGLILSK


aCPIKICOQKYSODSSTLDPSCSCLTCLSCISRAYLPJtLFWREPNM:WASIHNLHHNpCCn_0234 :71216
270518


QVNKEIREAILKDEI
CTIH1 hyPOenecical protein


FIML03CKXALLSIWSILAFHPIPCW)VEJ1KSGFLGKVKGWPSKKEIOEEARTLPVKDS


CPn_U220 260660 261236
LSWKRYDYfSs's'GFSVEFPGEPDF15GOIVEVPOSEITIRYDTYVTCLHPONIVYWSVWE


rln robust homolog presene in
CeneNnk/EMBL'IPEKVDISRPEWLOEGFSCl410ALPESOVLFNOMOIQGHNALEFWIVCEDVYfRGHt.I
as of 11/7/98


F'fSFGKKKCIFYMSKESIRSYSEISTP1'PZFRETPSKLCVAYKI4LRSPAKOCILRNRVSSVNHTLYQVFtAVYKNK
NPQALpKfYEAFSOSFKITKIREPRTIPSSVIfKKVSL


LKCALLR:iIPFYCf.FLCAICRIHSAWSNCDAPC1'fRVINYLVCCLELLGLG'VVVIaCKVLA


'fALKFLFSKASSKIKOFIKWREKARNLANtDN~S;KEFCSVDLTSCFTRCFRLRNRWEE~Pn_0235 271195
272177


t:A,iENp'NREIIV kda8-deoxyoecutonosic Acld ..".ynchecase


VFVfIYLU4KPE;IECf.C I(;y(,PARWN.~..:RYPGKPWCIFIGK.iLIORTYENASOSSLLDItI


t:f n_U221 2ti 1621 262051 WAT~OHI
IOHItTDF~AVMT.iPTr:.:fICTERTCEVARK'IFPKAEI
IVNIOCDEPCWS


Flrr toc.ru:c lwnlolog present
E1IVDALVOKL.tL:SPEAELVTT~/ALTTGfEEILTEKKVKCVFD.iECRALYF.iRL:PIPFILX
in anrt'.tnk/EHOL a~ oc lt/7f~9


Tn111RYK'fEZJiOMVNRYK~3AEFF,ADNYYDDfILI'RMr:fKRNLRC:L11'b'ENEVCLFEE?MLKATMrfLHI
t:VYAFKREALFR'IIpH~:."TPL.:DAEDLEOLRFLEIfCA:KINVCIVDAKSPSV


Gl::>1I1N::11'IM:.:LIGtI'.HLII~IIW.':fODFKDSKIIIFIITALGLLLTL.:IC,IIVLLLKITDYFl:
4IAKVFX~YI'h:I::WIYF


t'f ILLILFTrt'.LU.'YPMY.~.MYSDFIIPI


r'r,r nz :o e7a t Iw : mn,;


nlr~ U:I2:.:1474 ~0_:14~ . IHr% 'Z'F ynrhut.r:u


wwk ::iortl.ir ity trr OdcihriOpn.al::Il'rfYIMOfKi
IFL'TI'l:W.~.::L;KCLTI.A::I,ALLLERORLFNJ1MLKLDrYLJJVDi'(Y114!P
'llfI 1'tt11


.tF:KFI.KIWEKIJ2flJJJIFEt.'IY)PEC'fRNRYN4Ift:.Y:RF'.Tt'f011AKVW:'.YRrVHEASLYEF'
F:IK:EfYVfDCtNEfDLDLC'JI'IIIRF.:::AAL.:hIL::.AT7VJIYARVIKREREf:DYLC.,~NO


Fat'FLTI:f1'oflKIlLrr)YI:::LVKIJIWLFLKF.LRKMI::PHKIRYFfiI:A1'f.TK(.ORPHYHLVII~I
IfTFIEIIt.WILDMKCII.~.rTNI.I'fF:Ir7~fI:DIE.:LFFLFJvIRUFRY011,~.EDCWIH


1 J:: MT'NF'f WMVEVK::K r'fVI C: /rJfL1
: i'; I 1 ftM L IJ:R::fYFL'TOEVI(~K
L::LPr:NVFNR


AVFN1/ InVKHT IYE74Tf J1LA(/PJf
LAtlf': :F:Kt.YLATVPEtI4DOWKVf.YN(N.uVIX.PKVK
I


'far ~I'.1t .'.r.1rSU ~~, r f n:W:K'IVOHRI1AYK::fFEALT11M4Vf/:IIMI:I
v IC IDAEGF1ILTNEII.fjt:LMt:LVPt7ItFG


83


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
IVY LEVMKFEFSVALKYLIPGRGI
AIV'r'LF'rI..LV'.'4tL.ii'.'F:::VIHGLC.~NIEJL
POVDL'YDPE4DYLLPE:
EKIA:
~:'


YR(.~IF~:YIArbVtf~CRCOGiI'YfrICII~VLWEn.
.~,NLOOANaLFJ4DPNTPItP ,n
LKPCDKAHKAlNCS5LI0ERHRHRYE'ItIPDYIOSLED30LHSPITILPSD1'YY~~rtYOT~NSSL:~T1Y:T
' PLKOCDI~O~'u'~~w'Y '


fPCL F
'MFX'AOf'LYAT'SCFMRLt.A TNFL'fYPSKLSYE~YGPtfD~T~
IIr;LAtyt:fr:I'FOCLCEItEV~DNPirltt.VOFHPEIYSKLISPHPLFIAFIEAALVYSKDA,y~ySip~Kltp
YTYIrII~,FYNpGLSPLGfOIZYFIDPDIJIRSiR50


.iHV ~D~,yE
Ta~DYOYFOP


7 ~
4 ~EGId ~I~i~~~.


421 PLVKAImLO~ETfMAFPGONLPNSVHPOAIYFIGLG'i.
.F~_023' 2'3741 2 'fiFAIITLKNtA


Y'fAf FnmtiY .
'.rK,vYtrI~KRI''.LAYAAEPLLLTLP:~NIEJ1GKNLKLiA)tAIJFKACGWIG
..,L~t,IC,IW;Kph .:.\t.' ~ :Pa, W'!~1!'.-r......-
PM
'
'
-


.
.:rAE a7rJ~
.. '
. . .. .. ... , ..::.iii.:.F
.... .;,~., 1.. :.::.:: .. . .
.
.:,....,
-,
::
~
~


. luu: -
:. .. . yOtlJ c1
. ~~:r..,'..:.._-;..:,f::
,~..a.FYY


ri37-L33 Ribosas~ai Procsin
L!'K6M
'


274110 275839 L~RL~~DRKLRRIN
_pn_0239 KDSSNRS10IREIIKLKSSESSONriTfIOtKRK


wf-Giueoss-6-P Dahyra9snse
NFLLFVIFASAGtID!EIIOfMVVOETI~I1~ISPRTCPPCILVIFGAT0151 286036 287559
' CPt


~CMIOKLRDFNPR 4-
L'IKECRLSDDFVC'.IGFARRFJCSIt~OENK~VI0FSP5F3.DIKVconseswd hYPothetieai Dsotsin


GDLTI1RKLLPALYH
SPDICLPYMSPFIOCTVIiRLICY15FOKE8A?LPT::REPRiTKSLGStNSVIS101KINF
RLFYHRSEFDNLi'ICYTSLKDSLDLDK1IAL~POYFSRII~.NID
F


IEp
ISI,fiCSRNLtI~6Vt4iGILLKIVGY11'~STNEIItDADYL.ILN1CAFUtSARDGKTIYI~NL
OQ ~SNINYLiGSGfIVI18ILSAIESRiS~ItIStIKSIfI
KHKLFYKNDODGKPIiSRVITEKP~IDLDSAKOL00CTNflVIHIDfIYLiGK>:NO
KSf:NLRDINONL1110LLC


NIL'I'fRFANt'IFESCWI'tSOYIDHV0I5LSETICICSR~IPFEVTI~~
PYITFW1DEIRKEKIKILQRISPISEGSSIVf10QY0~'VO~~~~~N~A~'~'
IIPSIKIDtLRSKPL0p1L1~ERTLYf~BV


:.LTNE KEITLIAODLGDYG~LSTDRE'~OLEE~NQ''~MLYLYP~It~
KDSRVETYVALKTVINNPRWLCVPFYLRACKRLAKKSTDISIIFIOISPYIILtAA~SI~GFPGE


PLtIJGLL::AIOPOEGVALKFNGKVPCTNNIVRWI~tRYDSYIOrT'rPERYERLLCDCSNPKLLPYVDIPLOtII11
D1lILK0!'BIIfIT'SRCQIIGFLEKLMKVPQVYIRSSVIV
EEiiDODSSPSFPNYPJIGSSGPKE~DALIERDQRSWiIDF'TGEOWIDNLGTFLYSO>FN?fPAt\ELPDOIPEKVKP
SRLILILSOIOKRNV
VNASWKLFTPVL TOBEIOEL


' .
IICDRTLITGGDE
pKlOIDKLIGOLIEAVICtIYtIPE114fid.TJIRFYGpAPEVDPCI
IVNEAIa.VSHIGCI~FIE


RPL I~~WB~~


~Pn_0239 275863 276672 . ' .
se-6-P Dehyro0enase IDsvB familyt
Gi 01ST 288112 281576
CPtl


ueo -
devB- CT111 hypocMCieai Ptacein lframle-shift
KaISt'toiIGITNATLINFND'PNKLLLTKOPSLFIDI~SKDNIASANOATKDLwith 0257?t


SGGKTPLETYKDIVINKDKLTDPSKIFL1WODERIJ1PITSStSNYGOANSILR~I~tIPDEATSTVCAWTZrDnOSlB
7DARSCSFRRACRFriRYWLGGVIRIPNNKFt~tdl'STDSIVINSAI


aIFR!!!:1'EIPDGAtOtYOELIFSBtIPDASFDNI1'G~GLCED.~.YIO8S0''O~~IPRLFRTSIlXIKtIGDNI
DNCfGGELLLVAYW10NPLFPDIR
SLFSNI'SATEBtl~LW N


FNSVPtIhlTF7Df1'LTFPMOOGKNVWYVOGEMOCPILKSVFFSEt.'RECKZ.YPI1~V~DIEiaIIS'fCSGTSYY
RARPIIGNLCSTIYA~~'~'~SFRVpSPBwtIATLPFV


ASPLlWI ISPESYDIMiONISSTYlOmIL


0210 277861 276698 CPtL0253 288171 287950
CPn Cf111 hypothetical Drocsin Ifratae-shift
with Ot53?I
F


_
FC"f3CRT!'ISSSIPTCOIfITISIPTFVRFNIESINLTDEQKKTALTTGONIATEtIIWi~GN
No robust homoioq present: in Gsnabenk/F1~L
as of 11/7/98


LVYPNVFSPSSESWKaNSWRSN~~VSPSESTEY!!>fSETM00RVPDIESLfDVDADODLICONtttE~N~P~SGRVNL
SNSPFSYQOSIGMtRQDYI4'tIt~fl


RP'fD!!8'ffGFItAAQM.GNLFNSFGILI4lCFSQCKSCOTPGC>El'SATVLCJ1TLLF1<WALIEQPOQYVPY~O
a'TN~RAALSIi~t~SGDI414GE.5lIYLGTSSIKI~I~VO


t,GpTI~,AL,VYCAY1NY'1'LCKtIIYSIliKAItAKVLRHP110ERIFNRARGVATIRSSiEGVK'~
~
I


CPIL-.0251 289368 28859.
itLYKSANIGSLWSLI11SL71LIALTAGIVLVLFFVAPGRAPVITAAM10CCA7100GAI
tDLFLTDCISH
t


fItATX CT143 hypothetical Protein
SLtGWIAIVNKALDIOLTN~~AVSERLLHDPSNFOATLSVIaNVRi~BJLETRDLKVLLPfTTSPCEFIVIfONIISAt
xS
RPSOHYOGSSDYQHRRGINtONFI'GSHFOGOOGFAGSH


YGNLFSNEEVAOLVOGGAPGGGS IPHKTLt3tIlmONLFIDO
' SAAI>ALTFSYYRI(TOCORANLYTYYPGN


fIYC ~T~p~ntSKTDVSOTP4CNNTSDPO
:..AGYPTAPfNPSAPPPFPPPAYD
CYYVAPNL'ttZTHVAATTiKSV&RNRTPDFBAYADIEPWKLfCOVCIYf7Y11'I<u.TRYIBCQ


0241 279372 278203
IATLTINFVSQiIOITLLC1'SD'~GYSSDRTSVAVTAIFSVTILVSSPIYDrPWI
CPt>


_ It~JtSLS~I~PFPSNlV6VD
No robucc hosalop Dresenc in Gsnebank/E!!8L
as of 11/7/98


:FLVKFMSA!lISLSSSHGSTASEfI'pVRDVLVSL~EfYIDREfEILPTKVFLRR01Z.SS


TAIIDDIJIDVVETBIGBHIIFOVYSNTSLR4IYORFFEKIFOICCCFLLLVTDBNIfl'DPOGA
0255 290183 289329
CPel


L;TCIIF1111Vh!'1'VCAIVFCPTi.CTLCYSAY1CTY0LTKKISSLSRIIi?Zi~FTNSVOKSDPFI...
AAAAS05TIKACKStFROSTGTFFVt.GLIITISLAALIVGLVFALtTLDPGAPACT12 hypothsciul
Procsin
V
KNNINIiFBCYFtILDSTVDCDtSaANLKTFtI~AOGISS1'CIFSIIOQItiTPKDO
i


A TLLKVIN
HRSG
V8A1GLTSGTI~~ONFTEEOISIDFKfBIRLSNCALPK6DCDPVPANYVRBPY!!CS
I
VIffAANIOCCAAGG'fGILLSVIGFLtabvYSWKSODGVHIQOn'ALLRCIVSNI'IION~Y


LPITPCiI7UfVLTOSIRRYDOFFSDDEYRDIESEVPLNRQZTPPPSYE1'LFHEECSOaSSNKPT.IGDTtuNSO>eS
~T~ETTI'~NVNSTfRTIGWKQSTRIL'NC~I'AZ~T.RA


VIPRCSPPAYS1'IDSSNSPFPSSSPPPYYA I~ELypK7lNptBJNGfIOGRIYINIiDLOCVGC
1STIYSOGCYATICrLCrtTYRASVD


VAPNPNDPNRSDNYNAGI~~IGNYSFSLLYYP~C


CPt~02i2 279975 279487
No robust ttowolt>Q Dreesnc in 0256 291=82 290398
Gsnsbartk/Hlst as of 11/7/98 CP


KSLKYCSLYOFSOKPTVILN71CSIFF1(MSt]f'D,n.
YZmEPLSKKTACLWD?f4.YPVIAWCA CT111 hY9othsclcal Procsin
'NSWLLILKVLFLLLSFPFtQ.CSASSALPCERVSLGSHFKCLYGCCLPYLLitCItIVPVFCGGRIJISERA'PKTKI
SIprIVRFNIOSTNLTF~OKKT'1'FrVCCKSt
l'fQ'lIVVR~LrCT


.GTAIt~FI
ISHRTSEDARLSSAIVII~APILOL71015GLIKPDachTCOSt~r:aKD:rITRErsTNSEIVeDCRLN~.sNSPLma
~ISacODTTDraaESSaKP


OEYVPIGYYKRTOIEIIR~ORARN890YVOOGSVPSCSYVPwNKFDOTS'fQICISCfEIYTDP


CPet_0243 280609 280133
E1D~TK<.VE'EVNNKVPKLFET~I~~TLLRANEY00000RINYfDLRN
No robust homolog prn~t in
GensGank/EL~LBRGSSYYE'fRPI4YVCVTYYAQ~CYETFOEaRAGGCLRVSFPSwNIVIILPYVL
as of 11/7/98


iNYNIfLVFLLKFVKGRIINACSIGYItLCNANEPDRF5111SINALV11DILLYPFNAVIGWTT


FAVLltWK<.LFL71TKFLVNfCIAACKSRPLPSCKENFOCLFGPK~(PGPSDWLGCLVLIP
CPtL,0257 292136 291267


IIGTLIYSTIITYOSDZi~RLRYFIISPAYInICSTAIINWCT143 hypothetical Drocein


_
GVVIBtRRM.OKTGPHASTPSINttAfNtG ~0
~ ~~T~'N


CPn_0244 280906 281556
ADTTTSPCEFIVODCB.SAESSOFKATTLSKCLBTTSEDOODAVPKPIfN&DPQSPR011LT
'


adk-Adettyiate K>,nase
GAPLVTKCSVFIIMGPPGSGKC'EDSOYLANRIGLPHISTGDLLRAIIRELTPNBLKAKAYPYI
YNY1IRMLiCOAtNL~SSSQPL'NGKPIETVC~IPNPE'fYRISASAKIYDAVIII!
IIIRESr#ISGLDNPNbYWI3tIGI34KTLTGU~DTRCY~RtRTSIAV


LDKGAFItPSDFI1WEILKEKLDSOACSKGCIIDGFPRTLDQAHLLDSI~i~VNSIVYI'VIFLOFE~GIYOVrIO
TGTFTLTEIVATPPHDIfPNLFLE1TIGIDIKSMSTCVIWFPFOANFJILVD


ISFDEILKRVCSRfLGPSCSRIYM'SOGHTECPDCNVPLIRRSDDTPEIIKERLTKYOE


R'fAPVItIYYDSLGKLCRVSSENKEDLVFEDILKCIYIt
CP1L0258 292531 292133


CT112 hypothetical protein /frame-shift
with 0259?t


=Pn_0215 281627 282199
CFSFCRLCSKFEKITLOCKCAIOLLAAGTYILTPTICKRN~WERiL~3GSIRLFBt:KYTGD


ydh0-Polysaccharide Hydrolase-
InvasinQMIGGSTV1STI~TAVYRDHSDIDPDPNNPSDKYEB'MFLfYRNCOHSAVIG
Repeat Family


TCOKEIMCNIfL.iFSPSADFFSKOCAIETOVLfGERVI:JKGSTCYAYSOLFHNELLWKPYPNYSITLLYFAG~fV


r:115FR5TLVPCTPEFHIHPNVSWSVDAFLDPWOIPLPFGTLLtNNSQNNIFPKDIIJ4it


!4'ff IWGSGTPOCDPRHLRRLNYNFFAELLIKDADf3
.IttFPYVWOGRS1MESLEKPCVOCS CPn 0259 293031 292141


CFINILYOAOCtNVPRNAADOYADCHwISSPENLPSCCLIFLYPK6EKRISHVMLKODSSCT142 hypothetieel
Drocein /frame-shift
vith 0259?I


TLIHASCCGKKVEYFILEpOGKFLDS1YLFFRNEbRORAFfGIPRKRKAFLI
YFYFKRKTYtNFIEM1'I'INNODMIECYFKLDSTVDCDLLASNIOTFOttOAKCISSTETF


0245 282955 392551
3tI00NATFKEIfVSATCLTSASTYKLNATGPAPBSITIDNKNNRtSNWILPKNPCDPVPAN
~P
GTME'DDSSRYLPI1GDCSNYTLYOSSKAGDVPRPVDWOONSKKL


. YVRSPOYFFCAKPIE
n_ HLGLiT4PYNPLtAEPTS
rs9-S9 Ribosomal Protein
VvAKSTIQESVATCRRKOAVSSVRLRPCSCKIDVNGYSFEDYFPLEIOATTILSPLKKIT


EDOSQYDLIIRV.iGCCIQGOVIATRLGLARALLKENEENRODLKSCCFLTADPRKKERKK.
0260 291090 233518
CPn


YGHKKARKSFOFSKR _
secA-Procetn Trensloease &ubunic


AYLDFSKRSCVEEDNVSKKINRE7tK~CPCOSNKKYKOCCLKKEEOTARY7Tl~%PKPSAEV


tp =0247 283130 293969
LSASEOGEIIGONC'MtLtORISOSLTSEOKMVGKFNOITKItKEIMSKKALJIKAQAKE6KL


r11:-L1) Rihooal.~l Protein
VTEKLQOfINFEIWICENWPPEIFS1'ATLNOCrNFVUEDFIPTOEDFRISENSOKPPVEE
D::YIINKILRKOTK7TiVK.iSETTKSWYVSIDMGKTtl'ALS:uEVAKILRGKHKVTYTPHVA


McaXNIVINAEKVRLTt'AKKGOKIYRYY7~CYifWF.EIPPE1d94ARKPNYTIENAIKCMMD


I ~HfIetL:KKUIJt::LR I VKGDS IETFE;eKP
ILLDI CFn 0:.'t:l 1'1A27" 3~StW t


~'fw n4>t :x4151 ~dO:Sf! ytl.~-FF-Gx.f. .utlltrt.lmilY A'rPner.
~.FIRIPFIVFN::'1'i.LIIdPMStN:KRLE:LVRKALYTIITtILANIINKIWAIJ#X:KDSLTL
Y


;'~tV/Vt4N1 .\tu' 'Tt.mcpwt.r ATF.~r:.-.
L::K?IOOQN LtJtL.KAI.
.TRC.ftWLDI.ItAVNt~Y:KY~tr.AEVNKPYLTFIt:DUt.CIPFRTIP~WAPETP


te::1Iat.I':ItVIt'/A'rP.:It::FR.~.PA<:KK::RKtL\t:tdtlIFY::RIJ1M::LLIEAKNEr:YPt:
.~.UARRRLLFOAAKER~A::AIAEY:IIIIRDULtIOTALLJit.I.IIKAKFNitLI'VIOINIIF
A
DLKNODL
:f'~ LRFFGY
'
'
'


. ?dl'fUiFLIETPEEWIRKFAKtah:FARVTt'tt
, IM/::(.Ii::YAKU::IJtt.lB~VFPL7WIWIA
M.LtILtF:fLDVP:~ K
iY:A::t:M:K1 Y
I::I I
:'tJl::If:l'IN::1.::IJINa'I
.nIttKAIV:F'VIy/tll~tLl.l:DVrJLIfNVltfti\LLARY.tIIwYti::PVYTRALELLGLVNLEDKV


1't,l.:::Y.I:r::l?Yn.NtVAIAItALINBI'AtLIrIDRI':aatILEET::EUttINLLLEUA::ALCGIL.
:Q
LAfQEtK::::


!V'1711/KItI\::1':::HI77lI::N:KLFI'IIN::nanVl2i.~ ='t.U'.'. :'n':w t
~


~IH!1 ~1' : NN'it1 :''1111 inllB::4rB~ t ikne AI:1.1 ItN7f:~tl.It.ll:,.
I.IFNINKEItK\t'tL'IIJ~MKALKI IL'1'NUU:I'FAKtxt;;t:t.V.':ALLFNIIt:UtYIMfVAE7U::


'I'1'.I tnYt.n 1.t m.vl tt..t.ein


84


CA 02350775 2001-05-11
WO 00/27994 PCT/US99126923
GiLRTLFE3V3PDLVISrINCC FTEPOAV\:LE:.RL'IiLT'::.
.iQK6YF.ELLNKiAYYKCVw"L'ECw:YC:IRNG4C:.
I "
:SP" '
'
'
~


.. t ~iDOYVKRFIPVKYFKEQRRtIDHCIF:
X fNE:.IfiIT.
1CAS LKNNIIVARRT".tEFDAGPIROrEOII
PYAYFQPVKGWAV
..N(JV
:K.74\L..
LtSOPFP


tlNt':KN\WV:x.TICMKQALYt7raW'NALSOtNMISFFQQDKAPEZUtALVIYP iN t
P YW4'IOIWOT.Pt~u
.FObD(CACFLKAY'd311K'Illd(Lb


':LTt:IlJINFPT::Pf7CSSwlOL7INLVPPvIDEFFYCEPQYLGSVNKNtIYYVCKISG1I1LICAt~
?~~
PCEELAAlWIK11FI7114GFta~'LA SLt7~'~PIIK


EELACNLENfII:,IIGPIF.ipFt;aPICLM'LCEFOKTQINPFHL4LLSSEL1TKIFHIVSDEOfVMLf'MtGMAVR
FPHfxVRPhCRTARGYRGVSGIQfECOKVVSCQIVCD~SV


LIVCOpCft:KRSLVWfRETFWOGVGVASILINERNCNVIJG11IPYZDHOSILWISSQCQA


:9517 297136
IRIIIIQDVRIMCR..~l~7r:'IRLJHLKEGCALVSFtCKL~SNCJDCeILSCSEEF7CSC:YSLR


Y7i17 hytbchetlr;.tt Protein ~> .
aPRKLRVRPP::LAKYAFRGFRNSHCPRPTKFSFPLYFSKtLSWFIIGGFiJIAC~uV0- ~Jr,01'~4
~
'


.1 ,
\L. .
.. v:::7.:7iYA:. . ':L~!':F~~:~...-:- ! '_' .. .. ..
.'~I'.'-'FI
.
'


. ~.r
.. .. .
."(f.Y~:.._;,L~.:
::P1NY::.'.~Cr;;'E:f::fNDPKEKNf;.:~AtTi:.c'v:wn'fRlfRi~tSIYIi:l7lto'C,isiitlLti
6llYLtNiDEANir.lv.i
:H:::
.u.Yil;


:IINKKKCYT11GOLILE~INFFIfAiiGIVY%NWHTAFYSFLTYCIATKYNDNVIInCLECCRIDVRILCCCCIVIV>
xIGRwIPIEVNERLSAKOCRlYSALCVVLTVUIAODKTOImSYKV


KSVTI:TSSPRKLGHIIlIEfLGIGLTYIHAIIiCYSCEPRNLLWWEItLCLSOWtEIVNRSOGWICVGV~CVNaLSEI
a.VA'lYP'I(DKKCYONLISKGIPITfPiQYV$VSDROOTiIVFYP


EOPSAFIAIFIfLIIEVINCRRT
DPKI8TC1'P~ISILIOIRLRGJIFLNRGZTZVFEDDAOVfSFDKVTIPYCOGIOSIyfYIH


OFB~L.!'SEPIYICCfRVRDODEIEFW1I4WNSGYBELVYSYJMtIIPI'Rp00TNL'1'GPS


CPn_OZ6< 297770 297155
TALTILVIFT1'YZKAtOHWO'a'tKL7ILTG~IRi~i.TAVISVKVPNPQI~O!'IOQI~NBDVS


ubiD-Phertylaerylau
Deeasbo7cYlaseSVAQQV1AC&ILTIFFCZHPQIARNIVDIIVFVAAQARB3IAItKARB.TLRK8ALDSARLIGK
WCZSCASGYIL71VKLIKELVNAKHQVCVIISPSGRKiLYYELGCOSFDALPStEN
M
R


K
LIDCLEKOPEItCQ4IfIV~,DSAGCSJIIIpORORRFQAILPZRGKILNVCKJIRI4KIlONQE
Y ~C
LEYIHTNSIQAIFSSLASCSCPVEIiTIZIPCSllITVAAISIGL710NLLRRVADVALKaR


LMSIHL>DiLLXLSK5Gl1TIIPPNPNWYFIIPO~~L~~~IIOTIZAAIGG:ZGADIfIt4.SKZ.RYMIIINl01101
fDCSNIRTLLLTP/YIUIftALI:


PLILVPREIP
VYZAQPPLYINSKIOtDfRYILSEKDfaSYLT2lLGTNESSILFKSTCRCJICPaLCiTINV


PSDLTKOwSNPE
ILDVCSFIIITLEKKAIPPSE!'LElIYItO'..Iu"YPLYYL71P7f1~f00GRYLYSDtAtCCAL110


EC1?IKP1CIIELYIfVAYFVDI~LOLKKYCLDISSYLIPOKNEIVILRf~SP5CNY8CYTLE


=Pn_0265 298672 297730
EVINYLXNLGRKGZEIDRYKGLGL~Ii100LWD'1'l9NPOQRTLZMVSIJfaA~TaDNZPTIQ.


ubiA-Benzoate OctaphettyltransferaseNGIlYPPRREFIFSHAL4IRIFOILDI
:!IIIVRLYYFI1JLVNTlCYSIFSILFLSAS1Yf11LSINEZSpNLSFICEGFKISVFG71IAFV


FARTTGIWFIQCiDAFTDKIQffRTSKRVLPANLVSLNFAWVLSLICSFLFLFLCKZLRIF
7
3


.'4IVYPYMKRVTFFGIRJGIIGLVY1Y11ILlNlCAFAFSCLSIIRLCFLULiIGG9
aLGIASLa CPh~0276 311110 310


. CT191 hypothetical Drotem
SVCNVIAaNDIIYAIEDTEFDREDGLRSVPAHYGEK101VEIAKVNLWVSYtJIYIFSGTVGI7NP'LKRKKRDGSQVO
NKRTASPIIDWWYLFOfYLQEI.QKZNiIANPN011IDAWNOVf'ItDKY


SLDKEFYFfAIIPLWILKWRMYSNYSKKDOEGLSKPFLANIAIALSFLVSNTLTWSLSKGMSpAIGFRDHILLVKVYNS
SLYALLI~'1'PONDLINSLYQVASNVpIREIQFLI~r


R


C?n_0266 299181 299876 CPIL0277 312003 311104
No robust hosolop Dres~t in GensWnk/ElmL
as of 11/7/98


No robust halsoloQ Drsssnt in
Genebehk/ENHLIKHLPPLIFYGYILNZIHVRAtAtGITSVQQPSTN!'OAAIPIL
as of 11/7/98 NISIFYPKYFIEGKCVL


IMALDEINNOF87PSppI115STSOTSKINODR1(TFACTVTLLWATL!(ILSDIVLLtTIGS.
NIVIOCSRZSSTYAEDIEEVAQEIILFJfSTNSKSSTSVM.WJWRVRCIfVEILCOCIVILAL


IGLSVPLSCILCTFAVTVCAVLFZ1CLTILVRKSLGIEQ10~DI3ifT.KIKTPTPPARPLIf
VVVVV
F~
t
GEC<
C9


SKFSVrCSTTSIVLGNALLIG71WSVFFL'iGYLpLCLCACLVCLG'1'ALTVAGLiIRNSPRS.
C
I
YLCP
G
EITALSILpVIZKLIItCLIDVLCVCLFGLGVCWAIIG71IA


WDOCCSGSADSQSNIVGICEPKAAQDOKWY)0lAIMIG>mGZPTAIIILTPEKPIIVKTi.ISPDKPYPfVVYV


=PtL0267 300122 300910 CPtL0278 712881 312060


No robust homolog Dresent in Genshsnk/EMBLcaausrved oueer eualbrane
llpopsoesin
as of 11/7/98


SINSWd(TN71LLNQPEPAVCLNAWDPKYINQDRKTFACTVtLLVZATLMILT1GVIVLLR08FBfKICLSLLVCLIi~
fLSSCF(KCWIpMCIRIVJ1SPTPNAELLESL01<aAItDIGIKLKIL


VS:aGTSVITLGTJ1LFIICLVKL210f5L7~WI0YpKYFOE<fVICOKYEPFSPI7CCYRIPNRLLLDKpVDANYPWI
011FLDDECE<tIfDGIOCELWIA1MILCPOAZYSKKNS
AF1GSPGLSV1


.
SLaILKSQKKL.TZAIPVOIITILIQRALNLLF~CGLIVCKCPAMitIffAKDVC~KCiRSZNI
-PKNONVttKLTSG.PSPLDIESPSPEASTPVSIQ.RIACSGYAiVILIVTLLIGAWS~IFFC


ptaLllIGFACLGT71LFVGGLirGLRTNSLIAQGINYLYLTYYLSSALEERNEITIImQLEVSJ1PLLVGSLPDVDAA
VIPGNF'AIMNLSPKKDSLCLEDLSVSK7fIi~ILWIRSCWGS
:~GYL


, P101IKt.QKLFQSPSVQHlFDTKYFK~TIILTIEI~FIC
RNEINTYLTEfl(~tQ01dtL1DILLE


0268 30091 701318 CPn_0279 313516 312875
CPn


_ Pwsibls A8C Transporter Peceease
No robust homolop Dr~~t in Csnsbank/F~LProtein
es of 11/7/98


xawOltSLNSQCOSSSTS"110EWNKStVPFK'R~1PTPPLSPIPSLDEFIL7IYEPtI2PKSDPEKKD~SDLIQIL.L
KETVNI'LYIIVSTAFF1SCAIGCNLGLGLf'C1'SPItBLNPIDfSLYATIS


NAQIIFtPPCI'STPNVFNCIDDtlIPLLGpPNEOFE1JIFBtPGTSCSNPTSLPAPtE~'EtNSNZLSE'LTAIPFAI
LZVILFPITRIdIVGTS1.GP'1'ASIVPLTZGAIPFWTIWD311RNiAL


QECZaGSCN~LIG
NYLC$J1VALCIPKRNILtGILLPESYPQLIFSLKSLWNLISCETL7IGlVOOOOI~pIi.I.


QYCYYRPZ<3iSVTtSVLVITLVLIESVRILfIDIWGRRVLKIfROIL


CPn_0269 302168 301176
0280 311593 313550
CPIL


DipeDtidase -,
VATRCVIffII7F0LCIM.LSHPNFGRImPAVRCSPEQLLSOCVRpQVCAIFVPHSRGEPNCDItdppl'CiDSDtids
Transporter ATPase


pFtSLifSLPNQYPDIGLLSYEEEElIGSSS010CSLSLIRSIEN1l5J1LCODTAPf.C'ILI31KLIKCGWLVSEpH
SPIISVQt7VSKKLGDILILISIfVSPSVYF'CEVFGIVCH9I'~GK1TLLRC


IHLTKOGPIJ1YLGIVWIOGDNRpGOCfF.APIDtLBNDGKVLLDINYELCVPIDLSHCSCKL.ALDPLDNPTSGSISV
AGTDNSLPTQKFSR11NFSKI(VAYISONYGLFSSKl'VFCFIIAYILItI


EDIGDYT11DKLPF7LiIVIII.f'
NSTIPRSVGDHRANLVDAHAKiZVRRIOCVIGLNGVRSYtICDSHHSCISKSEYCEOVYI7I'I14FLNLYNRNDJ1YP
GNL.SGGOKQKVAZAR71IVCQPLYVf~GD<I


IGDLEKNVLtIAENiGILSSIVLGSOFPYANFaENIFFFtF.CSSAffJWPVW~OLIHRIFSItGT511LDPKSTENII
ERLLQI11QERGITLVLVSHEID'WfOCZCSHVLVl4fpCAVCELGTIEE


KAESILSSRAOSFLKQ11IVEQVNPKITDVKf.
LFIi4SFiISITNEL!'HEDZNIAJII3SCYFAEDREEYf.RilIPSKELAIQCI
ISKVIQ'fla.VS


INIL~FIINLPAKSPFFGFLI IVLOCEYD~tKKAKELLIE1.G17VIKTPIf


CPtL0270 303313 302168
0281 315033 316103
CPn


yvlC-SuAS SupatEamily-related Protein_
SIFGVIVPDKIUQITFSLPEVMSAINQGKNALPTD1'VYGFVLSLY11SE71EERLYALKDR-dhnA-Predicted
i.6-fructose BiphosDhate
Aldolase Idlhydsin


EPSIGFALYVNSIEDIENISGYPLSPTAKKLAOLFPGAITLWKfiPNPRFPKLTLiLFAIVEamily7'


DNS~ft7REIVINtCGTLZGTSANLSEFPSJ1LTAQEIFADFADHDLCIFDCPCSHGLESTVIIAISLRRHTWtrIIHD
ILGNDt>E2tLL5YQCKNITtmKLTLPSNDFYDKVFCLSDRFB1RVLRS


SDPLYIYREGLISRSVIENI71G'fEIUCIFHRTSHAFSKHIKIYTSIIO~OEQLVSFLSGSLDFLOTNFSI~ftLANS
GYLSILPVDpCIEHSAGASFAINPIYTDPF11IVKWIESOCSAVAST


KCWCENPKPIOIFYTRLREALKKKTPSIVPIYDINt'SDYP>r3.fPFLSPYYIEYCTLSLLSRKYAHKIfFNLKLNHN
ELISYPPKYHpIFFT0VE7N1YSNCAVAVCATWIGS


ETSNEEIVAVSNAFAKAPSLCL71TVLWCYLRNPAFVAFICIfDYfffAADLTGOADHLWTLG


CPn_0271 303628 301362
ADIVKQKLPTCQOCFKAIFtFGICIDIItVYSELSSNHPZDLCRYOVINSYCGKVCLZNSGCP


LysophosDtsolipasa esterase
SGKNDFTfAARTAVINKRACQIGLILGRKAFORPLSFGIGLLNLVQDIYLDPNITIA


KLIifDYSFFRRKICNIPJ1IECPCNPQDPIIILCF~YGSL110NLTFFPSICSFSKLRP'lWI


FPNGILPLENDFRGSRACFPLNVLLLpELSRLYAF1GVI07LQEKYDELFDVDLETPKFALECPnr0282 3160A1
317529


ELILNf.HRPYNEIZICGFSOGAIIrITHLVLTSQNPYAGALIFAGARLFNQt2rlEOGLKQCAxasA/gadC-Mino
Acid Transporesr


QVPP'LQSHCYEDEILPYHLG1W
LNDLLLTKCI4CpFVSf'H~HEIPSWFQIO~VTVPNWIILILQSLNFSKKVETMSHSKP1'KPLCTFT~IiLSLJIW
I:at.RNLPLTAK11DLSTLPPYCL


DPARG
AVICPFIIPYALISAELASFKPpCIYIWlIRDALCKWWGFFAIWNOWFiUlflyIYPAV4AFIA


STIVYKINPELAHNKIYIATVIIJIDFWILTFFNFLGITSs~ALfSSINLIC~LIPCVIW


CPn_0272 305272 301340 SLA4lWIFSGNPIAISLa~FtLLPNFSMISSL
DNVNPRK


dnaK-I711A Pol tII Gases and Tau
NYPKAVFICAIA'tLT:L'JLv'SLSIAIVIPKEEISLVSGL'(K'EiTLf'IDKYNL9WIfEGIW


FNRQSI7AT'IATyNMHLEEENQGWFrILLRKVYHQEVPPAILLHGPTLPVLQDKAEOLASEIVMTIAGSIGEit~AWM
FAGTKGLFISTONDCLPRLFKIMt3KNVP'INIJa.PQGIWTIPTL


LLS.iSPCSEHKVSQKIHPDIYQFFPEGKGRLHSjDLPRCIKKQIYISPFPJvNYKIYIIHELFLCLDSADLVYWILT.
1LSVOHYW1NYICLFLAGPILRIKEPRApRLYSVPCKFIGICEF1


ADRNTIJWtSAFLKVFEEPPKHAVIILTTAKVpRLpKTIISRSLSIFIERCEKILCSKETSIU'..ILSCAFALWVaFL
PPRELAQISFX'SKICY7TFLLLAFSLNCLIPFU'IYFTNKRLSK


FSYLPRYAQCEIPVTEVSQIIKESSEl'DKQVLRDKVQRFNEVLLELYRDRY'fW9tGLIUSKS


.\L.NYPEMIKEILQLPLLPLOKVLLIVESACRSIaBJSSSAASVLEWVAIQLVSi7pYKEKEL


vsVSP~COCL.SF7 cPn_0293 318581 317532


Nn cobuac homolaa Dresenc in Genebank/ENBL
es of 11(7/?R


'.Pn_U27f 305853 305227
c:RRL:fFODLIKNAV1KIISFRKSPPNPVItLLIKFAKKGLFlJSSIAPLYEVLLEILF31PG


rdk-Thylsiriyluca Kinase
EEILEVLFSLDPNwWtBNLDPKKHSTtGIEIS~aETAETIESCSIGLISINLLLSGLCLRS


.'.aVFt'/IDOCEG:xK:SLAKALGDOLVAQDRIfVLLTREPrY:CLICERLRDLILEPPHLE.~>NDRrQAVKIIQO
Fti'QFS.'EEVQNFVEQRNILTPFWIHLFECDEVALLWQW:LRLOLIV


L~F.CCELFLFIG~RIIpHIQEVIIPALROCYIVICERFHD?fI'IIQCIAEGLGALIFIIADLCPNALYPEPDC:;CW
<,kaNSEItAKt711E00QEDFlIKTKFA(:Y.Ef:LKKLVLPAL~ITSIPQLL


.:Y.VVI:PTFFLPNFVLLLDIPADIGi.QRKHRQKVFDKFEKKPLYNNRIRFf..Ft.iLASADPRARRFf!Q!'w\E
IW~L\IMtKKNKQNPFIFLEALLE::EEF::t.'.'X:KYWIW~1I'IIIL.WOKLWIA


at'(LVLGAPE:U1::L IDK1INLNt'OIw.LCTI Y U:YF71V L ICps :r I ETF~'RRIWIJdPEAFQM
IQOr:f L4:FLFfKNLLD


:an:R;7A 7UN1FR 305952 t.'Pn t72i,4 IInU'i1 IIHSSI


IyrA fAIA ::yr..::.. ::utfunlC N., rr.t.:::a (HNNILW I'r'::unr
n :n ::.,nvGW k/F:YGL .u: ..1 ll/'1/rR


:::I'11*tTIKDEIIVFKNLEEFJIKE:u'YLRY:.M.iVII:PALPDIRIX:I.Kf::QRRVLYANKQLfLIMFIIf
A('\WfV114?ft1'NNf::::'(r:IL:LK::::LitfIT'I:.IWIUUA'fLdI::VLYFnt:II::


:I~:IV:AI!IIRXti\YIY:DT.'.:I;DYHF1K:E..'VIYPTLVPNAQF1WANRYPLVDCxXINFCSIDIiDV:."
PI'/f(:MLIfL::Vu'::1'la't:lYl~F"(:yJ':::IFKTFVF::IT.':I::VFI'::I1EIU.N(d.L:REF~
::


I'fnANR'ffEAkL'rIC:nNYLMEDLDKDTVDIVPNYDETKHEPVVFPSKFPNLLf:NGSSf:IAV::AfOELI.KNF
PAf71'fItRPItHI.fI::fIFLOFJJ.ftfiIRf:FEED~H'f::Y.lt.


'r:NA'ft7I1141NU:KLIEA'fLLLWNPQASVDEIWVNNIPDFt~IY:CIf~.0 Lpl
ICCCEGIRSAYTT


:Rr:KIK'/PAkWIVRFNEDK11R&:IIiTF?1PYNVNK::PLIEQIANLVNRKTIJICI3I>1IROEW rsniH'.
'..m.ul slm'.I


::lrYff:IfNLEIKKr:Fw::EIIINRLYKPTDVQV'fPY:AtMWLINtNLPRTN32HRMI:iAWINm rmtmrr
lu~nul.vl fa..,:~m in
ym;t.mk/Falf'1. .n:: .n Il/'I/:N


ItIIHYEVIPPf!TRYF.tI7KAETRAHVLECYLKAL:xf.DALVXTfPE~J.TJKEH/ULERIIESFCK4LFT7LFF'
P'fJUJK6fT1::111?LIYIvtY,::F::I::f~ITIV:LIAI::V1.1.1.la:VVF'ALVt:'IiVI.




CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
MPIGLL'JW~AASVCS741AIVStJICLYKOGKPVeAttPOependwnr Prac~ W n~xy Reaul.yrL'T\'
SNEEKIDPT%DLEIKOPESLKPV .:nIL%1.~.1:
tRNFIIQJLIDMFLLKK'.'IITVSLDNDL:.L'::ADKt~::FKI'C~tVF::::.GIriFSfYI:


tNt~QSLPKERKTI3lItAIfIPSIVWDntPYVIQSrFYHGNKVYSKPIAEpIpSLEIfEITVECYITI3KCKL4PLNR
.IIR~iDCFf'!EKP~iYJn4l::w\~flplW~li~Yy~firf~t
EEYA


TLtYDFPRALEE.~SaKS".rGSLLRCVI3EIKNLFLPRFL.iRKVKYSL:ACLRRLGS1ViFLELYAItt~IIIFIIE
W .....: , s
KC ECP3VAL


SSA.LILLLTKPEPIJ~tM'OOLIJWLNSLKTEKIUILTPIMDKLVISINFNFY41~ISLiE.


tEKIVAYDPM.LTDELIWiLFJICittIVOFLLS!'OSSO10REFRALFP~OELPSAKDUSrf
777866 ?37627


YVPAINSSEYNIfDPKDI-SVLIIl4,"LSERLU'CEKIPSPSSNttPTSSVASHYImFSLL!'fFF'CPIt~0295
'yl ~'~rmer Protein
ecDP-At


SNppSVILONPFLLtELWENPKGOTFGKCLLEKANPNSNWAAL.FKPNLNCNtISCIJ1NK.
ANSLEDDVtAI~JEvL.."'VOPKEVNFJISSFIEDLNAD~LCLTE:.IM:LEEKFAFEISE~J1


KELICITAEHW. PFKETTpJIIASCKILDLLLONLPDFy.. r7..-..,. . ...y., rrT.~.
. ..,.


... .~,~,~~ .~: t~ , . :". .~; 3a~ :
O mi:'W W':--


mgtE-Ng Tcansporcer tCBS Lk7lCiihl CPn_U
AFTCLSIDZHSH av
CT296 hypocMCrea: Drocein


SCRFSKGKINVGEpNIUtECKLOiAFSSGItJ~SRTSNt~DELSFKLFXKIPIIGIfICNDITLVCNRYIVTCGSROIG
Il..IVKLFLFIJfiADVEINCWEtR00AVIL5I.
DLSICIVIEYNPI~J1YAVSCLPSESRAILYIU~ILSCITAKVAFIINTDSASRWAIFRRLSD


SEVCALIE(xiPPDPJ1VWVLDDZPDRAYRRILELIDSIOtALICZR~.010K'rRNI'kRLNTNETOLOGCVSFJ1RV
CVSNNOCVKDCVOKFLDIUWKIDILVNNACITRpPLLIaIIdC~QSV
V
V
"
I
'


FFAf(XETrVKDV$11CIRSNPGIDLTRLVFVLDFKCELOIW'IDRSLIINPPO(SL10QIHASI
AKIwSA
.7pl~lYAAARAGIIAFTRStJ
SSVIRlOIIKARSCSIIN
ISTeR.TSLYYK
'
'
'


NpIOOLVLPDIATRECWDLYERYKIJ1J1LPWDCaIFLIGAM'Y~Z~ZJ1DE1'IARSVINONLKAt3~ILKSIPIirR
AGTPm
V71RVALfLiIfOL
tt7tfl
ItEIfAAPNIRVNCLAPCFIE


tiACITfDUCYQTClIWOR!'LhRAPWLLYILFA~ChZSASYNAYFOKISPAtl.ALIIFFIFLSSYIft110TL.WD~
:LTY


INGItBCItN4'VpCSTILVRBNATCCLSFGRPRETIFK~1SIGLLICVSII&IIICGLWYIJIr
1D CPIL0297 775721 371771


FLCiZiIISGOGIOLGVTIIAI'CVIG71SL?ATl'LCYLSPFFFAKI.L1IDPALA90PIYI71t1fabD-
Halortyl Acyl Grrier Transeyclase


IMSltIIFFLIACCINFLPFN
SHSIt~Nt?OUtRYAELFP00GS0YVCHUpDLYNEYPEVRELFDPANN<<RIRLCFSLTSZI~E


0287 321210 722089
GPEaiJ~IETVHSOLAIYLNSNAWKVL50RSSI0PSLVSGLSLOLYTIIi.VIIS~IISVLDG
CPn


_
LELVR>~OLl9~tEACNpSPGAHAALLGLPSEVIEENITSLGOCIWIAHYNAPItOLWAGI
No iooust homolog Dtesenc in Gatebank/E!!BL
as of 11/7/98


RACIIIRSPLPFISSKFAtidQCLODEFSCPEDWDFLFSEIELLA90DEPSt~YiJILSRSAEkYDOAIELFRMACKRA
VRt.IIVSGATMTPLNQY1100GLAPDIYJIt.QIIU7SSLPWSI1V


LLMIlI'tIMiPKVIfKRVIFYGVSYCLIWESMSIFIDVLTYIDFLFEKLGISASDRi.SLCSARVGKSLVNTE~IECL
APC~SPTtWYQSCYHIESEVDEFLELCPGKVLAGINRSIUISItP


TCINFILYSpIGD9ffLSEWDNFRLIE0LL1U01P0L1UaRi~fOIFRZGAIWEEVSLVASITSIGTF710IEKFLSEV



ASVYpAVCRSFIELYtIIWLEISDLAL'GIOtCtJIt.ALDLSPttIAItIHADYAIOGLVYIGTRQG
L CPIL0298 JJ671Z 775717


KSLLIERGIIENFSIfAZFiSFSRDGTrTL11Y0NYRYliYALA'9VKLFDLTY10CEIIFOQANIIfabN-
OxoacYl Carrier Protein Synchase
IIZ


YQTVpAFPNLSCttMVWGELLIRSGWWSNMCYIEVOLEKLASI4KKTNDPIA4SCLI'ATYTSFFLYIMtfSVNIO'pI
KMIWATOSYLPEKVLSN71DLF1Q4VDTSDEWIVTRZCZK>CRRIA
f'KDSRHRLISANRTFPGtISJILVHAI~IVQLCSJILYIttEDSHPASAI
tU'M


.
GPQEYTSZJ1GAIAAIEINiAGLSEDOIDCIIFSTAAPDYIFPSSGAirIQIINLGItOVPT
GIAILCLYL '
SCFQSCLESiDLDA~~WU.FDAYFSwCIIUUtSNtLd.RIfAVDVASRLCSiJtPPJIILFWSD'
'


RGLALKCLAFJ1TIDGAYKEIFLSLSLLJnORANDLSGRLEILELWGOSHYLLAQ-0OSLPGDOQARCV
WIIFI
C11G!
FDOOAAC11CYLYCLSVARAY~S~fONLLIAADIUSSFVDK1
IUESRPUSLEINRLSLGADUIdUELiSLPJYa'sSRCPA~SKLTipSCIUI1IAMEGRtYFRHA


HYDEAYTLLTKVDLTLSSSRVKLILAAVLLG1(GRLL.pIri'DFAEFJWEZLCFLY6YYLEDEVRRI~TAARHSIALA
GIOEEDIDWFVPtIQANERIIOALaKRFEIDESRV!'KSVNKY17N!'A


TSLGCPEAYYTIGKFYAVIImNNIUWG ASSVGZALOFl.VHTESIHI
DDYLLLVAFt~IGULSWG11WL1(QV
<N


HVIRSAQYGVRITEAiIWWDPYLJ1NLREIHAFRL1NFN010GRL1iiGNKTm90


0288 325785 724571 CPn_0299 716726 777115
CPn


_ recR-Recaabination Psoteih
CT288 hypoducieal Drotsin
tt100<Z.VYYSESLY$MM.UPRPECIUiICIHITNTRYPDYLSiILIFFLR>Q.PGIGFKTAAC.A
ISITIREFLFFCFECRAKFYNVIMSCFNLTSTHFSLRPISPKASFPIODaiOSYlRSALRK


HRSOTLSVSYCKVNKYDANLFVRLTVIALAVVGVLILFSItd.ASIQGTLVZTSWPLVTAAFELISWDSEOLKILLi'I
APHIIVASEpSttCPLCFTLKESKEADCHFCItEptlfipSLCIVASP


ILIPIZLLTOGMfILfRhGEXVDVISGVCZPPFSRAGWVPISSSIftLDCFDEKIiIfSACSYKDVFFLQISKVFKGRY
HYIGSLLSPI1CKIIIENERLSILKSRIETLCPKEIIL1ID11TLE


LDISTL.SAOUSCZJU1VYQCPPLLFR11FPCFGIPCANPFVALLPNIYNLZRFLWPPYIIFGDATALIIJtOEL~!'S
VNISRL1IGLPIGLSFDYVDS4TLARAf9GRH8Y


RNIYEHFFCIUD:P>~DRFIYItDVARtcIGRSLJIAFLtIAPFYJI6aC'IIQJIFYSLLDPLiICRV


tiIGSVERDtitIOl~iVZLARS1ISLAtIF~WSLFRFEOGOGR10GIGQHAFYLHLCCpPOSVfLFD


KGEIVSGAttPSIOLPERRCLDTSCRYPHZSVIPDS~iD&AIUIFIV


CP(>_0289 725797 726996
C-1'2A9 Hypothetical Drot:ein
NFtdltl'BfKpRSHYKKNNLLLLLSILVGLGLGSVOSPNIVYSAECIANl'FLKFI~Li.SIPL
VFCA1GSTITSIOFtFNflNTLGIUtILYYTLLTTVI11J1SIGLLLFFLLRPONI'1'~ALAT'1'
TKCNPLCYLtNLSDTLPtNIFRPFipGNVZSAACL.IwVt.LCSASLFI.Q~~FtfIS
rFSIFUa.~ocLxLLFIAfa,crsYItFxFS.IIDOSNtTIaAaxtscvloun.AOCFIYLP
ILLKINKVSPLKVJ11UNSPALVTAFFSKSSA11TLPLTMELAtODLKIN%NLSRFSFPLCS
VINtIIGCMFILITVLFVATSNI?IIISPIJISI4WIFIATLIIAItRIAGVPlIGCYFLTLSLL
T5181VPLSILGLILPFYTVIIMZlTSLHVWSDCCWSLAN
029D 327027 328523
trtTAEUVPVSERFFt.CCETIVRCIfKSFIICPKYSATFPQOGLSSLLISEEIpYILIBpPl1
CPn


_
ISAFYiLDSGFVCLOEYItISLKDLRSSAGFCLRFDVL~OJMfPVHtGFGWPIItPT~ILI~K
Na-dapeldenc Transporter


RSALTHNKKHASFSSRLCFIFSNIGIAVGAGtiIWRFPRVM~JOGCAFLILWICFLFLWSIDNSORltFALOQip


IPLIIIEISIGKLTKKAPIGJVLIXTAGiUCFAWAGCFZTLVTTCILAYYSTIVCtIICLSYTY


YAVSGKIHIL~DFAXLWTSHYOSSIPLWAHLTSLGLAYLVIRKGIVtIGIEK(ZIKILIPAFCPlL0J01 710167
310762


FLCTIaLLLRAYTLPCAVOGIKOLFSCLnCSCISNYKVWIEJILTp~111WDTf'JV~GLLLYYAfOapIhLike
Oucsr Haebrane Proteihl


GFASKK1'L1VSNCALTAIGNNLVSLINGIZIFSTCASLDItGI'rpLODDAGI1SSIGITPIIKtX.SKEIF11VFRI
IGFWYPFSIPIU.VQVIt9(%LLFS2FLLVLGS'fSAAHANI.CYVt~It~lC


YLPELFTRLPtxIYLTTLFSSIFFL11FSMJ1ALSSNISHLFLLSpTLAEFGIKPYISEfLALEESDLCKKETEELEAH
KCOFV1UIAEEZZEELTSIYNKt.pDEDYMESLSDSAStG.RIUCF


TI:AFVLGIPSALSLTFFSNpDIVwCVJILIVNGL.IFIYJU1LVYCFPKLKIfEVINAAPGDLEDLSCEYHJ1YOSOY
YOSIt~SNVIfRIQKLIQEVKIAAESVRSK8KLF31IWEGVGAIAP


AWIGFDYIIXYLLPIEGILLL.CiWYFYDCLFPFJ~>GQwWtIPISLYSLCSLVLQWSLCLIILCl'DKTTEZIAII1J
ESFItItON


wxFNKOLYLAFSRYNIiEIL
CPn_0J02 710766 311866


CPn lpxD-UDP Clueosamsne N-Aeylcransterase
0291 328658 329191


_
SKFI~FSNSFJtPVYTLKOLAELLQ~00NIETPISGVEDIS0A0PHNI11FGDNEKYSSF
ine8-Inclusion Hembtane Protein
B


EKHMSAPIPTPQELSDOITCLNVipYCQYSELARENKCDIECLKTLTAALTADAGIOPSADLKNTKAGAIILSRSQAII
QHAHLKIOJFI.ITNFSPSLTFOKCZELFIEPVT90FPUIHPTAV


EIYSLQ'fJIAALILSASEKPCSCPSGSTECSVTVQSPC%FKKVIJ1WLT:IALIAIAVLIAIHPl7IRIElUVV'!'I
EPYWISpNJItIIGSOTYIGAGSVIGAHSVIGANCLINPKWIRERVL


CIIAACGGFPLLLSaLNLYTICACVSLPII11S'1'SVALICLLTFV1WSLIKPVITVRTfRtGNItWVpPUAYIGSCC
FGYITNAlCNNKPLKHLGYVIVCDDVEIGAM'1'IDRCRFI4PlV


LN>~1'KIDNOtfOVAHNVEICKHSIZVAOAGIACSTKICEHVI
IC1C01'UITCHISIADNVI


CPn_0292 329201 729836
MIA4TCVTKSITSPCIYCWPARPYpE,TIIRLIAKIRNLPKTEERLSKLtIOQVItDLSTPSL


ind-Inclusion Membrane Protean AEIPSEI
C


VKNfl07SDFM1'SPIPPQSSCDASFtJIEOPQQLPSTSESQLVTOLLTMMKHTGALSTVLQ


fJORDRLPTASIILOVCCAP't'OCACJ1PFOPGPADDHHNPIPPPWPApIETEITTIRSELOCPn_0307 )12982
311921


tMRSTLCQSTKGAR10VLWTAILMTISLLAIIIIILAVLGFIGVL.PQVALt.NOCETNLICT303
tWpochaticel Drotein


wANVSCSIICFIALIC'tt.CLILTNIUrI'PLPASREOKCLHHNDVSRKINRtITOFYVDSIDCVIKNFDHKPSEDIt

sRDtiEELEEKLLTITKRIY


pSApEFQNRItTDSKNYYLKKTOWLPFKNEELEOTKELFANLTStIDIfKIAOLFFYSPOCSS


CPn
DWVEFTEVICNLNOSICLGGVLIxCCLFE00CEHVVTVNKKLDLPLLLICTtVVNSLRYYL
0297 729910 772723


_
TYRNISLLNCO~HSELOKELCDVLKQHCVAFTLIFKEIVDIDLLNYVKLIOGLKRSGNIO
CT271 hypochecicel protein


VWSNpNVLRLLFNLHHGEEKRAFLFFLIGLVWCICCYCI'LSLAECLFIEKLCSAELPKIYARIYONDVP1'LPSVSSS
PIALRYSLAM'IRCLAt.NVOFSSLKFISPSIL~fENTAKALN


LCGSLILCVLSSLILYNLFKKHISATJ1LF'LIPVSLSILCNFYLtLSSIFAIOPPRSPLFF:,f',CECFIFSNLDEF
Nt~IIKIVtIpLLR'IICKLSPEIWKNIMKILNIKRRVRSLYI


YRiVIWSLTILSYTSFWGf'VDpFPNLODGKRHFCIFNAIIFLCDnIICSv'IIASLVN7IGI


OCILILFTAALVLTFPIVFriSKSLKSISDDHDLFIVK:HPPPLS%ALKLCFYDKYTfYL~Pn 0)04 1.11091
)AI15%1


G:F'tFLtpLLAIATEFNYLKIFEIOFASKEEFELVAHICKCSLWISIGtQICFJ1LFAYSRIpdtlA/t~pA-
PYruvar r Wthydrory!n.m.,
AlDh.t


VKRLGYtJNIILFAFLwFLSLFLFWTFK'l'l'LSIAVIJItNVREGV'I'YALDDNNLOLLIYCVPDQKPLPKRLF'Y
%KVMD.~.SAPYNIA::yaEK.~TVFRtLDLYCPA:x'.IKFLKONVGIREFEA


NKIRIIrJIRIWESFIEf'IC:NLVWGLICFL:iSpQWFCLtISLtATILVVLVR~fYAKAILRCEEAYLECLVCI7FY
1L:YAWEAVATMIAN'I'~:LDPWVF.~.;YRt~lfAtltLWIPLDCIM


KN4:ApALpLTRSNC~C~WIK:.K1VKQKRQVELFLLAfILKHPSERHt?TFAFQHLWWSRSVRLII:KE't''l'.At
s:RtY:::NHtkic:PtIFHN:FtaVtIYJIPLAA(:AAFT(KY(~KNRV3LCFIC


LP::LLAIN9JKL.iLFN%LKTIF?NIf3SLWAKDFLTLELLKRWTSIFPHFAIIuAIHLYFAEIY7AVA:~:vF'IIt
TWFV::L11QL1IatLtIFlJNdd::W?.~,LNRAVAKQIyIAE:.'~I:.~...1'DIRAV


IIDLIJIITIIIAEOLYDT~rt:DRLLMILTVRRUEAYf:PYRDLADKRLKELLNSC/~PEDIVNC'fVN:F'fO.F'N
:'Id.:FHhAYRYM/UfF-':IVt.Vtx.'Ia:::NFRC:It::l::Dl'tll.YN::%RYlIUCLfKIf


LTLI.Y.LEKNPONFPiLLDFLNTKNEDtLIV'I't'.KAUIT:SVRANIIKt'YCf'ELLKRLROCSHNUI'tVI.AY
1MLIRLFIfI.1'EF.t:FqFIIRQIY:KTAV1.FJ1F::NAKL::::D1'::YITf.EFI:VYA


f>F.A:X)'f l.I.KT I:: I AL01::F'VKOLf~'I':NI.KNT::R%YAF:AM
A:t:LOKEVaFAFLOVLTDE


:rIiNRChILMML~KI!lNWLLKKIIAYKiVKC%A:;KALFY~YIk:IIYIQKK'ftTINL.:LW~:Pnyn'.
LL11A2 iA~.I :'I


tlflJ7:.T!'YfAEVNFIII::LItat.GSMEHSC1/LIRAL'1";:KNDKIKk~ALFSLEKtF:DSHLF3LtIhli/
rlNIt.lYmv.W OuhylnrpHay::.,
pr..


L!1F~/tKJt~iII:Y:.EKYYFKC(.1/IPLTLKELWlWI~7:P::::W%LTAQI?WCEEU'YCDFDFNKt"'...Mh
YIIK'PIk:INE\LRF\ILRE7A::NGI'NV"IfI:EF:Jt:D'IIY:A1'KVT'Yt:LLIIKYXiPKRV


U::VFRI'fWQKHEDYR'PEE::L'fLl:a'L.1ItUAPI:a:MF'::i:k:tt:AAl.:a:l.l~t'I
IF:YM::YrtIY::F1A11Nl t::IIAAKNIItfflYt:KP::VPI


V F'MTffI:MA1?Y:a:(%I::Ik'VF_':L
lMt t tc A. t 1 1 AI':all'YMYr:I.LK::A
1 HNNNINLPl.EN


ilyn ~l'L'~4 1 f 11177 s 1702 .
F:I.t.YHLYt:EVI'1'EI?YLVPIy:KAIIftYfjFJ:NIU:fI
1'I"/::IMV::I'PKFUIy'::LAKKRWf:LaIEI


86


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
"'VRffC:iRCIVIEEC:NYFn oEiIALITEINF05LDAPPLJIVCK'CPI'CKRVIL:KIVKLLP1T'N
(EEGiDC:LIHI:IGISI~tVIDdIVDP.~.IYJNKC:OIVEAIV
LDLRTIKf'LUI~fIL
i


. IKNLTNYC:AFVELLPGI6G
.n L1N
V HVNAE
(IEEKYPICL
tlpwOF
ICKDOCKISLGLKOTER
t-:


MP _
~KETPIrPY:IKtLEOATLPNVNRILCfIEK _
_
_
_
_
KL


115136 316131
P'II ?~C181r~11AV~L~I
VI".1GV1ITK TATQATL.~GiLIIVSALSL1K


CPn_03ne KKVSLSVKEYf.IDNAYDODSKTE4DFK0~CPKERKKKCK
ptltlC-DiMdcol ~poare>,de Aeetyltcansferase


.~.KFVI9LLKNPKLSPCNEV ,~T.VKAOIKK.iN00VSlGOVIVEISTDKAiLEIfI'ANEDf~VIR
A ~Pn 0316 359794 3e0121


EILRHECEKtVt4RPIAVL:.'TEANEPFNLEELLPKTEPSN<.F31SPKCSsLLVSPATTPOin A
P


;ATFTAVTFKPEPPL;::PLVFKlIIICT1'!I(ILaPLAROLAKEKNiLIVSSIOCSCPOatIVKroea
A nusA-N Ueilixaevon
r
~
_...
...
.r-
~
r
.
M

'
'
~
~
'


, "
...,.;..,;-.r,r-i . I..- ;.:\.\Ff!.Iv'.Y.\
...,It,,-;~,m.v../'.' Y,
.Y
a ,
: : :
HF .y.w:
I21 _ ,
~ ~
~ v .
~ :\LF:fAA!
YT.
?.
.
.L
:
,v\
.


..
. ". N..F:YF..., ...1~"f;F;.
.... ;~Y.\REYI;i-W':l:r:YMDVPF'.':a:Nf'f7R:\>:i
. !:C'4":f':
. w
_:
r.,-
r.
.,.:... ..~
.
rt,
a.;
w
ra:..L,..,t
:r.r:::r~.f.
;~:-.. '
v:rl
~
r
~ri
r


L
AAfWttvNYwR141EitLVI'CLtsHtIxVNt:1.w.'YYKHF'NLwiNLILDLGK'lFrli:.PTRFiF
'
,
,
..
..~:.
.
.
.
.I.
:z:,w
:
7lI:iAEIKSW.KaP7N:iLCDTEYK(M:F~JSNLCMTCIT
.rAIPDGIITPtIRCAURKNtl


. GAEVIuRSNAEFVKOLFIaEVPELEECSVLIVAIA
EFTAIVNPPCAAILAVCSVTF.OIILVLOCLITICSICNLTL5VD11RVIDCYPAANFNKItLQKTEKHKIGDKIYALL
YE1NESENL.
'


RffAGIIRTKiJWRSSD%ITDPVCAFVC~CSRVKNI
IRELNDEKIDIVNIfSPVSI
LLL~IL


KILLAPAVLIlX
LYPILIOKIAILLDDKVIAItNN001DYATYICIO)UiINARLISHILDYLLCVpRNitYtIIL


CPtL0307 31199A 316515
LEIOAi.CLAEFDSPNLOCPLEi4Df3IS1(LVICNGEHACY~i'IARVLLASJINphASVICISL


41QP-Glycogen PhosprorYlase ELAYKILLQVSKYCESXVDLICPLIED


NGCIVBflFBSFDKNKVSVDSH~AILDRLYLSWQSPLSA6PRDIFTAVIUCIVI~tLIUIG
360015 36750


wLKTQNGYytO'Ipv><RVYYLSML.,PttGRSLKSNLti~GILt#.yRKJILlf1'LNYDFONLVA(ECPr1_0317


St7AGLC~CiGRLAACYLDSNATU1VPAYCYCIRYDYGIFD~tIHrCYp~IPI>EWGAYGinfl-Initiation
Faecor-2
'
'


NPWLICRGEYLYPVRFYCRVINY?DSRCKOVADLVD'1'OtViJIIIAYDIPIPCYGNQ1YNSL1
lOJLKLKIKNACLTK7iAGLDKLKQKLApAGiSFaKSSSLIIPS
SLLIASLSKSANMCIfVIQ.


RLW(MpSPRCFEF5YFN1K.TIYICAILDL11LILNISRVLYPNDSITLGOLLRLKOtYFiNSAKDfSVKVAi.i~ATS
TPTASAE0A5PLSTSRRIRAK)IRSSFSSSEEESSAIITPVDfSLPAP


ATTUDIIRRYTKTHICLONLADKVWOLNO'1'NPALGLAFJIItILVDRL&LPWDKAWFJIrI'VSIJ1DPEPLLEVVD
EVCDLSPEVIIPVAtVLpEQPVLPETPPCEKELLP1IP11ItPALIAIVV


VIFNYIFRITILPEALERWP4DLFSKLLPRNLLIIYLINSRirG.LKVCSAYPKIFD0101RSLSNIItSKfCPaGIOf
INIR.LAKTPK11PAKC0NVAGSK$lxPVAS~CPGKPC'1'SLUGWIIitL


.IVEOGYipKRINIfANLAWGSAKVIGVSSFHSCLIKLriT.FKtfYEFIPGffIINTNGVZ'PRKQFNPANItSPASG
PKRDI1GKKNLTDFRDRSN7CSDES


RWIALCIIPRLSKLIXETICDRYII.SLIRSFA~SCFRL11ROLGIrKLttIOC~LTSRIRVYILPKKtiYDGSIORPI
HIKISLPITVItDLAALl9CLKASEVIOKLFIIK?fi'1WNDILO


YNEYCEIVDPNSLTDCHIKRINEYKROLM1ILRVIYVYNOLKLNPNOtHVp'ZVIFBGKASETAVpFICLSFCCTIDID
YSEpDIfLCLSNDTVRDEIpSTDPSKLVIRSPIVAfIOINdI


APCY4NNLLIIKLINSVADVVNODSRVNDKLKVLPLPNYRVSNAEHIIPGTDLSLCISTACK1TLIDSLRKSNVAATE7
lGAI'i'QIDICAFCCSTPVGDITILDTPCIIFJ1F811lIMAOAM


GrtEASGIGt~IIKFAIlrGALTIC:'I~DANILNALNIGKPi~tf'IFCLLmOIVOLRREIfCPOTDIWLWIIGDnGI
KmlLFaIENAKAADIAIWAINKCDKpNFNSETIYRQLItLIfi.PC
'


ICDKNPKIROVLDLLEQCFFNSNDKDLFKpIVNRLLNFIiDPFFVIJ1DLESYIlUIlILNVNK.LSFLL~Q.ALW1EV
LLLKADPSARARaLVILSfiJO~LOPVA
AHCCS'CVTVNTSJIIITC1L


LPKEPDSWfKISIYNfAfiKsFFSSDRAIQDYARDIWNVPTKSCSGmIITVLIONGSLKLCGLVFIiDGYGKVKTNHNE
IIrIaJICLAOPSIPVLI1GGSDIPAI~DPFF


W104BKTARDI IIJUtSAGOQRFAiAQIGDiPNF0.RM.ONKkTLKLLIKADVOCSI6AWtS


0308 349213 319596
ISKIA30tYDVEILTNSVCEISESDIRLAAASKJIVLIGFH1ICI~lIALPLIItiiaVAV6L
CPn


_
FTVIYHAIDAILLIMfSLLDPIAEBItDDGSJ1EIKLIPRSSQVCSiYCCIV'1'DIfANNK
No robust holeolog Present in Genebenk/H~I.
a at 11/7/98


FFlbHffe'(ATVAQTPQTTOPOPSVSHKATHRYCSWVFPICPILVSrr_r.-
.rVRVLPNKLILN!(GTLSSLKRVKEDVKEVR10GLLLS7ILLEGYppACIGWI4CY8VIYNPQ
rer.~,LVIA


sGVrrLSICxGTVLAIQIVLaGTaLVLAFNHIROFKOARTALLNSIOOtuAPAAATVOKCKKL


LEr7RrssK
CPtL0318 36270 363176


0309 350977 319595 rbfA-Ribosome ,iA4iaQ Faecor A
CPn


_
VIISYNVIRa.SIItOOrIYId.IfYQFI'EiRAIKRVNJILLpEAI111NILImVKIIpKISNNITRt
CT309 hypocMtical parocein


FNRAWEEFLLLpEKEIGTNTYOKWLRSLKVi.CFDACNLYI,FaQI~FQITIiFELIIIRIDIVKRV$L$ImLNSARVY
VSV11PNENTICEEALBaLINSAGFZJWRASKNWLKYIPGJIFYLDD


SGLVtiTl1ICPIAVNVTSVDKAAPFYAEIO(xlppLKTAYITIIfYCiSVNPQlI'FSNFLVTPLIJIFSPOOYI)~L
IJIOIO


DLPFRVLQ6F17CSPDLt~K~YrFNPIYLtGF~BGIITI~SAISVLRP80CKILY1BSDI.


FTEIG.VSAIRSfiElJC7~RSFYRNiOALFILDIEV!'SCKSATCtArIIRFNSIJIS>rlr~.IVCP1L0319
363133 363179


VSSSYAPVDLVAVtDRLISRFLNCVAIPINPLVp)Z~.RSFUtNQVCRLSIRIC~l'ALOFLcxul-tRNA
PseOdouridine Synthase


IYAfS
LLYF.DWRTIdJLDPLEAtIGtVALTPLKItRTIllr~fINI'IKtxllflaLAV~.KIxILLVDKPpCRTSFSLIRAL
TKLIGVIOtIalilOnDP


NVAQY7fCVS0E5ILCR5pSRLYVLPRCVANYFCRQKLBLSYVIIICDVFBRDIISTVISSIRFATBVILVItt.IGRK
FTRLSDILLFEU1C


LIDpKIE<ISNDIHMAIODISKNLNSUDLSLLFFPSLLItIILSAACYFOCLIQQLPPNFSAKKIrQCKIILYEYARKG
LSILRRNBTVQV11IQITAYLYPLLN


PV11SCSKCrYIRSIAHPZC'1'IGGCCAYLEpLRItLRSGRFSIDLCIL7CNLLCIIPLII'DIfPY


CPt~0310 353173 351019 uL


60IN-60kDa Inner namosane Prxein
K 03Z0 367121 161713
AKISL CPII
TLA


Q _
O rib!'-FAD Synthase
YFOLLSLIFRVY014~IKIlTLi.FVSLIGIAF1ICC0IFFGYDIEFRSCKNLAC
A AVAVCDILLFLLtMGEAAQSVIfSSGLSNSFVONKOC


FDNINLI1LYRCOGSSFNP1'N'fCKVFLpTNIICCLPVLtNEFRHN1C6PLVFLCLYAGCRISN7TPISIFLPTY~IP
NLIAYSLTSSPSVDSV1VCFFDCCHLt'JiSNGLSILTSYBCSiCIIIT


KDSTIFGT11LVP9iRSGSDYIPIGLYDSRLE1C.VSLDLPITMVIT~00DSAKSSDTANHFDENPOTVLSZ11<17~I
NtIQERLOLIATFPII7Wi.CYLTfT%MANpSAE6lLTLLIOINL


YVLlNDYNpINSL85CSILCINLPP11S'1'NMfSIVNEIGFDROLILSflISPEaIit'FGLSSKKCIOtLIIGYDSC
ICK00pSM'LALDTIGKPLGIIYILIPPYAlIDNIW88LAIRp!(iAG


LPL7COQAIDISIGCYYPLLRRGLLSDSKKLLPLEYIIALTNV~RELATRIALRYRVLSYTPNLLCIWAFLCIIPYAIS
CKIT~SGIQGSLGFATINLPREFSLIPL~VYAC6IAYCITI'Cp


HSIpLESLDRSVCKVY1C.PLNPLLICpYVFEI'AITLTKE'1'EDVNVtSGVPLVLINSNA811PGVlIILCTAP?FG
RESLYAE11LIIFSPAENLYCKEtISIIPRKFLREEKKFQSKCILIMIIIL


TIKYRVI>nO~GGSLOKVKt.PIfVItEPLAIRROVYPOWILNSNGYFGIILTPLSLIASCYCSDILDApDNPAKGSFN
Y~TA


LYISGSTAPTRLSAISPKNOLYPVSKYPCYESLLPLPKI9A61'NRFLVYACPLAFPTLKVL


I7KTITCEKCP~IPLYLDSISFPGVFAFITAPFAJ1LLFIINKIFIQ.VI~1CISIILLTVFLCPr1..03Z1
36f900 361767


KLLLYPLNAWSIRSIItPNpILSPYIQpICCKYIOrEPKRAONEIMGLYKTNKVNPITOCLPYahr'wGTP
Binding Protein


LLIQLPFLIAIffttt~fSSFLLAGILRFIP'G<'tIDNLTAPI1VLFSWpI'SINFICNLFNLLPILYSK1QNIIFIF
RCLNSNTLOGIVGLPNVCKSCLFNAL'SCAQVASCNYPFCTIDPINOIVP


IGIVlffL001NTSLNKKGPVTOQpXCQOVrCFiIIIJIILtTANFYNFPSGLNIYNLSSNILGVIL7ERLEALJ1KIS
NSQKIIYADbCFVDIAGLVKCASL>CACIGNRFLSNIACTIIAIANVVR


W'OG4IITN1(ILDSKHLKNEWIIdnCKHR
CFIL7PDVTHVSGKVNPVF~IEVINLELIFSDFSSAKNIHSKLFJtLAKGKAL~C~LLPIlD


TIIANLEKGLPLATLELTPLOIVALKPYPFLTNKPNFYIANVDLSSLPOI~KIYVMVRL


CPn_0311 351153 353575
vAAIt~ISKWpICVRIELLIVSLPIELRLEFLHStGLEKSGLHRLVIWIYDTt~OLISYiT


CT711 hypothetical protein
TGPCLSRAWNVACSSAWEAAGEIHTDIQKCFIRAEVITFFit'IIECpGRAAAREtaKLHI


OFlMIHAVIYWDRSKiVWSFEPWSLNLTWYGVFFTVCIFLJICISMYL71LSYYCLtIDHLSE:CRDYTVCDGLIfIC.
FLNN


FSKSpLRVALFtiFFIYSZLFIVPG7UtLAYVIFYGWSPYi.QNPLLTIOIWfiOGLSSt~OVL


GFLWAAIFSWIYKKKISKLTFLFLT~CGSVFGIAAFFIRLCNFIiNOEIVCTP'fSLPNGCPn_0312 366231
367328


wFSDPMpGVQGVPVIiPVOLYECISYtNVSCILYFLSYKRYIRLCKCYV'ISIACISVAFI' YscU-YOpS
Translocacion Protein
U


RFFAEYVIfSHQGIM.AEDCLLTIGQILSIPLFLlG1111LL.IICSLKARRHRSHIs'NI~i4SMGEICfEIUITPKR
LRDARJ<IOCOVAKSODFPSAVTFNSMF'YAFSLSTFPPKIIIGC


FLVSM1.SQAPTRHDPVTTLFYWO~CtJ4<.ILTASLpLIGAVAWCVIVCFLIVCPTFfTN


CPr>_0312 351518 351976
FKPDIKKFNPIF?llltpKFKIKTLIELIKSILKIFGAALILYITIJfiINSLIILTa01IS1I


CT101 hypothetical protein
ITACIPKEIFYKAVTSICIFFLIVAILDLVYCRHNFAKEL1WEKPEVKQEFKDIIOtIPLI


CTNARNIKYFLIIFPGILWISACNOILLLKATAIALDPLSSFFTYCLLSMVS1rGLILSLIGIRKCRRRQIJ1G~EIAY
EI1SSSOVKNASTWSNPKDIAVAIGYNPEKYKAPNIIANDIMJIAKR


'CLLSKTIRKGL:LSSEFFSpKITWIAYIKOTFISRRFLIININIAFSLVLRRYLSNPOALILDEAEKYGIPIMRNVPL
AHOLLDECKELKFIPESTYFJ1IGEILLYITSWIICNPNNKM'


FVIRATVG1(ALIKTAIAYFSKLQNAIXENpEGtiNOPDHI.


CPn_0113 354957 355355 CPn_0323 36731? 369160


acpS-ACyI-carrier Protein SynthaselcrD- Lov Calcium Response D


wKILKEISANSNEIIHIGTDIIEISRIREAIATt~NRLWRIFTF~1ECKYCLEKTDPIPS'SFIMNKLWFVSRTtGf~T
TAWMINKSSDLIWt.wl9~CtMIIIIIPLPPPLVDIJ1ITINL


FACRFACKEAVAKALC'IGICSWAWKDIEVFINStIGPEVLLPSHVYAKICISKVILSISHSISVPLWVALYIPSALOL
SVFPSLLLITTNFRLCINISSSROILLKAYACNVIpAPCDP


CKEYATATAIALA WOGtiYWCFI IFLI ITI
IQFIWTKCAERVAEVAARFRLDIWpGKQNAIDADtJIA~IID


ATCARDKMGLCKESELYCANDPAIIKFTKCDVIACIVISLLNI4CCLTIGV711iKlmIJIO


~:Pn_O31A 156185 355353
AAHVYTLL.SICOGLVSGtPSLLIALTACiIV'1'l'RVSSDKtdINLCKEISTOLVKLPMLIi.It


erxl-Thiors~xin Reduetase
CAATUiVCPFKGFPLWSPSitJILIFVALCILLLTItKSJIAGKK00GSCJ~tiTNCAAGODM


MINSRLIIIC.SfiP~Y1'MIYASRALLHpLLFECFF:l~I'WOLMI'l1'VENF'PGFPECI'IIfC'.DNPDDYSLT
LPVILEICKDL.iKLI011KTK'.,~CSFVDONIPKNROALYCDICIRYICI


IGPKt)ltifMKEQAVRFC'I'KTLApOtiSVDFSVRPFILKSKEETYSCDACIIATGASJIKttLHVR'I'~>PSLEC
YDYMLLWE1IPYVRCKIPPHHVLTNEVEON4SRYNLPPI'l'YKNAACLPS


BI('CN:rK)EFHOKCVTACAVCI1CA.;pIFKNKDLYVItY7CD.iJILEEALYLTRYC:.~.M/YWIi\WV.~.EDA
KAILEKAAIKYWI'FLE'IIILNL.:YFFIIK.~.SC~EFI!',,IpEVRSNIEFNLRSFPOL


RRDKLRA:IKAMEAPApFAdEKITFLWNGEIVIIISC:OIVR.:VDIKNI/~YfCEITTREAACVFVKE1II'RLIFLQ
KLTEIFKRLVDE9ISIKDt.RTILE::I-.EWJWTEKL1NLLTC(VR3SLKL


r'AiCgIKI?fl'DPIdxDLTLDE;CYLVTEKC;TSIIT~'VFr:VPAACOV~WKYYR0111'1':.ACSCCYI::FI!
F.','QI:Q:~1I3V1'LLDI'EIEENIRC;AIKrT::N::S'llJlfpPD!:VFILILKrhRNI'ITP1'


f MLOARRFI.I: PAtTiqFPVLLTA IDVRRWRKLtETEFFDIAV
t:: CpEIL.PEIR IOPLGRiQIF


'Iwy f I'. t5t:977 15H71r: n't'ti rl Sa4 t~tlbfl f'llh:ff!


r::l :a ItiLrANtt.li Prutuin ~"r1:11 hypocMrriurl protein


MI*VAF.Y15J~:::KKIIiXJIEC.'LTEDVAEFKDLL1'TNIPIT.~.::EEE:IWEIJFC:ALLII4TWYWNIRRI
eIAA.~r(XTl'GCIL'A.'l'Or:/tILMVhAIJ~AKAOME'NA.~.GEI'(EFNNIOp':.O~.T


nIHKUF'VWIIw:LK::pt,;VIFM::EPtDS.~.EI:LVLuAEJIWLOpAEDEF.:KYIL.:REKATRNI'AMTRTKK
KEEKF~~fLE::RKY.~FJ1C:KAF:YY:a_:fEEYt'f11'DLADKYAa(Ii!;EIv~


,7ItS7Ylh:! I
LAIN'F:h72:IYKt.'(IITAKVKCI:LIVDG7FIRAFLPt::.UIf7FIKKItta.YlaAIt:UI)/I:PEDILAL1
/VEYIKGIAlrr::
IKNLLDYVGKVC: fJ Id7YLVrT'I'PF:7Ir:KLKFr\LiGARNI'rIT


r:F'KILKtrNFRRNtW::RNELLEAPRL~.KKAELIFlrL~.It:LYRKvri'VKNITDFCiVFLDLDFktFr:M'AI
tJIKNILFA:~EYAiYrIlN:a".:ra.f::LYLEVTn7I1P1ITI:ppLL:TILOpRYTYC30


r:IfX:UJII'fI>lrl5dKRIItIIP::PlIVEINOELFNIIL::/CAIRKC:RV\L.:(..I:v'KFJIrIhYIEOt
EKMAIV~::FItIKQIATELKItG:Fr/F::Ny,7Wtrrl-rf.rnrrAYL'r::YnyFF.;RVFILLD..rLK


87


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
AtYIIr~'fCa)IlIFVKVAE.TIHKIIt)DKFPTJL:KV.CPn 013' % I Ja1575
dNLIC00V0~'f:VWLPFSuR snpB- Snwll ProcW n t


QTSSRLF:~ADKROOt!)IWIANALDAVNTNNCDYPKASDFPKPYPw3IEEIFPQIOL'*RLLILUL.RRKtICFLLYW
Fi3PIllclidi
KEt
~A


CPrt03:5 J7tJb9A :71119 PlI~'1.'~Y~hI
EAGLVLTCCEIKBLRdIIaQB;IGDJIYILl~tI~EC8iI3180
WRYFLRKLExKI110KQf:LIPIGNF4SRGYVKVRL.xCRGKKAYDKRRTIIERFJIEREV


CT3:5 hypochectcal ororwn MAIIfRRNN
KRLAIpNQYECLLE.iLAPLLNiT'.J1PDYJJNSCLIRFSDTfIVPWIEI9l'~NSGOLAVSTL:.


?LP04VFRER: FKAALO"VNC.:FQSS IK.'
ILGYCEYTOQLYLSDIL~fYIl4GEKLFEYL 033P 393373 387375
' CPn
'
~


fVA _
:NLPDLNVLfa% 11U ...~ty.ry..-. - vil~
LRT
KLFSLIAKIWMFw


. ~ ;~.tl' . ... . . rrINKF:-:.:aNr:;. ;r ~.:~v: ...:.,
., ' ,, r.~~T :..,-.,.:.,;L...;h; ,.::.vvrv-
.ct.r:~::
Z:Y
'
'
'


f.E:::EANLtt~.:.:.v.:a;l4>.;i
eHLL:iIIEIImfPll:.
LX:.i.~:
AK1.YEK::ai,;IP::KRFtL::/


real0-Glucanocranaarase
PDION71LRFSLPAEOLKTi4f~RTSFAVSREESRYYL'1CYLLIIAMiVATII~TJX'nKRIaK
7iLLRRVNVIJfY'tIfHSP5J1NAWNLICfSPKfKiIYLPLFSIHTKNSC~uI~vEFLDLIP'
PSClL


r RLLBG
LI~OIfOCFSVIOLLPLND1'CtL?fSPYNSISSVAWPLPLSLSSLRtID?IPNAIDQ.QIDAMLDKSFSGEYIIPIIL
1VEEIIKHCSDEGBr1IIFLL)pOKIAVOCt>MLLIl
'


CSTPSVSYIbVKDIKYIAFLREYY01ICCKSSLO(ZISNFSEFLESERYYILYPYCfFRNF$SH$VRFSTLpC$L?LTJ
BiCIRV
OII~ PVIST6SNVKLDL)IREF1.ITLLKOVALII
CPfPDFS


. _
AIKMIUiCEPIHNWPKSLTDOENFPDLZKKEHDEVGiFSYL~OFLC1f00LCEYlIAYAO(RIH~~F~EIAFNPfFFLD
ILKHSKDCLVSIGISDSYNiGIITDSI196WI


VLLi(~LPILISKDSCDVWYfROYFSSSRSVCAPPDLYNS1DLPIYNPSOLJ1I~DYhP~LNt~


LYNBlF3tLRYAONFYSV1IRLOHI IGFFW.YIIitDS>iCRCRFIPONPKDIfIKOOTt;ILS
t~.G 193105 384034


ASSIQ.PICEDLCIIPOWKTTLTNLOIC~'L'RIPRWLRNNG4DSAFIPLXDrNPLSV1S2.SCPn_0339
StIFS7ISIlRINLFI~Y LT339 hypocMcical prauin
KTLT'L'ITOIDILIC
PKEAK
fAKFIJtLPt


. _ _
O T
Q
THDSDTFAQwWGNS
LAICPDLVSKNLORERINfPCI'ISKKNNSYRVIIPSLEEWIRKKfNGIfIF)IILTGL


~~~~P ~


OlZ7 371937 373311
KaRIl.I5G71PADRALELNLL~OCDNNYTI"~LSYYNRAt~ANU'I'KSKQTStVASml4S
CPn


_ ~P~~~~'r' '
r138-L38 Ribosomal Ptocein


RIHRKNNSRKGPLTZiKRPRRCYSYT1.RGIAXX100GIGLKVZGKTKRRFFP4llL'HUtLWST
CPf~0310 383843 381156


E~iAFLKLKiSASALRHIDKLGLE%YLERAKSIO~tF1lrarltehitt wieh 0339)


013B 373320 374993
PLYPLLIVLSSRSSACICCSLKKOAM1JAGLWDEOLVKHGTYLSIQRfLCSOKLSDLS~L.
CPn


_
wSNIGJCEOLALKFKSSLIItNSDISCtAVAEEFHKOLSISLPRDLE
cT085 hypocheeical protein


LIfYRCIFNSPLRRNISLFRSQKpLIOVFAPVSPNLEU1EIHRRVILDpCPJILLF10JVIGS


SFPVL,tNLPO'fRNRV00LFSpAPD6ILIJ1RV11NLISSTPKLSSZiIXSRDLLIQtIgSi.CLKItCPI).0311
381160 381195


ARFpRIpFVSNSSVM.ImiLPLLTSWP(FLTLPLVYTGRPTLTTPNI.~IYRWRFNOtfraas-shift with
0310)
S1fFL9CNPFLTLSAIAPLPWVSLLLfIITFL
CSZ'SK3PRRtDPLLTNNONPV50FSSG.OKHSLL71ILRL7IflCLYLKpSHNVSPLVCLODI


NI?>DLtlFOI01030(3'OILYC71Ep1~l4LWHAGI~iCRWOLLDPAPTLCOTLITSTHNNGFi.PKTSLYLSIFN7
10VSD0II
OG7110.LYKKTHDHPHPLLYDAEFILVCFSPAOKRRPbiGPFODNIGYIISLONDFPCIIIQDC


IYNRImAIYPATVVG1IP'YOIaFYICtiKt.OLYLSPIFPLV1IPGVRRL1ISYDESGFRALTM
CPfL0313 381619 385067


VVK)CRYfdItESLTTALRILGDGpLSLTKFiJIV'tDOEVPGDRTSYVLETILOiLOPORDLIIDtbit:ced ONP
(leader 119) pvptidel
fSETIINDTLDYTCPSW10GSKCIFNOIGKAIRLK.Pt~YpOCKIHCVODIAPfC~CLVL$


TSLEDRCIXSLLHNPDIJt541PLIILiIUiLRETIOSEK01WR1'!'1'RCAPAt~LII~JtSNFIBBfKFLLTILFL
)lVri~IPLFSETSVIQTLPSGIOGLICITSKOKEbVVCVIUFLRSYTSLKP


ATNRPNYHFPFVTDAtI9CPSYPKEYIYDPSTKOIfVSEANHAYFPNItEfFYIIARVLEKLtIYWEI4MYlatRKE'I
LEKHAFJILNRLLK1CI11ELKPGVPINfYI7tSI0f.IfIVR
VAWIPDCPEFJ110C>l7D.tStALLRTODW


CPn_0339 375085 376146
Phtapholipasa D Supertanily (leadesCPIf_0313 184999 385595
133) peptide)


KNNKROK~CICVIISTLILVGIFAMPRGDTFICffLKSEt)IIIYSNpCIIEOIOtKILCDItratleshltt with
0313?1


AIF11ADEEIFLRIYNLSEPRIQOSLTROAO~RfVTIYY01D'KIPOILXOAbN9TLYE0PLPRRBQKRKAILI471PP
N71CSTLJI7tRYRCVKfVOFVft3GKIGRQLLTYCPTK~NGKLPS


PAGRKIl0I0KALSIOID0371WLGSANYTNLSLRLtadJLIL08lSSG.CDLIITNlSDWSISLDVLIiS~181IFLP
FRLPIfCi00KVCTIETKLD'fPNKAYVINTSN1YII'i~KSLYL


KDQ1GKYPYLPODIIIIIAIOAVLflCI0T11QKTI0VAlDbILTNSbIIQALRO~QIeGIINDI>B~'LICt~ILTPI
IGIVPEMLt~'tIMEDKQKNSRLJIPYPNODIYVINCfG8RPIlNLYGP


IIDRSIIbID:.TFKOLR0I14INImFVSIMAPClLt00CFAYIDNKTLIJIGSZtiiISKGRTSLNPIDMSLi7pKNS
INPEK~R


DESLIIL121LTKOONOKLPID:Ii10~l0iSDIPrV~>I~LLII>''~fSi.PVEGOGA


CPt>'0330 376930 376303
CT083 hypochstical Drouin
FISII~Id.SiIOLFSVLPSRIpDLHVYRfKF'SLIQirOFlnTtf100EIWVLiICIKEF~LRA
RIfhPVAKRRIICIYLRIFRVLSRFDVIOtIIWDPYGALSJ~pSIA~6R1711SPLVF3CISE~I
ATNGIRLi(LLAIGDRDQ
00Bl~EIQRWKRSI)M'K1DDOCISLCM~OiIIMMYJ1WVIARNICGVLBTASTLFYOKD
t7J1
CPlL0331 37153 376701


CT083 hypocMCit:ai protein


IOAIIN)IVSGf70GVpPSSDPCl011tPALOCD(?AECPSPLKLSIfSETKQASSMIWESLVR
'


SGbICNYATESOINKAKYRKAODRSSTSPKSKLKf.TFSK)01ASVOGfl16GF05RASRVSAPiC
LVPPTFLLLIFPIPLTFODLFR!


KIIASOSGAGTSLLP'l'CIDAIALKKQRIISPDIOCFFLD118GNCCbSSDISOLSLGLKBSA


PSCiUtSLSLSSSESSSVJISFGSFOKAICPlISEDIVNAwfVARLOCtE?IVSSLLDPHVLT55CPI>.,0345
388587 387436


LVRRANATOIIEGMIDLSDLCQLEVSTAK1'SPRAVEGKVKVSSSDSPGNPTGIPNSNTLECT715
hypothetical 0rocsin


RAEKF?.EKOFSRDpLSEDQN14.ARAH71GLLT~AlIP0EVL5NSV1ISCP51~IFPPPKFSC1'LLKVaCIJWLAVI
~f'wRTOSICRQTL6IVRRYPSEFKIISN71SYGNNLRLffQOLICFAPLM


DKSKHKSPCIF~CSTNttTNFSPLREGTVK511VKSLPHPESMIRfPKDSIVSREEAVYNECVYNF~ICQRPPH110FF
LCQDGL71QLCIND'IYTTWAASSOIPJILPAILA&000CK


PEAWIIFSTAFKNPINSSONfLPIAVESVFPRESCI1~'rAllf'aSDAVS55YHFLAORGVSLLALALJ1NKEILVCA
GELYSKTAKPNOIKVLPIDSEHNALYOCLE1GR'l'It~IIDa.ILTJ190G


APLPI41TDDYKEKLEANIICPGGPPDPLIY9YRNIfAVEPPIVLRSPOPFSGSSRISVOGKPPLLNKSLCELSCVTKO
WWRPIN8~14CSXIIrVI>SSTLVNI(CLIIEJIYN4lCGC~NLIL)1
'


EAASVIiDOGCGGNSGGFSGOpRRCSSCOKASROty00CKKLSTDIF
VIHPOSLIHfBNCELDCSVISIIBIPPDMLFPIOYJILTIIPERfASPRDOIDFSKIOpTL.P~


PVDCeIfPSIRLAppVLEKOCSSCSFfNRANEVLVRAFLCEISWCDILIIIILTTU18CNK


CPn_0332 378676 )79536 VYACHSLF~ILE51DGEAR7(frlOEI


CHLTR T2 Protean : '
' 0316 389690 388701
CPn


ILLHLLAVLCPPISFFTOGVSPCVFfCFLDf _
YLDSRIRVIPU1RQRC 070-eroDlytQD-Intpral lhalbtane
Protein


CPn
KKOSW7sLRPSPYYCVSFfOFfSVfFSRLfSOSLPTCSLYIDDIQIIVfIJIISC9CAFlIG
0333 779117 379800


_
TFLVLRKNAhYANAVSH11'LFCLVCVCLP'CNQLTTLSIGTi.TL.71ANATANLiGFLIY1IR
itu8


VDFFVFVFFMCKPKKSRTDRALAOEIOKKSTEVLKKP11RIKAKNRRKFLIAKEOKTLKHRNTfXVSEESSTALVFSLL
FSISLYLLVPMT10'1JWIGTELVIGNADSLTKEDIFPVTIVIL


ApEIIDDL'IR~LLDSOKKLITDKVLIFNYENCFVFTDICDNFSKYSIRLANAVITIFAFRSWCSSFDSVFASSLCIPI
RLVDYLIIFOLSACLVGAFKAVGVt?UtaF


LI IPSLIIUfVIAKSIRSWAWSLVFSICI'AF4APASSMILSAYDLCLSfSCISWPLTN


CPn_D3)4 379309 379837 N'IIWKFISYFRGYFSKNfEKISERSSOY


CTD79 imilaricy
TMSVHITPRKCFItCILs7IFTLPTLFpKAHLILFSPYIVIGFYCFSKDKCLVt.At,CCGVLCPn_0347 )?t078
189679


.~.DIJ1LGSRCVFLLLYPLTALITH)fANLIFSKESKAALVIVNNIFYCVPLLLTIPICCALFC069-traC/yt9C-
Inepral ll~abtane
Protein


HEVRWrIDVLHIPLKCSFLDNLIFTSVIYILPCALNSCLHKMISFFRRLVCYTf?ItIPGLSRK1'IWIVLINLSC11F
SDTLFLaSFLIVTLICN1TALWG1'ILLLSIOOPLLS


ESLSNAS'IPCLLV0AIJIlrY'lvFStQAr~IFWIVLFOCAASVfaYOI
IVFttIKYCKLNKD8J1



379p08 )80671
LCfVLWFFAIGVILASYt'KESSPTLYNRINAYLYQOAATLCfLFJITW1IVICASLIAL
~.Pn
0.35


_
wWWlfR0hM1'fDKDfAVTC'LKTVLYFJ1LSLIPISLVIVSCVRSVCIVLISAMVAPSL
tolD-Nechylene Tecrahydrotolace
Oehydrovenaae


EICNLLP':LPMEKILJRLKEEISOSPTSPCL.AVVLICNDPASEVYVCMKVKKATEICILGAROISDRL.a"l'ILI4
SAFFwI~.ALC.~>YISVAFTCRJ1IICOQAVPVTLPiGPLWICAG


.~.KJWKLPCDSTLSrVLKLtERLN00PSINCILVOLPLPKHLDSEYILQAISPOKDVDCLHLLAGLCLLFSPKr'.~W
VIRfI'RRKNFSFSKDOEHLLKVFWHISHNRLFNISVRDFVCSYKY


PVN4CKLLtA:NFOGLLFiTPACIIELLNYYEIPLRGRNMIVCRSNIVCKPLMLJaIQKHQE'IfCPKPfPRWRV'OIL
EH11'w"YSIKKEODYYRLTKKCRSEALRLYMHRLWE.9YLVNSLDF


I~ytTX.'"'ffVWt:OSFlILtEILKT.~DLIIAAtGAPLFIKE7?NAPHAVIVDUr7TrRVPADN:~YE..~VNELJ
IEEfEHVLTEELD11TLTEIWDP~!DPIIPr3IIPNKKKEV


AKr;pTLII:DVDFNNVI.TKt.'.IAITfVfCtIVf:PM'VIWLN.3MWRf.'YONFS


CPn_0349 )x1915 l9lD'!.7


"fKS u:sr. irlt)S~a tw1591 Db9-troBlyeWB-ALK: erannpnrewr
A'fPase:


.!m 11, tll~ILIIJVYDETtW
:VHNI.SIN'IEtIAAVLY11I::F.~.IGI!r:CLTAIII:PNI)M:KI:TLI*A."LI:


vt'IXCMI:apl::::;:WftHW::IIIa:RY1111XJYNaNLPKFFLYLG:LCIt:::I::X1KTTTIOGEpMfILIY
P.~.Sr.TrIFFNOKPKK1ROPIA'MPQRA.:ILYIAFPNTVLDIJUl4rXVYCYKI71~RL~.S


;"lNIVII.T::I::AKi:1'vl::I,:rf~IpBt:FHKID::LYNNWNPYS6L::rfNRAPADVPITL::VELINnP.
RFJ1FIIILfRVGL.E:3VACRrJIr.'OL;:tXaXyJPAFLAHAU10KADLYU4DELF::ALONAS


:aat.lY1'Ifll'LYKL::17GRFhITtICPLKTWf.IJILK::OTLPPKDVWF.ONYK0W~1~IQHLEFO::FT'TS
V1:'IWELRDOCKTW't'VNHDL::IIVRpC.FFdIWLWKRLIr:Ia:I'TDfX:LttLDfLfQT


trrKTt.IYKN1~IIVrJfDL.I:WKriY\VIX:WEIt:NTFr:PNNYVI:WIX:EIKT::CH11P:~RPWRYra:EIE
LLF70TLKL.~.IK'*VF;:C~:


Ir~:FlV:fILlilIr111AtA'Y:7~IIIIWIYAYf7r:KIYT1IILClfR'R:KII.ELaSYPI0:1V.3WN'T'-
,


;.,I
It:r'A'lAhIllA'fVIXfFIC:KIFJIICyWAEEI111IL't'YTt)fl7A.~.'.ir y
':f'n O14'l
y
'


y
I-rr.A! rQA-::slur.. Prnr.
vtr Ismv)tro7 t'.vnily
n.


88


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
n


WILKHA::RFIAWIDIGYIFKVMttWTFrFVACC:TkALItOPEGIPVTr'::...'Wr:~
~KI~fAIDAW:i::'C:..::.:::.A~..ARF~.:::4MEIRDCA
iIANSRPCILS'FTIRNIHDC
LFF71PNDP'VFIDDVFHALYJ1~aKIISItAGGt~ILL::FA.'iKCftFR'.LOhGEIA


'JERIM:NHIr\TAVLIKGSGDPFIAY~IVIttIDKDKIAGSAVIFCNGLGLOtCLaLRKtILEM'lARIKPGTPL
LFMI10CCIIOSAFirniltH AAilPf~faLFO~fIIAIiI~i.R
R LP


I'FI;:VKLV.ERLIARCAFVPLEEOCICDPHIY81DL.~stYlKfaVIEITLIILIEKFPCSiSAEFKAIPCLAAAIT
FIf~:7PR'IABLiNOGGRO~IfCAFCdFERitDR


a;EEL12Fa1SILD~IAKOCL~IIPOiUIYLV~FffiAFSYPfRRYLATPEEVASCAWPSRC


ISPECL.iPEAOI ~VRDINAWDYINENOVSWFPED'I'I1~DALKKIVSSLKKSNLVRLAOK


KPLYaDNVDON'lF.:TFKHNVf:LITELC~IALECOR106b50 40578.
Cpn_0361


SYt .P~ Synchee~se
Yr
tYrS


~tm o75n 797167 J~7b84 ; ~
T
- ,. .~ ~ .....,... . ,
......: .P \ r-:1~\..LT':IIW:-

~
,


... I.-: ...... ,
. :!~r':F'.-~ ~ ',K:'.: ' ny:.ll,~,.
..,....~ .,.,y..14F111i'!': -';'tw: .-:?..:.:..:.':':".',::.
y.. '...
.... YILb\4:
:w. " '
KlU!Y ..
~
,
,


., .
,
'CL:JWIAOWWEI.:~...i:w:'w;iKIIthaw:wML':K~:.i::HVH.i::E;;IrYiEF::YL:LOiYD
, DFIRRICGIG
. AYG:
... . :YPLLiNJ
. GIGCiG%TiSG
! '
...!,;;
.;.,.... ;,
:Y:.:':YA . ..tlCG....
ILFSETPRTINPKPW1PR(iSKKRRDFINFTtITDICRYLEIJIROVt70KDLi.
'


F:I" O
LLKNLKTF .
ARFSPKKPLTSLKRELIRSIRNGIVSVELWNAYVFaVRAVSSPNLEV1'SPFVDG~'tiTSCI
IO
FYHLFKNYCTILr.Ci..S
PKIARTL2LLSNEEIODI~tRVpTDPVAVKWA


'MiLDSOLTSPFELYOY;.LRLPDDTT


ODILS11IIK:DIGLEtaLSS~TRS~tPt~SGS%~Flt~.!'AG~ASLDKSLVIGIWIiLD


CPn_0351 791861 395133
4FLVIGLCKSKGIIRRLIEOKCVYINNVPiANEHbI7CEE0DIGYCdIYVLLrIQCKIOt%t.VL


adc-ADP/ATP Translouse YW
ROTImTL
TVL


.
KIKVfQRVI~BTfKTEDCPFGKLRSFWPIHTHCLKKVLPNFIJIFFCITIt~iY7
StIIt.SKGALFYAVG'i'PFLIFFALFPT
IYAm
IVCApGSGAEAIPFIKFWLWPCAI If?G


. 055
. CPt4.,036Z 107113 10
AAIYVLA~S~t~~'F
W


fF CliA/rpsD-SiOea-1B/WhIG FaailY
VIYPLROVLHPfEFIIDRLQAILPPGLIGLVAILANSINIITE
KRFYALFGIGANISLLASGRAIVNJ1SKLRUYSDCV~WGISLItIi.WVIIi'
ANEITK
H


I
LDKJOIIVIITOQ'I'ONIIE1R4NFYWCIOEIEYRDSLIEFYLPLVIISWNRLi&ONI
EA VN
IV9GLVUtASYWWINIfNVLTDPRIYNPEIHOKG100GA1CPIOB4lAmSFLYf.ARSPYiLLLAK7WKL5G
'


LVIAYGICINLIEIrIyIKSOLKIQYPN4JDYSEFNCNFSII~IfGWSVLIt4.FVOf:NViRKO
t LI1GAIIDDLRKOOIiVPRS
DLYASGVWLVRAVERYNP6R$RAFEDYAV!
SD~i10101I
VSWtMPS
VMRPAL


. .
FGWL'iGALVTpVNVGLTCIVFF11LVLFRNQASCLVANFG1TPIJG.AWYGIIIGJIW''1ISTI
O
ASLROSLGKE~WI~~~
EFS~IOELEKERKVMALYYYEELYLI~IGKVtaVS


KYALFDSTIIB'IAYIPLDDEOKV1IGKMIOWAARt~CKSQGALI00GLLVICGSICAK1'PYEERIPDE>1A~
SI
FR


LAVILLFIIAIWLVSATKLN%LfLAOSALKEOEVApEDSAPASSt
, , .
ESRVSOINSKALLKLPAAL


0357 395178 396130 CPn_0363 109700 107913 '
CPn


_ flhA-FlaQellar Secretion Protein
No robust homolog present in Genebartk/~I.KIRlfT1118S1t
as of 11/7/98 !


WVGIFFINSHFfNSYAFFNOtfVIITVRIfSCf.Tl9CCSPLTLVPNiTLID4DCECHRSCSLKIGLCISFAfSLLT
GVIYSGKKDGVROIIPVPLSILVLIFLPLPOILLD!
~Ri~W~TRWIVSSGTASSLIVSLGSFFSLGSWAATFACL4LF
CLlPPFFLYLC'


RTTARLILGLVLALVSJ1LSFVFW1PISYAICGTLAtaAIVTLIITLWALLA1ISKVLPI5J11

1VNFLJIVSKGSDtIAEIIRSRFFLEALPAKQNU.DSDLVSGW1SYR11V11%OICiALI~DF


PNELQKIIYNRYPKEVFYFVIC"HSLTVNELKIFINCWKSCfDLP~LJ~AEAPT3iDILKFSAI~CVFRFVIIGDAiIS
CILLLVNWSVTCLYYTSGYIILOpNMPIVfGpIIt.VSQVPALL
IDIGYTT
'


QKVYGCi.GPWI
TSCA71ATLISKI~LSLI31YLFEYYItQLRQHFRVVSLLIFSLCCiPSlPI~PIVLLiISL
SIDLTLFPEFEEILLONCPLYWGSNFIDKTESVAGEIGLMCl
IFHSYTRPLLTLISESpYKFLYSKASIC~IQWDSPSVIOCl'CLEIF'~lpEIStIFRI~IIOGIS


OFLFLFFSHGI'MEQAQNIOLINPDNWIC4LC0FDKAGGI~TFOCFIM'L79'~'DPVSL.wLRYRJ~EPIIS~SCIFR
11FSYVOGACPKEOESQFYQVYRAASIEVF~VRLM'tS
tAEVV
KYLCfSERVICIAVED
RNIJI
A
'


SNYEPTVNFKIwKE4KVLLEKVKESPMiPASALVOKICttN~tIOIDNLLO~OFVRM'SSOPFJ1VLPFL
Hd
II
O
LRIL~tPWLRVFT70NVYLD~1
IVPACISLSSLVVLSRLLVRERVSLRLFPxILiJIVAVYONSGDSLEILitBI(IIRKSfaYWI


art'SSLPOYAFHAOTYKLEKKIESSLPIRSSL
GRSIJtOG100TLEVITIDPNVP~LINSSYSRSNPVlIpt3iVIRRVDSLLBRSVFKDFRAIV'I


CPn_0353 396893 397135 SCLTRF~9flG4.DPHr~CViS~LP~IPISFLGIVSDEVLVP


No robust haooloq present in G~tabsttk/H~L
as of 11/7/98 109951 110378


LRFRNIKKSLIPIKRIAYSOSGKEOKOARPtfIGtSITSSLVILtS.IAIFNH~iFSSII0fC~1CPI1.,0361


FFKa4FIWI0~tTSINRIFVKFT1 Eer1-Ferredotdn IV
KaISNAKLVITSDDL00EFF3.EDNSEiAEPCES?ICI
PFACTF.CVCCTCVIIIIL~RAif.S


CPn",0351 397062 398507 ~W'~~F


No robust hoslolop Dresent in Genebsetk/C~I.Cp~0765 410198 411511
as of 11/7/98
l015T
~TYYF~


. No robuac hotsoioQ Prvsestc in Gafebank/D~L
YKTISIKILKIXTFLLIGF4LNLRYNTOIDEPRRG45NITSPVIOlC4as of 1117/91
'T-'r'-~-rr.IALIGVILGII~ITPNISSItE
'"IHIVISAILLCGALIAFLCIIAAPVSYI!~


.
FKQtQVNSL~TISPISLTVOHPLVCrKIGLRCSNF~CI05RILLITAiIAVWtIOlLI.
pVPPQELVNRIPAIIYPKPVSDFVSGKPN<.I~LISFIDLLNOLNSLYGSS1NYNVSE~.O


OKIDTFEGIARLi04EVRTASLKRLESMSSRPLFPSLPXIIQINPPFPWLGBFISJIGS%VIGLI1111PVIYFL'i'r
ISFIAWLSNFILYIIRATTiZICPRACGI~ttilalaV~SS


VEyNRVID(IOGSLF~DLSDYIKP~.PTYWLIPL~'RPTNSSIWLHTLViaRVLTRDVFISIAINRSKPI~LPAPSALL
TDNPYEIWIDIIWSLFSLVSLLPO~dLI
fiKTLLIF1TSGNAFISSYVDTi'PSPKSLLN6JIIOfTRVEINI'tLIAWIO~LY
SAB~


ONLICYAAI14G684iiWISDLNMKOQLFAKYNAAY0SY10i1.60PSLOmEFYNLLLCIFIOf.
WOPDPI%iRVFLPOIPtTPFJIIYOYYYALYV'I'YIOTAIITINt'pIIOIPLYSLJtOQ.YfRC.


RYSWK~ISLIKTVP)1DWEJL.CCLTLDIti'GRPODI~FASLIGTLYTOCLiNKFS%AFLSSPPp9RN00SIJ1NITA
VKY11AELHP6YPLTIACVERSLAQt.POESI~L.B


LTLLSLOQFKTIRROSTNIAIffLBJLA't'ID4STtRSLPPITVNPLKRSVFSOPE~Si'L


IG
CPeLD366 4119'76 113140


CPt~03S5 399955 398591 No robust haooloq Present in Genebank/FJ~L
as o! 11/7/91
'


No robust holeoloq prnenc in Cenebank/EI~LGNOKLIELKGKOOAESSPRTi?SVILEVti.VmC
as of 1117/98 NGYLPVSATDULtISPAAPLINSAt~T1
'
'


IRDPYLHIIY?AFNRSISKELA!lSKfIVPHJIf.P1041tCECNSTFPLSSRTIVRIAIASLFCYKY
ILLVAVIILF'CFLMVPFMIOfI
CLIVLSLL71IRFALOF?LCfGNPIUIIAVLJIVSCI


ICAtaAIGCLiIPPVSYIVCSVLtIFIAFYILSLVIIJILIFTiCIGG.PP'1'PRIIPDRITNVIDVKTVODIfASTI
IISHGQTPTL~'IFSGIVYAE&QAGL


e'rIYGLSISAFVREOQVTLAEFttOFSTALf.CNISPEEKIXOLPSELRSKViSFGISRLAGD
0367 113078 113136
CPn


t.FIG4NGIPIF~LLSQTCPLYWLOKFISAGDPOVCRDLCVPREI:YCYYWt.GPt.GIfSTAID1T_
EIfwDTDEVKAIYERIY7TYTAROTLxi'F31GG1.TNo robust howolop present in
OetfebanklFiBL
LTKEwLLLxNKAL as of 11/7/98
L


IFCKETHNI
SFPWRYfKfKiTSIPOVHFi~IDSHLSVDERLISFSPVLTKXEVIAKIIKLTALILiII~IIA
DG
O
KETISKELLLLSLHGYSFDOLQLITOLPRD71WDWLCFVDNSTAYM.OICALVGALSSGNL


LDESSIDP'WNLCLYVIODLKFrIVpAFSASDLPIGCtLGKFWtFDSSVSIGtLSSVLROGLVGTAWAGVLtiIPL4t7
tI11TGAALtaAWLSCLLLRRREPSKPTEELLGPOKNVPKDIAJIQ
'


HRIALEt~NARARVYOVNPYTCMINRKTSIlP'ImEGDIJ1II
WPSVPi.0Y0KLLRN6Wl'LVDtfLSEINISWCLOOPNORYYVWEtpGAPITLVXI


PRLICtSCRVNiVNAaNSNIOSCCACIWU1ISMTNPTCWNDII'RTSGGKII~1I!GIOGLSVGDC


CPn."0356 400165 100109


No robust homoloq present in Genebank/Et~L
as of 11/7/98 113766 414107


KOVQLFOYtQIESQJDWt.CDFDSOCDGFQLSRLVGLLHS5WJ1LYPJ1KE0FYLPEVSLLISlECPt~0368
No robust homoioq pressnt in Genebank/F~tBL
as of 11/7/98


FyIDpLIsSKpCIWGyAKDLCNVFEKHIpRFRQYLGSLDLI,10RFHJTFLNYpKyNLpRETLAKDIfLWVN71A0HPC
SIETCRINDTNPGFJUiFLAOLLCPKYDCLXANPEKLSN1II%KA


0357 401311 100169
YLNCFDF~IIId~tOA.IJVOVPLISSSIYSPCGKLELEPVNQ'fKPNSSAYKLYHIRT
CPn


_
uo robust homoloq present in Genebenk/D~HL
as of 1117/98


YSSF81CASMVNIOPVYRNfQVNYSOATOFSVCOPALSLIIVS17VMVIJIIVIILVCSOSLLCPn_0369 111115
415563


~IELGTALVLVSLtLFASAMFNIYIMROEPKELLIPKKINELIOENYPSIWDFIRDGEVCR'058 hypochecieai
Drocein_3


3LYEIHHLISIWKTNVFDKAPVYI.OEKLLOFGIEKFKDVHPSKLPNPEEILL4HCPIJIWNIKCDSNPLPSYT~'1SL
YRTPAKHSYPIRLPLNRTDRIEKILKIVtLTLALaCJILGFSIA


LCRLYIPMVSOVTPfiCYCYYWCCPLCLYENAPSLFERRSLLLLKKISlGEFALLEDGLKKAGILJWPIFSAVLYTTL1
L11VSLYSLLKKPKLYEILPOIEPESEOSSLSPSPQIP~OD


MWSSSELVOTRONLFTRYYADKEEVDEAELNADYEOFDSLLHLIFSHKLSLPLOIDPLPDPES:1EVSL1DLTTPPEEL
TAITVTPGYCALLEONYIa.LPSLAAVDPSFT
'


TMtAL.S
TETPOOPCFLWKLKDSKLIFISTSCOIAVPRIKTpCRVMIVNJUWI71I8RmOG


D758 101757 101578
LATSLOG1INASRL?RAHSRSGSOLOPGECRSAKWFatSDHTSNDN11PGKJ1NFL14~hGPEA
CPn


_
AKCtiJDPKOAFE'YSKKAPHM.FOFJIEIICVDVIOLPLIGCNLFAPSRf.WLCKTRAWIE
Flo robust homolog Oresenc in Genebank/E11BL
as of 11/7/98


EEVLSV.SMKLIPTQDSIERE'fDSKROKKIFTIYICSSKVL74GHFFSHLDKHNKIHSTGVAIKLALITSLODFv'nI
E00NlEEDKIIILTOKDOPPIIPPRFOLTTP
~


401991 403117 CPn 0770 415755 416913
~Prt
0359


_ CTO58 hypothetical procein_3
~e~~.pasa


ITLpYILItEYKIFlITRHFSIIAHIDHGKSTIADRLL.a~TSTVEERFytREOLLDStiDGEREKRIFFKLFVFYLKS
FMSTfEPNLTNVNLTNLL3SESMPMIJ15NKLKGLDLVAPILIiGI


R,ITIIVWPYITffYLYECEVYOWLIOTPCHVDFSYEYSRSLSACECALLIVDMQGVOAAVSSG't'MIIIGIPLLFIL
T.1WVLAF.iILLYFLLREPKSPISYMIQPTPTTKDTDLPW


t35tJ1NV'1L\LERDLEIIPVWKtDLPMDPVRIA00IEDYICLDTTNtIACSAK1COCIPPPLALTPVPTEAILEEPP
LFSPRTHOTLLOEIB~JDIttPDLOANTOMPFIMDNO'fOYAYML


atLKAIIDLVPPPKAPAETELKALVFDSHYDPYVCINVYVRTISCELIfI(ODRITFMAAKGKNSNLTLISTIGFIEKP
R1'A"COCTVNLVNAATPF81AMJVK(TSLALAIfAT3VPGIiDISKK


::SFEJII:LCAFLPKATFtECSLRFCQVCFFIANLKKtIKDVKICDTV'fKTKNPAKTPLECFSPOPLRSKOPLC4:E
C:RSAAI~tE'NtlxT1'NAJ:KI1CLPDFLCCLIGPIUSDYNYNPNDAFTF


KEINPV'~FACIYPIDS3DFDTLKDALGRLQWDSALTIEpESSNSLGFCFRCGFLCLWLCROAYLNCWFJIKRRKT'M'
CLPLL:.~.fIFW:.~.FKDEETfSLRLOWIOCNKIALIDAIptF


F:IIFERIIREFDI~IiATAt'.~.VIYKWLKNCKVLDIGNPSCIfPOPAIIftIVEEPWVHVNIGf.FJIENONOPWV
T:=TTLViHPLITP


ITfOEl4;N IlMLt'.LDKRC tt.'VKTE34LOpHRLVL1
YELP WEIVSDFHDKLICSVIItGYCS


Yfi'IRf/iG'lI!KC~IIKLCVLtNEEPIDAF3CLVHRDYAESRGR~ICEKLVDVIP00LPKIPCPn 0771
~lhl4t 417:4:


tJMItIYKVIAREfIRII::KNVTAttCYOf:DfTRKRY.LWEKOKItt3KKRNKEFCKVSIPM'ANo ntGUSC
tuvn..l.xl Vt'.eahnt m
W rn:rmnk/F1~IDL .t:: mt tl!'I/'tN


!: ip-Jtyl,p
K'MPV3:APLPT~IIRPS:x:NU:WF.C'fC:KAI.YAY11~~DY~'fY.TTKLLVKTLVAILVtEVII:


f IMPFIPrI'PPI::.I IIt;:LILTTIV
VI.Lt:IfNL.If.VIIKTtLTfAF7~~aTKRKI:1'SII3i


wttt o:!o au.tt.I 4ut'mz >


'Ts:.u tr/tt.tt.rt.:tt innr.in
'JAfJiftltr:1.f:WVFK:KNLVINNIDtICFSV.'WNRTFEKTRGFLKEYf'NtIRELVI:F~SLEoPt~ Ut7.
'tl'oa 41wn.1


Id~'IFC:LF:HI~ItKIItUIIVN:KfVDp::IIIALLPFLEhifiJIIffIN::YFKDSERFCKELOEKFlo
rrtt~u.~.t hom.Uxt 4'tarertu.
W t:.yaa.mk/F:NItI. .t:: nl Il/~//'tN


:It.PIJ:P:I:;t%:EI~I:ARIN:P::IMFt7f:NPRAWfLVAFCFOaIMKVIY:RPrC::'YAT7YX:ACNYRACI
IRHIt:HHII~':PWh:C:.::A::F'/FXrPYI::YFI.F:kI:Y::X:HUIKIAFM:.TALLLW
'


!!'I'/YA'/tIN:IF.'ft:AtULU:P.A'h:ILROYLKL:iA'PAVATILKLWM'LELESYLIRLA.~sEVL1'CSV
tHII'FT/Ir:f4.F'LI:::.ILL.AIHt.I::MYKITtit'NVl'1I:N
'CF'V7DIVAIAMIF\


89


CA 02350775 2001-05-11
WO 00/2994 PCT/US99/26923
".Pn_0383 1. ~)07is


~'pn 017r IlRlSri 120218 CT017 hypochectsal procetn
'


.
V
VODITPLTLPMOIWt.?3iD0w"uWYAEIWBAIA:.dt'eaa4ED01CQ1.
' i
'
OGIJIPATLM~ICtPALIOB(t~'hG~CZNY1KPPLA'91(O~iRYfKIfH~P~1
'
'
B


1
EC1RQ.SMLPSAL,.'LSGFGEiiPADItOICRIIRt.:.LpMERIp:I:rGSC.."LA:ILFLMI~ST
ITLTTDIDST
EpIY
LITPAINSSRRKTNTVRIGNLYIGSDNSIK
OSITS
N
IPEIFN!
ALAFJfNCDIVRVTVOCIKEJ10ACEKIKERLIAIGLNIPLYIIDINPPPOAAl4,VADPADKV


AINPCNYIOKHNMPICGTKIYTEASYAO.r~LLRLEEKFAPLVflCCKRLCKAIOtICVN~SSLSSLPOILSEPOIILL
CSJf~KTSLONSDIKELYVKKEKIaLHKPRDSLIJtRDPV~1100IJIF


ERIMOKYCCtCIIVA.iAIE'IIAVCIOG.NYRDWFSMKSSNPKZl4V1'AYROW(OLOAACLLEDGEDPLG'.ITFLR
l'OCLYCLIISIEEGSKEMIHP1!F'ntYGKERLHOALN~aLIYM:L.:


wLYPLHLCJTEAtA4CV0taIK5AVGICTLLAEGLCD1'IRCSL'iGCPITEIPYCDSGt.RNTIO~BNpOPIVAVta'
LVIRMVNL


..... . . :";ICrr/,a--:..r.-': ..~:
Ff.r;.~ : 'r:l' __~",.~..
.;.'
_
.
'
,~
~


. , _ .:, .'I w
"
....
,.
;
,,.y.,,~l...~;,:~:~ItFr:.Flnlln_YO::,v~I:,FEr~r~:~rr


JHt;APtvHFHA50PF1H1'S'il0FFt3cOGNOt:KPTKI:JFSROFDNNEEN:I:.Ia:EFGALL:.ncCU-
Htrconw-i:Ke 1rUt1il11 ~


OCiGEJIWLDLPHLPLOWLXIAFCTLOtIANRLVKTEYISCItlICCRTLFDLEEVlTRIRVITCLItIGIKMIGAOKK
OSGIOITA.iMVRKPAKK'JMKR'1S'KKATVItKTAVKKPAVItRTII


KRTpNLPGLKIAIMGCZVNGPGPHAOADFGPVCSK1'Cf4ZDLYVKNTCVKIJtIPIff~AEEEAKKTVAKK1TAXRTV
RKTVJUOIPAYKKVMKRVVKK'.SfAKKT'fAKRAVRK1YA10tpVARK


LIRLLOEf~VWKDPELrTIC.TV
TTVJ110GSPKMMCaLICNKNNKNTS~KRVCSSTATRKNGSKSRVR'1'AtII~IRNtX.I>0!!f


SR


CPrt_0374 120109 130961


CT056 hypothetical protein CPrL0385 431011 43252=


VDSlICLSFNTHPL~NYWL'fI~FDGLPIRHCVFSKOKDAEGTYPAAICtPEIASALOSPKYCDpepA-LeucYl
Aeinopepcidaee A


LNORNGTSYRMPTSPrYQPIIOGIL'TOSPLLSIJIIRNSDCOAAIFYDREIOD1IANVHSGFLVIIOGt~VH.FItAQ
i~RNRV1U1DJ1IVLPI~IiHPKDA10JAA5FFJ1EFEP5YLPAL~OG


wRGGtGNIYAVTVGTMKIfLFHI'KPQDLtYAIGPSIGPDYAIYPDYATLFPRSFLPF4III~tPKK7GCIELLYSSPM
KWtIVLLCI1GKNEELTSDVVP01'YATLTRYLIIKAKCSTVNIIGPT


PIHfDLRAIARKOLTNLGZSIfDRIFISDLLTYTENDAPFSSRYLI1NNPDPNLtGQH5I0J10JISELItL511E6FL
VCLSSGILSGNYDYPRYMNDHNLETPLSKVTVIGIYP101ApAIfRlIE


WI'AVLLLPRD
MII~iYYLTRDLVNAHADEITPKKLIVRVAI14L~CKEFPSIDTKYI,~DAZAttEIOGLi.Li1


VS10MC1fDPNFIWRYOGRPK$KZ~'VLIGIK'S..
'fFDS~LDLIIPGKStQ.'IlOfJC~IilOC7lT


Cpn_0375 421111 411615
VLGILSALiIYLCLPIHVICIZPATENAZOCILSYlOIGtNYVCtIS"LSVEICSTOi~LIL


Ho robust homolop presort in
Genebsak/EMBLAD11ZTYALKYCKPTRIZDF11TLTC11MWSLGEEYAGFPSNNDYLAEDLLE7LSAlTteEPLt4
as of 11/7/98


RLSMKLGASTNHKVHEPVKPKKApLAEIEAtBCTOJITEClLRSKSL71WZARJ1VLYILPMRLPLV)tIfYDKTLN5D
IA0t0QiLCSMUGJ1ITA1LT1.QRFLEESSVAfiAllf,0laC1'AYNZIC
'


:lILAJYCITFVTFL71L.CFPLIQAYSIACIITLVGIaICLVLLILSLLPKEDe~1~rr.e~gEDRIIPKYASGFGVR
SILYYLENSLSK


LLPLTIIVIEOOPZTPKPEIPYSYLTKLU.L.TSLfLTLRRSSSORIfIN


CPn_0386 131543 131016


CPn_0376 121680 411191 sab-85 DNA 8lndinq Protein


No robust hamoloq present in Genebank/EMNL.KSIE:YL18Q'CNFAGYLCAD ETVNCKCNIt~Y
as of 11/7198


FKV1I1'AKIIPNLTEIROIGARWSLPLLSPLTSMaIOGtXxIIfSAPLIIQL00LiGEEOMfJIMK~M.PIfLbIGSG
YZYAGOISVEBYMSKDGSPOSSLVISYDSLWSPPGRNtsI7SRSPSLED


TKMNSRKIfAGpNAIFNSPTPCVSSTLVWPI'PWGYYDKWODILLR1I9PNSSSL9EKDSKNNp00G7fESVSVGPtI:
PJILMBJ1IKDKaNYACYGQEppYVCEDVPt


EFLI04LFVDLLELJGfTSVIfINAEEAFTPLDNTGKPHPKRONVYLPC>Qi.GiILtIPJIAVOAN


vSaDTpFTLFL,TpDECNPPttDltlOiC CPI>r03A7 435229 431699


CT013 hypothetical proceln


CI~0377 123111 122317
M~Id.OGDSLMSRONALtZILIQO'AKIffr.RLPDVAFDOMJIICILFVDGEPSLNLTYElQi6D


sue8-Dihydrolipoamida
SueciflyltransferaseRLYYYAPLLDGLPON1'OWGJILY6KLL1~SMLCCpINCGGVCV11TKEOLILMNCVLI
lOLY


iM'1TEVRIPNIATSISEVN7LSLLYi'EGALIOB~IpGLLEIESDKVNOLZYAPVSCRI~WEAETIaiJWAOLPZETW
KWNTVCADZCJIGREpSVD'fIIPQMPOGOLiQAPPP1GIM


VSEGDUVPYOL1IVGKIEPAGEGEEtaDSOSKETIEAEIICFPOSGVRQSPPt~C1'tZPLR


DQ!IDOCSpGLSaGORGETRERt!!'SIRKTISRALLSALiiESlltIZ.TTFtIEYYN1'PLFtILAEECPfL038B
131313 4)7320


KOEEPLSRYGVKIGFMSPtYKIIVLEALKAYPAVN11YIDCEEIVYRIIYYDI5I11VGI0MGLqlqX-GlYCOqen
Nydsolase Idebranehinq)


VVPVIRDCDIG.SrGEIOpKLADLALRARECLIJ1IAELEGf~FTITIK7L11YGSLLSTPI~NSZTIGLVSSYPSVPL
PLGASKISPNRYRP1ILYAliOATEVIL71L1T~18EVIEVPLYPDIMR


pPOUGZLCi9fKZIGtPVVLONEIVIAOtIatYVALSYDNRLIDGKTAVGFLVKVICi7GGENPAiGAIWNIEIEGISD
p88YaFRVIIOPR>O~MpYSFIfLYLRDPYAKFIINSPOS!'GSRKRDOD


SLLDL
YAIrYLItEEPPPt~OpPLtQ.PI~BMIIY1'lBIVRSP1'OSSSSIMIAp~QrPIGIIEKIONL


NKLGINAVCLLPIPt~6171NPf1~KPPYi.CtiYWOYAPIi~IPPSPCRRY11YASDPGPSR


CPn_0378 126195 123115
EPRTLVKTLt~ECILVILDWPNIIiCLOGTICSLPItZDrPSYYILMOGNITNlfS00CET1'


suU-Oxoqlucarace DahydroqHaae
LIfIIMAP1TOWILDILRIMII>IJQOiVOGPRPtIt.ASVESRGPSCSPLQPAPVLOIIfI~LL


IVPICFNYFIlIDSSEFVCOVIISSIk~WIESMYQRFMNNETLDPSWKYPPI4CYpLGQMSPSEASTKIIA6PWWGGLI
lQ9CYPff1'LSPRfiSEWIiGPYRONV101!'LI~OQNI,IG?FA51lI8Gs


ASTKISQLEl'IAFIZ.QCQKSOPLCTIYRYYGYLQSOISTLAP1TDSRPIOEKI71KIDLDDpQDIYPIIOStrITIS
IMfVSCNDGPTLCD'I1RYNIIKiWEANCEONRDGT011NYltYNIOT~IC1'


VPS71GLLP1IAQVStfltELIEALKI(CYCCSLTLETLTCTPLLOEFVtIiLIOIKPAEpLI~PGILEYR>OtOLPNP
PLTLMVSOGIPMI0SG0EYAN'1'AF~BEiRMALDSNNfYfIi~/pL


LRSYIIDLCItA?PFEEFLOIIQ'I'GpKRFSLGCGETLVPtC.EI6N11YGSJILGISNYVGD01HTJUIPItJOIPL
CDLZAPRIGtYKTLFNRGFLStRCEISSiVWFMtPM'lliRpGNPLAPICIItiPK


AGRLNVLTNVLCEPrRYVPMePmDP
ANV:vArNVG~oDOLILTLPMSm~fLProIVASSOOCrvPONVATPSw:LOeFn~l'tsus


NASIG.ESVOPIVEGV11AAIQI~tAGKEQ&SIJ1ILVHGDAAF90pCVHfCI'LOLSRVPGYMl~.lrl'


STEGTLHIWtNYIGPTAVPRESRSTPYtTDZAIO~.GIPVPRVNSCDWACItJIIEYJ1L0


VRFJtf'SGDVSIDLCCYRKYiWBtE8t7DPSV'1'APLLYDOIXRKIfSIREL!'AQYLL~IOFIIDICPIIr0389
138751 137319


SEETL71SIEKEZOESLNREFOvLR;i'DPEPPPK1IECHHCDRLN4GELILNOCDNSt.ONt:TCT011
hypothet:ieal protein


LFNNSSRICGfPONFlIPHPKI
11SLLIDGYMLRLTVPNPKRPYOKZ750RQta71'ICLRPPKXTCKELIEPRRRTVKLLKlNLIGLFISNSIBGF


~rODSIAGTFSORNLVFISt7NICD1'IfSPLYHi.SAEpGSVGIYNSPLSEYAIL6!'EYGYAOSEVRVSDTPVKpDT
WEPKIRVLL6N!'.bl'1'ALIPrIKGPYRIYGONVLL,DTJ1I000RL11VN


QALKTLVLWE1IOPGDPANGIVQIIFDQYISSGISDIVGLLPtICYECOGPPJISSSR7lLYmCiZRWGEFYPCIrQCL
KZEPVD02ASLPPNGI0Y0CSLY1MRKDtIICIMVSti6YPIED


IIxlYLQLAANWNFOWLPSTPVOYFRILRENAKRDLSLPLVIPTPKLIiRYPQCVSSIEEYLK&VLSI1IYLEELDREJ
1LSACIILRTALYEKLLiIRNPONFWIMtAEEITiYJIGIIGYtICQ


FTEPCGFRAILEN1DPNYDASILVLCSGILIYYDYAJ''lE.p~tRKDPSCLRIESLYPLALEFYGVEEAIDWCARLWD
SPOGLIIOApMS~pSNVDRIJ1IEGFNARQILEKFYKDVOPVIII


DLVSLIDKYSNLKNFIrt4tpEESI0e1G11Y0YtIPMAL.(lOILPEKLLYICRPRSSSTASCSAKE.S~IIEELDGE
IR


LSRQ!S.V1CMETLFSLR


CPn_0390 139171 438134


CPn_0379 416168 126765 ruv8-NOlliday Junction Nellcase


C'I053 hypothetical procetn
RKSZI~EGSYMINOVAVL~DKIIFDVSLRPIOGLEI~'IfCQHHLKERLDLPLCMLpRGRVPC


KNKKMLC?CSRIODGNPWMKSeer Yvr
seEW)n,~,LVp((,KEISRIIOEEIRILEHHCLFtGPPCLGKTSW1IVAY1'VCKrLVLASGPQLIKPSOLrGLLTSL
Q~ONfPIDEIN


KIYEEKERLOLLKF11GEIEYVfPRRSPAX1'VYPDGPSMSDIEFIrEPTLTEIDZDPCETVRMGKVAEEYLYSAMEDP
KVDITIDSGPGARSVRVDLAPFTLVGATTRSOM.SEpLRARPA


ELF3.T~ECREDCAVEVDYSNEDDEDPFSDRNRWRRGGIIOPDANEttFSARLSYYSOpOLKEILVPSSHLLGIE71DS
SALLEI111fRSRGTPRLAWILLRWVRDPAQI


REGNCINCDVAEKAIat2LII~NCWEIDI1ILLTTZI0YY0GGPHGIKTLSVAVCEDIKT


=Pn_0380 416671 127876
LEDVYEPFLILKCFIKK1'PRGRINVTpIJIYDIiLKRHAIDR.tSLC6CQ


hanN-COproporphyrtnoqen IZI Oxidase


KSTIPTICftIKTLSAIAiIIGDIIWSLIPtfLM~CMPL71LYIHIPt'CT1IXCRYCSFYTIPyIICPn_0391
139701 439510


SESVSLYCNAVIQCLRKLAPIQETHFIETVPt~O~TPSLVSPLDLKRILKEL1PNAREINo robust hamoLOq
Dresenc in Genebank/EJ~L
as of 11/7/98


TLE71NPFM.TVSYLRQLQE1'pINRISVC1IQTPDDSILOLtGRTNSSSMITALOECpNHGKDOLYKOEKPIPKATIL
SRNLEVtO.DtIPKCKRQTLFLGRTSGRSALY5Y5RRILVLIJIAT


FSNLSIDLIYCLPfpSLEIFLSDWOALTLPITHISLYNLTIDPHTStY%HRKILVPTIANRCP


OEEILJ1F~ISLLJ1ENLLLSOGFORYELASYJ1KPDYPAKHNLYYWfDRPF'LGLGNSASOYLN


CEASKNYSHISHYLRJ1VRKNLPTQtTSEILPKKERIKEALALRLRLLWIDIJVEP'PSl'LTCPn_0392 139914
140383


.~.MLTODVKLpM.FSYfbpGLAWRQGRLPHDTIAEEIMCYSFdcd-dCTP Deaelnase


MSIKEDIWIREMAItIrIDMIHPFVNGQVNVNEETGEKLI3YCL.~aSYCYOLRLSREFKVF'M


"Pn 0781 12N836 418037
VYNSWDPKCP'fEDIFISITDONCIVPPNSFALARSVEYFRIPRNVLTMCIGKSTYARCG


aT326 similarity
I I VNVI'PPEPEWEONV?IELSN7TPLPAK
IY11HOGIApVLEFFS51TCCV51fA0RKGK7f0


aLPNKFAAWfAPTESRSSPPTLLEETEPLSPNPIPADIOIPRITISPPSLDVSIYASSAKOQGItYPCV


EDI~VFIACCPRSSSSASVASOWELVCLCCGDEDPEPPDSEVRTLYVNGSWOTNQPJ1V0


ELLYIaEVRCFJ1VRLL'INOCSCNSPWPISPCRTLPTLDHPLC(~ALLTVWCpPPSAPEI~NCPn_0173 110129
410721


AEFLVIFYCDIUPYI00ALTQSRHSPRLWVCISPTVPIOf;DFRVfINYRVSGDPPSSLOC<~f03R
hypttheCtc.U protein


FGTPAFIICTtLPYS9CLECVFLPSIRCPSFIWAVRPr;EOCLVAF1RCE0VEDRtJCLSppAEKFLTLRNCORXFTII
dct:LpR.,~Y;,LSL'/FPARFtJIOTEKESIKSNM:SPYLVSNVSVRKKN


ASGLPfI:ERDt.AWTDL'TDPSsNSRLVEWWOCSt'SSOMEINPYPORePOVAtSALYAISwCPRLLEEVNIIfSWWV
IF.~,ILII,T',FV'IDRAIOELRTEELHLpSKVSSIC00IVSAQEKOR


~'J::.LSVEWILr4IVHE(a.DWIC'l3LIIl4HTTFAVRYFFLLFTNYt~SRERFRTARIYAQOLOLHLOIfWQD.~
.MI61ALLORII:LtPY..YKY.t.CVSPKpQ.~,f~OID


:r.YI.P.: f LVLVPUCr~JVLRK1,WMPpEILRAIF
tSA::TISGS f VFVt:GTRtM:fa:LRNRVp


..F'w/WVICrt:Ll'Vff:fVRASYROR1K:FIICFLOTVH(y:LYLPV.~.IMILt~IAIOVPRILVI'Pn
ilf'~4 4A0717 A41u.q


vlaItfCAV'IDUINK.~,aEENW:I::CDVWVI~TWFIta:APVLFVNLWFFVKSVLRH3RRRRRtly:~:R:'
ltwnut pfotetnlHr:mvlY::m
Iw>arprrll


KI:'TMI VftIt.NPPI tt'.FTII'..Jf:FI
:L::y IALF::1.1f:'.f.t::IIYKR::K::KK(><H1VATLLIJIPH


..'I,yt~t 170752 4lnnlo HLLITLLF(.'DIC:WLAfONCPAILFr:LM:l.Mr11"n
:l.l'LAITf.tIa:EII.(HCAVALpfNfQ


y.rm:Jyt.U.-::nMfwL.mHnt
MrrrttycranatnraaelA::~1IAPLI1...'VTKIFNpLWWtaV~':fsrvw~tltt::Kr.JfOItVIVELKFII
W:x:KOIf:V
'


I'VTL
VtIpEE?nLLYt7YL,~aLvDCS:IMERtIC~Ftr~4ll.l"II)IIYPff.ENLY1.LF::K(Hk::NVPIf:Hf7N
ILI.fTITft:fAAVETLI':CVICELVIIRLpf:LIVE.iDI&7CAAFLSLNKIPEVIIKPPWI


I::Y.IIAFL1KANDFYLEfIVKHCEN4~,LI:DAfiLM:IJ1DPCA::LVARAItAIIUiPVMF.SCPIA~NWI:Lt_
't'AN:a.LlJI0KPl4.~.::DfiLi.lLLYYIyYMI1~.'PI::AKMAI.I'NIIMND6TIJ*ti


".:fTI.AItAL;:(:LI'::~E:F'PF4:YLPSY.:PKERVK:iIKKMT.3KEV:uTS'Vt'fET.~.IRtAIYTFEt
>!'Y(i::Itl~.LtTUEDLFEIVN:EIVff,NII1K11.'l'Pf:a:Al,VtIA:7~:pL1:LftRF::F:IYDINL


:LIJ7fl.f'::'IAI:Ia'VA::I)L:.uP:iELVLTAQVYI$yp'~EDLC..~VK7vtTKVPI'IFLFHIPNffNNH
IATGkIrJt.tEyIf:fIPTfrl4Yl
:'Wt7flIJ.l'~VIJtAAIYINIftIIVYIIIKI.YIn




CA 02350775 2001-05-11
WO 00127994 PCT/US99/26923
l,


"Pn_If'I'~ 111955 141175 NYrJ7AMEKLLtTDfVT:"' nLDKK:'IERLYA:.FpAW
1(LFFLT.:RYYK1'AAP:.FSD
'
.


CT257 hypotnecical 0roceln yEr.ATALFSIESCiIPri
.DNY
FOAPYLLCC~ICAfVW::-:::.nLLYSKSLP.:DL:.:::;.x.
'
'


CNC?fMSALFI~ttCVNIICIVLOCPYSllfOIACISFNRVRLOYYLTKDF(KKARYINFLZRR:
R:iLKDp,YAEP i7G IAI
YRFSPfPIAODLIfCYVOPRSFPNAIfERELLFE
~LI:FLTIIG1L~ 1&MA
RWiIfDPRx
'
l


PYRL!'~7JIIGYMIALAVCSESSRNCIfRAIGITPDYAPFTOIFtWIFAf3.LPLTISRIfIr
~falI
f
OKELEAOCALTSVJI
SAPE00NH,1DFLAPPJIDIfHCIwAWEALIfItYY00LMSL
DLIEACDFKIVM:


PEKLALWCI1PILY'fSHYIFYPLIOLICSLT.F~.I:IYLIJ'IIRKEKfltSTLSRDEFOK71LCTHr
SCDDIWDL


HEED'!11'IATNIF.iw~11TC1100VCQPLEQVTIQ.PSSANVKDFCRTIfO'JfDINFIPVYNK
151090 451591


ARIWVLCIAHPKDFVNKALDEPLINNWSPwFITAKSKLIRILKEFRDNR55VAWWASCPn_0108


~EPLCIL:iLNAIFKILFNITHIIWIJIPK?ISVIERTFFGFISRIImLOKLLDIOFPOYPVECT10-
hYpotheci~al pratetn



~-
~
. -"
'
"


".v~n
..~." \i.-t .'."... ........... ,
...... ... -:h:..":Y1Il,:-: up: ~.vet
~,r,
t,
,
.:.:lr-
~
.:a':IJ!!'i F, ..... , .
.: _
.
:
:

'


_
~
. .. ... . .. . ~
.
.711~'v.
~':VWAY!w..


r_Pn_0396 111J1y 447741
EFPPDTUINHi.Wt;tulxv::.:.:.:a':.:~t':..wY:,::~::W(.VI7x.L


yhf0-Nits-related protein
LVLEASIRIIfWCVL CPn~0109 151615 1551:7
'


PPERCLLEFLOk?FLIEC:YJ1NPSSVNOLGKILSROCT3(i0 hypotheercal protein
YSMIYLCtMRIfl
WPE~AC
V


SY YINI
SFpCRVLYTSCATESLNLAIASLPKDSHVITSCSE11PAILEPLKIiSSLSAN
C
'
'


VLTIEOZERAVTPKTSAIILC,IVNSE'FGAKADIAAIANFAOEROLOFIVDATANVa%RILT
p~f.PLFFVIASa
fiJONNLTKfLKSSDEEPFLERFS
iEfLALICY
O
N'Curt


VLPSfrYf!liUlfSCMCf'H71L~IGALLVSFGVKLIiPOLWCOGQOCGLA7IGTFi~tLYI7IASLLLPY~JtESNK
ASTARLLHLLNRDIDIPGFf?IDEEOCLIfYRLVLPCLIRittIICIIi.RIYI


YIFKYLDLHOERISOEILTIIRNGPEKAIIWIIPOVNZIfCADOPAIIMiVSAIJ1FPPLl~EVOn'IffL.VCDSFSH
AICLIS9fA11~ILDCLRA0AL0E00EKRNE


LOIALDIECIJICCYGSACSSCATAPFKSLVSNC1IDEELTLATtRFSPSHLLt.OEWt~IAV


CIIEKVVCRL1045 CPn_0410 155087 155833


dnaQ-OIIA Pol III Epsilaf Chain


CPtL0397 145124 441)81
tIVRLFKSWKKMfIISSQI1~VLIFYDTCrCC:OIERDRIIEIMYNSVTDESPLTYV11PEI


PPZC Dhosphacase family
PIPDGSKItICIITDAVLSAPKFPGYDCFRKfCGEDSILVAlOS4DCFDFPLLGK1ICRRN


EHPVDfDYFCLSDIGRVRAANEDFWOVNWSQWAIAOGVCCRfl;CDIA50E11VTSLNELSLEPLTNRTIDS4KWAOKY
RPDLPKNNLOYL.RpVYCFAEt4pAHMLDDt.'VIUOIVFTSLI


IDEOQSKLNGYGDDpYKETL10CILLEYNCWYEEK%4EEHf.Oh~'1'fLSFIOFRI~tAWLCDLPPQOVLDLLOOSYN
PKVFxNPFCKYKCOPLVDIPKSYFENLEF~CJ1LOKPEIdIDZKJI


FHVCOSRIYRIROGELRRLTEDHSLf140LKNRYGLPKOSDKVYSYRHILTNVIGSAiYVNAIALLMOPT


PDIANLPCEKEDLYCLCSDGLTMIVPDZDIRDIIafOPATLEER~I71LISLaNI'RIiG'Od~fA


TWLVRIO CPt~0411 155794 156609


CTZ67 hypothetical protein


CPn_0398 115518 15700
RHQSRYSSITSTDNILTAAFSPCPNDIFLFRSFLfmPOFRPLLNOVTIADILTI1F1'LU.O


No robust homolog pnstmc in
Casebank/ENaI.RRLSLNKFISMLFPLVSDYYNLN09CM'LCYNSCPIVLSLDPECSLDrtaTPCtOfi'1'JWA
as of 11/7/98


IEELPFtQIENSSILFAEWFOCWPiFSVISAPWFLPCxTLIPKEKVTIIVPSOWStSLSOLCKLYYPKJUCLIPMPY~f
ILSIIILOCINOCGALINEERFSYDLOLTLRADIGI


p
FPLPL~CWIAKYVPM711vDJ1LTAALRKSLZCSLKDPITaGAKAVEYSKNIQAIIVIfUt!'I


CfYIIM~I>!'OLSIITDKKJILIHQ.WlI7INyC(:pY1'


CPtt
0399 115759 416573


_ CPet_0112 156515 457216
Cf253 hypothetical Drocein


YKLGIViIiGKSLNCFSIDLItSKNFPIU1RIFCKISNLA'NIMtKNLVLLASLGLLSpTLSSCT363
hypotheCiul protein


rTHLC71SGS7MpKLYTSS~S ZSKATYAS
EPZS'lIIKPfNYLKfGKKi.YICSCRI~HIVNfPKKZLCNADY%ISPLI~TpIN~t


EKVFLIKfMASP~FYAPIANRLPETIfEOFLPAEPIVATB.LEOK'IGKF~IGYDSVI'LYSYAC1'DYHLDLYIVIIV
IICSTAVWaLpSYCpAYTDYDWINPGF11CRCSPEIII~OCIf


ASVRVRVIDIRHNKZALIYQEIIF7CSOPLTTLVNDYfCtYfIJFISKNFDSTPIGtJbiSRLFRTIDCIANLTfD!'P
PVLSFaPPYIFDALPDSLPKSSLV'tSPVLYftYalfOfIFKII~YA


EVV71RVEGYVCJ1NYS
IASOA71ENNIPCSFLKITSDYTYPGDCPFSRLEEVS01(LTQT~.YE<i.PIGJIfhIJIIPItKLL


LPCP '


CPtL0400 416537 417306
C1'251 hypothetical protein
SKS~ISKPILLt.SIGVMtaSKNFFIWPAPSC1(TPL1QRQVLFGG11LLVFSSLVALSVSSO
TABLLS1111CISLAFAFLFYLLFLPKDZTRAILFSCERWXTSWR7IfGSJIIRIMIIIIPV
'1\7LICINIISKFL?LVLPTOEZH1'QEYl'pEVpNSLPI'~NYISNILNtL:VLTPF'CFiIIFFR
GILQl'FIJOJKNTAZMYtCSSIIFSFIHZENSLCSWVFVWLFVFSGSACFLYEI~fIIL
SPIALIGLFNLTSLZ.FLCIK
CPtIr0101 147881' 447195
GT~SS hypothecif:al protein
NRDHAlSIQ.I4'IVRANVVECRCPWSiQOSLVSNVEHILCECOEFHEJ1VG.OGKTVOEVaSE
AO~IGTLVLILCFLLEAI7CVi.ivSED17J1HEAFlEfILRRAAPYIFAEDYKPVSIEERDRIJfEL
AIGOtBI~ES'f
CPJL0403 449012 447888 T


mutt-lldenine Glyeosylase


NPIDtFCNTKI11FSEKAIOJFI?VEAL<CKWFEIOVIfASLPWRDNhfPYSVWVSEYFILpOTRJLEVCP(L-
.011! 160103 159172


VIDYFI~t~RFPTIESL71MKEF~IfIKLwF7CUYYSRARHLL7lRMIl~EFIIDKIPDDaeU-AeCOA
CarboJtylase/TransEerase
Alpha


aISLRQI1~VCPYIIIHAILAFAFKRMMVDCNVLAVLSRIFLZ>:fSIDt.ESl'R'IyRIBRILCLRIVCIID'IILF
IRGENIWELLPN4CQVVEYEKJ1IAEFKEIQIIDLNSLL658LIOID.~I


AQAt.LpNKSPEVIAEALIEIGACI
FVLPVRNAAKKVRLWfiJCEKIYSDLTPWltAII0IC1WPSRPRTVNYIpGlICCEFVELCGOR1'FRDOpAWCDF


IFLNRLVAIVLYt7GSLWEIDtRPKB~41AGLYEFPYZEVEPEDCLQDIDDF'fI~IBLSLESVKZOOORI~.IGOBKG
CDTJ15RWRNlCNLCP~FRKALRLCKLAEKf~CLPVYILVDTPG


'PLEFLGFR.KRHAf'CIMKVHLCPIIFKATSLPOFGEWLLSDZDHU1FSSCNKICIKDJ1LAYPGLTAEERCOfWAIA
iDILFCLSRL11TPVIIWICEGCSGCAtGNAVG0SYA14.lNEYY


LIYtGOVRSRESIGV
SVIBPGGGASIWKDP100'1SF~1ASM..%MICENLKOFCIIDTVIKEPIOCAHHDPALVYSN


VRIFZIQEWLRLKDLIIItELLEKRYEKFRSIGLYEtTSESGPEJ1


CPrL010) 119009 419710


yeeC-predicted pseudouridine syhchecaseCPt>_0115 161522 460221
family


NFNpL&NOKRMi.OYFME4F'SWLiLTpVSRLSSFLRSOLPNISKOEILtISIRONRCRVNCFCT266
hypocheclcal protein


IERFP.SYKVOPCDRVSLSLIPST100pPSILWEDDYSIIYEIIPPHLTTEQNAHNTRF!'CVFtSOIGFL.PCLTLIF
YIIIVWCNAFLIKLCVINCLOSRLQHCIEVSONSNfOSOVKOFIYAC


RLWOGTSCCLWGKSKOMTELF~LFKQRKINKOYIAFVFCNPKKKFGTVKSYTAPVYAAODKTLROSVLKZFRYNPLLKI
HDIARAVYLLMALEEGEDLGLSFLtiVOpYPSCI1VELFSG


C~vAVIFCAAGPSOGEPtKSAYIfWDCI,IVILLSE4f5TfDLIOJSLPRSSAL55lIL.TPGCFPWIfCLPYPAEHAE
FGLLLLQIAEFYEESOAYVSIOISHFQpAL!'DNOGSVFPSW90E


NSRLLKEKTTLSOSFLFOLCIIOIHPE'ISLEDPALCFWlpRTRSSSANM9CGpS8IG11Y


CPn_0104 150967 449871
SSC09GVIAYCPCSCDISDCYYFCCCCIAKEtyCpKSHpITEISFLTSTCKPHPMPDC15


No robust homolo9 Dresehe in
Cenebenk/EMBLYLROSYVHLPIRCKITISDKOYRVHMLAFrITSAMfPSIFCKCNNCQWDDPRLASCSLD
as of 11/7/98


ELEALCOKYCKAVLLIALSELCID'MSLLSCNALEGFPPIAEVNAACDRCSMDFCEILKSSY10CPCNDINILGENDAI
NIVSISPYMEIF7lLpCKEKFWNADFLINIPYK6OGVlILIFEK


QSMDWADMSCVDCLIADPFWSTAIASGIAKSSLQETEP'ECESKVN1.~SSWCEQGAQVCKVTSEXCRFFTKIW


SPFNLERICMSFPSLKVFSLK10JGCENMGIOLariSCWJLWSIFFVATNGCS1'PIWTTKE


NIJIALVIILVLSHYOCYFVPA1'CDPORCNIIIOJPEI1NAILAAGNCNRVDLERKRCCESSSSCPn_0116
Iu1871 161557


RYLELWtCFENSLTKTSLISDAFaAfpERDKCLLONSTSLI~sfrACWWRPPVPTPSGVThim0/lhfA-
Ihce<trotion Hosc Faecor
Alpha


AfiPOPOPOPVVfSOPSGLGaRERSPVSSRCRFPt'VLPLSVISPRSHPCAVERRDLEDEEEFJ1LSNNATMTKKKLtS
TISODHKIHPt81VR1VI0NFLDK.YTDALVKCDRLEIRDfGVf.QV


EVI~ VERKPKVCRNPKNMVPIH
IPARRAVKFTPGKPNKRLIETPlIKHS


CPn_0105 151814 450960 CPn_0417 463017 4ti~~1


CT105 trypochetl.cal protein amiA-N-Acatylmuramoyl Alantne Amidasa


Ntf,TfSHSRVLLI(KFSKEF'fIRTYRSLCFTDYLCu~LTNPLCKFPSPONPOWTIApSSITREKCJIKLTKYLNTKO
LRSNISRLFVRY.iLFNSKOLSFFAL~VIGSNPIFAOTPNPPpRVR


PpAVS~uuAWGFLO'fOGAASSTATTTTASCASAi.rL5p0pVOALLTNLLNYGOP$VOQPSTR:uEVIFIDFCFKxK0
0CTAS%ELHYEEKSLTI:
IJ1LTNQ.>ILKPMfiYKPpLTR330VYV0


ACl"'aCA :.~.S.SA~IQQpLLOLI
LDKTTCSCCSSVSSEOLOOLLSLVSOItT?SOCCSOCfOII:KRVAL~,NRCQc:OVF IS
IHCNHSSNAAJIIrCTEVYFYN:I~ICSPTRNRMSEVLGK(iILAA


aCpMSVLL.NLLSATCSAAANPI.CfAAsLAQIIYMVTSPCAK1ITSEfCYNYCCE1'CpGNMEKNCtLKSRCLKTJWF
WIRDTSMPAVLVETf:FISNSFERAAWDARYRMHVAKCIAEf:


~'C(:P1'~CPDCOCCCCCFCRFFCCVWttNCCCLCEC:.OEPAIPLVHNFt-:r;PFOKPKONtAK fRKPOIOild
.


<'f9~ 010,, A519b0 4529~s '. Fn-IIIIu .In4111I 16.51


t.rbl&uJyl-ACYI-t:arr(er 1rorein murk rf M:,'rylnnm.umfylnl.~nYL)lutamyl
R.ylu~cose DAF Liau:r


tX:FfILKIDL'fr:KVANAc:ICD0f7CYl:WCIfAKLLr\L;V7J1TIIVCI54VFIYKIFSO.S~iELCKMIJIJ(
ht..IJK:V~/(KIYt:KVRFLEVRNLTROSRCV3VCDiFTNIKrIJPYDt:NIA:AVIL~1.ANC


F'NP':RKf::Nf~fLLEIAKIYCNtM.:FD::FFDVPECIAENKRYKf:ITt:FTi?E'IAfQVKKDFAf\tA::::I
.YNI'Ff::l7VpIfTf'Nf.F.ELFJ,ELSAKYYEYP.~.3FLHTfr:IfCTNiF7~I~fCLI


:IItDLLVIL:LAN::1'Rf::K::Lt.ET.~.RKCYIriAL;.L:::T::N::LL::I1F.:::l!?IRtX:iTI::L
TKALI.U:Y!':KI':a:1.laa'(IvJII4;FJf7VtFfX:ITrtTPALV~FYLATM/It4NRIM'PMGW:.iI


YlJI::MNAVI'tTIW
1X.I::::AKAALE.:DTKTfr\WEh:RFW,tRVNTf:'.w:F4l::RAC7KAICFf':IJI:a:I'.VA'i'ITJI'I
rpAVI:ITIITIJJ111.DFlKTFT'NMItAYLF::LV4f':iteMVIffrpSPYA


F:HFIVDY'ItJFSIAFII'FrVINAI~tV(:AVMFLAiFLI::AITf:ETI.WDIiUANVFK:I(:PF?1FPK::y't
F':AKAI'Vf'Plv:ll'_'..\.1V'IPATDfuL:'.:::a.TKYTL'llr:p(jKIA':::::::FIr:KYNVYNL


!>:: LAA I:. I'VIfA;:1 Ja ~I,LF:IH.1.1:K
Ir:IJ.'r,Ff'It :RLDPVLFIfiN:F'll
IU'IAIrfFItAIJaiVl.'117L


I IF:1 J ~19~Y 7:It4I W F: ; s Y
.I ntl,k::Y.kYLMA~ WERYCFA'/'lf::ONII::H
FhF:L l'/HP tt:Or:


~'Im 11111'! .1.l~li'f .IV2H5H
I"/::NFI'/F'I6I11<KyAI'I'YAI::111::UFLIVI.IAC:KCHIiAYrJlFKIIrjP/AF'INnYJI~AFYLA



IIAU ::"Wrt.lmfly hV'It'nl.ru./Idw.:ph.ft.u:~::/V


91


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
.:KHGPS I::KARTf.NtR.GL


CPn U4L'1 466997 464A~6
r ~
1 ~en
~ Aii~
IIa
Z81 ~7~
~9La
~
~w~
~
e
t


SYRK~IVtGIffALYIILi.VLRYYKIOICEDtMtWAAE.
3VPYPN eswnc
FPIIINIwtItPOKKV l"r-
QL e
c
f
p
.
/
fo
No cbuse ho


.ILGOiIEFCVROPFRRGTFFAIA'IVRKCDKDLQOPFAVDITKFNtGADPLAIpECHRI~IIKtf4ll'LFKYVPRSR
~IPDTLTFLKRYS.TJLLHSEN:LSYRIPAKYInIIiw?SIJ1VAPAt?


r>.ILpPIDGG"'Y'..~t~LKL~fKSIIYCKLIPLLDVSVIIDRLSLWWKGYATKNRLPZN71LFFLFSCE'L'.iJL
RLCAL'fIGIAL1ICVL:.TIVVYCIA::KIA".'A::KKPPSISRIEIV


ITOYQRSYPF.'KL1JDOVLItTLREIKDCKTGKAFP'fGQttiAYFIB(IL6GOVCERKLLJISPL
4749\7 47
;514


NRt.D4TIRVIKLPKOt:.'.OiYLTiNPVIt7t'IAI~ELtiRGVL.FaKA~00GRLILINSCICEIG1
epn_Oil_
. .. -
..
..,


!':a~:: ... ..,rr::,_.::~7f!'i.'7::I"::'1
~.-=i a:.::YYf .':," :.:.\:IEEw:;.Y..'.:,
. ..:: m, ~ . . .. . , . .
~


W .C . , . ..... _ ..i.~...:, ~y:W-.
..Ir : :: ,'-GS~f:.vR:: ~~.
.I'tvrr:rt'i: :'' ;:Ii':::::.%:.::1t:..:~a174\t.:
.. . G' :: ., 1.
'
'
'


VAWYQQKLLAL:IPCRlCTCIfLpSFJISGLVPSPNRl7IIt~.SLESrSLS'TPYSLANGYNIU1:a:dvWh
::.,:ML
i F 1.;N:aLr.H I wi'AHla
.;LFLIi:aAPLvi..: Lit trlA:?.:


GIpMVOAYAILANCCIIAVRPTLVKKIVSASCEEYHLPTKIDfIRL.FSEEITAEWRAHRFI'


TLpOCSGPRASP%HNSSACKl'Ci'fC101IHf'KaCPtA_0433 4773.7 476929
YDKRRHIASFIGfTPVESSP~NFPPLVIG. '


ygIODpEYCLRAOCIKNYNOCRCAAPIFSRyADRTGLyLOII.ppKKLpNCp=p~,AAt~DlLystem N Protein
OesN-Glycine Cleavage a


YEEAtJRSPKpOGTR
RTFRILYGTLIR'lCSitKV1811YSDYHVWILPVNERWRt~:LTEKL4pKNLCAILNVDL1SVG


SL.CK>'~.EVLVILESSKSAIEVLSPVSCEV:DINLDLVDNPQKINEAPEGEtfIiLilWRt.DQ


CPci_0120 167120 166124 ~P~~~


CT271 hypOChatical protein
KSFPtdJNSRFLRLCCCLCFCGSLFYFIfINKONSLTKLRLEIPCLSVRLROLEQOIIISLRFCPeL0134 179471
477276


LIDKIEAP1EiAALPEYQYLEYPSEESISLLSYELPCT213 hypothetical Drocein


RPMfRIYOpDLPCRLCRDPAWFFSLLSFTLRFYCLGRGWTLLSFIYtOpKKFICIVIAW


CPn_0121 46A007 167108
CIfSGICVWCRFSRKCSAE'~TSRRI~fPT:ASGIWYVEKDFNAIBUtFPIItIGYPfI'~iPRA


yabC-P8P2B Family
mechylcransferaseWNfINIIGLL?DYFLZTRVGOG.FLKVYNPGFJCFSKEKAYpPYRitPOIIpPISS6iVNIt
SS


EILNSERAHIPVLVEECi.Ai.FAQRPPt7fFRJ7VTLCAOGNAYAFLF~iYPSLTtYDGSDRDLAPpLLEILKVlOOI
ENPISKFTiFLARAKLFLLERRFPHYVLROfIZ.IYRRQMF11LPPDiAL


QAIJ1IAFJfRLtTFQDRVSFSNIISFEDLANOPLPItLYDGYtaDIGNSStpLDI'LSRGPSFQSRQ~LRLFCY01'I
OOWICOJIYLSAAVSLLIRFIDEpKKVLPRPSKOFaRDDIYdtaKNA


GEKEELDHRI~O'fOELSASIriR.NSLKEffFi.GRIFRE1IGEEPpWKSAAKAVHIFRRIfOCILYT1IISK?MEPS
IJGFCEISTfSYFQFLEISESEFFt'ItYRDILLCKMLti.IQOGVEFDtOPL


SIODVI(F~ILLGVFPIiYRFNRKINPLTLIFOALRVYVNGtDROLKSLLTSAISWi.APOGRL1TFFVGGKDSIQVEF
PRLPKE1ISFKTKQELKAFCVYLIQ,VSLpKSDBt~VPNEILPIRTI


yIISFCSSEDRPVKWPFKEAEASGIGKVITIDCVIOPTYQEVRRNPRSRSAKZ.RCEEK1LS0KAK6PRLVCRRFSIDY
KRVIILODW1TVPNVEVLNYppNSENFQEILOp!'PDVCfCOSYK


p!'pICJCPALRDKISLITRKEILMRPERIL4SL.QpVPK09pEVLLSAGIDdSALPCISOCQ


CPn_0122 161233 1617A1
OLIIKYLLANiYLDLYSODACrYY'CIIVNSSFCKEEVLPYREVWtDIJISOLLTSN01Q.VD


CT273 hypothetical protein 1~RTRY~~~~'~~FSWSL~LKTIER~.


GLANVEIFNYSTSIYEQH715Ht4RIVSt~'RICEIO!!~'.ISIRDVAIDSApILtIOIPKPSALTSPORDRIFSIIt
VCOYSSVINSPNDGPCYYOCLSttLLYDRPASV~CL.FIaKSOLDEiLiGS


L:.pTNpKSNWACFSPPNNFYKQRFSTPYLAPSLGSPDOpDEDIDCISSFLIiVLTRG1~'SYYIaRFIEQCWR


RSOITPFLSYKDKEtEF~EDPEi~DPRVOQGKVLLGLDL.fvKSTNVMIDYVISRIFO


gypG CPtL0135 110908 479175


Plfospttolipase D superfamily (uncleavable
leader peptitlel


CPn_0423 1617A8 169216
GYtI~'RLRFRLMLGIFFILLVPNSVSJNt'1'IVIISIIItOtVCVLVYDNSVp~IiOpILDCIDII


CT271 hypothetical Protein
ANPYVCLCPClIIGGRTLKllNDIft.EANI'QfLVPBICSYIIIQPTFTpiIEttLLKAL1~RN


CHLDNEWKAIL~WGDOELEELRISG1ISP'LitQCHYSKAILPFEALVILDPLSIYDNpTLGCPNRPPYV!'ICCPPST
SIL71PNVIEIAIIKLSIIDGKYCI


LYLQIGENSOALiWt.DQ7ILRNQCONLPTLISJIITKALPCLCRIEE71TAIATYLSSCPIPAINPRLIVSCVRRpLi
i!'RDODIIG.RSTAfGt.OLRECID!<GOPAIWD~YYA101AiPItAG


ANDAFJILIJISYSKATiDD~tIUILVR
ACPPLTLZ'3aAEETVfPGFDIOtEDLVLVDSSKIRIVLCGPItDICpPNPVYOEYLKLICGiIRS


SVIC.iIlOIYFIPKDELi1071LVDVSlI9d~rVIiLSLITNCCIIELSPAITOPYAtIQIItDnPALL


CPJ>_0421 469528 170961
YGI~IfP4WlUd11C61t41CPY~tVSIYEFAIWC1'QIJIKI~NIIDatIPYIGSYtII~miIF


dnaA-Replication Initiation
IaeeorCYL4IWIESPIttfAAItAIDfVR'iKDIGLSIPVSNGDIFbwYFHSVNNTLCNL~.TrMPA


SRCHEIFSPSLIK111VDCIWLSFIMCE901LTCNFLNYVKTRCSKTAFHrWISP


IOVLElTQEKIRLEVPH1 ALCf'WAGDDCPSAPVCPIlr0131 I1i33 110902


ASI IEGPSNOpVK871J1VGLAGKPGRSYNPL 1p1J1-Lipoace Proeein LSQUe-Like
Protein


FIIiOGVCIGKI7fL(JIJIV~WiYVRt:NNNKNLRIIK:IITilIFIN~VYNLItSKSVDKNI0~1FYRSpYVC1llI
KVRIVDfQKSSAASNIWtDRDi.LESLQOGELILHLYt04D1PCSLTYQWl411mt


Lt)i.LLVDDI0FL01110NFEEEFCN1'FCfLTt~.~OIVITSIrifPPSOIiC.SiGtIIARI~IIGFLL.S4YAOI
.GLDA11VRP1COGIVPIDOODYAFSVtIISATHPSYSSSVLA~tylflVIISPVAKV


LVAHVCIPDL1C1'RVAILOHKAEpIGLLIPN~IAFIfIADItIYCNVRQL~GAIl'iIG.TIIYCRt.LEIVFRIGQf
ZaPI;~iSSSRDSGIiPCNIUITSKYDVLFGDIDCIGr3AAQRKVOpGPIJpGS


FCKSLTE1911RETLKELFRSPTIC~ISVEI'ILKS1fA111FONlCLNDt.KGNSRSKDLVWtQLFL90SSSETYORF
LKP6YLEIIn0I0IHAFFPLCLEA71DEVL,pFJIRQOVKiA/II~IC


IAMYLiINTLITDSLVAIG71APGItTNSTVLY71CKTILt0~l4DlcfLKRpVM.CKNNIVCC~.t.


CPn_0125 170965 171561 CPU,-0437 41110 11350


CT271 hypothetical Dreceins elpC-ClpC Protease


FRGCPtffRRTCIIGPFEDVOTLYEEETSSPSSYSPYSRSERPETPPSLFdJPKASE7IRpLNlfplinKPTNRAKQVI
KWDt~IQRLNNNYLGTFJtILLCLLK1.00GVAVNV1~IL4I0PDT


HNLTJ'.R-
SSLPpWSSTPRTESLLPLEEPETTLGEDVTFKCEWIIILRI3.RIDCI'FflGILVSK1IROEVICRLIGYGpEIQVYG
OPAL1CRVKlCSFESANBEASLLEIOJYVCTOILLLGILNI~D


GKIIIGPKGSMUDIOLOEAIIEGW6CNITVSCI(VELRGGAIIKGDIOANTLCVDDGVRSVALOVL~ILIfI~RLVRKC
ILKELETFNLOLPPSSSSSSSSSRSNPSSSK6Pi~liStGS


ILGYIrIIACI'1'DItSERZGtDL
DKIIGC.SAIJUIYGYDL?EiNRISKLDPVICRSSEVERLILILCRRRKNNPVLIQ6AGYCK


TAIVdriL710KIILti1VP011LRKlWLITLDL71f1tIAGTKYRCQFEERIKAV10E11RK1A.11I


CPn_0126 172111 171536
LLFIDELNTIVC'.iIWAOGAIMSNILKPAL71RCEI0CIGATTIDEYRbIIEKOMLtiRRF


CT277 similarity
QKIVVJtPPSVDLTIBILRGLKKKYECNNNVFITEEALKAAATLSIlQYV11G1lFLPOKAIDL


NVLFSLLFPKLCYGCOAPGAYFCSNCLEKLLVEDREGRCLNCFRYLCSSETRLCSOCSPSLDIfaGARVRVNlIDQPTD
IJOa.EAEIENTKtJIICEQAIGTOEYPXA71GLR~EKKiRERLQ


SQLQAP'SLYLPSOTALSVYARACEQCRPALOFFSKSIAFtG.ASt.DI:TPSCIAYITSTISRSHK~?llKl:<EIIQ
VPVt>EGVAOVVSGOTCZPS71RLTEABSFJG,hKLfp'ILRpKYIOpII


KIWEVAKLEKLLRIPLWPWLPKKRQIEKLPKGEGICFL511YpL~KWMQTIVGGSASPLtlilVT8ICRAIItRSRIGI
KDPNIIPICSFLFtI3PL~\/GKSL~LIIQQIIIIEHF0GA~11LI0~


VSISLFLSQNDQ
SEYI~IfFIUITKM~aSPPGYV~tL~HLTEOVRRAPYCWLFDEIE1WIPDIJ4r0IL


OQGRLTDSFGRKVDTRHAI II4li'SNLGADLIRKSGEIGFGLKSHH9YKVIOEIfIAtAIOtK


CPn_0127 472157 173715 .
HLKPEPINRLD6SVIFRPLEKBSLSEIIHLEINKLDSRW041fpNAtJtIP06VISFLVT10C


nqr2-NAI7H (Ubl
quinonel Dehydros:alase NSP~4DMPLRRVIEOYLEDPL
'~YR~'OFJIRKLRATLVFNRVAFEREEEOpEiUIL


aVCYVFERVEASTFLSITHLKKFINSLWKLCpQ0KY0Rf'TPIVDAIDCFCYEPIETPSKPPSMiLPS


PFIRDSVOVKRWIM(.WIALFPATPVAIWNBGLpSIVYSSCNWIJIEOFLtII~3FGSYLS


tvYKEIHIVPILWEGLKIFIPLLTISYVVOLTCI:YLf'AVVRCNKIAGOLLV'l'GILYPLTLCPn_0139
185155 181731


PPTIPYWNAAIGIrIFGIWSKELFCGTGMNIWpALSGRAFLFFTFPAIOt~DVWVCSNPyebF-PF-loop
supetEamily ATPasa


GVIKI>,SLt41001SSTCKVLIDGFSQSTCLOTLNSTPPSVKRLHVDAIAAI~021tIPHVPirODNLTLPNPP_OVR
EINOOlYIVANSOCVDSSWAYLPKKFTNY1CVIGLFIODafEEDSDOCLC


'JIHSQPSIIrII'ETHPGWVLDNLTLTOLOTFVTAPVAI':OGLGLLPTQFDSAYAITDVIYCIGSSTKDY1CLNERV
CLpLDIPYY1VSFAK6YRERV!'ARFLKEYSLCYTPNPDIi.CNRCIKFD


KFSACNLFwK:NIIGSLGETSTFACLLGAIFLIV1GIASWRTNAAPCICaFLTGWLFKFISLIAKKV:ELCGDYLATGH
YCRLIfI'ELOE'IQLLRGCDPQKDOSIFLSCTPKSALfONLFPL


'ILIVCQNG/1WAPARFFIPAYROLFLGGLrIt~GLV!'NATDPVSSPTItIILCKWIYCFFICFHTCE191KR'F1RJ
IIAApAALPTAEKKOSTGICFIGKRPFKEFLEKFLPNKIGtNIDND~I'KEIV


IVIRLINPAYPBGVMLAILL.GNVFAPLIDYFAVRKYRKtiGV~HOGAN'f'fTICORRCLDLGGSdIPCYVKiINfIE
ENSIYIVRCEOHPpLYLRELTARtIli


WFTPPKTCNCSAKVRYiISPDEACTIDYSSCDEVIfVRFSOPVKAVTPOOTIAFY0C0DCL


~Pn_0129 173719 .171481 ~~.lIILVpNIPSEC


-nqr3-NAat (Ubiquinonel OI(idoraduecase.
Cenma-


NMSicC;SXHiIIRINQTWYIVSFILCLSLFAGVLLSTI(Yl7t.SPIOEQAATFDRNKpNLLACPn_04:>
IA5523 I8ti077


ARIL.7FKGRFOIOEKKEWVPATFDKKTQLLEYATKKVSEVSYPELELYAERF~IRPLLTDJ1tM rotma
hamolog present in Genebank/NBL
as of 11!7/98


QGIfVFSFEEKNWPIEpFEKYQESppCCOSPLP!'YYILEN'I'SRTEHNSCADV11KDLStIrQIiSSNttf.'ILFV
SSTLNCVFPSSLPEESADLFITNKEIVAiGEXCNVFLTHSIPlOIL~UIIT


ALIFPI~t~GLYJGPIHGYLGVKNOCD'IIJLfTAWYGO,'ETPCLCANITNPEWDEQFYGKKILLVIVALA:IAIICL
GCYSCSILLIAVCIVLTLLTLLCipALVGFIKFLROLP00t.IftTf


FLpDS.~CT'INFATTDIGLWKCSVRTTLCp$PKrILSAIDCISGATLTCNCVTFJ1YVOSLQFIREK.IRPEs"SLQL
YTNAVRKTTQDTLKLYEELCDL.iOKEFKLQSTLYQKRFEL$INOfC


Av.YROLLINF~tIt.THEKKTCE ' K'fNON


::Pn 942'1 4711:611 .1'571 CFn_0119 INfi9Ht 491:74D


mlr4NADH IllhiquilKmel Rullwtase tM rot'::c llaaalol tr.tctnf in
J rayusAink/EMLL nr. of II:7/7R


KRNPFMfwKK::YK::YFFDI'LW.~.WJpILIr\ILCIC;.ALAVT'tTWPAIThK7IA'J.~.IV'~':C::WTIM:
IKMAT::VAP;:fNPF_:::PL:IIATEVIlILf?iAIITQMiPIPMW1ETPR5KLS1'IIN


::FFV::I.LRKFTPtr.'VRtIITQLIIt:LFYIVIDQFLKAFppDL::KTt.VFIK:LtITM'.tVN'fl.c.'FA:
:::.:.LTIt7GTt.'..1~7YY:YTf~IWIIty:Ir:II:fIVLTt.ILALLLAiM.KNKQTIl'KL


~~Ita:LARI1VTPIPAFt.DC:FIISCLt:'riaiVLLVtt:JIFELFY:FI'ftJ4c:FRtIFQFVYA::t:fIUEi
:a~:.I:SIC:x:FV(IRYI:U4F.~.t'tY.aVIILtELT1'r~EKTRIUIEtEAKK&:IONLEL


11f'fl:IVil(.::It4Vl
~P!:AFFL4:INIWLI'NIRD:.IfYPYRYfTFI:/i::YIJ1~~K0PKRK:::'.('t:.~.FMP:'.IKIIt::Y.N
F'JILFf%:


.'I~m_nA 111 .I-/512! .t~ln.1)m r:lal 114 4: I>:na7n .IH'/H SN
1


IHIf: t1At111 (lll.i.llllll<IIIQI T1111'1 :.%tYlfINef 1.:.11 t'fOCbln
IkIIL:C.1!:1, 5


t'MwIt:ArIW(lNFr:ILWAAFtONILLWFtt7tiewYt.M:::TRV::fAN:taat::VALVLTVT1!WktU.::4F
KW.t'111Mt'lu:llVl_Tf11'IVyldUy:IIlt:liArJ:HD1'1hIF::A~llnyfl.KVllO


:::(IIWt'VIL\t't'PaKAI:IW(::P:L.A:iVNLt:FLELIiFIWtMFI'Qtt.IiLLLEKV::RNLYAKt'KKL
:fYrl'tI:YRVYt7ITFL('.TLII'rt:II:Y:Lt.Y::Tr:Ytt:AI~IVWKt:::Ltl:a''1'pptOLt:


t::ll:It't.lLtAVIVt:AIItY:VLF(:ITR::YPFIf'M4IF::Il:/VtXV,l~ll~Il.AfYif.ATIKEKLA
1'WATPVL'.'~FYNYVLI_:Lt:AYTL::LKtIWI.4ra
If:7a.VIl1KtItF74t:Yt:LYU:YL:x:KYOAT


'tltlYttly;Nra::Fl't'ha.f.WAFN::L'tt7(DI::Kf.':AKIVItAit.t.-
fEl/VtllM'N('LKC:~t:Itt::A:F":/TNETt:L110RKTYlII:NSV::YKA'flirt:rtdY:lYl~/tIF::
IGYR:.T::VI:Ntax.AY


92


CA 02350775 2001-05-11
WO 00/Z7994 PCTNS99/26923
:ltKf:F'frXI::hHDISLADDN r:FYAISPNLI~T..~it.,.~ttE
t:R.~YNAD:..':7ltF~F
ft'
JVY
'


.
.
l:hFICAt
NL'PRFRKKLtIYIIHLt~::PI;IFF:f


NNNKT:iIITFKT.':AFFTYI::AVIdIF
0llg -~ ~[' 9MA7t X a5(~136M1 p
.
Pn


_ _
:
~
y7t7Cy9>t.: 'lt0teet Pf01lM w.,l...__


'('~_OId: 4N77h1 1RH5ZN FLOPSRREIHEWK
r...GS.SLRII~LSPlOOPEOCHFDVVC:.FLIIPEuLTMRSMI~R


;.TUUe nvo~chemc.ll Pcol9ln
IVYCONRWEDAAIP1JLIKKp'I'IJICLIlf'fDGElIIKYSwDlDIMiCIFICVDIIRA~PE
ARKLKNCAKSYPRTAL1IEVLVSSVt&AL 'IPSPSOltHlJttlAPHLKNTRKFY
tIf:KQr'~L'fLtI.IFPERLL ItKTlEKCNAKAKC


. .
NILFOTt7MClKt tCVYLKDKISVSKHPFIENlEF
LHILTIA'J:ICLVFSLVFI LDLRAPSWf:JDSHdILOEI:
HLASYAIMIft W
SC


. .
O CRL
KvtLtFCA..~TfMLTLPLAALFtIAIKTK PTNOELIDDIVFYfPOVICGLYAAGCRNLOLDDCA
'1T::/'fLFriMKNLFPPYEPPP:iRPHTPPPl710E'NPLISESYFD.._ ...,y_...
.".,.. fi
y
,__.....
n ,
_.-.....
.
.


PPPWFtSG:Li.M. ,
.
.
,
;r
.
~
....
..a.~l.;t..
. ' . :.. ; F w\"" ....-.::',::.......;
;;.:.:.:a:.' .' . . . ,. ...,.
. :.-a_;
... ..;i~:


1~'; i .,; . I;. . WF~ECIN1RK:E'at:''~'r:AFlihE:AKtI',Y,;
cai P=oter,n . .
c
':OOSCnacJ


.
. 507231 505330
VDsIISOPPINPLCOPOVPM~PSrOpsIVKRLKTSSiCLFKRFITIPDKYPKJOtYVYDT


GIIALAAIAILSILLTA,iGNSWt:IALAPAi.AI&ALCVTLLISDILDSPKAKKICIJIITACPIL.0449
LO tlrana-shift vitA 01511
10-PMP
palp


LWPIIW1IAAGLIAGAFVJ1SSG114LVlANPMPVMOLL1VCLYtNSLNKLTLDYfRREN_
_
EI\Y'IGFR~ODISFSNNIVQLTI'X~O~IS1LAAGECSLSAEACDITlI~i7IIVAZTPQ


LLRMEKKTOETAEPILVTPSAI>DAItKIAVEKKKDLSASARt~~(TJL9DAQD~AAILL~IL~TTKIWSIDICSI'AI
!:11~.AAISCHSIPIYDPITJ1NI'
NPEHRRSFGSLSRIKTKPSD71ASTRPJ15ISPPfI~DIIDPYNlI~LRS$SFVLKI1GVTLDTKGFIQTAGSSVIIDi
IO
TAGfa
T
TLK
PVIL
'


AQGSIFYSSR .
GSGASSAFTPIMPASSRSPNlSiCfVIJtPEPVYPKGGKEPSIPRVSSSSRRSPR~KOS
JADfIL
O

IVFSC6ItLSEDE71K
TTLK7f$TCEYPLT'-'r'SIPVDSLCiflGKKVVIAASAASKNV~PIGLLDIIOGN71YAI~L


OOOONODEEO1CQQSKKK~KSNOSLKTPPPOGKSTANLSPSNPFSOCYOERCKRKHR10ACCKTODlSIVOLSALGCAT
TTDVPAVPNATPTltYCIIOOTWQfIwVODI'JLS't'PXTKTAtt.A


9llMCYLPNlLItOGPLVPNSIiiCSF8DI0AI0GYItRSALTLCSIJRGfwAAGVANIf~


CPn_0144 90365 191507
IOfGZKRKYAMK$GClxI0GAi101'CSCM.ISFAPCQLPCSdfDFLVAIWITd?'1tJ10RlYIO


pmp_6-POlymorphlc Outer Membrane
HITLCSGFI~KLPGSWSHKPLVLCGOWYSNVSNDLxfKYTAYPiVKClN
Proce>,n PNL4LPI0Y
SLPWLLTSSALVFSLHPLMMtiTDLSSSi#.tYENDSSGSAAFTAKETSDJ1SY
R
D
F
K
I
OIt
fDOSNL


KAFPORHFO(Y S
CTTYTLTSDVSITNVSAITPADKa~CP'TM'CCALSFVGADNSLVIpI'IALTHOG71J1IIM'N.
I
SE
C
E
S
O
M!<GAS11H8YPE1LI~~~IXLM'T
CItRAG6N
RNDPKCTT
ILVISCA
d~ILiW
OtIt


LSlSGlSSLLIDSAPATCTSGGKGAICVTNTDGGTATF'fDNASVII~KNCSDfDGAAV.
r
T
i
KFEKISDCZIDISYDL?LSYVPDLI
NVDLOGKF
!
'


TA ~OFJlIYRGSSRIY
SAYSIDLAK'1'1TMLLDCfITSTIO'1GCAL.CSTAN'1'NOCNSCi'V'flSSNfATDKGGGZYSKO
YAPSPNFP.VIL


EKDSTLttilt~lfGVV1'lKSNTAKTCCAWSSDONLALTGNTQVLF0~1KT1GSAA9AN'LPECC
SOAll1 5071A0


GGAIGCYLATATDICfCLAISQNOE?ISF1SNITTANGGItIYAT10C1'LOCMTLTIDGf~IITCPeLOSO
TGP 10PolymorFhic Outes Meabrane Procaia
' PAD


1~NTNLL.FSGNKA _
ACCGCAIYTETEDFSLKGSTCNCISTNTIUtTC~LYSI~SSLSGIMtSQISWLVLSSTLACFTSCSTYF1111TAENIG
PSDSIDDSTNIrrTYTPIC'f!'iTGIDY


SNS&ANOEGtGGAILAFIDSGSVSOK1GLSIAtJ~tOEVSLTSNAATV80GAIYATKCTLTGONLCDSMLTISCI'SDT
IESLSlAGI4r~tSLSPLliIKSfAEGA7ILSVlTOXHi.
' TL1GDITi


fN .
NGSLTFDCiPI'AGTSGOAIYTETEDtTLTGSTGTS?ISTl~T1'111ITOOU.YSIt~INSGS~SLIGlSSLTFWtPS
SVI1TPSGKGAYIOCGCOLTFDNNC1'ILFICODIfCEEp10071ISTIC~Ii.


LLFSGNIfATGPSNSSANpF.CCCGAZLSFLESASVSTKKCLWIEQIAJIfSL6GNTATVSOGSL10181IGSISIaiO
tSSATGKXi0G11ICATGTVDIT'tIKTAPrLlSllIIJ1EA710GAIt1$lClt
'
'
'


71G1tIxT
Nl'SLVISDtNfATAGNGG71LSGDiIDVTISGNOSV'1'lSGNQAVANOGJ1IY7110Q.T
IITlSTNS CTI7C
J11~
AIYATKCALHGNfIT.TFDGM'AETACG71IYTETED!'rLTCS't'Oi


NGNfSPTKNIGLVFSGNSATATATTfTDOIIL~ISESDIATKSLTLT1~SLSF>


INM'J1KRSGGGIYAPKCVISGSESINIDGNTAf:SCCAIY51GCSITIWGWSP1!$1SCCLRSOOOOVSPFL?I


KGGtIlYIADSGELSLEiIIDGDITFSCNRJITt~TSI'PNSI1QGAGi~ttITKLAAAPGNTIYt'
50A158 51I05A


YDPITNIJIPASGGTIEELVINWVKAIVPPPQP~'.PIAsITPVVWJIpANPNIGTIVISgCPJL0451
11FMDJ1C1'TLt:T 10 IFralne-shift: with 01511
10-PMP
pm0L


GKt.PSOW15IPAN1T1'ILNOKINLaGf~NVLK~ITLpVIfSFIGOP031_
BiEGSPYDNPGL .
lt1'ORV!<IKILDSCIVItNi.IYLICIYIDANSSIJOiKSITM1C1'SIPWV4VSSVIJIlBCIR.Q


TTlt~TfDOSII8.101LSVFK.DALDGKRMIT1AVNSTSOGLKISGDLKIf?SL71NEELLSPDDSF7i~tIDSGTIT
PKTS)1TTYSL1'GOVFFYEPGITal'PLSDRC!'KdtTDN
LVAWO
'


LVPKVGIIGGKVT
LTFLCiiGNSLTFGFIDAClt1A0J1MS'1'TAN101LTfSClSLLS!'DSSISTIYrT00t3'1'LSS
KAtdin.PFLDLSSTSGTVNLDDFNPIPSSNAAPDYGYOGSWLS'1'IAOOtIIJI'1TJIOA
WITLVPNSLWNAYVNI1t5i00EIATANSDAPSNPDIWIGOIGNtIlIi00KQN
AtGYTPKPEt


. Si
KtTtAGFRLISRGYIVGGSM:TPOEYTIAVAFSOLFCILSKDYWSDIKSOVYAGSIC710SSAOGVNLlliIRKLVII~
ISTADOGAIKC71SPLLTCTSGDA4T5l
7AICM'KA1DSP
i
C


YVIPLHSSLRRHVLSKVLPELPGEIPLVLt(GQVSYGRNNtOd~fl'tIQ.A!>Nl'QGKStRfDBHSJ111RTP
4
RTJIt~tI;DYVRF1.SNIA&TSOGiIIDOf7GTSILSNNKFLYFEti
ELIISNNKTL1FASNVAC1'SOOAIN7FKKLJILSSOGF?EFLANNVSSATII~WISIIfASC


FAVEVGCSLPVDLNYRYLTSYSPYVKIaWSVNOKGEOEVAADPRIIDASIQ.VNVSIPhaELSLSJ1E1'GfIITFVRI
fCLTI'ICS1'D'1'P1QN71INIGSWCBCITCLII~IITIlIYDTTI'SE


LTFKHESAKPPSALLLT'w~YAVDAYR~iPHCLTSLTNGTSWS'fFATNLSRQAlIAEASGHGTSSOVLKIM'~SAL'J
I(~tPY00TILF9GLTLTItOQJIVA~i.K88l1pPViL100KIi.LO
'


RYSF
ImV't'LESTSPS01JIGSLtGItDSCrlLSTTAGSTI'ITNf.GINVDSI~L.ICQPV8LTA1~A8N
LKLLIiCLOCIASGSCELRSSSRSYNANCG'1


191739 197579
KVIVSGKLNLIDI~tIYESI$tPBImOLlSI3*ITVOIIDVD'lIiVDISSLIPVPII~IIISE


CPn,-015
YG!'ODO'~L~T~ATA15r1'ILTGIVPSPERKSALVC?11'LwCVITDIRSLOOLV


pmp_7-polyarorphie Outer Ma4brane
EIG711~tNOGIWfCS$!ItNILNKTGDH4P1GCFAIiTSOCYVIGGSiUftPKDI7GlTPAICN
Protein tIBESAIEKIPREIPGiILD
FNlLVSKIICLOMKSSVSWLFPSSIPLFSSLSIVAAEVTLDSSNNSYDGSNCiT!'lvlr~n't


DMAG?TYSLLS11V5lQNAGALG1PL71&GCFLEAGGDLTPOaIQHAL%FAFINA05871GTW
' LIRRD~CFIALO'~AT~'l~SNTLOPONYLRLG
VOVSlSHSI~CIItYTSLPES' ix.SI110fiIGL~.PIYLSNPNPLIRITIl011I1tC


ONISSD
MnVSOftSFIESSSDGRGlSIGRLLHISIWG71KPVQGDIGDS7fI'YD~OlIVaWYRI$1
VASTSAAf>a(ItLLFNDFSRLSIISCPSLLLSPfGOCALKSVGNL&L1GNSOIIF1tJD
NGGVIHTIOJlLLSC1S01ASFSRNOILF1G1~GVVYATv~TITIINSPGIVSPSONLiIKGS


GGALYSTINCSI?DNlQVIFDCNSAWIN~QAOGGaICC:TfDKTtIfLT~ILSITN~APOSTItTLVNSPDSTiKIROG
tdSROAlLLRGSNNYVYIISNCGJGHYAI~
'


LT7~CilISCLKVS1SAGGPTLFOSNISGSSAGOCicGG7IINIASAGEIaLSATSCDITIrHI'ttKI.R!
VG


NQVITiCSTSTRNAINI IDTAKVTSI1W1TGOSIYFYDPITNPG'17N1S'1'OrWLrtLADANS
PGS 053 511101 512A60
' CPt>


rlKDLTDS _
EIEYCGAIVpSCEIGS~1IAANVTSTIROPAVL.7IWGpLYt~tDt:YNDmP 12-POlymorJhic Oueer
Mrnbrane
ID'1'l9DS Protein Itruneatadl
TFaADtDIISLSGTIU


.
f'NEEI'KrILPNlLTCSALlLU.PAMQVVYLNESDQYNG11INNKSGEPRITCYP~'ISYI
RILNDGGTTLSAKEIINLSLNGLAVNLSSLOCTNKAALK
FYF1JIOJLKSAS?YPLLELTTAGANCrITLGALSTLTL0EPE1'HYG7t0DNNOLS>ttANIITSS


KIGSINWfNTGYIPSPERKSNLPIiJSt.WGNFIDIRSINOLI>~l'ICSSCEPPERELWLSGIAfLDDVRISNVIWD0
8DJIGVF I$~ICIyIZT.T
'
AFTSJ1PLLP00DGaIYStGSVMICJSEIYI'FCGNYSSIiSGSi171IY1'PYLfaSKJ1
LSNf3Yi


GKNNGDTYG .
NFtfYRDSMPTAHGFRH15GG1fALCITATTPAEDOLTIAFCOLPARDRNH11SCCIRYL~FRONVISOG7fCCA1ST1
0iLTLITAGPSClC~NtAYNDIItiSl10CJ1IAI
' 6RPSVM


SYLHTDFAIMICCYYTDtJ .
ASLYFNtfIEGLlDIANFL4JGK11TRAPWVLSEISOlIPLSlDAIC!APOGSISISVKSGDLI!'tKiNl'ASOI7t~
f1'ItIZISIHIQ~"IIQlIDJ4RJIVfE6DVYlYDPISH
NAEGR11FN


3IIKGsSNRNDJ1FCJIDi.GASLPIVtSVPYLLItEVEPEYXWYIYAtWODIYFJISEIifIKITDLVINAPECKETY
tJG'rISFSCLCLDDtIEYCAENLTSTILQOV'1'L7100TLSLSD


KSELINVEIPIGVTICRDSKSI(CfYDLTLMYILDAYRtINPKCOTSLIASDANWMAYGTNGVTIALNSFItOGSSTLT
MSPGT1'LLCSCDIIRVpNLttILIED'I'DN!'VPVRIRAmKWILV


tJvRpGFSVRAUitflpVNPIiMEIFGOIAFEVRS6SRNYNfNLOSKFCFSLCKLKVAFFJIYwSVYDFPOIKEAFfIP
LLELLGPSFDSLLIGEI'fl.tRl~V1'1'EIIDAVR


CPn_0116 497602 500415 GFWSISWtEEYPPSLDKaRRITPI'IOn'HFI'TwNPEITSTP


pmp_8-Polymorphie Outer Membrane
Protein CPn_0153 513156 516152
LIEPIOtLSHKIPLHKLLISSTLVTPILLSIATYGADASLSPTDSFDGAGCSTF'fPKSTAD


ANCTNYVLSGNVYINDAGKCTALTG:CFTE'i'1'CDLTFfGbGYSFSIIffVOAGSNAGAAASpnp_13 -
POlymorphic Oucer Membrane
Protein
'


YASGKSTLSSACAIiJLTt%IGTILPSOtJVSN6ANNNGfLAPCFASTAl7YEVIMPSENF
TTADI(ALTFTGISNLSFIMPGT NCVLLYLFFYSLSLI:RIIWFHLYVOht(TSIRKFLISi


.
DDSSGKIFPYTfLSDPRGTLGI1SGDLYlANLONAISRTSSSCISNRAGAWILG100GVF
CGAlYSSAAASISGN1'GOLVIMINKCi~.'tr'iGGAL
SISGNI'SSITFTSNSAIOC1
~
A1TTK:'L


.
SFWIRSSAOGMIsSVITONPELCPLSFSGPSOMIlDNCESL?SDTSIIir~JVIPHASAIY
. '
.
.
CFFJ1SSSITa'ISSLFFSCN1'ATDaAGIttxAIYCEKTCETPTLTISCNKSLTPADiSSVI'0


CGAICAHGLDLSAACPTLFSNNRCCNTAAGKCGJ1IAIADSCSLSLSANOGDITFL.GKtLTGSM
ATTPMLFINNDSILFOYNRSdOFCMIRGTSITI>~ffKKSLLINfirIGSI~ICGAL1
'


STSAP'CSTRNAIYLGSSAKITNLRAAQGpSIYFYDPtASNIIGASOVLTINOPDSNSPLDf
INLINNSAPVIlSTt411'GIYOGAIYLTGGSMLTSCNLSCYLFVNNSSItS06AIYANONV
'
'
'


KOPLAWSCTLALKCNVELDNNGF?OTEGSTLL PPATPPP1GVSLTIS
'fSGTI'/PSCt7tLSADEAtWIDNFISIt dIf
C1
FSNNSOLTlOFIlfnlaPONSLPAPTPPPTPPAVTPLLCYCf
'


. JtCKGCJ1IAIPESCELSLS71NQG
MOP~fKLKAOTEA1SLTKLWDLSALEGNKSVSIETAGANK1'ITLTSPLVFODSSCNFYECENSVTPLENIAS,OGALY
GKKISIDSNKSTIFII(~fl


SHTINpAFfOPLWF?MTAASDIYIDALLTSPVpTPEPNYCYQGHWEATWADTSTAKSCDILFNKHISITSC'.'i""IW
SIHFCKDAKFA?IiGAtpCYTLYFYDPITSDDLSAASAMTW
"
'


iONtSIY00RGLtAASGTANPF L1LM~ATLJiIMN
TNTYflITCYNPNPERRACWPDSLWASFTDIRTL00IFlft
VNPKA.~sADGAYSC:.'_VFSGETLTATEMTPANATSTII~OKLEL~L
'


. fND7ILT
NKt)IISGTh'QAFRNKSItCYIVGGSAEDFSENIFSVAFCOLFGRDKDLFIVENTSNNYLASLFTODEKSWINDA.."
.TLATTNDAtPII'DGAITLNKLVINLDSLDCTKAAVVNIIpS
'


'fLOHAAFtJ'GLPMPSFCSLTLN.ILKDIPLLLNAOL.~>Y~fTKItDND'fRYTSYPEIIOGSWfNNNCYOpSIYCI
fO
I~GTGGLVNNSOIt:7HHGMlNADWpVPILELKATSNTVZTfDISLGI
NP
~


SLALYLPKEAPFF~.'1'FPFLKFOAVYSRQONFKESC:AEattAFDOCDLVNCSIA5AADCED
~
CTWEFTII7lTffftl':GtJNKKTGYLPHPERLAPLIPNSLWJ1NVIDLRAY60
CALELCt' '


. ILINTYTRITPDAALSIGPCQLITIfSKDYL
. CKOLSI1GITNF!!!?.NtfCCDMSYANMOCG
.
PVCIRLEKI~EDEKNNFEISLAY:ODVYRKNPRSRT3LMV:.f:ASWCSLCKNLAROAF'WS


SGEA>YELRG=AHIYNVDCGLRY3F
VGHrHSNVYFAR~.:NITKSLFC.iSRFFSCCfSRVTYSRSNEKVKTSIRKLPKDRCSWSN
~>IHiVEL '
IC
iHLTL


. PECRIFGHGHLLNVA
.
~NL~ELECNLPI'.LSSRILNLKOIIPFVIU1EVAYATWiGIOt:M
.
.


rn
VPUrVRFGKN3HNR?DFYTITVA'fAPDVYRNNPDCDTfLPINGJITiiIfSICNNLTRSTfi.V
r
447 SU05.11 503351


~ OA.iSHTa~VNDVLE:FGHCGCDIAttTSROYTLDIG3KLRF
,
tyap_r-Ftlymorpnu outer Membrane
Procein


F'JKPP IAL'fMKSS W IWFL IESSCaLPLSLNFSAFMWEt:III3I'fNSFSG(~TY'f
PPAOT CFn 0154 510179 51911'.
T


TtJAIJ:ff'fNLTf:DV:itTNAi7St'TALTASf:FKETTtaIL:F~NUYOFLLOtJIDAGANCTFl4
W7lymc:.hlr ~utsr MeaICrane
~ PtOteln
' m


sNIM~CAW P_
YFf~NF P
NT.WIYt.L::F::f:F:.'YL:'Ltv"l'fNATlC'K:AIIL."ff'.Af:SIOCtIY~.Lt:uA:CAFAETRLIX't
IP/PPITIY)GEEILLT::DFVI'.:xJFLf'JL;F
' CMiL:FK::~::F't:L.l


.".:t:::.:a.tIPNLTFAKNKATUKA:dL'f.~.ff:taTttAITt.N::A.:f:tSNTAAMICCIIIYTEAS.
'LNFIfJRAIT:.fX3At
.':.~.Fltf::al4:L:":f:::LSLTFTCCOAfrfN.~.tr/ALLSMETLTFKNF::::INFT(xJO.~.7ytL
:
:
CNta
C
'
:
T
1T
'
A'f
:
A
'
C~/LTL
~
~


.
r:CLf'f'fYDIVt'9::IC.~LIFTtNAVAY3PA.TIfTATPAITII/Ttr:A:.At.~~ITIk:I.TVEtJI
.
.
:
.
.
FIMC
V
:
,
,
AIY
:~:T.
t
Y.
::Ft.
.:alY:,I:
a':ALJ.x:DITFEX',ttIWXCAS
l'KNN::AIDTAAPLJ:f'.AIAtAL
'.~::L
::::IXa"rt
'/t'fitJL'/L


.
:a)::IYFF(:NIJVJF.::AI::::.FTAWKPiNNTATFI.iF3HtIFTSSGf;I:VIYW
. :::::I.LlIJHC
.
;I:TYALt:::tY:AICIPnI'FELKtd'IOI:IG'PP::YNt:TPNN1G
. :L
':::nJl1TlIJ::INI~.TIfNAKIIILR~::;)r'dlI'tYFYOFI'i'f.~.ITML::DAIJJIlY;PDL.ALTIP
Att
':i
VTf'
;
'


nf:f:'fF:w:F*L:a'J1F414~\ONLF"PlrxltLTI.NYJAt:LY.:F:VT't.VAK:F~O::fr:.~.TLLr%f.

y
.
v:
:
:
IIFTI1N:
LLD::NfMRH(X:AI~:AY'fLNIV:Ix7PtEF::RNRAHKtX:AIFt~:C
'NIVraI~\:~:
AI'fAETr


:41rN71-rLFTAIta'fINNLVINV~:'.LKI.-
fKNA'fl*.1'frJA::r/~PPI.::::L::LVDP:~Y7NVYED.
'/::WMII~'JY::LL'rI:rAIN~I'ANlllt'IDLrIAUPLF:KtIFIIIW:IrJ:NWAL::WOECffATK.~.KM.

::Vr:f~PAK(rr:.Tt.T::.rI::FY_a'l\FTf~IMLNfYi~:IRNAIT/EMIIIEIV.'II~SA(JtXi:RL'fF
Y
"


'fL'tWrwl;tNl'FlIHPf!r7rt.VANTI.Wt:::FVIriR::I(/~L%ATY'/Rn:I~RTR(aYX:Ef:L:iNFFf
ELLLPAtrITfILItTVKtA:7.E
DfITII.~.LifiT.~.1sKK3lTfN.VJ~:A:77::WFT.~.Y~:L.S
'
'


TYItIK.:F'ItIII::f4:YVVr:ATPrI
I::DtJLI'1:1AF'r:fjLh:Y.UI1UIIFINKNRA.iAYAA.iLf:AFMVDFTV;KIr\FOM
IIKIx :.ILYRD
l*ITIAIAWfrJII:F.A'~f:::ywLTII::xYTPII:LATP1
'
'


.. fK
(JIUA'rl:::a~::I.1.uYl.n::a:::E:~~PVLI'nAVt::yl'f.':YNTMKTY1"f~LtPKI:E:::I~fYNO
t:CIt:YIfl:E
f11
F'V:;A:~/IW:fYfJW::.r:AL1'LUt:111MPIN.YfB/t.W~I~/AIPIAVFKn:ATV
'
'
'
'


. fLVR:1
tnrl:lL:aUr:LF'HAYFI'FIKVF:A::YtilrrL::FKF:Ptri-CL.VK::FD.rI:DLINV~rVPt
nl It.LPE
F:ll~:::l TW::RtLI.IFAPUX:FFT:F.:P..AtffLYAVNN::h
IA'ft..ll'lt:Y~f:KW:a
'


. :7:k
.
HYr:(a'/::N::LWI::F':.aY~AF::UIV)INLLIIAlI~:4itTAICAWAWF7rfM!~~:11FI:F
In:t'fF'I:II'::IeNP:VA::Ylu1'fVIS'V:\INYftKPll~l>t~I'ff.t.t.Itlttl":WKTit.'rtJL
.:R(MLiIfsAA


93


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26913
Yrt:YL'AAL::NtfITDHTTII:L::f'V'OLICKTNAHF ....~LI.3FF~;0FPLYIiOKSEA ::Pn (IIA4
S v ',i-i~~~
..I:1YKAAW:'f::KNHLNT'I'ILTIPDKAPK.Spt~WNNNSYYVLLiAEHPPLMrCLLTRPLAOA No roDusc
nomoloa Dr:aemt :w:'.rw~dmK EMII;. n: ..
WDL;X:FI3At7~'LOWO~KfTIT'IDLQRSPSRGK.YNVSLPLCC35pWITPF'K1GPSTLTI
HRL~RFTE~ILtRI.IS1M9L~~MILR ~l~f~CKINN1~~
KLAYKPD I YRVNPfIN f',ff VI/:pJDEST3 LiGANLRRHCLFVOINOWDLTECIOAFIi'IYTF
UI:KNCPTNIIItV.'.'".' :LK.: v F
IRCGPF3EDAVPE.iEPFDL.i:'lVIfCDRSI:PGPTKKRS::,i~",C."yE;,PESIYPOSEP'.LM
RPRMLS
'I'n_0455 520 )63 517158
Na mrntsc rtnm"loV Dresant tn GlIteWnk/ENBL5~=all
as of 11/7/98 5 -Z~a
=~~ 015
.
'ar
n


.
..;_n...,., ...I,..;;..r;.-.Lc--v:'~:. .
--E.:;~rL:r.~.' .
. ._.. .
~ .
...
'
"
;
'


~ , .. r. . :.;r: .1;:
.:_. . :::u':':~ ;...,
;~.~ ~-:. P,.6EAACA': .~. _ ::u:t'
.;LF::F~:... ::.l~r:~nl~ .R.rl.ylJ..:.
:.?NFAL :
:
~
'
'
'
'


~"'KWAFUDEHLPWVi:iHIAYAEEIREKOEQ'IFIIXI'.ILTEEOIVA:.i.CMYSTE1WWF.
.''\ :li
::m
.,;;i:.ilk.:kVEDILKRQR'...iLEi
Ul,~t
:.~.i:iv'v:.'!':';:.i.'wi.:.,ii.:i:i:l'.fi~:il.


. RDCCKVHCDLPSAPFF
aLaAVIKOSVNRFRNPDLFAYERCALfJISVTDALVSYVSNLDIfIPYTSSOGIVI~SSIV


RTSOEHTLIVNCMFDKLASOIEFLCPSDVLPISGKDPLISDDEDEEWPKVSSAA08KD
573718 536537


K.., CPn_0466


pmp,15-POlymorphic Outtr Mlnrorane
Proelin


0456 521568 520327
TSIOtFPCf'CIGl.PfTPIrt.ANEGL4LPL.E!'YITLSPEYO~AAPpVCF!'IO~pDtJIIV~IW
;:Pn


_
DFILDYKYYRSNGCALTCI~.LISF21ICNVFFEIOIVCPNSGGAiYAApNCTI$XtrpWllAF
No roousc t>omolo0 Drtttnt in CentWnk/E7tBL"'
as of 11/7/98 '


IPCTFES%RKFIaffHCLIK:WFSWRHNFVOAFNFSRPLYSRITHPAt.CVIKAIPIVGHLV30CCFV1u1LJ1iIiKt

w,ALYTETtQiIl~1
TTNLVSDNPTATJYCSLiGGALFAINCSI?NNG


HGVDNLISHCF6RGVSNPGFPSDIJ1PILKVEKIAGRDtiISRiA'IDLKSLRKTIEtIEDLDKKGPIIIIfQNMLi4S
DSLGGS.LYSO~iSil'IIFGYSG71IOT:SNSIfSTQZLT=SSN


VEICOYpENPYA0M715SEYLKLDK,NiIVSEL.ciKAFSRVRNRITRSYSYAPTPOLDSIaIVGKKLIEIStc)SAFA
NNYGSNFHPGOOCLTTTtTTILMiRtGVLFN~BpSQSI~GtIINBKSI
'


tLLVSPEEOENLVRLANEYIOLYPKSKTTLYLLIDFDRaWVGDISSO~OLRSLGLHSELATRGGAL:liLS7YCSGNCS
FILSAONGDIITl4~IVfASIDiAWIPYRN
LIKIIBiWYFLNN
'
'


'IiCCL.SyLEPpGADGEDTKHFDt.IfVOCYGKDSYLREGKILOQAiiGTSLG1YPWDlJPMNTLPAIH$TPNlOJI4
IGARPCYRVLFYDPIFitELPSSFPILFNFE
fGtnG
NLFSGAMR~IIPt
'


SRYRSRLSLPINTEIfDICfELYKEISRTHNOLHTIJCMCLGAODSGLLLDRORLW1PL$QGSDI?MFPSYLRNISELR
OGVLaVEDGI1GLACY1CFFOROC.
LliirpGaYITTA4TIP11SST


i.FSAtVDIIKNISKKEL.REVSINFANDTSVECGCAFYFPTZITaSTITI2iHIAIDLPSI'SFQAOAPKIWIYPCK7
GSTYTEDSNPTITISCILTLRNS
HCHSYLADLTHELKI:


.
IBIEDPYDSLDLSNSLEKVPLLYIVDNAAQKINSSQLDLSTS~I9GOlYGYOCIWSfYWVE?


0457 527886 52212D
TTITNPTSLL.GWfKHKLLYANWSPLGYRPNPCRRGEFITNAIiASAY:ALiIGLJiSL6SW
=Pn


_
DEEKOIiAASLOGIGiI'YHOK~OGFKCFRSIOnGItSA'I'fGTSSOSPNPSG:FAQFISKA
No roousc t>omaloQ present in Gentbsnk/Ct9L
as of 11/7/98


VFLPSRVMASCLSAwFSIVREHFYRAEDFSL?FC11RITEFVLGVIKGIPVVGHIIVGIEwKEHZ~NSTSSNHYFSOMC
IIZICLFKEWIRLSVSLAYMFfSENTIFIMYOCLLE>a~CSF


LVSRYLESPVfXPTFVSDWSLLKTEKVACRDIiIARWETL1CR0RVAVAPIftIF~KVIKiICIHNlft'L71GALSCV
FLPQPIIOfSLQIYPFITA)JIIRf~ILAAFQESOWtAREFSLiRtPL1'0V5


PVHPFOGIO~EVLTLYPEVODATf.GWFSKIRNRVRQAYL.OAFRPItLOKIYIICN~IPLP9CIRASW10~RIHRVPL
VWLTEISYRS1'i.YRODPELHSKLLI50C11~'fQIITWnINIL&


FEVDDFLNLARL.QJtTORLYPDATISLYLTASGGRNAMD)00RICi.SDCELNPKIACLDFIKVlaPl1'IOVFPKVT
L.SLDYS11DISS5TLSHYLNVASRIOtF


tIOCOWKQ11TCDCWHVYNGHDpCfLNOIOEELEILSGECfPNIHVCOKPLSOSLWDTSPP


SSLEMKCDKEItALCYSELtKEOLYSRLVYVGRSSVLSLCIGDSR~ILID)PIDIVNAPLS.CP(~0167 536528
539131


~HYCHSYL11DLENPGL4K1'IL7U1FI14PKELSS1'ILOPISLNLIWSKTYLRQH!'CFFER


MSRSDR1JVVVWCDSWWCfDWKI3:PSFOHFINLLDGRCYSNFNIFAFRSN6MM.ARIL


NFSSOEKAP'fElIFCEDSVSOGDIRCLHL715f:GMLCOXECYAVWY'fSCCANF1?~tVLTL


ERFSNLWNRKHGLWI(AEVRKpK0EA71LDODESEIYVCNpL?AOpNfACS


CPtI_0468 539608 50132


CPI~0459 527062 526619 prnp_17FOlYlno~hic Ouctr Mtmbrant
Proclin


No robust hdsolog pnaanc in
Gtneberlk/D!lBLLIYKLLDNKLMIFYDKLYFHIMnMFMttPICLSILSTALCCSLSONEVPItL71SC01IS1110
7I
as of 11/7/9A


STKIQMHPGLRNWRTSTNKLREE7CSVSFRtYFMYlICDKIVApICifLFTLDAVIKQAIhIRSAFNTSPSPRiaN1'P
EFLVSSFRP9NLL14GFOHDITODITITGNSI118VIDYlMIYfD00


SOEKU1LFYVESt3ALGREIKVSLEEYIOSMVICILGSOATKKSFKPSVDFTPLEOALQERCILiICKNLt'IS~Rt~I
LSFWJSSfISBGCALYSVRC~'IISi~NYSFISW1J1SLJ1Tl1'L'SO


SSDDI)EDATATSTA:..71TASPTIfIOtI~E
FOGivIHAL~DSYITNNL.CECQFLON11SKNRODAIYVGVSLSITDNLirPIVIKKNO'I'L.tDSS


FOOGIFCRAVNIERNYpNI0IN0NASGOGVVYFLP


CHy0460 527810 526992


No robust ttomolo0 present in fllntbar1k/003LCPIL0169 510399 541160
as of 11/7/98


VIQNLLNFALEETPSISVOYQCOEKLSPCDNSPEIGK>OCRWNKLESFSTYCSLFNSV107Hpmp_17-
Polymorphic Ouilr Mlmbran
Protein IFrm!-shift with


YKIl~IGIONSLSGWLLDPYRVCAPLSSPYSCPSYLLDL~1KELARSLLSTFLDPIOiLTS60169)


TFRSVSIHFGEISSFCORWSEELSRVLHDEKEItttVAVIfiElD7IKLL.EECGSPEALSLLCEDLCFA1'~iIFSAL
GVIISSNKEIIEISNNSASSINTASGKLYPOIxCDfCTSLVIt?ItiPIOG


RESGYSYIl'IILSVSPELIISIfV0ER0ILRRDLOGRSF'IIIMITDLPLGSEDIRSIAL71&~tILIFMIIfTAIIL
SGCAIHTRSFIFQEBiGPfAFINNSATSGGALINLSOIOSTPON!'ILBADY


LVSSSLDAADACASGCIfVLVYENPNASWJ10ELF3JFYKQVEAARCDILFNNfIfITSSSPOPCYRNALY11J1PGIN
LKLGAROGYKILFYDPIdIDOIZ'fDPIVFN


YEPtHiLCTYLFSCINVDSffATNPLNFLSKFSNSSRLERGVLAIEDRAAISCRTL.SQlGDI


,0161 528617 527811
LRLGNAALIRTKCPCSSINFNAIAINLPSILOSFJ1SAPKtWIYPI'LTDSTYSBD1'SSTIT
CPn


_
LSGPLTFLNDFJJE7JPYDSLDLSEPRRDIPPPLPPRCDCKIOdNYPESHCRSHELR
No roousc hanolov prestnc in Genlbank/F318L
as of 11/7/98


ISIVACPSISSWt'NVRpHFVNAFDFTHPVCSRITNFALGIIKJ1IPVLCHIVlxIEWLIS


wIPRNTVRHCRIFTSt7VSSAIKVEOTRGHNCLAPLEAYL59LRVPISOEDLCKVfIGRTPEDCPn_0170 51357
512532


?FJDITP1'EIVOLLPDEEL57VDFJIIAGVRSRLTYAYRSVEKPMIODLALVCFCLRD.SADpmp_17-
POlymorphic Outer Meiebrant
Protein IFrelne-shift vlch


LINtVRIJUJGVONHYPHTKVItLYIJIKNLAOVWDCEISEEEKCOLRALGLDPKIESISL'fS04701


ACLPSVPEVATVDFMITCYCKDQEVpDP
ISLHLERZSPLLYLLDVTAKKIDTSNLIVEN9JLDEHYCYOCIWSPYABIET'1'1'1'fSSTVP


F.pTTrlNHROLWDWfPVGYRPNPERHGEFIANTLWOSAYNALIGIRILPp~iLK0i0L'G


~Pn 0462 531121 529037
SGOCLGLLINOHNR)~GRKGFRNtII'IGYMTTSAKTAARHSFSLCFApMFBK'lRER08PST


tlo roousc homoloq present in
Gentbank/EMBLTSSHNYFAGLRFDSLLFROFISTCLSLCYSYCDtIHMt.CNYTEILKGSSKAFPNNHTLVAS
as of 11/7/98


LIFYLFLNLYLACVRFHFCCZIFDPNACYISIWISTVICQNFtRAFDFTRPG'rSRITNFALCt.DCTFLPARITRTLE
L.OPFISAIALRCS0ASF0ErC0EtLRKFI1PKHPLTDISSPICFRSE


VIYJ.IPItGCVSIICVSWLVSTCSARRFCKPAFTSDVASIVKIEKTRL'YNPLAWVE0YLR0WKTSHNIPNLWCfEIS
YVPTLYRKNPt3~IFTTLLISNCTWRQATPVSYNS11AAKIKNT50


LRVRLPECDLCKIHCKVSRDYVCDRTPOENLNM/PHOYLGEL.GRAFYC1RNRVTKAYORVLFSRVTLSLDYSAOVSSS
TV~YLKJ1FSNC1'F


TPLEYPCLTLVGFDILDPEDpVNFVRLANGIpTQYPOTQIKLYLISIOKIYAi0COC1'I50


EKEQpLRSLCLDAKIKCVSAPAt.LLpKYLpSENLPSCDLLINYYCKOpSVROVDSIKSLLCPn_0171 5J2561
545401


NL.iSEHIPAtSVTYRPDDPFYSYYFFPGSpOGTAPDORiPWSlOEHLQI'Y1TT.SNPRCDRpmp_11)-
POlymorphr,c L'ucer Membrane
Protein


'IAVHLGMEDFASC'JFLDPLRVSAPLSCEYSCPSYLLOLKSEELRCFLLSAFIDPNNSGOCTVONNRS4iKSSFFVtG
ALI4:KTTILLNATP~DYFDNpANOLTTLFPLIDTLTNfIPIfS


NPRPMSINFCNSPLCORWSEFLSRVLNDfTEIfHVAVNCMJPOLIKKSFPSHSLSLLHtELNRATLPCVRDCCNODIVL
DH~YJSIESWf'CNFSpOCGA4SCKS41ITNTKNOILFLNSFAI


EEJ;ISYIJ1IVSV~.,pERTCVKERRILSSDPSGRSFTVILTDLPEGSSDIRNL.OLASDRILKRACANYVNCNFDIS
ENHCSIIFSCNLSFPNASNFADTC'~GAVLCSKNVTISKNOCfAY


.".~AIJ7AAL1ACASECKILEYEDPEpEWA00YASFYRNIDRAGDLOROCIPCEPLCVSASTFINNKAKSSCCAIC'A
A)INIKDNTCPCLFFNNAACCTACCIILFAHACRI~HSOPIYFIN


RVJLEKD)VFNLNAVIOI'AMWKFKKRDLPAVESQJILCtIOMARALEGYICSSLLVDCTiOPNOSGI~.AIRVtIC'F
it'.ILTKIrCC:.JIFNEfJFAMEADISANNSSCCAtYCISCSIKDNPCIA


~~/u:NVNV.~.FATLDEAVCrIACDSAQOAPSEENM'DDAFDNNTAARDOC)AID'T0.~.LTIOD~':PWPTNNO?I
110CAIMLRODf'.ACTLPAOOCDIIFY


NNRHFIf flfFe:N11V~17JCTRM::LTN~.A.~.fXat.~.ATFYDP
L LQRYTIONR IOKFNPNPOILC


't'n X14..) x)24911 Slll?1
TILFS:TYIFDT:TVRDDFI;aIFRN111.LYNl:fL.ALEGtAES~IKWKFDOFpC1'L.RLA33M


le> cnlnra. hair'luit t,l:tsenc
VP:TI't7l:F~':::::::a'.:::VINIh'NIAINLf.~.(U:NRVAPKLWIRPfI:,:::APY:80lNPIINL
in t:Pnadank/ENDL ai; ut L1/7/98


.::a~YEi!TP.til.l.l:fPNCRTPRVNI:;RK:IPIDETCtIAFV~MMK(X:VCTODAKELYTFL~.R:f:ffta.
t.pDE7JLDfYD'fADLrk;f(AEVPL.LYI.LDVTAK11INTONFYPPIfILM'1V11Yf(YQC


tlt7flnl~'.LWF::f.l:EE4:FLFDEKMLCAFf::EDH'l~tI~YLVDLVDt711LKDLLt.SIIFLDPOVw::fY
WIF'PITT:'.M'.':F.M~MIJIR'JLt::frll'Vft:YYVNf011cf.DlAL::AFYIaiPHNLP


rll::rva:f.IJCV::Itrin:U;:F::PLtjOKDFL:?IVLRDE'f~:KNWWFKI:VLGLPATOVCKLVEE:1'1'L
PYtYtt't>t::]LAI'fl:7:F.f"fet.F"/INJN::NNNAY.r:FIMFJrIY:Y.~.IJ'IT::HTA:aIII:IPf
IM


Irc:YD'C:'/LNIF:X:IY:Lw:::hQf.LPRKELECT~:R7FRVhAL'ILt:DTOMRSWf.JIi6RINF:XJLF::N
I.YF::'.IL'.i'N::VA::If~~fIAUJttIIIIWI.()IxIY:.'C.:A.sI.AY::Y.''.NIIIIIKICII:Y:
XrK


'J.~.I!Pk'LLVfd\YAAh~:Kt.LKIDHTNWRP:I'F::RFIADFADAVDV::M:fNSREFKLI'fpAElpCIVfNC
:KC'.Y::'1'fG:.IrIL:Y:::4:IJ,WR::P.ILIIFffFIOAIAVRaNVfAf(7f::X:GeARKF:1111


I1.l:Ja:f.hLt~.:Y.TIWFX:ft.lF!.'DRVTVTRIIFILtILGAAIK0AVH1'fIKtIP::LIDKOCFJ1LDKI
'LYNL'MII'I~:IS'::1WF_':Kfl:f.f'1"IYrtIIKIrIYVtVf.Y~JJFIPEIIIV::LE_::Xl::arld:l
.T1


LY'I'~J.'!.l:aV::'il.l:lV'1'N::IIExCT:'.KCfFtpl(EIIAff:.~.PLKCALFIGSDEDVPL'l'sE
DP.'if.AfHAIAfK,:uH~'It'IFIKL::Vfl.liY:J:::V::::::'(TfIlY1J1A4PfPKl'


I~IAIP.~.I~IJ:U::


: a'i, o1'I L '..1 ' ' 19 '.1''',n
1


94
C)~_0458 526314 521236
No robust homoloo oresenc in GentDSnk/t?BL as of 11/7/98


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
Nn r.arcn trlwoltxt pr~sant In r:enetCrl
/fl48L es o: SI;W 99


FYFflA:xilr7::::n.Ln. CLPPKtR.;PSPKIiELCSHCISLPPpENCEEGASCSSHIHS


)3.~.FLPEDU~:OSS~aSAAS.~sPCfP;iRVRSCYCPALKSF~.~.AE$'f"..OARE3'RGAPVRLCpn 01A l
.. L....S~' 1. ~. 5'~(L,1
" W . ,
pMSentN
No robust noACl~o4
s~f,
f'wtneb
inltL ~
fC;:~~5~;


Y~BNPSOCVPGTSSGPEPORLP$LPSVKKO .
.iKTITAOERROVDSSSAAATPJ1RVAEDAo ..
:iCTJr'RLVOTVRDRtVLPSGAPPTDSEpLSLYELNLRLS5LR0EI~iDIOSNDQLTPECKAE,
.
t
JCLRIECILMAT~VP4'CSJ:r:GEANSSNERFTERT::RMYYMLVL..'.A~.L:FIAIIIV:


:.TVfIQOLIOITEFOCCYMEATOSSVSLrIfrIRFKCVITSDEINSL.C3Irt.TDPELOCLlSOFPQVCWAWCrFAL
:,C:.:.:.:.LAtVPAV~.,GLVLI;KTLEPSREATPPEIVAVKE
.~.. CCL


':D.~.t4NLL0ETADDLFJ1ALSTft'RLSFSLDDNPTPIDNNPTLISOEEPIYEEIG~MOPORLGNEYWRSELIS:.
F:.R~~LH~:.:S~SIIDR.~.LC:CC::wZFI:;.KLEP:..r:....i.w.KKDMi


TRFNWSTRLWNpIRE\L1I3:,;...~tIL.iILGSILHRLRIARHAAACAVGRCC'"CRGEELTSSINI:LHLVROWN
L:~.~IrPE'JTAHAEEL:.LFLiEE?YY3FCTLK:.:RYCMLC~A?SPI?I


'::!1 .~' . . .. '~!!,~'.'f:rl.l.... ~~...:. ... . .. . .-. . ,. ......
...M:W :::'eJl..tty:!':!.ITY!'.-:'~, . . : ~!ir"'~nr c...,.. .
. .:
~
'


il': 4' .
ct ' .....~.. :.".111:x . . ...
.tr LV!.~::Hrrr:.-:,:c :~.s.-
:,.
F. . 1..:::a:r :::fPw.'rnr-L.......
:;~. .. ':'.~I::de!:-


:GDY6YpIT;iA.FP:iKDKNi'MCPRLATPALYDL6'wRFI:S.xSSR:iFSSLRVR9S:iPNRRG'lE:irlN~'NL
.iiN:.h'i:v~:.:':r:u~KH;,e';.i;'i.:.::~i:~inNy:Ni:y:LaKSE:LfaIE.iuFi,ia


VPLPPVPSPAMSEECSIY$I7MSGA,SGACESDYECMSRSPSPRGDLDEPIYnWI'PEDNPFTLIEYPLSYL:GWA~..
I.'CVi?;fEi3LEC0ADY'IS:.'.QCLCS14I$OFASRi.t7SGQKt,'IatPR


ORNIDRILOERSGCJ1SASPVEPIYDEIPWItICRPPATLPRPEM'LTNVSLRV$PCFGPNDVLSEOMVMLVHGLMt7C
V3FOCLKALHIfLTAVPORMWL~uAi,PLfESfPVFNRIBfFfT.G


MALLSfSVSAVNVEAESIVPP'tEPCOGESEYLEPLOGLVATTKILGp100WPPGG9NAfSLCD


CPn_073 519602 518070 CPn_0482 561764 560961


No robust: hoteolog present in Oenebsnk/00LattJ-Asnsne Peripiasntse Bsnding
as of 11/7/9B Protesn


~$IMAV~OCSRSPSPIPpNRRH56DGKVSPKDM&fJiTVS$$DSSLASOGPTIEUKANtrIYWICI'MIKOIGRfFRAF
IfIMPISLTSCESKICRNRIWit.'t'"N.
ATYPPFfYVOIIOC


OI~GTWCIPLPSVKEPGDSOTSORSGVLQRIWIO(11KEYVGFDID4.AKAISEKLCKOLEVRLFAFDALIIiIWOWRI
DAILAQISITpS110tICe'..~...r
rfYfKiCI'pOARPCVSSPRLPSIiVOH


GORLpCLOCfRDRIOKRSENPEADLGKH><RSYSDGDLDRVOtiDSNEDBTEDSRSEOCEPSPYYCOEVOEI14WSKRS
LE':PJLPLTOYSSVAV
.p"~r~.FQEHYLLSOpCICVRSFL>BTi.LSt


SKSSSPLSGVItCAVSKVHCJ1LGDIKDKFQRSASE~L'1'I'OOEDS11GDTVKaIR$EGEASIME11RYGKSPVAVL
EPSVGRtIVLKDfpNLVATRLELPPECWt7I~CGL,I11AKDRPECIG?:


SKSSSFLSGVRC71TSTV'pGJlL;.011KEKV$AFGEOMGAIRSAPGNIRTRIQRSSSDWLSOOAITDLKSEGVIQSL
TKkftFILSEVAYE


"NNKAAKIiLRKJILfNLEINApEQVSPEVJ1SRVOSLLARNE0LTN0EPP1YEDLITFVESN


VOSDSVEYASIVPOOGSOAPAET1(FJ1PETGCVLGSAJ1QGJ1WKATJIDfWSIfQAVASffRCPeL,0183
561830 564964


aIJISRLS$ARRESAVDDLASESNTQWFVEpEGVSNPSAAPSLSFAEEIARRAAOiSNRNANo mbusc hanolog
present in GeAebank/EFIBL
as of 11:7'98


OSLEKLF~d1V't'DPVIQOCLGLrIitSFAPECOIILIKICRAIfaIIFPIPPPNCPPNNID'NFYHLTfDTIGDPLL
LRILRTIGYVLTJfIIT.GL


C~ 0171 551600 519807


CT365 hypothetical protein


LKIIISISFHSTSPISNpPRYLSLSNATEKTSLLtINSRSLSPVPNSLVPSNPEDTCLRKS


IFTHSVTLFAGLWLLVAVSVWV71LTVLAPOVPOAILLGIAISCVOIOGFSIl0f5LVYN


VADYMSPRMQ1SSRIXS71LJ1VG1GFTVMGLVI9IVCANPVPOCYOGLVGSL4SSAYSROSO


TTL7LSFSIIYIYTKFFRSEKVAKG61Q.TEAETIKE71KKLHYISLSIATIGVCLAViGILIJ1


IAG112i.CG1IPATTAI IL1PPLISICLT1'VLQTILHSSIGKWRAPLLTOEIOIOLFVD'!SL


KDIRLEKZ.PPSEVEESEI'SOSVIEVPDSECIAE1'RIS7IEffIDTRLSLTTRQKYIFALATL


r r~~eIAAfIVrCFOGL7YMpVLLVASVG$AVAS\I1T.PIIVSSCfSYVAY0LItARiI4ISKL


RWKE7UfWOCRVROFLILxGVIASt$IEFNOIIWK'IYYIOCQIOKTDAAIREEVRNFLOmGLVN


SALVCGILL.CVC:R3IIQ3.ALVPAFApIVPGILALCCSTLCIAGSILT~BtTCCtVtiWLYDELVK


LYERRRIiRRELLYGpESKIGtSIATDLWEALA7LSt~HLIDLDGfVDFIDVDVDIDGMEKNOIOFLRJ1TFPNYQLIT
PJIILLDCEIESTPRNGyBIVfLTRI.NVCS1CGSP8$PT)1tS


DOf$KSFLIFGFtl-BIYPKLL4KKTPLJ1ARLDAfOREASHRFTOVKDIa.LLSLKYGFPL11T


CPn_0175 553850 551685
ATINOIfSRAROQLICNLL1DTIV'PAS17GFCRSG1ROSLIGYLHSLBSNELODILOmVlm071


glgB-Gluean Branching En~rme
EANDVAAK1TVPLOPFAVCLINSDRD1YSEEFtIENPVJN4iCFt11CISPERDRRIFLIRPP


PSHVDKLIHPWDGDLLVSCRQKDPfIKLLtcILASEDSSDHIVITRPCAHIYAIrrrrnHMIYOCLLpRHPRTCOI~IS
KPDSSNP


aVAYRSCLFfLSVPKGICHCDYRVYlIQNCLL,7IHDpYIIPPPLWGEIDSFLFHR01'hIYRIYB


RNG(IIPNEYOCISGVLFVLWAPHAORVSWGDFIVIWI~LVNPLRKISDOGIIdCLFVpGLGCPn_0184 561931
565821


BGIRYKWEIVT09raiVIV!<TpPYGKSFDPPPQt:1'ARVADSISYSWSOHRT~RRS70pS19Garo0-
Deaxyhepconate Aldolase


PVTIYMR.CSWOW00GRPLSYS
ITOIPLT4&SlJfrYpVTRSILKTOOLKSLVLHIVLILTF'1'YPLPRTLKOHPDI:'VFfrVpISPMS1G8ID8PILI
AGPC


G1I1D1Pf$RYGTLQEfpYFVDYLHKiNIGIILGWVPCHfWOiIFALASFDOEPLYEYTGHS.TLISYEH'IYSSALTV
IIFaGAQVP'RGSIRKPRTSPFSPOfR4CKECVIiiliK8J108IHOLPII


QALNpNWJI'FTFDYSRHEVTNfLIGSALFWLDKhBCIDCLRVOAVAS6Q.YRDYGREDCWITEVLOVADVEITAf7iV
DILRIGrIiO~IIHM'PI~d,OEVSKSHAPIIL~tSPMTLFJiWLt'.AIIC


PNIYGGKFNLESIEFLKNLNSVIHKBfSGVLTFA&fiSTAPPGVTImVDpOGLCPDIfKiBILYItaS8PSCPCNILCE
RGIRTfFJI51'RY"fLDLNfSIALLKEI$BLPVIVDPBNAilOItRiLV


Ci1ll~fFHllfIBLDpmRKYHOKDLTFSLWYAFOTSFILPLSHDEV111KTKGSLVfBC.PCDTLPL71511GLSVGA
DOiJIIIYHANPEKAt.GDAKpOIT?EELHLFAIDDIFCP$E8R71HAIB


WtRPAOIDtVLLSYQICLPGKKLLF?GGEFCQYGIWSPOItPLOWELI2tdBlYIOCfLRNCVSIt


LN71LYIN0PYtidIpCRSQECFHWVDFHDI~7E1VIAYYRTAOSNRSSAiJ.CVIiHfSAS'1'FPCPnL0185
565993 5662=9


$WLRCF7GhWCELLLNfDDESFGGSGKCNRAwVCQDOGVAWCLDIB.PPLATVIYLVTCi381.1 hypothetical
Drocein


OPIORTPiRVfLWRFHIKOACKFYLLpCLLCALYWLLKYCRKLi.IOGTLIH$IITL,Ypui,


$SLIDLLYOLICQLPAP1NE


CPn,-0476 551877 553858


CT865 hypochscical procesn CPeI-0186 56'799 566105


GRGRRADWCDCNIDIIIOHFRPYTMVPGpKLpIPGSLLYAOVFPTLWRLFSSKHEILNF7pThypotheeieal
Droline paraease


IAVOGPLIOtPAVFQDLHRGGI)1VT$EttYKYYLLPSGDClOSI100KLPSA710AGPLLSL1'ViAOIIRSLLKGNI
FHLGCGVLYE?U~1!'SLFLFPLIAIOGICLYVCRRGSKKVEORfSIffLRGR


HKHADWQNVRCRRDLKEILPLWFRFMM71PKCSYRDLETTaICSLVKTAfIORVLHRE1TESLKIFPLI~f'1'FIATO
IOOGVLLGAAEEAFCYCYGGILYPLGVALGLIfi.011CP010lWEG


IAPALLSIJILRGfSGCFLPRSYDEEFpGILPODCDPEGGVPFELLSYSPGMIODIFLRHpSLTTYVSIF111IfYGSK
KLRKIAFLLSAGSLFFILVAQVIALDRLf55fPFCKYV1'VJIAiI


COLVEILPALPPEfPCGRLIHVALPNILTLSIVWrKIITIRpVELHAEYSGEVFGKFCSSLVLASYtSTGGfRCWRTDV
I01GFLLIAVLVCGVSIhILSVPKSLSVLDPfOSLPC71KLSN


CSARLREWSERALSGSKRLSLGETLEIKAI:TTYLWDCFHKWIINPhLFNLVE0CMV0RCVAdSSpKRLAWMVGAGLVL
LLfNFIPLfLCStGAKACLIG


GCPLIDTIAYFCNPSLAAVNMAICVAILSTADSIl'TtAV50LIAEEIfpTWIPYYRYLVL


CPn_0477 556112 551844
GLJ1VMPLVAIGfTNIVDVLILSYSLSYCCLSVP~CfYLLAPKGRRVSGAAAfiJIGVLYCA


~yQeV_Bs Hypochecscal protein
tGYGwVOIVSIJ~ELLAWVCSLVAFSFVGFIEITWIg4KVKi0T


RYMIVAEVKCTFKLVCLGCRVNpYEVpAYRDOL?ILCYQEVLDSEIPADLCIINfCAVIA


SAES$GRHAVROt.CRQt~IPTAHIVVTCCI.GESDKEPF)15LDROCTLV~11C~CSRLIEKIFSCPn_087
569833 568112


YD2TFPEFKIHSFEGKSRAfIKV00GCNSFCSYCIIPYt.I~ItSVSRPAEKILAEIAOWDCT381
hypothetical protein


OGYR>;1IVIAGINVGDYCOGERSLASLIEOVDRIPGIERIRISSIDPDDITEDU1MITS5RR1CGISLTYSSFRWASF
RCYSLIFFCfCGSLFCSESLtY0LLI0DfAKVSEECIGLLES


R11'CCpSSHLVLOSGSN$ILKR~TIRKYSRGDFLDCVEI(FRASDPRYAPTI'DVIVOfPGESDKEYSLLOA1ILVLR
AL.apNSSFDI1Y1FRSFKKCOISYPELAHDROVLEEFCIWLR~IQ~IP


ODfEDTLRIIEDVGFIKVtiSFPFSJ1RRRT1UYTFDNQIPNOVIYERKICILAEVAKRV00KSVTVRAVSVIJ1ICLV
TDFRLVPLLLOSCNDDSAIVRSLAt.QVAVNYGSESLKKALVQ.AR


F1BIKRLGE1TEVLVEKV1GQVATGHSPYFENVSFPWCTVAINtLVSVRLDRVEEIxLIGNDDSINVRITAY0W1LLOI
EELLPFLRERAENIILYDSVERREAWKACLELS$0FL6TCV


Eri
AKODIDOALFTCEVLANGMLPETTEIFTELLSVEHPEVpESLt.T.SALI1WSHOLQIpIKEFL


SKVRHVNCTSPFAINRFOA 11LLHLHCDPLGRDSLVDCLRSpOpLVCEAABMLC$IGIN


CPn_0179 557640 556210
GVpLNfEHLESL.iSRKMNIL.,'ILLLVSREDIERA~CDVIARYLSNPE?ICWAIEYFGWDiIQ


hfl%-CTP Binding Protein
wNLACDTFPLYSDMINREIuKKLIRLLJ(VARYSQNtAVTATfLSGppA0Q4SfF9fiffllE


WHOGPLt)TIDTPCDpOSOSPY,7,tSLCARFDLPRKEGDP$pALAVASYQNKTDSQV'JEEHLDECDVKT3EDLVTDA
CF.IAKt cns: scuOKKDOASLORVSOLYNDSRWODKLAILESV11F


ELISLADSCGISVLETRSWILKTpSASTYINVCKLEEtEEILKEfPSICTLIIDEEI?PSSENLD11VPFLLDCCHHEA
P:LRSAAAGALFSIFK


OORNLtKRG,CLWLaRTELILEZFSSRALTAGNIQVOLApARYLLPRLKRLHK:HLSRQK


SCCCSCGFVKGEGEKOIELDRRHVRERIHKL.SApLKAVIKORAERRKVKSRRGIPTFALICPn_0499 57,1147
569767


~YTNSGK.a~TLLNLLTAADTYVEDKLFATLDPKTRKC11LPGGRNVLLTCNGFIRKLPHTLttitA-HIT Famlty
Hydcatase


VMFKSTLEMFIitpVLLHWDASIIpI.ALEHVpI'1'IfDLFQEL.KIEKPRIITVLNKVpRLPRKLPTCFAVNVTRSR
DHMR'FKOIIDGLIDCEI~IFENENFIAIKDRFPOAPVNLLIIPKIt


rx;SIPMNt.RLLSPLPVLISAXTCEGIDNLLSLMTEIIOEKSLtM'LNFPYTEYCNFTELCPIPRF7DIPCDFJ4IIl
IAFJr,:KTVOELAAEFt:fADGYRWtNNGJI~QpAVFNLHIHT.LGC


DACiWASSRYOEDFLWEAYLPKEIQKKFRpFISIYFPEDCGUDEGRGPVLFSSFGDRPIGALA


~.'Fn_0477 559431 55761ti CPn D49v 5'.IJl7 i')tlllr..


I>ttnP-Mtaal Drlperldenc Ilydrolaas~Z'79') hylxtthrrt t.:.rl prJthln


AIC~IVHDtOSESICKLVFLCTC:NPE7CCPVPFCSCR\Y'ONT<;IHRLRSSVLIO'IQtIKTLVIRIVFAIFIdYF.
~.LV'fltEtAJt.FftI'/~aM)fFR::I':"IIDCCFIIN)EVTAf.'N.LItft%.VDI:?llfT


nN.pDFR1'pM,VAGV~ELDrNFLTHPHYDNICIaDLLRN'h'IV1'OR~LPLVf..:A..~TYRFLIR::RDPWL.~.
Kt:F.51\'(~n;lYaEIIKRFDIIIInY::YDt:::W::::NxIILIIYLKRtt:YNOCEE
'


NKAKfYLFATPMIFa
YIIF'IMffLVINYDE~hFI:RFF:'.Y.Er:F'r.'::F::D(fY.fYNPHEERETFCaJN)!X',ALtIPTIDF
~:LPAVLEFTTI.NEDCl70EEFU:IPYTW~YYQK~CfMr:FRFY7NL
'


A
IJ:HLHYYFt)YDHth'IUaVHF:WETFx'Mt:LYF'LPItJIWNENFFFtJX:RKIIT'MF11P'.fP..~f
lL'rDL(:::IDAKIF.':YLDNVKTLIL.:AI:F::ECrtFFIf:lili.'..~.IiL'IVFFJUUF)WHACIKN'D


Lf tTtlt::IK:'LFaEHD(tllrtIVTFAY0L71E1IL4ffL.
:~Wtl.Hr:IriNLINtIMIUY1':r~'~FJMN:LfE:KF:L::KV'X:IM.AVFf.'ItKnI.FId:IIWM1RF~.C


Vf4\L.kLTI.LUN.r: l 1


~.'f~s114rr'1 .5'(375 Sr.H.:SU


~.'I'SHt ttyl>ttrMrri.:.)l t:Ptt 114'ltl :.').,
hrr)LRIn '.'/ t 1 f
,1


::Uh~tl:W:l:~:IIII;RFY::KKt9ppt:NtJ:::Lrt:Yl\:XaIIEEYKNRYF1'IOLCAfWiPYWn.-
1'W'1 Ir/Imr(nn r,~.y
1'r.m.:m


1'V i WDVrJ:Arrn:Il.lyVL IH:KGHKF(rIMYNLW INIINlAA::I'1 t :b Lf::I
:L.T'VFH:P IT:a.W ALEpK:M:AIVL&~AMYEICILYYlG:f11 I'IIY:LYL IF71
tfAYFtl:f111.1!
: ~.~' I(,tVNLK:i


'JVHHFY)tr..:IV.:WVF!.f.i~.iPMLIVf:VMI/ttAPLIVa.:AYJVIiR'.tltl:Vl7AILCLFAIL::LW
U:VF'IvVIJ111t.F:LNYA12KF7.FIJl'/Lt'M:9rl.ltA'l'/WLELLI:IY:::FVv:KLYMItIWNI.
'


HA VIC:I~.YtNHMF-
Mrl'InIC(r::'.t'I:.hFr:Y.Yi.F:IIF'l'I'IJ:TItIGHLWFLI'If.lv71'U"II:F.'fI'Ir:
/In:1:In'lHhi'ILJIIt:IIEYfTOf.'Flt'.ROIONIii(WY.:/ITEYrA'tY'\t~:VFITKLPNGSRR




CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
FLPIIt::K::I:PRrIILKIRKFLPL'IGNVTORPPVPE.W n'70- ''
~ IKTRPLNIRTVFAR71VODLL ~rDC-N:iP-7O ~:ot.le.~c


i'QIILpIITMIOILEP11'OFD'17DI'IEFY;..~.'3EPICR:PLEFFTLtPYKEH3FFfYR~Q~E.DVIICDTPP
tI0EI9V ~ .a)~' EKNDKti
9 81,1.


'LGipOyFRVFE.iIPE0E00AANFLaK.SELLEt.~r~ItKPRI3P3DERNARLIOKNOKEROELhOYAL>C~1'".I
JZDR'WPI' AT'CIIn
E


IEDQPCFP!'LKANECOHITSO.f'ttLF.TRYFPSA.iLKCNPLSNYSRYYIQNfYFOIPSPT9GItEYSSIGOKtNP
ft.JIEAVQTEFfSEVPFl..'fLEEFAKCYKICERPIRVAKVKVJ1KJ1P
' K~


t v
EFFSIRIOR.iFLLDLYFA(:I~OLEvKRLLOYIKRNNKOVCNFVP101QAEDPAGSY!TPKBHIIE


:LI~aSCLI:rCDYQEFLRELLT~.IORL.iODFt'IPEFPPOTPLAIL'fCOCSGAN~JItIRVAT
r r er


.
r.iILSCCNL:::'....~'1'CNAYVEaF~SYAIPOLLERQAD~LA~I~~58425 53n31J
tK:SEiIIMNCL!'CLSSAXAGIA CPcy0501


.' ~:.Klr;KMWPVFLIGPVDYWItSKITALYN3NHAVLTI..


'.::If.". : ..cl,...TFir:,~I.. ..ay~....% ..
1~
y~~~NF
FF
~ IP


':Pn_J4'J1 571595 ~.'lJJJ4 CITA
~YI~
~V?EAVI:VPAY
PN~OPASTKDAGRIAGLDVKR
1CAOILlDWKFI


~1389 hypocnee>.cal 0soceJn
AAt.AYCIOKVGD~IAVFDLOr'GG1'FDLSILEIt~GYFEVLSTNCOTf.hCGIDIDEIIIIKW
NSCYSWt'~tLFSFLVLFVCCI110CIPLCPOCKYETKSYIJtSDOLPRLKDAAEKAKILL9C1ISSTEINpPPTM011
DCPKHLRLT
AI


ILSSLY7YF11QITAF :p
~RTOWIEIDiRAYLFSLPVDSSLSEAITNIVRDLNIE~EGIDLSKOtlI
' YKELICK
DVLLVG
V
I
A


IGYVOSLL~01FL p
DNGNNYENDCYL KCIO
IPIfCGKDCHiLP~TfTLFSPLfADP CNSNIIPA
I
LTRJIpFEK4AASLIJ<R?KSPCIKALSDAKIS


PFICAYEICERPYCECITRSSAERPLLPKEKTCOEPN10CVNP0EWAIGAAIOOOVIGCI:YKDVLLLZNIPLSIGIET
IG~LVINIITIP
~~rILLRLfDVSRF1NDCDPGI00GVF>i~t!'~.DttiVfIDID


ROVINSAGIRFNEKtIVQiIIVOATIlr7
'110KK0IFSTAAONOPAVfIWLOGERPNAKDNKLIGRFDLTDIPPAPR~IPQI
HPNFPRPNLSD~VDLF


PESdIVIiSDFPVAGWSG11Id0t8FRFRW1~.S5H~EFILTANCIFHVSAI~IIASGKEOKIRIEJ1SSCI4EDEIQR
NVRDJ1EINKEEOKRRRF3ISIWlN6A
RLYGCCCYIVSRIILTFPERPFYCEWGAE4RPrIIL.RNOE~AQPIFAIfPETLVKEIEERIENVRHALRDDAPTLlII
KEVTmLSIOt~II


iSFRYTPOI DSNIFRAEMIKDYKEO
WEOQIIFGLDOSYILCNWAIVOEfGRKIMVLEYNQOFSK~OFIRtPCNYYGFRLTYGFce~lo~sAtAAASSAANAIOC
GPNINTEDLKIwsrsTKPPSNHCSSr~HItGCVCIIDeI


CPn_0192 571617 571801 ~ .
t homoioQ present >.n Genebank/FlIBL
as of 11/1/98 586118 588511
l
N


xls CPn_0501
o ro vac8-ribonueluse fraily
LFSLIFPICEERNSQQTYItHLNVESACFLLFSPLKINWSSPYCFPPPYRROLKL


ATOPTS>"1'IGF:NOCPKL'~9LL~LPKRKPGRR1'YGKSLILIFIPCTLFVNARIOGFOFV


=Pn_0197 57514? 5718s5
SPONPEEYPFDIFVPAR~.RGALOGD1NIVSVLPYPRDCOKLRCTISEYWtCKTI'LVGT
nt in Genebank/F~Olt. as of 11/1/98EDRSPALON
' LSTPPWVDKP
l C
t
~


op prese O
No robust homo Iv
SKTEGSHSKTSKGFUCRFVGWIRTf'IGRGSKKRSPSSFSP1'HPYIRGRTYTRSPR09aVE~
IL
ITSLVSFTBALAYTSIISCSOSLIPVBLLPGRTYI
LEFIQITf~IAKAOf0AI0AC1fNL11ELFPPEVIEFaSLFSOKHITOVIJISRKDI~DLLCF':


RKpEOAETSFIETP10GIL10CQ~DPKGKIiVIRJK1MI
HLDKEAAKRCNSIYFf~It
IDSgPARDF00AISLTYOIRirNYII .L'NfIJIDVSIiYVTPNB


_
_
5~~~ ~WI~~I


CPn_0191 575370 515116
~GOtr3fIPLSKItJtF7at'II~KK1'SDIREERCCIRFVLPSVZfLSL~PVALIItIbTFS
~c in Galebenk/ENHL as of 11/7/98 IDiKi!'DIT!'1'PfOE
l L


No robust homo A
o0 Pr~
HKi.II6FNLKAIIEWAYNI8N0GVSLPFItSHEPPNOENLLaFOE
CSASVGVTS~1~LA


YINIRVNPYGSYRC~RNPSPEDGKKDVPLSCNSRLNRPOOIAR~PDYOnS.fTLTS71GIIPLDOVLHSQFVitSNK1'
ASYSTF1~80GN1IGL%IJ7YYTNlTSPIIULYID


SLEKRVlOCISLANIrIC LIVIHtLWNPL3IDQT1Q.EI
IVRAGSTKERVSAKAF1i5F1~il0LTRFIMVLCWP~


NAYIITANHBGLStIIVTEFCN<tGFIAAATi.PICtYSLKKriALPESIPDKIatIGJISIRSrtID


CPn_095 575507 576793 SVNLLTQKIVWSIA1~ICPl~IK>ITPSK1IKDTKK


aspC-ASparcace AlninocransEerase
KPOYIISEISIOG.iIGFRK
EMt~tIWPRFSIi


. CPtL0505 588471 589106 ,
ttRLK>Q1QIWAIOKAGAFLRCLPSESRPYL l
ETIPEISVIDLSIGDrtOPLCRST?OAIKEFCVSOZILQp~''t'~


YENRISPEEIFISDOAKPDIFRLfSFICSEKfiGL~ODPVYPAYRDIJ1NITGIRDIIPIJICase .
)-alechYladenine ONA 9lYcosY
KRIiRK'KEPIOCCPRNVLO>I~rLSEwTTLl100LLOfiKLITTHOGLITSC1LIVLT6J1YR


RKETdFIPELPt~IQOSLDILCIA:YPNNP3LZVL'1TGOI4ALVNYANOtKTCVLIFDJ171YSAFR
IPEAKYCAIEINSFSKSLGFTGNItL7lG8tVIPKLLTIDIdiEPNINDNKRCPOOIU1CHAYN1IRK7CMIMlKLI(O
OBAYLYRCYGMNNLLNVV1'GPEDIPNAVLIIU1ILP
ISKOIISQI'LT
PAL


VSDPSLPKSIFE Y
OLFPTPPAISLYLTIIAOKWISL><1'MFfV110CDHAPYL'DOGK>ILNIORROtIRDKpIHLLTNGPOKVCOALOIS
LENNRORLNI
FATTPNGASLLNQCAOYYGZ KVLS
: '


. LLSPI9SG
A?11RIGIDY1HOEYRDIiPNRI
WVELPEGISDELAFDFFLttQYNIAVTPGHGPGSCCQGFVRrSALTOPpNIALaCDRTL'1'A


SLKITNVIJv
cPtt'OSOS st9ass ss9tlo


CPeI"0196 576751 577811 CT131 hypothetical Dsocein
CPNEISPIPRRICKSFILNM.>a.YSKETN7WFLISCRRfNKRYFITiit.VILLPLiIf?IAI


CT391 hypoeMSieal Drocein
VlIIIItIFLTOPIITZaStFF>DCFSFY17UIMLIJO)VLOfILLP'GLrPIITVLtGFLTIItIII
SCMfILRIUSOYLFFt'SLICSFIYVATCGSOFOSVSSPICIAIFLSFPNVNIIPPPNNiYOCfGWA
D '
'
'


PPLYRITKRN IFCSKECSFKQV
O 1
PLLEDCSKSCIE1'LKDf84LPEIWLJiiIf~SIVKARKTARSLFtfOACtWAIVTLGTI11TK1
FKSIiiIYDIIILHRfPIIKTVYKM00VN
GDAPNCC'TO~PLVTVFIPTCPNPTSGFLTLFRKSDIV!'LCNIII6DJ11KYII80DV


VIISNTETORPVIY71l1VPDILESLTLPKNtI4tIYfs'VI81LLDIt7I71ICFAI01WATN710fIVYLPSSPLPD
EiJfOD00S


KPSBPFPSDLOKEIVKKLMSGIEYIEISITSSTFlCfItIR0AI010tPS11IFIPLSPLStOCLSTPNAC


ECfAFLOEILIfIdCIPISTDDfSLISDGKCI11CSVD1fRK8GKQT71KTV~LYN~~BL
0s07 SH198 590122
CPet


RKII71QRLSPTi'1'PNEDIIKYLGIKLiDCTDINOpLSE'KSAVS-
Cf431.1 hYDOChecical Drocein


S'fPYPQFPLSCEIKI~'NIELFlIfRNSKOARRRAKSPKKRKPRYAIVHPAPAPItIVYaJCf


CPeL0197 578107 5178/0 NALSTSDSIFIPKIG


CT388 hypothetical protein
iPQRWL~SWILCVKVTPKAKtNICIVGFOOQALKVRVTEPPT~OGXANDJ1VISLLtUCAGS590133 590300


l,pKRINTLIAGETSRRKKFLLPNRWDIIFSLNIDV~CPt4..0508
Cl'131.Z hypothetical protein


SRINSRNRSYGKSVIICVTKPIIVLIDiFERVEVLR)aGRWNOSTA%KVICLPRTPILK


CPn_0198 579D62 578085
No robust hanoloq present in Genebank/EMBL590808
as of 11/7/98


YCRLRRAPFIBaRRKARWVVALFAffrALISVOGCPWSQAKSRCSI~fYIPWNRt'I'EVCCLCPn_0509 590299


PEAENVEDLIESSSAWVLTPEERFSGELVSICOVKDEIIJ1FYNDLSLLIttICAVPSYSATYIPr~iceed
Netalloenzymel
NKFVFLYGNFIRV1'QEKIKItIVSNEOTCIPIHLVSVEKLVLTLLF71LKVTlIIEIFIYILE


DCAWPGGPLPALRORLDFLVRENORCVRFKICIVFL.CGERGRYOSIEEDfJiFFDSRYNPFDMtaELHDKVFADPSLT
D'lITLPIDAPCDPAYPtM.CEAFISP0AA4RFLCJISPNDm


PC~6IESGNRVTPSSEEEIAKFVWMONLLPMWRDSTSG11RY1'PLLAKPEF~1RWANRKIYEEISRYLVHSILIWLGY
DDTSSEEKRIaIRVKIaIDIIGNLRKlOIALLTA


VI'LLLFRSYQEAFPGRVLFVSSOPFIGLDACRVGQFFKGESYDIJIGPG1AOCVLKYNNAP


RICLIITL7~'ILICETt~CLNISEGCFG
CPn_0510 590801 591971


CPn_0499 580104 579705 clyC-C85 Daalains INenolysin homolopl
1
OLNNLHILWfFCILLFLrItGLTOPSCHGSSKFLIrfItJpRFFKDKGREYPPFPSAPTILA
8


/9
TLLCILYGALfiTKLYTLLPPK'tJWKDLLIWPLYSLSALIAYOFLPPNISTINPK~iI'IAHL
No robust homoloo present in Genebank/ClaL
as of 11/


LaVYLLIFYF~.FX;STMSSVNOSSGTPNPEEV'fSPESTEFl'IKNWSSDEiI0ATN11VALPIVRFLASVFOLCLFP
LOLLFYRRRPNQpVRSSI'SFOSOLSEALSAF01'R.IVRIVNIPKVDIF


".'OLSLPDGVGTSSEETASNPRVDEIVAEVSSSMVADQISSLVfltVGELLODLt(C710SLFOEALVLVSEEGYSRV
PVYKKNLDNITrILLVKDW.LLYTSSHDLSOPtBSVA
ALPEEITL


TSFOSEL1CJCLPAWKSSTRRL>:T'AGRGONADIARLELERSDIfAVLGNANOFfIGKAHLIL.
SKLTDVNHKLOCLSREDLSIrIFONNDRVLEHt.GSLGI.aVpilECiK'ISLSCERGIPRLVLTAKPPPYAPEIKKAS
SLLOEFROKNRHLAIIVNEYCFTECIATMEDIIEEIICBIAD~W
~aINN
V


DSNLVOIKKVNLPTVEELRTL.OCITESSSDPRVEEStSCCERLLNELRRIbIANIVOFISSFHKVOAVPW
ENTPYKKICSSNIVOGPINISDAEEYFftt*IDNENSYDTIiGGFI


~YONIVEYfJBIIVRRINLLPCLGCLPFIGJPDASOEDORSS~ERSTRRERLSRRSDLSEECNFDIEIITCTERHVCKt
JIITPRKRKt?lIS


FlItVMECESINPESPHCDCRNpPSttCDKODSDSEEETEL
CPn_o511 5:3111 592488


,am_0500 59064? 58236: csbV-9iqma Ratulatoly Facto:
NSDfOKEEHCSTTIFHLNGKLDGISSFLnIOENL~>OSLaAGSKNIILaCAHLDYNSSJYDIR


pcoS-Prolyl cRNA Synchetose
Vt.IQ..'YtIpVCOHSGKiVLTTVPKTIEOTLYVTCFLSYFKIFNTVDEAIOTLMfD00
~PNSNKTSOLF'tKTSKNANKSArIVLSNELLEKdGYLPKVSKGVY'CY'CPLWRWSK?!8'LI


IREEttrAICGOELLLPLLNNAELW~GRWEAFTSEGLLYTLKDRECKSI~LdPrNEbVI
571111
2 5x3538
5


CSFVAQWLSSKROLPLNLYOIA?KFRDEIRPRFGLIRa~RCLL1IED9YTPSDSPEO~'~OY.
CFn 0
1


EKLRSAYSKIFDRLCLAWIVTADCCKIGKGKSEEFOVLCStGEDTIGVSOSYCANIEAACT125 hypothetical
Droeein
GIIPTOWLAPAT


::;iPF011AYDREFLFVEEVATPGITTIEAUWPFSIPLNKILKTLWKLSYSNCEKFIAISLPLTNRRSVCYVNP9IAR
rY:OISTWKFLYSLATPLPAGTKCKFDLAGS
14A8PNNP
V
~
'


xIRGDROVNLVKVAiKWADDIaLrISDEEIERVLGTEKGFICPLNCPIDPFA0E1TSPM'Q
1
ILTf
.sEIIEATAIPVKDHPVPOFEFTLPYCLOVGG
DLSOTRNVIYAENPIrJ
WVIOrK


.
VLrDACWSAOt.FAORRKPFYLYIDP9CE~.NYOEPOVFBI~IR~IVLKKIEIPTPS
St-X:ALINAKDK11WIJVNWORDLLPPQYGDFLWEEGD'1'C'PF1IPGHPYRLYOGIEVAHIFN


:.:TP.'rt'tY.:FEVFIFQDEH~TCQc.IrNCTYCIGVGRT(.AACVEOt.A~RGtVWPKAWPFStRFDf7YRFE0E
FGNL'fLIF:t'EETRtEL~IEItLRE:><.NWpLFIPETGPYILPNLYfNI;PCI
FI::APIKCFADSAFtItJetf:LLHGESERVDSED1ICICIRtYFRDORAL
'
'
'


'."tAFtIJ:DTV;.QEIrIE"LYHEW:OCYEFLLODROERLCFKLKD:DLIGIPYF.LILGKSYIRIOLKNL.S
1
OET
NFYA::.~..~.FEtIQENL::1'DIWKLIN01Y.~.LFNEEDPFTTL'~'I:)
JY3GEPNLmVRNIGNIKE


~e::Y:IFEIE:R:.GCKYTV.iPEnlFt"IWC.~tRlLJ1'PKSII::KMKf.YKH I FI 1KLTK:'fVfNIf7N
I:: I i'::FTA.9KEfYiFDFC'IFIfPEFERVVEIYNAN


':R:~1:~1 s~7n50
~:$::FTfMIIMPFPtCX:KI':x01'Rc.9'VIFl:LKKIILRIY:FVIVSt',LppRCIYKDYPDSPQW
'


fn USUI
'l::It:f:fAIItaIKYTItF_'ab't'\Lt'ANllv:'IA'I'It:PPIVL:iFtIITOAPIIf:SELSTOSKPOU1
V
: '
U Rupla~.anr
mrctiPtuer
~:l~IfPl1 Tr
I


. KAPP
. NRIII:Y:11'/M.TALL~1'VKt
flttY:I:VLIPfFFI'D!:IBR.DYEYGONVPLt:VTGKOPN,t
rc '
'a:::F'PIHL:-.~T(VLVC:W~JMIt::KVSKR(Y:KILfSILFATTELYLh"IIOIrI~P'7SKTt.K
ILLTF


. lYNfiIN
t.ay:::h4a'1'fIHNYF'AliI.PJlI:u:F1*KNIIT:XX:RI(?DWLRIIWIfItyI:FI:PFJ1EI.:APtVFY
YLItVi'c~AIILWW::.:I


NI.K I:YJId ::E::I<N l t KISt hKA'f
EL4:EILD4PTFF.~.::ARPFNO::VTN ~'tny'.1 f ~.t'.1'. '.v'.7'::
f Q I1'G'/MIORAYt'
'


~1:7LYNEV rvlr.r..Amn.n::.
:L.:CEFP~It'Pfff'C.WL!'l~~t:UCI.::IKRIEKFLONYtPYLP'1'NEEL::KKEEHL.. :: ~
' n


.ELLtllC711iKC .
WLYITRriYII':;I:EDt.YrJi~:N::KI.I*YC,1FKDPEVLAL<~I::LFENRR~WV..
LFAN
VI'IrYA.P::I*~:VKNUU4\KKYIIYtKr~::PVia.)IIi.E:RIVIi(6'T/lplfl'l'CLl'(WPKT.'uPLY
::

'
'
'


KFJU.1 I
F'F:KIJiM/Ft'I::DRUAIJII.I.tJ:IIJKI:I/,NTikRli'ADV1IRY.qPVCO'l'VYY::.:Tt.YLYP'M
F
LLK
1
INI.t
NA'fAFI':Kt:lV.hfla7P:TlN:C::VITII'YITIIR:iPL(:ALI:IL(a
"
'


PI, n'I ~F: a .'K!'r.'::F'lAKft il n'M.WI.
l:: aKL.LP::KR Y::1'I Nal.l W~ 1 vl l I Y.Ti'
w ItIF:Piav)::F'IKt'KI:a'NNFH;i:ak:KU;NFPITEVH IVIY:I:FP."(. '.NldtlY iDLF
ILRTE


96


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
'fKIKE-fl7lJtIIIKALTAfE'fAYL.iDLGIJLaIRG'.t'.f'n t15~5 X' _ ~'t"397
..KDIIGLDSIP~GAEILYDKCRN CT77H hypnchecvcal 0rocafn


FLAPKRL:.:::DFLNIHYMAlIOLGtHSNtTMLC'YHKECPCDLV1'l0IVKVPOLODETOGFKNGt CFMHDALL
iZWIIOaL~L'~IR41111VHKEHCK~AINO~L~
~~


FCLLKFACEMLVLCKRLRK.',~fiALPLiLiIIIAVARtFLDNfSPaI4KALWNYLGtF~IALDLLI~
ICTOIRDCCkl4RI0Ehr'~O'INKL~OOME~LRL'~


Lx'CANDL,~>.~TIR4CEK'IFOMA,i.iKEP
KOAGC.~.IVSLKE3LASTflrSSSVIEKEIFE:RKKCNECGKALLCpRTELKNAIttPELl.
CK11DAECFIAALITOOCRTPGLTN35HV


iIYERLLi~1'IKKDRWVPtF?1RVCSCCHri'LTPONENLVRKKDR1.IFCHCSRILYWOCSQ


CPn_05Li 575690 595530 VNApCJSTAKRRRRMAV


~T4-7 hypotnecscat prota>.n . .;~' .. . ._ .
~.NfT,PHN".TR~JAM.~.tFit4YiiIllCY.iftN.iFPLSWLIKRNDIRCJLAPPADLINLL.Iw v
-


R:; ~ ,:~s..s.m : :-.t.s. ..
'..~:1\C"FF.':.'. ..~\T:.-
... " . :.':. ~ I:F'.'."'iDL
VFNL1EK~F.%(:'(IJ:LK:::i::KFTrHM'r.iiMl,i:DW:QGawiKUKvIV9FFF0AFJPKW1
.'.FIILAW .-"~.;;, .~.. ~~.~..-LLt.:D.V.:.;;::'::.
'


C1.PP5
OCJ1EKILGMS<.WVFFSGItLiK3fICYARKLVATLOSLSERIIf.FFSP11DLIJ(Cr~L.VSPGDI
nV'GWYDLTKLPFVFALi.:.fiSC'.:WiCEHFLPNLWEEALaQFESSPEEVLKEAHONl


LLOEYYI1LCOYRLGEEH'tEiFEKFREY'tCTLY00ARLVCLFSKSGETOELI~fVPHLKSRRAILVAITSIIPYSMr
IALSOLWILP>;VAiLDPffIC.I


~~I~~~~~


CPn_0515 596450 597181
HLG~NSFSLEVFSJ1YCCCC'JCIVDPOFRtl'CIFTDCDLRRSLASYOGEVLiLBLEKVIfi'


ubiE-Ubiquinone Meehyltransferasa ANPRCITEDSDIAIALOIJIFSSSPVAVL
EKNI'fKALKNSGNINEPSTNKPDCKKIFDSIASKYDR1'NtILSLCfINHFWNRSLIOIIf~S


GYSLLDLCAGl'GM/AKRYIlUW PQASVTLVDFSSH4.DIAKDHLPOGSCSFINSDINpLP
' 0527 609910 608726
CPn


YSAHKLYL _
LENNSYPLaAMAYGLRNLSDPHKJsLOEISRVtI~tPSGKLCILELTPPKKTHP1sueH-Dihydrolipoalaidt
Succiaylcranstvcsse


RAWPWICKSVSKDPDAYSYLSICSIOQLP(tDt(Dt.I~LP.SRSCFYIApOC%tFLGAATIyft,RY!lItCFRFPKI
GLTSSOGSIVRfiLKNLGDNVARDCPLIEVSTmCIA'ISZ.PSPItA~tLVR


LEKQ
FCVN(93D6yA9COViGLIFi.CISEADDESTSCPPTBCCTKSPJVGSSSS&ttl'fPSPAVLSL


J10R8GIGLDF4.OKIAG1GKOGRVTRODLEAYISESOOVSIPEIF004VNRIPIISPLRRJ1I


CPn_0516 598909 597255 ~
ASSLSKSSDEVPitASLWWD1(T~1LISCZs~ORFI'CfNCVR..ITSFIVOCL~TLEO
f 11/7/98


No robust (wmoloQ present in
Genabrnk/tJ03LFM.LI4GSLDGITIVIeIKSIIINCVAVMNKF~VVVPVIHNCODRGLVSIAKAIdIDtiSIGR
as o VVVRDDDSLIIIRK
IILnVLGRAIAKAYYVCMVARGLCDFPTLVPNERLPIGPPEVPOHTS


RISISFRVSWFVK LNKLDPS~~~'~"I~IIRYPEVAILGICIIOKR
WDliLSVKSDYEEJVGPAICIRSLEPO
A
'


I4 MVYM'LTFDHRVLDCTYGSEPLTSLIO~tRLESVfMG
SIISGLDDILt(LCILORRPF
w110CKEFAKRNF
~RLKFPKSIGSKDAVIVDSF3~lVPVN
!'


F
VSOISPAHCRLCSTLVOWAPILGSEEOLVWLEE
T1'D1TSGVSFJ1AAAE7lAVDSTPGTEE
'


ITIPAJISC CPfL0528 611165 609921
ANPfQCIPMSETVESSPVAPCNTTD t
PSPSLRYALWOFRfPYPEPPKEPEVMFTDEEICSLILE71TRARRMQ.DLYNCIILiIDItEL.'K


DEIOKIIVPDLPENWRtNWRwSERLYKFPFKTKKEGLEEI!'LMCELGFHIt.ARGZ.RATOSQ:
t?1cT-Glutamate Syspor
Ll~OItJ~CIPIGLPSIG1RCGLVLEGIAItFKPICDIFLNLLSMWYPLVPCSIlV11GI1LRIS


AitIKVFNSLYAkOi.QSFNV~GttSCTIIKPLPTSKLDLFKSEFrSKPIOatILTEFLVASDEEIDMOO.GRICIKSt
IGLYIG1TALAIVICLCFAt4IFSPCNGCDFA0AQ8~SAVTVIf
IRT


LFKGLRVLEPGIEL7fYDHPt>DAGEIRSVLEGLVO11GRISCYtitNOPPGRFYLRGVG~AAY!'CSIIAQVFPSNPV
RSFAEQiILOIIIFAIFLGIALRLSGERC1RWPJtfIDD~N
lCQRFKSCVR'1'IO.VG&F11DES GCLVAPG
FFESSDEEGAFIIDNfPSKTAt4
GEIlt


.
LIIMItItIMSFAPYGItit181tAhtISCWIGt~Vti~IGKFII1IYYLiICLl7IATLVI
p
ELVt$i.ESLV11S


LPItiGRFTILV ca,ISrsxFLSS~IISUIVSrASSSATL


GTAIF0~4AAWI~If~~~'~'t'I'~ATFSAVCliMYP000lITtGSf1Jt81RL


CPr~0517 599637 598795
PIOCIAII~1GIDRLRDIVGTPIQtILGOAWATYVJ1SG~.SPYESI1C0E8VE1T
f 11/7/98


No robust hattolog present in Ganabank/1?18I.
as o 65
FIMSSLLSCGRIEPTRV1'CSLKTYLEDTSONOLSTRLVRASVIFLCALLIILVCVAtSSL


IPSIMALATSFTVMGLILFVMSLT.GtrifAIISYLTYSTVi'SYR~(1DIAFEIHKP~SVYYECPn..05Z9
611398 6111


CVR)811DLCRSSLGCGEIPIVRTLFSPFONIIGLNHAL.71AICIPLEftFJIFSPGPPFIEPt.VDwAYc_aH-
ATPas_e
~PSTLFLFYRRVTIAISLEGILGCyOGSLLSIIVPAPLVAL111F
F


~LIRDrRPHVSSLCFVIKQ(iSSLRTKDCNI'ICEAFRSI)YOhHFAMVDCYRLZHSKLIIERFSWS1PYRARSTVI


KNGLIfNL7I IPSVMVREDYPSRPGF~YRt:GLLIUnt'sGKG7II.
KLTWDSIIVlIBA&YVCDEPtiIIAJDILP6SVWVNKaIIRISAARJUIEKIGILI.ItOGLOYR


KLHI~IfEIAWN00DPtGGRAFFPItGRLRDFPLRLKlYDAIIVN00GKEJIG'1'VVIaV8f17t


CPn_0518 600806 59983
POIFVKPTIASVVWIIB~RIPKF~4RaRVCVFCCLGFPOGFiHrt.REIHILDKYLL


CT119 hypothetical Drocein I~SVIG.PRLSGEVSLLPIAKVCIItI.iVNOD
Flail'IfPVPONfLLLRILRI~~iiFSRSDDEIiDFYLDRVflGFILYIDL~fDOf'~Ci~RCIYOEL


EiztIIERYCLIPKLTFYEVKKZlICfFINEKI'YDIDTK>Q(FLEILOSFLEFIYDHEt71'LSLtiK4IE0IHKNRO
N


AELIDOFpOfYVERSRIRII LKIHLFDAK\ICIICITQ
ARQLLSNKUCIYYSNEALiJPRPIOtGRPPKOSAKVEI'>:1TISSDIYTKVPQAARRFLFLPECPeL0530
613323 611160 ,


ITSPSSITFSEKFD'IEEEFLANLRGSTRVEOpIHLTNLSERFASLKF.iSAKtGYDSGSTGspoU-r~
Nachylasa
SVVL~1CKFLWARCCSLiIPWEFCSl4OCIGK10QPLVl(F31UIL.KRSRCWISStiPL1f19311REI0


DFFGDDDEKVVTKTKGSKiIGRKKSS IK~iPVAVI
t.DSTLtIQLdF
KALRTr,YLCOHVrCSTtILSItXiXEFLYf3.KmiSTICILYC


_
_
QKRWIMOmFI'IQPFYLIIOV~CPOFNCiIZLR
ADGaGVDDV LOIPIV06Y1tP


CPe~0519 601707 600901
MMtSSLGAVFSLPILSISRE6GKELFKOEDwLYISrTSPPALTMYFBKNYLGPIALVIGS


dapF-Diamit7opimalace Epimersse
EKDGL?E~1FSEOlSEIALPMIGCSDSLM.ATSYAAVAYEWRQR1NN
OPT1G.RILVYWMAFYSPSI'ISKYFIYSGMIiRP'L.LGETLPEVEDVRFLCOECRVDGFLYL


KPSSCADAQLIIFNSDGSRPTMCf~iGLRCI1IAFQ.i1S010CKSDISVSTDSGLYSCYFIfSYtD
CPIL0531 611198 613315


RVLVDKI'LADWRJVVHRLESAPDPLPK1LIMIfffLlfPNIIWILPEISIT.DLSILGPFLRYSJ11!
depasdant methYlcransttcane


HOTFSPDCVNVFtPWICGHCOLRVRTYERGVFGETAACGiGAtaSALWSNSY~IKE.SIODS&ImDFRKEKC'Rr
RK501I~R~K~RHSKTYFSLIRERLVMDYKLGDSGtICiJKLt~F


IH'tSit7GELMTVSONRGRVYLQGSSIfRDL
GPVI'LIRPSSYAVWPKSRPEI3FSQAAt4YVRDGERGAiiIO~IFKRLPLBNEVAPStNRCLLK


05I0 601233 fi01616
RTPFCNLCYFPEIBGENPALKOJIIEKfOCERWIi~t.FAYIGAGSIPMKOGARY111V0iAS0
CPf IYF


L
MVAtIAQW4V~I~tAPPEIIRIFYfVIEDVISFLKItEIRPNKKIfOVILL~PSYGRGPOG


elDP-CLP Protease
KIDImLFPLLSt.CSKLLRDDiI.SYFLLTSHTFGNTPEFLRAIARRSVPlLVSPAiiSC'GESF
ERHYFlIADCEVHK1.RDIIEKELLFJvRRVFFSEPVTEKSASDAI1GG.WYLE1JIDPGICPIVF


YINSPGGSVLVIGFAVWDpIKMLTSPVI9WfGf.IILSMGSVLSICAAPGRRFATPNSRIMICGiGiIGJILP&GStV0
11IA


HOPSICGPI1GOATDLDIHAREILKTKARI
IpNYVEJ1TNOPRDIIEK7IIt>ROMWltTAl4FaCPrt0572 611716 614075


KDFCLLDGILPSfNDL ribC/risA-Riboflavin Synchase


ESFCCKDSVVIMOGIffSGII0E1GlbIlCFFEAOCa4CLu'LCIKS'fPLFVTPLVTCDSVAVOG


CPn~0521 607807 601211
VGLTLTSCNFSKIFrDVIPCI'LACrTLGEIOtCSt7pVNLhAi.KMGDSIGGNLLSGItVlCT


QlyA-Sarina
HYdroxymethyleranstaraseAEIFLIKPJJRYYFRGSKELSOYLFEKGFIAIDGISLTLVSVDSD'fF'SSICLI
PE7TQRT'fi.
NFEKFKKFAIVEIFTIIVtIAWSLLNKFLF3~ASGKKGOSLJ~STAYLAALWILLNAF


KSLLi GKKROGERVNIEIt)IISfICIQVDTVKRILASSCKD
PS ICERIIDELKSORSHLIGIIASENYSSLSVQIrIMGMd.TDKYCEGSPFKRFYSCCBNVD


AIf.WOCVETAKELFAAOCACVQPHSGADANLLAVMAILTHKVOCPAVSKfGYK'IYNELTE
CPtL0533 614918 615385


EEYTLLKAfl4SSCVCLCPSIIiSGGHLTHCNVRLNVMSKLIOtCFPYDVNPDTCFDYAEISCT106
hypothetical pcotein


R:.AKEYKPKVLIAGYSSYSRRLNFAVLKQIAEDCGSVLWVDMANFACLVAGGVFVDEENPEYAPHOCPFCNHGELKVI
DSRNAPEJ1NAIKRRAECLKCSORFTTFE1YELTT.OVLICRDCR


LFYADIVTT:Tt(KTLRGPRCGLVLATREYES'CS.FM.ICPLfOAGCPLPNVIAAKTVALKFIvLYCNFOESKLIItC
IaLIASSHTRIGODOVHAIJ1SNVKSELLCKQNREtSTKEICELVNKYLK


S'JDFKKYAHQWFBJARRi.7IERFtSItGLRLLTOLTDNHMMVIDCGSLGISGKIAEDILSSVKADHIAYIRFACWRR
FKDYCEIJIEIfL.LSATPOF1EK


CIAVHANSLPSDAIGKWOCSGIRLLTPALTTL.CMCIDEMEIfADItVKVLRNIRLSCNVE


CSSKKNKCELPEAIAQF1ROIIVRNLLLRFPLYPEILLEALV
CFn_0574 615389 515784


=Pn_05~2 607835 601655 dksA-OnaK ~uppressor
WFTRS(tWPta'DOEIEOFKKRLLFI41WILSHTLEGNAQEVKKPNEATGYSO(pADOGTD


':'C433 hYpotheci.-~: protein
TFDRTISLEV'ITKEYELLRQINRJ1LEKINESSI.ICDVSGEEIPLARLIAIPYATMVKA
REPLSPEKTSL1FKVKNVNpRMIKKNQGKKKNYFQYIPLKVpKLROPSFYPKRLMTLYLC'


ItIOKTARKYOAHYLPILTLFPYAKSTPONKRALQFLPOATHVILTSPSSTNLFLSRMTSLCN
OEOFEKCLLs


LSYJITLKTK1'YLCIGEs'TKERLLSFLf'.OVKYWaTGEIAECIFPLT4aLPS5ARILYPHS
0575 615763 516296
CPn


::LIRPVtREFLYNRFTFFSYPHYTYKPRKLKKNILSIC1KKIIFTSPS7VRAFAKIFPRFP_
lspA-Lipoprotein St4na1 Peptidase


cYT'fWCO.RMTLOEFCKFSSOKOVSLLET1JGKSRTSFKRTPCWKLSSMJ1TRFRSILLVITLP'VLIDNV1'KLWLG
D'IKDLOILT(tPTLYTH~CRFS


052i ~047~0 d050S3
F.iIAPVFHF7CAARGLFSNYKYFLFLLRIFVILGL1J1YLFFKKKSIO'"I'CCI('ALVL(.CACA
'_tm '
'


_ ILISCGTLLLWKFYFPTKpFEKKR
tk robutc hollwlcxc pcesene in fFNVAD
CenebanK/ENBL ass .'i 11/7~aA hNVCDttFYCNIVDFt~FNYKf,MAFP


s~Nfv~::ATC~FDGTAF:sLFPFITRPRYNFKLALFVT1AIALVWIALtA".'TIAICLCIHPLC
~Fn O:s~. eI6700 X175'71


.:F'IFLTAICLYFISRYIC.HYARNVYIai.DWfDII.':YLODNR~HSFIF~DRbrqA-f Al.rl:ly
Pacslsaaxe


'tn n'.4 0:15070 W 1v1'1s
'fR::I4EY1.R1'FFK1\fINRLLsLt.."VFDGFFWSY'/AFILII'ILGV:F:111I::RFFY)F'fIIPSQ


rmtnt::r IN411(tl,.ul Ns':::rnc
F'~KLFF'I'f::\MICQERETKL1:VIIPLKVFFASHYXILCICIPl.ICIVTMta~:::Ct:ALPYRhiI
in Urnatkmk/F?1RL .u: ut 11/'7"tN ~
IN '
'


. .VIVAtIdJaYIVFI
'.y':'.VL::S'1.'FRDKCIADKYyFTtAKf:TL\ILA::Ir\LC:ALVAUtPLIKJ1FKTPW.
:RT!'. fIPM
'r'fE~fYKFNN::f:C~ N:(Fr:::t'JKY::F:V1'L:IKFRKLDRDfNf~
'
'
'


. ILPFFMLLYt
. .af~SLORIUKLCa
:::CIVII:NI'VYI~\Ll.f'I'rAI.Y::VYfFLV'fIKtFrC::Y'/::::MJtJY.'JLG~sNFKPG:14\WVEK
NAL:L
'/riF::V(T0.~.L\I IIWNI.FKI.'YFM~LLFL'/FYAI


'/!.':'f:alF7M.:F"fHNIILN1'KFKt:\LyfDW:QPF'QFTFL".':LRVIF:1!MQ::TC:IfFNf'Vf:PfN
L'Iff.VKRFI1TLFIiLIt:IYF:.::AYYrp,iAL~FArYZYAITINOCISRMY:iaiIa~7Fiki1
:LIJIVLAr.3W5IGLENA:W'VFd(TLL:YF
D
'f
'
TL
'
'


IsarfA~ftlt.::'Cll;f::Tt.KI*::VWF/tt.'Kl'HEIr%PAK':Ff.FF::f'fFNRWKLPNEALOQTFNLP
:
~41~
I::IWAICMLI
:
..:AK
:
IJ::F
VI':YYr:AKFL'1X11'CAY.IYTLYr:(d.lL1'LF"'FI::()Nf
(TF'F'YVn:Y'ITII::YFt
If4VKFFt


Itl.:::AF:YY::II.I'fI~L:IS\rtta'K::l'ELFNW6'YYItVALIJ1'fF?X.'LK.1\(E::ItM\tVALC
LF.
DFFNTtE .
' ALI.IN.'9/:7
i\1.t.1.t'1'Nl.4SF(LIKE'fIFPAFAA:LTET::L.iTE
'
'


/It~liiLLVII.O
S
WIW:,rIVAFY:KIAL(.DAIiIfPAi.t
'r::'/'IF:VIIFit11.1Kfi:fF


7,; :I::F:F:
:H' ~t'.sY nlIH1'. '.Itlltr'.


97


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
;:Pn 051.: ~i ~luW 3


~THI~I.: I,ynncnRtt':.t: n.,r..t:, rL:I-L2: P:bnsal!.x: Pr~cetn
ROLNKILKWTHR:~
~:LVPG1KI
:I'atr~'IIVNTI~FC
1:
~
'


.
faKORLTL~IER&fBKRIYNBP~YANiQ'0('~.I~~W~YQVBc~~DY'D1.'EW.CR.til.'~
.
SHI'LJiNAO~fAEYL56N~WA161KY61I1KI~fYiI~l~Ilil
.;.: FVPD01'KASIl'
:::::::LLF~
.
:.IFLLFNDN
LAptII4:NNL4:,WLKKRKNM'..:L:CD1GELLDEKKOR:.'WKKNLDGGIKhCAALVLIWKV


F f NN0 r.
EILI


'Fn 1539 .t8129 5:A511
051'.' :115Rn u32198
CPn


::T914 nyprYCn,c ir:.i: Prott:Yn _
'F'IiKTKTLRDI'nIPRNNHKPNKTKCKRFRWLRyGbB famil/
.VLF,r'CFIATLL ..-..
~Ft:Kr -
TKEI .. . -
tK:AQHw .... .,.
~ -.
:
r-

~

'
~
-
'


. _
, _
. ,,
, t
.. .. ...._...,s
;...........,.,,.:1Y:.::.Ya:.':!':.;CFi:fi
. ~F
... ,....: r
; :::~ w"~;:::: w:.... .........,,..I
r
:
4:
.t
~
.~.
....
...
.: _
..-.........
'..:"
.
:
; '_
;:
:
;
~
'
'


. :.
, ..
. .~
, .:.
_
,.
,
. .
.,
m .
,
...;
..
... ._..
..
! ..... . . .!. ....... ........
..1.. ... ......v....-


.. ..
t;,>f,~LKPNGY:.:HVnI:T:w.iitPKFV:KL;iALRt,~:.l~;Vr:M7L'i
a":::~L': l:.iu~fivLi.'n


Cpn_0539 518678 621545 FCCGOGVOCF~PT'JNCiCD


Pm0_I7-polymotphie malorans proeeln
.~.fHLyCLR10~0(~OILWGFLFLSSFGQVSILRANDVLLPLSGINSGBDLELP1'S.RSSSPfKCPtc~0548
633231 a32191


'rJ'YSLRRDFIVCDFAGNSIHKPQAAFLi4LItGDLFFINSTPLAALTP104INLGAPG11GLFSeysJ-Sulfite
ReducGass
'
'
'
'
'


;NVTFI~WtSLVLENNESWGGVLTTSGDLSFItBfI'SVLC~l4ISYGPGtCALLLOGRKSK:
E IKVGDALGVLPElIS
DSND
1
IS
KHYLpEKPKrWOVPLVLRELLSCSDSINDSDPIYRNVF


.
KEVSINVLOLLr"YSPTfLYNVKKTSEKVSAQKFIQGYVCL00fIPAKLtISFPPOKOPKI?L
ALFPRONRCTiLFLKNKAYF)pDESNPGIfCG7IVSSISPGSPITF'AONQEILFQENEGELGG'


AIYNDOCJ1ITFPNNFQTTSPPSNKASFGCAVY.~aAYCNLYSOWf.~TLFTID~IAAAINCGAIHY
YDAIQEYRPQLPIELFACSVFPLLPAlYSI7ISSFDLMPKSIELLVKtRISYPGKYOKRFG
'
'


ADYVIIIRDCIOGS'aVFEENSATAGCAIAVNJWCDINIWCPVREIFCJ&AI,GII~GiIIYYAATCfECKPLVIfICA
G
PCIAPYKAFLEERLFlBID
CSS11CSELONNDSAYIPVQPTKIIPTLS1


:SILRLHANOGDIEFCCMfVRSpFNSNINSTSNITFINAITIOGiIPREPSL4ANEDHAICFPGNNLLFPGERKEKVN
SRERDpKVYVOCL:.RI~DCVR
'


'fDPIISaTENYNSLYINHORLLF~1CGAVIFSCAALSPEHNXENKCtK'1'SIINOWRLCSGYLASLRKENRYWDVY
KAYEEGCPFF'JCGRINC~GZEVK11ALEEILGImI


.CAILAVRSPYQD~LLiItGPGSKLT?OCICQSD~ICIVZTM.CFNLO~IL.aSSDPAE
LSIFf


. CRL0519 677662 633255
IRATEKASLEI:a~.VPRVYGHTESFYENNEYASKPY1TSZIIS11K1aNTAPSRPEKDIQNL


LLAESEYMCYGYQGSWEFSWgPNacKEKKTIIAS111'P7~CEPSLDPKR~SFIPTTLWSTFrsl0-510
Ribosaslal Procvin


S.'.LNIASNI'ltdd4YLNNSEVIPLQNLCVP!OGWYpIN~IPKOSSNNLLViONAfit04VGARPOD90NOPWNDNS
LLAFLKKFK1CRLLRSKCCIIIOppKQK-RIAIJOCFDpGOLDRSTADIVE


iPFSTNlILSAALTOLPSSSSOQNVApKSNAQILIGIVSLNKSWQALSLiISSFSYTEDSQTAKRIGARWGPIPLPTIU
tEVYIYLRSPHVD1(KSREQFEIATFIKRLVDI:.DPTGIITIDiIL


'INKRVFPYKGTSAGSWHIiYGWSGSVGNSYAYPKCIRYLIQlI'PPVDLOYTKLVQNPPVEPGKllL7ILPAGVDIKI
IWI


'fDPRYPSSSFJSft4LSLPIGIALC~DtFICSRSSLFLQVSTSYIKDLRRVNPQSS11SLVI14N


Y'IWDIpGVPL:KAUiITLNSfIIIYIfIVCAIMGISSTORtr.SNLSANANAGLSLSFCPtL0550 ti35dB8
633580


tusA-Blongacion Paetor C


0540 621631 676862
tlJYG~'DRflf15N0EPDL9AIRNIGIFD1HIWIGKTrlTEHILPYAGRTHICIGEVH~GATFf
CPn '


_ dIKINIIDTPQHVDFTI>:Y>DtSLRVLDGiIVAVIDAVS
20-polymorphie mtatnbsane Protein OWRl7IQCOEItG:TITSAATrVFwLL
'
PmD


_
GVCPOSEIVWROADRYGVPRIAIVNKfE1R11CAD1fPMVESMCPxLCANAPPVNCPIOSCS
FIHLIYSSLIEFVNISDRFSSHXyK.PATAVFAAVLPALTAFCDPASVEI9TStnGSCDPT


SDAALTGF'Jtp55TETDLTTYTIVGDITFSTITNIWPVVTPD~AF1DSSSNSSKOGSSSSG7101''VONDLISQKAL
YFLDDTiGAIIWEEIfEISEDLKERCAFLRAMl.FZLiITIDEStIGM


~.
SLIRSSNL.NSDFDPTKDSVGDLYNLFPPSASNTLNPALLSSSSSOGSSSSSSSSSSGSJ1l9fVLEDPDSITEDEINp
VIfRIOGVIEIiIIINPVLCGTAFKNKGVQpLLNVIVl0iLP8PLORG


SAWAADPIOOG71AFYSNTANfitL?F1TD5(.TIPGSLTLQNLKlIIGDGIU1IYSKGPLVITCLNIROINLKTDQEI
SLEPRROGPWII~1PIIIFIJ'DPYVGRITFIRIYSG'ILK100SAIIJIS?K


10ILTPfCNESQKSGGAAYTEGiILTTQAIVEAVT~1'SAGOGGAIYVKEATLPNAL.DSLDKKERISRLLEIWAFIER
TDRDEPTVGDIGACVGLKFSYfCDTICODt~pCIYLERI!lflOP
'


KFEKNTSGOAGGCIYTFSTLTISNITKSZCPISNK)tSWAPAPEPTSPAPSSLINSiTIDI'IISCFOifJG.DIi.Rf
RWI
Vlpl4iIILPK$KGDRBKLBDAI.SSLSLmPTPHNVSTHEE'1'CQ


STLQTRAASATPAYAPVAAV1'Pl'PISTQE'IAGNGGAIYAKOGISISTFImL.TFKSNSASREPKVEAMtGKPQVSY
KETI'lV9CNSCI'KYVKQSOCI~pYANVCLEIEPNEPGIt~IIWS


VDATLTVDSSTIGESGGAIFRADSIQIQpCPCZTLFSGNfANItSOGGIYIIVCQV1'LEDIAKIVOGYIPKEYIPAVI
IIGIEEGI.FtCCYiJYCYGLVDVKVSIVf'CSYHtVDSSF~111PKICGS


?JLKMlItiTCKGErw.AI'fTKKALTINNGAIL1TFSGZJTSTDNGG11IFASRiGITLSDLVEVANAVIGIiICRKA
KPVILEPIYDfVAVITPEDNLCiDVIGDI~IItRAGKII~0ES811014141MAEY


FSKNKIGtiYSAPITKA1~NTAPW5SS1TAA.SPAVPJWN1APVTNAApGGU.YSTECLTVPLS>QQ'GYTTSLASLTS
GMTSTI~PAFFAKVPOKIQEEIVIUI


S~vZTSILSFFIddECQIIQOGCAYYfKTPQCSDSNRLQFTSNIWIDEGGGLYCGi70lITLTNL
.


r'IfTLFOENSSEKI~GGLSLASGKSLTNTSLESPCLN7INl'AKFai00G11NVPE?tIVLTPI'Y636174
ti35d98
CPt1.0551


TPTPNEPAPVQQPVYGEALVT!GNIASKSCCGIYSIaIAAFSNLSSVTPOQNtSSH~EiGALLrs7-S7
Ribosooal Protein


?QKAADKTDCSF1YITNIINITNNI'A'1'~IAOGIfANFDRIDNLTVOSNDJ1R1000GYYLI1YNSRRHSAEIGtDZ
PGDPIYGSVILEXFINIM401GKKSVJIRIfIVYSAL~tPOKIC.ti.QY


E'DALZLDNITGSVSONIATESOGGIYAKDIOI4ALPGSFTITDMNETSLTPSII~.YGGVL>!T!!GEALHiAKPILE
VRSRRVGGiITYQVPIIEVJI3iRANCL.RIpWIIXNARBKPGIt~E


GIYSSGAVTLTNISCTFCITGNSVINTATSQDJ1DIQGGGIYATTSLStZtOCHI'PILd~B1VGLI1TELIDCTMtOG
ATZKKREDTtDtNAPaNKAPANYKW


SAATKKTSIZfOQIAGGAIFSAAVTIENNSQPI
IFLNNSAKSEATTJIIITAf~IDS00CAIA


ANbYI'LTNNPEITFKGHYAETGGAIGCZOLTNGSPPRRVSIADtxiSVIJQ4~1SALN11~CPIL0552 636698
636219 .


IYGCTZDISRTGJ1TPIGNSSKHOGSAICCST71LTLAPNSQLIFC~RM'L"1'1'ATfKASINrsl2-512
Ribososal Proetein


NLCAAIYGMJETSDVtISLSAFHGSIFFKFM.G1'ATNKYCSIAGIiVKFTAIP~1SAGK11I5IQaOYVPSSSFJ~DC
PLPnGtALI.YZSNLWVItLKREEYFIPTINQLIRKRRK8SWH0IiPA


FYDJ1VNVSi'KCl'NAQELKLNEIUITSTG1'ILFStigLiiCd~lfIPOKVTPAlIfi4t.ILGK~LELQKCPQKRG
VCI4VKTKTPKKPNS11LRKVAWVRISNGQEVIAYIGGECIDJLOENSIVLIQ


LS11VSFTQSPGTTITNGPGSVLSFRfSKFrIGCI11I1ZiVIIDFSEIVP'J'ImNA'NAPPTL%1.aGRVImLIaGV
RYHIVRCILDCAAV10iR1(OSRSRYGAKRPK


VSRTNRDSKDKIDITGTVTLLDPNGNLYQNSYIGEDPDI?LFNIDNSA~iIYI'ATNVTLQ


GNIGAKKGYLG'iWtd.DPNSSCSKIIL1(WfFOKYLRWPYIPROILtFYINSI9>GA~SLVTVCPeL-0553
637753 636812


KQGILGNI4ttaWtfEDPAFNNPWASAICSFLRKEVSRNSDSFTYHGAGYTMVDiUCPItQENo sobust
t~moloq prwnt in GewbenklENBL
as of 11/7/98


FILGMFSpVPGHAESEYHLONYKNKGSGHSTQASLYAG14IFYFPAIRSRPILFQGVATYGCl6rRWLRFLIIFIt.CR
AYFPLRASffSPSWETSTCLT4'LGIPFIDIIL'1'lliIDFVAOCG


CYNptIDTTZYYPSIEEKNMAIiWDSIAWLFDLRFSVDLKEPOPHSTARLTFYTEAEYTRIALQIGTISSTNNAKIKEI
FLIYKEKPPE71SISTKRKEIWLSQSNLSDfGII~MU~!!YA


OEKFTf:.DYDPRSPSACSYGNLAIPTGFSVDGALAWREIILYNKV511J1YLPVILRF4iP1U1EGNIIFDCflVGPA
LKOPKDLRLVLRCPNQPDTLLYSpIFJIEtOCIETNTCLCNOGIfTIi.OCQL


TYEVLSTKEKGNVVNVLPTRNAARAEVSSQIYLGSYWfLYCTYTIDASl9TlLVpNIIFICtiIILYGDSIEKFLKETK
RIO~D4HTLVDLCDSOVVT1'FLGRFWSLI3iYVpYLFLSEDSAKILAG


RPVF
IPOL110J1TQLLSRTVPLLFIYTNDSIRIIEQGKESSFTYHpOLTEPILGILIGYZN~t


EYCPNCAOSSLGET


Cpn_051 617137 628003


Solute binding protean t-yebL-SynechocyscisCPn_0554 637506 638111
Adhesin Hasologl


NNRSSYQTAFVNHICVIVFIFLTLYSLKSYCNDVIDKPIiVLVSIAPYKFLVDDIAEE7rFVCT440
hypothetical protein


YAIYTNHYDPtlTYIrS.PPQQIKELRQGOLSiFRIGFAFEKTCERNLTCOQVDf.SONVSLIQGVFSYLLLCIILVYY
RFlfIfEGKSRMASPTPGQLHLpQIfVESIQ1YDYSRSLANIATALLFfI


KPCCTiQIfITNYD'tffIWLSPKNLKVpVETlVTI'LSKKYPQHATLYQSNGEKrVALILSCLSLLPOVFLPFSCAYF
IIGSFL71PI71LGILLINCVCDLKOYLTSS
r . .r nQyf,IE


E:LTITSKAKQRHILVSNGAFCYFCRDYNPSQliTIEKSSHVEPSPKDVARVFRDZEOYKI


SVILLEYSCRRSSANL1DRFNMHTVNLDPYAFNVLVNL,KTIATTPSSLCPn_0555 638298 640241


cap-Tail-Specific Protease


~Pn 0542 628000 618737
NFVIRfICLVALCWLLSL.LPNVLPSSDLLREDCIKKhI~I(LIEYNVDAGEVSTDILSRSLS


ABC Transporter ATPaee
SYIQSPDPHKSY LSNQEVAVFLQSPI~UtRLLIQJYKACNFAIYRNINQLINESILRAROW


FYTIRILAEGLAFRYC5ICGPNIIHDVSFSVYDGDFICIIGPNOGCKSTLTM.ILCLLTPTRNdVKNPKELVLFaSSYQ
ISKQPIpYISKSLDEYKORQRALt.LSYLSLYIt.IIOASSSRYEG


FCSLKTFPSHSAGKpTHSMIGWVPQNFSYDPCFPISVKDWLSGRLSOLSWHGKYKKKDFKEEOLAALCLRQIENHFNVY
CL;INDNGVAN~iDEFJIYOFHIRWKALANSLDANTAYPBK


EAVDHALDLIr'CL:aDHHHHCFAHLSGCQIQRVLWRALASYPEILILDEP1TNIDPDNQQRDEALAYIRIOLEKCNCG
IGVYLKEDIOGVVVREIIPGGPAiIN&CDIQiGDIIYRVDOImIE


ILSILKKINRTCTI(IIVTHDLHHTTNYFNKVFYMNKTLTSLADTSTLTDQFCCNPYKNDEHLSFRGVLDC:.RGGtIC
STYVLDINRGESDNTIALRREKILLE~IRVDVSYEPYCOGVIGK


F,~.CSPH
VTLNSFYEC~~VSSEVDLRRAIQCLKE1WLLCLVLDIRfN!'OGFLSCrAIKVSGGlInI~IC


VVWSRYADCTNKCYRTVSPKNFYDGPIJVILVSKSSASAAEIVAQTLCDYC1f11LVK~p


t:Pn_0511 6JR7f0 e~?603
TYCKC1'IOHCTLT'CD.1SOOOCFKV7IAiICYYSPSfiKSTQC.QGVKSDILIPSLYAEDR(A3R


INeca1 Transport Proceanl
FLENPLPADCCDNVLHDPLTDLOtOTRPWFQKYYLPNIQKQETLYIRQILpQL'l'IQIB~Itt.


K~IF?IIu.SLLRDSFPLLILLPTFLAAIw:A'.iVACCVFIC'I'YIWKRIVSISCSISHAILCCSENSNFQAFL.;v
IKSSfKTDLSYCSNDIQLEf
iINILKDMILL.QQCRK


LJLT W IQYKLHL.iFFPMYCAIVGAI FL11.CICKIHLKYpEREDSL
IANIWS'J~IAIG I I


FISRLPTFNCELINFLFGNIt.WVI'PSDLYSLCIFDLLVLGIWLCHTRPLALCFDERYTA~Pn~0555 e40921
540325


LNHC.~VQLWYELLLVLTAITIVNLLYVIY..'TLLNLSMLVLPVAIJ1CRFSYKtn'RIItFISVLcrpA-ISkD.t
wyafkane-Ractf prOCein


ll1t11:.~.F':rC:ICIAYCLDFPVf:PTISLLHCLGYTASLCVKKRYNPSTPSPVSPEIHTNVENGNSSNLHFI:C
fCf(1AMPESVLNIVEEIM:CSVTACLQ/1IT.~.sl'CI1VNLLLCWAKT


N F iOP I RE.~sIfLFQ.~.RM'Q ITLLVIrILLWACLACMF
I FHSQLCANAYIiLI I PMIGLIK


~'tm o.44 eln5.tH .;_9525
LLVTSLCFDE,i:T.~.EKLNVFQKWAGSfLED0LD0.TLNN~NKIFr;rNKTEC2dI'.~.RA'1'1'pVL


ylurL-t:TH tricuJinu pruc.:tn ND(:RGTpVL'I'LV::KIMV


K.:;:VFY': x; t K >::FFt:LNKDKNV
t NFVI~' LTLELRAGKrX:Ia,'WAWRKfKYLPI!xPIfOGN


.:.:WY:::'lltf\TT::VP.:FG\'iRNIFF1.I;APfl:67::~vATtMRTf:R::CKDLIV3VP'l~l'LLRDr'
.Pt~ IIS~~ ..d.'.H'!9 n.Atln4


:\F:ha:IlJIDI'IVIya:IlLLV::aa!:t:KrX:Kt:M'FFKT.'IRIMPTKATF'r:KPI:EIRGVELELKLryI
ICH~.!.ifei .y.:l.tn.:ktrh nMt


I:\nua:r:FlNU:K::ft.l'N'CI
IIYfEVh't'..AYPFT'TLAP3LGLVLc:KURLYOKIMIIADIPEIt'M::Y.LIRItIITJI~t:P::YW
:r:FA:aX:IFJ1AVAF:LITKIVA::At:TKI'AfNItfPAKKVR


::th7:AIY-
MKI:U:Id)F'LItIIII:Is'TLLI.i.FVIW::KREP1YSPEEDLLTLIItELIL:HOPDFEKLVHRNKrJfNF.~K
.aa:\FCnKEFYIt:EFf:Rt:VINF.AfjQE.:I:Yr:RLYSYNVNIA1'NVRV:Q:;


!.:I*LVAIJIF;IDIH.1.I'1'F.uI:L~'Lvcil'QNRF'p.':'f'PFVLI::ri:fY;t):VIa:LYRFFTr)R
WV'/Pb'YAY'1N:::C'ff Ilal
1U:KKV.'/fAIVI'lYp)LPr:EAEI1I::.~.UhETTI'1'::I1:KLVWF.IGtI.


r:N:DK<:Y ITLWI.'KILK FI a t'F"PMTVr:M'.
PFLR::Y'f Kl'.f:pPA IC IKytl:ffY.'N.'LHt:fM'


n.4'. .. ulInN n au.:f a
'IKIEYVtfh':::iIANM.'11.7Wf'Vfil:'I::IIA:X:QHVI::FNLC.DNRfr:DKKVFTVF.FY:PQIrIN:

~


n iJl'fIJVA9"Il'1'.1;I:IIM.':'..WL-PI-
I/IIF.f?HJVtII:Y.IIOW::YVCKF'V6'Y::f::V::NI'tlUf:/Lll
f.'/ L.:f n nl.c:uwwt Im.w.:ttr


'I'I?r\HI:FHVYtnIIYYrxa::V:l<FY:RD::Y;:KKL::%KVt:N:rJYV::T:::If.VIa~N'fFYItIPAp
NIK:OWIIH/TLl'::1.-IYLYr\IW
:I:It?:IIY'IIWHIYFY4t:W:FTLpFNLWKAi,N17ltF'1'YY~VAV


!n a<IAII'LI~ ALVI t :I \'4TINN'1'tllt'1'Y'P::E:~N't: t'r."T:t: AI.TPfIRnHb
I::V\'ft7UL .I.AA'fllW'1L171'tlDt' f CV~!7!?7fV'!R
111.?NMH:AFl'lltM:


1.11 *Y::YEL<,rF I A:::::f"rN. PP
I: r xrrrn~ f~Al.lY.I l::fKE::VEF
:VTI Jtt: I Als xtAM:FA 1


98


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
vLNSNIfrKStrn~.AY~;Df.Fr.P'. .suz'rFL.LL:.r~::LFF4-::F
A:.:F:-A.:..wc-Y


~.:::a-r:r: ;w::nTENnr~f EYSBlIJIKAKNHYPLa"CF.SAf
:3FLFLrIGiFG.'.IRW*.'.Ltt:FFDA:.Pi.::.:.~
JWIIIIWStF


RVIUCS'!'ICALOL~1.SIT.'YVLIRIIIII~ffVLI.~EPYC~~ill~~.
'~"I~.A


Nn 11.'.N w4 f t4w h13031
DIICYFf'GKAT:NKKIAf0I5PNKTVOCFYAGCGCATLISFTFFtfOSPTRFAS1'lfIIMI


uIM:A-'lk0.m'Ync:me-Rf:h
LiPOProtetnLIPLCLALG:~iFFGOIIE:a,:FKRDANLKNSNKLKAVOCNLDTLDiLI.:.STPIAYLFL:.:
~
RIVDCCFEOPCAPSSCNPCEVIRKKERSC~TIACCSY
~
'
FCCV'!
:
'
:
C


. TOSKEFIG
.
.
.
.
.
.
.
KitIKKAVLIA.W
'


~N:~ SPpVKr.,CTSPOf:RCKO
'IPSC:,Np:f::1


r~ nr,.;.t : ~ !7qn f.41't?7 F'Pn ~S6q w5'1905 riSRlA1


:.'.\ ;'1. .. : . 1..' .. , s\.
. i... ; ,1 '
r :
~ -
'


I
, .
. ,,."...,.. .,..; Sr~,Iw.-4~4'KF:.
. . . I'.591M:..- , ~.,~:\:.v.;s~.Vr-r:
Si:l ,. ... .
....,. . . ,i':-r: :'.?IF::\11.::.::
rh
:~:
\
:-:'
~
a:

''.
::


. EEPPFSfTFA'1'r','JPLE.iFF:7GHL:.TS:..'
,. :':GTEVANA/LStL~tiLPEVRAFNDDIQRRYAOL
..........,...........:.
.
.
.
.e:u:e:r~
.
:r,:;
:


FFLARSVFlIrCYNTNL CML111CGRDlGSKVFPNADLlC:
TLT3SPE1IMORRLKDLPtExr
rCGSPOpLQAELVKRDRAD


CPn_0560 b15666 611098 AQRAHDPLVIPLCtCIVIDSSOLTIROVLEKILIt.LFRNEL


4ltX-Glutatnyl-(RNA Synthetase
RNSRtQfX9CSI~ISKDKRiNMrFJJVRVRVAPSPIGDPtIW'1'AYMALFNLIFAKR1KG10fILCPtI..'0569
6519)98 659099


RItDTDRTRSRODYEaIIFSALRWCGIWDEGPDVCGPYGPY1~SERT7CIY0GYVf1'LLKPIsC-Glycerol-3-P
A..-Ylcransferase


'fDCAYICCFA?P0EIJ1El0tAVASTLC7fROGYdIRYRYLSPECVASREAk~QPYTIRLltVPLLFGI~IKTSSG1H
FSFfISKRANIFRICKFl7wVAFSLFYKLKVYCHDa1tI10GPAIIAV


SCE<1ZEOYSKGRWFPWADYDDOVLVKSDGtPTYNtANIIIDDIILJICITNVGPGEEWISSNNNSFLDPIALlOICVN
ECIHLARASLFfIIPWWICQ41CCFPVRQDI~tSAAFKIIISRi.FN


TPKNLLLYEAF~TIIEPPVFLIDIPLLii~POCTKLSKRIWPTSIFYYRDSGYVKEAtYNtLTLKRIOC.VIYPDGAPS
POCQLOPGi(VGICI~tAAKSRWIIPVYIRCTPEAFMINQKIPNVWK


HCY~EEVYSLERIIETNPRRIGKSCAVFDIOKLOYI~iKtiYII~NEGSP~Li.ICa.OTITCZtFGTPMf!'DDIION
PEIKNKkTYQIITNpI?IIKIAELKAWYtSDCI~rDVP


GWLWDEFFLKILPLCOSRIZ?GIILFINLTSFF!&GLLEYRVCELLPQAISPOGAILLY
. 659011 6607e9
7


SYVKYLEKTDOWI'KEf~S'LGSRWLAOAFNVNNKKAIIPLLYVAITGKXpCLPLFD6ItILCPn_05
0


:~KPRARAALVYALKLt~CGVPKKIJ1J11YDKFNDR~'CGT1DLsrQS-ArQinyl CRNA TransEerase


'
TKLPSSKfIGt4JRGAlftTSPKLMSTLLSIGSVICSQAIAKAFPNLF~WAPEri'PSTKnIIG


0561 616107 61SA71
NYQQIDAIOtWtVLIUtAPRAIAtAIY7IE:.POEPFSLIEIAGAGIrifFTFSPVFtI~.EH
CPn


_
PKW1LKIGtpV80PKKIIIDFSSPNIAItDIONGNLRSTIICDSLAItITSYVGNWLRiJttt
euo-CHLPS tuo Protein


CH7ICtQttDCGYELEtREEIEDIKDSDtKWV5IT0AAXLWVfRQAIYVAIKOKKLKASKEIGONOTATC14.ITYLOE
NPCDYSDLEDLTSLYKKAYVCFINDEEtKKRS00MIVAI4AK0


TRWEIDIKDGEYKIOiRYSRXKSLYOGELVFONrKDCYSINQVAOIIGIPVOKVYYATRTPpIIIAIW1JCIClTSEKA
FOKIYDILDIWOQIGP$FYNPFLPEIIEDLOfIIfiLLTVS~A


GTIRGERKG7IAwVINVSEItR7fKNEYLSKOAAKKLKGAEPKFJ~APNfEPPTEIFPLSNKCVPNEAPSIPFNVOKSO
GGYNYATTDLAAIQtYRIEEDNAIXCIIIVC~GOSLiitnLGG


TAIAPGYLOPGItSNVGFGLVLDPOGKKLKTRSGEMI1G.RFi.LCfAIEKAretr
ssItRPE


0562 618051 646918
LTDEAIQERAPVIGINAItfYSOLSSNRTSDYVFSFCIflS.RFEGNtAMFLLYAYVRIOGIK
CPn


_
RRIITISOLSL>a:PPEIOtPAEELLRLTLLRTPL1L6STIKELCPHtLTDYLYNLTNKFND
CHLPS 13 kDa Protein homoloq_1


NYKVINSI11IJ1RLD7fAAILDtOtPKPSIANFSSEOARTSNE1GWANPYLYRLLEIIWGYVKFIRDSNIOOSPYJII
tSRLFLCAI~1E0VLATGNHLLCLKTLOtL


FLLGLIFFIPLGLFWVL.QKICONFILIG~.
TIFRPICRDSNii.RtNIYAARLFSAStOWt


VSSVRRVCLOYDEYPIDC'h.ELRLPNAKPDRWNLI~BDCLEYRTVI4GA~fItRIAECPtL0571 661179
660719


ESQSIJILIFNYPGVMttSpGNITRIaNVKSYQACVRYLRDEPACP0AR0IVJ1YGY5L~ASffslu:A-ODP-N-
Aeet:ylplueos~nine Transterase


QAFaISKEIA0G5D5VRWFVV1CDRGARSICAVAKOFIGSIGVWL11NLTNMNINSEKASImTIDtVNVSFSDFOiIKC
ERRNQI11QVFCCGRLNCEVKV,9CAIDIMTKIi.YJ~LLROpKLTL


LNCPELFI7CGtmSOGNLIGOCLFKICEZ'CFAAPFLDPKNLEECSG1DCIPVAQ1CL.RNDttILRNVPDICDVSLTV
Et.CKSLGtIiIVSwOKETEVLEIY1'PEIQLTRVPPTPSNVtIRIPILLIG


SDDVIKEY11GNIORH1DN
ALiGiICPIa'.rVYVPNODW1IGFRTIi1tt11tGLKOIGIfDISSDSSGYYAKAPRGLItQNIfIN


LPYP81ICi1TEtd.ILIIAINAIiGRTVIKNVALFJ1EII~LVLFIrDXAGAOITTDNDIIlIDIfC


CPn
1'OGtCwSVOfITILIDKIEiIASPGIIAAWla~GGRVPYRNAKpELLIPFLRQ.RSIC001LVSE
0563 650117 618293


_
9DIEFtQERPLVGLWLLTOVNPGFL'IOWOQPPAVLLSOApGSSVINElVN~ILOYLIIG
reW-ssONA Dwnuelease


OYKNLWDFSPKCPCGIKFNTNSCNASAAGLLW71NPKEDPAFILalIIIKFHLPPIYAOIFiAIOGiIECOLFHOCLS1
'KACRYAIGNFPNSAVIHGATPLWASNLVIPOLRIIGtAYVIIML


ISACFOTIOEIHKFLYSHLSSLYDPGLFLCIISKIIYFRLLLARDRIttNVItIYCDSOV0~f1'IAmODSIIENTHLL
DRGYTIKIVDKLRSLGrIICIQIP'DlIEpEELITSPKSLALRMiL


GVALLVEFLRDIDVHVSYFFLGAILRQHCITSTLIAIG.KLEJCITLLI'fVDOCITAWCNS


OITPQCIDVZITDIOMPTG1CIPHCV11TLNPKLRDtfIYPNRILTWCItlU9Q.ARGVti~tItbICPt7-0571
662719 661616


SRNLVPKSOCSLKICIi.DLVTLCfITDSRrIfLiGEtittVNVAYGIKEIARGiIRPGLDMt.CALCCTIS6
hypothetical 0rotein


CVCKSEVTSTDTVLKIAPKLNSLCALOOPA10GVELL.LTOODCRVD11LL~TfINRERORIMAAPINO~ITO'f~Cl~
'~SLGEHSVT!'fGSCAAAprI'~11'V'iL.IAdIDpE


IGEVFODVOtII~tSNPEILtOMIVLSSTAWNARVIPIISARLattTYtiKWVIIAIQRGIAS~GSAVSPSACNSfSTL
PPETGSLGATJYpSApSAGLISLSGRTORaObEIfSfi0D8


IGKGSARTICSFPLLrGYLKKCSSLLLSYGGNDtAM.llilatiDIM>mtICKKFVHLVNfSLKSISRT8SNASSf'.E
l'SRA68SPDtrcDLDSLSGSERAEWEGPtDP'GGLPLSIIPNYDII'01f


)CGDTLPtQ.EIDAYADFDAIOYDr ~n
tEPtGIOf~EItPIFYSINROVRYPICVLP'~iNLASIIJ1PLIOIPAVOOR~O1'K~iHIVYVDE'J1R&SFIIIIRN
GOWSTAFSIXYBNitkTK14QT1C


KLYLSQKERNLEGYAFGLGRNADALKA.SWNYPLEIAYTPRLSOTSCSCVItiLLVRDIRISPADLDICIAKFCVCYET
INSOifI'GRVKPTI~ERSG711~IYtptIJG.SNI~lt1'AWYORIIA


SEPRPSD
KESS>iGYTPSAWR110A1fYCICPIWImVCGLXGIIXiKITPAPDFSFINLTP00GRNliblfl'


CPQlWGATWPNVNIIIrtGGIKVDI~iIHt.CGITTMrTI'F.~DDD'fNITSI7tST81001NS


CPO,
ISS1GEOSTIEED'!'IOtDDPGOGFDDNAIPCTNCPPPPf'P'
0561 651759 650115 PPNLSSSRLZTI~N~1I


_
t.~iVlYOtdXtAYDSNG~SISDLNQOLCQVtrIGtStNDVNPPIVILPttiTGD1'DPb00AtGG
seeDiseeP-Protein Export Proteins
SeeD/SecF (fusion)


SGAMWKVKRNFAIIICVPAIALYYVLPTCLYYAKPLDRKIDGNBAEHIIKSFTIWAppVVTEOOGHIIINIIORNTOSI
GOSLGATPTPOPTIJUCIVTSLPI(ANVSSSSVLPQPQVATII


R!(OVIPRVSAILSSIJILRGNI00HPAIPDIVSVR1KRGEDAEDFICNLVIIDEPNVPIKSATPOARTAST51TSIG'
ICI'EStS'ITSTGTC'LICSVSTOSI~ICfPT'1'1'fRSCCrSATIITSS


RLNVYCYSREHDDNVIOVASSINISLVESDFSFVSYSSIQ~t~fll7ISSILORVYSACTiPKAS'IOTPQAPLPSCfR
HVATISLVRNAAGRSIVIQpGGRSQSPPIPPSCCC10t~11Gi1QtJlA


OKOCSCSYPSIWETAPKLOL:QYAIDiLSSGFEVFSSRLSAlCOOSFSSNQORtJIFLSRLSMbOVASIL.GQVVNQ~l
ATJ1G50PSSRRSSPTSPRRK


SLSNDA71IDVEDOKLLKSVYLTLSpTIICIRSLOCPYIEGLRLDCSE$SL11SSIIYCPKE


RKIFLTLHSDLLAORTSISKEORLDFD.SRLAVEKptLSKNLTWVEDYlrc1C181pW1~CPn'DS77 665117
661691


TQCKIILOGERLLOCIAENLTALTLHRP71AESCDLIPEN1PVFCAQPRESiAFr3CYIFSPYe6C fauilY


NTOCKHFSKGSVYILGKGLRSIVAKYOpCCGKB.OSFGONLYNCFSHTFJII~EVEdIACfISKWAttI'KHRKERADH
KKGKIFSRIIKELISAV1G.OCADPKSNARLIWVIpKAK


OpvLEIRHPLpQFLDVwGECFVICXLOCAFLEVKDIODRLIiTVNOItKNROSDLVRNNLQCiNIPNG~tIER(iZlvt
A?SALOKNFE6VPYELYGftaGVCIIVFJIIffONIQiIlTASOIGIIAIN


YRHAKCSMDLQERLSAPIPYONLFLLNNKI.fA~fRKISI~HiILRLGIDFVOGROLLLSFKDIOIOGSLVEPCSVLYN
PARKGACTV11KSSIDEEYIFSYAIEAGAtDLCI'EDEEIJFLVICAP


HOCKOLTDKEDILKVSDES.CARLNKLGVSEILPRDGDYIHLSVPGSSTISSSEILGTSKSLL6SVICLKLISpGATCS
EDRLIYLPLRLVDCDEKDGtAMALIDWLEOIEDVDOVYtC~IN


tiSItIVVNERPSSYS716RYEVDAFLDYiJVlt1'SDApGKTBPttIN1'111SALFNEEVDVPPSVS


HEAITKLKSEGt.u'SPSGCETPSTDLD1TFSNIAIGKOALOKANPLVIVFRNYALDGASL


KDLRPEFAAGOGYS2.NFSV%DTSPKKttAEKLSPttStIfllvfSAYCOi7GISCfANGOYS1WCPn_0571.
665979 665794


aGWRNAWIDCYNVSSPILNVPLKNttASVSGKFTNREVSKt.ASDLKSCANSFVPEVLSEENo robust
holnoloQ Dresenc in Genebank/ElOIL
as of 11/7198


TISSDLGKKpCI'OCIISACCCLAMLIVI19SVYYR1CGVIASCAVLWLLLIWAAirpYLDASAGGIRNPIVNVCIYLN
NFORYLSKYLYRVFRPPCRKKTFLSSHRVLARPSFPVDYCPG


PLTLSGL1GIVLANGNAWANVLVFERIAEEFLISOSLKKSVEKGY'fKJIFGAIFDSNL'1'1'KIYDLQETYEELiI1
14LFOGALRLOICWFCRKJ1TRKGKSVVLGLFHENflDLIRINRSI~RQ


'JLASALLPPLDTGPIKCFALTLILGIFSSNPTALETfIICFFFMLWl9rK'IOHTOLNNNMKFVEIPRPfNEYLVYHf
HVNSVVPREYSLSCRSIFI~KItFKEYEORFPLYWU1VAWEfDINAYL


:IKHDFLRGCKKWAVSCSVFLL.CC:fALGFCA4MSYtGNDt'!(GGYAPfINPKEHCISDVALRCYXIfRVOCCYCRA



OMRCKVVHKLQEAGLSSRDFRIOTFCSSEKIKIYFSDKALSYTKADTSLSPKINDIInr:..r


AVGLLSE1'GLDFSI'ETLNE1'ONFWSKVSSKL&KIWFYOATIGLLGALAIILLYVSLRFEWCPn_0575 oooSZ4
56598?


'fAFSAVCALIHDLLATCAVLFIAHPFLKKIOIDLpAIGALJfIYIOYSLNNfLIIPDRIRYhhY-Nnino Jroup
Acetyl Transferase


FDROANLFTPMFfVLVNOALOKTFSATVM"fATTLSVLIlILLFIGGSSVFNFAFItftIGILSIFGRVWRSFHTatIC
ONTGILGLEIRYTLPSDATYMLKWfIJDPKILACFPIOTEALIRCT


U.TISSLYIAPPLLLFMIRKENRSK .
VNPYNCPYRYHSSLTAV1'WuNVA4IfATLVWFYVKVSNNALISIIVGEEFRNKGIGTJ1LI.


NNLIHLAK1'RFKLEVLYLEVYLGNPALHLYORFGFVEVGRONRFYKDEICYt.AK'1'ITtEKD


CPn_0565 655741 ti51531 L


r."r94.r hypOthetiCal protein


NKLFCFLIFC;FVNISAILFDSSFLLKIKRHSKRM.RSttKFPRISISDLIPfQMVIWw~GCPn_0576 X67513
5b6491


~NVNYVtrNAOMLPKKILGGVLACFCLALLCCMFAAGVCOTIFPCiCt~IILGLVLLGFAY.prtB-PCPCida
t'.hain Release Factor
2 Inacural UGA tranle-shift 1


t.QYSKn.
iliRPERPLFRETKVFEKPINWIL:CLSLLQSWKKIRPGCYYMPGCPOVEICDGSOMpCB.DKRLE.1LRTEISLWRSL



EIVfKtFOKK~DRtPfSIFLt0EMD0IALROCIEKSF.LSRKTFALDPSWSSLLStIOREE


rJ~YLGPKVI~k:SEDOA:iDRTHPK,iAIYVNISDJ1A(tEPQCRCYIDAYTKAFF'CVLDOIGDCPn 057ri.I
ne7S!:1


IMIVKKIrrtYVLTPILGVPDALPKELOENLKLGSOAAFLYSAEOVAKRNREEKODSIRIKPrtBIn.tcur.ll
UCA f:.ameshitt:
1


F I FTDi'T:: fT::L'f F:ii'IOt: :".'rI'H.M'PI:iLSCFVGEOE,SYTFAMUEHt.DKRLF 14RTE
I:iLlIR:::-


.'ftl 4S.h W hntr r:Sf.9911 CM IIS'I7 ~ ~..:Hrn, 1,l.Nl'.S


~.1.:: t.,mi 1V :.WIO 1'fM7.11 .axnplrx Pcrrn.in


I:IrIYCfAl.Itlld'VDI::11'rNNAE:Y.FP::LORLPNHVAIINDCNRRWYKKIIREECGHTHT:E:a'N.'iV
KtIKN::AFiOIrVMi:."lGIl.VLV':Y.t:I~MrRTEIVYKVWF:1'IKKIItK:OrQKNKRNIL


alYYr:AY.Vt.fYItINAVtA)f.GIKVLTLYTP.:TENfT:f.PKEEIOEIFNIFYTOLDKOLPYLMfGANIaKVFY
L:'Df'IDMF~YdfYLL:;RIIIYK


r7lK U: LI!r' I::Irt.::Kl.rKl :IQTK
IMIV::RMTA:iF:iRLELVLAVNYrX:KDELVfiAFKKLIiVO~:r
' mh~
~157A '(agy,..H
Ct


II.tIKY.I::::nlri::F .
_~t.l::::YI.Or:r:LTOPDLLIfeTr7:EfIRV::NFLf.WOtAYTELIITDTLW. .
I'IN-f1,rld.fl:ItHVYIrhIt::RIIIi:K ,
n
y.mr1 ptln:a.htnylcttl.t;:.


TTtk:YIVLt:;I:aATLPIL\F'.:4tA:a~
II:fTMI.frPfAIfWRI.PKK11A1tUk:LHIAttI::WJI


"Ityr.r./ ..'.nN14 e.S7Nl~l
VIIKRVPPXFWKV::K:II:NP::fIU.fVFr~:fN.lf:IrARt.EUKERLA'rFtlffL.filrItYFAII.


..,p:A 1In.::l.ll.ll i.l.ll . 1'yt
r.YOIIYY:I::YI::RN'fK(:F:rft'IIf:F:Y::I!I'tulAI
t.lylyl r.ln:a,.r.,;,. IA
VhY~t:LF:::Sr::1'ItYOttR.T1<al:(YIMII.


99


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
LKLLKNCT'LTLi.NNfiHVIPNTWIVGIIGDLP.1R1.._dEOAFKNYDPSLPr'.LLrLSHNPDf'':.FLYriDiR
I~FACP'ILFFFi. .3:SFi.tli;L:-~Vltld:..;E:u?.'Ff".'IIRf".:F:KSYP~i
'


:TRitwYf'r:DFyL"~'HSHGPOVTLWIPK/MKFFERLSGLC~IPYLARCIFJTXII7GKDLYV.rCIOIC!!
NHEIDAORKKRYEFIIL1GEFPKLTW'IYtt::iFrfILRAK.~.Rt:VVISLYAWFiCS
'


NRC<LX;LKR IRFCSPPEICYLl'C3Y0 C,P~. I
DFRaMNKGSTLT'"rGKLRI~C~II .[dtr
' ' "


YfiNDLVCFSEVIL'SFHV9~E(7GTLTFS


CPn_057'r ri69110 569993
CPc: 0591 5901ed 6:tt030


yqDP)yteNiudar NuclaDtide Phosphocylase


KEPJI:1PLLK'w\T:rNVPHIKS3LiLL.iOCOCTRFr'~SKlPKDYLPtIJCI'FL'fLHSLK-aLSSYatiE
family
'


.'.POfAEV~FI~.DP.~.YOETPpEYPVSPAIPGERR0tI5VFSCLCOV.~YPN'/SIHDGARPFIY~:.IHRCTAIC
TVA'rtIIJNLfIILLKPRYFTtL:iREri
UHi.D,?."DASNDLAIFPPFGYAV
..
.


ia'1 :r :'.~:If:'!'." y ..,(yeT:.:'nLYlir\lil:..
_ ._:!r~n::.:. A
I 4 :.':d.~...F,rrt,r,.:~ .. . -..~...~ya:
. :
..
i


-
:.....v ...'.:.i:::~:F'.1 .;,v: . w
hTiKilC::: :.:'i i C.'.:.:'::~.....-. y. .~.
.:.'i~:i:.:. .. !. ... , .,. . .. .,. ,.....\...v;,
..
'


cAR 1NV L L':
HSDILDEPDFFiiWIPCfQAIYRL'W.iL..:
I


CPI>_0580 669936 e70793
059Z 66113= dd1161
CPn


ccuA-Pseudouridylate Synchase ! ."
ASSiI~IPLPRRSNDCFSPPKtKVALLIAYDCtAY~W000PN~SIQ1YIE8SLI0fITKTYip taailY
'


RTPLIASvRTDALIfNA:lGOV7111FRAPOMPL!'JWANL.TKKALtIAILPKDIVlRDVALFDDNLYSKNFSIISFK
RFLpOIPVItICi.:..
IYLYpWLISPLIrCSCCRFFPSCSIiYAE0ALK8NGF
'


FNARYWIAXEYRYSLSRLIIKPLPNORiIp'L'YTP1WP1'STLt1~100~6LIGTNDFABFANUIOfIrLSIKRIGKC
,r.PNHPGCIDHVPK
"ALQLYLEFYQEID~DSSHFSE


iICRDYNSTVRTIYTLDIVD~.SI ICRGNGFLYKNVKNLVC7It.LDNG10CRYPP~G.LD


ILDOIOJRRtxpSAApAYGLSLHHVCYggPYlIIFGCEpCgVSTSNECCPn_0593 fi8119i 581391


CT171 hypothetical protein


CPn_0581 671533 670715
VLC'dtKCNAFKRKTRNL.t~QVLIL5VCL4l4.FLLLFYSAlFRImIYKLHLFSCPLIAKBSItIt


PMsphoqlycolace Phosphacase
VYGSISOASLODLISLPKDEItYMYGRPIKL41ALSSfAlASNHIDITPVL~fPLTY


EDLRNRSVKSFLROL10IYSI~GNSDEFDLCLRSCHYLEDYDVFFFDLDGLLVDTEPCFYRJ1tELKCSSVPWLLIrtI
IDLKDFFVILDYLRCNIfYPYTSIpGLFLLIKHYpE~IiVDEpCLYNF


FLpACAEFSLEV1MDFSTYYSHTnG'lEIFSKKFIFpYPWIQEYNAEIFAIUILpIYYKSLCSTPEFGYLRTLLVC7l0
90A5SVASLARNVIRCCSERFFNFCNEESRTSNISA1'O~KYL


r:NAGPAL'IPCVEAFIELVLSWKTFCVVTNSPRDATfITLATllYPIIiIKFLFNVTRWYARKSYI~CEESLAALLLi
.VNDSGYVLttEFCDEDLEKVlRLNPQSpYSONP'1SRL~ISPRIIE


PKPYGDSYDYAYRTFAREGMIVIGFLDSVI(GLRALSKIPATLVCIN9171EI?FLDYPELKLAOISCQRVGPRVpEDO
DEEWVODGDSLWLIr110iFGIPHDKIIGKNGWiNRLFPQIV


GKE!'!'SYPSiDVLTEfi<517QKLL LKLPAKQS


CPn, CPe1,.0591 682517 681958
0582 671305 673177


_ phtYl'-phalylalalfyl tRNA Synthetase
CT165 hypothetical protein Beta


KNPNALLKKlONRLV100iDKMfVLYLOAMiIiJQKRIIR10iPINI'YHSSNi'1'ETRRLPTYYKNTCNY1'CVIVK
SLVKTSLRLSSNRIPITI.LOTYPSEPLSTICEILP)1CDNIGIGEIITfit


SNIVLIO.IILRIS'IVSLLTSCSFSKNSATCPIrfPERITSOKDCPVLLNPK'.FITISPPLYDWLYSFASVITAKIL
NTIFlIPN7IDKLAVATLTDaGIF)~BHIKCApNCEAGLIVAtuF'GI1KL


ISPNREVITAYSFYCRGpGNSIITPECVLYDCDGLIN8ITKLEFRYINPRLIB:VVRLLGQPDBP~AYTI
I~RALLELPGTPILiEDLiITVLC


OHPKVSIIGFCCPKHFHFLE71SGISLSDtJit.OCtA71TF71LDFPLPNE%I.LiiTIKKLYIDINtSLEISLTPNL
01~71SPLGLiIRBICtM'QANLVIPKtFSFENLFTfAt.p~DPDICFF


NSDPSLSNEIVTGTLTNPELRLTGOGSHTEITVCILDhZGOtBIEALSSAFSYWITCIS110PSPIKLOt$LOALKOKP
INJIIVDlTNYIH~LSLGQPLH71YDASNVAt.DS


LItVOt~'pESLTLIJKiElVLLPSGVWVRDONS


~PI>_0583 67239 672717
AYFLPEALRA9t7KLLPIPSESAYRFTRCIDP~WPALpJIdiIfYlLEIFPGTISPIY88


CTt66 hypothetical protein
CEICRBLIfEVAtJtPKTLORILGRSF'SIEILSOKLOSiGFSTTFpCtSLLVKVPBYRItaIN


IVLSFFIGK'1'KV'1'pRFiIOJERTLLLLWKIQOGLFLAILDLTQTFSSLT3PELEKYLKOKKEEIDGVEEICRT85
1:MIE'IDNWSCYTPIYKLKR1TADPLAN71GL.QEFFTPDLLDP61YA


IFLSCIDRVDLOIREPWNAFSSELPpDIGFELEEIADIfIIRILOTDK11NYA0KKXEFGIYLTRI~KEtISLOG810t
rtVLRSSLLPCLL.ItS7NITNt.NRQAPSVpAFEItS1WA10~8Q'!0


ERP
!1'OTG71ILLTEDCEBRSNLPKPSLSFYSLKDiiVAILLYNNNLSIDALTL6SS11ICEfllllf


QOCVLRIIDCOSFATLDOVNPEL7UOtAQIKHPVPFAELNLDIi.CKM,KK1TK<.YKB'YAIYP


CPn_0584 677659 673798
SSTR~.TLTVPEDIPANLLI~IfLLHECSKSiLCSITIISIYQDKSLETRNIOiVSIJILV/p0


acoS/DCrB-Z-Caaponenc Sensor YERTL&NDDIEEEYCRLVALWLLLTDf!(<.TINS


IRINJITlOIRKKRNLVFTPIV?OSKNLtIPPAYFZLEIKARI1'OSYKDISAILTAIPDGILLL


SITONFLIGNSOARLILGIDFiJi.EIGMtSPI'Wi.pD'ICiJGFSI0GL6SLINPRTLIIiSLCPIL.0595
681917 685926


CKESKFKEYELFIRIfNL;SGYLFIpIRDRBDYlI0t.~4RTERYIDtIJId3GKtt1'A2LlltltTRCT176
hypothecleal psottin


NPLSGIVGFASILIOtEISSPRHQPIIZ.SSIISCfRSLt~LVSSlILEYTRSOPIiiLKIIt~>z.QROYpIIBCOLL
FCVCYFANSCSAYASPRRODPSVIOQTFRNNYGIIV9001~KIKTBDG'tI


DFFSSLIPLLSVSFPNCKlYRE(i7lpPLfRSIDPDR!1.RVVWhR.VIOAAVE'ICNSFI1'LTLNTKVLKNGJ1TWE
YY9GGLLIIGtITLTFPIrI'L'ALDWOIYDpGfILVSRRTFFHGLPB~E


TSGDISViNPCTIPSEIlIaRLPTPFFTi'KREONDi4taF)IpKIIRV10GDI0LKT8DSAYLPNF~CIFVLTRNPOl
~BIDSD'i'I11CPYFIE'lTIIQCNVIEGSYTSPNCK7fSS8IN1~8DYR8


SFFIIIPELLAALPKF31AAS VF$SltlI PESZTHYpItCpPHGLItLTYLpOCIPNfIEE


1RIYGi~ODf.TTIVl10a3CKTSEIAYV10fr1IKEGLELRYNGOEIVAECVSNItl87FU8iE111fIY


CPi1_0i85 67518D 673865 AGDIOKNDiYYRORSVBGI~FJtiXi7UlG


siailaricy co Cps Iucl~,Z


ISLRRKILRPIBtPSlGDCS&M71TPADKSFT!'ppPSFVREIGSIiBiFVFSPLTLLEIEGD(ACPe1_059ti
685930 886157


IARVpDO~IliItTIVRVSLIILiILLTIIGGCLLVCLLPAVINFICDCLIAtGAVIF11LALIada-
ssthyltransttrase


LC1.YDSpCLPEELPPVPEPppIQIEDGRNETREVLEC1'LLEVLLKDRDAKDPAVPWWDFAtMiIDCfLIPKIJ00I8
4S0ACSECLLIAKYPPLAVIVHTDNNLWIC1'NLSVAPV18CLE


CEKRIGlE.DRKLRREEEILYRST
VADRLEITtRASYflIFVIGPIWiKANpEIWdCSRYAGMEtIPPFSSHFAKDLIPSQYLEIiI4CVAtIPPCEpQTYAE



DGINTVPSE6GEKEISALADLISLpppTVpOt.RSRID~OKRCwtAi.~IIHOSpKaIORAIAKICI'D'1'IIFRTVG
rIaCKG4IPfLLFFPCHRVHf;SHGEI1NYVI~rPVINEILLK!'D~LSY


N~tP~ISORACEG'1'EI4DCAEAGOLEKDLRAOLKSIIOESiItI~G1'INOOEKAWRRQItI~KLER


LOED4RLTGIAFDEOSLFYREYXE&YLSDK4DND1fIL0EVN718KSGNCLESLYHDYEKQCPeL0597 681215
686179


LEQKDMIWKAAAVNEEELGKpppB~fEpTpEIRRLSTTILEYODSLRGFJMJtDFQELoppC-OliOOpapcide
Pecmease


pQAYSRLpfEKDVKEIflLEESNIBIFAIM.FEKAQKENNAY1WDJ1DL.EGiWIP'CEIGB~DNQKHPSFYORFL571
YYKtd.LABLSWKFFISVJILlCIYAFLFASSKFLWTf4IICEIFFPLL


WVt.TDSASLSOKKIRELVEENpELLKAIaFKSNEISpLVADAVG&KEISKLREHIEEDKRYLFFPCYYTICPV~.FFN
ViJlIrl'FPFTILSFKLTRGWLRRWLLCiLCII80CNIFANAYBC


DGLRALDIMNAQAIKDCGAQRKCCDLESLtSPVREDKiIWP'>G.EffEt.ORLOEENApLRAINQDPALABNLKKMIA
FJfIfPINiSKIMSEt4IMLLPKC1'R15lp1ERRYNSTYLpiGILIG


EVERLEQEDFDG
KYRKKOGSVKKYOVAFEEf~QSPNPTLRIILiMOrDGICLKRLOQRVOKIpItPYEtIRpGJI


iINWITONYRPFWALTRIEHF3.NLIDYDiJWOQpEDLCIAYANVEKKAEPYKKBLLEIRpV


CPft_0586 675993 677193
LEDY11KLRSAISFIQDKRLWICKESEDLRILINPPFSSFiIWEDOWGGSRE?84KYVPIiWpL


atoC/ntrC-2-Component Regulator
SRVTRItDLtaAt.VFCIRIALWACit3ITIALAIGINIGLVSGYFGGTVOItII~RFTEIirtE


KEKINPSRGENHAIKNlLWDDEPLLRDFLSELLTSQCFIPDTAENLRN71T.ONIRSItDYDTNPVLFiLHLVISlTGQ
KSLLtIJIYLLCCFSWICFSRYVRIEVLKpRDRGYVLAATMGY


LVlSDMSMPDGSCLDLIKIIKDSSPNTPVLWTAYCSIENJ1VEAT810GiiFNYLTKPPSSEStiYYINVHOILPNrII
VPViSLVFPAIOIANISCGGLTFLGLGEESSASWQitJOtOCVIGF


ALFAFISKJ1ECLI0JLVNENLFLHSdtTFDSHPLIAESKAd~OfDLt.AfAKRA~SSS11NIFINPAESAVLWPPAII
L'lldLLIAIALIGDCVRDALDPR1.QOS


GE90CCKEVLSFPINNNSPRANNPIfIKVNCAAIPETLLESELFCHEKl8IFTGATTKKAGR


FELAHKGTLLI~EITCVPUNLOAKLhRAIpEKEIEHt~OGTKTLSVDVRILATSNRKLKGCPn_0598 68971?
699719


tODKSFRpDLYYRWVIPLHLPPLRDRpDDILPtrINYFLMtFCI~KtPLKTLSPKADELopp8-Oli9opeptide
Peszaease


LLNYPWPGNIRELSNVLERWILEH1'SLLTEDMIJ1WEEOCSVLKYILJfRL\'LlPLTLFAIVSINFVIWAAPCDVLE
EKSRDAIGEAGKSDKNRSY


KGPDRYLQFRENYGLTLPIFFNTRPKITHKKIOTALOELANANIII'1'PSAKNAAKSLVYWC


CPn_0587 677779 678111
DCAKFiMPALLFE~DASRDDK'IRHIIIADLFIRCGVLpGFVCPNLSPI4pMQNICEIAESN


yvyD_es conaecvad hypothetical
proteinAFWROWEEDLDTKVEALKCYII~DNGCTEVFCYSSKDFYiKTFFLETRFARYNSRVLIILD


SYCELFILSTLLKHHVTLGDKNRPHRKfIVSSKSL71L!(pSAS'l'HVEITTK.1P'RLSNPLKDLFCTLRHDJ1HKT
VISEVIKRLRCSLVLSILPNIVCFVI.CQIFCNINALKRNRNIDHSLNFI


ILEKSDHLPPNETIRWLTSNKWfLCTEVHVVASHGKEILQTKVNNANPYTAVINAFKKIFLILFSIL>lfAfAVFNILD
NNIlRd'IPFTTIPHPYSCLRSPPEVPNEL.S'TIJCRIFDLVSH


RTNANKHSNI(RK~tTKfIDt.Ct.AAKEERIAIQEEOmRLSNEWLPVEGLD~AWDSLIfTLGYVGFLPFCAVSYGAt.
IM7SRLSRSIFLEVLSpDFICMNARGLRWFDILYKNVCNHAAVBIV


PASAKKICISKKKMSIRlQ.SGDGIRDLESAAFtiFLIfLNEOEHKIQCIYKKNOCNYVLIETSLASSLG1'LI.GCat
.YVETLFIIIDCFQiffYOAIt.NRDIiNVIILFSVLVGSAL.iLIICYLLG


PSLKFGFCI ~ DICYVLLDPRVDLECRRI


CPn_OS9R 679033 67866 CPn_0599 691927 .89682


CTa64 flypothecieel protein oppA~IiqopepttJe Hindinq Lipoprotein


TSKSIKSNAPIIWfI'ATHSLLNLPSSQDSA.iEDSTSDSpIFDPIRNRELVSTPEEKVRpRKRRES'tlfiMYKRCIf
LCKILKCt'I:.:uLILLYWSBDLLERDI1L:IKl?IVRDtQ!EDLREtSRV


LI::FWHKLNYPKKLIIIEKELKTLFPLLHRKCfLIPKRRPDILIITPFTY1'GW(,MtTNNVKD(JVI:rpAIPAAt\
:VMLAPKL'/ItDEaFALLFI:OPSYPNLI:
LDPYKppTLPELIGTNFH


LCDPKPLLLIECKAIJ1VNQNALKpLLCYNIfCIt,,aTCtAMACKHSQVSALFNPKTO'LLDFYPfK:ILRTAIIVt:
K('ENL~PFNCFG'IVIIGF'IOLv:IP.~.IJI::PfIVr,KYEEP-~.PDLAVKIEENLV


(Y:LPE'ISQLLNYFL~.WL RDC:.'X:DKEFHIYLRFNVFSdRP
IuPKALFY.HV~1LDL11PDRfHIfM'AIIDIKFFYOAVIOiPW


ATNRAVALR:%'.YEI'IY::V;:VFJIGIJGLV'/PWKAfIT/TNEtT:KEERKVLY~.AFaNI'I:,IQPL


t'1'1 ~(ISHn r:79671 i79175
f'fiFV'ltjlFANf:EKIIEDEZIIt7TYPTN::I~IAr~NF711HY1J11NJ'f
IV!%Y:AYYFH:MpDBKLVF


rTA'/4 hypntlutu:I1 Dmcein
::RNI'Lf"IDPUALII'KRFV1'FYE.'TI!:'.LFMDFY.T:KIDf::It.PIMDRONFY~FIBCifiAYN


::::Hr)Ir~J'h:WLR::RPta:KNIITLTPLFTPDCLFTFFAKpW'fWr:DYRt:'LVGI::LCKYTK~VAYI:AVR
FTV::ADIiAYTI'I':rfilr:F::LFP.c:fr)VIH:AItrMIIIIHtFJtfIh~r:L0t~t7YT


IJIIIN':.~.ftl.1'Kt:PIK:DCWAFfJIIKy'!"IALLffA:Ii:KNIQJILI.A:X1NKEId':IIKLF:iLFW
FI::tah'A:::::P::YNKv'IECWlIY:atW\I1.Ll:Ia7.WID1'tl:ff:IHItKVIfIiVIVPFRFRL1,'.


IJIHIfC:::.NI'EFFAJ1IF'VLKLLpYD:ILDLTPAC::LCKII::f.PY4:Y1tY~1:11KL.CKKIIpiIKYYV
Y.:~ITAII'ffAM1'A'1'At'KEL':IIx.'::Ll!:IJIMDL:YjAI'fA:Ytlh'M1WJ1%e'.II:IPPED


~L\C:If.KEEEUILt7AIINAKaF"ELLALAEFPfAfAEKIFYLt'D::WEEKK::ERN.~..~.F.DFPRAILaI:F
)t:ANfiM:'..WWv:F11t1EFAUY!IfH~I:;/h:'IDI.Kh:ffIRLYIIRFIIh:fIIIRFJ11'YA


'II IF.ILFI::K WItIW h'L1 ::I!IIC:iLLYKih1'KN Ih'VTTIIITI
IL f t hi4.'irl~.fYtNfrWII.ItKKhl7it'I_:1~.:


~'1'n 11~'UI r.H1111r: ti7'n;lri n:ltyt~1111 r.~; t~,.. .n1:1'.'./
'


.-t'A N.. rr.tnr::l. Ir.wrl.rrl Im.:..r~r
II Irylru lure i.vl Prnrrin rr. :..rn.r-rnk!I:MnL .u: ..1 11
' I!'IH


I00


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923



KK:FJYSIICOAKRFONttLPNIIFDIx:LOF. "RPPNLK:iPY,~iLSDLLlfIEL
1LDKAK~fPAEI'LGIi.R.IEIIf:::JLLi.AFR'.":CKL.LS
..'JLKODRLAYGELIILL.~aKY00KT R IHPGFDCIYIAt~it:RIiv:RDFNL4;N
L f : PCEIIG I:.LRKAFAL.iEK
SIf


Nf:YHKI tO
F3SLLKEETr.'.~.LNPAKOHLL'IK t LRDFtrfMDFILR.iLGL1JG111CETY11KALPKOVO
IPHSPCL FL9YFL3ADYSOIGJt11L9U~pR;8.~8~E~'EC1NF
A:70!(~ . A
~~1~


R~IR~T
K111NICTVYr70WLF~AKVItKPStCENCEC'..A'tFSRiAHA"E


':Pn_OSOt ~.?7073 ';727)5
HIGRFRIIOS~INEFPCRFAVN1'RIOT:SAAELIKLAIFLDISOAIKOQa0t5AfQ:.


CT493 hypochotical protein
OIHDELLFE11PEEEILFMI:RLVREKMESAKf:..iJPIWN:LKtBGIEC
?(DEITPN'fPL4RODSLWtiR'IRVSWRADL.S11SSRYEIASAIAIL:LLY
'
FPRINADDLIN


.
a c~ nr; l 'IOS~d2 ~;t.t:5lt
O
AFCASAAVS I IFTANPi.AQIIF I DrCLJIIfiLL2I
PLY IvLLI IG I IVL:.YGIYLFPOORE


~D~. ~''~tt~ : ~il~.~ w:'ti :' ~.':~:~ :. .-...,. .. .
,. . ;..: :,.:: ,...._:,....
:c~~;...:.._ I..\_,;..-,Y,;,.
'VNh'f:
'


-_ .
': vy;,ll:.::::... :.:r..~:. .
KTAPLIAVItHKDVI?u:KM'AK?tvN::.t:'Jt'KnIYLKDHYKvIV10NO1:PCuiVFEIDR


D.iGFIiKPIGFOENLEALCNKTStIOLLKYLLKGILfVC'GASLLIALEFSFPLYFFLFSGKTRFWILCRItGFPI'f
~IVNCi.CAS~YY'J:CAi1?KIYA?SSSLICSICVASGPFIlNK
LYSIQ


VIPAPCLACFFLTLFVCLVTRLYLLSCIIGDFFE~.ASEYLOGAVPPtOCRSOttIVEiQSHL.
OCLNRYGYESDLL:Ar.%KDIGPl4JPYTPWfSHDREEROATLDFLYGOFItDIYIO~LPii.TK


AAAI11'KISINLONOEYSLLSEIFKFLPKHDLIRKFSCFCFWILDYFGFRECLLOKAIiJLYIEKLVIIfIICIIRIF
SPEKAKOELYIITI\"GATKEQVL.COIVaYCKIC~IYAVICSOIa~~RfR
'


F1LT71R VASAMSSPLVfCIIIKHDILPLSHDAAYIPPYIJ1L
KWpAIpVDLSAHVSLAOAYVALSGLYADPRKYPEFDANYWIPSGRYS7lEI0Gtt
'


'IL
RAIECTOIINEYAPCNAI~MJ10LAYSYHDLOhIPHEEIOEYEIVLKLKPftWl1115KiL


YlppOt4AKGIRIYLEIKKRDYKKSOKLIKFYf.IItYIfYCPeL.0611 707175 705793


CPn_060) 691136 695185 adc-ADPJATP Translot:ase
PIYKSEFSKPItPLFtiJIFFItCFNYCLLKNOID
LAAYLC
VFIAHK1R3KtP1p58EYKPPSA


hweZFerroeheealase _
71NFOGPRHAKDI4EFLISLLT~tDVICTF _
TPAYLL M
y
~
YS
I


. YYVHSB~S VPL.IffG.
WICIMtLIVt3IpCLVSLFLAKKYM I
t.PRVLNRHLFfFtA~(RVPKYLPpYOSLQtiWSPIYFO?ETL.71KTLSEILPAPVIPfIdtYLELLPOGLRGFIVMI
IIYWS
I
PYC!~SLIILNSL11DKL.Q
CLiiNOITTI'!FaGRFYALINTGLMSSICAGEISYWIIfIKOTFVAYSFACD~IitSVIIIJILT


PSTIIEKTLLALRTLHTRHYICIPLFPHP1'YSVTGSIVRFFT00MEIPISWIPOFCSDSKNLITCSGLIHIWI'YARI
HHLTIDTSIPPSAAW1EDC'dATANLKLOUIPKAKARHLPLJLL


FVSLITCHIRDFLOKLCILEKECCFLFSVf~LPVRYI50GDPYSKQCYESFS1lI11TlFKQIOSRYLi~GL7IIIYLS
YIfLYIRLFMIWKD0YS0IYSSiIVEFNCYNSA.?TLIGV1ISVL117L
'


YLPL .
VLL1COCIRId~CALYTP1.YHLVSGLLFFGTIFAA1WDISIFGGYi.~ffPL.71L~W1'
S~IFLCFOSKFGPGKwiSPSTAOLCQNIDTOKPNVIWPFCFISDNLITLYEIERD


LRSRGYRALRIPAIYSSPLWVSfLVDIVIfEN51'WAEELIRSGI0011GIROtllbNyLgRV"TKFfFFDQI'K84AP
IPLSPEDKNIIGKAAIL1GVVSRICKSOaLlYOGLiN


IFSSVAASiIWIJILVGLIIMIVWLAWAYIGKEYYSRAAOAVATLKOPKCPSSSIVREAO


CPn_0601 695981 695196


EIiY-Glutanline Bindia0 Procsm
NSEil0I14VKIKFSW1IVNFLICLIJ1VGLIFFCCSRYKREVLVGRDffIWP'P1LOFGIY9 707631
CK101 7
8


0 0
TSaLNAPLNDLVSEINYitENLHINIVNODWVHLFFNLDORICIOGIIFTSVLPlLH~.~tYQ14
IVPIA CPIL0615
pQsA-Glycerol-3-P Phosphatadylcransterase


FSppILL'IGpVLWAODSPYOSIEDLKGRLIGVYAFDSSVLWIOI~tIPDAViSLYOfLAKIIOtQFCNIZSLSRWLAL
Y!'CQEI~HIRLLRIVGAI4.SDIFLDCYi.iIARYIUfISRLGS
6DFN


r_5...'TTSNCYWLiaPVTLYPJ1LIET11YKGRLKIISKPIlIAOCLRLiIILKOTRGDLLiLDPITDINIVPVCIT
OLYIIECSIStIWLFFICARDLFLIItV~CYGSLVIOf,T~11~Y0YGSL


ACLVK1'RRSGKYDAIKOAYRLP
lWCitIF'rVVOFIILLLYfAOCEIPW1~GLVPLVAIrcrFLYFLERIlIflYIO~LA


CPn.,0605 696777 696150
CPeI_0616 708701 710137


yhhF-Nethylue dna8-Aepllcative ON71 Hslieaie
LRKLCSSRGOVRILrGKYKGKSt.K'fFSNPHIRPTSCLtnCFaFFSiCAEDIDGAAFLOLFATGVHYLMJU1NOLYCE
DFYYLEH
CIQ
'


~IIIGFE7lLSRGAJ1SWFVDISIAAIOLIHTNSALIGEOLPWIFRODAOSAIQRLIKO.
GVPLPSPPHSKESEHIVI.C
TLTNYESSLIJO)KS1
iIBFJIG'fA7IYLiEYVDZ
KRIDiOL'fVIGGPSYLITi
IDVNi
RGEtL


KRSFDLIYIIfPPYELCt~ICYV1'ttOKIVSGNILNPEGTLFLF3JASDEEIACEGLTLRRRR.
.
.
KIIFRVLODAFKOIRCP
IRS1CRILRRHISTAKEIEKAALEOPKNVJLEIILDEJVONSFFKISt75TSYSQYTLVAtSCi~


KLGK1'YLAEYIVP
LTTTfDKPYLVOIQEROELFL~OtLIpGDNIISFfTGIPTHFIDLDOLI11CFSP9NWILMR


PAIKtKl'ALA~IIAlI4UCFOHALPIGIFSLQlfVDQLIHRMICSRSIIFDSK1LISTOOLBDH


CPn_0606 69749? 696707
DFORIVSVIlIEt40EtlLLLIDOOPCLKVSDLRARUtRl9tESYDIOFLIIDYLOLi.Sri80'fI'


CT188 hypothetical Drouin
RATFSROTEISEISRM.%TL7IREItIIPIICLSOLSRKVEDMIWRPlIIiDLRESG8IR10D
1IGVPEKTNEVFGDPWIGYNOKICSEWOAWHP
tOiIYCLADLIL


!
SDLVM!'LLRREY7IDPNDKPGTAELIIAlOiNICSIGSVPLVFEKEGIAPRNYBJIF~IS
SSYSRItOLRFYLGSLO
EDIVLLPGDISWAIB~iLSEANKDFAFICDLPOtKYHIRGaRiOYWSSASTSItITAALPPSLY


YLNp~'71LLTPHL71WGVRLWDSPTICVKKJQJFLTPSTOEOSYTEQDEKIFLRELGRLKR


AFAALPXEVTEVIVKrNYPPISSDGTPGPISEFLGDGRVSLCLtGHIHKVORPIDGIGII,


IAGIHYILVAADYVNFVPQEVN


CPII_0607 698910 697577
010C-Glucolrl-P lldwyltrantEstafe
NRAIOtIIflIMPEASNFFSSHPYRDlLVCVIILCGCEGIfRLSPLTItCRCKPfVSFGGRItIa.
IDIPISIGISaGFSItIFVICQYLTYTL00HLFK1'YFYf90VL.ODOIHLLAPEAR0000I41Y
QGTADiIIAIDC.t.YF~DTEIEYFLILSGDOLY68mFASIVOTAIATHV~IVL.VAOPIPEKD
AYpIGVLDIDS~R.IDFY»PQCKLVLIIRFOLSSEDRRIIIKL?~oSGDFLC~~ICIYLFR
RDSLFSLLREEEGNDI~sKtiLI01lp10CROQVO'fLLYNGriIADIG'1'IESYYEIW IALTOKPH
ACIOIGLNC7fDDtxIHIYSKNHHLPGAIITDSNISSSLLCEGLItINCSHV8R5VIGIRSKIG
ERSWDOSIIIlGN7IIlYGSPStiPSLGIGKDCEI1DIAIIDF34CCICSiGVKt.~ILKGYIKYOS
PDKKLFVRONIIIVPOGTNIPDNYIF
0608 699690 699016 CPeL0618 71Z)00 713010
CPn


_ lplA-Lipoace-Protein Lipase A
Oridine 5'-NOnophospifate
SynthaseKNHPfCNCIFLDLPGIISILHOLOIEFJ1LLRVANONFCIINSGJ11(DSIVLCISAIA?10WH
itlmp Synchssel-truncated?
'


PLYVOMtLV
ISRJIOADItIPIIRRYSOGGIVFIDSM'IJFVSWIt4JSSE71SA0P0ELL.AWrYGIYSPLLPN
VSPLYFVIDtGRRLWPll49YEDAKLRGQAV11ILYQICaIKFGIWIL7l5GEE3


ISSPEVI41VATLIWRLRPSFNSSLLGGVPYT111.'fL7vTSISLKYNIPNVLRRKfit~tiVOPTFSIRErmYVIGH
K1II0CNAQYIORHRWVHH'NfFLWOIDLDItiSYYLPIP000PTYRNOR


SDAIKVEGLFTPCQIrLVLIJO14VSSGKSIIETAVALEENGLWRFJILVFLORRItEiICOPLSNEEFLTTLRPWFPS
1LDDFLFRIKASGSLLFTWEEFLDftELEEILAOPHRK11TTVW


GPQCIKVSSVFTVPTLIKAGIAYCKLSSGOLTLANKISEILEIES


0609' 699672 699986 CPtL0619 713162 713013
CPn


_ ndk-Nucleoside-I-P Kinese
CT190 hypothetical Drotein
RRYVYThtEOTLSIIKPDSVSKAHICEILSIFE05CLRIAAMKM0iL50TFJ1ECFYFVNRE


ONTKNSLIRFMILIRLFLGISLPKCFPLYLEPPLVLATFOCTOFVGTYSEATNPLYIDNLRPFFOELVDt7tVSOPWVL
VLEGANAVSRNREtI'1GATNPAEJ1ASGTLPAKFGGSIGVtMV


NLNYNYTOELLYKAVPCNYKSIYREIPLIIFPEVLIGSTPTOSTEHGSOZ'LFiJAAVEIAYFFSKIEVVNASKPLV


CPn_0610 '01150 7000:9
0620 711115 717519
CPn


rho-Transcription termination Factor_
RLFLrtFKGSIHKCERSSEILPRVKETKKHAYVSMOEKSCVGECAWASESEEAESVTVTKruvA-HOlliday
Junction Nslicaee


IAKLORNCIEELNIIJ1RCYCVNNIGSLTKSaWFEIVKAttSERPDELLICECVLEYLPDCDKMYDYIRGTLTWHTGaI
VIECOCIC'MLAITERWAIECIRALNpDFLVETIIVIFRCIE


eCFLRSP1'YNYLBSAEDIYVSPAOIRRFDLKKGtn'IIG1'LRSPKEKEKYFALGKVDKINCHL.LYCFHSREERECP
RILISFSCICPKLALAIWALPLKVLCSWRSEDIRALASVSCIG


:iTPdWfERVLFENLTPLYPNQRIVl484CKDHIr\ERVLDLTAPIGKGORGLIVAPPRSOKKKTAEKLtiVELKOKLP
DLLFLDSRVITSOTKITSSCLEEGIOALrLILGYSKIAAENIiAE


'!YILOSIAHAIAVNNPDIVLIVLLIDERPEEVTDNIROVRGEWASTFDEOPERHIOVAEAIKDLPEGSSLTDILPIAL
KKNFSCVNKD


KMRLVEHCNDNVLLLDSITRLARAYNCVOPNSCKILTGGVDASALHKPKRFFCJW
MVIF


. CPn_0621 71x707 7111.14
RNIECCGSLTILATlILIDTCSRHDEVIFEEFKC'ICNMELVLORRLSDRRTYPAIDLIKSG


LYNPSELERVYLFROAIJ1DL'1'fiDAIWLLLGRLKKTNSNAEFLLSLKEruvr:-Crossover Junction
Endonuclaase
TRKEEL


.
L:iRWSSFKDNKFKYF0E31VSELIIGVDPC'fIVACYALIAVEQRYOLRPYSYGAIRLSS


701
t7NPLPNRYKTLFEOtSCVLDDTOPNAIFVLE'K~FVNKNPOSfMKWDIRGIVLtJIAIIpRDI
173 7011:0
'
'


.
LIFE'tAPNVAKKAWC.KGtLi:iKROIpVMVSKILFNPE~/LNPSNEDIADAFALAICNTNV71
_t
n_0611


yacE-predicted phosphatass/kinass R~aPtr<:CYR
V


F
RtfNRRDAKTSEREOGISYDFIRSYSCEYLNWICKLGN4Ll(LLKVSITCDLSSGKTE71CO


.AYWwADEISHSFLIPHTRIGRRVIDLLGSDVWOCAFDAQAIMKVFYNSVLLOC
JEII' '


. 1':1
LEAttJIPEVCRILEfQYHOSIODCNYPLFVAEVPLLYEIHYAKWFOSVLLVHANfDIRRECfm 01.72 '15761
714


~.EDFOORuRFU~VEEKt.AQADVWENNGTKKELHOKIEEYFYALKCALv:T'.Ui hyp.~cnACi.:.O
pcotein
RFHYKTCR3 "


. .PwKDADINP110O LCNIfSCV
NY:: JR t.t.:: i LKLHLF::I.H: aS..4a'lIY
YH: d.'::R:.'F1LIILId


:fn II.lJ ~vl.l6RH ~JV_U_=
:.':FH:Y:Y.1:J::IIJY.E~fC\~a'II~~EHERIIIIJ,IYRF.~~L.::ALEEEIRRREEA10i00L.EKL.OQ
QPf
'
'


ImIA-UNA ll.lyaa':.Iz;u I YFJIKIKOLE~LORYVS
fEI:E.RPtt.
wtlJllf!1lF:KhrJtRlk.'::~II~EIY.KELUJ::VSH


H.:IIff::LIL~fVERFRREfNBCKLFVLUA::.:FIFIL1'fFALPENKMIpGOATrJAVFGFitI~>LnIR:AI".
:(t:LEEfAI:::N\.\1'\I:IFIPLKK::LIDL.yEY.DIYIKTY11::FIAKLHEKL.OROICAO
'


NKLIKEF.~.F1=fNI::VFDv:hNHK0::R0AIYADYK3Nk!JKKFEDIPWIALVKt:IC:SLICIr\HANP.~.Flf
KLDHWI
'f::::h'/t.'::fEKLifF:VLYfDI\I:KY.YAI\t.tlJUfI:UJ'fWJl.i!Gf.IIKE:Kt


YLE7!E::VFAUInIIA:;IAKKWEFtJYKVIIU'1'ADKOLWLVNDHWAWNFWAIY~~WC'aI::E~:LIJ:YI:IF_
:FI;tNV1::Ii;:K:aii::


V 1 Eh'n: l Pfr xi I fDYf.ALVt:D::.~.DN
I FGLKX:~PKKAAAIdJIOF'f:.~>ltEt%LLENLIN1VKGL'


.:l!fNl-:EROt~fLKL::KH11L1.D::NIPIPI'FIESLTFPQFIPVDEEKLIIIFYI40t:FKTGVPI ~nl l
II..I..~
'.'fhry.: s


::KrJfUATVt7VrJIINUA1::'.L'fNiLNLV~\:::DI1FAVAYTr:NIIW..~.LKLEGIlvLTOC:.(:VF~T'
.n4 INlr,rtr.t i.:.n! L~.....,n
'


t:EE7ta'KILIILKLWILHCDI:fFYufNLKRDCIIALLJ~W:I'JIREI.':YGLAL.AEHLTNFFRNI'D
FIII
IY!rftfl'rlF"MeUINI~IYt'I~:vF;YY.l:7/I:FIrKIIN::rJnfFt4V1:1W
:VI::IJ111r


.
IC:Yf'Yi:Jt'l4:fflF!MI'.l'RIfAtltt.Y.A'/!:LINa:VKIIVY:I!FJvLIKUfY..'."f1'LINIOE
KPLrI
t::iYJ::IJ.'lNII:FTFTAIIRFAKEIa:N:x:LfLGt!LPFJJt'F.OYFY:EFVA'ILPIIKOAIL'
XY:Y


. .h::RKISKKF::\ItFh::l
Yr:F:Y.WKNKKYI..':1't!FIIIIKf.IAFYIYtA:YJKILtffVK
r:l: ttINYFH1111 t L::U 1 f?fl'LF:KVLF::HEH4::1KhY:I'F:f f:
l.:VfLOVEfIJI I I.I S\I.FETEWVLTEEIYOt.:


101


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
FEWEE::pFIIEIVEOKKF:LLPPPAKLI:rEYINC.r~l't.:I'7 Ribasrx4rl h stn
JPWTSJIDWfrLOALVRESSDL HKKEKVK".sMA'3EP1
t.RKVKI;WVSAKNEKTWVNVERIF:iHP~YLKV'lR3.iKKYYA!(:'


WALL::AGUAtHFPETEEEPT.3Jl.r'FE&i.SANFFPETSSATEEEELKVSEGDxvKIG4'i'll~ItI:KAIINV,
~CVt/Sf;.;


CPn_Ob24 7I8D19 717011
Ob33 725979 725743
CPn


~ap~-.IY~er~idehYdefP Dehyroqenase_
AMKWTNCFGRt:RLVLROIGIRNSSV~LAINDLVPGDJ1LTYLFKFOSTHGRPPEOVACr1.?-L2? RaDOSOmaI
Protein
ASGKGIMIAAKKI:L:.TOLRCFaDDDL~w\YVHENKKALFALRAENL~.(~IJKWKVIMFSI11K


EAI7HLIW':KRKIGFI-
iERNVONLPWKDLCVDLVIFaCTCiLFTKKEDAALhIQAGAKRYLISKNIARALTIKOEPYr%KYH~
'


EGtifITUHA
nrvllr:rtl"f!~'.frPINHYTFtIPP:ICDI~IL~ItA.Sf.'rl'4t.'trIPIAKVLLIBIF'::".


"'Y:4l:N7a-=: ::i~\:::'::.:./.\':.':..::.Fi:LF':Y;.'."~CAFRVC:::... . ,
v ' . ... . , .
'r


...~L~:::.:.~'.~L'E:'.~.'..'.:~I':'::,.u_:.:FC:.:.:.:.-
. .::._:,Z:.L.''.;.IY:.L:aYvtrY~ rllo-Lte RtDOSOMaI Protean


IAtI~7DRFFKLVAWYONEZCYATRIVDLLEYVEKNSKI
IIINIPKATKFRKOGKGQFRGLSKGaTFIIDFGI:YANOTL.EP(~IVI'SROIE71CRVAIIIIYL


KAI!<SKVWIAIFPDKS':KKPAETAMCKCKCAPDMWVAYVRPGRILF1YANVSK~t.


CPn_0625 718188 718060 AAAAAIG1CIKTAPIKAVER


r117-L17 Ribosomal Protein
AAHAOp4ITW~S


vtpNARKKPAVCRTSSIWRC71WJl4.KSLIIIYERILTfLPKAKEL727D92 726409
IIAFVERI(~1~' CPCL0611


LAARRIAICNiHVRYIKQ.TSKEARQAItGCDI'SVYNVDRLWNKL.FDILGrs3-S3 Ribosomal Protein


RILKIQNRIGONAOKCIIEFL71S
KGRRIIICOt(QCPICFR'IGVTIUtWRSLWItGNKDEFGKFLIEDVAIAOFLAIOCPSCOCANCP


WPAILSGKZEY1'IG'IJIAPOLYIGKIOCIIF.YDLLKBLLAALiGKEI7IiLEIJIEI1WGJ41KL


CPn_0626 719670 718495
VJUaJIAAOIERAYSFRIW4KXANOSVlmAG71VCV1II0VSGRLi1G71CIARSdYf~AVPL


rpoA-RNA Polytsereee Alpha
HTLAADII1YATACJ1E'!'!'YCIIGIKVWII~GiSSSITPt'84PAAPSAAA
WLGKEKCaISDNAIO~iLLYDKFELPEAV1QQ.WlxLPIDKHAAFIAEPLER


wLPAKK%AQS
CNGHTL)GNALPAALLIGLFJ1PAIIS!'AM'GVLHEYNAIEGVI~ILHLKGAL.LIIKY72711D 727096


PNQO&SLGAT'CQVLfUISISIDJISOt.AAANCQKM'LDaLi.ODCOFCAVNPDOVIF'M'OPCPn.-0642
r122-L22 Ribosomal Protein


IOLh1)vLAIAFGRCYTPSEAIYLEDIaCVICEIVLOAAFSPVTLVNYFVCDTRVGODTDFDRAAHSIVKATI1AYIRV
OPRKARLAAGLIOtNLSYOEAEEpLGFSOLKAGACLKXViliSAYIW


LVLiVffDCAVTPKEIVLA!'S'IQILTKHPSIFFi~I~EKKIVFEFJ1ISIEKC4KDDILNKLISVTEVAVDAGPVYK
RSKSKSRGfiRSPILKICTSHLTVIY00fm
. A~1IAREM


4:INEIELSVRSTNCLSN7INIlTIt;FININPEPRLLOFRNFGKKSLCEIKNKLKElDQ.EL.


G~.TOFCVCLONVICEKt9IWYAEKIM10r1'IOGCPCI,.0643 727725 727450


CPn_0627 720059 719640 rsl9-519 Ribosomal Protein
EIRDICRSLRKGPFVDHNLLRKVRAIB'IIEEKKTpIIrIWSRASNITPOtIGIn'FM~IDI


rail-511 Ribosomal Protein ItLTVPVSEITNGICKIGEFSPTRIFKSNPVI~
AQAKIISVIIRICOLIC~tIPSOWICVKATFIB'TfIVSITDPACNVI9WASAGK
VLVIOJ
A


FLI
SR CPt7_0641 728594 727722
O
VCYSGSAKSSAP1U1TVAAOOAAKTJIIO~ISGLKFVE11CLWGTCAGRtSIIYRALI&71GLWSY


IRDETPVPtBiCCRPAKRARV rl2-L2 Aibt>aaaal Protein


CPr~0628 720461 720063
FIREIN&QR(!'KIrV'i'POI'RDLYLPwiDEL'1TRGELRG?'RSKRSLRPMtKLBFTIOtSSOG
RiII~IISCRHROOGAIOOLYRWDF1W'BIDGITAKWrVEYDPMISAYIALL8Y8D~R


csl3-513 Ribosomal Protein
YILAP>OGIOAGOVYVSGl7GSPFKPC7CGK1'LKSIPCGLSVfOIIENRPSSOGKf.VR8A0LM
IltY1'ILREAQRNPRIICIDIPAKK1G.KISLTYIYGIGSJ1RSDEIIIOQJILOPEIiRASELT'


EEEVGRLNSLLOSIYIYOGI)LRRRVGSDIKALIAIHSYRGQAiIRLSLPVRCQRTKTNSRTiT.KItPSGEFRt4.ti
DGCRATIG
OVIAIt$PGYV


RKGIDtKTVAGKKX TJ148~IPVDNP110GCE(ZAH14Ci1fIPICT


0629 721881 720487 CPn_0645 728933 728598
CPIt


_ r137-L21 Ribosaeal Proteia
sect-Transloease
OM~IfOYIKRHYVTCKAKl4.EHLSA~1'Cflfr~fl~CS!'CILDPKTVFIV51~11?I~LIaOAL
KIRLL~'RPYKI'1'LROFFLITELRQKLFYTFALLTACAVGVfIPVPGINGELAVAYfICQLLC'
~'


SCONLFOLilDIF5t~71FA0NTVIJIILiWPYIS11SIIVOLFLVIfIPALOAty'OItSSDOGKRf1~f0018VG
IOfAIV
EAIYVDKNVKVKSVNfTrA7KPOPAPMFAGRPt~ATSGI


RIGRLTALFTVALiaVIOSLLFAt~ALAINLTIPGIVLPTLLSSKLFGVPwIFIfI'1TVV1M
0646 72906 728950
CPl1


TfGTLLL36(IGEpISpIOGIpJGISLIIAI,GILSSFpSVLCSIVIIIGiaCSODSSO~.IS-
r11-tA Ribosanal Psotsin


ILILALVPVFVLITTILIIECVRKIPVOYARRVIGRRbVPGGGSYLPl.KW1(~P~FyAiDIJNLLSK1DFSCNKIGEV
EVADSLPAD~OCLOLIKDYIVAIAN11010ti8AC'!fit
ASSLLNFPATICOFIAS&SYl9tRIAALLAPGSLVYSICYVLLIIFF'lYlwi'ATOFHPEOIALL
'


IASEI~fOQtAFIPCIROGKPTOIIYLEY'llAfIRYCL1LGALFLAAIAILPSLd.CCLLRVDStJVRGGGIVFGPII
PKFM7NVRINAKEAIIM
SEYBNSTAKPPKOIOGTGIIAROGCLiISPO!
A
SJILIIFIJIDCNVOCRSILFIDNLO11V0~a
LTAPkT
'


Lt?OtRYDSVLiITOATIOCIW .
GWLDT p
4 Li101II0
INKLTPVD~~DR


1
ISLANLTAVIOCFVYCININCYDLASAIaIIVISpfAL.OELYERLVfiTIID
CWOJIP
SYFLOGTAIT<.IW


CPei_0630 722316 721885
0617 730190 729657
CPt1


r115-L15 Ribosomal Prouin -
RRFGYE0I1GVPLYR r13-L3 Ribosa~l Protein


NIKLESLFDISERIWAKIQ.LGRGPSStaiGKTSC~IKGOGSRSGYKYLEYPSYCIC4L.PPLITCPFIFLA~FLFFLt
?1SISKILSRFVSLTf.OBEBIfSLIi310Kf11
RVPTRGFSHKRFDKCIfEEITTCRLAELF0E7GGITLOALKAKKAIAAOAVRVKVILIIGOL'


XESOOYPSLOIGAEOIIAP
RSHISVIGKK>DQ4IHIFDKOCSLVACSYIRVEPNVYf0IR1


EKTIVNOCiAWiSGCVONLLGIT ~ZTK~


CI810GICGFOGtMKKFG1~GPGSHGSG!'1(RNAGBIGIBtSTPGRCPPGSKAPS1o83i1~M'


CPn_0631 722812 722712 VIO'E.EVIKVtM.tKKVLLVKGAIPGAItGSIVIVKfISSRT


ray-SS Ribosomal Protein


~15GSKNSHKEOOLEEIfVLWNRCSRRFSFSALILVGDCKGAI~SYGPAKANEL
L 0618 731636 770605
CPn


TDAIAKOCEAAxtWtI9CIEALEOGSIPHEVLVHHOGAOt.LLKPAKPGIt3IYAGSRIRLI_
CTS29 hypothetical D~tein '


eHAGiKDIVAKSFGSNNPI4NQV1W1FKALTt~.SPRImLLARGAAINDFFFIGIPCXEVIOtATNJIIASAGBAASi0
0.LPVAXEPMVSSFJ1QKGIYCI00!lTliP'GNXL


0672 727354 722827
AK!'00J1TKSL00(CFKLSKAVSDCWCSLEF011LTSAMIApOIa.KiTAEWAW~1V
CPn


_
ItIGtIVPSfVNSIbRCY0YTA0AFEUSKTKERKTPCEYSR~.LTRODYLWvBAGCtA
r118-L18 Ribosomal Protein '


KCLISSWLVNLLOVFAPNVLLNLIKVREFVMCMaISWKLVKLRIf0Al0iRSRVMESSLCKItif11J1GVAGAVOGIA
L
G71TTYSATFGVLRPLG.INKLTAKPFLOKATVGIIFGTAVAGIItI


KSL40(RRAALRVRKVLKGSP'fKPRLSWKTNKHIYVOLIDDSIG%TLASVSI'LSKLtJICSOEOKLFKJWCESLYNE
RCALCJOOSOL9GDVILSAERALRKEtIVATLKAHVLTi.L>GGt.EI.


CLTKKNOEVAKVLGl'OIAEIGHIrt.OLDAWFDRGPPKYNCIVSMIADGAAEDGLOFWDG11KLIPLPITVACSAiII
SGaLTAASAGIGLYSIWOKTKSGK


CPn_0633 727760 723209 CPrr_0649 772672 731710
!mc-Nethionyl cRNA Poa'myicransferase


rl6-Lb Ribosomal Protein
IJOLKVVYFCTPtFrIITVL00LLHHKIOITAW1'RVDKPOKA8AOLIPSPVKTIALTIIGLP
SHSRKAREPILLPOGVtVSIGODKIIVKGP1CCSLTOKSVKEVEITLKDNSIFVHAAPNVV


ORPSCHOCLYWALISNMVpCVHLGFEKRLFI4ICVGFAASVQGAFLDLSIGVSHPTKIPIPLLOPSKASOPOFIEELRA
FNADVPIWAYGAILROIVLDIPRYGCYNLHAGLi.PXT~GM
'


STLQVSVEKNTLISVKGLDKGLVOEF)1ASIAAKRPPEPYKGKGIRYeHEYVRA1UVGKAAKSGEU1W1L11SpGiIIV
LIK
PIOAGINEGATESCNIVIAL'~11Gf4TI'GONANITRVPICPOIT!
TLQOIESGOLOLVSODMf.ATIAPKLSKELf~VPWD1IPAKFJ1YANIAC11TPAPOAKILFS


tGKK
FSEKAPKRllCIRKdSLLAEAGRYGJ1PGTVVYI'DROELAIACSEGAICLHEYOV~KGSTN


CPn_0~74 724215 723787 SILiPIlJGYPI1KKLICIVf'CLNN


rsH-aP Ribosomal Protein
E3SIKRKAIYMCKCSDSTAOLLTRIANAI~IAENLYVDVEHSKNREAIVKILKHKOFVAHYCPrr_0650 777517
7326b5


LVKEEt?iRKAANAVPLOYSDDRKPVIHQLicRVSKPSRAV'NSAAKIPYVFCM4CISVLSTSlpxA-ACyI-
Carrier UOP-~lcNAc 0-ACylcransEerase


rX:VtIECSLARSIDIiCGELLCLVW SRRN4ASIHPTAI IEPGAKIGKOW IEPYW
IKATVTLCI7NV WKSYAYIIXNIITIQOC!


TIWPSANICNIfPGOLICYOCEKTYVCIClTICEIAEFAI
ITSSTFECI'1YSIC~IiCLINPWA


D675 724763 724206
HVAtINCI'ICIRiVVLSNNAQLACHVQVCDYAILOGIIVGVNOFVRIGAHAN~CALSGIMW
CPn


_
PPY'I'IGSGNPYGI~.aGtNKVOLORROVPFATRLALIKAPKKIYRADGCFFESLEITLCEYC
rl5-LS Ribosomal Protein


CERKANNSRLKKFYTEEIRKSLFEKFGYANKIpIPVLKKIVLSHCIrIEAAICD10JLF0AHLOIPMMFLEFCC3PSKR
CIERSIOKGJ1LEEESAWfEL~ILIES


ECLTNISCQKPLVTKARNSIACFKLRF.CpCIGAKVTLRCIRNYDIMDRFCNIVSPRIRDF
ttb5! 733975 737517
CPn


R4F~NKCOCRCCYSVCLDDQQIFPEIIILDRVKRTOCLNIIWIJ'l'fApTDDlxTfLLEWCL_
C.aGZ.Nyrlstoyl\cyl Cacrier Oehy3racase


kFKKJIp
MJUPrIIIKLAELLCLLPtIRYPFLLVDKVLSYDIEAR5ITl4pKNV'fiNEPpFNfAIFPNAPI


<:Pn brit.: 775tOR 724750
Nf~f:VLLLEALAtsArh:VLiCLVLEIIDRNKRIALFI~IOKAF.FROAVRiCDVLTi.OMFSLt
'
~


rt2AL~1 Ribrartm,sl Prntr.in !'t:QI.VTFrIEL.iFALVDKFw
t
:~Kr7r:IfAWAGAR1


FY, t:KEIMKKVN I RVC~KVF I L.ACNOKtiKECKVG.LTEDKW
VEC:VNVR t KtJ I KR'~(FK ~Pn:N:S~ 7t.lNqy 7)799D


:YkI;:IFIvt'ItII:,NffRL?fAf:EPAKt.:DVKVTEGGREWORRPU:TSVLYRLVRCKKG


Ipxr: Ptysrryt r:lcN.W nre.metyla5e


:In or: s7 ~_'u47 t 7'.'.5D'1s KRN::I
t'ft;O::L:;:l'1'NC.F.R'lltR'CLItREI/RYA:W:IHLl3K.~.STIJCLOPAQ'lNl~':I11FGR0.~.


t4 Hilrc:rrr..sl Irnr
A:x:lffEtiVPAId.IWVY'lT':R:'I'fL;Ar:::AVIA7Yt31LNAALRSNNtDIILIIOr::7t:EEtPI
.W
r114.1


.
c:Or:.:?1VFYIJ.ICtsAt:(t:ELsE00Y.V:.IARLTPP/YYOHQOIFLAAFP
. I~L.KL:YTW1YPQ
tt:ttrtWt.:VLKVACAdIt:AKKVNr'FKVta7r:.~.RRRYA'l~/f:lriltlft.':a'Rf7VEI?C:::LKKC
:DV


VIIDOKC:NINr.T1!ll'r:WARt:IirOkr:FIKL".:iL::::CIr.TVYK::LVINEE;:FRVl:IAf~.'RTFA
L'fIIELCFIlIEKGLI~:LIMfAWFKOtt:II
IYAV1VR'Mtltlll'PNKOt.:TI.KFIri'H:a'


. ::IrrYlIHFAPEf'VRItK I
LGLI.:OC.;:I.W:RPFVAIIVUIW::.r:NESTItAFCKKIi.EALhL
AthV t


wtn ~u. rH ~: ': f'7 t 'f~bA'ro rtn 7r.5 s f s...t..'r 7 t4H4sr


102


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
':


.-urE-Apnlvpopmtem NAeatYleranst.-".rseANFCVSLFEIOCLtCMLVAC-
....DKISIICxtR:PMNVLF~L:L:.FAItGNWF.'.RSNtKrMV
'
'
'


':EPVGRIFr'FVLiyIt:LLAFAOPOL~PVStLCMCGYG!'FSJYSLEPLKKPSLPLRTI.FVSYPL~T.DI'
ttl4PAYF:ATFA..
IA.~.T
OCILLFVICFFLYOPQIM LAAaEL'((((eAAG
f
'I~~R
I~ f ' '
,
~


CFFTIIP'PIEv~INF~rWIIL.i00YICKLIYLVWLTLITILSYLFSCFSCLLYAIVROKRTAFLI
LP
:

WIRVIIGFFIALtJILR6f
.. v... .."i. .~ , i


WSLPCVWVAICiLItFYGIF~"!. fSFDYLdIPMTJ1SAYGROFGGFIGrtAG05FAVIAVI~IISF


YCLLLKKpNAKMLWVLTL:.LPYTFGAIHYCYLKHAF00DKRALRVAWQP)WPPIRPRIJfCPn..0666 71677n
750107


SPfvVWEpLLpLV::PIOOPIDLLIFPCVWPFGKNRpVYPYESCAHLLSSFAPLPIOGItAT'dnsE-ONA Pol
:II Alpne
'


~.N3DCATAL~HFOCPVLI:LERWVKKENVLYWYNSANISHKGISUGYOKRILVPOGE0K
L
GFFL711IPGHGNSOYSYLCAHSSIKDFVAKGOEFGIPA~I:.vOHQILYGAWFYKEL~


.
~TpPttrrE_~IlAW:~PPDYKKEKRSRAAHHLILLCKNECC:YPHL:LLTS1JIFTlI:FYYF
'fY:KF~:":.I~P~t.FnY'/AfIX'KRLPr:RR.'7:'!'l':'VRGLPRIr:LT:.~lEGZ'FCYRLOSYK.
..
'
'
'


n~ tl .,..; .f Y: I'll iw?tf"".?!Ar...,..~,.Vl:i:::ii~R:LKLO
:,:i,:..:::1:::'tlf .nr .. .'.:~8':Ilr.:
J ML . ,
. :OL'r'-'_ .a,~..;~IJiK
,.v.WF
. . :.:
:'It:CKe:.~'.'(.'.'~: : .. . , ~
~
'
'
'
,
.


....~r..Ag._.\7::i:::~::a'L,:YRf.irlKF:~
... ~i UPlET14\I .: : /i.cT.::.v :.:.1.
Ll't1i KT:.Yf: i ,~ :L . : 1. :.:SIT
a. a :\
~:l::f~:a.~'JU'IQAII
:
!t':vF'l:l~'FY.ewGi:.i:'.'
'


KEIR CILIJr'VOSCLf VItIAKQIP: H I
PNPKRKVYRSREYYFKSPApNAELFKDIPEV
I$NILLYA


KRCDlTFDFSKKfIYPIYVPESLKTWSYTEEDRYOASAVFLK~IIEALPIOIrSSIVIaN


CPrL0651 777051 776507 IAIOfFPNRDPIDIVIfEPlmNL<4AI L I
PKCwlICDYLLIVWDI INNJ1KATL;IPIICPGRGBCiIG


vdlD/yciA-scyl-COA Thioescvraav
SVLLFLLGITEIEPIRFDLFFERFINPERLSYPDIDIDIt~IA~GAERVIMfAILIItB3RWV
'


KKIIDF45VtNlYYRNOEYPIKIGSVESTML10IKPV5FSCIDCNIYIfIFPDR7L!(Al'BnVIGfLalAt.SKVNNI
AKHIPDLNITL$KALCIpPDL1101.YDlD
AOIITFCITOtAKMAVKD9CR
'


CLLIISLLORLALWACRNTE,SVCIrfAFVOJILrtFYAPAYItDENLICKAAVNRTWRTSLEVGMIIIPICI$KCSII
tIT'IpY9BtlVCS
AESAQVIDMALCLOGSIPNICViIAAGVIICGOpL


VIfVWAEI~tIYKOERRHITSAYF'l'FVAVNEDNOPIPVHOIVPE'1'PEDCRRYNFADARROARLVOIGJNDLLGLK
TLTSINTANSAIEKKIGpSGAMATLP3.ODATrFShLN0IRl11CI1~IC


SIaipELaIOILRPDLFEEIIANGALYRPGPIIDIIIPSFINRKIiGKEIIEYDHPLJQSILRI


TrGnwYOCOVMOIACALASxsLCECwLRRArnKKaFOOM~a~cxlcKRACOBIaIDPc


0655 777B56 737101
W'IYIFDIMOtFAAYOFNKSHAAAYOLITYTTAYLKANIfPKIi~ILIALLTCDSDDI
CPn


_
LIRI~QSlIGIPiLPPIIlNVSSNHFVATDEGIRFAMGAIKGL.R(H.IFSIVLERDIINDPYB
dnap-DNA Pol III Epsilon Chain


KEIMSLLIfDTVITCLDCEH1CLWK>fDItIIEIMVRFTFDSVISSIEFLINPERWSAESSIRDFIORSDGKKVSKIIS
IESLIDACCFDCFDSNRDIid.ASVEPLYIJIIAKDIDI6AAffiV


ORVNHISNAMLRDOPKIAEVFPOIKAFFKDGDYIVCHSVGFDtpVfaOFlIERIGCfFLSKM'FITLCAMDRIO(EVPI
CLPKDIPTRSKKELL.~IFKELLGIYLTEHPI~'1YRDNGfRLSV


Y'tIIDTLRWCEYGDSPM'ISLESLJ1VHFNVPYOG~MRAHKZNEININIFKHLCKRFRTLEVLJ1GEFlNLPNGSWRT
VFIIDKVL'1'ICISSKAQf~FAVLRVSL10ID&YQ.PIIiPDNY6OQ


OLKQVLAKPIKMKYMPLGKHKGRCFSEIPIAYLOWASxIIDFDSDLLFSIRHEIKHRQKIiTOELLLLDRLIYAILVLD
xRSDSLRISCAWMNDLSIVNCtIIYtI:D0AF01lIKHQV0101SF


GFSpVNNPFMEL
TMSI'SGKETKAKGNKPNENCHTOALIIPVTLSLDLHB.LRIl5HLCILKKIVQKHPG~'1'LVL


VF'IIQONFRVASIISPDD11YFVCEDIEELAQELVTJ1DLPVRVITV


CPn_0656 737842 738018
No robust hanoloQ prasane in Gerstbsnk/EMBLCPn_0667 751097 750177
as of 11/7/98


THNFLLLPLSLFDILLTVEGFLCL':LYFASVORMPCEQKAVP(~1LYYYYIAAHSSLCLSVNo sobusc
homoloa prestnt in Gent6snk/EitBI.
as of 11/7/98


~gtKp
NISi.LCICIOKRYFHKKLILYFAAPYASLfCGYFLGIDRVPCAOKIMRLMDNSSEVFSKSC


RlIRtKISGFSF1.01IFLRHVSPEOALALFPEYRDDKSIVELAFIPNTLtOiVRPSKEEPIIIOC


0657 738476 738051
HII80DDiIWSLVt'IOOIVIJtI~IWrCSRaFRECtS.tJIAGK001DIVI0TLATt~'1TSRE
CPn


_
SLApAt.At.IWIRAERVIK!%:OKIDCLIFASGNOIGTHFOQFQPIRtICTITWNNPWILpIIP
Y7aE tATPasa or Kinaael


PMGRYRRVSNSSpETLLL~.'TELGpVLVPGAVLLLFCDYGAGXTEFVRGIVSGYLCDTIAERNAAVFPAOYSLORVRI
ILVIfIIIFGONFLIVRSSMVYVpVYKISLVSADNSVRVEYILBIVt


EVA3PSFSII.ifiIYGt~R:PKRLCHYDLYRIDOKNOEYIFODAEEDDVLCIEWADRLPKPW~CGKSIpDL


In'INIYITHpI'N.IBREIIIEbt
CPer_0668 751176 751162


CPr>_0658 739180 778155 CT547 hypocMCical Drottin


CT578 hypocMCical Protein
WRFVWSPRLIHIIFLLYVPLLLVLVSTOCMKPVSFEPFSGKLSIbRPEPONSAFiYISQ


KRVCi~ISGAVKQW.t.QFIGXQKKPELLATYLFYLDpALSLRPVVFVRDKIIFKTPEDAVGOEPLKIIfRIFRI(ALI
CFGIITHIIPPRDILRNOApYLIGVLYF'fQDIIPDt.iIDRAIASYtQL


RiL~CIwRETEIOISSEKPpVN~N1'KRIYICPF1GKVFACWVYANPODfIIYDwLSSCPDAiYSEFi.FOISIYAIAO
RPAOCKRKRICRLOCFPKIaIiADCpiILItIYDEILTAFPfI~.


PQNIODIQCCVRIKRFLVSEDPDVIKEYAVPPKEPIIK'fVFASAI1CKL!'HSLPPLLEDFIGAOAi.YSKAALLIVt
OJ~.'1'F~ITxTI~ItILTLOFPLifILSSEAFVRLSEIYLQQAIOICPM7L


SSYLRPIITLEEVONOTKFOLESSFLSLLOWLV1D>KIAAFIESLA~I'APHYYISpWVDTQYLtIFJUQlJEP~xQHP
NNPLNEWSANVGiIMAFJIYARGLYATGRFYEKKKRAWI~tIY


YRTAITNIfpdTLLVA10G01IRLDRISKNTB


CPn_0659 77948? 779838 CPe1.0669 751110 752775


CrxA-Thloradoxin CTSIB hypothetical protein


LOENNRDSNSIFREGKLHVICIISSENFDSFIASGLVLVDFFAE1VCGPCRIC.TPILHdaAIEYGSILPKICINMRLF
SI4TIYLFFSLUSSCCCIfSIWSPYNLSSLGKSti4IIFIA


ELPNVTIGKZCIIDOJSKPAE'1'YEVSSIP3LILFKDGNEVARVVCL.KDKEFLTNLINKHAPIKEDPHOOLCSALTY
ELSKRSFAISCR&SCACYTLKVELtIICIDI01I01r1'PAPI~ICDK


'FNNtFIVSNEGRLSLSAKWLIMID1~0EVLIDOCVARESVDlDFEPOGLTANANCF71GOQ


CPr!_0660 710737 739860 FDB18L1IKSARRILSIRLAr<1IA00VYYDLF


apo0-rRNA Mathylasa


MRWIJICPDIPOMGttiCRTCVAt)OAE<.ILVRPLGFSLADItlYIfRAt?lD7fWDKLOLTWDCPn_0670
751778 757196


SIEGtXOVPEDOZFCLSTKGSASYTEFSLPSSGTYVFGSESKCLPItEILIOCYYItrtCLBIrsbW-sigma
rs9ulaeory Laetor-hiscidina
kinasa


PMQQDIRSLNWTSVGIVLYEVVRQKTV)1LQKNPTVPRRLt~1RY171I'FFLCETV!'PAVLSB.tISMLDLIKIIJU
OKOSKCP08KLLJILEtJICEELLVN


IISYAYpGENSPCC'IAISCISH1IGOLtVYIKDHGPSFNPLAV5INI0EDLPLEORKLOGL


CPn_0661 711179 740717 GIFLAXSSYDEFLYARED1K~1IYNLiOl:1'IGpHS


miD-~P-type paptidyl-prolyl eis-Crens
isanarasa


tiSRCLKIKDRRRKMNRrtIJNLJLATVALALSVASCDVRSKDKDKDOGSLYEYKDlRtDINDICPei0671
753660 753205


ELSDNQKLSRTPGHLLARQLRKSCDSff'FDIAEVAKGLQAELVCXSAPLTETEYEEKWIEVC?550
hypothetical protein


QKLVFEKKSKENLSt.AfKP'LNENSKNALWLNpPSKLQYKIIlfGIIGKJ1ISGKPSALLHYRITIN0RKY1'MSLDF
FEEFYHOSIIM~CI'SFPtCYLNIAEILSYPHCI'DANI'DFLC$OSD


KCSFINCQ'VFSSSL~'4~IEPILLPLCQ'l'IPGFAIGMpCIGIDOETRVLYIHPDWYC'1'AGOLNDFIIAEbKDIf
LTLFNADFAIWLVPLLVOGOAVTRCYIAVSQGDGNYCPET01FGSOpYN


PPNSLLIFEINLIOASADEVJN1VPQECi'IQCEOSSLILFJ1LQLYLKDIKDI'ENALR&PRF'tNDN


CPt>'066Z 742938 7411'7 CPeL0672 757TZ3 755018


asps-Aaparcyl tRNA Synehacase dacPlpbpS)-D-Ala-D-Ala Caroxypepcidasa


SKCtCYlIkYRTNRCNELTSNHIGENWta(iWVMRYRNiIGGWFIDLRDRPGITOIVCREDETLKSPMIKRPFFTYLCI
IFYCSCASLSLNAGLSFPCVRGATMWHIIDSGKVFY0KDI0A


OPELtIORLDAVRSEWVLSVRGKVCPRLAGMENPNLATGNIEVEVJISFEyLSKSONLPFSIVIYPASM?KIATALFIL
KHYPrVLDTLIKVKODAIASITPOAKKOSGYRSPPIMLCIDCS


ADDHINYNEELRLEYRYL.Dl9tRCDIIEKLLCRNOVMtJLCRIiFMWIpGFTEIVTWLGILr~I'TLOWLREEFHALL
VCSANDMt~NLiINACCCSVEKFMDKLNFF
EEIOCTtn'N


PECARDYLVPSRIYPCKFYJ1LPQSPOLFKOLIZIVGCLDRYFQIATCFRDEDLRADROPEFf'NNPIIGLNtiPNNYI
7TRDLISIMRCALKEPPFRGVISTTSYKIGiITMJIGRPtNKL


AQIDIEMSFGOTODLLPIIEQLVATLFATQGIEIPLPIaIO'!l'YpEAKDSYCTWCPDLI1FDL.LPGSTYNYPPALD
GK'1CTTKTACKNLINAAEKNNRLLVTIATGYSCPVSDLYODVIAL.C


LKt.KOCRDYAKRSSFSIFLDQLaHOGI'IKCFCVPOCATMSRKOLOGYTEFVI(RYGAtGLVETVFNEPLLRXELVPP
SDCLOLEIANLCKLSCPLPECLYYDFYASEDREPLSVSPI7UiAD


WLKNQOGINASNIAKFMDEEVFHELPAYFDAKDODILLLIAAPESVANOSLDHLRRLIAKAFPIEOCDLLCHWVIYWEG
KKISSOPFYAPCRFERTIKPWKLYMKRVPTSYRTYNSITM


ERELYSIXipYNFVWtTDFPLFSLEOGKIVAEHHPfTAPLEEDIPLLE~DPIdVRSSSYOLL.tIiYFRIRKHRKYKNL
K11YSKI


VI.NCSfEIASGSORIHNPDLOSOIETILKISPESIOEKfGFFIKAISFGTPPHtGIALGLD


RLVM/LTAAESIREVIAFPK1'OKASDIJ~BWAPSEINSSOWfELSIKVAFCPn_0677 755217 755167


CTSSZ hypothatieat protein


r'.Pn_0667 711270 712901
3KS1'LGKAYHCFLKOVSLAWREE11W~IPHHWFILt1pF00FSGEQDRFCSFLFrITIROR


his::-Hiscadyi tRNA Synchatase VSFLVLpEKIATLK


Y.SNNFEARHIM11TLPKCVFDIFPYLADAKOLJdUITSWNSVEKAIHTVCMLYGFCEIRT


PIFFJLSEVFLNVCEESDWKKEVYSFLDRIOGRS!!1'LRPEGl'MWRSFLEFICASNRSONKCPn_0671
75669's 755577


Pt'lILPMFRYER00IVGRYROHHOFCVEAIfNRHPLRDAEVLaLi.WDFYSRVGi~161QI0LImu-RNA
Mettfylcranstatase


NFLOCSETRFRYDKVLRAYLKCSNCELSALSQQRFS'fNVLRIt.DSKEPE00EIIR0APPIRGILYYI'NVPPRQNHA
YOLLKOLHTSAISEADRVSYYPKONR3LGSK~IOWLONIIFNIL


LGYV3DEDLKYFNEILOALRVLEIPYAINPRLVRGLDYYSDLVFFJ1'I'f'LFOEVSYALdGGRHRRLLETLILOSGf
Otrl'PEALVAKVNOCVLENLDSYSALPWPVRYSISODWiFLVt~Y


r:RYDGLI':AFGGA:iLPACCFCVGLEMIOTLLAOKRIEPOFPHKLRLIPMEPDADpFCLEGEEpAEEIAKLWLTEAP
ITIRVtflDKI3I/KELOEKLEYPSSPCELPEALNFSKR11PLQST


W::QIILRRIiaPTEVDMSHKKVKCAL.KAASTEOVaPIr:LICERCLISG'OLVIKNMSLRKEFEAFRIK',FFEIOD
EHSORL'C1'IwLTDKOIVLDFCACAOGK3LIFA0KAKHWINDSRK711


FY1'KEEVEQRLLYEIONTFL L.p'f'AKHRLLRACARNFSIrILVL.RIfi$F.~.W
IVDAFC~FRRNPEHKWQFSKKLLLNY


YR WjKG ILIfI7ASAYVGPRCRLVY tlY:::L.LKEENEANVA'rMN.SIaaIKEVHRKTLPL4~VGKG


~a,W i.r..l ;.(1775 741557 OAFFT.~afIFL'Kt


:>,. irdnl::r nan,Uwt Plarent in
tanabank/ENBL .t:; ut 11/7/')8


I.WFJIfIAMKKLIALI,:tYLVPIKI;NTNKEHIIAHATVLYJU1RAKYNLFtYODVFPVFIEVtEP~.Fr_IIi.75
:S'h.sl n5r.7..d


f::l~'1?:LVIIYEIbIV rTr.'l~ hypOthlittCal I'rr.t,lh


'/PL:dIILOFUFS II:YYLHVf.EL:,tI!U:1't!
f t.\'. IdtKK).LL.fiAWl'Vllld'L1't'NYIiI'::V.'7
f' f


I~r il.r.. n.lqry ')4'.'If,S
R(r/IHELF:M::At;;Y::l::::NIdJ,LfFLf'LII~t:Y.hJI'WdI:YHt.FFI'::FriIIKKAIVDKLL7A


~Jq~: Ia'xrr.:ldwa:ph,rr.e
'rr.rn::PnrtFK.~.LILFL:RRPVDKIVI'AAN1'.'/f.~.Yr:Y::Nh'::::WhUITIItrtSI::Iln~l'f
f
4:fVlWIRLM


YMrNWI'KI'Y\'I'I'KIIIKRIEDIIEWKKKYK'IWIiIRIF't::Mf'fv:YIbYYFTRKGFTPAMPTLDA.:LVN
tIULTfLLFIaITAYL:~L::f.ftf.f.l~'f'Ira:KAylLKTL:'.f:K::'NLLRI.LIhLF.iL


f.\Idr?fIKn4W
:Itv::.TL1'F::Yr;I:KFV:rIMSOq.~.tIPRYf.TfAf!:Wtl'f:LTNIFFI:I~S::AEDPhTtIM..LL'.
D::f.ab'L'LiVt:Lt'aYl1'1:1'f~IY:KTA/r:l.WhF:PAf.A::1'I:U::KLALL:FL


::IYLYAIYIYI;:I.MWF'(XfiA.WI'h:AItLLTIIWfAK::EAr:75'aI::VW:.T::iINICG11LIPILTGF
AEVLRKVIVEKKLtN::K::IMn'1'FEEW:Iff'I::11!f/~Ml'AI.WDKNCtMf.IJlitYlId.tH'LhtDIr:



I IUY:XWIa:AMYVh:(Lifv:NGLVLINRLRDTFQ::II:f.PffAfJ!KMlYYNPHP.~.FWRhYt.tLilQtFf
fEKYKRDPFIHAHfIEL'KSl4iE


' : I'I.:R I I :IU :L:."1'Rl: f
(.F'M'V f:ITX~W1.WFLAAA::FF!'!
f VRMAVNDW :ALFL t E1'KIIYMVK


103


CA 02350775 2001-05-11
WO 00/27994 PC"TNS99/26923



75?:I39 758051 ..~LNLIV4IFRpVFF.~.NSRS4.
.JCNYLRL:.K:NFA::':.1'KER~..'KTt.::.~1:.'.'fCFASF:r
l
~


ri
FYTNTFPFLEEOYTPAVUr:VA.:RYtI',~.NNIvDL111'SHRLK:::E".':J1F':DE:F.TIYIIPFCC
,Pn_On


tloalnlogou:: to CT695 ONELI0MK3PYI~3GFA'IRNt~Illl~fLTTEC~II>~~K
DRM'ISDPLEESAAEf7CD5DLEDRVSESATOVIETIADTGIPEATPSDG
'
'
'


:, LQPI'TNRKG~IYRLG K 'I~CIST. E
EK SDPISRKLAAOHYPYSFC
I
, piSfi~OC


T1~pLM.iOLVDRVEYEARCSLLT114.ARIRKAVSOIW~IVKTKRNPKEO~IRSIGOIPCD.


LLMTRLPKETAEPPYIYAGITALASCR.iFFINVFLRLITLLRRONPEAPLDLCCI'OPISKSKOLYLKKOLPKR


PfAAVAPALILRSCCKWVATDAVOECLPLEVIEEACNYNJ1FSLEATTTVEEVSKRLSELL
o X71107 770147


Y.~.DKRIOCLANVRCITKIITSPYLCACOCVSWOM.KTYDLGRNY'EOVLACASOIDEFAD~Pn_069
7NiE:-r:liceet Hwnnrranzflroaa
ft


Y~.FNFALVNIfDtLYI;JM3DR.~.YfI/:DFtarteISEEHASEIrtJYDWtaILEVNLPILEEDYRr
u
r.rrv
;::1~-
r



'EI Y:' .
~;:n'_1Y- i:e:Ifi':: ""i'.'~'' . E
.. ~.::ivY~r~s-:~ :. :.i.":. : ).
. Lr-=u.'.': :y, i=


77 760119 7593511
HNANVtw"WEIKRRRCSLVKKIRVHOSt:LICLDDLEKLLNEGAt)FV3IPINSN41GC11pP


CPn_OD
LOOVAELVNRYOAYLAYDGAOCAPNLPIDVOLWOVDFYVFSSNKIYOP'TOIGYLYI~DL
No robust homolop Dresent in t:enel>Mk/ENBL
as of 11/7/98


RIAEGINPSGNRSPDDVWVpGAOCOSSSTOCfGiITNSEEGIWEM'1'STSQPQV1~4KAKQLLOOLPPVOC~DNVAIY
O~tPEYLPAPEIKFEACTPNIAGVLCLGAALDYI~GLSAKFIY
'


WQIVRCFFLCKICSPDSSOCASGPAIIOSPSOPIIRITRPAPPPPI'IC071NJ1KRPATIIC~RIOGANPLOIGFLLD
IJECI71VIC:
DKEIALTTYtJiKELLEIPCVEIIGPSIEEPRCALICEtI


APOPPTAGSSSOSEOPTANSSEVAKLVSELKDAVNSIIAEB~CVL10NSOELOTKiII'QiC'1CHOCADPJWERWNVC
NVLRVSLCIYNDF~DIDOFILVLCDSLOKIRR


NRCPDYLWCYRVI11RAT.OpTYTLOS14.IELTSSTCPVPQAVTYAKDAVTO'lvRG11I1QiL
RVSDpGGWSOIDYTSDIARL 0690 772701 771176
CPn


ENPKPCNDPONL~IpWISLGIOCPTLDPGESIONPLLT_
AGKN11'1'RDVNOIANESSRL ASC TransPOrter Nelnbraru Protein


GSALDRVRENNPNENPRIWIALARCIGAJ1VHSNATSVRIANGSV4AGDlM.VSICfFSSIASOSPVOKAAEACYTQYS
KOPSSKL17LS5FS1iI0EtSGfPO


~,~~I~y~
RYNUITi'J1SELIKOIRSL71FBCILINGKYEPS4SOLPEWIVCCIDW1CSLSSF


0678 761329 760682
NOCFOVN1WPLAfWAVCSEDRCWLYIPEID~'1'SDPIFVRNISFPTVSOHDVIFfTRIV
CPn '


_ ELFVCECADLTVIfNPCYSEtSDfLSWS
No robust homolop present in Centebank/ENBLVILCQRASAOIQISNDVDLENUCSSKTIVNGYt
as of 11/7/95 '
'


KiINSVNPSG7JSKND4WITCANDOHPDVKLSCVISANL~SNRVTASCGROGLLARIKGVi
SYIVCKKGNAESLVLVOSPRIL~IKJLSN
TIA'M1~11ICMffGIJLL.ESCOGPGWPDN


7CPFSRMSFFRSGAPRGSQQPSAPSACIVRSPLPOGDARAT<x3AGRNLIK10GY0PGlDM'011~IYSRQHIKSILY9
GNPLFF~CI'ISISSOGCLSDANQKHDTLLLSSLAAVSTIPRLEI


IPWPf7fIGAORSSGS1TLKP'TItPAPPPpKTOGTNAKRPATNCttGPAPOPPKTOCfEIAKM~6YKASNC~ATVOPL
DPOpIFYl9LSiKitl'EAFaQEKLIHGFL1~LVSDTFtaSST~.Et


ATI~KCPAPOPPKCILKOPOOSGfSGKIOtVSWSDED


0679 763936 761735 CPn.0691 773167 7736C!
CPn


_ CT691 hypothetical protein
pyk-PhosphoQlyrerace Kinase "


CY!>nIfLTWDLSPEtICKVLVRVDFNVPMpDGKIL~IRIRS11NP'1'INYLLKKEW1VIWSfCaKILXiaCSVLV
RGLGSNLKIKfiLHASCE10VKILDOFNWIOPCTMNIIt;PNDA0K5
'


HLCRPKCpOFOEEYSLOPVVDVLECYLWtNVPLI1PDC11CEVAR0AYA0LSPGRVLLLfl:IL.DI
SSOEIALOEONLLSNLP~LRSMGWiCFONPPEIPTI~tKMFLRDAYNAIIRRN~10~


RFNIGEEtIPEKDPfPAAELSSYGDFYVNDAIGT
SIt~FNZ'LLSTVLI.TIfEYNII'1TDLFL~INIMOGFSGGERKRNEICONLVLEPEfIIVVLf~EP


r'f.
.EFLGRNLLTSPKRPFTAII~GAICISSKIGVIDSOLDVDALRLICRVLEKYRELIIPtSSLCIVTHIQPKLLi7LIRP
OWIG.LLDGIIVALIfiW
'


It>cISLVEKS11LDLAREtVLKIAKSRNH'I'IVLPSDVKAAEM4SI~YSVISIDOOIPPIQ4GKRVAWR
SIlBIELfJIKSY0E1r1


FDIGPRTTEEFIRIINOSATVFWNGPVGVYVPPPDSGSIAIANAf~?SiPSAYMI00GD


AAAWALAGCSTKVSNYS1GCCASLEFLEQGFtpCIEVLSpSKSCPeL0692 771915 773161


11AC Transporter


CPIL0680 761351 763971
IOE!'G11GGKYEIOESVKVPLEEREDYPYC:IYI'PIESOGLTRCGSEE?IEEIAAL1~POP


ygol-Phosphate Pesmease
IIDPRL011YRYwIODiJtEPANARI3tYGPIAY~IVYFSSPKOKKPLGRtiOIIDPIILDTFK


YSNLPLIIFVIi.CGFYTSWNIt.ANWANAVCPSVCSCVLTLRQAWIAAIFE!!'GALLirGKIGIPLDDOKRLLI~N~
iIfAV~.VFDSVSICTTFKE71LEKAG11IFCSLGGIIOmEFEILyKIt


7RVAGTIESSIVSVTNPNI715GDYMIf~IfAALW'fOVWC.OL71SFFOWWS1TNSIVGAVIYLGSWSHRfIfFFAAW
AAVFS00Sf11YVPKCVKCPIO)ISTYFRINNKF~YOQFOITLIVY


GFGLViGKGTIIYWNSVCIILISWILSPfEGCCVAYLIFSFIRRNIFYICiDPVWNVRYAEDOCIfASYLOCCTAPAYS
SNt,~rNMWELVANBHAVIRYSTVONYfYAGDKKTGI~IYNF
'


PFLAALVZM1LG11MISGCVILKVSSTPWAVSCVLVCCLLSYIITFYIMrI'lCNCSYISOTK
VT1~LCAOYRSKISNSpVl110AAITWKYPSCILKGDESVCTcPYSIIJ1LTSGKIQAD1CI


PKRGSLTYRLKFJIO~iYCRKYLWERIFAYLOIIVACPNAFANOtiNDVANAIAPVAGVLRNLN9CKRTTSIVISIOGI
SSDESKNT!'RSLVSLCKIUIOfSSNYTOCDSIQ.IOKASGiII<TDP


pAYPASYTSYTLIRi~tA!'OGIGLVICL71IWGWRVICNCC1CITG.TPS1~!'SVGIIG&11LTKIWS'1STSSIEN
GTlSICLREDQLLYLRSRCLSPLI'AVSLVINGICRLIIEDLILVAO


IALASILCLPIS'1'1'INWGAVLCICLARGIMl'HIItIIKDIVGSWFITLPAGAI3SILFFEASKLLLIKLC9S11G



FALRALPN
CPeL0693 776393 773310


CPn TPR Repescs ID-Linked 0lt:NAC Traneterase
0681 765001 764358 hanolt~pl


_ LRSTEItM.GEISNEGiUWIJIXElTrCSGI
C1'691 hypocMCieal protein AAL11YCY


NGIR~KSFTRSFRQVIIAXKAII~lp1'IJLALFGOSPFAPLOIIIQ.~IWSCVEriI<.PIfTALGIIALL1CRVSE~
WCSKGLASEPGDSYLRYCYGVJ1LDRONpYWIIEO~iIYVAWP


LRDORYECL.LQEAIQ.VSDKEYpA0CI10~i0laNtiLPA0LTIlPISRAGILEIISIODSIADTDOVECyIFSLGSV
YEGtLKRLQGLDCFDKILALDWlIIPOSLYNKAVILSEfWEAIiIRLL
'
'


AF~DVAILLTIRRWTYPSIIfL.FFRFLtiOr?.iAFELI7t!LWEp'NOLLESSFOCRKADKAL0K
EVAVAIO~IPLYfrKAWhLLGFLLSRSKRWOKATEAYDtYWLRPDGSD011Yl~iCYL
!


Pr~r-
SK;RVAKSENESDVLOREtHQIFFSI~FIIPEKEFYLWLOVIRRTJ1GISDS58adWRTRLALKAFOFALFTJtAEDA0
11lIFYVCLJIIILDLKOIOIGYEAINSALSDSL.C101ARnIa


INNTLEEK
YLH1MQGETDIUITKELLFLpIfKDS'IPAPLIQKTWSDPSSNOFCRRIDTIS


CPc>_0682 761913 765955 CPrt..0691 779135 776330


dppD-ABC ATPase Dipepcide Transportpbp3-PBP3-transplyeolase/transpepcitiase


TSKCWKNSLFPHIR1LPKRSCKRLNASNPILOIEDLSITLitK0R00YPIVOSLSFTINDGFSOESEAIOrINSNKRPI
Uf!'PIYESIAOkTNItLISCIVIAFAVIALRWYtJlWOIWKLE


OTLAIIGESGSGKSVSANAILRLLPCPPFSVS12QVNFQGHNLLT11SRSIOKKIICTEISMEAYKPOIRVLPQYVERA
TIC~tIGKTIJ1VNOWYDVSVAYCAIRDLPfRAWIIVDEiIOIKO


IPONPpJLSWPVfTIEQOFREIItLMLILTAEVAXEKIC.YALEEItGfNDPRLCWLYPNOLIPVRKNYINCLSELLSQ
F1JILDREAIF~AIHAKASVLGSVPYLVAANfIS6RTYLKLKMf.


1501?Q.pRICIAMALt.CSPl4LIADEPI'IALDVSVpYQILOLt.KTLOKICfOISLLIITICISXIJNPOLINGVV
RRNYPOESVASDILCYIICPISLOEYKRVTOEG80LRECYRAYE~


NGWAETADOVLVLYAGPNVECAPAVOMFMQPSNPY'1'RDLIaSRPSLQPOpLCSFNPIPCPKLPL9GLASIOOVMLLE
SVESNJ1YSWALVCKEKiVFJ4CWDSKL~ItICIOIPILV0N101i


OPPHYTAFPSOCRYNPRCSKILNRCSAfAPEIYPVRGGNKV1~WLYDDFIOE~OAVPEAPatKIpL.TLSAEI4AYADA
LLLEYEKT6TFRS~IIKREKi.PPLPPW


IKIXiAI IJ1LDPNNGEILANASSPRYRDRJDFVNAKVAEDSKAVRSSIYRWIrWKIOIIAEIY


CPn_0683 765936 766919
DRKVPLIRERRNPLTCLCItEEILPLTFOCFLOFLFPENSVIKLOLKIWSFVO0AIt110NL


9ppF-AHC ATPase Dipepcide
TransportVTRLLSLFPYEECfCPCSI1IFDAVFPNEEaHILIOEYISL0E0KWINECWDNKADItL


CVCCt~f!"1'NFPOPLIQATSLT>IYYKRSFWFpCKTIASRPVDDVSFSLYSRMVCLICESCKEJILOpVFN6LPANY
DXILY'TDILRLIVDPEItFSPVLPSEVNRLSLSEF1'6LOGRYWIR


SCKSfWLAtxLLPLTSGFLTFNG1'PIKLHSKIK;RtWLRSQVRLVFONPOASLNPRKTISAFSTILEDAFIEVHFKSW
RKSEFLOYLAAKROEEALRKORYP'TPYVOYLEKEKTRQAfKII


:.DSLGHSLLYHKLVPKEKVLATVREYLELVGLSEEYFYRYPHOLSGCOO~tVSIARALf&FCOEHLD'T!'U1YLFSR
TPYIfDGLEPY7fDILDLWINELONGAtIRALBWNEIIYLFLKiRVSN


VPOLIICDEIVSALDLSIOAQIIlMLAELQKKLSLTYLFISNDU1WRSFCTEVFINYKCLSENLPALFSTFREFNCLAR
PLLOKYPISIVRNKROTEODLAASFYPVYCYOYWIPEUIYG


OIVEKCNTICRIFSDPOHPYTRffLWAOLPCTPDORDS1(PI!'pEYEIKt~EESC51GCYPYNOMTLGSIFKLVSAYS
VLSORILWGHNECPANPLVIIDKNSP'CYRSSKPHVCFFKflO'!PI


RCPpKQEACKSEIIPDpG1HH'lYRCIH
PTFFROCSLPCNL~ffICRCFIDLVSALOISSNPYFSLLVGECLOOPEDWDAASLICPGEK


TGLGLPGEYAORVPHDLAYNRSCLYATAICOHTLWTPLOTAVMLASLVIA)OWYVPKLL


~Pn_0684 768056 767181
LCEWEGEHVSYLaSKKKRTIFNPDAWEYLKTCNRNVIWGOYGTAMiCSOFPPOLLBRI


spoJIparB-Chromosome Particioninp
tCKT51'AESIMRVGLDREYG'MKNICDiWFMVCFSDODLSLPTIWIVYLRfGEIORGA
Protein


ERSCDIVPEISKDTIIEVAIODIRVSPFOPRRVFSNEEWELIASIKIIVCLIHPWVItEPMAVKNIDMNEKI~~ORLSF
LRG


IGTGDRVLYYELIAGERRWRJIEIpLACATTIPVILKNVIAOC'l'JIAFATLIFlIIORVNWPI


ENAEAFKRLINVFCLTpOKVAYWCKKRe"LYANYLItLLALSKTIOESLLOGQITIGHAKVCPn.-Ofi95
7Rt1301 78178:


ILTLEDPILREKtlJEIIIOEHLAVRLAELI11KOLISEECSSIEL1CPTPLDNAESSKOHEEhonalagous to
CT595


LOORLSDLCCYKVOIKTRCSKATVSFHWM~ODLOKLGWL.iSEK:I'LSESIa"SLEYSNKKLLKSaL4SMFIk:SVCS
LOALPWC.NPSDPSLLIDCt'IWECJ1AGDP'CDPCATW


CDAISLRAGFY'vC1'VFORILKVDAPKTFSMCAKPI~C.:AAANYTTAVDRPNPAYNKIILND71


~:Ptl",0685 76800 768317
EWPTNAGFIAWIWDRFDVFGTLCASNf;IfIRCNSTAFNLVCLFt:SIKGTTVNANiLPNVSL


No robust haslolog Dresenc in
Genebank/EMBLsNCWELYfDTsF3WSVr,Am;ALy~CATGCAEF0YA0SKPKVtEWVTCNVSQPfVNK
as of 11/7/99


FPOSOYLLIFPNRILDWAFEILWQCML?DQRKHIOMLNKHHSIEIFLSNNWEYKLFFPKCYKCVAFPLPTDAG11ATA1
CTKSATINYHEWpVCA3LSYRW.~.LVPYtCVpWSMTPD


KTLK AOEtTR
IAOPKLCTAVLNLT11WNPSLG3NATALSTI'D3FSDFMO
IVSt.'pINK/KSRKACGV


'IVr.ATLVDADKWSLTAFJ1RLINERAAIIVr',.CQFRF


.:rn_0586 758373 76817c


Nr, roD,)sc homolog present in ~,enebank/ENBLCPn 0646 7RI707 7R~5'n)
as or 11/7/98


:\KD.SMIPtX:RLFRVIOELFFFSa'LYVCEORRPRKL7P~WHWFPIEKPRFLLKCFKKELt.'f:'le
hypnrhcwr:,l Iw"chm


EIFYERhI N:Y:FYVPNI'LLTY :71FE I EV~):a.F~.p:
~ .'KI: P I KDL11::N:AI IE':I
lOrl'RIadNf'KNKLY IPEE


NtJtILY I I N(.AK'T'L.~~1WIIAt.111
I HKV IOfkIKTVLt1N TKK4AKt.'V
I RY,MIEM.EFPIAE


:'Pn r)b87 759501 '1W)3l1
RWU%71LTNM1'1'IRN::II!T1111CrF.KfH.::HfIVAYf:PKKEJ1ALLAY.ItIK~KLLNNLEITtRYEIK


'T4Nt IrYp~thutic,ll pruratl:
KAf~:LLVWOC::YEKIA'IAI;IKYI/:fIVI.ALVUTEIC:DIt'IIINIV11~'NUIr:I.K::fRLItN


ItKINYdtLNIIAYRF~I'PM.'fc:FN~KLVItNIWY.KfY~F.SAIAICIVL.\::FL::LKTV::N7YKVIYFNII
FJlKIIKI.:It:t'I::IVY.:a.EYIl4a:AII:::::UI~IdvarEarINE:ItI4.IJ1KKYIi:YJW


II::OAI!ffl:: t LLLTRMbYAV.'~1:FLPSK:
AL.i:iLEYAYNIIX:E'.a'EtKPYIU:FIJI::I:FY
IIIN


I:1LM:AYYN:LAYfQI:.VAWLDHPIqKLLKETSWJA00L'fDVALSK:fYOLImAN.~.:aCtYyrfi.r7 'r4
'a'.'1 nt t~.1'1
C


'il~rl::Fl.TLt.tNIELKELL1YJDV:.'y~DFMLKSSPLFHpFFRN'fl:OCEWTL:KRFY'*Kf:r::l-
Klcxn7acv,m F'.n:rmt f::


WNr:YI JA::fiF::NHT1.K'TLI\trl':
Ir:f:l'Yr y, I:AL1JN t Y:Nf.E:IJ1WYLItKI~:LA::rItSKKPJ1R


'hyln.NN Ir.'117~ 171)1)7
ETYIfi:IIMKTINH.TALiIV'rPII~.'I'Id~/ANEIAVI'kIsEY::IILtlII)ILKYKVIII'Vli\I:%!Ad



."pqal trrl!Ir..r:.:.rl 4rntriu
::::rJUP::I.:'/DELHAYftYn'Vt:EJIII!I::VVAYI1KAH'fr'I"/r7f'/::IY:M:K'I'VAI:rNI:Y
:::::


104


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
SWIDIAFOIWAAOPOFL3xF-.~VPAFJtIAK! 4're68 nypocnec:~.at t. .m
'~tOCxPOEJIEKIV'fCKLN': AFK'lVKRffCM IDP'JlCf PtILDI:DAEACv'
TAD :"'=N:a: FG~:ELKKC:.'.?FALC'.::YAAPKD


. TTLVOCFKPNPNIOI~DOMIChLI9ft~.~~tt.E;~INN
fFOEAt:LLECPf IKNADLi
IO~L:OOf.~.K':'.SI:a~VAIAOfAD~IIIDAVOViIDOiN>I;GCNIINIOIfOODARKCDI~
EOF ILWK1CA B1C9 E1HIJ4R
'
"
'


':Pn 06os 79)44) 754201 t
CL;.
.
ALLYPSDItDONRF~LANFL>r!'~YAVORA''ORAELFA:~
ri'L : SV33SIKTIlI


pyrN-tMP Ktnase
EPNKNNAKOTRRVLFK:'~,GL.iKLS"~1RIDEMRISRLVSELRAVPNNDIEInILVIGQCN715207 ~~574~
0707
CPn


tLIiGIJtEGKELa7INRV.~.AttOMCMLATLINCMAVADUJIAE:.IPCL:.':~_
LSCPOIaDLI' 'f~e7 hYPOCne.:~.~t Gr~t~tn
.
.
'


PCK.~>tEILDOr:KIttr'~':A(t:PYL'ITDT".AALPJICELtNWLtYJITMNVOf:VYDKDPRL.
..
' :
' ...
...",: : . . .:1,
'. :~ ;.
.
~
,
'


: .... ~yr~;
. .
....,..:.i.FYi .
:rt.: ~:'F'C'.:=.i:r..'.':..:/~i.;.......:.~TIL::II:~fPn..iv.niv'i.~
' ~ .. .... .
: . . . ..
'..:i~?'.:':.:' :.::.~: .. . ....,
... ... :.


.._ ~:lll : '.': I I' OV IANCKIESTPdtifJS: LIaMLri' WAK.:AV:PtG:


CPn_0699 781179 794721
CPrL0710 794492 796210


rrf-Ribosome Relusin0 factor Cf66e hypochscaal Protein
ODTEKKMAAALDffNKEVKSFR1GK1WPALVE'IVVVDVYCT~SDIASISVADRC>CKSMAT~TAfDfNIMLOCVCTYV
ItGV00YLTELZZTSTQGTVDtL:CMfNLOFRM
TMSIrt


. RS
LRQLVISPYOGNNASAIAKCIIAAM.M4PEVP1GSIIRIKVPEPTADYROEMIKOLRRKCOILSpYIIESVSNILTAVN
TCIZT>4ARAVKCS


EFJ1KINVPNIRRE71NDKLKKDSALTEWVKON~IOELTDKPCKOLDELTKOKf~EIAS


CPn_0711 795791 796484


0700 795094 785609 .~
CP
.ZdFINYLyLCItYpSMfFM~IfItKEEKt4SOPL:.DLE00M00t1oR.10ELKJLSVODK


n_ ~
CT676 hypochee ical prou:n
VHKLUtLLRF~SDKFSfCOOCSLL1GYVAtw~KVLCRIt4R10tI
PATICYTEII>K~LVIRSYVCATCPCPSHYYNNEHLSLSKGVCVLT'


tJIVNSPTIIOCYFICQO .
LECfrI4CKTVWttSKODD Lf.GCFlQCYTNFKNOITSKLKSERWSSSTMEKCOGSLttIGR
OTLEREDYEOAAVIRDOINHLKT104~DPS CPn_0712 799)15 796781
At nylacs yelase)
~ co eds
l


.D _
E ogy
APOEASNLFtpLLKLI FHA dolesin: haao
I
~~
~ IP~


CPn_0701 795594 796672 EE~D
VYDFD
FISDEFD
~,pprIpIyyNGVAIOfT'fQLKNE~ILS


karG-ArQln:ne Kinass
gOpLSpSNEpGKDLLPROTS6fIHISPKL.TKDOGSSDPTTSCDQFlJID~JIfIJISAKAE
MTLPFIflLLEfLVI(RKESPOANKVWPVTTFSLARMLSVSKFLPCLSI~Ofa.EIbOF
I
IAKVA>ULO~S~~PKEONiUtDSPKGEBRTNKPpNiIINEDNOASPRODPOPK


KPK IO~PI
p
SAEPSWOJfARDCfPLIIB4KPVElxAFHfKATPDSPEKKDOPEDGS1~OGSKIFJ1TPLDSQ
ITSMfIAIIEGFGEFIVLPLKOTPLWOKIIPLLEtIFLLPYDLV~'iP~'.EtILW'fRSfDFIs~A


INfpDHLVLt~IDFOGNVEKTLDOLVOLDSYLHSKLSFAFSSEPOFUfTNPKNCCTGLIISK~~ETDSAADAFiDAtAS
DHTAEOtHtCfPItKV84lt%SA


JCFLHIPALLYSRTTNLIDEEVEIITSSLLLGVfGFPGNI~iRCSLGLTI'~i.L~SVLSPfHVODt.FRfDO'fIFPA
EIt~IAKl04I&VDL'PpPSRFLLIMaGAtIICII6FHLDBG%


fAITASKLSVAEVAAKKRLSEfl~WLIOdLILRSIGLLTHSCQLELKITLDALS~OIf'IIDKTSILg
HDOGILIEDL~KNDVIYFX:RIt
SNpIWCITVC
YIIGTDP!"fCDI
V
T


DLCLIKV'IENHPLhR4PLFW0IRRAHL71LOKQAt~SRDLOKm'ISHLRASVLKFLTI(GLHP_
_
_
_
_


!SF ' ~pV00LAILPGTdTASLtHI'K
OELM.AOVINOfPIYR9'ffN


~L~I~~'~~~~ ~ ~~Iii.SIOIt~ILO


CPt~0702 789700 786929 ISIOiSPEPGKFII1GYVKTEEOAACLVDYINIH


yscCJqspO-YOD C/Gen Secretion Procsin~F~~~~L '~~~~~~IPCVRLV104fAVLLPAtRCGIID
D
LNL.RYPNRYRM~fgRYOETSIMIWNCRILTRGDVTDCKM'SIOPMIIFLfJCD3LXYK
I.IQWPVKIVIINICRItILOGIKIUOt>utICILSGLFfLDLVLLCVSSORP2'EI'SANV~04L
'PAKpt


RDEKLAACPKNSJU1SLSAXKSIftlOCT?PGSIPSKVFSKFD11T01~fFOKTSGSAFIIIYHII


TLXELEERKKPRPERRT'fADVKRSPRFLP'PDEVEPVPAASKDOLDSIQ~~


AVNAINLSIKKOLEED'fSTV'fEKD110PKT011TPHASKIQ4VASPSTSHPGIDfAATTVAVP
YSKISCfDIIYfDSNDLO CPe~071J 799!17 799132


.,.._..m........".,.rree.,.-r.,m~r~rtarrmtsILELIaFCT66J hypochscieal P>rocsin
LDLItBEKAGPPNtft(SIPOGTKl'fTAAL>aNfSMLEICLIKNFATYMGITSTLELDiIDGAYV


LPISEVVIGVMQOFDIOFitIVLSAShGAL.PPSADTA1ILYL0l4~tTfrliLPCRL1COSAf.OLDS


EGNVVMVRRFSGt~EYRtIVL8Il~fSEISII.SDLCLGKO


X0714 901125 A00091


MwA-Glucawyi cRt4A Rsdoccase
mUaAt*BRERJvIOYLOSFEKNLFLA~ORPIG10~ATTPLL'f~tA


NYRIVLMVIL1IVCT
g.YIITSESPLTAGAULSf'~.TSaGIRPYRHRCLSCIIItLfOVTBCTDSLifOElEI000V


KMYIJIGSKZRCLffDWTLPQKJ1LKZI01ICYRSRICTPDI~V1'IESWOCILLSYDKiTIt


'i'I4LF4C3YSDIZiRILVIUIYLYpt~YHRITFCSROQYfAPYR?LfRLTt.SfI~PYDVTF1LC5


SESASQFSDL~l3LASIPKRIVFDINVPRTfLWKETPTGIVYLDTDfISHCWI01T/0Cf


K9lCVIHWILLLTCAAKKOWCIYtINCSSHITORQISSPRIPSVISY


CPn,.070J 791205 789695


pkn5-S/T PrOCSlh Klnass
RSRWHOLNP6TRItSTVIKVfSPSPSF CPt~0715 901436 80)162


RKIGFMOCROGIPLPEPOVIGCYMVXKILSKKL pyrH-ON71 GYrass &ubunic H
TSRSVYNFLKE710SLHOITHPNIVKFHRYGIIWOt7CLYIAMEIfIt7CISLREYILAOFISLPKFIIKISHMAAYTE
ASILSLASLDtiIRLAAGNYICRi.Q7CSOKEDOIYTLIKBWONCIDE


QAIDIIFDIAOALEHLHSRNILHKDIKPENILITPOGKIKLIDFGLADWtrfEIORAHPSVfIMfRIGKSLKISASOKQ
ISIQDOGPGIPL.~LIt7CVSKINTCAKY1'QDVFHFSVCIN~r
C


IGTPYYNSPEQROGESHSPASDIYAt.Gi.LivYELILI~ISIGRVFLSLVPRISKILA%AL~~EIFSVRSVRKKKYFI
LtffFMRGVIAESXOGST1IDPDGTfYgPTPDPSIP~1'f


OPSPN4RYSSTREFIODIHHYRFtSGDMOEDLRIKDHTVALYE0I4ZQR~POfNNDFLKDKI~~TSHNDLIIDLFDAEI
TtPPLYSPL<"fONEZM.T


FISCVLYHQCYPLYPNAYDTL.LOt7VtNGWGGYSPISt47tTIALSVVKSLVC00DLDRPLLFI!'SHLF%3~TfERY
fSfVNCOTGD~T~'TAFKEAIVKG1MEFFZiKTItSISNDIRI~IVCC


DRVCEINECLIRIOCIPIDEtGISILCLEISKENICLSWIJ1CCKTtfWIKROCRVWDFESIAIKLISPIfESOTKNKI
GNitOIRSSLTKDVKEAIVOALRKDKVAPIIKffiEKlR


FSPCIGKITSLOIRETKVAWEIGDFJ1VVCTLELEESVIISLiClLSIaEI4DRROKAIFCPIKNIOFIKODIJCSIfO
KKVIIYKIPKLADCIIFHYN~tSLYGEA55IfLTG'JCSASASIL7ISRti


ESIHGCIOSRl,7HG5NSPSTi.ISLKRIR
PL'lOAVFSLRG1IPFTNfSLEI~.TIOtYICtDELFYi~TAIGI?ONEIOHLRYItKIfILiITDADV


I>GKIIRNLLITFPLKTLLPLVE~iLFILETPLFKVRNKT1TLYYYSEO
1'DK


CPCt_0704 792))0 791209
KDSSLEITRFKGIGEISPKEFAAFIGPEIRLTPVTITSLESISSIt.QtYNC~XOf


fliN- Plapellar Motor Sw:ceh t)anar,n/YscOIlIDId.ITDf
family
RYfM11V1U1DSSAS1ILKSRNNFLSSLOKTEEpVMPCFPKEItOHKIREKFPLEDVOVSIK


FRGSITAVEATKEFGVHLLIOPMNOPWEVENLL.fLTSEFaEQEI?NAVFDDASI~SYFY
CPn 0716 80366 804902


EKDKLt.GFHYYFVAE71CKLFELOWVPgLSAKVOCDAIFTATSLOGSFOWDISLRLDGK9YrA-DNA Gyrass
Submit A
C


NVRCRLLLPfDTFQSC0KFF5GLHDCSDLHNIDGtOQISLSIlE9CY50LT0EEWilOVVPFMRWSELFRTHfMNYASY
VILERAIPHILDCLKPVORRLLWFL.FI11DDOKMHKVIWIAC


SFIIa.DSCLYDPETEESGaLt.:'VOxHOfiroGRFLTPSSCEFKITSYPNLTHEDPPLPENPRTMALIiPHGDV1PI
YEaLWLINKGYLItri'OCNFCNPLTCDPNAAARYIfJvRLSPLARICfL


QASAAPLPCYSRLWEVARYSLAVSEFIKLNLCSILSIGNHPAYCVDIILDGAKVCRCEIFNTpLIAFHDSYDGREKCPD
ILPAKLPVLL4NGVDGIAVQftZ'KIFPHNfAELLKAOIAI


.~~I~ INIHCI(F7VFPDFPSwINDP8EYO0IfIGSITLRASIDI
- INDKTLWKOICPOS'C1'E1Z.IR


SItZIAAKRCTIKTDTIODfSTDVPNIEIKLPKCSMKE?1LPLLFEHTECOVILYSKPIVI


CPn_0705 79)176 792734
YENKWECSISEILKLIriTAt.OGYt.EKELLLL0E0LTt.OH'IHIfTLEYIFIKMKLYDBVRE


CT671 hypothetical procsin
VLAINKKISAI~Uh'AVLH.1LEPWLHELATPVTKOOT'".~OLASLTIKKTLCFNEGCTKEL
'


fSSKCMfRTES
tAIEKKOMIOKDL:RIKE\TVKYLKf:LLERHCHLGEP.K1'QITNFKYAKTSILIOOOTLI
FMELKKTAESLYSAKTf7HHZYYONSPEPRDSRDVKVFSLECKO'fRCEKT


RKFADEEKRVt~ELAEVGSKEEE0ES0EFCLAENAFAGMSLIDL~1AGSAEAWEYAPtA


VSSIDTQWIENIIt~~TVESMVISEINGEOLVELVLDASSSVPFrIfYGANLTLVOSGODLS
' CPn 0717 H0~~69 HOSJOti
'


t CT556 hypochec a:a1 prccstn
VKFSSFVDATONAE11ADLVTNNPSOLSSLVSALKGHOLTLKFStt6NLLVOLPKIEEVO


PLHNIA.r~TIRNREEKOORDONOKOKODIHfEODSYKIEEARLIRIKFIDTLTIWRMEPRHIYIRKPETPKAPDVEKP
I1IPEYMTMANTITFa:PVKTLOOL


RRALTEOROAEEDv'KFIYDNFIOSILISfFCLVHKt7ttDPJWKJ1SKRMRfVYKEQ


C~ 07Gb 79)689 79)180


CT570 hypothetical proca:n
YJtVAK'JPLEPVLAIKKDRVDRAEKWi(EKRRLLEIEOEKLfIEKEAERDiIVIOJHYNOIII00C~ tt7tlt
ptt5700 905626
~


S.RDLLDELTfCDAYLOIKSYIKtIVAVQLSEEEE1NNKOKEWLA.VKEL.EKAEVNLAKRRol procetn
CT5'7 hYfMfhect.
~.fTYFf.ALF\'MtINOEItFLC~IHCRWAPF
I N~PLYLTLIADHDTfY LrIKNLDKfPLP
RA'/M


KEEEKTRLHKEEiMKFtL.KEEARAEEKEODEI1COLLFOLROxKKRESCGS.
VEr~WFJCI~Jf.iTl.~..~.LLK I FL:.~.DLiSLItLL.IU.TKFE
I LT1JIDLYt.AON I


':Pn 07n7 'I'1501~ 7aJ7114
:'.Fn rfllr NP~477 HOewtn


y:a:NYtttt N IFIJUnllatTyMt ATPJtel:ant: II:,rt.Irml I,linr ::ynrh.naI
'JNIIDULTTDFtfftNSOI~.DVNL'I'TVVCRfTE.YVCMLIKAVJPNVRV::6"ICLVKRN1~1EPLVKKVIKt::
FIKTYf::ffVC:Y.>?t:J:Rt.UK'fLTE'IltfYY,':RAYYV171IL:a:LVOINPQ
FGfYt


'IfEVW;t'fll::FAFt_:PG:EL::CI':;FSSEVIPTi:Lt'I.HIRACtK:Lt.:I:VLFY:GCEPIDVET.
C*S
Itffl'/A'fItIJX.'rat!\'.'IDtv'F:KF.EL1.ELLI'FJlfl't.l~Y'~IF.G:MIt.VINII1'RDMMIP
AWi


Yr:fIJJtPJlt~rfln'IF'ItAFi'DCIAIRwKLR(?IL7T~:VRf:tCC:MLTV\I;.:ylt(CIFM:ACVV11A1
JJ11:1.:EItIJCKh:Ff'EF:1WHIv:IVIIftI.LYfIf:X:LIITAKTIt~tAK.YyftIELF~
IIF'IY:f1


.:LUY11AHNABF7Il)VMJIALA:ERtiREVREFIF1:OL:EfX:MYn::\'ItV:.'f::D(J:::iOIJtLN.
'L:KI'k::l'flIr191I::It11VN1fHKFMI"r::ll:YlJ1VTIIr't.WIJItIa:K4~.fV
::Y1
A'/n
'fYhI
YY


AIIYWrfntAl:'tFl.llr:KTwt?iMD:;YfI:FARALtitr:L.AM~Er1'\HV.:YTr:vf~aTLMtL.
' .
.
.
AL::I'hlt:lrftY~l.lt\'IIMKlllatft
Ill:ltl'l/lft:lP:attl.~..':Yr:l.fiYtXNJIA'i::Vifr'1111'RTROF


(IIYfALOVLA 1I:IuMIt:'LLtYi:FtiNl:1'fIINKNLI.I:::ILYFi
f.ER:Y:A:a*t:fl'fAFYTt'LVr4:f1h19'IEPVADL1't:.'.ILff:IIfVI::N,IIrWA'::1
YM:1


.a::f<L.LTAIVf1:19JRftIIv:KARfVLAiCYK.WI?1LIP.It:I:'fIId:::ItKfIttFAIDIiLOKINR.

.
(
~94


FI ~K(~L I tll:KTtI7FGIkV LHA I ttnn: sr.
FR ~'I~t, _ l.ai '!~
'


':Iw_t~l'W '/"n,ytft '/r5G14 In.m.,n
u'1".'.', I,yl..tlvI n.~..l


105


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
,1


..:.:.VAKPDI~:Y:: ~EGATIKAIR (i:FRTVPFLPY.:IL.7t::.::.c:.,
LR ITMKEF Irt'f : tKNL'JDRPEcJR . f HFFLRIIC:iLI:RnIIFPFRRF.:RL:Y
IKEIQCfttT:. ~..~: LF.~LLIE
~
'
'


. ..
MINf:A:.LH::ENKP~I:LAPVWNII
.If(7411
A'/IJrVVLOY'/EEC:"fLM::.:.TM:LLPt


TLL;f~VA::PNfIVRIIGLELMEEK W f PfYIMRHSDf~R
C'I!:~.tSfAG!AI~P~'.x>:WLT.'.iAitGII~IKFLwf'1
~~LL


'.LTOIlt'..'...-.DI"IJ1RYVTIEIw~FLYL?tY.iLKIYOL'iT~~
APt~"i7GIL:5 PA


':Pn 07.',: 90771 999199 .
t oRCYOREDMERGf.KLMKFtfhTLTISVNL
LY'ACLL:.:rILPC:VRVf.YEHCLPpQSAVYlI:


KdsA-KDt7 :;vrsc netns~
VRVLRCY'SASIIPItALAPLV:SI:.FYAQROYAVPLF:~:',T.'A:rIN:V:.S:,VLCRWVLitDIfS
.1PYSDRIOWFFY.:.>~IDKANRSSL.v
LEIIIGKLOS1:
:AG?C'JIEG'
C:'
KRM1
KML:
MFPM


.
';YAT.iITAWJOL'lf:.ilYY~SKRLPM'SKLWE.iIRRSIKVIC'.'".'MIUICItITUiIJIILT
. '
/ '
c '
. '
I
LRI:r\KVKETF':Vf::LT~VHTPODIIYAAAE1.~\IG','I?AFLCR~':~:.:.VA

:FR!ax:LT
W


. ApAIAF:..
. F.
. CIFI 1FLFT,FAKLLRVEDLINLaS!
'.',tE~"'.AtVNt.YY~F:..~.PNfIIt!:PTtIY'f!.:'f~.tlflY:;a.TF.P~'..~~r~,Wlf~;r:pMp
SIpVL!YW
:T"'~YVIFLNFLTPf~ADt?:r.
~i~.
..

.


., .. ..:. . ...~.. I..-...: .I.. .:.,::
..:..~~::~:~ .... ......,;:''~:.\~:~yv
. .-.....-..y.,t~- . _..


m. ........._. .r._..:': N V. :.:'.
. . ....:~ .'. F7 : . ~:. -
. ~ n
.~


a08971 :eneoenk:~BL as oC 11;7/98
0722 roWtst r7omotov present ln
909177 Nc
CPn '


_
IPDNILKIRAKETSLSFLLIKPPSPPPLKf:DYLFDISpYTSSE
CT651 rlypochee7.ca1 protein VAIAISANIF/IRL,CK
"


YCLSM'KFLYCLFYSL;.LL~JIFGTNVAIIOVDOICDVSCIOaDIPOGPPFIJIIKI(VNVTLRLRSISIIS

LTICCSYFKLNKASLGS


SKQICSpEERF!'NCKIDKSCMELNPPOSSYSCKEYLTRISIRIIL?tINFtKQMpIRGNSGL

922092 822976
7


LNYODCSLHVYDCRFOVDPVP~YGSPDKEDSSSGGFGCTLYLSLPRN)2-
CPn,0


nEoBhdonueluse tV


072) 908979 809703
NPl9IVL.PPPSIPLLCAHTSTACGL10JAIYECRDIWSTVpIPTANORpWDRRALJtEEVIE
CPn


_
DPKA11LKETCLSYIIiSHIIGYLIHPC11PDPVILEKSRICIYQEiLOCITLr:STVNItIPGA
yhbC-ABC Tranaporte: ATPase


ASNPILSVCNLVIOfYNKIIPVTNDVSFOINPGEIVGLLGPNGJIGKTfAFYLTVGLIRPDSGALKSSKEDClB~IKIV
SSFSOSAPLFDSSPPLWLLEZTAGQGTI,TGSNPEEiGYLVt~t.Ri


KIIFKMIDVCKK!l~ItRARLGIGYLAOEPTIFKELTVODNLICILEIIYKARKpOSHLLNOIPIGVCVDfGHIPMGYO
TTSPOGWEDVLIJEID


TLWDLOL:.'S~~1.HKKACfLS~uGERRRLEIACVLUl4P5VLLLDEPPANVDPLVIQNVKYLHAPLDEGYIGKESFK
FW"DCRTRKIPKYLETPGCpENWOKEIGELI2IFS1QiR0S


IKZLAGRGIGiL:TDNNJUIE~IADRCYLIIDGKIFPEGSSSONISNPNVKOHYLGDSF



CPn_0777 927779 827101


rat-81 Ribosomal PrOteln
.


0724 810602 909706
CLKYMAAYCCPKNRVARRFCANIEGRSRNPWOCPNPPGp!lGMQRKKKSDYCLCLCiKOK
CPn


_
LKACYGMINCItOLVKAFKEVIHKOGNVAqIIET.ZDtPECRLGBNYRI~ICPAK':IFAIIQpi.VA
No robust homoloQ present in Genebsnk/EMBL
as oC 11!7!98


RTSTRLOYRSGCII.SKILPFPFLWIQILLGFLCDCpCASWOCMVAIK~IDSVFMSRpEHKPtK:HILVNGRRVDRRSF
FLfIt~IDISLKEKSKRLOSVKDiu.ESKDESSLpSYISLDKTCPK


NIPYITKJ1TRRGLRNKTLAYLASLKDARQLAYD!'LKDPGSIaRWtALIAPKE71L.0Et4ILGELLVSPEODOIEAp
LPLPINISVVCEPLSNRT


FFYGCSNIEDILEEIBIRPNRILLJGFSYCOKPKIw.pDGRfNDACRYDPSHPIrASCSTGT



MMRIlIARRYTZYIIpTFIDIAKHLHTLIOtRYPGYOILFAV'fACELSLKMFGDYAS11l8rLKCPf1~0771
82786) 821915


GVGIRL1GRICNfPKAFKLAERGVKpGVt'ILEEDCF~1LARTLTEYSSAPFPRDFCEINYeM



QNI'IIDtFSSNf~iF'LOCNYPODYVRVFI!!>DQfY7CALAYYYI'IRVDHPHEETALIiKIVLmL


0725 810829 810587
OVSCRIYISEpGINGOPSCYEpHAELYMOart.KERPNPSKIKFKTNHIKOiTPPRT1YKYR
CPn '


_
KELiIILLGCt:VDLSKQAaIISPOEWIiEtG4FNRCLILDVRHHYEWRiCii!
CT552.: hypothetical protein ~TLPDIOlF


SCGWGMFFAPLLYESLRRGL?BiPTSNMpOQLARLEFINDOL'1~.'ELEHVNF3.tLSLCFPEREPPEYAEKLi40EC
DPCITWl4IYC'1'OCIRCELYSPVLLEIOGP'KIYYpL00DVIRY~OIf


CLTTIlUIAEEVLSDDEPLLD
C'fCKWL.GKLFVFD~IPIDESDPDYAPIAECC!!<l~l'pSDAYYNCANIOQtfILPGIxDE


CItIQf~CCGEECSOSPRVRXFDSSRQJICPFRRANLCEISEN&ES~LI


CPn_0726 917381 810880
' 0735 825680 825007
CPn


~: _
620 hypothetical Drocsin 'Utidine Kinase IUridine
ADIOrlIYSrSISTFYKKLSLVSSMHSFAORNRESLEHI11N1fEKTfA~tDTLKFtLTEVLDQItonoptlosphoki
nasel IPyriddine


RASERYRSAVEKLlIKYEVERATVAKSIPVAAItff7IPLSSTHASVO~Yt'ASTpMTGSGVCJ1Riboffuclsosad
e KmasW


YYN71VK0KWAQDLIVELN'fVtff'1'INASVNSKNPANKDIRt>XLNZ'8LOALVAAG~.TEZNGEK!l2MlCJ9II
IGITOGSGACK'ITLTONIKEIFGEOVSVICODNYYKDRSIfl?PC10171N


YOTLYNFPEEIITAIQRACI'f?Gt;l9(TD!'iNOLAGKYG10ATLTCI'F1IDGRVEGPKDILTLIWWIPOAFONDL
LISDIKALIfCMEIVOAPVFOFVLCa'NRSKI'EIETIYPSKVILVOGILV


AVQCVLTPEOtI'IPAEIATELOAL71DN,~IPDEAGLORILDI1CEIG.itAVTNSSDLTIIFiDKPENpEL.RDt~T
RIFVDTDADERILRRMVRDV0E0CDSVDCItISRYL9lRIKPIBIBKpIEP


INFCOHITDLYSDpVAAIGSFD111LDIl4TYVNON00TllF8NLS5FYGSLTGTPJ1PIDLRSTRKYADITVIKafYR
pNVYINTLSOKIICIHLENALESOET1M11MSK


SOGDISSAAL1GALAT71ACLNSRFNELTAEOQKLINECIKSLVTPKCCENLG11IWAYFTA



STWALNP'fATMDHVKAAILEEAKELDN8SF0L~1SSIKS11111'SIVNSSGSFSVTVNSS?L



QY1'IYSEKNGKVEINOILLNYGS1GPLPEITII1.A><CNAESTARSYFRPKALAAVESaiVO



NKIZtDt.OS0L00FTN9CTELFDGOLLSQASELRALPLPSAVASYL.IDRYMPlCE110YIHET




YKKLYYSNLCSSIGNSIIDAISOYVNGATYFNP'ASYVCOQPAVCAGGANAPPCSOESAOA
KLOQERKQaALYLOE'fRGALTVIEEORARVLKDDKIII~lE0RS1'ILDSLRNYEDNINSISG
SLVLLQNYLOPLSIAGGSVA4TPLV1CECOE9WOARLQILEEALVSCLVGF81ZNOGTIEPLO
STIQSDOQSFADMGONFOLDLQLOtI.TSMpQEWI'WATSL.QLI1J~QYLSLARSLTG
CPn_0727 81)559 816192


CT619 hypothetical protein


KYYLFSMSTFSIQNRLRTISGESTRI TKLGOfYSCFDPRSVPJ1INLEELNSCIYALRilI1f


NALOSENTNVMLIlIpNNfTFp1'TSWTCfIfIWSRPOJISSpRAPSSOTP?DIVSAAARALVL


VIDGGIJIEi.VASVTEIDLGUSTISTVROLJ~IASYLCL:TLTAEQEKWfSSSYVPSEIOJL


LEHVKpENAAEIOAKQEEI1WVLEJ1KGVSTEEIEAILKEYPDIYM~'PKCFIEEPLHTYCPef-.0777 A27ti69
8)0756


RAKVCAPIOEITIENAIOLLPTPPAITPDNVNEVNGIQPI'LSTILOAIDDAIKOAPALOCDOreeC-
Cxodeouyribonuclsase V. Ganm


EIITILOTLVPLVDK1TPTKAEPDLIYTATOLpIfI'ASLKLYLTDROIAEYRCKTTXVYQNKRSAKLPASGASKAKGR
AKKKLTDERIFAPSVRVLPbNRIOrAKRNLYKLSFITYRKCV1IP


SIONLSETIIRVVENNRSIa.L'fpLSMFOQMnICFV'IWISOMIAI1~IIAITNKYISAVLTtSMSAiNDFPLTGIVI
RfATKNCRASPSNSpIWLLiWLAEDLTSTIIpKPPTIDBiILVJWRT10H


EMYOGLLCLSYMYERLIIDDEKAIFDKSVNEyLPIHIV1AGGSWVNatIAKHAAYQELAEYSWIID~IOLVHVLSDHIF
MGSTIFTASDSIVKHLPLGSGCSOPNIPDYLTLPL.LINNILiEIS


CG1'AVTSOOOLKAYCpTRGNEFKATRNPFHNICDQMY0P71NE'NFCNCLTI'ANG11I0PDLKASKPENGREPLSPP
TYEZTItiCIJtAAFKOPNTFSORPt10r7frSNY0EL1'pILESNPS1YEE


GGFiREAIfrNVCTVEADYVSNAORILNEFNCMTAHVLOL0L0IAELOKKADDLDPGKASMFTfILTBJIt?pEEDCSL
HIFCYJWLPKHIJ1EFPINLS1'YPPVYFYCFSPCIIEYIGDIJSD


F'fENRIIFAVMWITSESLGDALISMII1JSOLPKOEJIFLICPLIECINPFB~ttJIANALNSt:LORAIDFPWNOLP
DSPI%NAWF71YVLSDRQALIJ1M.IWKSOSS~FFLDREIOYOQ?LPiK


:TNEFSTTSVYYSLSSYLVOSK'LGpNLFAGDYYETLLAAAREREYIYRDTARCKOAINLVHDSSLGVIONSILDLKP1
'SPODFSOTKCtICIYRALNIPREVOVPCKV'L'ELIJIRDVfPE


NGLLOKINSLPGATSAOKOEMLNATTYYpYSLSVTLNOLTVLESLLAGLKMfipf:'SNNKEIFILSSNIESYKVNLNA
IFNPHVPIYFTDEVDPRAEDLPIdKKIGLL&SIt.~i'QDDGNYIL


YDKSVFKIESFDDWIPTL1ALESFLTSGFPNISATGGI7GPLP11?VOSI>QOTYTSOCQ1G0OLLTHPOLOOPIDQNK
VPYLIKJfLSSEWGKISSKDRASGpQMKAL&DLILECYPPIIOBCx


LNLIAlpMT1'I00EWfLVSTShIQVLNGIISOL.AGAIYSNRVSOVEVWKiTVPLIYFIQERINLYLSSSOHSYEDLF
~1VPSCLEKIFVLSPCII'SPI't't'
~


LRNSLPPTPl~$SCSLLFFTDFCLDFLLHFHKPSPLYDKpGPYICSLSSLSLIP1OCYIf!'I


CPn_0728 818187 816525
Lu'ANK1TSSDIFDLIJJRTT'fIIEELAFSSTEDEtTrFHPLQILVSTKttEWISYISSMQPN


HLPN 76kDa Homoloq tGT6221
LPSPfGHNIKE'ILDLPVE1'LPTOPYLSAFFKNKACLHTSOEYNYSLANJIlY8KKALLP8L


VFMVNPICPCPIDETERTPPADLSAQCLEASAANKSAEJIpRIAGAEAKPKSKTDSVEAWFIPTVKOyNLpOHCSLNEI
IKCIFSPLDLFLKTNYNLRISYPENLKK00KLPt~1'IDpIED


ILRSAVNAIJtSLAGChCL.Ia'SNSSSSTSRSADVDSTTATAPTPPPPTFDDYKTOAQTAYt1MECPVDKEHDLLF.'
ISPHAEELFTYYREKTILLRNCLDKDPKtISPYTVrPS8&I166R


DTIfT,~.TCtrIDIpAALVSLVDAVTNIKD1'MTDEETAIMEWITKNaDAVKVCAOITELAPYNE..~YLFPPISLSF
~\:NPVOIHGTLHCVCNFl'.nLYLCSIDPRDSLKK'!'fRTIGSLPLTSS


KYI1SDNOALLDSLGKLTGFDLL.pAALt.OSVANIMI(AAELLKEf~ONPWPCKTPAIAOSLEOKOLLERYVAL.AVL
~'M>roHL:SDSALIKLTSFtrIKOMHPPPSOP~YLRKVLlIIYNLN


VI7t?TDJ1TA?CIEKDCNAIRDAYPAGONASGAVENAKSNNSISNID9AKMIATAKTOIAESSOPIPLLSPL.CWKTL
L'DEEKFHOAVL..iAISEF.AIfHPSLPIFWOPHNRNIECILiIVCAS


AOKKFPDSPILOEAEOMVIQAEKDLKNIKPAL7GSDVpNpGITVCCSKOpCSSICSIRVSMERLKILaLFRGPCE.\t'



LLDDAENEfAS ILJds'GFR(.'N I HNPNTENPDSOAAQGELAAOARMI(AAGDDSAAAALJ1DA


QKALFJ\AtirK.IGOOOGILNAL.GQLAs'AAWSAGVPPAMSSTCSSVKOLYKT3KS1'GSDYCPn 0778
910719 97)B95


KTOISAGYDAYKSiNDAYC1URNDATRDVt68~IVSTPJ1LTRSVPRJ1RTEAt6CPEKTDpALArecB-
Exodsoxvrlt,'onuclense V. Beta'


KVt:.CNSRTCGDVYSOVSALOStM0II0SNP0ANNEEIROKLTGAVTKPPOFt,YPYVOt.SKFYLFCEYM~CPFNIF
DSNSSIOr'IIFF4EASJV~CCKTF'fIEpIVLRALI~:SL111Vls1AL


MU,STCKFTAKLESLFAECSRTAAEIKALSFTNSLFIOOVLVNLCSLYSCYI.OAITf127ASTNELKVRtKDNLAOTL
RELKAVLNrOpASL?"l"ILOINCNVKOIYNpVR1i11LA


TL.DOM.iLFTIHGFCNFS'LEpYPPffTRLLI1KNPALTNSOL'ILHNI'MYLKODLaIKNVL/QE


:an_u72w 81.1905 x19591
OFNLIJ1VRYNIT::KIf."::3LVDKLLA.TtTOPICCIF.':GRVfRLEOIw(JrMOQIYNSL.LdIP


i:NLPN 7tk0.7 fk~mnltm ICTI:':17
KpVFLDOL'~WI:.CFIIKOPF::ILGGLHfIFVDL.LYTSETIf.'LFSFFKIAETPNPKIIRWI


f'AWi.~.V:a'f.NIDTKDCMKKtWY0WCr11:WL.LALTL::rYAELtL:iPwKVKSH'I"!'f1'LDEVKYNff.'
MFfIfLENlI:W1'ERTLISF'CfILGftIt'NTLL%DL.VEYL.IIQNYTMnR.iiPDESVPALiKL


D'f L::KRf:fYETItKt)DCVLR IACIAIRARWLYFRED1.: :.~.FJI4PVl'(? ALJIF.~ Ys?f.VL
l CtIP:a)KDI!'fNPLFV NR'llt.~.EFYL'fI DEf rflrDK(/;~1.~.I F:itILF
I81'KF7~fLICDPKQ9IY
I


IffnAEPNWf..~.::KNNWfAIMX:ENT',W:VDINRAFLC'fRFYKtIFt'fKTDFFMEff:R::uLCDf~IR::AD
LI'f1'1.TAK:;::F::EWIyL'rL'rteNR."~fYIJIEAIIIpIFGIIL::pFLEIPtIYLPILY


a.iE::EVVfV:.NF'Li:LJItYWTREL:KD1'1'YrJVIVII(T:PFV111JMTKK111'A4A7VFJ:ILNRLPKII
AtNI~)::::LTFENf'TII\t'tIIFFF'frfIKOrJAI.'rIIF.~.EALPWIf0DKt1'LW11VVLVGDSH


art'tlKt.'::WLVM'PhYt~t'f:TfT:KAA'fNAMK'lK'i.:'lY7trWt.l/r:YIC;~VtNtNYJi(KPLILY
nJAFELI::'fATtMI::F::KNK:aFIt:.TE'fllIl.TT:.t.LEAIWIPENYEKILIKLIi::.~.LFf;<..:L



:AFIJ411I
\KA'PKTPf.Ir:KI~Nt.AWI'll%'PIrI:LRCM:IM::A'I"/It1'!:1'VF:~1L:VPEIDV:I:17t'VI'fK
K(77FTfYF'v::iJC:'ft.:IIIY:LIrYfF"/4'IMP!(Y:IffLF::::MH7M..IF'QDIEKLCUY


7r:Rt;NLI.1:FWF'A~yIIMM'Pf'KFrINr:FTN'fKt:F::AL'!M'h:fTf.::l.::rWYi:A'I::KPANnK
c.l!Pl::::'ft'YN~~LWII.I:NF~.:Rh:rilh:::l:LAt::::'/::I:()LETLY,LTTfIL:::IGa.EYD
IVA:P.


Ir:::dFfP'7KFl7It:1 f::Ai' I fK::KKNK.~.::::ELlJit7IYVM-PP
AYYVI.YLh I:: LOPf::IJJR::::AL9'NYVKLP%'.La:BIIYD


LA I I IIJIUFJ IPDLP 91'::l.l'K
UIK :IIL'rt"f I Jil.lLLkTFAt.Y'lTPPK'f
t YaY:::.TKPt.LDlIIK


~.'Im oW n t:t:w ::l'.r'n.r
IR:U::It>Y:.KLt'1:'Ky\~LF'~:I:KT:fLfIIYII.h::a01'::IJIf/rEYtM:
'CINHPIKHTIILOP


nrviN Irrr.Ir.rl MIYNI7t.rIN:
1I.r..7rrVEPTILKU~:A"fFF::t'LTF::::(~fF.':1.:?ryLIiIYIFttET.':FLFLFaI)h:IJ~IIr:
VIDLPPEHE


.:n:P'Kmr:I~Jtr:IM::RKIdIFI'::I
r:KYYIIf7WK'P::FU:IVfN::DY::K::111.::IYIKrrF:YI.D'frl:l1'NKAVItKFWL?FKIIKICNEt.

\It::fFNII:Y:fI~w::hf'h:ll'Itf:I
VLt'1'Ytr:ADfIVMfW


106


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923



:VIF(Yt;;::T'JIa4GFFAL::::EDIPNFNPKd:'J!.LCCDICpIIR-HTH Tr.(c: :on.tl
Rl~uaco~r
!H e':~:ean . RecelVer


Daman


':~ n~fv al19I2 af79rii KITDFILRTNSYIIGFCTNh~DK;TS.p4~'P,.DL,5L8~
IrtSQI~~;'~/I F
tI
~PEEDI:
PDTF
'
'


xfrnrrca~ E
a: Dr~cein SII
'T Ir;w t E
QKVLJIpGR
lCYCL
ESVAIPCEYG4LPE0lF6P


ryf
OAVIRAFLRQNEt~tENSIPD~ffTfG011TFRVLNLVIESPEGS~~I:.TPSE71GILICKi.LINR
. ~
':KVLFKLIL"f:LRNKI!TK:,~..'t:IIALCII-iFRSLFQFIYOKIRS3FVSLNVKFFPKIKO


AP.~.:dlt.rWLEi.ENLI~Y.ER'!.V~LCEKLKLYEV3NN1'PPLFPEiLTPYFHKLVEGKWYRDaLRNKLGPYGS
KIWI14.'IfCYLFSDDCSIP
GHIGLRKNLLAEIKCKfKE:IARNVDVHRi


'fTlB.ISSSCwr/NV.KTIi~IKKN.iPVL,SCNVLIf::L'IDYVCEItOSRIRLITDVCMKPSWAtIRIANtiONT
ANPNEE


~.D I7.':W41IKI1.".LREL I PQ11EO
IaHAY TLEKDN': K I SOLOELD3LI~EGFNQALLRCIL. ..
.
~~
~


.. N,:;~~: . . ... :.::IcTr,:.l:: , .
..:aF-f;:...,.,. r-=:'.,';,as=. .;,,-
~ . . . ~ y
3
1
....
.
.


:':;.::.::. ,.,
.;: ..::::a::.:::.:..:..::.trc.lzw;....,
,.
-..:: ...;,....".,
MFRCILFG1FL:.T:Fsa.;~:::. Y YLft::HOf:i:GPKEKaRSVW
I EEEKsFTDfVLNIILPSO


0740 97b054 874861
HONLHILCFOCFLTrOK,70KFS01EKIFSK4Y0E7vODCpfLFKEEiLtSRLINSFFLKTD
Pn


_
VlIETILCLLNORCPNSPYYHLFNALVCYKOKLYA>:lfTE0ir1Y4pEEKTRALAPLtI4ISIE
cyrB-Aranacae M Nnanocranstsrase


SYNSFFNNIPTFSPOiIILGLCNVPfADKRPEIfVNLVICVYENPQKRY0GL5CIA1UQTVIOLLTDFLLDYISANSLI
EOKNFPOGRVILNRNINRLIJWECEWNAKTYDRIAILLSIIfYf


EEEQ14XSYLPISGa'.QIFLDCIRCLVFCiJIVDPSAIVGFQSt~'l'G71IJII.GARLLSV11KGSLELV8SK511
DIYFOYYENVLfYLKKIYIGEpCPYAELLPLEELVSLINENVfILPI~C.Y


GKVYVPEpIWSHtIIRIFSOECLEVIRYPYYSKDQKpLLPEPGIAfLKEYE1C45VILLHGCPLIQLLtiIYlQKHYVN
PNSSLWOILYDRfSTNNFGJ1IRFCE71LVSFSGLEG.IKaQIIZTF


CHNPICVDFfE~IWKEIJ1ILHK8RF3.IPFFDI'AYQGFAHGIELORKPIEIFISDCNTVLVfEii.&NIfVGOIICI
EEAKOC'/A:.LHILDPS:SISEKL.rILSSD':LQNIVSCDO00lfrKLTINY


AASSSKNFALYOLRVCiFAVHSTF1'DELVICIHSFLEfltIRCEYSSPORWCVEIVSTILSNLDLWF~1IOSYDI~tC
OLVHNLVYGAKDLwKKGrh'DEKrI.N'..;.;L\:.RFTSYDIDClSWF


PYLKEE110SELNFIRESLGKNATRNW14RKVACNTFDFLLSOIIGFFAYPCfSDKQVLFLLFIKOAYKOALSSH11IA
RLLKLKFISEANIPSIVISFaEKANFL1D11CYLF11NlIDYOKC


REpHAVYI'IAGGRJBJLNCITEKMIONVVOSFIQAYELYLYSHWLTKVAPSPQSYRLAGLC
LMENKRYDEALEFLCNLSPNDSIh'DYIITOKAiaPICQK


NQSKDRAAS


CPn_0741 838387 976185
0752 818595 850082
CPr>


greA-Transcription Elongation factor_
ItIFRLK'1GI::.fCYLEKIpb~IEEGOSANFLSLWECYCFNOWI4GRELVEILEKVKSSSL'recD-
bcodeoxyribonuelease V, alpha-


ASLFGRIVD'IWPLwETCIPEGKDKDRVLQLILDLCfSNSONFFDIATEYVNKKYSGEENF4WALlffEFAPFLEDLVN
QQVISPLDIAFASKNISSDFEESFVFLWSSAIiiRYORpfiSL


NEALRWCLRDGRDFQFSLSRFDPi~ODBIKGNlVFlIQOGWGVG)l9lCtfSFLQOKVLIEFEEPI~IRIRPSLGCIS!
'IDLYRGPIOJLPKtARDKLFVWSCRLYLRSLYTIRS1ILLDKLBLLC


GINSAKDISFE1'AFKSLTPLSGDHFLSRRFGOPDGFE7IfAKENPItINILLROLCPKTASATPNYfPPSIDSSILSE
EDNFIFMCITOCCFSIVSCfiPGIGKTFLAAQLILSLVKQpPK


KEIKDELVDLVIPEAOWNRwNOSAIITKIKKGTRIISPONPKEPYVLSDiII'sCStPIGOLERKLRIAIVSPTG101T
SNIRQItJOfYNIFDDNVIlK)'fVNHFLQEYAYRRYNSIDVLLVD~1


G:LSLNSAEKISLIYHFIRDLJiSI?.IG~tIEIRIISLVKALODLDN60(rTdCSLILORELLLSEVTPOLLYSLVpT
irpCYEKDK1Q.YTSSLIILGDTtIpLPPIGICVCNPLQDLIGYFNFXfFF


YL.GIKDASI~CEYITSLSEDDTSRLLEI~BIPIV71LQKSFLSLVRKYSSFWQQVFlI7ILLYTLKTSNRAKTCVVOO
LTOSVGRGOlISFSPLPSISSAIEVLIQJRFVKSLROSGttICVt.TP


TSPII9tDFVYKTIlQ4DPSSVEVLKKRLLDSAHQPIttIFPELFVwFFLKIGrBtEDCLFDPEDMRIICPNfVLM~rI
ltINORWiSDPDLRIPIMVTSRYETWGLFNCDZCLi.CLKTQNLIIfPO


KEVLRLfLESJILNfNYOVASI'PNKELGKKLHHYLVCORYLi(VAOllIOGIISLPPLKELLLLNEPIDSitALSQYV
nIYVI4SVNItSQCSEYDEVIVIIPKGSEVFCVSILYTAITMKIfRVSV


STKCPOFSSSDLNVLpSL7IEVWPTL10WKSNVEEFiiVLwSTSESFSRIOIAKT4SLVGKEwGDpEfLNKItKtISNf



NVONJ1KEIEDJ1RSIGDLRENSFIKFALEKRARLpEEIRVLSEBINRARIL?IIDLVF'fINN


GVCCKVTLKGD~AGEVVEYTILGPwDADPDSCIISLOSKLILQr8QGK10.tiDWItQCKEY1CCPaL0753
851099 850161


IgRIQSIWEEl~1 No robust homolag prssenc in Gensbalak/ENBL
as of 11/Yf98


IM71TAHLf.RQALLNLRSfr1'PAIR7~1LFRQQSNSLI8~8iVLF7IGDIVCAIKNSTAISRIU1


CPIt_0712 938442 878888
LGSSHYANAALQKTDGFLCMOGVNI'J1VJ10ANLWI'.OLItJGS!(IfETDEE'DGCLRRC~J1D


CT635 hypothetical protein
AE(x?lfOu.TITGINARIdSKi'IGTATFLNEi4tiWSLGrWJWItIQC>M'SCLNI.


TKNMVIVI4VSIISAQKIIDSIKGILTIYNIDFDPSIV~SSLSSDSDADYEYLITKTOEKIQEVATOCSLTESSISLYA
ILSTRPITISDpENPNKPSAEFAARSt(AIWiIIPIAwt.GOWOLV


LDKRApEIL:~SlSXIFAMiPDNFSPEEWL71LEKVRSSCDEYRKETENLINEITLCDAI4TLSLFLPAIT~VLIMAI1
GLISCVINFVIfDYJIKIG


DLHpTKESKRPlCptaSSTKKNKIfKNWIPL


CPrL0754 851781 851040


CPn_0743 838956 840761 rs20-520 Ribosomal Protein


~nqrA-Obiquin~fe Oxidorsductase.
OFILId.XVLVLSCDIIUIPKRPH1001VI0RRPSAEItRILTJ10KRELINNSFICBKVKTIVIOt
Alpha-


IFMKITVNRGLDLSLQGSPKESGFYNKIDPEFVSIDLRPFOPLSLKLKVO<XsDAVCSG71PFFJ1SLKLDD'1'QATL
SNi.OSVYSWDKAVKRCIfKZxtKAARIKSKATLXYN~MB


IAEYKIitPNrYITSNVSLW1'AIRRGt80t5LLDYIIKKTPGPTSTEYIYDIaTLRRSOLS


EIFKtNGLFALIKQRPFDIPAIP2'OfPROVFINL~PPTPSPOIHLALFSSRHEGFYVCPrL0755 851579
852799


P1IVCVRJ1IANLFGLRPHIVFRDRLTLPTpELKTIAHLNTVSGPFPSGSPSININSVAPITCT618
hypoehscieal Drocsin


NEKtWFTLSFQDVLTIGNLFLRGRILttEQVTALaGTALiLSSLRRYVITTXGASFSSLINYKDLPFIa.LLVRKWGN1
'CfKYWIYFLPWi'LLLPLV'CYPFLSISOKIYC1IFVFITIif~1


LNDISD27D'fLISGDPLiGRIG%KEPFLGFRDHSISVLHNPTKRELFSFIRICFNKPTFfFALMRC~IOLIITMVGLL
QTKIRKLTENNDGLRQIRESL1CEI~Q$SJIQIQIB


ETRPIII1TDIYDKVNPIBIIPVVPLIID1VIT10~ffDT.ALFIQ.OGLLVKT1C0OOKLETLLIJtRTL~IRCLIDI
pVpSLIOECGEKTCEVpliSBtIQ.ALT


NEt~GFLEVCCEDFALP?LIDPSKTE~Il.TIVKESLIEY111tESGILTPNQDLAY001lIl4DEYQA1'FSDORNIQ
.DKROIYIGKLENKVQDLMIEIRNLLQLCSDSAIUIIBQ


CSN7IYLGtiISLQLSSELKItIAFKAtZtIEAASSLTJ1SRYLHTDTSVHNYSLEICROLFDBLR


CPn_0744 811387 840389
EEI~.FVYAROSORAVFANALFKTKn:YCaEDFLKFGSOIVISGGKQwIICOt~lBll1E


ham8-POrphobilinogen Synchase
CSGRLVIKTKSRGNLPFRYCLNALIDfCPLCYIM4vLYPLHKEVLOS


ENSSLTLSRRPARNR1(T1N1IRDLLaETHLSPKDLL1PFFVKIICYBJIKEEIPSLPGVFRWS


LT7LLLKEIERLCTYGLMVNLFPIIPYGSYSSNPIOdILCIISIHEIKNAFPNLCLCPI1~0756 855889
951676


ISDIALDPY177K;HDGIFLNGEVWDESVRIFCNI11TLHA~GADIVApSONI~mGRIGYIrpoD-RN71
Polymerise 8igmw66


RSKLDOSGYSKTSIMSYSVRYASCLYSPFRDALSSNVI'SCDKKQYQt~IPIQM.EALLfSSISYLPLTKLSSKARNPL
VLFWRIQ.FIQlf(SISQJ1TEYSSEEESOKKLEELVALiIKEpGFI


LDEIDGADIIlIVXPIYGLYLOVIYRIRONTCLPLdriYOV5GEY11NILSAfQOGwLDKETLFTYEFINEILPNSFGC
PEpIDOVLIFLT at'IIDIOVI14QIDVERQKEKKKF~1KELEGL.ARRTE


NESLIAIKRAGADHI
ISYSAPFILET.LIiOGFEFCTPDDPVtINYLKE~iTVPt.LTREEEVEISKRIEIGOVQIERI
ILRFRYSAR671ISIANYL


ISCKFRFDKIISEKEVFDK1'HFLKLLPKLITLLKEEDTYLCJLLLa'LKOpDLSKOBRJfiG.


CPn_0745 941903 841742
NDSLEKCAIRTQAYLRCFHCR1WVTEDFCEWFKAYDSFLHLEQpINDLKVR718RNKFA71


No robust h~colog present in
Genebenk/Et~LAKLMAKRKLYKREVAACRTLEEFKKOVRMLpRWNDKSOEAKItEMVESNLRLVISIAKKY
as of 1117198


VDSCFDI%4RJ1.SSLOGS i":
YtTfIIYDPKHTLaYGFCNOVSVIfItFHLKPPIISOEKFL72tROLSFLDLIO~'BIGTJIKJ1VEKFEYRRCYICPS
FYIITWwIROAVTRJ1IAD0ARTIRIPV


HNIETINKVLPGAKKLtMETCICEPTPEEL1EELGLTPORVREIYKIaQIipISLOAIVCEG


CPn_0746 ' 841979 813567
SFSSFGDFLEDTAVESpAFrITGYSNLIIDKt4KEVLK'TLTORERFVLIHRFCLLDGKPkTLE


~'632 hypochecacai protein
EVCSAENVTRERIRQIEAKaLRKIWHPIRSKQLRAFLDLLEEEKZGTSKVKSLKSK


FSGRCPFSFEVFMLGKEe'EF':CKQKOCLSHFVTNLTSDVFALKNLPEWI(GALFSKYSRS


VL,:LRALGLKEFLSNEEDCDVCDFr~YDFETOVQK.IADFYQRVLDNFGDDSVCECDGAtILACPn_0757
851709 855131


MENVSILAAKVLEDARICGSPLEKSTRYVYFDpKVROEYLYYRDPILMTSAFKDMfLCtCfolK-
Dihydroneopceran AldOlase


DFLFDTYSALIPOVRJ1YFEKLYPKDSKTPASAYdTSLRNfVLDCIRGLLPAATLTNLGFFPCIKNIALVIAIERYOLI
IaKFRIIWLFIGCSVEERHFlIOPVLISVfFSYNEVPSACLSDK


~TIGRFwQNLIHKLOGHNL1ELRRTGDESLTELlIKVIPSFVSRAEPHHHHHQAMtOYRMLLSDACCYLEVTSLIEEIN
yTKPYJ1LIENLANELFDSLVISFGDKASKIOLEVEKERpPVP


KEOLKGLAEpATFSEE~1SSSPSVOLVYGDPDGIYINJUVGFLFPYSNRSLTOL:DYCK%NPNLLNPIKFTISKELGPS
PVLSA


HEDLVQILES3VSARENRRNKSPRGLECVEFCFDI:.ADFCAYRDLORHRTLTOERQLLST


IiNCYNFPVELLDTPMEKSYREAMERAMETYNEri'0FPEEAOYINPMAYNIRwFFHVNARCPn_0758 555101
95645a


ALOWICELRSQPQCHOM'RTIATGLVREWKFNPKtELFFKFVDYSOIDLGRt.NOENRKEtolPfdhpS-
Dihydropcaroace Synifuse


PIT
RANSEPRFVCLSLC3NIfiNRFKNLOIARTLIGEOAVLGLRSSVILETE,IItLPGSPPaiD


LPYFNSVLVCETfL:LRELLVTIKOTENWCRAEESPPwSPRTIDVDILLYCDFSPCCDN


~Pn_074'i R17o49 841057
TEITIPLSNLLSRPFLIAGIASLCPYRRFCfOCSPYHNFTFGEIrLIHLPSPPCMIRRSLS


:T6J1 hypothetical procain
PDMLNL:WNVTNDCMSOCGMFLDPEKAVAOAEKLFTECMVLDFGApATHPKVIOpFLSV


RTCMCCKCAEVOILSSRSLSCMKILSGSLFYKKFCDptDiERLEPVLRLLKETWSNRKOYPIISLDTFYPEIILR141D
IYPiQWINWSOCS08NA


EVARDCEL.iLVNIaIS.i.~nLPS'DPItNILSF3VPIGEOLLSWC:EKOUWFSDVGLNANDI/iFD


:Pn_p74R R44 Pa6 944121
PGIGtGKGMQSLATLYEIAKFKRLGCPILIGHSRKSFL3LFf;IJIIDPKDROWEIVCLSIL


tripA-.ar~nyt Transcranafsrase t.OpOCVDYLRVHNVAAHQK,1LSVAACFJICAPt


r:TLfIJfALCI"f RP~ L ESA I EKALECFCP
ICNP IRSP'/EYALQL~CKRLRfCLVCNNAQCL


I:LNiIDVMOS.1LAVEFVIIT~'TL:aDDLPCNONDDERPr:RP7IMKAFDFATALLNran X1759 aSriA i4
:156~n
:YALIPA


AY!:IILRLNAICKLKEQGCDFREIDIAYNIICDITDKIIICCGGtIi.CCOYDDMFfrNRGOEHVfolA-
DihydrofrrLacr R,xlr,.;carh


_yIMIKK'1t:SLFELW.'I::(:GILFf.~,;Dl'OFAPTIT..'.F.~.NNFr:LLFOIKDDF3DLQKD.SQOILLV
KPVIIPGNFF-NfU:VGI~KIIW:VPr;I'!M.'DI'ttf;JIr:LEGKLf'WIiYIEDLOFF:ETIOK


:INlALL.fC;EIG\ALCLI.aR~~MJI-
.LELLDRLSA.i.:LYt7S.~.EFETIIC.~.IGFFPIVNGP.KTWETLFPKYFI'LIRA'/'.NF.~.IIRKRI~:VH
r:EIWVT::LfEFLU.JSIL
:PTFLIf7G


r:EL'I::LFLENOiVPGFFI::It!YYElA':VfFFt
tt:LLCTWTKTVI.TtL~f~W ITTt.-YYEMIIIR


'Ir _117A" I~.r.IN Ntr.0(Ir. VM'Klli::L


rllmll NUf-.;Ir:lIM: lrrruly~rntu~rYLil;r


Vc:YM-
lfA::::IF::1'F:Dff:fl~t:II::KAlIY'fWOIt.hLtItIJI4LF.NiiVF::c:llIt7TVF_'x.YfLKNr
:(r:47.:n ... . .,,')r.r.f


(EKIEfAEI'AYVi::'r:AS'1\'r:ft'.IG:::V'rEVRllt,1'!Li!:NVI'Ivl::R('W(:HCPELKNSYLG
t.'Pr.ll hYfN'rnr,r.m.RVrna..y


HIrrKAAllli\'ilr:G::Vt::::1:VNG:At:VR(.'ANFRLLY:PtII'Nft:'r::UK.~.KKIO?r:P.RKLG
AFRFK:PKLCLEIPKP::r~IVTHRIT'I"IKTIYfYf'YUDLI:aLG:::Lf'KLNRf!::IWI'f::KIV~


~ r :Kr:VA IrWNVtI f Hiv;~'I( I n:4T:AWELEKV::YLELtKylilLA'f V I-/F:Kf
Lf'IITR I Rh.'UV I ; I'lL'fY.K4a:IL I f~.: A. a
lut:::llVlY:YYVLY


INUFLL.':VNTII:fAIIJiNFYlii.f:IN.%:f
I I::Ir.:lft'ffLHRt;TNf;G:I~'WtJr:l'!'i'I:INYH:Kf


~'Iw- /'.n n:.tr..lr.: H.tn.'In"
D'.'lriRALKM'Y::NLf.UI:I.:AAA'/t.t'W:1:;fU:trPlIAIIEFJIFKITFIC:::fTrL.~NIM:'fLA



f Af:IYCDL'fr:Pt.LO::MAWETI'AI't".:


1~~


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
NU1KRE9TLWVHE:LLPK: rLCKLPAPYP4,:IKCfAECL::F11E.:.:FPAIE1KAVA


3DR.TAKCIP1'AR
APLDIFPLKHLFPR.fl10D53HSKt7~IVLOWIR(,'IifATEC7TPL,'.C:rJI


~Pn p7wl ss57.tH 858775 ,
RTVAKYMOLKILtAIIKPKItDtI,:I~i~t~tPROA


:TnlO mn.'.'rnrattc.)1 procmn n . ~ . ~~ ~I


.Ilrf:ldfELLDKOIEDOHHLKHEFYORWS~'.lfLEl(COIQAYAKDYYUtIKAFPCYLSALH972100 d'016J


ARCDDLOIRROILt39LH0EFJwCHPNHIDLWRpFALSLGVSEEEiJINHCPSOA~ATFCPn-0772


RRI.CDNPULA:::LGALYTfEIOIPOVCVEKIRGLKEYFGItSAIIGYAYl7IIHOGDIKNASuvr0-ONA
Haficasa
NLGLLIfI'CISELtJfa:RKAtffAPI?1PVLVLV;.1~K'iAVt
: fRILHLIPJpCIAPREIIrI


''eEKDILpT;.~.~RL~1IPDAVL.QG,~,QE~fLDfLLdiFLaSFINSfEPCSCKVf!'ITpfAARELtfEP:'.'N
pr:Jl..'-'~NEFDVPHVCTFHSL..'VFTLRRSINLWR~'INPTIYDOS


..., '::Jf~..'- .- -.;-~.~KK:.
. "
~


. ,.. ~ :I:...':.\;.:'I".~'; ~:'c..; , TIIIA'::".~
: a":.:-:K~y~
v


.r .
, ;... c ...... ,.., ;:r.. . .;:
NVFASA:DPCOSLY::wnrJAMLHtlttlfFGYDYIMAtCVL:LEFNYf4iYCNItl'IAANAI,IIWNA
:. t:r "
~.iw'~IIETKRSIYl4'ILPDRKK1LF~1AVAYIEKO~GS~(~((S
~ELLYFSRFSEKWJY


. fIIKUtDICIPYfCINSQS
ALGIHGVPKGRVIEIFCPESSGKTL'LATHIVAWWIO~fiVAAYSRLOfEIRSVKGPGEKIRLFI&SfDREEaIDtYAA
EILOGiRVv
ATHEISTIK'IGALSLDL '


. GGLSFYKRKEIOOILIFIJtIFISKSDIVAF0R11MLPKRGIGS
LDAF3GLDPSYASLICVNIDDLIIISOPDCGEDALSIAELLaRSCIIVWIViDSVAALVPKRTFCDA4LRRRIPYE.a


3ELECDIGCVMVGWARI91SOALRKLTATLSRSOTCAVFINDIRERIGVS!'GNPIT~fCG't'fIFJILlCYAIAOGL
PILKACOOALDTKDVKLSKKOOEGi.OEYU~I.FPOImHIYtfILSLR
'


RAUCFYSSIRLDIRRIGSIIICSDNSDIGNRIKVKVAIDOCLIWPFRIAEFDILFNOGISSAfNLFlfFLDDIXXu
DPIG4VVRI1GYLEiLKEDAOTFKDRKSNLEELYHKALESECONPK
'


CILDLAYEYNIIEKKGS1JFNYQEKKLGOCRE!'IREELKRNRIa.FEEI00tIYDVIAANKIJOSGfOCLEFR'rSF
VCLEEDLLPHANSIGGTYENIEEEIIIILCYV
SOODIiQLT'!1


CITRAODLLYLTAAQVRSLWCTVRlBIKPSRFLKEIPKDYNIQVR


TPSVNANlTPOEVPApIYEA


0763 860520 859972 CPeL0773 872185 871195
CPn


_ unQ-Uracil DN11 ~lycosylasl
yyfA-FOrsyleecrahydrofolace GyeloliQUe


NFPKfDPKIEKSALRKLPISIRRDLSEERID1EASS11VASFVRSFSKtSWL.SPYSFI~BtCItlIONATIDDLWS51
CECLPLC1~IREpLKEEWSKPYMpQLLIFLKOEYKEHTVYPEC~xVFS


ONOFaNRILIOKGTLALPKIDQ~Ii.YPVLIPSIDDLISYVHPImPFSKOTPISSDEITI(VALRSZPFDQVRWILGDD
PYPGRGQAi~LSFSVPECpRLPPSLINIFRELKTDGGIiIVHIi
'


:.VPGU1FDQOGYRLCYGHGPYDRWLAQHPYPSIRTIGIC1~QKIDRLPOESHDIPL$QIJ1AA
GCLOS1;IANQGILLLNIIIi.TVRJtGEPFSHAfiKGWELF?DAIV'ffa.IDFItTHIItViJ~li


RKKCELLFNSKHOHAVLSSPHPSPLAAItRGFFGGSHFSKINYLilIlCI1'IKKPHIM~i.P



0761 A61819 860521 CPn_0771 871183 873125 .
CPn


_ Cf606.1 hypochecual protein
CT618 hypothetical Drotein '


GYKShmIKKLFCLFLCSSLIANSPiYGICICDYEIQ.TLTCINIIDRNCLSEIICSKE1U.KXITOOLPSAECMPSVAN
LFJ1DFLAAFaLL
LFJ1P101DCIHSVCFQKTPRLTAKSWSME<Qi.


YTKVDFLiIpQPYOKVHRNV1011tRGDNVSCLTAYR1NOQIKOYLECLI~BrBfAYGRYRt~IIiIMADIREIACCLE
OSLATLVPSE


CNIKIpAEVIGGIADLHPSAESGWLFDOTTFA'sILFJIAIVYE14GLLOGSSVYYIfIN
01~5IRYSED$EEWLIIAtEEY(( 0775 871010 873111
~CF CPq
tiY
'


Q -
fYTSSGKLI yODY family
GNIWICECPYHKGVPQGKFL
00
FGRLL%AEYLDPOTNEIYATINEGNGIOAIYCKYAVILTRATYRGEPYCKYfRFaISCI'0


IVQTYNLtAGAKI1GEEFFFYP1'fiKPI~LII~WII~ILNDIVKIWYPCXaTLESQi~.VlllKER11BIIYIASSHC
YKIRE1'RTFLNRLGDPDIFSLSDFPDY1CLPQEOGDSITANitL'II~IIi


KSGLLTZYYPflCQINATELYDNDLLIKGEYFNPCDRHPYSKI~CCIAVFFSSAG1'ITICAAMQ.GCWVIADOL>Q.R
WJ1LNGLPGPLSANFACVGJIYDImItRKXf.LD3J(SSLGRLVDRS


KIPYQDCKPLLN
AYFEGCYVLVSPNGEIFK1'YOICDGYISNOF3fGSSGP~CYDPIFVKYDYKpTFAFi.B~


. NDVSHRAKAWKLi1P14.OSLFGOILLTRD


CPt>_,0765 862415 861801
CPtL0776 871180 875187


CT617 hypothetical protein ,
TfIYIKLLGRLIItfi'tISILILSFLSLtSILPVLAITSNHVKISORWSDWSOILTLKVIRCT605
hypothetical prosein


DHELDVI%fOIARISImRNNLSIF.~~LI3ASCKDLRPISRFRDtI.l~OfiliSNSLL.71QSI~VWERFIFVL1DIP
YDCLIJ~'FOFLSf7fBOfIFYSP1ZLSCIFPYVCCA0Nt1i0LDRIFS~EY1R


tAALEKSNHOLVWNCE0tfi01DFAFVIit.E0AT0~'fEDIESLFSLFNPt7IPVAPLVFFLCWCIOf~CIALI&HBA
AINSI~ODJ1LSVFYSRK(tDCfVEILCTLF31CYYCiITPfiTVWIDPS
'


IQ11'KplTPl.GNEVWLTHAEAISRWI YM~IpIVKA80tY~LI
RYRE.RSySLYCVKEVPffEVAINCDVFVYDVpDIGVRSYSP
'
'


YAPN11LNWIPIB(G
i~11'PGELALFFIOII
VLDRPNPIGGRIVOCPLPNPI'fSCSLIIPYCYC


0766 863785 862391
~B~fl'P'DLIGLItrhPTSPOhPDPOSPFFriNITGILCAL81fJ1SIG1f6YTLPPKVIGAP111
CPn


_
OGONADCl~l4DCIPNLFLPFFYEPFPCKYttI~~'1CSCVLLVLODPKIFYWETOCZI91C
CT616 hypothetical protein


AMIFKLPVYNICLTKAFldJI'IKIAILQKTCIO.~ftIPDGRtI'tSLP)QiYFA71PT1'FVLKALYPKQVEOTLKS
IERIPARIISSICHGPGGDEFLSISHKERYIVIIPWtLCKESRES


SLpCSDILVKSSSSSL101R10iILKVALTHLF~ISLiILPWESLIVOPOIGKPIDRaITPLTLFHOLRS~LLSEY~


WIAqQI'fLKKELSFLS0110IFPDfQ.SCRAADIFFL7100SPLK5LPAYLLIYf7GSEEYCCI


Fv10W01IAVlIRSFStaISTIdfSCmIHATWYIQETPPOlYi.PAINVAOISPM.~(ILEOKCP1L0777 875586
877178


LSLPLWCOS!!1'YGVEDEDWEIYGDfIAAAW~J15RRPLTfPYDATSVSPAA910Rit~tSQroEC~2-Mat
aback Drocein-60


SLLIGKYALJ~171TVWSIGSVLKLKSLSSSASNHFAF71GPEP1CVL.PRSLKAALKTVKAIC%TS
TKAVfPAICPROYNWIKIO~iAPIVLT1~RI


IOiSABNYPLLPTIPTSEp'1'LKFLUILGIiSSPSIRFSYF8YIf11'SYPSI~I1PSLPYSALVEAKEIIf.ODAI'
6SLDVKGKFaLLRWE01GDCS1TALWIDtILITpGL10GI1tADt.DPQEI


VIOCOOQP~IPQFLKICISSNPIa.QIiVSFSLED0R5f~L0!'ftSSKAGILLSVDNYQOLOROAIQ.OS
ATVISD71D(JODIIPS


SIm9GISRTROI~IfKSGYLSDYFVTRPLTImVVWEEALVLIL4IIBl.VSLfE~.IRYL6


0767 863878 861177
LIB'111lPLVIIAEDFD~'1VLATLIIHKLRNCLPVCAVKAPGSRE4110VVL.BOLAIL'1G
CPrr


'
ATLICQEBB~ICEIPVSLDVLCRVIWVMITtILTPTFLECGCDAEIIQAR~.CIJ1IARST
CTblS hypothetical Drotein


NIfC.SYLi.RTAINVYSFLIL71YIFASWVPDCOSARWYQLVSXCVDPFIIiFFRRE1IPRIGFSESiCOELLGILAI
FIGSIPOVDITADlOTEpROIQFOLPSALMTKA71l8mCIVi'~OV


IDPSPFVGLLCLGILPFVILRVLRFIILiIIFNSPWLLQYLAF4RAANItIEVPANiSSCrtITPGFEPLL011VRTPL
KVLaQNCGRSSEEVIHTILSHC7PRF


OYIKiIn'DZ'FEDLVDAGICDPLIV1TSSLKCAVSVSCLLLTSSFFISSRTKT


CPn_0768 861114 865161
0778 877100 878092
CPtL


yohI/nir3-predicted oxidoreduetase,
YFSFSHAAPIFI10fILLRSSIVYAPLJ1GFSDYPYRCHSALYOPGI~1FCFJMCVECILYAPtsa/ahpC-Thio-
specific Mcioxidanc
fTSA1 Peroxidase


ERTSICLLDYI~f7~PIGAQLCGSNPETSCFJ1AXILEGiGFDLIMti00CPlWCITKDCSGAPVApSORVPGYEPGCO
RFESSLVRfB~IXRVEEEVPNILSLVGKFJ1POPVAOIUMrGCICT


SGLLKTPEi.IGRILDKIINSVSIPV1'VKIRStZiOHEHItAtE~'VRIIRDAGASAVFVFICRYSLImYLCKYVVLF
f~fPKDFTYVCPTFLTIAPODAIL'aEFH'fRGAEYIGCSV~IJITIIOOWL


TRAQGYHGPSKOEYISRANAAACKEFPVfaiGDIFSPEAAOAIQ.TTGCOCVLVMGTIGJ1ATIO~IECITYPLLSD~1
CVISRSYHVLKPEEELSFRGVFLIDKDCIIRHLtMrDLP


PWICKOIDDYL'1"fGSYEKIPFIKRKAAFLENtQtLVEDYYOSCfIfFLSSfRKL.OGNYLISALGRSIEEa.RTLDA
LIFFEfNCLVCPANWIIEGLRANAPNEECLQ~P4TID


AKVRFLRSSLAKATSYpEYYOLVNDYEFJ1DDSSLEIF~tKG
CPn_0779 8'!8502 878095


=Pn_0769 867763 865121 . CT602 hypothetical protein


_opA,DNA Topoisomerase I-Fused
RFDLIPOIOCPNALFGEiEKGSYDTAYFCRSLVDLHNYLCDVSSPCI'IL71IKTLLSDYNV
to S1'II Domain


SIOGPIfJIIRIJOfKSLIIVESPAKIKTWKLIGSEFVFASSICHIVDLPAKEFGIDVDHDFVYIRVREDGYCVDSYFF
GLHF'LNiQZ'rLKNIIAICLPCVGtIpHIIFJ1SRSLCOKWrSLLL


EPQYQVLPDKpEVINHIRKLAAKCEKVYLSPDPpREGGIAWHIANOLPDSPLIORVSFNFFDlIDLYDLLTFNOPF


AITIWA'JTEALfHPRTIDMALVNAOQMRLLDRIVCYKISPILSRKLODRSGISAGRVOS


'JaLXLWDREKAIDAFVPVEYWNLRVI?IQDPK'l'!K'I~IAHLYAVOGKkWaCEIPECKTENCPfL0780
879211 978591


DVLLINSEEIWtHYAELLEKSSY1'ITRVEAKAKRRFAPPPFITSTLpOGSRtIFRFSWpap0/ami8-N-
ACecYlmuramoyl-L-Ala
SR Ilmidase


TN3IAQTLYECVDLDSEDS'hCLITYMRTDSVRVDPEALTTVREYIOCTFGKEYLPEKANIIHGNKIAVOSLRFMiAKL
SFFILLSLLFSGIDCSP.LtIAAGRSPSLOCYtaEIEDISAKUS


YTTKIOffpDJWEAIRPTDINLTPDKLKNKISDOQFKVYNLIWKRFVASOITPAIYD1'LAVHEVtIVHLSERLDEODS
KCOKWTAAKPEfIJvOKIRELESGOKAWKTLJ1VI9TSVKDtpi


OITTCYEIDLRASCSLLKF1(GFLAWEEKODDF3'IDpEEDIiPLPPGHApWILIKEb11S0E0NWSKWEIOKDHRALW
OLRLVRRSLLJ1LVDS=SPGAYADFSDPVPD7IYIVRGGDSLS


:,FTI!PLPRFTEASLVKELEKSCICRPSTYATIMJKIOSREYITKt?JORLRP'l'ELGKII50KIAKKYKLSV1'EL
KKINKLDSDAIYAGpRLCWPNKQ


r LETNFPR INDIGFTAIIIEDELELIADNKKPWKLLLOEF1VL'fFLPWITAEKFJ1VI
PRI L


TNIECSKCHKCKLVKIWSKNSYFYGCSEYPECDYRTSEEELAFNKEDYAED'fPWDSPCPLCPn_0781 879851
x79199


'.,t:VMCVRtICRYGTFLGCEKYPECRCTISINKKGEEIEpEEPIPCPAIGCNCKIFKKRSApat-
Pepcidoplycan-Associated Lipoprotein


YNY.IFYSCSE1PECSVIGNSIDAVITKYSGTtXIPYKKKTPrIO(KSSAK1TKMRTPSKKQNCYRSRRKTVPLLG:FP
SATDIfFlIT!?IIHSLWY3.C'fLLALLALPACBLSPNYOWEDSCN


r,Y.AKSSVKKSSEKKTGPLFLPSPDLJ1KMICNEPVSRGPJ1TKKIwDYLKEHOLQAPTI4KKtTCHirtRRKKPSSF
CFVPLYTEEDfNPNITFGEYDSKEE!!QYKSSOVAAFRNITFATDSYT


LYFOM~tLAT I ICPNPIMFOL,iKHLSOHLTIfVSNDFSSASSI KCEBNIJ'I
LTNLVHYNKIWPKATLYIECHTDEW:.AAS'ItILALOARRANAI
KE71 WIKQC I


W RLo~TISYCKEHFWSCNNEWW00lIRRTEFY
IHAR
.


;Fn 077p 868722 ar;9lll


T.42 hypothetical protein l;Pr:07R2 PPL077 87977?


KFRTRtIVEKLEFVTCL.~.SPDDDLITFNKOGLL.k7PEEEKVAFLVRSN1WLD:CPETPASFmlb~f~'tty;:.tc
clt.trida transporter


rF..:IJiEUFDIFPEYVEVLY;:NECLDVWFrICC."111ILt1ttElffIOLRKHHRKASRWiL:HYSRDr:l)tr:
MLROI.ef'VVFFFSFA.:LWAEELeIWR~EItITLFIEV::c'.OTDTKDI'KfOKYL::.~.L


t:/trrVtFJIVIIAVRHKFtfEPVFE~r'VWYUTSRWf:yiRRFFr:PLFR~Pt:ESYLLLFFTILCLGITRIfY:Y.
DIAfa:Df'.lw'I'rAA::KC:aSSFLAISLRLNVPOL3'P/LWa;:KTPU'fLC::aTI:~II


::1.'rfIPA(:ILINLVLIfIYFIAARLCMAOSYL'fRAHKKIFYl'IfGVPPLWVLLRLTDKEIKNFAL::VDIrJY
fIIIIMPf1911'AL'llf:fPCL~.N:KIVFALSSLCYI~KLKUr7RW1'fM'b:KNI.AP


vt:l f PVLEIfIMKRKLF.NVHWKp IYU.~.YI:
fT(Y'.':I.:ITI'KWlt,1\:::NPI'YLWC'lY.'If~/PY
FV IFLGFLFJffEf:ICKVLPIJU ~IJIITP;


1'I<KYLLAI'VAttl-h;Nt
l'1.l'Ivsl'I:a.T.'7.PMl:RFPRLLNEtI~,fIP::FNI'fl::IJI.VFf.~,


Yr~ ~I'171 ~~ItsSll u.n11.1 NKW
:NtbLY111::1.I'I't:11.\I'IiI.LTKKYFN::;~FAW~PDr:YF.IAPt:::VIKtiVH~strf'IDI::


rt.'.tl IetIA fatlym.r.r::.
::i,lm.t'id::I:l:lilrJl:rf::ITNKF:a'::WAfIY:AIIf.VF::J4:NAE~iELYLIiLVTKKTNK
IAIs:Vra:KItF


II'Yr:H;:YPI.%U::::ALUNF1'1.tKrJKL:LKYLPC:LIiIIQ(/:WNW.~.PLTEL:iSYY/OEILONPP:lf
:At'fa?PIKIYrI.


of 41 :;:Lt:EI:I1Y::(r -IFi"rN::TF.~,YLtIpTPt:Is~'~:L%1'P.LLPOI
EFJ1F:.'fAEERFIIWO i


v:Itl.::Ut)a.nl.l'NIt:DFll~'FLELPLEKIIIKVWnTtrJtIL::PEfaII:PSWSYY1MKLLRNSSs'Or~
rr/HI IIHIItNII :IHllon


sN~,A%::1'/ffY.'Yt'LlrlTk.'EFAPINKKFwL;:L::ELRtIILKKAfI:.~.IPWf:PAAACfIIKPMVScT
'.'sss I,yl,.rtu'r s,~.nl 1'snr..rtm


't'I~LIIU'If.F'I::.'7::.WYf111:.'iRt:1.t':iIKLtIY.FfFfIF%FJILPKEEUKNL:i00If~aAK
WLIKINNK%LI'YfAI'rN'IIL::ILLLVI~A::fLi'KKHViPKAF4EYLVTIUf'KPIYITI'::VWIA'




CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
AKTtRP.':V.1'~QPQKQAKCCPPOt?tVQKALGKPTPI.fNEPNEILaILFW.'tL.'vLL'~.ll;.
..1I':DYII:x:EKI..iYc:..:::..::::.:.'.1:.:..:FCfP:'fPKti
:."EPPKPSPAPTVAKK'1'fATEKP Wf'/:Ot'w:""'_.,.....~.F..........lA_.
pp':LFIr1YJ16SD
L~IFN'
R
"


PP.~.ITKKNTO4iK1'QLQTL.iEVAGAL.~sLtIVDKTERSETSLKNT~IP3TAQLTNHSCLKJ1T.
..QHIJIR
O
TF:
.::
~L:
a~.lK S
K'4'IFlxIFJ\WAWRIIIY\'EI1~:1~
D"'
F
~


'3EDEIl:ELFRT11LALPSKriYVRiKLVL,iPNGEfOECSFIw~EVSAADKOLLTORIOALPPO.
~ ,
' .
r
.
.
V:iLOKISKDTA
ALPLEIfIQALOP~ItLt'~ :'T.EDIfKYPSCLF:EE:.:.KCFL:IFttpC
'lfl~1


L ETENSADCfLTIL:aF~
.N.:
KFLEKYKV.~,KNt.:FHIKL


CPn 0791 x92359 991972 ~ or.;lr)9 9.r4~lv
U .7
~.Pn


A%bD-BtOpOlymer Transport PrOCein _
ORAD.~.Tr""'!'!N'~tl''QYY.tI4KYPrI'EETCEPVMLTPL:Dt'/FVTLJMFTVAVPLIKrbsn-slom\
rrrriac~ry f.rmilv nr~rwm-PF~C
cnospnacase IRSEi~t
-
~


: : : . .... o ...,:.:IY.1'i ~ : :.. _
.. ~:A...\i~.. . .... .. i~:.Ial n.
:.. ... ~ rr.
'::Ii1:" . .. . . . , ...,.....
. , .'S'~VYSI''b:A


: i~: :i-! I::W rJ:.:.:::
NTLTOIVPLNVDVL:LFSLVL:ILDA.i:Ft:fPNL':.L...'VE11L~KVFx'.:YNELiLIKVFPNGD
..~.yy :,::_:::i: ..
-::'IY:IS


33 992296
KIWASSIPENLGtJf(NHKIDIPILYfPFLAAi.KOSP101pEVfSJIpIINVFpAKCpELOGI


CPn_0795 9970
LYTfFSAGLLJCCtl.It>IOOSYLTVKTAILSKYGV:LItASDPAiJILNTVYPDIIt'RIfIIIC~QV


exbB/col0-polysaccharide
transporterFU4~PCPIDSELGPLT.SPLDIGFNFYSFKIKDTEIWGCIETNPSIDIAVLSYAIGIEES
ONLYFETLS4NKDt'YSMMFSNNPIIQAY'fFJIDFFGKSIFf''LLILSVrISIM.HOItiJII


OKNFLKAGItSLKOFLIKNRNAPLSLDIHPELSPFJ1DLYF'fIKRCCLELLDtOIROSAPDRGfAPL.WRMRM'FAYf
PCILLGSLIAFIVARRLSLPIRIC.ATANIESRIOdOJCLYTDDiLG
'


PILSSEDIQSLETLLCAINP1LYKALLH10ISFIPATTISLAPFLGLLGTVWGILVAII'ttISLPSYPNIE
FEIIGII~IfItNAIIIIEt1L11L.AKTNFP.IQfIlG\OF4l:.Hi:.EpAQpRLLPN1


SCSS~tSAINEGWTALCTTIICLFVAIPSLIAPNYt.ItANSSELI5EIE0'1'AYLLIaISIEWIAYIPAITVfxDPF
fHFVVCECSXARLFLIVADJISGKGVNACGYSLfLIQIIZ.RlfLSR
"
'
'


I
MYYSCNPPACYLDPDCETS
SA
SSSi.Q0AI0L"1'SRLfYFPttKNSCMFVTL.~IYCYN~


WLINpGMALGFLPEVJWITSKLFNPKPCSLPYLYSDGITE~111t~P7~I0lffCCERI4AAI0G


CPn_0786 881137 995293
LTGKSAAOAVNRIJG.SHCI'FtICNStIpIiDDITLLILKVLES


dsbD/xprA-Thio:disultide tneerchaelQe
Protein CPrL0791 197123 891001
IPG
'


L No robust holuolo0 Dresenc in Genebank/ET~L
fOOVHIIPGAEGLSESSY as of 11'/98
NHGVILNKFRTYLOTALIAPFFSFPALSCSFSSIpAeEI
'
'


pKVfEEEGTTFF
KSSKNRSFLLKKSOQiQV5LY0lfWWFISOLKKSLCYSTVAdL:FNIPSOESFADSLIDLNL
EHNP1
QTPRIGIK:TASKGSHI'lWlOIPGEIGSPLKISWOLPIE
EWtJICGDSCLPGNVDLKLTLPY~!(iPSLY
.


CY1;9SALIVAINMPEGYTPGQEVELRAOV
GLDPSVECLSGDGAfSVGYFTtUGSTPVEIfpPFKYDV5K1IT!TT'..SVCTANOSGYAYGIS
PtriIIAEFTKTLHJIQt~ftVLFIJDHSVQVAOGKCNEIILNISKItINIITNAWE1ISEKAt>IQ.FAY'


AE'tSYSGCTCCAWRLKViQitSGV01Q4EKLHCILLLrIDIIIGRPVESLTINSSAVIbVIOCFSYDA
YDCI'I11IC1'CSLJG71G1CYNCAKiI$ADCTLTPLTGITC.3FStfCFaRAISKC
'


AGLSQYITILIMAFIGLIILtI4IMPCVLPLVTLKVYGLIKSAGENRSSVIANGWFTLL1IVAVkWVN
SCpPKAVOffASGAT'fYCOLADISGGSRSSYAYAISDDGT::VCSNESTITR
'
'


CCPYIGt.7~GVAFIt.KVLCtOJIGWGFOL01ATLIIVfFLFALSStGLFBdG'tDffANLG.IYIVCAANFATVTNC
NpESNAtMYKDNOIIfD
~' GLYISGt%
NVPTYLCfLDI


GKIQSSF20CSSNNKAVGAFtIJGILATLYtTPC'fCPFLGSVLGLVNSLSfIAOLLIFTAIG


L~L7LSpYLVFSVfPKMLS1ILPKPOGWFISTfKOLTCfIQ.LVTV1WLVWIFGSETS'iTSWVCP1L0795
198008 899195


LL.OGWL1.:LGAWILGRWGTWSPK1LORVCASLLTFAFLOGAISItSGt~SNYFABPOQTVNo robust
Maaolop present in Genebank/GwBL
as of 11/7198


SVNEDSLWpPFSLEKLAOLRAQGR15VF1MFTAKWCLTCpITBCPVLYCOAVOIC~?LTfIGIVGTLOGANSSA1GVSS
DCSVIVCpAQTADKSVHAFpYYNGEtIKDLCTLGGTSSTA1LTVSPD


TLEAWfRKDPGITEEIJ1RLCAASVPSYVYYPGDN&APVVLPEKITOM.LEDWSRFVRGKVLORSOIADGSWFIAP14C
NTDFSSNNVLFpLil~il'YKTInENGRQWSIFNLONBdOR


ASDrt!"lTftIAi.Gt~GLYVMILONLPStI~AQYfGIAYKIRPKYRLGVfLDF81F8Sil


CPrL0717 185604 186101
WlrIINVSHIIRWIGAFII~IpDSDAt.G55VKVSfGYCKOKATITRDpL.C~fFIJIL''SGaNf
'


yabD/yctH-PHP supertamilY lurease/pyrimidlnaselCVNfL
tydrolue
ECVA7l0I>I~RYCKSLGdMtWPFLGLOFVNITRKEYTENAVOPPVNYDPIDySI


TRROPVDIrIDJUItMLSDDAFEEDINSVLOMODSCVSLWiMTfEKETI4RSFAYJIEitFPGICSNI11LVDSGiVCI
'NI~ONFAANTDRFSGSIASIGNRVFENLDYCIIfRAFA~tIM
'


KIRFCNVCGTPPQDVDpDIEF~YRtIPNAAANSIQfLAAIGbIIGLDYGFATE>03IARI~YLSSDLRYIILGF
YELPYLQSLNLILRVNOQPI4CV!!G!


QAYLiILSLECFS.PLWHCRGJvF4iDFFRM.DOYY1Q7DPRSRPGMJICfIG?L.EAOELISR
199280 901710


(;<JPISISvIVFFKNAODLRDLWELPLflILLI>:fDJIPILAPVPYAG~04EPA1Nl~TNA
CPt1_0796


VuqV~~~y~,~G NO robust haabloq present in Cenebenk/<i~L
as of 11/7/98


SELYSSYLOP~IVPNSIILPLPCLSRSETFKXVRS108(TlBM.TPIfIYRRDWYfAF


CI~0781 186521 887132
LLTAIPGSFJIfnT.VDIJ1GEPRHAA0A1GVSGOGKIVICNIfVPODPFAI1VOFQ7CItlONLQ


sdhC-Sucranace Dehydrogenase
PLL1VRPQCSVYPNDITPOG'CVIVCZt~IIfAIGIICSVAVKWHJCKVSELpM.IDTLDdVJISA


SLVKSLRNSRIiEICPEVSHK1IGKYYSTFIFRCIHSLAGIAtTFFI~DiLF1?A4Jt.RSYFSVBAOORVIIIt>LGi
ISVAVK4f~OVITOLPSLPDAlIUICVIICISSOCSIIV011RIDV


QGKCFVANVNGTNKIPGLKIIEVAGLVLPFLCHJ1IIGIVYLFOGKSNCIfSGDGSRPNLRYSWRNfAVQWICDQLSVI
GThOGTI'SVASAISTOCCVIVGGS817ADSOTRAYAYIQ~MSD


1U0~1YSYTS~IpRW'tAWILLFGIAFtIVVfILRFIRYPViIVDIHC'1TYYAVDIOPSRYDVIVIIG'1'IGTIACI
YSWtAVSSDGSVTVCVBTNSENRYMAFQYAOGONVDtI~TIGGPE5IlAQf~YSG


IIGFLTLNLPNI'~ISSItYSRHDLGGADAALLSFJINSYLLTPSADTAFLYWRt111LGSLFIDGkVIVGPAQNPSOW
ILAFLCPP~SPAPVNOGSTWI'SONPRCINDINAt'YB~.II~


ALLYTILVIAAAFt~FNGLWI'PCCR<AGVWSLRMGGVi.RIVCYL71NIWTFIK:VSAVWiLOOL4RLLION51UNES
VSSGAPSFTSYIOGAISROSPAVpFIDVpKGTILSYRBOSIIOMION


YSVA
COLLTCAPMDWKiASAPRCGfKVALNYGSOMLVERAALPYTEpOLG8SVL80lODQOOG


RYDFMGETVVLQPFIIQIOWtLSREGYS610i11AFPVSYDSVAY8AAT8lIIfiJUIVrJIfLf


CPtL0789 187136 889316 P101SfAATINERDL1~ISNI PFASLAIIYYWRQ00LV


sdtlA-Suceinau DehydroQenase TL.h'TNle'IpQPLTCfLSLVSp88YNLSf


0t40ltJiRKVIVVC~CIJIGiSAAI~LANLGI
IVELVSL'1'JCVIfRSNSVCAOGCINAALtfLKPE


EDSPYVNAYDTIKGCDFL7IDOppVLt7~lCLAAPRIIKI4<,tItPOCpPFIPBpSGNLDNRRFGCPtLC797
901552 901694


GTLYHRIyFCGAS.I'~Oil'IYTLDEQVRRRpUIGRVIKR>SMtEFVRLVT~GRACGIIhNNo robust
homoloQ presafc in Genebenk/Et~t.
as of 11/7 H1


NLFNNRLEILRGD11VIIATCGPGVIlRtS'fFISTFG1GAJINGRLFLOCKAYANPEFIOIHPVLIL1WINVGTKIG'
LNNSKKIKVf.GHLTi.CTLFWCVLCAAALSNIGYASTS0E8lrORSI


TAiPGRDIQ.RLISESVRGEGGRVtafPGDSSKRIVIPDGSDtPCGETGAPWYfLCMYPAYVSIaiGSRIVGASGaGaC
SbTAVIWC5NL11W(.G'1'h0


CNLVSRWGAW1ILRVCEAGLCIDGAl4P~lYLDVTHLPERTRHKLEVVLDIYIGCFIGEDPNGGSSA~ISKDGEYW~IS
DTREfiY'1'WIFVfDCROIOfDLCILGJ1TYSVARDVl3t~II


TVRBtIFPAVHYSMfiGAWVDWPAADOPDRDSRFROFltNIPGCFNtxESDFOYHCxNRLCAVCVS11TIUIG)~lllt
~OVIGVIIWEKCKIKQGKLLPQCLWSPJ1NAISEt>CI11ITIT'~10EI818lItI


NSLtSCLFAGLVSGDE71SRFIEAFGASOATSSDFORAt.00EKEEN71RLLSASG1I~1IFVLVAVKNNKNAVYSLC'
1'LOGSVASAFaISANGINIVtiGiSTINNOETNAF181KDE11lfDfI7lL


NEEIAKINVRfIV'MCRDBfRDLQE'll~KLKEFRfRLfffVSVLDSSPfANKBfItFVRONGPftOCGPSYATGVSAO
GPAIVGPSAVK1GEIHAFYYAEGET1EDLTTLGCEFJIRVFDISE~ID


t:EL.AL7IITKCALLRLiEl7~SHYKPEFPERDDEIiWLILTTVAVYAPEEPEISYLPVDTRHVAIIGSIK1'DI~GJ
IERAYLF1IIHK


PTLRDYTKSSTCKI1:LTNIPDNIRLPI
CPn_0791 902810 907856


=t'n_0790 119279 990103 No robust homoloq present in Genebank/P~OL
as of 11/7/98


sdh8-Sueel.nace Dehydroqenase
WFEIIFWRVPMtNTCCONYRSiCWFSWLFVLT'i'Q'fLFACHFIDICTSGLYSWAPGv


:1SRIPLIISVYPYRKAFItItZ7LETFILKIYRGVPGKOYWESFELPLHPGENVISAUdEIESGDGAVWCYE~NAfKY
VDGEKFLLEGLVPRSE11LVFKASYDGSVIIGISDODPSCRAV


KRPVNILGEINNPWWEpCCLEEVCGSCSILVNGVPROACTALIOEYIDATOSREIViJIPKWVNGALVDLGIFSOGIIp
SFAEGVSSDGKTIVCCLYSDDTE'fNFAVIMDETGFNVLPNLP


:.TKFPLIADLIVDASIMFI>1JLERIQGWVAADIt7CETFGPQYf0E00ELLYALSOCMTCCCEDRNSCAWDASEDGS
VIVGD71MCSEEIAKJ1VYWKDCDDIit.LSNIPGAKRSSAW1V8KDGS


.1'FJICPQIDl7KSDFICPa.1i50ARYFNTYPGDKRSRKRWRAfJIGItGGIEGCC0Al04L11RVFIVGEFISEDJ
EVNAFVYHNGVIIIDICTLCCDYSVATGSISRDGKVIVCHSTRTDC6YRJ1F


CPKKLPLTESISAVGREISKFSLRSLPSJ1LPKKKXicYVDGRNIDLCI'LCGSASPAFGVSDDGKTIVCKFETELGEC
HAFIYLDD


CPn_0791 893101 890111 CPn_0799 905001 903910


CT590 hypothetical protein No robust htlmoloq Dresenc In
Cenebank/t7i8L
as of 11/7/90


T_LRSSRKIWEDISDRNNYSCYSKGISHNYLLHPFISRLDIFVFDSLIrINQt7pNLLEEIFNREWIMIKOILRSMLSO
SSLWMVLFSLYSL~Y~VITDKPEDOFNSSSAVIMD181CK


CSEDTVLFKAYATTALQSPL1AIQJLNIARKV1WYILADNCEIOTVICLVFJ1IHHLSOCTYPTTLSALSNKKASAKAV
SV."tCrITTVCFIKD7WSPTYAVRWNYWCfKELPfSSWVKKSKATG


iGPHRHNEM~REEIIt.L101LKAf.Kf11PKLILESIRTLFVPSYSIIQFILIRHTL~11LFIPQTILSISSDG.iII
AGIVt3iELSOSfAV'IWIQMllIYLLP""fwAVrSKAS'GISSDCSVIVOS11KDA11


TIHVRQAALTALFTYLRQIrIGSCFATAPAILIHOEYPERFLKDLNDLISSGKLSRIVNORSRTFAVKWTGHEAQVLWG
WAVKSVANSVSANGSIIVCSVODA~LLYAVKWEON1'I1'HL


'cIAVPINL :GC IGELf
KPLRILDLYPDPLVKLSSSH:LIIKAFSAANLIETLGDSFJIpI00.1'LIX'..IS.I IAKAVSNNGKV
IVGRSETYYGEVHAf~'HKN
0'MSDUG'tLCGSYSAAKGVSAT


LLSHOYI1AOKIQNVHETLTaNDIINSTLLHYY0L0ES~IPPKECLFSKEQVAFSTQH.KVtV~iSTTJINGKLH1FKY
1.:CGRNIOIJ;EYSWKEACAtIAVSIDGEIISnGVOSE


PAELSEIQRVYNYLHAYEEAKSAFIHDTQNPLLKAWEYTLATLADASOPTISNltIRLAtG


WKwEDPHSLVSLVMIFVEEEVENIRILYOQCEpTYNEJINSpLLYIECRNANPLNNpDSpICPn 49(10 90ti550
905249


LTNGMRfR0El3JKALYEWDSAQENAKKFLHLPEFLLSFYTKOIPLYFRS8YD11FI0EFAeno-Enol.aae


NL'lANAPACFRILFftICRTHPFrtWSPIYSINEFIRFLSEFfTSTESELLCKHAVINLEKERKEIKINFEAVIADIC
AAEILOSRGYPTLIfiIK'rtT~TG.~'/:EARVPSf:ASTCKKEALCPR


~.~ALVNHITAHLHTDVFQEAL.L.TRILFJtYOLPVPPSIIliHLOpL;~011'PWVYVSCGTVDTLLTL'tIvPRYQ
CKGVLQ.1VKNVILEILFFLVKGCSV'lE9SLIL::(J~iNDSDIL:FNKITLGANAIL


:.:.G'IFE:CEPLTLTEKHPENPHEWAFYA0J1LKDLPTGIKSYLEECSHSt.LSSSPTHVFStV:a.ATNIAAAATL
RRPLYRYLOCCFACSLPCfMNLItrYxlilAOFIriLEFpEFfIIRPICA


I:AGSPLFRFJ(WDNDWY~'YTWLRDVWVKQNODPLQC/1'ILPpLSIYAFIENFCNKYALOHVSSIKEAVNHtv\011
FHTtKKLLHERCLw~~lr~Vf7G~Y:FAFt1(J1SNEFJ1LELLLLAIPIUIGff


'IIIDFHDFr:CDHSLTLPELYDKCSRFLSSLFTKDK'IVALIYTRRLLYIlNREVPYVSEOOLFt:KUI::LILDt:A
A:::'FYWKTCT'IIY:RIIYEL7GIAIL.~.NL':DRYFIU::IELI:WUDYDGW


iE'/LONV.~..~YLI:I:i::RLTYEKFRSLIEETIPKITI'LL:>:ML*HIYKi:LU1()SYOKIYTEEALLTE/Ia
:EKI/Qt6'~:I'f~LFYfNI'ELILECLS1Y:IJW::VL:Y.1'N~It;rI.TFTWAtKLAQN


U"'ILRLTTAMAlIIINIr\YFaFLLFAD1:NWPSI'/FCFILtIfC'FfEtl>l.WKFNYACLOtiQPLOA<:Y'ITI
t'IIR:x:f1TI"tTlADirIVAFFU(CQIY.T:.:L::I'::F.f?V\NYtIHI~IEIEEELG.4FAI


:1I9ELFA'T::HiwrfLYANF IDYI:NPPPP(:YR::RLPKEfFfTU::LfJt~.:YI:D::1:f:


':In 117n: w~4:r55 N:rIIUH 'a'n_rxnl rUN'fU'1 nUf.7i7


Wt'.stn M/Iinhrr.u:.~l pc~cein avrH r:xm.1..n::' Alr' ::ufslnrl
It


''rHHt.ItlIYtaRtHKIrfFTKRVLFFFFLVfPIPLLLILlAM:FF::P::AANANLWVLtITRAIIF'Mfft~:Ll
1\I1':llvta>L,t'FU\/APL::AiNRrYiM~QV'.:/a-nly:Y'rFTINNVAFMII.


7YIL:'.IF.FF:YY.L'fIIIKLFLDRIJINC(./1LKSYA'.'sP:iAEF'fAQAYNE?MA(.::NTDF~LCLLDPI
'fl.Vl.AIiNKTL.AAA'L\'~'h:F'IiF:YFfiRIAVEYFL':Y'fDY%yifillY
f AI!::IIr1'tI:K::IJ.INDf.


F'f/:::VRTYtIIV:Ut'FIRYLNOtIPE711tKNL.HMV(:KAFt.LTII~:KPL1.111'LILVELWAa~WDSILN
tI*L::ACH::II.t'Jifil~fl.1\::::V.~.t:I'ftal:::4~JYT:.'NALVI.fU':YI"tl'ItNII:FnW
LVK


'ITf:X:LL'/::1'YPll::1I.QKDLFOSt.IItTKCNLCLVNCY'a7VLFt:IIQU::E:::FVFSLDLPNLNIIY
yA::PItVN::APRtaata'llrlF'1A'If::HIl.I*LEFIlIUfL'f
al:'C:PIh:M111*E::VP


inFqAR.:P::AI EIEKA:a.Ita:GFIJLITVS::A77.Yrr:::llW 1 t't:\ t HF>La\
INKKRYLJ:LVIrIY. I t'!\t:rYTt.::LVPVSDLI fe'r ItJFI:L1:F.NNAFfDDRt I EKOH
I 1 Flllrl'Mlfrl F7iIKE V:


J::ALKVPtIIICFFWI.AFLiMWWIF:iKINTKLNKPVit:LTF'~.'MIlldWRn:NIINVRFEWjPYPY'Kr:llv
'fY::HIIF'n:Alw:.\I1'n:1.td:fl'I'It.OFt.I.IIDE=:INJfLI~lIIAfIYHt:IIJ::RKQ::1.


ia9


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
yE'C:FRLf".:AFOIVRPLTYCFJWKYFRIrJtYVwAT.m.,tLDNA Mtsawetn Hey
.'EVOESSCitt'Jr"ptIRPTGIPDP TRAP LQLL,DFL~. IHQ 2.11CE l: EV.
~ ::1.'If EL I Et'w':.DACADE:
'
LCWtIf~ILTXAPM
. i:


NPE IRFAT: ;tJVDDLwEEIRLRL:iOKHEK .
I LV I :ITKRLAEOMAGFL iELEI PMYLNSG,
I .
'JRfNq'~IC~'I~EF;tCI~
SI
EIETL3GOOGALi'IRONCr~9FRAEJai;l~
~~'


ETAERTCtL:DLR.7aftOVLICVNLLRECLDLPE1ISLVAILOADItEGFLRSTSSLIOFCG,
aAApHtlr:KVIFIADOKTRSLEElLRETERRROLCLDYNKEHNIwPKPTIKALFANPILOr
" ~
A:,::.IOtEIQSSIECDOCVR'lYtflOW'IVxEPCAtIpLGrtYtVIISL
P~pR&t


:ORFLiKEDLEEOIKKYEAL1~1QPJIAItEFRFNfJIAKYRDAM~CKEOLLYL0.?DRLGIRKLIEHRILST1WIGWS
~1ISECHHEIQIAK06~:P'OERVAYVMCONfMQOALTI
' '
'SKDSESPKE ~


. .DAYAL:.LPLNRY/VF
. .LF: :KKLT
OKL1NGYRIVCYL:.~PSFtIRPTROCOKIFaIDRPIE.


F
VLK;,YLPSS>JCDFMMPOKI6ARICKEEL4t'.DCiXEAIVE'.'LrICPPf::'..:R'"HOEIEESO


~n~751 90R7n~
::VPLPMFRMLE'.'r0'I:EEESVEFOCNLFAY35EOV~::LEKCEYT.~.R~PKSCNDYIIYSSWR
~P
ORO


n_ . r'.":4\:'..,:. ..,.., ...,~...
: .


. . :: r:r ~ : .. . y..r ;,..r .
r-,,.. . .
' . _.~ ... ..
_. ,_. ... .;,It : .. E,r-
J .:v:i. ~:.'.~: 'Y:i
"/:'.... ; :
:TF


,. N~ICECLTCATF3KH0f1~'FDVSWLK:.i%d-
%":KPkKIa'UirlitlRHLLLIlSGfMF~i
....
.
Y :,
iAKEEVLt>uDMIIYEVLADW4iJGIDPIKSIZYLOSAIPEIYELHLLF.itfLLSINRVNGI


PSI~tDMARNASIEEGSLSYGLICYPILOBADILLAXAQFVP1ICKDfIIJdtIr~.TRDIAPNF
R CPIL0813 920813 9:193)
'


DPN P
NRLYGOVFPEPEVL.OCELTSL'dGI000G10tSKSAtNAIYLSOSDATITkVRX11Y1


IRATTPGRVEGNPLFIYHOIFNPHKDIVEEFK)1RYROCCIXDIEVRARLAEELIHFLIPIPepP'Anr>.nopepc>.
dase
'


KERRSEFLSKPLALQNVLCOGTHIOUIEVAKS~IEEVNL*ICFSHXWRSLLKEfL.iEOLAYFLHWJ1IJ1CILLIOGO
EYIIF
TLILyIKOtAitISNDRILNA4RALSEHNLDALLt


FVYPMDKDLYSHIORVPLTFL'tODWADLSLYVOKQRYCKIGFDSASTVYIfKFAQ~LP


0803 910306 909752
CLWtPLOCFTEItIRSIKSEEEIRRMOEAAAUGSA<iYOYVLTLLR~"ITCXEVVRpGRAIIi
CPn '


_ iDRPLKXCCIV~IDIGIWG7fCSOKlInf171LG
CT581 hypochecacal Prace>,n AEAGAEGPSFPPIIAFCENSAFPHSIP


FMMKTKTLELEONVfLLL>''~JLIfRIFATPIGYITPREFQtiVVFNCANCQOEIANFFPEMTPH
I0~1L1VRVLRENHLDTYIINCICIIIRICR


LINGKLTQELAPOQKOAAHSLIAEFlOIPIRV71IIDINERGEFINFITSOMLTOOFRCIFLNHIHEYPCSPRGSQVIC
.f.~fl'ITVEPGVYFPGICGIRIEDTLCIt~l0~IF5LT11RPVISE


RLARVDCQEFLLMIOVDNTCHLIRNLLaRLLEAQtOdPNCEIOdLQEIQEEITSIJOVtiFDELL


TKA40
CPn 0811 911996 923357


0804 911071 910310 CT911.1 hypothetical protein
CPn


_
FfLFFKLSYNtIFNLPLTMYOLLSICYSFVSFIALLWNLCYSPNYVTDLYRISLSAEESL
qp6D~CHLTR Plasnid Paralog


EIFSSMGNLKTLLESRFKKNTPTIMEALARKRMEGDPSPLILVRLSNPfLSSKEKEOLRHLGGIRAFPOAESLLCCACA
LNFPDLEERLPDLRKELLFLGSNDRPDAOGCRFSIALiISSKE


LQNYNFREQIEEPDLTQLCT'..SAEVItOIHIiQSVLLHGERITINRDLLXSYREGAFSSWLLCYIAALKFRVYLIiV
'1'NSSItGPVYSFSP10GVP1'EWIECFSVSVDCRVE111fVRLOGLIaEL


LTYGtrRpTPYNFLVYYELtTLLPEPLKIlD'IEIDIPRQAVYTLASROGPOEIOCECIIRNYAGISKPRDCETLFLNP
PJ1NKLDCWEIACFRVOASFPVIIQXIRRIGVDKFLIJOIOGAEIfADXA


ERXSELLDAIRKEFPLVETDCRICTSPVKQAt.At4.TXGSQILTXC1'SLSSDEQIILEIG.IKTXERVDFVSSDEEt
iIISRYLAVCtM.LWDCNC~IpTCGEFpCASSRAPLFEIfIaI00KVMIA


XyNyFpm.XV
DLWNIbO'1'ORQTISLVXGVPSPIEINEYIREIEFTCMRSWSKPIVLVOCrpRt.ILSPOpN


LRTAIOf3iiEICLSRAD0IQ0YV1GKV1CPLLVFERLEXDLRGFVLRGNI~'t7~RTLVC1'ISL


CPrI_0805 911816 911067 ~ PLItaCPtPAVASpEVSSN1'ItSAAANPGIL'L19ROG5


minD-ehranosame partitioning ATPasrCHLTR
Dlesmid protein GPSD


GYJIRR!!K1'IAVpISF>(CCTAKLSTTLffLGAAi.AOyfIQARVLLIDFDApANLTSGfGLDPDCCPfL0815
923361 925622


YDSLAVVLpCEKEIQEVIRPIOD'L'OLDLIPAD'l~f.ERIEVIiCttWADRYBHERIJfYVLGSgspD/OilQ-
Gen. Secretion Protein
D


VQDKYDYVIIDTPPSLCWLTESALIAAI7AlALICATPEFYSVKGLERL1GFIOCISARHPLMVPfPNS4LNLVAL&~G
.CCSS4YALTIAEIQIASLEHSGRGAODYEIiIASPNANOtEYSL


TILGtJALSFWNCRCIO~tISAFAELItitCTffGKTthtl'KIRRDTIVSEAAItt~VF'ATSPSAOLSKLYEFJ1RX
LRASG'1'~EALWICDLIRRIGEVRCYLREIEELWAAEIRIEi~.EDIfAL


RASCOYFNLTKEL.LILLRDI
WIQIPCC1'IYNLVTDYCTEDSIYLIPOEICAIXIA4'LSKTWPKESFEDCT.TQILSRfGIC


VRQVNSWIXELYl091K~CSVAGVFSSRKtE.EALPIrI'AYICFVLNSNVDAtlTN011VLDIF


0806 913816 911867
1NPLTlfNDVIAGRVWIPGS7VGENGELLXIYNFVpSESIROEYRHIPLTEI~IISIL
=Pn


_
NMFREDLZX1HSEESLGLRYVPLOYOGRSLFLSCTAALVOOUZI'IRELtEDI:MPIDK
LhrS-Thraorlyl cANA Synthecase


NANNESPPti!$J1WN104IOV1r00RiYEVLEGTTMEWCOLfOQSf~FIGVLINERPRDISTVFWYItVKNSDPOrr'
~rre~DyfSGEtOtASVGAADGCG80LJ~L18I0IDfIYSEfARD


THI1JE1GDTLVFLTSEDPDGREIfLNTSAHLLAQAVLRL41PDAIPTIGPVIDNGTYY~'ANGSVKYGNFIADSkItG
TLIMVVEKEVLPRIpIC.LXKLWPKIOIVRIEYLLF1Jt10.IWt~IIB


LSISFSDFPLIEDTVIIQIVDEK1J1ISRF1'YCDI(QOAL.ApFPQNPFKTG.IRELPCiEEISGLNtI.RLCEEVCX
XGCSPSV~111 .LXT~
GILEFLFtOGSTGSSIVPGYDLAYQFLJ111CEWRI


AYSOCEFFDLCRGPHLPSTAlIVKAFKVLRTSAAYWRCDPSRESLVRIYCTSFPISKELRANASPSWI70ip1'PIIRI
AW~tSIAVSSDKDKApYNRApYGIMIIOIZWINVGE~tSY


HLEQIEEAKXpDIiRVi.GAII1.DLFSOQESSPGMPFFHPROMIVWOALIRYWKQLff1'A71GYXITLtTDTi'FL7
1'I'GXNHD~tPDVTRRNITN1IYRIAOCETVIIGC1RCIO011SD8107GI1lLC


EILTPQIlaIRpLNEYSGNWDNYXAtItY'1'LQIODmYAIXPIB~ICiGClI:.YYKTfILHSYXEPDIPGIGKZfGM
SSTSDSLTEIPVPITPKILENPVEQQmrrrsrre~~pp(


Pt.AVAEVCNV1IR0 TPEOVIdlTillILOLVSTLYCfFASFJIAAWWIKXLEMFPAbCVSLSpV0t0EYDGC


GLE7MLELSTRP~TIGDDSLWEL71TMI1~IALVOSG'1'PFIVRPGEOAFYGPKIDIHVII


t7AI0R1WOCG1'IQLtMFLPERFELEYITApGTXSVPVlfLfPALFCSIERFLCILIF31FKCCPn_0816
9ZS600 927102 -


RFPIIiLSPE01JRIITVAl7RtIIPRAKELEE7WKRLCLVVTLDDSSESVSK%IRNAONIpVNgspE-Gen.
Secretion Protein E


YMITLCDHEINENVLAVRTRONRVINDVSVZfiFINI'ILEE109SLSLTALLRGIOfellMSILSOELL.DILPY'tF
GIOIiICLLPIEEBSLLITIANATATSVI110DEVIG.LIX


KPVRFVLXEESCIt.ORL00LY8NRl~tI80!$.LTIDtICDCITISEEED4LkTl0SIPWR


CPI>r0807 913950 914879
LtifliILKFaIIJ~tASDINfEPCE~IRYRIOGVLHDRNSPPSNLRSALT1'RL.1tV61001


GT580 hypothetical protein
DIAEMRLP~ODGRIXIHIt7GQEVOMRVSTVRIIYGERWLRIL01WNVILDIACLJ111103'L'


TLQI~LtMSLFLVFLTAFIWSSSFALSIQ.VtBIASAPIFAZGARMtfIAGAILILaAwIPOGEILTKDTITAPECILL
V'IICPIGSGKT1TLY5VL.QEWOGPLTNIM1'IEDPP6YIO.IOIJIQI


FVGISKXIPLYIVIS.ALTv'FYLTNIFEFIGLOSLSSSKTCFIYCLSPIHSALFSYIOLKEAVKPKIGLTFARGUtHL
LJtQDPDIi?IVCBIRDOETAEIdIQAA4TGlR.WSTLJf11D11IS


KYft.ICKVLGLSLGLVSYICYLTFGGGGDDSpPWISapICLPELLIL~GMSLASFLW1'LLRQAIPRLLOMGILSYLL
SATLVGWAQRLVRTICPYCKVAYTPENDEKSFLiIBtL~I'~L


IEKOSfLSVTAINAYAMLIAGM<SIMHSAWEPWRPLPVQDISOFLYATLALWISNLICYROQCMICPRSCYIfGRQCIY
EFLRPNTLFRSkSrASCIRPYHILREfAEpIGFLPIL.EtIDI


YNLYAKLLRK1CSSTFLSFCNLVMPLYSCFYG<JILL~GEKCVSt.GLVt.AVAPMVAGCRLIYHAL.71VSGETTL11
EVLRVTIOLCD


EEFROGYri75


CPn_0817 927106 928187


CPn_0808 916398 911956 gapF-Gen. Secretion Proclin F


C'"579 hypothetical protein
GGRMPRYRY1'YLDPKERRXAGYL.EaL.HIOEAREKLAQEtIIWi.DIREVALRRNSIKSTEL


IXKLPSWALKSLKRMPQSAEPSLAHIKPIIFKGaCIAtl1'SGVSGSSSODPTLAAQLAOSSIyFTKptrrr.r.sa:L
PLYESLVSLRD0YNE0XMCLLLTSFMETLASGCSLSOAMAAHPNI


OKAGHAOSGHDI'KNVTKQCAQAEVMOGFEDLIQDASAQSTGKKFATSSTTKSSKGEItSEFDH!'YCSGV1N1GESYC
NLOGC1.~IITWLEERAOITKKMKiALSYP'CVLLVFSFAVIQ.FP


KSGKSKSSTSVASASETATApAVpGPKGLRONNYDSPSLPTPEAQTINCIVLKKGhCCtJ1Lt.GIfIPSLKETFENNE
VKCLTItIVIGVSDCLSAYRYLFLCPASALI15ACIIl9U0tIPWICK


LLCL'JNTtsIANJIAGESWKASFOSONOAIRSQVESAPIf,IGFJIIfDtOANIWASATFaQAIWS.
ILEKLLF11LPGTKKFWKVAVNRFCSVASAILXGOGTLIEGLDLGCDiIIPYDRLRTDIOtD


LISCIVNIVGFTVSVGAGIFSAAKGJ1TSALKSASFAKETCASAAOGAASKALTSJLSSSVOIVQAV1GCCSLSOSLrI
QRSWVPItLAIGMIALGEESGDLADYLCYVAHIYNmItpKTLASI


QTMASfAIt)1ATTMSSAGSrIITKAAANLTDOMAAAASKMJVSOGASKASGGLFGIYLI~KPNTSWCOPVILIFLGGL
IGVIMIJ1ILIPLT~IIQTL


wSEICVSRGMNWKTOCARVASFAfRJALSSSMOMSOLMHGLTMVEGISAGCfGIFJANNQ


RLAGQAFAQAEVLKQMSSVYCQQAGpAC:OLQEQAMQSFNTALQTLOIdIADSQ1'0'1TSAI-FCPn_0818
92B15B 928682


N predt,ecad OMP (lesdec 11b1 pepcidel


CYTKM7GF~JVWSTRDSDFSWWPDRCpNV~IIDPt'HXOYPNIIKCVLRG909fROKRXO


CPn_0809 91'791 916307
SITLIIhafVVI1'LICIIOCALAFtMRCSIHKCKVFOSEQNCAKVYDIiJMEYATGCSB'L1I


CT578 hypoehetical protein
EIIAHKETWEEAs~CKEGRKLt.KDAWGEDLIVQWDKCODLVIFSKRVOS~ROt


dfMISISSSSCPONOKNINSOVLTSTPQCVPQQDKLSCNE'1'ICOIOOTROGKNTEfIESDAT


IACASCKDK'C'STTKTETAFL'pGVAAGKESSESQKACAD1GVSGAAATTASNTATKIAIIOCfn_0919 929117
928956


TSIEEASK.iM~TLESLOSLsAApMKEVEAWVAALSCKSSGSAKLETPELPKPGVTPRSCT5e9 hypochecWal
protein


EVIEIGLAt.AKIIICTLGEATIISAISNYAST~ADQtNKLGLEKOAIKLDXEREEYOEMKA9LY:'ICLFLIWEKFHN
NIGKANFHLKIITTDFLTDLYIVTIRDPIAYPLTGIC


AAEpXSKOLEC2MDI'VNTVMIAVSVAITVISIVMIPTCCACLAGLA1GAAVGAAAAGCA


ACAMATTVATQITVQAWQAVKOAVITAVRQAITMIKAAVKSCLKAFI!(TLVKAlAKACPn_0930 729012
329657


ISKG:3KVFAKCTCNIAIWFPKL,~>KViSSLTSICWVNr:VGWVMPAG:KGTMOIpLSENCT567 hypothec
i.al protein


t7QNVAQFUKEVGKLOAAADMISNFTOFWQpASKIdSKOTGESNEMTOKATKt.CApILKAYOEBLPCRCL'CGTFFRr
iET~SIRTEMPMCNSIAMKKOKRCFVLMEt.tJISF'FLIALLLC1'LC


.1AISCJ1I AGAHKTNNF FWYRK IYTVOKQKER IYNF'lt
EFSRAYKOLRTLP.'.TI$Li3'.iYEEPGBLFSLI
PORGVYRD


PKLAGAVR.1SLIIHCTKDURLEWtLCNIKI7QSYFETQRLL
:HVTHVVL.~>FOIWPDPEKLPE


s:Fn DAIO ?18193 17925 TtILTITREPKAYPFRTLTYOFAV.K


r.'T577 hyprXMICIC~I VCOteln


t:EIWIKKtKKTKKA\b>KMFVKRVPEE:iOEMIIQQLEL\V~DLYKELFLAUTFJ15LTDKefYt_01121
nvH~l7 'llOni.:1


N(IItL:fIML::r;'t'LE.:LJILEELTQGLFF:.'.AQEDAI,IFAKEL::.7lNfK:LKNLTTIVNKQMVKrTSi
.:, nytt~cn..t r,:.si Innr.nrn


l:At: IYfNhkLA:NKfYM)I'F I
FTI.LI:L'f::I.V::(.%AFOAANAHKRCM:AOTf
Ela<:F.NFYI:IKRSACA


F f EYrrF:K::RII4: A I LR I::KI*r:l!VTfY.p
LAKVATKKKCsRYRLWVI'F::RFItIN.~.RYNLYA


CPr_tIHII ~s13.s).f v1920~
t.L:'.EffEI'\':'f7TA:lA'\IFIRLLRhA'l'JOTYxP/Ff~:.~.f.'IAIANALI::NKUELLERGAQLG


Is:rllli.v ~.n k.r:ynlu:.
Oror...rnI'1'\'IF.'fl:I'LI'I:f:IsAE(F'IKMtJ!r:::::N::4::LlItYG('IEEK:a.C:IK.'Kf
.Nf.IFfIDfLLLEAVL
H


llsfAIFII):IIr.:M::KI~:fkNWI;WJKP:il::1'tIKKTR:SP.LAtL,nVWKK:\K.\DLLFY~1IIIIHPT
l4il'IN'lRl'1'::LLkII:IWHAVKhrJF7IAVII:IV70lJvALELFYTHTDFftI.El.Ht*M(rt.LL:iR'f



F:FEfYY:.t):fill'F1:1::NI:LL~,.~w~LLA:L:D7LLEEI7TVAYTF'1::~~:KYNF.A'J:LFYJLIJ1A
I~LL1'.IlIKKMFDY1'f.:::Y:InYLF'LVIdIJMfAI::Pn:Ia'f'.:K::fKI.


yrt~Jlr/Y'MI ~ :1.::: ~:YI IULIILYNEJIAtI:FFLAF
DAQPONP I ffYY 1 Af'::I.LKL.pL\P nl'tr IW :.: ~3x~ml.w ml :'l
EE :N
NFIDVI'NIsIm;fINl'F:FNfl.f;lRc:VIMKQ:aEKVWY:ETKKAITYKI'r4:K::YTT17JKY.::(:K


Yk n~I'.r.. Iry1tn11Nf r.~.nl In.rirr


fYLII'JLI::rIYNiIt.elflMf*.~I'1YF7C:::KI:::::::QFD!:LYRKVKDLII::NI'KW:KWKKFL


~'Iw_nNl.' vlmHl '~~Ilyn_
::HHN:F'.Atia'LVLL',:11ALYI::WN7:1.1'IN":~/I/:FIIVF.IRKMI:xIL~::Y::IMK:PIY


110


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923



NAILf.'~:LIIkFVLNIPSFAV.';FIYLCVILaFI'.~.::ITMn'~CAEFJIKVNfTt :F
.(KDROiHPKTrtIc:::VEWAKTHGY:TGPKAIALPIYA
_.iTCSKDHCDIfHpDTSNKPS


tpLt,ADKFK00LLiLG:YD~sLEYALRYDIRt.LROJ1SFSFSAYL\TPrx.'LONGSLIYPNYC


OR2S '15191A 7lLSO/
Y.iP!'I(CIJfOVVCITI~RROAIIiYIC.r'LNERPIILCOEPGf'~iHi'~E.'t~RIL'
r.'Pn '


~ 1~SITtIpCFFt.EKKNDLPIQ~t.rVEPQDIFUfVIOa
yscTJapaHYr,nT TranloCation T EOGIrtlFNYOVGDPST('EIRF'3lWl


RYAIQVRFSN':'::INr.TIKELMCICLPELFSNLCSAYLDYIFONPPAYVWSVFLLiS:r'POAALKRLPNFFSSPI
rFfLKOLLIEVtIROSRGIK',LDLKPILVCIG6SRCId.IGVEL.YRmIC


CFAVAPFLCAICLFPSPIKIC:~L~"WLAIIFPKYL1DT'pll2tYM0l44Lf'fVLLVKfZtZIG:fSLIPI'PLOGL
CFLPRVLPPtatVPQFLTQYIIpHERILFPNPpTILPPESYELVIQSINRPH


VTCFVL1FPFYlU0SAC3F:INQQCIOGLEGAT~LISIEOTSPNGIL'lH'lF'ITIIFwLVCPASPWLOLELKTNIG3
5rPTCIAIw7CWCSKHTFLPfOACFLDLIfONLFQFLKOfL$TOKC


~HRTVT.iLLI4TLFVTPIHaFFPAf?MSLu~APIYIITNIKMCOLCLVM?'LOLSAPAAIJWLVIAEN'IYTJ1NITO
VFKLDAIrIPL.3VTCi'TIJWPL:DLOFFSQLKAACLPpIPOM.F$$OFIC


-"d.r?.:: ::!M\Lw''':. ...:.:.r Li.. .'-:L:.i?!WF:vI11111'...' .
FT!:L:.:'......::vWFI:~Uii.-''.:.i:dF~F.':I~IM:..:.a""tiv.\-\:.:.::':F',.'.
.':.iiF
~.'~i
:.
:.
'
,:


....:::!: ~, ... .. .,
....:.: .
.
. .,.:.:'!~.t!':'.:'.~,..,. ..I~'..~,y-.i:A~...
:;;f::..:!iii.' .~ rl:...;..


VFDELNMAKNK.i:nOiHKLLCR:lJI4ihSK:n,iL'iv:
PtENNLi,EFKI;(j,Dllt,pNS(ypSpilLF


Cprt_O8Z1 972677 932779
KKL("tKRCSSEELFIIPSOCLLLKL?RPFI''rRRTJtKLVLPELPDKYESIIACtLSPDOE


yse5/IliO-YOpS/tli0 Translocation
KLYIIATLOROISHIOKLE1'PEEPATNFLNIFALWHLKOIC~Ip7IVF!'1<DpDpYK~ESG
Protein


IRTRAVLAFFATSFKSVLFYSYOSLLLILIVSAPPIILASIVCINYAIFOMTOIOEOTKwNAIVKLLKFSLNACYKVWF
SOYIHMIRI:1'LYLEEICIKYILSIOG7ISIlI~ILTF


FAFAVITLWItGTtIIIStXIiL.SNNILRFACOIFONFYKWK'ITDPNCQVFVCSLLaAGTGINLTA~IV4INYDRwI
MPAKENQALDRVIOtIGQIDJMIYR


LIT1DTLEERIHYLIEKKIRLLDKVIASODSNII3MCNREDLLTILSYKDDICISDSCtS


CPt>r.,0875 933618 977677 PVDAPVEDCIIGVLPPEDS


ysCR-Yop Transloeation R


ERIKVfTItARSIFRFSLCFFfLSVSCClADASLYC4SCPSRCOPTPPPSNSNPIliWOQPCPeL_0836 916960
915732


VAASSVPSYNPPLNADOVLPRDNLSDGSFSDTYPDITTOAIILIFLAtSPt~'LYNLLTSYLbrn0-Amino Aeid
t8raneMdS Transpost


KIIiTLVLLANALGV00'IPPSQVLNGIJ1LILSIYVNFPI'LYAMYKDARKEIFJINIIPOSLIMKI~ASNSLSIWSI
GCSIPIIIItFCAGNIVFPLALGY1IYNAtIpwS7lYlGlIG.TA


:TAEGJ1L1YFVALtIKSKEPLRSFLIRNTPKA0I05FYKI50KTFPSCIRAHLTASOFViIVCVPLLGLVSMLFYSGI
IYOKFFFSIGAIPCMIFITAIILL:GPFGGIPRAIAYSNATLIS.


IPAFIMGDIKNJLFEIGVLIYLPFFYIDLVTANVLYAMOlI:IIL.SPLSISLPLKLLLIVMVDLSENKSAFIPSLPIF
SAICCVLIYIFSCKLSALIQWLGSVFFPIIG.VTLtilVZIRSIIIIP


'GWtLLLOaltISFK
THPMVpEFIPNAROAwIaGFIEG1~T1?~LLAAFFFCSIVLISLRQLVAEEID(Pf6IEIPL
'


SfOCI8K1C41aSLiILrGFFLAAILLGM'YIRFVLSMRIIAGLLVNVSKptILGRISAlAIG


CPc~0826 931382 933611
PNSILAGVSVFIACLTTEIALVCZVADfLARVIISFKR14YAS11VICTLIPt'YLISIWFE


yseL-YOp Tranaloutlon L
TISNLLLPLIALS1IPALIVLACGHIAYKLWNFAYSPVLFYLTLSLTIVLK<.VN


HDNKRSGVFSSL1IFIDPORYYAIVIQBCFFSLIFKD~VSPNKKVLSPFJLPSAFLDAICpT~T


KTKADSFAYVAETEQKCAQIRQFaImpCFKECSESWS1IQIA!'LEECTIDrLRIRVREALVPCPrL0877 917777
947115


LAIASVRKIIt~tELELt(PEfIVSIISQALKLT~ICNIIISVNPKDLPLVLKSRPELID'tIhch-
fnodnucleasv III


VEYADSLILT11KPDV1'PGGCIIETEAGIINAOLOVGLD7iLEtU1F51'ILKA1CIPVDEPSETLTNKO!'ILRTWA
LFPNPKPSLEGNSSPFQLLIAILiSQiSTDKAVNBVTPQLPAKAp011


SSSTDSS&LSNDODKXE
OSILDLPP'G1C.YOLIAPCGLCERKSAYIYQLSOILVRDFNGEPPNONU.LTOLPG1~P1IT


ASVFTGIAYC1IPTFPVDTNILRL74QRWICISElIKSPSAAEKDLARYP!GNENTPIY


CPr>_0827 935773 934131 YAAQYCPALNNKIDNCPICSYLiIKJEiINSTRT


CT560 hypothetical protein


CCLVTANfFCILDILMKNSKEDDLSRFLP10JLLVESPNPEEIPLxSLSf'RISWLPTINPBCPeL0838 919196
917781


wITIAMKFFPPEI0G0LLAWwPEPLVOCILPLLEGISIAPHRt:APFCAFYLLLIC.SIOCIRthdF-
Thiophafe/Puren tbcidation Protein


?CGITEEIFLPASSANAILYYTGPVICIALINCIGLYSIAKB.KftILDKWIERYIDiALSPISINIIfPNSFIQ.FNL
KLGILSESSFNP'SIFMLIQ(DttIMIATPpGECSIAWRISQpWII


TEKLFLTYCOSNPMOtLET'1N!'LSSW1TDALROFVNKOGLfPIGRJILTKENJLSFLwYFLVIADRIF9CSVASFAS
HTIHLCpVIFEEM.IDOALIl.LI9tSPRSF'iGK'iC!'F


RRLDVCRAYIVEQTLKTWYDHPYVDYFKSRLEOCMKVLVKACSOILDALIAI&ARPALPGEFSOMFIlIGKIOLVpAFJ
IIONLIVAENIDAFRIAQT!!P0


GNPSIDCIOEINTLIIF~i.IIFLEVWIDFPEEEOPDLLVp0EKI0Ni1L1lIVmFI88f0lDO


CPr>_08~8 936292 935267
RLAOGTSLILAGKPNVC%SSLIJ4ALWLaIMIVTHIPCTtROILEEOiditOCIUtIRLLDT


ysCJ-YOp Transloeation J
AGORT~IDCDGI&PALS11MEF~1DCILWVIDATOPL6DLPKILtZI~BILtJ11U1DLT


IKRriIWIMVRRSISFCLFFLKfLLCCTSCNSRSLIVHCLPGREANEIWLLVSKGIIJNIOKPPPFLOTSLPOFAISAI
tiGECL'lpVKQALIQSAMOKOEAGKTSRVFLVS87UOAlIiALVAR


LPQAAAATAGMTLGOIA4VDIAVPSJVpITEAIJLILNOAGLPPIOfGTSLLDLp'AirpCLVPSELCLI~IpQNLYLO
PPEIIJ1LELREUNSIGMLSCKIV'IESILGIfSI~'C1GK


OEKIRYOEGLSEOMASTIRIOIDCWD71SVOISFTTENJS~lLPLTASVYIKI~rVLDNPNS


IMVSKIKRLIASAVPGLVPENVSWS>RIAJLYSDITINGDNOLTLtIDYVSVNCIIWCRSCPeS'0139 9N~30
950159


LTKFRLIFYVLILILPVISCGLLWViWKTHTLINtMOCl'10;FFNPTPYT1(N71LE71KKAEGpsdD-
Phosphatidylavsine Decarboxylase


AJN1DKEIOIEDiIvDSOGESIQiALTSDKDSSDIfD7LP0GSNEIE:11PLfIVBRt~.VQXPOYIDRITIDtRVIEP
IFYEKTMLFLYNSKLGKXt.SVPLSINPI18RIY


frWI.OKCbyl1'RRIQIRPFl84RYKISEKELTKPVADF'1'SFNDFITRJ(LKPWIPIV~KLVFI


CPeL0829 936729 937198
TPVDOAYLVYPNVSCPDKlIMfSKJLPSLPR3.LW~LTKLYANGSIV1MLIPfDKIIIIFN


No robust hwsolop Dresent in
Genebenk/E!~LFPCDCLPQKTACV51G11LFSVIIPLAVKDNFILFCENKRTVTVLCC6pIrGKVLYLLVCR111V
as of 11/7/98


iCYICFVpTLAKSfYINIRDSRFYSWL.CFI
GSIVpTISPNOTYAKDDEKGFFAItOGSTVILLFLPNAIRFONDLLID'ISRIBPCfRCIJbQ
IYKT)YCE


FFLANAKWPLVPACYRRVRGImfYiSPLVDLVILFPWlr1'KD6RYSPCSMTII'CICRSIVESIlaRiDfIELI


CIPWSTLFGIGRFCAVWCVGFSCSTFDKIYNTIVAVLGILGLGILTFILRIIPSVLHt.


pVwPLFKCYS CPtI,",0810 950111 951541


CT700 hypothetical protein


CPn_0830 937339 977959
ISaRNtJCILKTFIGIAKRDKSOILwNIMwLVIWAt.AASL71IALVA1~YYRlYYlIIItYAV


No robust homoloQ present in
Genebank/ti'>8LOVIRHVRL3NELKLWALAEOQLLPILKIOtSYRROCLFItYlQIILRIDpRtE681JQ.LAlAI
C
as of 11/7/98


DSCSFLLPCTEYEAQTFPOVFSKVWYKYXSSRI:.LIALLYNITLVIGLIFINKKYLCOKKLG~PYFFLCIAYKAYRFG
AFIfECAOAFASVpQOGf'EEEDAAKYASALVIILG0L0ARC


GRVILKIY(~EEFFMTERFPSIGAGYLRVRNIWSVLFPFEDLtC.VCPSVPKDFPLSAFSLI6PWISPLSNOETFVrIO
ttIYITSKRYKDAI
n


KYL17CLIYWSYLESIPVVGAFFPSIGRLFAMWCiEDFPGSIFSRIYNTIVCVLCILGLGISSYAKAGKLI"RIILLSN
PVYKLEALFNIGLCEOKLGRFGKALLIYOSSDGWBRCDAiiJIKY


IMFILRI IFfLLTLPFWLISCLKSSM
AAMAAMDORDYVLAEPCWCL.IILRCSI'FAKDYKCCIGYCFSLCRLRKYCpIIENVYCQ.ION


FPDCLTACKAIAWLCGVCYATLLDSEEGIXYAIDTAVELtkiSCETLELLSACEARCCHFDA


CPn_0831 938219 938174
AYEIOSFLSSPDTSLOEKORRSOILR1LRKKLPI1~HNIVEVDALLAA


No robust homolog present in Cenebenk/CM7fL
as of 11/7/98


NKRKN1TVLIAKSESEGAFFEATpNYPTIQpGYQLVRIREHNLSVRAHFDLSLSLDASVNPCPr>_0811 951719
951610


M . secA-TraneloCase SecA


IKRHIG.CF(.IfRFFGSSOERILKKPOKLVDtIVNIYD~.TPLSDDfLRNKTAELKOItYpNG


CPn_0832 979750 .938827
ESLDSNLPEAYCW101VCRRLAGTPVEVSGYNORWpMtIPYDVOILGAIAl0t1(GFIT~Idt


lipA-Lipoate Synthecase
CEGKTT?AVtiPLYLNAL1GXPVHLVTVNDYLAORDCLWVGSVLRNLGLTTGVLV&G?LGE


VMItCRpTLNTDQPRVRKKLPERFPKwI)QRPLPOGSAFNATDATIKRSGIIPCVGEEALCPNKRKKIYOCDWYC'fAS
EFGFDYLRONSIATRLEEpVGRCYYFAIIDCVDSILIDfJIRI'PL


RACWSRKTATYLAIGIriICTRSCSFCNiGNSKTPPALDPTEPERIaLSAKEi.GLKNWITIISGPGEKNNPVYFELKF
JfVASWYLOKELCSRIALCARRGLDSFGpVDILPKOKKVLEC


MVARDDLEDCCAQCLVDIIOKLREELppJITTE4~.ISEFCRSLWLVSKGMPI1JRVLRRVREHPDLRANIDKWDVYYN
AEpNKCCSLERLSBLYII
\SDFOGNVSALHTLLOSCITIYMOiV


ETVARLSPWRHKATYARSMFYLEOAANYLPDLKIK.iCINVGLGF11DGEVKQTLODLASIVDEHNNDFELTDKGMOOW
VEYAGGSTEEFVlIIDMCNEYALIENDETLSPADKINKKIAI3r


GVRIVTI~,.OYLRPSRKHt.QVKSYVIPETFDYYRRVGEAMGLFVYJ1GPFYR55FNADMILAEEDI'LL!FaPAIiC
LRpLLRAOLIldERLriIDYIVRDDOIVIIDENIGRpOPGRRFSECL11pJ1I


SVQIHIASA
EAKI'NVI'IRKCSO'L'LATVTLDNFFRLYEKIJ1GM'CCI'AITESREFKEIYNLYVLQNpIFKP


CLRIDNtR7EFYITfEREKYHAIVNEIATItK:KCNPILVGTG~VEVSEKLSRILRpNRIEHT


CPn_0933 !11171 979717
VLNAKtRfAQFJIEIIACACIfLGAVTVATll21J1GACTDIKLDtdEAVIVGGGiVIGTTR1108RR


lpdA-Lipoamide oehydrogenase
IDROLRGACARLGDPGAANPFLSFEDRVIRLFASPRLN'ILIRNFRPP6DGYISDPMFNRL


RCVLFEILITVSEISA1'pEFDCWIGAGPSCYVrIIITAAOSKLATALIEEDQACGi'CLNRGIETADKRVODRNY'tI
RKFnLEIfDDVIWKOR0AI1N1PRNDVLtIll6$VFDLAKEIICHVSLM


CIPSKALIAGANVt/SHIKHAEOFCINVOGYTIDYPAMAKRKtnVVOCIRf?CLECLIRSNKVASL'MSDROFKLWL'L
PNLEEWITSSFPIAtNIEELROLKDTDSIAEKIAAELLOEFOVR


ITVLKCT,3LVSSTEV!(VIGOLiITIIKJWHIILLT'C3EPRPFPGVPFSSRILSSTCILELFDHMVE.LSKAGCEEL
OASAICRI1WRSVMVMHIDEOWRIHLVDMDLLRSlYGLRTIIDQK


EVLPKKWIICCGVICCEFASLFHTLGVEITVIEALGHILAVNNKEVSOTYTNKFTKQGIDPLLEFKHESFt.LFESLIR
DIRITIARHLFRLELTVEPNPRVNNVIPTVATSFt0a411NfIC


RILTKA3ISAIEES~OVRITVNOpVEEFDYVLtII:ROPNTJ15IGLOC1ALVIRDDRCVIPLELT'llrDSEDOD


PVDETlIATNVPNIYAIGDITGKWLIJUIVAStIQCS'IARKNISGHNEVMDYSAIPSVIFTHP


EIAMVG4:LOFAEQ(NiLPAKLTKFPFKAICKAVaG:115DGFMIVSHEITOOILGAWIGI:Pn_OS142 vR5015
75A710


PIIASSLt.EMTIrIIRNELTLt'CIYETVIIAHPTL_EV'~ALLATNHPLHFPFK3~.T702 nyptttnHtw,O
prucmn /frame-ahitt
with OBI31


KYYTFFTI.~.A:IPW::NL ALICfI::EPEYIY:NQLLKTQ.iL(.TTtNDTLLNAPKDFPlISKIIDKN


''fr~!IHtA nAISAA '1IGOta
ILFI:f/dQfrL.::ll%AQFLIN?IRRKFWIF'PINOOVW.~.EWLPFI


't'. :r. Ilyh.tr It.:t ir:a l Iltrtr.:
tn


t:IS,ADFANETF110RTCWKt'.lX:::V::MIIVtr;!:FY('.'\FVrDPf'VA:XX:FS::r.'HIt:Pn 4YA
: $H ~ wSS~ fD 't.A'rtA
:FPECA.iK


N.FAFGLF'AV:.::EIAtIf:AVV:Iy?NP1'OFTNKOVIphW.~iR.0:1dPL1ALF47hLLAFAFLILr1'7n:
r.ylxtrhu i.:.U prntam trr.,mr...~.hitr
wir.h O9A11


Lt':'ftr.7:LVL'IWIKNAAYIO',:I U: tIKtIKL: :
:'S~IrtFKVICJYItVYS>PI'(IPDIt11Jt1EI)(.(.DN.~.FJ1A::LDKYr~CIr;V'IVEFJJrOQG
V1VAYRCYAK:FL


GLLI'/L/:F':VFKlf1'tllVftCt


.:1't. IIH :'. 'IAL4'IH 'tA~LIAv J
m,! 1 ::WI/::NI I.unilY heli.:ar:a 'Ftt 4RAA ~N~ wv.'P.I ..'hl
,


tlNtIIVLFIUrtIFRt~7AM011f.LJIHRKETWTiFY'Et::~?titll9Uit'u\I'EC:YWL~TLKWDIDm
%ptw:r.TF.c:.:m:Tt'-ItualuI Irr..n.


Nla'F.lt:.':a:(IX:f7'L'LIIIXiS\YFAVYpAU:IJIfWILYFHII:aWIAVF::11FFLD::IPWA~YNHIF
TTHLKIALIGRIMV1:K::::l.l'DII1!.'YI::IAIVfI::I~%.TfRI*LYt:FIJIAPV:VFAVV


:1:HV'ITt.E:.:T'SIITLTIFR.'(.::I:EVFQOWLRTIIL\.~EfaT'JFTN!"PF'LK::AL'lR'fAKKFFF
LIfM'7:/L11N::ED'IFqKlli'IN~~AlaywlYFJviH/i.Lt:Jti)ITr'3tTEF.DN11.AKLLG'LILKPL


NF:FT:Af?S:flt:l?1::47ra'f~.:IIF::LOY1'jiLVFKAFIt.:FtTf.f:DIFIKf.FalIIT;:LFIJI/
::ltptLLVAtIYADI:H~EELt~IHETYK1~:11LI'P!I:."PAllf)KII(Uft.tI~RIKLVMILPEPREEEEE


111


CA 02350775 2001-05-11
WO 00/27994 PGT/US99/26923
i:


:LEEV:VDfIIF.E~EAALP.'aft'fPOf:LVITflC:F:.LNt.~IYROLTENNLPMf:'.
'EILWIRSFQNC:VNC'."f:A.~,I~:::,f".'!!tRLr""...:.::
r~f.'TLPESPCCAPICTLKIAL.IGRP


tM:K:'.'.:IfNr:LI.NEERCIIONCP4T!'RDftLOILY:NK>XtOYLFIDTAGLRKlIKSVKNSIEfY1'YOCCA
TiftaISYCT.TI'7AKCNYLDALNOEKSYWOAR~F:L
"DQ'fDOFATNI~v5


YIIiw:RTEKA1::RADIt:LL;ttOATOKL:w(EKRI::'LI:iKRKKPHIILINKtiDLLEIyRtKGTZYRGLDLFK
IfIKIRICI!>pIFL~I/RLRI
PI~, :illY~,~'L.'p 'w"I


EHY''.KOLRATOPYLfbAFJILCt.iJITTKRNLKKIF:IIIDtLHIIWSNKVPt'PtVFfKTL~ISAt
fir'rtQIATFO'tpKH9CLP5LI:KYPIyI'NK711FIK
PLONf~S fOkTt.'PTOONV


LHRMIPOV ICrRRLR IYfA
ICKT"TPGOFLLFIASIGI~IIYS<iK7IAKYIltELIKEITTFOSADLYYSL.iIYLKCIItR.pAVAOP
ItIAKSLLTKHYEYYLKIfCLKSSFNLYCI LGKAVG1:.


PfDLEfICEKMCPtIN
NDLKTRANADITRCNIIIKAAIDKtILVEIKAOiIELSK.~aCTRtLI."_~.'_TNfKS(;SOIw.lANL


SCL(~'FL;iCLTLKAVNDFNATYFJ1F:AEIF~tPfNM~IItRCLATFLiFVtOIxC~CITPGC


''Fn nrl.t5 n5ql il ~5~a5r1
OOOLLOANE.S!'.QCOF3'?F.~rNQOftILfLESa~ANQQESIfGVSAAL~LLNpNVSKIJIRIIIKS


.. .,:,.., , . HaO,,~.~-
,



::.li.~:'.:' . t,.r~,.. ,.:Ih:YrJA'IF'.":::":Pl7Wl.Mt:n., ..~., , -
3


RPLEDLDIATNAs'PTIVSTtPPtriII::LCJAFGIIIJKpOCRLfEVATFRSOGtYKDCRHPiT7l~
nypotnecvca: Protean


ORIIFSSI9tGDALRRDFIVttCNYYDPfEDKVFDfYCIRDILKl07IRAIGNPRLRFSEDKNINNPKIAUWSLPLTAJ
APVFEESYffPa'Va:~.11DYVOAT'lGSPIILTVLKDVIKGwIR.D


LRILRAIRFSSSLu'FCLDP'ITLRAILXLAPALVNSYSPERINOGJIKIQ.IOfOPI'GALSLLIGKiIFL.T~OCFI
N'ILTLU1IIQA.iIrIDpSSRFSRKKEtKIIItQFIILtIUIAfOMTklSG


LKLKVLZFIFPCLRDIPYS4LRTTZLFARKfHPTIIPPILFLLPLfnGVStWITVJVCRVpPI7IDPVADKItPLOSAF
AYVLLOKYIPAOttALYALCRELHLSGYApIILFSPLLISIIKS


LRISNKEt.KLIESNYEALPNfQNpSGNRVPWANFL~ISPfIIPLFLELfS11L4KDPSR00HFINSAPINYNIGSYIS
OTS<.TANFAYCY!?IILSRYt~IILVSpCRLDIAB'1VK111GItWIIiA


ISRVptLESRLEOFILRIKTSSPWSAPDLZ.1KGISPGRLtGt7f.LRRJItILSZENCLDKSVKJWVSL'tDROKKCI
ECIIASYTKSLOVINTOLTDVITI'FiI,ASITFVPGL~I7fDISYRIV


EKZLLLL.OLKGtsec O~.SI
IAL4NDL1M.VDGKVDITTAVNOGLLNFFT1YL?DOpNYCpt~tpTpptIlLtx.E


LIWpppWSLVSASL%LU7CNY7TVI9GF10~t


CPeL0846 9597!7 95A11~


clpX-CLP Protease ATPase CPn_0851 97119 971106


RENHMtKXNLT:CSFCGRSLKIriItKLIJIGPSVIfICDYCIKLCSCILDKKPSSTISSAPVSEatopB-Ourer
l4smbrarte Protein 8


TPSCPSDLRVLTPKEIKKNIDEYVICOERAJUCtIAVAVYNNYKRIPALLF01KOVSYGKSNCPTDINSKNLKNLRLAT
LSfSMFfCZVSSPAVYALGAtZJPMPVLPC1R4PE0'tWICA!'OL


VLLLGP'ICSGKTLZAKTLAKILOVPFTIAMITLTtAGYVCEDVO~tIVLRLf~AADYWACNSYOLFMI.IVGiLKtGt
r' rlf'DYVfSLSANITNVPtIITSVTfSG.~L"t'fPfZTST'1'IfNb'DFD


RAEpGIIYIDEIDKICRTlJINVSITRDVSGmCVCOALLKIVifXiITANVPDKGGRKHPNpEIIOiSSISSSMATIAL
OtTSPAAIPLt~IAPfLKOYYRLPWiIYRDITfIIPGtfA


YIRVFTtENILFIVCCJ1PYNLDKIIAKAIGK'f1"aCFSD00ADL.1~KTRDHLLJtKVF.TtDLIE.SLYIDCLI~I
C15DYCIVAIGLSLOKVLiiKDNSFVCVSADYRNCSSPIMYIIVYNKJWPE.


AFGNIPEfVGRFNCIVNCEELSLDELVAILTtP'L'N71IVKOYNLLFAt&NVKLVFIOCFALYIYFD11TDCNLSYKt
SISIISIGISTYIaIDYVLPYASYSIGNTSRKAPSOSPTELH~FlMFIC


AIAKKAIfQAKIGAPALQlILFtiLfttDII9PEIP5DYNGINI0E0TIAt2DtAPZIIRRTPFKIRKITNFDRVNFCF
LZTC:ZSMiFYYSV~RWCYORAINITSGLpF


FaIA
CPtL0855 971001 972991


CPtL0847 960019 959787 ppdA-Glycerol-3-P Dehydro0enase


elPP-CLP Protease Subutfic
GGBBIpNIGYLQ83IWCPCLASLIJ1NKGYPWANSRNPpLIKOLQtERRNPLAPNWISPN


KLFDEEfOHTL.VPYVVEDTGRGERIIFmIYSRLLImAIVNIGQEITtPLitHIYIApLLFLItISP1TDM0J1INNAE
NIV~11TSAGIRPVALpLKOZTDLSVPFVITSI~ItpMSiLIi$E


SEDPIQ(OICIFINSPCGYITAGLAIYDTIRFLGCDVNrYCIGOAASIkiiILLLSA01'l~lINLLViGDSV'PPYIG
IfL&GPSIJIKlYiI4GSPCSWtISAYOSOTLIIpINl~IISLPIANYP


NALPNSRMttIHOPSGGIICTSADICLOAAtZLTLIOUILANILSECTGQPVEKIIF~SDtDHTDIIOGAALG~LIONI
AIAGGIA~LRfCaO~tAKAGLVTRGLHtMiKLJUItI~CKPiTW


FFNGAEE71ZSYGLIIIKWfSAKETNKDfSST
GLiKiGDL.CVSCFSPSSPNf.ItFCNLLJ1QGLTFmAKAKIClNVI9GAYTJILSJ1YQVAIOWK


ILIIQITDGZYRVLYEM.Da.KtGIALLtARNtKCEFL


CPn_,0818 961556 960177


cig/muri-Tripper Factor-pepcidyl-ProlylCPeL0856 975110 977995
isoaerasa


VOASSPAFPFKSNJOCGCLVPRSLSNEOfSVOLiFSPGCIVSAWKVSP~TKLFa071LIfA0X-1 Homolo0-WP-
Glucose Pyrophosphorylase


KIKKEITLPCPRKGXAPDINIASRYPfINRIQLGC.VTOQAYfUILSZYCDNRPLSPKAVRGSRLIwNVRLTVIffESV
YSPSAIqfVNSL7IDIfLKAINOEHILDINPSLSPKQppRLf00LTS


SNSITQFt)L0LG7UNEFSYCAFpAISDLPWSiLSLPOHE7N1.SEZSDSDIEKGLTHIOI~FVDZt~fAlOppOLLSS
PTAIIJmFNPITSF1ISSGtDPGtANAGTTLLKtKINAL11VLR~Q


ATKTPVERPSODGDFtSISLtNSKSNDtNIISSMIP:NKYFKLSIIiA~'LWKLINLCISG4RtxCDOPK~fPVSPIK1
0IPLfOLVALKVRAASKL7tCOPLPLAlIftBPiXfROTRSFF


TCHRWtTITSPEIQSFLRGDTLT!'fVN7IVINSIPEIDD6KARpIQAtSLDDLtUIKLRILSIISnFtG.DPNO~V~F
COPWPLL?LSGDLPLtf7l~'1'i.AIGPt~NCCZ11TLLYfif;NYAC


CLEKC~11~CCLOKRFSEAEOALAIIt.VDFCLPI'SLLLERISLITREKLLtiARLIOYCSDttNIOfRGItHVBVIP
I~PLiILPFWELOCFHANSt~B~tEVTIKAJILRpfIIILDIICILYKSNDS


LLIfRKSELIKEAEtD~ATKALICLLFLTFDCIFSDCU.TISRtGIAYI~SRLRfOpOPPKCKfSVILYSLIPONEAF1
1L1~DGKLK7fCL7WZGLYCL~FIRIWIYOpLPLYKVNKIWt


DIS~1'LQELVNSARDRLTYSKAIEIM.RKASLL.ASTPSJ1QL.GI~'SLaI610~OBiKICCFIFDLfRYSDHCQfL
VYPROLCPAPLIOa.~NIISPDt111EQRLS
IXtAIpLFHKVlGKI(LSPNTTPLLFJIDFYYPSTSTSLNWFaIK


AFFCEPFfGB
CPn_0819 961752 965~A5 ,


mocl/snt-SwF/SNF family hsliease CPtL,0157 975108 975792


ADYIINSYSRCF~~LMWt~RDFSANILODCKKLFtOGJIVItfAICZL>iE~E'1'VCISAQVRCT716
hypoehecieal protein


CLYCNIYECEIE111>RStBO'ISIDSNCpCSYHYDCONIVALLfYLlOIfFN~4VIlAYJIR>iRDI.IJiiJIpYIK
TARGISRI141DRL.G6LSLZLKVKIHKYLDTLIOJpKRLALTVSRNI0f1'pa0!


ETDNCINLLVItKiLIfETFYAAATKtEERKDRAtpKtIAOtdQ.GIYFlIFISRONIKNYDILLEYLIITLOSSLYXQ
OSLSLRFLEINNOOI.O~.I~tR


EKDBIILLiIVLTYSVNEDTfAPIINDPIEPOLVLRLPCRSKPFYISNIRTFLCGVLYOtPIVKIIACIKNNKYSKDOL
IOT


i~IGRRfFfTIQStNABt>AKI,II7LLZ ALOVILiIIDPtN


CLIIDNGf70SI~ttSFSGLPCCNL$EPIL~ISLTPCPn,pSSB 977115 975757
VD


ODi110PlroTNLLESL1APGZIHHFVYNWFSpOIKRIWLRSFSRtJIDLIIPEALIGSIROiAtlii-Flagellum-
spscifie ATP Synchase


LPV!~pIYIIEIANVHLLNSFVTLPYVDEYRJ1ICI7NSYLDGLLEAIfLIIPLYGSLRVPAASLiIIDISt'fRNQRR
TRPSTFCFDSIB~INLNKLKLtIINNWQPYRACCLLSKVSfrTILILYDGLSiICL


LOYOOVRAFISDLGILARNLVLERKNLL6VFSGPIYDCROGAPRVKSLKKIY6l7ffETIPCCLQ(ISSCImPNLLIEY
:v'FNNNTTL.taISLSPLHSVAL.CI'EVLPLRRPPSWLSDNii.G


ANQIIRITFNCPLNLSCpFIYDETIFB.SFRtxiSDRVLDJ1PCNPI.PKfItRKPt.LSLPPSPl0~0lpPIDpIFP'
tCIK7IIDtIFLTL~RI
I


SAKKRFLLLPKAGOQSNGTRRGKVNSGKLPCILVLpL~tIAPWOIfNtIGfKVLDLX.VQGVISCPOSGKS&LLS71IA
LGSKSTINVIALIGAtCRtVREYIflOfSNALKp~tTZIIAAP


KCPLNSLTCISLDOFEJ1LPVNPSNSERLIEICKOIRGLIEFDfQDVPOCIOATLRSYpTCAtItTAPTKVZAGRAAIf
I'IMYFRLOCNEVLFIHaSLSRWIAALpI.IIALARGLTLSNpYA


GVfBILERLRKFOiLNGILADDtI~IGKTLOAIZAVTDSKLEKGSCCSLIVCPI'SLVYNNI~EASVFHFNSE!'1'LM
GI~IJOOfGSITJ1LYAILYYPKNPDIFTDYLKSLLDfNFFLTSOGLALLI


fRKfNPEPRTLVIDGVPSORRXCLTAtaDRqVAITSYHt.LpKOVB.YItSFRFDYWLDtASPPIDILSSLSRSApALA
LPHNYIIAAERL.RSLLKVYNtALDIIHLCJ1Y1'PGDDEII.OKAV


HHIKNRTTRNAKSVIQiIOSDNRLILTCTPItPISLtEWSLFDFIJIPGLLSSYORfVGKYIKLLPSIKAPLAQPLSSY
CYLCBfI'LIfOLtALAOS


RTC,11YNGNKAONFNALXIONSPFILRPF9fEtM.KDLPPVSEILYHCHLTBSOKELYOSYA


ASAKOELSRLVKOEGFERINIIM.ATL?ALKQICCNP11IPAKOApEpGD.SA1CY0M.1~LL
97597 977055
CPt~0A59


SSWDSGHNIYVFSOYTKNLCII1CKDLESRGIpFVYLOCSTKNRLDLVNOFNEDPSLLVFCT718 hypothatieal
Prot:ain


LISLKAGCTCLNL4CAD1VIHYL7tiMNPAVENQATDRVNRZGOSRSVSSYKLVTIiJ'1'IEEVfLYtt'POSPGSLS
pSHLPNPHDPWDTtP'tSLPEDPNOKASCELNSLVNt.FRK<SINLLS


KILTLQNRKKSLVKKVINSDDlWSKLTWEEVLELLpIEVLK!lVppLKPDIrz'!m.-
ZCEKFLYKXLENPOELALLLSTAIARHTTLRSLTPIKVFLN


PLIL.KTLTOWISTHELPNIKHAEFPPOTSCARSGFKIETPNCILRQLISELLONLLbvLT


CPn_0A50 96575 ?66790 . A


mreB-Rod Shape Protein-Sugar Kinase


LCKKYwNCCRYDFNSPNRNLFKLKNFSNRLYNRALGRFOKVFNFfSCNVCIOt.CCANfLVCPn_0860 x'8679
977608


YVRGRGIVLSEPSVVAVDAOTHAVLAVGHKJUtAM.GKTPRKINAVRPMtDCVIADFEIAEIliF-flagellar N-
Ri:fg Protein


CML.KALIKRVTPSRSVFRPRILIAVPSGITGVEKRJ1VEOSJ1LNAGAOEYILIEEPN71AAIRTLVFfONLAKKLTA
LCISiLCCLLIG31IVSCAILfGRSSNPSt.APTQVKTEKT9CNnK.K


..rvpLPVHEPAASNIIDICGC?EIAIISLCCIVESRSLItIACDEfDECIINYNRRTYNLNLTONLTtPKLIESLTKK
ECLEKDLTSFNPIASAINAIALSTEDOUNSPI1ILSVILTLRKe6


ICPRTAEEIKITIG.iAYPhiOpELENEVRGRDp4ACLPITKRINSVEIADCLAEPIQCIISLTPSL.LFSITDYLC8.
3L1LKRLNISLSt7NLQILYIPFw~ITVNSLPtN'ILIDIYtGKIFP


ECVRLTLEKCPPELSAOLVERGNVtJ~CALIKGLDKALSKNTGLSVtTAPHPLLAVCLCKEMPALAYNAKJIOCPTt.C
LTt~ItNYIIWLTKEtStKIVAHTKHYLYONYUDSYDtVIETL


TcKALEHLDOFKKRKCNLV
PFARt.QNItKSFPAKVLIC:HILVISLMI'/ALASFYLARHAYERVSPEPRKIKRCINISKL


LEIIOKESPLKIALLi.S1 L: PKlfAPaLLNRLPEOLIWCVGIfYKL


CPn 0951 ?66778 068195


PckA-Phospnoenolpyruvace Carboxykinas~CPt1_0961 ~-975Z 979925


REP.~IMVWSTNLKHECtJfSWIDtVAKLTTPKDIRLCDCSOrtEYDELLTthESTLTMIRLnitU-NitU-
related p:'oceln


NPEFNFNCFLVRSSADDYARVEQfTFICTSTfAEaGPTNNWRDPOFIOtRELHOLFRCCNOASYPFTWKPLJftLPLEF
NIFWSSLSAK'MKKFLTPHCACTFSEEDAFJ1KLNILYI9IIpGN


r;RTLlIVPFCMGPLDSPF.~.IVCVELTDSPYV~IC9~ItIlftRFGDDVLRSI.CI'~IfIFLKCLHRLNaKZfIFr
iK.VDtIKNCt'LLDAKFQYF.HPYLIPWPJ11R:NLVCGKSYSLAYIOfILODI


::VGKPL.:PCEADII~IPCNPKSNRIVHFODDSSVMSP''SSCYGGNALi.CICKCVALRL71SYW1OKSLRVHJWCP
ALPED::I: LYitPVIDALDTAVEOCLEIPLEDCSLpf4711~uPNNL~CMN


K::pcA.IWEHNLtIGITNPECKKKYFSASFPSACCKTttLANt~IPKLPGWKLECtrppIAWIPYSOSDWEALTHEOK
t.YALR.tTLAEKT:PYtANCIfCEViYESLENFTVTLAYSQiC90CP


HH:RIYiPLYAVtIPtYCFFCVAFOTrERTNPNlIIrITCRSNSIFTNVALTADCDVYM80LTE3SLG3'lIlJSICOL
LRAY IfELpVKVDE3~L.NL:HP


OPPEPLTIKPWItPCiC;iPIAHPtI.SRFTAPLRCCPSLDPE4MSPGCVPLDAIIFGGRRS


tTltL'flFAt.:~lF9KKY1'Ii.~CllS'.':ITTAAIW7CL.aCLPHDPFNILPFI.'GYHMAYYfpNWL~Pft_0
9f2 ,~Jn24 >7'l7.~..


.:IIFtIR.~.LKLCNLFt:VHNFRNFBJpI:EFLWW:FCENLI'ILEWIfQRTDCLEDIAERTPICYyfttwttit5-
rFt.m..l tc.~chin


t.l?IfQYFtIIIX:I.HLDL.~rrVQELF.~VOAtY'1J(J1EVEIdt':EYLKLA:~Ot.'L'~~OITDELLRLK'i
PtiTIFRLTf7CKT::r.'f:'NEKIQtIRKAFPIFWLtRIQVAtIPSERVKE:9'AIJI::OIPdLPpG


::1:1 Kh:Y. :.AIJf IhJiKTCE.~.I RUL>.t:LY(fi:t1
L FRF'/PtIFt'IFMI I VLAALVt?IL:WPtt:RNll
t ILPAH


U)pLLItL~LCRHOt:tt:IT':'tJIIlV181F.':P
IVEF'.r~LtETL::PR::f.LF::It:AAHC:LT1:VIQP


~'irn nHS:: .wrt~7A ,n~a.:f!
lLPL4:LCKDRRtLI~II.CI;UItJVL\FLTI'EIIiIADIITF::.~.AAIJ,Xi~It:::It7CIFIRKGL


":'W l hytxmtx:r ruU Protoiu t7tVf:..'IiFPPFfPSA::I:
F:'.:\t'MNp'1'N:F.ERI::ALI'f.FTFI1T:.Nl~'KKLIUELO::VLr::I
~


iY.lattU'flYt.rt7rV1'14~i'::YINFTPNVITAf.::.~ltlflP::AfEt:a'.::1LFFOELODKIIOCiL
.AF::EVONRLPNIWAAfPDttAEa~:FiILllr/J:fYI'.'aJ:YERF6~F4WVi.tlNwtiI::PP


LKIIAII:LVVF:L::AF:ALN('A~1VOT::I::YLPTEE:i'.'.RCS:L~.N:LIDR'1'tlPt'h'1'ODfVKAI
LQll:H::AU1F.~.LTER::KUI.t::::KLAH.WIILLIKIIIa'ILLt::~:::


Nt,FIFFT::K t FVFI :LINfVFK::1't.~:lTPt'Ff:
IDP::NFE::A ( I LNY ITLIJJNLhPKFAACST


I'rnAU'InALIAt.t:UFVKRIFJ1LKMIMP1W::Itb'iIAFWUF:IFfPt'INMIQV4:lPVTDYW':Pry
nH67 t~YISjOWl,y ~.ylxtt ~Y~3
I


VyINd.::INITAAytIK~WI.KNPf.':ILK(llLtlf\AJt'fftJA'MIYPADAEYNARMCNIOSLIspM
(hoePtKxFIYm't.y. N.H.t...


112


CA 02350775 2001-05-11
WO 00/27994 PCT/US99l26923



FHMALLILLPiIG0:rIMMEKNLf.iG4JVDLPL...~pOCHLLIOt-SROSfMCL:ptLP::.
XfFKCF':I'AIIKJL:.FF:..~.'tK.'.F~.::.It:LESAL1:.
.fSACAAIONLPIDCIlT3RVR f;'...1RL
iLRR
'
7FEOOEIf:w
Ih'LRTFI'E:
PIAKA:
'
'
'


.;IXtALLWnINNSKKIPYIVF1EDPIW(ENSRI'I:aAEEtTN~IPLY053AWFRNYGELO..
.
..
.
.
.
rTD'~.FOS
t
Ft!P1
RHLCC3AKAf:
Llt
~
~'

'
'~T
~


KNKKOTAEOF'iEERVKLYIItR,iIKTAPPOGESLYD'CKORTLPYFEKNILPOtANGIWVFtf~
' .
' .
r~t~YD.WTL
/%'
'
'
OONATLCFJV':nPIF'.'T1ERNRLDFOC't'3R'~d~ftLVRCATC?l9L.i
~110~'PSD


EELYL iLELPCCKWVYQ~D~KIEKNPCF~
EAAiLVNS!"t't'IQCgItPLTIRGLP.iLVIGL..~VATFICo:
.~ANCNSLR.iLtNDLEKL 13P0~JRLRC,LYSTMLSLLVKS


LRSMREMWKOLLPOLTVLDFSEr~..SSCC:LDVFAEGIAVRiNtJJCAVSIN:.


ePn_ORFI 99!559 993371


y7bt:-Dradrcced Pteudouridine sYnituse
Yf;IIIJVt'KVRIIIKFLA.irYllAiPRKGDEILFSGSIrtVNGRVAECPFVL'IDPEDKVOVt~1'SCPn
11875 ~ ~77h7 't't44I2
-~..


.;l:u.m..,.. y,y.;YYf~l~:l'F:: -'vHLiYF":~':6L.'.i....
..,.,,....w
' -v....


_ _
...w=..-rr , vc r--.,nr, .. ,
.,.y.... ...,f.:_:A fi::r.-. . .f'~:F'IP:F:~!.i':!~th'!~'::
!:FVIY ? III:-.:.' :I . .f....F":.
SRRLFAP.KW':.iKL'fL.:VyANh'h:iAEK~:~t'.;,LEdCL.aYIaSAArw~.i::i4AL.i
K RLsRR
SD


tWSEGKKNEIRLFADAAf:FPLLELNAIRIGSLVL.GGLRYCEYRELTWELGTYN%L_
~ITOLSkI'FSOAit80
M
~
Y


,. ER11PELI
Ptt_0865 9811 II 987942 VOG
AAFASCL;.LDSCIY
CVIS6ffDf


~1'865 hypothetical protein
SPNGYVIYVIJIGSIFIGISLGJIIfCOLYYSVKSVLfS'hIYLL.'iYYIILEKRNALU1LSOLVGECPn"-0876
991113 995517


EDApSpKEIDFLSOCDKtSWMFLIG~tSYEIIPTFK~LLSFAVOCFLESIETI1~RdaOA-D-Alanine/Glycine
Pensease
'


AILCIEtifyiASIOJGFDFEI11AYEFJIVFJfYLKLRQMPi~ti~ISKLFRFLOVPSIRFSSSIR'fCLITG.Y1IE
0I1~IKLSTSFCVFPNILLIGCFLlfIKLRGLOf
IIOLIO.CFNiJ4.CbLLD


DSSSKANEVSSYGVAGILAf~7JlGIIGNIAGNAVAI~1C~PGALVWVWIaALi.CAIVpYJYG


0866 987191 982916
SYLGSKYRKP~'CEf'IOGPIJUCLAt~RItKIIJYGFF11LPTIhtLAFCACNCVptISCIVP
CPn


_
LCAGClPGKLLVGILLALWIPVI'7lGGt~BIRILRFSARVIPFIAGFYCI'SC~IILfONABA
birA-Biotin Synehatase "


hs~fKVIYYEIEEIPSTIILZtAKSYNIIIIrIDPYALTVIS1'KCOTAG't'GKFGKS~rIKSSKGDLLN1'ILPAIK
LICSSAPCIKACLIIGIGGYTLSOVIS1GINN1VNATDC
..SCIIVSILOANlKSI01


FCFFITDLfIIDVSRLFRLGTEJIWALCKDLGITEAICIIfWPNDVLVHGEKt.CGVLPC1'LPVPVVOCLVTLVPpVI
tMVIICSTT14.VLIVSGAY8SCA0GlLNVNSAFIO~tSLGSLGSVIVIL


ECLIGWtGIGLNI~I'1'KOALKDVCOPATSLOEILCNPIDLETtRELLIRNLLGVL4~ILANALPGY1'1'IL'itiF
ACAEKSIpYMIPGRRAM.WI.IfALYVLIIPLGCVIOIOtNIWILSD1'0


PDSLATKSNRGNL .
fS(RIVIL1JCI11LIALLKDVLSINRWALLttRECSVADPVRNLD1


0867 983105 981667 CPeL0877 995521 995982
CPn


_ ybeL family
roM-Rod Shape Protein
RRRDIOLLSPAF11YGAPIPRfYTCOCJ4GISPPLTFVDVPC11AQSL1LIVEDPDVPKEIRS
KYFRYVNSWVFLW


CIRIPa9liICFCHL11A0fiNFFYNINNFNILEIYSLINSNIIMIYHOCLWINNIVYNLSTLITNLAEGAEIFAVOGI
JIT~IKPVYDDPCpPWCQtIRYFFTLFALDV
LTI14.LSWVISShmPTANLIrI'SSKGLL11JKSINOLRIIFAt~IiVVFFIGYFDYNLfIOtW


AWVLYPtTlIGILVGLFFYpSVpNVNRNYRIPFItOISVOPSE~IGI~VIVIlQ.67fIL~RKAVLPEC~1'RDQLYF~
EFNIIEpAEIJ~1GTYEKS


DITSK'iTAFLiICLWALPPFLILKEPDtIJTALVLCPVTLTIFYLSNVNSLLVIfFrTWAT


IGIICSLLIFSCIVSNOKVKPYALKVIKEYQYERLSPSNIOtQRASLISIGIGGIRGPGiICCPtL0878 996660
995991


TGEFAGRGWLPYGYTDSVFSAhGttPCLt.GLLF1'LOLFYCLICIOCItTVAVATD~GIC.LSET Doolain
P~isin


AAGITVYLAIDM.INIS~'IOGLLpIl'LIfPLILISYOCSBVISTNASLLIILOSIYSHRFAKGCNStVS'fEPCSSI
NISL~D10MIDSOPYSLDR~BELLiIFRFLPSLV1'81WK11COOIClLC


Y
NItSnIRRLISPL7110iLGKLNKODLLCPPAPPVSVCWINANMGYGVFARDtGPYII'YIGEY


TGILPlIROAIf~fDCmIC!'RYPNPLF1'I.RYFI'IDSDKOta:M'RFINNSLDRiAIJIIGVFS


CPn_OB68 986733 981670
EGLFNVIIR'IYJIPIY11G0EICYHYOPLriRDIRKKREEFIPEF~


CPfL0879 997163 996615
yyaT-metal depefdenc hydsolase
YRIIfiKVBNOGFFpW&GSKCaNSAYIG1'DSCKILIDLGVSKOWI'RELi.SINIDptDIOA
IPIrllItHSONISGIKSFVKAYNTPIYCHLCI'AMt.CHLLDSfIPEFKItBZOSS!'aODLE
VOTFNVPttDAVOPVAFIPHYRE~CFC1'Dt~4~'SWITRELYDCDYLLII~PB.VR
OSORPDVYKIDtVLSRGHISNOCCGQLLOKIITPIa.KIC.YWtL8T0d0'171=LAISIYSE
SI115ITSIAPEGWIOGITSPIYFSRLCVJlCIII
CPtL0B80 999861 997111


. ttsK-Cell Division Protein FtsK


PtfIR~O(SRRPRLYfLPtJUIItASLYLFFIVCFSCLSLWSFNRDOPC1'Q~RIIGIi~QiifBS


IIPWLAVILHDGS'MNGIIrALRLL1CS
FLLYlFCiIAAFFIpLYFWLBFLYFRRTPRPLFtYKJWIFISLPECSAILLSIR.iPll~l't.


PALi.D1'JtLpKFIL~1PPVSYVGGIPFYLFYCCpSFCLKiiLIGSVG1'ALIIOfVM.IiVL


CPt~0869 987179 986658
YLODGI11LLIOOtTFODOHIKAFCSFFpI'CFIOfGKKLINRANYLPKPSVPFV8101P11Ca'K


CT778 hypothstieal protein
SOpSPRRVSETIILDCSISPLPOEEIPCSKKESTFLTPNpCKRFLTKIVtIpaUlx~lt


OfRiRFFFPICtS~'11'SDCPQt~ILAKIKtQDPNOHFICSRTPEDHIIOfVRDfDtRVCKCEPNTTIALS$1'PlyV
R6S1a3KSRAALPIC.KSLJ1VPCIDLPQYHLLBIDiRtJlRpfiLOAtLtAIUI


T110CPF7fNW~NALS1G1IFIFFIATLFFLIpi'NRALQVKSLISLCVGWI'FYftGCLKARKALIL1~'FLTSIOID
ADL.Q1IC9CPTLAAPatLPNSCVKVpKIKSLn~IDIAIitL0A8iIRII


w7lYl~.SHRSM.EGD'IEIElNt'D00CIlLRILFlI'AGFfmPLLOnIVEYVCSDbTLLLDTAPIPGKAA1K1IEIP
fPFPOAVNFRDLLEDYQKTNRKI4IPLLILIKKANDpiGIAOW'lflP


MIRES.YIRKmLPFIPLI00CSRIL~LCGWIIFLpLVLCISYTLALViSALNVLVL.SFLHLIIJ(DT1CSGKSVCINf
IVNSNINTTLPSEIKLVIIDp%KVELl'CYEOLPIIa.BPVITI


NAKIL70JDKISFJtVWVLCIFITSASI
ISBiaIKLLSREVYNALVWLVK~IBSRYEILRYLOLRNIOAFNSRTRNKTIFASYDItCIRt7?8/MIGI


IOEtSDLLLSSSODIETPIIRLAONAMVGIHLIIJITORpSREVITQLIKANPPlIII=FK


CPf>_0979 988A81 987118
VSNKVhISOIIIDEPG11H~1LIGN00lQ.VLLPSVPG1'IRAODAYICDEDINKVIOdCSIIPR


serS-Seryl tRNA Synihetase-2
't'OYVIpSFNAFDDSDSDNSGEKDPLPAQJ1KTLILQ1'GNAS1'1'FLARKLKICYARAASLID


TI'fNPI'QGFGGAVILPFSPISIJIRItIRKSCCSEKSSIYSIiFCTLLLtC~1E1'SlR.DIKIIRKOLtEARIIGP
S~CAKPAQILIONPLEG


TPEDCEIRLRIDCDPKISLEPVLSLDKEVROLKTDSt1T.0110RRLLSODIRKAK1'pCVDAT


NLIpEIII'LAADLEKIEOHLD010~IAQLNELLSNLPNYpJI~IPVBEDKAGtfOVIKSVDDLCPeL0881
1005616 1006309


PIFSFPPKHHLELNQELDILDfOAAAItTrC9L~rIPAYIOJRCVL.L6WALLTYNLQKpAANGFNo robust
homolog present in Genebank/l3lBL
as of 11/7/98


OLiJI.PPLLVIOfEILPGSGpIPKFOGpYYRVEDGppIfLYLIPTAEIIVIatGFRSODILTEKENKKFAVIMPVPID
NSSRNLQhIfPFSLEDLEQNAEFSP1'HOSAESSSLOLSIrISSAISSIiV


LPLYYAAGTPCFRAfaCrWGAOEItGLYRVNOFNKVH9FAFITPNODDIAYEIDILSIVCE7iEQLSSLVL~ISDPSSL
RDVPIFSAIYESSTFrI'PVPTPLVGVGYINDSOSOYYCtORES


LTLIU.PYRLSLLSTGDNSFTASKTIDAE1MLPG0KAlYbI7SSISOCTDFOSAASCTRYKWiLSOLLGSRRVEWYNOG
NFfIFASLIiJLCPRRPRRDPSPISLALLEtyIFIVFFLBNPPGS


DSQCKLQF171fCLNDSGIJ1TPRLLVAILF~BLpQADCSWIPEVLRPYhCCLEILi.PKDOTP7JPIFFW


CPn 0871 988766 989899 CPn_0882 1006169 1007101


ribD-Riboflavin Deaminase No robust homoloq present in
Genebank/t't~I.
as of 11/7/98


EYNE:DFSEQOLFtTIRRAIEIGEKGRITAPPNPiVVCCW1IQFNRIICDGFNAYJVGGPNAEEM'POVALLIOYFFCN
GAPYVREALRLTPHA~IIVWGICPSLYPENPRSLYYRVSr,DIGS


LAL~JASNPISGSDVYVSLEPCSHFCSCPPCANLLIaiKVSRVFVALVDPDPKVIIGpCIaRFDORGFVNSL1IETLPY
SSGSFGIEWISII'DPTPNFAIVNIFNRTAGINEVSRPNl0~1'E


Nt.ROAGIQVYVCiCESEAOASLOPYLYORTIWfPWI'ILKSAAS~FDGOVaDSpGK90NITCTSLIDIRDL3F~C6V~
irtDSLEOEFSLJiGIVCH71JCCVSIftVTSSPNIPYIIIpTLi.GGPE


PE~WIDteGKLRAESOAILVGSR71250DPNLTAROPO(stLYPKOPLRtJVLOSRGSVPPTST4AEAEFi~IPtFPNS
I'IDSLAEIlQBIWRISDAVSIIWIFPIVDTTYNGVWJNCIGPICI


KVFDK'tSPTLYVTTERCPFNYIKVLDSLDVPVLLTES1'PSGVDLHKV1'EYIJIpKKILpVLNCICSTFLtLTNPRS
RRORWRNLRIMVLCYRSLGSGIO~tLFDL51~NRNAMRiM'SCIYA


VEO~'fLHTSLLKERf'VNSLVLYSGPNILCDOKRPLVI'VIGtILLESAaPLTLKSSOILGNLYANV1'LttCWtVaI
00J1tt0YCFPSVRDAPYRYCLRNRYCLTpRNEDSi.Q1'IIDTR/pY!'lt1'


3LKW WELSPpVPEPIRN HLf00pNVAS I tldl.~'VFGLFFGF1IGLMlI'PCGLEIS
PSCRNDAAIMRT11C IF


CPn_0872~ 989903 991216 CPt~0983 . 1008901 1007577


ribAiribB-GCP Cyclohydratase i DH8Pdntpp/ogre-Phenolhydrolase/NADH
Synchase ubiquinone oxidoreliuctase


KERIFRVACLASESVNARESNIETREEVCSAHFVSLEMIEDLRAGKFVIWDEASREDELYELFIKSCIFII?IML:C:L
YFLCIASLLFCAIf3VIL.ACVILt~:RKLFIKVNPCKLKIND


CDLIIJ1GEKI'IYEIDfI'FLLOHTIGWCAALSQERLLSLDLPPNVKDNRCRFKTPtTVSIIDNEELTKTVSGy?TLL
Vf.LLziSCIPIPSPCCCKATCKOCKVRVVKNiIDBPLfTDR9TFSKR


MFYISYt'IGVSAADRTKWOLL10PKSKPEDFISPGNFFPLASSPCGVL1(RAGNTESTVDLOLttGWRt.sL'CCKVQ
tIDNSLEIEERYi11A3S1IlLTVLSNONVATPIKELWAVD~KPIP


Mff.ACLpPCITILAELVNEDYS!!'8tLP0ILEFARKNNIAVIPVTSIIANRNLSDRLVSKISFKPODYLOL'fVPa'
YKTNS3DWKp7IMPE'fYSDWEHFHLFOpVtDNSbLPADSANKAYSLA


3APLPTIYGDPfLHVYESLL.E~IQHLALVKCNVADK.3N'JLVRVH.iEC~'fGDLLCSKACDC~YPAELPTtKFNIR
L1TPPFtNCICPN.~.EIPWf:IK3SWF.~.LKPGDKITVSGPY~INKD


.EOL,i3AMSYtAF.KCTCVLVYLACOGGRGIGt.GHKVPJ1'1AL00NGYLTUDIINlJINCFPVDDDRPLiFLIi.w
k'r.::Ft,R:;HILDLLUIYHSKREIDLWYr:ARSLKtNIYOEYEfI4EAQPP


SREIGICAOILVDLKLTTLKLITHNPOKYFr:LOGF,LITERVPLPVRISF~NEpYLRTKNFFIYNL'JL::EFLCEOL
W~JpItODM'YTNFLh'MFNLC~I-:RLONPEDYLYY'/r~'PPWN


4fFN,HWLDLF't:CNNRVO .'.::ILKLLCD\'.;l:Ff ::a ILI?DFt;,:


:F'n ~)a7S n!IIINN v'11511 ~'.t91 pNN.I lit.1rsr.N Irlll'Nlrl')


ra.ERibityllum.Wine ::ynth.\n.f 1'.T7ll hy(tttth.tit:.ll ftfl.tain


f:aIhJTtr:ltB'IFFEYMARI.Kt:ItL:IAKNLpfAIlflC~ftFyAtIADALV::aQETFLKFl7G:iE~:fif:
ML::RIV9l:Fl.h'LL:::a.CLPAEEEALr~::Ktffi-/r)I'AVMIr\IAfI.I~YYFtI~WNtE~NRR


IJaXCIR'Jh:AFEftt'.TIKKW.:::a9IKFDAtVAf'.(T/LIrJCET(kIYW~tIVNtJVn1117It'..\L::K
AMKKKKNDtr\Kua?KV'1'AtA:fItTfVi7f.IPFJft'VtIlltA::r:Y.'/IVIJCIaII::EIt.Y('fI~NK
::


LEt'r:L1ITL::IV V1f':'.e\EtAWt,)R:xaK(:RHV:VS~fAIEMA'fLITy


r'.fn OHNS InIOn~~.U I~NI'r~ 1:


~:Ir._tIN%A '11116A n91'74n yycArHNA M.tlrVlrl.lrca..r::..


'1"7':': hypntlN,r i.:.tl Prnr.irt
A:al:l'M:7IT~Wt.'1'llht:Vma:::l'1J:al'I::Ix:LYYYE1:L1.INJl.h'AILVI~.:hhllAlLltt
.'::I


f.f::WJf.KILTKpRINtEF\::MWIlLKIKVLVFPLALW~1'Y_tl::Ir:yAC:I~\:::WTN:b'fKVK::LRr:
RtIKMEF'::FFVfYFIa:K::lt:1't::::l'YIKK)afVTh'LLIIIIJ~rfM)tLKI:fNhiIMnfll


fr:::F.'/W(HpKLRUYPh:I.L.W1.T&~O1:A1'LL'f.~.TDIOItIIY:EKLFNKKVPALDIAtfc:MIHLFEI
IIA'fFPI'KNK.:::1:.'f1:1'L'It'Iv::a
,r.llYM/It.T!':f.Tl'f:%hVNI'~1f'ltltS.Iltt:IlJ.~.::::I.


113


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
a~


NtA:aYWlEKVAARCLiCYYETKLLYI-dPaWY EFLAFCU..L:!5C~FYiEGL
:!ItRIF:LL::::..:K:AGA:K~::iE.'atQKA.iSCfIPGL
~50CN.iAaF"LRPR~FFOPOLTQ


AAKItETAVEFINPEC~:.:.:LY~-
CACt':GINL:PYVKNVIGVEIIPOAVASApENI%APLCCAEW1YLYC'IVLRONPRDt'HWINRCRF'Jt.;:A;.fI
C;.1'.:.Y~::.NLICfVVrLE'JL.CE


'VEYILCOAKAFCKRHENCKAPDVILtGFPRCCIWSK'Jt.KIILRtG.iPKIVYISCFROLNaiR'l'PrINPIYEt'
t'1/0'~lTiQt'0:1~."!~tJ:NANAIIfWGNItFR''~'S.
NNKEDI F~GK'.
~
~~


. Pf7lYG
NPKTOFQECADLta~OCYRIKtO~IDPIDQFPYSTHLENICLLfREIOPYCt.~00CPNh<xVVSHEV~GSFWDaIJ4I
J'etiANi'ftRINI~i.UC'
Y6~lRTi
'
'
'


LViAHTIICHGSPK .~~.NKAtIQSPLCV~I
N
WDVYEIDCYOFTNINfTFSSIKRGOERf'(


0996 10:1299 l01 X709
ETKQII~MLPEEItFfVPPAVIOJFFAHK:vEDRK110EQWLDEVRVWSK.7FPEIJtEEEYAL:S
: Pn


_
HKLPIQ4LE3t::CSVE."tPDSIAGRAA;iNKLIOVLSIOHIPYLI~L,iSSDt~.WIANt~'
nccA-Htscone-Like Developmental '
F:~tean


ItTLFWILKDTAKNMKDi.Iw.StiHDLIKAEKtTIItAAAORVR1'DSIKLdtVAKLYRKESiKAililYDFSCRNTK
YUVPEfCtMTtNNf:LAYS
OVFRPFGLTFLVFSDYNRNAIRLAALiKLP


... . ...,.,~ ... r.!:y_ ,;... . w_F~'!C..._. . . -:'Z~lfaA.':'-;
. ~.,...... ":.~F'.''T:4;s\':F'Y~:'<'t :.t?f'-~.


.. . :..:R.~ALw~. . . . . ~ , .:i'.F':
!"!'": ' "\:. ":.1K:.Ef


:,ppWRVS':;FH.iIELFtiIuL':'ruXv.::%.:.:::L.:1RV.;:'cr4ianliwwiXY
iv:.iEtitrlitlMD


:Pn_0897 1011692 1014157 RFOYSCA:DOVSfCCFTi .1~a,l:.~R::.oO


CNLTR possible phosphoprotetn
NKKLYHPTLFLRPLIRLSLIFALSLTLISQ4FPQQKSFGHCCAONNSALISCKNCCCtACPn_0191 10268:3
10.5988


DPIERVLADRtTLTJINOwG'ty'JVLVREYLLKCIRKGDCDYCVItILOKIS.ALRLPKIIiIRItDemn-ANP
Nuclwsldase


LpILwIIRtNPfpAPLRDWOQLFTIGGNLSI4DHLLFCLYIIrTt~ISCYENRKOtxIQiJIKtPRI4D10tA104LRR
KHYKCERVSKHTSESRIt10tk4.1:RY~SSVICOFCPYLLLTNF8YYI01'


OGDYKKAIEWCF1.VMLiKOSCSPHPEIVOIEKTFt.OKTI3.At4IKYAQtAOESCD11LI.FAKtiICVPVFf7CSN
FSAAN11PNLKTSILDFKLOSPOAALTZOLCSFLPDLKJWJILOCO


TPYCLSEiAYTEANDAWLRIARGIVSRTNEVDSVLLSNAI4NLPFAREXJ1IPELEVLIDCLRSFfYOVCDY~~IRCOG
TSDAYFPPEVPAt.7INFVVpKATTCVLCDItKAFiYHIGIT


NCfIYLESTLLYYAYFSLLBLYtIpNImFIISLERLLEKGOiIVLrVPENPYIPEYOFFLOAYFYtlrlFtIRIWEPNK
1CP1UIKLYlIKAOSA~IEGATLFAAGYRIINLPIrALti,ISDLPLIfI~Z


AKGKYLSJ1GIMLOIIDPAVKLCATFARAYLYIGCZAYVGNNY~fAEEYFLMY%SNCREKTKSfONP'IfN~Y'fF9KI
LTGOCVIEFG.EKVFtLKMJvSDNIOIDppIfRGLPNNEVCt71IX1t


ESGIGLF'IJvYAVQKKXTACEDI4LYNPKFSIIYRHLLDSi.CSLSYPt~SNIOGSSJ1I0RVNRNASGSE?SDSDY


AVPEtSEIYSRCIYMIKYRNVTYTtIPIIEt.AYNOV11NLEKRNLLEICRD11QDPCYDKAL


AFHGALQSCASVPRSLIESStNDE7UtiTIRCYEALYFlrI4PDAIJIFIZ.POAFSEEQ4SWOTACPeL0995
1026973 1027557


LRLViPI'LVRPKGAPNNAKYWDHLVLRPHGDSLYFFCYDLOEYLIGKEDIILKNLSVFAELFefp-Elongation
Factor P


PKSSLLSLVYYLOGYSESSAiJttIVCiJFVKALEEF'.EISNSGENNK1WAYIYYNVItLDt.iIDEIDCFNVRVSTS
EFRVGLRIEiDGOPY'..IIIJDPVKPGKv~7A!'NRIKVID4t'LIGRVfERT


TYISGCNFSOAVNILEEVK~WpIIASNPKLIIFLI(CEDLYLWELRWVECLi(YAYFOLHETYKSGESVFL'ADIVERS
IIRLLY'."DOEGATFft~tIFE01;11VA4EKLFNIRONLLEDTIYTL
'


AHLSt0ILLENVEKNLISPRSYRDYYCESLQRTLGLCpRFLCVVLYNfr~VItAVEPPIPFiELSIAETAPCVRCDfAS
GRVLKPAVZ'N:CAKIIIVPIFIDEGELV


KVD'1'R1GSYESIIVSK


~PILOCAB 1015141 1014119


henG-procoporphyrlnogen OFCidase CPn_0A96 1027574 1027822


AERRFCVKRAIIIGJIGISC:.SX~IwLNKKFPOAEILVLDKFJ1YA0GFVltTESP~OGiSIDLCT753
hypochet:iul Drocein


GPKGFLTRGDGEYTLKLIHELGLOt4SLIFSDRAAR~1RF11YYROKANKIST1'11'LtJIKCti.PBKYFI'FlVIU
d'DJ' lltItIKELSKGOLLIQfLREKSRVLDEIOJKRItANVAIq,VAIIPESIREIE


SLIKDFRAPCYTpDSSVODPLKANBSONITSYZLDPLIT11IRJ~ON&SILSTIDIfIFPS.iIKKCEKVLTPQLFQAI
AEKILE~V


REASSCSLLRSYLIQJRSPKKSKTDRYL71SLSPSIGI'LITTIOEK<.PATWKFSTSVTNIDC


SPKFaCVL'1'PSETPFADNVIYTCPLQpLPVLi.PHIICIENLSKAVLPWfG.SSISILrIRltAF1CPet_0A97
102A794 1027A53


FSLPIOGYfiG.PADELPLLGIVwNS0IFP0ATPCItTVLSLLIEGKI4RESEfvNAPAIAAISEIphospholtydro
lasel


YLNINOICPDAP'AIFSSQDfI'IPOHAVCFLERKERILPNLPGtd.KIVCpNIACPCLiatCIASNPSLDSttI'VDO
ICdfSt4PRPM0EKPRlMIRIINISDVNFNVLPVNPVNCFNKRLI~LLKKV


AYHAICDIJITEETL71QFOSSL
FCLVNFQJITTICORFPKtIVR51a71DSVCITGDFSLTANDCEPLWtNIYLTLJ4KNSSVYL


LPOMIDVYTIUSLiIpQ1'FYTIIlpNDQLOpNKVSPNKI?DFMMLILLt7CSQJia41S11lLY


CPn_0899 1016941 1015462
VNLtIOISAIFfFLLBLSPEEN11IIANF(YPLLSSONPSNDLINNfNiptiVLXKrPKVRi.YL


hemN-Coproporphyritwpen III
OxidaseAAVYNCAD1'SPSYIUi9GSISLPTNSRPNVI~.YPEKYQVIrMILOa.LDIDAP


FIJtFNVNFNfLECii(pPAPRY':SYPTALaWEPSDAAPALt.At'ORIRFNPOPLSGY!'IfIPFLEIANEATNOCp
KL


C0574CLYCGCSWLNRREDIYEaYINTLI0Et0a.Wt:TIGFRPOVSRINIOGGTPSRLSR


ELFTLLFDtiINKLFDLSHAEEIJIIbIIDPRSLRt~lIEKAD!'P~7VCFNRVSIGYOD'IOADVCPt1~0898
1070511 1028901


OEAVRARpSNEESLKAYEKFKELAFOSINIDLiYGLPKOTKLSPSItTI0DIL71N1fPORLANitochondrial
NSP60 Uaperonrn Nomolog


L!'SFASVPNIKPFpKAMtAStINPSIIEOtFAIYSOSRtG.L'!'KA4~0AIQ~IIFSLPNDPGTTKKRi.OSVKIIi
ttiGVC~ISEOCKLStMiADKKLFSGIDXt.FOIVfOCSYOPKQiLSPTiFF


IJtFIORITLIRNfOCYSLPPEEDLIGIrCaTtI'S1'SFIRCIYt.OI~NtKTLEEYI0T1'VLRGTfATVK~'
'YAISOTELSFtSIfC4i.CVDFNU11NNKINKENBDCATTGLILtiDIILOiSIfAIILEK


KSKILTE<~ItIRtIWIIINKLIC'ZF'"It4l~EFFNLFL'IfEFD2'YFIFSRDRLI~E'IlGLIt04CISTHKI.I
ASLKLOGEKI-0SALpppSSiPIKDAi.KVRNIIFSSIJ~P'1'I11di1YWIfSWC


SPCSLKYtPiGFS.FVRVIATAFDHYFLNKVSK!(tCFSASIPEGLISITKERG~IL1'SI~VFOCFKIPJIGYASTYF
VSDTASRLTRIANPLILITDRKI~E


INSLLPti.OEIS~NDNLIIFCtDIDPtfVLJITLWNKIQGLLpV'IWl'IPpGiITNOt<.i1


CPtIr0890 10I7A29 1016519 EDIJ1LF'LO'1'ILICPCpI7ISItVt.iIPENV1't-
0SCLSIEISESQ1'Ti.IOCLJtiILYLTLILTWf.


hsmE-uroposphyrinogen
DecarboxylaseABEIRTCSCLC1'RIIIILIKSTNRt.QSSVAILPTDEDtVEPLYTLiII~tINf~ALI:RCfIVP
00G


STIJA4WDSFtSJIFFDLLKSOTASHPPIWLLRQVGRYFtPPYOES.IOCSQSLKTFFNtifCAIVCVAL/YASLTIGT
PKDDADENSIAISLLOKACCAPLKLi~1TH11DL.DCD11VIAKLSSLC1T8


ATGt.CPSii.NVDA71ILF71DILSILDCFAV1'1f~'APGPRIOPSPEQPFTFTSDPOTIFSYLIlSISVFSREIED
LIAGOILDS<J1TTSTIIJIQALD1'AILVLSSKILIl.DpYCI!!L


LD11IRTLJt9Kt.PVPLIVFAASPPfi.ACIfLIDGG7tSIIDPSKTILSFLYVYPEKFDQLISTI
I


EGTAIYLK1'pFmJIOAAAVOLFESSSLALPSALFTRYV't'EPHRRLIA%tJtLOi(IPVSLtCRCPt1~0A99
1030A48 1037215


CFELt4FYTL0AT0A0n'LNPDYNVDLttAIOKNiiCSLpr.FfLDPAIFLLPOEKLLNYVEAFLntttF-
Nutasoyl-DAP Lipae


VPLRTYPNPIFNSIitGILPETPLFliVpLWSYVQROLNNACxAONYfQtAFIt.L'EDNVSI~i.SOVSCPKCDKtCI
TGFAIDS00V0P~LFFALPON7lTD


GNOFLKf4AATAG11VAAWSIIDYpf~SFCLELIRVDCIK5AL0EACSNOCNf.IpO'ILVCIT


CPn 0891 11121079 1017819
CSVOK1T17CEFSKTTLSSIYKTNASPKSYNSOLTVPISLtXAOCDEDtIMIL~GVfiP04


mEd-Transcription-Repair Coupling
Hpt#TRIVOPEIAVITNINOQNAWiFPp0I0EILKEKSYILOKSKLOLLPKDSPYYLDf.R


NFNIItIDFNPVNLDFSISKEFKEhTLPLLLFIdIHPCATJ1!'L.71A104E'NDCItASVIFIITIPARSCSPTAEK
fSFSFNDPL7IDPnfKAISCOSWZOTPE~4YCt.PIAFSYIIpAYTtdi.IAWIL


LDDLFEId,ATFLt7p11PVEFPSSEIDLSPKLVNIDAt7GIIRDNLLYStiJpHRAPITCYtTLKSY~IILBVPEECV
IRSLPELKLPPNRFENSMRNGNpIIINDAYNACPF~WIAALOALPLP800


ALLEKTRSPOATSOOHLDLAVCDYLpPEATTEt.CICSLGY50VlQ.TSEKCEFSCAGCIVDIGKIILILCHNI~LCRY
SESGW1LYAEKAA$RCDIIIPPICEKWIPV05VLKSYSCEVSFFS


FPLSSPEPFRIEPS.IGEKIISIRSYNPSDOLSTGKVSKISISPAYTET~ISGGItYSNSLLDYSAODVKDILKOVARI
fCDIfILLIfGSRALU.t;.SLf.ACF


FSTPPLYt.FDNLEILCDDFADISCfLSSLPDRFFSIGTLYDRISTSNQVYFSETPIPNVK


NLKINRVIIE1F'NRFMEASROAiPILYPE0Ii0NDEHPLLAFLONLOEYNPPI~KPtJa.ACPn_0900 1032208
1073281


=YSTKTKSLKEAAAL11ETVARGDVEIYEKTGNLTSSFALVNEAFIN1ISLSEFASTKVLRRmraY-NUrawoyl-
Pencapapttde TransEerase


OKORTHFSV7TEEVFVPIPGCnMiINNGtGKFG;IEKRPMiWIETOYLVLEYADKARLLVFFfILaASNIPLIPNPL10
SLFPSIJ1LT1?fl'1'LVLTVAhCVWNM4LItplC4YRDrINKt


YVPStpAYLISRYVGTSDKAADi3iNINSSKWKRSRDLTEKSLriYAEKLi.QLFJIpRSITPYCEKi.BMLNKDKAEY
P1COGVLLFISLIASLLVWLPWCKFSrWFFIILLTCYAGL41VYD0


AFVYPPNCESVIKFAETFPYE!'IPI70LKTIDOIYNDlILSPKtIIDRLICGDACFCKTEVINRIKIKRKQGNGLIWt
NKPNVpIAIMFTLIALPYIYGSTEPWCLKIPFN~FIfiLPE'WL


RMVKIIVCOCHRpVIVMVPTf::.aTONYE'l'FKERM71GLPIEIaVLSRFSOJ1KVOKLICEOCKVFCLCL1LVAII
CrSNAVNLTDC:L0GLiIAGINSFAAIGPIFVALRSS1'IPIAQDVAYV


'JASGQIDIIICTNKLINKSLEFKNPGLLIIDEECRFGVKVKDNt.ICERYPMIDCLTVSATPLAALVCACIGFL3IYN
GFPAQLFtIGDICSLLIGGLLOSCAVNLAAECILWICf.IffVAGG


TPRTtJtNSLSGAADLSVIAMPPLDRLPVSTFVNEHN~EfLTAALRIiFI.LR0GOJ1YVTHNRSVItAVISCRWIKKR
LFLCSPLNHHYEYqCLPETKIVMRfWIFSFVCACLCIMWtR


tESIYTLAETIRNLIPEAftIGVAHGpNGAEDLSNIF'IKPKNQKTDILVATALICt4GIDIP


NANTILtOHADKF1~WDLYONKCRVGRWNKKAYCYFLVPHLDRLSCPAAKRLMIl4K0EYCPn_0901 1033279
1031517


:::(~ICIAt.HDLEiRGACNItw.DpSCNICTIGFNLYCKLtJUtAVSALNKHTSPLLTNDDVmurD-
NUramoylalanine-Glutamau Liqase


KfEFPYNSRIPDTYIETC.~.MRIEFYQKICNAESSEELTAIOEENRDRFGPLPOEICWLFAFCFIRRSRYSuCiJtEI
dICpRILILCTCTTCKSVARFLYQOCHYLICAONSLBSLISVDML


LAEIRLFALQHGISSIKCTANALY~/OKCLSKSEOTKKTLPYALSPTPELLVIIEYIESIERHDRLIJlGA3EFPO4ID
LVTR3F~CIKP'fNPWVEpAVSLKIPWTDIOVALKTP6lnAYpSF


~FLtNAS
CtT~JCKTlTILFLTHLLKILCIPAIJ1M(.TIICLPtLDHltt7pP.VRWEISSP~JITOEE


N t PAISCSVFtJ4FSRNHLDY11RNLDJ(YFDAIILItIOKCLROD1ITFWVWEECSL.CIISYOIYS


CPn_0992 1027673 1 (13101 d EEI EEI LDKCDAWtP IYLNINtONYCM
YAIJINEVIIJVSPECFLItJ(tATFEKPANALIYi.G


alas-Alsnyl (:RNA Synthetase
KKOGVHYINDSKATIYTAVEKALJIAVPKDVTVtLCGKDKCCDFPAt.AS'lft.SOTIIttIVIAN


EFFFNLSNIIRSNFLKFYANRHNTILPSSPVFPHNDPSILFTNAGNNOFKOIFI.NKEIIVSCECI~TIADALSEKI~P
LTLSKDLOEAVSIAQTIAQECCCVLL3FY-,CA'iFOQ!'QSFK611GA


YSMTTSOKCIRACGKNNDLDMtCHTSRHLTFFE?tt.CNFSFGDYF%AFJ11AFAWEYSLSV'IFKLLIRF11C~AVR


FNfNPEGTYATVHEKDDFJIFJ1LNEAYLPTDRIFRLTDNONFWSNANTCPCCIf~SELLFDR


~PSFrJ4ASSPLODTDCERFLEYWNLVFNEFNRT3ECSLLALPNKHVDNDN3LERLVSLIACPn_tt'W _
lA3Sfl7 10)5311


r:'fIIT/FEADYLRELlAKTEOtSCICV'!NPODS(:adFRViAt%IVRSIsFAIADCLLPGNfERnlpD-
Nur,lmm4te:r. linv.vsln rep'tat
t.tmlly


':YVLRKII.ItR.',VHYCRRLCFRNPFLAEIVP;iIrICnVICEAYPELIWt'aL.~.OfOK'/LTLEEESAVpOkt
LV.:::EVtaaIRRONVITA'J'/VNAILLYALFVT~Kkl!:VY.DYD.F~n,FPHFASSKVTOA


F'FKTLDRt%:NLWt)VLII:i':':~.
$Ct::jEDAFKLKIfI'YQ1PIDEI:iLLAKDYD'ISVDNDi'F'N:',EEKVIEKCWAEVP::RPtAKETL~FtE.:K
PViVTTPP'JP1/V'.'.ETDE1/iTIIAVPPp


IIKLEDF.AKER:;1!KNW(R;Qf~I'.~.G:i'INELIILT:'EFL(:YDH4~.t'.DTFIFJ1IILY.DNIVSF.LF
VRE'IYKEFX~APYA'('VWKK~:OFLERIAPANIpITVAKLt4r)IHGUI'I'IVLKIfDpYIKVt'TS


:~EKUF7:AIYLKV::PFYAEKtI~VGG.~(;gtt'f:.~.F~.'fFIVTIITP::PKAc:LLV111N:RISOCSLI~D
V:aIF:K'Cf'>.1'(''t'.WIPlIY1'tW~Er:D.~.PN'1'~ALRNtIIRLDDLLKNNDLLFYKARRLKPfID
'


IYFM'/TAtWNRYRRKRU1NMITII:HLtJtAALEITLf:DIIIR(L4:;:W0tITKIRLDFT11P0VLRtP
~03'i~'r
AI::PELL4: tLTLVNR::IItENEf'VfiIREILYwL'ifW:..~.EIKQFFY:DII7.~.DV'/P'~.Y;H~S


III:II!Y?I'IL\F_1'IYA)O:FF'RITKFJIi:%AFI:IkRIErWTr:EKAIGV'fVH(,~J.~.EVLEEtA'rLL
.Ot.'Fvsnwm IW '.::1'. Inlv..l1'/


Vfklli(V::Rt:hA'PI.DERKQt)UKRI~fELFSt::LyfKLDKLIIINt'llhl!y:IT!:L'/NHIrIEHEtt::
W n..l l I'mt::m.n Inrn..ln
t'n.;W


NIIIIIIUV'IA~x'IJIIJI!11't:KLI::LSPI'TEKtt:KYII'L:-
.RV!:f1111.19'(r:VIIAODLLYAVLTPm.':Kr~:Il:NKNFVI:a'Lfl:ll'::Ia:I.INVFVY:::AIVI~
P.~.LFT:TIIKALIINJVrYLIIl~:I/


:kYI7r:KtS':
~l~x:;:ADAL1'A'ftVIHETLbi(tYltA::I.LYNNF1VRUFLKI::INI.La:iJI.ALIr:'/F'ff~:IL:Ll
:fttJ:AkkYII~:F7Yif:PIVI".IKINK
:TyLi


'(LV I' I VALY 1'I: rI'::.~.L'lJlit
LKMYLY.I:1'A I l.l' I I' I LL IA
L EP Gtr::'.,yl'/ l::A::l. t INF
IN


"1'n_'IH'm In.:IN-2 tn~'iNRr;
't'::VkI.ItYWl.l.l'LLr:VLINi:AIh'/fMlY'/nYIILNVYL11PELDIKr:IU9Y~f'ICIAKfAH::X:


I Y.t n 'fLmr:knfl nrl.t::. KL1I:I!r:11N1::LUKLTYLW:1~411.YfAA
i'fAt:F:F'r:Ff!?ILVLLLLYM.'h'PNY:YAtAIKA::


114


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/269Z3
::LECM4AMttTLII'K011FKlltIW:~LLF.~.Y':. .FF:OfX:3CLIANFICCIffLLLKV
VbIFB~ful.lp autr~tt.lmv. c~. nolaa
'f DESK::.°~L X7tRFRRPNCP".niUSKGiFFS FHLKKfaTL:ld I I f: f
PKIVGFf40:iF"-F.~.(.LKVAAKA I OCKK:.?JNL::~w"1JR'.":3EP'.":YFY
YV~~VNVHVKAl9ltlfiø9E1'.1<AIKOKV,'~~'I,I~NEi. L'I~Y~t/:DY~I aV.~l
':Pn_0904 1(175720 177396 RLEEtJ4ItOCPTYPDKLIa:: ~ . ....
muc.-PeotiJtwlYCan Transterase
RYINICKIRKVALAVOtiu~CGHIVPALSVKEAFSRECIDVLLiGK.f'aL100lPSLOOGISYREI vPn 091e
I046R1) :049094
P!xLPNWPIKIF1.'.RTL;LCCCYLKARKELKIFDPCLVICF~~..SYIISLPVLwIGL'aNKIP fabF-ACV:
Carrvsr Pr~cefn ~Yntnase
:.FIJIEONL'JtY:(h?ra.F.~RYARf:L~F.":PVTYI!FRCPAEEVFf.PKR.iF.iIA,SPMIKRCT
LLHrIfRV'Ma'KKRV'l.'~:P'':1':SC:uNl.1'CTFYDNLLACVSGVRPTTSFPCEDYATRTJIw
...,.:... . .."... Il.;.:.lt~'.~ . ,~..;~La,.~... ...,.... ..,"..i.h..:;- .
".,'Tl.:
..t..,..,1.: ~nY:.~.':= . .Ji::::I'K..L...F.: . ... .-..y-v ~w::: .
" . ,Al.. .. ... . ~.~ u:;~;
. . ,... ....1. ,. :.,,~lYr': :~ . . ~Hr. . .'~ 'A."'..'C
../~~,~~CCw~i:'1:.:.nvlLLi:.:i ~::.'f.':?':: . nt't:.~.~, . ...
DVLEGG'MILEKELTEKLLVEICV1'FALDSHNREKORNSLAAYSQQRSTK':FHAFICECL
0~\AYGHLVS4Rn\OK i~w3Ti:AAVrNA:.:Leu:FtlFilidi.9ERNGAFuWIiRWIDPDROGfV
4;1~AGILVLEi'LE.iA_IRRDAPIFAF14LCSYVTCDAFNITAPRDOCF~ITACVU'ailllISA
CPn_0905 1037400 1079875 ..IPKERVNYVliAHCTo.
Pt,CrLSEIrt.AVKKAF'GSHVRNIJIITISrKSLIGtICLCAAOGYEA
murCiddlA-KUraauce-Ala Lipase i G-Ala-0-Alam Lipase
WAIpAILTCKLHPTINLDNPIAEIEDFOWANKAQDWDIDVAKSNSPGF~O~~STTLFS
.____.»._.~-~-~~-~..........-nawvw~a.aMVIeVTTPCf.Ylllf'JRC/~.IiD RYVP
cPl~o917 loleosa lo1as39
hydsolase/phosphacase namolog
lNDI I4EVCTLVFll0C1'ItYEYSFGVTPIKFFGTPDIO~tfUUCFICFiTRCKMiCFPIGRtaEa
KEDPOEAACRELVEETCLSWNFFPKVLTEpYSFNNEEOVlVRKEVTYFLAS1IRGDIW1D
pKETt:pSOWLSLOECLRLLSFPELIIDLTV~ADKFINNYLESS
YLRNYIRIHDVCVSI~GAC~fix'cf:GiJlLlcfarnrfusa.saw.nvw.ww.ai.w~...~..,r...-.,
CVOCLFPVLtICP 0918 1019272 1019579
A CPn


7 _
YISPtFYDVSYFIINRpGLWRTGKDFPHLTEETOCDSPLSSEiASALDW-Inotpanlc PYroDt>osphatase
FCEDGTICCFFEILCIIPYAGPSLSt.IATAt'mttLLTKRiASAVGVPVVPYQPLNLCFr(K~'t'


PELCIOI1LI>:I'FSFPKIVKTAHIGSSIGIFLVRDKFEG'7EKISEAFLYDTOVFVEESRLG~tIIESLCCYIEITP
YDSVKFEGDNATCLLKV<EtPQItFS
ELLNSKKPLYYAHPWHSPTLT
RP


SREIEVSCICNSSSWYCMAGPNERCC11SCFIDYOEKYGFDGIDCAKT5FOI4LSQESLDCNfCPCLYCLLPQTY~AS~
YS~'tfICGDKDPLDVCVLTEIDiINHDNTLLOA
'


VRELAEAVYPA>'K7G)cGSARIOFFLDE~IYWt.SEIMPIPCFfI'AASP!'i.QAfYHAL~'1'QEQVLDKIOHYFL
TYIUITPIWt.IKC
IGGLRTIDSCEADDKIIAVLEDDLVFAETEDISDCPCI
'


IVWIFITDALHKFDICQQTIEQAFTKECDLVKR tALVN
SPAKIEIVGIYCKKEAOKVIOLANCDYLSYICD


0906 1010514 1079915 CPef.-0919 1019375 1050170
C?n


_ ltlh-Ltuca.ne Dehydroganase
CT767 hypothetical protein
FKRYSIlIFICEIKIDOYERVILVfCSIfVRLfIAIIAIHOTAVCPALGGYRASLYSSICQACT
'


NE
DAGRL71P011'ItKAIISN'fC~'1'ODCtISVITLP~APBLTED10;.RAFu~AVNAi.DGTYICAD
KWGSEVLELV1~SQLSREASAFRLDIDFFIINIYPFFRNF104IELCFFLSISOFNLDf


EFVAYIVIQJLVTNPFJIVEIRSIEtBSIKLEIRVAAEDTGKIIGRRGNI'IHAIRTILRfGYaINDISIVAEE1'PYV
CGIADVSCDPSIYTANDCFLCIKTAKYIiICSSSL1~IAI


RVCSRLIQDfVpIDLVQPFixTWIADQOYICDND55NSTFqIifGESZITCCSCHCH1IDEDLCCICSVCRALLQSLFF
EGAEZ.YVADVLERIIVpDAARLYGATIVPTEETNALECDTfSPCA


NQEEpERWNSCffCSNHH
RCNVIRKDIiLADtI4CKATVGVAN4pLEDS511GK12ItERCILYGPDYLVNAOGLIMtAAAI


0907 1010916 1040415
tr:RVYAPKEVi.LJIVEELPTYLS1CLYNOSK'~11C1IDLVALSDSFVEDItfi.ilYTS
CPn


_
~cutA Periplaflmic Divalent Cation 1051423 1050471
Tolerance Protein CutA IC-


Type Cyeoehrome Biogenesis ProteinlCPn_0920
GTSTYLWEGKL cys0->sullice Synthesis/biphosphece
t#osphacase


FAFSKFLIIKSSIffAVLILTSFPSESARSLARHI.ZTERtJISCVHVFPKILCBiElOISELPNY~1TVCSWTEITTQ
LS3:YRSDIRLYPFirEKSDGSFITJIADIIO&OYIf
CESEEHNIOIKSIDIRFSETC.J1IOEFSGYEyPEYLLFPIENCDPRYLNWLTILSYPEItP'


RLLTSSVSRD~.ISTLVPPIlPTS
LKOOL71KAFPNTPFIGEETLYPDQONDCIPETLKFI


LFVLVDpII7GTACFIRNRA~AVATSLIYLYRPILSVMACPAYNOTFKLYSAA10GI10LSIY


0908 1011607 I0407a0
HSQNLDRRFYYA'fKQFCE11SLAALi~i00NNA?RKLSLGLPNTPSPRRVISQYKY11LY
CP


n_
AEGAVDFFIRYPFTDSPARAWDNVPGAFLVEtAGGRVTDALCAPLEYRKESLVL~INVI
:.T761 hypothetical protein


ILaILFHIIIKNNEI1~1TRRFFKTLTPPCPQYSL:CY11SILIVISSLYCVPTFCWLFLPELSLAS~O!'IHE'tTf.
AAL~IGLtIWPTDKLIJ1L


LSKFNPSPIPNLfLVSSTLSKVPP'CAIJIFIiLRL511DAPTYLNEFSIKD1FSSI3IAI.GIFS
FFS 0921 1051516 1052793
E CPI1


Q ,
SLVIEKSPOIAODITTFYTI.QTPIAYVCtOtaNTI~TILfI;SCFL~CpPYFPSIiIi.PQI.
PfKTLLKELAKESPKIIDLSLSDAYPCEIIVTTSSGSLLRLPIKTLDsnOlycerol-3-P Acyleransfvrase
D
KLPI(ElOlt


LIDIO
CEI1G.IKLWAATYfI;ZM'1'FLVCRLLKLRYRNpVEfND'MNINPKpI,~LFLIIIMVAIZVaI
.
RALDLYK1090CSPVIE$EKOYVYDLRFPNFLLLKAL


IL6YLfWSRFHVRPIBIYEYLFItSRWQNFLNSVRSIPTPQLVPGKiBILRSLGDIBIC>(iE


CPeL0909 1011592 1041966
ASRAWRGESLLLYPSGRLSRTGKFETVNOYSAYYLIJtRVf~CfIWLVRVSCti~f7lffR


rsbV-Sigma Faeeor Rapulaeor YKpN51'PKLGPAFIIFJ~ALLR1~TFFHPKRFVIt


IISLI1TRTLLRLtl'OlLal~7lGDiIVIYIAC5LD11VSVPSVpLYLEOFIpKKNLKI11LWFNOCDONLPIEVPYA



N!1'DVSYISSAGIRLLLSNFKLVCSROG101CLCCVICESPTEVKAI71GLOQLILLCOSEQE
CPn_0922 1052266 1053927


au-ACylglyeerophoaphoechanolamine
Acyltransterase


0910 1011970 1041004
QFJWRSSLRITRKLAR10100RNRCHNUaO1LRLRPCSTLLEAFLIL:SGIEOCI11GFDDIL
CPn


_
GSLSYRELRNAZTAVAIKVSKFSFIHtVG1IK14P11SIGAFIAYFGILL7ILiKTPPIINWaOGL
miaA-tRNA Pyrophosphate Transteraae


FLYf4.PFEFEFNTTSSPECDib'C..CPpKLFVKLFKRTIVLLSCP'PGSCKTDVSLAL11PNIDRELRACTKTVEVR
RVLTSQQFTKHLTEVOG
IGLYS


CEIVSVDSKQVYOGKDICTAINSLRARpEIPHNLIDTRNVOEPFHWDFYYEAIOAC~JIKCSVPWLLRIFCVSGVESDD
TAVILFTSGTEKLPKAVPLTNKMI~IFiJOFJICL1IFF0PNT0


LSRNKVPILVGGSGFYFHAFLSGPPKGPAADPGIREQLFaIAEENGVSALYEDLLLImPEDYIQ.AFLPPPHAYGFNSC
GLFPLIfIGVNVI1FASNPLNPKKLVEFIl7DIUfVfFfGSTIVIF


AQTI?KHDKHKTTACLEIIOLTCKXVSDtII~SIDIVPKASREYCGRAWILSPETEFLKNNIDYILICI'AKKQNSCLE
SLALWIGGDALKDTLYECTXKi.OPOIALYOGYGATLCSPVISIT


t7FIRCEAMLOEGLLEEVRGLLNOGIRENPSAFKAICYAEWIEFLDNGFJILEEYE6TKRKFVTICESPR%SEGVCNPI
a?mVLIISKETHIPVSSGECGLIVVAfI'fSVFSGYi~diHENOSFIt


~NSWNYTKXOKTNF1IRYSIFRELPTLGI3SDAIAOKTAKDYLLYSSLGGDQWYL1GOLGHIGPSCDLFLEGRLSRFVK
ICCEMVSLEALESILNFJIFTENONmA


CSLWCCIPGDKVRLCLFT'tLITTIHEVPTDILKSAETSSIYKISYVIIDVlSIPIIGICIIP


CPn_0911 1011079 1042985 DYVSLNALAVSLFG


Fe-S cluster OxldOreducCafe
EVTYVLDAN C~ 0927 1057966 1055093


SLLLJ1IFNVNYFNNLCKAISFEEGLfLFVSSPIRLOEAADATRKERYPSNbioF_1-Oxononanoace
Synchase_1
PNYTNICKIDCTFCAPYRKPKSPDAYLLSFDEVIlSLLORYVSSGVK1YLL.OGGVHPCIGI'


DYLEELVRITVOEFPSINPNFFSaVEIEHACRVSCISIEpGLORLWDACQRTIf'r~GAEIVCKESFL7TSDVIDt~.'
IT1DFLCFARSPI'IYCEVSKRFOIHCQOFPHEKLGIRGSRL1NGP


LSERVRKIISPKKFICPr~IfINLHKLAHI~1GFRTTATHtIPGIMCIPEDILIHLO'ILRDAQDSSVTDDLESKIASY
NGAPNAFIVNSGYNAMiCLCNNVSRSTDVL4WDCEVHKSWIIaLSA


SCPGFYSFIPWSYKPGNTALRRNVPQQASIETYYRILAIGRIFLDNFDHVMSWP'GECKSISGOHHTFHHNNLEHLESL
LOCYRISSKGRIFIFVSSVYSPRG'fWPLCOIIAISRICYNA


LGAKALHYGADDFC~uVILDESVHKAT4WSICSSEEEIt3'tIIRSEGFIPVERNI'FYOHISCHLIVDEAHAKCIFCC
OGKGLCFIALCYENFYAVLVFYGKALCfKGASLLTSSCVKYDLfIpFI


TVSSL
SPPLRYSTSLSPNTLTSICTAYDfLASDCEIARKOVFKLKEHFHDCFDSHAPOC11QPIFL


PHTCLEEAISVLETTCIHVCVYAFAKHPFLAVNLNAYNfVDEVNLLAOVKKPYLBKSSHR


CPn_0~12 1044120 10157u0 'MINHEFHLWRELCC'H


CT768 hypothetical protein
tNINDNSONSFHTLETEOGSFLNDEWVEEVASTESTEISDATLCFAEKKVAFILNIWRE~Pn_0924 1057301
1055029


ALTCSSOGiDLRLFidDLRKQCLPLFNEIEDTAKRAOHWRCYIELTKECRHLKCWDEECSpriAPrimosanal
Ptocsan H'


FVVGQIDLAITCLEKOTLK!'QECTEDKIFKDREDNFLESpALDKHOAFYKONHTSLLWLSY.RFTAKTKSNGYIESa'
TPRLYAEVIVCSNINNVLDYCVPENLEHITKC'fAVThLRODKK
'


SFSSKIIDLRKELIHVCNRNRLKSKFFORLSNIwiFIQVFPKRKELIEKVSGTFAEDVOAFVfLKLILPAIS
'AiVIYQIKITtOCKKILPi4;Iw~DSEIVLPQDLLDLLFWISOYYFAPCGK


AKYFiCSDKETLKKTVFFLRKEIKNLAHAAKRLF'lS3HVFAETRLKLSKCWDOLKGKEKE3FNIOPKOHYRWLKVSKA
KTKEILAKLEVLHPSGGAVLKILLOHASPPGLSSIJ~3'A1IV


tROEOGRLRWSM4SKEVROKirIEVSSLLIECNDL31NRKDLECISKKINALDLTHDDVOSPIH3L&KIGILDIVWIAt
7LELGEDLLTFFPPAPKDLHPEOpSJIIDKIFSSLKTIOFN


I:LKKF140pLF0pLREK00AAEHSY0E01.AK0KC'VI(Y.EAARSLAERI1TFSICTCS<f~1ITTHLLF'uITv~S
GKTEIYLIiATSEALKOCK:n"CTLLVPEIALT/Q'fVSLFKARFGKDVGVWN


:iFaAEEWQTLKELL:KHSFLPPPEKISLDNpLNLrILC/CIVNFFEEGLLSSPDSRCKLVFMKLiOSG!SRTWROA:E
GSLRILIGPRSALf'CPKKNLCLIIVDCEHDPAYKGTE3PPCIfIIA


RCVLKORRERRQELKDKLEODKKLLCSSCLDFDR~Y3ALVEEDKRALEELDASILELKFDVAVKPCKIJvNA'I1S'L(
:!:ATP.~.LE.a~Y1NALSI:KYV(SRLS3RAAAANPAIITSLININLC


W f00LL hEK~KTY.ILF~OPVLKK IAERLE'JrEOVL
IFFNRRCYHTtNSC?11CKHTUtCPNCOMILT


FHKY Vf/LLCIIIf.N.';:PKDLt'U.~.CPKCt.CTHT(l)YRC3GTEKIEKIIf~IFPpIRTI/LID


I:Pn Oll f 1U4570'1 tU4S74g
"D'ITKFYf:;fIETLLRVFA'h:K\GVI.t~fQFILAKE.FNFSAVTU1YII1Jf:0.,~CLYIPDFIUIS


NII fl.tNlt;r tr111Nlhxa Vresenc
EpVFOLt'h)VN:R::I:R::IILIt:EILIQ::FLPDIII?1:NSAI4PUGY'AF'L':QEIT4RELCEYP
in I:linrlvnk/ElIBL >t: at tl/71'IH


Hl.x'K'fYRIFJITD::dIIWRRNCf'fAFDLDt'.'fLLK~:If:::xt~FYC'R:LG.t:LF::IK'PLPE'1:IF
FIHLtt~IFMCY.ufKG'IWt:F\I(I1VIIHILKEULEL"tNPLt4t~/TPt'GHFKtKD'fPRYQFLI


-i ltl'FNFKFF!'f:I FIIPS I 1R Y :AW I FVNKKLIINAL.KIr\KU:1Y.VKF'FI
IINUIMR'FF


~'IW ~lnl1 111.IS'HI'1 lU4inf'Iw ':I'n lfaL'i 10'./w!'~ !'n'.:Wt


NII ftHNl:iL INNMILNI hfla:l'flt TI'1'1 llyt111CIN!f I '.11 Inl..l
111 n:.IIHI\IhY/EHDL .!G Jl I l/7/'IHt.lll
'


VFFWGLP::I'Y'/:al.Tftl.t::::Vf'CDDLYCVAINFI':'..'t.'If:::DFYAINLEKLEErIFADI'IT:
Ctf::fF.t:ll'ft \::I.tIJVTLIftrAI::A::Y.::a1'EKAY611PN
YIMLPMF11S(NIFIIB'rl.t:QLI.DH


VILE.':'.::IrIFIVtIIIAC~Ia:I:::.WIA:x.'YRU):vItIJTIYKKCLTi:DKKAVIL::Y(KKIFICIAt
fyl'(TPPPITtIt.::~'t:K'rKl~.:LWKWVILIiCUL::9NAILKEKYPALYG:::W'Ai*IPC~I


AH::IITF::ImI
ILW.I'YL.HLa:EEKTWR('Ct:HLKKH:1YY'fYWNIVF'rvF:FIUIEEVI.FFNRIVKn:f
'JI.f~I'rY::rI.IIIAKTNtF'/tIIIPNFFI.AIAI'ltP/IRYKIP
'


f::, ;.t:'ft".Kf.::IiI.KItItLWAILlIftLPFAYTI'Y::::
'ITDYIYJ::LTOtF'J:IFt.t'L


'11 11'1 l'~. 1t1.vi.lq l 11141,H
1"


115


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
CPn 097'1 lr1'. . 10711'7s


.:~ p'I,p., LOr.ADDn 10'SNSS7 Cf790 MOoctlectcol Procem


Tnroratnlrtn OioulIiJrf lenmtriee IIINRW1'IRL'.rf' .Ttla'
"~'n.'l'PPSB'E~F~'~"
CLLLTLPCCAARRRASGCIL00tRPIAAANL NM;~LIPfK .~8,~~
i0HKP1IL01XAF~ D91HIi1~IKVR
FKD n
'


. lSO
. VIOVIILKLAKTI~V~irG00NGIDahfrlRDiERI~GIY
IIHTOpTY;.TRF
VSIPERTEEIOCCIVSEISEYTCLHVAAVHVIIKCLTQPKatIDEEIEELV3YCOLPSPE
<.
OWP3YAEALENSKOtIHKPI~LFF'fC50WCMhICIRND00ItpSSEFKHFAGVNWIVEVDF
PgD~RIOPEEGROKNOELKAQYKYCOFPELVFIOAtZKOWt~FEP00CAAW5RVRSAL


KLA
OFLt.~SEG


' ~Pn neln W i0\n tD?t_aA


?. ;os~nllf tnsaa7n ' r..t:f:om:. alr:::
- qy .


_ ~ ..
,. ~..rr: .,.""....,_, .", ... .;.\..... .. 1f w.:?
h':\ur~:.=~~.' -:.at. r. :::INP."Ff ....~".
:Yt:..: :wsr:l:
:Nr:
::
'
'



.
ERIPFLIU(K'IASIETIW.iNC:EALLLENNL:KCNNPKYNVLLKDDKTFFCIrIIsWISW
. '
.
:.: r:.
: ::: :::.::: ::.:-
:a:: .a
:
WVLKKTCOFFILPSSIISOSHSKTAVAIRl4I1'FLSNIIfOGLSLKEI


FLISLILFLPt.ALL CILY
IDRGDSSLNDt.AKJtTG
PKVdJIIRTKAITSSORCLItGPYVSAI~CfITLLEVISOWFPLRTCSDREtALRKR1
SGLh~


.
dRlIbCLAPCVCYLTPECYO~'LDKAILFLIOORIEEWKDLLKVIOKASDNLEIWIIAIfYY
SAAORVIITOYDDLWDSLAIKIPtIALPNRWILYSpGNRTILTLLTVItSCKLIGARNFBFFl~7lQ
YPGINSSKGSAKRENLVIISYQACVRYLRDECiGPKANOIIAPGYSLGTStIOMUt


SNLLVTN O
ALDRMOCo~DLTSwIWKORGPRSLADVANDICItPIASAII1Q.VG~IHIDSYKPSiaLRCRTLSLIKOAI'U1KOQVE
XFHIONIDAI~LYRt
~ODLL88FILQYYVSQP1'IPKEILTPLPLEFPtLSYVWA6SPPRLRSP~GY~~tELLD


PETFIYNSNNDQELISOGLFERfNGVATPFLELPLYRTSCTRIPIPE11DLLHIlIPLSPHVLAYRNAKAYAATTLPSS
1'LPYODFONILRIISO'YPYRILGY~I7111lOGUlATGVYIVF(~IJ


VDRLMVISNYLDSENPKSOQPD
GFDP(fOYRTFSI08ERTl~I~.ALi.IBVLd.RRFNSLTTJILPOIIWDOCR'I11YNRTKRIIO


'l'tJ'ILTGIOVYI'IAKEKlIMSRCLIiIWCIfGITFPIO<CFSLPPTSMi.QFFOILRDWdtFA


CPtt_0928 1061075 1039881
ISKNRRRIIGRALFEOERIPCICEVRIIKRLt4RFKSMOpVIILSSQisEltaIPGLTIOtDIAV


CHLPS I7 kps P~eeln hoalolo~3 ~tRRD


RRKDFAFTLIl~SDILSGIFSNPNPV5YFS8TRAKOLSDFSKKHPILTICIYI'IIVICILGROt


FKLLICLIIPPLGIYWLCOLVCSLALFPRSSMLYSVLIIfCFRKYRLWBIODYWIDfL.DP
SFIfDPAVSESKRITI00DHLTIInLAINFSTARPKItwLLISLGSCOFLEDMIGLRDBLFL~PeI..0911
10755DA 1077018


SWKELAKLLGANILIYNYPGVIt551G%tliLtNLATAIBiLL:Alat.0~I0GPGANEIITYGrq-~ Nimcch
Reoas!


YSt.00WQSAALOKNPFTNSETSWVAVKDRAPNSLPAAAtt~FFGPIGKLIAViJ1RW10~A


EKNSR&LPGPEILVYSADAFRPSEII~DTALLPEITLAWIIKRTPFMSKKFIGEVNfi.H


3SPLKNPTIOKLAEAILESL3RKN


CPIf_0929 1062701 1061186
~CMLPS 13 kDa Drocein homolop_1
EKllIiIPIHGSNAFVEDILNSNPSPOATYFSSTRAQKLNEFI~WPVLTRZ1~SVIIRIFRV
LIGLIILPLGIYWL.cOTLICTNSILPSIWLLKIFKItOPNTRTL1LTNYLHU4DY88tD'1HVA
SNARVPILQDNVLIt7fLEICLSOAPTNRWIG.ISfGSDCSLE~IAC%LIFDSfIORFAK:.IG
ANILVYNYPGVNS51GSSSLKDLASAIWICTRYL%DRmGPGAIOCIITYI~'SLGCL.ZDAE
ALRDOKIVAD87DT1tiIAVKDRCPLFISPEGF1CSCRRIGKLVMLPGWC1'RAVOeSODLPC
LEIFLYPIDSLRRSTVRpNKIi.APELTi.ANAIICrSPINONREFIEVRLSSDIDPIDSRTR
VAL71TPILKldS
CPn_0970 1062851 1067370
No robust honoloq Present in Oenebetlk/ElOILLCP1L0912 I07S955 107775
as of 11/7/98


NlOISELAPCSTGLOMVPNTOVHtiALDTRRVILTIAACLSLI11GIVLVGIaAAAILPSLFGdnaG/PSil1-ONA
Psiaase
R'IAHYTEESLZ>cIWtSIDIIIDVLREHINLBRSGATYKACCPFIRCCi'PSFIVN
HCSI17Q


VIG(r?tILILFSSIALIYLYXK?REIIOpIALEPLPEMISKDOSIIDIVETRDYASLEKRAT.
YRCFGCGRNGDAIGFi~IOItLGYSFTEAILVLSIOCFwOLVI.OPK08GYTlP00NC
A
A


FAYTIfrNYYOCSMV!'IfREIPRFr~CSYLiIL.RKDlIaROALE'P
G
N
~IZTJStABTFFRYCLYNLPEARNAIAYLYMRGPSPD1'I~tFNLGIRiP~08GFi4ilwE


~PtL0931 1061078 1065718
DlRId060LM'AGFFG180IFLFARRIIFPVIIDAiOttTIGPSARIQC.ENDp00RYVM'PET
MIE


PIFIaSRILtof.NIBRRRIAi~IVILVOD0110CL0lIIDSGPNLZYAIIpQrAPTJA


lyeS-LYSy7 eRNA Synehecaee


IDFRVLQ'1KSDIYTNILCEPM'APAEYLDNEDFLY~UGQGSE<GVVLYPYETPGVFS
IALLLOS(JDYL'1'li.It001$$YPKPGPRERALLVEGIROIwbaSPILVYEIG.I~Ir~SL


CEDI10t1'PASOEIGN8EAANSRS'1'PRVRFI~ILWRAIDKNiIFG0ILa0~fOTI0~1RIIRtPEDNVLBLAHPOP
'1'AEPpHIPIROKVPKINPNIVhEiDILRf7D.ICGfail'ICILY'1'J10


ElTSVNGLSEDItEITPIKFIEEKLDLCDIIGIDCYi.FFTNSGELTVLVLTVTIl.CESIi.6FYIVPEDIIINPI.I
AFMISYYEKIfRKNVPFDEACOVL8DS0ILpLLI'IOtRIXIfALD
aAHCiFLCVE1'PILO
I


f
TIfIRfL0t0lADRRfdIEOCRPL$LHONIOD~EILEDYWLRRDRTIITLL0P68ELIP
LPiROfAGLSI7KEVRYRKRWLDLISSREVSDIFVIOtSYIIIC.IRNY
i'I~ALNSENFLRISLEIALIGfILVOGAPRIYELGRVFRNt~ItR~'!Q1
T


NIYOGiIFJIKPI
T CPeL0913 1077972 1078238
PCTMIGYMYIfsYIfEVHVFVfM.VFJILVR7IVl~ffSLVYSY18f11DPOBVDIKAIWIR
I


LI 1 hypoclucieal pmcein
tfll8~r8IA1YAGIQVDVfIBDOKLEEILKKKT'lFpg'1'AIrATASR~.IAALfDELVSfRC1T94
N


APNNITDtIWEITPLCxTLR5GD,1AFVE<eFESTCLGKELCNAYSI'3i~PIROR1~LE00.
'
PPIBfSPRFLLPFLSVILCaia.LSSPR5RJ1ISVTESIGtISAVKTLVC.BBIDIREfita'.1GY


fNMSIRDVLYFPIIl9iR GVGASSILItONQfOOWLCIESLLAQNlVM
TIOIL.LPDSECNPIDEEFLE/\LCOC?IPPAOGFGICVDRLVItIL
FDAGfIN


CPtf_0932 1067160 1065721 CPeL_0941 1078503 1078997
7
ban
~


V~KSS5D~11IMAF;NIyECLYFYNCAS rhQ>O~F9PNtFt'PVRLYTCGPTVYOIfAHIGNFATYVFED~
FEAAYIipA
VRAELOP$ ~
IKT111QYFIPLLALLIFSPSL


VFfGYSVTfIVIQIITDVEDRTIAG11SKRNIPLOEYTOPY'fFAFFEDt.CI'LHIARAI~PGILVFTSGIITPBFII
DLTNGSPSLSTPIAKCFfNWLCPOLISPLDI11110DPV
ILKR:T


.
ILYIGS!'LQ~PEVF~tVSGPRLCYILIDL00CAQC0AVLPLLTIGt
DFYPNATtIYIPQNIOAITKLLEOGIAYICODASVYFSLNRFPHYGKLSICt.~.SSLROCSR.


ISADEYOKDiPSDIVLWNAYNPERDGVIYSiCSPPfRCGRPGP81LDC8INAMELIGDSLDIN
E 091S 1079001 1079660
CPef


ACCVtxtIFPNNENEIAOSERLSGKPFARYWIJtSPitLLIDGRG1SKSLONPiRr
i


FICptVAYIRiASNYRTOLN!'TECALL1CRNALRRLItDFVSRLEGVDLPGESPLPRTLDSn
CTY95 hypocheeical Drorv


SSOFIEAFSRALANDLNVSTGFASLFDFVNEINTLIDOGNFSKADSLYILLTLKKVD'HII.8IFKNRILPSYFCHHFD
DLRPHYINtIALSLLSLi~IIFPIFCEESRPGSEDCNSMfQLIIIC


GVLPLTfSVCIPETVNQLVaEAEEARKTKNWANAI7l'LRDEILAAGFLVEDSKSGPRVKPL50171'00CLY1~1(RI
EGKPLVTWILNSCDOCQACfIGLSETCEIYLSVLBGSI


FSELifNIWLVP9GVNPLIYPPI~PILAEIVKFKELFKDESFPfGGSI
IWCVTP1~PC


CPn_0973 1067532 1068578 . DIIEVSPVSLTV6EEETLPSEQTTEVESTSEtQSEDPAIA


predicted disulfide bond isoaerase '
K CPri0916 1082816 1079715
AEL


C t)ly0-GlYCYI cRHA Synchecase
PVILI4NIKRCSLKOLKVLATLLLSLSLPTLEJIA~IRDSOSIVWHLOYOEAL.OKS
GJ
L
'


t
GECOKICKCYTLESFVSEHPLTLOSNIATILRFWSEOGCVIHpCYDLEVCACTFNPATFLR
JEVEYLKHRPQVCiIRO~
PLLVIFSCSOWNGPGMKIRKEVGt=SPEFIKRVOCKFtIC
L


KSKPKINELPCNILL3NEEREIYRiGSFCNETCSM.CDSLCNIVEBDSLLRRAFPPDfISALDPEPYKAAYVEPSRRPO
DDRYCVItPNRIAHYi~LOVILKPVPQIFLSLYTESGRAIGL


SLSELORYYRL11EELSHKEFLKtIALEIGVRSDDYFFL:aEKFRLLVEVCKl07SEECORIKKDLRDNOIRFINl7DN
CIPTICAWGLGWt1MI11GNEITOLTYFOAIGSKPLDTISGBIZYGI


RLLNKDPKNEK01'HFTVALIEF9ELAKltSPAOVAQDASOVIAPLESYISOFG000KD1~R.WERIJ1NYIAKKISIY
DV WID1'LTYGOITOASEKAWSEYNFDYANfAIIFKNPF~IACGL


R'J~tIAOFY LDSDQWtIMAi.0lf
AEVAFEAAPNEVRSNISRSLEYIRNOSRTLIDIGLSVPAYDfYIKASNAPNILDARCTISV1'ERTRYIMIRpLTRLV1
1DSWEwRAS


0971 l Oo8918 1068526
lt~lYPLtw"LSSTSEPICETSfSWPMISSTEDLLLEICSEELPATPVPIGIOpLESLiIRO'Vt.
CPn


_
TDtMIW(TJrLEVIGSPRRLALLVKNVAPEWOKAFEKKGPNLTSLFBPOCDVBPpCpOFF
:npA-Ribonuclease P Protein Canponenc


'IFVNPLTLPKpSRVLKRKOFLYITRSGFCCACSOATFf'/VPSRHPCTGRMGI'CVSK!(lCKASOGVDISRYQDL8R
HJ1ST.AIR7YNCSEYLFLLNPEIRLRTADIIlIpEt.PLLI0RT8IFPK


AkEANSFttttWItLYFRHVRNOLPNCQIWFPKCHKORFVFSKLLODFItJOIPOGGHRLGKYII~WONSOVEYARPIR
WLVALYGEHILPITtGTIIASRNSF~11RDLDPRKI$ISSP00YY


TKATTOCIx:TPItSEKC\TAPR
CTLROACVVVSOK6RRNIIEOGLRANSSOTISAIPLPRLIGTPLSENPFVSCOOP$Op


PCALPK6LLIAENVNNOKYFP'fNETSSGJ1ISNFFIWCDNSPNOtfIIEC~tALTPRLTD


rPn
:EFLFKODWIPLTtFIEKLKSVTYFEALGSLYDKVERLKANORVFS1TS5LAASEDC.DI
0735 1Dc9100 1068957 '


_ ~OKLSTIGT
r171-L7A Ribofloafal Protein
AIOYCKADLVSAWNEFPELOGINCEYILKHANLPTASAVAVtIEHLRNI
'


EtIIVKRTYOPSKRKRRNSVGFRTRNATRNGRKLLNRFFRfIGRNSLVDLGLDRLiIDH
LL~rLLDRLDNLLJUCFIIGLfCFTSSNDPYALRROSLEVLTLV
:ASRLPIDLAu'


FPSTtEEKVWDKSKTINEtLEFIWGRLKTFMGSLEFRKDEIAAVLIDSATKNPtEILt)'fA


!f! Jr. l i)r: )310 1069170 EALOLLKEENTEKLAV iTfTNNRLKKI
L.i.~.LKL'oM1':~SP
r:lr: I EVLCDRESNFKI,WLDAP'PGF
'


_ IidE
r:7t.-L:'. Rrtxfurnul Froceln
PKET.~J1HAFLEYFL.:LApL::NDiODFLfRVIIIMIDDGAIRNLR15LLLTANDKFSIt~


'11J1KV.~.:;::VKAGP:aM:DKLVRRKGRLYVLNKKDPNPY.~.VAV
iRO~PARKK


u':: I IJ.:'t.lH7 IJr:'ti'rK ~Rr_tlnA7 II)K IA f f li)NAO'~'I
am


_ pttsA r:lYVarrcl f 1'
PM>trptf.ttyh/ltc.m:ltKC.ncr.
r::lA::lA ItrW rt.vml f'cocwi"


~/KRNAYY.::::V\kF
V:RRRLVE.WFKKR::DLRXIVKC:..:'!.~.EEEIII7JARt::LNKIIKROTSP:.:AN.LMJYCTF::RLFITt'
IhffIL'ILYr:YWFra'PCVVI.I'Y'It.LAtd
\L::EI.'fDJII0.A'VA


'f'fIJItW''LIa'a<I'1a:1'td<KIAI::RCt:Pft~MA:2'1t)Efia:JIKAaIRKF.~/JIt'~:KLLC11'
HAIxIt1'ItC::CYL'PF1~~C'INNLC'LLL'/FlF(
IRIX:VI::Tt.RTVt:AF


1'r:fMIMNAat:KL.KA t 11r :Y::F'YI.I
1.1:/M 1 1'11:.1 t :l..l: X/tY:LL:I
h1\::YN:: f ! AVY::IA::


wfn U.::, 111..111'.. ltlo'rrll., iIEIfFWMIKNh'I'RtNAKTKI\:hY.tDll'_:YU


..~~/i1R t.ylo.rlrI i.~.:l 1'r..r.im
-IL:.uMr rr.'fl ir~rn i.l.' 1'rriPl.f!'JIiCI'


rIN1:971'LIt'IT)1.1'1::ILLIvYVIIJ7f:C::AYIADKKYFtf/It~.IFF'M:Ah'F:FIr:LWLLLL':h
r_If'nAk I111c'.As!t IW AIVA
/


I~.:ItktlAl.h:KfAlt.('YfMI::I)1.FDDfJCK::LY?IDEIP:::~:LWEI411YFF.1.'WFYflIKDRFN
V'IUIA r:lYt'rrNar ::Yldtl.n:.
'
'


' :I' I ::Fh:h:l.l'l'LLYr :K'fYf'h:l't'L I ::KI ~.
WNWKKI 7MKUW,tk VKL'IGC:IIJt~ItL.Kh'.A::K.:~.~ I::R
':F'71N. I'ArIAVEhTI' 1 VKVI S
:1 t :IX YA::L::K 1:1 J1 KVNIIV
h.'lLf.('l11' 1


F::FYYEFLCKWA::AL:Y::1'1t:1:1'l:t!
1'fId'KaJII:LF:.TC:IY::h7JlIVVkN::AYAAAAAII


116


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/Z6923
JEADIAUIYIILNIIIOM:LLJII:LLKHPIJ~IPVH_CPn 0?~0 I' '. l:l.'.':'::
"INIIPCYIlGYC3L'OLLAASOIp
'!:


, tT909 hYpocttetscal pw.r~sn
.~;few'IIYOLPRDI'OT3VLJIKCALYCGDYI'I1'VrL:.iOEttNDY.iDIfALNDAILARNSVF:iL::Ltl3Y
LrNPraKAI.W'w~F L tt.
OISSDKFPLI E
3EP0VLFTKK6CIRAVLYFxL
'


. EELWSPLEVGR'.fGA~i'v:.'oOWL
:NtaDEDVWNPKTOPALAVOYDA:iLL
I
'


~NEVLtNCFtALt.ODCLASSPNIRL DCFFCOCSL~:PERKNtLKfLEflRKKNCG~PFCYL
-Vt3RIYEERCPEFNKEIILMAHGISYAFLLICi VCSGVFDCRPLIROEw::E:
'
'


.F .
~
LDFNDPWILTYAAAGHICfPSNILEAG~GLTCLIAMtYClVPLVRKTOGLADTVI
.PT


TFFDLNIIFNEFRAHL.iNAVTYYADEPDVWI1~LIESCHLIIJIiGLDAHAKNYVNi.YOSLLS
'IT
~-


~
Cpn_09~t 109710.; IO


.t.n n.~.tv lUR5Aa7 lOnF181 rl t:-Li2 Ratna~ln.~: Pmresh ...
.
.
._. .


. .
,

. -.KAt",r
:: ;
.. . , , . ..
.. . . .. . ... ...:.;
'
'
'


. :..,.... ... ;.s.,:.,slt .,
~;; 'A.:L.. 'dl:T'.....i..: .;:
:LalIt11H1:r:~-::n~!~:r~':vr:;~ :.,.
".': -:I1:,. .... .._


KKFI::NLFSvALSSIYFa'LSYEGR L IKALVI(DIOYO
ITPYOV INLDfCCLVEDRPIXtI'fI 1D9730t 1099:75


PIIKINAVDCICVIC~SLROVIRAVRVMCKPKDIVPFLELDNRSVGLSOTRKLSDIKIFCPI1~0962
. plsK-PA/PhosphollPid SyntMats Protein


~yA
IL3L1111C1r0ICIDIlCWHSPLWWaVLVDVLKSO$SffPFAl1'LFAiClIRI4~tOCAf
~I~


0950 106170 1017037 80LP00CFPKIfSAEtiIVANm8PIw1A1RKK38
CPn V
~


_
TLARAKIPLFPAVSRPALLVLyPTIEUGIU1VILDYCANISVKPiDNOFAIAOGAYROL
pch-PePtadyl CRNA Mydrolase LC


PSLCOlahAKLIVAICNPRNCYANTRIINNGFLLADRLVE~.Of;PPFKPLSKCNAiJfl'LVCSDSKIPTIGLtIfIG
SC~IKGTC7WRCtfIWLRE?liG6AFLQ(ItSGAVPDDiIADIWTt~F


SSGPLVFIRIT'FVNLSGIU1WLAKKYFNVALStIILVLA~SFGKiJIt.CFNOQ>IOGIITCItIFLRTAICVTCPLO
RILGDKL6ADIORRLDYTFYPDSVIIL~GLAKLVIKCIIGKA~fS


NtiLKSfTASIGSNEYWOLRFGVra'RPLCL7I:VELSNFVLCKF8ICZ~.OLGSIFVF~ISTLf'tLFIGILGSINta
QARt.CKRILS~.I


EadCSItF
CPn_0963 109371 1103231


0951 107113 107157 pm~ZlPutatlw Oucet Msebrane Protein
CPt>


_
TPLRFKVAHW1KKTVRSYRSSFSHSVNAfTSACIAFCiINSLNSSF.LeLGVTNI0FS~5
rs6-S6 Ribosaeai Protein


.EFI1~ICKKENQLYEGAYVPSVTLSEGRRKALDKVISGITNYCGEIHKIHDQCRIOfLAYTIANVBCAQTSVLKGSDP
VNPSQKESEIfVLYIpVPL


I~AREGYYYFIYFSVSPOAI JISLPE
TlVPCIDQKLVNStxOTIINFSOP11QEPDTSNAVSPJTISSCEKDipKpLiTCDPCKCIGLK


EvSSDLPKSPETAVAAISEDLEISENISARDPLOGt.AFFYLO?fSSQSISEKDSSF17QIIr


ct~o9si 1oe711;9 loe7n3
sasaANSCLCFarsrIAVKSOAAVrsaRDIVr~avKCLSFISCESt,ma:s~l~nrrH


rslA-S18 Ribosanrl Prouin ~T~'fG'u'~~~~T~~'~I~A~


CENTNKPVHl6LEttRRKRFNKIICPFVSAGWK1'IDYIfDVCfLfitFITEPGKVLPRRI1'L11SS


RFQLYLSQJ1IKRARNIGLLPFVaED


CPtl_0953 10~7717 10BA3~8
r19-L9 Ribosaml Protein
1TARFGYVRIIYLIPIDDUIVIAG714TLRLOAIa.KW
RLIOAAADItADSERIApALI~IVLEFQVRVDPt7~i~DIYC.RVTIHDIIAF~IAiDDIIFLVRIC~t
FPHANY71IKNLGKKNIPL IQ.KCCVTATLLVEV'tS~IE'YYMa0GK0'tCENOEC
CPI~0954 108~359 10~8708
ychB-Prttdleted Kinase
GRKVCY%DLNpYfSPAKLM.P'LKIWGRRPI7NF1IG.TTLYpAfDFf'sD1'tSLSLSS
NVNELLSPSNLIWKSLEIFRRETQINOPVSWNLHKSIPL09GLOOGSSNAATALYAt~iH
F01'HIPITTLQt.WAREfCSDVPFFFL0E0H
CPIt'0955 lOAAtilZ 10A9175
I Erawe-shitc Wich 09561
LK06IFAICITI110RNELtH'8TNVi~SCLGVV1~CQNICCFtDlKIIItLlGYJiLCLtYtOWE
RAFPliP7ISYlMIATLGSRNRIfRCSFFFSSCfALGHD~i~4.fSIKLQi'111K7DCYVLYLDIIQ
Y11GILAGPWLIKCiIfVYQit~T1'D
GfptEfTAY05LLPQDYSiGIHC~IACFYGCNDLdKSVFAIRTLL10~IKElaC.CRNWBPFCS11V
YOTLOIS7GSWT~.PIACt8IDYRIfIHJPRRFIfIIIVS117VPlVtAiYHIIE~PtIiCO
tlISGSGJ1TLFVCYLEELEODSIIVSS0IIL9LIKO~fpCIPVSRLYAEPNNYSLKOS'fYIOiBP
GKEVRTFQRTRIENVAIPFatALFJIAYSRDSRAEHiSVpLAYVFDV><IIIA7PVCLITL1~J1
LOCFpPpI AYSIIKCYCVDI IlIYDII~IIIIIF
CPtI_0956 1oA9515 1090909 CPn_0961 110~~11 1103301
CTA05 hypothaeical psouin No robust homolop present in Oenebank(CIOI. as of
11/7/9!
WWP$NILPPYSYSLKIGAAVLFPCSILNtFLTP11LY1u0SYODtKLVFPl7CWIDlYJIRt.
OSILCSIIKYIYLIIOiSKNMLBNPISLFSPAELIAICYIrt.IPKISPIYIRIIIiLIIl.1~11
SELTRILSRVSIVFFLWAVPL!'fWFLYTCGYRISL01YFNSPNYOTAVFI11VILILT~~P
CQTRL'fNVAOVGNPSSLI~SIIXIIJ~CSGGPLCWYfEIOILAFITTiVtlIIfi.IHC.
IVYFACLVLSSIAIQGKTSPKSWWWfLlIIAPPLLSCLL1TC1GANIIG11?LLJaWIYV!'8P
fVAOLRLFIIPLPPKKIVmt.SCPTTECIIiEVfOPFIF11L0ALLFC~LiIfF!<IV~fIIC
SRRFAYATNGt3.FSNISICCLTSYVSSRALTLIPPALI6iENSFFLSNFAMfAIVATLIST
KApI.PIIIFGHRi.VAISPOCSOFJII/tRIPtLKKVLISLaVLTPAi1K116nY1~i
TIYYFIFRKEFKKFPDIPSD>mPSVCKVPWWIfCVNIIlVGSIILSPSTPLFlICAfdi.FY
OItIANl000lTFPILIIQ.LIGL1CKSSLPICfPSfKtKI~AALFIAS/!IA>WiT
iGPnXFTIEYQOPINLSKVCYVCLrYIIGLWFGOI.pDIWVt3It3100LSDFGYIfIYSIrfLS
R$~RLYSIANDf~LLIW90CFtDCREf~ISIODG~A6EYRFAA0p11DA1tY11G1ICaVi.
IFLDNALVNYLVtOJISVATDCYNYLWXK'NIUILiGLTLVSNIPNIVGYLILRSAFPSSTi
RNCSMKIAWNVINT1IKPTf(OK'taCLVTENLQDTIG11LTLRQTNiI'114mtCW10LlML
HHONLFLCALGPSI IShfIVPSiLLIQ.1VPEFLYCFFR
PLIIKYLNS~.VNSVFKSHOIUIDPCTItALIREPALDILY11SLRLPpTSJIlEfM5Tti31
DPLZYCPNKACI71YLLYVLICIICL
SEEFLKRIFOtLPAV
CP1~096e 110055 L10d719


pcnA_:-POIyA Polyeerase


LLITtIINCENNILr<:RSiLELLKKKSNITLTFTIYSVSNHNfKLKDFSPNALSVIK?LRK


AGYIAYIVCC.~IRDLLLMl'PKDFDI :'l'SAKPECIKAIFIO~ILVCIIRFRf.ANIR~Opt


CPn_095A 109803 1093793 '
IEVSTFRSCSTOEOVLITKDNLW>1'PEEtIVLRRDPTINGL!'YDPENCCIIDYTr7D11lKKJt


pls8-.lycerol-3-P AcyltransEerase
NRYLRTICDPFTRFKODFVRNLRLLKILSRSPFTVE1'OT'OEALIJICIt06LIK850ARVFE


LYRAIYHOFSRYLRYAFONpYLPEPLY0KFS11FliQNYIDAATKKAAADOAEVLCLOWhfVELIKHW.SCRAIO~PPO
LLIl7JfILLEILFPYNDKAPRWPALCCpTATYLKALDDICILKItE


TIDLYtIPFIFPPYHKKIRAPIDLFRLSIDFFSLVIDDIfNSALtIrLItRLKEIBEYIARCDAEYDRIWUMIFLPPLV
NPNVRYKHOKHPYLSLT~VF?(IKNFLCQFFAOSFTSCSKKtIf'


IIWLUWHOTEGDPOT1IYYAiGK'CI1PCLJIENHIFVACDRViSDPWIPPS11CCOLL.CIYSILTI1LILQNDYRLT
PLIPIKKALPFNKKLLHHTRFLPJIL:LLCIRSLVYPKLOKVYVJWl


KRNIATPPELREEKLLHNpKSMOILIITLLNEGCKPIYYAPAOGRtHIKNAEGRLYPSEFSPRHHOTLKCKKD3NSOK


ESLEVFRLLAKA.,~IIQITHFYPFALKTYDILPPPPKIEIIAICEORAIFFAPVFFNFGACLF


FDALC.~.KEELtFK:DKHAORTLRAEKVFSIVKNLYCELCPn-0967 IIOM171 L1U~8>r:


mrnA/pps-Plx~yhnt l uc-cwwt.>,.~..~


''Ur_U''.n 1117AJ7e lU.s179~
PTAYKFAFIC.k'R::EKIRRTCtDFRRHHp~h:VkYLlY:fGIYRCR/W!'EPlfIYE1'fVLLCK


.:.stE-IUi.tI Fshnrnc 1>rocein
AVARVLRl7:Ra:KHNV11',:KUTRI:X:YHFFNALIAP,ItJ::M;ICEtVt(:PLITFGVAIITR


A(X:Wa.TRKVNENEILIi~IIE:iKEIRYAIILKNCf~LFLLTtERKKVROt.KCNLYRGRVTHTAYRADR::IHI:
i4:11NM'RI?ltal!TF::LEI:FKI::fi/Gl~P1E711V:IPJIDPtIPLPCOINVOK


LIlNfO::AFtNLDERfaN:FIHL~.DILF3~I;:KKFfONFOHDVDALPEEA.iFJIPt.L.iSEEAPIENKRVIOII
IK:N\'VLF(NK\TFI'Ke:NTIJII:I.IIIVLI/:NK:A::YIIVAI~.T/FERLI1AEVI~OCE


F!'1.1;LL::PVt.VWVKEILU::KVARLTSDII::Lfc:RYL'dLLPIIf.PIIRGV.~.RKIEDPIMRCDLP'l'C
tNINEIk\:ALFIth'L'KAVIL7KlI11t4:111.U:DWPITMVDF7Ni1L110s7111ILaICA


YyLIN::!'Fl~tXnIY:LIt.'R'PA:TCTA:,TFr\LINPJWDLLLTWII'I'ILEKFYSTEGPI:LLY~ET;:DLK
KR::ALMINItWI'fllrlN!'rtJIJI'(Lf7t:L:IIrVFT.':1'P'.YJRIIVIJMHLYJIEVTIl14E0


Inll.l!Yl'/I.1't:IIH!ILYKNLLIDDYATYQIG:KIMLKKY:PDA::IKIB'fYRDSIPNFERPNLE:il:IOA
IFLDYM"Aa:I:II::TIlY'/LRINIF-':I_:HI::W.TAtIVK::I'pl't.INIIAVRFJIIM.6T


YliILYATIeNKIWL:::~h:1'LFFnI(TEJ1l61TII7VNS~1R:STQL.C'~VEETLVptIILEMCEIAIPLIERT
IJtt~lltY:Al~:1':::1<fl.LInY:y71'FJJInI'MVPINIrKlltnlll'IrtVM.AWIMF3.I:


L:HFI*1M't:f.VILDFIDHK::RKNDRRVLERLKE11N!!YDAARCTLG:HSEF.LVtMRpR'Ir'"lRE
I
~~


NHF::! t4JftJ"tU: Mtt::a:NA I I vl O
KTPFwIIV/ t E T CRDl.YIfV IMIK6113HLCLWHPEIASYN(IC


Kyl7lLrillh?IINWKOLK~VCLQINT::D:iVill.lRIYOFFr'LITGE.:fOLCI91 ILIbN IIILnHNY
11IW1


.IIIIC:.:14n:.b..MW t.,.;:t",r.,.:.
.. 1 ~IUlrt.yr.W :ittr.t::.


lI7
CPtL0957 1093~1? 10909E3
ide/ptr-Insulinase Eaeily/Protease III
KIYTRNCKNfWKLt.CPILICTSLSITSCEOQFkWPNOCPIQVSTPAAAOpICfEKI ICSN
GLPLLIISDPNLPTSGAALLVK'1~INADPEEYPGNAHFTEHCVFLGNGfYPEVSGFPCFL


CA 02350775 2001-05-11
WO 00/27994 PCTNS99l26923
Ie
~


VEOCLFIRKTVCRVOQ.SNL.F
RMDGIF!:YW'!J~(,YSfVLLGLAKLCYRGYD,iNW4J
D CPn !)'179 11:1:71 l' ~
a


/ '
1 httA-DD :ieriM/~d~'ltlYWllai' p
AWIOtGfIflIFKCLRRii.TAOCI ~ G':... !I.
-rERitITA:IVICItfIIiIATIICVP'IiINANPNVDEGR
ES


iP~~DIO.r~EIIVQLFSLYYOF.SQOLYFSFCQTIJiGLRCiSVACALtHKDNPIYfILCA80caOIIITKOLRS1~I
DI1VL~'LLRLPr~Af'SKKESRYStCP~00b"'.
VR .SC~fS~SkV~~fK


O .
PLIIGIL.7cEE'fFIASDSMFFKYTRHSpALASCiFAIVSOGKEPEVYIE.ILKKIHKDATPAWYIE.iFPKSOAVTH
PSpGRIK.PYCNPFOYFNCEFfNIIFFGLPSpRDIPOSKMVIt
SEDASOK3GYCYYNLKEIYOOP6YLECLIOKIW~falIILSECLiDVPIKSFKiITf
IT~.


.
~LVSPOCIIVTNNIIWED1GKIHVTLHOGOKYPATIIw.DPIITDIrIYIKIKS~.PY
VACf"'w~YHAGYLAKYIIF$LVSTPVIIICVASGFRYRRPYIGKCTLTrILf.iQSGCfAO?LI1


ALKiLRRRNIAYLLCICNVPGSAIAL.CVDNCLFGFaGVEIGVATTKAITSQLLLLVFIGLLSIr?ISOHLKVCDWAIA
IGNPFI:LOAT.:.ICYISJ1KCRNQLHIAD!'EDFIC:~AAINIGN


Kf.IWVIK7ALTHAE0t:3f~f:LQ.SLPptGpKLLANE;LHSWAOPYSYEDKfLFLCRRI~IYP:,rI:PLWLDrfjt
!tGVHfAiV:;rStrlfICIf:FAiP.ftaIAltRtIDDLIRD!:pVIRCFt~Y':.


..I; . . ..I, ...,:" . Y' ::.F"~Cln!.'!:YM:..:iiLrLS'."!Y i.i:f~.' ,y,. ..
L ~tif. .. .y:. v. :: ~\'f.'i. :..'MFRIiif:.:
...F. ..I Y.Li: ~i:.n . ..d 1i
.. hl h 1:.:.:.' -
~
i""
'


' 1;.~~
...:Nf:tA::l.l:.':!:v::v:l:n .. . .... .. - .. . . . .
. ........n ...... "".;t?IA..,.. '.!
..\.47AE!~J' ~:Sf ~!'SYF:.:tII..
. :
at:::'."PI':!.%L':PI'a':'


PRNLAKSVTVE TKGILL ZSVEPCSVnIASS:aIAPGOLLLAVNRQINS~
IECt.NRTLIt05Np1ENItilIfl9OCD


VfRFfALKPEE


CPn_0969 1111101 1111999
0980 1136981 1135501
CPn


cyrP_1-Tyrosine Transpore_1 _
VYVMSNKVLGCSLLIAGSI1IGAGVLAVPVLTAKGGFFPATFLYIVSWLPSMASGLCLLEV'si~ilsricy co
Saecharamyees serevssiae
hypocnecsul 53.9KD


MfstlitESKNWNa.SIIAFSILCNVGKISICLVYLfLFYSLLIAYFCfxIGHILCRVIMONPtocvin


LGIrtiIRHLGPIGFAfI~CPIINIIOfKVIDYQ~1R!lMIGLTV111GIlCAtGFLItIOPSFLFVMJItIAKKNAKP
WLIFFSTKDKt3YCDIIf'NNCSGKPt4lt.DSKHFDINSANFLi~JNt


vRS~ILTTf401FPVFFLAIGfpSIIPTLYYYlK7RKYGDVKKAILIG?LIPLVLYVLtiiWFISFPSISADSDHI40C
1I~ICAHFLVDHVNKfFDVPGWITPGHPPIIYASYKSCDPLSPfL


YiGIIVSLPILS0711CICCYTAVtALKOJWllIWAFYIAGiLIGlFIIt.VSBlVGVALGV!loFLMLY11IY090PA
OLSOCIi~I~!'ILREC~NLY


AOGt.KWNKXSNPFSIFFLTFIIPL11WAVCYPtIVLTCLKYIIGCFG71AVIICVFPTLIVWKPLNIIWLICGEBISG
SLiILPIWLGUtKEALW1DYLLIVDGuFL40tNPYVSIGitRGIVfM


CRYCKQNlIR00pLVFGGItFJIL!'I10TLLfVIHWSfYHELKI8LEG~IKD018f'.YttX'.IAYNMRALSiILSS
LHHPONSIAIEGFYDDLiIi.PSDSORPD


LPKSDTLREC~FRPOGYE7l?YSPEESALR%YEINGISCicrYTGPGFXTVIPYMTA


CPn_0970 1117153 1111618
Yt.9CRLVPNpDP~tAAHOVIIIIILROQVPBSLKFSYEILPGGSIMiRSWILpfVKVIt~I


cyrP_3-Tyrosine Transport_3
YSDLYNBiCLRLVMPATIPIGPLLGtAAOTSPfICGTSYLSDDIHAAaNFEIIDOLItIOCF


VYVMSNKVLOGSLLZAGS71IGi1CViJIVfHLTAKOGFFPATFLYIVSiiLFSMASCICLLIYLSICOLLDKLPKIKE
,


M'Iwl9LiSKNPVII~.&IHIESII~It~ICISICLVYLTLIYSLLIAYIC~NILCRVP!>CQN


IGISWIRNLGPLGFAILIGpIIMACTKVIOYCNRFPMItGLTVAIGIFCALGFLKIOPSFLCPn_0981 1137019
1139953


VRSyWL.ITINiIFpVPtGFGFQSIIpThYYYImKKVCDVIOtAILIG?LIpLVLYVLWiWZinc
Mecalloprocese tinsulansse
Eanilyt


VLG71VSLPIL~AICIfiL~fTAVFALKQJWR~IAfIffAGC.FGFFJ1WSSFNGVAI41!tm!'LVTLSMtAGDTYRN
FIIKSCJtDL.PEICSKLGFJIiNKPIGASIIOIIVNNDEIfIVIT~tICFIffC


ADCLKWlIKKSNPFSIFFLTFIIpWiAVCYPLIVLTCLKYlIOGlGA7IVI
IGVFpTLIVWK


CRYGKONHREKpLVP'C~71LPLMFLLIVIMWSIYNEI.


CPItr.0971 11f1697 1115~15
yecA-Transpore Penlease
DGSNGLYDRDYIQDSRVOGTtASRVYfR!IMTAGLIVTSCVALGLYPSGLYRSLFSPIiYMMC
PATLGVSFPINSKIOTIS1!SAVCGLFLLY
ALYhGLAAVYGJ1F'fKSI7LTKISKIIfl~FALIGLLLVTLVFAWSIOVSNPLIYLLICYf~GL
VIFYCLTAiIGADAIRRISSTICta~BJfLSYKISt*ffJlLIaIYCNVIINfIiYLLQIFSSSaNR
D
CPIr_0973 1116)77 11154)0


ttsY-Cell Division Procein FCSY


RCIIIHSLLFPSYLVSFLti.OLTLLWIFKPPRIDG.OSLFK81YISLDLICDALSLFYfJIDF


GTELTEELCAALRRTKKADiLSTIRDLITVLWSLOGLPSOA:90SSQTRPIVSLI.L?fNG


PGGfliL7lAI LR~t!LTLDI~FPIVPAI
SCKTn'1
YKGtSiSVIQ VA
WI
Wn


_
_
_
_
Q


t~01I CPItv0913 1111315 1139963
E0VRVFNDWpLSGLIF?RYDGSAIIGGI'LPOIAKRt3CIPi7fFIGYG
~iP


~.DLFLatKLFPLVDCI YipN taaily .


KIIS.ASVhBILPVSLiYCIi.ISGCVFFI.
sNfYS85LYlWOCRAPLEKIOK<.Ol~Ipti.O'!SL


CPIt_0973 1116116 1117537
NLiRIp&pi.Ii~DFSNRW.SSHKLIKDM1t66AQN1flrro'fSKSFQSILSPIQTri.Tl11031


'suoC-Succinyl-CoA Synchecase.
Beca'LiTF~fRtt~IORLKCOISOLLAV6KKLEIIC17M1hDIL10Ip0iI0LMifiT.


IPPYWVVSSCELCELLITKSGLDSAWKWVtWG
AWG.iYGDYDSpTT8A0GItFRADIIIRLP00RCLIID11KAPISD~iIIFSVMl01m0i.V0K


GRGI0K9GVIVJ11tS8XiILQIIVAKLht'InI0IPT5NQTADGFLPVEKVLISPWAIQRaYYVAVIK~IIKTLKSKS
YWGIPHOSPiYVILFLPCKSLFND11IRIJIp~MIQAiINVIfiSpLT


IImRJOIRCPVIJQSKAGQIDIT.EVJWSSPWILTLPLTSYGHIYSYpL110ATK1101WiGiLLALLKTIAYNiIIQC
IL4KQIQ6YSt.LGKCWRRIQW!'l'HPQKIOIIfILJp'tVOlYlal'


Vt4pNOLIIOG~KCFYAD7VSLLCINPLVLTLCGCLLVLDSKITID1S111LY1WPNG6VLSSInYRVL.PTLRKFiGL
CfSSSI~ILEP?PIESL1TSFPNTCDIDTtIUIYf


YDP8Q6lNRDVi.NfOIGL&YIALSC~ItIGCIVNGil0i.7WSTi.DILIOiKKitiIlAtiFLDNGGG


ASpKQIQEAVSLVLSDESVKVLFINIFGGtI~CSWASGLVAVNCCADOVIIP'fVIRLI%:TCPn..0981
1173015 1171306


NvEtGKiIVpQSGIPCQtYSSMF.OfiIIPRAVELSNpssA-Glycerol-Serine
Phosphacidyltransferase


KNPf.CY0QK1aJ10ID1171fx.DLIJ1I1GKRRVYtPNAfTAIChCCGLFI
IFKSVLRTSSBViL


CPn_0971 1117537 1118133
FHRt.OGLSLLLISJWIJ1~SOGAIARiIBLAFSAPCJIpFDSLSDAYfIGIAPPLIAIRfLD


'sucD-Suecinyl-CoA Synchetase.
AlphaGfYVG~IFFSSW.LI?SIIYSLCGVLRLVRYNLFSOKTVDVSKpYCFIGLPIPAAAASiVS


VCRFRRYMFNSLSKNfPIIInCITGKAGSFHTLpCL~YGTNFVCGVTPCKGGTiJit.DLPVULILiISDIFPOLPJYO
LRVGLLSFALLFIOGLMISPWKFP4InWFRINVSSFLLWTICL


YDSVLF~IKpJIT3GRATMIFVPPPYAAEJ1ILF~!'.1GIELIVCIT~IPVRDMLNARVMDAACLFFSGLVDHFVIVY
FFLVSWLYTLYGFPIFSIIYRKKS


NST50LIGPNCPCIIKPGECKIGIIIPGIffHLPGNIGWSRSGTGTYSAVWOLTQLItIGDS


ICVCIGGDP111C1'SFIDVLpALfJf~PYTELILMIGEIGGS71i6FJVAlIWIQANCTKPWAFCPIL0981
1173170 1115510


IaGYI'APIIGKRMGHAGAIISGNSCDAKSKfWLRItSCYTWESP1WIGK'NDAVLMItEL'nrtlA-
Ribonucleoside Reduecase.
Large Chain'


GKVMVIYEtKNYTIVKRNGNFVpFNODRIFOALEAAFRDTRSLETSSPLPKI7t.EtsIJIQI


CRt_0975 1119075 1119677
TNKWKEVLAKISDGQWZ'VERfODI.VESOLYISCLpDVARDYIVYR00RKJlER011SSSI


No ralttlsc homolo0 Prnenc in
Genabank/D~LIAIIRRD(>GSAKFNPItKISAALE1WRJ1TLQINQMTPPATLSEINDLTLRIVCDVGSLHC
as of 11/7/98


3IEEQVALSIAIKfLKIItJILILFPLVLLJ1WVIRYQIJiANFHCSWPFPGPSVNpJIYKCSEEAIM.iEIQDIVCKO
LINAGYYDVA10JYILYR


EAKIEEM.DLLDLITLEWSSRCLRQI>M'FANRLEZELIOELRVSETE6LISLOGKRNLVRQKEDf:TTYLL1IKTDLE
KRF&WAGKRFPKZTDSOLL71DMJ1Fl8dLYSCIKEDtYI'fACLIIIA


:.LLTHFPNPPKRSRVESVGHEWFPVFDRLKREEEIICOGPITRSNEELWALLDHCTARGRAMIEREPDYAPIAAiLL?
SSLYEE3'LCCSSODPNLSEINKKHFKEYILNGEIYRLNpL


IHKTLWfSIFFKYLTQIELF
KDYDLOAi3EVLDLSRDOpFSYMCV~R.YDRYINIJtEGRRLETAQIPIi~OIVSMGLatJ~G


EQ10011tAITFYNLLB'IFRYTpATPtLFNSCMRHSpLSSCYLSTVKDDLSHIYJfVIIDNAL


CPn_0976 1130079 1131185
LSKWAOCfGNLfNfOVPATGAVIKG7TICKSQGVIPFIINAIIDTAIAVHpGGKRKC:AIICVYL


No robusc ttoll~loq pnsenc in
Genebsnk/DIBLfrIWNLDYEDTLELRKNTGDERRRTHDINTASWIPDLPFKP.LEItKplPfLFSP00VPGWi
as of 11/'!/98


II*1LVYCFDPS1IPTSPFJiR(lfAALpRWFFt.OCHW1RILTLECt511!MFOENMSISTVi!(IAYCLEFEKLYEE
YERKVFSCEfRLYKKVEAEtILWRKMLSMLYE1CHPWITPKDPBNIRiN


LKLISYLLIPIVLIALLIRCFLIISRPKCNWKCDSISOWIVPHDVQPFNDFOLFNNOERLNQDI~CSMLZCILtlICSF
a~ETAVCN4:SINLVENIPDIDKLDE&KLKCTISIAIRIL


IWKHRRWSCItrirt*NPVDYLRSQFPCFKEIPEAIRCENYVSDGQFS~SYLRAMLTONVIDWFYPTPEAKQIINLTHR
AVGICJMGFOtA/LYEWISIASOEAVEFSDG:SiIIAY


DIVCIfILSLDETYWfNVILKIRMICITFESFPCKEADPN1ISPRVTHHYFDESWKAGARNVYAft.A.iSLLAKBRCT
YASYSGSKWDNCILPLDTIEL4KETRCEHNVLVDTSSKIIdffPVII


LGOQiIIMRL.OEALfRTEKPGKOGECITKOFLKDYCKKHLEVNSCPDFIESLVDEKIREFDTIQKYriARNSOVMAIA
PTATISNtfCVTOSLEPM1!KHLf'JKSNLSGEFTIPNTYLIKKL


RCPSIWSAVCDVIDRKCOEHLLKAIINEANRRLFCl50JSSPTMI~NQVLFY'fIFSPPKLKEIGtJVDAEMLODLIfY
f'OGGLLEIERfFNNLKKLPLTAFEIEPEWtIE~fSRRpI01IDE1G


PPM$SVYF .
VSI*6:lLAEPDGKKLSNMYLTAWKKGLK11YYLR..~QAATa~~/EKSFIDINKRGIOPRi150i


KSAST.i T WERKITPtrt.'~MEEGCESCO


CPn_0977 1131339 113310:


No~toousc ttomoloq prnenc in u~enebank/EMBLCPn_09s15 1115173 11)!:571
ss of 11/7/95


LYIlIQFANtLKSSFt*fEVYSFSPSVRTSFOIiRVNAALONWFFLCCRALKWSLDSCNSCQ'nfd8-
Ritmnucleonide HrNiucca.~.N.
:amLl !:h.tin-


aCELrNPIDT'ffKVLKItSYLLIPIVtIALLfRYLLJlSNFTAKV3QKPWLKTLQLGIDIKI3VNKY~'..RKKNNPR
LFN!:RRLRIL:iITEK7ktAKMEADiLI7:KLKRVFV::KKCLVNCNpV


::FtLPCiIiVllMO:fATLFKAIRLECKPVDViYHRLHS01IWFYIPAQKLPDDLRLTMdLWt~LYFIKYKWIIWPiI
YLNI7CAWAdLPTEVI'MAF.DIELWY.~.LEI::EDERRVILW4iFPfs


PEKE'IRKTEYVRNMWiVMCIfLT~.aQtiKERIQQWv~DSRa3TrIJCABKVLQYRFIONPQSQ'fAE:II.V':NNI
VLAIPKIIITNfEAf~YLWWAFEFJIVIfMITFL'(It:F':4:LDEI:EVFNAYN


':.EFURLWF*IITTK'Cw~nEDKIYVQ~DLFOIUIFOCIrWPQFI;7VI0.'~.PTFSEELVHENSOKLOPAA!:IRA
KDOff.Mff(.TV(WI.MtIF.~.VV.~.::FLaa7~YIKNhN:%YIINI1:IFPY!X:PVIIIL.i


LOr:tYPEDDEFEDKFt.NfL.LKAVtJIMt:FECC~S!A~YIFLICPIkILIWIPFLJOIpKFtIRpIWtTfta:EUY
VY11.RDI?IIILNI:IDLIN:IKEfIII'~IWITl:I4tEEIVALIEKAVi


f.EIEYAYD!:LPR!a4:IJC::?tFIIWVPIIIAINthI.I:H
I<:LYf'1'/tC:HNi'YfWIL:P~11M14K


!.'hue U 17N t 133ti5 ! L l.'. )h'tRKNPFETRVTFY!Ifl4VII::W
;


w. tanu:a tNrrnrrlrxl uw~aene 1n
tamrl.tnk/!?IOL .,:: c ll/7/'tN


KYFFlIEVY::1'IIf'AVR'P!:FVIIRVIIAALpAWFF4w:lIRU!W::Lp::!:N:X'.WAYOELY.iI"'M'nIY
y'rW. I1 u./l'.'. 1I of!..


I:IM.1!LI~:ILLVI'IVIIALLfRCLGi:NFRIDVEKERWf.KIREIl:IDIF~KLP:SW!>pY.ut114r'W'r..
l rNtM 11~rlWl.e:.


V:::;FIWFY.KnK::KItPRIDVDYIITLH:vIfWINFPtt'FQKII'KT:RF:iYWF"..QK!'fRKROYV1'LLFN
YtVOI~:PI''YIYIKt7!NIwInpn:Vl.%41'NIfII'I:IIVIIY:.T::71YJE:YF~NIM'::Ilk'YiX:


NlwI.IMIVU:YLT::ITXaStIWYL::K'P:~IQ::1T::LDFERVLQIV'.LTDIIDEWGEVORLLNEE:a:NC:LW
'N/KtAyKIHVII1.WIAVPx.let'IA'fNKIW::YNININ~fr~IILI!Ith7:fAY.'fFHJYWP


::ATY.:::X;OKLVf.I::I1VSDIIC(k~IItFKFLE'VIV;:FAFIF.ELVEEII::f:KLliLDFIGLEKIWllt
YINI!LWNFPDfUII'KMNIINKIINId~rl<:1S'.l:I::It:aIL::AVNALATINaffYlJ*::II:A


'I'!.IS~RLRN'.:LLNAWIIIH : aY:VD I yfIIIJIPRMFTI"IY I KlfPnPi al::wl'f7
IKI!Vf:M:L t I1TF.A t~~f.~~ I PF::R::n rllrfY aJN l F'ITl:F I KKIk t
1


118


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
1,
Pt70PRNLYLCKTPtt:WEPSpu LFTPLFLILYLLfI'1~VP::RCiANSPiSCS
CFn U~tN7 1117491 llJN115
PARNLLIICONKVfFAGVACIEk.ucEELLEIVDPLKNPNIti"SLJCRIPK4yLt.iGPIGlG
YtVe-Like Pr~iccee rRtlA metetyiase
KTLIAKAV"''QG~t911~I1~3DPV~1~'"t~"~ll''EpAI~
LQ~CTFAIGPITtPAYRTLLT)INVNOVrHEIPKTL1NPCDTVIMTCCNCNDSLFLAALL.O RNRCACIOOGND
6LVENDOPOTRECVIIIhAIrINRP
CCGRL11VYOM1(G4aNALLLFE'INLSEOERSVIEtIKEDSNENILEKDVKLIHYM.GYLP
VlIR.PDIKGRFEII11VNAKAIKLDPTVDLIIAVARST
KCMtEITTLMTTEISLi.YJItJ~tIVRPDCLLWVCYPGNPEGEKETNSVE7IA0RLNPKEW
TAVTAVDVAGUtDKVLYGKDtRSLENDAEERKTIAYHESCHAWCLL11~NGDPVOKVfII
'YS.iFYYANRCRAPRLFLlORQC,i ,F'.aSIIDKC
PRCL6LCJ1TNFLPEKtIlail'AttKELYOpLAYLNCCRAAEEIFLCDIaS('rIpODISOATIG.
VR~~NVCGrr:NSPOLGNY:YDCR50CL;GYOCYNEKSYSEETA1C'IDTELRNLLON1YDRA
::::i:.l:::~:..K.":.".~ .. .... . ..::'KE:"t.'fv;y; "::i:'.. :. ..:'::r
:!LP47Li.i.w:
.. ~y'.H. ;r.~. .;.,~j.tn.,. rrTtn.~ N...p,... :~I~t~r~I~f;F~...m .. ., .
KPFINLINLDOCILKNKfAAPNtIPPPPVRRSVWIJtRYSTFRIGCPANYFKJItHTIEEARE
VIRfLNSINYPFLIIGKCSNCLfDDRCfDCIYLYNAIYGKDPLEDMIKAYStx.SFA71LIG
KATAYNCIfSGLEFAAGIPCSVCGAIttIQAC'ft~ILSDISSVVPNVITINSEGG.CS~fSVEEL
~LSYRSSRFNRQpECIL.iIITPI0LS1~QVSADHSIISIt~IRLKfOPYI'QPSACCIIRNPEG
TSAGKLIDAACLJ~.11ZGWOISPLNANFIIN1G%J1TSDEVICOLIAIIOSTLKTQCIDLE
HEIRIIPYQPKINSPVSEK
CPn_0989 1179552 1179016
CTS77 hypothetical Deocein
LRTSLaVNCVLLTIFNLLVNJtTLSPEKP'SGSPISISKEFPCOitllHtEIILQ'S.YAL.DNAPS
AEDSLVPLIJISQTAVSOKHVLVAIhIpTKSILEKSOELDLIIGNALIfIIKSPDSLDLV110rV
LRLTLFEHPYSPPINKAILIALAIRLVKXPSYSE71CPFIpAII3iDIfTDSSIJ4fNSLSI
Cpn_0990 1179A80 1140~10
intC-Initiation Factor 3 ~ QRIENISOVVK~DIIOVKLLSINfltGGLKLSHKATLE ,
SVAINFKINROIRJ1PKVRLIGSAGEpLCILAiKDALDLARE11CLDLVEVASNSEPPVCKI
I~IfOKYRYCLTl4CAmSKKAQNQVRIKEVKLKPNIDENDFSTKtl00ARTFVEKCiIKVKIT CPtt,_1000
1157197 1152091
CMFRGItEGYPENCFKwOKNSpGLEDIGPVGEPKLN3RSLICWAPCMfI'ttttKQFJtS rsl5-815
Ribosanlal Protein
SAFAI1IILRRNPNSLDKCTI~EITKKFQLHEKD'fCSADVpIAILTENIAELItENLIOtSPK
Op~RLALLKLVGQRRKGLIYLNSTDTERYIOiLITRiJR.RK
CPtt_0991 1110391 1110611
1001 1157769 1157A69
CPL>


r175-L75 Ribosomal Ptrocein _
KORKNRKSLIPKK~K1NKSVSMFXLTTRPCKRttIQYthC-tYtosins deminase
SKKSSGEKiINLSItOPLVD


KCpVODfKRIIC.V
YYLEIrGCEKLIt~KDIFPI~pOAPKEAAKAYDQDIVPtiIsCVIVImDICIIAIWlI4V~CJt


DATAWIEILCIGSAAODL.0NNiILLDTVLYCTLEPCI1lC1tCltlt?LNtIPRIVW11APD11RLD


CPtl.-0997 1110612 1110996
AGGSWVNIP'fEENPFtPIIISC1CCVCSEFJIBIIIIJ00IffPVPJ(RRENSEK


r120-L20 Rlboeamal Proclln


GIfhVNVRItT(iSyAgRRRRKRILKpAKGpyIGt>RKCHIROSR.,SVIOiANItFIIYtOiqtmRlOGDCPIL1002
1157111 115089


FRSWIARU~IVASRINSLSYSRLIDK:LKCANISt~iRpG.SEiAINNPOGFAEIAtaQAptACTfIS
hypothetical protein


LEATV
KSAERIMCIKIVtLLDOLYEDOCSRt.QKLCEElVPNLTPEDLIQPNDFPOL1!>u~WAPRfE


DGVLSGICEVPAAILAALSpEN


CPn_0993 1110975 1141070


-pheS-Phettylalattyl cRNl1 Synchecase.CPtt_1007 1151862 1151091
Alpha-


KSttCSttSLGIRIS1~9t8EIEAVKQpFNSBGDOVNSSOALiIDLICVRYL~GIFRSFSEKCTll6
hypothetical protein


LKpLTDKAKI~.SLINDFKTYVEOLLDEKSLVLi.ILSEQAEA!'SKEKIDSSLPGDSQP90GRTBNkTINPLIJ~GPD
ROIAGRASIrnVIFPDKtSINPPNLSKii.KKLPSVILYI'SCIAPIISY


HILKSILDDNVDIFVHtGFCVREAPNIESSJIiHIFI'LWITCDIIPAII~ItDI'FYil9ATTVLIINIDrIGIFGLL
EItJtL9NtGI0KNNIwQFLTYPLITJ1DSLSIi~SFEI1'pRLti.Rlnl


RTtffSNWiIRELKKGOPlIKVVAF'Gi.CFAN~ILDFTLFYKJIIpHLIRKUGAFSVLVNISVroAi.IIGJ1VIJ10
FMAt.INSSQHFIOPESZD~Ii
TA


ILSAFYNSFfORKTID.RFRNSYFPPVEPGILVD11SCECCGKCC71LCKH'1GMLEVAG7101I.LTVOIFLDPEKRI
TICPTPLSIfSItdiGFLFVLCFYCCILIPSCJ1PLLLIJ1811LAIVi~IIL


NPWLRNfZiVDPEIYSCYAVtiICIERLaNLtfYGVSDIRLFSENDLRFLQpFSI~CIpbPYTTSLRF


CPI!_0991 1112771 1111110 CPn",1004 1155115 1151A79


Ci177 hypothetical protein CTB4T hypotheciul Dlrouin


L!>illIIRDGRI~IfitSRRMEpALENi.EIQ.KEISL71TSe1DSIlLINPARFI~RKO't~SSVNlD9CNLSIEEZ
t4SIQPVSNTTPKADKIfIPDSTKVISDSITINKQSAFYPCISNQ.RtifilTlY


EJ1LKNVaJYLLEISCVSXStitIDKAtJVSDPLIAGV~MSFL8A08~.YKSLLDEYSEV'tOKSIL71VL1xN1'IVC
OQRVKELItC.PLLKVPDI4KKDCSDDEYIQJpNCI0I1Y06~QIS


IG PECFLNNL
ANRQMIQQELSSAt70RrW11NOKSVNS1TIESNQILpJITSSIH.STLKELTIKJ1NLTWPID


NDJILVpIIYI(QIQfLNtNNDGDPLT1ITL.IJ~dd8E8VIDIIASSLVN1iG11PLItLFY~tALSN


LDIIaWKVtINAVNIILPfSRYEAZlIVIKSPKKNNisiYFNDFLLPLRWIItDI1a~11fIDSDCPn_1005
1155957 1155115


ERKqfKLt,ISALSLCIFGSKLVPEEASRYLYFNIQTKLENtINOKKPLSPGQYLTDAYEELCTAIA
hypochetiul Drotein


tiRLISKYPNGPL!'KAN<RIVLtNLRRPYOPNILGILPSLCC1'LKiJGK$IDIIRSPSPV'1'QNRRPVRLfMWIID
PLSAKICPI4AAINVPGTPITt70PN1'ATADDIIAKPSKDSNPLNNY


SSILYFLGFL.NaIGNRSEVfLVLNIONRISRKERARSRVIEEALEOEBNAPYVNYVYOSVLVApONLSIIAQEGQANS
SAOTYfJJIJOFJ1LYQWSIPKNKIJtDItBSSYLptIp$


AFS!'PCPEELL.pNLESItIGDZE1TADPFSILpEtFHKPLG&SFPL?KELKEFVCS1LKEONOAIGASROAIONDIS
SLCNIU1QVISSNLNPt~NII0pSI4VC0ALI0TlSCIVSLIAN


KLTALKDIFFAKIUCILTAN~C.LLLHLLSYLIVPKLIERTNPNSIWVSKDGLDYVSVPII


AGPJ1FPSRGfWDGfSLKLLLTNVLSP?LVARDRLVFVSNIELLSKFVNCLKKNRQGFSS


LKSpFKDDIECK'IEPTCYLNELTEYStp00JI. CPtL1C06 1156197 1155990


CTS19 hypoehecical Drocein


CPt>_0995 1115515 1114115
TKVNFPINSITTL.GTLPIVNfINSSRPPLEPildfPKIGI1VL!'SIYELIiAAIEIRDdM.


CT87A hypothetical protein
TCSQOLNDNTNIpOQLNOLTNDIKYAIVSAGAKEDEITRVONQNpNYSAQRSNIO~LV'f


RNLIwKRNtvLTRFNFALTSLLVLALIFYASINHStJiTLKCASTMSt7ASVKLSILYYL71QTRaNGQIIL5HA51NI
NIIQpQSSQDSSPIKT?'NSICS1'VtJQLNKPLC


LSLKAEPLIPOLVAVATTSTLFANQNIOtEIILIrQASGLSLKStJOIPLLLSCAVIIQRILYA


NFQWLHPICEKISITKEMiDRGTfDKEpGKIPALYLKD01YLLYSSIEPKTI.TLiaIVIwICPn_1007 1156689
1156907


KDPttTIYTNEKLAF1TLSLPIGLJVrt'OfFANDSENLELKEFFl7l9tEFPEICPNFYPi~tPFS07519.1
hypothetical Drotein


KLFSJIGIJlO'1RLSEPPKItIPWNJ1T0LGLSTpVPpRILSLi.AQFYYVLISPLACNAAIILSALwYKSLNDEEKD
VSCNECNDYPEVFKDDVSAYVLVTCCQNSSECKIQVt?IflfDPAYIS


YIJ:LRFSRTPM'WYLIPLCI11NIFFVFLKACiVLASSSVLPTLPVNAFPLIVLPLLTNYLLTKARDSLDES


YAYAIQ.Q


CPn_t008 115d901 1158227


CPn_0996 1116592 1115519 CT81A50 hypothetical pcocein


CT879 hypothetical Drotein
VLNYSFII;MLKPNYVLSKRLYRWVNpLIKLCDLVI0JSR3F5VEWVPISALLLIf'CCIQCA


ANpLLWICVLIFRYLKTAAFCTLSLICISIISSWEIVAYIAKOVPYDTVLRLNAYDIPYLSWKVSLVPFLLLFSFLIfF
LILCPRCKCYALLtrGSfFVTLYVAKYV1IDETLYVSINGSGL


LPPILPCSCPVSAFSLFRKLSDNNfOffFLRASCASQSIIMPPVIJIVSCAICCLNFY1'CSE:VSPLLAPCLFLt%7V
Wt 1QEEEtiVKGKEpLRLSEDLDApRSAYEDLLLTKSQOCEFLOAR


L.ASICRYQTCKEIAIeI~M'SPAL.tJ.p'tLpKKENNRIFIAVDHCAKSKFDNVIVALKGIJtIEApCLDRELTf7C
QEt.LKA?.YCKOEYLTIDLKILADpKN~ILEDYAELNNKYIELVSK~DV


ISNVCIiRSIIPD1'IICDIVKAKtriNPISKLPDSLTESSSPSSpRPYIETLDELL1PKITSVFPWVAEPSVCCSOC$
ERVDVSRWVSAt-0EKEESLLRLPNEILVEKpIICSDYOIRCptiL.G


TLFIIGKSYLKTRTDYLPWKOLVKQSLIWSHLpETLNRVAIGFLCITLTYACNILGINKPRLLLDNFTALERRCEELVW
LWpKITpINELi~LVCK3EElfVSVEPSJINAEtSCVEEKDYK


FRKSIALYFIFPILDLILLIVCKNTKNLpLAIrLFVFppLVSMNPAARAYRFSRCYACLY:QWEpFLEK:,E'1L3LYR
KKLFAVDFKYLTLKKKEELTKpDISPDDISMICDLLERI


EILEEELiHLEELVSRSt.SL


l:pn_0997 111d699 111Tbd1


mesf-PP-Loop supertamily ATPase CPn_InOU 1159095 t1581Rd


AYIttM.GSDLLRODKOLDLPFASLpVKKRYLLAGs"(iC3DSLFLFYLLKERCVSFTAVHIDmaptNfthioninN
.leitutpnptadaae


Htlllt'o'CiIWEIIKELEELCAREGVPEVLYTLTAE00CDKDLENDARKKRYAFLYESYRpLD'fRLLHR'/I41KN
NDR'Nt.'C:ivRNWKOrHYPpPPNNSpFJILKOHYASQYNILLKTPCOKAK


.lcxl:FLAHHANDOAETVLKRLLE,~.JWLTNLKAM7IER.iNEDVLLLRPLLNIPKS.iLKEALf
YNNC.YJITARILDELt.'KA:i~'KCYITNELDEL.~.pELNKI!'fDAIMPPIfYI:SPPPPKTICtS


OAR(:I::IWUp::NEDERYLRARNRKKLPPWLEEVPCP.ttITFPLLTLCEE.sAEL::EYLEKpIJtEVtt:IY:IP
NDtf'LKL1:DIMJIDV::I'.IVDf:YYCV_'CPMMIr:EVpEiKKKICOAAL.~L


..WPFF.~.M'191VD:~tt:ES.PCPDt.'LI()()Aff.CKWVNKKFFHNAC:LAV::RHFLCMV'IDHL::R~tIL
:aAILYfr3tfIt:EI.:F.IIt:rIRApT'lt:F~WDpF'h.IF:W:IEFIIENPYVPHYIUiRyMIP


a:ATtJilIRNK t V t t KItJVW t D 1 J11~?I t I-P I Ef'M 1 tllh
7KKta:1.17DPKN1AJEAR'fCDtK~G::A(~WEIII'
I1 tTETt:YEiLTLIJ1D


':Itr ~19.')H Il17Ni1 IISOSNd ar:_Ir:l': 11'.tr:'!5 115NH.'7


tt::l1 A'f1'-rls'paui.nt =1110 Wtrtt.t.tGreLr'.:~ n~tn tn:r i..rl Ivrt.rrt


LI:a:NKtTI::KUKKNKI'EPKKNFI'IVFFFLLFr:WPr:P/At'(INt'1.11t:KK.VtVn:FatIQLEI1'MLI
LIJII:a.l.l"tVI.FP::W ;:fl"/tVII.WCN1'::RKY'/lPVILRtI'1.4'AIf:ALILFVITfIR


I.VNLJILIVI'Efk:IIKIALNDNLV
>F(7LRFRLRIOTQEf'fiIJtYIIYI.I:LIt)t7t:HRLDLDLOETU.':FFQPLfiR:l.sit':rl
t.'t:Fl.1.t'N::IKMI1J11'NPEPAYDUf.~.KTF.PIFFPIJ1F('VI'If:PA


N:a:M'fI:KEVTtJ:atldF:.At:li::PIPS.YAt:;YN:E-
I:I.::VIa'HPLVV'1'f:PIICP'QLINL'!t'fAtJ::lNt:l:ta'l:.I:RIIt'f:VIttAYIAF.~.t.tTt
.tl:::::FFIXeLtY:NFY:L1~LERLPISIAL


IC:1 .~t:P
YI1't.:R::1'F'ALRTW:::DLYELICKYt::I'VII:Ir::aTLKREIJCttLYpQVEVSLTO1JJA::VtNf4IJ
:.:I::1 AI'N I, a1'
I.:


tTtJfFJIAV'f 1.W .'QVt:: f WR
LS:i.~.LW::EOfJERF:aJLtr::VRLYREPJe1KY11KLVPrIRDtl:


VI1VLEIttJa:Rl::(j111WYPNNOEt~::RSLEKODPEVftNIWFJItLIKEtWfAFKFNII::L::FKA'an_14
11 1 n.n um t I'.~rNt:!


111YY~~-


119


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
1:1


DaLppELLC3RRCfRSEfYA uEKKLETKVpIKD4.'KCLF,'~QDQDSNGFOKKSPL<C'C


'TN'il Irytnchetieal proton CrSRKNRIAKAAOAVPVIPPP:n.
YF'a3YLLTKOGIG..ZDP.:.4YCCNKDSVfSI'ORELDA
4V~tY;n'VKORNL'ILLRESPFArI Y
:JL7QiLITtILAL '
C'IL
'


..... .V3l~iLGLSfX~LA~iI
. WOLRLETLKV~K:;ItRC1G10S11:~1~fIJ18k
. ~IVI'~
FItRLKNYPHipf.irFLPC ~
xtJ<VIXfWGLEIM7CIAYLLIILVRrYLRLt:KEEptTP'fKfNMSPSYS

lt


AMfALYt:IJ IL
a LIiW IKCLaDKFNDNLENICPLK>rCEGIIRKI
tTL LO


Pf. T.'.P fALPW P,:P3CI ..~sP I
L~,.'71IKGLOPAIESCNAALRCAILFSQAEIYKLKGKL1'K
lOLOIG.KSFOROOIIYE~R


s'OF.LL~SIESSFEALSRLIlIYIRtELDOVYLH:.LRG


~Pn_1012 1 L5Z3Z0 1160121


y:.:0-AOC c ranoporcer penlleass
"FF'"PPP.i:IA.:.~.fI4SLPLLL t.'Pn_IOCI 1171270 L171b9H
YMKYKFIfYFVTVF
~
LLFIS~tCffSRMPPTF
~


. ' . '.. w - '
. ..
. ..
AIP. .
.LIT; ..
'' . ...~L\iL::::ri r
"lF r
v:
"'.,i '
~ :
: /
~
'


.. ,F
.. rT
. rc
.. l
.dlrrir. .c
: err.
. ~t:t:.
_" ~~.'. ~-p.,; 1::: r,.',:.
;;y,RIfJI.\LP il::
.:. .~..;.......Ia.i?'
:y.'-.: .!.'111
r ' ~ ' -' "" "
~;.';..
i
r~


.
TfGKIfKIa~EGL.EK:r"fKtvIMAYLGKDYAK.iITVFkWLYFFNPfVSKFWP'aiir13IJ01a
......
..
:r.: ~.. .
.
LGILHLLSRPNYQ1ELAFAGLATLSILT 'L.~LF
~
'


aAGFaALAGiiNASOSt:
ECYSOAWAYufTAVLRDXDPYPHYYAYICYTL.TNENLEAEKALDfAWVRiIt)HIIILYNR.
:IIFa
GL.KIAIC
KSV111LKALSVLALIPINLIPWKDNSKSPPtaUD~R.TSL
SLK


N KEEILDIRKHK
7ITLLICKLIrSLrRYKRN
TLLL~WI'PNPNNIPLYAGVAKCYPKpMGLDLOtQIClIDSSSAVPNVLft0Y0llALYHAL.G



INKT SIKQIPIOIVCRLZDSSLODFLYRSGDPIYKFmLNDKVL.GFCiJ~t~RTILfRIi.ET

' 1022 1175709 1171:16
CPn


IGHPVIOCrLBaICOLP .-
WpNCVIIPSEVIaIVSBDLISt>NLtJsRtIDFLYCAFYNI1931IKL01CTt63 hypotheraeal Drotsin



TGPQLIVrfKttGTKASEPEIVGinKAt.OESIIFSKDNP~rKLYAKLTKSIPItNLYOErSTFIIfALKLGIlIIfPV
PSAVPSANITLKEDSS1YST115fiILKTA1~EYLIISCfAL~SS


YI:pWEElfPLLIpSODPLSKDLVDKLLE'tIIKRYPELASEVAXrST.NDLYNPSLPEBD~fTtlu.ISLAtGOIILA
TQQELLLOSINVHOLLrLPPI:VVELBICWDLLVCLOIAtTITS


6PpCICTOSRSEQrLP0088S1I05ALSPRSLCPEISDSKpp(~AIQTPKD$AVPl01SGP8


CPI>_1013 1162209 1163621
PEl~pIIMSLSQASSSSQRSLPP0E8APaTLLtOGKASSriPLSOrSABItpItGLTISKS


lulK-ruearste Nydracass
NELY1~POODROGRECHDRGDOE~KKKIOOIRGLCVGVAEEl0~.0IMLIrSD
RENSWNRGNIDNROEKDSLCIVEYPfDICLYGApTNRSRNFrSWGPBL?tPYEYIMLVNI



KKCAAQATIpDLCfLDSKNCDNIVAMDEILOGCFECHrPLKWIp'PCSGTQSNIIiVNCVIAONRPPAEETSK>CEl'I
FIUtKLPSPHSVISRFIPSKNPLSVCSSINGPIQTPKV~MIiVl1


NLAIRNItOGVLGSKDPLMPNDNVNK~SSNflVIPTA1HIAAYISLIC11LLLPALDtIIIItVI.KLJfARIf.ODAG
EANELYHRVKORTDDVDTLTVLISKIIRI~ILRfS6DICD9tALT.I~DtA


DAKVEBERHIHKIGRiHL?IDAVPKILGQEFSDYSSD4RIICLESIArSLiWLYBZiIIGATAKEICVTI
ITQIIEKIIBlQRHLQEISOCNOARSN


Vr.TCWVPEGPVafIIHYLRICETDLPrIPASNYrSALSCHDALVDAfIGSGTLAGLTKIVGKLLKELlOTIFIYHLRP

~


ATt7LSFf.CSGPPG'CLGI~S.ffPFNEPCBSINPGXVNP1'CCGT.O~CAQ~~IOTVII~. .

' CPt~1023 1176005 1176331
Y1A


RCNpE~IMCpVIIyNFLpSVpyiS>57,01AFSEfFVKOLKVNKARi~fINNSLIG.Ho robust hoKbloq
prestnc in
4swbank/E~I. as o! 11/7/95


LAPVLGYDKCSKAALKArHESISLKEACLALGYLSI~ETDRLYYP8t~5~Na.DFLLIFINKKWrLSIIrFATYCASIL
4AVTWAVPLSEAPCKIQVItPVVIiL.QPOEEQ


CPeL1011 1165156 1163732
GSVIYSFfIfPYDYGYYYPL"LYCYTRlI~OESRLCY'IRrEDCI'IIYLCD


yehH-Sullace Transporter
AyAgTLCYGIVKVpW1r101rIpTtLYTSIKD3Y5FPL1'FKKI~pAGITIIfiZLIrPFAIAIJ1ICPt1_1021
1171317 1176334


CVGVSPIOGLLASIIGGLLASAHDCSNVLISCPSSAFISILYCLSAIDfGAItALlllrlLLitxerD-
InceOrast/rscombinase
'


CVILIAFGLTGL.CTP'IKYNPYPVtIrGLTTCL71IIITSSQIKDFLGtpGIINIPADrLPIdftILCDFSIJLSVDI
GICQQSIAA
IfIFPNISI:CSLKIAPLPILKLN8IJ1SHTNPSTQFIII


IAYWDIILWIWDSKSFAVGGLTLLINIYFRNYKPRYPGYNIAIYIA'1'fLVYIL3.EIDIPTIGYRQDISSFLTISAI
SSP~ISpNSVYIFABELYRRItLAITILIIRRLIALKVIrLrLKDOG


SRYCfLPTAIPLPKIPOLSITIfILOIJtPDALTI71VL9CLLTT.LSAWA17G9flGNNK)iSICLLPYPPIIEHPKI
NIOtLPSVLTPOE<tDI1LL71VPLONERIP11HIJ1FRDTAILHTLYiIGYR


QLVApGVANZCTSLfSGIPYfGSLSRTAASIXSGTTPIAGIVNSIFICFILLL.LAPLTVVSSIGDLRLGHYSDDCIRV
fGKDSKTItLVPLGSMRGIDAYLCPrRDOYOIDDSIN~IL


KIPLTCLAAVLILIAWWhl9fSEIMiFIHLITAPKKDIWLLTVFILTVIfI'IITAAVpYOIII.FLSTRGHKLLRSCV
WRRIInfYAKOVTSKlVSPHSLRNAFATHLLLSfICADL1WI0~
'


AAFLtTIKOMSI7LSDVISfAKYFl~.4DFLSKAIYP~tfEIYEINGPFFIGIIILStLIDGi.NWDfPRNL . .

RIASTEYYTfNAADSLIIOf!


DIEKPPKIFIIIGKfRVPTII7J15
1025 11Tt266 1175579
CPn


ELICVDNIpSNIKSALi.rACJILTtiLt=RKTSfRiQ.Y,.
ppi-dlucoss-6-P Isalasr~ss


1015 1165550 1166593
GillOatSSYRCI!lIDERKRFIDGOSTICILOELALNPLDLTAPOYLS1IERItKFSLLt~FTF
CPn '
'
'


_ A
CT857 hypothetical protein tpossibieIRJSM
IH proceinl
SrATDtI~DAILMLISL1CERGLHESIBJINOOOQWNYItsttPS~PAtJn


KNIMD~IFSrITSVRVRSKVDNIILEVitCJtLOi.CALrLFGYL1IVPEHIVRVNKSAI71LDSSIZGCACDIAVPSR
VGORLI~TLTKYR80FZTZVOICIOIDSdIJ3PKJ1LYRALRAYCP


ANOfLHWLVCTSNIPNADHttILVEEIAdISOVIFFLF&ANAIVCLIDAIDCGFSYIVItICR'10001V1trISNIDP
0~.A6YLDTIDU1KAT.WWSKSCTIIETAVNWrA0F1'AIO~rLSF


IQSRTLLL.WALiGLSrFLSAALCNLTSIIIIISISxRLVItARRRRLLT.G71ICVI11VNK7CKDHFIJ1V1CECSP
NOD'fQIttLMIQaiESIGORrSSTBNVOCWTGIAYfiI6YfL.OLLpO
QPNARC~1L.RGSALISIfsIRNFLJCIfPTGVIPYSS0T.IlFPJIIGQOfGI!ICS
ASAImOIAT


AM'PLGWT'l~fiItdlIIITSWGIIPALIVPSLVCVLVJIfIFCbprTLRKRGSTLLVmVL.

~IISFfOCLNOGTDIIPVeIICFIfDIS
ARVOrBTSPVIWDEPDTNC


LpSAPPKSWIIfIGLGSLLIVPVWKACLCLPPFNCi7ILLGLCLVtd.TSDWIHBY,

NDKSIAGDG


HLRVPNILTKIDISSITFrICILL71VN11LSFANLLTL7FSIi5~CIFSRHVVAIIfICLLSSFt3Q1'fSSOKLl7W
fIA0AIALiICGSEN1NPNKNrDGNRPSSVLVSipIIIPYfTGdLBYY


VLONVPLVAA1llOfYTLPL17DTLWlQ.IAYAJIGTCCSILIIGSAA6VAFlx.LEKVDrISIYFINKIVFODL.C1V
GINSfD01"~1SI~KKAL~VI'>rfL'DGADASNrPfJIASLT.TLPI~IFR


KRISI~tIALASYlOGt?sYFVLESIIIFrI
CPtI'1026 1175961 1179177


CPtt_1016 1167027 1165595 1cW
CSfGFCKICI~IFIAVRSRDFIJ'IHCIL71AR70GfQVVKSThGUtVFYSLVS


Ct551 hypothetical prouin


KREVE10DODKLG7IINrGLLFTSSVAGFSKIX.'flmllAYDOIlIffIOILISLII7fAPLPNKIi.L

7 1179172 1180735


lGi~.SpQ'IOpARLpI,YLECI~TINYCpKVLSNYVRgLNDYNAGLTrYRTCSAYIPYVLKCPn..102



LSEDGHVF1IVDVQTSOCDIYtGDEILEVDCI~IRGIESLRFGRCSATDYSAAVRSLTSA.NO robust honolo0
present
1n Genelank/t!>aL as o! 11/7/95


SAAfCDAVPSGLA!!<r.KLRRPSDLIRS'fPVRWRYTPatfIGOFSLVAPLIPDIKPQLPIbSCNNIOSVSSPPLSPf
IIVrtrtDIVPSS~S~.IQPMVLKZSILIrIILVTILGIVLWLSiAIG


VLFRSCVNSDSSSSSLFSSYNVPYTWCELRVpNKpRfDS~RDiIGSRNGPLPI'FGPILiICOALPSWLTYSIiCIAIA
VGLIGLGILVTRLILSTIRKVDitICYDAAVICEEpYLSItIRQJS


DKGPYRSYIFKAIf~'aNPNRIGFLRISSYVNTDLECLCLDiIKDSIhIEL.PCEIIDNLCKSDIR6IRdWMV~0'WIL
SEE~SfIMJtDPEYLlK7!lIBRLIAELEIEfi011LVAQiILLKON
'


TDALIIDpiIDIPGfiSVrYLYSLtSM.TDHPLL7lrKHRHIP1'pDEVSSALtR9pDLLEW1TORVLYPIt

N118LSRL1FRJ1YKOKFPTGALCPYRIEDIJUCI1~QILFLICP6CIAMVKSLPGLZ


DEOAVAVLGI:'1llCliYCf104D1AV11SLQNFSQSVLSSWVSGDINL.SKPNPLLGFAQVRP1IPKCFOSLVHRFA
PRSRITQTPKYEYNSRN~1EDDKVMVCARLIIKEFl811ViGiICSY~Xi


HOY1'KPGF!B.IDEDDfSCGI)L1PAIWfDNGMTLICKPIAGAOGFVIpVTFPMISDIKGLICEMVALKITLPLPGVY
DfLVOLrPNLLTACSWKDICI~fSYPIfLRPYL.SVDIICKRLI


SLTGSLAVWtDGEfIENL.cVAPHIDLG1TSRDLQTSRPTDYVGVKTIVLTSISCIAtOfSYQLrCEICLKLFTICSPL
DpAWRLISYYRNHIPAVLtIBTCLPPPE'IbGSVFVti.PRT~Elt


EEtn'SPO~fPEVIRVSYPTTTSAS
LLW$QIEVLATRYLKD?FVRNS6WlGSFBI~IftSYNEHCKEISIxItIltrAmYCI'IIHSLEP


PPSPL6EECEFLPPCSEEEYSVLPAPDLOVDSHWVWNPPVPIOCPL


CPn_1017 1168997 1169975
CPt~1028 1150995 1151999


lyre-Necalloprouaae indhC-Nalats Dehyrogsnase
VIINR!(LILCNPItGFC9GWMI0vVCVaLI~PIYVKIiEIVIWRNWNALMKG11IF



'JEELVDVPEGERVIYSIWGIPPSVRAGKARICLIDIt7JITCGLVTIfVtISAAKLYASKGYKIFFLKOVRNAfKLWR
VAVi'C6>CGpIAYNFLFALAHGIriIFGVDRGVDLRIYD11PC?CRJILS


tLICNKKNVEViCIVGEVPEHLTWEILYaWEALPFSSOTPLFYITQ'ITLSLDDVOEISSGVRH6LDDGAYPLLNRLRV
TTSWDAFDGIDAAFLIGAVPRCPGNERGOLLII'QIpOIFSL


ALt.KRYPSILTLPSSSICYATTNRpIUILRSVLSRVNWYWGDVNSSNSNRLREVALRRGGGMLXIMKRDAKLFWCNPV
HfNCWIAMKIIAPRLNRKNFfHIMLKLO0N~M8M.IWM


VPIIDLINNPEDLI1T'NIVNNSGDIAMfACASTPEtiWQJICIRKISSLIPGLpVl3iDLrAVEEVPLCCVSRWINON
HSAKOVPDtTQJIRISCKPMEVICINtONLFalILVNSVGMICSJ1VT


DWF~pLPKELRCS
GRCKSSAASASRAIJ1EMRSIFCPKSDEWFSSCVCSDHNPYGIPEIM.IFGrPCNIGPSC


DYETIPGLPWEPFINNKIpISLDEIAQEKASVSSL


CPn_L019 11698~5 117062


No rotwac honwloq present tn Gsntloank/EMBLCPn_lOZ9 1151987 118:511
as of 11/7/98


RMSYENYOKNSWLRS:.CLL\KFFSRLLYRVPF3FR0DIYLfSSLYLKYPRLfFYDLGKYNo robust troeoloq
present
in Genebenk/EMSL ae of 11~7f98


VYSLRHCPYAKLCRLIx3A.f:.LKECRJVYCETPWS4'L1KICQAFDITSCDILYDt.CCCLGKVRVFVTSTMLWGVS
NRpSfDBLSONJIrKIIIrNKORFCFIrCSLCCFGFVFALrLKLGSRLA


~.FWFSNWRCOVIGIDNDPHFIRFSSNMRKLSSGFALFDTEEPKNWLSOASYVYFYGSPEISLSTLGtIiAffGIrSVI
CASAIIVpFLtJIKCSOCETSKt.CCAIKN'lliiSSLJftSLLVS


SFSRRLWEILLKISEMAPC~IVTSISFPLDSFSRGXECFFTfNSCSVRfPWGKTIAYKNMPFrTANVAWTVAMLSSFLG
SLPYMrKLrHTVLIFIPYLSAT11LILLFLaTSfSGLFFCI


tRKCS
PVWQIOE.iIDYRNLLCFRf3JILRpfTIVVIALVDLAICfWLALDSPYIIIrHLVELADIH


T4ISfLApIIFVLIVPIALILTPAVSFFFNfSFSFYLAKpEECKALVK


~:Fn_101~ 117:116 117Dti)~


,:THi.n hypncnacLCal protein CPe~t07D 1151901 1192913


tHRPNtMTVrYOShTPPPOCEFDIFVDCNATEEAY/MEVQVALPACFa?YAL1LMTSELprsvllcteA D-nstino
ecid dehyr,Jqeneae


'.FGtL'f9.~.ECAL'LVALPPKEKPIQEEpFLVKNDIWF3f3LPNLKPfI(>>CQ'!SL?SHRNPFKVNFNRIAVI/
:.1G:YAC44V74MLLLH.ipCfATLDLFDPIPLGIfL'rA.4~1S3i.LIJfAITGK


LAOQ:7f::::N.,~1't:Kn\7fET't$SeFPFfSCK.\PECD$.~.'IDKTFTV~PKTQEr'rOGSAuOKAL.IfPP
t.ADCI:INATHALITFJ1..''K1LLNVPIVT:ar~ILRPAIDEDO.WLITERVEEFPKE1/


::()AOF71VR::Y::::rTIKEtI:.AKEKV:~bTtt.~.AE717KH'frlrK::DATL::PIISLY~TLMKEVPQFI
rWEKAfd:FIIPSNVTPPNLCALFIK.7:YrIdRLDLYItf:LAOAfNKU:71.'fYDELIEDL


.\L::::PX:~>UKttFFJIHDLRQcIDCYEC'fOECEE'tKILKTfMK'7YF..:L.(xJT::::tYl'/'fESITPI
ADIEEF'fDtItIVTIt:WA:aLPELKDNfVNKVYt:qLLEt.'.WPY.DI.AML;iF3TNANKY1NA


IPDPI'/FFAL::E::QI::%t~4:ItRVTNLDVLRtr'fEWYII1LK::RANOITffRLEEREIJIERENP~iKITh:
it.H\TFEIINUPEFTPDPAtAYOtINPP'/L::LFNILK(MQVIJh'1'.kilR
r&IL~


AIIEt.AA::I::RyAKY.INWU:1\'PITtGTL:AfAMI.r;EL::C:D::III:FVC'RI:d:PFKDATAKItI.rV
I::PLNRILt.W6'Iva:IL:::K4LLY1K:IT,IDIILAt3AVLRY.:.TAYIAKEFLtTI


'PFFY.t: (.:YVF'P::l.::pl::'1;.\A::KVIIF:L.:iE::
A't R A~':\E'fRK r.YFHMItVU6l.'TRT
t EEYKDNW


N::MIJ!IFt.I N I llJ'fIJIWAH::LY'; 'frr_ l4 t 1 I I N.S~i7 I l
HID4$


.ir.:DAt'Irnrnr..ICnirtinr
ArN.vfmtt.rt


.'I'n_10u Illt....t ll.l~l~.'
IKF71>lffl:R'rK::.~.KNt/.TIALNiIVV.~..~.IIrYY:IFa.fI~INIAAT14w4:AVlhbtlt.'It:Ft
i


.TN.1 hYlnlv.m,:nl Wwr.m
NFFIMITFRII~:TLRrULJtl3tl'IMY::f.EC:ft:l"fI':FTIt~:/Wl.t.'~tLF':NiY:YAYfTNDA


n'r:N::fM:::1ll:~tA:.F.YLtJkHJ1'1'ft'I/APR:J;I:::::\'IYI::Y::ITVAtritllv'K::LPK
PF1'OKIHYF'YIl!Fl\T:NP4YALUY:::ILIYI1If71FIVr.Y':III(IA::TtIIVIuTtI'KIIf1.fffti
L


fV::nJ'K::1?I'ffMIIIK'rFtIATPRERI1.RF'1:::.~.FF:xJUINr:~JAGlr::::IWNLF::91IN:IT
EA'PAFFFY.LAYYK'rDFI.IfSIIAYPKAOP::Lf:::l:::yl.Y,TfML'JTLWAYh:IFY:A\1'1S:7.YIAI
WP


:'.KAIIAnJIa:rMf*::1'h:KT::ItKAI.UKN1.::::KVF::A.'KIIFIYrIJIIyIILKLFn.TID.~.LY:
:Q~LI::W7tJA'r/4:FLiY'.I:fI'/IL.F::II.IF~:::t.l'uINJINJtItIP.'.TH:VIJ)II.V:KWr:
EVUTIV



I2~


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
':LITAVL::.'.'ItIL:.1IfII'IAEIPF;AAKN.TFPEIfRFFLBRGVLLRPL.~'LN:
fOEEDLRIIY3tIL,;dIL::;
. .EKSPSV3LYIT53VNOLJWIL f'~3


WF.~"N.WIfITiL.SIT~.VNVf.PAYLA:3AAFLFKt.,~,h.:., ...._ _.. ... .._ .....
CYPKKCISIKAPLANITCIL.L1IVY CPn 104 ;r 96i3II C~57)4I~. ,
(:IPFYIDAGKKKKNAKTFFAKKEI11GNC!'IGLL.At.TAI~
NALVLLAf ~
~


. th~
::tWLIYA~LKYLF ~B
b
'bioD-dethiobid


FLFL'IYHtIKt NRSPlTYFRANFrfIpRI
IIVOIL7IGYGRTIV~,,AtLARALNAEYWKPIQAL~.CtSDSNIV


N8L4GAYCitPGYALJIKPLSPHKAAO Lt>NVStEESHICAPK~fSN:.I
ILTSCGFLSPCTS


Pn_ 1072 l 19515 7 11955b5
KRL4CDVFSSWSCSN1LVS0lIYLC.RINHICLTVPJWRSRNWItGNVVlK3YPEDEEHNLT


CTJ7) hypotnecical protein
OEIKLPIIC'LAKEILEITKTIIS~'YAEarItEVWI'SNf10CI0t:VSfYfPSLNLM
ItIA'n'rrYr'':.AFIr~%a~F..~.DO~tPOqPf'RTFr:fD.:ALI4AKIFNPtITVPYTSVt.PKEL


:':I.....:':~ir:iL1':i.an:~.'ii.::v:..
:iW:Y'LI':.:i::.: ridln ~
. ! . : -' : ':~~'.
. .
..
...


r ,
.y ,
~ i., OtoF_:d-Oecononanoaca iynthase_-
-~!n='.'fn:..'P'. :.-IIF::.:.L,T."
. ::.'!:r.: elq:(fr...
..r i:::::


Atf.Ftr)PENAEPAKVN
pNLOQQFLIE7tLARRKSKHTYRS1SLNSHLIDFT~IDYL.GFASSPELRKIYITKLHAIES


1196187
LGAl'G$Ri.LTOHSrit~lIEiI7LAAYlWFESCLIINiGY'fAM.CLiyJILJI?OODRILJ~L
7


55
YtIGfIYDCIRLBK710SFPlliNND~tLEItRLASSHLCRTIVNESVYStJIDiVAPLOAt
CPn_1037 118


CT372 hypothetical psouin
SLLtaYSAYLIVOGNAVCV1GDOCi0;LV5AG.~hODINL~ITVIITF~.KJIfpTIKiMIIIGS
NNKKKDYSCE!'LTTtTIVDSIAFLPSEENFCYIKTILFFRVRIKHYA!'FYCEPfIISFRFLL


'ISSYAETPKCrBCHYNAYKJ1RI0KKNPESIKlSAP8ETPNIBISLISPV1NIFSILKDrLINKRPFIYTTAOPPHAL
T11IELJIYEIOiQRAPNORENLiALIIINFREKA~IiG
LSGLCAL APN


u
LOLJItDNI'TTPIOSIGVSCSadtAROAAL.0I0N9GYDVRPIVSP1Y1LQREELLRICLN
I5L0FSILPOWFYPNK71IGOT011L.EIPSWOIYlSP
T
'
t


N T104LIDViLGHTLEQIFiCNVSSL
t
p
CSHPMCOCISVSNLLTSVEKA
NGVDI1ZKIAAGTASSINDIfifR1L41NLilOLTFB
a
'
'


f
ACOGIVDFSYTLIHY 1191700 1197699
511WTLYDSP1
O
OTFPGDPLTLJ1IGOYSLYAIDGTLYDNDOYSG'FISYALlON11S7lT1fBlaSTaAYLOITPN


SEIKVOLGFpDSYNIDCTNFSIYNLTESKYNPYGY~PKPSCCDCQ7f8VLLYSTRttVPCPn,1044
t~i 'bioB-Dioein 9ynthase
G


.
AKLBOIEETVSWSLEDZREIYHTPVFCLIHKANAILRSNFIJISEtC':CYx.ISIR1GOC11lD
EONSQVTarSLNAAOHIHEKLYLFCRINCATCTALPINRSYVLCLVSENPIJ~IWA
ICFATNKVNAKAISNVNKLRRYESVtiGEATICP'DPYISLTPDFaLYIHPJILLtPEltlfl'S0


CAYCJ1QSSRYIrCHV?PCMOfIVDWEMKRAVELICATRVCI:.71AWRNAKZ7DRYP0RVL


VYGLPANLSL
IMtSITDIaAEVCCUGfC.SEEOAKKLYW1CLYAYNlB1L06SPEFY)rl'IITTRSY6DRW


118773?
IZ.WVNK$CtSl'CCOCIVDICESEEDRIKTi.NVfaTItoNtPESVPVNLL.9TPIDCTPLODO


CPh_1034 L188599
PPISliiESILRTIATARWFPRSlRIRL.JN1GRAFLTVEOOTLCFLAGANSIfYCDKii.TVEN
Predicted OMP fCT77I1 (leader (181
pePCide1


KTSWOKYKKYLSYSIWOKI1IRYVlOCIWLFFTILFSCSSFYASCRYAIVRSINEYACOILNDIOB~CIIIQi.G4IPR
PSfGIERGNPCYJWNS


YDEC4fWLILOt.DCILLOCGEaLSHBItrKSKAIOGL.OKOCTP~F~IfiF3IWPFWIEIOEH
APL 104s 1199603 119A90i
CPeI


rTWPiESAIFLLIEKIQKOCKTTTVYTERPKT111~.TLKOLHtIIINSLtDTAPOPO.-
LY,ISY;ILFSGDYNKGPCLDLFLE1CL.PLPAItIIyIDNQKDrVL.RIf~t.COKYCIAyeoni:ernd
hypothetical bacterial
Pla4t membrane protlib
'


. LLLVLSM.VLSSKLIPI'LTFNFIIPOCiLILYPLTFLI
ALLLRfIQIDiE GTLPNNI'SNRKTLVFSYLSSTI"1


FGITyKApELHPPIYFItcIIAQVQYNYSKIfLLSNHJIASDWNGIPCP10LARVNIFSAFIJINLf.7lSSIVOIIMF
fPVASPEfIp'1'J~.!'17LSPLIIFL


CPeL1075 1190081 1188570
ASLL.71FZVSOOLDTVI'YTF'F~TFNSSWLRS1E''&71iIS0IPDTP'IVO'1'GILY!'OIGIS
'


aroE-Shikimce S-t>.hyro0enase IRKFLQIPSTKI11N1YpLI!)pP
FPO'1TJLIIStYSYtYttITFCVt.TrPL.FYL11VM


WQLPIJIVPIVfiLQIWRFSNIYYGVBV!!t
CATVSOPSFCEAK00ILItSLLQ.VOIIELRLD 1016 1100675 1199590
ITA CPet


LINELDDOELHTLtTT110NPILTFRONLt~ISTU.iIIWa.YSt.AIG.EIIOBDIDVSLPI_


LOTIRKSNPKIKLILSYIiTDIOVEDLD11IYNHCaTPMZY1CIVLSPaISSEIItNYIKIGR'TtypCOPhan
NroxYlase
'


LLPKPSTVi.CNf.'I'fICLPSRVLSPLISNAIBtYJIJIGISAPQVAPCQPKLEELLSYNIfSIC.SEFONSOSLQR
AYSTPYSYYRIIL.OKENKfJDOIILA
VHYCERTLDPKIfILRIALKL.IIpSLSL!


RLSHLSiff4FLZSKLGiJi7ITYIKFPVTICEW1'FPSAIRDLPF~L.r~RH~IS
VVS1'PFPNRNWYRLLSSRFdiIIMS
KSHIYGLIGDPV~I


.
YCPRFFLDYLE11TGLLS>uLDl~7lVIKFPELETHFSYYPVBCFYJ1PNQ'IfLSI34DRYFPI
VTNPLKTAIFDHVOxLDASI1QLCESINTLVFRNOKILGYN1'DD~.IfAIC.iJ4QKNISVfIIK


HIA11GAGCiAAKAIAATi.fIMOGAfdJiITNRTLSSAAALJITLCKfRfAYPLGSL1~S1FRTIDIASVICtI'LDK
IxIFSLTPDLIHM.iLifNPWLLtIPSPSBFFItQGItLFTRVItIIVOALPB10C0


'INCLPPEVTFPWRFPPIVlIDINfKPNPSPYLERAQKNGSLIINCYGP'IEDALt4PU.WRt0!'i45NLIAIVRC1~
TVESr'L'IE~IBDRIUYQAVL'ISSPpC~rIUFIZSiVRVLPI'G'


FPDFLTPE~DSFRNYVIDiIMAKV
DOIIALPFNTSTPOETLFSIRHFDEt.VG.TSKL&MLODGLLESIPLYNOLIIYf.I3GFEVL.


CO


CPn_1036 1191190 1189954
cetLlo47 lioosr 1iD13u


arw-Dallyrowinau synehase dew-DShydroaipieounat. R.a~case
cYDescRSCIILrNwrfmsel'uTTPHVVtmISNrFQLaa.FSSISTAYPLVIrravs


vaoTaicPlt.otLlxiac.YwlvtTFPPaEPfa~ls~te:rraIaYOLVLroNISncssIICICDrasta~nssfenn
svlccscxreKVIVSALEOSSEYTIaxrsRSSALTLe~wIllL~olrrv


GTVLiIfIGFLdIATYCRCLPLYLIP1TITANVDTSICGLD~IGt1'~GII~Ri~01'FYLPKM'LOIStIPLLTXEWA
HLLISPKPLIIGTi'CiIfGtOCKSAHDSLEELTNIVWVYfrINRiLGAY


NCP~QFLSTLPREEyfIIHGiAEAIKtiGFIA~1YLWEFLNSHSKILL.FSSSOILNiFIKAI~QIIHKIIWIB.L.90
L.C~IPOFDIRIRITIBIRYWfDSLSGTAQDLi.DTIOpVI~BV00
TRISL~IIK
RDSSKKTIEVOSSR1R~I1QGIMlTIBS~OnvRNTVfERHVtCRCILSIt:<RdJCI'LX
I


S VL~CLLKK!'fDl!
KAAIVA~PYDRSLRKILNtCftsIAHAIElLAKGrVMKiOAVSVGlIIPOLYSIGDTLBL
P


TPOLIDOLDtLLKRFNLPSTLKDLpSIVPEHLLQiSLYSPENIIYT1QYDKKNL~tEfJOf.
O


INIEHI.t'rRA7IPFNGTYCASPNNEILYDILNSEOLVIRIHC
CPeLlO1 1201518 1201601


CPtI aad-ASparcate D.llydropenau
1077 1197786 1191123


,-
LIDERKC~IAVLGVDGLVGQKFVUiiIKWYRDiIIVIAEYVASNSKYCOSYOGCIt~pGIG
aroc-Chorismate smehase '


LHFSRGSRRSFLEELLATSYSRStiYLVKV~ISFGSLFSf'L'II~rESI~PSIGWIDGCPASNiIiTYRlIIIMII
PIIPJQNNRDLPIAKIEE110SDIWSFLPSSAESNEAYCLSQDIfVVR


.~.LELNESDFVPAfOtARRPCRiPGI'SSRXA4DIVOILSGV1f10GKTTt'.'l'PLSLOILNIWDSSIPEVNSOHF
OLLGOPYPGEIITSPNCCVSGITLALAPLRKFSLONVNIVTLOSAS~GY
'


PYENSERLYRPGHSQYTYEKKFGIVDPNGOGRSSMETAiCRVAAGVVAEKlWIONIITLiiLN
PCVPSLDLLANTVSHIVCSdEKIL.RBfVICtLCSSKOPLPCKLSV'i'lIIIRVwJIYOf!!


AYLSSLCSLTLPHYLKISPELIHKIHTSPFYSPLPNEKIQEILTSLtitJOSDSIGCVISFIVTFfKDVDLDEiLYSYO
EKNIfEPPNTYQLYDNPNSPOARKIQ.Sl07t>t111V11LOPITYO~


TSPIHDFLCEPLFGKVHALLASAiIISIPAA%GFEIOKGFASAOIDIGSOYTDPFVIO~tIRTIKIIiVLIHNLVRGII
AIZ'LtJISNSiYFIDYLKRENCLR


TLKSNNCOGtLGCITICVPIEGRIAFKPTSSIKRPCR'nIrKTILlL111YRTPQ1GRHDPCV
1049 1101s86 1203911
CPft


AIRAVPWFaMINLVLADLVLY0RC5KL _
lyaC-ASpas:eokinase III


1038 1192750 1192199
EOfNSKIVriCFOCI'SiJITAlNICLVCDIICKDKPSPVVVSAIIIGVTDLLV~'CSSSLJtER
CPn


_
EtYLRLtIEGKNEBIVImRJIIPFwSIIi'fSRLLPYLQNLEISDLDFARILSL.CEDISASLV
aroL-Shikiwee Kinase II


WKLELRNVM'ltt.~LPTSGKSSLGKALAKFLNLPFYDLDDLIVSNYSSALYSSSAEIYKRA11CSTROWDLGFLF~1R
SVILT~SYRRASPNL~IKAIiWtWL6LiJQPSYIIOGFIGS~


AYGDOKFSECEARILETLPPEDALISLt;Of.'rLNYEASYRAIOTRGAW!'LSVELPLIYERLCETVLLGRCGSOYSA
TLIAELARATEVRIY1'WNGIY1?IDPKVISI7JIQRIPEL8FEIlIp


t.EKRGLPERIJIEAMCTKPLSEILTERIDRMCLIADYIPPVD11VDI1SSKSS4E0ASODLITNi.ASFCA1NL.YPP
t2FPCMPAGtPtFVTSTFDFEI~TWVYAVDKSVBYEPRIKALB<.SD


LT,>t$
YOSFCSVDYTVLCCDGLEEILGILESHCIDPELNIAOMJVStOT'VL4DODI
ISOEi1QE11LVD


VLSLSSVTRLNHSVALTCFIICONLSSPKWSTITEKLRGTOGPVFClCQSSIIALSf
WJ1S


CPrt_1079 1194011 1191665 ELAEGZIEELlS4DY11KpKAIVAT


aroA-Phosphoshikimaee Vinyltrenaterase
TE7WICAC 1050 1:0)981 1201798
P CPn


O _
VCP!lILTYKVSPSSVYGNAFIPSSKSHTLRA1LWASVAEGKSIIYNYLDSdaDA-Dihydrodipicolinate
Synthase
KpHDAStKKFPOILEIVCNPLAIFPKYTLIDACNSGTVLRfM'ALACVFSKBI~1TCSS0


LORRPNAPL:.OALRNFCASFHFSSDKSVLPFTNSGPLRSAYSDVODSDSOFASJ1LJ1VACSGCKTKSYSRNVGRINH
LLTATVTPPPPNCTIDFASLERLLSFODJ1VONDWLLCSICIdiL


LAf7GPC.iF':fIEPKERPWFDLSLWWLEKLHLPYSCSOTI'YSFPGSSHPQ~'SY~'~FSSLTKKEKOALICFJH:D
LOtJIVPLFVCTSGTLLLEVLDWIHFCNOLPLSCFtIfl'l'PLYIIfP


S.\AFIAAAALW''K.it4PIRLRNLDILDtOGDKIFFSLII~fL.GASIOYtSIEEILVFPSSFSKLCGOIWFEiIVL
NAAKNPAILYNIPSRMTPLYLD'IVKAtaHHPpFtGIKDSOGS1IBBF


tX:StOMDCCLDALPILTVt.CCFADSPSNLYNARSNCDKESDRILAITBEt.QKMCACIOPTOSYKSLAPHIOLYCCD
DVFW.'EMAACCANGLtsV4SNAYIPEEAREYVLNP00pDYRSWf


HDCLLVNP!;fLYr:AVLDSHDDtiRIAMALTR1.1LYASCDSRIHNTACVRKTFPNPVQ1'WLETCRW11Y1Z'fNPI
CIK.1IL.AYKKAITHAOLRLPt
;IEDFDLENVSPAVBSNLAfrPKf.RTS


NEARIEECHONY:a'NWSTNKRKVFARBSPC VFSYS


:Pn_1040 1191876 llJ4p7) CPn_LO51 120495b L205270


Nn rabu:lc htxnoloo present in \:anebank/FJABLNo robust fwmoloq present rn
t;anelank/EfIBL
as of 1!/7/78 as of 11/7/99


RP::OSLFLRTWGPSSSFREHTVG1APLLYPRRRSPDYLFSPTGCPMST'CMtHPIHTASRFFM'PKSIOOLHLtIITt
DPVRKISPV7TKKSSFFROSLLRFLELIWNPLYCIRSTRP11CV


wFPVLKEtVd::NYW11AQWINTLSFt.ENSCaICKISASBNPTEVKEEVLKHAAEEFRHCtIYLHLATFICRCLILFL
TTLFLs7fICILHFITLPWICKEDPRILRKNK


KTQt.ikI: E?SLPDYTSKHLLGCLLTKYYLJILLDt~iTCRVLBNEYSLSCQTLK'1'AAYILV


'PlALELRA::ELYFLYHDILKF.1QSNITVK:iIILEE0CHL0EHERELKDLPtiCEELLCYACrPn_1051
1~O51D2 lZOnl6n


:PECEU'.L~:fYERLFx'WIFDPS~'TFTKF No rottusc homolal Praswnc in
r:enebank/ENOL
.rc of ll/7/9R


FF IQKMKYNSREK IK::ALR Iv.:..~.YC
ITVFRNNF :L.~.CYONI F'l::La,'YVFfaIPNS
ICR<R


In.l1 II',ei3pt IIId72" ,
:'.Fr.'PFIN:KKTEVETXEVKCKQETPP::L.FI:HMNKVAE.:FPYRkMLE:a'~'.~~Q~~IL:NLCA
':IY,


_
t:NFLD:7~NL::ftNF:.:KEllli::"fIf'fR::K::nY410t:::EPFR'ITACC.1'I
I,trM.NLa>"t:yl:a.rlnv.min..H.Nntno-I-t7x.non.\naae.rVa.R::KI.A~aYEL


Nu s tu.c t . Hn: t.: r . c:. tTITACtGCII:RLKDV::D::I IR'fRAT::::
t L:VIK:::MUTRt'L::CTYY IVt:K.W
Pt.lFFFRLTSD


Id'HRLt'II'LISIKy'r:::laYI~FLVHFt.OP::EF~tSRK'fu.~.IILF1JCR:~1V.TY.F9YLRYl'x.KV
RROLKKKFRLF.f't.'KD


YN::fa:f
IAL.W::Htl~'\:.INflY~t:LviKPNIIKdJIIKLLCfTMk7i::f~lf1):i:.Cfl.'7lxLWfIP!'f


~::,\Lp.':rl'IFIViH:fi:.\YL'iAEC:TRYGt:\t",.4M1'.NLIK:IK:IIPYITKKLCBOA~LLBIIV'L
t._UI'.n IW!r.ItH IIH.%111


tl'WFTIII;~:V.1:LV::KI.APt.1.t'Lt7LEtlFFP~DfX::.T::IF.tANY.IAVQWYNQtKIAKSHFVtY.
rmt~t::r ttrww,lnrl yr.:v>r.r
in :.at..l.mklf7lfsl. .v:: .,t
tl/7/,N


:L::NA'tlY:lI'LY:MC:t.V:P::ITtVPFIIDLFLP:::.T(MPY'fr:Kf:RLAfA011KTVFSB3NIKK:Y:I
illlf'AKIIAINIILYLTt'l~:IdI:VN:LIV.'17x.'IdWY::I.:FA::11611.W::KPNt:L:X:EP


:~\I' I'lf:l 1.1.x. ~u M :NIIffNM:t.:kLAI'IJJD'fYY::1
V::LIN:LH'/f.'~f::I:Y:NIIINiN
:LKR f LKL1KIIYr Ntl: IADRI LT:ft 1 DMIIVAfW. I:.\VItRY.TIIFIIKBIPrT
xl'fCPLFA::Ef'fD I


InItlf
1::I4:f:Iyi:YI.I'LAf:M'fKBLIII?AFV:.(1l*NYAL.LIIr:IffFTt:NPflX:~aMLrLiL::.':KDY
APF.~.I.TARE:IJIIaE7tiJlDffFYf/::LVl.~tx~r::~yrwrrKTNIxPprRlu:.7lKIr:F'


in:fl::1.aml.~:'h.Nlf:la'IN:F.FtMAl8:::1.tJ('ROfYiI:I"JIJU.ffIPAFd'IY:YF30YROII
LNEL:NK::AF


izi


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
SO


ipt3-Triosv0noleDnat~ Grass


III'nd 1307O1U 12091br; IsCRE~71RIKFReiICEJIKNTR:'.
iLa.RiliIQIKTLCL1ICE11~..n1.:;.:.~CEF:.:~'I:IA
:Pn


_
5P!'f..~LMINEVIM'AI'JR!'Bwltifl~~l~lLStilll'~?~w:.Pl4rt~"'
Nn rnoucc hoslolop pressnc m umebmk/DtBLR1ERR
as of 11/7/98


::RwIOiRFtNOVLLSPOLPPPPOHSVCSIS3P5KLRVIJIITFLYPCNttt.I~JILFLTiGIHCP~AP'IASRVKlY
ApAC4YPV11E4mlSLlYRiGKAII~nIR
SE


Pt:L:aMISPGIGIGISAIICGVIJtI".tCLLCLLVKRELPIYRPEEIPELYSLAPSECPJIiQAPLIAYEPVWAICI
CKVAEAiDVODIHNPCREWAERPSF~ITAEEISI::ICwi~KYDNAQR


WKTLApLPKEL00LD'tDZOEVFACLRKLKDSKYESRSFLNDAIfK6LRVPDt~lllLp'fLSEHOC.it7NDCLi.VC
~SLEGOSF!'EVAKNFNV


IFELROIYAOtCNDIJtFLILI;GRSLl4t'tAFSESLDGFINSKRLCYLP9CDVRG~i.KKSA


!ri!t'/IIPr~IJN:LrtHIIVAYAPDRN.iYIIIMEKAFAKALIiALEE3VYNSL'MSYRDKFLGSECpn_tOK1
i22071n 1~309a5
' '
~


.:al:i.:.::LLvIX.:::16'FY.LiI:nn~7.YIVFIirh':t1.h'I:EGT:F:::7NL4:Did~ST. .":.
. .,.,;: : :.."..,.,,:v.:
.
..
:
r
:


..n:~ ~,., :..."r.,."i,. . ..-..v..yrp,:F.Y.Nt.:.c.'EF.~HAppLY, . , ,.
...\I;.". .
. h......tt...= ... ...rntaNrw:v:.;:'.,,....~,_,,...
.., -.;,y: t"::
........~~!r--...~,;.._.~y..,...


IVROKYOQEF~CRLCIfiiiALYPCVSVSIR~IKIOETRSNL6KAYFJIItatIfRCCVRE


~p,~~pl~,ptZLS ZLKS,t~ CPt1~1065 1221110 :::0928


TAEVtI~RCILS011ESRLtIVFIOVKiMPCRIECIEKT(JWA6LPLLPTKKAtEKACSOYNSNo robust
homoloq Present m Getlebsnk;C~BL
as of 11/7J98


GC4.EKVKPYGKESLAYVTSKERLVSLD6aLRRAYTECQKRPOC~.ESEVRACREOLIWRIL:RNRRTSDPCTLfIfFS
IPEFSLPPDSCRL~IOItPKNEIILPSILtxKPIIOYLKZTSI


RCRIOEFClOGLOL ~Y Y~


y YIIEERlGIKEKIILYGTfIIVAT


OORVAAfFSIEVpEIPGPtFICPSLLDKARSLPTREOHTCPell086 1221132 1:211


No robust t>omoloQ Dreamt in GenebanklE!~L
as of 11;7198


1055 1209583 1210521
SNSWCEIGItI'VLIYAFLFIFLILCYiLCCLILVOESKSIGLCSSFCVDSCDSVI~GIISTP
CPO


,-
DILIGM'SiuCAYAFCZGCLL'SFSTNLI~KIILDAKEFLLPAAECSDTpASSISVGOES
No robust homolop presort in Gsnsbmk/00L
as of 11/7/9


CKYLYHIiSYPPPPDNSNGAf'FCLSKFRVWITFLVI.CrIILFLISG71LFLTIGI9CLSAAIS


FCLfJIGLSALGGVLWSCLtGi.LAtOtEVPCVRpEEZpixVBVApSEEPALQATOKTIaOLCP1~1067 1221675
1222292


PKEt~OL~tYIOEVYSCLGttLItDLRCEDt7GLLItORKFJG.OYIrDAMItDOf1'EIVG.OOIHdeE-
Polypspcide OatosRylase


OpELyIYLKCLIOEtOtDIGSTLFHSQVSLFKWFWIrG7fLPSGDURGERWISAR,6VIlORFIIIQVLWRDFPTEL~0
711IVQ1ItIRRLEYYCSPILRKKSSPIAEITDEIRNi.VSt>IICDZItEA


RRICDTRIfVAM'FDRN71YGVAXT11P
EYIGITKILIlDBNRGIA7W1P0YCiQiVSLFVNCVORFIEDCELIFSESPRVFINPVLSDPSETPII~ItGCL


E7CILRICYLEIRR SIPCLRCEVFRP~fIl'VTAl07Lt~KIITCNLDGE'fARI
INHLTpNLNGYLYIDLNEiPKD


pKKFKI4RLEIIIIOtJtYNI'NLZ;Xl~.VS


CPttr1056 1210182 121122


No robust hosolop presort in Gmvbank/D~LCPr>'1065 1223267 1222365
as of 11/7/9


CEDIKDNtSRVEEI~l4.RVIELPLLPIKC~Ai.EKAIYpYNSYKAKLTKVCPCFRESPAYIrnh1-
Ribonuelvase NII


TSEERtASLOp'1'LERJ1YKEYpKRFQEPSRLFSI~PPPFVKLT?SAOtdILRDOLKEKtiFIF50P0It1YFQARSN
MCTLYPSCKLYIOG


IFVSWLFRKHVSCLVSTVNVP
IYSKVAKAFPS4.KGSEEPIEFFLEPEZLNiPTlIARVDpDLRPNLGVDESCKCOFFGPLCIAAVYASNABILK


KETLEIG1KJ1PREEtYWLILEERKSKFJ(RLI1NKIEA71QORVKt%.CPPPIKE1'dfpKRKKEKLYCtKVODSK41
IJ071'KIASIdRIIRSLCVCDItIILYPiKYNELYCKIOt~KM'LLi1W11HA


YSFFIALKS
TVI~.APKPAGwP'AISDOFJN1SEYTLLIL1I.~Ga"rDITI,IOKPRAEODVWAAASIL71


RDAFVOSIOKLEEOYOVOLP10GIIfiINVKJU1GREIAKpRGKELLAKISKT1IFKTFf%ICSG


CPn_1057 1211167 1213596 K


CT356 hypocheeical Dsocein


IINFYFFNFANPEPLY1T0Q.ITnLSPYLLLYAiITPVNWYPWCAFJ1!'NIMIENKPVFISICPtI,-1069
1223507 1223911


GCKNSRNCpVlGpESYTNPE AIC.YGDt.At5ILAVSGdfQYt9A-HTN Trmseripcional Rpulaeor


ETVSIiPLNVILTPOLVPFFSVNYLONEGKLGf.PSPPOZIDKLiFl6IE011EEREALVD2ANVIIQGtINKt<.LN~
rEIFRSSRESOSLSLIG7VG71TSIRYSCLFJIIEpOCLCKLISPVYA


KVLEIASFLEGCVRKEILDESSLIGtTVAALYODIDPtMDGVKAFPKRLPGLLLOFILRYSOGFIKKYJ1TYLGLDGDS
IL00IPYVMIIFXEFSDt0~91EfILLDLESIGG'RNSPERAINSItS


IGGGVYSYTI<7DIOd.IPAFa~RLIDIIiItJ1Ai31YIdIJRi7YGLIIZOCI1MIWLL'.6GFSIF


LFAWICIGKaYRGICKpILSYILSELYSP1VCAFYSSE011DRIZ11G00ER''Y911SVEEZS
.


NAiGtDAEIPCDYYDISRECFPNGRItILNIPVNREIEELS1DCY11RSItAIEDIVDRSRDI1225523 1221114
CPr>'1070


LKGIMOR~SmSKDtlLSLTtMifBSIIYTFAYAGRLLGEVEYIEICKI~GtPVIINSLYIQIHNo /robust
homolop pswmt 1n Gmebmk/ENBL
a of 11/7/!1


YESOCGSFWLSFAEQJIpEWLBPRSEEOC RPfIJlIFPCtiJaCYYRETPPPNPOG~IPLO
ZSL


FYSVOGRDSTLLIKGSPLSOGiTIS~01LI~.LSLHLITDWDILTYJIt4IL0IA0ACPiiCREIICiCFL06tI9K~D
CACCL


Atrl7(KFS51GLLIAS(S7YPSR10iVKVLIAiGDQE~tSPVLKCLSGLFLPYLSLI>~sf10fl~1ETVODPDNPSA
OFLQOLIOOYGPZCVGNtF00GPlICI'OICIEOGEPLG~1~ESI~iOCKL


OEIfLCfVLPCYEE1CLIPKGDCTAITI7fVi.LYDpCKRFKDLELFRR7fLISLHRELLKAAOPIIL7lCESL
VSE9AL5FYPStaIIPtC


WIIQPEppPCPPTPTDELpLOCAVOGAPAPppIGWP


CPtl..1058 1217742 1211536
LSLESGYIt3PLG0ANI0IVOLIKKSLKRLVASDLATfIGPGICLSLT~pVIMNLICLL


CT355 hypoeMCieal procsin
SKGYLPLDPLNPEO'M.DPAl100PNORILRKVLV'1T111GZ<llIwRqI00GtR0itPIPIDP


EVIeQ.YpTLPGIVLVS7CCIFiL.75K;GYAAEVPVTSSGY82tLLESKEpOPSCIJ1INDRILWODD6IERDGlVDO
GGPGIPCQCLRfSiRKLPTEKItPNAWL


FKVDBENVYtALOVINKLNLLFYNSYPHLIDSFPAR80YYT11l1iPV11LiSVIt~Ft3NAD


AIUIIOtIATDPTAVNGEIECtQCR~.SPLYANFENSPNDIFNVIDR?L?AOIIVIKSSNIISKCPfIL1071
1227336 12255


vhS.KYfPGKIREYYWtLEWSRKVIwKYRVGTIKANrE5Li1S0I11DIMWtLNWI191DNo robust harolo0
prnmc in Gmtbmk/OIBL
as of 11/7/9


KDRLTALVISOGGOLYCSEEFSR>2ISELS05HKOEL~.IGYPKCt~CGLP7GWKSCYIG.YIIKC'11'IN~CPNILS
YtPRlCCNFfICEANI:ViTI'EGTTRQSASDISEE11L'wRSOGAfIPITTO/1'KI


LGDKTSCSIEPLDVNESKIKQNLFALEAE5IILKpYKDRLRIOIYGYDASNIAKIiSEGPPTlfVO0tV0PNTApGDCS
I'IISIIpF.~.VDSILSHRRZ'pCCtEYCYD81LA'i~C~ROGSP


LFSLf.
CRLICGTYKACCLDRLDNpIIAGLVItECEpTIIGPIAYAL11AK1fGLNLIIELVIKNtILStE


QI~AQtICSFaKI'OLYQINQSLSONFFLEGVNSIRERGLDDSLVOAVLffIl1?RSii~fT


CPtI,-1059 1211118 1215678
IESPlJ15G1'SSAWi9TRIPACYZ1lX11'SPLTfSRLSCGSROJIRIIPSSVCAiPOYVAKKYND


kysA-Dialachyladmosine
TranstsraseNDiiWOLGIIIW'Il~it.KTGDPSAiGPFCLLIV10~ISFLLSASOSTSSZLKH1'GGEICYTC


VTRSSPAOLSRFLSEIONKP7GLSLSQNFLVDQNIVKKIVATSEVIPOWVL6IGPCFGRI.PNFRDIWLLt4.AIGYCP
AM'DLTSWDIIMIDDPIhII'IFYRLOYSYR!'OKTSASFIJOGf


TET:LIAIIGApVIAIEKDPNFAPSLCELPIRLEIZDACILYPLDOLOEYKTLGKGRWJ1NLPPSLVROffSLDCPTPA
ESVPLNSSLEEEDE~DDEDCNIJ1YQ0RILEGSCNL.pTLFLGIK


YHITfPLLTKLFLE7IPDFiiRTIfTVNVQDEVARRZV110PfxRDYGSLTIFLOFFADIHYAFIMO


IfVSASCFYPKPOVOSAVTNNKVIIETLPLSDEEIPVFII'LTRTAt'OORRKVLAHfIJIGLYP


KEOVEpALKELGLLWYRPEVLSWDYLALFNKNOAGCPn_1072 1227921 12235


No robust homolop Dresmt in Gmvbank/E!~L
as of 11/7/98


CPn_ID60 1217691 1215727
KKDYILIIANWCCWKONLKIOKKRNCVSWITYCJ1IVCFFNSADAApKKIDCIPIOILYSFT


~cs/tkt-Transkecolase
KYSSYIJG~ICDASTIFC11DVORGLt.OtIRYLCSPCWOETRRRQLFKSLCdOSYGNO1LCEET


YXRILYIHITKVIfI'SSSCPLLOLILSPADLItKLSISOLPCLAEEIRYRIISVLSCIOCNLLAIDIFNNKDCLC&EI
PZONE71ILJWSSALVLGISSFCITGIPATLHSLLRt~M.SFpKRS


SSMGIVELTIALNYVFSSPKDKFIFDI~pTYPHKLLTGRNNBDFDNIRNDNGLSGF'1'NIASESFLLKIOSAPSDASV
FYKGVLFRCE'1'AIVDALSpLFAOLDLSPIGCIIFL..'EDPIW


PTESDNDLFFSGNIVLTALSW.GIUIQITPLESATItVIPII~GDAAFSCGLTLEAIl~NISTDOAVCSACIGWCi9Q(
FIGLVYYPJ10ESLPSYVNPYSTATELOEAOGLQVISDLYAOLTIiJAL


LSKFWILN01~4~RISISKNWGANSRIFSRNLIIHPA'fNIG.TKOVFJWLAKIPRYC06LNilISPKNN


RRISpCVKNLPCP'fPLFEGFGLAYVCPIDGHNVKIfLIPiLOSVRNLPFPILVIIVCI'tIOCK


~LDpwOMdPAKYtIGYRANFNKRfSIUtHLpAIKPKPSFPDIFGOTtGELCEVSSRLNWTPCPn_107) t=9011
1329832


1NSIGSRLECPKQKFPERFFDVGIAEGNAVTFSAGIAIfMR~IPVICSIYSTFIJiRALDNVFPredicted OMP
IC'I'37l


HDUC7pDLPVtFAIDRACWYGDCRSNtIGIYDNSFLMIIPQNIICOPRSQWFa0LLY5SMRRYLIMVGAL(:LYRAAPL
F~1WIKITDJWJ1VLKFAREKTLVCFNIED'1WFPKONNCOS


LNYISSP3AIRYPNIPAPNCDPLTGDPNFLRSPCHAETLSt3CEDVLIIALCTLCFTAGSIKAWLYNRELDLKTTISEE
pAREpAfLEWNGISFLVDYELV:ANLRNJLTCLSLKRSWVLCI


H0tL1YCI3ATlh'DPIFIKPFONDLFSLLLIiSNSKVITIEEIiSIRCCLIS'EFNNFVATFNSORPVIILIKM'LRI
LRSFNIOFTSCPAICEDCwtSNPTKDTfFDpAMAtFJWILPVGSLK


FKVDIWFAIPDTFLSHCSKE11LTKSIGLDES9!lINRILTHFNFRSKKQ111GDVItVNCOPNDAALEYLLSGIaSPP
SOIIYV000AERLRSIGAF~_KKANIYFICNLtf'fPAKpRVf


:.'YNPKLTAIpWSOIRKNLSDEYYESLLSYVKSK


CPn_IOGI 1217932 1217666


C"330 hypothetical Drocein ' '


FI:SIINEIHNKDPSLKKLFAi.ppSLFfL.NSLSDIVATYEAMFSLIYECLNKALRKDQt.CY'


LIa'1ltdSK.fLLKSPSCDPIVQTFPINPNN ' RNA SECTION


;:Pn_lUi.2 12191135 1.1815) . . . . . . . . . . . . . . . .
. . . .


><aw1-fxaJoxYriOOnuclease VII


Ix:FPM::.~.PCOIIVA:iLTERIKTLLESNtL'OILVK~EL::NV~LOP:X:HL1FCIKDSOAFWcmlsHA I
\N.t'r, 1 ~ND7A


:AFFtIFK::KYYDf:KPKDf:OAV I I14:KLAVYAPRI:QYq
I VN IALVYArExDLL4KFEETKR


Id.TAFxYFXrl7:KKPLPFAPQCICViT:a"n;AVIpDILRVLRilxam.~l...v::. f NN:\ n.n71.11
:RRARNYKILYIPVIIroCN ,4'L.4o


::AAIIEI::KAIEVtfUIfNLIDVLI T.1R(3CC;:IEDLWAFNEEILVKAItIA:'rIPiVSA11G71E


'rDYTG.'hvA::WfNP'rP::AAAEtVt'.KC:EEf~)VFFY:'ILRHLL::II:;ROLLTCKK0f3LLPW1~..:
rICNA lUrun.r.d Itlll_ll.:


I!I! vLDfIAEFYTTIIt~(~LOa IE LA10Kl3V(~CK
I IIE.,Kr,INYDN I::RWLtX:DLYwPMlCRLOS


LKKNL:rJAL:YIKAI::WVRr:IIQLKK::LT1PR0It~11.:OKL::ISi~LDTLt~RRLIIYOKE.:s:: rNNA
Ino:4l'. Iml'..:'!Ir


!:YF11KIIT1LKl W IN1II.EWLIt::IIVQKf.ELLCRNL::MX.'EIIJt4>NVK
IA1:WYKETLATI L


h:NNYllI::/ARY::ALKEWJI::WPKNVLKRUyANLFDFtif:Pt::AHL::VO::I.QeWIVRi9L0..: tNNA
114~'.t: Inn..ur


I r:Ell LTIrt'H f R Ir:KL IKI:
~'Ar_t4a; t~l'~'N1U 1~~U71:
122


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/Zb923
' tlldAi
. . . . . . . . . ~ . . . . . . . . .
CMUI 1 6aqln Erxt Type Codon
t 99657 89728 Thr CCl'
2 90o9N 91070 TrD t:~:A
w:c
~~~~~- ~M~r
2ri075 294117 Val TI1C
6. 296151 296111 Asp GTC
7 109818 109921 Pro T0G
8 167111 162211 ArQ CCr
9 671=36 67231A Lw GJI
677161 677337 7tp TtC
11 739103 739186 Leu G1G
12 781610 781110 Gly TCC
13 781~7? 781196 Glu T'1C
11 781912 781991 Lys T.T
836119 836191 Ala OOC
16 813926 813999 Pro ODG
17 877400 877473 Acq 11CC
18 10~3605 1085676 Cln T'1C
19 1112031 1112118 Ser TCA
1175163 1175911 Iwu TJ10
21 1230028 1229912 Ser C'aA
22 113?162 1137389 Val G11C
23 1030603 1D30533 Cys OC11
21 1000072 999919 Mls GTa
961607 961536 Gly GCC
26 A07113 807311 Arp TCT
27 7es7eo 7es7oa Thr car
se 716971 71se99 Leu T1N
29 70AN1 708351 Bar OLT
68D~59 680178 Leu 6710
31 671115 631373 Phe G7N1
32 626987 626901 Her OGiI
33 293177 293105 Thr 'rC1'
34 293399 293317 Tyr CrA
269112 269070 Ala TGC
36 269065 268992 Ile C11T
37 161389 161318 Asn GTl'
38 87522 87150 llet GT
51
123


CA 02350775 2001-05-11
WO 00/27994 PCTNS99126923
Contig463
Length: 273254..
1 ATTGTTCCTG TAAGAACACT TCCAAAGCGC ATTTAATCAT TTTTAGTAAA
51 AAATAAAAAT ATACTTTTAA ATGTTGAGAA AATTTTTAGC TAAACTTTAT
101 AAAGGGTTGT TGGTGAAACC TTTGGGTTAC TCCTCAGAAC GACTTTGTGA
151 TTCTATAGTA TTAAAAGGAT CTTGGAGTAT AACAAGTAAA GATCTTTGAG
201 GATAGCGTAG GGCCGTATTT TGAATAGCGT CCAATAAAGC GCGTTTGCAA
251 AACGCTTGAG TTTGGTTGTC CCAATAGAAA GTGCCTTCTT TAGGAAGAAT
301 CTCTTCTGGA GGCACTTCAT AGACCGAAGT AAAGAGAGGA AGAGCAACGA
351 TTGCTGCATG ACTTTCTATA GCTGCTTTAA GGCAGTTCTC GTACGCTAGT
401 AAAGCTTGGC GATAATATTC TTGCTGATTT GGTAACTCTT CAGATTTAGG
451 GCCGCATACG TGGCCTAAAA AGGTCGGAAG AATACTTTTC TTTTCTGCAG
501 AGCTTAAATT TAGATTAAAC GTTTGATCTA GAGCTTCGTT TGGAAGTTTT
551 ACTACTCTCA CTTCGGTAGG GGAAAAGGGG TCTTCTCCTT TTGCGGGACC
601 CCCTTCGCGT TGCTTGCATG TATCCCACAC GCTTTTATCT TTTAGGGTGG
651 AGTAAAGGAT AGTAGAGAGG TTCGTTGCAG TGTTGTCGAT CAGATTCGTT
701 GGGCCTACGG GATTAAAGAT GATCCCTGTG GATTGATTTT TTTCGATCAC
751 TCTAAGTCCA GTTAAGAAAG TAGGCTGAAA TGGTTGAGAC GCATCTGTTT
8C1 GTATCGCTAC CTTGAACTTA GGGTTCAGGT GATTATTGTA AAATTGCATC
85i TCGTTTGAGT AGCAGTCTAC GTTTTTTTCT TGCCACGCTT TTCCCAAAGG
901 CTTGAAGTTT TGCTCTAGAA CTTTCTGCCA GTTAGAAGAT ACCTTTGAGG
951 TCATTTGGTG GTAGACTAAG AAGGTTACAA CTGAGAAGAG GGCCGTGGTA
1001 ATGAGAAGAG CCAAAAATAC AGGGTTCCCT AATACTATCG TTAAAGAGAT
1051 TCCAGCCACC AAAGCTCCTA AAGCTAAAGA AGCTAGGATT GCA.T~GAGTGG
1101 ATATTTTTGC TATGGTAAAC TGTTTTTTAG GAGCAATTTC TTTATCCCGA
1151 GGCACATAGG ATAGTACAGA AACTTGAGAG CTCTCAGTAC GTGAGGGTCC
1201 TGACATAACA TTTTTTTTGT AAAATACTTT CTATAATTTT AACATATTTG
1251 TGTTTATCGA TCCGAGAAAA TTGGAGAGTG AGAGCGCATG TCTTGCAATT

CA 02350775 2001-05-11
WO 00/27994 PGTNS99/26923
1301 TAGAATGATC GGGGACGACA TCTAGAGCTA TGTAGACATT GCGTGCGTAG
1351 TGGGAGCAAA TATAGCGAGA TATAAAGTAT AAGGGAATTG CTGTTAGGAA
1401 GATAAAGGAG CACAAAGGGT GGATACATAG CCCAATAGCT ATGGTGGTAG
1451 CAATCAGAGC TATCCAGACG AGTGCAATCG CAATAGTAAC GAAGAGGGCA
1501 AGCTTGAAAT TATAGCGAGG ACGAGTAGCT GGGGGAAATA GAGAGGGAGC
1551 CGTTCCATCA AAACCGGGAG TAGCTGAAGA AGCCATAAAC TATTAAAAAT
1601 TAAGTTTTTT TCGGAGCATA AAGCATTTTA AAGTAGTGGG GTCTTTTTTG
1651 TCACGGAGAT GTCCTGGACT TCCCAAGCGT TTCTAACAAA GATACCTGCT
1701 TTTGAGAGGA GAACTTTTGA AACTCCTGCA AGGTCATCCT TCCTTGGCAC
1751 CAGTAGGTTT TTTCAGGAAA TCGCGGAAAG ATTTTGGCGA AAGCTCTTAC
1801 AGTTGAAGGG CTTGTGAAGA TAATTTTTTT GTATTTAGAT AAAATATTTT
1851 TTTTAAGTTT TCGCGGCTTC ACTGTGTAGT GAGGGTAAGA GAAAAAAGTA
1901 AATCGATTGT AAAGAAATTC TCTGATCACA GGTCTTGCGA GGGAGGAGTG
1951 GGGGTAGAGA ATGCGGGCTG AAGAGGGCAG TGCCTGTAGC AATGGGAAGA
2001 TGCCTTCAGC GATTTCTTGA GTTGCTACTA CGTACTTCAC TTGTCCAAGG
2051 AAAGAGAGAA GTCTTTCTTT GGTGGACTCT CCTATACAGA GGTAGGTCTT
2101 TGTTTTTAGA GTGGCCTTAG AAAGAAGAGA AGTCATTCTG GAAAGGAATA
2151 GGTGAGTGGA TGAGGGACTT GTGAGAATCA CATGGGTTGC TTGTGGAAGG
2201 AATTGAAGAG CACGCTTATT TTGTGGAGTG CTTTTTGCAT AGGGGAAGAG
2251 AGTTAGAATA GGCAAATAAT GAGCTTGGTA TTTACGAGCG GTTTTTTGAT
2301 TCAATCCTAA GTAGAGGGTC ATGAGTCTTT TCGGGTAAAA GGAAGGCTGC
2351 CTAAGTTTTT GTACCTTCAA AGGGATATAT TGAAAATAAT TTTTCTTTTT
2401 CCCTTGGTTC TTCTTGATCA TGCGTTGATT GACATTTTTC ACTTTGAAGG
2451 CTAGGCTGGT TTTTTCTGGA CTTAGAGGTT CTCTCTATTA AGGCTTCGTC
2501 TTTAGAAGTC CTrt'GCTAAAA GTTTTTGAGA AATTTAAGAA ATTCGCAATA
2551 GTGGAAATAT TTACAAAGGT GGTTGCGGTG GTTTCGTTGT TGCATAAGTT
2601 TTTAGAAAAT GCTTCGGGGA AAAAGGGACA AAGTTTAGCT TCGACAGCGT
2


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/Z6923
2651 ATTTAGCAGC TCTTGACCAT CTCTTAAATG CGTTTCCTTC CATTGGGGAG
2701 AGAATCATTG ATGAGTTGAA GAGCCAGCGT TCCCATTTAA AGATGATTGC
2751 TTCTGAAAAC TATTCTTCAC TTTCAGTGCA GTTGGCTATG GGGAACTTGC
2801 TCACAGATAA GTATTGTGAA GGAAGTCCCT TTAAGCGTTT CTATTCCTGT
2851 TGTGAAAATG TAGATGCTAT TGAGTGGGAG TGTGTAGAGA CAGCGAAAGA
2901 ACTTTTTGCT GCGGATTGCG CTTGTGTTCA GCCTCATTCT GGGGCTGATG
2951 CTAATTTACT GGCAGTAATG GCCATTCTCA CGCACAAAGT CCAAGGCCCA
3001 GCTGTCAGTA AGTTAGGTTA TAAAACTGTA AACGAATTAA CAGAAGAAGA
3051 ATACACTCTA CTTAAGGCTG AAATGTCTTC TTGTGTTTGC TTAGGACCTT
3101 CATTAAATTC TGGAGGCCAT TTGACCCATG GGAACGTACG TTTAAATGTG
3151 ATGTCTAAGC TTATGCGTTG CTTCCCCTAT GATGTCAATC CGGATACGGA
3201 GTGTTTTGAT TATGCAGAGA TCTCCCGGTT AGCTAAGGAG TATAAACCTA
3251 AGGTACTGAT CGCAGGATAT TCTTCCTATT CTCGAAGATT AAACTTTGCA
3301 GTTTTAAAAC AGATTGCAGA GGATTGTGGA TCTGTCTTGT GGGTAGATAT
3351 GGCGCATTTT GCAGGCCTAG TTGCTGGGGG AGTGTTTGTT GATGAAGAAA
3401 ATCCTATTCC TTATGCAGAT ATAGTGACAA CAACAACGCA TAAGACATTA
3451 CGCGGTCCTC GCGGGGGATT AGTTTTGGCA ACTCGAGAGT ATGAAAGCAC
3501 TCTCAATAAG GCGTGTCCTT TGATGATGGG AGGTCCTCTA CCTCACGTGA
3551 TAGCTGCTAA AACAGTGGCT TTGAAGGAAG CTCTCTCTGT GGATTTCAAG
3601 AAATACGCTC ATCAGGTTGT AAATAATGCT CGTCGATTAG CAGAGAGATT
3651 TTTAAGTCAT GGGCTACGTC TTTTGACGGG AGGAACAGAC AACCACATGA
3701 TGGTGATTGA TTTAGGTTCT TTGGGCATTT CTGGAAAAAT TGCTGAAGAT
3751 ATCTTGAGTT 'CCGTAGGAAT TGCTGTGAAT CGGAATTCAT TACCTTCAGA
3801 TGCTATTGGT AAGTGGGACA CTTCAGGTAT ACGTTTAGGA ACCCCTGCAC
3851 TAACGACTTT GGGTATGGGT ATCGATGAAA TGGAAGAAGT TGCAGATATT
3901 ATTGTGAAAG TATTGCGAAA TATTCGTTTA AGTTGCCATG TTGAAGGGAG
3951 TTCTAAGAAA AATAAAGGGG AACTTCCTGA AGCCATAGCG CAGGAAGCTA
3


CA 02350775 2001-05-11
WO OO/Z7994 PCTNS99/26923
4001 GAGATCGTGT TCGCAACTTG TTGCTGCGTT TCCCGCTCTA CCCTGAAATT
4051 GATTTAGAAG CTTTAGTTTA GTTAGGAGAG ACATTATTTT ATGGCAGACG
4101 GGGAAGTTCA TAAATTACGT GATATTATAG AAAAAGAGTT ATTGGAAGCG
4151 CGCAGAGTAT TTTTCTCAGA GCCTGTAACA GAGAAAAGTG CTTCCGATGC
4201 AATTAAAAAG CTTTGGTATT TGGAATTAAA AGATCCTGGA AAGCCTATAG
4251 TTTTTGTGAT CAATAGTCCT GGGGGATCTG TGGACGCAGG TTTTGCTGTT
4301 TGGGATCAAA TTAAAATGTT AACCTCACCC GTCACTACTG TTGTGACAGG
4351 GTTGGCAGCT TCTATGGGCT CGGTATTGAG TTTATGTGCA GCTCCTGGAA
4401 GGAGATTTGC AACTCCTCAT TCTAGAATTA TGATTCATCA ACCTTCAATA
4451 GGTGGACCGA TTACCGGTCA GGCAACCGAT TTAGACATTC ATGCGAGAGA
4501 GATTTTAAAA ACAAAAGCTC GCATTATAGA TGTCTATGTA GAGGCGACAA
4551 ATCAACCTCG AGATATCATA GAAAAGGCTA TCGATAGAGA TATGTGGATG
4601 ACAGCCAACG AAGCTAAGGA TTTTGGTTTA TTGGATGGCA TTTTATTCTC
4651 CTTCAACGAT CTCTAAATAT TTTATCTATT CTGGAGCAGG AAATCGTTTC
4701 CTTCTTGGTG AAACACTTCC TGAGGTTGAA GATGTTCGGT TCTTATGCCA
4751 AGAGACGAGG GTTGATGGTT TTTTATATTT AAAGCCCTCT TCTTGTGCTG
4801 ATGCGCAACT CATTATTTTT AATTCCGATG GATCACGTCC AACGATGTGT
4851 GGTAACGGCT TGCGTTGTGC GATTGCTCAC TTAGCTTCTC AGAAGGGAAA
4901 ATCGGACATC TCTGTATCTA CGGATAGTGG TCTATATTCA GGATATTTTT
4951 ATTCTTGGGA TCGTGTGCTT GTAGATATGA CTCTCGCAGA TTGGAGAGCT
5001 TCTGTTCATC GATTGGAGTC GCGTCCTGAT CCTCTTCCCA AAGAGGTCGT
5051 TTGTATCCAT ACGGGAGTGC CTCATGCTGT CGTAATTCTT CCTGAGATTT
5101 CTACTTTAGA TCTTTCTATC TTAGGTCCTT TTCTTCGCTA TCATCAGACC
5151 TTCTCTCCAG ATGGGGTGAA TGTCAATTTT GTTCAGATAC TGGGACATTG
5201 CCAGTTGCGC GTTCGTACTT ACGAACGTGG AGTCGAAGGG GAAACTGCAG
5251 CTTGTGGAAC AGGGGCTCTA GCTTCTGCTC TTGTTGTGTC AAACTCCTAT
5301 GGATGGAAGG AGTCGATCCA AATCCATACT TGGGGTGGAG AGCTTATGAC
4

CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
5351 TGTGAGTCAA AATAGGGGAC GGGTATATCT TCAGGGCTCT GTAACTAGAG
5401 ATTTATAATT AGATGTGATT TTTGATTTTG TCATGCAAGG ATTTTAAAAT
5451 CTTGTTTAGG GATAGATCTT GCTCTCTAAC TGGGATTTTT CTATAATCGT
5501 AATTTATGAT GACGTATCCT GTACCACAAA ACCCACTTCT TTTAAGAATC
5551 CTTCGTCTTA TGGATGCATT CTCTAAGTCT GACGATGAGA GGGACTTTTA
5601 TTTAGATCGT GTTGAAGGGT TTATTCTCTA CATAGATTTA GATAAAGACC
5651 AAGAGGATCT AAATAAGATT TACCAAGAAT TAGAAGAGAA TGCCGAGCGG
5701 TATTGTTTGA TTCCGAAGTT GACGTTTTAT GAAGTAAAAA AAATCATGGA
5751 AACGTTTATC AATGAAAAGA TTTATGATAT CGATACCAAA GAAAAGTTCC
5801 TTGAGATTTT GCAATCCAAG AATGCCCGTG AGCAGTTTTT AGAGTTTATT
5851 TATGATCACG AGGCAGAGTT AGAAAAGTGG CAGCAATTTT ATGTAGAGCG
5901 TTCTCGAATT CGAATTATAG AATGGCTTCG CAATAATAAG TTCCATTTTG
5951 TCTTTGAAGA AGATCTAGAT TTCACAAAGA ATGTTTTGGA ACAGTTGAAA
6001 ATACATTTGT TTGATGCCAA GGTGGGGAAA GAAATCACTC AAGCGCGTCA
6051 GTTGTTGTCG AACAAAGCTA AGATTTACTA TTCCAATGAA GCATTAAACC
6101 CTCGTCCGAA ACGAGGCCGT CCTCCGAAGC AATCTGCTAA GGTAGAAACA
6151 GAAACAACAA TTTCGAGTGA TATTTATACA AAAGTCCCTC AGGCTGCTCG
6201 TCGTTTCCTT TTCTTACCCG AGATTACTTC ACCCTCTTCA ATTACTTTCT
6251 CAGAAAAATT TGATACGGAA GAAGAATTTC TTGCTAACTT GCGCGGTTCG
6301 ACTCGTGTTG AAGACCAGCT GAATCTTACC AATCTTTCAG AGAGGTTTGC
6351 TTCTCTTAAA GAGCTTTCGG CTAAGCTTGG TTACGACTCT CTTTCTACTG
6401 GAGATTTCTT TGGTGATGAT GATGAGAAAG TGGTCACTAA GACGAAGGGG
6451 AGCAAGCGAG GCCGCAAAAA ATCTTCTTAA TCTTCTATTT TGTGAAGTAG
6501 TTTATTTTTA GACGCTGTTC TTATTGCTTC TTTACATGAT CTTATTACAA
6551 ATCTTTCTTA TTTCTATTTA TTGTTTTGTT AAAATTTTAA CAATAGCTAT
6601 TTATTATTAG TCATTTTTTT AATTAAAAAA CTGTTAAAAT TTTTAAAGCT
6651 AATTTAAGAA ACAGTGAATA GTTCATCATG TCATCACTAC TGAGCTGCGG
5


CA 02350775 2001-05-11
WO 00/Z7994 PCTNS99/26923
6701 AAGAATAGAG CCGACTCGGG TTACCTGTAG CTTAAAGACG TATCTTGAGG
6751 ATACGAGTCA GAATCAGTTG AGCACACGTC TAGTTCGGGC AAGTGTCATC
6801 TTTTTATGCG CATTGTTGAT CATTTTGGTT TGTGTGGCCC TCTCTAGTTT
6851 GATTCCAAGC ATTATGGCCT TGGCGACCTC TTTTACGGTA ATGGGGTTAA
6901 TTCTTTTTGT GATGTCACTT CTTGGTGACG TTGCAATTAT AAGTTATCTT
6951 ACTTATAGCA CTGTTACGAG TTACCGGCAA AATAAGAGAG CTTTTGAGAT
7001 TCACAAGCCC GCTCGCTCCG TTTACTACGA GGGGGTCCGC CATTGGGATT
7051 TAGGACGATC ATCTTTAGGC ACAGGCGAGA TTCCTATAGT AAGGACGTTA
7101 TTCTCTCCAT TTCAGAACCA TGGTCTTAAC CATGCCTTAG CTGCTAAAAT
7151 TTTCCTATTT ATGGAGCATT TCAGCCCTGA GCCACCGAAC GAGCCTTTGG
7201 TGGATTGGGC CTGTTTGATT CGGGATTTTA GGCCTCACGT CAGTTCTTTG
7251 TGCTTTGTTA TTGAAAAACA AGGGTCATCG CTGAGGACTA AGGAAGGCAA
7301 TACGATTTGT GAGGCTTTCC GCTCTGATTA CGACGCCCAT TTTGCTATGG
7351 TAGATTGCTA CCGGTTGATC CACTCTAAGT TGATTATAGA GAAAATGGGA
7401 TTGAAGAATA TCGATATCAT TCCGAGTGTC ATGGTTCGTG AAGATTATCC
7451 TAGCCGTCCT GGGGAGGGCT ATCGCGAAGG CCTATTACGT ATGTATGGTG
7501 GCAAGGGGGC TCTGTGACTT CCCTACTTTA GTTCCTAATG AGCGCTTGCC
7551 CATAGGGCCT TTCTTTGTCC CGCAGCACAC TTCCGGTGCG AAGGGTAAGG
7601 AGTTTGCTAA AAGGAATTTT TCTATAATTT CGGGATTGGA TGACATATTA
7651 AAATTATGTA TTCTTCAAAG GCGTCCTTTT GCTTTGCAGT GGGATAACCT
7701 CTCTGTGAAA AGTGATTATG AGGAGGCTGG GCCCGCTATT GGGATACGTT
7751 CTCTTGAGCC ACAAGTTTCT CAAATTTCTC CAGCCCACGG CCGGCTATGT
7801 AGTACTTTGG TCCAGTGGGC CCCTATCCTT GGTTCTGAGG AGCAGCTAGT
7851 TTGGTTAGAA GAAACAATGA AGCGCCTAAA GTTTCCTAAA AGTTTAGGTA
7901 GTAAGGACGC TGTTATTGTG GATTCGGAAA TGGTTCCTGT GAACGCCAAT
7951 CCTACTCAAG AGATACCTGC AGCTTCCGAG ACTGTAGAGT CTTCACCTGT
8001 AGCTCCAGGG AATACAACAG ATACCATGCC TGCAGCTTCG GGAACTACAG
6

CA 02350775 2001-05-11
WO 00/27994 PCTNS99/2b923
8051 ACACCACATC TGGGGTTTCA GAGGCTGCGG CGGCTGAGGC TGCCGTGGAT
8101 TCTACACCAG GGACAGAGGA GGAGCCGAGT TTTTCTCTGA GGTATGCGCT
8151 TGTAGTTCAA AATGTTCCCT ATCCAGAGCC GCCTAAAGAA CCTGAGGTGA
8201 TGTTTACAGA TGAAGAAAAA AGTCTGATTT TAGAAGCTAC TCGTGCGCGT
8251 CGTATGGAGT TGGACTTGTA TAATGGCTAT TTAGCTGATT ATGAACTTTC
8301 TAAGGATGAA ATACAGAAAC ACGTTCCTGA TTTACCTGAG AATTGGCGTA
8351 CGAATTGGCG TTGGTCGGAG AGGCTCTATA AATTTTTCTT TAAAACAAAG
8401 AAAGAAGGAT TAGAAGAAAT TTTCTTAAAC AAAGAGTTAG GGAATATGAT
8451 TCTTGCCCGA GGGCTGGCGG CAACTCAGTC ACAAGCACGT ATTAAAGTAT
8501 TCAATTCTTT AGTGGCATGG CTCTTGCAAA GCTTTAACGT AGGGAGGAGC
8551 TGTACAGCTA AACCTCTTCC TACGTCAAAA CTAGACCTCT TTAAATCGGA
8601 ATTCGAGTCT AAGCCTAAAA ATAACATCTT AACGGAATTT TTGGTGGCCT
8651 CTGATGAGGA GATTCTCTTT AAGGGGCTAC GGGTCCTAGA GCCTGGAATC
8701 GAAGGTTGGT ATGACCATCC TGATCAAGCT GGAGAGATTC GGTCGGTACT
8751 CGAGGGTCTG GTGCAGGCTG GACGTATTTC TGGATATTGG GAGAATCAGC
8801 CGTTTGGGAG ATTTGTCCTT AGAGGAGTTG GTGAAAGACG TACCGAGCTT
8851 GTAGAGCTTT TGGAGAGTTT AGTTGCTTCT GGTGAGATTA TGCAGTTCTT
8901 TGAGTCTTCG GATGAAGAGG GTGCTTTTAT TATCGATAAC GAACCTAGCA
8951 AGACTGCTAT GCTAAAACAG CGATTTAAGA GTTGTGTCAG GACGAAGCTT
9001 GTCGGGAGTT TTGCTGATGA GAGTCTTCCC AGAGGTAGGT TTACCATTTT
9051 AGTTTAGCGT GGGGTAGAGC ACTCCACGAA TCTTAGGGAG CTCCTTGCGA
9101 CCAAGCTTGG AGATCCTCCA TGTTTTATTG TTTCTCTAGT AGCCAAATCG
9151 TAGCCGCTCC TAGGAACAAT TTTTTCTTTT TCGCAATATA AAATCCTGAT
9201 TTAGAGAATA GGTCTTCAAG ATCGTGGTCC TTTGGAAGTT GCTGGATACT
9251 TTTGCTGAGA TAGCTATAGG CGTCGGGATC TTTAGAAACA GACTTTCCAA
9301 TCCAGGGGAC GACAGCACGC AAATAGAGCT TATGGGCACT ATAGGTAGGG
9351 TGTGTTTTTT TTGGAGGTGT GAGCTCTAGA ATGCCCAGTT TTCCAGAAGG
7


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
9401 .CATAAGCACT CGGGAGATTT CTTGTAGGGC TTTATGTGGA TCCGAGAGGT
9451 TCCTGAGGCC ATAGGCCATC GCTGCTAGGG GATAAGAATG ATTCTCCAAG
9501 GGCAGTTGAT TAATATCGCT ATGAATAAAA GAGCAAGAGC CCTGGGGAAG
9551 GTGTTGTTTT GCAATGTCGA GCATTGCTGA GGAAAAGTCG ACGAGAGTTA
9601 CTGATGCTTG AGGGTGTGCG GCAATATAAC GCTTCGCGAC TTTTCCTGTT
9651 CCTGCGCAGA GATCCAGGAG AGAGTATCCC GACCCTAGGA TCTGGATCAA
9701 AGAGCGATTC CAGAAATGGT GCATTCCTAA AGAGAGTATT GTATTTGTGC
9751 GATCATACTT ACTCGCTATG GAATCGAAGA TCTTTTTACA GTCGGGCTTG
9801 TTGGTAGAGG GTTCCATAAT ATTCCCGGAA TTTTTCAAAG CTTTCGTAGT
9851 GTTCTTCTCC TAGACGGTAC TGGCATAGGG CATAGTATTC TTGAAGAAGA
9901 GAAGGGGGCA GACCTGTATG TTGATGAGCT TCTTTAAGGA CTTCTTCGGG
9951 TGAAGATTCG AACTGTTGGA GGGCTTCTTC CATCGCAAGG TTGGGTAGGG
10001 GATGTTCTTT CCAAGAGGTG CTGTGTAGAA GAAGAGCAAA TACAAAAGGT
10051 AGCTTTGTAA GATCATACCA CCCCGAGGCA AGGTCATAGG TTACAAATCC
10101 AGGAAGTACA GGATGTTGTA GCGCTGCATC TCCGATTAGG AGGAGGCCAT
10151 CATAATTTTC AGGGGTTTGT CTGAGTACTT TTGTAGTTAT GAATCTTAGG
10201 ATATGAGGAG TTGGGATGCG CCAGAGATGA CGACAAAGCA CTTTTAAGAG
10251 TCCTATAGAG GAGCGACTTT CTAAAGTTGC GGCAATCCGA GGTTGCGGTG
10301 AGTTAAAGAA AGTGGGAGCT GCATAGAGGT TTACACTGAG GATACGTTGG
10351 TTTGCTGCAA TTCCAAAGCC GGGGACATAC CCCAAGTTAT GAGAGATAGC
10401 TCCTAGGGAT GAGGTCAAAG CAACATCGAG TTTCCCTTCG ATTAGCAAGT
10451 TGAGGAGGTC TGCAGGGGGA GCAAGAACAC AGCGAATATC GTTTCTTTTT
10501 ATGAGTTGTA GGGACAGCGG AAAGGAATTA ATATAACTTA CGCAGCCTAA
10551 GCTTATACAT GGCTGGAGTT GGTTAGACAT GGCGTTCTCC CTTGTTGTGT
10601 GATGAGGGCC GCCATTCCCT CAGCGTCCAT TTTAATAGGT TCTTTAGATG
10651 AGGCCATCTG GAAAACCTTT TCCCCCATAT GTGTTGAAGA AAGGTCATTA
10701 GCACCACAGG AAAGGAGGTC TAGAGCTGCC TCAATACCTA GGTAATTCCA
8


CA 02350775 2001-05-11
WO OOI27994 PCTNS99/26923
10751 TAAGGCTTTC ATATTGGAAA AGTTGTCTAA GAAGATTCGG GCTACTGCCA
10801 TTAAAGATTT TAGAGGGATG GCATGACCCT GGCCTGATTT TCTTAATCTT
10851 TTTCCTAGGA CATTATTTTC TTGGGCGAAT TTTAGAAGTA TGAAGTTTTT
10901 AAAGCCCTGA GTTTCGTCTT GTAAGTCGCG GACTTTTACC ATGTGGGTGA
10951 CGAGGTCTTC AGGTCCTTCT TTATGATAGC AGAGCATGGT TATATTGCTA
11001 TGGATTCCCA GTTGATGAGC CATCTTATGG ATGTTGAGAA AATCAGAAGA
11051 AGAAAGGCGT TTGGGAGCTA AGAAATTACG TATTTTGTCG ACGAGGATTT
11101 CAGCTCCTCC TCCGGGGATG GAATCAAGAC CCGCATCTTT TAATGTGAGA
11151 AGAACATCGC GAATAGAAAG GTTATCAAGA TCTGAGAGAT AGGCATATTC
11201 AATGGCAGTA AGAGCTTTGA TATGGATCTG AGGATCGTAC TCTTTGATTT
11251 TAGTAAATAG ATCGGAATAG TATTGCAGAT TGCAGGAGGG GAAACAGCCT
11301 CCCACGATAT GTACTTCTGT AATTGGAGTT TTTATATTTT GGATTTGCTG
11351 TAGAAGATCA TCTGGGGAGT AGAGCCATCC TTTAGGGTCT CCAGGTTTTG
11401 CATAGAAAGA GCAAAATTTG CAGCTGAAGT CACAGAAATT TGTAGGATAG
11451 AGGTACAAGG TTGAGGAGTA GTATACAGTG TCGCCAACCC GTTGTTTGCG
11501 AACTTGGTCT GCAAAATTCC AGAGTGTGCG TTGATCTTCT TTATTCGTGA
11551 GGAGGAGGAG ATGAAGAGCG TCTTCACTGC TTAATCGTTC TTGGGCATCC
11601 AGTTTTTCGA ATATGGAGTA GAGGGGGGAA GTTTTAGGGG GCTGTGGGAG
11651 GCACGTCGTC ATTTGATGAA CACTTTGATG TACTATTCTC TCGAGATTTT
11761 GTAGCACAGT GCTCTGTTTT GTCACATGTT TTTTTTTGGC AGCAATCTGG
11751 TTTTTGACAC CCTTTAGAGA GGGGCCTGCC AAGCAAATGG GAACCTACAA
11801 GTAGAATACC CATCCCTAGA CCTAACAATA CTGTGGCACA GCAGATTACG
11851 AGAA.AAAGTG'TCATCATAAG AAATCCTTAG ATAGGATAGT TTCTTAATTT
11901 AAATCCACCC AGATTGGGGA ACTCCAGGCC ATAGCATTGT CTGCCTGAGT
11951 GACCCTGAGA TAGTAGAATA CAAAAGGTGC TTTACCGTTT GGATCTTTTA
12001 GGGTCACTGA ACTTAGGGGT ACCATA'hCAT CGTATTCATA GTCCAGGTTA
12051 TTGCTATCGG GGAAGAAGGT ATGGAGAACT TCGCCATTGC GGATGATTTC
9


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
12101 TACAGTCTTG AGTAGGGCAG TGCCTGCCAC ATGACCAGAG ATGTGACGGT
12151 TGACGTTGAG TCCAGGTTTC GACCCTGTGG AGAGTTCGGA GCCCATAGGG
12201 GCTGAAGTGA TGTTGAAGCT TAAGACGATC CTAGGTCCTG TTGTAGCGTA
12251 GCAATGACGT GCGAATAAAG CTTCAACAAG AGACTCTCGG GTATATTTAT
12301 TACAAATGAT AGCCGTCAAC CCTGGGGAAT ATTGCACTTG CGGAGAGTCA
12351 AAGTAGTCTT TATAAATTCC TCGATCGTCG AGACCCCCAG CAACAAATCC
12401 GAAGCGGAGA TTCTTCTTTA ATCCTTCAAT TACTGTACCT CGAGGATCTT
12451 CGCTATCTTT ACCTTGGATA GGGAAGGGGT TGTTTAGAGC GGCTGTGGTT
12501 TCTGAAGATC CCCAGGCATT ATAAATTTCT ACAACTCTTT CGAACTCGGG
12551 GTAGAAATTC TCAAAGTCAA AACCATGTTC TTTAGAAGCT GTGAACGAAG
12601 GAATAGAAAT CATGTCGTGG TTGACAGTGC TTTTATAGAG CTTGGCGAGG
12651 GGAATATGTT TGTATTCTTT GTGTTTCGAG TGGGACTTTG TTTCCTTGGT
12701 ATGAAGGATG TGACGCACTC CCTCGAGATG AGGTTCTCCG CTATATTGGA
12751 ATCCGGATAG TGTGATGAAG CGATCTTCTT CATTAAAGTC GGAGACAGTT
12801 TGATTGATGA GCTTCCAAAT ATCTGGAGAG AGGTTCTCTT GATTTTCGAA
12$51 TGATGAAGAA GCATAGAAAT TCAGAGCGCG GTCATCTCGG AAATAACGCA
12901 TACAAGTTTC AATATTTTCT TCAGAGTCGA CGCGTTCGGA TTCGCCGTGG
12951 AGGAGACCCC ACATAAGATT CGGGGCGGAG TCAGCGAAAC ATTTGATAGG
13001 GGCAGAGATG AAAATTTCTT GTGTAGAGAG GTTTTTCAAT TGGATGCGAT
13051 AAATTCCAGG CTCATTGAAA TAGAGATTAG GAAGAATAAC AAAGCCTGTT
13101 TCTGGGATGA AGAGCTGCCA ATTTAAATTT TCTCTAAGAT GCTCGTAGGA
13151 AAGCTCGATT CGGGTCTCTT CAGGAGAGAA GTTGGTGAGG TTCCCGAATT
13201 CGTCTTCAAA TCGCACGGTG ATATCGAAGC GTTTGTTTTT AACGACATAG
13251 GAGGGAGTAA AGATCTCTAT TTTTTTTAGG ACGTTTCCGC GGATATCCAT
13301 AGAGAAGACA TCGGGTTCAT CATAGTTTCC TTCTCCTGTA GGATCGATGT
13351 AGAGGTAAAA GGGTTTGCGA CGTTGTGCGA AAAGTTGGGC TCCGTTCCCA
13401 GCATCATCGA CTTGAGGATG GTTTGGAGAG GCTCCCATGA CAATAGTGAG
10


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
13451 GGTTTCTCCT ACTTGAAGTT CGTAGGGGAG AGTAAACTCG AATTGTGGAA
13501 CGGGATTGTC TTTTACAGGA ATGGCGGTTG CTTCGATGAT TTCGCCTTCT
13551 GGCATTTCTG CGTAGATTAC GTTTCTAGTT TGGGAGAGAT CTGTCGCGGG
13601 GGCTTCCCAA TCTGTGGGTT TCCCACTTCC TGCTAAGTCA AATTTACATT
13651 TGGTTCCAGC TGGTAGTGGT GTGGCAAGGG AATAAAGAAA TTTCCAAGTA
13701 GAAATTTGCC CTGCTCGAGC TATCGAAGGG TTAACGTAAC AAACAGATCG
13751 TCGCATAGTA AGAGGGAGGC TTTATATGAC TTAAAAGCGC CATCATATAC
13801 TAACAGTGAG GTTTTTCTCA ATCCCCGTCT TTGTTTAGTG TTTGTATCGC
13851 TTCATCCACA GTATTGAATA TTTTAAAGTA AGAAAGGAAT CCTGTAACAT
13901 AGAGAGTTTG TTCTATGGTT TTTGGGACTG TAGTCAGGAC AATTTTCCCA
13951 GAATGTTGTC CTACTTGATG GTAGCTTTGC AGTAGGACTC GGATACCTGC
14001 ACTGGACATG TAATCGAGGT GAGCACAGTC GAGAATGATA TTTTTGGATC
14051 CAGCTGCTAG GGATTGGGAA ATATTTTCTT GTACTTCTGG AGAAGA~ATT
14101 CCATCAAGTT TTCCGTGGAG ATGAAAGATT GTTGTTGAGC CGTGTTCTTC
14151 TTTTTGGATA TCACTCATCT AGATAGTTCT CCTAACTATA CGGGAGCTTA
14201 AGTTTTCACT CTGATAAATC TTTAGCTTTT TTGCAAAGAG ATTTTTATTG
14251 GTGATGTTTG AGAATTTCGA TTGGGGGGAG GCAGGATGGG ATCGTGGAGT
14301 GCAAGGAAAT CGGTCCTAGG ATGTTTACAT TCTTAGGAGA TATTGAATTT
14351 ACGTTTTCTT GGTGTGATTT TTAGTTTTCC GACATTTCGT TCTGTGCAGG
14401 TAATGATTTC TATATCGAAG TTTTCGTGAT GGATACGCAT TCCTTTTTGG
14451 GGAACAGCAC CCACTTTATG GAAGACATGT CCTCCTAGTG TATCGTAGCT
14501 ATTTTCATGA TCGATTTTCA AATTGAAGTA CTCTTCAGCG TCGGAGATAT
14551 TCATTCTTCC ATCTACAATC CAAGAGCTTC CGATTTTCTT ATAAGGAGTA
14601 TTTTCTTGTA CGTCGTGCTC GTCTGCGATC TCTCCTATAA TTTCTTCGAT
14651 AATATCTTCC ATGGTAGCGA TGCCTTCTGT GAATCCGTAT TCATTGACTA
14701 TGATGGCTAG ATGGCGATGT TTTTGTCGGA ACTCTTGGAG AAGAGAGGAG
14751 GCTTTTTTTA TTTCTGGGGC ATAGAATGGG GGTTTGCTAC TGAGGATATG
11

CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
14801 GGTTGGCTGA GGTCGTGGCT GCTTGTATAG AGCAGTAAGA GATCTTTAAC
14851 AAGAAGGATT CCTGTGATGT TGTCTAAGTT TTTTTTATAA ACGGGAACGC
14901 GACTGTAGCC TTCTTCGCTT ACGAGAACCA GAGCTTCTTG TAGTGTAGTT
14951 TCTTCGGGAA GTGCGAAAAT ATCTACTTTT GGGATCATGA CTTCACGGAC
15001 AATGAGGTTA TCAAAAGCGG AGAGGGCTTC GGAGAGCTGG CTTTGAAATG
15051 ATGTTGAAGA TCGTACTTGT TGGTTAGGGC GGCGTCTGTA AAAGAGCAGT
15101 TGCAGTGGGA AGAGACCGAG TTGGAATACC GAAGCTAGAA AACGGAGGTG
15151 GGCGGTGGTT TCTTTAGGGA CTTTTGTAGA GATCCATGGG GGGAGGAATC
15201 CGTAAGCTAT CAGGGCGCTT AGAGAGTATA GGGGCCAGAA TAGGAGATCT
15251 TTGTGAGCTG TTTTTGGAGG GAGGAGGGTA TAGAGTTTTG TCCCGAGAGC
15301 TCCATAGAGG ATGCAGAGCA GCGTGGCGAG AATTGTAGGA GCACTGGGGA
15351 AGGGGGGATA CTCTCTTCCT TTATCTTTGA AGAAGCGTTG GTTTAGGGTT
15401 TTTAGGAATT TTGAGGATCC GTGACAGGAC GGTTGCGTAA GCCCGAAGGC
15451 TAGGAATAGA AGAATACAGA ATATGGCTAA AAGAATATGG AGCATGTTAA
15501 GCTGTTAGCA AAGCATGTTT TTTTCTTAAC ATACACAGGA TTTGATTTTC
15551 TTTAACTCTC ATTTTTCTCT TTTCTTCTGA TGAGGTGTCG TCGTATCCGA
15601 GCATATGGAG AATAGAGTGG ACGAGGTATC TCGAGATTTC TTCGTAGATA
15651 TCCTCTTGGT TTGGGGATGT GTTCTCTAAA AACCTAAGAG CGGCCTGTGG
15701 GCTAATGAAT GCTTCTCCTA AAACATGAGG ATAAGCGGGA TCTCCGGGAG
15751 CATCAATAGG CAGAGTGATC GTATCTGTTA GAGAAGGATC AGCAAATACC
15801 TTATCATGGA GTTCTGCAAG AGCTTTATCT TCTAGGAAGT AGATAAAAAT
15851 TTCATTAGTT GTTACTTTTA AGTGCTCTAA GAGCGTAAGA ACCAGCTTCT
15901 CTACAGAAAC CAAATGAATA GGAATACATG TTTGCTCATT GGAAACATGT
15951 ATTTTGATCT TTTCTTGCGT CACGCGAATG AAATTCCCAT AGAGAAAAAC
16001 AAACTTATTT TAAAATAGGG GTCTTAGGTA AACCTGTGAC TTTTTTCGCT
16051 GTACTATCAT TCCAACGGCC CAACTTACGC AAGACTTCTA CTCGCTCAAA
16101 ACGCTTCAAA ACATTTCTTT TGGTAACCCC TTTGACAGAT TTACCATAAC
12


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
16151 TACGATGTCG AGACATAATC CTGCTCTAGA AATAAACCTA TTTTCGGGAT
16201 AAAAATGCTA TCACTGGTGC TCAATGCATT CGTATGCAAT TTATATACAA
16251 TTCTTGGAGC TGGCGCTGGA TGCACAATGG CATACTTAGG TTTACGTTTT
16301 TTAGGACTTT TCGCTCTGCG CCGAGCTTGT TTACTCATTC GTGTCATAAA
16351 TAACTCGATA TTGAATTTTT TAATTTCTCC AGACAACGGA AATTGAGGAT
16401 ACGGAGTACT TTATAAGAAA AAGGATAGTA AAAGAAGAGT TTTTTTTCAA
16951 GAAGATGACG TCTTTTAGCT GCCTTGATCT TGGTGTAGCT CGTCAGGGAG
16501 GGGAGACGAG GGGCATGCCA TGGGGGTTGA GAGGACTCCA CAGGAGATAA
16551 TATATTTGAA AGCATCTTCG ATTTTCATAT CTAGGAATAC GATATCAGAT
16601 TTTCTAAATA GGGTAAGAAA CCCTGAGGTG GGGTTGGGTG TTGTTGGGAT
16651 GAAGACCGTG ACGAGGGGGT CGTCTTCCTT TTCTCCTGTG CAGCATACTG
16701 TGGGTGCGTC TCCAGCGACG AGACCGATGC ATTGAACATT TGCGTTAGGG
16751 AAAGGAACCA TAACTACTTG TTTGAAGGAT CCTGATTTTG ATCCAAATAT
16801 GGTAGTCATG ACTTGTTGCG CAGCTTTATA CACTGTTTTA ATGATGGGAA
16851 TTCGGTGTAA GATTTTGTCG TAGATAGAGA GTAGGGATTT AAAAATCATA
16901 ATTCTCGTGA GGAAACCTAG GAGCACTGTG GCGAAAAAGA GACCGAAGAG
16951 TAAAATGATT TGCAATACGA ATTTTAGAAG AGCTCTATGT TTAGTATAAA
17001 AGCTAAATTT CTCAAAGAAT TCCGAAGCCA AGCCTACGAA GGGTTGGGTT
17051 AGGAAGTTCA TGATCATAGT AACAATAGCA ATAGTAATTG CTAGAGGAAG
17101 GAGAATAACA AGTCCTGTAA TAAAGTATTT TTTCATGATT CTCCTGCAAG
17151 ATATGAGGAA ATGGGCATTT GTTTCTTTAC TATACAGCTT AAGATTATTT
17201 AAGATAAAAC TTTTCCCGAA TCTTCTGGGG ATAGGAGAAA TCTCCATGGG
17251 ACATCACGAT ACTCTTGAGC ATAATCGATG CCGATCCGGG CAGTTGCTGT
17301 TAGAGTCCCA GAGATTTTTT CTTTGCTGAT ATAGAGAGCT GGGGTATTTA
17351 GGCGTTGCCT ATTGTTTTCC AAAGAGATTC CTAGAGCTTG GCACACTTTT
17901 CCGGGTCCAT TGGTGAGAAG GTGTGGGGGT TTATCTCTCC ATTGGCGGCG
17451 TTGGATCATA AGTTCTTTGC CTTGATCAGG AAGGATGGCC CGGATCAGGA
13


CA 02350775 2001-05-11
WO 00/27994 PCT1US99/26923
17501 CGGCATGGGG AATGTCCTCA GGTCCAGTGA CAACATTCAA TAGGTGATGC
17551 ATGCCATAGC AACGGTAGAG GTAAGCAGAG CCTCCTTTCA GGTACATCGC
17601 TCTGTTCCTC TGAGTTTTTC TGTAGTTGTA GGCGTGGCAT GCTTTGTCAT
17651 CAGGGCCACG ATACGCTTCG GTTTCTACAA TGTAACCTGA AGTTATCAGA
17701 CCCTCATGTG TTGTGATGAG TTTATGTCCT AAAAGCTGTT GCGCTAGTGT
17751 AATTACATCT TCCGATAGAA AAAAATGTTC TTGTAGCACG TTACGAGGCT
17801 CTTTTTTTCG TTCCTTTTTT CTTAGAAGGC GTTTTCTTTA TTTTCTTAGG
17851 TTTATCTTCT GTGGTTGTCG CTATAGACCA GACGATTTTT TGCGTAAGGA
17901 GATTCACGGA ATCAATAGTG ACTTTTATAG AAGCTCCAGG TTTCATTTTA
17951 TCTGGGATAG ATTCTGGAAG AGCGTTTTTC TTTAGGGAAT ATTCTTTAGG
18001 GAGTTCTGCT GCTGCAATGA ACCCTTCATG GCAGAATTCG GTCACTACAA
18051 ATGAGAGTCC TTCATGATTT GCAGTGATGA TATACGCATG GTATGTAGTT
18101 TTAGGTTGCT CTTGCAAAAA TTTATTTATG AACCGAGTTT TTTTGAGGTT
18151 TTCGAAAGAA TTTTCTGCTT TTGCGGATAC TCGTTCTTTT GTAGAGCATG
18201 CTCTTACGAT AATTTCGAGG TGCGTTTGGT CTATAGATAG GGGGTTGAAG
18251 AGAAGCCTGT GAACAATAAG ATCGATATAT CTACGTATGG GACTCGTAAA
18301 GTGGGTGTAG TAGTCGAGCT TAAGTCCGTA ATGACCTTTA TTTTCTGTAG
18351 AGTAGGAGGC TGTTTTCATA CTTCGGACAA ACTGCGAGTG TAGAACTTGC
18401 TCTAGGGGAT GTCCTGCTGA CGTAGTTTGC AAAAGGTATT GGTAATCAGG
18451 TTCTTGTGTG GGAGTGAACG TGATATCAAA GCCCATGTTT TTTGCCAATT
18501 CTTGGAAGGC GAGTAGGTTT TCATCATTGG GAGGTTCGTG ACTACGAAAA
18551 GGTAGAGAAA CGCCTTGATG GGAGATATGA TAGGCGACCA CTTCGTTTGC
18601 TTTAAGCATA AACTCTTCGA TGAGTTTATG GGAGAAGGTC TGGTGGTTTT
18551 CTATCAGAGC TACGGGTTCT TGAAGATTAT CCAAGGACAT AGTGACTGAG
18701 GGGAGGACAA AGCGAATGCA ACCACGTTCT TCACGGATAT CGGAAAACTT
18751 TTTACTTAGA GTGGCCATCT CATTGAGGAT TTTTGAGAGG GGGTGGGAGT
18801 GTTTCTTTTC AATGATGTTA TCGACTTCAT CGTAGGTCAT ACGATATTTG
14


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
18851 CTTCGAATGA CGCTACGGAA AATCTGGTAA TCTGAAAGAT GACCTGATTT
18901 TGTAAACGTC ATAAATACGG ATACAGCGAG TCTATCAACG TTTGGTTTTA
18951 AGCTGCAGAG ATTATCAGAG AGTGCTGATG GCAACATGGG AATGACTTTC
19001 CCTGGGAAAT ATGTAGAGTT ACAGCGTTTA GCAGCTTCTT TGTCTAGGTG
19051 AGAATGTGGG GTAACGTAGT GGGAGACGTC TGCGATGTGT ACACCAAGAA
19101 TGTAATTGTT ATTATGATCG TAGGTGAGGG AGATGGCATC GTCGAAGTCT
19151 CTGGCTGTGG AAGAGTCTAT GGTGAAACAG AGGAGATCAC GGAGATCTTT
19201 GCGAGAGTGG AGAACTTGGG TAATGTGTTT TTGAGAGAAA AGGCTTGCTT
19251 CTTCAATGAC CTCTGGGGGG AATTCTTCGG CAAGGTTATA TTCGGCTTGA
19301 ATTGCCTGAA AGTCCGCTTT AGCGTTGGTG ATGTGGCCA.A TAAATTCGAG
19351 CATTTGTAAG GCTGGAGAGG CTCCTTCTTG GGGTTTATCT ACCCAGGGAG
19401 GAGTGCTCAG AAGAATGCGA TCGCCGATTT TGTAAGTGCG TCCGGGAAGG
19451 AGTTCTACTG GAATTAAAGA TTGGGATCCC GACATGCTTG TGTAGGCAAG
19501 TGCTGATGTG GGACTGACTA GTGAGGTGAT CGTTCCTACG AGTGTTGTTT
19551 TTCCTCTTGC GAGTACTTCG CTGATAGTGC CTTTGAGTTT TTGTCCGTCT
19601 CTTGGATAGG GAAGCACGGA GACAATCACG TGGTCACCAT CTAGAGCCCC
19651 GCGTAAATCT CGGGCGGGAA CAAAAATATC AAATGGGTAT TCTTCGGGGT
19701 TGTCGGGAGA AACAAAACCG AAACCTTTTC TAGCATGAAC AAATAGGGTT
19751 CCTGGAATAA AAATCTTCAA GGATTTACCG TATGTTCTTC TCCCTGGTTT
19801 TCTTTTTGGT TTTTTCAACA ATTGGGCTCC GCCTGTAAGT TTAGGACATT
19851 GAACGAGAAA TCCCGTAGTT TCACTCGTGA ACTGAGTGGC TCAACAAAAT
19901 TTTCCCTTTT AAGTGGGTTT TGTGATTATG AAAGGAGCAG TCTCAAATTC
19951 AAAGCCAATT'GTACTAAAGA AGCTCTGTTA TTGCAACTCC TTGATCAGAG
20001 AATAAGAGAA TGGAACTGTT TTCTGTTTAT AAGGAAGTTT CCCATCCTCT
20051 TATGAGGATG GGAATAGAGA AACTTAAATT GAAAATTTTG ATTACTTATC
20101 GTCGTTATCA ATAATTTCTA CATCAG~TTC TTCGATATGG TCTTCTGAAG
20151 AACCGTTATT TGAAGGAGGC TTCGTACTGA AACTATGTTT TTTCAAATCT

CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
20201 TCTGTATTGA TGTTAGGTCC ACCTTTAGCA TTGGCTGCCG ATGATGCTGC
20251 TGCTGATGCA GACTGCGATT GCATAGACTC TCCAATTTTT TGCATATGCT
20301 TGCTTAGGTC TTCAGTAACC TCTTTAATTT TTTCAATAGG AGCGTCATCT
20351 TTGAGTGCGT TGCGCACGTT TTCGATTCGC TCTTCGATTT CTTTAACTAA
20401 AGTTTCAGGA ATTTGCTCCT TATAATCTTT AATAGCTTTT TCGGCTCTGA
20451 AGATCATGCT ATCGGCTTCA TTTTTAGCAT CTGAAGCTTC ACGACGTTTT
20501 TTATCTTCTT CCTTATTAAT TTCGGCATCT CGAACCATTC TTTGGATTTC
20551 ATCTTCTTGA AGTCCTGAGC TTGCTTCGAT ACGAATTTTC TGTTCTTTAC
20601 CGCTGGCAAC ATCTTTAGCT GAGACATGGA AAATTCCGTT TGCATCGATA
20651 TCGAAGGAGA CTTCGATTTG~AGGATGGCCT CGAGGAGCCG GAGGGATATC
20701 TGTAAGATCG AATCTTCCGA TTTCCTTGTT ATCTTTGGCC ATGGGACGCT
20751 CTCCTTGGAG AACTACGATG GTAACCGCAG CTGGTTATCA GCAGCTGTGG
20801 AGAAGATTTG TTTTTTCTGT GTAGGGATTG TAGTATTTCT CTCTACCAGA
20851 GTCGTCATGA CGCCTCCTAG AGTTTCGATA CCCAGAGATA GGGGGATAAC
20901 GTCTAGAAGT AGAACATCCT TAACTTCTCC GCCAAGAACA CCACCTTGAA
20951 TTGCGGCTCC AATAGCAACA ACTTCGTCGG GGTTGACTCC TTTATTAGGC
21001 TCTTTGCCGA AGAGTTCTTT TACAGTTTCT TGCACTGCGG GCATTCTTGA
21051 CATACCTCCA ACTAAGAGAA CATCATCGAT ATCCTTAGCG GAAAGTTTTG
21101 CGTCACTGAG TGCTTTGATG CATGGAGATT TTGTTCTTTC GATTAGAGAG
21151 GCTGCGAGTT TCTCGAATTG CGCACGTGTG AGTGTCAATG CAAGGTGTTT
21201 AGGTCCTTGT GCATCCATTG TGATGAATGG CTGATTGATT TCTGTGGAAG
21251 AGACTCCTGA AAGTTCTATT TTTGCTTTCT CAGCAGCATC TTTAAGTCTT
21301 TGTAAGGCCA TATTATCTTT GCTAAGATCA ATGCCTTCTT GTTTTTTGAA
21351 TTCTTCGATC ATCCATTTGA TAATGACTTC ATCAAAGTCG TCTCCACCGA
21401 GGAGAGTATC TCCATTTGTA GATAGAACTT CGAAGACGCC ATCACCGATT
21451 TCTAGGATGG AGATATCAAA AGTTCCTCCA CCAAGGTCGA AGACAGCGAT
21501 TTTTTTATCA CCGACTTTAT CGATTCCGTA GGCAAGAGCT GCTGCGGTAG
16


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
21551 GTTCTGGAAT GATACGTTTT ACATCTAGAC CTGCAATGCG TCCAGCATCT
21601 TTTGTGGATG CTCGTTGAGA ATCATTGAAG TATGCGGGGA CGGTGATCAC
21551 TGCTTCTGTG ACAGTTTCGC CTAGATAAGC ATCAGCTGTC TCTTTCATTT
21701 TCATTAAGAT TTGTGCGCCA ATTTCTTCTG GAGTGTATTG TTTGCCATCA
21751 ACTTCGAAAA CGGCATCACC TTTAGATCCG GAGGTGACTG TATAAGGAAC
21801 GGTTTGGATT TCCGAAGCTA CTTCAGAGTA CTTACGGCCA ATAAAGCGTT
21851 TTGTAGAGCC GAGAGTTTTT TCTGGATTTG TCACTGCTTG ACGTTTTGCT
21901 GGAATCCCCA CTAATTTCTC ATTACCTTTG AAGGCAACGA TCGATGGCGT
21951 GGTTCTTGTT CCTTCGGATG ATGTAATTAC TTTAGCTTGT CCTCCTTCCA
22001 TAACAGATAC GCAGGAGTTT GTTGTGCCTA AGTCTATACC TATAATTTTG
22051 CTTGATTTTT TGTGTTCACT CATGTTTGGT ACCTAATCTC TAGGGGTTAT
22101 TTCTATTCTT TATTTTCTTT GGGAGTAGGA GCTTTAGCGA CTTTAACTTT
22151 AGCTACCCGA ATCGGGCGTT CTCCTATTTT ATATCCCTTT GCAAACTCTT
22201 CTAAAATCGT CCCCTCAGGA ACTTCAGAAG TCTCTTCTGT TTGCACCGCT
22251 TCGTGTAGGA AGGGGTTAAA CTTTTGGCCT ATTGAAGAAT ATTCAATAAT
22301 ACCTTTTTCC TCGAAGATTT GTTTGAATTG GTTGAGAATC ATGTTGAATC
22351 CGAGGGCCCA ATTTTTTACA TCGTCGGACA TTTGTGTAGC AAATCCGAGG
22401 GCTTTCTCCA TGCTTTCTAT GGGATTGAGA AAGTCTATTA AAGTATTTTC
22451 TAAAGCATAC TGCATAAGTT CTTGGCGTTC TTTTTGTAAG CGTTTTCTAG
22501 AATTCTCAGA TTCTGCTAGA GCCATGAGAT ACTTATCGTT TTTTTCTTTT
22551 AATTCGGTTT TTAGGGTGAC GATTTCCTGT TGCAAATGTT CAACTTCATT
22601 TTCGTTTTGA ACATTGCTTT CGTGTTGTTC CTCATTTTCA GGTGGGGTAT
22651 CTGTCATAAC GTCTCCTTAG AGGGTAATAG TTTTATAGAA GAGTACTCCG
22701 TTCTTAAAAT AGGCTCATTC GAAAGCTTAC AGTTAGAGGT GAGTGGTCTT
22751 CTGAAGGATA GTTTAAATTT GTAGAAACTT TG~'GTCAGGG TTTCATTTAT
22801 TTTATTCGCA AATAGTTTGA GCAAAGGAAG AGCTTCCTTA TAAGGAAGAT
22851 TGATCGGGCC TAGGATACCT AAAGCTCCGA GTGGAGAGCG ATTCATATAA
17


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
22901 TAGGGAATAG TAATTACAGA ACATCCTGGA TTCGAGGTCC CTAAAATATC
22951 AGAAAGCTCC TTCCCTATGA ACGCTGTAGC TCTTCCTTTA TGCATTCCTA
23001 TATTTAGAAG CTCACACATT TGTCTGCGAT TTTCAAAAAG AGAGAGTCCT
23051 AGAGCTAGAA CTTCAGGATC TTTAAACGCT TCGTATTTCA GTAGTTTCGA
23101 CATTCCTGTT TGATAGAGAT CTTCTTCACT AAAGTTGCAG TAGCGTGTTA
23151 GATAGCGGAC AACCACCTCA TTATAGAGGG ACATGCTCAG GTGTTCTTCT
23201 TTTTTCGAAA GTTCCTCATT TGTGGGGAGC TTTCGGATGT AGTTCTGCAG
23251 GAATTTTTCT ATACGTTTGA TAGAAAGAGT ATCGCAAGCT TCAGGCAGCC
23301 ATAGGGTGTC TGTGAAGATC TGACCAAACT CCGTAGAGAG GATGGTGACA
23351 GCTCTTTGCT TATCGACCTG TGTAATTTGA ATATTGGTTA CGGAATCATT
23401 TTCAAAGCGT GGGGAAGAAA AAAACGTAGG CAGGTCTAGG ATTTCTCCAA
23451 GAAGTTCCGT AGCTTTTTGT AGATCCTTGA TAATATTGCG ACTTTCGCTA
23501 GGAAGCTGAC TGATCTTATC AAAAATGGGG GCAGAAATCT CAGCTTCTGG
23551 GCATTCTTCT TGGTGATCTA CATAGTGACG TAATGCTAGG TCTGTAGGGA
23601 TTCTTCCTCC GGAAGTATGA TTTTTTTTTA AGAATCCTTC AGCTTCAAGT
23651 TCTGCAAAGT AATTTCTTAT AGTTGCCGTA CTCAAATCAG AGCAAAAACT
23701 TTCCTTTAAA GTTTTAGACC CTACAGGCTG CCCTGTTTTT AGGTACAACT
23751 CTGTTGTAGC AAACAGGATA TCAAGGATTT TTGAATCTCG CTTTGAGACT
23801 TTGGATCTAG CCATCTCGAG TCCTACTAGA ACAATCGTAA CTGAGAGCAT
23851 TATAGGAAAA GAACCCGAGA AGGTCAAGAG AATTTAGCAC TCGAATTGAA
23901 TGAATGCTAA CTTTTTTACG AGGAGGGCAG CGATCAAAGA GCTAGGCTAA
23951 GTGATTCTGA CACCAAGTAG GGAAGGCCTC CGGGGAGACT GTATACTTTT
24001 CTCCAGATCG GGATTCAATT TCGAATATTC CCGAAGATTG GTAGGACTTT
24051 CCTAAAATAA GCTTATAAGG AATGCCGATA AGGTCACTGT CTTTAAGTTT
24101 AAATCCGAGT CTTTCATCTC GATCATCAAG AAGGGGCTCA TAGCCTTGAC
24151 TTTGTAGCTC ATGATAAATA GTTTCCGCAA GCTCTTGAGA TACAGTGTCT
24201 CCTCCGTTAA AGGCGATAGT GATAGAGAAG GGAGCGAGTG CTTTTGGCCA
18


CA 02350775 2001-05-11
WO 00/27994 PGT/US99126923
24251 AACAATACCA CGGTCGTCGG CAAGCTGTTC TACACAAGCG GCTAATGTTC
24301 TTCCGACTCC AATGCCGTAG GTCCCCATCC AGCACTGCTG GGTTTGCCCG
24351 TGTTCATCTT GGAAGTTTAC CTCAAAACTA TCGGTATAGC GTGTCCCGAG
29401 ATTGAAAATA TGAGCAACTT CTATGCCTTG ATAAATGCGG TAAGGATGGC
24451 CAGGATTTTC AGGACATGTG TCTCCCTCTT CAGCGAGTAG AAAGTCACCG
24501 TATTGGGGGG GGAGGAGGTC GCGATCCCAG TTTACATTTA CGTAGTGCTT
24551 ATCTTTAGCA TTGCCCGCAC AAACAAAGTT CGTCATTGGG GACGTTGTTT
24601 CGTCTGCGAA AAAGTCTATG GGACAGTTTA GGGGACCGAT GAATCCTTTT
24651 TCTGTGCCTA GAACGCGTTC GATTTCTTCA TCAGAAGCTA GAGCAATATC
24701 ATCGGCATTC AGTTTGGAAG CGACCTTCAC TAGGTTGACT TGCCGATCTC
24751 CTCTCATTCC AATGGCAATG AATTTTTCTT CATTTGAGTA GGAGAGTTTT
24801 ACGACAAGGG TTTTTAAAAT TTTATGTAAG GGGATAGAGA AGAAGTTTGC
29851 TAGAGCTTCT ATTGTTGTAA TCCCAGGGGT GGCCACTTCT TCGACGGGAA
24901 GAAACTCGCG ATCGTAGGCA TGCTGTGGAG GAATGGAGAC AGCAGCCTCA
24951 ATATTAGCTC CATAGGAACC GCTGACGCAG ATCGTGTCCT CGCCTAGAGA
25001 GCAAAGGACC TGAAATTCCT CAGACTTTCC TTTGCCGATT TTCCCTCCAT
25051 CAGCTGTAAC GATGACATAG GCAAGACCGA GACGATCAAA GATCTTACTA
25101 TACGCAGAGC GGAGTTTTTC ATATTGCTCG TTCATTTGTT CGGGAGAGTC
25151 TGAGAAGGTA TAGCTGTCTT CCATAAGGAG CTCTCGAGAG CGAATGAGAC
25201 CGAATCGAGG GCGAATCTCG TCTCGGAATT TTGTAGCAAT TTGGTAAAGG
25251 TGGAGAGGAA GTTGTCTTTT TGAGGAGAGC CATTGTGCAA CAAAAGAGCA
25301 GATGACCTCT TCATGTGTAG GAGCTAGGCA ATGAGATTTT CCTTCGCGGT
25351 CTTTGAGAGT GTAGAGCAGT CCTTCCGAAG TAAATGCCTC CCATCTCCCT
25401 GTATGTTGCC AAAGTTCAGC ATTGTGGAGA AGTGGGAGTA GAAGTTCTTG
25451 ACCTCCAATC GCATTAAGTT CCTCTCTAAT GATGTTCATC ATCTTGGAGA
25501 CCACGCGCCA TAACAGGGGT GTATAGGTAT AGACTCCTTT ACTTACTTTA
25551 AATAGGTATC CTGCCTTTTC TAGGAGCTCG TTTGAGAGCA CAGCAGCGCT
19


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
25601 TTTATTTGCA TTTTTTGAAG TCTTATAAAA GAGTTGAGAC GTTTTCATAG
25651 AGTGGGGCTG TTATCTTCAG TGATTGATCG TCAAGAGTCT ATAGAAAAAA
25701 TTACAGGCTA TTGCGATTTT ATATCGAGAG TCTTTTAAGC TAGTAGATTT
25751 AGTTTAGCTA TGAAGGAGGT ATCTTAACAC AGGGATTTTT CTAGATCAAT
25801 GTTTTGCGTT GGGTTCTTAA CTTAACAATT GGGGTCAGGA TTATGTATTA
25851 TTTTAATTAT ATTTTTGTTT TTATTTAGAA ATTTAATAAT CTCAACTTAT
25901 AATTAAGTGT ATACTTATTA ATCTTTTATT TTTGTAATTG TAGTACTATG
25951 TCGTCAGTAA ATCAAAGCTC TGGAACCCCG AATCCAGAAG AGGTAACTTC
26001 TCCTGAATCT ACGGAAGAAA ACAAAAATGT TGTTTCTTCA GATGAGGCGC
26051 AAGCCACGCA TGCTGTGGCT CTTCCTATAG TCACTCAACT TTCTCTTCCT
26101 GAAGGTGTGG GGACCTCATC TGAAGAAACG GCGAGTAATC CGAGGGTAGA
26151 CGAGATTGTA GCTGAAGTTT CTTCGAGTCG GGCGGTTGCT GATCAGATCT
26201 CATCACTTGT AGAGCGTGTT GGAGAGCTTT TAGACGACCT TAAGGGTGCC
26251 CAGTCCCTTT TCACTAGCTT TCAGTCAGAG TTGAAAAACT GTCTTCCGGC
26301 ATGGAAATCT TCAACGAGAA GACTCGAAAC TCGAGGTGCT GGGGATAATG
26351 CGGATATAGC GAGGCTGGAA TTATTTCGTA GCGATTACGA GGCTGTCTTA
26401 GGCCATGCGA ACCAGTTTCA TGGGAAGGCT CATCTCATTT TAAGTAAGTT
26451 AACAGATGTA CATCACAAGC TACAGGGACT CAGTCGTGAA GATCTTTCCC
26501 TGGCGTTTGA CAATAATGAT AGGGTTCTTG AGCATCTGGG TTCGTTAGGG
26551 CTTGATGTAG ACGCTGAAGG TAATTGGTCT CTTTCTTGTG AGAGGGGGAT
26601 TCCGCGACTG GTGCTTACTG CTGACAGTAT GCTTGTCCAG ATCAAGAAAG
26651 TGAATCTACC TACTGTAGAA GAATTGCGGA CTCTTCAGGG AACAACGGAA
26701 TCTTCGTCTG ATCCTAGGGT TGAAGAAAGT TTGTCTTGCT GTGAAAGATT
26751 GCTCAATGAA TTACGTCGTC TTTGGGCGAA TTTTGTAGGT TTTATTTCGA
26801 GTTGCTATGA CPtACATCGTG TTTGTTTTGA TGTGGATAGT GAGACGGATT
26851 AACCTTTTGC CTGGGCTGGG GTGTTTGCCT TTCCATAATC CCGATGCTTC
26901 TCAAGAAGAC CAGAGGTCTT CTTCCGGAGA GCGTTCTACA AGGAGAGAAC
20


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
26951 GCCTTTCTCG GCGATCTGAC TTATCTGAGG AAGAGATGAT TGTGAGAGCT
27001 GAGGGAGAGT CTATACATCC TGAATCTCCC CATGGAGATG GCCGTAACCA
27051 ACCTAGTCGA GGTGATAAGC AAGACTCTGA TAGTGAGGAA GAGACGGAGT
27101 TATAATAAAG AATGATCTCT TCTGAGTAGA AACTATCGAA TCAGAATTGC
27151 CAGCTTTTAC TCTCACCCAG TCCTATTTTT ATAGTTCAAA TTGCGGTGGT
27201 TTGTTTATTG TTTTTTTGAA TTTTTGTTAT ATGGTGGCGT CTAATATTGC
27251 CGATTAAGGA GGGCGCCCTT TATGAATAGA AGAAAAGCAA GATGGGTAGT
27301 GGCATTGTTC GCAATGACGG CGCTCATTTC TGTTGGGTGT TGTCCTTGGT
27351 CACAAGCGAA ATCAAGATGT TCTATTGATA AGTATATTCC TGTAGTCAAT
27401 CGTTTACTAG AAGTTTGTGG ACTTCCTGAA GCTGAGAATG TTGAGGATTT
27451 AATCGAGTCC TCGTCTGCTT GGGTACTGAC TCCTGAAGAA CGTTTTTCTG
27501 GAGAGTTAGT CTCTATCTGT CAGGTTAAAG ATGAGCATGC TTTCTATAAC
27551 GATTTGTCTT TATTACATAT GACTCAGGCT GTGCCTTCGT ATTCTGCAAC
27601 GTATGATTGT GCTGTAGTTT TTGGCGGGCC TTTGCCAGCG CTACGTCAGC
27651 GCTTAGATTT TTTGGTGCGA GAGTGGCAGC GTGGCGTGCG CTTTAAGAAA
27701 ATCGTTTTTC TATGTGGAGA GCGAGGGCGC TATCAGTCTA TTGAAGAACA
27751 AGAGCATTTC TTTGATTCTC GGTACAATCC TTTCCCTACT GAAGAGAACT
27801 GGGAATCTGG TAACCGAGTT ACTCCCTCTT CTGAAGAAGA GATTGCCAAA
27851 TTTGTTTGGA TGCAAATGCT TTTACCTAGA GCATGGCGAG ATAGTACTTC
27901 AGGAGTCAGA GTGACATTTC TTCTAGCAAA GCCAGAGGAA AATCGTGTGG
27951 TTGCGAATCG TAAGGACACC TTACTTTTAT TCCGTTCTTA TCAAGAAGCG
28001 TTTCCGGGAC GCGTGTTATT TGTAAGTAGT CAACCCTTTA TCGGTTTAGA
28051 TGCTTGCAGG'GTCGGGCAGT TTTTCAAAGG GGAAAGCTAT GATCTTGCTG
28101 GACCTGGATT TGCTCAAGGA GTCTTGAAGT ATCATTGGGC TCCAAGGATT
28151 TGTCTACATA CTTTAGCGGA ATGGTTAAAG GAAACGAACG GCTGCTTAAA
28201 TATTTCAGAG GGTTGTTTTG GATGAT'IrCAT GGATCTTAGA GGTTAAAGTC
28251 ACTCCAAAAG CCAAAGAGAA CAAAATTGTA GGCTTTGATG GACAAGCTTT
21


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
28301 GAAGGTCCGT GTTACCGAAC CCCCAGAAAA GGGTAAGGCC AATGATGCTG
28351 TAATTTCTTT ATTAGCAAAA GCTTTATCCT TACCGAAGCG TGATGTCACT
28401 TTAATTGCAG GAGAAACTTC TCGAAAGAAA AAGTTTCTTC TTCCTAACAG
28451 AGTTCAAGAC ATTATTTTTT CTTTGCATAT AGACGTATAG CCTAACTCAC
28501 AGCTACGTTT TTCCCTTTCT CAGATTTTTC CAATTTTTTG CTTTTAAAAG
28551 ATAAGAACTG GTTGCGTTCT GTTTTATGAA GCTTGATTCC TAAGTACTTG
28601 ATGATATCTT CATTAAAGGT GGTTGTAGGT GACAGGCGTT GAGCAATGAT
28651 TTTACGCAGG CTGTCCACAT CGTGATTGTT ATAGAGTAGG TGGTGCACAA
28701 TTTTTGCGAT TTGTTTTCCT GATTTTTTGT AATCCACGCT ACAGGCAATG
28751 CAGGCTCCTT CGGAAATTAA GGAGGTATCG TCGGTAATGA TAGGGATTTT
26801 CTCTTTGAGG ATTTCCTGAA GGAATGCGGT GCCTTCTTTA TGAGAAAGTG
28851 GGGAGAGGGG AATGAAGATA GCTGAGGGGC GCTTGTCGAT AGCCTGGCGT
28901 ATCCGGGTTT TGAATGTACT GCTTGTAATA GAGATCTCAA TGACCTCAAT
28951 TCCTGAAGCA TGGAGTTTCT TAACAATTTC TTTTTGGAGA TCTGAGGGGA
29001 AAGGTTCGGA GGGTTTTAAA TACACGATAG ATTGTGCATT GGTAGCTACG
29051 GCTTGTATAG CAAAGCAGTA TTGATTGATG TCTAGAGTGT CATTCACTCC
29101 GTAGATATTC ATTGTGTTTT TAGGAAGGGT TAGGCTTTCG CGATCAGGAA
29151 CAGCGGCATA GATCACAGGT TTCTGTGTTT CAATGTGGCT CATGACCTTC
29201 GTAGCAATAG TTCCTAAGGT GACAATCGCC ACGACATTTT TATCGGTATG
29251 TAAGGAGCGA GCAATTTTCC TAGCCTTTAC GATACTGTCT TCAGCATTTA
29301 GGACAACAAT TTCAGGAAGG TTCTCAAAAT CTTTCAAGGT TTCTATACAG
29351 CTTTTACTGC AATCTTCTAA TAGGGGATGG GGAAAGGATA AGAAAATTGC
29401 GATTTTAGGA GAGGAGACGC TATCTGGTTG AGAACCACAA GTGGCTACAT
29451 AGATGAAAGA GCAAAACAGA GAAAAGAAAA ATAAGTACTG AGAGAGTTTA
29501 CGTAGAATTG TCATGCAAGA ACCATCGTTT CTTTTAGTGA AGCGGTACAG
29551 AGGCGGTCAC AGGCTAAAGC GATATTTTGT GGTTGTGTCA GAGCGGAGAA
29601 ACGAACAAAT CCTTGTCCAC AGGAACCAAA ACCGTGGCCG GGAGTCACTG
22


CA 02350775 2001-05-11
WO OO/Z7994 PCTNS99/269Z3
G'J071 I.HHIH1Vt11t1 l.1Vt11V1MV MVlltat~t~.ean saV,.V~.rm.aal. t~rv.ramava>rr
29701 CCTTCAGGGA GTTCTACCCA AAGGTAAGGG GCATGATCGC CACCATGAAC
29751 TGAGAATCCT GCAGTTTCTA AGCTTTTTTT AAGTTTCTGA GCATTGGTTA
29801 GATATAAAGA GATGGCGGGA GGTGTCGGAA ATAAATCTAG GCCGTAATAC
29851 CCTGCTTCTT GCATGAGGAG AGATGCTCCG TTAAATGTAG TCGCAAAGAG
29901 CCGTTTCCAA TCGTTGATCA TAGGTTCGTT ATTGTCATAG GTGAGTTCTT
29951 TAGGGATCAC GTTCCAGGCA AGGCGCATGC CAGTAAAGCC TAATGATTTA
30001 GAGAAAGAGT TGATTTCTAT AGCACAATAT TTTGCTTCAG GGATTTCGAA
30051 GATGCTTTTA GGTAGGCTAG GATCTGAGAC AAAGGCGCTA TAGGCCGCAT
30101 CAAAAATAAG AACGGTTCCG TGCTGATTCG CGTAGTTCAC AAGTGCTTGG
30151 AGTTGTTGAA AGGTTAGAAC TGTTCCTGTG GGGTTGTTAG GATAGCATAG
30201 ACAAAGAATG TCTAGGGATT GTTGGTTCGG AAGTTCTGGA ATAAACCCAG
30251 TTTCTTTTCT GCATGCTAGG GGGATAATGT CGCGGATTCC TGTAATGTGG
30301 GCAATGTCTC TATAAGCTGG ATAGACAGGA TCCTGTAGAC CTAGAGTCTT
30351 TTCTGAGCCA AAAAAAGAAA AGAGACGGAA GATATCAGGT TTGGCACCAT
30401 CCGAAATAAA AATCTCTTCA GGGGAGATTC TATTTTCATA GACTTCAGAG
30951 GCAATTTTTG TGCGTAATTT TTCTAATCCG GTTTCTGGGC CGTACCCACG
30501 ATAGGTCTCT TGTTTCTCTT GAGAAACGCA GAACTCTTTG ATTGCCTGAG
30551 TAATAGAGCG GCAGAGAGGT TGTGTCGTAT CTCCGATAGA AAGATCTATG
30603 ACAGAGATTT CTGGATTCTC CTTGCGAAAC TGAGCAAGCT TTTTACTAAT
30651 TTCAGAAAAT AGATACTGAG GCTTGAGAAG AGAAAAGTGG GGATTTCTAC
30701 GCATAGCGTG CTCCAGGTAA GGGCGTGATT CACTTGGAAG ACATCTTAAG
30751 AAAGCCCCAG CTTTTTGGAT AGCCATTTTC TGATTTTTCT TTAACCGCCT
30801 CTATAAAGGC AAGGAGTGCT CTGTAGTTTC GGTTAATTAT TAAGATATAT
30851 TTATAATTCT GTTTATTAAA AGTTTTTTAA ATCTTTTCTA ATTTGCTCAC
30901 TATAATTAAA GGATAAGATT TGAAAAAATT TTTTAGGTAA TTATGATAAG
30951 GGTCAATCCT TATGGAAGTT ATAGGGGTAG GAATCCTTCT CCAGAAGATG
23


CA 02350775 2001-05-11
WO 00/27994 PCTlUS99/26923
31001 GGAAAAAGGA TGTACCCCTT TCAGGGAACT CTCGCTTGCA TCGTCGTGGT
31051 GGGATTCGTA GAAAGCATAA GAGTGCTTCA GTTGGGGTGA CCTCGGGTTC
31101 TAAGACGGGG AAAGCTTCTT TAGAGAAGAA GGTCAAAGGC ATTTCAGAAG
31151 CCCATTTCAA ATAATCCAAG ACAGAAGGTT CTCATTCGAA GACAAGCAAA
31201 GGATTCGTTG GCAGATTTGT TCAATGGATT AGAACGTTTA CAGGACGTGG
31251 AAGCAAGAAG CGTTCTCCCT CAAGTTTTTC TCCAACGCAC CCTTACATAC
31301 GTTTGCGAAC TTACACACGC AGTCCAAAAC AGAGTGGTGT AGAGAGAAAA
31351 CAAGAAGATG CTGAGACCTC ATTTATAGAG ACACCCAAAG GGATCTTGAA
31401 AAAGCCTGGA AACAAAGACC CCAAAGGCAA GCACGTCCAT TGGAAAGACA
31451 GCTAATCCGG ATCAGAGCTG AGATCCCTCC TTCTCTTCAT TAAGAGGGTT
31501 CAGAGTTTTA GGTCTCTGCG GTAGAAGGGA GGGAATCCAT AAGGAGAAGA
31551 CCAGTGGATT TTCAACGGAG ATTCTAGAAG GAAACAAGCT GATTCTACAT
31601 GGAGATGTTT GTAGGTCTGT TGTGAATTCC TCTCTTCACA GATCGGAAAG
31651 ATCAATGAAA AGAGTTAGGG TCTCTTAGTT TTAAGAATTG CTATACAATA
31701 TTGCGAAATA AATACTTTCT TCTCTATACA CTGTCTTTAC GATGAAGACA
31751 GCTTTTCACT CTTGCTATTC TTGGTTTTGT TGGCTCTTTA GCTTCTTGGT
31801 ACTCTTTGTG GGTGGCATCG CTGGGGGAGA GCCTTTGTGC CCCGATTGCA
31851 AATACGAAAC TAAGTCTGTT TTACGTTCGG ATCAGCTGCC GGATCATCTC
31901 TGGAACTATG AAAACGACTG TTATCTTACA GGTTATGTGC AGTCTCTTTT
31951 GGACATGCAT TTTTTAGATA GCCGTACGCA AGTTGTTATT GAGAAGAATA
32001 GAGCGTATCT TTTCTCTTTG CCTGTAGATT CGAGTTTATC AGAAGCCATT
32051 ACCAACTTTG TTAGGGATCT TCCCTTCATA TGTGCTGTGG AGATTTGCGA
32101 GCGTCCTTAT GGTGAATGCA TAACGAGATC TTCTGCGGAG CGTCCCTTAC
32151 TCCCTAAAGA GAAAACTTTA GGAATGCCAA TTTTCTGCGG CAAAGAAGGG
32201 GTATGGTTAC CTCAAAATAC CATTTTGTTT TCTCCTTTGA TTGCAGATCC
32251 TCGTCAGGTT ACCAACAGTG CTGGCATTCG TTTTAATGAG AAGGTCGTGG
32301 GGAATCGGGT AGGTGCTACC ATCTTTGGGG GAGATTTTAT TCTCCTGCGT
2A


CA 02350775 2001-05-11
WO 00/Z7994 PCT/US99/26923
32351 CTTTTTGATG TTTCTCGATT CCATGTAGAT TGTGATTTCG GAATTCAAGG
32401 AGGAGTCTTC TCAGTTTTTG ATTTAGATCA TCCTGAATCG TGCATGGTAA
32451 ATTCAGATTT CTTTGTTGCC GGACTCTGGT CAGGGGCTAT AGATAAATGG
32501 AGTTTTAGGT TTCGATTGTG GCACCTCTCG TCCCATTTAG GAGATGAGTT
32551 TATTCTTACG CATCCAAATT TCCCAAGATT TAATTTGAGT GATGAGGGCG
32601 TCGATCTCTT CATTTCGTTT CGTTACACAC CACAGATCCG CTTGTATGGC
32651 GGCTGCGGTT ATATTGTAAG TAGGGATCTT ACTTTTCCTG AGCGGCCGTT
32701 TTACTGTGAA TGGGGTGCGG AACTCAGACC TTTTGGTCTG AGAGAAGGAA
32751 ATCTCCACGC ACAACCGATT TTCGCGATGC ATTTCCGTTG TTGGGAAGAA
32801 CAGAAATTTG GCTTGGATCA AAGCTATATT TTAGGCATGG AGTGGGCCAA
32851 ATTTCAAGAA ATCGGAAGGA AAATCCGTGC TGTTTTAGAA TATCATCAGG
32901 GATTTTCTAA AGAAGGCCAA TTCATTCGTG AACCGTGTAA TTACTACGGT
32951 TTCCGTCTTA CCTATGGATT CTAAACGATA ACAAAACCAT CTTCAGGGAC
33001 AGGGTGTTCA GGTCCTATGG GCAGCGTATG ATTGAGATAT CTGCGGAAGA
33051 TTGCAATGCC TGCCTTTGCT GAGGATAGGC AGAATAGGCA GTTGTGTACC
33101 CATTCAGAAC CTCGAATGGT TCCTACAGCA TGATTGGAAT TATACAAAGC
33151 TGTGATCTTG GATTTCCAAT AGTCTACAGG TCCGATTAGG AAGACGGGAA
33201 CAAGAGCTTT TTTCCCTGTT TTGAGACTAA TAAGCTCCAG AAGGAGTTCG
33251 AAATCGGTTC CCATGCCTCC GATAACAAAT ACAGCAAGGT CGACATGGAA
33301 GTCGGCCTGA CGTTCTAAAA GATCAGGAAT AGCATAGCTC ATTTTAGCTT
33351 CTACATAGGC ATTCGTGGTA TCCAAGCTAA TTAGATTCCC ACAAGAGAGT
33401 ATGGAGAGTT CTGTAGCTAC ACGATTCGCG AGTTCCATAG CTCCAGAACC
33451 CCCTCCTGTA AGGATTGCTA ACGGTGTCTG TGGTGGAAAT TCTGGGATCG
33501 TGAATTGCTG AGAAAGAGTA TGCATTCCTG TCAGGAGCTC ACGGAGAAAC
33551 TCATCATAAT CCCCAGCGAT TAGGCAGGAA CCATGAATTC CTATAAAGTA
33601 GGATTGAGCA AACTGTTCAG CTTGATGTTT AGGGACAAAC ATGCCCACAT
33651 CTTTATTTCT GCGTTTGATG TATTGTAAGA GTCGTTTCGA TTCTAAGTCT
25

CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
33701 GCCCAAAATA CAGAAATTCC TGCAAAATAT AGATCGAGAA GGAAAGAGCG
33751 ATCTCGATTC GAGAAAAACT CTCCAGAAGT GGGAGAGGGA ATCTGAAAAT
33801 AGATATGTTG CAGGTAATAG CGAGAGTAGT TAGAGAGGAA CATGCCCTTC
33851 AGCGATGCTG AAGGGAAGTA GCGGGAAAAT AAAACTCCTT GGCTTGTGAT
33901 ATGATCTGTT TCCATGGCTT TTAAAAAAGG GAAACAAGGT TGGTCTTCAA '
33951 TGTGCTTTTG AATTTCCCTA GCATGTCTTT CATCTGATGG GGAGATTCGA
34001 GGTTTGATGA TCCAAGAGTC TTGGGAGAGC TCAAGCAGCT CACTACCTTT
34051 GGAGATAAAC ATCGCAGCTT GATCTTCGCC TTCCGGTATG GATTCAAAAA
34101 CACGAAATAC CTCTTGAGGA GATTCTAAGG TTTCCTGGAG CATATCTCTA
34151 TAGAAGAAAA ACGAATGCTC TTTGTAAGGC TCAAGAGTAA AAAATTCTAA
34201 AGGTATTCTC TCAATAGGTT CTGAAGTGCT GCCGTAGAAT TCATAAATAT
39251 CTCCAGATTC TTGTGTGGTA GGTTCGAGAA TATCCGCTGC GGTGTGACGA
34301 AGCCCTTGGG GGAGTAAGTC CTGAACGACT CTTGCAAATA CGGTTCGGAT
34351 GTGCAGAGGC TCTGTCTTTA TGAGAAGAAT TTTATGATCT TCGGGAACGG
34401 GAGGACGATC TGTTACCATT TGATACAAAG GAAGAAACTT ACGTATTTTT
34451 AAATGGGGAC GCGTGAGTGA TTTGCTCATT AAGGGAAGGA ACCCATAAAT
34501 TGTCTCTTCG TAACAGATTG TTCCTGGAAG GATCGGAAGG AAGACAACAA
39551 GCCGATCATT AATGATCTCT AGAGTGATGA AGTGCTCAAG TTTTTTCCCA
34601 AAGCGTAGGA GCGGAGATCC TGTACGGTCT GTGTGCGTAA ACATCCTGTT
34651 GAGATAACAA GGCGAACGTA CGAGTCGGCG ATCATCAGCA GCAAAGAGCT
34701 TGCAGACAAA ACTTCCAGGC TCTAGGAGCT CCAACATAGC AGTGGCTATA
34751 GGATCTTGGC TCATGAAGAG AACGTGTAGA CGAGCTTCTT TTCGGGCTTT
34801 ATTTAGCTCC AAGTGGTTTA AAACGGCTTC GACACCTAGT TGGGCTAAGG
34851 AACTTTTTAA ATTTACTTGT ATACACTGTT GAGGCAGATG AAATCCAAGA
39901 AAGTACGCAG GAATATTCTC AATGAGGACC TCTCCTTCGT AAATGTGGGG
34951 CGAGAGTTTT TTCAAATGGG AAACGAGTCG TCCGTCTGGG GAGGCTGCAT
35001 CATGATGCGC GTGGAGTAGG TTATACATAA TTCATCGTAT TTTAAAGTTA
26


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/2b923
35051 AAAGCAATAG GTAAGGTCCC TCTTTGCGCT TCTAAGACTA GCGTTTCCAG
35101 AAGAATTTCC TACAAGGACT TCATTCTGCT TGTTGTTATA ATTTTGTGCA
35151 ATTTGTCAAG GAGGGGGTCT TGGAACGAGA AAATTGTGTT GTGGAAGGGA
35201 ATTTTAGGAA AAAAGTTTCT TAGAAAATAG GGCAGAAACT TTCTCTTGGA
35251 GAGATTGAGT AAAATGTAAA GAATAGTCTT TGCAATTGAG AATTATTTCT
35301 CTCTGGTTAC AATGGAGGAT TGGCTAAGGA GGATAGTAGG TATGCAGATT
35351 CCAAGAAGCA TTGGTACTCA CGATGGTTCT TTCCATGCGG ATGAGGTCAC
35901 AGCGTGTGCT CTCCTTATTA TTTTCGATCT TGTGGATGAA AATAAAATTA
35951 TACGCTCTCG AGATCCTGTC GTATTATCGA AATGTGAATA TGTTTGTGAT
35501 GTCGGTGGTG TTTATTCTAT AGAAAACAAG CGTTTTGATC ATCATCAAGT
35551 CTCTTATGAT GGATCTTGGA GTAGTGCAGG TATGATTCTG CATTATCTTA
35601 AAGAGTTTGG TTATATGGAT TGTGAAGAAT ATCATTTCCT TAACAACACT
35651 TTGGTACATG GTGTGGATGA ACAAGATAAT GGCAGATTCT TCTCTAAGGA
35701 GGGATTTTGT TCGTTTTCTG ATATTATTAA AATTTATAAT CCTCGCGAGG
35751 AAGAAGAAAC TAATTCGGAT GCGGATTTTT CTTGTGCTTT GCATTTTACC
35801 ATCGACTTTT TGTGTCGGCT AAGGAAGAAG TTTCAGTATG ATCGAGTTTG
35851 TAGGGGGATT GTCAGAGAAG CCATGGAAAC CGAGGATATG TGTTTATATT
35901 TTGATCGTCC TTTAGCATGG CAAGAAAATT TCTTTTTTTT AGGGGGAGAG
35951 AAGCACCCTG CAGCTTTTGT TTGTTTTCCT TCCTGCGATC AATGGATTTT
36001 ACGAGGGATT CCTCCGAATT TAGATCGCCG TATGGACGTT CGTGTTCCTT
36051 TCCCTGAGAA TTGGGCAGGT TTGTTAGGTA AAGAGTTGTC CAAAGTATCA
36101 GGGATTCCTG GGGCTGTGTT CTGCCATAAA GGTCTTTTCC TTTCTGTATG
36151 GACAAATAGA'GAAAGTTGCC AACGTGCTTT GCGGTTAACG TTACAAGATC
36201 GAGGGATCAT ATGACAGTAT TCAAACAAAT TATCGATGGA TTGATAGATT
36251 GTGAAAAGGT ATTTGAAAAC GAAAATTTCA TAGCTATAAA AGATCGTTTT
36301 CCTCAAGCTC CTGTTCATCT TCTTAT~ATT CCTAAAAAAC CTATACCACG
36351 ATTTCAGGAT ATCCCAGGGG ATGAGATGAT TTTAATGGCA GAGGCTGGAA
27


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
36401 AGATCGTGCA AGAGCTTGCT GCAGAATTTG GAATTGCCGA TGGGTATCGT
36451 GTGGTTATCA ACAACGGTGC TGAAGGAGGA CAGGCGGTAT TTCACTTACA
36501 TATTCATCTT TTAGGTGGGC GTCCTTTAGG TGCTATAGCC TAATTTTCTT
36551 TTGTTTCTGT GGATCCTTGT TCGGCTCGGA GTCCCTCCGT TATCAATTGT
36601 TGATCCAAGA TTTTGCAAAA GTTTCAGAAG AGGGCATAGG CCTTTTGGAG
36651 TCTAAAGAGT ATTCTTTACT TCAGGCTAAG CTAGTTTTAA GGGCTCTGGC
36701 TCAAAATTCT TCTTTTGATG ATTGGTTTAG AAGTTTTAAG AAGTGTCAGA
36751 TTTCCTATCC AGAGTTAGCT CATGATCGCG ATGTCTTAGA AGAATTTGGG
36801 ATTCAAGTTC TGCGTGAGGG AATCGAAAAT CCTTCCGTGA CCGTTCGTGC
36851 TGTGAGTGTC CTTGCTATTG GGCTTGCTAG AGATTTTCGC TTGGTCCCTC
36901 TCCTGCTCCA AAGTTGTAAT GATGACAGTG CTATTGTTCG ATCTTTGGCT
36951 CTTCAGGTTG CTGTGAACTA TGGCTCTGAA AGTTTAAAAA AGGCCATTGT
37001 AGAGCTTGCC CGTAATGATG ATTCTATTCA TGTTCGGATT ACAGCATATC
37051 AGGTGGTCGC TCTTTTACAG ATAGAGGAGC TATTGCCATT TTTAAGAGAG
37101 CGTGCTGAGA ACAAACTTGT AGATAGTGTA GAACGTCGAG AGGCGTGGAA
37151 GGCTTGCTTG GAACTCTCTT CTCAATTTCT AGAGACGGGT GTAGCTAAGG
37201 ACGATATTGA TCAAGCGTTG TTCACTTGTG AAGTGTTGCG TAACGGTATG
37251 TTGCCAGAGA CTACTGAGAT TTTTACAGAA CTCTTATCTG TAGAGCATCC
37301 TGAAGTGCAG GAGTCTCTCT TACTTTCTGC TTTAGCTTGG AGTCATCAGC
37351 TACAGAATCA CAAAGAGTTT CTTAGTAAAG TGCGCCATGT GATGTGCACT
37401 TCTCCATTTG CAAAAGTACG TTTTCAAGCT GCTGCACTTC TCCATCTGCA
37451 TGGAGACCCT TTGGGCAGAG ACTCTCTGGT TGAGGGCTTG CGCTCTCCTC
37501 AACCTCTTGT GTGTGAGGCA GCTTCGGCGG CTCTCTGCTC TTTAGGAATC
37551 CATGGAGTCC CTTTGGCAAA GGAGCATTTG GAGAGCCTTT CTTCTCGAAA
37601 GGCTGCTGCG AACCTCTCCA TTTTGCTTCT TGTGAGCCGT GAAGATATTG
37651 AAAGAGCTGG AGATGTGATT GCTCGCTACC TCTCCAATCC TGAAATGTGC
37701 TGGGCTATAG AGTATTTCTT ATGGGATGCA CAATGGAATT TACGTGGTGA
28


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
37751 TACCTTCCCT CTATATTCGG ATATGATTAA ACGTGAGATT GGTAGGAAGC
37801 TCATTCGCCT TTTGGCAGTA GCTCGCTATA GCCAAGCCAA GGCTGTAACA
37851 GCAACGTTCC TTTCAGGACA GCAAGCTCAG GGATGGAGCT TTTTTTCTGG
37901 AATGTTCTGG GAAGAGGGAG ATGTGAAAAC TTCTGAGGAT TTGGTTACAG
37951 ATGCTTGCTT TGCAGCAAAG TTGGAAGGAG CGTTAGCCTC GCTATGTCAG
38001 AAAAAAGATC AAGCTTCCCT ACAGAGGGTC TCTCAACTTT ATAATGACAG
38051 CCGTTGGCAA GATAAATTAG CAATCTTAGA GAGCGTTGCT TTTTCTGAGA
38101 ATCTTGATGC TGTGCCTTTT CTTCTAGACT GCTGCCATCA CGAAGCTCCT
38151 TCGCTGCGAA GTGCAGCAGC GGGTGCTCTT TTCTCTATTT TCAAATAAAT
38201 ATTAATAAAA TTATTCAAGA TATAGAAGAA AAACCACGCA GTTGTAATTT
38251 CTATTTCTTA AAAATAATTT TTCAGACTGA CTTTATTCTT TCATTTTTAA
38301 GTCTTTGGAG ATAGAAAACT TTGTTATAGA TTTTTATCTG GTAGCTTTTA
38351 TAATTTATGA AGAGCGTAAG CTCAGAGCCT GTATGTCATG CACAACCTGT
38401 ATATAAAATC AGATTTGTTT TTTGAATCTC TATTCTCGTT AAGATTTCGT
38451 TATTCTGGAC GTTATCTCCA TCACCACTCC TAATTTTCCT AGCATTTCTA
38501 TCTTTAAGCT CAGCACCGTA GCTTGCTTAA AGGAAATATT TTTCATTTAG
38551 GTTGTGGAGT TCTTTATTTT ATGAATTTTT CATTATTTTT ATTTTTCCTG
38601 ATAGCTATTC AGGGAATCTG CTTGTACGTG GGACGTCGTG GTAGCAAAAA
38651 GGTAGAAGAT CGCGAGAGCT ATTTTCTTGC AGGAAGGAGT TTAAAAATCT
38701 TTCCTTTGAT GATGACATTC ATTGCCACCC AAATCGGTGG CGGTGTACTT
38751 CTTGGGGCTG CTGAAGAGGC CTTCTGTTAT GGTTATGGGG GGATTCTTTA
38801 TCCTTTAGGA GTCGCTTTAG GGTTGATTTT CTTAGGAATG GGGCCCGGGA
38851 AGCGGTTGGC AGAGGGATCG TTAACGACCG TAGTCTCTAT CTTTGAAGTG
38901 TTTTATGGTT CTAAAAAGCT CCGTAAGATC GCATTTTTAT TATCCGCAGG
38951 TTCCTTATTT TTCATCCTGG TCGCTCAGGT GATTGCTTTA GATCGGTTGT
39001 TTAGCAGCTT CCCTTTTGGC AAGTACGTAA CCGTAGCATT TTGGATTGTC
39051 TTAGCATCCT ATACCTCAAC AGGAGGGTTT CGCGGGGTCG TACGTACTGA
29


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
39101 TGTGATCCAA GCAGGATTTC TTCTTATTGC GGTGCTCGTC TGTGGTGTTT
39151 CTGTATGGCT CTCTGTCCCT AAATCCTTGT CTGTGTTGGA TCCTTTCCAA
39201 TCACTTCCTT GTGCGAAGCT TTCCAATTGG ATATTCATGC CTATGCTCTT
39251 TATGCTTGTT GAGCAGGATA TGGTGCAAAG GTGTGTGGCT GCCTCCTCTC
39301 CAAAACGCTT GCAATGGGCG GCTGTAGGCG CAGGCCTTGT TCTTCTTCTT
39351 TTTAACTTTA TCCCTTTATT TTTAGGTTCT TTAGGAGCTA AAGCAGGCCT
39901 TAAAGCAGGA TGCCCTCTGA TTGATACCAT TGCATATTTT TGCAATCCCT
39451 CACTAGCAGC TGTGATGGCT GCTGCCATCG GCGTTGCGAT TCTCTCTACC
39501 GCGGACTCTC TTATGAATGC TGTAAGCCAG CTAATCGCTG AAGAATACCC
39551 TACGTTGAAA GCCCCTTATT ATCGTTATTT AGTATTGGGT TTGGCGGTTG
39601 CAGCTCCTCT TGTTGCTATT GGTTTTACAA ACATCGTAGA TGTCTTGATT
39651 TTAAGCTATA GCCTGTCAGT GTGTTGTCTT TCAGTCCCTG TGGGTTTCTA
39701 TCTTCTAGCT CCTAAAGGTC GCCGTGTGAG CGGAGCTGCT GCTTGGGCAG
39751 GAGTGCTCGT TGGTGCTCTG GGCTATGGAT GGGTTCAGAT AGTCTCTTTG
39801 GGGATGTTTG GGGAGCTATT GGCTTGGGTA GGTTCTCTAG TCGCCTTTTC
39851 CTTTGTAGGA TTTATTGAGA TCACTTGGAA AAACAAAGTC AAAACGCAAA
39901 CTTAGATAAC CACTGCATGA GAAGATATAA CTAAAATAGA TCCTGAGTTG
39951 TTTAGGTTTC TCTTAGATCT GATAGGTTGC GCTTAGTAAG AGATCGTCAG
40001 TTTTTTAAGT TGTGTTTAGA ATCTGATACC TCTCCTTCTT TTCCAAGAAG
40051 AAGAGGGGTT CGTTTTATTT TTTATTATTC ATTCGTAGGG GCGGGAAGCT
40101 GTTTTAACTG ATAGAGCAGG TCGATAAGAG AGGAGAGCAG GGCTTGATAG
40151 AGCGTCTCTT CAGAATGGTG AAGAGTGCCC TTAAGAAGCT TTCTGCAATA
40201 CTTTAATAGC CAATACAGAG CGCAAAGTAA ACAC.TGTAAA AGGTAAAATT
40251 TACACGCTTG TTTTATCATA AACCTCCAAA GAAAGACGCG TGTTGGCGTT
40301 CTTCCAATTG GCTGTTAGAT AGTGAAAATA GTATTTACTA TTCAATAAAA
40351 ATATTTATAT TAAATATAAT AAAACAAAAT TTCTAATAAA CTTTTTAAAA
40401 GTATTTGGCT AAGTTTAATT TAAGAGTTCT CAAATAAAAG ATTTTTTAGT
30


CA 02350775 2001-05-11
WO 00/Z7994 PCT/US99/26923
40451 CTCTTTTATT TGAGAATCAT CAACCGATTT CAAGAAATCG CATGAGCCCG
40501 TGATTCTGAT GGGCAGAAGT GCTTTTTAGC AAATAGGTGA AGCTCCTCGG
40551 GCGTGATCTG TTGCTTCGCG TCACAAAGAG CCTTTTCAGG GTGTGCATGC
40601 ACTTCGATCA TCAGACCGTC GGCACCTACC GAGAGACCAG CAGAGGCGAG
40651 AGGAAGAACT AGAGAACGCT TCCCCGCTGC GTGGGAAGGA TCTACAATTA
40701 CAGGGAGAGA AGAGATCTCT TTAAGGAGAG CCACGGTATT GAGATCTAGC
40751 GTGTAGCGCG TAGAGTGCTC AAAGGTACGA ATTCCTCGTT CACAAAGGAT
40801 TACCCCAGGA CAGGAGGGAG AAGAAGCAAG GATGTACTCC GCTGCGCATA
40851 GCCACTCTTC AAGAGTAGCT GCTGGACTGC GTTTTAGGAT AATCGGACGA
40901 TGTGATTTGC TGACCTCTTG TAAAAGAGGG GTGTTATGCA TGTTTTTGGC
40951 TCCGATACGG AGGATATCCA CATGTTCGGC AGTAATTTCA ACATCTCGGA
41001 CATCTAAAAC TTCGGTTTCT GTAGGGAGAC CATGGATGCT CTGTGCTTCC
41051 TTATGCCAAA GCACACACTC TTTCTCCCAT CCTTGAAACG AAAATGGGCT
41101 TGTCCGTGGT TTTCTGATTG ATCCTCGGAA TACCTGAGCT CCTGCTTCTT
41151 TAACTGTAAG AGCTGAAGAG ACTGTATGCT CGTAACTTTC TAAGGTGCAG
41201 GGGCCTGCGA TCAGTATTGG CGATCCTTCT CCAAACGATA GATTTGGAGA
41251 AATAGGAACG GTATGGACCT CGTCAGGATG CTGTTTGAGG GTGCGCGGTA
41301 GGGGATAGGT AAACGTAAGA ATAAGTACCT CATGCAAAAC TAGGGATTTG
41351 AGCTGTCCGG TTTTGAGTTC TGATCTTCAG TTCTAGGGTT TCTAGGCAAG
41401 AGACAACCGT AGTGGTTGGG GAAGCGGATT AAGAAGATAC GAGCGTCTCT
91451 TTCCGGAGAA ATTGTATTTA AAAATCCATG CATCGCAACA AAGTTTTCAA
91501 TATTTTCTTC TGAGACAGTA TCTCGATCAG ACATGATCAG ACAAACAGCA
41551 AACGGCTGCA AAGGTACAGT AGTCATAGCA GCGACGTCGT TAGCCTCAGC
41601 TTGCTCTTTG ACGTCATCCA AGATATCACC GAGTTCATTA GAACTTAGGG
41651 AGTGGAGGTA GCCTATCAGT GATTGTCTAA AACCAGAGCG ACAGAAACCA
91701 TCAGATGCTG TGACCGTGTT TTTTAAGAGA TTACAAATCA ACTGATCTCG
41751 AGCTCTAGAG TATTGATTTA TAGTCGCTGT AGCTAGAGGG AAACCGTATT
31


CA 02350775 2001-05-11
WO 00/Z7994 PC'f/US99/26923
41801 TTAACGAAAG TAAAAGCTTA TCTTTTACTT GTGTAAATCT ATGAGAAGCC
41851 TCTCTTTGGA AAGCGTCTAA TCGAGCAGCT AGCGGAGTCT TTTTTTGTAG
41901 AAGTTTAGGG TAGTTGTTAA GGAAACAAAA GATTAAAAAA GATTTCGAAA
41951 ACTGATCACT TAAAGCTGTG GGAGAAGAAG GCGAACCCAG TTTTGATCCG
42002 AAGAGATTTA ACCGTGTTAA AAAAACGTTC CACCCATTCC TAAAGGTACT
42051 TTCGATTTCT TTTTCTAAGA GTATGGCCTC GGTTTCCAGT TGATAATTTG
42101 GGAAAGTAGC TCGTAGAAAC TGCATACAGT TCTTTTCAAG TATGGGATCT
42151 AAAACCATCA CTTCAGGATG TCGTGACCAG AACTTGCAGA ACGTCTCTTT
42201 ATCTTTATTA GAAGGGGTTG AAGGATAGCG TGCTGGAGGC TTGAGTACTA
42251 GAGACCCTAG AATGGAACGT AGAAGTTTTT GGAGTTTCTT TTCGTTTTCT
42301 ACTTTTTTAA AGGCATATTG TAGAAAAGGA AGTAGAAGCA TTTTTAACTC
42351 TTTAACGGCA TCTTTAGTTT CTGGACGATT ATTCACTTGA GGATGCTGTA
42401 AAATAAAGAA TAAGAGCTGT ACTGCCTGTG TATAGAGGAG GTCCTGCTCT
42451 TGTGAGAGAC CACTGCCTTC ATCACGCGTT AAACGTAAAC TTCTTGCCGA
42501 GCAGACAGTA TTGAAGTGCT CTTTTAATTC AAGCTTACTA TGAATTGCGA
42551 CTAATAGCTT GTTTACAAGA TTTGAAAGTA AATCCGGCTG ATTGCTTAAA
42601 GGAGGGGTTT CAAAGGATGT GGATATTACA TTCTTTAATA GGTTTGCTAT
42651 GCGTTTATGT AGTGTTTCGA GTTGAGGCAT AGAGCTCTCT AACGTCAAAC
92701 TCTCATGAGT CGTAAGGATA GTCATGATGT TAGAAGAAAT CACCTCTCGT
42751 TGCCAAGAAG AATCTAAATC TTCATTAAGG TAGCTGAGGG AGGCGTTACT
42801 TATTTGTAAG ATCTCTTGTA TAGAGCGTTG GGGGGGCGCG GACGCTATAA
42851 TGATATCCTC TAATGATTTG TAGAAAGTAG CGCAGAGTTT GTGAAATCTT
42901 ACAGAGGGGC AGGTACATAG AAAAGCAAAG AAAGAAAGAG CTGGAGACCA
42951 AGGGAGATCC CCAGATCGTC TGTTTTCTAT AGTTGCAAGA AAGCGAGAGT
43001 ATTGACATCG GATTGCTTCA GGGAGGTGAT CTTTGACTAT AGCATTGAAT
43051 CTCTCAGTAT CCCACCCTGA ATCAGCAATG CGTCGACTCA GCTTAGCAAA
43101 AGCCTCTCGT ATTTCTCTTT CATATTGCTT TCGAAGTTGT TGATCTTCTG
32

CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
43151 GAGACATCTT TTGAAGCTCT GCATCAGTAA GAAAGAGAGC GGGCAGATGT
43201 TCCAAGAGAA GAAAGAGTTT GTCTCTAGAT TGCATGTAAG GAAGTGCGGA
43251 ATGATCAATG AGATTTTGGA ACAATTCCGA ATAGGGACGA TTTGCAGCAA
43301 GAAATTCCAG AACACTGGGG AGGAGATCGT CCTGCATATC AGAAATAAAT
43351 AAGGCTCTCG CTGTTTCTTC ATTACTTGAA GCTGCGATTT GTTGGCGAAT
43401 CGCATATGCA GAGAGTTTTC TTAGAAAGGC TATCAGAGTT GCAGTGTGTT
43451 TCTGAGAAAG AATCACCCCG TCATAGAGGT CTATGAAGGA GCAATAAGTA
43501 CTGCATAACT GAAGGAGTTC AGCCATTTCT GCACAAAGAT TCGCATTTGC
93551 CGGAGAAGAG GAGGCGAAAG GCAGGTCAAG GAGACGTGTG GCTTCCTGCT
43601 CAAAGACAAT ATCATTTCTG CTGCTCTCTT CGTAGAGAGC AGATAGCCAT
43651 CCTACAGCAT AGGCACGATA AAAGCAGTTC CCATCTCCCG GTACATTCAC
43701 AAGGTAGTAA TTGTCATTTA GATAGAGAGC CTGTTCAAGA GAGAGTTGCG
43751 CAAGTCGCCG GTGTTGTTGA GGAAGATCCG GATTTTGTGC GATTTTCTTG
43801 AACTGTTTTA TTTGGAAATA CATCGGTTCG TTATCAATCC GATTGGGATA
43851 GGAAGCTACG AAATGAGGGT TAAAGTCGCC AAGTATTGGA TCAACTTGCA
43901 TGGCAGGAAG AGGCAGAGGC GCTCGTCTTA GTACCATGCT CACCATGCGA
43951 TTTAAGCATT GCCAATAGCC TTGACTAGAT GGGCGGAGAG GCATGGGGGT
44001 AGCCACCCGT ACGGGAGGAG GAGGGGCCTC TGGTGGTGGA GTCGGCTTTT
44051 TATCAGCAGG TTGTTTAGGG ACTTTTGGGC TCGCTGGTGA AGGAGCTTTG
44101 GGGGGAGGCG GGGGTGTGTC CTCTGGGGGC GGCGTGCCCG GCTTGGGAAC
49151 ATCGGGTTTT TTGTCTTCAC CATCCTTAGG CGGTTGTTTG GCAATTTCTA
44201 TAGTTTTTGG CTCTGGTCCT TTGGGAAGAG TGGGAGGCGT TGGCAAGCCT
44251 TCTTTTCTGA.CAACCCGATG ATGCTTGTAG TAGTGAATCA GAAGAAGCAA
4430'1 ACCAAGAGTA ATGATATGGA GCAGAACGTA TCCTATGGTA CGTAGAATTC
44351 TAAGTAACAG AGGGTCTTTA GTATCAGTCG TTAAGTGGTA AAAATTATTC
44901 TTGTTATTGG GCGGGCAATG TGGTGGGGAA TTGGATACAT ACGTTCAAAA
44451 ATTGCTCGTT TTTTAATCAA AATTATTCAA AGTTAAAACT TTTTCGAGTT
33


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/2b923
44501 TGATGTATTT ATTTTTTTAT GTAAACTTTA ACACATGATA GAATTTAGCG
44551 TATAGAGCGC AAACTTTCAT GATAAAACAA ATAGGCCGTT TTTTTAGAGC
49601 ATTTATTTTT ATAATGCCTT TATCTTTAAC AAGTTGTGAG TCTAAAATCG
44651 ATCGAAATCG CATCTGGATT GTAGGTACGA ATGCTACATA TCCTCCTTTT
44701 GAGTATGTGG ATGCTCAGGG GGAAGTTGTA GGTTTCGATA TAGATTTGGC
44751 AAAGGCAATT AGTGAAAAAC TTGGCAAGCA ATTGGAAGTT AGAGAATTCG
44801 CTTTCGATGC TTTAATTTTA AATTTAAAAA AACATCGTAT CGATGCAATT
44851 TTAGCAGGAA TGTCCATTAC TCCTTCGCGT CAGAAGGAAA TCGCCCTGCT
49901 TCCCTATTAT GGCGATGAGG TTCAAGAGCT GATGGTGGTT TCTAAGCGGT
44951 CTTTAGAGAC CCCTGTGCTT CCCCTAACAC AGTATTCTTC TGTTGCTGTT
45001 CAGACAGGAA CGTTTCAGGA GCATTATCTT TTATCTCAGC CCGGAATTTG
4505'1 TGTCCGTTCT TTTGATAGCA CCTTGGAGGT GATTATGGAA GTTCGTTATG
45101 GGAAATCTCC GGTTGCCGTT CTAGAACCCT CGGTAGGACG TGTCGTTCTT
45151 AAAGACTTCC CTAATCTTGT TGCAACAAGA TTAGAGCTCC CTCCTGAATG
45201 TTGGGTGTTG GGCTGTGGTC TCGGCGTAGC TAAAGATCGT CCTGAAGAAA
45251 TACAAACGAT TCAACAAGCG ATTACAGATT TAAAGAGCGA AGGGGTGATT
45301 CAATCTTTAA CCAAGAAATG GCAACTTTCT GAAGTTGCTT ACGAATAGAG
45351 GGTATTCTTA TGGCAACCTC TGTTCCTGTA ACTTCATCTA CTTCTGTAGG
45401 AGAGGCTAAC TCCTCCAACG AAAGATTTAC TGAACGAACA TCGCGAATGT
45451 ATTACGCAGC TTTAGTCCTA GGGGCTTTGA GCTGTTTAAT TTTTATTGCT
45501 ATGATTGTCA TTTTCCCACA GGTCGGATTG TGGGCTG'~GG TCCTCGGGTT
45551 TGCTCTTGGA TGTTTACTTT TAAGCTTAGC TATCGTTTTT GCTGTCTCCG
45601 GTCTCGTTTT AGGCAAGACT TTAGAACC'FA GTCGAGAAGC GACTCCTCCA
45651 GAAATTGTTG CGCAAAAGGA GTGGACTACA CAACAAGATG TCTTAGGGAA
45701 TGAGTATTGG CGTTCCGAGT TGATTTCCTT GTTCTTACGA GGGGATCTCC
45751 ACGAATCTCT GATTGTTGAT TCTAAGGATC GATCTTTAGA TATTGATCAG
45801 AGTTTACAAA ATATATTGAA ACTTGAGCCC CTATCTACGA CACTTTCGCT
34

CA 02350775 2001-05-11
WO 00/Z7994 PCT/U599/Z6923
45851 GTTAAAGAAA GATTGTGTCC ACATCAATAT CATTTTACAT TTAGTGAGAC
45901 AGTGGAACTT ACTGGGAGTG GATCTTAGTC CTGAAGTCAC TGCGCACGCC
45951 GAGGAACTTC TACTCTTTTT GATAGAAGAG CAGTATTACT CTCCTGATAT
46001 TTTGAAATTG ATTCGCTACG GAGATGCTTT ACAAGCAACG TCTCCTTTGA
46051 TGGATTGGGC AGATTCAGGT TCCTTTAGTG TAGACGCAGA CGGGGTATTT
46101 AGCTGTCGCA GAGAAGAATG TTCTCCTGAG GATGCTTTGG CGCAATTCGA
46151 TCTTCTTTTG GCGTTGGAAA ATCCCGACAG ACGCTTCTTA AAGGATTCTT
46201 TTCTTACCTA CATTTGGTCG TCTTCATTTT TTGAGAAGTT TTTACATCGC
46251 CATCTAGAGA GCTTGCAAAG AAAGCTCCCA GAGACAGCGA TCGATGTCGC
46301 CCGCTATGAA GCACAAATAC AAACATTTCT CTCTCGCTAT TTTCAGAAGC
96351 TCGATTTGAT AAACGCAATG TCCTTAGATT GGGGATATAA CTGTGCTGAG
96401 GGAGAAAAAT GTTATGAGAG CGCAAATCAA AGATTAGACA ACCTATTTAT
46451 TGCTTTTTCT TCTTCTGTTC CTGCTATGAA GCGGCTCTTT GACAAATATG
46501 GTTCTGTGGT ACGGGTAGAT CGTAGGCAGA TTCGTGAGCA GATTCTTTCG
96551 AACACTGAAA TCTTAGAAAA TGAGTCAGGG TTCCTCTGCA GTTTGTATGA
46601 ATATCCTTTA TCCTATTTGA TAGATTGGGC TGTTTTGCTA GACTGTGTTC
46651 GCGGTACCGA AATCTCTCTA GAAGATCAGG CCGATTACAC CGTTTGTTTG
46701 CAAGGCTTGG ATTCTATGTT ATCTCAATTT GCGAGTCGTT TACAGTCTGG
46751 ACAAAAAGTA TTGAATCCTA GAGATGTTTT AAGTGAACAG GCTGCGGTTA
46801 TGCTTGTTCA TGGCTTGGCA GCACAGGGCG TGTCGTTTCA AGGATTGAAA
46851 GCTTTGATGT ATTTGACAGC CGTTCCCCAA AGAATGTGGT TAGGAGCATT
46901 GCCTTTATTT GAATCTTTTC CTGTCTTTAA TCGGATGAAA GAATTTCTTG
46951 GGGAATCTCT GGGAGACTAG GTGAATTTGT ATCAAAGAAG GAACAAGATT
47001 GCATGTTAGG TTCTTTGCCA TGTTATCCTG GTGCTGGCAA TATTGAAGAA
47051 TACAAAAATA GGTATTTCTA TTGTCAGTTA TGTGCTGAGG TCGTTAGTCC
47101 CTATGTTGTT CCTGTTATTG TAGTTGATGT GCAAGGGGCT CCTCCTACAG
97151 GTATCTTGCA GGTCTTGCGT TGTAAGCAAC ATAAATTTCA AGGCCTACCC


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
4?201 GTACATGGCC CCATTACTTC TTTATGGGCT TTGGAGCCCG TGGGTAAGGG
47251 AGCTCCGCAG CTGGAGTCTG CAATGTACGA GCTCTGTTCT CAAGTAAGGA
47301 ATTTTGACAT CTGCTCTATT GTGAGTTGGG TCTTTGGTGG GTTGTGTATT
47351 TTTGCAGGTC TGATTGTCGG GGTAATGGTT GAAGCCCCTT TGATTGCGGG
47401 ATTAAGTGCT TGGGTGATTC CCTGTATCAT TGGAGGGGTT GGTGCCATTT
47451 TATGCTTGTT TGCGATCTTG ATGGCGTACT TGGGAAGAGG GAGAGTCCGT
47501 GAGTGGCTCA ATCTTTCACA CGAATATATA ACGCAATGTC ATTGTCGTCA
47551 GATACAGGCA CATTCTCAAA ACTATTCTGT GATCACAGAG TATCCTGCAA
47601 CCTGTGCATT ATCTCAACCG ATTACAAAGT TACCTAATGG ATCACGCAGA
47651 GATAACTAAG CGTGTTCGTC AGTTATTTCT CACATTTTCT CATGAATCTT
47701 TTACTGCGCT GCACGAGATC CCTCTCGAAA ATTTTTAAGG ATAGATACTT
47751 GGAAACTATG GTTTAAAAAG CTATAGAGGA TTCTAAATTG GGGTTCTAGC
97801 AACTTCTTGA CTTTAAGATC CAAAGTTAAG AGACTGACTA ATTATTTTTG
47851 TTTGCTTGTG TTTCCAGATG AGCAATTGGT ATGGTAAGAG ATATTCAGAG
47901 TGAATCTATA GGGAAATTAG TATTTTTAGG CACAGGAAAT CCCGAAGGAA
47951 TTCCCGTGCC GTTTTGCTCA TGTAGAGTGT GTCAAAACAC AGGGATTCAT
48001 CGTTTACGAT CTTCGGTACT CATTCAATAT CAAAACAAGA CTCTAGTGAT
48051 TGACGCAGGC CCTGATTTTC GTACGCAGAT GTTAGTTGCA GGGGTTTCCG
48101 AGCTCGATGG GGTATTTCTG ACCCATCCCC ACTACGATCA TATCGGTGGT
48151 ATTGATGATT TACGTGCGTG GTACATAGTC ACGCAGCGTT CGTTGCCTTT
48201 GGTCCTTTCT GCAAGCACCT ATAGATTTTT AAACAAGGCT AAAGAGTATC
48251 TCTTCGCCAC TCCGAATGTA GAGTCTTCAC TTCCCGCAGT TTTAGAGTTT
48301 ACAATCTTGA ATGAGGACTG TGGGCAGGAG GAATTTCAGG GCATTCCCTA
48351 TACTTATGTT TCCTATTATC AAAAGTCGTG CCATGTAACG GGTTTTCGTT
48401 TTGGAAATCT TGCTTATCTT ACAGATCTCT GTAGCTATGA TGCAAAAATT
48451 TTCAGTTACT TAGATAATGT AGAGACATTG ATCTTGTCTG CGGGTCCATC
48501 GGAAACTCCT ATTCCTTTTC AGGGACACAA ATCTTCGCAT CTTACTGTAG
36

CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
48551 AAGAAGCCAA AGCTTTTGCG AATCATGCAG GGATAAAGAA TTTAATTATT
48601 ACACATATCA GCCACTGTTT AGAAGCAGAG CGTGACCAGC ATCCAGAGGT
48651 CACATTTGCT TATGATGGCA TGGAGGTCCT TTGGACACTA TAGATACGCC
48701 CGGGGAACAG GGTTCTCAAT CTTTCGGAAA TTCGTTAGGG GCCAGGTTCG
48751 ACTTGCCTCG TAAGGAACAG GATCCCTCTC AAGCTTTAGC TGTGGCTTCC
48801 TATCAAAATA AGACAGATTC TCAGGTCGTT GAAGAACATT TAGACGAGTT
48851 GATCTCACTT GCGGATTCCT GTGGTATTTC TGTTTTAGAG ACCCGTTCTT
48901 GGATTTTAAA AACACCCTCA GCTTCCACCT ATATCAATGT GGGGAAGTTG
48951 GAGGAGATCG AAGAAATCTT GAAAGAGTTT CCCTCTATAG GGACTTTGAT
49001 CATAGATGAG GAGATCACTC CATCCCAACA ACGGAATTTA GAGAAACGCC
49051 TTGGCCTTGT CGTTTTGGAT AGGACGGAGT TAATTTTGGA AATCTTTTCC
49101 AGCCGTGCCC TTACTGCAGA GGCAAATATC CAAGTCCAAC TTGCACAAGC
49151 ACGTTATCTC CTTCCTCGTC TTAAGAGACT TTGGGGGCAC CTATCTCGGC
49201 AAAAATCTGG GGGAGGTAGC GGAGGCTTTG TTAAGGGGGA AGGAGAAAAA
49251 CAGATCGAGC .TAGACCGTAG AATGGTCCGT GAGCGTATCC ATAAGCTGTC
49301 AGCACAGCTG AAAGCTGTGA TCAAACAGCG TGCGGAACGC CGTAAAGTAA
49351 AATCTCGACG AGGAATTCCT ACCTTTGCTT TGATAGGGTA TACAAATTCA
99401 GGGAAGAGCA CCCTATTAAA TTTGCTGACG GCTGCTGATA CGTATGTTGA
49451 AGACAAGCTA TTTGCAACTT TAGATCCCAA AACGCGCAAA TGCGTACTTC
49501 CAGGAGGCCG TCATGTCCTT CTTACTGATA CTGTAGGCTT CATTCGAAAA
49551 CTTCCTCATA CTTTGGTAGC AGCATTTAAA AGTACTTTAG AAGCAGCTTT
49601 CCATGAAGAT GTTCTTCTGC ATGTTGTCGA TGCTTCGCAT CCTTTAGCTT
49651 TAGAGCATGT ACAGACGACC TACGATCTCT TTCAAGAGTT GAAGATTGAA
49701 AAGCCTAGGA TCATTACTGT GTTGAATAAG GTAGATCGGC TTCCTCAAGG
99751 AAGTATCCCT .ATGAAATTAC GTTTGCTCTC TCCTCTTCCT GTATTGATTT
49801 CAGCAAAAAC TGGGGAGGGG ATCCAGAATC TTCTTAGTCT TATGACGGAA
49851 ATCATTCAGG AGAAAAGTTT GCATGTGACT TTGAATTTTC CTTATACAGA
37


CA 02350775 2001-05-11
WO OO/Z7994 PCTNS99/26923
49901 ATATGGAAAA TTTACGGAAC TTTGCGATGC CGGGGTTGTG GCCTCGTCAA
49951 GGTATCAAGA AGATTTTTTA GTTGTTGAAG CGTATCTTCC TAAGGAGCTG
50001 CAAAAGAAAT TTCGTCCTTT TATTTCTTAT GTTTTCCCTG AAGATTGTGG
50051 AGATGACGAG GGTAGAGGGC CCGTCTTGGA GAGTTCTTTC GGGGATTAGG
50101 TAGTTTTCTT CTAGGACATC GAATCTTTGT TAGTGAGAAA AAGAGTGATA
50151 TTTTAAAATA GCCACTCATC GCTAAATCTA TTGAAGTCTC TAGAGGTATA
50201 TGACGGTTGC GGAAGTCAAA GGAACATTTA AGCTGGTCTG TTTAGGCTGT
50251 CGGGTGAATC AGTATGAGGT CCAAGCATAT CGCGACCAGT TGACTATCTT
50301 AGGTTACCAA GAGGTCCTGG ATTCTGAAAT CCCTGCAGAT TTATGCATAA
50351 TCAATACGTG TGCTGTCACA GCTTCTGCTG AGAGTTCGGG TCGTCATGCT
50401 GTGCGTCAGT TATGTCGTCA GAACCCTACA GCACATATTG TTGTCACAGG
50451 TTGTTTGGGG GAATCTGACA AAGAGTTTTT TGCTTCTTTG GATCGGCAAT
50501 GCACACTTGT TTCCAATAAA GAAAAATCCC GACTTATAGA AAAAATTTTT
50551 TCCTATGATA CGACCTTCCC TGAGTTCAAG ATCCATAGTT TTGAGGGAAA
50601 GTCTCGAGCT TTTATTAAAG TTCAAGATGG CTGTA_~.TTCT TTTTGCTCGT
50651 ACTGCATTAT TCCTTATTTG CGGGGGCGTT CGGTTTCTCG TCCTGCTGAG
50701 AAGATTTTAG CTGAAATCGC AGGGGTTGTA GACCAAGGAT ATCGCGAAGT
50751 TGTAATTGCA GGAATTAATG TTGGAGATTA TTGCGATGGA GAGCGTTCAT
50801 TAGCCTCTTT GATTGAACAG GTGGACCGGA TTCCTGGAAT TGAGAGGATT
50851 CGAATTTCCT CTATAGATCC TGATGATATC ACTGAAGATC TGCACCGTGC
50901 CATCACCTCA TCGCGTCACA CTTGTCCTTC GTCACACCTT GTTCTTCAAT
50951 CGGGGTCGAA TTCAATTTTA AAGAGAATGA ACCGGAAGTA TTCTCGCGGA
51001 GATTTTTTAG ATTGTGTAGA G,AAGTTCCGT GCTTCTGATC CTCGCTATGC
51051 CTTTACTACA GATGTGATTG TCGGATTTCC TGGAGAGAGT GATCAAGATT
51101 TTGAAGATAC TTTGAGAATT ATTGAAGATG TAGGCTTTAT TAAAGTGCAT
51151 AGTTTCCCTT TCAGTGCTCG TCGTCGTACT AAGGCATATA CTTTTGATAA
51201 TCAGATTCCC AATCAGGTGA TCTATGAGAG GAAGAAGTAT CTTGCTGAGG
38

CA 02350775 2001-05-11
WO 00/27994 PCT/US99126923
51251 TTGCTAAGAG GGTAGGCCAG AAAGAGATGA TGAAGCGTTT AGGAGAGACT
51301 ACAGAGGTGC TTGTTGAGAA AGTAACGGGG CAGGTTGCTA CGGGTCACTC
51351 TCCTTATTTT GAAAAGGTTT CTTTCCCTGT TGTAGGAACG GTAGCTATCA
51401 ACACTCTAGT TTCTGTGCGT CTTGATAGGG TAGAGGAAGA AGGGCTGATT
51451 GGGGAGATTG TATGATAGAT ATAATGCAAC ATTTTAAGCC CTATACTATG
51501 GTCCCAGGAC AAAAACTCCC TATTCCTGGA TCTTTGTTAT ATGCTCAGGT
51551 ATTTCCTACC CTGTGGCGTC TATTTTCTTC GAAACACGAA ATCTTAAATG
51601 AGCAGACCTT ACAGGTGCAA GGGCCTTTAA AACGCTTTGC TGTTTTCCAA
51651 GATTTACATC GTGGGGGGCT TGCAGTGACT TCTGAGCGCT ACAAGTATTA
51701 TCTCCTTCCC TCGGGAGAGT GCACACAATC TATCAAAGGG AAACTGCCTT
51751 CGGCAGCGCA AGCAGGGCCC CTGTTATCTC TTGGGGTGCA TAAGCATGCA
51801 GATTGGCAAA AGGTCCGTTG TCGTCGTGAT CTTAAAGAAA TTCTTCCCCT
51851 ATGGTTCCGT TTCGCCGCTA TGGCTCCTAA GGGATCCTAT CGGGATCTAG
51901 AGACGACGGC TATCGGTAGC TTGGTAAAGA CTGCCCATCA AAGAGTTTTA
51951 CATAGGGAAA CTACAGAGAT TGCTCCTGCG TTACTCTCCA TAGCCCTTGC
52001 GGGATTTTCA GAGTGCTTTC TTCCTAGGAG CTATGATGAA GAGTTCCAAG
52051 GAATCCTCCC CCAAGATGGA GATCCAGAGG GGGGAGTTCC TTTTGAGCTT
52101 CTCTCGTATA GCTTTGGTAT GATCCAAGAT ATTTTTCTGA GACACCAGGG
52151 ACAGCTAGTA GAGATCCTTC CTGCATTACC TCCTGAATTT CCTTGTGGCC
52201 GCTTGATTCA TGTTGCCCTT CCTAATCTTG GGACTTTGTC TATCGTCTGG
52251 ACTAAGAAAA CTATCCGTCA GGTCGAGCTC CATGCAGAAT ATAGTGGCGA
52301 GGTATTTTTA AAGTTTTGTT CTTCACTATG CAGTGCGCGC CTTCGGGAAT
52351 GGTCGGAGCG~ACGTCTCTCT GGATCTAAGA GACTTTCTTT AGGAGAAACT
52401 CTGGAGATAA AAGCAGGAAC CACATATTTA TGGGATTGTT TTCATAAATA
52951 GATAGCCTTC CATGGTTGAT AAACTGATCC ATCCTTGGGA TCTTGATCTG
52501 CTCGTCTCAG GACGACAGAA AGATCCCCAT AAACTCTTAG GGATCCTTGC
52551 TTCTGAAGAT TCTTCAGATC ATATTGTTAT TTTTCGTCCA GGGGCGCATA
39


CA 02350775 2001-05-11
wo oom~4 PCT/US99/26923
52601 CGGTTGCTAT TGAACTTCTA GGAGAGCTTC ACCACGCTGT AGCTTATCGT
52651 TCGGGGCTCT TTTTCTTATC CGTTCCCAAA GGAATCGGAC ACGGGGATTA
52701 CCGTGTGTAT CATCAGAATG GACTTCTCGC TCATGATCCC TATGCGTTTC
52751 CTCCTCTGTG GGGAGAAATT GATTCTTTTT TATTCCATAG AGGAACGCAT
52801 TACCGCATTT ATGAACGCAT GGGGGCAATC CCTATGGAAG TTCAAGGAAT
52851 CTCAGGGGTG CTCTTTGTTC TTTGGGCTCC CCATGCGCAG AGAGTCTCTG
52901 TAGTCGGAGA TTTTAATTTT TGGCATGGCC TTGTCAATCC TCTACGTAAA
52951 ATTTCCGATC AGGGGATCTG GGAGCTTTTC GTCCCAGGCT TGGGAGAGGG
53001 AATACGGTAT AAGTGGGAAA TCGTTACCCA ATCGGGGAAT GTGATTGTAA
53051 AAACAGATCC TTATGGGAAG AGCTTTGATC CTCCACCCCA GGGTACAGCT
53101 CGTGTTGCGG ATTCTGAGAG CTACTCTTGG AGTGATCATC GTTGGATGGA
53151 GAGGCGCTCG AAGCAGAGTG AAGGGCCCGT CACGATCTAT GAAGTGCACT
53201 TAGGCTCTTG GCAATGGCAG GAGGGAAGGC CCTTAAGCTA CAGCGAAATG
53251 GCGCATCGCC TTGCTAGCTA TTGCAAGGAA ATGCACTACA CTCATGTGGA
53301 GCTTCTTCCC ATTACGGAGC ATCCCCTGAA TGAATCTTGG GGCTATCAAG
53351 TGACGGGATA TTATGCTCCA ACATCAAGAT ACGGGACTCT CCAGGAGTTT
53901 CAGTATTTTG TAGACTATCT ACATAAAGAA AATATTGGTA TTATTTTAGA
53451 TTGGGTGCCG GGACATTTTC CCGTAGATGC GTTTGCTCTT GCCTCTTTTG
53502 ATGGGGAGCC TCTCTACGAG TACACGGGGC ATAGTCAGGC TCTTCATCCC
53551 CACTGGAATA CGTTTACCTT TGACTACAGT CGTCATGAAG TGACCAACTT
53601 TTTACTAGGG AGTGCTTTAT TTTGGCTCGA TAAGATGCAT ATTGATGGCT
53651 TACGTGTGGA TGCTGTGGCC TCTATGCTGT ATCGTGATTA TGGCCGTGAA
53701 GATGGAGAAT GGACGCCTAA CATCTATGGA GGTAAGGAGA ACTTAGAGTC
53751 TATAGAATTT TTGAAACACT TAAATTCTGT AATTCATAAG GAGTTCTCTG
53801 GAGTGCTCAC CTTTGCAGAG GAATCCACAG CGTTTCCAGG AGTCACTAAG
53851 GACGTAGATC AGGGAGGTCT GGGGTTTGAT TACAAATGGA ACTTAGGTTG
53901 GATGCACGAT ACCTTTCATT ACTTTATGAA GGATCCCATG TATCGTAAAT
40


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
53951 ACCATCAGAA AGATCTGACA TTTAGCCTTT GGTATGCCTT CCAAGAGTCT
54001 TTTATTCTTC CTCTCTCGCA TGACGAGGTG GTCCACGGTA AGGGCAGCTT
54051 AGTGAATAAG CTTCCCGGGG ATACCTGGAC CCGATTTGCT CAAATGAGAG
54101 TGCTCTTGAG CTACCAGATC TGTTTGCCTG GGAAAAAGTT ACTGTTCATG
54151 GGTGGGGAAT TCGGACAATA CGGCGAGTGG TCTCCTGATC GTCCCTTAGA
54201 TTGGGAGCTT TTGAATCATC ACTACCACAA AACTTTGCGA AACTGTGTCT
54251 CTGCATTGAA TGCGTTGTAT ATTCACCAAC CCTATTTATG GATGCAAGAG
54301 AGCTCTCAAG AGTGCTTCCA TTGGGTAGAC TTCCATGATA TAGAAAACAA
54351 TGTCATTGCC TATTATAGAT TTGCAGGCAG CAATCGTTCT TCGGCGCTTC
54901 TCTGTGTCCA TCATTTCAGT GCGAGTACTT TTCCTTCCTA TGTTTTAAGG
54451 TGTGAAGGTG TAAAGCATTG TGAACTCCTT CTCAACACTG ATGATGAGTC
54501 TTTTGGAGGC TCAGGGAAGG GAAATCGGGC TCCTGTGGTC TGTCAAGACC
59551 AAGGGGTCGC TTGGGGTTTG GATATAGAGC TCCCTCCTTT AGCTACTGTG
54601 ATCTATTTAG TTACTTTTTT CTAAAAATTT AAATACTTTA TTTGTAAATT
54651 GTTGTGGGAT TGTTCTATTT TGTGGTGTAG TTGATATTAA TAATTTATTT
54701 TATAATTAAA AATAATTATT AGTATTTCTT TTATGTCTAC ATCACCAATT
54751 AGCAACGATC CCCGATATTT GTCTTTGTCT AATGCAACTG AGAAAACTTC
54801 TCTTCTTGCA AATAGCCGGA GTCTCTCGCC AGTACCAAAT TCCCTAGTTC
54851 CTAGCAATCC TGAAGATACA GGATTGCGAA AAAGTATTTT CACCCATTCC
54901 GTGACTTTAT TTGCTGGCCT GGTTGTTTTG CTGGTAGCGG TTTCTGTTGT
59951 TGTTGTCGCT TTGACCGTCT TAGCTCCCGG AGTTCCTCAG GCTATTCTTC
55001 TTGGAATCGC CATTTCAGGC GTGGGTATTG GTGGATTTTC TATAATGAAG
55051 AGCTTGGTTT ATATGGTCCG AGACTATATG TCCCCCAGGA TGCAGGAGTC
55101 GAGCAGAATC AAAAGTGCTT TAGCTGTAGG GACTGGATTT ACTGTCATGG
55151 GTTTGGTCAT GAAGGTGGGG GCGAATTTTG TTGCTGGAGG GTATGGGGGT
55201 CTCGTGGGTA GCTTGGGATC CAGTGCGTAT TCCCGGGGAA GCCAAACCAC
55251 ATTAGCAAGC TTCAGTCATT ATATTTATAC TAAGTTTTTC CGTTCTGAAA
41

CA 02350775 2001-05-11
WO 00/27994 PG"T/US99126923
55301 AAGTTGCTAA AGGGGAGAAG CTTACAGAAG CAGAAACTAT AAAAGAGGCG
55351 AAAAAATTAC ACTATATCAC GTTGTCAATT GCCACTATTG GCGTTGGTCT
55401 TGCGGTTTTG GGGATTCTCC TTGCCATTGC AGGAACGGTA TTGCTAGGAG
55451 GCGCTCCCGC AACGATTGCT ATTATTTTAG CTCCCCCTTT AATTTCTATA
55501 GGGCTTACGA CGGTTTTGCA AACGATACTC CATAGTAGTA TCGGAAAGTG
55551 GAGAGCCTTT CTGCTTACTC AAGAP.AAAAA AGATCTTTTT GTAGACACCT
55601 CCCTGAAAGA CATTCGCTTA GAAAAATTGC CCCCCAGTGA GGTGGAAGAG
55651 AGTGAAACTT CCCAATCTGT GATAGAAGTT CCAGATTCAG AGGGGATTGC
55701 AGAGACGAGG ATCTCTGCGG AAGAAATCGA TACGAGGCTT TCCCTGACGA
55751 CAAGACAGAA GGTCATCTTT GCTCTTGCGA CACTCTTGCT CTTAGCAAGT
55801 ATTGCTGCCT TCATAGTCAC GGGATTTGGT GGATTGACAG TCATGCAAGT
55851 TCTCCTTGTT GCTTCTGTAG GATCGGCGGT TGCTTCTGTA ACACTCCCTA
55901 TGGTTTCCTC AGGATTTTCC TACGTCGCCT ACCAACTGAA AGCAAGATTG
55951 AATATCAGTA AATTACGTTG GAAAGAAGCA AAAAATAAAA AGCGGGTGCG
56001 CCAGTTCTTA ATTGAGTCTG GAGTGATTGC CTCGGATCGA GAATTTAACC
56051 AAATGTGGAA GACAGTCTAC AAAAAACAGA TTCAGAAGAC TGACGCTGCA
56101 ATTCGTGAAG AGGTTCGCAA TTTTGAGAAG GGTGGGGAAG TGAACAGCGC
56151 CCTTGTTGGT GGAATCTTAC TTGGTGTAGG AACTGGGATC ATGCTTCTTG
56201 CCCTGGTCCC TGCATTTGCT CCTATCGTTC CTGGTATTCT TGCTCTTGGA
56251 GGATCGACGT TAGGAATCGC GGGATCGATT TTAATGAGGA AGTTTGTCAA
56301 CTGGCTCTAT GATGAGCTTG TGAAGCTCTA TGAGCGTCGA CGTAATCGCC
56351 GTGAGCTTCT CTATGGTCCT GAAAGTAAAA TGCGCTCCAT TGCTACGGAT
56401 TTAGTTGTTG AGGCTCTTGC TGCTAGCCAC GATCATCTAT TTGATCTTGA
56451 TGGTCCCGTA GATTTTATTG ATGTGGATGT AGATATAGAT GGAGCTGCTT
56501 AGGCCAGGTC CTTGAATGTA AGATCCTCGA GCTTTGGGAG CTTGTCTGCC
56551 TTCTCTAATT TTATTTTCTC TTTTACATCT AGTAGTTTAC TTAATTATAT
56601 AATCAGCATT CTTTTTGTTT ATTTTAATTT ATATTTTGTT TTTAAAATAT
42


CA 02350775 2001-05-11
WO 00/Z7994 PCT/US99/26923
56651 TTTTTATTTT AACATTTGTT TAATAGTTTT TATTAAATAA TTTATCATTT
56701 TAAGGTTCAA TTATGGCAGT TGGTGGCGTA GGCGGCTCAA GATCTCCTTC
56751 CCCCATTCCT CCTAATAGAA GGAATAGTGA GGATGGAAAA GTAAGTCCTA
56801 AAGACAACTT AGGGGAACAT ACAGTTAGCA GTAGTGACAG TAGTCTTGCA
56851 AGTCAGGGCC CTACAATAGA AGAGAGAAAA GCCCAGTTAG GCGGGACTGA
56901 TAAAATTCCT TTGCCATCTG TCAAAGAACC CGGAGATTCT CAAACTTCAG
56951 GACGTTCTGG GGTACTTCAG AGAATTTGGA AAGGCGTTAA AGGGGTCTTT
57001 AAAAAAACCC CTCAAGCGCG TCCTGAAGTT TCTAGTCCAC GTCTTCCATC
57051 CCATGTGCAA CATGGCCAAC GTCTTCCTGG ACTCGAGGGC TTTAGAGATC
57101 GTATCCAGAA AAGATCTGAA AATCCAGAGG CAGATTTAGG GAAGATGAAA
57151 CGTTCCTATT CTGATGGTGA CCTTGATCGA GTAGGACACG ATTCTAATGA
57201 AGATTCTACA GAGGATAGCC GTTCTGAAGG AGGAGAGCCT TCTTCAAAGA
57251 GTTCTTCCTT CTTATCAGGA GTTCGAGGAG CGGTGTCTAA AGTTCATGGT
57301 GCCCTAGGTG ATATTAAAGG AAAGTTCCAG CGTTCTGCTT CCGAAGATGA
57351 TTTAACAACT CAGGGCGAAG ATTCTGCCGG CGATACTGTA AAAGAAAGGC
57401 GTTCCGAAGA AGCAGAGGCT TCTTCGAAGA GTTCTTCTTT TTTATCAGGA
57451 GTTCGAGGAG CGACGTCTAC AGTTCAGGGA GCCTTAGGTG ACGCTAAAGA
57501 GAAGGTTTCG GCGTTCGGAG AGCAGGCTGC AGGTGCAATC AGATCAGCAC
57551 CAGGGAATAT CAGAACTAGA TTCCAACGTT CTTCATCGGA AGGTGATCTT
57601 TCTAATGTGA ATAAAGCAGC AAAACATCTG CGTAAGGCTT TAGAAAATTT
57651 GGAAAAAGTA GCTCCAGAAC AAGTGTCACC AGAGGTGGCT TCTAGGGTGC
57701 AATCTCTTCT TGCACGCATG GAGCAATTGA CTCATCAGGA ACCTCCTACT
57751 GTGGAGGATC TTATTACTTT CGTAGAATCC AATGTAGGTA GTGATTCTGT
57801 GGAGTATGCA TCCATCGTAC CTCAAGATGG ATCGCAAGCC CCAGCAGAGA
57851 CTGCGGAAGC TCCCGAAACA GGTGGGGTAG AGGGATCTGC AGCGCAGGGA
57901 GCATGGAAAG CGTTACGGGA TTTTGTAGTT AGCATATTCC AAGCGGTAGC
57951 GAGCTTCTTT AGGGCAATTG CTTCAAGATT AAGTTCAGCA CGACGTGAAT
43


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/269Z3
58001 CAGCTGTAGA TGATCTTGCA TCAGAAAGTA ATACACAATG GTTTGTGGAG
58051 CAAGAGGGCG TTTCAAATCC ATCGGCTGCA CCTAGCTTAT CTTTTGCGGA
58101 AGAGATCGCT CGTAGAGCTG CAGAAATGAG TAACAGAAAT GCCCAGAGTC
58151 TTGAAAAATT GGAATCAGGC AATGTGACTG ATCCTGTCAT TCAACAAGGC
58201 TTAGGATTAG CTAGATCATT TGCTCCAGAG GGACAGTAGT CGTTATCTCA
58251 CTGTTTCTCT ATGCGCAAGG GAAACTTGAA GAGTTTTAAT TAAAACTCTT
58301 CAATATGTTG ATTATTTTAA TATATTTAAA AGCATTTTTG TTGTTTTTTA
58351 ATAAAATTAA ATTGTTTCAG AAAAAAGATT ATTCTTTTTA GGAAGTGTTT
58401 ATGGCATCAG GAATCGGAGG ATCTAGTGGA TTAGGAAAGA TTCCACCTAA
58451 AGATAATGGG GATAGAAGTC GATCGCCCTC TCCTAAGGGA GAACTTGGCA
58501 GCCACGAGAT TTCCCTGCCT CCTCAAGAAC ATGGAGAGGA AGGAGCTTCA
58551 GGATCTTCGC ATATACATAG CAGTTCCTCT TTTCTACCAG AAGATCAGGA
58601 GTCTCAGAGC TCTTCTTCGG CAGCTTCTAG CCCGGGATTT TTTTCTCGCG
58651 TACGTTCTGG GGTAGACAGG GCCTTAAAAT CATTTGGCAA CTTTTTTTCC
58701 GCAGAGTCTA CGAGTCAAGC GCGTGAAACG CGACAAGCTT TTGTTAGATT
58751 ATCAAAAACC ATCACCGCGG ATGAGAGACG GGATGTCGAT TCATCAAGTG
58801 CTGCTGCTAC AGAAGCCCGA GTGGCAGAGG ACGCGAGTGT TTCAGGCGAA
58851 AATCCTTCTC AGGGGGTTCC AGAAACCTCT TCTGGACCAG AACCTCAGCG
58901 TTTATTTTCT CTTCCTTCAG TAAAAAAACA GAGCGGTTTG GGTCGGTTGG
58951 TACAGACAGT TCGCGATCGC ATAGTACTTC CTAGTGGGGC TCCACCTACA
59001 GACAGCGAGC CTTTAAGTCT CTACGAGCTA AACCTCCGTT TGAGTAGTTT
59051 ACGTCAGGAG CTCTCTGACA TACAAAGTAA TGATCAGTTG ACTCCAGAGG
59101 AAAAAGCAGA AGCCACAGTT ACCATACAAC AGCTGATCCA AATTACAGAA
59151 TTCCAATGCG GCTATATGGA GGCAACACAA TCTTCGGTAT CTCTAGCAGA
59201 AGCTCGTTTT AAGGGGGTAG AAACTAGTGA TGAGATCAAT TCCCTCTGTT
59251 CAGAACTGAC AGATCCTGAG CTTCAAGAAC TCATGAGTGA TGGAGACTCT
59301 CTTCAAAACC TATTAGATGA GACTGCCGAC GATTTAGAAG CTGCTTTGTC
44


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
59351 CCATACTCGA TTGAGTTTTT CTTTAGACGA TAATCCAACT CCGATAGACA
59401 ATAATCCAAC TCTGATTTCT CAAGAAGAGC CTATTTATGA GGAAATCGGA
59451 GGAGCTGCAG ATCCTCAAAG AACTCGGGAA AACTGGTCTA CAAGATTATG
59501 GAATCAGATT CGCGAGGCTC TGGTTTCTCT TTTAGGAATG ATTTTAAGCA
59551 TTCTAGGGTC CATCTTGCAC AGGTTGCGTA TTGCTCGTCA TGCAGCTGCT
59601 GAAGCAGTGG GTCGTTGTTG CACGTGCCGA GGAGAAGAGT GTACTTCTTC
59651 TGAAGAGGAC TCGATGTCGG TGGGGTCTCC TTCAGAAATT GATGAAACTG
59701 AAAGAACGGG CTCTCCGCAT GACGTTCCAC GCAGAAATGG AAGTCCACGT
59751 GAAGATTCTC CATTGATGAA TGCCTTAGTA GGATGGGCAC ATAAGCACGG
59801 TGCTAAAACC AAGGAGAGTT CAGAATCAAG TACCCCGGAA ATTTCGATTT
59851 CTGCTCCCAT AGTGAGAGGT TGGAGTCAAG ACAGTTCCGT CAGTTTTATT
59901 GTTATGGAAG ATGATCATAT TTTCTATGAT GTTCCTCGTA GAAAAGATGG
59951 AATCTATGAC GTTCCTAGTT CCCCTAGATG GAGTCCTGCG CGAGAGTTGG
60001 AAGAGGATGT TTTTGGAGAT TATGAAGTTC CTATAACCTC TGCTGAACCA
60051 TCTAAAGACA AGAACATCTA CATGACACCT AGATTAGCAA CTCCTGCTAT
60101 CTATGATCTT CCTTCACGTC CAGGATCGTC TGGAAGCTCA CGTTCTCCGT
60151 CTTCAGATCG CGTACGAAGC AGCTCACCAA ATAGACGGGG TGTGCCTCTT
60201 CCTCCAGTTC CTTCACCTGC TATGAGTGAG GAGGGGAGCA TTTATGAGGA
60251 TATGAGCGGT GCTTCAGGTG CAGGTGAAAG TGATTATGAA GATATGAGCC
60301 GTTCCCCCTC TCCTAGAGGC GACTTGGATG AACCCATATA TGCTAATACT
60351 CCTGAAGATA ATCCATTTAC TCAGAGAAAT ATAGATAGAA TTTTACAGGA
60401 GAGGTCAGGC GGTGCTTCCG CTTCTCCTGT AGAGCCTATT TATGATGAGA
60451 TCCCATGGAT TCATGGCAGG CCCCCTGCTA CACTTCCAAG ACCCGAGAAT
60501 ACATTGACTA ATGTTTCGCT TAGAGTGAGC CCAGGGTTTG GACCAGAAGT
60551 AAGAGCCGCT TTGCTTAGCG AGAGCGTGAG TGCTGTTATG GTCGAAGCAG
60601 AGAGTATTGT TCCTCCAACA GAGCCGGGGG ACGGAGAATC AGAATATCTA
60651 GAGCCCTTAG GGGGACTTGT AGCTACAACG AAAATCTTAC TACAAAAAGG
45


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
60701 ATGGCCTCGT GGAGAGTCGA ATGCTTAGGA TTTAAGTAGT TCTTTCGAAT
60751 CTCTAGTGAG GATGTATCGG GTTCTTAATT TTTATGGGGG AAACGTATCT
60801 GTGGATTTCC CTTAGCTTCT CCCATAAGAT TCATGATGGT AGAGAGTGTT
60851 CACTACTCTC TATTTGGTCT TTACAGGTTG CATTGTCTAT ATAACATGCT
60901 TTTAAAACTT AAAGGTCGTT CCTGCGTGAA GGTAATGTGT CGTCGTTGAT
60951 GAGGATACCG AGCCTTGATA GTCTAAGAAC ACCGAAAGTT TAGGGAAGAT
61001 AAAAATTTGG TTTCTTCCTT TAAAAGCAAT GGCATTGCGA GCAAGGGTGG
61051 TTCCTGATAG GAGCCATGAC GATCCACTAG ATTCTAGACT CACGTTGATC
61101 TCAGGATTTT GTTGGTAGAG GACAGGCTGA TAAGCAAGCT CTATGTTCCA
61151 ATAGGTAGGA AGACGGAACT TGGATTCCCA AGCGCTCTGA ATTCCCAGAG
61201 GGACTGTCAG GTTATATAAG GGTTTATGAA CAGAAAATTT TCTAGCTTTA
61251 TCTCCACTTT CTTGAAACGC AGTTTGATTA GAACGAACGG CAATTGCTTG
61301 GATAAAAGGA GTGAAGTGGA GAGGTCGTGA TCGCCATTGT AGAGATAGAG
61351 AGCAAGAGAG AGCCGCCCCT AATGTCGTAC TATAACATTT GCCTTCCGTT
61401 TGTATTTTTC CAGAATATCC AGATGCTTTG ATATGGTGGT TGCTGTAGCT
61451 GTAGGCTAGA GATGCAGATG TAGAGAATCT CTCTTGCAGC CAAGGATTAT
61501 TGATCTGGAG CGCTACAGTT GTCGTATGCG AAGCCACGGA ATTGTCGGAG
61551 TGGCTCTCGT AGAGATTACT GAAAAGTTGG GAGAAGTTTA CACCAAAGCT
61601 ATGATTAGAA GCAGTGTTTG AGGTTGTTCC CAAAGAATAA CCCGTAGCTT
61651 CCATATGGAA TCCTTTCGCA TCATTGTTGC TATTTTGATG CACGAAGAGT
61701 CGAGTAGCTT CTCCAGAAGC TGTAGGTGCT ATTTGGCCTT GCTGTGTTTG
61751 ATAACGTAGT GTCGCAAATA AGTTATGGAA AGATTGCCAG AAGGCAGATA
61801 GGGCAATGTC TCCTTTGTTT TCTGGGTTTA CCTTATATCC TGTAGGTGTC
61851 CAATCACCAT AAAGCTGGCG ATGTAAAGTA TTCACAGTAT CTTCAGAAGA
61901 GGTATCAGAA GTTGTGATTG TTTCGATCCA GTAAGGGGAC CAAACGCCTT
61951 GGTAGCCGTA GTGTTGAGTT GTATTTAGAC CCTCAGGGTA GAAATTATCC
62001 GTATTAATAT GTTTAGCTGT GACGTCTAAG AGATACAGAA GAGGAACTTC
46


CA 02350775 2001-05-11
WO OOI27994 PCT/US99/Zb923
62051 TGCGATAGGT TGGGCAAGGT CTGCAGTATC ATAGGGATCT AGGTTCTCGT
62101 CATCCAGTAG GCTCAAAGGT CCTGAGAGAT TGATTATAGG GTTATTATCT
62151 TCGCTATAGG GTGCTGATGA ACCTGTGGGG CGAATCCATA GCTTGGGAGC
62201 AACTCTGTTG CCTAAGATAG AGGGAAGGTT AATTGCAAGA TTATTGATGT
62251 TAATTACAGA ACCCACACTA CTGCTACTTT GTTCTTCGTC TGTTGTAGAA
62301 AACACAGCTC TACTGCCTAA CCGTAGAGTC CCACCAAATT GATCAAATTT
62351 ATAGACTTTC CACTCTGCTC GATCTTCAAG AGCGAGTGTG CCGTTGTACA
62401 GTCCAATGTG GTTTCTGAAA TGTGAAATGA AGTCATCACG AGAAGTCGAT
62451 GTATCCGGAA TATATGTTGA GGAGAACAAG ATAGTTCCGA GGTGTTCTGG
62501 ATTAGGATTA AATTTTTGGA TAGAGTTTTG TATAGTATAT CTTTGTAGTA
62551 TGGGATCATA GAAGGTAGCA GAATGACCTT GACTTGCTCC AACTGTTAAT
62601 GAGACATTAC GCGTGCAGTT TACAGAAACA TGATTGCTGA AAGTATCTTT
62651 GAAGTGTCTA TTATTATAAA AAATAATATC TCCCTGATCA GCAAATAAAG
62701 TGCATGCACC ATCTTGACGG AGCATGATAG CGCCGCCCCA AGTTCCCTGA
62751 TTGTTTGTGA AATAGACGGG ACCACTGTCT TGTATAGTTA GAGATTGTGT
62801 ACAGATAGCA CCTCCATCTC GTGCTGCAGT ATTATTATCG AAGGCTGCAA
62851 TTCCTGGGTT GTCTTTTATA GAACAACTAA TGCAATAGAT AGCCCCTCCA
62901 GAGGAATGGT TAGCAGAGAT GTCCGCTTCC ATGGCAAAAT TATTGTTGAA
62951 GATCACAGAA CCGGTATTCT TTGTAAGAAT GCACTCTTGA TGTACTCTTA
63001 TTGCACCACC CAGACCTGAT TGGTTATTCA AAAAATAGAT AGGCTGAGAA
63051 TTATTCTCAA TTCTACAAGC ATTAGCGAAC AACGCGCCCC CCGCTGTTCC
63101 GCCTGCAGCA TTATTAAAAA ACAGGCAAGG GCCAGTGTTG TCCTTAATGT
63151 TTATGATTGC'AGCTTGGATT GCTCCTCCTG AAGATTTTGC CTTGTTGTTA
63201 ATGAAGTATG CGGTTCCTTG ATTTTTTGAG ATTGTAACAT TTTTCGAACA
63251 TAAAACAGCT CCCCCTGTAC AAGTATCAGC GAAATTACTT GCATTAGGAA
63301 AGCTTAAATT CCCAGAGAAA ATGATGGAAC CATGATTCTC AGAAAGATCG
63351 AAATTACCAT TCACATACAT CGCACCAGCT CTTTTAATAG CAAAGCTATT
47


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
63401 TAGGAAAAGA ATTTGGTTTT TTGTATTCGT TATGGCAAGT GATTTGCAAG
63451 AGAGAGCACC GCCGTCTTGA GAGAAGTTTT CGAACCAGCT TTCTATGGAA
63501 TTCTGGTGAT CGAGGACAAT GTCTTGGTTA GTGTCATCCC TAACTCCAAA
63551 AAGTGTTGCT CTATGAGAGT AGGGAGTCAT GTTAGTAAGA GTATCAATTA
63601 GAGGGAAGAG TGTTGTGAGT TGATTTGCTT GATTATCAAA ATAGTCAGAC
63651 AACGGAGTCG CATTAAGGAG TATTGTAGTT TTACCTAAAA TTAAGGCTCC
63701 AACAAAGAAG GAAGATTTAC TAAGGGATCT GTTATTTTGC ACTGTTTATT
63751 CCTTGATGTT TTCTTTGTGG TTAAAATGTG CAATGACTCT CAGCTTTAAG
63801 GTATTGACCT ACAGTTGACG AGGAGACTTG AGCTGAATAA TCTAAGGATA
63851 AGGTTACTCT TGAGAAAAGT TGGGAAGTAT TTTTTATTTT TGCAGCTACG
63901 GAATTATAGG AGACGGGAGT TGCTTGTGTT GTCCATGTTC CATTGCTGAT
63951 GAGTAGTGTC GTGAACATTT CTGGATTTTT TCTGTATAGG GTAGGTACGT
64001 AGGATATTTC CGTAGTCCAT AGCATGGGGA TATGATGTGA AGTTTTCCAT
64051 TCAGAACGGA AGCCTATGGG AGAGGAAAGA TCTGTAAGGG GATGTTTTGG
64101 ATGGAATTTT CTTATATGGT CTCCAGTTTC TTGGAACGAG GCCTGGGAAC
64151 AGCGCAGAGC AATGGCACTG ATAAAGGGCT GGAGTTCGAG AGTGCGGGTG
64201 ATTCTAGCTG GTAAGAATGT GCAGTCTAGA GAGGCTACCA AAGTGTGGTT
64251 ATTAAAGAAG GCTTTGGACG ACCCTTTTAA GATTTCTGTA TAGTGGCAAA
64301 GCATATGGTG ATCTCCGTAG CTATAACCTA GGGATAGCCC TGTAGAGATG
64351 AAGTCCCTGA AGAGGAGACT GTCGAAGCGG AGTCCTGCAA AGTAGTTGTG
64901 GGAGGAAGTC GTACTTGGAG ATTGACGTTC TCTAGTTTTG GAGAACATTT
64451 GTGCGAATCC TAAAGAGAAA CTATGTCGTG CTGCAGTTTT TGCTGAGGTT
64501 GTTGCTGCAT AGCCCGTAGT ATGGTTTCGG AAGCCTTTGC GTCCCTCGCG
64551 ATTATGTTGG TTAATTAGAA GCCCGAGTCC TTGCAGAGAG GCTTCAAGGT
64601 CATGCTCTTT GAGGTTTTGT GGAGGTAAGA TGCGGATTCC TAACAGAGCG
64651 TTATAGGCAG ACTGCCATAA GGTATTAGCA ATAAATTCTC CGTGACGTTC
64701 CGGGTTAGGG CGGTATCCTA CAGGAGTCCA GTCTACGTAG AGCTGCCTGT
48


CA 02350775 2001-05-11
WO 00/Z7994 PCT/US99/26923
64751 GGTTTGTATT GGTCTGTTCC GGTACTGTAG AGCTTGTTGT AGTCGTAGTT
64801 TCCATCCAAT AGGGAGACCA GATTCCCTGA TATCCATAGT GCTCATCTAA
64851 GTTCATGGCT TCTACAATGA GATTCGAAGT ATCGATTTTT TTTGCAGTCA
64901 CATCGAGGAG GTAGAGGAGG GGGGATATCC TTTCGAGGTT CAGAGAGATC
64951 TAAGCTATCA TAGGGGTTTT CATTTTCATC GTTTAGAAAA GTCAAGGGTC
65001 CTGAGAGAGT GATAGTAGAA GAAGTGTCTT CAGAATAGGT GGATCCTGTT
65051 AATGTAGGAT AAATCCAGAA CTTTGGAGCT GAGGCTTCTG ATTGTAAAAT
65101 AGAAGGAAGA TTGATCGCGA TTGCATTAAA ATTTATGGAG CTTCCCGGGC
65151 CTTTCGTCCT GATTAATGCT GCGTTTCCTA AACGTAGAAT GCCCCCAGTT
65201 TGCGATAGGG TTTTGCAAGA AATAGCAGCC CGATCTTCAA TAGCGAGCAC
65251 ACCCCTTTCA AGTCGTGAAG AGTTAGAAAA TTTTGATAGG AAGTTCAATG
65301 GATTTGTTGC GTTAGAATCT ACATTGATTC CGGAAAACAA CACGGTGCCA
65351 AGGTGATGGG GTTCATAATT AAATACTATA GGATCTGTTG TCGTCTGATC
65401 GTGATCTATA GGATCATAAA AGAGAATTTT ATAACCCTGT CTTGCTCCTA
65451 GTTTTAAGTT AATCCCCGGA GCAGCATAGA GTGCATTTCT ATATCCGGGT
65501 TGAGGAGAAG AAGATGTGAT TGTATTATTG TTAAATAGAA TATCGCCGTA
65551 GTCTGCAGAG AGGAAGAAAT TTTGAGGAGT ACTTCCTATA CCAGAAAGAT
65601 TGATGAGAGC CCCTCCTGAA GTCGCAGAGT TATTAATAAA TGCTGTCGGA
65651 CCGTTATTTT GGAAGATGAA AGATCTCGTG TGTATAGCTC CGCCGCTAAG
65701 TGCTGCCGTT TTATTGTTAA AGATAAGACC TTTGGGATTG TTCTCAATGA
65751 CTAAGGAGGT ACACATGATA CCGCCACCAC CGGGATATAG TTTTCCTGAT
65801 GCTGTGTTAA TTGAGGATGC GGAGTGATTG CTGATCTCTA TAATTTCTTT
65851 ATTTGAAGAG ATAATGACTC CTAGGGCAGA AAATATACCA CCCCTTGTCC
65901 TGAAGCATTA TCATTGATTT GGATGTTTTG ATAATTCCTT TCTATATTTA
65951 CGGCTCTGCA GAAGATGCCT CCTCCAAAGC TGGAATCTTC TAATGTTTGA
66001 TTTTTCTTGA TAACGATAGG ACCTAAGTTG TCTGTGATTG ATAAACTCAC
66051 CCCAACATAG ATAGCTCCTC CTCTATTTTT AGAGACATTA TCTAAGAATT
49


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
66101 GTCCTTCTCC TAAGTTATTT GTAATATAGC TATCTAGTGC ATGTATAGCC
66151 CCACCAAATC CTGAAGTTGT AGTAGTAGCT AAGGAAGCCG CATTTGAAAT
66201 AAACGAGTAG TTCTGATTCT TAGAAATCCA GCATTCCCGA ACACTGTAGA
66251 GAGCCCCTCC AGAACTGTGG GAGCTATTCC TTTCAAAACT TAAGTTTCCT
66301 TTATTTTCAG AAATGAACAA ATTTTTACAT GCAAGAATGC CTCCATCCTC
66351 GTAGTGGTAG TTATAATCTA TAACAGAATT GATAGAGTTT CCTGTAATTG
66401 TGATGTCCTG GGTTATATCG TGACCGAATC CATTAAGAAG ATTAGAGGGA
66451 CGAAAGGAGG AAACCAAGGG CTCTGGAGTT ACATTCAGAC GGAAGCTTGG
66501 AGACGTGTGG AAAGCAGAGA TGTCTTTTCT AGACATCTGA CAAGAGGCGA
66551 GGTTAGGGAC TTCATTTCCT GATAAGGAAC AACATAGCGC AGTGGATAAA
66601 ATGCTGAGAC AAATGGGTCG CATAAACATC CAAACTTTGA TATGAAAATA
66651 GAGTTTGTCA TAAAAAATCA TCAATTTGTT GTCTAGAAGT TTATAAATTT
66701 ATATTTAAAT ATGCTTATTA ACATGTTGAA TATTACAGAG TTAGTAAACT
66751 CATATATTTG AATGATTTTC TATATTTAAA TTTTGAGGGA GTCCTCTACC
66801 GATTGGGCTA ACGTAGCTCA TCGGATAGTT GTTAATTCTA AAGATTTCAA
66851 TTTTATCGTT CTTTTATTTT AGAATTTTAA GGTACTTCCT GCTTGGAGAT
66901 GGTGCGTAGA TGTCGAGGAG GAGACCGATC CTTGGTAATC CAAGAATAGA
66951 TCGAGAGAAC GGAAGAGCGC AGTTTGATTG TGGACTTTGT ACCCTAAAGC
67001 ATTGCGAACA TAGTTATGGC CTAGGATATC CCAGGAACCT CCGCTCGCAA
67051 GTAGCGTGAC ACCGATTTGG GGATTTTGTT GATAGAGTAC CGGTTGGTAA
67101 GAAAGTTCTA GAGTCCATTC TGTAGGTACG TGGAATTTTG ACTGCCATTT
67151 TCCTTGGATT CCTAGAGGTA AGGTCAGATT ATAGAAAGGC TTTTGAGAGA
67201 CAAACTTTCG GGGATTGTCA CCAATCTCTT CGAACGCTGT TTGGTGAGAA
67251 CGTATTGCAA TTGCCTGAAC GAACGGGCTG AGGTGAAGAT AGGATTTCTG
67301 TTGCCAAGGG AAAGAACAGC CGATAGCTGC TGCTAATGTA TGGCTATAAC
67351 ACGTCCCTTC TGCCTGTTCT TGATGTGAGG GATGTAGGCT GTGGAGGTGA
67401 TGGTCCCCAT AGCCATACGC TAACACTGTG GATGTTGCAA AGGCCTCTTG
50


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
67451 GAACCACGGA AGCTCAACAT AAAGTGAAGA GACTGTATTG TGAGCCGAGA
67501 CGTTGTTGCT TGATCCGATT TCTTTAGTGC GGGTGAAGAA CTGTGCAAAA
67551 CCTAAGGAGA TTTTCTGATG TAAAGAAGTT TCGGAGGATG CTTGTAAGGA
67601 ATACCCTGTA GATTGGATAC GGAATCCTGG AGCCCCGGGG ATGCTATTTT
67651 GATGAACAAA GAGGCCGTCG GCAATCCCTT GAATTTCTAA GAAAGGCCTC
67701 TCGATATCAG AATCACCAGT TCGATTATAA CTTCTTAATA GAGAGAACAT
67751 AGTATGAAAG GATTGCCATA GGGGAGTCGT AGCAAGATCT CCTTGGTATT
67801 CAGGATTGAC CTTATATCCT AAGGGAGTCC AATTGGCATA CAGAGCTCTG
67851 TAGAGGGTGT TTGCCGTCTC TATAGAAGCG TTATTTGTTG TTGTTATCGT
67901 CTCTACCCAA TAAGGAGACC AGATGCCTTG ATAACCGTAA TGCTCAGTCG
67951 CATTTAAGCT TTCAGGATGA AAGTTATCGG TATTGATATG ACGTGCTGTT
68001 ACATCCGATA AAGAAAGAAG ATGAATGTTT TGTAAAGGCT CAGAGAGATC
68051 TATACTGTCG TAGGGATCGC GGTTTTCCTC ATTTAAGAGT GTCAGAGGAC
68101 CTGATAAAGT AATTGTAGGG TTATTGTCCT CTGTGAAAGG AGCACTAGAT
68151 TGTAGAGGAC GGATCCACAA GGTAGGAGCT TTTCCTTTTG CTAAGATCGA
68201 GGGGAGGTTA ATCGCAAGGT TATTAATGAT GACCTGGGAG CCTACACTAG
68251 TTGATGGAGT CTCAGAGTTG GCAGTTGTTG CAATACTCGC CGCATGCCCT
68301 AATTTAAGGA TACCTCCTTT TTGAGTGAAC TTATAGAATT GCCATCCCGC
68351 ACGATCCTCG ATAGAGAGGA CACCATTGCG AAGTTCAGAG GTATTTTTCG
68401 AGCTGCTAAT GAAATTATTT TCGTAGTCAG AAGCTTCTGG GATATAGGCT
68451 GAAGAAAATA AGATCGTTCC CTGATGGTTC GCATTGGGAT TAAAGATTAG
68501 AGGATTTGTA GTTGGATGTT GGTGTTCTAT AGGATCAAAA AAAGCAGTCG
68551 TATACCCCTT ATTAGCTCCA AGTTGTAAGT TGCTATTTGG TGTACAATGT
68601 ATGGCGTTGT ATCTACCAAA TGTGGTGAGG AAAACCTCAT TATTTTGAAA
68651 TGCGATATTT CCTTGTTCCG CGAAGAGTAG GCAGGTGCTG TCCTGTAGGA
68?O1 GCATAAGAGC ACCTCCCCAG TTTCCTTGAT TGTTGGTGAA ATATACGTGG
68751 CCACTATTTT TGATTGTCAA AAATTGTGTA CAGATAGCTC CGCCATCGCG
51


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
68801 AATGCAGTAG TTATTATTGA AAAGAATAGT TCCAGGGTTA TCGTCTATGG
68851 ATAGGTTTGT TGTATAAATC GCCCCTCCTG AACCATTTCC TGAATTTATC
68901 GAACCAGATA ACGCTGTGTT GTTATTGAAA ATCACCGACC CGGAGTTATT
68951 TTTTATCGCA ACAGTAACGC TTGTTTGAAT GGCCCCGCCA TTGTTCCCAC
69001 AGTTGTTCTT AAAATAAATA GGACGCGTGT TATCAGAGAT CGTTGTATTT
69051 TCACTACGAA GCGCACCCCC TCCACTAGGG GCTGTATTGT TAAAAAAGAG
69101 TAGAGGTGCC CTGTTGCTTT GGATGCGGCA GTGTCCATTG GTGGAGAGGG
69151 CTCCTCCCCA GTTGTTGACG GAATTGTTGA CAAAGTAGAA AGTCCCTTGA
69201 TTTTGAGAAA TCGTGAAGTC TCCATTACAG GCAATCGCAC CCCCACGAGT
69251 TTCTCCTCCT GTACTCGCAT TGTTAAGACC TCGATTGCTG AAAAAAATAA
69301 GGGGTCCTCT ATTCTTCGTG ATTGTGCAGG CTCCCTGGCA AGCAATCGCG
69351 CCTCCAGTCC CAATCGCGAG ATTTTTACTG AAGAAGGCAT GGTCTTCAAC
69401 ATTTGATAAT AAGAAATTAT TACAGGACAC AGCTCCCCCA GCCGATGTCC
69451 AAAGAAGAAG GATGTTATCA ATAGACTTGT AGTTAGAAAG TACAATGTCT
69501 TGAGAGGAAT TATGTCTATT TCCAACAAAC GTAGTTATTG GAGAAAATCC
69551 TGTAAGAGTG GAGAGAGAGT CTAAGAGAGG AAAGCTCGTA CGAAACTCTT
69601 CATCCCTCTC TAAAGCAAAC TTTTCAAGGG AGTCCGTTTG TAAACTATAC
69651 ACTGCAGGAG TCATCCCGAA CATGCAGGCT GTGAAATTCC CGAGATAGAA
69701 TAAAAACTTA GGAGGAGTCT TTGACACTAG AGGTTCCTTA ATCTTTCTGT
69751 TTTGGTCACT TATTGTTAAA ATCTCATTCT ACTCGCCACG TTTAAGTAGT
69801 GACTCAGCGT GGAGGAAGAA ATATCCGCAG AGTAATCTAA GGAGAGAGTG
69851 ACTTTAGGAA ACACCTGCAT GGTATTTTTC ACTTTGATCC CTAAAGCATT
69901 GTAGGTCACA GGAGTGGCCT GCGTCGTCCA CGTACCTTGG CTAATCAGTA
69951 ATTTCGAGTG GAGTTCAGGA TCTTGCCTAT AGAGAGTAGA GCGATAGGAA
70001 ATTTCTGTGA GC,CAGACTAG GGGAACTCGG TGGTGGTTCT TCCAAGAAGC
70051 GCGGATTCCT ACAGGGAGGG AGACGTCCGT TAGGGGGCGG TGTAGGGAAA
70101 ATTCCCGAGC ATGGTCTCCA GATTCTTGAA ACGCAGCAAG ATTTCCTCGG
52


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
70151 ATGGCTAAGG CAGTAATAAA GGGATAGATC TGCAGGGACT CGCCGTGAGG
70201 TTGAGGTAAG AAAACACAGG AGAGAGCCCC TGCTAAGGTA TGGTTGTGGA
70251 AAGATCCCTG AGAGTTCCCT TCCAGGAGAC CCTGATACAT TGTATGGGTA
70301 TGTTCCGAGG TAAACATATA AGCAAGAGAC ACAGATAGAC GTATCCACTC
70351 TTTGAAGAGA GTATTTTCTA TGCACATTCC AGAGAAATAG TGGTGAGAGG
70401 ACGTGCTATT TTGAGATTCA TGTTCTTTAG CTTTGGAGAA GAACTGAGCA
70451 AATCCTAAAG AGAAATTCGG ACTTTGAGAA GAGGTTGCTT CGGTGGTAGC
70501 ACTATAACCT GTCATATGAC TACGAAATCC CTTAAAACCG TTTTTGTCTT
70551 TTTGATGAAC CAGAAGACCA ATGCCTTGTA GGGAAGCTGC ATGACCCTTC
70601 TCTTCATCCC AGGAGGAGAG GGAGTGGAGT CCTGCAAGAG CCGTATATGC
70651 CGATTGCCAC AAGGCATTCG TAATGAATTC TCCTCGACGT TCGGGATGAG
70701 GACGGTAGCC TAGAGGAGAC CAGTTTGCAT AGAGCAGCTT GTGTTTTGTA
70751 TTCGCGCCTA GTAGAGATGT AGGGTTCGTG ATTGTTGTAG TTTCTACCCA
70801 ATAGGTCGAC CAGATGCCTT GATACCCATA GTGTTCGCCA GAATTTAATG
70851 TGGATAGATC CAGTTGCGAA GAGTTAATTT TTTGTGCAGC GACATCGACA
70901 ATATAAAGAA GGGGAACTTT CTCAAGAGAG TGCGAGAGAT CCAGACTATC
70951 GTAGGGATCT TCGTTGTTGC TGTTGCGTAA GGTGAGAGTT CCTGAGATTG
71001 TGATTGTCGG GTTGGAATCT TCAGTATAGG TAGATCCTGT TTTTGTGGGG
71051 TAAATCCAAA TTTTTGGAGC CTGAGCTTGA AAAGAAAGAA TAGAAGGAAG
71101 GTCAATGGCA ATGTGATTTA AAGTTATAGT ACTTCCTACT GTCGTTGGTG
71151 TTGAGGATGG TGTGGGAATC GTTCCTGCTG TCGTGATCAC CGCACCTTGA
71201 CCTAGAAGTA GAGTGCCTCC TCGTTGGAAG AACTTATAGC AGGCCAGCCC
71251 CGCACCATCT~TCAACAGCAA GGACTCCTTG ACGTAGTTCC GAAGTGTTCC
71301 TTAAATAGGA AAAGAAATTC ATTTCATCGG TAAAGTTCTG GTGTACATGT
71351 TCCCCTGAAA ATAAAACTGT ACCTGTATGA CCGGTTTCGA AATTAAAGAG
71401 TATGGGGAAG GAGGAAGGGA GCTCATGTTC TATGGGATCA TAGAACAGCA
71451 CTCGATAGCC GGGACGGGCT CCTATTTGCA GATTCATATT AGGAGTCGAG
53


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
71501 TGAATGGCGT TTCTGTATGG AGGATTGAGG GCATGCTTGG AGGCCGTATT
71551 ATTGTTAAAG ATAATATCTC CATTATCTGC AGATAAGATG AAGCTTCCGT
71601 TTCCAGAACC TGCTGATAAG TTGAGGAGAG CCCCTCCCCG AGTTGCAGTG
71651 TTATTTAAAA AGTATACAGG ACCATTTTCT TTGATAATGA TAGATTTCGC
71701 ATGAATGGCT CCACCGTTGC TCTGGCTTTG GTTATTGTTA AAGAGTACCC
71751 CTTCTCGGTT GTTCAATATC GTGCAAAAGG TGGTAGTAAG ACCTCCTCCT
71801 CCTGGATTGA AGTTCGATCC ATAGTTATTT GCGAACGCGG AATTTTCACT
71851 GATTTCTATG AGTTTTTTAT TCGAGGAGAT CGTGAGTGTT TGGGTAGAAA
71901 ATATGCCTCC CCCAGATCCT GAAGAGTTGC TTGTGATCTG TATAGCTCCA
71951 GAATTTCCCT CTATATTTAG. AGAGTTCCCA CTATAAATCC CTCCTCCTAA
72001 ACTGTCCGAA TTTAGTGCCC GATTCTGCTT GATTATGATC GGGCCTTTAT
72051 TGTCTTTAAT AGATAAGTTC GTCTCAGTAT AGAGGGCACC CCCCTTATTT
72101 AAAGCGAGAT TGTCAACGAA AGTTCCCTGT CCTAGGTTAT TAGTAATAGA
72151 GCAATTTATG GCAAAGAGAG CTCCACCCAA TAGTGATCCC GCAGTGGCTG
72201 TAGGATTGTC AGAGACCAAG TTTGTAGTAA ATGCATAGTT CTGATTCTTG
72251 GAGATCGTGC AATTTTGAGC AGCATAAATT GCCCCGCCAG AATTGGGACA
72301 GACATTCTTC TCAAAGAAGA CATTCCCTAT ATTTTCAGAG ATCAGAAGAT
72351 TCTTACAGGT AAGAGCACCT CCATTCGACC GATAGTACTT ATAGTCCAAG
72401 ATGAAATCAT TGTGATTCCC GACAATTGCG AGATCTTGAT TTTGGTTATG
72451 AGTAAACCCT ACTTGAGGGG CTGCTTGATA TTCAGGACTT AATGTAATAT
72501 AGGTCTCCAA AGGAAGTTGG AGACCTTCAT TAGCCAATAC AAAAGTAAAA
72551 GGAAGCAACA TTCCGAAGCA AAAAAAGCGC ATACTCGTTC ACAAATAGAA
72601 AGAAATATTC TGAATAAATC AATGCTAGAT TTTTTGTCTA TAATTTGTTT
72651 TATAGTGTCT AATTATGAGT AGTTTAAAGT TATCTTATTT AAGACAAGAC
72701 GATTTTAATT TCCTTTTTTC ACAATAAAAT GCGAGTCCAA GAGTTTTAGT
72751 AAACGTACAT CATACTTTGG AAAATGGCGG CAAGTAGTCA GAGGATCAAG
72801 CTCTTTGCAT ACTTTGCTTA GAGCAGTTTT TGGTTCTCAG GGTCGTTTGC
54


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
72851 CTTTAGGGAG CTTGTGATGT TTGGGATCGG AGAGCCTCCT TATATCCCTA
72901 GGGGATGCCC TAGGAACTCT TCCGAAACAC CGAGGGTCTG TTAGAGATAA
72951 AAACAAAGGA CCATCGGGGA GACTTGTACT CATAAGAGCC ACTTATCTCT
73001 ATTCCTATAG ATTTTTGCTG ATTTTGATTA TTAGAAAATA ATAGATTTTT
73051 TCTGTTTTTT AAAGTAAAGT ATTTTTTAAA AGACTCATTT TTAATGAGTT
73101 ATTACTTTTC TCTTTGGTAT CTGAAGGTGC AACAGCACTT TCAAGCAGCA
73151 TTTGATTTTA CTCGCTCCCT GTGTTCACGA ATTTCTAATT TTGCTTTGGG
73201 AGTGATTGCA TTGCTTCCTA TTATTGGGCA GTTGTATGTA GGGCTGGACT
73251 GGCTCCTCTC TAGGATAAAA AAGCCAGAAT TTCCTTCCGA TGTGGATCAG
73301 ATCGTGCGAG TAGAACACGT CGTGGGTCAC GACCATAGAA GTCGAGTTGA
73351 AGATATTCTA AAGAGACAAA GGCTCTCATT AGAGCCTAGA GACGAGGGGA
73401 AGGTTCACGG AGATCTGCCT TCAGCTCCTT TTTTTTGATA TCCAAAGTCT
73451 CAAGTTCCTA CAGTTGTTCT CTGAGGGGAC AGCTCTAAAT TTATTTCGTA
73501 TATTTGCTCC ACTACGCAAC CGTGTGACTA CAGAATACAG TCGTGCTAGG
73551 CAACCCGACC TACATAGAAT TGCCATCGTC TATATAGGAG TTCTCGATTC
73601 AGAAAGTTCC AAGATCCTAG AGCGGCTAAT CTCTTATATG AGTTGTATCT
73651 ATTCTGAATC GCAAATGTAT TTAAGATTCT TTATGGGCAA GAATGTAAAT
73701 CAAAGTGCTG TACTCTCAAA ATTACATGTA GAAAATCTGC ACATCCGTTG
73751 TGGGTTTTTC AGCGAGGATG CTGTTCCAGA GAGTGAGCCC TTCGATCTCT
73801 CCATCTACGT GCACACAGAT CGTAGCTGTC CTCTCCCTAC GAAAAAACGG
73851 AGCAGCTCCT GGGAACTCCA AACTGTAGAA CTCCCAGAGT CAATATATCC
73901 ACAGTCGGAA TTCCTATTGA TGAGACCTCG AATGCTTTCG TAGACTCTAT
73951 GATGAAACAA GGAGTCGGGC AGGATGCTAA AGAGCTATAC ACATTTCTAT
74001 CTCGTGGGAA TGAGCATTAC CAACCGTGTC TATGGTTCAG TCTCGAAGAG
74051 GAACTCGGAT TCCTTTTCGA TGAAAAAATG CTGTGCGCCC CTCTATCTGA
79101 GGATCACTAT TGCCACTCGT ATCTTGTAGA TCTAGTGGAT CAACATTTAA
74151 AGGATTTAAT ATTATCGATG TTTTTAGATC CTCAGAATAT CTCAGCAGGA
55


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
74201 GAACTCCTCA AGGTCTCTAT AAACGTTGGA GATTCTTTTT CTCCTCTACA
74251 ACAGAAAGAT TTCCTCTCGA TGGTCTTACG TGATGAAACG GGAAAAAACG
74301 TCGTCGTGGT TTTTAAAGGA GTTCTCTCCT TACCCGCAAC CCAAGTCTGC
74351 AAATTAGTAG AGGAATTGAA CTCTAAGGAC TACTCCTACC TCAATATATT
74401 TTCTTGTCAC GGAGATAGTA GTCCTCAGCT TTTATTCCGT AAGGAATTAG
74951 AGGGAACTTC AGGGCGTTAT TTTACAGTGA TTTGCGCTTT ATATCTAGGG
74501 GATACAGACA TGCGTAGTTT ACAACTTGCT TCTGAAAGGA TCATGGTCTC
74551 TAGAGAGTTT GATCTTGTAG ATGCCTATGC TGCAAGATGC AAGCTCTTGA
74601 AAATCGATCA TACAAATTGG AGACCTGGAA CTTTCAGTCG CCACGCCGAT
74651 TTCGCAGATG CTGTAGACGT ATCAGCAGGA TTTAACTCAA GAGAATTTAA
74701 ACTGATTACG CAGGCGAATC AAGGGATCCT AGAGTCTGGA GAACTCCCGC
74751 TCCCTTCAAA AACCTTCTGG GAAGGATTCT TAGCATTCTG TGATCGAGTG
74801 ACTGTCACGA GACACTTCAT TCCAATGTTA GACGCCGCTA TAAAGCAAGC
74851 GGTATGGACT CATAAACATC CCAGCTTGAT AGATAAAGAG TGTGAAGCCC
74901 TAGACTTGAA AACACAGTGC TTGCCATCTA TCGTATCGTA CCTTGAATAT
74951 GTCACAAACT CTCACGAAAA AACATCGAAA GGCCCGTTCA TACAAAAAGA
75001 GATTATCGCA GACTGTTCTC CTCTTAAAGA GGCGCTCTTC CCAGGTTCTG
75051 ATGAAGATGT TCCCTCTACC TCTGAGGATC CTTCAGATGA TCATCCTTCG
75101 GATCTTGAAG ACTCTTAATT AGTTGCGATA GAATTCAATT TTTTATATAA
75151 AAACTATCGT GTTGTTCTTA TTAAAAGATA GTTAATTTTC TATCTTTTTT
75201 TAAATCTTTA TATAGCCTGC GTACGCTTTC ATTTTCAATG TTGGTTTGAT
75251 CCTATGGCAT GCTATATTTC TATTTGGATA TCTACAGTTA AGCAGCATTT
75301 TATTAGGGCT TTTGATTTTA CACGTCCTCT TGGTTCTCGG ATTACAAATT
75351 TTGCTTTGGG GGTCATCAAG GCTATTCCCA TTTTAGGATG CGTTGTTATA
75401 GGGGTAAGTT GGCTAGTTTC CACATGTTCT GCACGAAGGT TTGGGAAACC
75451 GGCATTTACT TCTGACGTTG CTAGTATCGT GAAAATAGAA AAAACTCGAG
75501 GTTATAATCC CCTTGCTTGG GTGGAACAGT ACTTGAGACA GCTTAGGGTT
56


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
75551 CGACTTCCTG AAGGAGATTT AGGAAAAATC CATGGGAAGG TCTCCAGAGA
75601 TTATGTTTGC GACAGGACTC CCCAAGAAAA TCTGAATATG GTTCCTCATC
75651 AATATCTGGG AGAGCTAGGT CGCGCGTTTT ATGGAATCCG CAACCGAGTA
75701 ACCAAGGCGT ATCAACGAGT CACTCCTCTG GAAGTCCCTT GTCTTACGCT
75751 CGTCGGTTTT GACATTTTAG ATCCCGAAGA TCAGGTGAAT TTCGTTCGTC
75801 TGGCTAACGG CATACAAACT CAGTACCCCC AAACTCAAAT AAAACTTTAT
75851 TTAATCTCTA TCCAAAAGAT ATGGAATCAG TGTGACGGTA CGATTTCTCA
75901 AGAAAAAGAA CAGCAACTCC GCTCTCTAGG TTTGGATGCT AAAATCAAAT
75951 GTGTGTCGGC CCCCGCTCTC CTGCTCCAGA AATATCTTCA ATCCGAGAAC
76001 TTGCCTTCCT GTGATCTTCT CATTAATTAT TACGGGAAAC AACAGTCCGT
76051 CAGAGACGTG GACTCTATAA AGAGTCTACT CAATCTTTCT TCCGAACATA
76101 TCCCTGCGAT TTCTGTAACC TATAGACCTG ACGATCCTTT TTATAGCTAC
76151 TATTTCTTTC CTGGTTCTCA AGGAGGAACG GCACCCGATC AGAGGATCCC
76201 TTGGAGTGAG CAGGAGCATC TTCAAACGTA TACCACCCTG TCTAACCCTA
76251 GATGTGATAG ATATGCTGTT CACTTGGGAA TGGAAGATTT TGCCTCTGGA
76301 GTATTTTTAG ATCCTCTTAG GGTTTCGGCT CCTTTATCTG GAGAGTATTC
76351 CTGCCCCTCA TACCTCTTAG ATTTAAAAAG TGAAGAGCTT CGTTGTTTCT
76401 TGTTATCCGC TTTTATAGAT CCCAACAATT CTGGTCAGGG AAATCCGCGT
76451 CCTATGTCCA TAAACTTTGG AAACTCTCCT TTGGGTCAGA GGTGGTCTGA
76501 GTTTCTATCT CGTGTTCTAC ATGATGAAAC AGAAAAGCAT GTGGCTGTAG
76551 TCTGCAATAA TCCACAACTT ATAAAAAAGA GTTTTCCCTC ACATTCTTTA
76601 TCTCTATTAG AGAACGAACT GGAAGAGTCA GGTTATTCTT ATTTGAATAT
76651 CGTTTCAGTG AGTCAGGAAC GCACGTGTGT TAAGGAACGT AGAATTTTAA
76701 GTTCTGATCC TTCGGGGAGG TCATTCACTG TAATCCTCAC TGATCTTCCT
76751 GAAGGGAGTT CGGATATCCG CAACTTGCAG CTAGCGTCAG ATAGGATCTT
76801 AGTTTCTAGT GCTCTCGATG CTGCTGATGC CTGTGCTTCT GAATGTAAGA
76851 TCTTAGAATA TGAGGATCCC GAGCAAGAGT GGGCGCAACA GTATGCGTCG
57


CA 02350775 2001-05-11
WO 00/27994 PCTNS99126923
76901 TTCTATAGAA ACATCGACAG GGCAGGCGAT CTTCAACGTC AGGGGATTCC
76951 AGGAGAGCCT TTAGGGGTCT CAGCATCTAC GAGAGTAGTT TTAGAAAAGG
77001 ACATCGTATT CAATCTCAAT GCGGTAATCC AACAGGCCAT GTGGAAGTTT
77051 AAAAAACGGG ATCTTTTTGC TGTAGAAAGT CAGGCTTTAG GAGATGACAT
77101 GCGACGTGCT TTAGAAGGTT ATATCGGCAG CAGTCTCTTA GTTGAGGGGA
77151 CTATACAGCC TCAAGTCGCA TGTAATGTCA ATGTGAGTTT TGCTACGTTA
77201 GACGAGGCTG TGTGTGCAGC TTGTGACTCA GCTCAAGATG CACCTTCTGA
77251 GGAGAACAAT ACAGATGACT AAAGATCGCA ATCTTGTGAA CGAAATCGCA
77301 GATTGATGGG AACTAATTAG ACACACCTTT CTAAGGTGTT TGTTTTGATG
77351 AACCTTTTTA TTAGTCCAGC AGAGCTCTTT TTTGAAGATT CTTCTTTTTT
77401 CTTAGGTCAT TCTGGGTTTT TTGAAGGTAT CGAGGGTTCT TATTGTCTAG
77451 TTGTCTATAG AGGGTATCGA GGTTTTTTCT CTTAGGTATC CCACGATTCT
77501 TTTGTATAGA AAAATTTTAT GAAAGCTTGA ACTCTTTACA CTGACTTTTT
77551 ATTTTTCAAA TAAAAACGTT TTTAAAAATA TTATTATCAT AATTAGATAC
77601 TTATTTGTTT TAATGTCTTA TTTGATTAAA ATAACTTTGT TAAAATTTTT
77651 ATACATAAAT TTCTATTGTG GCTTGTCCAA GTATTTCTTC TTGGTTTACT
77701 GTCGTTCGAC AGCATTTTGT AAACGCCTTT GATTTCACCC ATCCCGTTTG
77751 TTCTCGGATT ACAAATTTTG CTTTGGGGAT CATTAAGGCA ATTCCCGTAT
77801 TAGGACACAT TGTCATGGGA ATCGAGTGGT TGATTTCCTG GATTCCCAGA
77851 CACACCGTTC GTCATGGAAT GTTTACTTCT GATGTCTCTA GTGCTATTAA
77901 AGTAGAACAA ACACGGGGTC ATAATTGTTT AGCTCCCCTA GAAGCCTATT
77951 TAAGTAGCTT GAGAGTCCCC ATTTCCCAAG AAGATCTAGG CAAAGTACAC
78001 GGGAGAACCC CAGAAGATCC C.TTCGTAGAT ATCACACCCA CAGAAATTGT
78051 CCAACTTCTC CCTGATGAAG AACTCTCTAC TGTAGATGAG GCACTGCAAG
78101 GCGTTCGTAG TAGGTTAACC TATGCCTATA GGTCCGTAGA GAAACCTATG
78151 ATTCAAGATC TTGCTCTTGT GGGTTTTGGT CTCCGAGATT CTGCGGACCT
78201 CATAAATTTC GTGCGTCTTG CTAATGGCGT GCAGAATCAC TATCCCCATA
58


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
78251 CTAAAGTGAA GCTCTATTTA GCGAAGAACT TGGCAGATGT CTGGGACTGT
78301 GAAATTTCTG AAGAGGAAAA AGGGCAACTC CGAGCTCTAG GTTTAGACCC
78351 TAAAATAGAG AGTATATCCC TTACGAGTGC AGGTCTTCCT TCAGTGCCAG
78401 AAGTCGCTAC TGTCGATTTT ATGATTACCT GTTACGGGAA AGATCAGGAA
78451 GTCCAAGATC CCTAGGTGAT ACAACATCTT CTAAACTTTG CTCTAGAAGA
78501 GACCCCTTCC ATTTCCGTGC AATACCAAGA ACAAGAGAAG CTCTCTCCGT
78551 GCGATCATTC CCCAGAAATA GGTAAAAAGA AAAGATGGAA TAAGCTGGAA
78601 TCCTTCTCCA CGTATTGTTC TCTGTTTATG TCTGTTAAGG ATCATTATAA
78651 GCTGAATCTA GGAATTCAGA ATTCCCTGTC AGGGTGGCTT CTGGATCCCT
78701 ATAGGGTTTG CGCGCCTTTA TCTTCACCGT ACTCGTGTCC TTCCTATCTT
78751 TTAGATTTGC AAAACAAAGA GCTACGTCGT TCCCTTCTGT CAACGTTTCT
78801 AGACCCTAAA AATCTCACTA GCGAAACATT CCGTTCTGTC TCTATAAACT
78851 TTGGCAACTC TTCGTTTGGA CAGAGATGGT CAGAGTTTCT ATCTCGTGTT
78901 CTGCACGACG AGAAAGAAAA GCACGTAGCT GTTGTTTGTA ATGATGCAAA
78951 ACTTCTGGAA GAAGGATTGT CCCCAGAGGC ATTGTCTCTA TTAGAAGAAG
79001 ACTTAAGAGA ATCAGGGTAT TCGTATCTAA ACATTCTCTC GGTGAGCCCC
79051 GAAGGAGTCT CCAAGGTTCA GGAACGTCAG ATTCTAAGGC GAGATCTCCA
79101 AGGACGGTCC TTTACTGTCA TGATTACAGA TCTTCCTTTA GGTAGCGAAG
79151 ATATCCGTAG TTTACAATTA GCCTCGGATA GGATTTTAGT CTCCAGTTCT
79201 CTTGATGCCG CGGATGCATG TGCTTCGGGA TGTAAAGTCT TAGTCTACGA
79251 AAATCCAAAT GCATCCTGGG CTCAGGAATT GGAGAACTTC TACAAACAAG
79301 TTGAGAGAAG AAGGTAGTGT TTCTTTCAGA GAATATTTCA GAGCCTATAT
79351 GTGTGATAAA'ATCGTGGCAC AGAAGAACTT CTTATTTACT TTAGACGCTG
79901 TAATTAAACA GGCCGGTTGG AGATCACAAG AGAAACTCAA TTTATTTTAT
79451 GTTGAAAGTC AGGCTTTAGG AAGAGAAATC AAAGTCAGCT TAGAGGAATA
79501 TATTCAGAGT ATGGTCGGGA TTTTGGG'ATC TCAGAGAACC AAGAAAAGCT
79551 TTAAGTTTTC TGTCGACTTT ACCCCTTTAG AGCAGGCTCT ACAAGAAAGA
59


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
79601 TGCTCTTCTG ATGATGACGA AGATGCAACA'GCAACTTCGA CCGCTACAGG
79651 GGCAACAGCA TCTCCGACTG ACATGCACGA AGATGAGTAA CGTTTGTCTG
79701 ATACCTTAAA AGTTCCTTGC AAAGGGCTCC CTGAAAACTA AATTCCCTCA
79751 GAATCTCGAA TTCTCCTGAC TCTGAAACAA TCTTAGGTTT TCCTGAATAG
79801 AATCTGACTG AAATTTCTGC TCGAATCTAA GGGCTGTTTC TTATTTTACC
79851 CCTAGATGAG GATATTAAAT CCAAGCTAGG ACTTCAAAAG TAGTTGGTTA
79901 TTAGTTTATT AAAGAAAATA ATACTAAAAA TATTTAAAAG CTGTTTATTC
79951 AATTTAATTG ATATTTTCTA TGTTGTTATT TAAAATTGTT TGTTTCTAAT
80001 TTTATTTTTT TTGTTGTTAT GCCAATTCCC TATATTTCTT CTTGGATTTC
80051 TACCGTTCGA CAGCATTTTG TTAAGGCGTT TGATTTCTCT CGTCCCTTT~'
80101 GTTCTAGGGT TACGAATTTT GCTTTAGGGG TCATCAAGGC CATCCCTATT
80151 GTAGGACATA TTGTCATGGG GATGGAGTGG TTAGTTTCTT CCTGTGTTGC
80201 CGGGATTATT ACTAGGTCCT CCTTTACCTC AGATGTCGTT CAGATTGTAA
80251 AGACTGAGAA GGCGTTAGGT CGAGATCATA TATCTCGAGT GGCGGAGATA
80301 TTGCAAAGAG AAAGGGGGAC CATAACTCCT GAGAATCAAG ATAAGGTGCA
80351 TGGGAAGTTT CCTGTCTGTC CTTTTGGTCG TTTAAAATCC GAGGAAACTT
80401 TAAAACTTAA GCCGGGAGAA AGAGAGGGAA CTTTAGATAC TGTATTTTCT
80451 CCGATTCGCA CGCGCGTGAC TCGTGCGTAC TTACAGGCCC CCCGACCCGA
80501 AATACGTACG ATTTCTATTG TGGGTTCGAA ACTTAAAACT CCTCAAGATT
80551 TCTCGCAATT TGTGAGTCTC GCGAATGAAA CGCAGAGACT GCATCCTGAA
80601 GCGTTAGTTT GTCTGTATTT GACAGGCTTG AATCGCGAAT CTCAGATGTG
80651 CGATACAACT ACTGCAGAGA AGAAGCAGTA CCTACATAAC TCAGGTCTCG
80701 ACTCTAGAAT CCAGTGCAAA GACAGTAAAG AAGACGACGC TGGCTCTCCT
80751 GAAAATCCCG AACTTTGGAT TGGCTATTAT TCACGAGAGC AACAGCATAA
80801 TATAGACGGG CAGTATATTC AGCAGTGTCT AGGGAAGAGT GCAGATCCAA
80851 TTCCTTGGAT TCATGTTACT GAAGACACAA AGGATTTTTA TTACCCACCA
80901 AACTTTACTT CATACTCACA TACAAGACAA TCTACAGACC CAACATCGCC
60


CA 02350775 2001-05-11
WO 00127994 PCT/US99/26923
80951 ACCAAGACTC CCTGAAAGTG AGGGGGATAA GGATTCCTTG TACGGACAAC
81001 TGAGTCGATC GTATCACCAT GAGTATATGC TTGGTTTGGG ATTAAAACCA
81051 GAGGATGCAG GACTCCTGAT GGACCCGGAT AGAATCTATG CTCCTCTATC
81101 CCAAGGGCAT TATTGTCATT CCTACCTTGC GGATATAGAA AATGAGGATC
81151 TACGAACTTT AGTCCTTTCG CCTTTCCTAG ATCCTGGCAA TCTTAGTAGC
81201 GAGGATCTTC GTCCTGTAGC ATTCAATATC GCTAGATTGC CATTAGAATT
81251 GGACTCGTTA TTTTTCCGCC TTGTTGCGGG TCAGCAAGAA GGGAGAAACA
81301 TAGTTACCCT TGCCCACGGA ACTCCTCGTC CAGAAGATCT TGATCCTGAC
81351 TCAATGAACA TTCTGACCAG AAGATTACAA ATGTCTGGAT ATAGCTATTT
81401 GAACATTTTC TCCTATAAAT CACGGAAAAT GATTGTAAAA GAACGTCAGT
81451 TCTTTGGAGA TCGTTCTGAA GGGAAGTCTT TCACATTGAT CTTATTTGAG
81501 GATCCCATTA GTGCAGCAGA TTTCCGTTGT TTGCAGCTAG CTGCAGAAGG
81551 TATGGTTGCT AAGGATCTCC CCAGCGTAGC AGATATTTGT GCCTCTGGAT
81601 GTTCCTGCAT TCAGTTTTCT GAGATGCAGA GTCCTCAGGC TATTGAATAT
81651 AGACAATGGG AGGCACGTGT CGAAGATGAA GCAGGAGAAG AAGCCAGAGA
81701 ACCAGTAATT TATTCTCAGG ATCAATTGAG CAGCATGCTC ACTACACAAC
81751 AGAATTTTGT ATTTTCTCTA GATGCTGTGG TAAAACAGGC GATCTGGAGA
81801 TTCCGTTCGA AAGGTCTTCT TACTATGGAA AGAAAGGCAC TAGGCGAGGA
81851 GTTCTTAACT GCGATATTTT CCTATTTAGG GAGTCAGGAG CGTAATGAGA
81901 ATATGGGGAA AAGAACTACC GAAGAACATG AGGTCGTTAT CAGCTTCGAA
81951 GAGCTAGATC GCATGGTGCA AGTCCTCCCA GCCGAAGTCC CTGCAGATTC
82001 AGGCAATGAT CCTACGCGTC CCGTTCCTAA TCCAGATAGT AACCCTGATT
82051 CCTCGCAAAA TGAAGGCAGT TAGAAAGTAA AAATACTAGA GAAATTTCTT
82101 ATCTCCAGAT GGAATCCGTG GTCCATGTAC CTAGGATTCC AGGAAGGGTT
82151 TGCCTAGGAA TATCTTAATT TCACAAACCC CAC~GAGTGCA CAAACTCCTT
82201 AGAGGACTCT CTCTATATTT GTTCTTTCTA CACTAGTATT CCGTAGATTT
82251 CTGATTTCCA GGATAGGATT CTAAATAGTT GTATCTAAGC GCTTTTTACA
61


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
82301 ACCTTTCTCA GGTCTCCTCT TTCTAATTTT AAAAATAGCG AATTTCTTGT
82351 GGCTATAGGA GTCGCTAAGT ATCTTAGAGA TTGCTTTTTT CAAGTTTTTT
82401 TTAAATACTT CTTTCCTAAG TATTTCTTCC TAGTCGTGTT ATGGCTTCTT
82451 GTTTATCTGC CTGGTTTTCT ATAGTTCGTG AGCACTTTTA TCGAGCCTTT
82501 GATTTTTCTT TGCCGTTTTG TGCTCGTATT ACGGAATTTG TATTAGGGGT
82551 CATCAAGGGG ATCCCTGTTG TGGGTCACAT TATTGTTGGG ATAGAGTGGC
82601 TCGTTTCTAG GTATTTAGAG AGTTTCGTGA CCAAGCCGAC ATTTGTCTCT
82651 GATGTGGTGA GTCTTCTGAA AACAGAGAAA GTTGCTGGTC GCGATCACAT
82701 TGCTCGTGTA GTGGAGACTT TGAAGAGGCA GAGAGTCGCT GTGGCTCCTG
82751 AAGATGAGGA TAAGGTCCAT GGGAAGATTC CTGTGCATCC TTTCGGGGGA
82801 ATCCAACCTG TAGAAGTTCT CACTCTCTAT CCCGAAGTTC AAGATGCAAC
82851 GTTAGGGCTT GCCTTCTCTA AAATTCGTAA TCGTGTAAGA CAGGCGTATT
82901 TGCAAGCTCC ACGGCCAAAA CTGCAGAAGA TTTACATCAT AGGAAACGAT
82951 ATGAATCCTT TTGAAGTTGA CGACTTCTTG CATCTAGCCC GTCTCTGTAA
83001 TGAAACTCAA AGACTCTATC CTGACGCTAC GATTTCTCTA TATCTAACAG
83051 CTTCTGGTGG TCGCAATGCT ATGGACAAAA AGAATCGGAA GTTACTTAGT
83101 GATTGCGAAC TAAACCCCAA GATTGCTTGT TTGGACTTTA ATCAGGGTGA
83151 TGTAGTCAAA CAAGCAACTT GTGACTGTTG GATGGTGTAT CATGGGGAGA
83201 ATGATCAAGG TACGTTGAAT CAGATTCAGG AAGAGTTAGA AAAGTCAGGG
83251 GAGGAAACCC CTTGGATTCA TGTGGGGCAA AAGCCTCTTT CACAATCCTT
83301 GTGGGATTTC TCTCCATTTT CATCTTTGGA GATGAAGGGA GATAAAGAGA
83351 AAGCTCTAGA GTACTCTGAA TTAGAAAAAG AACAGCTATA TTCTCGATTG
83401 GTATACGTAG GAGAGCGCTC TTCGGTTCTT AGTTTGGGGT TTGGAGATAG
83451 TCGGTCAGGG ATCTTGATGG ACCCAAAACG GGTGCATGCT CCCTTATCTG
83501 AAGGGCATTA TTGTCATTCC TACCTTGCAG ACTTAGAAAA TCCCGGGTTA
83551 CAAAAAACAA TTTTAGCGGC ATTTCTGAAT CCTAAGGAGT TGAGCAGTAC
83601 CATACTGCAA CCTATATCTC TAAATCTTAT CTTRAATAGC AAAACTTACT
62


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
83651 TAAGGCAGCA CTTTGGCTTT TTTGAGAGGA TGAGCAGAAG TGATCGCAAT
83701 GTGGTTGTCG TTGTATGTGA TTCTTGGTGG GGTACCGACT GGAAGGAGGA
83751 GCCAAGCTTC CAACACTTTA TTATGGAGCT AGAGTGTCGA GGGTATTCGC
83801 ACTTCAATAT TTTTGCCTTT AGATCTAATA GCATGTGTGT AGAAGAACGT
83851 AGGATCTTAA ATGAAAGTTC TCAAGAGAAA GCCTTTACCA TGATTTTCTG
83901 TGAGGATTCA GTATCTCAAG GAGATATCCG CTGTTTGCAT TTGGCGTCTG
83951 AAGGAATGCT TTGTGGTAAA GAGTGCTATG CTGTCGATGT CTATACGTCA
84001 GGATGCGCGA ACTTTATGAT GGAAGAAGTC TTAACTTTGG AGCGAGAATC
84051 TAATCTGTGG AATAGAAAGC ATGGTCTTTG GAAAAGAGAA GTTAGAAAAC
84101 AGAAACAAGA AGCTGCTTTG GATCAAGACG AGAGCGAGAT TTACGTTTGT
84151 AATCAGCTGA CGGCGCAACA GAACTTCGCT TGTTCTTGAG ATGCTGCAAT
84201 CCGCCAGTCT ATATGGAGAT CCCGTATGCC AGAACTTCTC TCTATTGAGA
84251 GACGGGCGTT AGGGGAACAA CTCTTTACTA CTGTACATCA CTACCTAACA
84301 ACGCAAAAAA AGATCCTCAG GGGAATCTAG AAACGCAGCA ATCCGCGCAA
84351 TTGTCTATAG ATTTCACAGC ATTAGATGAA GCTGTTGAAT CTCTAGGATC
89401 GACTCTTAGC AGAGCTCCTT CAGAAATATC TCCAATTCCA GAGGAGGAAG
84451 CTCACTTAGG AGCCAACAAA TAGAGACAAA GAAAATTCGA CGGTTTGAGG
84501 ATAACGATAC GCAATGTCTA AGCTTTGAAT CAGGATACTC TGCTTTACAG
84551 GCGTGTCTTT GTGCCTATGG TCTCTCTCTC ATAACAGAGT CTCTCAAATC
84601 TAATTGTCAG AACCTATTTC CCCTAAGAAT CGATAACGTA TTTGTTAGGA
84651 AAACGTGCTC AACAAGAGCG TTTTATTTTC AGTGTACTTG ATGTCTAATA
84701 CAAATTGTTT ATTTTGTATT TTTGGCACAT CATTTAAATT CCTTGTACTT
89751 TTGAATCTAA ACGTAAATTT CTTATGACTC ATTGCTTACA TGGTTGGTTT
84801 TCTGTAGTTC GTCATCACTT TGTGCAGGCG TTTAATTTCT CACGTCCTTT
84851 ATATTCTCGA ATTACCCACT TCGCTTTAGG GGTGATTAAG GCCATCCCCA
84901 TTGTAGGGCA TCTTGTTATG GGAGTCGATT GGTTGATCTC TCATTGCTTC
84951 GAGAGGGGAG TCTCACACCC TGGGTTCCCT TCAGATATTG CTCCTATACT
63

CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
85001 GAAAGTAGAA AAGATCGCGG GCCGAGATCA TATTTCTAGA ATCGAAAATC
85051 AGCTAAAGAG CCTTAGGAAA ACTATCGAGG TTGAAGATCT AGATAAAGTC
85101 CACGGGCAAT ATCAAGAGAA TCCTTATGCA GATATGGCCT CTAGTGAGGT
85151 TCTTAAACTC GATAAGGGAG TTCATGTTAG CGAGCTTGGC AAAGCCTTTT
85201 CTAGAGTTCG CAATCGCATC ACCAGATCCT ATAGTTATGC CCCTACTCCT
85251 CAGTTGGACT CTATAGCTAT TGTTGGTATA GATCTCGTCA GTCCTGAAGA
85301 ACAAGAGAAT TTAGTACGCT TGGCGAATGA GGTCATTCAA CTCTATCCCA
85351 AATCAAAGAC AACTCTATAT CTTCTTATCG ATTTTAATAA GGAGTGGGTA
85401 GGGGATATCT CCTCTGATAA GGAAAAACAG CTCCGTTCTC TAGGTCTACA
85451 TTCTGAAGTT CAGTGTCTTT CCGTCTTGGA ACCTCAGGGT GCCGAGGGCG
85501 AAGATACGAA ACACTTTGAC CTTATGGTCG GCTGTTATGG GAAGGATTCT
85551 TACTTAAGGG AGGGTAAAAT TTTACAGCAG GCCCTAGGGA CTTCGTTAGG
85601 TACTGTTCCC TGGGTGAATG TTATGCACAC ATTGCCATCT AGGTATAGAT
85651 CTCGGCTTTC CTTACCTATA AATACCGAAA AGGATAAGAC AGAGCTTTAT
85701 AAAGAGATTT CTCGTACACA CCATCAGTTG CATACTTTGG GAATGGGACT
85752 TGGAGCCCAG GATTCAGGAT TGCTCTTAGA CCGGCAACGA CTCCATGCTC
85801 CTTTATCTCA AGGGTCTCAC TGCCATTCCT ATCTTGCAGA TCTCACCCAT
85851 GAAGAGCTGA AAATTTTGTT ATTTTCAGCA TTTGTGGATG CTAAGAACAT
85901 AAGTAAGAAA GAGCTTCGTG AGGTATCTCT AAATTTTGCT AACGATACTT
85951 CCGTAGAGTG TGGCTGCGCT TTTTACTTTT AGTGTCCTAT GATGAGAAGG
86001 AGAAAGACGT AGTTGTCGTT TGTAATCATT CTGAACCTAA TATCCTCGGC
86051 CTGCCTCCTG AAGCAGTCTC TCAGCTTATT GAAGAGCTTA GCGATGAAGG
86101 CTATAGCTAT CTGAATGTAG TGCGTTGTGA TCTCTCCGGG GAGACTACGG
86151 TTCAACAACG TCTGCTATTG AATGCCGATG AAGGGAGATC TATGACGGTG
86201 GTGATCTCAG AC;CTTCCTGA AGGGCACCCC GATATTCGGA ATTTGCAGTT
86251 GGCATCCGAA AGAATTTTTG TTTCTCGTGA AAAAGAAGCT GCTGATGCCT
86301 ATGCTTCAGG ATGTAAAGTG GTCGCTTTCG ATGATGAGCA TCTCCCTTGG
64


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
86351 GTCTCCAGTC ATATTGCCTA CGCGGAGGAG ATCAGAGAGA AACAAGAACA
86401 AACAATGCAA GGGTCTTTAA CTGAAGAGCA GTTAGGAGCA CTCCTCTGCA
86451 ACACAGTCTC CACAGAGAAA AATCTAGCCT TTGCTCTAGA CGCCGTGATA
86501 AAACAGTCTG TGTGGAGATT CCGCAATCCG GATCTTTTTG CTTATGAGAG
86551 AGAAGCTCTA GAGGCTTCAG TAACAGATGC TTTAGTATCT TACGTTTCAA
86601 ATTTAGACAT GATACCGTAC ACAAGTTCTC AGGGCATAGT CATAGAAGAT
86651 AGTAGTATCG TCCGTACCTC TCAAGAGCAT ACACTCATTG TGAACTGTGC
86701 AGCATTCGAT AAGTTAGCGA GCCAAATAGA GTTCTTATGC CCCAGTGACG
86751 TGTTGCCCAT TTCTGGTAAA GACCCTTTGA TTTCTGATGA TGAGGATGAG
86801 GAACTGAATC CTAAAGTTTC ATCTGCTGCA GACTCTAAAG ATAAAACCTA
$6851 GGGAGTGAAT TCTACACGAG AATCGAGAGG AGAGCGAGTC TTTCAAGGAT
86901 TCATAATCCT TGTTAACGTA TGCATAAACA AGTGAAGCCA TTGCAACGTG
86951 AAGTAATCGC ATTGATGAAA GATGCTTTCC CTAGGGATGC AAAGCAGATC
87001 GTATTCCCTT CTTTCCAAAA CAGACTATAG ATTCAAAAGA TATTCTTTCT
87051 TTTCAAATAG ACTTGAGAGA GGGGGGGGGG TTCTCATAGA GTGAGAATCT
87101 TGGCCTTCAT TGCTAAGTTC TTCGATGATG GATAGAGGAT TCTAAGACGA
87151 CCGGGGCTAC AGAAAACTCT AAGCAGAGCT TAGAGTTTTA AAATGTGGAT
87201 TTTAGTCCTG TAGACACTCG GTGGTTTGTA AATCCATTTT TCCCGTCAAA
87251 GGTATAGTTT AGAAAGGCCT GAGTGTCCTC GGTGAGATCT ACTACATCAT
87301 GGATTTGTAC AAACAAACCG TGGCGGCGTA GATTTGCTCC TGAGATCGAA
87351 GTGCTCTCTT GGTTTGAGAC GACAGTCACA ATATTGTGAG GGTTGACACG
87901 ATAGATATCA GGCTTGTAGG CAAGTTTGAT GGTCAGTGTA GAAGGAGCCT
87451 TCTTAAATGG'TGTGAACCAT TGAGAAGAAC ATCCTATCGG TAGGGAAACA
87501 TTGTACCCTT TACCTCTACT AAAGCTACGT TGCAGATCTC CAGTTTCTGT
87551 GAACTTACTT TGCCAACCAC CTAGGAATTC TGCGGAAATA AAACCTGAAA
87601 GATCCCAAGC TTGAGCCAGA GGTCTTGTAA GAAGACACCA GTTTAGGAAA
87651 GGATGTTCTG CAGAAATAAG AACATAGTAA CTATTGTTAT GCCATTGCCC
65


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
87701 TTGAGATTTT GGAGCTTTGT CAGGTCTGAG GTAGGTGGTA TTTAGGTGAT
87751 TTTTGGAATA ACCATAAGCT GCTTTCCAGG AAATTAAGGC CTCGCTCTTT
87801 TGAGTCACGA TAGGGAATTG ACCAAAGAAC GAGAGTAAAT ACATTTGTTC
87851 TGAGCAACGT GAATCGTAGG GGTTGGCGTT AGTTTTTCCA TAAAGCTGCC
87901 CGAAAGAAAG TCCTAACGTA GTGTGGTCCG TGTAGTTCAT AGATAGCGCA
87951 GCTTGGTAGC CTCCATAGCG ACCTGAAAAG CCCTCATGTC CTTGTCTTGG
88001 TGTGTGTTCG ACATAGGCTC CTAAAGCTTT CGCGGTTATG GACAACCCGG
88051 GATGATCTAT CAAAAGAACA TCTTGGAGAA TATCAGAGAA TGCCTGATTT
88101 CCTAAGAAGG AAATCCATAA GCTGTTGCTG ACAATTTCTC CGTAACGCTC
88151 GGGATCTAAG ATATAGGTAG~AACGCACGAG AGTGTCTGAA TTCCATACAG
88201 CATAGAGAGT ATTTGCGCTA GGAGAGGGAC CTCCAGGAAA TCCTCCATCA
88251 GGAGCTGGAA TTAACAGGGG ACGGGACCAT GTGTAGGACC ACTTTCCTTG
88301 GTAGCCGTAG TGGCTTGGAG TCGCAATCTC CCCATCAGGA AATCCTGTCT
88351 TAGTAACGGT TGCTCCTTTG AAAACAGCGA TAGGAATTGC TACTGGAGTT
88401 TGTAATGACA CCATATCATA AAGATCTGTA ACGTCATGTT CATCAAGAAC
88451 CAGAGCTCCT GTTAAAGTGA CGTTTTTTGT GCCTGCATTT ACTGATGCTG
88501 AAACAAAATC TCTTTTTAGG AAGGAAAAAG GATCGAATGC TAACTTTCCA
88551 ATCGTAAAGT CTACAGCGGC AGGTGCTCCC GTGGGTGTTG CCAGCCCTAA
88601 GGTTCCTCCA GAGCCCAGGG TAAGCTGACC TGAGCCCTGA GTAGCGAAGC
88651 CAAGAACATT GACAACCGCA TTGTCAGTAA TCTTCAGTTC TCCACTAGCG
88701 ATCTTGACTG TTCCTAGAAG TATAGTTGTC GTGTTGGCAG GCAACAGGAG
88751 TTCTGTAGAG GAGAGTCCCT TACTTGTAAA GACTACAGAT CCTGAAGCGC
88801 CATTAGCGTT GATTGTAATG TCTTTATTAG ACGGACTTGT GGTTGGGAGG
88851 CTATGTGTAA TGGGATCATA AAATACAAGA CGTGAGCCTC CTTGTGCAGA
88901 TAGAGACACA ATCTCTCCCC CTGCTTCTAC AGTGATGGCA TTGCGGATTC
88951 CAGGTTTTGT ATTGAGCATG TTTCCTTGGA ACGCAATATC ACCTTCGGAA
89001 GCCAAAATCG TAAGTGTCGA TGTTTGCTTC GCAGGGTCTC CAACAGAGGG
66

CA 02350775 2001-05-11
WO 00127994 PCT/US99/26923
89051 GCCTATGAAA ATAGCTCCAC CCTTCTCCGC GCGGTTTCTA GAGAATTCAA
89101 TAGGACCGCG TCCTTGAATA TTGAGCACTT TAGCACAGAT GGCTCCGCCA
89151 TTTCTCGCTG CAGTGTTGCT ATCTAGGAGC AAGGCACCCT GGTTCCCTAC
89201 GATGTTGCAG GTTTCGGCGT AGATCGCACC CGCATCATTT GGTGTACCAT
89251 TATAAGAGAA GGTGCACTTC CCCTGATTGT TTTTTAATTC GAAAGTTCCC
89301 GTAGGGATGC AGATGGCTCC GCCACTTCCT AAAGCATAGG TTCCTGATGA
89351 AGGGGTGACG CCTTTTAAGC TGTTCACACA GGAGTTGGCG GTGAAGATGA
89401 TGCATCCAGA ATTGTTTTCA AAAAGGAGAG AGCTTCCTCC ATAAATCACG
89951 CCGCCTCCTG ACGAAGTAAA GTTATGGGAG AAGCTCATGG TAGCGGTGTT
89501 ATTGATGAAT TTAACGACTG CCGTGGGAGA ACTGCTAATT GCAGAGCCGA
89551 AGTTGGCAAG GTTCCCAAAA AACTTGATCG ATTGGGATAT GTTTTCGACA
89601 GTGAGTGAGT CTGTAGGTTG GAGAGCAGAG GCTCCTGTAG TTACTGTAGT
89651 GATTGCGGGA GTTGCCGACG TAGTTACAGA TGCTGGAGAA TAGGCAACAC
89701 GGTTCGTAGT GAAGATCAAA TCTTTGATAG ATTGGAAAAC AATATCTTTT
89751 CCGTAGATGA GGCCGCCAAG TCCTGTCGAT TGGTTCCCTG TAAAGTTTAT
89801 AGAAGAAAAA TTCTTGAAGG TCAGAGTCTC TGCGGCAGAA AGTAGCGCAT
89851 AGTTACTATT TGTAGGAGCT TGACAAGAGG TAAACGTTAA GGAAAGGCCC
89901 TTCCCTAATA AGGAGAGATT GCTGGAACTA TTGATAAAGG AACTTGAAAA
89951 ACTCGCCCCC AAGAAGTTTG AACAAACAAA ATCTGAAGTG AGTAAGATCT
90001 CTTCACCCTG ATTCGTAATT GGAGGAACAA AGTTCCCTCC GAGTCTAGTC
90051 TCAGCAAACG CGCAACTTGC ACTACATAAA CAGGCAAGTA GACAAAAAGA
90101 TGAAGATTTG AAAGAAAGAG GCATGCCTCA ACCCTGTCGT TAAATAAGGT
90151 TTAAAATCGT AATTTGCTTC CTATATCTAG AGTATATTGA CGGGAGGTTC
90201 TGCGAATATC ACATCCACAG TGCCCGAAGA TCTCTAGAAC ATCATTTACT
90251 GAAGTATGGC .TGGATGCTTG TACTAGCAAA GTACTTCTGG TTAGATTATT
90301 CCCTATAGAG GTCCACGTAG CTCCATTAAT AGGTAATGTC GTATCGCAAT
90351 CAGGATTGTG ACGATAGACA TCAGGAGCAT AGGCTACGAT TATAGTGTAA
67

CA 02350775 2001-05-11
WO 00/27994 PC'f/US99/26923
90401 AAATCTGGTC GATTATGAGA ATTTTTACCA AAGCGGACGC CTACGGGAAC
90451 TGCAACGTTG AGTAGATGAC CGTGTCCAAA AATCCTCCCC TCGGGGGTAT
90501 TTTCTTGGAT GCCCCCATGA GTCGCGTAAG CAACTTCAGC TTTTACAAAG
90551 GGAATGATCT GCTTGAGGTT TAAGATGCGA GAAGAGAGAG TGATGGGAAG
90601 GTTCCCTTCG AGTTCTCCTA ACCAGCAATT GTTACTCCAA GAGCAGCGCC
90651 CTTTAGGCAA TTTTGTATAT GAAGTCTTTA CTTTCTCATT GCTACGGCTA
90701 TAGGTAACTC GAGAAGTGCC TCCTGAGAAG AATCTCGATG ATCCAAACAG
90751 AGACTTGGTG ATGTTAGAGT ATACTGTAGC GAAATAAACG TTAGAATGAC
90801 CGTGACCTAC GAGGTAATCC TTAGATTTTG TAAACAGCTG TCCAAAACCT
90851 AGACTTAACG CAGCATCTGG AGTGATGCGT GTGTAGGTAT TGATGAGGTA
90901 GCCTCCACCC ATATGGCGGT AGCTGCGTGC ATCACCGGTA TGATTCGCAT
90951 GGAAGAAATT TGTAATTCCT GTGATGCTCA GTTGCTTCCC AGGGACATCT
91001 TCGCCATCAG CTGCTGACGC TTGACTTACA GCTCGTAAAT CTATGACGTT
91051 TGCCCATAGG CTATTAGGAA TGAGGGGAGC AAGACGCTCC GGATGAGGAA
91101 GATAACCGGT TTTTTTCCAA TTTCCTGTGA CCGTATGGGT TGTCGTGTCT
91151 ATGGTAAACT CCCAAGTTCC TTGATACCCA TAGGGAGATT GCTGATAGCC '
91201 GTTTGTGCCG AGACTGAAGT CCGTAGTGGT TACAGTATTT GAAGTCGCTT
91251 TGAGTTCTAA AATCGGAACT TGCTGTAAAT CTTTATTAAA CATCCCGTGG
91301 TTGTCACAGC AATCTTGAGA GTTTTTCACA AGTCCTAAAG TTCCGGATAT
91351 AGTGAGAGCT CCATTGGTAC TCTGCACATT AACGACAGCC GCTTTAGTGC
91401 CATCCAAAGA ATCCAGATTG ATTACAAGCT TGTTTAAGGT GATAGCACCG
91451 TCAGTATTAT TAGCTCCATT TGTAGTTGCT AATGTGGTCC CTGCATCCAT
91501 GATGACGACG GACTTTTCAT CTTGCGTGAA GTTATGAACA TTTAAGGTAG
91551 CACCGTTTCT TAAAGCGAGA GTACCGCCTT CAAGTTCTAG CTTTTGGTTT
91601 AATGTAGATG TAGCATTTGC AGGGGTTGCT GCTTCGGTAG CAGTGAGGGT
91651 TTCTCCTGAA AAGACAATAG TCCCTGAATA CGCACCATCT GCACTGGCTT
91701 TGGGATTGAC GACCACAGTA GCGGCTGCGG ATGCAGCAGA TAAATCATCA
68


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
91751 GATGTAATCG GATCATAGAA GTATAGGGTA TAGCCTTGCG TAGCTCCTAG
91801 AGTGGCAAAC TTGGCATCTT TTCCGAAGTG AATACTATTG CGAGTAGGTG
91851 TCCCACTAGT GATGCTGAGG TTCTTGTTAA AGAGGATATC ACCTTGATTT
91901 GCGGATAGAG AGAGCTCCCC AGATTCGGGA ATAGCAATAG CGCCTCCTTT
91951 TCCAGCTGTA TTTCCAAGAA ATATTGTAGA TTTATTAGAA TCTATAGAGA
92001 TCTTTTTGCC ATAGAGGGCT CCTCCTTGTT CGGAGGCAAT GTTTTCTAGG
92051 AATGTAACGC TGTTTTCTCC AGATATAGTC AGGCTAACAC CTGTTGGTGG
92101 GGGGGTAGCT GGAGGAGTAC AGAAGATGGC GCCTCCATAT CCTAACAAAG
92151 GAGTGACTGC TGGTGGTGTA GGTGGAGGTG TAGGTGCAGG TAAGGAGTTT
92201 TGTGGAGATG CTGTATTGTT TTGGAAAGTC AGGTCGCTGT TATTAGAAAA
92251 TGTGACATTT CCGTTAGCAT AGATAGCGCC TCCTGAGCGC GAGCTATTAT
92301 TAACGAACAA GACTCCTGAG AGGTTCCCAG AGGTGAGCAT AGATCCTCCG
92351 GTAAGGTAAA TAGCCCCACC ATAGATCCCT GTAGCATTCG TTGAGAAAAT
92401 CACAGGAGCG CTATTGTTGA TGAGGTTGAT CGCTGCAGAT CCCGTGAGGG
92451 CCCCTCCATT AGAGATGGAT CCATTACCAT TAAAGAGAAG GCTCTTTTTC
92501 GTATTTTCTA TTGTGATGCT TGTGCCTCGA ATGGCAGCTC CAAATCCTGC
92551 AGAACGGTTG TATTGGAATA GTATGGAGTC ATTGTTTGTA AAGAGCATGG
92601 GCGTTGTAGC GTAAATCGCC GATGCGTGAG GTATGACATT ACTCGCTGAG
92651 GTATCTGAAG TCAAAGATTC ACAGTTATCG AAGATCATCT GACTAAATCC
92701 TGAAAAACTC AAGGGACATA GTTCAGGATT TTGGGTGATT ACACTACTAA
92751 TCGCGGCTCC GTCAGCTGAA GAACGGATAT TTAAGAAGGA GAAAACCCCA
92801 CCTTTTCCTA AGATTTGTAG TGCTCCCGCC CTATTGCTAA AGCAACTGGA
92851 AGAGGTTCTG GATATGGCAT TATCAAGATT CGCAATGTAG AGATCCCCTG
92901 AAAAAATACA GAGTGTCCCT CTAGGATCAG AAAGTGTTGT GTAAGGAAAA
92951 ATCTTCCCAC TCGATCCATC AAAGTTCTCG GAAGGCATGA TAACTTCTAC
93001 AGTAAACGCT GTTGAAGCAA AACATGGCGC CAGTGTGGTA GAAATTAAGA
93051 ACTTACGAAT AGACGTTTTC ATTTGCACGT AGAGATGAAA CCAGATTATC
69


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
93101 CTACAAATAA GGGAAAGGCT GTAAAAAAAC AAGTACAATA AGACACAGTT
93151 TTAATCTCTT AATTTTGACA GCTTTAAGAT TACAGGATAT TTTAAAGGGC
93201 ATTTTCCCAT TTCTTACATT GCTTTCTTAG AAGAATACTT GATAGAAAAT
93251 GGCGATTCTA TTTTTGAAAA ATCTCAAGAA ATTCTCCCAA ACGAAGATGT
93301 TTTAGAGAAC CTGTAAAGTA GAAAATGGCG CTACGCCCGA GACTCGTGAA
93351 CATGTGTGCA TAGAGGGACC TATCTCTGAA TACAGATAGG TCCCAAAATC
93901 TTCTTAAAGG.GGATTCCCTT AATTATAGTG TAGACTTAGA GATTATGGCG
93451 TAGAAGTGAT CTCAGGATTC CAAGTGAGGA AAACAGTTTT CTTAGTTGGT
93501 GTGATCCTTC TGTCTTTATC CAGAGAAGGG GGGTACTCTT CCCAGCTTAG
93551 GGACCAGAAA CCTCGAACGG CGTCATTCTC TGTTGTGACT TGGGTTCTCT
93601 CCAAAGTGGT CTCCCCTAGG AGAAGACTGT CAAAAGAAGG CCCTAGAAGT
93651 TCAAGAAGAG GAATCGTAAA GGCTTCCTTA AATTGAGGAA AGTCATAGAC
93701 GGACCAATAA GCCTCAAAGG CAACTTTAAG TTTTTCTAAT GAGACAAGAG
93751 CATCCTTGTC CTCGGCGCGA ATCCTTACAG GAACAAAGTT GTCGGTATCT
93801 TCAATCAGGA TGTGCAGATT CTGAACCCGA GCATCTCCTG AGCAGAGCAG
93851 AGTGGTTCCT GGAGACATAG TAAGCGTAGA GCTTGCTTCC TGCTTAAAAG
93901 AATGCAGTTG CAAGGTAACC CCATCCGATA GAGAGAGAGT TCCTCCTGCT
93951 AATGTGACAT CTTGTAGGAT TGTGGAAGTA AGATTTTCCG CACAAACTTC
94001 ATGATCATCC AGGCATAGTC CTGAGAAGCT AATTGTTCCT TCATAAGTTT
94051 CCTTTCCTTC AGGAGCATTG ATTACAAGAT CTGTAATTTT ATGCGACTCG
94101 CTATGGCTTA TAGGATCATA GAAATAAACT CCGGATTCTG AAACAGCACG
94151 TAGGTTCTTA AACTGTGCTC CAGATTGCAG ATGGATGGAG TTGTGTATTG
94201 TATTTCCGTC TTGTGATGCT GTATTTCCTT TGAAGATGAG ATCTCCGCTT
94251 TTCACGGATA TAGAGATCGA TCCTCCAGGA GCAATGGCAA TGGCTCCTCC
94301 ATTACTATTC AGGTCATGAT AAGCATGATT ATTTTCAAAA CACGAAGGTC
94351 CTCGAGTCGT GAGTGTGAGA TTGTGGGTAG ATATGGCGCC GCCATAACCT
94401 TGGCTCACAT TGTCTCTAAA CACCAGGTAG CGGTTCCCGC TGAGATTTAC
70

CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
94451 TGAAGGACGA CTCGCCTTAG AACCTAAAAG GTAGGGAGTA TAAATCGCAG
94501 CTCCACTCCA CGAAGAGTAG TTCCCACAGA AAGTCACTTC CTCACTATTT
94551 TCGATCATCA CGGAACCAAG ACTATAAATC GCTCCTTGTC CTTGAGGTAG
94601 TAGAGGTGCT GAGGTGAACG CTAAGTAAGA AAAATTAGAG AGAGTGAGAG
94651 TGGTGTCTCC AACGCGGTTC GAAATGGCAG CGCCAAAACC CTCGGTCATA
94701 AGGTTGTGAA AAGTGAAGTT GCAACGGTTG CCCATGAAAA AAAGATTCCC
94751 AGATCGATTT ATAAAAACCC CAGCATCTTC TTGATCATGC TTAACGTTGG
94801 A.AATCCTCAC GTCATCTAGA AAGATGTAAG AAGTTCCTTC TGGATAACAG
94851 GTAATTTTAG GTTCTAAGCT TTTATTATTG ATAGCACCGT TATAACCATC
94901 ACTTTCATGA AGATATACAA CTTGTGCTGC TGCAGGGAGA GCGAGGAATA
94951 AAGCCGAGCA GGTAAGAAAA TTTCGAAGTA TGGTCATGGT TTCCTCGTTA
95001 AATCAATAAG GTTGAAGCAA CTTTAATAAA CAAGAAAAAA AGAAGTCAAT
95051 AAGAATAGAT TATTGTCTAT TAATTATTTA ACTGTTTTTA AAATAAAATT
95101 ATAACTAGAA ATTATTAAAA GAAATCTTTT TTGAAGAGGG ACAAATGTTA
95151 TTTTTTACAG TTTGCAAGGA AAGCATTCCC TATAGCAAAT ATTTCCCTAA
95201 AAGTATGAGA AAACTCCCTA GAAGAACTAG GGAGTTTTAG CAATCTAGAA
95251 TCGGAGTTTG GTACCAACAT CTACATTGTA GTTCCTTGAA GATCCACGGA
95301 GTTCCATAGC GTAATGTCCG AAGAGCTCAC AATTGGAGTT GTAGACGTAG
95351 TTGTTGCTAC CCCTCAGTAA AAATGCCTGT CTTGAAAGAT TGCCACCGCG
95401 AATTTTCCAA GAGTCTGGGC TCATCACAAG AGTCGCTGTA GATTGGGGAT
95451 TGTTACGATA GACATCGGAA ACAAAGAATC CTGAGAGATC ATAGGTGTAG
95501 GAATCTCCGA TATCCCCCTG CACGAATTTC GCACCCACAG GAATCGAGAG
95551 GTTAAGCAGC CTTCCAATAC TAAAACCACG GCCATCACTA GAGCTTTCGA
95601 AGAAGCTATT TTGTGATACA TAAACCATTT CGACTTTCAT CTGTGGAATG
95651 AAGGTCTTGA AAAGAGGATG TGGGTTGGAA AGAACAAAAG GAAGGTCTAG
95701 GCCGATACCA CCAGCTATAC ACTCGTTGCT CCAAGAACCT TCGGATTCTG
95751 GCAATGAGGT ATAGTGCGTT TCCATACGGT TGTCTGAATG GCTGAACGAA
71

CA 02350775 2001-05-11
WO 00/27994 PCTNS99/Z6923
95801 ACTTGGACAT CCAAGGCTAG GGGAATTTCC CTAGGGAATT TTTCTATAGC
95851 TGATTCAGAA AACTTTGCTC TTCCTAATCT CAAATAGTTT TGGGGTTGTA
95901 GGGTATGAGA GTGCTTGAAG AATAAAGTTC CACCGTAGGT TCTAGAGTTG
95951 TTGTGAGCGA TAAAACAATC TTTGTCTCTA GCAAAGAGAT GGCAGAACGC~
96001 AAAGGTAAAT AGGTCGTCTT TAGGAGTGTG AGCACTTCCA CCGATGACGT
96051 AGCCTCCAGA GGTATGACGG AAGCCTTTGC GATTTTCATC TCCAGTCTTA
96101 TGCAGGAAGT TCGTCATGGA GGAAACCCAG AAACCTTGTT TGTGTTCCAT
96151 ACCAGTTGCG CCGATCTCTA CAAGCTGTTG CAGAGAGCGA ATGTCAGTAA
96201 AGACTCCCCA TAGGGTATTG CATACTAACG CAGATTTTCT TTCGGGGCTG
96251 GGAACAAATC CTGTTTTGGT CCAAGTTGCC GTGGCCTCTT TTGTATTTGT
96301 AGCTGTATCC GTAGTCCAAT TAACATTCCA TTGTCCTTGG AATCCGTATT
96351 CTGAATTAGG ATCCTCAGCA GGAACAGGGA TAAGGCTGCT GATGTCAACG
96401 TTAGTATCAA CATCAGCATC AACCGTGATT TTTAATAGAG AGAAGAGCTG
96451 GTCATGGCTG AACATATGAC TTTCATAAAT GTTCCCTTCA ATATCAATCA
96501 GGTTGAGCTT CCCAGATACG ATCACTTTAT TTGAAGCACC TTTTGCTGTT
96551 AGGCTGACGG GCTGCTTAAG ACCTAAGGAG TCAACATTGA TTCCTAGGTT
96601 CGTGATTGTA ATACTCCCAG CTGTAGTTGA TAATGTCGTT CCTGAATCCA
96651 TGCCGAGGAG AGAACCGGCC TCTTGAGAGA AGCTCGTGCT CTCTAAAGTG
96701 ACTCCCTTTT GTAGCAATAA CTTTCCTCCG GATAGGGAGA CTGGCTGCGT
96751 GAATGAAGAT TTTAAATTGT CAGCAACTTT AAGTTCATCT GCTGTTAGGG
96801 TTTCTCCAGA AAATAGAATC GTTCCTTGAT ATGGATTGAG AGCTCCCGCA
96851 GAGCCGTTAT TTATCTTCAA TACGTCTGAT GAGGTTCCTT CTGAAGTGAT
96901 GGGATCATAG AAGAAAATTG TATGATTT~'T AGCAGCCCGT AATTCCGTGA
96951 ATTTCCCGTT ACTTCCTATG TTGATCGCAT TACGTTTAGG AGTATCGGTA
97001 CTTCCGGTTG TTGTAAGGGT ATTTCTTACA AAGGTAATGT TTCCTGTCTC
97051 TGCAGAAAGA CTGAGCTCTC CTGAGGCATC GATGCTGATA GCACCCCCCT
97101 TAGGAGTTGC TGATGAGACA TTATTTCGTA GAAACTCTGT AAAGCCTCCA
72

CA 02350775 2001-05-11
WO 00/27994 PCTNS99/2b923
97151 GAGGAAAGGG CTAGCTTTTT AGCATGGATG GCGCCACCGC TTGTTTCTGC
97201 TACGTTTGAA GCAAAGATCA GAGTCTTATT GTTAGAGATT ATCAGTTCAG
97251 GAGATCCACT CGCCTTGGTG TTGCAGATCG CACCGCCAGT AGTTTTCGCT
97301 GCATTCCCTT CAAAATATAG AAATTTGTTG TTCGATAGTA TCGACGTGCC
97351 TTCATCATCG ATAGCGCCTC CTGACGTAGA CGCTATGTTA GATAGGAATC
97401 TAACATAACC TGTGTTATTT GCTATGCGAG CGCCTGCTGT AGTAGCAATT
97451 GCTCCTCCCT TTGTTGATGA AGAGTTGTTA CTAAAAAGAG CATCTCCAGA
97501 AGTGCCAGTT AAAAGGAAAG ACGCTCCTTT GATAGCTCCA CCATCTGCAG
97551 TAGAAAAATT CCCAGCAACT ACAAGTTTAC GAATATTTTC TAAATTTACG
97601 CCTCCTGCTG AGGAAAGCGT TCCCTGACCT GTAGTAACCG TTGTGCTAGG
97651 AGAGGAATCA AAACTCAGTA AGGAAAACCC TGAGAAGGTA AGATTCTTAT
97701 TTGCTGTTGT AGATGCAGCA GCACCTGCAT GAGTGCCAGC ATCTATAAAG
97751 CCAAACGTTA AGCTATGACC GTTCCCCAAG AAGGTAAGAT TGTCCGTGGT
97801 TTGCTTAAAA CAACTGTCAG ATAAGGGAGT GCCTTTTCCA GGCTCGTAAA
97851 AGAAGACATC TCCTGTTAGA GAATATGTTG TGGCTGAAGT TTTTGGAGTA
97901 AACGTTCCTG AATCGATATT TCCATTAAAG CTATCATCAG GTGATAAAAG
97951 TTCCTCGTTA GCTAGTGACT GTAGGTGACA TGAGAAAGCT AACACGGAGG
98001 AAACTAAAAC CCAAGGAATC GAAGTCTTCA TGGTAATGCT TTTGTTTTTT
98051 AGAGAACTAT TCGCATCAAT ATAGAAACAA AATAAGTAAA TCAAGTTAAA
98101 GATGACAAAA CAGCTGTCAA GAATTTTTAT CTTGACTCTC TGAGTTTTCT
98151 ATTTTATATG ACGCAAGTAA GAATTTAATA ATAAAGTGGG TTTATGAAAT
98201 CGCAATTTTC CTGGTTAGTG CTCTCTTCGA CATTGGCATG TTTTACTAGT
98251 TGTTCCACTG TTTTTGCTGC AACTGCTGAA AATATAGGCC CCTCTGATAG
98301 CTTTGACGGA AGTACTAACA CAGGCACCTA TACTCCTAAA AATACGACTA
98351 CTGGAATAGA CTATACTCTG ACAGGAGATA TA.T~CTCTGCA AAACCTTGGG
98401 GATTCGGCAG CTTTAACGAA GGGTTGTTTT TCTGACACTA CGGAATCTTT
98451 AAGCTTTGCC GGTAAGGGGT ACTCACTTTC TTTTTTAAAT ATTAAGTCTA
73


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
98501 GTGCTGAAGG CGCAGCACTT TCTGTTACAA CTGA'1'AAAAA '1'C'1'c~'1'C:c;c.'1'A
98551 ACAGGATTTT CGAGTCTTAC TTTCTTAGCG GCCCCATCAT CGGTAATCAC
98601 AACCCCCTCA GGAAAAGGTG CAGTTAAATG TGGAGGGGAT CTTACATTTG
98651 ATAACAATGG AACTATTTTA TTTAAACAAG ATTACTGTGA GGAAAATGGC
98701 GGAGCCATTT CTACCAAGAA TCTTTCTTTG AAAAACAGCA CGGGATCGAT
98751 TTCTTTTGAA GGGAATAAAT CGAGCGCAAC AGGGAAAAAA GGTGGGGCTA
98801 TTTGTGCTAC TGGTACTGTA GATATTACAA ATAATACGGC TCCTACCCTC
98851 TTCTCGAACA ATATTGCTGA AGCTGCAGGT GGAGCTATAA ATAGCACAGG
98901 AAACTGTACA ATTACAGGGA ATACGTCTCT TGTATTTTCT GAAAATAGTG
98951 TGACAGCGAC CGCAGGAAAT GGAGGAGCTC TTTCTGGAGA TGCCGATGTT
99001 ACCATATCTG GGAATCAGAG TGTAACTTTC TCAGGAAACC AAGCTGTAGC
99051 TAATGGCGGA GCCATTTATG CTAAGAAGCT TACACTGGCT TCCGGGGGGG
99101 GGGGGGTATC TCCTTTTCTA ACAATATAGT CCAAGGTACC ACTGCAGGTA
99151 ATGGTGGAGC CATTTCTATA CTGGCAGCTG GAGAGTGTAG TCTTTCAGCA
99201 GAAGCAGGGG ACATTACCTT CAATGGGAAT GCCATTGTTG CAACTACACC
99251 ACAAACTACA AAAAGAAATT CTATTGACAT AGGATCTACT GCAAAGATCA
99301 CGAATTTACG TGCAATATCT GGGCATAGCA TCTTTTTCTA CGATCCGATT
99351 ACTGCTAATA CGGCTGCGGA TTCTACAGAT ACTTTAAATC TCAATAAGGC
99401 TGATGCAGGT AATAGTACAG ATTATAGTGG GTCGATTGTT TTTTCTGGTG
99451 AAAAGCTCTC TGAAGATGAA GCAAAAGTTG CAGACAACCT CACTTCTACG
99501 CTGAAGCAGC CTGTAACTCT AACTGCAGGA AATTTAGTAC TTAAACGTGG
99551 TGTCACTCTC GATACGAAAG GCTTTACTCA GACCGCGGGT TCCTCTGTTA
99601 TTATGGATGC GGGCACAACG TTAAAAGCAA GTACAGAGGA GGTCACTTTA
99651 ACAGGTCTTT CCATTCCTGT AGACTCTTTA GGCGAGGGTA AGAAAGTTGT
99701 AATTGCTGCT TCTGCAGCAA GTAAAAATGT AGCCCTTAGT GGTCCGATTC
99751 TTCTTTTGGA TAACCAAGGG AATGCTTATG AAAATCACGA CTTAGGAAAA
99801 ACTCAAGACT TTTCATTTGT GCAGCTCTCT GCTCTGGGTA CTGCAACAAC
74


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/2b9Z3
9 9 B 51 TACAGATV'1"1' (:(:A(iC:(iCi'1"1'C: l:'1'Hl:Hli'1'Hht_.
HHI,:'1'l.l.'1'HL V l.rW.1 r~ ~ vvv i
99901 ATCAAGGTAC TTGGGGAATG ACTTGGGTTG ATGATACCGC AAGCACTCCA
99951 AAGACTAAGA CAGCGACATT AGCTTGGACC AATACAGGCT ACCTTCCGAA
100001 TCCTGAGCGT CAAGGACCTT TAGTTCCTAA TAGCCTTTGG GGATCTTTTT
100051 CAGACATCCA AGCGATTCAA GGTGTCATAG AGAGAAGTGC TTTGACTCTT
100101 TGTTCAGATC GAGGCTTCTG GGCTGCGGGA GTCGCCAATT TCTTAGATAA
100151 AGATAAGAAA GGGGAAAAAC GCAAATACCG TCATAAATCT GGTGGATATG
100201 CTATCGGAGG TGCAGCGCAA ACTTGTTCTG AAAACTTAAT TAGCTTTGCC
100251 TTTTGCCAAC TCTTTGGTAG CGATAAAGAT TTCTTAGTCG CTAAAAATCA
100301 TACTGATACC TATGCAGGAG CCTTCTATAT CCAACACATT ACAGAATGTA
100351 GTGGGTTCAT AGGTTGTCTC TTAGATAAAC TTCCTGGCTC TTGGAGTCAT
100401 AAACCCCTCG TTTTAGAAGG GCAGCTCGCT TATAGCCACG TCAGTAATGA
100451 TCTGAAGACA AAGTATACTG CGTATCCTGA GGTGAAAGGT TCTTGGGGGA
100501 ATAATGCTTT TAACATGATG TTGGGAGCTT CTTCTCATTC TTATCCTGAA
100551 TACCTGCATT GTTTTGATAC CTATGCTCCA TACATCAAAC TGAATCTGAC
100601 CTATATACGT CAGGACAGCT TCTCGGAGAA AGGTACAGAA GGAAGATCTT
100651 TTGATGACAG CAACCTCTTC AATTTATCTT TGCCTATAGG GGTGAAGTTT
100701 GAGAAGTTCT CTGATTGTAA TGACTTTTCT TATGATCTGA CTTTATCCTA
100751 TGTTCCTGAT CTTATCCGCA ATGATCCCAA ATGCACTACA GCACTTGTAA
100801 TCAGCGGAGC CTCTTGGGAA ACTTATGCCA ATAACTTAGC ACGACAGGCC
100851 TTGCAAGTGC GTGCAGGCAG TCACTACGCC TTCTCTCCTA TGTTTGAAGT
100901 GCTCGGCCAG TTTGTCTTTG AAGTTCGTGG ATCCTCACGG ATTTATAATG
100951 TAGATCTTGG GGGTAAGTTC CAATTCTAGG AGCGTCTCTC ATGTCTCAGA
101001 AATTCTGAGA GAGATCGCAT TTAGGATTTT CTTAAACACG ACTCACCTTG
101051 TTTTTGAACC AGGAGAGATC GGGGATTAAA AAGGCAAGAG GGCAGAGTTC
101101 GTGAGGTCAC GTACTCTGCC TTTCTTGTTA CAAACACGTT TTAAAATTAA
101151 GGAAATTTTT TAATAGAAAC CCGTTCTTTA AAATACGTTT CTTTAATTCT
75

CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
101201 TATTGAATAA GATAATTCAC TATTTTTAGA TCCTAAATTT TAAGTGGTTT
101251 TTGTTATGCT TCTTATAGAG AATAGCTGCA AAGATTAGAG TTGCAGAGAC
101301 GGTACGTCTC TTTCTTTTTT AAGGGAAGGG GTGTTGTTAC ACCCATCCTA
101351 AGATTTGTGA GATTCCCCTC AGGCAGTAAC TTTTACAATC GTACTTTATG
101401 TTTTGATCTA GCTGTTTTCT TGTCTTTAAT TTATTCAACC ATCGAGAAGA
101451 GAGATCCATG AGTGGAAATG TATTTTATTA GGATCATCTC TAAGGATGGA
101501 AATGATGAGC CCATTCCAAC AACCTGAGCA ATGTCATTTT GATGTTGTGG
101551 GAAGTTTCTT ACGTCCTGAA AGTCTTACAC GAGCACGCTC TGATTTTGAA
101601 GAAGGAAGAA TTGTCTATGA GCAGATGCGA GTTGTCGAAG ATGCTGCTAT
101651 TCGTAATCTC ATAAAP~AAGC AAACAGAAGC AGGTCTTATC TTTTTTACTG
101701 ATGGGGAATT CCGTAGGTAT AGTTGGGATT TCGACTTTAT GTGGGGATTC
101751 CATGGCGTGG ATCGTCGCAG GGACTCTAAT GACCCTGAAA TTGGAGTGTA
101801 TCTTAAAGAT AAAATCTCCG TATCAAAACA TCCGTTTATA GAACATTTCG
101$51 AGTTTGTCAA AACTTTTGAG AAGGGAAATG CAAAAGCAAA ACAAACGATT
101901 CCTTCTCCAT CACAATTTTT CCATGAGATG ATTTTTGCTC CTAATCTGAA
101951 AAATACTCGG AAGTTTTATC CTACGAATCA AGAGCTAATT GATGATATTG
102001 TCTTTTATTA TCGCCAAGTC ATCCAAGATC TTTATGCTGC AGGTTGTCGT
102051 AATTTGCAGT TGGACGATTG TGCTTGGTGT CGCCTCTTGG ATATACGAGC
102101 GCCTTCTTGG TATGGTGTTG ATTCTCATGA CAGGTTGCAG GAAATTTTAG
102151 AACAGTTTTT ATGGATCCAT AATTTAGTGA TGAAGGATAG ACCCGAGGAT
102201 CTTTTTGTAA GTCTGCATGT CTGTCGTGGT GATTATCAGG CCGAGTTTTT
102251 CTCTAGACGA GCTTATGATT CTATAGAGGA GCCTTTATTT GCTAAGACCG
102301 ATGTGGATAG TTATCACTAT TATTGGGCTC TTGATGATAA GTATTCAGGA
102351 GGTGCTGAGC CTTTAGCTTA CGTCTCTGGA GAGAAACACG TCTGCTTGGG
102401 ATTGATCTCC AGCAACCATT CTTGTATTGA AGATCGAGAT GCTGTGGTTT
102451 CTCGTATTTA TGAAGCTGCG AGCTACATTC CCTTAGAGAG ACTTTCTTTG
102501 AGCCCGCAAT GTGGGTTTGC TTCTTGTGAG GGAGACCATA GAATGACTGA
76


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
102551 AGAAGAACAG TGGAAGAAGA TCGCCTTTGT GAAAGAGATT GCTAAAGAGA
102601 TCTGGGGATA AAGAATCCGG AGTTTTTATC GACTCTAAGA GTTTTCGGAT
102651 CATAGAAAAC ATTTAAATAT TCAAGAGTCT TTGGCTATTG GATCATAGAC
102701 AGTCTTAGTA TACTAAAAAG TCTTTGGATT CTAAGACGGG CAGAGTTCGT
102751 GAGATCACGT ACTCTGCCCA TTCTTTCTTG TGATCTAGCG ACTTCTTTGA
102801 ATCTTCGACC TCTTGTAATC TGGGATTTTT TCTAGTTCTT AGATTCCTCT
102851 GATCTTTCGA CTTCTCCTCG TCTAAACAAG GCGCATTGTC TTTGAGAAGT
102901 CCCTAGATAC ACTCAGGATC TCTTAGAATT TCTAAGGGAT CAGGAACGCT
102951 TTTAGAACTG GAACTTACCT CCAAGATCTG CATTGTAGCT GCGTGAAGAT
103001 CCACGAATTT CCATAGATAG GTTACTTGTG ACCTCAAGAT TTGGAGAGAA
103052 GGCATAAAAG ATCCCTGCTC TTCCGATACC AGCTTGTCTT GAGAGATTCG
103101 TTCCTGTAGT TTTCCACGAG GTATTGTTGA TTAGGAGAGC TGTCGTGCAG
103151 TCAGGATTCT TACGATAGAC ATCGGCAACG TAGATGACAG TAGCTTCGTA
103201 AGACGCACGC TCGTTTCTCG AGAATCTCTC GAAGGTAATT CCAATAGGCA
103251 CAGAGACGTT AATTAAATCA CCGCTATCGA AAGATCGTAC CAAGGTAGTA
103301 TTACGTTCTT TGAAGCTATC TTGGTGTATG TACGAAGCTT CTACTTTGAT
103351 GAAAGGAAAA TACGCGTGGA AGAGACCCTC ATGGCTTAAA GCAGTGTGTG
103901 GTAGGGAGCT CGCAAGTTCC AGAGCGCAAC CGTCATTATA CCACGAGCTC
103451 TCTCCCTTTG GTGCTTGGGT GTAATAGGTT TTCATAGTAT TTTTACTATA
103501 GATATAGCTG ATCTGAGCAT CAAAGAGGAC AGGCTGCTCA CTTTCAGATC
103551 CAGGAAGGTA GCGTAACAAG CTTGGAGAAG ACAAGGTCGC TAGATGCTGG
103601 AGATGGAGAG AAGCTGCATA GGCAGAAGCT CTATTTTTAT TTATAAAGTG
103651 ATCTCTATCT TTCCCGAATA ATTGGCAGAA GGCTGCAGTG ATAAGATTAT
103701 CAGAAGCTAA TGTTGTAGTC GCTCCTACAA CATAACCTGC ACTTATGTGG
103751 CGAAAACCTT TATTTATCTT CGTGCTATCT TTATGGAAGA AGTTCGAGAT
103801 CCCTTCACAC CAGATGCCGC GAGTTTCTTG AGATTGGCGT ACTTTAGTGG
103851 CTACAAGCTG TTGTATGGAG CGCACATCAA CAAAGGATCC CCATAGCGTG
77


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/Zb923
103901 TTAGCAACTA AGGTTCCACG ACGCTCAGGA TTCGGATTGT ATCCTGTTTT
103951 TGTCCAGGTA AGAGTCGCTG CTTTGGATTT AGTCGCAGTA TCCTCTTGCC
109001 AAGATAATGC CCAATTCCCT TGGTATCCCC AATGGATAGG ATTTTTTTCT
104051 AGGGGATCAG CAGCTAAGTC TGTGATGTGA ATATTCGCGG GGTCGTCAGC
104101 AGTAAGAGTG AGACAAGAAA AGACTTGAGG GTTATTCCAA GAGACATCTT
104151 CGTAGACATT TCCAGAAGGA TCTACAAGAG AGAGCGATCC AGATAAAGTG
104201 ACTGTCTGAC TTGCTTGTGT TGCTTTTAGC GTAGCCTTCT TGGTCTCTTT
104251 TAAGGAATCT ACATTGAGAA CAAGATTATT GATAGTGATC CCATCAGCGG
104301 TTTCTAATGT GGTCCCTGCA TCCATGAGGA GGGTAGAGCC CGGAGATTGC
109351 GAAAAGGACT TAGCAACTAG AGTGACTCCT GATTTAAGAG AGAGTTGCCC
104401 TCCCGCAAGA GTTAGAGGTT GCTGAATTGT AGATTTGAGA TTATCAGCTT
104451 CTGCAGCTTC TGCTTCCGAG AGCTTCTCTC CAGAAAATAC GATGGTTCCT
104501 TGATATGCAG GATTCCCTGC AAGGTCAGGA CCATTTAAGT TTAGAGCATC
104551 TGAGAGAGCT GCAGTGATGC TAGTTGTTAT AGGATCATAG AAGTAGATAG
104601 TATTGCCTTG AGAGGCTCGC AGCTGTACAA TCTTAGCATT GGTGTTTCCG
104651 ATGTTAATAG AATTTCTGGT AGTGGTCTGA CTCGAAGAAG CTCCTTTGAC
109701 TACTGTGTTT CCTTCAAAAG TGATGTCTCC ACCAAGAGCC GAAAGACTCA
104751 AAGATCCAGA GTCAGCAATC GCAATTGCTC CTCCTAAGGG AGCTGCAGTA
109801 TCTATAGCAG AGTTGTTTTT AAAAAGCGTA GGTCCTCCAG AAGAAAGAAC
104851 TAGATTGTCA GTATAAATCG CCCCACCACT AGTAATTGCT GTATTTCCTA
104901 TAAAGTTCAG TTCCCCGTTG TCTGATAGAG TTAAGACTGG TTTGGGGGCT
104951 GATGTACTAC TACAGTAAAT GGCTCCCCCT GTAGCTGAGG TTGCGGTCAC
105001 ACTATTGTTT ATAAAGCTAA TTGCTTTGTT GCTGCTAATA AAACTGCTAG
105051 CTTCCGTGTA AATGGCTCCG CCATTGTTCG CCGCGGTATT TTCAGAAAAT
105101 GATGCTGAGT TTAACGTATT GTTAATTGTA ATCCCTCCCG TGGAATAGAG
105151 GGCACCCCCT TTTTGCGTTG CTTTGTTTTT GGCAAACGTT AGGTTGGGGT
105201 TTAGCGATAG ACTGATAGAG CTGCCTTGGA GGGCGCCTCC ATTGTCATTA
78

CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
105251 GAAAAGTTTT GGCCAAAGTA GCAACTATAG TTCGACTGAA TAGAACAAGC
105301 TCCTGTGGAC TfiGATGGCTC CTGTTCCTGT GGTAGCATTC GTGGTTTGTA
105351 TTAGTGACAA ATAGGAGAAT CCTGAAAAGG AGAGAAGCTT ATTTGCAGCT
105401 GTATTGGTAA AGGTACAGTT CGCTCCCGCA TCGATATTTT GTAGGAGAAA
105451 TTGGTAGCCG TGGCCTTGGA AAGAAAGATT CCCAGTAGTT TCTTTAAAGC
105501 AGGAAGCGGT TAGAGCTGTC GGAGATCCTG CATTGGTGAT TGAGACATCC
105551 CCTGTTAGAT TATAGATAGT TCCATCTGCA TTTGTTGTTT GGGCTGGAGG
105601 AGTGTAGGTT CCTGGTCCAG AGAAGCTATT GGTAGGTCCT AGATTGATTT
105651 CAACAACAGC AGCAAACGCA GAGAAATTTA GTGACAAGGG AAGTGCTAAA
105701 GATGACGAGA TTAAAAACCA ATGAAGAGAG GATTTCATGT AGAGGGCTAT
105751 AGGTGGTTTA ACAAATTATT TCACCACATA CTGCAATAAA TTAAAGAAAG
105801 CAAGAGGAAA GGAGAGACTA GTAAGTTAAG AATCTACAGG GTTTTTATAA
105851 GAATTCCTCC CTAAAAGTTT AGGGAGGAAA GTAGGAACTA GAATGAGTAT
105901 CTTAGCCCAC AATCTACATT GTAGATGTGT GCTGAGCCAC GAAGCTCATA
105951 AGCAGCTTCC CCAGAGAGTT CTACATGAGG GGAGAGAGTC AGATGGCTTC
106001 CAGCACTTGC TAAGAAGGCT TGTCGTGCGA GGTTTTTACA TAGCGAAGTC
106051 CAAGAGGCTC CACTGACCAT TAGAGAAGTA CGCGAACGGG GATTTTTACG
106101 ATACACATCA CCAATGTAGG CTAGAGAAAT CTCGAAATTA TTTTTTTCAT
106151 CTTCGGAGAT TTTTTCTAAC CGAATGCCGA CAGGGATAGA GCAGTTCACT
106201 AGGTCTCCAT CATCAAAAGC ACGGGCTTCA GCGCCACTCT CTTTAAAGTT
106251 TTGTTGGCGG CTGTAGACTG CCTGGAACTT TAAGAAGGGG AAATATCCCT
106301 GGAAGAACGG TGCTTCTTTA GGGAGATATA GAGCCAGAGA TCCTCCGAGC
106351 TCTAGAGCCC CAGAGTTATT GGTCCAAGAG CCTTGAGCTT CAGGATAGGA
106401 AGTATAGCGA GTATCCATAT CATTTTTAGT GTAGCTGTAG CTTAGCTGGG
106451 CATTCAAAAT GAGAGGAATA TCTTTCAGCA TGTCGGTGAT ACTTCCAAAT
106501 GAGGGCATGG GAAGTCCTCC TAGGAATGCT CGATGTTGCA GGTATAGCGA
106551 CGCTAAATAG TTATGAGAGG TATTTTCAAC TATAAACAGG TCTTTATCTT
79

CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
106601 TACCGAAGAG CTGGCAGAAA GCTACACTGA AGATATTTTC AGAAAAATCT
106651 TCAGCACTTC CTCCAACAAT ATAGCCGTAG CTTTTATGTC GGAATGCTTG
106701 GTTAGTTCCT GATTTATCCT TATGGAAGAA ATTCGCAGTT CCTGATGCCC
106751 AGAGTCCTCG TTGCTGATAG ATACTATTCG CTTGAGATGT CATGATCTGC
106801 TGTAGAGTGC GAATGTCAGT AAAGGATGCC CATAATGAAT CGGGAACTAC
106851 GGAAGCTCTA CGCTCAGGAT TAGGGTTGTA GCCCGTAGTT ACCCAAGTCA
106901 TAGTTCCTGA TTTTGCAGTT GATGTGTCTG CCCAAGTGGC TTCCCAATGT
106951 CCCTGATACC CGTAATGAGG TTCTGGAGTT TGTACTGGAG AAGTGAGAAG
107001 CGCATCGATA TAAATATCGC TAGCAGCAGT AGCAGCAGTG AATACCACCA
107051 AAGGCTGCGT GAAGGCTTGG TTTATCGTAT GGCTTTCATA AAAATTGCCG
107101 CTACTATCTT GGAAAACAAG AGGAGAGGTT AGAGTTATAG TTTTGTTGGC
107151 TCCTGCTGTT TCAATGGACA CACTCTTATT TCCCTCTAAG GCAGAAAGAT
107201 CAACGACAAG TTTGGTAAGA CTGATAGCTT CAGTATCTGC TTTGAGCTTT
107251 GTTCCTGGTT GCATGAGGAG TGTAGAGCCT TCAGTCTGTG TGAAACCATT
107301 GACATCTAAC TCGACATTTC CTTTGAGTGC TAAGGTTCCA GAGGCTAGAG
107351 CCAATGGTTG CTTTAATATA GATGTGAAGT TATCAGCAGC TTTCGCTTCA
107401 TCTGCAGAGA GCTTTTCCCC AGAAAATACA ATCGTTCCTG AATAATCTAA
107451 AGGCGAGTTG CTATCCGGTT GGTTGATGGT CAGAACGTCT GAAGCTCCTG
107501 TGGTGTTAGA TGCAATCGGA TCATAGAAAT AGATAGATTG GCCTTGGGCT
107551 GCCCTTAAGT TCGTAATTTT TGCTGACGAT CCCAGGTAGA TAGCATTCCG
107601 TGTCGATGTT GGCGCGGAGG TTGAGGTTAG AGTGTTGCCA AGGAACGTGA
107651 TGTCTCCTTG ATTTGCAGAG AGACTTAAAG ATCCAGAGTC GGCAATTGCA
107701 ATAGCGCCGC CCTTGCCTGC AGCTGTGTTC CCGCATCTAT TATTTGAAAA
107751 TAGGGTAGGG CCAGCAGCGG AAAGATCTAG ACCATGGGCA CAGATTGCTC
107801 CGCCTTGAGT TACTGAAGAG TTCTCGGCGA AGGTCAGACT TTTATTTCCA
107851 GAGATAGTAA GAGTAGGAGT CTCTCCTGTT TTTTCACAAT AAATGGCCCC
107901 GCCCTTGCCT GCAGCATCTG TTGCAGTGTT TCCAGAGAAG AAAAGGGAGC
80


CA 02350775 2001-05-11
WO 00/27994 PCT/US99I26923
107951 TATTTTGAGT AATCGAGGAG CTGGCTTCAA AGCCCAGAGC CCCACCCCCA
108001 GTTTCTCCTT TATTATTCAT AAAGACTAAC TGGCCGGTGT TTCCTGAAAT
108051 ACTTGCAGCC GCAGAGCTAT AGATCGCTCC ACCTAATTTT TTTGCGCTAT
108101 TACTAGTGAA GGTTATAGAA GAGGTATTCC CAGAAATAGA AAGAGTTTTT
108151 GTGGTGATCG CTCCGCCATT GTTATTAGCT TCATTGGAGA CGTTTTGGCT
108201 AAAGAGAATC GTTCCATTAT CGGTAAGATT TAAGGCTCCT GCAGAACTTA
108251 AAGTACTTTT TCCTGAAGCA ACTGTAGTTC CAGGAGCTGC AATGAAGGAA
108301 AGGTTAGAAA ATCCTGTGAA TGTTAGGGCT TTATCAGCAG TTGTGCTTGC
108351 CGCAGCTCCT GCATTCGAAC CCGCATCTAC CGTGTTGAAT GAAAATGAGT
108401 ATCCCTTTCC AGTAAATGTC AGATCACCCG TAGTTTCTGT AAAGCAGCAG
108451 CCTGTTAATG CTGTGCCTTT CCCAGCATCG TTTATATAGA CATTTCCTGA
108501 TAAGACATAG TTCGTTCCAT TGGCATCTGC TGTAGATTTT GGAGTAAATG
108551 TAGAGCCGCC CGCTCCATCA AAGCTATCTG TAGGGGATAA AGAAGCATCT
108601 GCTCCGTAAG TTGCAATGCT CAATAGAATG GGAGTGACAA GAGTCGAAGA
108651 GATCAGGAGT TTGTGCAAGG GTATTTTCAT AGAAAGATGC TTGGGTTCAA
108701 TTAATTAACA CGTTTTCGAT AATCTAGAAA CAAAACTTAG AGCCTAGGTT
108751 TGTATTATAA TTTCGTGAAG AACTTCGTAC TTCAAAAGCG AATTGACCGA
108801 AGATTTCCAT GTGGGGGTTC ACTTGGAAAT GGTTCGCAGC ACGAACAGAA
108851 AAACCTTGTC GTGCGAGGTT GGTACCATAG GCCATCCAGT TAGCATCGCT
108901 AGCTATTAGG GAAGTTTGAC ATTTAGGATT GCGTCGGTAA GCATCGAGTA
108951 TATACATAAG AGTAAGATCG TAAGTTCCCT TTTCTGATTT TGAGTCTCTT
109001 TCGAAGGTGA CGCCTATAGG AATCTCTACG TTGATAAGCT CGCTTTTATT
109051 GAAAGCGCGT CCTTCAGCAT GACGCTCGTA GAAGTCTTGC TGATGCGCAT
109101 AGATATACTG TACTTTGACA AAAGGTTCGA CTTCTTTCAG AAGATACGGA
109151 ACGGAAATAA CAAAAGGCAG GCTAGCTCCA AGATCTGCAC AGAAGGCATC
109201 GTTTCTCCAA GAACCCTTGA TGATAGAGTT ATCGGTATAA TATGTCTTCA
109251 TGTGGTTGTC TGTATGGAGA TAACTGAATT TAGCATCGAA CGATAAAGGA
81


CA 02350775 2001-05-11
WO 00/Z7994 PCT/US99/26923
109301 ATGATCTGGG AGATCTCAGA GAGCACCCAG GGAGCTCGGG TTGCTTTTCC
109351 CCAGAGGAAA TTGGCGATGT CGAAGAGCCC TTCTGTATGG TGGAAATACA
109401 AAGAGGCACC GTAAGTATCT CCGTGGTTCT TACCTGTAAT ATGATTGCGA
109951 TCTCTAGCAA AGAGCTGGCA GAAGGCAAAA GTAAGCTGAT CCTCGGCAGG
109501 AGTTGTTGCT GTGATCCCTA GTGCATAACC CCCGCTGATA TGGCGGAAAC
109551 CATGGCGGGT GGGCATAGAA TCTCTATAGA AGAAATTCGC AATTCCTGAA
109601 AGCCATAGCT CACGCTCAAA AGGCTCCCCA CTGGACTTGG TTTCTATAAG
109651 CTGATTGATC GAGCGTATAT CTATAAAGTT TCCCCATAAG CTATTTAGAG
109701 GGAGATTACT TTTTCTCTCA GGACTAGGAA TGTATCCTGT ACGGGTCCAG
109751 TTGATGCTTC CTATTTTTGA GGATGTTGCA TTTGCCCAAG ACAACTGCCA
109801 GTTTCCTTGA TACCCGTAGT GGGTTTCAGG TTCTTGAAGA GTCAGGGTAG
109851 AAAGAGCTCC CAGAGTAATC GTTCCGTTGG CTCCTGCGGT GGTAAGTTCA
109901 AGAAGAGGAT AGGTACTAGC ACTTTTTAAG TTATGATTCT CATAGAATGA
109951 CCCTTCCGTG TCAATAAGCG CAATCGTTCC CGATAGGCTG ATATTTTTAT
110001 CTGCAGCTTC TGTTTTTAAA GCTGCCTTGT TGGTTCCATC TAAAGAGGAG
110051 AGATTTACTG CTAAGCCATT AAGCGAAAGA TTTGCCTCTT TAGCACTAAG
110101 TGTAGTCCCC CCATCCATTA AGATGCGGGA TCCTGGACTT TGAGTCAGAT
110151 CCTTGAAAGT TACGGTGACT CCATCACGAA GTACAAGATC TCCCCGCGCT
110201 AATACTGCAG GTTGTCGGAT AGTAGAGGTG ACGTTTGCAG CGATTGCTTT
110251 TTCTGTAGGG GAAAGCTTTT CTCCAGAAAA GACAATCGCA CCCCCATACT
110301 CGATCTCACT GTTCGCATCT GCTAAGTTTA AGTTCAATG't' GTCGGTAGAA
110351 GCTGCGGTTC CTGGATTTGT GATGGGATCA TAGAAATAGA TAGATTGCCC
110401 CGTAGCAGCT CGTATCGATG TGACTTTAGC GGTATCAATG ATATTTATTG
110451 CGTTTCTTGT ACTTGTGCTT CCGTTGGTGA CTTGGTTGTT ATTGAAGGTA
110501 ATATCTCCAG AAGTAGCAGA GAGAGCGAGT TCCCCAGCAG ATGCTATATT
110551 GATCGCTCCT CCTCCTCCCT GACCGGCGCT ACTTCCTGAG ATATTACTTT
110601 GAAATAGAGT AGGACCTCCA GCGGAAATAC TGACCTTGAG TCCAGAGATG
82


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
110651 GCTCCGCCAT ATGTCAATGC TGTATTATTT GTGAAAGAGA GGTTTTTGTT
110701 CCCAGTAAGA GTCACTGTTT TATCTGTCGT AGTGCAACAA ATAGCCCCGC
110751 CCTGAGCTTG AGCGGCTTCC CAAGCACTAT TGCCGTCAAA GATCACTTGA
110801 AAGTTATCTG TAATCGAACA GTTGTCAGTG CTGTACAGAG CACCGCCAGA
110851 TCCTTTCGCT AGGTTTTGAG AGAAGGAAAC TATCCCAGGG CTGTTCTCGA
110901 TAGTTATAGT TCCTGTAGCG TAAACTACAC CGCCTTGCTT CCCTGTGAAG
110951 GCTTGGTTTC TCGAAAAGCT CGCAAACTGA GATGTCCCTG ATAATAAGAA
111001 GTTTTTCGTA TTGATAACAC CGCCGTTATC TGACGAGAAG TTCTGAGTAA
111051 ATATAATTTG GGAATTGCCA GTTAGAGATA GATTCCCCAC AGATTTTAAA
111101 GCACATTGTC CAGTAGGAGA GAGAAGAAGA GAGGGACAAG AGATAATAGA
111151 GAGTCTAGAA AAATCATTAA AGAGAAGATT CTTATCTGCT GCTGAGGTAC
111201 TGGCTACAGT TCCAGCGCTA GAGCCCGCAT TGATAAATGC AAACTTCAGT
111251 GCATGTTGAT TTCCTTGGAA AGTAAGATCG CCGCCCGCTT CTAGGAAGCA
111301 TCCTGAGGCT AAGGGAATTC CTAAAGCCCC TGCATTTTGA AAGGATACGT
111351 CGGAAAGTAA GGAATAGGTA GTTCCTGCAG CAGCGTCCGT AGTGGAAAAG
111401 ACCGTGAAGG TAGTTCCGTT AGATCCATCA TAGCTATTAT TGCTGCTATC
111451 TAAGGTCACC TCTGCCGCGA CTATAGAGAG CGATGAAAAG AGCGGGATTG
111501 AAGAAAAGAA CAACCAAGAG ACAGAGGACT TCATTTGTAA GCACTTTTTT
111551 GAAACAAGGA AATTAAATTA GCAAATACTG TAAAGAAAAA AAGAAATCAA
111601 GGGAAACGCA AGGAATTGAT TGATGCGGAG AATCAGAACC CCAAGGATGG
111651 CGGATCTTTT ACTTCTCTTC ATACGGATCC TAAGAATCTC TTTGATGAAG
111701 AGGGGATGCC CTCCCCCTCT GATACCCTAC AGTGCGATCT CAATAACGTA
111751 TTCATCTTTA~TAAAAAGTAT GTTTTTCTAA GATTCTCGGA GAATCTTAGA
11180'1 AAGAATAACG AGTTCCACAG TTTGCATTAT AGCTTCTTGA GGAGCTGCGC
111851 AGTTCACAAC TTCCAGAAGC GAAGCAGTCA AGACCATGAA GTAACTTCAG
111901 ATGTCCAGAA GCCTCAGCAA AGAAAGCTTG TCGTGATAAG TTTGTAGCAA
111951 ACGTAGACCA CGAGGTGCCA TTTGTTAAGG AGGTCAGGCA GTGAGGGTGA
83


CA 02350775 2001-05-11
WO 00/Z7994 PCTNS99/26923
112001 TCCCGGTAAG CATCTACAGC GTAACCTAAA GTAAGAAGCA AAGCACTGGG
112051 GGGCTTTGCT GATTCGTGTT TGAAGGTGAG TCCCATAGGG ATAGACACGT
112101 TGACCAGATG GCTAGCGTCA AAGATACGTG GATCAGCAGC AACCTCTTGG
112151 AATCCTTTTT GATTTACACT CACAACTTGG AGTTTCACAT AGGGAGAGTA
112201 GCTGGTAAGG TATCTGTAGT TTAGATCTAC AGGAAGAGAA CCACCGACTT
112251 CAACAGCGAA GCTATGGCTG TCCCAGTCTG ATTTCCCTTG TGTGTTGTTC
112301 GCAAGCTTTG TCGTCATATT ATGGTGGTTT CTTCCATAGG AAACTTGACC
112351 ATGGAGAACA AGGGGAGTTT CTCCTGGGAG CTCTGGAAGG ACCTTAGAGA
112401 GGACGTGGCG ACGTAATGAG CTATGCAGGG GAATGACATA AGAGCTCTGA
112451 GCACAGAGAG ATCCTGCATA GACTTGAGAT TTAATATCCG AGACTACGTA
112501 ATCCTTAGAT TTGCCAAAGA GTTGGCTGAA TGCAACAGCA AAGGTATATT
112551 CTTGAGGGGT GGTCATGCTG CCACCAACAA TATAACCTCT GGAAATCAAA
112601 CGGAATCCTG CATTTTCCTT TTGCTTGTCT TGATGGAAGG CGTTGCCAAT
112651 ACCTCCAATC CAAATCCCTG GATGTGAGGG AGCGTCCGAC ATCGCAGTGG
112701 CGATCTCCTG CTGTATAGAA TGGATGTTTA CATAAGCATT CCAAAGGCTA
112751 TTAGGAACTA AAGTCGCACG AAGCTCTGGT TTAGGAGTGT ATCCTAACGC
112801 TTGCCATTCC GCGACCAAAG TCACCTTCCC TCCAGCTCCT ACTTTAGGAA
112851 CCAGAGTCCA ACTCCCTTGA TACCCATAAT CCGGAGCAGC CATGCTAGAA
112901 GGAATCGGAT TGAAGTCGTC TAAATTTACA GTTCCTGAAG TAGAAGAAAG
112951 ATCTAAGAAA GGAAGATTTA AGTTTGCTTT CAACCCAGGA TTGTCATAGA
113001 AACTTCCTTC ATTGTTATGG AATTTCAGAT CCCCTGAGAT TTTTAATCCC
113051 CCACTTGTGC TGTTTACGGC AATCGTTATC ATACGCTTGC CATCTAAAGC
113101 ATCCAGATTT ACAGAGAGAT TCTTTAGATC GATGCTGCCA TCTGTATTGT
113151 TAGTTGTCGT GGTCTCTAAG GTCGTTCCTG CATCCATGAA TACTGTAGAA
113201 TCAGGCTGCT GTGTGAAGGA ATATACTTGT AGGGTGGCTC CTTCTTTTAA
113251 AACGACATTT CCTCCTGCTA AGTTGATCTT CTGGTTCAGT ATGGTGGTAG
113301 TATTTGCAGG AATCGAGGCA TCTTGACTGG GGAGTTTTCC AGAAGAAAAT
84

CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
113351 ACTATAGTTC CCGTGTTTGG GTTTGCAGGT GCTACAGGGA CTACAGGCAC
113401 TGAAGCTATA GGACCATTTT TTGGTTGGGG AGGAGGAACA ATAGCTTTGA
113451 CAACAGGATT GATGACTAAC TCCTCTATTG TTCCTCCAGA TGCAGGAGCT
113501 TCCATCGTAA TAGGATCATA AAAATAAATC GTATGACCAG GAGCTGCTGC
113551 AAGCTTAGTG ATCTTAGCCC CTGCACCTAA ATGGATCGAG TTGGGAGTTG
113601 AAGTTCCCTC AGTCGCTCGG TTCCCTGAGA AAGTAATATC CCCATCAATA
113651 GCCTCTAAGG AAAGTTCTCC GCTATCGGCT ATATAAATGG CGCCTCCCTT
113701 GCCTCCAGAA TTATTGGTAA AGGAGACAGG ACCGTTAGCT GTAATCGAAA
113751 GGTTTTTCGA ATAAATCGCT CCTCCCGAAG TTTCAGCAGT ATTGCCATCA
113801 AAGTTTATGG ATTCACTGCC TGAGATTACA CACTTAGGAG CATAAATACC
113851 ACCACCACTT CTTTTTGCCG TATTGTTAAT GAAACTTAAA CTCTCATTTT
113901 CAGTAAGAGT TAAGCTTTTT GTAGCTATGT CAGACTCTGA GATATTACAG
113951 AGGATCGCTC CACCACAACC TTCTTGATCT GTAGTTGTTG TTGCTGTTGC
114001 TGTTGCTGAA TTTCCAGAAA ATACAAGAGC CTTATTTTTG GTAAAGGAAG
114051 TATTTCCTTT AGTATGTAGA GCCCCTGCTG TCTTTGCTGT ATTTGTGCTG
119101 AAGGTCACGG TTCCCGTACT TCCCGTAAGA GTAAAATCTT CGGTTTCTGT
114151 ATAGATCGCT CCTCCTGCAG TTTCGGCAGT ATTGCCATCA AAGGTAAGAG
114201 TCGTGTTTCC ATGCAGAGCA CACTTGGTCG CATAGATCGC ACCGCCACTT
114251 ACTGTTGCAG TATTACCAGA GAGACTCACG TTTTCGTTAT CTTCAATCCA
114301 GAGTCCTTTT TTAGTACTTA CAGATGCTGA CTCAAGAAAC GATAGGATTG
114351 CCCCACCGCA ACCCTCTTGA TTTGCTGAAG AATTACTCGG GCCCGTAGCT
114401 TTGTTCCCTG AAAAGAGCAG GTTGGTATTA CCAGACAGAG AGTTGTTGCC
114451 TTTAGAATAT AAGGCGCCGC CTGTCTTTGC TGTATTTGTG CTGAAGGTCA
114501 CGGTTCCTGT ACTTCCTGTA AGAGTAAAAT CTTCAGTTTC TGTATAGATC
114551 GCCCCTCCTG AAGTTCCAGC AGTATTGCCG TCAAAGGTCA GGGAGCCGTT
114601 TCCAGTTAGA GTACATTTGG TAGCATAGAT CGCACCACCA CTTACTGTTG
114651 CAGCATTACT AGTGAGGCTG ACTTCTTGGT TGTTTGCAAT CGATAGTCCT
85


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
124701 GTTTTATCGC TTACGGATCC TGAATCAATA AAGGCTAGGA TTGCCCCACC
114751 GCAACCCTCT TGATTTGCTG AAGAATTACT CGGGCCCGTA GCTTTGTTCC
119801 CTGAAAAGAG CAGGTTGGTA TTTCCAGTCA GCGAGCTGTT TCCTTTAGAA
114851 TATAAGGCGC CGCCTGTCTT TGCTGTATTT GTGCTGAAGG TCACGGTTCC
114901 CGTACTTCCC TTAAGAGAAA AATCTTCAGT TTCTGTATAG ATAGCTCCGC
114951 CACATCCTGC TGTCGCAGTA TTCTGATCGA AGGTAAGAGT TGTGTTTCCA
115001 TCCAGAGTAC ATTTAGTAGC GTAGATCGCT CCACCATTCG CAGTTGTTGT
115051 ATTACTAGTG AAGCTCATTT CTTGATTCTG AGAAATGGCT AATCCAGTTT
115101 TGTCTGTTGC TGTAGCAAGA TAACAACAGA TTGCCCCACC ACAACCTTCC
115151 GGGTTATTTG CCTGTGCTGC TGAGCCGGTT GTTTTATTTT CCTGAAAAAG
115201 TACTTGAGTG TTGCCGGTAA GAGCAAGATT GTCATCAGAG CTCCAAGCAC
115251 CCCCCGTCTT TGCAGTATTA GATTTGAAGG TAACGACTCC TGTATTGGCA
115301 TCTAGCGTGC TATCCTTTTC TTTTGAGTAG ATCCCCCCAC CTTTATCTGT
115351 AGCAGTATTT GAGGAGAAGG TCACCGTTCC TGAGTTTCCT TGGACTGTAG
115901 TGTTTGCTGT ACTACAGAGG GCCCCGCCAT TTTTTGTGCT AGTATTTTGA
115451 TCTAAGAGAG CTGCTGTCGT AGTCTTAGCA AGATCGATGC TGTAGGCAGA
115501 AACTGCAGCT CCATCTTTTT CTGAAGTATT TTTTTGGAGG GTGACACTGG
115551 CATTGTCAGT AAAAGTCGCA GTACCTCCCT CTGTATTTGT CACACAAATA
115601 GCACCCTTGC CGCCCGAAGT TCCTGTTGCT GGAGCTGAGT CGATTAAGAG
115651 TGACGAGAAT CCTGAGAAAG AAAGAGCTGT GTTGGTATTG TTAATTGCAG
115701 CACCATCATG CGTAAGCGCT ATGGTTTGCA GAACCAATGA GTGATCAGCT
115751 CCAACAAAAC TCAATGCTCC TCCTGTGTTT GTAAAACAGC TTTTATCTGC
115801 AGGAGTAATT~GCAGATACAT TCGTAATAGA AACATCGCTA GTGAGAGTGT
115851 AGGTAGTTCC TGAAGCATCC GAAGTTTCCT TGGCAGTGAA TGCTGCGCTA
115901 CCACTACTAC CATTTTCATA GTTATCGGAT GATGAGAGAT CCGTGTTAGC
115951 AGCCATTAGT GGATGTAGGG AGAAAACTAA AGCCGAAGAG GTAAGTAGCC
116001 AAGGTAAAGA ATATTTCATG TGTCTTTGGG GAAAAGCTTT TTATCAAAAA
86


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/Z6923
116051 TACTCCCATA GCATGTGGCT TTAGGAGCAT GGTGCACCAA TAGAGAATAC
116101 AGTTAAATAA ATCAAGTAAA TGCTCTGGAG AAGACTCTCA GTTATAGAAG
116151 TTTCAATCTT GGGAGAGAAG CATTTAAGGT ATTTTTCTAT ATTTAAGAGT
116201 CCCTTAAAAC ATAAGGGAAA TGCTTAAGGG TAGGGGAGAA GGTGTACAAG
116251 CGGTTTTGCT TTTAGACCTT CTTGAATTTT AGAAGGAGAG AGTAAGGAAG
116301 ATGGTTATCT AAACCACGGA TCCTTATTTG TTCTTTCTGT GTTTCCGTTT
116351 TTCTCTTTCG TCATAACCGT CAGAGAATGG ATTGGAGGGG CTGAGGTTAG
116401 CCGTGCTTTT TCCGTCTGGA GGCGGAGTTT TAAGAGATTG ATTCGATTTC
116451 CCGCTTTTCT TCTTAGATTG CTGTTTCTGT TCTTCATCTT GATTTTGCTG
116501 TTGCTGCTGT TTATCTTGGC GATCACGAGG GGAACGGCGG GAAGATGAAG
116551 AAACTCGAGG AATTGAGGGT TCTTTTCCTC CCTTAGGGTA GACCGGCTCA
116601 GGGTGTAGAA CCGTCCCCGT GGAGAAATTA GGAGAGCGGG AACTTGCAGG
116651 CATTATGGGT GTAAACGCAC TGCTCGCTCC ACTACCGAAT GAACTGCTTC
116701 TTAAATCTTT GAAGTGATAA GGCTGAAAAT CGTCCTTAAA TGGAGGACTT
116751 ATAGATGCGG GTCGTGTAGA AGCCGCATCT GAGGGTTTTG TTTTAAAACG
116801 TGAGAGGCTG CCGAAGGAGC GTCTATGCTC AGGATTTCGT GACGAATAGA
116851 AGAAAGATCC TTGAGCCTCC CGACCGATCC TACGATGACG GGCATCTTGG
116901 CGTTGTGAAG CTTCGTGTTC CTCCATGCGG GCAGATGCAG AAAGATCTTT
116951 TTTCTTTTCC ACTGCGATTT TTTTTGCATC GTCGGCGGAT GGAGTCACTA
117001 GAATAGGCTC CGCGGTCTCT TGGGTTTTCT TTTCCATCCT CAAGAGGTGT
117051 TCCCTACGGA AATAATCTAA GGTGAGCTTA TTCAAGGACA TGAAGTATAG
117101 CCCCACCGTA ATCAATCCCA TGACAAACAT AGGGTTGGCA AAGACTAACA
117151 TCGTCCCACT AGAGGCAACG AAAGCCCCTG CAATAAGACC CGCAGCAATC
117201 GCTAATACAA TGATAGGAAC GACGATAGCA GTGATTGCCT CACCGATTTT
117251 CTTGGCCTTC GGACTGTCCA GAATATCAGA AATAAGTAGA GTAACTCCCA
117301 AAGCTCCCAG GGCAAGTGCC GGAGCGAGAG CATAAAGCAT AAGGCTGTTT
117351 CCTGAAGCAG TCAGGAGAAT CGAAAGGATC GCAATTGCCG CAAGGGCAAT
87


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
117401 AATGCCTGTG TCATAGACAT AGCGCATTTT AGGATATTTA TCAGGAATAG
117451 TAATAAATCT TTTGAATAAC CCTGTGGATG ACGTTTTTAA ACGTTTTACC
117501 ACGCTTGGCT GCCCTGATGG GGATGCTGCT GCAGGAACTT GAGGTTGACC
117551 TAAAGGGTTT ATAGGGGGTT GGCTCATACT ATCAACTTAC TGTAATTATC
11?601 ATTAGGCCCA TGAATTTTCA TTCATAGGAT ATATTTCATA CTATTATAAG
117651 ATTTAATAGG ATTTAGTTAG TTCTCTTTTC TTCTGAGTCT TAACTTTTTT
117701 ATTAAATAAA GTTTATTTGT TAAAATCTTA ACAGATTTTT AACTAAAACT
117751 TTAAGTTATT TTTATTTGGA ACTTTTAGTC GAAATAAGAC TCGCTTATGA
117801 GAGGGACATA CTCATCAGCA AATGGAGGGG GCGTGTGAGG TCGTGAGGGT
117851 GGAGGCTCAT ACGGGGGGAA AAGATTTTTA TGCACTTGGA AGAGGGTAAC
117901 GCTAGTAGTT ACAGACATGA GAAGCCCCAA GGAGATAAAG ACAACTGGAG
117951 GGGGGATAAA GACCAGACTA AAGACCAGAC CGATAATCAC AGCAATCGTA
118001 AGAATGTGGA GGAGCCAAGC CATAGCATAC GAAGCAAGAT GTTGGCAGCT
118051 TTTTGTTTTT ATAGCGTTAA AGAGAGCCCG TAGGGGTAGG GTCAAGGCAG
118101 CATATGTAGA AGCGCAAGGG ATCAGGATAA CCTTAAGAGC TCCTAAGACC
118151 GAGGATACCA GTACTTCTAT GGTAAGAGCA GTTCTGGGAT AGCTCTTCGC
118201 ACAGTTTTTT AATTTTCGAG CTAGTATTCG TTCCGGAAAA ATGCCATTCA
118251 GGTATAGCTG AGAGCCTTGT TTGCAGATAT TTTTGAATCC CATATCCGTT
118301 TGAAAGAGAA TATTTTATGA AAAATTATGT AAAAATTCTA AGAGGATAGT
118351 GGTTTTTAGA CAATCGAAAT TCCTGAAAAG GCAGGAAAAT GAGAGACACA
118401 AGTAGACAAA ATCTCCTGAA GTTTTTGTAT GGGCCTGTAA AAAAATCTTT
118451 CTGGAAACTG GAAATTAGAA GTTCATTACA GCGGAACCAC CGAAAAATGC
118501 GGAGGTTTTA AAAGTATGGG ACGTTTTATT ATTGTTGTGA TCATCAGCTA
118551 TCGAGATATC ATTCCCAATA GACCAACCGT AAAATCCCTT AATAAAACTT
118601 CCCGGCCAAG GGGTGAGCTT CACGTTAGCT TCGATTTCAC GTCCTTGATA
118651 TTCAAAGATG CCGCGAGAAG AAATTAGGTG ATTTTTGTGA AGTTTTTTTC
118701 GGAATCTTGT AAGGCGGTAA GCAAGCCCTA AGTTACAGAC AGATGTCGAG
88


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
118751 CGGTAATCAA TAGAAAAATT CACAGGATAG ATGCAATTGA GAGTTAGTTG
118801 GTCGGTAGCC TTGTAACTAA CACCTACTAA AGGCCAAGCC TTCTCTTGAT
118851 GGAGGCCTGT TTCATTAATG ACGCCAAAAA TAGCAGAAAG CTTCTCAGTG
118901 GCCTGGTATT TTCCAGAAAG AACTCCTTGA TAGAGTCCAT AACCCATCTC
118951 AATATTTTTA GGATCCACAA GCCCAGAAAG AATGATAGAC CACTGCCAAT
119001 TTTTTAAGGA GAGTGTATAA GCTCCTAAAG AGAGGAGAAC ATAGTTATAA
119051 AAAGAAGTAT CTTGGAAAGT CGCCCACCCA AGTCCATTAG GATCTGTCTC
119101 CGAAATAGGA AGTGAGCTTT TCCATTGAAT ATCCGCACCT ATATAGCCAG
119151 TAGAAAACAG TAGCCCAGAA TGCTCTGTAA TCGGAAGTGT GCAGAGAAAC
119201 GTTCCATCGT ATTGACGATA GCCTATAGTT TGATGAGGCA GCTTTTTAAA
119251 TTTAGCATCG TTCACCTTTA GGTATTGTAC CTGAGCAGAG AAAGGACGTG
119301 GAGGAGGATT TTTACATGCT TCTTCATCAA TTCCACAAGC ATCTTGAACA
119351 ATAAAAATAG GAGTCGAGAG TACGTGTCCC GCAAATGCAG CGATGTGGAA
119901 GAGCAGTTTG AACATGTTCT GTAAGATTCT CCAACGTTAC TAGAGATTGA
119951 AATGGAATAT ACGTAATTTT TAAATTACTA TCTAATAATT TTCCTACTCA
119501 GGAGGGACTT TCAAGAAGAT TTCCCTAGAT TTAGGGGGAT GAAAGGACTA
119551 GAATTTTTCT AATCAGCAAT CGAACAAAAT TACAGGGTTC TTGGAGAGGT
119601 GCTTAATACT ACGCATAAAT GATTTCTGAG ATGATTTCCG TTTCGGCTGT
119651 TTCTGTGCTA ACTTGTTTTG GCACTCAGTA ATTTTAAGCT CAAGATTTTG
119701 GATCGATTCC TTTTTCGCTT CAATTTCATT TAAAATTCTT GTTTTTTCTT
119751 GATTTTGTGT TGTCAGCTCT GGAAGATGCA CGCTTTTAAT TGTAGAGAAC
119801 ATCAACCCGT ATCTCTGAAC AAATCCTGAT CCTATAGAGG AAATGTCTTG
119851 AGATATCTCA TCAATCAGTT TTGTTCCTGT CTGCTTATTT TTAAGAGGGA
119901 TTGCTAGAAG AAGAGCAAGA ATCAGTGTGA GTACGATAAT TCCCAAGCCG
119951 ATGCCACAGA TGATCCAGTT TCCAGTATAT CCTGCGTAAC CTGCTGAGAT
120001 CGTTCCCCCT AAGGTTAACA GTAGTGTTAA GGCAAAACAG AGCGTATGCT
120051 TTGTGGAAAG TTTGGAGCGA AAGGTCTCCC AAGGAGCCGC TGGAATCGGA
89


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
120101 TGAGGCTGCG TAATATAAGC ATTAGGAAGA TTGAGAACTT CTGTAGCATG
120151 AGAGAGAGGG CTGCTCTCGG GGACTGGTGA TGGGGCTACG GAAGTTGCCA
120201 TGTTGTTTCC TCGAATCGTT GCTAACTAGT TTTGATTTGT CTTTTCATTC
120251 TTGTGAGAAA GCTCAAAACG TTTTTGATAA AGAGTTGATT GCAGTTTGAA
120301 CTCTTTTTGT GAGAGGTCGC AGAGTTCTTC GTATAACTTT AGCGTATCTT
120351 GAGTGGTTTT TCTCTGTGCA TTGGTTACAA GCTGTAGAGA GGATTCAGGT
120401 CGAATCTTCT CCCTGATAAA TTGTACTGTC GTATGGAGCT GCTGAGGGAG
120451 CTGCCGGATG AATTTAATAA ATCCTACCAA GGCTTGTAGG CAGAGAAGAG
120501 TCAAAATAGT AAGAACAATG CCAACGGCAA TCAACAGAAT GCTTTGGCTA
120551 TAGCAACCCA AACAGATAAT AGCGATTCCA GCAAGAGCTA CAATCACTAA
120601 GATCGTAATC GCAGCAATAT GCATAGGAAT GGAGTGGGTG AGAAAAACAT
120651 TGCCCTTCTC CCCCAAAGCT ACGATCTCCT TATTCGTAAT GAATAAATCA
120701 GCAGACTCTT CCGGAAGGGA TGAGGGAAAT ACCCCGTTTA AAGTACTAGA
120751 CACAAAGAGA ACTCTATTAT TTGAGGAAAT AATTTAAGAA AAATGGTATT
120801 TTTAGTCAAT TAGTAAGCGA GTCATGCCTC TTAGTTATTC AAATTTTTAA
120851 AACCTTACCC TTCCTATGAG GAGACAAGTA AGAGAAATTA TGCAACAAAC
120901 TGTAATTGTA GCAATGTCAG GAGGCGTGGA TTCTTCTGTC GTTGCCTATT
120951 TATTCAAAAA ATTTACCAAT TATAAGGTTA TTGGCATCTT CATGAAGAAT
121001 TGGGAAGAGG ATCGCGACGG CGGTCTCAGC TCGACTACTA AAGATTATGA
121051 TGATGTCGAG AGGGTCTGTC TTCAGCTCGA TATACAGTAT TACACCGTAT
121101 CTTTTGCTAA AGAATATAGA GAAAGAGTGT TCGCTCGTTT CCTCAAGGAA
121151 TACTCTTTAG GCTACACTCC TAACCCCGAC ATTCTTTGTA ACCGAGAAAT
121201 CAAATTTGAC CTTCTACAAA AGAAAGTCCA GGAACTTGGC GGAGATTACC
121251 TCGCTACAGG GCACTACTGC CGATTAAATA CCGAGCTCCA AGAAACCCAA
121301 CTCCTTAGAG GTTGCGATCC TCAAAAAGAT CAGAGCTA'~'T TTTTATCAGG
121351 AACTCCTAAA AGTGCTCTTC ACAATGTGCT CTTTCCTCTT GGGGAAATGA
121401 ATAAGACTGA AGTTCGTGCG ATTGCAGCTC AAGCAGCTCT TCCCACAGCA
90


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
121451 GAA,AAAAAAG ATAGTACAGG CATTTGCTTT ATAGGGAAGC GCCCTTTTAA
121501 AGAGTTCCTA GAGAAGTTTC TTCCCAATAA AACAGGCAAC GTTATCGATT
121551 GGGATACCAA GGAAATTGTA GGGCAACATC AGGGAGCTCA CTATTATACT
121601 ATAGGGCAGC GGCGAGGACT TGATCTTGGA GGATCCGAGA AACCCTGTTA
121651 TGTTGTGGGA AAAAATATAG AGGAAAATAG CATTTATATT GTGAGGGGGG
121701 AAGACCATCC CCAGCTCTAC CTACGGGAAT TAACAGCTAG AGAGCTCAAT
121751 TGGTTTACCC CTCCTAAATC CGGATGTCAC TGTAGCGCTA AAGTCCGCTA
121801 CCGTTCTCCT GATGAAGCTT GCACGATAGA TTATAGCTCA GGTGACGAGG
121851 TCAAGGTGCG ATTTTCACAA CCCGTCAAGG CGGTAACTCC AGGACAAACA
121901 ATAGCGTTTT ATCAAGGAGA TACCTGCCTT GGTAGTGGAG TTATCGACGT
121951 TCCTATGATT CCAAGTGAGG GCTAGGGAGA GCAGCTTCCT GCTCCTCTTC
122001 TTCCCTTTCA AAGGCAACGC GATTTTCAAC CAAGGTTGCT CGTAGCTTGC
122051 GAGCTTCTTG ACGGCAGGAC TCTTTAAGCA AGAGCTCCGC TAGAGGATCT
122101 TCAAGGTACT GCTCAATGAC ACGGCGTAGA GGACGTGCTC CCATTTCTGG
122151 AGAATGCCCC TTCGTTACTA GGAAGGAAAT CACAGAGTCT GGGATGTTCA
122201 AAGCCATTTG GTAGTTTTTC AGTCTCGAGT CCAGTTTGTT GATCTCTAAA
122251 TGGATGATCT CCGATAGAGA TTCTTTCTCG AGGGGACGGA AAATCACACT
122301 TTCATCCAAA CGGTTAATGA ACTCAGGCTT TAAGTGTTTC TTCATAGCAT
122351 GTTCGATTTT CTCTTGGATG ACCTTATAGT CCATATGGGA CTTCAAGCCA
122401 AAACCAATTT CTCCGCTTTT ACGAATGAGA TCAGCTCCCA AATTGGAGGT
122951 CATGATAATA ATGGCATGAC GGAAATCCAC TTTGCGACCA AAAGAATCAG
122501 TAAGACGTCC TTGCTCTAAA ATTTGCAACA TCAGGTCCAT AATGTCTGGG
122551 TGTGCCTTTT CTATCTCATC AAAGAGAACA ACGCAGTAAG GACGGCGACG
122601 TACCTGTTCC GTAAGGTGGC CCCCTTCTTC ATGACCTACA TATCCTGGAG
122651 GTGATCCCAT CATCTTGGTA GCAGCAAATT TCTCCATGTA CTCTGACATG
122701 TCTACCTGAA TCAGAGCGTC TTCACCACCG AACATCTCTA TAGCAATTTG
122751 TTGGGCGAGC AGGCTTTTCC CTACACCGGT AGGCCCAAGG AATAGGAAGG
91


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
122801 AGCCCGTAGG TCGGTTAGGA TCTTTGATCC CTGTTCGAGA ACGTCGGATG
122851 GCACGGCAAA TGCTGGTAAC GGCATCATTT TGACCAATGA CTTTTCTTCT
122901 TAACGTGTCT TCTAACTTCA GAAGCTTCTC ACTTTCAGCT TCTGTGAGCC
122951 TTGCTGAGGG AATTCCTGTT TGTAGAGAAA CTACCTGAGC GACTGCTTCT
123001 TCATCTACAG GAACTTGGTG CTCTTCTTTA TGATTTTCCC ATTCCTGTTT
123051 CATACTTTGC AGACGTTCGC GAAGTTTTTT CTCTTCATCA CGTAAACCTG
123101 CAGCTTTTTC GTATTCTTGA GTTCCAATGG CCTGCTCTTT GGCCAATTTT
123151 GTATTTTCGA TTTCAGCCTC TAGCTTCATT AAATCTGTAG GCTGACCCAT
123201 TGTATTCACA CGGACACGAG CCCCAGCTTC ATCTAAAAGA TCTATTGCTT
123251 TATCAGGGAG GAAACGTCCA TGAACATATT GATCAGAAAG AGTCGCAGCT
123301 GCTTTTAAAG CTTCTTCAGT AATGAAGACA TTGTGATGTT CTTCATACTT
123351 TTTCTTGAGG CCACGTAAAA TCTCAATAGT CTCATCTACA CTAGGAGGGT
123401 GAACCACGAT TTTTTGGAAA CGACGTTCTA AAGCTGCGTC TTTTTCTATG
123451 TGCTTGCGAT ACTCATCTAT CGTAGTTGCT CCAATACACT GAATTTCACC
123501 TCGCGCTAAC GCAGGTTTTA AAATGTTTGA AGCATCGATA GCACCTTCAG
123551 CTGCTCCTGC TCCTACAATC GTGTGGAGCT CGTCAATGAA GAGCAAGATG
123601 TTTCCATGCT TGCGAACTTC ATCCATGACA GCTTTGATCC GTTCCTCAAA
123651 TTGCCCTCGA TATTTTGTTC CAGCAATCAT TAATGCTAGA TCTAGAGTAA
123701 TCAGTCGCTT TTTCCGTAAG GCATCAGGAA CCTCATTCAG AATGATTTTT
123751 TGAGCCAGAC CCTCAACAAT TGCAGTCTTA CCAACTCCAG CTTCTCCAAT
123801 AAGTACAGGA TTGTTTTTTC TTCTTCGGCA AAGAATCAAA ATCAACCGTT
123851 CGACTTCTGA AGAACGACCA ATGACAGGAT CGAGCTTAGA CTCTCGGACC
123901 ATCTCCGTTA AATCATAACC ATATGCTTTC AGAGCAGAAA GCTTTTCGTT
123951 TTTGTCAGAA CCTAAGCTAT GACCTAAAGG AGATTTTGAA GATGAAGGGT
129001 TGCTTCGAGA GGATGAGGAA GAAGACGACG ACGAAGGAGG AAGTTGTAGA
124051 TTGAAGGTCT CTAATTCTCT AAGAATTTCC TTACGAACCT CTCTTGGATC
124101 GATATGTAAG TTTTCTAATA CCTGAAGAGC GACACTATCT GATTGATGTA
92


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
124151 GGATCCCTAA GAGTAAATGC TCCGTCCCGA CATAATTGTG CTCTAAAAGG
124201 CTGGCCTCTT CATTTGCTGA TTCAAAAGAT TTTTTTACTC TTCCTGTAAG
124251 GGCAGGGTCT CCGTAGACTT GAATTTCTGG ACCATAACCA ATCAGGCGTT
124301 CCACCTCTTG CCGTGCCGTA TCAAAATCTA TACCGAGGTT GCGTAATACA
124351 TTAACAGCTA CCCCTTGACC AAGTTTGAGA AGACCAAGCA GGATGTGCTC
124401 AGTACCCAGG TAGTTATGAT TTAAACGCTG AGCCTCCTTT TTCGCCAGTT
124451 TAATGACTTG TTTTGCTCTA TTAGTGAACT TCTCAAACAT AAAAACCTAA
124501 AAGACAGGGG TAGAACTTTC CTTAAGCATA TACGAAATTT AAAATAATGA
124551 TGCAACTCTT CGCTCTAAAC CAGCAAATTT GGTAAAATTC CTCTGAGTTT
124601 AAGGGAAAGT TATGCACAAA'CCTTTTGTAT ATGATACAAT AGTTCAGCTT
124651 CTTTTGAAAC AGTCTTAATT AGTTTTATGT TTGTTATATG AAAGTTCGTA
124701 TCGTAGATTC AGGAAAATCT TCAGCGGCCT CCCACATGGC TAAGGACAGA
124751 GATTTATTAG AATCTCTGCA AGATGGGGAG CTCATTTTAC ACCTTTATGA
124801 GTGGGAGAAT CCTTGTTCTC TGACGTACGG TCACTTTATG CGTCCAGAAA
124851 AATTTTTACT TTCCAACTAT GCGGATCTAG GATTGGACGC CGCAGTGCGG
124901 CCTACGGGAG GGGGATTTGT CTTCCATAAG GGAGATTATG CTTTTTCTGT
124951 TCTTATGTCT GCGACACATC CTTCCTATTC TTCTTCGGTA CTTGAGAACT
125001 ACCATACTGT AAACTCTTTT GTAGCGAAGG TTCTAGAGAA AGTATTTCGG
125051 ATCCAGGGAA TGTTAGCTCC AGAAGACGAA AACTCTTCTT CCAGAGATTC
125101 AGGAAATTTT TGTATGGCAA AAACTTCGAA GTATGACGTT CTTTTTGGGG
125151 ACAAGAAGAT AGGGGGCGCT GCCCAACGCA AGGTGCAACA GGGATTTTTA
125201 CATCAAGGAT CCTTATTCTT ATCGGGAAGT TCTTCTGAGT TTTACCAGAG
125251 ATTTTTAAAA CCCGAGGTTC TTGAAGAAAT TATTGAACAA ATCCAGATTC
125301 ACGCGTTTTT CCCTTTAGGT TTGGAAGCTG CTGATGAAGT GCTGCAGGAG
125351 GCGCGTCAGC AAGTCAAAGA GGCGTTTATT AAATTGTTTT GTGGTGAGGG
125401 GTTATGATGA GTCGGTTGCG TTTTCGCTTG GCAGCTCTTG GAATATTTTT
125451 TATTTTGCTG GTTCCTAATT CTGTTTCAGC AAAGACAATC GTAGCTTCAG
93


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
125501 ACAAGGAGAA GGTTGGAGTT CTTGTTTATG ACAATAGTGT AGAGGCCTTT
125551 CAACAGATAT TGGATTGCAT AGATCATGCA AATTTTTATG TAGAACTGTG
125601 TCCCTGCATG ACAGGAGGCC GAACGCTTAA AGAGATGGTA GATCACCTCG
125651 AGGCTCGTAT GGATCTGGTT CCAGAGCTCT GTAGCTATAT CATTATCCAA
125701 CCCACGTTTA CCGATGCTGA AGACCAAAAA TTACTCAAAG CTCTCAAAGA
125751 ACGTCATCCC AACCGGTTTT TCTACGTTTT TACAGGGTGC CCACCCTCAA
125801 CAAGCATCCT CGCTCCTAAT GTCATTGAAA TGCATATCAA ACTTTCTATC
125$51 ATCGATGGGA AATATTGTAT TTTAGGTGGT ACCAATTTTG AAGAGTTTAT
125901 GTGCACTCCA GGGGATGAGG TTCCTGAGAA AGTGGATAAC CCACGTTTAT
125951 TTGTCAGTGG AGTGCGTCGG CCCCTAGCAT TTCGTGATCA GGATATCATG
126001 TTGCGTTCTA CAGCATTCGG TTTGCAGCTC AGAGAAGAAT ATCATAAGCA
126051 ATTTGCTATG TGGGACTACT ATGCACATCA TATGTGGTTC ATTGATAATC
126101 CTGAACAGTT TGCAGGCGCC TGTCCTCCAC TGACTTTAGA ACAAGCCGAG
126151 GAGACAGTAT TTCCTGGATT TGACAAACAT GAAGATCTTG TTCTTGTCGA
126201 CTCTTCCAAG ATCAGGATAG TTTTAGGTGG TCCCCACGAT AAGCAACCCA
126251 ATCCTGTGAC TCAAGAATAT TTGAAACTTA TCCAGGGAGC TAGATCTTCT
126301 GTGAAGCTTG CTCACATGTA TTTCATCCCT AAGGACGAGC TTTTAAATGC
126351 TCTTGTCGAC GTTTCTCATA ATCACGGTGT TCATCTGAGT TTAATTACGA
126401 ACGGCTGTCA TGAATTAAGT CCTGCAATTA CAGGACCCTA TGCTTGGGGA
126451 AACCGTATTA ACTATTTCGC CTTGCTCTAT GGGAAACGGT ATCCTCTTTG
126501 GAAAAAATGG TTTTGCGAAA AGCTAAAACC TTATGAGCGG GTTTCTATTT
126551 ATGAGTTTGC TATTTGGGAA ACGCAGTTGC ACAAGAAGTG TATGATTATC
126601 GATGATGAAA TTTTTGTGAT CGGAAGTTAT AATTTTGGAA AGAAAAGTGA
126651 TGCCTTTGAT TACGAAAGTA TTGTAGTTAT CGAATCTCCA GAAGTCGCTG
126701 CAAAAGCTAA CAAAGTCTTC AATAAAGATA TCGGATTGTC GATTCCTGTA
126751 AGTCATGGCG ACATTTTCTC TTGGTATTTC CATTCCGTAC ACCACACTTT
126801 GGGACATTTG CAGCTGACCT ATATGCCAGC CTAGCGTCCC TGGGTGCGAA
94

CA 02350775 2001-05-11
WO 00/27994 PCT/US99/Z6923
126851 TCTACCAACA GGATCTCTTC TGCAGGCTCT GCAGGGATCC TGCCTGGTTT
126901 TTTTCTCTGC TATCGTTTAC ACTACGCTTT TATTGTTTGG GTAGAGGGTG
126951 GACCTTGTTA TCGTTCTTCT ATAAGCATCA AAAAAAATTT ATCGGCATTG
127001 TCATTGCTGT AGTTTGTGTT TCTTGGTATT GGAGTGGGTT GGGGACGATT
127051 CTCTAGAAAA GGTTCTGCAG AGTCCACCTC ACGTCGGACT GTTTTTACTA
127101 CCGCTTCAGG GAAGCGGTAT GTAGAGAAAG ATTTCATGGC TATGAAGAAG
127151 TTCTTTGCTC ACGAAGCGTA TCCATTTACA GGGAACCCTA GAGCTTGGAA
127201 TTTTATCAAT GAGGGGCTAC TTACTGATTA TTTTCTAACG ACAAGGGTGG
127251 GAGAAAAACT CTTTTTAAAA GTGTACCATC CGGGAGAGAA AATTTTTAGT
127301 AAGGAGAAAG CTTACCAGCC GTATCGTCGT TTTGACGCTC CTTTTATTTC
127351 CTCTGAAGAA GTTTGGAAAT CTTCAGCTCC CCAGCTTTTA GAGATCCTGA
127401 AGGTCTTTCA ACAAATCGAG AACCCCATAT CAAAAGAAGG ATTTCTTGCT
127451 AGAGCCAAGC TCTTTTTAGA AGAGAGAAGG TTCCCTCATT ATGTGCTTCG
127501 ACAAATGTTG GAGTACCGCA GGCAAATGTT TGCTCTTCCC CCAGATGAAG
127551 CCTTATCTCG CGGGAAAGAC TTGCGGTTAT TTGGCTACCA GACGATTCAA
127601 GACTGGTTTG GGGATGCCTA CCTTTCTGCT GCTGTTGAGC TCTTGATCCG
127651 CTTTATTGAC GAGCAGAAAA AAGTACTTCC CAGGCCCTCA AAACAAGAAG
127701 CTCGTGACGA CTTTTATGAT AAGGCGAAGC ATGCCTATAC TAAGATCAGT
127751 AAGAATAAGG AATTTTCCTT AGGATTTGAA GAATTTGTAA ACTCGTATTT
127801 TCAGTTTTTA GAGATCTCTG AGTCCGAATT TTTCAATATG TATCGAGACA
127851 TATTGTTGTG CAAAAGAGCT CTTCTCCTAT TGCAGGGAGG CGTTTCTTTT
127901 GACTTCCAAC CTCTAACTAC ATTTTTCGTT CAAGGAAAAG ATTCCATACA
127951 AGTAGAGTTC TTTAGACTCC CTAAGGAGTA TAGCTTTAAA ACAAAACAAG
128001 AGTTAAAAGC TTTCGAAGTC TATTTAAAGT TAGTGAGTTT ACCTAAATCG
128051 GATAGTTTGG ATGTTCCTAA TGAGATCCTT CCTATAGCGA CCATAAAAGC
128101 TAAAGAGCCT CGGTTAGTAG GCAGACGGTT TTCTATAGAC TATAAGAGAG
128151 TCGCTTTGCA AGACTTAGCA GCTACTGTAC CTATGGTTGA AGTGCTGCAC
95


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
128201 TGGCAACAAA ATTCTGAGCA CTTCCAGGAG ATTCTCCAGC AGTTTCCTGA
128251 CGTTGAGACG TGTCAGTCGT ATAAAGACTT CCAACATCTT AAGCCTGCGC
128301 TGCGAGATAA AATTTCTCTT TTCACACGCA AGGAAATCTT AAGGGCCCGC
128351 CCTGAGAGAA TTCTGCAATC GCTACAGCAA GTTCCTAAGC AGAGCCAAGA
128401 AGTTCTCTTA TCTGCAGGGA AGAATAGTGC TCTACCAGGA ATATCCGACG
128451 GTCAGCAATT AGCCAAAGTG TTGCTTGAAA ACGAGGTTTT AGATTTATAT
128501 AGCCAGGATG CAGAGACCTA TTATACTATT ATTGTTAATA GTTCTTTTGA
128551 AAAAGAAGAA GTGCTTCCTT ATCGTGAGGT TTTAAAGAGA GATTTGGCCT
128601 CACAGTTACT TACTTCTCAT GGTCATCTTG TTGACATGGA GCGTCTAGAA
128651 TCTGCGTTGC GTACACGGTA TCCAGGAGAA GAAGGCGCTA GCCTATGGCA
128701 ACGACGTCTT TGGAAGGTAG TGGAAAACCA CAGATTGGGA AGGCATCTCG
128751 AGGGGTCTTT CTCTTGGAGC TTAGATCGCT CATTGAAGAC TTTTTCCCGA
128801 GGAGACAAGG AGCTGCCCCA AGAGTTTGAT AGGATTTTCT CTATGAAGGT
128851 AGGAGACTAT TCTTCTGTAT TCATGAGTCC TAACGAAGGG CCCTGTTATT
128901 ATCAATGCCT CTCTCATTTA CTGTATGATC GTCCTGCTAG CGTGGATAAA
128951 CTATTTTTAG CTAAAAGTCA GCTAGATGAA GAACTTTTAG GATCCTATAT
129001 GGAACGCTTT ATAGAACAGG GAGTCGTAAG GTGATGTGGT ATTCTGATTA
129051 TCATGTTTGG ATTTTGCCCG TCCATGAGAG GGTGGTGCGC CTCGGGTTAA
129101 CAGAAAAAAT GCAGAAAAAT TTAGGAGCCA TTCTCCATGT GGATTTACCT
129151 TCAGTAGGGA GTCTATGTAA AGAAGGTGAG GTTTTAGTCA TTCTGGAATC
129201 TTCTAAATCT GCTATAGAGG TGTTAAGTCC TGTATCAGGA GAGGTTATCG
129251 ATATCAACCT TGATTTAGTG GATAATCCTC AGAAGATTAA CGAAGCTCCA
129301 GAAGGTGAGG GATGGTTGGC TGTAGTCCGA CTAGACCAGG ACTGGGATCC
129351 TTCTAATCTT TCTTTGATGG ATGAAGAGTA AATTTTTTAT TAGATATACT
129901 CATTTTTTTC AGAAGATAAG AGGTATTTTT TTAAGGCTAA AACATTTAAA
129451 ATTTATGTCT AAGGTTTAAA AAATACATCA GAATTATTCT ATGGATCCAG
129501 CTAGTCCGGT AGCCCCTCAT GTCCTACAAG ATCATGTGCA ACTATCTTCT
96


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/269Z3
129551 GAAGAATTGT CCGCATTATC TTCCGGGGTA TCTCGTGTGA AGAAGCTTAC
129601 TATAGCCATC ATGGTCCTTT CATTGATAGC GATTTCTTTG GTAGCCTGTG
129651 GCCTATTTTT AACGGGATCG GCACCTCTAC AGCTCTCGAT CTGGATTGCT
129701 GCGAGTTGCA TTACCTTATC TATGTTAGTT TGTGCGTGTT GGCGTTATAA
129751 GATTTCCAAT GCCTTAGAAA AAACTAAGGT AGCGCATGAA AGCTGAGTTG
129801 GACATTTTAT TGATTGAAAA ATCATGACTA CATTACCTAA GTACGTTCCC
129851 CGTTCTCGAC AAAATCCCGA TACTCTGACC TTCCTAAAAC GGTATTCTAG
129901 TGTCCTTCTC CATTCGGAGA ATTCTTTATC TTATCGGATT TTTGCGAAAG
129951 TGCTTGCTAT TCTCCTCACT TCGTTAGCTG TAGCTTTCGC CGTGACTTTG
130001 TTTTCTTGTG AAGGTTCTCA ACTGAGACTC TGCGCTCTCT ATATAGGTAT
130051 AGCTCTTGCT ATTTGTGTTT TACTGACGAT CGTTGTTTAT TGTATCGCAA
130101 GTAAAATCGC CACAGCTTGC AAAAAGCCGC CTTCCATATC TCGAATTGAA
130151 ATTGTTTAGA AGCATCTCTG TGTACAAAAG TTCACTAGAA ACTCGACTCT
130201 AGGAAAGTTC CTAGAGAGAG CGACGCTGCG TTCGTGCTTT AGAAATACTT
130251 GGCTGGTGTT TGGACGAAGA TTCTTTTAGT GGATTGGTCG TGTTTTCAAC
130301 AACTTCAGTC TCTAGAGGAG CTCTTTGAAT CTTTGCTGAA GGTTTAGAAA
130351 TATCAATACC TGTTAAGCTC ATAAAAGCCA TAGCAATGAG GCCTGTTGTA
130401 AfiGAAGGAGA TCCCCATTCC CTGGAGGTTT TTGGGAATAT CAGAGTAGGC
130451 GAGTTTTTCT TTGATAGTGG CTAAAATAAC AATAGCGAGC CACCACCCAC
130501 ATCCCGCTCC TAAAGAGAAG ATCATCATAG GAATAAAAGG ATAACTACGT
130551 GTGATTCCGA AGAGCACACC CCCTAGGATC GCGCAGTTCA CAGCAATCAA
130601 GGGAAGGAAG ATCCCTAAGG AGAGATATAG ATTCCTGGAG ACCTTTTCTA
130651 AAAGAAGCTC TAAGATTTGC GfiGAATGCCG CAATCACCAC GATGAAAATA
130701 ATCAGCTCCA GAAAACCTAG GTTTACAGAA GCTAAAGATG GAGAGATCCA
130751 AGTTAGAGCT TTAGGGCCCG TGATGAAAGC ATGGACAAAC CAGTTGATGC
130801 TCCCTGTTAC AGTGAGAACA AGGGCTACGG ACATCCCCAA GCCATTGGCT
130851 GTAGAAACCC TAGTAGAGCA AGCAAGGTAA CTACACATCC CCAAGAAATT
97


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
130901 CGCAAGAAGG ATATTCTGAA TAAAGGCTGC TTGTAGAAGA ATACCAAAGA
130951 CATTAAGCCA AGTATACGCA CCTAACCACA TAAACTACCT TTTTCTCTTT
131001 TTAGAGTCTC GAATGTTAAC AAGCCAAATC ATAATACCAA GTAGGAAAAA
131051 AGCCGACGGT GCTAGCACCA TAAGACTTAA ATTTTGGTAT CCATCGGGGT
131101 GGGTTTCGGA AGCATAAACA AATTGAGGGA TGATGCGAAA CCCCATAAGA
131151 GTTCCAAAAC CAAAGAGTTC TCTGATGACT CCAATGACAA GTAAGACCCA
131201 GCCGTATCCT AAGCCAGAGG CAAACCCATC TAAGAACGCT GGAATAGGAG
131251 TCACATGCCT AGCTAGACTT TCAGACCTTC CCATCACGAT GCAATTGGTG
131301 ATGATAAGAC CCACAAAAAC AGAAAGTGTT TTGGAAATAT CAAAGAAAAA
131351 AGCTTTTAAA AACTGGTCGA TAACAATCAC AAACAAGCTA ATGATAATTA
131401 GCTGAGTAAT CATTCTCACA CTGTCAGGAG TGAACTTACG TAATAAGGAA
131451 ACAAAGAAAG ACGAGCATCC TGTAACAATG CTGACAGCAA TTCCCATAGT
131501 AATTGCCGTT TGTACTGTTG TTGTCACTGC CAGAGCCGAG CAAATCCCCA
131551 AAATCGCAAT GAGAATTTGG TTGTTGCTCC ATAGAGGATC AAAGAAATAG
131601 CTTTTATAGG ACTTTTTACT TGTCATTCGC CTGTTTTCTT TTCATGGGTT
131651 AAATTAGAAA AATTTATAAG GAGCTGACGA TAGCAAGCCA GAGATTGTAC
131701 ATAAGCTTCA GTGACACCGT TGCATGTTAA GGTGGCTCCA GAAATCCCAT
131751 CAATAGCAGA AAGAGCTTTT GGAGAATCTC CCAAAGTAGT ACGCACGGAA
131801 CCTTTAACTA CCTCAAGCCC TAGGTCTGTT GTTGCAAAAT TTGTAGTTCC
131851 AGAAGAATCT TGTAGGAAGA TTTTCTTCCC ATAGAATTGC TCTTGCCATT
131901 CGGGATTTGT AATATTTGCT CCTAAACCTG GAGTTTCTCC TTGTTGGTAC
131951 CATGCGGTTC CCAATACAGT GTCACCGTCG TTTTTCACTC CTAGATAGCC
132001 ATGGATGGGG~CCCCAAAGGC CGAATCCTGA TATAGGGAAG ATCAAAGCTT
132051 GAACTGTAGA AAGGTCTTTC GCAACGTCGG CTCCTGACAT ATTTTCTGTG
132101 CGAGAGGTAT TCTCTAAAAT GACATAAAAG GGGAGGGGGG ATTGCTGACA
132151 CGGAGGGCTT TCTTGATATT TCTCAAF~AAA TTCAATGGGA TTCAGATTTT
132201 TTTCTTCAAA AGAAAATACC TTGCCTTGGG CATCTGTAAG TAGAGGACGG
98


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
132251 ACAAAGCGCT CGGCATACAG CTCTAATTCA GGATAGGAAA CCTCAGAGAC
132301 TTTTTTTGTA GCAACTTCAA GAAGTTGTGT TTTTTTATCG AAAGTCGCAG
132351 GCACCCACTC TTTTTTTTCC TGAATTTGAA ATCTTCCTTT AAAATCTAAA
132401 ATATGAGCAG CTAAAAGCAT TTGCTTATTG CGATCGAAAG TAGCAGCTTG
132451 TTCCTGTATT GGGGAGAGCA CATAGTAGAT TGTGGATAAC AGCACTCCTG
132501 CAAATAAGCT GAGGCCCAGG ATAAAGGAAA CGATGTACCA GGTTTGGTTT
132551 ATGCGGACGG TATGTTTTGA AGAGCCTTTA GACATATTCT AGACTCCCCT
132601 TTTTCTATAC TTTCTAACAG CAAAATAGTC GATAAGAGGG GCAAATACAT
132651 TGCCCAGAAG GATCGCTAAC ATCACTCCCT CAGGATACGC AGGATTGATA
132701 AGACGAATCA CAATAGTCAT AAATCCTATA AAGAATCCGT AAATCCATTT
132751 CCCTAATTTC ATAGTCGGCG ATGATACGGG ATCCGTAGCC ATAAAGACTA
132801 AACCAAAAGC AAGTCCTCCG AGGAAAAGCT GCCGATAGGC GGGAATGAAG
132851 AATCGAGCAG GTGCCCAAGC TCCGTTTTGT CCCACGATGA GTACGCTGAT
132901 AAACTTAAAG AGCCAGCCTG TGAGAAAGGC TCCTATCCCA AAGGCTGCCA
132951 TGGTTCTCCA AGAGGCAATG CCTGTAACAA TAAGGAATAT TGCACCCAAC
133001 AGACAGGCGA AAGTGGAGGT CTCCCCCAGA GAACCTATAA TGTTTCCCCA
133051 AAAGAGATTC CCAGCTGAGA ACTTCCCAAT CCCATAGATC ACATCGGTAA
133101 TAGCATAGGC AGAATCGAAC TGTGTGGGAA GCAGCCCCAA TCCTCCCTCA
133151 GCAACAGGAG CTGTAACAAA CGTTTGAAGT TGTGTAAGAG TGAGATTATC
133201 TAAAACCCAA CCAGGATGCG TCTCTGTCCA AAGAGAAAAT TGTGAGTGAA
133251 TGACATCTTG AGTAGGGACG TGAGGAATGT GAAGCATATT TGCAGCAATC
133301 GCATCGACAT GCAGACGCTT TACAGAGGGA GGTGTCGAAT TTAGAGTTTG
133351 TAGGCAGGTA GACTGTGAAA ATCCATCAAT GAGTACTTTT CCTGTCGAGG
133401 AGTTCATCTT CATGAGGCTA TCTTTAATCA CTCCGGGGTT GCTTCCTACC
133451 CAAACGTCAC CACTCATCTT TGCTGGAAAC GTAAAAAATA AGAATGCCCT
133501 TCCTGATAGA GCAGGATTGA GGATGTTCAT CCCTGTGCCT CCGAAGAGCT
133551 CTTTACTGAC AACAATACCA AAGGCGATCC CTAAGGCTGC CATCCAGTAA
99


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
133601 GGAATTGTCG GAGGGAGAGT AAGGGGATAG AGGATTCCGG TTACTAGCAG
133651 TCCTTCTGCG ATTTTATGCC CACGAACTAC AGCAAATAGG ACCTCACAAG
133701 TACCCCCGAC AACATAGCTA ATCGTAAGTA GAGGAATAAA GATCTTAAGT
133751 CCTTCCCAAA GGATAGGAAC TATATGGATC TCTTTGTAAA CAAAGGATAA
133801 ATAACTACCA AATCCAGAAA TATGTAAGAA TTGCTCCATC AGCACAGGAT
133851 TGCCTGAGCT ATAAACGATA GATTGAAGTC CTGAATTCCA GATCGCAACA
133901 AAGGTCGCGG GAAACAAAGC GATAACAACA AGCATCATCC AACGCTTAAC
133951 ATCTACAGAA TCGCGGATGA AAGGAGGCTT GGAAGGGGTT TCAATAGGTT
134001 CGTAACAAAA TGTATCTATC GCATCGACAA TGGGAGTAAA GCGCTGATAC
134051 TTGTCTTGTT GACATAGTTT CCAAAGAGAA TTTATGAATT TTTTGAGCAT
134101 TGTGATTGAG AGAAAAGTTG AAGCTTCTAC ACGTTCAAAA ACGTAGCAAA
134151 CTGCTTAAAA TTTTAGGAAT AAAAATTTTC ATGATTCAAA AATAGATGGT
134201 ACTTTTTTCG TTGCTGTTTC CAAAGTTATG TTATGGCTGT CAAGCTCCAG
134251 GAGCCTACTT TTGTTCCAAC TGCTTGGAAA AACTTCTCGT AGAAGATAGA
134301 GAAGGGCGTT GTCTACATTG TTTTCGTTAT CTTGGTTCTT CCGAAACACG
134351 TCTATGTAGC CAGTGTTCAC CCTCTTCACA ACTTCAAGCT TTCAGCTTGT
134401 ACCTTCCTTC GCAAACGGCC CTCTCGGTAT ATGCTCGTGC TTGTGAAGGT
134451 AAGCGACCCG CTCTGCAGTT TTTTTCTAAG AGTATCGCCT TTGAGCTAGC
134501 TTCACTGGAT GAGACTCCGA GTTGTATTGC CTATATAACA TCGACAATTT
134551 CTAGGAAAAT CGTAGTAGAA GTTGCTAAAC TAGAAAAGCT TTTACGCATT
134601 CCCTTGTGGC CGTGGCTTCC TAAGAAAAGA CAAATAGAAA AACTTCCTAA
134651 AGGGGAAGGT ATCTGCTTTT TGTCGGCCTA TCCTTTATCA CAAAAATGGA
134701 TGCAAACTAT CGTTGGAGGG AGTGCATCAC CTCTAGTATC TATAAGTCTC
134751 TTTCTCTCTC AGAATGATCA GTAATTCCTG CAATTGCAAG GTAACCAAGA
134801 ATACGTACGC CCTCATCAAC ACACAACGTG TTCGCTTGGA TGTCTCCTTT
134851 AATGATTGCG CCTCCACGGA GTTCGACTTT TCCAGATACT GTGATATTTC
134901 CTTCTACAAC CCCTTCAATA ATGGCTTCTT GTAGCTGAAT ATCTGCCTTT
100

CA 02350775 2001-05-11
WO OOI27994 PCTNS99/26923
134951 ACCACTCCTT TAGGACCGAT AATAATTTTT CCTTTTGAGA CTAAAATGCC
135001 TTCAAAAGTT CCGTCAATAC GTAGGAGACG TTCAAAAGCA AGTTCTCCTT
135051 TAAAGGTGAC GCCTTCTCCT AAGGTAGTTT CAGGTTCTTC AAGAGGGAGT
135101 AGAGATTCTG TTCTTGGAGT TGAGGACCAT TGAGGAAGAG AAGATTCTTC
135151 AGTTAAATTG TGATTCAAAG GGCGAGCTTC CGAAGCTTTA GGGTTGTCAA
135201 AAAGACTTGG AGGGGTCTCT GGGCGCTCGG ATCTTGAATA TGGCGAGTAG
135251 CTGGAAGGTG AAGAAGTTTC TTCTTCGTAA AGTGTTTGCA CATCTTCAAA
135301 AGGACCTTTT CCTGTTCTAC GGAACATGGG ACACCCCCTA AATTAACCAA
135351 CAATATGATT TTTACAGAGA TTTACTTGAC GCTTAAGAGT TTCGTCATTT
135401 TGTAATTTAT GTTCTATAGT TTTACAGGCA TAAAGTACTG TCGAATGAGT
135451 TTTACCAAAA GCAGCTCCTA TTGCAACTAA AGAATCTGTA ATAAGAGTTT
135501 TTGCTAAATA CATAGCAATT TGCCGAGCTA ACACAAGATC TTTAGAGCGT
135551 GAGTTTCCCT TAAGATCATT CAGCTTTACT TGGAATACTG TAGCAACACT
135601 TTTTAAGATC GTTTCTACAG AAATTTTTTG TTTTGTTGGA GAACGGAAGA
135651 GCTCTTTTAG AGTTTCTCGG ACTGTAGTTT CTGTAAGAGA CTTGCCGAAA
135701 AGACGACAAT AGGCAGTCAG CTTGTTGATA GCTCCTTCCA ATTGACGGAC
135751 ATTGCCATAG ATGTGATCCG CAATATAAAA TGCCATTTCA TTAGGAATGA
135801 GCAATCCTTT TTGCTCCGCC TTGTGCTGTA AAATCGCAAC CCGAGTTTCT
135851 AAATCAGGGA TGCCGACGTG AGCAACCAGT CCCCATTCCA TTCTAGCAAT
135901 GATACGCTCG GAAAGTTTGA GCTGACTTGG AGGTTTATCA CTGGTAATTA
135951 CAATTTGCTT ACTCAGGTTG ATCAAAGTCT CAAAGGTATT GCAAAACTCT
136001 TCTTCAAAAT TTTGGCGATT CTGTAAAAAT TGAATATCAT CAACAAGAAG
136051 TAAATCTAGG GAACGATAAA AATTTTTCAT TTTATCAACA GACTTGGATT
136101 TGAGATGGTA GACAAGATCG TTGATAAACG CTTCTGTAGT GATGCAATGG
136151 ATGCGTAGAT TTTTATGATG TTCTCTTACG TAGTGACCTA CGGCATGAAG
136201 TAAATGCGTT TTGCCTAATC CCACACCCCC ATGGATGAAT AAAGGGTTGT
136251 AGGAGCGGCC AGGTTTCCCA GCAATACCTA CAGCTGCAGA CTTCACAAAT
101


CA 02350775 2001-05-11
WO 00/27994 PCTNS99126923
136301 TGATTTGAGG GACCTTCAAT GAAATTATCA AAGCGATAGG AGAGATTCAG
136351 CTTTAATTCA AAATCTTTAG TTTCTTCAAA GACCTCAGAA ATTCCTTCGT
136401 TTGATTCTTT TTGAGAAGCC ACGGGGGCTG AAGGTTTCTT GTGTTCTGCA
136451 ACTACAAATT CTAAAGCAGG CTCTCCATGA ACATCTAAGG GGACAAAAGA
136501 ACAGAGGTCT CTTTTGTAGT TATCAAGAAG ATAATTTTGT ACAAAAATGT
136551 TGGGGACTTC TAAGCGAATT TTCTCTTGAG TTTCTTCAAG AACTTGAATA
136601 GGAGAAATCC AATTTTCAAA AGCCGTTTTC GAGCAACGTG TCTTAACATA
136651 ATTTAAAAAC TGTTCCCAAG TAGTGCACTC GTTACAGGTT AACATGCCGC
136701 TCTCTTTATT TATAAAGCTT TCCCAAATAC AATCGACCCA TCCCATGAGT
136751 GATGGCGAGA AAATCTCATT GCATCTTGAC TATTCATTGC ACTGAGTGCA
136801 ATGACTCTTC GCGTTCAGAT TGGATCCTGC ATTCCCTGGG TGAAGTTCCT
136851 GTTAGGAATG GATCCTATAC ACCCTTCCTC CGCAGGTAAA CGCGGTACGC
136901 TCTGCCGATA AAATAACTCT ACCGATTACT AGGTTTTAAG GCAAATTGGA
136951 TCGTTGGTTT CGTTAGGCAA TAAGGAACCA CAAATTCAGG AAAAAATAAT
137001 TATGAAATTT TGTAATAAAA ATGGAAAAAG AACTAAAGAA ATCCGGAGTT
137051 CTTCAATCAC GAAATACGTT TTCTATAGGA GAAAAAATTA ACGAACTAAC
137101 GCAGCATTTT TTTTGGTTGC TTTACTATAA CTCATCAATA GAGCTTCAGC
137151 ATCATTAGCA ATTGCTGGTA TGGGACAGGA TGAAAGGTAG GTGGCAATGG
137201 CAGTAGCTTC TTCAATTCGT CCCAAACAGA AGAGAGCTTT TGTTTTATTT
137251 AAGAGTGTAG GCAGATGATC TCCTTGCATG CGGAGTGCCT GATCTAAAAC
137301 AGCAAGCGCC TGACTATTTT CACCAATTTG GAGATAAAGA CCTCCAAGAG
137351 TTTGATGATC ATAGATACTT AAAGGATCTA AGATCACTAG AGCTTCAAAA
137901 AAAAGAATCG CTTTTGAATA ATGCCCTTGG CGTAGAAAAG AATATCCTGA
137951 GATTCTGAGT TCTTCTAACT CATCATCTCC CCAGCCTAAG ATTGCTTTCC
13?501 ATTCATTATC CAACATACAT TATCCTTGAA CAAATTGAAA GATACGAGAG
137551 ATCACATAGT CTATCATCAC ATTTGTAGAC TTGACTCCAA GATCTAGAGC
137601 TTTTAAGAGC ACTTTCCCTT GTTGTACTCT AGGATCGTCA TCGTCTTCTT
102


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
137651 CAGGATCTTC GTCTTCCTCT TCTTCTTTAT CTTTGTAAGA CAAAAAGGGA
137701 GTAATTTGAC TGCGATAGGA AAACTTCCCT CGAGTGAGAA CTTTTAAAAA
137751 TGAGGAGATT TTTTCTATGT CTTCATCTTG TTGGTCTGGA GATCCTAAAG
137801 AAGGTGCCAG GTAGGGTGTG GAAAAACGCT GTTTGTAAAA ATTATTTGGA
137851 GGGGAAAAAC ATGCCCAGTG CGACTTTTGA TTTGTCTGTA AAAGAGACGT
137901 CAAAGCCGAA GGCTTGGGGT TCATATCCAA AATTTGCGCA TGCTTGGCAA
137951 CATCACGAAT GGAGATGCCT TCCATCTGGA TTTCTTTGCG AAAGTCGCTG
138001 ACTATCCTAT TATTGGAAGC ATGTTGCTCA TATATAGACG TGCTATAATT
138051 AAAAATTTCT ACCATGGCGA GGCCTTAAAA AACCGTCTCT ATCTCTAGAT
138101 GATAGTGCGG TCTAGGAAAA AGTCAATAGT CTTGATCAGA AGCCTGAACC
138151 TTTTCTCCTT ATCGAAGGCA CTTAAGAAAA AAGCTCACCC CTATCAAAAA
138201 ATTTAGAGTG GCGAACTAGC GAGAATTTAA GATAAGGGAA GGCTTTCTTC
138251 TTTCTATGGA ATCCTGTATC TTTGTCCTTA TTTGAAGCCA AGAGCTTAGG
138301 AAATTCTTAT GTCCGAACGT GCGCATATTC CCGTATTAGT TGAAGAATGT
138351 TTAGCTTTAT TTGCTCAACG TCCTCCACAG ACTTTTCGAG ATGTCACCTT
138401 AGGAGCTGGA GGACATGCGT ATGCTTTTCT TGAGGCGTAT CCCTCTCTAA
138451 CTTGTTATGA TGGCTCCGAT CGAGATCTTC AGGCTTTGGC AATTGCAGAA
138501 AAACGTTTGG AGACCTTTCA AGATAGAGTC TCCTTTTCCC ACGCCTCTTT
138551 TGAAGATCTT GCGAACCAAC CCACTCCACG TCTTTATGAC GGAGTTCTTG
138601 CAGATTTAGG AGTCTCTTCT ATGCAGCTGG ATACTCTATC CCGAGGGTTT
138651 AGCTTTCAAG GGGAAAAAGA AGAGTTGGAT ATGCGTATGG ATCAAACGCA
138701 AGAGCTTTCC GCTAGCGATG TCCTGAACTC CCTAAAAGAA GAAGAACTAG
138751 GGAGAATTTT TCGTGAATAT GGAGAGGAAC CACAATGGAA ATCTGCAGCT
138801 AAAGCTGTTG TCCATTTTCG TAAGCATAAA AAAATTCTTT CGATCCAGGA
138851 TGTAAAAGAA GCTCTTCTTG GCGTTTTCCC TCACTATCGT TTTCATAGAA
138901 AAATACATCC ACTCACCTTG ATTTTTCAAG CTCTACGTGT TTATGTGAAT
138951 GGAGAGGATA GACAATTGAA AAGTTTACTA ACATCTGCTA TATCTTGGCT
103


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
139001 GGCTCCTCAG GGACGGCTTG TCATTATTTC TTTTTGTAGC TCTGAGGATC
139051 GTCCTGTGAA GTGGTTTTTT AAAGAGGCGG AAGCTTCTGG CCTGGGGAAG
139101 GTAATCACAA AGAAAGTGAT CCAACCTACC TACCAAGAAG TACGAAGAAA
139151 TCCTAGATCG AGATCAGCAA AACTACGGTG TTTTGAAAAA GCTTCCCAAT
139201 GAACAAAAGT CGTTTTTTAC GTTTATGCTG CTGTCTATGC TTTTGTGGAA
139251 GTCTCTTTTA TTTCTATATT AATAAGCAGA ACTCGCTGAC GAAATTACGC
139301 CTCGAAATTC CTTGTTTATC TGTACGCTTG CGTCAGCTTG AGCAGCAAAA
139351 TATTTCTTTA CGTTTTTTAA TTGATAAAAT AGAAAGACCT GATCATTTGA
139401 TGGAAATAGC AGCTCTTCCC GAATACCAAT ATTTGGAATA TCCCTCAGAA
139451 GAAAGTATCA GTCTTTTATC CTATGAGCTA CCGTAAACGT TCGACTCTAA
139502 TTGTTCTAGG AGTGTTTGCT CTTTATGCTC TTCTAGTATT GCGTTATTAT
139551 AAAATTCAAA TTTGTGAAGG AGACCACTGG GCCGCAGAAG CTCTCGGGCA
139601 ACACGAATTT TGTGTCCGTG ATCCTTTTCG AAGGGGCACC TTTTTTGCTA
139651 ACACGACAGT ACGTAAGGGA GACAAAGACC TTCAGCAGCC TTTCGCTGTC
139701 GATATTACAA AATTTCACCT TTGTGCAGAT CCTTTAGCTA TTCCCGAATG
139751 TCATCGTGAT GAGATCATCC AAGGGATTCT CCAATTTATT GAGGGGCAGA
139801 CCTACGACGA CCTCTCCCTA AAGTTAGATA AGAAATCTCG GTATTGTAAG
139851 CTGTATCCTT TATTAGATGT TTCTGTCCAT GACCGGCTAT CCCTTTGGTG
139901 GAAAGGATAT GCAACAAAGC ATCGCTTACC AACAAACGCC CTATTTTTTA
139951 TTACGGACTA CCAACGCTCG TATCCTTTTG GGAAGCTCCT TGGACAAGTT
140001 CTCCATACCT TAAGAGAAAT TAAGGATGAG AAAACAGGAA AAGCCTTTCC
140051 CACAGGCGGG ATGGAGGCGT ACTTTAATCA TATTCTGGAA GGGGACGTTG
140101 GAGAGAGAAA~GCTGTTGCGT TCTCCTTTGA ACCGTTTAGA TACGAATCGT
140151 GTTATCAAAC TGCCTAAAGA TGGCTCTGAT ATCTACCTTA CGATCAATCC
140201 TGTGATCCAG ACCATTGCAG AGGAAGAACT CGAACGGGGC GTGCTAGAAG
190251 CTAAAGCCCA GGGGGGTAGG CTCATTCTAA TGAACTCCCA AACAGGAGAG
140301 ATTCTTGCAC TGGCTCAATA TCCGTTTTTC GATCCCACAA ATTATAAGGA
104


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
140351 ATACTTCAAT AACAAAGAGC GCATCGAACA TACGAAGGTA TCTTTTGTGA
140401 GCGATGTTTT TGAACCCGGG TCGATCATGA AACCTTTGAC TGTGGCGATT
140951 GCTTTACAAG CTAACGAAGA GGCTAGCTTA AAATCGCAGA AAAAGATTTT
190501 TGATCCTGAA GAACCTATCG ATGTGACCAG GACACTCTTC CCTGGACGAA
140551 AAGGATCTCC GCTTAAGGAT ATTTCTAGAA ACTCTCAATT GAATATGTAC
140601 ATGGCTATCC AGAAATCTTC GAATGTCTAT GTAGCTCAGC TGGCTGACCG
140651 CATCATACAA TCTTTAGGAG TGGCCTGGTA CCAACAGAAG TTGCTAGCTC
140701 TGGGATTTGG AAGAAAAACA GGGATCGAGC TTCCCAGTGA GGCCTCTGGT
140751 TTGGTGCCTT CTCCCCATCG TTTCCATATT AATGGTTCCC TGGAATGGTC
140801 CTTATCTACT CCATATTCTT~TGGCTATGGG ATATAATATT TTGGCAACAG
140851 GGATACAAAT GGTTCAAGCC TACGCTATCC TTGCAAACGG AGGTTATGCC
140901 GTCCGGCCCA CTTTAGTAAA AAAGATCGTC TCTGCTTCAG GAGAGGAATA
140951 TCATCTTCCT ACTAAAGAGA AGACACGACT CTTTTCAGAA GAAATTACTA
191001 GAGAAGTTGT TCGTGCCATG CGTTTTACAA CGTTACCCGG AGGTTCGGGA
141051 TTTCGAGCCT CTCCTAAGCA TCACTCTAGT GCTGGGAAAA CAGGAACTAC
141101 AGAAAAGATG ATTCATGGAA AATATGATAA ACGCCGTCAT ATTGCTTCTT
141151 TTATAGGTTT TACTCCCGTA GAGAGCTCGG AGGGAAATTT CCCACCTTTA
141201 GTGATGCTCG TCTCCATAGA TGATCCTGAA TATGGTTTGC GAGCCGACGG
141251 CACGAAAAAT TATATGGGGG GGCGTTGTGC GGCACCCATT TTTTCTAGGG
141301 TTGCTGACCG CACACTCCTC TATTTAGGGA TTCTTCCAGA CAAGAAGCTA
141351 AGAAATTGCG ACGAAGAAGC TGCTGCATTA AAGCGTCTCT ATGAAGAATG
141401 GAATCGTTCT CCGAAACAAG GGGGAACGAG GTGAGGATCT CTATTTCCAT
141451 CTTGCTATAG ACTTTTACCG TTGAGCAAAG ACTCTCTATC AGAGAGCCCG
141501 TCTCCTCTTT ATCCTCTATG AGTAGTTTAT GTTATGGCTA GGGTAGGTCC
141551 TAAACTATAG AAATAACTTT AGCTTTCTTC CCCTAAATAA GAGACCAAAG
141601 TCTTGATGAG ACGGTCTATT GAAGTTTATG GAAGGGGGAG GTAAGGCTGT
141651 GTGTTTGGGG ATTTAGATTT GGGATAAAGG AGGCTTCTGT TCGTAGAAAC
105


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
141701 AGGAGAGCGA AATTTTATAT TTCAGAGAAG AGTAAGAACT TTATGGACAG
141751 TTTTTTGTGA TTGCTTGTAT ACTATCTTGA TTGAATTTTT TGTCGACCTA
141801 CGAGTAAAGA AATCCTTTAA GCATTTTTTA AAAATCAGAG TGAGAGCATG
141851 CCCCTAGAGG GCTTTTTATG AAAAAAGTTG TTTTTCAATA GTCCCTGGAG
141901 CGTAAATGGA TTTAAAAGAG TTACTCCATG GGGTTCAAGC TAAAATCTAT
141951 GGGAAAGTTC GCCCTCTTGA AGTGCGCAAC TTGACACGTG ATTCCCGTTG
142001 TGTGAGTGTT GGCGACATTT TTATAGCCCA TAAGGGACAG CGCTACGACG
142051 GAAATGATTT TGCTGTCGAT GCTTTAGCTA ATGGAGCAAT TGCCATTGCT
142101 TCTTCACTAT ACAATCCGTT TCTTTCCGTT GTTCAGATCA TCACTCCTAA
142151 TCTCGAAGAA TTAGAGGCTG AGCTTTCTGC AAAGTATTAC GAATACCCTT
142201 CAAGTAAGCT CCATACCATT GGGGTGACTG GAACCAATGG GAAAACTACA
142251 GTTACATGTT TGATTAAAGC TTTATTGGAT AGCTATCAAA AACCTTCAGG
142301 GCTTTTAGGA ACCATAGAGC ATATCTTAGG AGAGGGGGTG ATTAAAGATG
142351 GGTTTACTAC ACCTACACCC GCTCTTTTAC AGAAGTATTT AGCCACTATG
142401 GTACGTCAAA ATAGAGACGC TGTTGTTATG GAAGTCTCTT CTATAGGACT
142451 TGCCTCTGGA AGAGTAGCCT ATACCAATTT TGATACAGCA GTTCTGACTA
142501 ATATTACCTT AGATCATCTC GATTTTCATG GCACATTTGA AACCTATGTT
142551 GCGGCGAAAG CCAAGCTTTT CTCTCTCGTG CCCCCTTCGG GAATGGTTGT
142601 TATCAACACA GACTCTCCCT ACGCTTCTCA GTGTATTGAG AGTGCAAAGG
142651 CACCGGTCAT CACTTATGGT ATAGAGAGTG CTGCTGACTA CCGAGCCACC
142701 GATATCCAAC TTTCTTCCTC GGGAACAAAG TATACCTTGG TGTACGGGGA
142751 CCAAAAAATT GCGTGCTCTT CCTCATTTAT TGGAAAGTAC AACGTCTATA
142801 ACCTACTTGC TGCGATCTCT ACAGTACATG CAAGTTTGCG TTGCGATCTT
142851 GAAGATTTGC TAGAAAAGAT AGGCTTGTGT CAACCTCCTC CAGGTCGTTT
142901 GGATCCTGTA CTTATGGGTC CCTGCCCTGT ATATATTGAT TATGCACACA
142951 CCCCCGATGC TTTAGACAAT GTCTTAACAG GATTGCATGA GTTACTTCCT
193001 GAGGGGGGAA GACTGATTGT TGTTTTTGGT TGCGGTGGAG ATAGAGATCG
106


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
143051 CAGTAAACGG AAGTTGATGG CCCAGGTGGT AGAGCGTTAT GGTTTTGCTG
143101 TTGTAACTTC AGATAACCCT AGGAGCGAGC CTCCTGAAGA TATTGTGAAT
143151 GAAATTTGTG ATGGGTTTTA TTCAAAAAAC TATTTCATCG AAATCGACAG
143201 AAAACAAGCA ATTACATATG CTCTGTCTAT TGCCTCAGAT AGAGATATAG
143251 TGTTAATAGC GGGAAAAGGG CATGAAGCTT ACCAAATATT TAAACACCAA
143301 ACAGTTGCGT TCGATGATAA GCAGACTGTT TGTGAGGTAC TCGCTTCCTA
143351 TGTCTAAGCA ACTGTCGTTT TTTGCTTTAT GTGTGTTAGG AAGTCACCCG
143401 ATTTTTGCTC AAACACCGAA TCCTCCTCAG CGTGTACGAC GCAGTGAGGT
143451 TATATTTATA GATCCTGGAC ACGGGGGAAA AGATCAAGGC ACGGCAAGTA
143501 AGGAACTTCA TTATGAAGAG AAGTCCCTGA CCCTGTCTCT TGCTTTGACG
143551 GTTCAAAGTT ACTTAAAGCG GATGGGTTAT AAACCTCAGC TAACCCGATC
143601 TTCTGATGTA TACGTTGACT TAGGGAAACG CGTTGCTTTG TCGAACCGTG
143651 GGCAGGGGGA TGTCTTTATC AGCATCCACT GTAATCATTC TTCAAACGCA
143701 GCAGCCTTTG GCACCGAAGT ATATTTTTAT AATGGTAAGG TCGGATCTCC
143751 GACTAGGAAT CGCATGTCAG AAGTACTGGG P.AAAAACATT TTAGCTGCTA
143801 TGGAAAAAAA TGGCATTTTG AAGTCTCGAG GTTTGAAAAC TGCGAACTTT
143851 GTTGTGATTA GAGATACTTC TATGCCTGCA GTTTTGGTGG AAACCGGGTT
143901 TTTATCCAAT AGTCGTGAAC GTGCGGCCCT GCAAGATGCT CGCTATCGTA
143951 TGCATGTAGC GAAAGGCATC GCCGAGGGAG TTCATAATTT TCTTTCTGGA
144001 CCTAGTTTTC AGAAACCAAA ACAGAATATC GCTAAAATAC GTAAACCACA
144051 GATACAAGCA AATTAGTACT TTAGGAGTTA AAGGCAAAAA ATCGTCCTCG
144101 ATGCGATTCG AACGCATGGC CTGCTGCTTA GGAGGCAACC GCTCTATCCT
144151 GCTGAGCTAC GAGGACGCAA AGACCAGCAC TTTACCAAGT TAGATTAAAG
144201 AAGTCACGTG TTAGATGCAC GCTAGCAATT TAGGGGAAGT TTTTCTCAAG
144251 ATGTGGGAAT GATTTTTCTA GGTTCTAGAA ATATAGTTAT TCGCATTAAT
144301 CGATATGGTT TATAGTGATT GCGCATTTTT TAAAAAATGT CTTGAATCCA
144351 AAGGATGAAT AGATATGATG AGCTTAGACT TCAAAAGTTT ATTATTTACG
107


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
144401 GTTACTCAAC GAGCTACTTT AGGGCACTTT AATAGGAGGC ATTGTCTAAT
144451 ATGGCTACCA TGACAAAGAA GAAACTAATC AGCACGATCT CACAAGATCA
144501 CAAAATTCAT CCTAATCACG TACGTACCGT GATTCAGAAT TTTCTAGATA
144551 AAATGACCGA CGCCTTGGTT AAAGGTGACA GGCTTGAGTT TAGAGATTTT
144601 GGTGTGTTGC AAGTAGTAGA AAGAAAACCA AAGGTAGGAC GTAATCCTAA
144651 GAATGCAGCA GTCCCCATTC ATATTCCTGC TAGACGCGCT GTAAAGTTTA
144701 CTCCAGGGAA AAGAATGAAG CGCTTGATAG AAACTCCGAA TAAGCATTCT
149751 TAATTCTTGT AGTCTTCTTT GTCTCAGTTG TTAGAGTCAG ACCGGTTTTT
144801 TACCGGGCTT GACTCTAATT TTTGTTATTA TTATCGTTTG GTGCAATGCT
144851 TTTCTGATCA AATTGTGCGT GATAATGGGG CTGCAATCCA GGTTACAACA
144901 TTGTATAGAA GTGTCCCAGA ATTCGAACTT TGATTCACAA GTAAAACAGT
144951 TTATCTATGC GTGCCAAGAT AAGACATTAA GGCAGTCTGT ACTCAAGATT
145001 TTCCGCTACC ATCCTTTACT AAAAATTCAT GATATTGCTC GGGCCGTCTA
145051 TCTTTTGATG GCCTTAGAAG AAGGCGAGGA TTTAGGCTTA AGCTTTTTAA
145101 ATGTACAGCA GTACCCTTCA GGTGCTGTAG AACTGTTTTC TTGTGGGGGA
145151 TTTCCTTGGA AAGGATTACC TTATCCTGCA GAACATGCGG AATTTGGCCT
145201 ACTCCTGTTA CAGATCGCAG AGTTTTATGA AGAGAGTCAG GCATACGTCT
145251 CTAAAATGAG TCATTTTCAA CAGGCACTCT TTGATCACCA AGGGAGCGTC
145301 TTTCCCTCTC TCTGGAGCCA GGAGAACTCT CGACTCCTAA AAGAAAAGAC
145351 AACTCTTAGC CAATCGTTTC TCTTCCAATT AGGAATGCAA ATTCACCCAG
145401 AATACAGTCT TGAGGATCCT GCACTAGGGT TCTGGATGCA AAGAACGCGT
145451 TCTTCATCCG CTTTTGTAGC CGCTTCAGGA TGTCAAAGTA GCTTGGGAGC
145501 GTATTCCTCA GGGGATGTCG GTGTTATCGC TTATGGACCT TGCTCTGGAG
145551 ACATTAGTGA TTGTTATTAT TTTGGATGTT GTGGAATCGC TAAAGAGTTC
145601 GTGTGCCAAA AATCTCACCA AACTACAGAG ATTTCTTTTC TCACCTCTAC
145651 AGGAAAGCCT CATCCCAGAA ATACGGGATT TTCCTACCTT CGAGATTCCT
145701 ATGTACATCT GCCGATCCGC TGTAAGATCA CTATTTCCGA CAAGCAATAT
108

CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
145751 CGCGTGCACG CTGCGTTGGC TGAGGCCACC TCTGCCATGA CGTTTTCTAT
145801 TTTCTGTAAG GGGAAGAATT GTCAGGTTGT TGACGGCCCT CGCTTGCGCT
145851 CCTGTTCCCT AGATTCTTAT AAAGGTCCCG GAAACGACAT TATGATTCTT
145901 GGGGAAAATG ACGCAATCAA CATTGTTTCT GCAAGTCCCT ATATGGAAAT
145951 TTTTGCTTTG CAAGGCAAAG AAAAATTTTG GAATGCAGAC TTTTTGATTA
146001 ATATTCCTTA CAAAGAAGAG GGCGTCATGT TAATTTTTGA AAAAAAAGTG
146051 ACCTCTGAGA AAGGAAGATT CTTTACGAAG ATGAATTAAT TTTGGGTCTG
146101 TAATTGTGTT TAAGAATTGT TTGTATTAAA ATGATTCTTT TTATACGAGG
146151 AGAGCACATT CTAATGGAAC TTCTTCCACA CGAAAAACAA GTAGTTGAAT
146201 ATGAAAAGGC TATAGCCGAA TTTAAAGAAA AAAATAAGAA AAATTCTCTC
146251 TTATCTTCTT CAGAGATTCA GAAATTGGAA AAGCGTTTAG ATAAATTAAA
146301 AGAAAAGATC TATTCGGATT TGACTCCTTG GGAGCGTGTA CAAATATGTC
146351 GCCACCCTTC GCGTCCCCGT ACTGTCAACT ATATTGAAGG GATGTGTGAG
146401 GAGTTTGTCG AGCTTTGTGG AGATCGCACC TTCCGAGATG ATCCCGCAGT
146951 TGTTGGTGGC TTTGTAAAAA TCCAGGGTCA GCGTTTTGTC CTTATTGGCC
146501 AAGAAAAGGG ATGCGATACA GCGTCACGCC TTCATAGGAA CTTCGGTATG
146551 TTATGTCCCG AGGGTTTCAG AAAAGCCCTT CGCTTAGGAA AACTCGCTGA
146601 AAAGTTTGGC TTGCCTGTGG TCTTTCTTGT CGATACCCCA GGAGCATATC
146651 CTGGATTGAC TGCTGAAGAG AGAGGACAAG GATGGGCAAT TGCCAAAAAT
146701 CTTTTTGAGC TCTCAAGACT TGCCACTCCC GTGATTATTG TCGTTATCGG
146751 TGAGGGATGT TCAGGTGGAG CTTTGGGCAT GGCTGTAGG'T GATTCTGTAG
146801 CTATGTTAGA GCATTCCTAT TATTCTGTAA TTTCCCCAGA AGGATGCGCC
146851 TCCATTCTTT GGAAAGATCC TAAGAAAAAT AGCGAAGCAG CTTCCATGTT
146901 GAAAATGCAT GGAGAAAACT TAAAACAATT TGGCATTATC GATACTGTTA
146951 TCAAAGAGCC CATTGGGGGA GCTCACCACG ATCCTGCATT GGTATATAGC
147001 AATGTTCGAG AGTTTATCAT CCAAGAGTGG TTACGATTAA AAGATCTAGC
147051 TATAGAAGAG CTGTTGGAGA AACGGTACGA AAAATTTCGC TCTATAGGTC
109


CA 02350775 2001-05-11
WO 00/Z7994 PCT/US99/26923
147101 TTTATGAAAC TACTTCTGAA AGCGGTCCTG AGGCATAAAA ATCATCTCGT
147151 TATATTAGGC TGTTCTCTAC TCGCAATTTT AGGACTTACC TTTTCATCTC
147201 AGATGGAGAT TTTTTCTTTA GGGATGATTG CTAAAACAGG CCCCGACGCC
147251 TTTTTACTTT TTGGACGTAA GGAATCTGGA AAACTTGTAA AGGTTTCAGA
147301 ACTAAGTCAG AAAGATATTT TAGAGAATTG GCAGGCAATT AGTAAGGATT
147351 CAGAGACACT TACAGTCTCT GATGCCACGA CATACATCGC CGAACATGGG
147401 AAAAGCACAG CCTCTCTGAC GAGCAAGCTC TCTAAGTTTG TCCGTAACTA
147451 CATCGATGTG AGCCGCTTTC GAGGACTGGC AATCTTCTTA ATCTGCGTTG
147501 CTATTTTTAA AGCAGTCACC TTATTTTTCC AACGTTTCCT TGGGCAAGTC
147551 GTTGCTATAC GGGTAAGCCG AGACTTACGT CAGGACTACT TTAAGGCCCT
147601 ACAACAACTC CCCATGACCT TCTTCCATGA TCATGATATC GGTAATTTAA
147651 GTAATCGTGT CATGACAGAT TCTGCAAGCA TTGCCTTAGC AGTAAACTCT
147701 TTAATGATTA ACTACATTCA AGCCCCAATT ACCTTCATAT TGACATTGGG
147751 AGTCTGTCTG TCGATTTCAT GGAAGTTTTC AATTCTTATT TGTGTTGCCT
147801 TTCCTATCTT TATCCTTCCC ATTGTCGTGA TCGCTAGAAA GATCAAAAAT
147851 TTAGCAAAAC GTATTCAAAA GAGTCAGGAT TCATTTTCCT CCGTTCTTTA
147901 TGATTTTCTT GCTGGGGTTA TGACAGTAAA AGTCTTTCGT ACAGAAAAAT
147951 TTGCCTTCAC AAAATATTGT GAGCATAACA ATAAGATTTC TGCTTTAGAG
148001 GAGAAAAGTG CTGCTTACGG TTTGCTTCCA CGACCCCTCC TGCATACCAT
148051 AGCTTCTTTA TTTTTTGCTT TTGTCGTCGT TATCGGAATT TATAAATTTG
148101 CTATTCCTCC CGAAGAACTT ATCGTATTTT GTGGTTTGCT CTACCTAATC
148151 TACGACCCTA TTAAGAAGTT CGGGGATGAA AATACCTCCA TCATGAGGGG
148201 ATGTGCTGCT~GCGGAGAGAT TTTATGAAGT CTTGAATCAC CCCGATCTTC
148251 ATAGTCAAAA AGAAAGAGAA ATCGAGTTCC TTGGACTTTC TAATACAATC
148301 ACATTCGAGA ATGTTTCCTT CGGCTATCAG GAAGATAAGC ACATCCTCAA
148351 AAATCTAAGC TTTACCTTAC ATAAAGGCGA AGCTCTAGGC ATTGTAGGAC
148401 CTACAGGATC TGGAAAAACA ACACTTGTTA AATTACTTCC TAGGCTCTAC
110

CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
148451 GAAGTCTCCC AAGGAAAGAT TCTTATCGAC TCTCTTCCTA TTACGGAATA
148501 TAACAAAGGG TCCTTAAGGA ATCACATCGC CTGTGTATTA CAGAATCCTT
148551 TCTTATTCTA TGATACTGTA TGGAATAACC TTACCTGTGG TAAGGATATG
148601 GAGGAGGAGG CTGTTTTAGA AGCTCTAAAA CGTGCCTACG CTGATGAGTT
148651 TATTTTAAAG CTCCCTAAAG GAGTCCATAG CGTGCTCGAA GAATCTGGGA
148701 AGAATCTCTC AGGAGGACAG CAGCAACGTT TGGCAATAGC ACGTGCTCTG
148751 TTGAAAAACG CCTCCATCTT AATTTTAGAT GAGGCAACGT CAGCTCTAGA
148801 TGCCATTAGT GAAAATTACA TTAAGAATAT CATTGGAGAG CTTAAAGGAC
148851 AGTGCACACA AATCATTATT GCCCACAAGC TGACCACTCT TGAACATGTA
148901 GATCGCGTGC TCTACATAGA AAATGGTCAA AAAATTGCCG AAGGCACAAA
148951 AGAAGAACTC TTACAGACGT GTCCTGAATT TTTAAAAATG TGGGAGCTCT
149001 CAGGGACTAA AGAATATAAC AGGGTCTTTG TTCCTGATCA CAAATTAGTC
149051 GCAAATCCTA CGGACATGGC AATAACAACT TAGGTGGGAT CGCTCTCTCC
149101 ATGAGCTCAG GCAACAACTC TACAAGTGTC TGAGTTAGCT TTTGTGATAC
149151 CTCCTCCAAT CTGCTGAAGG GACAGTCTCC TGGAACAGTA TAATCAGAAG
149201 TGATCTTGAG AAAAGAACAG GGGATGTGAT GTTCTGCTGC TTGTGAGGCT
149251 ATAGCATAGC CTTCCATATC TAGAAGTTTA AACGTCTTAT GAAACCCATA
149301 ATGGTACAAT ACTGGAGAGG TAACCAGAGA GCTTTTAGGT AGAGAATCCG
149351 GTAGAGCGTC AAAGATATAA GGGGGATCTT CAGAGAGAAC AGGAGGTGTA
149401 TCCGTAGTGA GGTTTGCAAT TTTCTCAATA GTGTAACATT GACCTAAAGG
149451 AATCTCGGGA GAACATGCCC CCACAAAACC TGGATTGATC CACAGATCGT
149501 AATCTGTATA TGCTTGGCAA TAGCTTTGAA GAGCATTTAA AACGGCTGTA
149551 CTTCCCCAAA CATGGACAAT ATAGAGAT~T AGATGGTAGT CAGTACAACG
149601 ATAACTATAG AGATGCTCGT TGATCTGTGT AAAATCAAGT TGTTCAATTA
149651 GAGGAGAAAT TTCTCTATAG TCTGCAACAA TGCAAAGGAT TTTTTTAGGT
149701 GTATTGACAG CATTCATTGG CCTTCCAGAG CATATGTAAA GCTTTTTTCC
149751 CAGTTTTAGA TAGTTGAAAG GTTTCTTTGT TGATATAGGT TCCTATGAAT
111


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
149801 CTATGAATCA CGGTCACGTT TTTATTTTTA GAGTATTCTA CTGCTTTTGC
149851 TCCCGCAGTT ATAGGATCTT TCAGGGAGCA AATTAAAGAC TTTCTTAATG
149901 CTGCTGTTAG AGCATCCACT GTAGCCATAG GAACATATTT CGCAATGGCT
149951 AAACATCCTA AAGGAAGGGG AAAGATGGTC TTACGGCGCC ATAGCTCTCC
150001 AAAGTCTGCC CGCAATGTCA ATTGGAGATC GTAGCTGAAG CGCTCTTCAT
150051 GAATCAGAGC GCCTCCATCG ACTTTCCCTT GCAGTATCGC GGATAGAATT
150101 TTGTCATAAG GCATGGGAAT GAGTTTTGCC TTGGGATAGT AAAGTTTACA
150151 GAGAGCATGA GCGGTTGTCA TCTCTCCAGG AGTTGCCAAG GTATCTAGAG
150201 AACATTCAGG ATCTAAGGAG AGGACGATAG GACCGCTGTT GTATCCTAAG
150251 GTATTTCCTA CGTCCATAAG ATTATAATAA TCAGAAACTA GAGGGAAGAG
150301 CGCTGCTGAC ATTTTCATTA GGGAGAGCCG TCGCTGCAGA GCTAGGGTAT
150351 TCAAAGTTTC AATATCCGCA ATTGTTACCT GGTTAAGAAG AGGCCTGAAT
150401 TGGGGGTCTT TTAAGAAAGA ACGAAAAAGG AAAATATCAT TCGGGCAAGG
150451 AGAAAAGGCA GCAGTCAGTA TCATGTCGGT TGATGTAATA GAGCTATAGC
150501 GGCTTTGATG TCTTTATTTT CAGGCTTATC CAAAGCTCCT TGGTTTTCCA
150551 GCCATTCGAA GTAAGACTTA GGAATATCCA CAAGAGGCTG CCCTTTGTAT
150601 TTGCCAAAAG GCATTTTGAA GACTTTCGGG TGATAGCTCT GTTGCAGCAA
150651 GTCGAGGACT TGCTGGGGCG GTAAATCACC GATTAAAGAA GTAAATACCT
150701 TGTGCAATAT CACTACGTCA TCTAGAGCTC GGTGTGCTTG ATTTTCAGCA
150751 AAACCGTAAA CTTGTCTTAG GTATTGTAAA TTATGTTTTG GTAGATCGGG
150801 GCGATATTTT TGTGCCCATT TTAGAGAGTC TATTGTACGG TTTGTCAGAG
150851 GCTCTAAGGA ATGTCTGCGA CATTCCTTAC CGAGTAGGGG GAAATCAAAA
150901 CCGTCATTAT TATGAGCCAC TAAGATGCTG TCCTCTCCGC AAAATTTCCT
150951 AAATCCCTCG TAGGCTTCAG GAAATTTGGG AGCAGAAAGT ACCGCATCCG
151001 TAGTGATTCC ATGAATTTTG GATGCCTCAT CAGGAATGGG AATTTCCGGA
151051 TTCACATAAG TAAGAAAGGA CTCATCTGTG ACACTATTGT AGGCAGCAAT
151101 TTCTATAATG CGATCTCTTT CTATTTGTGT TCCTGTGGTC TCCGTATCAT
112

CA 02350775 2001-05-11
WO 00/27994 PCT/US99/Z6923
151151 AGAAAATAAG AACATCCATA GTTTGACTAC TCATTACGTT TTTCTTGTTG
151201 CTCTTGAAGA GCCTGACGTC TTAGTTCATC CAAATTCATA TTCCCAGAAG
151251 AGATCAACCC AATAGCATGA GAAAAACTAT CACAGACTAG CTTTATTGTA
151301 TCGATATATA TCCGTAATAG TGTGTCATGA ATTTCTCCGT TTAGGCAGGG
151351 CAACACAAGC CGATAAAATA TCAATCCCTG TTCTTCATCC ATGCCAAAGC
151401 CGGGAATATC AATGTCCCTA TTTAAGAGAT GGAGTAAACG AGCTGTTGAT
151451 GCCTTATGAG ATTCATGCAA TTGGTAGGGA AGGTAACAAA TCAACTGCAG
151501 TATTTCTCCC TCACTGCGGA TTACAAAAAA TAAAGGGAGT TCATTGCCAT
151551 TAGCTTGAAT GTTAATGTAA GTAAGACCGC TTTCTCTTTC TAAGAAAGGT
151601 TCTTCATCCG AACTTTTAAG AAATTTTGTG AGATTATTTT GATTTAATGT
151651 CCATGTCGTC ATTTAGGAAA TACTCCAAGT TGTTCCTAGA GCCTGCATCA
151701 TTGCTGGCTG ATAATACTAG ATCTAATCTT GATTCGTCTG TTGTTTTTTT
151751 TGTATCTTTT GTATTGCTTT ACTGAGGAAA TCAGAGGTTG CAGAGGAAGA
151801 AGACTGCTTA TTATTTTCTT GAAGTAGATG ATTGATATCA GTGTCGGGAG
151851 GAAACTCTGT CAGTAGCGTA ATTTTTTCTG GTGTTTCTGC AATCACAAGA
151901 GTTTTATTCA CAACTCGAAT GAGGTAAATA GAAGTTTTCG GCGTTAGGGA
151951 ACGTCGTTCT AGGATTTTGA TTTGAGACGA GCCTCCAAAA CCGTGACTTC
152001 TTGATCTCAC AAACTTTTTA AACGCCCAAA CTCCAAAGCC AAAAATTGTT
152051 AAAAGTAGAA TCAAAGATCC TAGCATTTTA AACATTTCTA ATTTCATGCT
152101 TCCTGGGAAC ATTTCATGTA CAGAAATGGG CTCTTGGATC GTTTCTGCAA
152151 GAGCAAGCTC ATCAGAAAGC TTAAAAACTA AAGAAAAAAG ATTAAAAAAC
152201 ATGTGTAAGA CCCGCGATCA TCTCTATAAA ATTATAGTGG TAGCCCGATT
152251 TGTATCCAAC TACATACAAG TAATAATGAA GTATAGTTTT AGTCGATGCT
152301 ATATAAATTA TAGTACAATG ATTTCCAAGT ACAGATTAAA CCGTAATCAT
152351 GTATATTCCT GCAAGTACCG TCTAGAGAGC TCCCCTAGAT GATTTGGGTA
152401 ATTCACAGAC TCCTTTATAC CCTTCTAGGG TGTGCTCGTT CCACAGAGCC
152451 CAAGCTCTTG TCTTTCATAG ACAAAACGAC AGCAGTCTGT CGGTGGATGC
113


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
152501 AAAGAATTTT TTCAAGTTGC CTGAGAAATT CCTTGGCAGC GCTTAATAAA
152551 ACTATGGTGA TGCTATGGAA AAGTTACTAG TGACTGATAT TGACGGTACA
152601 ATTACCCATC AATCTCATCA TTTAGATAAA AAGGTGTATG AGCGGCTCTA
152651 TGCGCTGCAC CAAGCTGGTT GGAAGTTGTT TTTCTTGACG GGAAGGTATT
152701 ATAAATATGC TGCACGCTTG TTTTCTGATT TTGATGCTCC ATATTTATTA
152751 GGATGCCAAA ACGGCGCTTC TGTATGGTCT TCAACATCAT CAAATCTTCT
152801 CTATTCTAAA AGTTTACCCT CAGATTTATT ATGTATTTTA CAAGATTGTA
152851 TGGAGGGGGC AACGGCTCTT TTTTCCGTGG AATCAGGAGC TCCTTACGGG
152901 GATCACTACT ATCGCTTTTC ACCGACTCCT ATAGCTCAAG ATTTACACGA
152951 ATATGTAGAT CCTAGGTACT TTCCTAATGC TAAGGAAAGA GAGATCCTAT
153001 TTGAAACGCG CTCTTTAAAA GACGACTATG CTTTTCCTAG TTTTGCTGCA
153051 GCAAAAGTCT TTGGACTGCG AGATGAGGTC ATCAGAATTC AAAAGGAGCT
153101 GGAACGCCAA GAAGCACTGA CTTCAGTCGC GACGATGACG TTAATGCGCT
153151 GGCCCTTTGA CTTTCGCTAT GCCATCTTGT TTTTAACAGA TAAAAGCGTC
153201 TCTAAAGGCA AAGCCTTAGA TCGTGTTGTC AATATACTTT ATGATGGAAA
153251 GAAACCCTTT GTCATGGCTT CAGGAGATGA TGCTAATGAT CTCGATCTTA
153301 TTGAGAGAGG AGATTTTAAA ATTGTGATGA GTTCCGCACC TGAAGAGATG
153351 CACGTTCATG CGGACTTTCT AGCTCCCCCA GCAGATAAGA ATGGCATTCT
153401 TTCAGCTTGG GAAGCTGGTG TCCGCTATTA TGACGACCTT ATGAGTCTTT
153451 AGGGAACATC TCAGGACCAA TTCCCATCAC ATTGGCTCCG TGATCTACGT
153501 ATAAGGTCTC ACCAGTAATT GCTGAAGCTA GAGGTGATGC TAAGAAAGCT
153551 GCAACGGCAC CCACCTGCTC GGCATTCATA GCCTCGGGAA TAGGCGCCCA
153601 CTCTTGGTAA TAGTCTACCA TTCTTTCAAT AAAACCAATT GCTTTTCCAG
153651 CTCGGCTTGC TAAAGGTCCT GCAGAGATGG TATTGACACG TATGCCCCAA
153701 CGGCGTCCCG CTTCCCAAGC AAGAGTTTTG GTGTCACTTT CCAAAGCTGC
153751 TTTTGCCGAA CTCATGCCCC CTCCGTATCC AGGAACAGCG CGCATAGAAG
153801 CCAAATAGGT GAGCGATATT GTCGATCCAC CACGGTTCAT GATACTTCCA
114


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
153851 AAGTGAGAGA GAAGGCTAAC AAAAGAATAA CTAGAGGCAC TGAGAGCCGC
153901 TAAGTAACCT TTTCTTGATG TTTCTAATAG AGACTTAGAA ATTTCAGGAC
153951 TATTTGCCAG CGAGTGGACA AGAATGTCAA TATGACCAAA ATCTTTTTTT
154001 ACCTGTTCTG CGACTTCTGA TATCGTGAAT CCCGTAATGC CCTTGTAACG
154051 TTTATTTTCA GCAATATCTT CAGGAACATC TTCAGGGCTA TCAAAACTTG
154101 CGTCCATGGG ATAGATCTTA GCAATCTCTA AGAGAGTGCC ATTCGATAAT
154151 TTTCTAGATT CATTGAATTT TCCTAATTCC CAAGACTGAG AGAAAATTTT
154201 GTAAATCGGT ACCCATGTTC CTACAATAAT CGTAGCTCCT GCTTCTGCAA
154251 GAAGTTTAGC AATACCCCAG CCATATCCTT GGTCATCACC AATGCCCGCA
154301 ACAAATGCTA CCTTTCCTGT TAGATCAATC TTTAGCATGA ATCCGCCTTA
154351 TACTTTTGAA GCTTATTGGA AGGAGAGTAA CAAATCTTTC GATTATTAAG
154401 AAAACCTTTT GGTGCCTCAA CAGGGGAGAT CCTGCCTCCA ATGTAAATAG
154451 AAACGTAAAT TCTTTAAATT TTTTTCTTTA CATATTTTAT AGAATATCCA
154501 AACTTCTCAC TCCCGCGTAC TGCTAAAAAA ATTTTCAAAA GAATTTACGA
154551 TCCGAACTTA TCGTAGTTTG GGTTTCACTG ATTACTTAGG AGGTTGTTTG
154601 ACGAATCCTT TAGGGAAATT CCCCTCACCA CAGAATCCAC AGGTTGTTAC
154651 GATAGCGCCT TCTTCCACAA CACCACAAGC AGTCTCATCT GCAGTTCAAG
159701 GTTTTCTTCA AACTGGAGGA GCTGCCTCCT CTACAGCGAC AACTACTACC
154751 GCATCCGGAG CCTCTGCATT AGGACTTTCA CCTGATCAAG TGCAAGCGTT
154801 GCTTACTAAT TTATTAAATG TGGGACAACC ATCAGTGGGA CAACCATCAA
154851 CTTCAGCAGG AACTTCGGGA GCCTCCTCTT CCAGTGCAAG TATGCAGCAA
154901 CAGCTTTTGC AACTTATCTT AGACAAGACA ACAGGAAGTG GCGGATCGTC
154951 CGTGAGTTCA GAGCAATTAC AGCAACTCCT TAGCTTGGTG AGCCAGATGA
155001 CTACGTCTCA AGGAGGAAGT GGTGGAACTC AGGCAGGACA GGCCGCTTCG
155051 GTACTGTTGA ATTTGTTATC GGCAACAGGA TCTGCAGCAG CAAATCCTTT
155101 AGGGACAGCT GCATCGTTGG CACAGATCAT TTATGCAGCA GTAACAAGTC
155151 CTGGAGCAAA GAAAACTAGC GAATTTTGTT ATAATTATTG TGGAGAGACC
IIS


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
155201 TGCCAAGGCA ACTGCGGTTG TCCTACCTGT GGCTGTCCAG ACGGACAGTG
155251 CGGTTGTGGA GGATTTGGCC GTTTTTTCTG TGGTGTATGG AAAAATTGTT
155301 GCGGGATAGG AGAGGGATCC CAAGAACCCG CAATCCCTTT ATAAGAACTC
155351 GAGGCTTTAG AACAGAAATA TGGCAAGGCT GTTCTTTTAA TTGCGTTAAG
155401 TGAGCTTGGC ATTGATACCA TGAGCTTATT ATCAGGACAT CGACTCGAGG
155451 GATTTCCTCC AATCGCGGAG GTCATGGCTG CATGTGACCG GTGTTCTATG
155501 GACTTTTGTG AGATCTTGAA GTCTCAAAGC ATGGATCTGT GGGCGGATGC
155551 GGCGAGTTGT GTGGATGGTT TATTACAGGA TCCTTTTTGG AGTACAGCAA
155601 TTGCCTCAGG GATTGCTAAG TCTTCTCTTC AGGAAACGGA ATTCGAGTGT
155651 GAAAGCAAAG TGATGGTTCT TTCTTCATGG GGAGAGCAAG GAGCACAGGT
155701 TTGTAGCCCT TTTAACCTAG AGAGGATATG TATGTCTTTC CCATCACTTA
155751 AGGTCTTCTC CCTTAAAAAG AACGGGTGCG AGAACATGGG AATCCAGTTG
155801 TCTGCATCCT GCATGAATCT ATTAATGTCT ATTTTCTTTG TAGCTACCAA
155851 TGGAGGAAGC ACTCCGATTT GGATCACCAA AGAAAATCTG ATGGCGTTAG
155901 TTGCTTTGGT TTTATCTCAC TATCAATGTT ATTTTGTCCC AGCCACAGGA
155951 GATCCCCAAC GTGGCAACAT TTTAGGTAAT CCAGAAGTCA ATGCTATTTT
156001 GGCTCGGGGG ATGGGCATGC GTGTCGATCT GGAAAGGAAG CGAGGGGGAG
156051 AATCTTCCTC GTCACGCTAT TTAGAATTAG CTGCACGATG TTTTGAGAAT
156101 TCTCTTACGA AAACAAGTTT GTTAAGCGAT GCTAACAATG TTCAAGAAAG
156151 AGATAAGTGC CTACTACAGA TGTCAACTTC ATTGATGCAT ACGGCGGGAC
156201 TAAATTTACA ACGCCCCCCT GTACCCACAC CTTCTGGAGT CACGGCACAT
156251 CCGCAACCTC AACCAGATCC TGTGGTTACG TCTCAACCTT CTTTATTAGG
156301 TGCTAGAGAG CGTTCCCCTG TGTCTTCTAG AGGGCGTTTT CCTGTAGTTT
156351 TACCTTTAAG TGTGATTTCT CCTAGGTCGC ACCCCGGAAG GGTAGAAAGG
156401 CGGGATTTAG AAGATGAAGA AGAGGAGGTT ATGTTTTGAA GCAGTGTAAA
156451 CGACTCCAAT TACAGTTTTA TGAATCTCTA ATTGTAAAGT TCTAGGGGTT
156501 TTTCTTGAAG TAAGTGCCGA GCACATTCTC TAGGATCTTC GGTTGATGAC
116


CA 02350775 2001-05-11
WO 00/27994 PGT/US99/Z6923
156551 GCACAAATTT TTAGGGGAAG ATTTGTGAAT GGGGAGATAA ATTCTAGGGA
156601 GTGAGCATGG AGGAGAGGGC GGAAGATCTG GGGAGGCTGT TCTTTAGGTC
156651 CGTAGTCGAC ATCTCCGACA ATAGGATGAC CCAGCAATCC CATTTGTAAG
156701 CGGATTTGAT GGGTTCTCCC TGTGATGGGC CTGCGGCTCC AAAAATCACA
156751 GCTCCACACC TCCGGTATAC GGGGGCCGTA TAAGATTTTA CGGTTCCAAA
156801 TTTTTTTTTA GGATGACCAA AAACGAAAGC TATGTATTGT TTATGGATTT
156851 TTCTTTGCTT GAACAATTTC ATGAGCTCAG TAGCCGCTTG TTTAGACTTT
156901 CCCATGAGAA GACACCCAGA GGTGCCTTTG TCTAACCTAT GCACAGTAAA
156951 AAACCGTGTC ATGTGTGCCA TTTGTTCAGT AGTAAGATGG GGAGGTTTTT
157001 CGTAGATAAT GCTATAGTCA TCCTCCCAGA GGATGCTAGG TTGTTGTTTT
157051 GTTGAGGGGA TCAGAGATAG GGAAACACGG TCGCCAGGTT GTACCTTGTA
157101 GGATTCAAAT CTTTCTATGA ACCCGTTCAC TCGACATCGA TGTTGGCGAA
157151 TAGACGCCAA GATTTCTTGC TTGCTATGAT TAGGCAGTTG AGATCTAAGA
157201 AAAGAAGATA ATCTTGAGAC TTGTGTGGCA AGCCAGGAAA AATTTTCCAT
157251 AAAATATTGT AAAGCAGCCC TTTTATCATT TGATAATTGC ATAAAATTTT
157301 AAGAGATTTT GTATGACAAA GATAGCTTTT TCTGAAAAGG CAAAGAATTT
157351 TCCTGTAGAG GCATTAAAAA AATGGTTTGA AAAAAATAAA CGATCTCTTC
157401 CTTGGAGAGA TAACCCGACT CCCTATAGTG TGTGGGTTTC CGAAGTTATG
157451 CTACAGCAAA CGCGAGCTGA AGTTGTTATA GATTATTTTA ATCAGTGGAT
157501 GGAGAGATTT CCTACCATAG AGTCTTTAGC TGCAGCAAAA GAAGAAGATG
157551 TCATTAAGTT ATGGGAGGGA TTGGGTTATT ATTCTCGAGC GCGCCATCTT
157601 TTAGAGGGAG CTCGCATGGT TATGGAGGAG TTTCATGGAA AGATCCCTGA
157651 TGATGCCATT TCCTTAGCTC AAATTCGTGG AGTTGGTCCT TATACGGTTC
157701 ATGCTATTCT AGCCTTTGCT TTTAAGAGGC GTGCTGCTGC TGTGGATGGC
157751 AATGTCTTGC GTGTTCTTAG CCGGATATTT TTGATAGAAA CTTCTATAGA
157801 CTTAGAATCA ACTCGTACTT GGGTTTCTAG GATTGCTCAA GCGCTTCTTC
157851 CTCATAAGAG TCCCGAGGTT ATAGCTGAGG CTCTGATAGA GTTGGGAGCT
117


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
157901 TGTATCTGTA AAAAAGTTCC TCAATGTCAT CGTTGTCCTG TCCGTCAAGC
157951 ATGTGGAGCT TGGAGGGAGA ACAAACAGTT CGTATTGCCG GTACGTCATG
158001 CCAGAAAAAA GGTCATCTTT TTGCATCGTT TGGTAGCGAT TGTATTGTAC
158051 GATGGCTCTT TGGTTGTCGA GAAGAGACGT CCTAAAGAAA TGATGGCAGG
158101 CTTATATGAA TTTCCTTATA TTGAAGTTGA ACCAGAGGAA GGTCTTCAAG
158151 ATATAGAAGG ATTTACTAAG AAGATGGAGC TTTCTTTAGA AAGCCCTTTG
158201 GAATTCTTAG GTAACCTTAA AGAACAGCGG CATGCGTTTA CTAATCATAA
158251 GGTTCATTTG TGTCCTATAA TTTTTAAAGC CACTTCTCTG CCTCAGTTCG
158301 GGGAATTGCA TCTTTTGAGT GATATAGATC ACTTAGCTTT TTCTTCAGGA
158351 CACAAAAAGA TTAAAGATGC TTTGCTAATC TACCTCGGGG ATGTCAGGTC
158401 TAGAGAATCA ATAGGAGTAT AGATGCGAGA TCACGCTTTT TCTAAATTGA
158451 TAGGGACTGT CCGTGCCATG GTAGTTGAAG GACGTTGTCC TTGGTCACTT
158501 CAGCAATCCC TAGTCTCTAT GGTAGAGCAT ATTCTTGGAG AGTGTCAGGA
158551 ATTTCACGAG GCCGTCTTAC AAGGTAAGAC GGTACAAGAG GTTGGTTCCG
158601 AAGCCGGGGA TGTCTTAACT TTAGTTCTAA TTTTATGTTT TCTGTTAGAA
158651 CGAGAGGGCG TACTTGCTTC CGAAGACGTT GCCAATGAGG CTATGGAAAA
158701 ATTGCGTCGC CGTGCTCCTT ATATATTCGC TGAAGATTAC AAGCCGGTCT
158751 CGATTGAAGA GGCCGATCGC CTTTGGGAGC TTGCTAAGCA CCGAGAGAAA
158801 AATGAATCTA CATAGTTGAA GTTTTGGTCT ATTTTTAAGC ATATGGTGCT
15$851 TTTGAAAAAA CAGAATATAT GCTATCAAAG AAGGGTAAGT TGGGGGCCTT
158901 TTAAGAGAAG GAACCTGCGA ATCGGGTCAG GACTGGAAGG TAGCAGCCCT
158951 AAGGAGAGTT TTCTTTTGCT AAAAGAATGT TCTCCAACTT ACTCTTTTTA
159001 CTTTATTCCC AAAAATAGCA ATGAGGTGAG GTTAAACAAC CCGTGCAGTG
159051 CAATGGGAGA AAGAATGTGC CGATCTTTTT CATATAGAAA CCCTGCAGAT
159101 AAGGAAAAAA CAAAGAGCAC GGGGACAAAG ACCCAACTTC CTAAAGAGTG
159151 TTCAATGTGA ATGAAAGAGA AAATAATAGA AGAGCATAGT ACCGCAGCTA
159201 TGCGCGTCAT TTTGTTTTTC AAGAATGTCT GTAGAATTCC TCTAAAAAAT
118


CA 02350775 2001-05-11
WO 00/27994 PCTNS99/26923
159251 ACCTCTTCTC CAAATGGAGT GAGGACGCCT AAATTTAGAA TCATGCTAAT
159301 GTAGTGTCCT GTTATAGGCA GAGAGTTCTG AACTTCTTGA GTGACTTCTT
159351 GTGTGTGAAT CTCTTGCGTA GGAAGAACCA AAGTTAAAAA TTTACTCATC
159401 ATAATCCCAA TCAGTTGTGT TACTGGGATG ATGATGATCC ACATTCTGAT
159451 GGCAGATCCT AGAGCACGCC ATGAAGTTTT AACCGGTCTT TCTCCAGAGA
159501 AAAGTATAGC ACGTGTGATA TCCTTGGGGA GAAAAAGCAG GTAGAACAGA
159551 AATGCAAAGG CAAGGCTAAT TCCTGTCATG GTGGAAAGTA ATTCCGCAGT
159601 TTGTGAGCTC ACGCTAAGAG CTACAAGGGA AGAAAAAACA AGAAGAGCAC
159651 CACCAAATAA AACTTGGCGG AGCTTTAAAG GTGTTTTCCC AGAGGGTGCT
159701 GGCCAGATAA AGAAGTTTTT GGAAGCTAGA GCAGCGACGC CAAGGGACAA
159751 GAGAAGAATA AACTTGGACA TTTCCTTAGA CTACGAGTAG TTAGCACAAA
159801 CATAGCCCTC AACTCTGGCA ACAACTTCGC GGAAAAGACG GCTATGCATT
159851 AAGCCCATGG GCGTTGAATC AAAATGTTTT GAGTTCCAGC CATAGCGATG
159901 ATAATCATTG ACTAGGGTAG TTAAAGGCTG GCTGCATTCG ATAATCTCTT
159951 GATAAATGAG AGCTATTTTA TGATGACGGA TATCAAAAAC GCGAACACGT
160001 ACAGACGCTG TTACAGAATC GACACCTGCT TCTTTCCCTG TCTTTTGTTC
160051 TAACAGTTCT GTAGCAACAA TGAATTCTGC AGGAAGAAAT TGCTCAATAA
160101 TTGTTTCGGG TAGACGATTC GCAATCGGAG CATAGAACTG AGAGACTGTC
160151 TGAGGTGAAG CATTGTGCTT GATCAGGAAG ACCTTTTCCG AAGCATAAAA
160201 CCTTTTGCTG ATCTCTTCAG TAAATTCTCC TTGGAGGTTC CAAGGTAAAG
160251 GTTCAAGACT CTTTCCTGGG CGATGAAATA CAGGAAGCAT CGCAATCACA
160301 CCTTTAGTTT TGCTCCCTGA AGTGTATAGC TTAGGATGAT AACTTCCTGA
160351 AGAGCCTAAG TGAGTGCAGC TGGATAGGGT TGGGGATAGA AGTCCTAAAG
160401 ATGCCAATAA TACCAACATT TTTCGCATAG TCACTGTCCT TAAATTGCTT
160451 ATTTTGCAAA AGATTCTAGC CCTGGGAAAG TTTTTACTTT TAAGATCAAT
160501 ACTTTCGCAA TTGAGAGATT TTCCATTTAA AACTCTCATT AGCTTATATC
160551 AAAGAAAAAA ATAAAAACAA GCAAAGAGAC CGTCTCAGTT TTAGTTTAGA
119

CA 02350775 2001-05-11
WO 00127994 PCT/US99I26923
160601 AACTCAAGGT TGAGAAAGGG ATTCTGACCA AAGTTGTGAG GGAACTTTGG
160651 TAACTTTTTC TTTAGGAATC AATGTGCACC CTGGGAGGAA TACAACTGGA
160701 GCTGAGATCA CAGAAAAAAT AAACCACTTC ATGACTACCT CTGCAAAAAG
160751 AATACTACTA TTTTCTATTT GCATAGGCAG TTCTTCGATT TATAGCAATT
160801 TTTACTTTAT CATCATAAAA ACTATGATGA AAAGGTTCTT AGTGAACTTC
160851 TAAGGAACAC CATGAGTTTG TGATTAAAAC TCATGGTGCG TAGTTGAACC
160901 CTTATTGAGG AGGGACGCAA CCACTAAGAG CTAACAATAC TAAACAAGAT
160951 AATAGTAAAG AGAATGCTTT TTTCATTATT CATCCTTAGG TTAATTGACC
161001 TTTCCAGCCT AGCTCTAGGG CGATTATTTA TCAAATTTTT CTTTGTAATT
161051 AATGATCATG CGACCATTAA TTTAGCGATA AATTATGATT TCGTCAGGAA
161101 AATTCAATTC TTTATAATAA TGATATGAAA TTAGAGAATG TCTATAGGGG
161151 CGGACTCTAT TTGTGATCCA GGATCTCTTT AGGAGCACTT TGTGGATTTT
161201 GATTATTTTG GTCTGAGTGA TATTGGTAGG GTGCGCGCTA GAAATGAAGA
161251 TTTTTGGCAG GTAAACCTCA TGTCTCAAGT GGTTGCTATT GCTGACGGTG
161301 TTGGGGGGCG TCTTGGTGGA GACATTGCTT CTCAAGAGGC AGTGACTAGC
161351 CTTATGGAGC TGATTGATGA GCAACAGTCA AAATTGATGG GGTATGGGGA
161901 TGACCAGTAT AAGGAGACTT TAAAAAAGAT CCTTTTAGAG GTCAATGGTG
161451 TGGTCTATGA ACACGGCCAA ATGGAAGAGC ATCTCCAGGG TATGGGAACC
161501 ACTCTTAGCT TCATCCAATT CCGGAAGGAT AGGGCATGGC TATTTCATGT
161551 GGGAGATAGT CGAATTTATC GTATTCGTGA GGGAGAACTG CGCCGCCTTA
161601 CCGAAGACCA TTCTTTAGAA AATCAATTAA AAAATCGTTA TGGGCTTCCT
161651 AAACAATCAG ATAAGGTGTA TTCTTATCGC CATATTCTGA CTAATGTTTT
161701 GGGAAGTCGT CCCTATGTCA TGCCTGACAT TCGGAATCTT CCTTGTGAAA
161751 AGGAAGATTT GTACTGCCTC TGTTCGGATG GATTGACAAA CATGGTTCCA
161801 GATATCGATA TTCGTGATAT CTTGAACCAG CCCGCCACCC TAGAAGAACG
161851 GGGGAATGCA TTAATTTCTC TAGCCAATAC TCGTGGAGGC GATGACAACG
161901 CTACTGTCGT ATTAGTCCGA ATACAATAGT TCCTTTGCTA AGGATAGTAT
120


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
161951 TCCATGATCT ATTTGGATAA CAATGCGATG ACACCCCCAG AGAGGGGACT
162001 TTTGGAATTT CTCCAAAAAA CCTTCCTTAT AGAAGGGACG TACGCGAATC
162051 CTTCGAGCGT CCATCAATTA GGTAAAAAAT CTCGTCAACT GGTTCTAGAA
162101 GCTTCACACT GGATGCAAAA GGTCCTTTCG TTTCAGGGCC GTGTCCTCTA
162151 TACCTCAGGG GCTACTGAGA GTTTAAATTT AGCAATAGCA AGCCTCCCTA
162201 AAGACAGTCA TGTTATCACC TCAGGTAGCG AACACCCCGC CATCTTAGAG
162251 CCTTTAAAAC ATTCCTCGCT TTCCGTTTCT TATTTAAATC CCGAAGAAGG
162301 GAGATGTGTT CTTACTATAG AGCAGATTGA AAGAGCTGTG ACTCCTAAAA
162351 CTTCAGCAAT CATCTTAGGT TGGGTCAATA GTGAGACTGG TGCCAAAGCT
162401 GATATAGCTG CTATAGCCCA CTTCGCGCAA GAACGACAAT TGCAATTTAT
162451 TGTGGATGCG ACTGCAAATG TAGGTAAGGA GAGGATAGTT CTTCCCTCTG
162501 GTGTCACTAT GGCAGCATTC AGTGGACATA AATTTCATGC ACTCTCTGGA
162551 ATCGGAGCTC TTCTGGTCTC TCCAGGAGTC AAACTACATC CTCAGCTGTG
162601 GGGAGGAGGT CAGCAAGGAG GGCTGCGCGC AGGCACAGAA AATCTTTGGG
162651 GAATCGCCTC TCTGCTTTAT ATTTTCAAAT ACCTAGATCT TCATCAAGAG
162701 CGTATCTCTC AGGAAATTCT TACCCATAGA AATGGTTTTG AAAAGGCAAT
162751 CAAAGCACGC ATTCCTGATG TCCATATTCA TTGTGCGGAT CAACCACGGG
162801 CAAACAACGT CTCAGCAATT GCTTTCCCTC CGTTGGAAGG TGAGGTATTG
162851 CAAATCGCCT TAGATATAGA AGGAGTGGCT TGTGGTTATG GATCCGCATG
162901 CTCTTCAGGT GCTACCGCAC CCTTTAAATC TCTTGTCAGC ATGGGTGTTG
162951 ATGAAGAGTT GACCCTGGCA ACACTCAGGT TTTCTTTTAG CCATCTTCTC
163001 TTGCAAGAAG ATGTTGAAAG AGCCGTTGGA ATTATAGAAA AAGTCGTAGA
163051 ACGTTTGAAA AATTCCTAAG TCTTAAAAGA GAACATGTTT CTAAGCTGAA
163101 AGAACACTCC TGACTCTTAT TGCAGAATCT ATGAGAGTAA GTTTTTAATC
163151 GATACGGTTT TTATCCCAGA TAAAGACATC TCTTTAACTT CTAAAAGCAA
163201 GTTGTTGATA ATTACAGAAG TTCCTACTTC TGCAGGACTG TCTAGCAGTT
163251 GCAATACCAA TTGGGCTAGG GTTTCTACAG GATATTGCGG AAATTGAATA
121

CA 02350775 2001-05-11
WO 00/27994 PCT/US99/Z6923
163301 TCGAGTTCTT TTTGCAGATC TTTTATGCGA GAGTTGCCAG GAAACGTTCT
163351 TTCAATAACA GAGATGGTCT TGGGTTTTAA ATGAGCAATG TTTGTAGTGT
163401 TGAATAAGAT TTTGAAAATT GCATTTAAAC TAAGAATACC TATAGGTTCA
163451 CCAGAAGCAT TGAGGACAAC AGCAACACTC GAACGGTTGT CTCGAAACTC
163501 TTTGAGGATA CGAATAAGTT TTGATTTTGC AGTGATAAAC CAAGGCGAGT
163551 GTAGATTATT GATTAGGGGT TCATCAAGAG CTTTATTGAC AAAGTCTTTA
163601 GGATGGGCAA TCCCAATAAC GTTTTTTCGG GCCTTGTGAT AGACAGGAAT
163651 AAAGTTGATA TCTGTATTTT TTATAGTCCG GCAAAAATCT TTAACATTTG
163701 CAGAAGAAGG AAGCATGGTA ACCTGTTCTA AAGGTTGGCA TACCTGATCT
163751 GCACAAGTCG CACTTAAAGA GAAAATATTT GTAGCAATTG TATTGAAATC
163801 TTGTTCTTCA TGGTGAGTCT CTAAAGCTTT TTGGAACTCG TCTCTACTTA
163851 ATGTAGAGTT CAATTTTTCT TTCCTAATAT TTAGAAGATA GTAAAGACCC
163901 TCAGTGAGAC TTCCTATGAG CTGAATCAGA GGATAGAAAA TATAGTGGGA
163951 ATAATAGAGA ATCGGTGCTC CCCAAAGTGC TAATTTTTCA GGAATCTTCC
164001 GTGATATTGT TAGAGGTAGA AGTTCTGCAA AAATCACAAC TATAAAAATT
164051 TGAGTGAAAG GAGCGTAATC TGGAGTGATT CCTAAAGCTC GATAGCAATT
164101 TCTTGAGGAC TCAGACCCGA CTTGTAGAGC GATATTCACT CCTAACATCA
164151 CCGTTCCAAA TAAACGATAG GGGCGGCGAA TCAGGAAATT AATGTAGCGA
164201 GCTTTCTTAT GATCTTTAGT CAGATAGTAT TGCAATCGTA CACGGTTAAA
164251 TGACACGCAG GCCATTTCCA TCATCGAATA GAATCCTTGT AAGACAATAC
164301 AGATAATGTT GACTCCTATC CAAAAGAGAG CAGAATTAGT CATACAATTT
164351 CCTTATATAC ACACGGCGAA TGCGATTCGG AGCAGCGTCT AATACCTGGA
164401 AAAGCAAGTT~ATTCCAAGAG AGTTTCATTC CTGTTGTCGG AATCGTTCCG
164451 ATTTGCTCTA TTAACCAGCC TCCTATAGTC GCAATATTAT TGTTCGTCGG
164501 TAGGTTGATA TCGAAGATCT CACTAAACTC ACGGAGTTCT AAAGTTCCTG
164551 AGGCAATAAT AACATCAGCT CCTGAGGTGG TATAGAGTAT TTTATTATCT
164601 CTCTGGTCTA CAATTTCTCC AGCAACAATT TCAAAGAGGT CTTCTTGAGT
122


CA 02350775 2001-05-11
WO 00/27994 PCTNS9912b923
164651 GATCAATCCT TCAATAGATC CGTATTCATC AATGATCATC CCTAGGGTTT
164701 CGTCTTCAGC TGCCATCTGA CATAAAGCCA TTTTTGCAGA GATGGTTTCT
164751 GGCATATAAT ACGGTTTTTT CAGCAAGGGG AGGAGATCAT CCGAAGATTG
164801 CAGTGGCTTG TCATGTAAAA GAAGAGAGCG CGCTGTGCAA ATGCCCAGAA
164851 GGTTTTGGAG GTTATCGTTA CATATAGGAA CTCGTGAGCA ATGCTGTTTA
164901 GAAAATAAAA GATAGAGGTT CTCTAAAGGG GTTTGGATAT CATAAAATAA
164951 AATATCCTGG CGTGGCTGCA TACGCTCTTT AACACTACAA TCACTAAGAG
165001 AAAGATAACC ATAGAGTAAA CGGCTTTCTT CTTGATTGAC TACGCCGAAA
165051 TCCTTACAAC TTTGCAATAC TTCCTTCAGC TCTTGGGGTT GGATGATATC
165101 AATCTGTTGC TTCGATAAAA TCCATTGGAC CACATAATTA ATTCCTACGA
165151 TACCCCAGTG GAGTAGGGGT TTGAAGATTT TAGTAACACA AAGAATAAGA
165201 GGGGCTACGG AACTAGCAAT CTGTGTATTA AAAGGAAGAG CTACTGCTTT
165251 AGGGAGAATC TCACCTAAGA TCAAAGTAAT TGCTAAAGGA AGACCTACAG
165301 TAAACCACCA CGAAGCTGCA TCTCCAAATA GAATGGCAAA ACAGTTTTGA
165351 ATAGCAATAT TCAGTCCGAT ATCACAAAAA ATTAAGGTGA TGAGCAGGTG
165401 GTGGGGATGT AGAAGAAGGG TAGCTACTCG CTGCTGTTTC TTAGATTTAG
165451 AGCGCTTATA GTGCGAGATC AAACTCGTAG GCAAAGAAAA CAAAGCAATT
165501 TGAGATAACG AAATGAATCC CGAGCATAAA GTAAAACAGA TAATGAAGAA
165551 CATTAACATG GTAGGAATCA TGGTCTCTTT TCAGTCCTTA TTTTCTGATT
165601 GTTGCTTTGG GGAGACACAG AGTTTCTTAT AGCCTTTAGG AATGAGACCC
165651 AGACGCTGGA TTAAAGCAGC TTCTATAGCA GCGGAGTCTT GCCAGTGTTG
165701 CAGATGTAAT TGGAGTTGAC GCTGCTTTTC TTGAGCAGAA AGAATGTCTT
165751 GGCATAAAGA AGAGACCTTG CTTTGTAAC~C GTAGCTCTTC TGTACGTAAC
165801 TCCTGGATAG CACGATCATA AACAAAGCCT CCAATTAAGA TGCTAAAGAT
165851 CACCCACCAG GATTTGATCA TCACTTCTTC TAGTAATCTA AAACCCCAGT
165901 TTTTTTTTCT TACTGAAACT TTAGACACAA GGTACGGTGA TGCCTTGTTG
165951 CTTTTGATAC TTTCCTTTTC TGTCTGCATA AGAAACCTCG CAGGTCGTAC
123


CA 02350775 2001-05-11
WO 00/27994 PCT/US99126923
166001 TAGACTCAAA GAATAAGACC TGGGCAATCC CTTCATTAGC GTAAATTTTC
166051 GCTGGCAATG GCGTAGTGTT AGAAATTTCT ATAGTCACAT GCCCTTCCCA
166101 TTCAGGCTCA AAAGGTGTGA CATTTACGAT AATTCCACAG CGTGCATATG
166151 TAGACTTTCC TATACACATT GTTAAGACAT TTCTAGGAAT TCGGAAATAC
266201 TCAACGCTAC GAGCTAGAGC AAAAGAATTT GGAGGAACAA TACAGACGTC
166251 ATCAGTAATA GAGATGAAGA TATCCTCAGT AAAGCATTTT GGATCAACAA
166301 CAGAGTTATA GACATTGGTG AACACTTTGA ATTCTCGAGA TAGGCGGAGG
166351 TCGTAACCAT AACTCGATAG GCCGTAACTT ATAAGTTTTT CGCCTGTCTC
166401 CTCATTTACG TTCACTTGGC CATTAACAAA GGGATGGATC ATATCGGCAT
166451 TTAGGGCCAT CTCTCGTATC CACTTATCTT CTTTTATGCT CATTTAGAAA
166501 CCTTAACAGT TTGAAATTGC TTTCTTAATG ATATTCTGTT TTTCAATTTA
166551 CTGGTTTTTG GGGGGAACTT TTCTAAGTAT AAGATAGACT TTGATTATCT
166601 CTTGAAAAGA CCAGTTGTAT AAACAAGAAA AGCCTATCCC AAAGGCTACA
166651 ATTTTATCAC GAAACCTAGA GGTAATGTTA GATAATCCTA AGGGAAAAAG
166701 GCAAACCTTA TTTTTAGGGA GAACTTCAGG TAGGTCTGCT CTTTACTCTT
166751 ATAGTAGAAG AATCTTGGTT CTCTTGAATG CATTCATGCG AGGACCTTGA
166801 TAAGAACTTC TTGGATTCAT AAAAAGATTA ACATCTCCTT ATTGATAAGC
166851 TAGAGAATTT TTACTACCAA CTTCTCAGTG GAAAATGTTT TTAAAAATAG
166901 TTCGCCATCT TTAATTTATC TGTTTTAAGA CAAAAGAAAT CTAGATCACC
166951 ACAGGAAGTT TAAATCATAA AATGAAAATG ATGGAGAGGT TCTAGTGCTC
167001 GTACTTTGGC CCTGCTCTCC TTGATAGAAA GAAGAGGTCC ATAGTGTACT
16?051 TCTATATAGT ATCTCGTGTA CTATGCCGAG TATAACCGAT CGGCGTTATC
167101 GATGAGAGTT TCAAAAAAAT ATAAAATCCA CCTAAAGAAA AAGCGATAGA
167151 GAAGGTTCGT ACATGACGCA TCAAGTAGCT GTCTTGCATC AGGATAAAAA
167201 ATTTGATGTT TCGTTAAGAC CTAAAGGGTT AGAAGAATTT TATGGACAGC
167251 ATCATTTAAA AGAACGCCTA GATCTATTTC TTTGCGCAGC ATTGCAACGA
167301 GGAGAAGTTC CAGGACATTG CTTGTTTTTT GGACCCCCAG GCTTAGGGAA
124


CA 02350775 2001-05-11
WO 00/27994 PCT/US99/26923
167351 AACCTCACTT GCTCACATCG TTGCCTACAC CGTGGGGAAA GGGCTGGTCT
167401 TGGCATCAGG GCCTCAGTTA ATCAAACCCT CGGACCTGTT AGGACTTTTA
167951 ACTAGTTTGC AAGAAGGGGA CGTGTTTTTC ATCGATGAGA TCCATCGTAT
167501 GGGGAAAGTT GCTGAGGAAT ACCTGTATTC TGCAATGGAA GATTTCAAAG
167551 TCGATATTAC TATAGATTCA GGACCCGGAG CTCGCTCGGT CCGTGTCGAT
167601 CTTGCTCCTT TCACTTTAGT GGGGGCAACG ACTCGATCAG GAATGCTAAG
167651 CGAACCTTTA AGAGCACGCT TTGCTTTTAG TGCGAGACTT TCCTATTACT
167701 CGGATCAAGA TCTAAAAGAG ATTTTAGTCC GCTCCTCACA TTTACTCGGA
167751 ATCGAAGCTG ACAGCTCCGC ATTACTAGAA ATTGCTAAGA GATCCCGAGG
167801 GACGCCACGA CTGGCAAATC ATCTTCTACG TTGGGTCAGA GATTTTGCTC
167851 AGATCCGAGA AGGAAACTGT ATCAATGGGG ACGTAGCAGA AAAAGCTTTG
167901 GCTATGCTAT TAATAGATGA TTGGGGATTG AATGAAATTG ATATCAAACT
167951 TCTCACTACA ATCATCGACT ACTACCAAGG TGGTCCCGTT GGAATTAAAA
168001 CCTTATCGGT AGCTGTGGGA GAAGATATCA AAACTCTTGA AGATGTTTAT
168051 GAACCGTTTT TAATTTTAAA AGGTTTTATC AAAAAGACTC CCAGAGGCAG
168101 AATGGTAACA CAACTTGCTT ACGACCATTT AAAAAGACAT GCAAAGAACT
168151 TATTGAGTTT AGGAGAAGGA CAGTGAAACT ATTGAAAAAC GTACTTTTAG
168201 GTCTTTTCTT CAGTATGAGT ATCTCAGGAT TCTCAGAAGT AAAGGTATCC
168251 GATACTTTTG TGAAGCAGGA TACTGTCGTT GAACCTAAAA TTCGTGTCCT
168301 TTTATCTAAT GAAAGCACCA CAGCTCTCAT AGAAGCCAAA GGTCCTTATC
168351 GCATTTATGG AGATAATGTC TTATTAGACA CAGCGATTCA AGGCCAGCGT
168401 TGCGTGGTCC ACGCTCTATA CGAAGGGATC CGTTGGGGAG AATTTTATCC
168451 CGGACTCCAG TGTTTAAAGA TCGAGCCTGT AGATGACACT GCTTCTCTTT
168501 TTTTTAACGG GATTCAGTAT CAAGGTTCCC TATACGTTCA TCGTAAAGAC
168551 AACCATTGCA TCATGGTTTC TAACGAAGTT ACAATCGAAG ATTATCTGAA
168601 ATCTGTACTT TCTATAAAGT ACCTTGAAGA GCTAGATAAA GAAGCTCTAT
168651 CTGCTTGCAT CATTCTAGAA AGAACCGCTC TATACGAAAA GCTCCTTGCA
125

CA 02350775 2001-05-11
DEMANDES OU BREVETS VOLUMINEUX
LA PRESENTS PARTIE DE CETTE DEMANDS OU CE BREVET
COMPREND PLUS D'UN TOME. - .
CECI EST LE TOME _ ~"DE c~
NOTE: Pout les tomes additionels, veuillez contacter le Bureau canadien -des
brevets
:;,:
JUMBO APPLICATIONS/PATENTS
THiS SECTION OF THE APPLlCATIONIPATENT CONTAINS MORE
THAN ONE VOLUME
THIS IS VOLUME ~ '-OF _ .
WOTE: For additiona'1 volumes please contact'the Canadian Patent Off~cE ~ -
_ .. . . . . ~ , ..'. , . _ .~. ,. 'w

Representative Drawing

Sorry, the representative drawing for patent document number 2350775 was not found.

Administrative Status

For a clearer understanding of the status of the application/patent presented on this page, the site Disclaimer , as well as the definitions for Patent , Administrative Status , Maintenance Fee  and Payment History  should be consulted.

Administrative Status

Title Date
Forecasted Issue Date Unavailable
(86) PCT Filing Date 1999-11-12
(87) PCT Publication Date 2000-05-18
(85) National Entry 2001-05-11
Dead Application 2005-11-14

Abandonment History

Abandonment Date Reason Reinstatement Date
2004-11-12 FAILURE TO PAY APPLICATION MAINTENANCE FEE
2004-11-12 FAILURE TO REQUEST EXAMINATION

Payment History

Fee Type Anniversary Year Due Date Amount Paid Paid Date
Application Fee $300.00 2001-05-11
Maintenance Fee - Application - New Act 2 2001-11-13 $100.00 2001-10-25
Registration of a document - section 124 $100.00 2002-08-02
Registration of a document - section 124 $100.00 2002-08-02
Registration of a document - section 124 $100.00 2002-08-02
Registration of a document - section 124 $100.00 2002-08-02
Maintenance Fee - Application - New Act 3 2002-11-12 $100.00 2002-10-18
Maintenance Fee - Application - New Act 4 2003-11-12 $100.00 2003-10-22
Owners on Record

Note: Records showing the ownership history in alphabetical order.

Current Owners on Record
THE REGENTS OF THE UNIVERSITY OF CALIFORNIA
Past Owners on Record
DAVIS, RONALD
KALMAN, SUE
MITCHELL, WAYNE
STEPHENS, RICHARD
Past Owners that do not appear in the "Owners on Record" listing will appear in other documentation within the application.
Documents

To view selected files, please enter reCAPTCHA code :



To view images, click a link in the Document Description column. To download the documents, select one or more checkboxes in the first column and then click the "Download Selected in PDF format (Zip Archive)" or the "Download Selected as Single PDF" button.

List of published and non-published patent-specific documents on the CPD .

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.


Document
Description 
Date
(yyyy-mm-dd) 
Number of pages   Size of Image (KB) 
Description 2001-05-11 250 17,131
Description 2001-05-11 80 3,621
Description 2001-11-09 300 12,402
Description 2001-11-09 300 12,332
Description 2001-11-09 300 18,388
Description 2001-11-09 250 26,555
Description 2001-11-09 120 4,937
Abstract 2001-05-11 1 49
Claims 2001-05-11 2 41
Cover Page 2001-09-10 1 32
Correspondence 2001-07-20 2 44
Assignment 2001-05-11 4 118
PCT 2001-05-11 6 275
Prosecution-Amendment 2001-05-11 1 22
Prosecution-Amendment 2001-07-19 1 46
PCT 2001-05-11 6 219
Assignment 2002-08-02 9 477
Correspondence 2001-11-09 3 76

Biological Sequence Listings

Choose a BSL submission then click the "Download BSL" button to download the file.

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.

Please note that files with extensions .pep and .seq that were created by CIPO as working files might be incomplete and are not to be considered official communication.

No BSL files available.