Language selection

Search

Patent 3165291 Summary

Third-party information liability

Some of the information on this Web page has been provided by external sources. The Government of Canada is not responsible for the accuracy, reliability or currency of the information supplied by external sources. Users wishing to rely upon this information should consult directly with the source of the information. Content provided by external sources is not subject to official languages, privacy and accessibility requirements.

Claims and Abstract availability

Any discrepancies in the text and image of the Claims and Abstract are due to differing posting times. Text of the Claims and Abstract are posted:

  • At the time the application is open to public inspection;
  • At the time of issue of the patent (grant).
(12) Patent Application: (11) CA 3165291
(54) English Title: SUPPRESSION OF SHADE AVOIDANCE RESPONSE IN PLANTS
(54) French Title: SUPPRESSION DE LA REPONSE D'EVITEMENT D'OMBRE DANS DES PLANTES
Status: Examination Requested
Bibliographic Data
(51) International Patent Classification (IPC):
  • C12N 15/29 (2006.01)
  • C12N 15/113 (2010.01)
  • A01H 6/46 (2018.01)
  • A01H 1/00 (2006.01)
  • A01H 5/00 (2018.01)
  • C07K 14/415 (2006.01)
  • C12N 5/04 (2006.01)
  • C12N 5/10 (2006.01)
  • C12N 15/10 (2006.01)
  • C12N 15/82 (2006.01)
(72) Inventors :
  • BATE, NICHOLAS (United States of America)
  • KIM, YONGJOO (United States of America)
  • GRAHAM, NATHANIEL (United States of America)
  • MOJICA, JULIUS (United States of America)
(73) Owners :
  • PAIRWISE PLANTS SERVICES, INC. (United States of America)
(71) Applicants :
  • PAIRWISE PLANTS SERVICES, INC. (United States of America)
(74) Agent: AIRD & MCBURNEY LP
(74) Associate agent:
(45) Issued:
(86) PCT Filing Date: 2021-01-29
(87) Open to Public Inspection: 2021-08-05
Examination requested: 2022-06-17
Availability of licence: N/A
(25) Language of filing: English

Patent Cooperation Treaty (PCT): Yes
(86) PCT Filing Number: PCT/US2021/015609
(87) International Publication Number: WO2021/155084
(85) National Entry: 2022-06-17

(30) Application Priority Data:
Application No. Country/Territory Date
62/968,596 United States of America 2020-01-31

Abstracts

English Abstract

This invention relates to compositions and methods for modifying Homeodomain- leucine zipper (HD-Zip) transcription factors in plants to suppress shade avoidance response in plants. The invention further relates to plants produced using the methods and compositions of the invention.


French Abstract

La présente invention concerne des compositions et des procédés permettant de modifier des facteurs de transcription de fermeture à glissière homéodomaine-leucine (HD-Zip) dans des plantes pour supprimer une réponse d'évitement d'ombre dans des plantes. L'invention concerne également des plantes produites à l'aide des procédés et compositions selon l'invention.

Claims

Note: Claims are shown in the official language in which they were submitted.


THAT WHICH IS CLAIMED IS:
1. A plant or part thereof comprising at least one non-natural mutation in
an endogenous
Homeodomain-leucine zipper (HD-Zip) transcription factor, wherein the mutation
disrupts
the binding of the HD-Zip transcription factor to DNA.
2. The plant or part thereof of claim 1, wherein the HD-Zip transcription
factor is a HD-
Zip Type II (HD-Zip II) transcription factor.
3. The plant or part thereof of claim 2, wherein the HD-Zip II
transcription factor is
capable of regulating response to illumination in the plant (e.g., shade
avoidance response
(SAR); e.g., is an ortholog of AtHB2, HB53, and/or HB78.
4. The plant or part thereof of claim 3, wherein the HD-Zip II
transcription factor is
HOMEOBOX PROTEIN 53 (HB53) or HOMEOBOX PROTEIN 78 (HB78).
5. The plant or part thereof of claims 1-3, wherein the plant is a monocot.
6. The plant or part thereof of claim 1-3, wherein the plant is a dicot.
7. The plant or part thereof of any of the preceding claims, wherein the
plant is corn,
soy, canola, wheat, rice, cotton, sugarcane, sugar beet, barley, oats,
alfalfa, sunflower,
safflower, oil palm, sesame, coconut, tobacco, potato, sweet potato, cassava,
coffee, apple,
plum, apricot, peach, cherry, pear, fig, banana, citrus, cocoa, avocado,
olive, almond, walnut,
strawberry, watermelon, pepper, grape, tomato, cucumber, blackberry,
raspberry, black
raspberry or a Brassica spp.
8. The plant or part thereof of claims 1-5, wherein the plant is corn,
optionally wherein
the corn plant comprises a short stature/semi-dwarf phenotype.
9. The plant or part thereof of claims 1-5, wherein the plant is wheat,
optionally wherein
the at least one non-natural mutation in a HD-Zip transcription factor is in
an endogenous
gene that encodes the HD-Zip transcription factor, the endogenous gene being
in the A
genome, the B genome, the D genome or in any combination thereof.
96

10. The plant or plant part thereof of any one of the preceding claims,
wherein the HD-
Zip transcription factor comprises:
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83;
(b) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of:
RKKLRLSKDQSAVLEDSFREHPTLNPRQKAALA QQLGLRPRQVEVWFQNRRA
RTKLKQTEVDCEYLKRCCETLTEENRRLQKEVQELRALKLV SPHLYMHMSPPTTLTMC
PSCERV (SEQ ID NO:1) (Zea mays HB53) or
RKKLRLSKDQAAVLEESFKEHNTLNPKQKAALAKQLNLKPRQVEVWFQNRRA
RTKLKQTEVDCEFLKRCCETLTEENRRLQREVAELRVLKLV APHHY ARMPPPTTLTMC
PSCERL SEQ ID NO:2) (Zea mays HB78);
(c) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of
LAKQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTEENRRLQREV (SEQ
ID NO:3);
(d) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of RQVEVWFQNRRARTKLKQTEVDCE (SEQ ID NO:4);
(e) a polypeptide comprising a sequence having the amino acid sequence of
RQVEVWFQNRRARTKXKQTEVDCE (SEQ ID NO:5), wherein X is L or S;
(f) a polypeptide comprising:
(i) sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ
ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S or A;
(ii) a sequence having the amino acid sequence of PXiX2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2is T or A, X3is V or M
and X4 is Q, S or N;
(iii) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and
(iv) a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9); and/or
(g) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA
(SEQ ID NO:9).
97

11. The plant or plant part thereof of any one of the preceding claims,
wherein the HD-
Zip transcription factor comprises a DNA binding domain having the amino acid
sequence of
VWFQNRRA (SEQ ID NO:9).
12. The plant or plant part thereof of any one of the preceding claims,
wherein the at least
one non-natural mutation is a substitution, a deletion and/or an insertion.
13 The plant or plant part thereof of any one of the preceding claims,
wherein the at least
one non-natural mutation comprises a base substitution to an A, a T, a G, or a
C, which
results in an amino acid substitution and disrupts the binding of the HD-Zip
transcription
factor to DNA.
14. The plant or plant part thereof of any one of claims 1 to 12, wherein
at least one non-
natural mutation in an endogenous gene encoding an HD-Zip transcription factor
comprises a
deletion.
15. The plant or plant part there of claim 14, wherein the deletion
comprises a deletion of
all or a portion of the DNA binding domain (e.g., a deletion of at least 1, 2,
3, 4, 5, 6, 7, or 8
amino acid residues of SEQ ID NO:9 (VWFQNRRA) of the HD-Zip transcription
factor.
16. The plant or plant part thereof of claim 14, wherein the deletion
comprises a deletion
of about 1 amino acid residue to about 120 amino acid residues from the HD-Zip

transcription factor.
17. The plant or plant part thereof of claim 16, wherein the deletion
comprises a
truncation of about 96 amino acid residues to about 125 amino acid residues
from the C-
terminus of the HD-Zip transcription factor.
18. The plant or plant part thereof of any of the preceding claims, wherein
the at least one
non-natural mutation in an endogenous gene encoding a HD-Zip transcription
factor is a
dominant recessive mutation.
19. A plant cell comprising an editing system comprising:
(a) a CRISPR-associated effector protein; and
98

(c) a guide nucleic acid (gRNA, gDNA, crRNA, crDNA) having a spacer
sequence with complementarity to an endogenous target gene encoding a wild
type
HD-Zip transcription factor.
20. The plant cell of claim 19, wherein the wild type HD-Zip transcription
factor is a wild
type HD-Zip II transcription factor, optionally a HB53 transcription factor or
a HB78
transcription factor.
21. The plant cell of claim 19 or claim 20 wherein the HD-Zip transcription
factor
comprises:
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83;
(b) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of: SEQ ID NO:1 or SEQ ID NO:2;
(c) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:3;
(d) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of SEQ ID NO:4;
(e) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID
NO:5, wherein X is L or S;
(f) a polypeptide comprising:
(i) sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ
ID NO:6), wherein Xiis S or T, X2 is D or E and X3 1S S or A;
(ii) a sequence having the amino acid sequence of PX1X2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2is T or A, X3is V or M
and X4 is Q, S or N;
(iii) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 1S V or L; and
(iv) a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9); and/or
(g) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9).
99

22. The plant cell of any one of claims 19 to 21, wherein the guide nucleic
acid comprises
a nucleotide sequence of any one of SEQ ID NOs:175-182.
23. A plant regenerated from the plant part of any one of claims 1 to 18 or
the plant cell
of any one of claims 19 to 22.
24. A plant cell comprising a non-natural mutation to a DNA binding site in
an
endogenous HD-Zip transcription factor gene that prevents or reduces binding
of the encoded
HD-Zip transcription factor to DNA,
wherein the non-natural mutation is a substitution, insertion and/or a
deletion that is
introduced using an editing system that comprises a nucleic acid binding
domain that binds to
a target site in the HD-Zip transcription factor gene,
wherein the HD-Zip transcription factor gene encodes:
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83;
(b) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of: SEQ ID NO:1 or SEQ ID NO:2;
(c) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:3;
(d) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of SEQ ID NO:4;
(e) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID
NO:5, wherein X is L or S;
(f) a polypeptide comprising:
(i) sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ
ID NO:6), wherein Xiis S or T, X2 is D or E and X3 1S S or A;
(ii) a sequence having the amino acid sequence of PXiX2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2is T or A, X3is V or M
and X4 is Q, S or N;
(iii) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 1S V or L; and
(iv) a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9); and/or
100

(g) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9).
25. The plant cell of claim 24, wherein the nucleic acid binding domain of
the editing
system is from a polynucleotide-guided endonuclease, a CRESPR-Cas endonuclease
(e.g.,
CR,ISPR-Cas effector protein), a zinc finger nuclease, a transcription
activator-like effector
nuclease (TALEN) and/or an Argonaute protein.
26. A plant or part thereof comprising a mutation in an endogenous HD-Zip
transcription
factor, which mutation reduces DNA binding by the endogenous HD-Zip
transcription factor,
wherein the endogenous HD-Zip transcription factor comprises a polypeptide
comprising a
sequence having at least 80% sequence identity to the amino acid sequence of
SEQ ID NO:1
or SEQ ID NO:2 (Zea mays HB78); wherein the mutation is a deletion,
substitution, and/or
insertion of at least one amino acid residue of amino acid residues 45-52
(VWFQNRRA
(SEQ ID NO:9)) of the amino acid sequence of SEQ ID NO:1 or SEQ ID NO:2.
27. The plant or part thereof of claim 26, wherein the mutation of the at
least one amino
acid residue of amino acid residues 45-52 of the amino acid sequence of SEQ ID
NO:1 or
SEQ ID NO:2 is made following cleavage by a nuclease comprising a DNA-binding
domain
that binds to a target site within a target nucleic acid encoding a sequence
having at least 80%
sequence identity to the amino acid sequence of SEQ ID NO:1) (Zea mays HB53)
or SEQ
ID NO:2) (Zea mays HB78).
28. The plant or part thereof of any one of claims 26 or 27, wherein
mutation is an
insertion and/or a deletion.
29. The plant or part thereof of any one of claims 28, wherein the mutation
comprises a
deletion of all or a portion of a DNA binding domain of the endogenous HD-Zip
transcription
factor.
30. The plant or part thereof of any one of claims 25 to 29, wherein the
plant is corn, soy,
canola, wheat, rice, cotton, sugarcane, sugar beet, barley, oats, alfalfa,
sunflower, safflower,
oil palm , sesame, coconut, tobacco, potato, sweet potato, cassava, coffee,
apple, plum,
apricot, peach, cherry, pear, fig, banana, citrus, cocoa, avocado, olive,
almond, walnut,
101

strawberry, watermelon, pepper, grape, tomato, cucumber, blackberry,
raspberry, black
raspberry or a Brassica spp.
31. The plant or part thereof of any one of claims 1 to 18 or 26 to 30,
wherein the plant
part is a cell.
32. The plant or part thereof of any one of claims 1 to 18, 23, or 26 to
30, wherein the
plant comprises an attenuated Shade Avoidance Response.
33. The plant or part thereof of claim 26 to 32, wherein the plant is corn
and optionally
wherein the corn plant comprises a short stature/semi-dwarf phenotype.
34. A plant or plant part thereof comprising an HD-Zip transcription factor
gene that
encodes an amino acid sequence of SEQ ID NO:201.
35. A plant or part thereof comprising an HD-Zip transcription factor gene
that comprises
the nucleotide sequence of SEQ ID NO:202.
36. The plant or plant part of claim 35, wherein the plant is corn.
37. A method of providing a plurality of plants having increased yield when
each plant of
the plurality of plants is planted in close proximity to one another, the
method comprising
planting two or more plants of any one of claims 1-18, 23, or 26 to 36 in
close
proximity to one another, thereby providing a plurality of plants having
increased yield as
compared to a plurality of control plants planted in close proximity to one
another.
38. A method of producing/breeding a transgene-free genome-edited (e.g.,
base-edited)
plant, comprising:
(a) crossing the plant of any one of claims 1-18, 23, or 26 to 36 with a
transgene free
plant, thereby introducing the mutation into the plant that is transgene-free;
and
(b) selecting a progeny plant that comprises the mutation but is transgene-
free,
thereby producing a transgene free genome-edited (e.g., base-edited) plant.
102

39. A method for editing a specific site in the genome of a plant cell, the
method
comprising: cleaving, in a site-specific manner, a target site within an
endogenous HD-Zip
transcription factor gene in the plant cell, the endogenous HD-Zip
transcription factor gene
encoding:
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83;
(b) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of: SEQ ID NO:1 or SEQ ID NO:2;
(c) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:3;
(d) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of SEQ ID NO:4;
(e) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID
NO:5, wherein X is L or S;
(f) a polypeptide comprising:
(i) sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ
ID NO:6), wherein Xiis S or T, X2 is D or E and X3 1S S or A;
(ii) a sequence having the amino acid sequence of PXiX2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2is T or A, X3is V or M
and X4 is Q, S or N;
(iii) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 1S V or L; and
(iv) a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9); and/or
(g) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9), thereby generating an edit in the endogenous HD-Zip
transcription factor gene of the plant cell.
40. The method of claim 39, further comprising regenerating a plant from
the plant cell
comprising the edit in the endogenous HD-Zip transcription factor gene to
produce a plant
comprising the edit in its endogenous HD-Zip transcription factor gene.
103

41. The method of claim 40, wherein the plant comprising the edit in its
endogenous HD-
Zip transcription factor gene has an attenuated Shade Avoidance Response
compared to a
control plant that does not comprise the edit.
42. The method of any one of claims 39-41, wherein the edit results in a
non-naturally
occurring mutation in the endogenous HD-Zip transcription factor gene that
produces an HD-
Zip transcription factor with reduced DNA binding.
43. A method for making a plant, comprising:
(a) contacting a population of plant cells that comprising a wild-type
endogenous gene
encoding an HD-Zip transcription factor with a nuclease targeted to the wild-
type
endogenous gene, wherein the nuclease is linked to a DNA binding domain that
binds to a
nucleic acid sequence encoding:
(i) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83;
(ii) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of: SEQ ID NO:1 or SEQ ID NO:2;
(iii) a polypeptide comprising a sequence having at least 80% sequence
identity to the
amino acid sequence of SEQ ID NO:3;
(iv) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of SEQ ID NO:4;
(v) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID
NO:5, wherein X is L or S;
(vi) a polypeptide comprising:
(1) a sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ
ID NO:6), wherein Xiis S or T, X2 is D or E and X3 1S S or A;
(2) a sequence having the amino acid sequence of PX1X2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2is T or A, X3is V
or M and X4 is Q, S or N;
(3) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ ID
NO:7), wherein Xi is Q or H, X2 is R or K and X3 1S V or L; and
(4) a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9); and/or
104

(vii) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9);
(b) selecting a plant cell from said population comprising a mutation in the
wild-type
endogenous gene encoding an HD-Zip transcription factor, wherein the mutation
is a
substitution and/or a deletion of at least one amino acid residue in the
polypeptide of any one
of (i)-(v), wherein the mutation reduces or eliminates the ability of the HD-
Zip transcription
factor to bind DNA; and
(c) growing the selected plant cell into a plant.
44. A method for reducing a Shade Avoidance Response in a plant, comprising
(a) contacting a plant cell comprising a wild-type endogenous gene encoding an
HD-
Zip transcription factor with a nuclease targeted to the wild-type endogenous
gene, wherein
the nuclease is linked to a DNA binding domain that binds to a target site in
the wild type
endogenous gene, the wild type endogenous gene encoding:
(i) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83;
(ii) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of: SEQ ID NO:1 or SEQ ID NO:2;
(iii) a polypeptide comprising a sequence having at least 80% sequence
identity to the
amino acid sequence of SEQ ID NO:3;
(iv) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of SEQ ID NO:4;
(v) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID
NO:5, wherein X is L or S;
(vi) a polypeptide comprising:
(1) a sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ
ID NO:6), wherein Xiis S or T, X2 is D or E and X3 1S S or A;
(2) a sequence having the amino acid sequence of PXiX2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2is T or A, X3is V
or M and X4 is Q, S or N;
(3) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ ID
NO:7), wherein Xi is Q or H, X2 is R or K and X3 1S V or L; and
(4) a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9); and/or
105

(vii) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9), thereby producing a plant cell comprising a mutation
in the
wild-type endogenous gene encoding an HD-Zip transcription factor; and
(b) growing the plant cell into a plant, thereby reducing the Shade Avoidance
Response in the plant.
45. A method for producing a plant or part thereof comprising at least one
cell having a
mutation in an endogenous HD-Zip transcription factor gene, the method
comprising
contacting a target site in an endogenous HD-Zip transcription factor gene in
the plant
or plant part with a nuclease comprising a cleavage domain and a DNA-binding
domain,
wherein the DNA binding domain binds to a target site in the endogenous HD-Zip

transcription factor gene,
wherein the endogenous HD-Zip transcription factor gene encodes:
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83;
(b) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of: SEQ ID NO:1 or SEQ ID NO:2;
(c) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:3;
(d) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of SEQ ID NO:4;
(e) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID
NO:5, wherein X is L or S;
(f) a polypeptide comprising:
(i) sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ
ID NO:6), wherein Xiis S or T, X2 is D or E and X3 1S S or A;
(ii) a sequence having the amino acid sequence of PXiX2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2is T or A, X3is V or M
and X4 is Q, S or N;
(iii) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 1S V or L; and
(iv) a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9); and/or
106

(g) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9), thereby producing a plant or part thereof comprising
at least
one cell having a mutation in the endogenous HD-Zip transcription factor gene.
46. The method of claim 45, wherein the mutated endogenous HD-Zip
transcription factor
gene produces an HD-Zip transcription factor having reduced binding of DNA.
47. A method of producing a plant or part thereof comprising a mutated
endogenous HD-
Zip transcription factor having reduced DNA binding, the method comprising
contacting a
target site in an endogenous HD-Zip transcription factor gene in the plant or
plant part with a
nuclease comprising a cleavage domain and a DNA-binding domain, wherein the
DNA
binding domain binds to a target site in the HD-Zip transcription factor gene,
wherein the HD-Zip transcription factor gene encodes:
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83;
(b) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of: SEQ ID NO:1 or SEQ ID NO:2;
(c) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:3;
(d) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of SEQ ID NO:4;
(e) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID
NO:5, wherein X is L or S;
(f) a polypeptide comprising:
(i) sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ
ID NO:6), wherein Xiis S or T, X2 is D or E and X3 1S S or A;
(ii) a sequence having the amino acid sequence of PXiX2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2is T or A, X3is V or M
and X4 is Q, S or N;
(iii) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 1S V or L; and
(iv) a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9); and/or
107

(g) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9), thereby producing a plant or part thereof having a
mutated
endogenous HD-Zip transcription factor having reduced DNA binding.
48. The method of any one of claims 45 to 47, wherein the amino acid
sequence of any
one of SEQ ID NO: 38, SEQ ID NO:83 comprises the amino acid sequence of
VWFQNRRA (SEQ ID NO:9), and the mutated endogenous HD-Zip transcription factor

comprises a mutation in the amino acid sequence of VWFQNRRA (SEQ ID NO:9).
49. The method of any one of claims 45 to 48, wherein the plant exhibits a
reduced Shade
Avoidance Response as compared to a control plant.
50. The method of any one of claims 45 to 49, wherein the plant comprising
a reduced
Shade Avoidance Response comprises at least one of the following phenotypes of
increased
yield, decreased height, decreased shoot:root ratio, decreased leaf length;
increased
mechanical strength of stems; reduced lodging rate; delayed senescence;
increased
photosynthesis efficiency and grain filling; and/or enhanced defense responses
against
pathogens and herbivores when planted in close proximity with one or more
plants
comprising a reduced Shade Avoidance Response as compared to a plant that does
not
comprise a reduced Shade Avoidance Response that is planted in close proximity
with one or
more plants that do not comprise a reduced Shade Avoidance Response.
51. The method of any one of claims 45 to 50, wherein the nuclease cleaves
the
endogenous HD-Zip transcription factor gene and a mutation is introduced into
the DNA
binding site of the endogenous HD-Zip transcription factor encoded by the
endogenous HD-
Zip transcription factor gene.
52. The method of any one of claims 45 to 51, wherein the mutation is a non-
natural
mutation.
53. The method of any one of claims 45 to 52, wherein the mutation is a
substitution, an
insertion and/or a deletion.
108

54. The method of any one of claims 45 to 53, wherein the nuclease is a zinc
finger nuclease,
transcription activator-like effector nucleases (TALEN), endonuclease (e.g.,
Fokl) or a
CRISPR-Cas effector protein.
55. The method of any one of claims 45 to 54, wherein the HD-Zip
transcription factor is
a HD-Zip Type II (HD-Zip II) transcription factor.
56. The method of claim 55, wherein the HD-Zip II transcription factor is
capable of
regulating response to illumination in the plant (e.g., shade avoidance
response (SAR); e.g., is
an ortholog of AtHB2, HB53, and/or HB78.
57. The method of claim 55, wherein the HD-Zip II transcription factor is
HOMEOBOX
PROTEIN 53 (HB53) or HOMEOBOX PROTEIN 78 (HB 78) .
58. A guide nucleic acid that binds to a target site in a HD-Zip
transcription factor gene,
the target site comprising a nucleotide sequence encoding:
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:1 or SEQ ID NO:2;
(b) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:3;
(c) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of SEQ ID NO:4;
(d) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID
NO:5, wherein X is L or S;
(e) a polypeptide comprising:
(i) a sequence having the amino acid sequence of RKKLRLX1KX2QX3 (SEQ
ID NO:6), wherein Xiis S or T, X2 is D or E and X3 1S S or A;
(ii) a sequence having the amino acid sequence of PXiX2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M
and X4 is Q, S or N;
(iii) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 1S V or L; and
a sequence having the amino acid sequence of VWFQNRRA (SEQ ID NO:9) and/or
109

(f) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9).
59. The guide nucleic acid of claim 58, wherein the guide nucleic acid
comprises a spacer
having the nucleotide sequence of any one of SEQ ID NOs:175-182.
60. The guide nucleic acid of claim 58 of claim 59, wherein the HD-Zip
transcription
factor is a HD-Zip Type II (HD-Zip II) transcription factor, optionally
wherein the HD-Zip II
transcription factor is HB53 or HB78.
61. A system comprising the guide nucleic acid of any one of claims 58 to
60 and a
CRISPR-Cas effector protein that associates with the guide nucleic acid.
62. The system of claim 61, further comprising a tracr nucleic acid that
associates with
the guide nucleic acid and a CRISPR-Cas effector protein, optionally wherein
the tracr
nucleic acid and the guide nucleic acid are covalently linked.
63. A gene editing system comprising a CRISPR-Cas effector protein in
association with
a guide nucleic acid, wherein the guide nucleic acid comprises a spacer
sequence that binds to
a HD-Zip transcription factor gene.
64. The gene editing system of claim 63, wherein the HD-Zip transcription
factor gene
encodes:
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:1 or SEQ ID NO:2;
(b) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:3;
(c) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of SEQ ID NO:4;
(d) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID
NO:5, wherein X is L or S;
(e) a polypeptide comprising:
(i) a sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ
ID NO:6), wherein Xlis S or T, X2 is D or E and X3 is S or A;
110

(ii) a sequence having the amino acid sequence of PX1X2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2is T or A, X3is V or M
and X4 1S Q, S or N;
(iii) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 1S V or L; and
a sequence having the amino acid sequence of VWFQNRRA (SEQ ID NO:9) and/or
(f) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9).
65. The gene editing system of claim 63 or claim 64, wherein the guide
nucleic acid
comprises a spacer sequence having a nucleotide sequence of any one of SEQ ID
NOs:175-
182.
66. The gene editing system of any one of claims 63 to 65, further
comprising a tracr
nucleic acid that associates with the guide nucleic acid and a CRISPR-Cas
effector protein,
optionally wherein the tracr nucleic acid and the guide nucleic acid are
covalently linked.
67. The gene editing system of any one of claims 63 to 66, wherein the HD-
Zip
transcription factor is a HD-Zip Type II (HD-Zip II) transcription factor,
optionally wherein
the HD-Zip II transcription factor is HB53 or HB78.
68. A complex comprising a CRISPR-Cas effector protein comprising a
cleavage domain
and a guide nucleic acid, wherein the guide nucleic acid binds to a target
site in a HD-Zip
transcription factor gene encoding
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:1 or SEQ ID NO:2;
(b) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:3;
(c) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of SEQ ID NO:4;
(d) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID
NO:5, wherein X is L or S;
(e) a polypeptide comprising:
111

(i) a sequence having the amino acid sequence of RKKLRLX1KX2QX3 (SEQ
ID NO:6), wherein Xiis S or T, X2 is D or E and X3 1S S or A;
(ii) a sequence having the amino acid sequence of PXiX2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2is T or A, X3is V or M
and X4 is Q, S or N;
(iii) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 1S V or L; and
a sequence having the amino acid sequence of VWFQNRRA (SEQ ID NO:9) and/or
(f) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9), wherein the cleavage domain cleaves a target strand in
the
HD-Zip transcription factor gene.
69. An expression cassette comprising a (a) polynucleotide encoding CRISPR-
Cas
effector protein comprising a cleavage domain and (b) a guide nucleic acid
that binds to a
target site in a HD-Zip transcription factor gene, wherein the guide nucleic
acid comprises a
spacer sequence that is complementary to and binds to a nucleotide sequence
encoding:
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:1 or SEQ ID NO:2;
(b) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:3;
(c) a polypeptide comprising a sequence having at least 95% sequence identity
to the
nucleotide sequence of SEQ ID NO:4;
(d) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID
NO:5, wherein X is L or S;
(e) a polypeptide comprising:
(i) a sequence having the amino acid sequence of RKKLRLX1KX2QX3 (SEQ
ID NO:6), wherein Xi is S or T, X2 is D or E and X3 1S S or A;
(ii) a sequence having the amino acid sequence of PX1X2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2is T or A, X3is V or M
and X4 is Q, S or N;
(iii) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 1S V or L; and
a sequence having the amino acid sequence of VWFQNRRA (SEQ ID NO:9) and/or
112

(f) a polypeptide comprising a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9).
70. The complex of claim 68 or the expression cassette of claim 69, wherein
the HD-Zip
transcription factor is a HD-Zip Type II (HD-Zip II) transcription factor,
optionally wherein
the HD-Zip II transcription factor is HB53 or HB78.
71. A nucleic acid encoding HD-Zip transcription factor having a mutated
DNA binding
site, wherein the mutated DNA binding site comprises a mutation that disrupts
DNA binding.
72. The nucleic acid of claim 71, wherein the mutation eliminates the
binding of the HD-
Zip transcription factor to DNA.
73. The nucleic acid of claim 71 or claim 72, wherein the mutation reduces
the ability of
the HD-Zip transcription factor to bind to DNA by at least 75%.
74. The nucleic acid of any one of claims 71 to 73, wherein the HD-Zip
transcription
factor is a HD-Zip Type II (HD-Zip II) transcription factor, optionally
wherein the HD-Zip II
transcription factor is HB53 or HB78.
75. A plant or part thereof comprising the nucleic acid of any one of
claims 71 to 74.
76. A corn plant or part thereof comprising the nucleic acid of any one of
claims 71 to 74,
optionally wherein the corn plant comprises a short stature/semi-dwarf
phenotype.
77. A wheat plant or part thereof comprising the nucleic acid of any one of
claims 71 to
74.
78. The wheat plant or part thereof of claim 77, wherein the nucleic acid
is comprised in
the A genome, the B genome, the D genome or in any combination thereof
79. The plant of claim 75, the corn plant of claim 76, or the wheat plant
of claim 77 or
claim 78, comprising increased yield, decreased height, decreased shoot:root
ratio, decreased
leaf length; increased mechanical strength of stems; reduced lodging rate;
delayed
113

senescence; increased photosynthesis efficiency and grain filling; and/or
enhanced defense
responses against pathogens and herbivores when planted in close proximity
with one or
more plants, corn plants, and/or wheat plants comprising a reduced Shade
Avoidance
Response as compared to a plant that does not comprise a reduced Shade
Avoidance
Response that is planted in close proximity with one or more plants that do
not comprise a
reduced Shade Avoidance Response.
114

Description

Note: Descriptions are shown in the official language in which they were submitted.


CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
SUPPRESSION OF SHADE AVOIDANCE RESPONSE IN PLANTS
STATEMENT REGARDING ELECTRONIC FILING OF A SEQUENCE LISTING
A Sequence Listing in ASCII text format, submitted under 37 C.F.R. 1.821,
entitled
1499.17.WO 5T25.txt, 465,868 bytes in size, generated on January 28, 2021 and
filed via
EFS-Web, is provided in lieu of a paper copy. This Sequence Listing is hereby
incorporated
herein by reference into the specification for its disclosures.
STATEMENT OF PRIORITY
This application claims the benefit, under 35 U.S.C. 119 (e), of U.S.
Provisional
Application No. 62/968,596 filed on January 31, 2020, the entire contents of
which is
incorporated by reference herein.
FIELD OF THE INVENTION
This invention relates to compositions and methods for modifying Homeodomain-
leucine zipper (HD-Zip) transcription factors to suppress shade avoidance
response in plants.
The invention further relates to plants produced using the methods and
compositions of the
invention.
BACKGROUND OF THE INVENTION
Shade avoidance response (SAR) is a response to a decrease in the quality or
quantity
of available light (Kebrom and Brutnell, J Exp Bot 58:3079-3089 (2007)) in
which a plant
attempts to outcompete neighboring plants by growing toward resources
(primarily light).
Overcrowding of plants can lead to shade avoidance syndrome (SAS) where plants
lack vigor
and decreased yield. Shade avoidance relates to the relative proportion of red
light to far-red
light that is present in a plant's environment (Ballare et al. Science,
247:329-332 (1990)).
Plants absorb most of the red light available to them, but reflect far-red
light, including
reflecting this light on nearby plants. When a plant detects consistent far-
red light in its
environment, it will undergo a morphological and physiological response. These
responses
.. can include reduced branching, increased plant height, decreased leaf blade
area,
redistribution of auxin, enhanced ethylene production and acceleration of
flowering. SAS is
characterized by increased root/shoot ratio, increased plant height, and
reduced individual
plant yield. In a typical monoculture crop setting, interplant competition
through shade
avoidance is considered a wasteful survival mechanism.
1

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
SUMMARY OF THE INVENTION
One aspect of the invention provides a plant or plant part thereof comprising
at least
one non-natural mutation in an endogenous Homeodomain-leucine zipper (HD-Zip)
transcription factor, wherein the mutation disrupts the binding of the HD-Zip
transcription
factor to DNA.
Another aspect of the invention provides a plant cell comprising an editing
system,
the editing system comprising: (a) a CRISPR-associated effector protein; and
(b) a guide
nucleic acid (gRNA, gDNA, crRNA, crDNA) having a spacer sequence with
.. complementarity to an endogenous target gene encoding an HD-Zip
transcription factor.
A further aspect of the invention provides a plant cell comprising a non-
natural
mutation within a DNA binding site of an HD-Zip transcription factor gene that
prevents or
reduces binding of the HD-Zip transcription factor to DNA, wherein the
mutation is a
substitution, insertion and/or a deletion that is introduced using an editing
system that
comprises a nucleic acid binding domain that binds to a target site in the HD-
Zip
transcription factor gene, wherein the HD-Zip transcription factor gene
encodes: (a) a
polypeptide comprising a sequence having at least 80% sequence identity to the
amino acid
sequence of SEQ ID NO:38 or SEQ ID NO:83; (b) a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of:
RKKLRLSKDQSAVLEDSFREHPTLNPRQKAALA QQLGLRPRQVEVWFQNRRARTKLK
QTEVDCEYLKRCCETLTEENRRLQKEVQELRALKLV SPHLYMHMSPPTTLTMCP SCER
V (SEQ ID NO:!) (Zea mays HB53) or
RKKLRLSKDQAAVLEESFKEHNTLNPKQKAALAKQLNLKPRQVEVWFQNRRARTKL
KQTEVDCEFLKRCCETLTEENRRLQREVAELRVLKLV APHRY ARMPPPTTLTMCP SCE
RL (SEQ ID NO:2) (Zea mays HB78); (c) a polypeptide comprising a sequence
having at
least 80% sequence identity to the amino acid sequence of
LAKQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTEENRRLQREV (SEQ
ID NO:3); (d) a polypeptide comprising a sequence having at least 95% sequence
identity to
the nucleotide sequence of RQVEVWFQNRRARTKLKQTEVDCE (SEQ ID NO:4); (e) a
polypeptide comprising a sequence having the amino acid sequence of
RQVEVWFQNRRARTKXKQTEVDCE (SEQ ID NO:5), wherein X is L or S; (f) a
polypeptide comprising: (i) sequence having the amino acid sequence of
KKLRLX1KX2QX3
(SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S or A; (ii) a
sequence having
the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P
or A,
2

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a sequence having the
amino acid
sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H, X2 is R or K
and X3 is
V or L; and (iv) a sequence having the amino acid sequence of VWFQNRRA (SEQ ID

NO:9); and/or (g) a polypeptide comprising a sequence having the amino acid
sequence of
VWFQNRRA (SEQ ID NO:9).
Another aspect of the invention provides a plant or part thereof comprising a
mutation
(e.g., at least one mutation) in an endogenous HD-Zip transcription factor,
which mutation
reduces DNA binding by the endogenous HD-Zip transcription factor, wherein the

endogenous HD-Zip transcription factor comprises a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of SEQ ID
NO:! or SEQ
ID NO:2; wherein the mutation is a deletion, substitution, and/or insertion of
at least one
amino acid residue of amino acid residues 45-52 (VWFQNRRA (SEQ ID NO:9)) of
the
amino acid sequence of SEQ ID NO:! or SEQ ID NO:2, optionally wherein the
mutation is
a non-natural mutation.
Another aspect of the invention provides a plant or part thereof comprising an
HD-Zip
transcription factor gene that encodes an amino acid sequence of SEQ ID
NO:201.
A further aspect of the invention provides a corn plant comprising an HD-Zip
transcription factor gene that comprises an amino acid sequence of SEQ ID
NO:201.
Another aspect of the invention provides a plant or part thereof comprising an
HD-Zip
transcription factor gene that comprises the nucleotide sequence of SEQ ID
NO:202.
A further aspect of the invention provides a corn plant comprising an HD-Zip
transcription factor gene that comprises the nucleotide sequence of SEQ ID
NO:202.
The invention further provides a method of producing/breeding a transgene-free
genome-edited plant, comprising: (a) crossing a plant of the invention with a
transgene free
plant, thereby introducing the mutation present in the plant of the invention
into the plant that
is transgene-free; and (b) selecting a progeny plant that comprises the
mutation but is
transgene-free, thereby producing a transgene free genome-edited plant.
Another aspect of the invention provides a method for editing a specific site
in the
genome of a plant cell, the method comprising: cleaving, in a site specific
manner, a target
site within an endogenous HD-Zip transcription factor gene in the plant cell,
the endogenous
HD-Zip transcription factor gene encoding: (a) a polypeptide comprising a
sequence having
at least 80% sequence identity to the amino acid sequence of SEQ ID NO:38 or
SEQ ID
NO:83; (b) a polypeptide comprising a sequence having at least 80% sequence
identity to the
amino acid sequence of: SEQ ID NO:! or SEQ ID NO:2; (c) a polypeptide
comprising a
3

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
sequence having at least 80% sequence identity to the amino acid sequence of
SEQ ID
NO:3; (d) a polypeptide comprising a sequence having at least 95% sequence
identity to the
nucleotide sequence of SEQ ID NO:4; (e) a polypeptide comprising a sequence
having the
amino acid sequence of SEQ ID NO:5, wherein X is L or S; (f) a polypeptide
comprising: (i)
sequence having the amino acid sequence of RKKLRLX1KX2QX3 (SEQ ID NO:6),
wherein
Xi is S or T, X2 is D or E and X3 is S or A; (ii) a sequence having the amino
acid sequence of
PX1X2 X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V
or M
and X4is Q, S or N; (iii) a sequence having the amino acid sequence of
ENRRLX1X2EX3,
(SEQ ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and (iv) a
sequence
having the amino acid sequence of VWFQNRRA (SEQ ID NO:9); and/or (g) a
polypeptide
comprising a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9),
thereby generating an edit in the endogenous HD-Zip transcription factor gene
of the plant
cell.
An additional aspect of the invention provides a method for making a plant,
comprising: (a) contacting a population of plant cells that comprising a wild-
type endogenous
gene encoding an HD-Zip transcription factor with a nuclease targeted to the
wild-type
endogenous gene, wherein the nuclease is linked to a DNA binding domain that
binds to a
nucleic acid sequence encoding: (i) a polypeptide comprising a sequence having
at least 80%
sequence identity to the amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83;
(ii) a
polypeptide comprising a sequence having at least 80% sequence identity to the
amino acid
sequence of: SEQ ID NO:! or SEQ ID NO:2; (iii) a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of SEQ ID
NO:3; (iv) a
polypeptide comprising a sequence having at least 95% sequence identity to the
nucleotide
sequence of SEQ ID NO:4; (v) a polypeptide comprising a sequence having the
amino acid
sequence of SEQ ID NO:5, wherein X is L or S; (vi) a polypeptide comprising:
(1) a
sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ ID NO:6),
wherein
Xi is S or T, X2 is D or E and X3 is S or A; (2) a sequence having the amino
acid sequence of
PX1X2 X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V
or M
and X4is Q, S or N; (3) a sequence having the amino acid sequence of
ENRRLX1X2EX3,
(SEQ ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and (4) a
sequence
having the amino acid sequence of VWFQNRRA (SEQ ID NO:9); and/or (vii) a
polypeptide
comprising a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9);
(b) selecting a plant cell from said population comprising a mutation in the
wild-type
endogenous gene encoding an HD-Zip transcription factor, wherein the mutation
is a
4

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
substitution and/or a deletion of at least one amino acid residue in the
polypeptide of any one
of (i)-(v), wherein the mutation reduces or eliminates the ability of the HD-
Zip transcription
factor to bind DNA; and (c) growing the selected plant cell into a plant.
In some aspects, the invention provides a method for reducing a Shade
Avoidance
Response in a plant, comprising (a) contacting a plant cell comprising a wild-
type
endogenous gene encoding an HD-Zip transcription factor with a nuclease
targeted to the
wild-type endogenous gene, wherein the nuclease is linked to a DNA binding
domain that
binds to a target site in the wild type endogenous gene, the wild type
endogenous gene
encoding: (i) a polypeptide comprising a sequence having at least 80% sequence
identity to
the amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83; (ii) a polypeptide
comprising
a sequence having at least 80% sequence identity to the amino acid sequence
of: SEQ ID
NO:! or SEQ ID NO:2; (iii) a polypeptide comprising a sequence having at least
80%
sequence identity to the amino acid sequence of SEQ ID NO:3; (iv) a
polypeptide
comprising a sequence having at least 95% sequence identity to the nucleotide
sequence of
SEQ ID NO:4; (v) a polypeptide comprising a sequence having the amino acid
sequence of
SEQ ID NO:5, wherein X is L or S; (vi) a polypeptide comprising: (1) a
sequence having the
amino acid sequence of RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S or T, X2
is D
or E and X3 is S or A; (2) a sequence having the amino acid sequence of PX1X2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M
and X4 is
Q, S or N; (3) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID
NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and (4) a sequence
having the
amino acid sequence of VWFQNRRA (SEQ ID NO:9); and/or (vii) a polypeptide
comprising a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9),
thereby producing a plant cell comprising a mutation in the wild-type
endogenous gene
encoding an HD-Zip transcription factor; and (b) growing the plant cell into a
plant, thereby
reducing the Shade Avoidance Response.
In another aspect, a method for producing a plant or part thereof comprising
at least
one cell having a mutated endogenous HD-Zip transcription factor gene is
provided, the
method comprising contacting a target site in the HD-Zip transcription factor
gene in the
plant or plant part with a nuclease comprising a cleavage domain and a DNA-
binding
domain, wherein the DNA binding domain binds to a target site in the HD-Zip
transcription
factor gene, wherein the HD-Zip transcription factor gene encodes: (a) a
polypeptide
comprising a sequence having at least 80% sequence identity to the amino acid
sequence of
SEQ ID NO:38 or SEQ ID NO:83; (b) a polypeptide comprising a sequence having
at least
5

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
80% sequence identity to the amino acid sequence of: SEQ ID NO:! or SEQ ID
NO:2; (c) a
polypeptide comprising a sequence having at least 80% sequence identity to the
amino acid
sequence of SEQ ID NO:3; (d) a polypeptide comprising a sequence having at
least 95%
sequence identity to the nucleotide sequence of SEQ ID NO:4; (e) a polypeptide
comprising
a sequence having the amino acid sequence of SEQ ID NO:5, wherein X is L or S;
(f) a
polypeptide comprising: (i) a sequence having the amino acid sequence of
RKKLRLX1KX2QX3 (SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S
or A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and (iv) a sequence having the amino acid sequence of

VWFQNRRA (SEQ ID NO:9); and/or (g) a polypeptide comprising a sequence having
the
amino acid sequence of VWFQNRRA (SEQ ID NO:9), thereby producing a plant or
part
thereof comprising at least one cell having a mutation in the endogenous HD-
Zip
transcription factor gene.
In a further aspect, a method of producing a plant or part thereof comprising
a
mutated endogenous HD-Zip transcription factor having reduced DNA binding is
provided,
the method comprising contacting a target site in an endogenous HD-Zip
transcription factor
gene in the plant or plant part with a nuclease comprising a cleavage domain
and a DNA-
binding domain, wherein the DNA binding domain binds to a target site in the
HD-Zip
transcription factor gene wherein the HD-Zip transcription factor gene
encodes: (a) a
polypeptide comprising a sequence having at least 80% sequence identity to the
amino acid
sequence of SEQ ID NO:38 or SEQ ID NO:83; (b) a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of: SEQ ID
NO:! or SEQ
ID NO:2; (c) a polypeptide comprising a sequence having at least 80% sequence
identity to
the amino acid sequence of SEQ ID NO:3; (d) a polypeptide comprising a
sequence having
at least 95% sequence identity to the nucleotide sequence of SEQ ID NO:4; (e)
a polypeptide
comprising a sequence having the amino acid sequence of SEQ ID NO:5, wherein X
is L or
S; (f) a polypeptide comprising: (i) sequence having the amino acid sequence
of
RKKLRLX1KX2QX3 (SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S
or A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and (iv) a sequence having the amino acid sequence of
6

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
VWFQNRRA (SEQ ID NO:9); and/or (g) a polypeptide comprising a sequence having
the
amino acid sequence of VWFQNRRA (SEQ ID NO:9), thereby producing a plant or
part
thereof having a mutated endogenous HD-Zip transcription factor having reduced
DNA
binding.
An additional aspect of the invention provides a guide nucleic acid that binds
to a
target site in a HD-Zip transcription factor gene, the target site comprising
a nucleotide
sequence encoding: (a) a polypeptide comprising a sequence having at least 80%
sequence
identity to the amino acid sequence of SEQ ID NO:! or SEQ ID NO:2; (b) a
polypeptide
comprising a sequence having at least 80% sequence identity to the amino acid
sequence of
SEQ ID NO:3; (c) a polypeptide comprising a sequence having at least 95%
sequence
identity to the nucleotide sequence of SEQ ID NO:4; (d) a polypeptide
comprising a
sequence having the amino acid sequence of SEQ ID NO:5, wherein X is L or S;
(e) a
polypeptide comprising: (i) a sequence having the amino acid sequence of
RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S or
A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and a sequence having the amino acid sequence of
VWFQNRRA
(SEQ ID NO:9) and/or (f) a polypeptide comprising a sequence having the amino
acid
sequence of VWFQNRRA (SEQ ID NO:9).
A further aspect of the invention provides a system comprising the guide
nucleic acid
of the invention and a CRISPR-Cas effector protein that associates with the
guide nucleic
acid.
Another aspect of the invention provides gene editing system comprising a
CRISPR-
.. Cas effector protein in association with a guide nucleic acid, wherein the
guide nucleic acid
comprises a spacer sequence that binds to a HD-Zip transcription factor gene.
An additional aspect of the invention provides a complex comprising a CRISPR-
Cas
effector protein comprising a cleavage domain and a guide nucleic acid,
wherein the guide
nucleic acid binds to a target site in a HD-Zip transcription factor gene
encoding (a) a
polypeptide comprising a sequence having at least 80% sequence identity to the
amino acid
sequence of SEQ ID NO:! or SEQ ID NO:2; (b) a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of SEQ ID
NO:3; (c) a
polypeptide comprising a sequence having at least 95% sequence identity to the
nucleotide
sequence of SEQ ID NO:4; (d) a polypeptide comprising a sequence having the
amino acid
7

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
sequence of SEQ ID NO:5, wherein X is L or S; (e) a polypeptide comprising:
(i) a sequence
having the amino acid sequence of RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S
or
T, X2 is D or E and X3 is S or A; (ii) a sequence having the amino acid
sequence of PX1X2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M
and X4 is
Q, S or N; (iii) a sequence having the amino acid sequence of ENRRLX1X2EX3,
(SEQ ID
NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and a sequence
having the amino
acid sequence of VWFQNRRA (SEQ ID NO:9) and/or (f) a polypeptide comprising a
sequence having the amino acid sequence of VWFQNRRA (SEQ ID NO:9), wherein the

cleavage domain cleaves a target strand in the HD-Zip transcription factor
gene
A further aspect of the invention provides a nucleic acid encoding HD-Zip
transcription factor having a mutated DNA binding site, wherein the mutated
DNA binding
site comprises a mutation that disrupts DNA binding.
Further provided are plants comprising in their genome one or more mutated HD-
Zip
transcription factors that have reduced ability to bind to DNA that are
produced by the
methods of the invention as well as polypeptides, polynucleotides, nucleic
acid constructs,
expression cassettes and vectors for making a plant of this invention.
These and other aspects of the invention are set forth in more detail in the
description
of the invention below.
BRIEF DESCRIPTION OF THE SEQUENCES
SEQ ID NO:1 shows a portion of the Zea mays HB 53 transcription factor (amino
acid residues 173-288 of SEQ ID NO:38).
SEQ ID NO:2 shows a portion of the Zea mays HB 78 transcription factor (amino
acid residues 76-191 of SEQ ID NO:83).
SEQ ID NOs:3-9 are partial sequences of HD-zip transcription factors.
SEQ ID NOs:10-54 are examples of HB53 transcription factors from a variety of
different plant species.
SEQ ID NOs:55-98 and 303 are examples of HB78 transcription factors from a
variety of different plant species.
SEQ ID NOs:99-134 are cDNA sequences of HB53 transcription factors from a
variety of different plant species.
SEQ ID NOs:135-174 are cDNA sequences of HB78 transcription factors from a
variety of different plant species.
8

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
SEQ ID NOs:175-182 show example spacer sequences for guide nucleic acids for
targeting HB78 and HB53 transcription factors.
SEQ ID NOs:183-194 are example cytosine deaminase amino acid sequences useful
with this invention.
SEQ ID NO:195 is an exemplary uracil-DNA glycosylase inhibitor (UGI) useful
with
this invention.
SEQ ID NO:196-197 are exemplary regulatory sequences encoding a promoter and
intron.
SEQ ID NOs:198-200 provide examples of a protospacer adjacent motif position
for
a Type V CRISPR-Cas12a nuclease.
SEQ ID NOs: 201 provides an example HB78 mutated amino acid sequence
produced using the methods and compositions of this invention.
SEQ ID NOs: 202 provides an example HB78 mutated nucleotide sequence produced
using the methods and compositions of this invention. SEQ ID NO:202 encodes
the amino
acid sequence of SEQ ID NO:201.
SEQ ID NOs: 203-246 and 302 are the portions of HB78 amino acid sequences for
different plant species shown in the alignment in Figs. 1A-1B.
SEQ ID NOs:247-291 are the portions of HB53 amino acid sequences for different

plant species shown in the alignment in Figs. 2A-2D.
SEQ ID NOs:292-294 are the HB53 sequences shown in Fig. 6.
SEQ ID NOs:295-297 are the HB78 sequences shown in Fig. 8.
SEQ ID NOs:298-301 are the HB78 sequences shown in Fig. 9.
SEQ ID NO:304 and SEQ ID NO:305 provide a wild type HB78 genomic sequence
and cDNA (corresponding to the WT HB78 amino acid sequence of SEQ ID NO:83 and
the
WT HB78 coding sequence of SEQ ID NO:171), respectively, from Z. mays.
SEQ ID NOs:306-309 provide the protein sequence, genomic sequence, coding
sequence and cDNA, respectively, of an edited HB78 having a 17 base pair
deletion.
SEQ ID NO:310 and SEQ ID NO:311 provide a wild type HB53 genomic sequence
and cDNA (corresponding to the WT HB53 amino acid sequence of SEQ ID NO:38 and
the
WT HB53 coding sequence of SEQ ID NO:132), respectively, from Z. mays.
SEQ ID NOs:312-315 provide the protein sequence, genomic sequence, coding
sequence and cDNA, respectively, of an edited HB53 having an 11 base pair
deletion.
SEQ ID NOs:316-319 provide the protein sequence, genomic sequence, coding
sequence and cDNA, respectively, of an edited HB53 having an 8 base pair
deletion.
9

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
SEQ ID NOs:320-329 are the sequences shown in Figs. 10-12.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 provides an alignment of HD-Zip (HB78) amino acid sequences from 44
different plant species. The sequences are a portion of consecutive amino
acids (53 amino acid
residues) from the full length HB78 sequences.
Fig. 2 provides an alignment of HD-Zip (HB53) amino acid sequences from 45
different plant species. The sequences are a portion of consecutive amino acid
residues (e.g.,
about 116 amino acid residues) from the full length HB53 sequences.
Fig. 3 provides an example illustrating the relationship between planting
density and
yield in corn. Increasing planting density increases plant yield up to an
inflection point
(arrows ¨ Optimum economic seeding rate). Variants with attenuated shade
avoidance would
have inflection points at a higher planting density.
Fig. 4 provides an example illustrating a dominant negative strategy. By
removing
the DNA binding capability of a bifunctional protein, dimerized complex will
not activate
gene expression.
Fig. 5 shows the relationship between HDLZ Class II proteins.
Fig. 6 provides an example of editing of the DNA binding domain of HB53 in Z.
mays and shows target example amino acid residues for modification in the
boxes. Coding
(SEQ ID NO:292) and non-coding (SEQ ID NO:293) strands and the HB53 amino acid
sequence (SEQ ID NO:294) are shown.
Fig. 7 provides an annotated HB78 gene with example guide nucleic acids.
Fig. 8 provides an example of a deletion in an HB78 gene (SEQ ID NO:295)
showing that a deletion (SEQ ID NO:297) in, for example, Exon 2 results in a
truncation that
deletes Exon 3, Exon 4 and the DNA binding domain. The protein sequence (SEQ
ID
NO:296) at the site of the deletion is also shown.
Fig. 9 provides a representative genomic sequence (coding strand (SEQ ID
NO:298)
and non-coding strand (SEQ ID NO:299)) of an edited plant showing a premature
stop
upstream of the HB78 DNA binding domain. The protein sequence (SEQ ID NO:300
and
SEQ ID NO:301) resulting from this premature stop is shown.
Fig. 10 provides an alignment showing the location of the guide PWsp227 and
resulting edits in Zea mays HB53 (consensus SEQ ID NO:320; Z. mays SEQ ID
NO:321;
CE28392, CE28403, CE28409, CE28382, CE28390 SEQ ID NO:322; CE28492, CE28505,

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
CE28514, CE28517, CE28522, CE28534, CE28544, CE28547 SEQ ID NO:320); CE28456,
CE28477, CE28483 SEQ ID NO:323).
Fig. 11 provides an alignment showing the location of the guide PWsp227 and
PWsp225 and resulting edits in Z. mays HB53 (consensus SEQ ID NO:324; Z. mays
SEQ ID
NO:325; CE28330... 8D SEQ ID NO:326; CE28350... 11D SEQ ID NO:327.
Fig. 12 provides an alignment showing the location of the guide PWsp230 and
PWsp232 and resulting edits in Z. mays HB78 (consensus SEQ ID NO:328; Z. mays
SEQ ID
NO:329; CE28330 SEQ ID NO:328; CE28350 SEQ ID NO:328).
Fig. 13 provides examples for targeting of HB53 (upper schematic) and HB78
(lower
schematic). Plasmids pWISE443 and pWISE444 are shown with corresponding
spacers.
Plasmid pWISE445 contains all 4 spacers shown for HB53. Plasmids pWISE446 and
pWISE447 are shown with corresponding spacers. Plasmid pWISE448 contains all 4
spacers
shown for HB78. Plasmid pWISE451 contains all 8 spacers (4 for HB53 and 4 for
HB78, as
shown in Fig. 13).
Fig. 14 provides the results of an example E2 shade avoidance assay of an HB53
knock-out and an HB53/HB78 knock-out showing no stem elongation in the edited
lines
when grown in the shade.
Fig. 15 provides an off-type analysis showing no evidence of morphological off-
types
or developmental delays in the edited plants.
DETAILED DESCRIPTION
The present invention now will be described hereinafter with reference to the
accompanying drawings and examples, in which embodiments of the invention are
shown.
This description is not intended to be a detailed catalog of all the different
ways in which the
invention may be implemented, or all the features that may be added to the
instant invention.
For example, features illustrated with respect to one embodiment may be
incorporated into
other embodiments, and features illustrated with respect to a particular
embodiment may be
deleted from that embodiment. Thus, the invention contemplates that in some
embodiments
of the invention, any feature or combination of features set forth herein can
be excluded or
omitted. In addition, numerous variations and additions to the various
embodiments
suggested herein will be apparent to those skilled in the art in light of the
instant disclosure,
which do not depart from the instant invention. Hence, the following
descriptions are
intended to illustrate some particular embodiments of the invention, and not
to exhaustively
specify all permutations, combinations, and variations thereof.
11

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
Unless otherwise defined, all technical and scientific terms used herein have
the
same meaning as commonly understood by one of ordinary skill in the art to
which this
invention belongs. The terminology used in the description of the invention
herein is for the
purpose of describing particular embodiments only and is not intended to be
limiting of the
.. invention.
All publications, patent applications, patents and other references cited
herein are
incorporated by reference in their entireties for the teachings relevant to
the sentence and/or
paragraph in which the reference is presented.
Unless the context indicates otherwise, it is specifically intended that the
various
features of the invention described herein can be used in any combination.
Moreover, the
present invention also contemplates that in some embodiments of the invention,
any feature
or combination of features set forth herein can be excluded or omitted. To
illustrate, if the
specification states that a composition comprises components A, B and C, it is
specifically
intended that any of A, B or C, or a combination thereof, can be omitted and
disclaimed
singularly or in any combination.
As used in the description of the invention and the appended claims, the
singular
forms "a," "an" and "the" are intended to include the plural forms as well,
unless the context
clearly indicates otherwise.
Also as used herein, "and/or" refers to and encompasses any and all possible
combinations of one or more of the associated listed items, as well as the
lack of
combinations when interpreted in the alternative ("or").
The term "about," as used herein when referring to a measurable value such as
an
amount or concentration and the like, is meant to encompass variations of
10%, 5%, 1%,
0.5%, or even 0.1% of the specified value as well as the specified value.
For example,
"about X" where X is the measurable value, is meant to include X as well as
variations of
10%, 5%, 1%, 0.5%, or even 0.1% of X. A range provided herein for a
measurable
value may include any other range and/or individual value therein.
As used herein, phrases such as "between X and Y" and "between about X and Y"
should be interpreted to include X and Y. As used herein, phrases such as
"between about X
and Y" mean "between about X and about Y" and phrases such as "from about X to
Y" mean
"from about X to about Y."
Recitation of ranges of values herein are merely intended to serve as a
shorthand
method of referring individually to each separate value falling within the
range, unless
otherwise indicated herein, and each separate value is incorporated into the
specification as if
12

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
it were individually recited herein. For example, if the range 10 to15 is
disclosed, then 11,
12, 13, and 14 are also disclosed.
The term "comprise," "comprises" and "comprising" as used herein, specify the
presence of the stated features, integers, steps, operations, elements, and/or
components, but
do not preclude the presence or addition of one or more other features,
integers, steps,
operations, elements, components, and/or groups thereof
As used herein, the transitional phrase "consisting essentially of' means that
the scope
of a claim is to be interpreted to encompass the specified materials or steps
recited in the
claim and those that do not materially affect the basic and novel
characteristic(s) of the
claimed invention. Thus, the term "consisting essentially of' when used in a
claim of this
invention is not intended to be interpreted to be equivalent to "comprising."
As used herein, the terms "increase," "increasing," "increased," "enhance,"
"enhanced," "enhancing," and "enhancement" (and grammatical variations
thereof) describe
an elevation of at least about 15%, 20%, 25%, 50%, 75%, 100%, 150%, 200%,
300%, 400%,
500% or more as compared to a control.
As used herein, the terms "reduce," "reduced," "reducing," "reduction,"
"diminish,"
and "decrease" (and grammatical variations thereof), describe, for example, a
decrease of at
least about 5%, 10%, 15%, 20%, 25%, 35%, 50%, 75%, 80%, 85%, 90%, 95%, 96%,
97%,
98%, 99%, 99.5%, 99.6%, 99.7%, 99.8%, 99.9%, or 100% as compared to a control.
In
particular embodiments, the reduction can result in no or essentially no
(i.e., an insignificant
amount, e.g., less than about 10% or even 5%) detectable activity or amount.
As used herein, the terms "express," "expresses," "expressed" or "expression,"
and the
like, with respect to a nucleic acid molecule and/or a nucleotide sequence
(e.g., RNA or DNA)
indicates that the nucleic acid molecule and/or a nucleotide sequence is
transcribed and,
optionally, translated. Thus, a nucleic acid molecule and/or a nucleotide
sequence may express
a polypeptide of interest or, for example, a functional untranslated RNA.
A "heterologous" or a "recombinant" nucleotide sequence is a nucleotide
sequence not
naturally associated with a host cell into which it is introduced, including
non- naturally
occurring multiple copies of a naturally occurring nucleotide sequence.
A "native" or "wild type" nucleic acid, nucleotide sequence, polypeptide or
amino
acid sequence refers to a naturally occurring or endogenous nucleic acid,
nucleotide
sequence, polypeptide, or amino acid sequence. Thus, for example, a "wild type
mRNA" is
an mRNA that is naturally occurring in or endogenous to the reference
organism.
13

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
As used herein, the term "heterozygous" refers to a genetic status wherein
different
alleles reside at corresponding loci on homologous chromosomes.
As used herein, the term "homozygous" refers to a genetic status wherein
identical
alleles reside at corresponding loci on homologous chromosomes.
As used herein, the term "allele" refers to one of two or more different
nucleotides or
nucleotide sequences that occur at a specific locus.
A "locus" is a position on a chromosome where a gene or marker or allele is
located.
In some embodiments, a locus may encompass one or more nucleotides.
As used herein, the terms "desired allele," "target allele" and/or "allele of
interest" are
used interchangeably to refer to an allele associated with a desired trait. In
some
embodiments, a desired allele may be associated with either an increase or a
decrease
(relative to a control) of or in a given trait, depending on the nature of the
desired phenotype.
In some embodiments of this invention, the phrase "desired allele," "target
allele" or "allele of
interest" refers to an allele(s) that is associated with increased yield under
non-water stress
conditions in a plant relative to a control plant not having the target allele
or alleles.
A marker is "associated with" a trait when said trait is linked to it and when
the
presence of the marker is an indicator of whether and/or to what extent the
desired trait or
trait form will occur in a plant/germplasm comprising the marker. Similarly, a
marker is
"associated with" an allele or chromosome interval when it is linked to it and
when the
presence of the marker is an indicator of whether the allele or chromosome
interval is present
in a plant/germplasm comprising the marker.
As used herein, the terms "backcross" and "backcrossing" refer to the process
whereby a progeny plant is crossed back to one of its parents one or more
times (e.g., 1, 2, 3,
4, 5, 6, 7, 8, etc.). In a backcrossing scheme, the "donor" parent refers to
the parental plant
with the desired gene or locus to be introgressed. The "recipient" parent
(used one or more
times) or "recurrent" parent (used two or more times) refers to the parental
plant into which
the gene or locus is being introgressed. For example, see Ragot, M. et al.
Marker-assisted
Backcrossing: A Practical Example, in TECHNIQUES ET UTILISATIONS DES MARQUEURS

MOLECULAIRES LES COLLOQUES, Vol. 72, pp. 45-56 (1995); and Openshaw et al.,
Marker-
assisted Selection in Backcross Breeding, in PROCEEDINGS OF THE SYMPOSIUM
"ANALYSIS OF
MOLECULAR MARKER DATA," pp. 41-43 (1994). The initial cross gives rise to the
Fl
generation. The term "BC1" refers to the second use of the recurrent parent,
"BC2" refers to
the third use of the recurrent parent, and so on.
14

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
As used herein, the terms "cross" or "crossed" refer to the fusion of gametes
via
pollination to produce progeny (e.g., cells, seeds or plants). The term
encompasses both
sexual crosses (the pollination of one plant by another) and selfing (self-
pollination, e.g.,
when the pollen and ovule are from the same plant). The term "crossing" refers
to the act of
fusing gametes via pollination to produce progeny.
As used herein, the terms "introgression," "introgressing" and "introgressed"
refer to
both the natural and artificial transmission of a desired allele or
combination of desired alleles
of a genetic locus or genetic loci from one genetic background to another. For
example, a
desired allele at a specified locus can be transmitted to at least one progeny
via a sexual cross
.. between two parents of the same species, where at least one of the parents
has the desired
allele in its genome. Alternatively, for example, transmission of an allele
can occur by
recombination between two donor genomes, e.g., in a fused protoplast, where at
least one of
the donor protoplasts has the desired allele in its genome. The desired allele
may be a
selected allele of a marker, a QTL, a transgene, or the like. Offspring
comprising the desired
allele can be backcrossed one or more times (e.g., 1, 2, 3, 4, or more times)
to a line having a
desired genetic background, selecting for the desired allele, with the result
being that the
desired allele becomes fixed in the desired genetic background. For example, a
marker
associated with increased yield under non-water stress conditions may be
introgressed from a
donor into a recurrent parent that does not comprise the marker and does not
exhibit
increased yield under non-water stress conditions. The resulting offspring
could then be
backcrossed one or more times and selected until the progeny possess the
genetic marker(s)
associated with increased yield under non-water stress conditions in the
recurrent parent
background.
A "genetic map" is a description of genetic linkage relationships among loci
on one or
more chromosomes within a given species, generally depicted in a diagrammatic
or tabular
form. For each genetic map, distances between loci are measured by the
recombination
frequencies between them. Recombination between loci can be detected using a
variety of
markers. A genetic map is a product of the mapping population, types of
markers used, and
the polymorphic potential of each marker between different populations. The
order and
genetic distances between loci can differ from one genetic map to another.
As used herein, the term "genotype" refers to the genetic constitution of an
individual
(or group of individuals) at one or more genetic loci, as contrasted with the
observable and/or
detectable and/or manifested trait (the phenotype). Genotype is defined by the
allele(s) of
one or more known loci that the individual has inherited from its parents. The
term genotype

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
can be used to refer to an individual's genetic constitution at a single
locus, at multiple loci, or
more generally, the term genotype can be used to refer to an individual's
genetic make-up for
all the genes in its genome. Genotypes can be indirectly characterized, e.g.,
using markers
and/or directly characterized by nucleic acid sequencing.
As used herein, the term "germplasm" refers to genetic material of or from an
individual (e.g., a plant), a group of individuals (e.g., a plant line,
variety or family), or a
clone derived from a line, variety, species, or culture. The germplasm can be
part of an
organism or cell, or can be separate from the organism or cell. In general,
germplasm
provides genetic material with a specific genetic makeup that provides a
foundation for some
or all of the hereditary qualities of an organism or cell culture. As used
herein, germplasm
includes cells, seed or tissues from which new plants may be grown, as well as
plant parts
that can be cultured into a whole plant (e.g., leaves, stems, buds, roots,
pollen, cells, etc.).
As used herein, the terms "cultivar" and "variety" refer to a group of similar
plants
that by structural or genetic features and/or performance can be distinguished
from other
varieties within the same species.
As used herein, the terms "exotic," "exotic line" and "exotic germplasm" refer
to any
plant, line or germplasm that is not elite. In general, exotic
plants/germplasms are not
derived from any known elite plant or germplasm, but rather are selected to
introduce one or
more desired genetic elements into a breeding program (e.g., to introduce
novel alleles into a
breeding program).
As used herein, the term "hybrid" in the context of plant breeding refers to a
plant that
is the offspring of genetically dissimilar parents produced by crossing plants
of different lines
or breeds or species, including but not limited to the cross between two
inbred lines.
As used herein, the term "inbred" refers to a substantially homozygous plant
or
variety. The term may refer to a plant or plant variety that is substantially
homozygous
throughout the entire genome or that is substantially homozygous with respect
to a portion of
the genome that is of particular interest.
A "haplotype" is the genotype of an individual at a plurality of genetic loci,
i.e., a
combination of alleles. Typically, the genetic loci that define a haplotype
are physically and
genetically linked, i.e., on the same chromosome segment. The term "haplotype"
can refer to
polymorphisms at a particular locus, such as a single marker locus, or
polymorphisms at
multiple loci along a chromosomal segment.
16

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
As used herein, the term "heterologous" refers to a nucleotide/polypeptide
that
originates from a foreign species, or, if from the same species, is
substantially modified from
its native form in composition and/or genomic locus by deliberate human
intervention.
As used herein, "shade avoidance response" is defined as the growth of a plant
in
response to a low red:far-red (R:FR) light ratio. Suppression of a shade
avoidance refers to
the suppression of the growth changes in response to a low R:FR light ratio.
In one aspect,
suppression of a shade avoidance response can be shown by measuring the height
of a plant
comprising the trait of the invention (e.g., mutation of HD-Zip as described
herein) and an
isogenic plant without the trait in a controlled environment with a low R:FR
light ratio. When
grown under identical conditions in the presence of a R:FR ratio of 0.16, a
plant comprising
the trait of the invention will be at least 5% shorter (e.g., height measured
at coleoptile, V1
sheath or V2 sheath) than an isogenic plant not comprising the trait (e.g.,
about 5, 6, 7, 8, 9,
10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28,
29, 30, 31, 32, 33, 34,
35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53,
54, 55, 56, 57, 58, 59,
60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78,
79, 80, 81, 82, 83, 84,
85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 110, 120,
130, 140, 150%, or
more shorter, or any range or value therein; e.g., about 5, 6, 7, 8, 9, 10,
11, 12, 13, 14, 15, 16,
17, 18, 19, 20, 21, 22, 23, 24, 25% shorter to about 26, 27, 28, 29, 30, 31,
32, 33, 34, 35, 36,
37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55,
56, 57, 58, 59, 60, 61,
62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80,
81, 82, 83, 84, 85, 86,
87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 110, 120, 130, 140,
150%, or more
shorter) (e.g., about 5% to about 10% shorter, about 5% to about 15% shorter,
about 5% to
about 20% shorter, about 5% to about 25% shorter, about 5% to about 30%
shorter, about 5%
to about 40% shorter, about 5% to about 50% shorter, about 10% to about 20%
shorter, about
10% to about 30% shorter, about 10% to about 50% shorter, 10% to about 70%
shorter, about
15% to about 20% shorter, about 15% to about 30% shorter, 15% to about 50%
shorter, about
20% to about 30% shorter, about 20% to about 50% shorter, about 20% to about
70% shorter,
about 40% to about 50% shorter, about 40% to about 60% shorter, about 40% to
about 80%
shorter, about 40% to about 100% shorter, about 50% to about 70% shorter,
about 50% to
about 100% shorter, about 50% to about 125% shorter, about 75% to about 100%
shorter,
about 75% to about 120% shorter, about 75% to about 140% shorter, and the
like).
Plants exhibiting SAR show exaggerated elongation of hypocotyl and internodes,

longer leaf, impaired root growth, early flowering and reduced seed set, low
photosynthesis
17

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
efficiency, enhanced green snap, high lodging rate, hastened senescence,
reduced grain
filling; and active suppression of disease and herbivory response mechanisms.
A plant in which SAR is reduced as described herein may have increased yield
as
compared to a plant that does not comprise the reduction in SAR. As used
herein, "increased
yield" refers to any plant trait associated with growth, for example, biomass,
yield, nitrogen
use efficiency (NUE), inflorescence size/weight, fruit yield, fruit quality,
fruit size, seed size,
seed number, foliar tissue weight, nodulation number, nodulation mass,
nodulation activity,
number of seed heads, number of tillers, number of flowers, number of tubers,
tuber mass,
bulb mass, number of seeds, total seed mass, rate of leaf emergence, rate of
tiller emergence,
rate of seedling emergence, length of roots, number of roots, size and/or
weight of root mass,
or any combination thereof. Thus, in some aspects, "increased yield" may
include, but is not
limited to, increased inflorescence production, increased fruit production
(e.g., increased
number, weight and/or size of fruit; e.g., increase number, weight, and/or
size of ears for,
e.g., maize), increased fruit quality, increased number, size and/or weight of
roots, increased
.. meristem size, increased seed size, increased biomass, increased leaf size,
increased nitrogen
use efficiency, increased height and/or increased internode length as compared
to a control
plant or part thereof (e.g., a plant that does not comprise a mutated
endogenous nucleic acid
encoding an HD-Zip transcription factor as described herein grown in an
environment with a
low R:FR light ratio (e.g., a shaded environment; e.g., a R:FR ratio of about
0.16) including
when grown in close proximity with other plants). In some aspects, increased
yield can be
expressed as quantity of grain produced per area of land (e.g. bushels per
acre of land).
"Seed weight" is jointly determined by grain morphology traits such as seed
length,
seed width and seed thickness as well as grain filling and these traits are
all governed by
quantitative genetics.
As used herein "decreased height" means repression of stem elongation in
response to
enriched far-red light.
As used herein, "decreased shoot:root ratio" means reduction of the proportion
of
above ground biomass relative to below ground biomass.
As used herein, the terms "nucleic acid," "nucleic acid molecule," "nucleotide
sequence" and "polynucleotide" refer to RNA or DNA that is linear or branched,
single or
double stranded, or a hybrid thereof The term also encompasses RNA/DNA
hybrids. When
dsRNA is produced synthetically, less common bases, such as inosine, 5-
methylcytosine, 6-
methyladenine, hypoxanthine and others can also be used for antisense, dsRNA,
and
ribozyme pairing. For example, polynucleotides that contain C-5 propyne
analogues of
18

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
uridine and cytidine have been shown to bind RNA with high affinity and to be
potent
antisense inhibitors of gene expression. Other modifications, such as
modification to the
phosphodiester backbone, or the 2'-hydroxy in the ribose sugar group of the
RNA can also be
made.
As used herein, the term "nucleotide sequence" refers to a heteropolymer of
nucleotides or the sequence of these nucleotides from the 5' to 3' end of a
nucleic acid
molecule and includes DNA or RNA molecules, including cDNA, a DNA fragment or
portion, genomic DNA, synthetic (e.g., chemically synthesized) DNA, plasmid
DNA,
mRNA, and anti-sense RNA, any of which can be single stranded or double
stranded. The
terms "nucleotide sequence" "nucleic acid," "nucleic acid molecule," "nucleic
acid construct,"
"oligonucleotide" and "polynucleotide" are also used interchangeably herein to
refer to a
heteropolymer of nucleotides. Nucleic acid molecules and/or nucleotide
sequences provided
herein are presented herein in the 5' to 3' direction, from left to right and
are represented
using the standard code for representing the nucleotide characters as set
forth in the U.S.
sequence rules, 37 CFR 1.821 - 1.825 and the World Intellectual Property
Organization
(WIPO) Standard ST.25. A "5' region" as used herein can mean the region of a
polynucleotide that is nearest the 5' end of the polynucleotide. Thus, for
example, an element
in the 5' region of a polynucleotide can be located anywhere from the first
nucleotide located
at the 5' end of the polynucleotide to the nucleotide located halfway through
the
polynucleotide. A "3' region" as used herein can mean the region of a
polynucleotide that is
nearest the 3' end of the polynucleotide. Thus, for example, an element in the
3' region of a
polynucleotide can be located anywhere from the first nucleotide located at
the 3' end of the
polynucleotide to the nucleotide located halfway through the polynucleotide.
As used herein with respect to nucleic acids, the term "fragment" or "portion"
refers to
a nucleic acid that is reduced in length relative (e.g., reduced by 1, 2, 3,
4, 5, 6, 7, 8, 9, 10, 11,
12, 13, 14, 15, 16, 17, 18, 19, 20 or more nucleotides) to a reference nucleic
acid and that
comprises, consists essentially of and/or consists of a nucleotide sequence of
contiguous
nucleotides identical or almost identical (e.g., 70%, 71%, 72%, 73%, 74%, 75%,
76%, 77%,
78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%,
93%,
94%, 95%, 96%, 97%, 98%, 99% identical) to a corresponding portion of the
reference
nucleic acid. Such a nucleic acid fragment may be, where appropriate, included
in a larger
polynucleotide of which it is a constituent. As an example, a repeat sequence
of guide
nucleic acid of this invention may comprise a portion of a wild type CRISPR-
Cas repeat
sequence (e.g., a wild Type CRISR-Cas repeat; e.g., a repeat from the CRISPR
Cas system
19

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
of, for example, a Cas9, Cas12a (Cpfl), Cas12b, Cas12c (C2c3), Cas12d (CasY),
Cas12e
(CasX), Cas12g, Cas12h, Cas12i, C2c4, C2c5, C2c8, C2c9, C2c10, Cas14a, Cas14b,
and/or a
Cas14c, and the like). In some embodiments, a nucleic acid fragment may
comprise, consist
essentially of or consist of about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13,
14, 15, 20, 25, 30, 35,
40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 125, 150, 175, 200, 225,
250, 300, 350,
400, 450, 500, 550, 600, 660, 700, 750, 800, 850, 900, 950, 1000, 1050, 1100,
1150, 1200,
1250, 1300, 1350, 1400, 1450, 1500, 1550, 1600, 1650, 1700, 1750, 1800, 1850,
1900, 1950,
2000 or more consecutive nucleotides of a nucleotide sequence encoding an HD-
Zip
transcription factor for which a reduction in activity, e.g., a reduction in
DNA binding, can
result in a reduced shade avoidance response in a plant.
In some embodiments, a fragment or portion may be a fragment or portion of an
HD-
Zip transcription factor. In some embodiments, a fragment or portion of a
nucleic acid may
be a fragment or portion of nucleic acid encoding an amino acid sequence of
any one of (a) a
polypeptide comprising a sequence having at least 80% sequence identity to the
amino acid
sequence of SEQ ID NO:38 or SEQ ID NO:83; (b) a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of: SEQ ID
NO:! or SEQ
ID NO:2; (c) a polypeptide comprising a sequence having at least 80% sequence
identity to
the amino acid sequence of SEQ ID NO:3; (d) a polypeptide comprising a
sequence having
at least 95% sequence identity to the nucleotide sequence of SEQ ID NO:4; (e)
a polypeptide
comprising a sequence having the amino acid sequence of SEQ ID NO:5, wherein X
is L or
S; (f) a polypeptide comprising: (i) sequence having the amino acid sequence
of
RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S or
A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and (iv) a sequence having the amino acid sequence of

VWFQNRRA (SEQ ID NO:9); and/or (g) a polypeptide comprising a sequence having
the
amino acid sequence of VWFQNRRA (SEQ ID NO:9), wherein the fragment or portion

comprises about 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,
20, 21, 22, 23, 24, 25,
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44,
45, 46, 47, 48, 49, 50,
51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69,
70, 71, 72, 73, 74, 75,
76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94,
95, 96, 97, 98, 99,
100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114,
115, 116, 117,
118, 119, 120, 130, 140, 150, 175, 200, 225, 250, 300, 350 or more consecutive
nucleotides,

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
or any range or value therein of a nucleic acid encoding any one of the
polypeptides of (a)-(g)
as described above. In some embodiments, a "portion" may be related to the
number of
amino acids deleted from a polypeptide. Thus, for example, a deletion of a
portion of an HD-
Zip transcription factor may comprise the deletion of at least two consecutive
nucleotides
(e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
of a nucleotide
sequence encoding an HD-Zip transcription factor polypeptide that comprises
the amino acid
sequence of SEQ ID NO:9 (VWFQNRRA). In some embodiments, a deletion may
comprise
a portion of the HD-Zip transcription factor that comprises exon 3 and exon 4,
wherein exon
3 encodes the HD-Zip DNA binding region. In some embodiments, a deletion may
comprise
a portion of the HD-Zip transcription factor, wherein the portion comprises
all of exon 3 and
exon 4 and optionally, a portion of exon 2. In some embodiments, a deletion
may comprise
the portion of the HD-Zip polynucleotide encoding about the last 96 to 125
consecutive
amino acid residues at the C-terminal portion of the HD-Zip polypeptide.
In some embodiments, a "sequence-specific DNA binding domain" may bind to one
or more fragments or portions of nucleotide sequences encoding HD-Zip
transcription factors
as described herein.
As used herein with respect to polypeptides, the term "fragment" or "portion"
may
refer to a polypeptide that is reduced in length relative to a reference
polypeptide and that
comprises, consists essentially of and/or consists of an amino acid sequence
of contiguous
amino acids identical or almost identical (e.g., 90%, 91%, 92%, 93%, 94%, 95%,
96%, 97%,
98%, 99% identical) to a corresponding portion of the reference polypeptide.
Such a
polypeptide fragment may be, where appropriate, included in a larger
polypeptide of which it
is a constituent. In some embodiments, the polypeptide fragment comprises,
consists
essentially of or consists of at least about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,
12, 13, 14, 15, 20, 25,
30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 125, 150, 175,
200, 225, 250, 300,
350, 400 or more consecutive amino acids of a reference polypeptide. In some
embodiments,
a polypeptide fragment may comprise, consist essentially of or consist of
about 5, 6, 7, 8, 9,
10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80,
85, 90, 95, 100, 125,
150, 175, 200, 225, 250, 300, 350, 400, 450, 500, 550, 600, 660, 700 or more
consecutive
amino acid residues of a HD-Zip transcription factor (e.g., (a) a polypeptide
comprising a
sequence having at least 80% sequence identity to the amino acid sequence of
SEQ ID
NO:38 or SEQ ID NO:83; (b) a polypeptide comprising a sequence having at least
80%
sequence identity to the amino acid sequence of: SEQ ID NO:! or SEQ ID NO:2;
(c) a
polypeptide comprising a sequence having at least 80% sequence identity to the
amino acid
21

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
sequence of SEQ ID NO:3; (d) a polypeptide comprising a sequence having at
least 95%
sequence identity to the nucleotide sequence of SEQ ID NO:4; (e) a polypeptide
comprising
a sequence having the amino acid sequence of SEQ ID NO:5, wherein X is L or S;
(f) a
polypeptide comprising: (i) sequence having the amino acid sequence of
RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S or
A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and (iv) a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9); and/or (g) a polypeptide comprising a sequence having
the
amino acid sequence of VWFQNRRA (SEQ ID NO:9).
In some embodiments, a fragment or portion may be a fragment or portion of an
HD-
Zip transcription factor. In some embodiments, a fragment or portion may be a
fragment or
portion of an amino acid sequence of any one of (a) a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of (a) to (g)
described
above, wherein the fragment or portion comprises about 2, 3, 4, 5, 6, 7, 8, 9,
10, 11, 12, 13,
14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32,
33, 34, 35, 36, 37, 38,
39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57,
58, 59, 60, 61, 62, 63,
64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82,
83, 84, 85, 86, 87, 88,
89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106,
107, 108, 109,
110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124,
125, 126, 127,
128, 129, 130, 140, 150, 175, 200, 225, 250, 300, 350 or more consecutive
amino acids, or
any range or value therein of any one of (a) a polypeptide comprising a
sequence having at
least 80% sequence identity to the amino acid sequence of (a) to (g) described
above. In
some embodiments, a "portion" may be related to the number of amino acids that
are deleted
from a polypeptide. Thus, for example, a deleted "portion" of an HD-Zip
transcription factor
may comprise at least one amino acid residue (e.g., at least 1, 2, 3, 4, 5, 6,
7, 8 amino acid
residues), and/or at least two (e.g., at least 2, 3, 4, 5, 6, 7, 8 amino acid
residues) amino acid
residues of the amino acid sequence of SEQ ID NO:9 (VWFQNRRA) of any HD-Zip
transcription factor described herein. In some embodiments, a deletion of a
portion of an
HD-Zip transcription factor may comprise a portion of consecutive amino acid
residues of
SEQ ID NO:9 (e.g., at least 2, 3, 4, 5, 6, 7, or 8 consecutive amino acid
residues). In some
embodiments, the deletion includes at least a portion of consecutive amino
acid residues of
SEQ ID NO:9 (e.g., at least 1, 2, 3, 4, 5, 6, 7, or 8 consecutive amino acid
residues), wherein
22

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
the deletion may be at least 1 amino acid residue to about 120 amino acid
residues in length
(e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18,
19, 20, 21, 22, 23, 24,
25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43,
44, 45, 46, 47, 48, 49,
50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68,
69, 70, 71, 72, 73, 74,
.. 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93,
94, 95, 96, 97, 98, 99,
100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114,
115, 116, 117,
118, 119, 120, or more consecutive amino acid residues up to the full length
of the HD-Zip
transcription factor, and any range or value therein) and wherein at least 1
amino acid residue
that is deleted is from the DNA binding region of the HD-Zip transcription
factor. In some
embodiments, the deletion may be a truncation that includes at least a portion
of consecutive
amino acid residues of SEQ ID NO:9 (e.g., at least 1, 2, 3, 4, 5, 6, 7, or 8
consecutive amino
acid residues). In some embodiments, the truncation may be a C-terminal
truncation and
comprise a length of at least 96 consecutive amino acid residues. In some
embodiments, the
truncation may be a C-terminal truncation and comprise a length of about 96
amino acid
residues to about 125 amino acid residues (e.g., at least 96, 97, 98, 99, 100,
101, 102, 103,
104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118,
119, 120, 121,
122, 123, 124, 125, or more consecutive amino acid residues up to the full
length of the HD-
Zip transcription factor, and any range or value therein) and wherein at least
1 amino acid
residue that is truncated is from the DNA binding region of the HD-Zip
transcription factor.
.. In some embodiments, a deletion may include a deletion of exon 3, which
comprises the
DNA binding region of the HD-Zip transcription factor (e.g., the portion of
the HD-Zip
transcription factor that is deleted may be about 27 amino acid residues in
length). In some
embodiments, the truncation may be a result of a deletion in exon 2 (e.g, a
portion of exon 2)
that results in a truncation of a portion of the amino acid residues encoded
by exon2 and all
remaining amino acids following the deletion, thereby, for example, truncating
all of the
amino acids encoded by exon 3 and exon 4. Thus, in some embodiments, a
deletion may
result in a truncation of the C-terminal region of the HD-Zip transcription
factor polypeptide
that includes the DNA binding region or at least a portion of the DNA binding
region. In
some embodiments, a deletion may cause a frameshift mutation that results a
stop codon and
a truncation of the C-terminus of the HD-Zip transcription factor polypeptide.
In some
embodiments, a C-terminal truncation may result in a polypeptide that
comprises 207 amino
acids (e.g., the deleted or truncated portion includes all amino acid residues
after amino acid
residue 207; see, e.g., maize HD-Zip edited polypeptide SEQ ID NO:201).
In some embodiments, a "sequence-specific DNA binding domain" may bind to one
23

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
or more fragments or portions of nucleotide sequences encoding HD-Zip
transcription factors
as described herein.
As used herein with respect to nucleic acids, the term "functional fragment"
refers to
nucleic acid that encodes a functional fragment of a polypeptide.
The term "gene," as used herein, refers to a nucleic acid molecule capable of
being
used to produce mRNA, antisense RNA, miRNA, anti-microRNA antisense
oligodeoxyribonucleotide (AMO) and the like. Genes may or may not be capable
of being
used to produce a functional protein or gene product. Genes can include both
coding and
non-coding regions (e.g., introns, regulatory elements, promoters, enhancers,
termination
sequences and/or 5' and 3' untranslated regions). A gene may be "isolated" by
which is meant
a nucleic acid that is substantially or essentially free from components
normally found in
association with the nucleic acid in its natural state. Such components
include other cellular
material, culture medium from recombinant production, and/or various chemicals
used in
chemically synthesizing the nucleic acid.
The term "mutation" refers to point mutations (e.g., missense, or nonsense, or
insertions or deletions of single base pairs that result in frame shifts),
insertions, deletions,
and/or truncations. When the mutation is a substitution of a residue within an
amino acid
sequence with another residue, or a deletion or insertion of one or more
residues within a
sequence, the mutations are typically described by identifying the original
residue followed
__ by the position of the residue within the sequence and by the identity of
the newly substituted
residue. In some embodiments, a deletion may result in a frameshift mutation
that generates
a premature stop codon, thereby truncating the protein.
The terms "complementary" or "complementarity," as used herein, refer to the
natural
binding of polynucleotides under permissive salt and temperature conditions by
base-pairing.
For example, the sequence "A-G-T" (5' to 3') binds to the complementary
sequence "T-C-A"
(3' to 5'). Complementarity between two single-stranded molecules may be
"partial," in
which only some of the nucleotides bind, or it may be complete when total
complementarity
exists between the single stranded molecules. The degree of complementarity
between
nucleic acid strands has significant effects on the efficiency and strength of
hybridization
between nucleic acid strands.
"Complement," as used herein, can mean 100% complementarity with the
comparator
nucleotide sequence or it can mean less than 100% complementarity (e.g., about
70%, 71%,
72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%,
87%,
24

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, and the like,
complementarity) to the comparator nucleotide sequence.
Different nucleic acids or proteins having homology are referred to herein as
"homologues." The term homologue includes homologous sequences from the same
and
__ from other species and orthologous sequences from the same and other
species. "Homology"
refers to the level of similarity between two or more nucleic acid and/or
amino acid
sequences in terms of percent of positional identity (i.e., sequence
similarity or identity).
Homology also refers to the concept of similar functional properties among
different nucleic
acids or proteins. Thus, the compositions and methods of the invention further
comprise
__ homologues to the nucleotide sequences and polypeptide sequences of this
invention.
"Orthologous," as used herein, refers to homologous nucleotide sequences and/
or amino acid
sequences in different species that arose from a common ancestral gene during
speciation. A
homologue of a nucleotide sequence of this invention has a substantial
sequence identity
(e.g., at least about 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%,
81%,
82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%,
97%,
98%, 99%, 99.5% or 100%) to said nucleotide sequence of the invention.
As used herein "sequence identity" refers to the extent to which two optimally
aligned
polynucleotide or polypeptide sequences are invariant throughout a window of
alignment of
components, e.g., nucleotides or amino acids. "Identity" can be readily
calculated by known
methods including, but not limited to, those described in: Computational
Molecular Biology
(Lesk, A. M., ed.) Oxford University Press, New York (1988); Biocomputing:
Informatics
and Genome Projects (Smith, D. W., ed.) Academic Press, New York (1993);
Computer
Analysis of Sequence Data, Part I (Griffin, A. M., and Griffin, H. G., eds.)
Humana Press,
New Jersey (1994); Sequence Analysis in Molecular Biology (von Heinje, G.,
ed.) Academic
__ Press (1987); and Sequence Analysis Primer (Gribskov, M. and Devereux, J.,
eds.) Stockton
Press, New York (1991).
As used herein, the term "percent sequence identity" or "percent identity"
refers to the
percentage of identical nucleotides in a linear polynucleotide sequence of a
reference
("query") polynucleotide molecule (or its complementary strand) as compared to
a test
("subject") polynucleotide molecule (or its complementary strand) when the two
sequences
are optimally aligned. In some embodiments, "percent identity" can refer to
the percentage of
identical amino acids in an amino acid sequence as compared to a reference
polypeptide.
As used herein, the phrase "substantially identical," or "substantial
identity" in the
context of two nucleic acid molecules, nucleotide sequences or polypeptide
sequences, refers

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
to two or more sequences or subsequences that have at least about 70%, 71%,
72%, 73%,
74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%,
89%,
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or 100% nucleotide or
amino acid residue identity, when compared and aligned for maximum
correspondence, as
.. measured using one of the following sequence comparison algorithms or by
visual inspection.
In some embodiments of the invention, the substantial identity exists over a
region of
consecutive nucleotides of a nucleotide sequence of the invention that is
about 10 nucleotides
to about 20 nucleotides, about 10 nucleotides to about 25 nucleotides, about
10 nucleotides to
about 30 nucleotides, about 15 nucleotides to about 25 nucleotides, about 30
nucleotides to
.. about 40 nucleotides, about 50 nucleotides to about 60 nucleotides, about
70 nucleotides to
about 80 nucleotides, about 90 nucleotides to about 100 nucleotides, or more
nucleotides in
length, and any range therein, up to the full length of the sequence. In some
embodiments,
nucleotide sequences can be substantially identical over at least about 20
nucleotides (e.g.,
about 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37,
38, 39, 40
nucleotides). In some embodiments of the invention, the substantial identity
exists over a
region of consecutive amino acid residues of a polypeptide of the invention
that is about 3
amino acid residues to about 20 amino acid residues, about 5 amino acid
residues to about 25
amino acid residues, about 7 amino acid residues to about 30 amino acid
residues, about 10
amino acid residues to about 25 amino acid residues, about 15 amino acid
residues to about
30 amino acid residues, about 20 amino acid residues to about 40 amino acid
residues, about
amino acid residues to about 40 amino acid residues, about 25 amino acid
residues to
about 50 amino acid residues, about 30 amino acid residues to about 50 amino
acid residues,
about 40 amino acid residues to about 50 amino acid residues, about 40 amino
acid residues
to about 70 amino acid residues, about 50 amino acid residues to about 70
amino acid
25 residues, about 60 amino acid residues to about 80 amino acid residues,
about 70 amino acid
residues to about 80 amino acid residues, about 90 amino acid residues to
about 100 amino
acid residues, or more amino acid residues in length, and any range therein,
up to the full
length of the sequence. In some embodiments, polypeptide sequences can be
substantially
identical to one another over at least about 8 consecutive amino acid residues
(e.g., about 8,
9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28,
29, 30, 31, 32, 33,
34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52,
53, 54, 55, 56, 57, 58,
59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77,
78, 79, 80, 81, 82, 83,
84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102,
103, 104, 105,
106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120,
130, 140, 150,
26

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
175, 200, 225, 250, 275, 300, 325, 350 or more amino acids in length or more
consecutive
amino acid residues). In some embodiments, two or more HD-Zip transcription
factors may
be substantially identical to one another over at least about 8 consecutive
amino acid residues
(e.g., SEQ ID NO:9), over at least about 9 consecutive amino acid residues
(e.g., SEQ ID
NO:7), over at least about 11 consecutive amino acid residues (e.g., SEQ ID
NO:6), at least
about 13 consecutive amino acid residues (e.g., SEQ ID NO:8), over at least
about 24
consecutive amino acid residues (e.g., SEQ ID NOs:4-5), over at least about 53
consecutive
amino acid residues (e.g., SEQ ID NO:3), over at least about 116 consecutive
amino acid
residues (e.g., SEQ ID NOs:1-2), and the like, or an combination thereof In
some
embodiments, a substantially identical nucleotide or protein sequence performs
substantially
the same function as the nucleotide (or encoded protein sequence) to which it
is substantially
identical.
For sequence comparison, typically one sequence acts as a reference sequence
to
which test sequences are compared. When using a sequence comparison algorithm,
test and
reference sequences are entered into a computer, subsequence coordinates are
designated if
necessary, and sequence algorithm program parameters are designated. The
sequence
comparison algorithm then calculates the percent sequence identity for the
test sequence(s)
relative to the reference sequence, based on the designated program
parameters.
Optimal alignment of sequences for aligning a comparison window are well known
to
those skilled in the art and may be conducted by tools such as the local
homology algorithm
of Smith and Waterman, the homology alignment algorithm of Needleman and
Wunsch, the
search for similarity method of Pearson and Lipman, and optionally by
computerized
implementations of these algorithms such as GAP, BESTFIT, FASTA, and TFASTA
available as part of the GCG Wisconsin Package (Accelrys Inc., San Diego,
CA). An
"identity fraction" for aligned segments of a test sequence and a reference
sequence is the
number of identical components which are shared by the two aligned sequences
divided by
the total number of components in the reference sequence segment, e.g., the
entire reference
sequence or a smaller defined part of the reference sequence. Percent sequence
identity is
represented as the identity fraction multiplied by 100. The comparison of one
or more
.. polynucleotide sequences may be to a full-length polynucleotide sequence or
a portion
thereof, or to a longer polynucleotide sequence. For purposes of this
invention "percent
identity" may also be determined using BLASTX version 2.0 for translated
nucleotide
sequences and BLASTN version 2.0 for polynucleotide sequences.
27

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
Two nucleotide sequences may also be considered substantially complementary
when
the two sequences hybridize to each other under stringent conditions. In some
embodiments,
two nucleotide sequences considered to be substantially complementary
hybridize to each
other under highly stringent conditions.
"Stringent hybridization conditions" and "stringent hybridization wash
conditions" in
the context of nucleic acid hybridization experiments such as Southern and
Northern
hybridizations are sequence dependent, and are different under different
environmental
parameters. An extensive guide to the hybridization of nucleic acids is found
in Tijssen
Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with
Nucleic
Acid Probes part I chapter 2 "Overview of principles of hybridization and the
strategy of
nucleic acid probe assays" Elsevier, New York (1993). Generally, highly
stringent
hybridization and wash conditions are selected to be about 5 C lower than the
thermal
melting point (Tm) for the specific sequence at a defined ionic strength and
pH.
The Tm is the temperature (under defined ionic strength and pH) at which 50%
of the
target sequence hybridizes to a perfectly matched probe. Very stringent
conditions are
selected to be equal to the Tm for a particular probe. An example of stringent
hybridization
conditions for hybridization of complementary nucleotide sequences which have
more than
100 complementary residues on a filter in a Southern or northern blot is 50%
formamide with
1 mg of heparin at 42 C, with the hybridization being carried out overnight.
An example of
highly stringent wash conditions is 0.1 5M NaCl at 72 C for about 15 minutes.
An example
of stringent wash conditions is a 0.2x SSC wash at 65 C for 15 minutes (see,
Sambrook,
infra, for a description of SSC buffer). Often, a high stringency wash is
preceded by a low
stringency wash to remove background probe signal. An example of a medium
stringency
wash for a duplex of, e.g., more than 100 nucleotides, is lx SSC at 45 C for
15 minutes. An
example of a low stringency wash for a duplex of, e.g., more than 100
nucleotides, is 4-6x
SSC at 40 C for 15 minutes. For short probes (e.g., about 10 to 50
nucleotides), stringent
conditions typically involve salt concentrations of less than about 1.0 M Na
ion, typically
about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3,
and the temperature
is typically at least about 30 C. Stringent conditions can also be achieved
with the addition of
destabilizing agents such as formamide. In general, a signal to noise ratio of
2x (or higher)
than that observed for an unrelated probe in the particular hybridization
assay indicates
detection of a specific hybridization. Nucleotide sequences that do not
hybridize to each
other under stringent conditions are still substantially identical if the
proteins that they encode
28

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
are substantially identical. This can occur, for example, when a copy of a
nucleotide
sequence is created using the maximum codon degeneracy permitted by the
genetic code.
A polynucleotide and/or recombinant nucleic acid construct of this invention
(e.g.,
expression cassettes and/or vectors) may be codon optimized for expression. In
some
embodiments, the polynucleotides, nucleic acid constructs, expression
cassettes, and/or
vectors of the editing systems of the invention (e.g., comprising/encoding a
sequence-specific
DNA binding domain (e.g., a sequence-specific DNA binding domain from a
polynucleoti de-
guided endonuclease, a zinc finger nuclease, a transcription activator-like
effector nuclease
(TALEN), an Argonaute protein, and/or a CRISPR-Cas endonuclease (e.g., CRISPR-
Cas
effector protein) (e.g., a Type I CRISPR-Cas effector protein, a Type II
CRISPR-Cas effector
protein, a Type III CRISPR-Cas effector protein, a Type IV CRISPR-Cas effector
protein, a
Type V CRISPR-Cas effector protein or a Type VI CRISPR-Cas effector protein)),
a
nuclease (e.g., an endonuclease (e.g., Fokl), a polynucleotide-guided
endonuclease, a
CRISPR-Cas endoltuci ease (e.g., CRISPR-Cas effector protein), a zinc finger
nuclease,
and/or a transcription activator-like effector nuclease (TALEN)), deaminase
proteins/domains
(e.g., adenine deaminase, cytosine deaminase), a polynucleotide encoding a
reverse
transcriptase protein or domain, a polynucleotide encoding a 5'-3' exonuclease
polypeptide,
and/or affinity polypeptides, peptide tags, etc.) may be codon optimized for
expression in a
plant. In some embodiments, the codon optimized nucleic acids,
polynucleotides, expression
cassettes, and/or vectors of the invention have about 70% to about 99.9%
(e.g., 70%, 71%,
72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%,
87%,
88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%. 99.9% or
100%) identity or more to the reference nucleic acids, polynucleotides,
expression cassettes,
and/or vectors that have not been codon optimized.
In any of the embodiments described herein, a polynucleotide or nucleic acid
construct of the invention may be operatively associated with a variety of
promoters and/or
other regulatory elements for expression in a plant and/or a cell of a plant.
Thus, in some
embodiments, a polynucleotide or nucleic acid construct of this invention may
further
comprise one or more promoters, introns, enhancers, and/or terminators
operably linked to
one or more nucleotide sequences. In some embodiments, a promoter may be
operably
associated with an intron (e.g., Ubil promoter and intron). In some
embodiments, a promoter
associated with an intron maybe referred to as a "promoter region" (e.g., Ubil
promoter and
intron).
By "operably linked" or "operably associated" as used herein in reference to
29

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
polynucleotides, it is meant that the indicated elements are functionally
related to each other,
and are also generally physically related. Thus, the term "operably linked" or
"operably
associated" as used herein, refers to nucleotide sequences on a single nucleic
acid molecule
that are functionally associated. Thus, a first nucleotide sequence that is
operably linked to a
second nucleotide sequence means a situation when the first nucleotide
sequence is placed in
a functional relationship with the second nucleotide sequence. For instance, a
promoter is
operably associated with a nucleotide sequence if the promoter effects the
transcription or
expression of said nucleotide sequence. Those skilled in the art will
appreciate that the
control sequences (e.g., promoter) need not be contiguous with the nucleotide
sequence to
which it is operably associated, as long as the control sequences function to
direct the
expression thereof Thus, for example, intervening untranslated, yet
transcribed, nucleic acid
sequences can be present between a promoter and the nucleotide sequence, and
the promoter
can still be considered "operably linked" to the nucleotide sequence.
As used herein, the term "linked," in reference to polypeptides, refers to the
attachment of one polypeptide to another. A polypeptide may be linked to
another
polypeptide (at the N-terminus or the C-terminus) directly (e.g., via a
peptide bond) or
through a linker.
The term "linker" is art-recognized and refers to a chemical group, or a
molecule
linking two molecules or moieties, e.g., two domains of a fusion protein, such
as, for
example, a DNA binding polypeptide or domain and peptide tag and/or a reverse
transcriptase and an affinity polypeptide that binds to the peptide tag; or a
DNA endonuclease
polypeptide or domain and peptide tag and/or a reverse transcriptase and an
affinity
polypeptide that binds to the peptide tag. A linker may be comprised of a
single linking
molecule or may comprise more than one linking molecule. In some embodiments,
the linker
can be an organic molecule, group, polymer, or chemical moiety such as a
bivalent organic
moiety. In some embodiments, the linker may be an amino acid or it may be a
peptide. In
some embodiments, the linker is a peptide.
In some embodiments, a peptide linker useful with this invention may be about
2 to
about 100 or more amino acids in length, for example, about 2, 3, 4, 5, 6, 7,
8, 9, 10, 11, 12,
13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31,
32, 33, 34, 35, 36, 37,
38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56,
57, 58, 59, 60, 61, 62,
63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81,
82, 83, 84, 85, 86, 87,
88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 or more amino acids in
length (e.g., about 2
to about 40, about 2 to about 50, about 2 to about 60, about 4 to about 40,
about 4 to about

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
50, about 4 to about 60, about 5 to about 40, about 5 to about 50, about 5 to
about 60, about 9
to about 40, about 9 to about 50, about 9 to about 60, about 10 to about 40,
about 10 to about
50, about 10 to about 60, or about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14,
15, 16, 17, 18, 19,
20, 21, 22, 23, 24, 25 amino acids to about 26, 27, 28, 29, 30, 31, 32, 33,
34, 35, 36, 37, 38,
39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57,
58, 59, 60, 61, 62, 63,
64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82,
83, 84, 85, 86, 87, 88,
89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 or more amino acids in length
(e.g., about 105,
110, 115, 120, 130, 140 150 or more amino acids in length). In some
embodiments, a peptide
linker may be a GS linker.
As used herein, the term "linked," or "fused" in reference to polynucleotides,
refers
to the attachment of one polynucleotide to another. In some embodiments, two
or more
polynucleotide molecules may be linked by a linker that can be an organic
molecule, group,
polymer, or chemical moiety such as a bivalent organic moiety. A
polynucleotide may be
linked or fused to another polynucleotide (at the 5' end or the 3' end) via a
covalent or non-
covenant linkage or binding, including e.g., Watson-Crick base-pairing, or
through one or
more linking nucleotides. In some embodiments, a polynucleotide motif of a
certain structure
may be inserted within another polynucleotide sequence (e.g. extension of the
hairpin
structure in the guide RNA). In some embodiments, the linking nucleotides may
be naturally
occurring nucleotides. In some embodiments, the linking nucleotides may be non-
naturally
occurring nucleotides.
A "promoter" is a nucleotide sequence that controls or regulates the
transcription of a
nucleotide sequence (e.g., a coding sequence) that is operably associated with
the promoter.
The coding sequence controlled or regulated by a promoter may encode a
polypeptide and/or
a functional RNA. Typically, a "promoter" refers to a nucleotide sequence that
contains a
binding site for RNA polymerase II and directs the initiation of
transcription. In general,
promoters are found 5', or upstream, relative to the start of the coding
region of the
corresponding coding sequence. A promoter may comprise other elements that act
as
regulators of gene expression; e.g., a promoter region. These include a TATA
box consensus
sequence, and often a CAAT box consensus sequence (Breathnach and Chambon,
(1981)
Annu. Rev. Biochem. 50:349). In plants, the CAAT box may be substituted by the
AGGA
box (Messing et al., (1983) in Genetic Engineering of Plants, T. Kosuge, C.
Meredith and A.
Hollaender (eds.), Plenum Press, pp. 211-227).
Promoters useful with this invention can include, for example, constitutive,
inducible,
temporally regulated, developmentally regulated, chemically regulated, tissue-
preferred
31

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
and/or tissue-specific promoters for use in the preparation of recombinant
nucleic acid
molecules, e.g., "synthetic nucleic acid constructs" or "protein-RNA complex."
These various
types of promoters are known in the art.
The choice of promoter may vary depending on the temporal and spatial
requirements
for expression, and also may vary based on the host cell to be transformed.
Promoters for
many different organisms are well known in the art. Based on the extensive
knowledge
present in the art, the appropriate promoter can be selected for the
particular host organism of
interest. Thus, for example, much is known about promoters upstream of highly
constitutively expressed genes in model organisms and such knowledge can be
readily
accessed and implemented in other systems as appropriate.
In some embodiments, a promoter functional in a plant may be used with the
constructs of this invention. Non-limiting examples of a promoter useful for
driving
expression in a plant include the promoter of the RubisCo small subunit gene 1
(PrbcS1), the
promoter of the actin gene (Pactin), the promoter of the nitrate reductase
gene (Pnr) and the
promoter of duplicated carbonic anhydrase gene 1 (Pdca1) (See, Walker et al.
Plant Cell Rep.
23:727-735 (2005); Li et al. Gene 403:132-142 (2007); Li et al. Mot Biol. Rep.
37:1143-1154
(2010)). PrbcS1 and Pactin are constitutive promoters and Pnr and Pdcal are
inducible
promoters. Pnr is induced by nitrate and repressed by ammonium (Li et al. Gene
403:132-
142 (2007)) and Pdcal is induced by salt (Li et al. Mot Biol. Rep. 37:1143-
1154 (2010)). In
some embodiments, a promoter useful with this invention is RNA polymerase II
(P0111)
promoter. In some embodiments, a U6 promoter or a 75L promoter from Zea mays
may be
useful with constructs of this invention. In some embodiments, the U6c
promoter and/or 75L
promoter from Zea mays may be useful for driving expression of a guide nucleic
acid. In
some embodiments, a U6c promoter, U6i promoter and/or 75L promoter from
Glycine max
may be useful with constructs of this invention. In some embodiments, the U6c
promoter,
U6i promoter and/or 75L promoter from Glycine max may be useful for driving
expression
of a guide nucleic acid.
Examples of constitutive promoters useful for plants include, but are not
limited to,
cestrum virus promoter (cmp) (U.S. Patent No. 7,166,770), the rice actin 1
promoter (Wang
et al. (1992) Mol. Cell. Biol. 12:3399-3406; as well as US Patent No.
5,641,876), CaMV 35S
promoter (Odell et al. (1985) Nature 313:810-812), CaMV 19S promoter (Lawton
et al.
(1987) Plant Mol. Biol. 9:315-324), nos promoter (Ebert et al. (1987) Proc.
Natl. Acad. Sci
USA 84:5745-5749), Adh promoter (Walker et al. (1987) Proc. Natl. Acad. Sci.
USA
84:6624-6629), sucrose synthase promoter (Yang & Russell (1990) Proc. Natl.
Acad. Sci.
32

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
USA 87:4144-4148), and the ubiquitin promoter. The constitutive promoter
derived from
ubiquitin accumulates in many cell types. Ubiquitin promoters have been cloned
from
several plant species for use in transgenic plants, for example, sunflower
(Binet et al., 1991.
Plant Science 79: 87-94), maize (Christensen et al., 1989. Plant Molec. Biol.
12: 619-632),
and arabidopsis (Norris et al. 1993. Plant Molec. Biol. 21:895-906). The maize
ubiquitin
promoter (UbiP) has been developed in transgenic monocot systems and its
sequence and
vectors constructed for monocot transformation are disclosed in the patent
publication EP 0
342 926. The ubiquitin promoter is suitable for the expression of the
nucleotide sequences of
the invention in transgenic plants, especially monocotyledons. Further, the
promoter
expression cassettes described by McElroy et al. (Mol. Gen. Genet. 231: 150-
160 (1991)) can
be easily modified for the expression of the nucleotide sequences of the
invention and are
particularly suitable for use in monocotyledonous hosts.
In some embodiments, tissue specific/tissue preferred promoters can be used
for
expression of a heterologous polynucleotide in a plant cell. Tissue specific
or preferred
expression patterns include, but are not limited to, green tissue specific or
preferred, root
specific or preferred, stem specific or preferred, flower specific or
preferred or pollen specific
or preferred. Promoters suitable for expression in green tissue include many
that regulate
genes involved in photosynthesis and many of these have been cloned from both
monocotyledons and dicotyledons. In one embodiment, a promoter useful with the
invention
is the maize PEPC promoter from the phosphoenol carboxylase gene (Hudspeth &
Grula,
Plant Molec. Biol. 12:579-589 (1989)). Non-limiting examples of tissue-
specific promoters
include those associated with genes encoding the seed storage proteins (such
as 0-
conglycinin, cruciferin, napin and phaseolin), zein or oil body proteins (such
as oleosin), or
proteins involved in fatty acid biosynthesis (including acyl carrier protein,
stearoyl-ACP
desaturase and fatty acid desaturases (fad 2-1)), and other nucleic acids
expressed during
embryo development (such as Bce4, see, e.g., Kridl et al. (1991) Seed Sci.
Res. 1:209-219; as
well as EP Patent No. 255378). Tissue-specific or tissue-preferential
promoters useful for the
expression of the nucleotide sequences of the invention in plants,
particularly maize, include
but are not limited to those that direct expression in root, pith, leaf or
pollen. Such promoters
are disclosed, for example, in WO 93/07278, herein incorporated by reference
in its entirety.
Other non-limiting examples of tissue specific or tissue preferred promoters
useful with the
invention the cotton rubisco promoter disclosed in US Patent 6,040,504; the
rice sucrose
synthase promoter disclosed in US Patent 5,604,121; the root specific promoter
described by
de Framond (FEBS 290:103-106 (1991); EP 0 452 269 to Ciba- Geigy); the stem
specific
33

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
promoter described in U.S. Patent 5,625,136 (to Ciba-Geigy) and which drives
expression of
the maize trpA gene; the cestrum yellow leaf curling virus promoter disclosed
in WO
01/73087; and pollen specific or preferred promoters including, but not
limited to,
ProOsLPS10 and ProOsLPS11 from rice (Nguyen et al. Plant Biotechnol. Reports
9(5):297-
306 (2015)), ZmSTK2 USP from maize (Wang et al. Genome 60(6):485-495 (2017)),
LAT52 and LAT59 from tomato (Twell et al. Development 109(3):705-713 (1990)),
Zm13
(U.S. Patent No. 10,421,972), PLA2-6 promoter from arabidopsis (U.S. Patent
No.
7,141,424), and/or the ZmC5 promoter from maize (International PCT Publication
No.
W01999/042587.
Additional examples of plant tissue-specific/tissue preferred promoters
include, but
are not limited to, the root hair¨specific cis-elements (RHEs) (Kim et al. The
Plant Cell
18:2958-2970 (2006)), the root-specific promoters RCc3 (Jeong et al. Plant
Physiol. 153:185-
197 (2010)) and RB7 (U.S. Patent No. 5459252), the lectin promoter (Lindstrom
et al. (1990)
Der. Genet. 11:160-167; and Vodkin (1983) Prog. Clin. Biol. Res. 138:87-98),
corn alcohol
dehydrogenase 1 promoter (Dennis et al. (1984) Nucleic Acids Res. 12:3983-
4000), S-
adenos,,,,,I-L-methionine symlietase (SAMS) (Vander Mijnsbrugge et al. (1996)
Plant and Cell
Physiology, 37(8):1108-1115), corn light harvesting complex promoter (Bansal
et al. (1992)
Proc. Natl. Acad. Sci. USA 89:3654-3658), corn heat shock protein promoter
(O'Dell et al.
(1985) EMBO 1 5:451-458; and Rochester et al. (1986) EMBO 5:451-458), pea
small
subunit RuBP carboxylase promoter (Cashmore, "Nuclear genes encoding the small
subunit
of ribulose-1,5-bisphosphate carboxylase" pp. 29-39 In: Genetic Engineering of
Plants
(Hollaender ed., Plenum Press 1983; and Poulsen et al. (1986) Mol. Gen. Genet.
205:193-
200), Ti plasmid mannopine synthase promoter (Langridge et al. (1989) Proc.
Natl. Acad.
Sci. USA 86:3219-3223), Ti plasmid nopaline synthase promoter (Langridge et
al. (1989),
supra), petunia chalcone isomerase promoter (van Tunen et al. (1988) EA/B0
7:1257-
1263), bean glycine rich protein 1 promoter (Keller et al. (1989) Genes Dev.
3:1639-1646),
truncated CaMV 35S promoter (O'Dell et al. (1985) Nature 313:810-812), potato
patatin
promoter (Wenzler et al. (1989) Plant Mol. Biol. 13:347-354), root cell
promoter (Yamamoto
et al. (1990) Nucleic Acids Res. 18:7449), maize zein promoter (Kriz et al.
(1987) Mol. Gen.
Genet. 207:90-98; Langridge et al. (1983) Cell 34:1015-1022; Reina et al.
(1990) Nucleic
Acids Res. 18:6425; Reina et al. (1990) Nucleic Acids Res. 18:7449; and
Wandelt et al.
(1989) Nucleic Acids Res. 17:2354), globulin-1 promoter (Belanger et al.
(1991) Genetics
129:863-872), a-tubulin cab promoter (Sullivan et al. (1989) Mol. Gen. Genet.
215:431-440),
PEPCase promoter (Hudspeth & Grula (1989) Plant Mol. Biol. 12:579-589), R gene
34

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
complex-associated promoters (Chandler et al. (1989) Plant Cell 1:1175-1183),
and chalcone
synthase promoters (Franken et al. (1991) EMBO J. 10:2605-2612).
Useful for seed-specific expression is the pea vicilin promoter (Czako et al.
(1992)
Mol. Gen. Genet. 235:33-40; as well as the seed-specific promoters disclosed
in U.S. Patent
No. 5,625,136. Useful promoters for expression in mature leaves are those that
are switched
at the onset of senescence, such as the SAG promoter from Arabidopsis (Gan et
al. (1995)
Science 270:1986-1988).
In addition, promoters functional in chloroplasts can be used. Non-limiting
examples
of such promoters include the bacteriophage T3 gene 9 5' UTR and other
promoters disclosed
in U.S. Patent No. 7,579,516. Other promoters useful with the invention
include but are not
limited to the S-E9 small subunit RuBP carboxylase promoter and the Kunitz
trypsin
inhibitor gene promoter (Kti3).
Additional regulatory elements useful with this invention include, but are not
limited
to, introns, enhancers, termination sequences and/or 5' and 3' untranslated
regions.
An intron useful with this invention can be an intron identified in and
isolated from a
plant and then inserted into an expression cassette to be used in
transformation of a plant. As
would be understood by those of skill in the art, introns can comprise the
sequences required
for self-excision and are incorporated into nucleic acid constructs/expression
cassettes in
frame. An intron can be used either as a spacer to separate multiple protein-
coding sequences
in one nucleic acid construct, or an intron can be used inside one protein-
coding sequence to,
for example, stabilize the mRNA. If they are used within a protein-coding
sequence, they are
inserted "in-frame" with the excision sites included. Introns may also be
associated with
promoters to improve or modify expression. As an example, a promoter/intron
combination
useful with this invention includes but is not limited to that of the maize
Ubil promoter and
intron (see, e.g, SEQ ID NO:196 and SEQ ID NO:197).
Non-limiting examples of introns useful with the present invention include
introns
from the ADHI gene (e.g., Adhl-S introns 1, 2 and 6), the ubiquitin gene
(Ubil), the
RuBisCO small subunit (rbcS) gene, the RuBisCO large subunit (rbcL) gene, the
actin gene
(e.g., actin-1 intron), the pyruvate dehydrogenase kinase gene (pdk), the
nitrate reductase
gene (nr), the duplicated carbonic anhydrase gene 1 (Tdcal), the psbA gene,
the atpA gene, or
any combination thereof.
In some embodiments, a polynucleotide and/or a nucleic acid construct of the
invention can be an "expression cassette" or can be comprised within an
expression cassette.
As used herein, "expression cassette" means a recombinant nucleic acid
molecule comprising,

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
for example, a one or more polynucleotides of the invention (e.g., a
polynucleotide encoding
a sequence-specific DNA binding domain, a polynucleotide encoding a deaminase
protein or
domain, a polynucleotide encoding a reverse transcriptase protein or domain, a

polynucleotide encoding a 5'-3' exonuclease polypeptide or domain, a guide
nucleic acid
and/or reverse transcriptase (RT) template), wherein polynucleotide(s) is/are
operably
associated with one or more control sequences (e.g., a promoter, terminator
and the like).
Thus, in some embodiments, one or more expression cassettes may be provided,
which are
designed to express, for example, a nucleic acid construct of the invention
(e.g., a
polynucleotide encoding a sequence-specific DNA binding domain, a
polynucleotide
encoding a nuclease polypeptide/domain, a polynucleotide encoding a deaminase
protein/domain, a polynucleotide encoding a reverse transcriptase
protein/domain, a
polynucleotide encoding a 5'-3' exonuclease polypeptide/domain, a
polynucleotide encoding
a peptide tag, and/or a polynucleotide encoding an affinity polypeptide, and
the like, or
comprising a guide nucleic acid, an extended guide nucleic acid, and/or RT
template, and the
like). When an expression cassette of the present invention comprises more
than one
polynucleotide, the polynucleotides may be operably linked to a single
promoter that drives
expression of all of the polynucleotides or the polynucleotides may be
operably linked to one
or more separate promoters (e.g., three polynucleotides may be driven by one,
two or three
promoters in any combination). When two or more separate promoters are used,
the
__ promoters may be the same promoter or they may be different promoters.
Thus, a
polynucleotide encoding a sequence specific DNA binding domain, a
polynucleotide
encoding a nuclease protein/domain, a polynucleotide encoding a CRISPR-Cas
effector
protein/domain, a polynucleotide encoding an deaminase protein/domain, a
polynucleotide
encoding a reverse transcriptase polypeptide/domain (e.g., RNA-dependent DNA
polymerase), and/or a polynucleotide encoding a 5'-3' exonuclease
polypeptide/domain, a
guide nucleic acid, an extended guide nucleic acid and/or RT template when
comprised in a
single expression cassette may each be operably linked to a single promoter,
or separate
promoters in any combination.
An expression cassette comprising a nucleic acid construct of the invention
may be
chimeric, meaning that at least one of its components is heterologous with
respect to at least
one of its other components (e.g., a promoter from the host organism operably
linked to a
polynucleotide of interest to be expressed in the host organism, wherein the
polynucleotide of
interest is from a different organism than the host or is not normally found
in association with
36

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
that promoter). An expression cassette may also be one that is naturally
occurring but has
been obtained in a recombinant form useful for heterologous expression.
An expression cassette can optionally include a transcriptional and/or
translational
termination region (i.e., termination region) and/or an enhancer region that
is functional in the
selected host cell. A variety of transcriptional terminators and enhancers are
known in the art
and are available for use in expression cassettes. Transcriptional terminators
are responsible
for the termination of transcription and correct mRNA polyadenylation. A
termination region
and/or the enhancer region may be native to the transcriptional initiation
region, may be
native to, for example, a gene encoding a sequence-specific DNA binding
protein, a gene
encoding a nuclease, a gene encoding a reverse transcriptase, a gene encoding
a deaminase,
and the like, or may be native to a host cell, or may be native to another
source (e.g., foreign
or heterologous to, for example, to a promoter, to a gene encoding a sequence-
specific DNA
binding protein, a gene encoding a nuclease, a gene encoding a reverse
transcriptase, a gene
encoding a deaminase, and the like, or to the host cell, or any combination
thereof).
An expression cassette of the invention also can include a polynucleotide
encoding a
selectable marker, which can be used to select a transformed host cell. As
used herein,
"selectable marker" means a polynucleotide sequence that when expressed
imparts a distinct
phenotype to the host cell expressing the marker and thus allows such
transformed cells to be
distinguished from those that do not have the marker. Such a polynucleotide
sequence may
encode either a selectable or screenable marker, depending on whether the
marker confers a
trait that can be selected for by chemical means, such as by using a selective
agent (e.g., an
antibiotic and the like), or on whether the marker is simply a trait that one
can identify
through observation or testing, such as by screening (e.g., fluorescence).
Many examples of
suitable selectable markers are known in the art and can be used in the
expression cassettes
described herein.
In addition to expression cassettes, the nucleic acid molecules/constructs and

polynucleotide sequences described herein can be used in connection with
vectors. The term
"vector" refers to a composition for transferring, delivering or introducing a
nucleic acid (or
nucleic acids) into a cell. A vector comprises a nucleic acid construct (e.g.
expression
cassette(s)) comprising the nucleotide sequence(s) to be transferred,
delivered or introduced.
Vectors for use in transformation of host organisms are well known in the art.
Non-limiting
examples of general classes of vectors include viral vectors, plasmid vectors,
phage vectors,
phagemid vectors, cosmid vectors, fosmid vectors, bacteriophages, artificial
chromosomes,
minicircles, or Agrobacterium binary vectors in double or single stranded
linear or circular
37

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
form which may or may not be self-transmissible or mobilizable. In some
embodiments, a
viral vector can include, but is not limited, to a retroviral, lentiviral,
adenoviral, adeno-
associated, or herpes simplex viral vector. A vector as defined herein can
transform a
prokaryotic or eukaryotic host either by integration into the cellular genome
or exist
extrachromosomally (e.g. autonomous replicating plasmid with an origin of
replication).
Additionally included are shuttle vectors by which is meant a DNA vehicle
capable, naturally
or by design, of replication in two different host organisms, which may be
selected from
actinomycetes and related species, bacteria and eukaryotic (e.g. higher plant,
mammalian,
yeast or fungal cells). In some embodiments, the nucleic acid in the vector is
under the
control of, and operably linked to, an appropriate promoter or other
regulatory elements for
transcription in a host cell. The vector may be a bi-functional expression
vector which
functions in multiple hosts. In the case of genomic DNA, this may contain its
own promoter
and/or other regulatory elements and in the case of cDNA this may be under the
control of an
appropriate promoter and/or other regulatory elements for expression in the
host cell.
Accordingly, a nucleic acid or polynucleotide of this invention and/or
expression cassettes
comprising the same may be comprised in vectors as described herein and as
known in the
art.
As used herein, "contact," "contacting," "contacted," and grammatical
variations
thereof, refer to placing the components of a desired reaction together under
conditions
.. suitable for carrying out the desired reaction (e.g., transformation,
transcriptional control,
genome editing, nicking, and/or cleavage). As an example, a target nucleic
acid may be
contacted with a sequence-specific DNA binding protein (e.g., polynucleotide-
guided
eitdon uclease, a CR ISPR-Cas endonucl ease (e.g., CR ISPR-Cas effector
protein), a zinc finger
nuclease, a transcription activator-like effector nuclease (TALEN) and/or an
Argonaute
.. protein)) and a deaminase or a nucleic acid construct encoding the same,
under conditions
whereby the sequence-specific DNA binding protein, the reverse transcriptase
and the
deaminase are expressed and the sequence-specific DNA binding protein binds to
the target
nucleic acid, and the reverse transcriptase and/or deaminase may be fused to
either the
sequence-specific DNA binding protein or recruited to the sequence-specific
DNA binding
protein (via, for example, a peptide tag fused to the sequence-specific DNA
binding protein
and an affinity tag fused to the reverse transcriptase and/or deaminase) and
thus, the
deaminase and/or reverse transcriptase is positioned in the vicinity of the
target nucleic acid,
thereby modifying the target nucleic acid. Other methods for recruiting
reverse transcriptase
and/or deaminase may be used that take advantage of other protein-protein
interactions, and
38

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
also RNA-protein interactions and chemical interactions may be used for
protein-protein and
protein-nucleic acid recruitment.
As used herein, "modifying," "modification," "mutating" or "mutation" (which
terms
may be used interchangeably herein) in reference to a target nucleic acid
includes editing
(e.g., mutating), covalent modification, exchanging/substituting nucleic
acids/nucleotide
bases, deleting, cleaving, nicking, and/or altering transcriptional control of
a target nucleic
acid. In some embodiments, a modification may include one or more single base
changes
(SNPs) of any type.
"Introducing," "introduce," "introduced" (and grammatical variations thereof)
in the
context of a polynucleotide of interest means presenting a nucleotide sequence
of interest
(e.g., polynucleotide, RT template, a nucleic acid construct, and/or a guide
nucleic acid) to a
plant, plant part thereof, or cell thereof, in such a manner that the
nucleotide sequence gains
access to the interior of a cell.
The terms "transformation" or transfection" may be used interchangeably and as
used
herein refer to the introduction of a heterologous nucleic acid into a cell.
Transformation of a
cell may be stable or transient. Thus, in some embodiments, a host cell or
host organism
(e.g., a plant) may be stably transformed with a polynucleotide/nucleic acid
molecule of the
invention. In some embodiments, a host cell or host organism may be
transiently transformed
with a polynucleotide/nucleic acid molecule of the invention.
"Transient transformation" in the context of a polynucleotide means that a
polynucleotide is introduced into the cell and does not integrate into the
genome of the cell.
By "stably introducing" or "stably introduced" in the context of a
polynucleotide
introduced into a cell is intended that the introduced polynucleotide is
stably incorporated
into the genome of the cell, and thus the cell is stably transformed with the
polynucleotide.
"Stable transformation" or "stably transformed" as used herein means that a
nucleic
acid molecule is introduced into a cell and integrates into the genome of the
cell. As such,
the integrated nucleic acid molecule is capable of being inherited by the
progeny thereof,
more particularly, by the progeny of multiple successive generations. "Genome"
as used
herein includes the nuclear and the plastid genome, and therefore includes
integration of the
nucleic acid into, for example, the chloroplast or mitochondrial genome.
Stable
transformation as used herein can also refer to a transgene that is maintained

extrachromasomally, for example, as a minichromosome or a plasmid.
Transient transformation may be detected by, for example, an enzyme-linked
immunosorbent assay (ELISA) or Western blot, which can detect the presence of
a peptide or
39

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
polypeptide encoded by one or more transgene introduced into an organism.
Stable
transformation of a cell can be detected by, for example, a Southern blot
hybridization assay
of genomic DNA of the cell with nucleic acid sequences which specifically
hybridize with a
nucleotide sequence of a transgene introduced into an organism (e.g., a
plant). Stable
.. transformation of a cell can be detected by, for example, a Northern blot
hybridization assay
of RNA of the cell with nucleic acid sequences which specifically hybridize
with a nucleotide
sequence of a transgene introduced into a host organism. Stable transformation
of a cell can
also be detected by, e.g., a polymerase chain reaction (PCR) or other
amplification reactions
as are well known in the art, employing specific primer sequences that
hybridize with target
.. sequence(s) of a transgene, resulting in amplification of the transgene
sequence, which can be
detected according to standard methods Transformation can also be detected by
direct
sequencing and/or hybridization protocols well known in the art.
Accordingly, in some embodiments, nucleotide sequences, polynucleotides,
nucleic
acid constructs, and/or expression cassettes of the invention may be expressed
transiently
and/or they can be stably incorporated into the genome of the host organism.
Thus, in some
embodiments, a nucleic acid construct of the invention (e.g., one or more
expression cassettes
comprising polynucleotides for editing as described herein) may be transiently
introduced
into a cell with a guide nucleic acid and as such, no DNA is maintained in the
cell.
A nucleic acid construct of the invention may be introduced into a plant cell
by any
method known to those of skill in the art. Non-limiting examples of
transformation methods
include transformation via bacterial-mediated nucleic acid delivery (e.g., via
Agrobacteria),
viral-mediated nucleic acid delivery, silicon carbide or nucleic acid whisker-
mediated nucleic
acid delivery, liposome mediated nucleic acid delivery, microinjection,
microparticle
bombardment, calcium-phosphate-mediated transformation, cyclodextrin-mediated
.. transformation, electroporation, nanoparticle-mediated transformation,
sonication,
infiltration, PEG-mediated nucleic acid uptake, as well as any other
electrical, chemical,
physical (mechanical) and/or biological mechanism that results in the
introduction of nucleic
acid into the plant cell, including any combination thereof. Procedures for
transforming both
eukaryotic and prokaryotic organisms are well known and routine in the art and
are described
.. throughout the literature (See, for example, Jiang et al. 2013. Nat.
Biotechnol. 31:233-239;
Ran et al. Nature Protocols 8:2281-2308 (2013 ) ). General guides to various
plant
transformation methods known in the art include Miki et al. ("Procedures for
Introducing
Foreign DNA into Plants" in Methods in Plant Molecular Biology and
Biotechnology, Glick,

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
B. R. and Thompson, J. E., Eds. (CRC Press, Inc., Boca Raton, 1993), pages 67-
88) and
Rakowoczy-Trojanowska (Cell. Mol. Biol. Lett. 7:849-858 (2002)).
In some embodiments of the invention, transformation of a cell may comprise
nuclear
transformation. In other embodiments, transformation of a cell may comprise
plastid
__ transformation (e.g., chloroplast transformation). In still further
embodiments, nucleic acids
of the invention may be introduced into a cell via conventional breeding
techniques. In some
embodiments, one or more of the polynucleotides, expression cassettes and/or
vectors may be
introduced into a plant cell via Agrobacterium transformation.
A polynucleotide therefore can be introduced into a plant, plant part, plant
cell in any
number of ways that are well known in the art. The methods of the invention do
not depend
on a particular method for introducing one or more nucleotide sequences into a
plant, only
that they gain access to the interior the cell. Where more than polynucleotide
is to be
introduced, they can be assembled as part of a single nucleic acid construct,
or as separate
nucleic acid constructs, and can be located on the same or different nucleic
acid constructs.
Accordingly, the polynucleotide can be introduced into the cell of interest in
a single
transformation event, or in separate transformation events, or, alternatively,
a polynucleotide
can be incorporated into a plant as part of a breeding protocol.
Yield (bushels/acre) in, for example, corn has increased steadily through high

intensity breeding. However, incremental increases in yield have recently
started to plateau
and require large investments in field evaluation and breeding to clearly
demonstrate genetic
gain. New approaches to genetic modification are required to deliver
significant
improvements to yield that are not possible through traditional methods. Crop
yields may be
improved in two fundamentally distinct ways: 1) yield improvements per se,
where an
engineered plant has gained an advantage such as improved photosynthesis or
optimized
carbohydrate partitioning or 2) the removal of vestigial survival mechanisms
that are not
consistent with high production agriculture. Shade avoidance response (SAR) or
shade
avoidance syndrome (SAS) is such a survival mechanism. SAS/SAR is
characterized by
increased root/shoot ratio, increased plant height, and reduced individual
plant yield and in a
typical monoculture crop setting, this response to competition a wasteful
survival mechanism.
Thus, the present invention addresses problems associated with increased
planting
density tolerance (see, Fig. 3) and reduction in yield loss (on an acre basis)
due to planting
variability. The present invention describes the use of gene editing to modify
key regulatory
factors that trigger shade avoidance in crops (e.g., dominant negative
mutants) (see, e.g., Fig.
4). Plants with such edited genomes will have reduced shade avoidance
capability. An
41

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
example of a mutation useful for addressing (e.g., reducing/attenuating)
SAR/SAS, may be
one that removes the DNA binding function of transcription factors that
dimerize to be
functional. In in some cases such a mutation may be a dominant negative
mutation (see, e.g.,
Fig. 4).
The HD (homeodomain)-LZ (leucine zipper) (HD-Zip) class of transcription
factors
have multiple functions within the plant. The Type II class have been
associated with light
perception and shade avoidance. One specific HD-LZ Class II member (HB53) has
been
shown to be induced by shade treatment. In maize, ZmHB53 is the closest maize
homolog of
ATHB2, an HDLZ closely associated with the Shade Avoidance Response in
Arabidopsis
(Carabelli et al., 1996; Steindler et al., 1999) (see, e.g., Fig. 5). A
closely related HDLZ
protein, ZmHB78, has been identified as a further target for attenuating shade
avoidance.
One approach for attenuating the DNA binding capability of a transcription
factor useful with
this invention (e.g., HB53, HB78) can include modifying individual amino acids
(deletions,
insertions or substitutions) or removing all or a portion of the DNA binding
domain through
an in-frame deletion.
An alignment of HD-Zip (HB78) amino acid sequences from 44 different plant
species is provided in Fig. 1. The sequences shown in Fig. 1 are a portion of
consecutive
amino acids (53 amino acid residues) from the full length HB78 sequences. Also
provided is
an alignment of HD-Zip (HB53) amino acid sequences from 45 different plant
species (Fig.
2). The sequences shown in Fig. 2 are a portion of consecutive amino acid
residues (e.g.,
about 116 amino acid residues) from the full length HB53 sequences. These
alignments
show that there is substantial conservation in the targeted region of these
two genes and
demonstrate that the present invention of targeting an endogenous Homeodomain-
leucine
zipper (HD-Zip) transcription factor, wherein the mutation disrupts the
binding of the HD-
Zip transcription factor to DNA, would be predicted to work across diverse
plant species to
produce a plant having an attenuated Shade Avoidance Response.
Examples of possible editing of these genes are provided in Figs. 6, 7, 8 and
9. Fig. 6
provides an example of editing of the DNA binding domain of HB53 in Z. mays
and shows
target example amino acid residues for modification in the boxes. Fig. 7
provides an
annotated HB78 gene with example guide nucleic acids and Fig. 8 provides an
example of a
deletion in an HB78 gene (SEQ ID NO:295) showing that a deletion (SEQ ID
NO:297) in,
for example, Exon 2 results in a truncation that deletes Exon 3, Exon 4 and
the DNA binding
domain resulting in a deletion in the protein sequence (SEQ ID NO:296). Fig. 9
provides a
representative genomic sequence (coding strand (SEQ ID NO:298) and non-coding
strand
42

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
(SEQ ID NO:299)) of an edited plant showing a premature stop upstream of the
HB78 DNA
binding domain. Fig. 13 provides a schematic of exemplary targeting of HB53
(upper
schematic) and HB78 (lower schematic) using plasmids pWISE443, pWISE444,
pWISE446
and pWISE447 with their corresponding spacers. Plasmid pWISE448 contains all
four
.. spacers shown for HB78 and plasmid pWISE445 contains all four spacers shown
for HB53,
while plasmid pWISE451 contains all eight spacers (four for HB53 and four for
HB78, as
shown in Fig. 13).
In some embodiments, the present invention provides a plant or plant part
thereof
comprising at least one non-natural mutation in an endogenous Homeodomain-
leucine zipper
(HD-Zip) transcription factor, wherein the mutation disrupts the binding of
the HD-Zip
transcription factor to DNA. In some embodiments, an HD-Zip transcription
factor may be
an HD-Zip Type II (HD-Zip II) transcription factor, wherein the HD-Zip II
transcription
factor is capable of regulating response to illumination in the plant (e.g.,
regulating the shade
avoidance response (SAR)). In some embodiments, an HD-Zip II transcription
factor useful
.. with this invention may include, but is not limited to, an ortholog of
AtHB2, HB53, and/or
HB78. In some embodiments, an HD-Zip II transcription factor useful with this
invention
may be HOMEOBOX PROTEIN 53 (HB53) or HOMEOBOX PROTEIN 78 (HB78). An
HD-Zip transcription factor useful with this invention is an HD-Zip
transcription factor that
comprises: (a) a polypeptide comprising a sequence having at least 80%
sequence identity
(e.g., at least about 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93,
94, 95, 96, 97, 99 or
100% sequence identity) to the amino acid sequence of SEQ ID NO:38 or SEQ ID
NO:83;
(b) a polypeptide comprising a sequence having at least 80% sequence identity
(e.g., at least
about 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97,
99 or 100%
sequence identity) to the amino acid sequence of:
.. RKKLRLSKDQSAVLEDSFREHPTLNPRQKAALA QQLGLRPRQVEVWF QNRRARTKLK
QTEVDCEYLKRCCETLTEENRRLQKEVQELRALKLV SPHLYMHMSPPTTLTMCP SCER
V (SEQ ID NO:!) (Zea mays HB53) or
RKKLRLSKDQAAVLEESFKEHNTLNPKQKAALAKQLNLKPRQ VEVWFQNRRARTKL
KQTEVDCEFLKRCCETLTEENRRLQREVAELRVLKLV APHRY ARMPPPTTLTMCP SCE
RL SEQ ID NO:2) (Zea mays HB78); (c) a polypeptide comprising a sequence
having at
least 80% sequence identity (e.g., at least about 80, 81, 82, 83, 84, 85, 86,
87, 88, 89, 90, 91,
92, 93, 94, 95, 96, 97, 99 or 100% sequence identity) to the amino acid
sequence of
LAKQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTEENRRLQREV (SEQ
ID NO:3); (d) a polypeptide comprising a sequence having at least 95% sequence
identity
43

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
(e.g., at least about 95, 96, 97, 99, 99.5, or 100% sequence identity) to the
nucleotide
sequence of RQVEVWFQNRRARTKLKQTEVDCE (SEQ ID NO:4); (e) a polypeptide
comprising a sequence having the amino acid sequence of
RQVEVWFQNRRARTKXKQTEVDCE (SEQ ID NO:5), wherein X is L or S; (f) a
polypeptide comprising: (i) sequence having the amino acid sequence of
RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S or
A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and (iv) a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9); and/or (g) a polypeptide comprising a sequence having
the
amino acid sequence of VWFQNRRA (SEQ ID NO:9)..
In some embodiments, a plant or plant part of the invention comprises an HD-
Zip
transcription factor that comprises: (a) a polypeptide comprising a sequence
having at least
80% sequence identity (e.g., at least about 80, 81, 82, 83, 84, 85, 86, 87,
88, 89, 90, 91, 92,
93, 94, 95, 96, 97, 99 or 100% sequence identity) to the amino acid sequence
of SEQ ID
NO:38 or SEQ ID NO:83; (b) a polypeptide comprising a sequence having at least
80%
sequence identity (e.g., at least about 80, 81, 82, 83, 84, 85, 86, 87, 88,
89, 90, 91, 92, 93, 94,
95, 96, 97, 99 or 100% sequence identity) to the amino acid sequence of:
RKKLRLSKDQSAVLEDSFREHPTLNPRQKAALA QQLGLRPRQVEVWFQNRRARTKLK
QTEVDCEYLKRCCETLTEENRRLQKEVQELRALKLV SPHLYMHMSPPTTLTMCP SCER
V (SEQ ID NO:!) (Zea mays HB53) or
RKKLRLSKDQAAVLEESFKEHNTLNPKQKAALAKQLNLKPRQ VEVWFQNRRARTKL
KQTEVDCEFLKRCCETLTEENRRLQREV AELRVLKLV APHRY ARMPPPTTLTMCP SCE
RL SEQ ID NO:2) (Zea mays HB78); (c) a polypeptide comprising a sequence
having at
least 80% sequence identity (e.g., at least about 80, 81, 82, 83, 84, 85, 86,
87, 88, 89, 90, 91,
92, 93, 94, 95, 96, 97, 99 or 100% sequence identity) to the amino acid
sequence of
LAKQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTEENRRLQREV (SEQ
ID NO:3); (d) a polypeptide comprising a sequence having at least 95% sequence
identity
(e.g., at least about 95, 96, 97, 99, 99.5, or 100% sequence identity) to the
nucleotide
sequence of RQVEVWFQNRRARTKLKQTEVDCE (SEQ ID NO:4); (e) a polypeptide
comprising a sequence having the amino acid sequence of
RQVEVWFQNRRARTKXKQTEVDCE (SEQ ID NO:5), wherein X is L or S; (f) a
polypeptide comprising: (i) sequence having the amino acid sequence of
44

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S or
A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and (iv) a sequence having the amino acid sequence of
VWFQNRRA (SEQ ID NO:9); and/or (g) a polypeptide comprising a sequence having
the
amino acid sequence of VWFQNRRA (SEQ ID NO:9). Accordingly, an HD-Zip
transcription factor useful with this invention may comprise the amino acid
sequence of any
one of SEQ ID NOs: 1-8 or 10-98, which comprises a DNA binding domain
comprising the
amino acid sequence of VWFQNRRA (SEQ ID NO:9). As an example, residues 173-288
of
the HD-Zip transcription factor comprising the amino acid sequence of SEQ ID
NO:38
comprise the amino acid sequence of SEQ ID NO:9. As a further example,
residues 76-191of
the HD-Zip transcription factor comprising the amino acid sequence of SEQ ID
NO:83
comprises the amino acid sequence of SEQ ID NO:9. In addition, to SEQ ID NO:9,
other
polypeptide domains identified in HD-Zip transcription factors useful with
this invention
include polypeptides comprising at least 80% sequence identity to the amino
acid sequence of
SEQ ID NO:! and/or SEQ ID NO:2, polypeptides comprising a sequence having at
least
80% sequence identity to the amino acid sequence of SEQ ID NO:3), polypeptides

comprising a sequence having at least 95% sequence identity to the nucleotide
sequence of
SEQ ID NO:4, polypeptides comprising a sequence having the amino acid sequence
of SEQ
ID NO:5, wherein X is L or S, polypeptides comprising a sequence having the
amino acid
sequence of SEQ ID NO:6, wherein Xi is S or T, X2 is D or E and X3 is S or A,
polypeptides
comprising a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ ID
NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L, and/or
polypeptides comprising
.. a sequence having the amino acid sequence of SEQ ID NO:8, wherein Xi is P
or A, X2 is T
or A, X3 1S V or M and X4is Q, S or N.
In some embodiments, the at least one non-natural mutation in an endogenous
Homeodomain-leucine zipper (HD-Zip) transcription factor in a plant may be a
substitution, a
deletion and/or an insertion that disrupts the binding of the HD-Zip
transcription factor to
DNA. For example, the mutation may be a substitution, a deletion and/or an
insertion of one
or more amino acid residues of the transcription factor. The at least one non-
natural mutation
may comprise a base substitution to an A, a T, a G, or a C, which results in
an amino acid
substitution, thereby disrupting the binding of the HD-Zip transcription
factor to DNA. In
some embodiments, the at least one non-natural mutation in an endogenous gene
encoding an

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
HD-Zip transcription factor may comprise a deletion. Such a deletion may
comprise, for
example, a deletion of all or a portion of the DNA binding domain (e.g., a
deletion of at least
1, 2, 3, 4, 5, 6, 7, or 8 amino acid residues of SEQ ID NO:9 (VWFQNRRA) of the
HD-Zip
transcription factor. In some embodiments, the deletion may be a truncation
that includes the
portion of consecutive amino acid residues of SEQ ID NO:9 (e.g., at least 2,
3,4, 5, 6, 7, or 8
consecutive amino acid residues). In some embodiments, the deletion may be a
truncation
that includes at least a portion of consecutive amino acid residues of SEQ ID
NO:9 (e.g., at
least 2, 3, 4, 5, 6, 7, or 8 consecutive amino acid residues). Thus, a
deletion may be about 1
amino acid residue to about 120 amino acid residues, or more in length (e.g.,
at least 1, 2, 3,
4, 5, 6, 7, 8,9, 10, 11, 12, 13, 14, 15, 20, 21, 22, 23, 24, 25, 26, 27, 28,
29, 30, 31, 32, 33, 34,
35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53,
54, 55, 56, 57, 58, 59,
60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78,
79, 80, 81, 82, 83, 84,
85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102,
103, 104, 105, 106,
107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, or more
consecutive
amino acid residues) up to the full length of the HD-Zip polypeptide, wherein
the deletion
includes at least a portion of consecutive amino acid residues of SEQ ID NO:9.
In some
embodiments, a deletion produces a truncated HD-Zip transcription factor that
includes a
deletion of at least a portion of consecutive amino acid residues of SEQ ID
NO:9. In some
embodiments, a deletion in an HD-Zip transcription factor polynucleotide may
result in a
premature stop codon that generates a truncated HD-Zip transcription factor,
optionally a
truncation at the C-terminus of the HD-Zip transcription factor, wherein at
least a portion of
consecutive amino acid residues of SEQ ID NO:9 are deleted.
A non-natural mutation in an endogenous gene encoding a HD-Zip transcription
factor mutation useful with this invention may be a dominant recessive
mutation. A
dominant negative may remove the DNA binding function of the transcription
factor that
dimerizes to be functional. The transcription factor can still dimerize but
will have lost the
ability to bind to the regulatory regions of downstream genes and therefore
will be non-
functional. Thus, by removing the DNA binding capability of a bifunctional
protein,
dimerized complex will not activate gene expression (see, e.g., Fig. 5).
In some embodiments, a plant cell comprising an editing system is provided,
the
editing system comprising: (a) a CRISPR-associated effector protein; and (c) a
guide nucleic
acid (gRNA, gDNA, crRNA, crDNA) having a spacer sequence with complementarity
to an
endogenous target gene encoding a wild type HD-Zip transcription factor. The
wild type
HD-Zip transcription factor may be any HD-Zip transcription factor involved in
the shade
46

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
avoidance response. In some embodiments, the HD-Zip transcription factor may
be a type
HD-Zip II transcription factor, optionally a HB53 transcription factor or a
HB78 transcription
factor. In some embodiments, the HD-Zip transcription factor gene to which the
spacer
sequence of the guide nucleic acid shares complementarity may encode (a) a
polypeptide
comprising a sequence having at least 80% sequence identity to the amino acid
sequence of
SEQ ID NO:38 or SEQ ID NO:83; (b) a polypeptide comprising a sequence having
at least
80% sequence identity to the amino acid sequence of: SEQ ID NO:! or SEQ ID
NO:2; (c) a
polypeptide comprising a sequence having at least 80% sequence identity to the
amino acid
sequence of SEQ ID NO:3; (d) a polypeptide comprising a sequence having at
least 95%
sequence identity to the nucleotide sequence of SEQ ID NO:4; (e) a polypeptide
comprising
a sequence having the amino acid sequence of SEQ ID NO:5, wherein X is L or S;
(f) a
polypeptide comprising: (i) sequence having the amino acid sequence of
RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S or
A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and (iv) a sequence having the amino acid sequence of

VWFQNRRA (SEQ ID NO:9); and/or (g) a polypeptide comprising a sequence having
the
amino acid sequence of VWFQNRRA (SEQ ID NO:9). In some embodiments, a spacer
sequence of a guide nucleic acid of an editing system of this invention may
comprise a
nucleotide sequence of any one of SEQ ID NOs:175 to 182. In some embodiments,
the
nucleic acid binding domain of an editing system useful with this invention
may be from a
polynucleotide-guided endonuclease, a CRT SPR-Cas endonuelease (e.g., CRT SPR-
Cas
effector protein), a zinc finger nuclease, a transcription activator-like
effector nuclease
(TALEN) and/or an Argonaute protein. In some embodiments, a plant cell edited
as
described herein may be regenerated into a plant, thereby providing a plant
with a mutation in
an HD-Zip transcription factor that is involved in the shade avoidance
response and having an
attenuated shade avoidance response.
In some embodiments, the invention provides a plant cell comprising at least
one non-
naturally occurring mutation (e.g., 1, 2, 3, 4, 5, or more mutation) to a DNA
binding site of an
HD-Zip transcription factor gene that prevents or reduces binding of the
encoded HD-Zip
transcription factor to DNA, wherein the mutation is a substitution, insertion
and/or a deletion
that is introduced using an editing system that comprises a nucleic acid
binding domain that
binds to a target site in the HD-Zip transcription factor gene, and wherein
the HD-Zip
47

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
transcription factor gene encodes: (a) a polypeptide comprising a sequence
having at least
80% sequence identity to the amino acid sequence of SEQ ID NO:38 or SEQ ID
NO:83; (b)
a polypeptide comprising a sequence having at least 80% sequence identity to
the amino acid
sequence of: SEQ ID NO:! or SEQ ID NO:2; (c) a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of SEQ ID
NO:3; (d) a
polypeptide comprising a sequence having at least 95% sequence identity to the
nucleotide
sequence of SEQ ID NO:4; (e) a polypeptide comprising a sequence having the
amino acid
sequence of SEQ ID NO:5, wherein X is L or S; (f) a polypeptide comprising:
(i) sequence
having the amino acid sequence of RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S
or
T, X2 is D or E and X3 is S or A; (ii) a sequence having the amino acid
sequence of PX1X2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M
and X4 is
Q, S or N; (iii) a sequence having the amino acid sequence of ENRRLX1X2EX3,
(SEQ ID
NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and (iv) a
sequence having the
amino acid sequence of VWFQNRRA (SEQ ID NO:9); and/or (g) a polypeptide
comprising
a sequence having the amino acid sequence of VWFQNRRA (SEQ ID NO:9).
In some embodiments, a plant or part thereof is provided, the plant or part
thereof
comprising a mutation in an endogenous HD-Zip transcription factor, which
mutation reduces
DNA binding by the endogenous HD-Zip transcription factor, wherein the
endogenous HD-
Zip transcription factor comprises a polypeptide comprising a sequence having
at least 80%
sequence identity to the amino acid sequence of SEQ ID NO:! or SEQ ID NO:2;
wherein
the mutation is a deletion, substitution, and/or insertion of at least one
amino acid residue of
amino acid residues 45-52 (VWFQNRRA) (SEQ ID NO:9)) of the amino acid sequence
of
SEQ ID NO:! or SEQ ID NO:2. In some embodiments, the mutation of the at least
one
amino acid residue of amino acid residues 45-52 of the amino acid sequence of
SEQ ID
NO:! or SEQ ID NO:2 is made following cleavage by a nuclease comprising a DNA-
binding domain that binds to a target site within a target nucleic acid
encoding a sequence
having at least 80% sequence identity to the amino acid sequence of SEQ ID
NO:!) (Zea
mays HB53) or SEQ ID NO:2) (Zea mays HB78).
A mutation of an endogenous HD-Zip transcription factor in a plant or part
thereof
.. may be an insertion, substitution and/or a deletion of at least one amino
acid. In some
embodiments, the mutation may comprise a deletion of all or a portion (e.g., a
deletion of at
least 1, 2, 3, 4, 5, 6, 7, or 8 amino acid residues of SEQ ID NO:9 (VWFQNRRA)
of a DNA
binding domain within the endogenous HD-Zip transcription factor.
48

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
In some embodiments, non-limiting examples of a plant or part thereof include
corn,
soy, canola, wheat, rice, cotton, sugarcane, sugar beet, barley, oats,
alfalfa, sunflower,
safflower, oil palm, sesame, coconut, tobacco, potato, sweet potato, cassava,
coffee, apple,
plum, apricot, peach, cherry, pear, fig, banana, citrus, cocoa, avocado,
olive, almond, walnut,
strawberry, watermelon, pepper, grape, tomato, cucumber, blackberry,
raspberry, black
raspberry or a Brassica spp. In some embodiments, the plant part may be a cell
from a plant
that includes but is not limited to, corn, soy, canola, wheat, rice, cotton,
sugarcane, sugar
beet, barley, oats, alfalfa, sunflower, safflower, oil palm, sesame, coconut,
tobacco, potato,
sweet potato, cassava, coffee, apple, plum, apricot, peach, cherry, pear, fig,
banana, citrus,
cocoa, avocado, olive, almond, walnut, strawberry, watermelon, pepper, grape,
tomato,
cucumber, blackberry, raspberry, black raspberry or a Brassica spp. In some
embodiments, a
plant may be regenerated from a cell or plant part of this invention. A plant
of this invention
comprising at least one mutation in an HD-Zip transcription factor comprises
an attenuated
Shade Avoidance Response (SAR).
In some embodiments, the invention provides a plant or plant part thereof
comprising
an HD-Zip transcription factor gene that comprises the nucleotide sequence of
SEQ ID
NO:202 and/or that encodes an amino acid sequence of any one of SEQ ID NO:201.
In
some embodiments, the invention provides a corn plant or plant part thereof
comprising an
HD-Zip transcription factor gene that comprises the nucleotide sequence of SEQ
ID NO:202
and/or that encodes an amino acid sequence of any one of SEQ ID NO:201.
The invention further provides a method of producing/breeding a transgene-free

genome-edited (e.g., base-edited) plant, comprising: (a) crossing a plant of
the present
invention with a transgene free plant, thereby introducing the mutation or
modification from
the plant of the present invention into the plant that is transgene-free; and
(b) selecting a
progeny plant that comprises the mutation or modification but is transgene-
free, thereby
producing a transgene free genome-edited (e.g., base-edited) plant.
In some embodiments, a method of providing a plurality of plants having
increased
yield when each plant of the plurality of plants is planted in close proximity
to one another is
provided, the method comprising planting two or more plants of the present
invention in
close proximity to one another, thereby providing a plurality of plants having
increased yield
as compared to a plurality of control plants (e.g., plants not having an
edited HD-Zip
transcription factor gene and reduced SAR) planted in close proximity to one
another.
"Close proximity" refers to a high planting density for any particular plant
species that
can result in SAR. For example, in some embodiments, "close proximity"
includes a density
49

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
of plants resulting from planting seeds of the plant about 6.1 inches or less
apart (e.g., about
6.1,6, 5.9, 5.8, 5.7, 5.6, 5.5, 5.4, 5.3, 5.2, 5.2, 5.1, 5, 4.9, 4.8, 4.7,
4.6, 4.5, 4.4, 4.3, 4.2, 4.1, 4
3.9, 3.8, 3.7, 3.6, 3.5, 3.4, 3.3, 3.2, 3.1, 3, 2.9, 2.8, 2.7, 2.6, 2.5, 2.4,
2.3, 2.2, 2.1, 2, 1.9, 1.8,
1.7, 1.6, 1.5, 1.4, 1.3, 1.2, 1.1, 1, 0.9, 0.8, 0.7, 0.6, 0.5, inches apart,
and the like or any range
or value therein). In some embodiments, a high density planting includes about
35K seeds
per acre at 36 in and 38 in row spacing; or anything more than 35K per acre at
30 in or more
row spacing. As would be understood by one of skill in the art, the number of
seeds planted
per acre to achieve a high density planting will vary by plant species.
In some embodiments, a method for editing a specific site in the genome of a
plant
cell is provided, the method comprising: cleaving, in a site specific manner,
a target site
within an endogenous HD-Zip transcription factor gene in the plant cell, the
endogenous HD-
Zip transcription factor gene encoding: (a) a polypeptide comprising a
sequence having at
least 80% sequence identity to the amino acid sequence of SEQ ID NO:38 or SEQ
ID
NO:83; (b) a polypeptide comprising a sequence having at least 80% sequence
identity to the
amino acid sequence of: SEQ ID NO:! or SEQ ID NO:2; (c) a polypeptide
comprising a
sequence having at least 80% sequence identity to the amino acid sequence of
SEQ ID
NO:3; (d) a polypeptide comprising a sequence having at least 95% sequence
identity to the
nucleotide sequence of SEQ ID NO:4; (e) a polypeptide comprising a sequence
having the
amino acid sequence of SEQ ID NO:5, wherein X is L or S; (f) a polypeptide
comprising: (i)
sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ ID NO:6),
wherein
Xi is S or T, X2 is D or E and X3 is S or A; (ii) a sequence having the amino
acid sequence of
PX1X2 X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V
or M
and X4 is Q, S or N; (iii) a sequence having the amino acid sequence of
ENRRLX1X2EX3,
(SEQ ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and (iv) a
sequence
having the amino acid sequence of VWFQNRRA (SEQ ID NO:9); and/or (g) a
polypeptide
comprising a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9),
thereby generating an edit in the endogenous HD-Zip transcription factor gene
of the plant
cell. The method for editing a plant may further comprise regenerating a plant
from the plant
cell comprising the edit in the endogenous HD-Zip transcription factor gene to
produce a
plant comprising the edit in its endogenous HD-Zip transcription factor gene.
In some
embodiments, the edit results in a non-naturally occurring mutation in the
endogenous HD-
Zip transcription factor gene that produces an HD-Zip transcription factor
with reduced DNA
binding.

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
A plant comprising an endogenous HD-Zip transcription factor gene that is
edited as
described herein to provide an HD-Zip transcription factor with reduced DNA
binding has an
attenuated Shade Avoidance Response when compared to a control plant that has
does not
comprise the edited endogenous HD-Zip transcription factor gene. A plant
comprising an
edited endogenous HD-Zip transcription factor gene as described herein may be
compared to
a plant that is not so edited when grown under the same environmental
conditions, e.g., an
environment with a low R:FR light ratio, e.g., shaded conditions (e.g., an
R:FR ratio of about
0.16; or a range of an R:FR ratio of about 0.09 to about 0.7 (e.g., about
0.09, 0.10. 0.11, 0.12,
0.13, 0.14, 0.15, 0.16, 0.17. 018, 0.19, 0.2, 0.21, 0.23, 0.24, 0.25, to about
0.26, 0.27, 0.28,
0.29, 0.3, 0.35, 0.4, 0.45, 0.5, 0.55, 0.6, 0.65, 0.7, or any range or value
therein))
In some embodiments, a method for making a plant is provided, the method
comprising: (a) contacting a population of plant cells that comprising a wild-
type endogenous
gene encoding an HD-Zip transcription factor with a nuclease targeted to the
wild-type
endogenous gene, wherein the nuclease is linked to a DNA binding domain that
binds to a
nucleic acid sequence encoding: (i) a polypeptide comprising a sequence having
at least 80%
sequence identity to the amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83;
(ii) a
polypeptide comprising a sequence having at least 80% sequence identity to the
amino acid
sequence of: SEQ ID NO:! or SEQ ID NO:2; (iii) a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of SEQ ID
NO:3; (iv) a
polypeptide comprising a sequence having at least 95% sequence identity to the
nucleotide
sequence of SEQ ID NO:4; (v) a polypeptide comprising a sequence having the
amino acid
sequence of SEQ ID NO:5, wherein X is L or S; (vi) a polypeptide comprising:
(1) a
sequence having the amino acid sequence of RKKLRLX1KX2QX3(SEQ ID NO:6),
wherein
Xi is S or T, X2 is D or E and X3 is S or A; (2) a sequence having the amino
acid sequence of
PX1X2 X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V
or M
and X4 is Q, S or N; (3) a sequence having the amino acid sequence of
ENRRLX1X2EX3,
(SEQ ID NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and (4) a
sequence
having the amino acid sequence of VWFQNRRA (SEQ ID NO:9); and/or (vii) a
polypeptide
comprising a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9);
(b) selecting a plant cell from said population comprising a mutation in the
wild-type
endogenous gene encoding an HD-Zip transcription factor, wherein the mutation
is a
substitution and/or a deletion of at least one amino acid residue in the
polypeptide of any one
of (i)-(v), wherein the mutation reduces or eliminates the ability of the HD-
Zip transcription
factor to bind DNA; and (c) growing the selected plant cell into a plant.
51

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
In some embodiments, a method for reducing a Shade Avoidance Response in a
plant
is provided, the method comprising (a) contacting a plant cell comprising a
wild-type
endogenous gene encoding an HD-Zip transcription factor with a nuclease
targeted to the
wild-type endogenous gene, wherein the nuclease is linked to a DNA binding
domain that
binds to a target site in the wild type endogenous gene, the wild type
endogenous gene
encoding: (i) a polypeptide comprising a sequence having at least 80% sequence
identity to
the amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83; (ii) a polypeptide
comprising
a sequence having at least 80% sequence identity to the amino acid sequence
of: SEQ ID
NO:! or SEQ ID NO:2; (iii) a polypeptide comprising a sequence having at least
80%
sequence identity to the amino acid sequence of SEQ ID NO:3; (iv) a
polypeptide
comprising a sequence having at least 95% sequence identity to the nucleotide
sequence of
SEQ ID NO:4; (v) a polypeptide comprising a sequence having the amino acid
sequence of
SEQ ID NO:5, wherein X is L or S; (vi) a polypeptide comprising: (1) a
sequence having the
amino acid sequence of RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S or T, X2
is D
or E and X3 is S or A; (2) a sequence having the amino acid sequence of PX1X2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M
and X4 is
Q, S or N; (3) a sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ
ID
NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and (4) a sequence
having the
amino acid sequence of VWFQNRRA (SEQ ID NO:9); and/or (vii) a polypeptide
comprising a sequence having the amino acid sequence of VWFQNRRA (SEQ ID
NO:9),
thereby producing a plant cell comprising a mutation in the wild-type
endogenous gene
encoding an HD-Zip transcription factor; and (b) growing the plant cell into a
plant, thereby
reducing the Shade Avoidance Response in the plant.
In some embodiments, a method for producing a plant or part thereof comprising
at
least one cell having a mutated endogenous HD-Zip transcription factor gene is
provided, the
method comprising contacting a target site in an endogenous HD-Zip
transcription factor
gene in the plant or plant part with a nuclease comprising a cleavage domain
and a DNA-
binding domain, wherein the DNA binding domain binds to a target site in the
endogenous
HD-Zip transcription factor gene, wherein the endogenous HD-Zip transcription
factor gene
encodes: (a) a polypeptide comprising a sequence having at least 80% sequence
identity to
the amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83; (b) a polypeptide
comprising
a sequence having at least 80% sequence identity to the amino acid sequence
of: SEQ ID
NO:! or SEQ ID NO:2; (c) a polypeptide comprising a sequence having at least
80%
sequence identity to the amino acid sequence of SEQ ID NO:3; (d) a polypeptide
comprising
52

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
a sequence having at least 95% sequence identity to the nucleotide sequence of
SEQ ID
NO:4; (e) a polypeptide comprising a sequence having the amino acid sequence
of SEQ ID
NO:5, wherein X is L or S; (f) a polypeptide comprising: (i) sequence having
the amino acid
sequence of RKKLRLX1KX2QX3 (SEQ ID NO:6), wherein Xi is S or T, X2 is D or E
and X3
is S or A; (ii) a sequence having the amino acid sequence of PX1X2
X2LTX3CPX4CER(SEQ
ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or
N; (iii) a
sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7),
wherein Xi
is Q or H, X2 is R or K and X3 is V or L; and (iv) a sequence having the amino
acid sequence
of VWFQNRRA (SEQ ID NO:9); and/or (g) a polypeptide comprising a sequence
having
the amino acid sequence of VWFQNRRA (SEQ ID NO:9), thereby producing a plant
or part
thereof comprising at least one cell having a mutation in the endogenous HD-
Zip
transcription factor gene. In some embodiments, the at least one cell in the
plant or part
thereof having a mutated endogenous HD-Zip transcription factor gene produces
an HD-Zip
transcription factor having reduced binding of DNA.
In some embodiments, a method of producing a plant or part thereof comprising
a
mutated endogenous HD-Zip transcription factor having reduced DNA binding, the
method
comprising contacting a target site in an endogenous HD-Zip transcription
factor gene in the
plant or plant part with a nuclease comprising a cleavage domain and a DNA-
binding
domain, wherein the DNA binding domain binds to a target site in the HD-Zip
transcription
factor gene, wherein the HD-Zip transcription factor gene encodes:
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the
amino acid sequence of SEQ ID NO:38 or SEQ ID NO:83; (b) a polypeptide
comprising a
sequence having at least 80% sequence identity to the amino acid sequence of:
SEQ ID
NO:! or SEQ ID NO:2; (c) a polypeptide comprising a sequence having at least
80%
sequence identity to the amino acid sequence of SEQ ID NO:3; (d) a polypeptide
comprising
a sequence having at least 95% sequence identity to the nucleotide sequence of
SEQ ID
NO:4; (e) a polypeptide comprising a sequence having the amino acid sequence
of SEQ ID
NO:5, wherein X is L or S; (f) a polypeptide comprising: (i) sequence having
the amino acid
sequence of RKKLRLX1KX2QX3 (SEQ ID NO:6), wherein Xi is S or T, X2 is D or E
and X3
is S or A; (ii) a sequence having the amino acid sequence of PX1X2
X2LTX3CPX4CER(SEQ
ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or
N; (iii) a
sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7),
wherein Xi
is Q or H, X2 is R or K and X3 is V or L; and (iv) a sequence having the amino
acid sequence
of VWFQNRRA (SEQ ID NO:9); and/or (g) a polypeptide comprising a sequence
having
53

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
the amino acid sequence of VWFQNRRA (SEQ ID NO:9), thereby producing a plant
or part
thereof having a mutated endogenous HD-Zip transcription factor having reduced
DNA
binding. In some embodiments, the endogenous HD-Zip transcription factor gene
encodes an
endogenous HD-Zip transcription factor that comprises the amino acid sequence
of SEQ ID
NO: 38 or SEQ ID NO:83, wherein the amino acid sequence of SEQ ID NO: 38 or
SEQ ID
NO:83 comprises the amino acid sequence of VWFQNRRA (SEQ ID NO:9) and the
mutated endogenous HD-Zip transcription factor comprises a mutation in the
amino acid
sequence of VWFQNRRA (SEQ ID NO:9). In some embodiments, an endogenous HD-Zip
transcription factor gene encodes an endogenous HD-Zip transcription factor
that comprises
the amino acid sequence of any one of SEQ ID NO: 10-98, wherein the amino acid
sequence
of any one of SEQ ID NO: 10-98 comprises the amino acid sequence of VWFQNRRA
(SEQ
ID NO:9), and the mutated endogenous HD-Zip transcription factor comprises a
mutation in
the amino acid sequence of VWFQNRRA (SEQ ID NO:9).
In some embodiments, the plant or part thereof comprising a mutated endogenous
HD-Zip transcription factor as described herein exhibits an attenuated/reduced
Shade
Avoidance Response as compared to a control plant that does not comprise the
mutation in
the endogenous HD-Zip transcription factor gene, e.g., the plant or plant part
has not been
contacted with the editing system. In some embodiments, the comparison with a
control
plant may be made between the edited plant and control plant when grown under
the same
environmental conditions; e.g., a shaded environment, e.g., a low R:FR ratio
environment. A
plant that comprises a mutated endogenous HD-Zip transcription factor that
results in an
attenuated/reduced Shade Avoidance Response in the plant exhibits phenotypes
that include,
but are not limited to, increased yield, decreased height, decreased
shoot:root ratio, decreased
leaf length; increased mechanical strength of stems; reduced lodging rate;
delayed
senescence; increased photosynthesis efficiency and grain filling; and/or
enhanced defense
responses against pathogens and herbivores when planted in close proximity
with one or
more other plants as compared to a plant that does not comprise mutated
endogenous HD-Zip
transcription factor that results in an attenuated/reduced Shade Avoidance
Response, which is
planted in close proximity with one or more other plants. In some embodiments,
a plant
having a reduced SAR is at least about 5% shorter than the control plant grown
under the
same environmental conditions (.g., a shaded environment, e.g., a low R:FR
ratio
environment) (e.g., about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18,
19, 20, 21, 22, 23,
24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42,
43, 44, 45, 46, 47, 48,
49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67,
68, 69, 70, 71, 72, 73,
54

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92,
93, 94, 95, 96, 97, 98,
99, 100, 110, 120, 130, 140, 150%, or more shorter, or any range or value
therein).
In some embodiments, a nuclease contacting a plant cell, a population of plant
cells
and/or a target site cleaves an endogenous HD-Zip transcription factor gene
and a mutation is
introduced into the DNA binding site of an endogenous HD-Zip transcription
factor encoded
by the endogenous HD-Zip transcription factor gene.
In some embodiments, the mutation in an endogenous HD-Zip transcription factor

gene may be a non-naturally occurring mutation. In some embodiments, the non-
naturally
occurring mutation may be a substitution, an insertion and/or a deletion. In
some
embodiments, the non-naturally occurring mutation that is a substitution, an
insertion and/or a
deletion may result in a substitution, an insertion and/or a deletion of one
or more amino
acids in the endogenous HD-Zip transcription factor encoded by the endogenous
HD-Zip
transcription factor gene. A nuclease useful with the invention includes but
is not limited to a
zinc finger nuclease, a transcription activator-like effector nuclease
(TALEN), an
endonuclease (e.g., Fokl) or a CRISPR-Cas effector protein.
In some embodiments, an HD-Zip transcription factor useful with this invention
may
be a HD-Zip Type II (HD-Zip II) transcription factor, wherein the HD-Zip II
transcription
factor is capable of regulating a plant's the response to illumination (e.g.,
shade avoidance
response (SAR)). In some embodiments, the HD-Zip II transcription factor may
include, but
is not limited to, an ortholog of AtHB2, HB53, and/or HB78. In some
embodiments, the HD-
Zip II transcription factor may be HOMEOBOX PROTEIN 53 (HB53) or HOMEOBOX
PROTEIN 78 (HB78).
In some embodiments, the present invention provides a guide nucleic acid
(e.g.,
gRNA, gDNA, crRNA, crDNA) that binds to a target site in a HD-Zip
transcription factor
gene, the target site comprising a nucleotide sequence encoding: ((a) a
polypeptide
comprising a sequence having at least 80% sequence identity to the amino acid
sequence of
SEQ ID NO:! or SEQ ID NO:2; (b) a polypeptide comprising a sequence having at
least
80% sequence identity to the amino acid sequence of SEQ ID NO:3; (c) a
polypeptide
comprising a sequence having at least 95% sequence identity to the nucleotide
sequence of
SEQ ID NO:4; (d) a polypeptide comprising a sequence having the amino acid
sequence of
SEQ ID NO:5, wherein X is L or S; (e) a polypeptide comprising: (i) a sequence
having the
amino acid sequence of RKKLRLX1KX2QX3 (SEQ ID NO:6), wherein Xi is S or T, X2
is D
or E and X3 is S or A; (ii) a sequence having the amino acid sequence of PX1X2

X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M
and X4 is

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
Q, S or N; (iii) a sequence having the amino acid sequence of ENRRLX1X2EX3,
(SEQ ID
NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and a sequence
having the amino
acid sequence of VWFQNRRA (SEQ ID NO:9) and/or (f) a polypeptide comprising a
sequence having the amino acid sequence of VWFQNRRA (SEQ ID NO:9).
A spacer sequence of a guide of this invention may be complementary to a
fragment
or portion of the nucleotide sequence encoding (a) a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of SEQ ID
NO:! or SEQ
ID NO:2; (b) a polypeptide comprising a sequence having at least 80% sequence
identity to
the amino acid sequence of SEQ ID NO:3; (c) a polypeptide comprising a
sequence having
at least 95% sequence identity to the nucleotide sequence of SEQ ID NO:4; (d)
a polypeptide
comprising a sequence having the amino acid sequence of SEQ ID NO:5, wherein X
is L or
S; (e) a polypeptide comprising: (i) a sequence having the amino acid sequence
of
RKKLRLX1KX2QX3 (SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S
or A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and a sequence having the amino acid sequence of
VWFQNRRA
(SEQ ID NO:9) and/or (f) a polypeptide comprising a sequence having the amino
acid
sequence of VWFQNRRA (SEQ ID NO:9).
In some embodiments, a target nucleic acid is an endogenous HD-Zip
transcription
factor gene that is capable of regulating response to illumination in a plant.
In some
embodiments, a target site in a target nucleic acid may encode (a) a
polypeptide comprising a
sequence having at least 80% sequence identity to the amino acid sequence of
SEQ ID NO:!
or SEQ ID NO:2; (b) a polypeptide comprising a sequence having at least 80%
sequence
identity to the amino acid sequence of SEQ ID NO:3; (c) a polypeptide
comprising a
sequence having at least 95% sequence identity to the nucleotide sequence of
SEQ ID NO:4;
(d) a polypeptide comprising a sequence having the amino acid sequence of SEQ
ID NO:5,
wherein X is L or S; (e) a polypeptide comprising: (i) a sequence having the
amino acid
sequence of RKKLRLX1KX2QX3 (SEQ ID NO:6), wherein Xi is S or T, X2 is D or E
and X3
.. is S or A; (ii) a sequence having the amino acid sequence of PX1X2
X2LTX3CPX4CER(SEQ
ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or
N; (iii) a
sequence having the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7),
wherein Xi
is Q or H, X2 is R or K and X3 is V or L; and a sequence having the amino acid
sequence of
56

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
VWFQNRRA (SEQ ID NO:9) and/or (f) a polypeptide comprising a sequence having
the
amino acid sequence of VWFQNRRA (SEQ ID NO:9).
In some embodiments, the guide nucleic acid comprises a spacer having the
nucleotide sequence of any one of SEQ ID NOs:175 -182. In some embodiments,
the HD-
Zip transcription factor may be a HD-Zip Type II (HD-Zip II) transcription
factor, optionally
wherein the HD-Zip II transcription factor may be HB53 or HB78.
In some embodiments, a system is provided that comprises a guide nucleic acid
of the
present invention and a CRISPR-Cas effector protein that associates with the
guide nucleic
acid. In some embodiments, the system may further comprise a tracr nucleic
acid that
associates with the guide nucleic acid and a CRISPR-Cas effector protein,
optionally wherein
the tracr nucleic acid and the guide nucleic acid are covalently linked.
In some embodiments, a gene editing system is provided, the gene editing
system
comprising a CRISPR-Cas effector protein in association with a guide nucleic
acid, wherein
the guide nucleic acid comprises a spacer sequence that binds to a HD-Zip
transcription
factor gene. In some embodiments, a HD-Zip transcription factor gene useful
with the gene
editing system encodes: (a) a polypeptide comprising a sequence having at
least 80%
sequence identity to the amino acid sequence of SEQ ID NO:! or SEQ ID NO:2;
(b) a
polypeptide comprising a sequence having at least 80% sequence identity to the
amino acid
sequence of SEQ ID NO:3; (c) a polypeptide comprising a sequence having at
least 95%
sequence identity to the nucleotide sequence of SEQ ID NO:4; (d) a polypeptide
comprising
a sequence having the amino acid sequence of SEQ ID NO:5, wherein X is L or S;
(e) a
polypeptide comprising: (i) a sequence having the amino acid sequence of
RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S or
A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and a sequence having the amino acid sequence of
VWFQNRRA
(SEQ ID NO:9) and/or (f) a polypeptide comprising a sequence having the amino
acid
sequence of VWFQNRRA (SEQ ID NO:9). In some embodiments, the HD-Zip
transcription
factor may be a HD-Zip Type II (HD-Zip II) transcription factor, optionally
wherein the HD-
Zip II transcription factor may be HB53 or HB78.
In some embodiments, the guide nucleic acid of a gene editing system may
comprise
a spacer sequence having a nucleotide sequence that is complementary to a
nucleotide
sequence encoding (a) a polypeptide comprising a sequence having at least 80%
sequence
57

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
identity to the amino acid sequence of SEQ ID NO:! or SEQ ID NO:2; (b) a
polypeptide
comprising a sequence having at least 80% sequence identity to the amino acid
sequence of
SEQ ID NO:3; (c) a polypeptide comprising a sequence having at least 95%
sequence
identity to the nucleotide sequence of SEQ ID NO:4; (d) a polypeptide
comprising a
sequence having the amino acid sequence of SEQ ID NO:5, wherein X is L or S;
(e) a
polypeptide comprising: (i) a sequence having the amino acid sequence of
RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S or
A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and a sequence having the amino acid sequence of
VWFQNRRA
(SEQ ID NO:9) and/or (f) a polypeptide comprising a sequence having the amino
acid
sequence of VWFQNRRA (SEQ ID NO:9). In some embodiments, the guide nucleic
acid of
a gene editing system may comprise a spacer sequence having a nucleotide
sequence of any
one of SEQ ID NOs:175-182. In some embodiments, a gene editing system may
further
comprise a tracr nucleic acid that associates with the guide nucleic acid and
a CRISPR-Cas
effector protein, optionally wherein the tracr nucleic acid and the guide
nucleic acid are
covalently linked.
The present invention further provides a complex comprising a CRISPR-Cas
effector
protein comprising a cleavage domain and a guide nucleic acid, wherein the
guide nucleic
acid binds to a target site in a HD-Zip transcription factor gene encoding (a)
a polypeptide
comprising a sequence having at least 80% sequence identity to the amino acid
sequence of
SEQ ID NO:! or SEQ ID NO:2; (b) a polypeptide comprising a sequence having at
least
80% sequence identity to the amino acid sequence of SEQ ID NO:3; (c) a
polypeptide
comprising a sequence having at least 95% sequence identity to the nucleotide
sequence of
SEQ ID NO:4; (d) a polypeptide comprising a sequence having the amino acid
sequence of
SEQ ID NO:5, wherein X is L or S; (e) a polypeptide comprising: (i) a sequence
having the
amino acid sequence of RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S or T, X2
is D
or E and X3 is S or A; (ii) a sequence having the amino acid sequence of PX1X2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M
and X4 is
Q, S or N; (iii) a sequence having the amino acid sequence of ENRRLX1X2EX3,
(SEQ ID
NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and a sequence
having the amino
acid sequence of VWFQNRRA (SEQ ID NO:9) and/or (f) a polypeptide comprising a
58

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
sequence having the amino acid sequence of VWFQNRRA (SEQ ID NO:9),wherein the
cleavage domain cleaves a target strand in the HD-Zip transcription factor
gene.
Also provided herein are expression cassettes comprising a (a) polynucleotide
encoding CRISPR-Cas effector protein comprising a cleavage domain and (b) a
guide nucleic
acid that binds to a target site in a HD-Zip transcription factor gene,
wherein the guide
nucleic acid comprises a spacer sequence that is complementary to and binds to
a nucleotide
sequence encoding: (a) a polypeptide comprising a sequence having at least 80%
sequence
identity to the amino acid sequence of SEQ ID NO:! or SEQ ID NO:2; (b) a
polypeptide
comprising a sequence having at least 80% sequence identity to the amino acid
sequence of
SEQ ID NO:3; (c) a polypeptide comprising a sequence having at least 95%
sequence
identity to the nucleotide sequence of SEQ ID NO:4; (d) a polypeptide
comprising a
sequence having the amino acid sequence of SEQ ID NO:5, wherein X is L or S;
(e) a
polypeptide comprising: (i) a sequence having the amino acid sequence of
RKKLRLX1KX2QX3 (SEQ ID NO:6), wherein Xi is S or T, X2 is D or E and X3 is S
or A;
(ii) a sequence having the amino acid sequence of PX1X2 X2LTX3CPX4CER(SEQ ID
NO:8),
wherein Xi is P or A, X2 is T or A, X3 is V or M and X4 is Q, S or N; (iii) a
sequence having
the amino acid sequence of ENRRLX1X2EX3, (SEQ ID NO:7), wherein Xi is Q or H,
X2 is
R or K and X3 is V or L; and a sequence having the amino acid sequence of
VWFQNRRA
(SEQ ID NO:9) and/or (f) a polypeptide comprising a sequence having the amino
acid
.. sequence of VWFQNRRA (SEQ ID NO:9). In some embodiments, the HD-Zip
transcription
factor is a HD-Zip Type II (HD-Zip II) transcription factor, optionally
wherein the HD-Zip II
transcription factor is HB53 or HB78.
In some embodiments, a nucleic acid encoding HD-Zip transcription factor
(e.g., HD-
Zip II, e.g., HB53 or HB78) having a mutated DNA binding site is provided,
wherein the
mutated DNA binding site of the HD-Zip transcription factor comprises a
mutation that
disrupts DNA binding by the HD-Zip transcription factor. In some embodiments,
the
mutation may eliminate the binding of the HD-Zip transcription factor to DNA
or may reduce
the ability of the HD-Zip transcription factor to bind to DNA by at least 75%
(e.g., at least
about 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92,
93, 94, 95, 96, 97,
98, 99, 100%). The present invention further provides plants or parts thereof
comprising the
nucleic acid of the invention. In some embodiments, when the plant is a corn
plant, the corn
plant may comprise a short stature/semi-dwarf phenotype. In some embodiments,
the plant
may be a whet plant or part thereof, optionally wherein the nucleic acid may
be comprised in
the A genome, the B genome, the D genome or in any combination thereof. In
some
59

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
embodiments, a plant, a corn plant and/or a wheat plant of the invention
having a reduced
SAR may exhibit increased yield, decreased height, decreased shoot:root ratio,
decreased leaf
length; increased mechanical strength of stems; reduced lodging rate; delayed
senescence;
increased photosynthesis efficiency and grain filling; and/or enhanced defense
responses
.. against pathogens and herbivores when they are planted in close proximity
with one or more
other as compared to a control plant that does not comprise a nucleic acid
encoding HD-Zip
transcription factor (e.g., HD-Zip II, e.g., HB53 or HB78) having a mutated
DNA binding
site and therefore does not comprise a reduced Shade Avoidance Response, which
is planted
in close proximity with one or more other plants. In some embodiments, when
planted in
close proximity to one another, plants of this invention comprising reduced
SAR may be at
least about 5% shorter than a control plant grown under the same conditions
(e.g., a shaded
environment, e.g., a low R:FR ratio environment).
In some embodiments, a method of the present invention may further comprise
regenerating a plant from a plant cell or plant part comprising at least one
non-natural
mutation in an endogenous Homeodomain-leucine zipper (HD-Zip) transcription
factor,
wherein the mutation disrupts the binding of the HD-Zip transcription factor
to DNA. In
some embodiments, the plant comprise comprising at least one non-natural
mutation in an
endogenous Homeodomain-leucine zipper (HD-Zip) transcription factor may have
increased
yield, decreased height, decreased shoot:root ratio, decreased leaf length;
increased
mechanical strength of stems; reduced lodging rate; delayed senescence;
increased
photosynthesis efficiency and grain filling; and/or enhanced defense responses
against
pathogens and herbivores when they are planted in close proximity with one or
more other
plants compared to a control plant that does not comprise the at least one non-
natural
mutation in an endogenous Homeodomain-leucine zipper (HD-Zip) transcription
factor, and
therefore, does not comprise a reduced Shade Avoidance Response, which plant
is planted in
close proximity with one or more other plants. In some embodiments, the
mutation is a non-
naturally occurring mutation. In some embodiments, the mutation is a deletion.
In some
embodiments, the deletion is a dominant recessive mutation.
An editing system useful with this invention can be any site-specific
(sequence-
specific) genome editing system now known or later developed, which system can
introduce
mutations in target specific manner. For example, an editing system (e.g.,
site- or sequence-
specific editing system) can include, but is not limited to, a CRISPR-Cas
editing system, a
meganuclease editing system, a zinc finger nuclease (ZFN) editing system, a
transcription
activator-like effector nuclease (TALEN) editing system, a base editing system
and/or a

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
prime editing system, each of which can comprise one or more polypeptides
and/or one or
more polynucleotides that when expressed as a system in a cell can modify
(mutate) a target
nucleic acid in a sequence specific manner. In some embodiments, an editing
system (e.g.,
site- or sequence-specific editing system) can comprise one or more
polynucleotides and/or
one or more polypeptides, including but not limited to a nucleic acid binding
domain (DNA
binding domain), a nuclease, and/or other polypeptide, and/or a
polynucleotide.
In some embodiments, an editing system can comprise one or more sequence-
specific
nucleic acid binding domains (DNA binding domains) that can be from, for
example, a
polyrnicleotide-guided endonuclease, a CRISPR-Cas endonuci ease (e.g., CRISPR-
Cas
effector protein), a zinc finger nuclease, a transcription activator-like
effector nuclease
(TALEN) and/or an Argonaute protein. In some embodiments, an editing system
can
comprise one or more cleavage domains (e.g., nucleases) including, but not
limited to, an
endonuclease (e.g., Fokl), a polynucleotide-guided endonuclease, a CMSPR-Cas
endonuclease CRISPR-Cas effector protein), a zinc finger nuclease,
and/or a
transcription activator-like effector nuclease (TALEN). In some embodiments,
an editing
system can comprise one or more polypeptides that include, but are not limited
to, a
deaminase (e.g., a cytosine deaminase, an adenine deaminase), a reverse
transcriptase, a Dna2
polypeptide, and/or a 5' flap endonuclease (FEN). In some embodiments, an
editing system
can comprise one or more polynucleotides, including, but is not limited to, a
CRISPR array
(CRISPR guide) nucleic acid, extended guide nucleic acid, and/or a reverse
transcriptase
template.
In some embodiments, a method of modifying or editing an HD-Zip transcription
factor may comprise contacting a target nucleic acid (e.g., a nucleic acid
encoding an HD-Zip
transcription factor) with a base-editing fusion protein (e.g., a sequence
specific DNA
binding protein (e.g., a CRISPR-Cas effector protein or domain) fused to a
deaminase domain
(e.g., an adenine deaminase and/or a cytosine deaminase) and a guide nucleic
acid, wherein
the guide nucleic acid is capable of guiding/targeting the base editing fusion
protein to the
target nucleic acid, thereby editing a locus within the target nucleic acid.
In some
embodiments, a base editing fusion protein and guide nucleic acid may be
comprised in one
or more expression cassettes. In some embodiments, the target nucleic acid may
be contacted
with a base editing fusion protein and an expression cassette comprising a
guide nucleic acid.
In some embodiments, the sequence-specific DNA binding fusion proteins and
guides may be
provided as ribonucleoproteins (RNPs). In some embodiments, a cell may be
contacted with
more than one base-editing fusion protein and/or one or more guide nucleic
acids that may
61

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
target one or more target nucleic acids in the cell.
In some embodiments, a method of modifying or editing an HD-Zip transcription
factor may comprise contacting a target nucleic acid (e.g., a nucleic acid
encoding an HD-Zip
transcription factor) with a sequence-specific DNA binding fusion protein
(e.g., a sequence-
specific DNA binding protein (e.g., a CRISPR-Cas effector protein or domain)
fused to a
peptide tag, a deaminase fusion protein comprising a deaminase domain (e.g.,
an adenine
deaminase and/or a cytosine deaminase) fused to an affinity polypeptide that
is capable of
binding to the peptide tag, and a guide nucleic acid, wherein the guide
nucleic acid is capable
of guiding/targeting the sequence-specific DNA binding fusion protein to the
target nucleic
acid and the sequence-specific DNA binding fusion protein is capable of
recruiting the
deaminase fusion protein to the target nucleic acid via the peptide tag-
affinity polypeptide
interaction, thereby editing a locus within the target nucleic acid. In some
embodiments, the
sequence-specific DNA binding fusion protein may be fused to the affinity
polypeptide that
binds the peptide tag and the deaminase may be fuse to the peptide tag,
thereby recruiting the
deaminase to the sequence-specific DNA binding fusion protein and to the
target nucleic
acid. In some embodiments, the sequence-specific binding fusion protein,
deaminase fusion
protein, and guide nucleic acid may be comprised in one or more expression
cassettes. In
some embodiments, the target nucleic acid may be contacted with a sequence-
specific
binding fusion protein, deaminase fusion protein, and an expression cassette
comprising a
guide nucleic acid. In some embodiments, the sequence-specific DNA binding
fusion
proteins, deaminase fusion proteins and guides may be provided as
ribonucleoproteins
(RNPs).
In some embodiments, methods such as prime editing may be used to generate a
mutation in an endogenous HD-Zip transcription factor gene. In prime editing,
RNA-
dependent DNA polymerase (reverse transcriptase, RT) and reverse transcriptase
templates
(RT template) are used in combination with sequence specific DNA binding
domains that
confer the ability to recognize and bind the target in a sequence-specific
manner, and which
can also cause a nick of the PAM-containing strand within the target. The DNA
binding
domain may be a CRISPR-Cas effector protein and in this case, the CRISPR array
or guide
RNA may be an extended guide that comprises an extended portion comprising a
primer
binding site (PSB) and the edit to be incorporated into the genome (the
template). Similar to
base editing, prime editing can take advantageous of the various methods of
recruiting
proteins for use in the editing to the target site, such methods including
both non-covalent and
covalent interactions between the proteins and nucleic acids used in the
selected process of
62

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
genome editing.
In some embodiments, the mutation of an HD-Zip transcription factor gene may
be an
insertion, a deletion and/or a point mutation in that produces an HD-Zip
transcription factor
having reduced DNA binding (e.g., a mutated HD-Zip transcription factor). In
some
embodiments, a plant part may be a cell. In some embodiments, the plant or
plant part
thereof may be any plant or part thereof as described herein. In some
embodiments, a plant
useful with this invention may be corn, soy, canola, wheat, rice, cotton,
sugarcane, sugar beet,
barley, oats, alfalfa, sunflower, safflower, oil palm. sesame, coconut,
tobacco, potato, sweet
potato, cassava, coffee, apple, plum, apricot, peach, cherry, pear, fig,
banana, citrus, cocoa,
avocado, olive, almond, walnut, strawberry, watermelon, pepper, grape, tomato,
cucumber, or
a Brassica spp. In some embodiments, a plant comprising a mutated endogenous
HD-Zip
transcription factor having DNA binding may comprise increased yield,
decreased height,
decreased shoot:root ratio, decreased leaf length; increased mechanical
strength of stems;
reduced lodging rate; delayed senescence; increased photosynthesis efficiency
and grain
filling; and/or enhanced defense responses against pathogens and herbivores
when they are
planted in close proximity with one or more other plants compared to a control
plant that does
not comprise the at least one non-natural mutation in an endogenous
Homeodomain-leucine
zipper (HD-Zip) transcription factor, and therefore, does not comprise a
reduced Shade
Avoidance Response, which plant is planted in close proximity with one or more
other plants.
In some embodiments, the plant may be a corn plant, optionally wherein the
corn plant
comprises a short stature/semi-dwarf phenotype.
In some embodiments, a mutation that is introduced into an endogenous HD-Zip
transcription factor and resulting in reduced DNA binding may be a non-
naturally occurring
mutation. In some embodiments, a mutation that is introduced into an
endogenous HD-Zip
transcription factor and resulting in reduced DNA binding may be a
substitution, an insertion
and/or a deletion of one or more amino acid residues. In some embodiments, a
mutation that
is introduced into an endogenous HD-Zip transcription factor gene that results
in reduced
DNA binding may be a deletion, optionally a deletion of all or a portion of
the amino acid
sequence of SEQ ID NO:9 (e.g., 1, 2, 3, 4, 5, 6, 7, or 8 amino acid residues
of SEQ ID
NO:9).
In some embodiments, an HD-Zip transcription factor may be an HD-Zip Type II
(HD-Zip II) transcription factor, optionally wherein the HD-Zip II
transcription factor may
bea HOMEOBOX PROTEIN 53 (HB53) or a HOMEOBOX PROTEIN 78 (HB78)
63

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
In some embodiments, a sequence-specific nucleic acid binding domain (DNA
binding domains) of an editing system useful with this invention can be from,
for example, a
polynucleotide-guided endonuclease, a CRISPR-Cas endonuclease (e.g., CRISPR-
Cas
effector protein), a zinc finger nuclease, a transcription activator-like
effector nuclease
.. (TALEN) and/or an Argonaute protein.
In some embodiments, a sequence-specific DNA binding domain may be a CRISPR-
Cas effector protein, optionally wherein the CRISPR-Cas effector protein may
be from a
Type I CRISPR-Cas system, a Type II CRISPR-Cas system, a Type III CRISPR-Cas
system,
a Type IV CRISPR-Cas system, Type V CRISPR-Cas system, or a Type VI CRISPR-Cas
system. In some embodiments, a CRISPR-Cas effector protein of the invention
may be from
a Type II CRISPR-Cas system or a Type V CRISPR-Cas system. In some
embodiments, a
CRISPR-Cas effector protein may be Type II CRISPR-Cas effector protein, for
example, a
Cas9 effector protein. In some embodiments, a CRISPR-Cas effector protein may
be Type V
CRISPR-Cas effector protein, for example, a Cas12 effector protein.
In some embodiments, a CRISPR-Cas effector protein may include, but is not
limited
to, a Cas9, C2c1, C2c3, Cas12a (also referred to as Cpfl), Cas12b, Cas12c,
Cas12d, Cas12e,
Cas13a, Cas13b, Cas13c, Cas13d, Casl, Cas1B, Cas2, Cas3, Cas3', Cas3", Cas4,
Cas5, Cas6,
Cas7, Cas8, Cas9 (also known as Csnl and Csx12), Cas10, Csyl, Csy2, Csy3,
Csel, Cse2,
Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5,
Cmr6,
Csbl, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csxl, Csx15, Csfl,
Csf2, Csf3,
Csf4 (dinG), and/or Csf5 nuclease, optionally wherein the CRISPR-Cas effector
protein may
be a Cas9, Cas12a (Cpfl), Cas12b, Cas12c (C2c3), Cas12d (CasY), Cas12e (CasX),
Cas12g,
Cas12h, Cas12i, C2c4, C2c5, C2c8, C2c9, C2c10, Cas14a, Cas14b, and/or Cas14c
effector
protein.
In some embodiments, a CRISPR-Cas effector protein useful with the invention
may
comprise a mutation in its nuclease active site (e.g., RuvC, HNH, e.g., RuvC
site of a Cas12a
nuclease domain, e.g., RuvC site and/or HNH site of a Cas9 nuclease domain). A
CRISPR-
Cas effector protein having a mutation in its nuclease active site, and
therefore, no longer
comprising nuclease activity, is commonly referred to as "dead," e.g., dCas.
In some
embodiments, a CRISPR-Cas effector protein domain or polypeptide having a
mutation in its
nuclease active site may have impaired activity or reduced activity as
compared to the same
CRISPR-Cas effector protein without the mutation, e.g., a nickase, e.g, Cas9
nickase, Cas12a
nickase.
64

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
A CRISPR Cas9 effector protein or CRISPR Cas9 effector domain useful with this

invention may be any known or later identified Cas9 nuclease. In some
embodiments, a
CRISPR Cas9 polypeptide can be a Cas9 polypeptide from, for example,
Streptococcus spp.
(e.g., S. pyogenes, S. thermophilus), Lactobacillus spp., Bifidobacterium
spp., Kandleria spp.,
Leuconostoc spp., Oenococcus spp., Pediococcus spp., Weissella spp., and/or
Olsenella spp.
In some embodiments, the CRISPR-Cas effector protein may be a Cas9 polypeptide

derived from Streptococcus pyogenes and recognizes the PAM sequence motif NGG,
NAG,
NGA (Mali et al, Science 2013; 339(6121): 823-826). In some embodiments, the
CRISPR-
Cas effector protein may be a Cas9 polypeptide derived from Streptococcus
therm ophiles and
recognizes the PAM sequence motif NGGNG and/or NNAGAAW (W = A or T) (See,
e.g.,
Horvath et al, Science, 2010; 327(5962): 167-170, and Deveau et al, J
Bacteriol 2008; 190(4):
1390-1400). In some embodiments, the CRISPR-Cas effector protein may be a Cas9

polypeptide derived from Streptococcus mutans and recognizes the PAM sequence
motif
NGG and/or NAAR (R = A or G) (See, e.g., Deveau et al, J BACTERIOL 2008;
190(4):
1390-1400). In some embodiments, the CRISPR-Cas effector protein may be a Cas9
polypeptide derived from Streptococcus aureus and recognizes the PAM sequence
motif
NNGRR (R = A or G). In some embodiments, the CRISPR-Cas effector protein may
be a
Cas9 protein derived from S. aureus, which recognizes the PAM sequence motif N
GRRT (R
= A or G). In some embodiments, the CRISPR-Cas effector protein may be a Cas9
polypeptide derived from S. aureus, which recognizes the PAM sequence motif N
GRRV (R
= A or G). In some embodiments, the CRISPR-Cas effector protein may be a Cas9
polypeptide that is derived from Neisseria meningitidis and recognizes the PAM
sequence
motif N GATT or N GCTT (R = A or G, V = A, G or C) (See, e.g., Hou et ah, PNAS
2013,
1-6). In the aforementioned embodiments, N can be any nucleotide residue,
e.g., any of A, G,
C or T. In some embodiments, the CRISPR-Cas effector protein may be a Cas13a
protein
derived from Leptotrichia shahii, which recognizes a protospacer flanking
sequence (PFS)
(or RNA PAM (rPAM)) sequence motif of a single 3' A, U, or C, which may be
located
within the target nucleic acid.
In some embodiments, the CRISPR-Cas effector protein may be derived from
Cas12a,
which is a Type V Clustered Regularly Interspaced Short Palindromic Repeats
(CRISPR)-
Cas nuclease. Cas12a differs in several respects from the more well-known Type
II CRISPR
Cas9 nuclease. For example, Cas9 recognizes a G-rich protospacer-adjacent
motif (PAM)
that is 3' to its guide RNA (gRNA, sgRNA, crRNA, crDNA, CRISPR array) binding
site
(protospacer, target nucleic acid, target DNA) (3'-NGG), while Cas12a
recognizes a T-rich

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
PAM that is located 5' to the target nucleic acid (5'-TTN, 5'-TTTN. In fact,
the orientations
in which Cas9 and Cas12a bind their guide RNAs are very nearly reversed in
relation to their
N and C termini. Furthermore, Cas12a enzymes use a single guide RNA (gRNA,
CRISPR
array, crRNA) rather than the dual guide RNA (sgRNA (e.g., crRNA and
tracrRNA)) found
in natural Cas9 systems, and Cas12a processes its own gRNAs. Additionally,
Cas12a
nuclease activity produces staggered DNA double stranded breaks instead of
blunt ends
produced by Cas9 nuclease activity, and Cas12a relies on a single RuvC domain
to cleave
both DNA strands, whereas Cas9 utilizes an HNH domain and a RuvC domain for
cleavage.
A CRISPR Cas12a effector protein/domain useful with this invention may be any
known or later identified Cas12a polypeptide (previously known as Cpfl) (see,
e.g., U.S.
Patent No. 9,790,490, which is incorporated by reference for its disclosures
of Cpfl (Cas12a)
sequences). The term "Cas12a", "Cas12a polypeptide" or "Cas12a domain" refers
to an
RNA-guided nuclease comprising a Cas12a polypeptide, or a fragment thereof,
which
comprises the guide nucleic acid binding domain of Cas12a and/or an active,
inactive, or
partially active DNA cleavage domain of Cas12a. In some embodiments, a Cas12a
useful
with the invention may comprise a mutation in the nuclease active site (e.g.,
RuvC site of the
Cas12a domain). A Cas12a domain or Cas12a polypeptide having a mutation in its
nuclease
active site, and therefore, no longer comprising nuclease activity, is
commonly referred to as
deadCas12a (e.g., dCas12a). In some embodiments, a Cas12a domain or Cas12a
polypeptide
having a mutation in its nuclease active site may have impaired activity,
e.g., may have
nickase activity.
Any deaminase domain/polypeptide useful for base editing may be used with this
invention. In some embodiments, the deaminase domain may be a cytosine
deaminase
domain or an adenine deaminase domain. A cytosine deaminase (or cytidine
deaminase)
useful with this invention may be any known or later identified cytosine
deaminase from any
organism (see, e.g., U.S. Patent No. 10,167,457 and Thuronyi et al. Nat.
Biotechnol.
37:1070-4079 (2019), each of which is incorporated by reference herein for its
disclosure of
cytosine deaminases). Cytosine deaminases can catalyze the hydrolytic
deamination of
cytidine or deoxycytidine to uridine or deoxyuridine, respectively. Thus, in
some
embodiments, a deaminase or deaminase domain useful with this invention may be
a cytidine
deaminase domain, catalyzing the hydrolytic deamination of cytosine to uracil.
In some
embodiments, a cytosine deaminase may be a variant of a naturally occurring
cytosine
deaminase, including but not limited to a primate (e.g., a human, monkey,
chimpanzee,
gorilla), a dog, a cow, a rat or a mouse. Thus, in some embodiments, an
cytosine deaminase
66

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
useful with the invention may be about 70% to about 100% identical to a wild
type cytosine
deaminase (e.g., about 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%,
81%,
82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%,
97%,
98%, 99%, or 100% identical, and any range or value therein, to a naturally
occurring
cytosine deaminase).
In some embodiments, a cytosine deaminase useful with the invention may be an
apolipoprotein B mRNA-editing complex (APOBEC) family deaminase. In some
embodiments, the cytosine deaminase may be an APOBEC1 deaminase, an APOBEC2
deaminase, an APOBEC3A deaminase, an APOBEC3B deaminase, an APOBEC3C
deaminase, an APOBEC3D deaminase, an APOBEC3F deaminase, an APOBEC3G
deaminase, an APOBEC3H deaminase, an APOBEC4 deaminase, a human activation
induced deaminase (hAID), an rAPOBEC1, FERNY, and/or a CDA1, optionally a
pmCDA1,
an atCDA1 (e.g., At2g19570), and evolved versions of the same. In some
embodiments, the
cytosine deaminase may be an APOBEC1 deaminase having the amino acid sequence
of
SEQ ID NO:188. In some embodiments, the cytosine deaminase may be an APOBEC3A
deaminase having the amino acid sequence of SEQ ID NO:189. In some
embodiments, the
cytosine deaminase may be an CDA1 deaminase, optionally a CDA1 having the
amino acid
sequence of SEQ ID NO:190. In some embodiments, the cytosine deaminase may be
a
FERNY deaminase, optionally a FERNY having the amino acid sequence of SEQ ID
.. NO:191. In some embodiments, the cytosine deaminase may be an evolved
deaminase, e.g.,
SEQ ID NO:192, SEQ ID NO:193, or SEQ ID NO:194. In some embodiments, a
cytosine
deaminase useful with the invention may be about 70% to about 100% identical
(e.g., 70%,
71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%,
86%,
87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or 100%
identical) to the amino acid sequence of a naturally occurring cytosine
deaminase (e.g., an
evolved deaminase). In some embodiments, a cytosine deaminase useful with the
invention
may be about 70% to about 99.5% identical (e.g., about 70%, 71%, 72%, 73%,
74%, 75%,
76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%,
91%,
92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 99.5% identical) to the amino acid
sequence
of SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID
NO:192, SEQ ID NO:193, or SEQ ID NO:194 (e.g., at least 80%, at least 85%, at
least
90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at
least 99%, or at
least 99.5% identical to the amino acid sequence of SEQ ID NO:188, SEQ ID
NO:189,
SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:192, SEQ ID NO:193, or SEQ ID
67

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
NO:194). In some embodiments, a polynucleotide encoding a cytosine deaminase
may be
codon optimized for expression in a plant and the codon optimized polypeptide
may be about
70% to 99.5% identical to the reference polynucleotide.
In some embodiments, a nucleic acid construct of this invention may further
encode a
uracil glycosylase inhibitor (UGI) (e.g., uracil-DNA glycosylase inhibitor)
polypeptide/domain. Thus, in some embodiments, a nucleic acid construct
encoding a
CRISPR-Cas effector protein and a cytosine deaminase domain (e.g., encoding a
fusion
protein comprising a CRISPR-Cas effector protein domain fused to a cytosine
deaminase
domain, and/or a CRISPR-Cas effector protein domain fused to a peptide tag or
to an affinity
polypeptide capable of binding a peptide tag and/or a deaminase protein domain
fused to a
peptide tag or to an affinity polypeptide capable of binding a peptide tag)
may further encode
a uracil-DNA glycosylase inhibitor (UGI), optionally wherein the UGI may be
codon
optimized for expression in a plant. In some embodiments, the invention
provides fusion
proteins comprising a CRISPR-Cas effector polypeptide, a deaminase domain, and
a UGI
and/or one or more polynucleotides encoding the same, optionally wherein the
one or more
polynucleotides may be codon optimized for expression in a plant. In some
embodiments,
the invention provides fusion proteins, wherein a CRISPR-Cas effector
polypeptide, a
deaminase domain, and a UGI may be fused to any combination of peptide tags
and affinity
polypeptides as described herein, thereby recruiting the deaminase domain and
UGI to the
CRISPR-Cas effector polypeptide and a target nucleic acid. In some
embodiments, a guide
nucleic acid may be linked to a recruiting RNA motif and one or more of the
deaminase
domain and/or UGI may be fused to an affinity polypeptide that is capable of
interacting with
the recruiting RNA motif, thereby recruiting the deaminase domain and UGI to a
target
nucleic acid.
A "uracil glycosylase inhibitor" useful with the invention may be any protein
that is
capable of inhibiting a uracil-DNA glycosylase base-excision repair enzyme. In
some
embodiments, a UGI domain comprises a wild type UGI or a fragment thereof. In
some
embodiments, a UGI domain useful with the invention may be about 70% to about
100%
identical (e.g., 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%,
82%,
83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%,
98%,
99%, 99.5% or 100% identical and any range or value therein) to the amino acid
sequence of
a naturally occurring UGI domain. In some embodiments, a UGI domain may
comprise the
amino acid sequence of SEQ ID NO: 195 or a polypeptide having about 70% to
about 99.5%
sequence identity to the amino acid sequence of SEQ ID NO: 195(e.g., at least
80%, at least
68

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at
least 98%, at least
99%, or at least 99.5% identical to the amino acid sequence of SEQ ID NO:195).
For
example, in some embodiments, a UGI domain may comprise a fragment of the
amino acid
sequence of SEQ ID NO:195 that is 100% identical to a portion of consecutive
nucleotides
.. (e.g., 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80
consecutive nucleotides; e.g.,
about 10, 15, 20, 25, 30, 35, 40, 45, to about 50, 55, 60, 65, 70, 75, 80
consecutive
nucleotides) of the amino acid sequence of SEQ ID NO:195. In some embodiments,
a UGI
domain may be a variant of a known UGI (e.g., SEQ ID NO:195) having about 70%
to about
99.5% sequence identity (e.g., 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%,
79%,
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%,
95%,
96%, 97%, 98%, 99%, 99.5% sequence identity, and any range or value therein)
to the known
UGI. In some embodiments, a polynucleotide encoding a UGI may be codon
optimized for
expression in a plant (e.g., a plant) and the codon optimized polypeptide may
be about 70%
to about 99.5% identical to the reference polynucleotide.
An adenine deaminase (or adenosine deaminase) useful with this invention may
be
any known or later identified adenine deaminase from any organism (see, e.g.,
U.S. Patent
No. 10,113,163, which is incorporated by reference herein for its disclosure
of adenine
deaminases). An adenine deaminase can catalyze the hydrolytic deamination of
adenine or
adenosine. In some embodiments, the adenine deaminase may catalyze the
hydrolytic
deamination of adenosine or deoxyadenosine to inosine or deoxyinosine,
respectively. In
some embodiments, the adenosine deaminase may catalyze the hydrolytic
deamination of
adenine or adenosine in DNA. In some embodiments, an adenine deaminase encoded
by a
nucleic acid construct of the invention may generate an A->G conversion in the
sense (e.g.,
"+"; template) strand of the target nucleic acid or a T->C conversion in the
antisense (e.g.,
"2, complementary) strand of the target nucleic acid.
In some embodiments, an adenosine deaminase may be a variant of a naturally
occurring adenine deaminase. Thus, in some embodiments, an adenosine deaminase
may be
about 70% to 100% identical to a wild type adenine deaminase (e.g., about 70%,
71%, 72%,
73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%,
88%,
89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical, and
any
range or value therein, to a naturally occurring adenine deaminase). In some
embodiments,
the deaminase or deaminase does not occur in nature and may be referred to as
an engineered,
mutated or evolved adenosine deaminase. Thus, for example, an engineered,
mutated or
evolved adenine deaminase polypeptide or an adenine deaminase domain may be
about 70%
69

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
to 99.9% identical to a naturally occurring adenine deaminase
polypeptide/domain (e.g.,
about 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%,
84%,
85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%,
99.1%, 99.2%, 99.3%, 99.4%, 99.5%, 99.6%, 99.7%, 99.8% or 99.9% identical, and
any
range or value therein, to a naturally occurring adenine deaminase polypeptide
or adenine
deaminase domain). In some embodiments, the adenosine deaminase may be from a
bacterium, (e.g., Escherichia coil, Staphylococcus aureus, Haemophilus
influenzae,
Caulobacter crescentus, and the like). In some embodiments, a polynucleotide
encoding an
adenine deaminase polypeptide/domain may be codon optimized for expression in
a plant.
In some embodiments, an adenine deaminase domain may be a wild type tRNA-
specific adenosine deaminase domain, e.g., a tRNA-specific adenosine deaminase
(TadA)
and/or a mutated/evolved adenosine deaminase domain, e.g., mutated/evolved
tRNA-specific
adenosine deaminase domain (TadA*). In some embodiments, a TadA domain may be
from
E. coll. In some embodiments, the TadA may be modified, e.g., truncated,
missing one or
more N-terminal and/or C-terminal amino acids relative to a full-length TadA
(e.g., 1, 2, 3, 4,
5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 6, 17, 18, 19, or 20 N-terminal and/or
C terminal amino
acid residues may be missing relative to a full length TadA. In some
embodiments, a TadA
polypeptide or TadA domain does not comprise an N-terminal methionine. In some

embodiments, a wild type E. coil TadA comprises the amino acid sequence of SEQ
ID
NO:183. In some embodiments, a mutated/evolved E. coil TadA* comprises the
amino acid
sequence of SEQ ID NOs:184-187 (e.g., SEQ ID NOs: 184, 185, 186, or 187). In
some
embodiments, a polynucleotide encoding a TadA/TadA* may be codon optimized for

expression in a plant.
A cytosine deaminase catalyzes cytosine deamination and results in a thymidine
(through a uracil intermediate), causing a C to T conversion, or a G to A
conversion in the
complementary strand in the genome. Thus, in some embodiments, the cytosine
deaminase
encoded by the polynucleotide of the invention generates a C->T conversion in
the sense
(e.g., "+"; template) strand of the target nucleic acid or a G ->A conversion
in antisense (e.g.,
"2, complementary) strand of the target nucleic acid.
In some embodiments, the adenine deaminase encoded by the nucleic acid
construct
of the invention generates an A->G conversion in the sense (e.g., "+";
template) strand of the
target nucleic acid or a T->C conversion in the antisense (e.g., "2,
complementary) strand of
the target nucleic acid.

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
The nucleic acid constructs of the invention encoding a base editor comprising
a
sequence-specific DNA binding protein and a cytosine deaminase polypeptide,
and nucleic
acid constructs/expression cassettes/vectors encoding the same, may be used in
combination
with guide nucleic acids for modifying target nucleic acid including, but not
limited to,
generation of C¨>T or G ¨>A mutations in a target nucleic acid including, but
not limited to,
a plasmid sequence; generation of C¨>T or G ¨>A mutations in a coding sequence
to alter an
amino acid identity; generation of C¨>T or G ¨>A mutations in a coding
sequence to generate
a stop codon; generation of C¨>T or G ¨>A mutations in a coding sequence to
disrupt a start
codon; generation of point mutations in genomic DNA to disrupt transcription
factor binding;
and/or generation of point mutations in genomic DNA to disrupt splice
junctions.
The nucleic acid constructs of the invention encoding a base editor comprising
a
sequence-specific DNA binding protein and an adenine deaminase polypeptide,
and
expression cassettes and/or vectors encoding the same may be used in
combination with
guide nucleic acids for modifying a target nucleic acid including, but not
limited to,
generation of A¨>G or T¨>C mutations in a target nucleic acid including, but
not limited to, a
plasmid sequence; generation of A¨>G or T¨>C mutations in a coding sequence to
alter an
amino acid identity; generation of A¨>G or T¨>C mutations in a coding sequence
to generate
a stop codon; generation of A¨>G or T¨>C mutations in a coding sequence to
disrupt a start
codon; generation of point mutations in genomic DNA to disrupt transcription
factor binding;
and/or generation of point mutations in genomic DNA to disrupt splice
junctions.
The nucleic acid constructs of the invention comprising a CRISPR-Cas effector
protein or a fusion protein thereof may be used in combination with a guide
RNA (gRNA,
CRISPR array, CRISPR RNA, crRNA), designed to function with the encoded CRISPR-
Cas
effector protein or domain, to modify a target nucleic acid. A guide nucleic
acid useful with
this invention comprises at least one spacer sequence and at least one repeat
sequence. The
guide nucleic acid is capable of forming a complex with the CRISPR-Cas
nuclease domain
encoded and expressed by a nucleic acid construct of the invention and the
spacer sequence is
capable of hybridizing to a target nucleic acid, thereby guiding the complex
(e.g., a CRISPR-
Cas effector fusion protein (e.g., CRISPR-Cas effector domain fused to a
deaminase domain
and/or a CRISPR-Cas effector domain fused to a peptide tag or an affinity
polypeptide to
recruit a deaminase domain and optionally, a UGI) to the target nucleic acid,
wherein the
target nucleic acid may be modified (e.g., cleaved or edited) or modulated
(e.g., modulating
transcription) by the deaminase domain.
71

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
As an example, a nucleic acid construct encoding a Cas9 domain linked to a
cytosine
deaminase domain (e.g., fusion protein) may be used in combination with a Cas9
guide
nucleic acid to modify a target nucleic acid, wherein the cytosine deaminase
domain of the
fusion protein deaminates a cytosine base in the target nucleic acid, thereby
editing the target
nucleic acid. In a further example, a nucleic acid construct encoding a Cas9
domain linked to
an adenine deaminase domain (e.g., fusion protein) may be used in combination
with a Cas9
guide nucleic acid to modify a target nucleic acid, wherein the adenine
deaminase domain of
the fusion protein deaminates an adenosine base in the target nucleic acid,
thereby editing the
target nucleic acid.
Likewise, a nucleic acid construct encoding a Cas12a domain (or other selected
CRISPR-Cas nuclease, e.g., C2c1, C2c3, Cas12b, Cas12c, Cas12d, Cas12e, Cas13a,
Cas13b,
Cas13c, Cas13d, Casl, Cas1B, Cas2, Cas3, Cas3', Cas3", Cas4, Cas5, Cas6, Cas7,
Cas8, Cas9
(also known as Csnl and Csx12), Cas10, Csyl, Csy2, Csy3, Csel, Cse2, Cscl,
Csc2, Csa5,
Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2,
Csb3,
Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csxl, Csx15, Csfl, Csf2, Csf3, Csf4
(dinG), and/or
Csf5) linked to a cytosine deaminase domain or adenine deaminase domain (e.g.,
fusion
protein) may be used in combination with a Cas12a guide nucleic acid (or the
guide nucleic
acid for the other selected CRISPR-Cas nuclease) to modify a target nucleic
acid, wherein the
cytosine deaminase domain or adenine deaminase domain of the fusion protein
deaminates a
cytosine base in the target nucleic acid, thereby editing the target nucleic
acid.
A "guide nucleic acid," "guide RNA," "gRNA," "CRISPR RNA/DNA" "crRNA" or
"crDNA" as used herein means a nucleic acid that comprises at least one spacer
sequence,
which is complementary to (and hybridizes to) a target DNA (e.g.,
protospacer), and at least
one repeat sequence (e.g., a repeat of a Type V Cas12a CRISPR-Cas system, or a
fragment or
portion thereof; a repeat of a Type II Cas9 CRISPR-Cas system, or fragment
thereof; a repeat
of a Type V C2c1 CRISPR Cas system, or a fragment thereof; a repeat of a
CRISPR-Cas
system of, for example, C2c3, Cas12a (also referred to as Cpfl), Cas12b,
Cas12c, Cas12d,
Cas12e, Cas13a, Cas13b, Cas13c, Cas13d, Casl, Cas1B, Cas2, Cas3, Cas3', Cas3",
Cas4,
Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csx12), Cas10, Csyl,
Csy2, Csy3,
Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3,
Cmr4,
Cmr5, Cmr6, Csbl, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csxl,
Csx15, Csfl,
Csf2, Csf3, Csf4 (dinG), and/or Csf5, or a fragment thereof), wherein the
repeat sequence
may be linked to the 5' end and/or the 3' end of the spacer sequence. The
design of a gRNA
72

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
of this invention may be based on a Type I, Type II, Type III, Type IV, Type
V, or Type VI
CRISPR-Cas system.
In some embodiments, a Cas12a gRNA may comprise, from 5' to 3', a repeat
sequence (full length or portion thereof ("handle"); e.g., pseudoknot-like
structure) and a
spacer sequence.
In some embodiments, a guide nucleic acid may comprise more than one repeat
sequence-spacer sequence (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, or more repeat-
spacer sequences)
(e.g., repeat-spacer-repeat, e.g., repeat-spacer-repeat-spacer-repeat-spacer-
repeat-spacer-
repeat-spacer, and the like). The guide nucleic acids of this invention are
synthetic, human-
made and not found in nature. A gRNA can be quite long and may be used as an
aptamer
(like in the MS2 recruitment strategy) or other RNA structures hanging off the
spacer.
A "repeat sequence" as used herein, refers to, for example, any repeat
sequence of a
wild-type CRISPR Cas locus (e.g., a Cas9 locus, a Cas12a locus, a C2c1 locus,
etc.) or a
repeat sequence of a synthetic crRNA that is functional with the CRISPR-Cas
effector protein
.. encoded by the nucleic acid constructs of the invention. A repeat sequence
useful with this
invention can be any known or later identified repeat sequence of a CRISPR-Cas
locus (e.g.,
Type I, Type II, Type III, Type IV, Type V or Type VI) or it can be a
synthetic repeat
designed to function in a Type I, II, III, IV, V or VI CRISPR-Cas system. A
repeat sequence
may comprise a hairpin structure and/or a stem loop structure. In some
embodiments, a
repeat sequence may form a pseudoknot-like structure at its 5' end (i.e.,
"handle"). Thus, in
some embodiments, a repeat sequence can be identical to or substantially
identical to a repeat
sequence from wild-type Type I CRISPR-Cas loci, Type II, CRISPR-Cas loci, Type
III,
CRISPR-Cas loci, Type IV CRISPR-Cas loci, Type V CRISPR-Cas loci and/or Type
VI
CRISPR-Cas loci. A repeat sequence from a wild-type CRISPR-Cas locus may be
.. determined through established algorithms, such as using the CRISPRfinder
offered through
CRISPRdb (see, Grissa et al. Nucleic Acids Res. 35(Web Server issue):W52-7).
In some
embodiments, a repeat sequence or portion thereof is linked at its 3' end to
the 5' end of a
spacer sequence, thereby forming a repeat-spacer sequence (e.g., guide nucleic
acid, guide
RNA/DNA, crRNA, crDNA).
In some embodiments, a repeat sequence comprises, consists essentially of, or
consists of at least 10 nucleotides depending on the particular repeat and
whether the guide
nucleic acid comprising the repeat is processed or unprocessed (e.g., about
10, 11, 12, 13, 14,
15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33,
34, 35, 36, 37, 38, 39,
40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 to 100 or more nucleotides, or any
range or value
73

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
therein). In some embodiments, a repeat sequence comprises, consists
essentially of, or
consists of about 10 to about 20, about 10 to about 30, about 10 to about 45,
about 10 to
about 50, about 15 to about 30, about 15 to about 40, about 15 to about 45,
about 15 to about
50, about 20 to about 30, about 20 to about 40, about 20 to about 50, about 30
to about 40,
about 40 to about 80, about 50 to about 100 or more nucleotides.
A repeat sequence linked to the 5' end of a spacer sequence can comprise a
portion of
a repeat sequence (e.g., 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18,
19, 20, 21, 22, 23, 24,
25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35 or more contiguous nucleotides of a
wild type repeat
sequence). In some embodiments, a portion of a repeat sequence linked to the
5' end of a
spacer sequence can be about five to about ten consecutive nucleotides in
length (e.g., about
5, 6, 7, 8, 9, 10 nucleotides) and have at least 90% sequence identity (e.g.,
at least about 90%,
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more) to the same region
(e.g., 5' end)
of a wild type CRISPR Cas repeat nucleotide sequence. In some embodiments, a
portion of a
repeat sequence may comprise a pseudoknot-like structure at its 5' end (e.g.,
"handle").
A "spacer sequence" as used herein is a nucleotide sequence that is
complementary to
a target nucleic acid (e.g., target DNA) (e.g, protospacer) (e.g., consecutive
nucleotides of
any one of SEQ ID NOs:1-9; e.g., SEQ ID NOs:175-182). The spacer sequence can
be
fully complementary or substantially complementary (e.g., at least about 70%
complementary
(e.g., about 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%,
83%,
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%,
99%,
or more)) to a target nucleic acid. Thus, in some embodiments, the spacer
sequence can have
one, two, three, four, or five mismatches as compared to the target nucleic
acid, which
mismatches can be contiguous or noncontiguous. In some embodiments, the spacer
sequence
can have 70% complementarity to a target nucleic acid. In other embodiments,
the spacer
nucleotide sequence can have 80% complementarity to a target nucleic acid. In
still other
embodiments, the spacer nucleotide sequence can have 85%, 90%, 95%, 96%, 97%,
98%,
99% or 99.5% complementarity, and the like, to the target nucleic acid
(protospacer). In
some embodiments, the spacer sequence is 100% complementary to the target
nucleic acid.
A spacer sequence may have a length from about 15 nucleotides to about 30
nucleotides (e.g.,
15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides,
or any range or
value therein). Thus, in some embodiments, a spacer sequence may have complete

complementarity or substantial complementarity over a region of a target
nucleic acid (e.g.,
protospacer) that is at least about 15 nucleotides to about 30 nucleotides in
length. In some
74

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
embodiments, the spacer is about 20 nucleotides in length. In some
embodiments, the spacer
is about 21, 22, or 23 nucleotides in length.
In some embodiments, the 5' region of a spacer sequence of a guide nucleic
acid may
be identical to a target DNA, while the 3' region of the spacer may be
substantially
complementary to the target DNA (e.g., Type V CRISPR-Cas), or the 3' region of
a spacer
sequence of a guide nucleic acid may be identical to a target DNA, while the
5' region of the
spacer may be substantially complementary to the target DNA (e.g., Type II
CRISPR-Cas),
and therefore, the overall complementarity of the spacer sequence to the
target DNA may be
less than 100%. Thus, for example, in a guide for a Type V CRISPR-Cas system,
the first 1,
2, 3, 4, 5, 6, 7, 8, 9, 10 nucleotides in the 5' region (i.e., seed region)
of, for example, a 20
nucleotide spacer sequence may be 100% complementary to the target DNA, while
the
remaining nucleotides in the 3' region of the spacer sequence are
substantially
complementary (e.g., at least about 70% complementary) to the target DNA. In
some
embodiments, the first 1 to 8 nucleotides (e.g., the first 1, 2, 3, 4, 5, 6,
7, 8, nucleotides, and
any range therein) of the 5' end of the spacer sequence may be 100%
complementary to the
target DNA, while the remaining nucleotides in the 3' region of the spacer
sequence are
substantially complementary (e.g., at least about 50% complementary (e.g.,
50%, 55%, 60%,
65%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%,
84%,
85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or
more)) to the target DNA.
As a further example, in a guide for a Type II CRISPR-Cas system, the first 1,
2, 3, 4,
5, 6, 7, 8, 9, 10 nucleotides in the 3' region (i.e., seed region) of, for
example, a 20 nucleotide
spacer sequence may be 100% complementary to the target DNA, while the
remaining
nucleotides in the 5' region of the spacer sequence are substantially
complementary (e.g., at
.. least about 70% complementary) to the target DNA. In some embodiments, the
first 1 to 10
nucleotides (e.g., the first 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 nucleotides, and
any range therein) of the
3' end of the spacer sequence may be 100% complementary to the target DNA,
while the
remaining nucleotides in the 5' region of the spacer sequence are
substantially
complementary (e.g., at least about 50% complementary (e.g., at least about
50%, 55%, 60%,
65%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%,
84%,
85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or
more or any range or value therein)) to the target DNA.

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
In some embodiments, a seed region of a spacer may be about 8 to about 10
nucleotides in length, about 5 to about 6 nucleotides in length, or about 6
nucleotides in
length.
As used herein, a "target nucleic acid", "target DNA," "target nucleotide
sequence,"
"target region," or a "target region in the genome" refers to a region of a
plant's genome that
is fully complementary (100% complementary) or substantially complementary
(e.g., at least
70% complementary (e.g., 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%,
80%,
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%,
96%,
97%, 98%, 99%, or more)) to a spacer sequence in a guide nucleic acid of this
invention. A
target region useful for a CRISPR-Cas system may be located immediately 3'
(e.g., Type V
CRISPR-Cas system) or immediately 5' (e.g., Type II CRISPR-Cas system) to a
PAM
sequence in the genome of the organism (e.g., a plant genome). A target region
may be
selected from any region of at least 15 consecutive nucleotides (e.g., 16, 17,
18, 19, 20, 21,
22, 23, 24, 25, 26, 27, 28, 29, 30 nucleotides, and the like) located
immediately adjacent to a
PAM sequence.
A "protospacer sequence" refers to the target double stranded DNA and
specifically
to the portion of the target DNA (e.g., or target region in the genome) that
is fully or
substantially complementary (and hybridizes) to the spacer sequence of the
CRISPR repeat-
spacer sequences (e.g., guide nucleic acids, CRISPR arrays, crRNAs).
In the case of Type V CRISPR-Cas (e.g., Cas12a) systems and Type II CRISPR-Cas
(Cas9) systems, the protospacer sequence is flanked by (e.g., immediately
adjacent to) a
protospacer adjacent motif (PAM). For Type IV CRISPR-Cas systems, the PAM is
located at
the 5' end on the non-target strand and at the 3' end of the target strand
(see below, as an
example).
5'- -3' RNA Spacer (SEQ ID NO:198)
1 1 1 1 1 1 1 111111 1 11 11 1 1
3'AAAN N-5' Target strand (SEQ ID NO:199)
1 1 1 1
5'TTT NNNN-3' Non-target strand (SEQ ID NO:200)
In the case of Type II CRISPR-Cas (e.g., Cas9) systems, the PAM is located
immediately 3' of the target region. The PAM for Type I CRISPR-Cas systems is
located 5'
of the target strand. There is no known PAM for Type III CRISPR-Cas systems.
Makarova
et al. describes the nomenclature for all the classes, types and subtypes of
CRISPR systems
(Nature Reviews Microbiology 13:722-736 (2015)). Guide structures and PAMs are
described in by R. Barrangou (Genome Biol. 16:247 (2015)).
76

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
Canonical Cas12a PAMs are T rich. In some embodiments, a canonical Cas12a
PAM sequence may be 5'-TTN, 5'-TTTN, or 5'-TTTV. In some embodiments,
canonical
Cas9 (e.g., S. pyogenes) PAMs may be 5'-NGG-3'. In some embodiments, non-
canonical
PAMs may be used but may be less efficient.
Additional PAM sequences may be determined by those skilled in the art through
established experimental and computational approaches. Thus, for example,
experimental
approaches include targeting a sequence flanked by all possible nucleotide
sequences and
identifying sequence members that do not undergo targeting, such as through
the
transformation of target plasmid DNA (Esvelt et al. 2013. Nat. Methods 10:1116-
1121; Jiang
et al. 2013. Nat. Biotechnol. 31:233-239). In some aspects, a computational
approach can
include performing BLAST searches of natural spacers to identify the original
target DNA
sequences in bacteriophages or plasmids and aligning these sequences to
determine
conserved sequences adjacent to the target sequence (Briner and Barrangou.
2014. Appl.
Environ. Microbiol. 80:994-1001; Mojica et al. 2009. Microbiology 155:733-
740).
In some embodiments, the present invention provides expression cassettes
and/or
vectors comprising the nucleic acid constructs of the invention (e.g, one or
more components
of an editing system of the invention). In some embodiments, expression
cassettes and/or
vectors comprising the nucleic acid constructs of the invention and/or one or
more guide
nucleic acids may be provided. In some embodiments, a nucleic acid construct
of the
invention encoding a base editor (e.g., a construct comprising a CRISPR-Cas
effector protein
and a deaminase domain (e.g., a fusion protein)) or the components for base
editing (e.g., a
CRISPR-Cas effector protein fused to a peptide tag or an affinity polypeptide,
a deaminase
domain fused to a peptide tag or an affinity polypeptide, and/or a UGI fused
to a peptide tag
or an affinity polypeptide), may be comprised on the same or on a separate
expression
cassette or vector from that comprising the one or more guide nucleic acids.
When the
nucleic acid construct encoding a base editor or the components for base
editing is/are
comprised on separate expression cassette(s) or vector(s) from that comprising
the guide
nucleic acid, a target nucleic acid may be contacted with (e.g., provided
with) the expression
cassette(s) or vector(s) encoding the base editor or components for base
editing in any order
from one another and the guide nucleic acid, e.g., prior to, concurrently
with, or after the
expression cassette comprising the guide nucleic acid is provided (e.g.,
contacted with the
target nucleic acid).
Fusion proteins of the invention may comprise sequence-specific DNA binding
domains, CRISPR-Cas polypeptides, and/or deaminase domains fused to peptide
tags or
77

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
affinity polypeptides that interact with the peptide tags, as known in the
art, for use in
recruiting the deaminase to the target nucleic acid. Methods of recruiting may
also comprise
guide nucleic acids linked to RNA recruiting motifs and deaminases fused to
affinity
polypeptides capable of interacting with RNA recruiting motifs, thereby
recruiting the
deaminase to the target nucleic acid. Alternatively, chemical interactions may
be used to
recruit polypeptides (e.g., deaminases) to a target nucleic acid.
A peptide tag (e.g., epitope) useful with this invention may include, but is
not limited
to, a GCN4 peptide tag (e.g., Sun-Tag), a e-Myc affinity tag, an HA affinity
tag, a His affinity
tag, an S affinity tag, a rnetiiionine-His affinity tag an RGD-His affinity
tag, a FLAG
octapeptide, a strep tag or strep tag 11, a V5 tag, and/or a VS V-G epitope.
Any epitope that
may be linked to a polypeptide and for which there is a corresponding affinity
polypeptide
that may be linked to another polypeptide may be used with this invention as a
peptide tag. In
some embodiments, a peptide tag may comprise 1 or 2 or more copies of a
peptide tag (e.g.,
repeat unit, multimerized epitope (e.g., tandem repeats)) (e.g., 1, 2, 3, 4,
5, 6, 7, 8, 9, 10, 11,
12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or more repeat units.
In some
embodiments, an affinity polypeptide that interacts with/binds to a peptide
tag may be an
antibody. In some embodiments, the antibody may be a scFv antibody. In some
embodiments, an affinity polypeptide that binds to a peptide tag may be
synthetic (e.g.,
evolved for affinity interaction) including, but not limited to, an affibody,
an anticalin, a
.. monobody and/or a DARPin (see, e.g., Sha et al., Protein Sci. 26(5):910-924
(2017));
Gilbreth (Curr Opin Struc Blot 22(4):413-420 (2013)), U.S. Patent No.
9,982,053, each of
which are incorporated by reference in their entireties for the teachings
relevant to affibodies,
anticalins, monobodies and/or DARPins.
In some embodiments, a guide nucleic acid may be linked to an RNA recruiting
motif,
and a polypeptide to be recruited (e.g., a deaminase) may be fused to an
affinity polypeptide
that binds to the RNA recruiting motif, wherein the guide binds to the target
nucleic acid and
the RNA recruiting motif binds to the affinity polypeptide, thereby recruiting
the polypeptide
to the guide and contacting the target nucleic acid with the polypeptide
(e.g., deaminase). In
some embodiments, two or more polypeptides may be recruited to a guide nucleic
acid,
thereby contacting the target nucleic acid with two or more polypeptides
(e.g., deaminases).
In some embodiments, a polypeptide fused to an affinity polypeptide may be a
reverse
transcriptase and the guide nucleic acid may be an extended guide nucleic acid
linked to an
RNA recruiting motif. In some embodiments, an RNA recruiting motif may be
located on
the 3' end of the extended portion of an extended guide nucleic acid (e.g., 5'-
3', repeat¨spacer-
78

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
extended portion (RT template-primer binding site)-RNA recruiting motif). In
some
embodiments, an RNA recruiting motif may be embedded in the extended portion.
In some embodiments of the invention, an extended guide RNA and/or guide RNA
may be linked to one or to two or more RNA recruiting motifs (e.g., 1, 2, 3,
4, 5, 6, 7, 8, 9, 10
or more motifs, e.g., at least 10 to about 25 motifs), optionally wherein the
two or more RNA
recruiting motifs may be the same RNA recruiting motif or different RNA
recruiting motifs.
In some embodiments, an RNA recruiting motif and corresponding affinity
polypeptide may
include, but is not limited, to a telomerase Ku binding motif (e.g., Ku
binding hairpin) and
the corresponding affinity polypeptide Ku (e.g., Ku heterodimer), a telomerase
Sm7 binding
motif and the corresponding affinity polypeptide Sm7, an MS2 phage operator
stem-loop and
the corresponding affinity polypeptide MS2 Coat Protein (MCP), a PP7 phage
operator stem-
loop and the corresponding affinity polypeptide PP7 Coat Protein (PCP), an
SfMu phage
Com stem-loop and the corresponding affinity polypeptide Com RNA binding
protein, a PUF
binding site (PBS) and the affinity polypeptide Pumilio/fem-3 mRNA binding
factor (PUF),
and/or a synthetic RNA-aptamer and the aptamer ligand as the corresponding
affinity
polypeptide. In some embodiments, the RNA recruiting motif and corresponding
affinity
polypeptide may be an M52 phage operator stem-loop and the affinity
polypeptide M52 Coat
Protein (MCP). In some embodiments, the RNA recruiting motif and corresponding
affinity
polypeptide may be a PUF binding site (PBS) and the affinity polypeptide
Pumilio/fem-3
.. mRNA binding factor (PUF).
In some embodiments, the components for recruiting polypeptides and nucleic
acids
may those that function through chemical interactions that may include, but
are not limited to,
rapamycin-inducible dimerization of FRB - FKBP; Biotin-streptavidin; SNAP tag;
Halo tag;
CLIP tag; DmrA-DmrC heterodimer induced by a compound; bifunctional ligand
(e.g., fusion
of two protein-binding chemicals together, e.g., dihyrofolate reductase
(DHFR).
In some embodiments, the nucleic acid constructs, expression cassettes or
vectors of
the invention that are optimized for expression in a plant may be about 70% to
100%
identical (e.g., about 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%,
81%,
82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%,
97%,
.. 98%, 99%, 99.5% or 100%) to the nucleic acid constructs, expression
cassettes or vectors
comprising the same polynucleotide(s) but which have not been codon optimized
for
expression in a plant.
79

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
In some embodiments, the invention provides cells comprising one or more
polynucleotides, guide nucleic acids, nucleic acid constructs, expression
cassettes or vectors
of the invention.
In some embodiments, a method of editing an endogenous HD-Zip transcription
factor gene in a plant or plant part is provided, the method comprising
contacting a target site
in the HD-Zip transcription factor gene in the plant or plant part with a
cytosine base editing
system comprising a cytosine deaminase and a nucleic acid binding domain that
binds to a
target site in the HD-Zip transcription factor, the HD-Zip transcription
factor gene encoding
(a) a polypeptide comprising a sequence having at least 80% sequence identity
to the amino
acid sequence of SEQ ID NO:! or SEQ ID NO:2; (b) a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of SEQ ID
NO:3; (c) a
polypeptide comprising a sequence having at least 95% sequence identity to the
nucleotide
sequence of SEQ ID NO:4; (d) a polypeptide comprising a sequence having the
amino acid
sequence of SEQ ID NO:5, wherein X is L or S; (e) a polypeptide comprising:
(i) a sequence
having the amino acid sequence of RKKLRLX1KX2QX3(SEQ ID NO:6), wherein Xi is S
or
T, X2 is D or E and X3 is S or A; (ii) a sequence having the amino acid
sequence of PX1X2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M
and X4 is
Q, S or N; (iii) a sequence having the amino acid sequence of ENRRLX1X2EX3,
(SEQ ID
NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and a sequence
having the amino
acid sequence of VWFQNRRA (SEQ ID NO:9) and/or (f) a polypeptide comprising a
sequence having the amino acid sequence of VWFQNRRA (SEQ ID NO:9), thereby
editing
the endogenous HD-Zip transcription factor gene in the plant or part thereof
and producing a
plant or part thereof comprising at least one cell having a mutation in the
endogenous HD-Zip
transcription factor gene.
In some embodiments, a method of editing an endogenous HD-Zip transcription
factor gene in a plant or plant part is provided, the method comprising
contacting a target site
in the HD-Zip transcription factor gene in the plant or plant part with a
adenosine base editing
system comprising an adenosine deaminase and a nucleic acid binding domain
that binds to a
target site in the HD-Zip transcription factor, the HD-Zip transcription
factor gene encoding
.. (a) a polypeptide comprising a sequence having at least 80% sequence
identity to the amino
acid sequence of SEQ ID NO:! or SEQ ID NO:2; (b) a polypeptide comprising a
sequence
having at least 80% sequence identity to the amino acid sequence of SEQ ID
NO:3; (c) a
polypeptide comprising a sequence having at least 95% sequence identity to the
nucleotide
sequence of SEQ ID NO:4; (d) a polypeptide comprising a sequence having the
amino acid

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
sequence of SEQ ID NO:5, wherein X is L or S; (e) a polypeptide comprising:
(i) a sequence
having the amino acid sequence of RKKLRLX1KX2QX3 (SEQ ID NO:6), wherein Xi is
S or
T, X2 is D or E and X3 is S or A; (ii) a sequence having the amino acid
sequence of PX1X2
X2LTX3CPX4CER(SEQ ID NO:8), wherein Xi is P or A, X2 is T or A, X3 is V or M
and X4 is
Q, S or N; (iii) a sequence having the amino acid sequence of ENRRLX1X2EX3,
(SEQ ID
NO:7), wherein Xi is Q or H, X2 is R or K and X3 is V or L; and a sequence
having the amino
acid sequence of VWFQNRRA (SEQ ID NO:9) and/or (f) a polypeptide comprising a
sequence having the amino acid sequence of VWFQNRRA (SEQ ID NO:9), thereby
editing
the endogenous HD-Zip transcription factor gene in the plant or part thereof
and producing a
plant or part thereof comprising at least one cell having a mutation in the
endogenous HD-Zip
transcription factor gene.
In some embodiments, a method of detecting a mutant HD-Zip (a mutation in an
endogenous HD-Zip transcription factor gene) is provide, the method comprising
detecting in
the genome of a plant a deletion in a nucleic acid encoding the amino acid
sequence of any
one of SEQ ID NOs:1-98, wherein the amino acid sequence of any one of SEQ ID
NOs:1-
98 comprises the amino acid sequence of SEQ ID NO:9, and the deletion is in
the nucleotide
sequence encoding the amino acid sequence of SEQ ID NO:9.
In some embodiments, the present invention provides a method of detecting a
mutation in an endogenous HD-Zip gene, comprising detecting in the genome of a
plant a
nucleotide sequence encoding at least one of any one of the polypeptide
sequences of SEQ
ID NOs:1-98.
In some embodiments, the present invention provides a method of producing a
plant
comprising a mutation in an endogenous HD-Zip transcription factor gene and at
least one
polynucleotide of interest, the method comprising crossing a plant of the
invention
comprising at least one mutation in an endogenous HD-Zip transcription factor
gene (a first
plant) with a second plant that comprises the at least one polynucleotide of
interest to produce
progeny plants; and selecting progeny plants comprising at least one mutation
in the HD-Zip
transcription factor gene and the at least one polynucleotide of interest,
thereby producing the
plant comprising a mutation in an endogenous HD-Zip transcription factor gene
and at least
one polynucleotide of interest.
The present invention further provides a method of producing a plant
comprising a
mutation in an endogenous HD-Zip transcription factor gene and at least one
polynucleotide
of interest, the method comprising introducing at least one polynucleotide of
interest into a
plant of the present invention comprising at least one mutation in a HD-Zip
transcription
81

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
factor gene, thereby producing a plant comprising at least one mutation in a
HD-Zip
transcription factor gene and at least one polynucleotide of interest.
In some embodiments, the present invention provides a method of producing a
plant
comprising a mutation in an endogenous HD-Zip transcription factor gene and at
least one
polynucleotide of interest, the method comprising introducing at least one
polynucleotide of
interest into a plant of the invention comprising at least one mutation in an
endogenous HD-
Zip transcription factor gene, thereby producing a plant comprising at least
one mutation in a
HD-Zip transcription factor gene and at least one polynucleotide of interest.
A polynucleotide of interest may be any polynucleotide that can confer a
desirable
phenotype or otherwise modify the phenotype or genotype of a plant. In some
embodiments,
a polynucleotide of interest may be polynucleotide that confers herbicide
tolerance, insect
resistance, disease resistance, increased yield, increased nutrient use
efficiency or abiotic
stress resistance.
In some embodiments, a method of producing a plant comprising a mutation in an
endogenous HD-Zip transcription factor gene and having a dwarf or short
stature phenotype
is provided, the method comprising crossing a plant of the invention (a first
plant) having at
least one mutation in an endogenous HD-Zip transcription factor gene with a
second plant
that comprises the dwarf or short stature phenotype to produce progeny plants;
and selecting
progeny plants comprising the at least one mutation in the HD-Zip
transcription factor gene
and the dwarf or short stature phenotype, thereby producing the plant having a
dwarf or short
stature and comprising at least one mutation in an endogenous HD-Zip
transcription factor
gene.
The present invention further provides a method of controlling weeds in a
container
(e.g., pot, or seed tray and the like), a growth chamber, a greenhouse, a
field (e.g., a
cultivated field), a recreational area, a lawn, and/or a roadside comprising
one or more (a
plurality) plants of the present invention, comprising applying an herbicide
to one or more (a
plurality) plants of the present invention growing in the container, growth
chamber, field or
greenhouse, thereby controlling the weeds in the container, growth chamber,
greenhouse,
field, recreational area, a lawn, and/or a roadside in which the one or more
plants are
growing.
In some embodiments, a method of reducing insect predation on a plant (or a
plurality
of plants) is provided, comprising applying an insecticide to one or more (a
plurality) plants
of the present invention, thereby reducing the insect predation on the one or
more (a plurality)
plants. In some embodiments the one or more plants may be growing in a
container, a
82

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
growth chamber, a field, a recreational area (e.g., playing field, golf
course), a lawn, roadside,
or a greenhouse.
In some embodiments, the present invention provides a method of reducing
fungal
disease on a plant, comprising applying a fungicide to one or more (a
plurality) plants of the
present invention, thereby reducing fungal disease on the on the one or more
(a plurality)
plants. In some embodiments the one or more plants may be growing in a
container, a
growth chamber, a field, a recreational area (e.g., playing field, golf
course), a lawn, a
roadside, or a greenhouse.
The nucleic acid constructs of the invention (e.g., a construct comprising a
sequence
specific DNA binding domain, a CRISPR-Cas effector domain, a deaminase domain,
reverse
transcriptase (RT), RT template and/or a guide nucleic acid, etc.) and
expression
cassettes/vectors comprising the same may be used as an editing system of this
invention for
modifying target nucleic acids and/or their expression.
A target nucleic acid of any plant or plant part (or groupings of plants, for
example,
into a genus or higher order classification) may be modified (e.g., mutated,
e.g., base edited,
cleaved, nicked, etc.) using the polypeptides, polynucleotides, RNPs, nucleic
acid constructs,
expression cassettes, and/or vectors of the invention including an angiosperm,
a gymnosperm,
a monocot, a dicot, a C3, C4, CAM plant, a bryophyte, a fern and/or fern ally,
a microalgae,
and/or a macroalgae. A plant and/or plant part that may be modified as
described herein may
be a plant and/or plant part of any plant species/variety/cultivar. In some
embodiments, a
plant that may be modified as described herein is a monocot. In some
embodiments, a plant
that may be modified as described herein is a dicot.
The term "plant part," as used herein, includes but is not limited to
reproductive
tissues (e.g., petals, sepals, stamens, pistils, receptacles, anthers, pollen,
flowers, fruits,
flower bud, ovules, seeds, embryos, nuts, kernels, ears, cobs and husks);
vegetative tissues
(e.g., petioles, stems, roots, root hairs, root tips, pith, coleoptiles,
stalks, shoots, branches,
bark, apical meristem, axillary bud, cotyledon, hypocotyls, and leaves);
vascular tissues (e.g.,
phloem and xylem); specialized cells such as epidermal cells, parenchyma
cells,
chollenchyma cells, schlerenchyma cells, stomates, guard cells, cuticle,
mesophyll cells;
callus tissue; and cuttings. The term "plant part" also includes plant cells,
including plant
cells that are intact in plants and/or parts of plants, plant protoplasts,
plant tissues, plant
organs, plant cell tissue cultures, plant calli, plant clumps, and the like.
As used herein,
"shoot" refers to the above ground parts including the leaves and stems. As
used herein, the
term "tissue culture" encompasses cultures of tissue, cells, protoplasts and
callus.
83

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
As used herein, "plant cell" refers to a structural and physiological unit of
the plant,
which typically comprise a cell wall but also includes protoplasts. A plant
cell of the present
invention can be in the form of an isolated single cell or can be a cultured
cell or can be a part
of a higher-organized unit such as, for example, a plant tissue (including
callus) or a plant
organ. In some embodiments, a plant cell can be an algal cell. A "protoplast"
is an isolated
plant cell without a cell wall or with only parts of the cell wall. Thus, in
some embodiments
of the invention, a transgenic cell comprising a nucleic acid molecule and/or
nucleotide
sequence of the invention is a cell of any plant or plant part including, but
not limited to, a
root cell, a leaf cell, a tissue culture cell, a seed cell, a flower cell, a
fruit cell, a pollen cell,
and the like. In some aspects of the invention, the plant part can be a plant
germplasm. In
some aspects, a plant cell can be non-propagating plant cell that does not
regenerate into a
plant.
"Plant cell culture" means cultures of plant units such as, for example,
protoplasts,
cell culture cells, cells in plant tissues, pollen, pollen tubes, ovules,
embryo sacs, zygotes and
embryos at various stages of development.
As used herein, a "plant organ" is a distinct and visibly structured and
differentiated
part of a plant such as a root, stem, leaf, flower bud, or embryo.
"Plant tissue" as used herein means a group of plant cells organized into a
structural
and functional unit. Any tissue of a plant in planta or in culture is
included. This term
includes, but is not limited to, whole plants, plant organs, plant seeds,
tissue culture and any
groups of plant cells organized into structural and/or functional units. The
use of this term in
conjunction with, or in the absence of, any specific type of plant tissue as
listed above or
otherwise embraced by this definition is not intended to be exclusive of any
other type of
plant tissue.
In some embodiments of the invention, a transgenic tissue culture or
transgenic plant
cell culture is provided, wherein the transgenic tissue or cell culture
comprises a nucleic acid
molecule/nucleotide sequence of the invention. In some embodiments, transgenes
may be
eliminated from a plant developed from the transgenic tissue or cell by
breeding of the
transgenic plant with a non-transgenic plant and selecting among the progeny
for the plants
.. comprising the desired gene edit and not the transgenes used in producing
the edit.
Any plant comprising an endogenous HD-Zip transcription factor gene that is
capable
of regulating a Shade Avoidance Response (SAR) in a plant may be modified as
described
herein to reduce/attenuate or eliminate SAR in the plant. In some embodiments,
a plant may
be a monocot. In some embodiments, a plant may be a dicot.
84

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
Non-limiting examples of plants that may be modified as described herein may
include, but are not limited to, turf grasses (e.g., bluegrass, bentgrass,
ryegrass, fescue),
feather reed grass, tufted hair grass, miscanthus, arundo, switchgrass,
vegetable crops,
including artichokes, kohlrabi, arugula, leeks, asparagus, lettuce (e.g.,
head, leaf, romaine),
malanga, melons (e.g., muskmelon, watermelon, crenshaw, honeydew, cantaloupe),
cole
crops (e.g., brussels sprouts, cabbage, cauliflower, broccoli, collards, kale,
chinese cabbage,
bok choy), cardoni, carrots, napa, okra, onions, celery, parsley, chick peas,
parsnips, chicory,
peppers, potatoes, cucurbits (e.g., marrow, cucumber, zucchini, squash,
pumpkin, honeydew
melon, watermelon, cantaloupe), radishes, dry bulb onions, rutabaga, eggplant,
salsify,
escarole, shallots, endive, garlic, spinach, green onions, squash, greens,
beet (sugar beet and
fodder beet), sweet potatoes, chard, horseradish, tomatoes, turnips, and
spices; a fruit crop
such as apples, apricots, cherries, nectarines, peaches, pears, plums, prunes,
cherry, quince,
fig, nuts (e.g., chestnuts, pecans, pistachios, hazelnuts, pistachios,
peanuts, walnuts,
macadamia nuts, almonds, and the like), citrus (e.g., clementine, kumquat,
orange, grapefruit,
tangerine, mandarin, lemon, lime, and the like), blueberries, black
raspberries, boysenberries,
cranberries, currants, gooseberries, loganberries, raspberries, strawberries,
blackberries,
grapes (wine and table), avocados, bananas, kiwi, persimmons, pomegranate,
pineapple,
tropical fruits, pomes, melon, mango, papaya, and lychee, a field crop plant
such as clover,
alfalfa, timothy, evening primrose, meadow foam, corn/maize (field, sweet,
popcorn), hops,
jojoba, buckwheat, safflower, quinoa, wheat, rice, barley, rye, millet,
sorghum, oats, triticale,
sorghum, tobacco, kapok, a leguminous plant (beans (e.g., green and dried),
lentils, peas,
soybeans), an oil plant (rape, canola, mustard, poppy, olive, sunflower,
coconut, castor oil
plant, cocoa bean, groundnut, oil palm), duckweed, Arabidopsis, a fiber plant
(cotton, flax,
hemp, jute), Cannabis (e.g., Cannabis sativa,Cannabis indica, and Cannabis
ruderalis),
lauraceae (cinnamon, camphor), or a plant such as coffee, sugar cane, tea, and
natural rubber
plants; and/or a bedding plant such as a flowering plant, a cactus, a
succulent and/or an
ornamental plant (e.g., roses, tulips, violets), as well as trees such as
forest trees (broad-
leaved trees and evergreens, such as conifers; e.g., elm, ash, oak, maple,
fir, spruce, cedar,
pine, birch, cypress, eucalyptus, willow), as well as shrubs and other nursery
stock. In some
embodiments, the nucleic acid constructs of the invention and/or expression
cassettes and/or
vectors encoding the same may be used to modify maize, soybean, wheat, canola,
rice,
tomato, pepper, or sunflower.
In some embodiments, a plant that may be modified as described herein may
include, but is not limited to, corn, soy, canola, wheat, rice, cotton,
sugarcane, sugar beet,

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
barley, oats, alfalfa, sunflower, safflower, oil palm, sesame, coconut,
tobacco, potato, sweet
potato, cassava, coffee, apple, plum, apricot, peach, cherry, pear, fig,
banana, citrus, cocoa,
avocado, olive, almond, walnut, strawberry, watermelon, pepper, grape, tomato,
cucumber, or
a Brass/ca spp (e.g., B. napus, B. oleraceae, B. rapa, B. juncea, and/or B.
nigra).
In some embodiments, a plant that may be modified as described herein is corn
(i.e.,
maize, Zea mays, optionally wherein the corn plant comprises a short
stature/semi-dwarf
phenotype.
In some embodiments, a plant that may be modified as described herein is wheat
(e.g.,
Triticum aestivum, T durum, and/or T compactum). In some embodiments, a wheat
plant
may comprise at least one non-natural mutation in an endogenous HD-Zip
transcription factor
in its A genome, in its B genome, and/or in its D genome.
Thus, plants or plant cultivars which are to be treated with preference in
accordance
with the invention include all plants which, through genetic modification,
received genetic
material which imparts particular advantageous useful properties ("traits") to
these plants.
Examples of such properties are better plant growth, vigor, stress tolerance,
standability,
lodging resistance, nutrient uptake, plant nutrition, and/or yield, in
particular improved
growth, increased tolerance to high or low temperatures, increased tolerance
to drought or to
levels of water or soil salinity, enhanced flowering performance, easier
harvesting,
accelerated ripening, higher yields, higher quality and/or a higher
nutritional value of the
harvested products, better storage life and/or processability of the harvested
products.
Further and particularly emphasized examples of such properties are an
increased
resistance against animal and microbial pests, such as against insects,
arachnids, nematodes,
mites, slugs and snails owing, for example, to toxins formed in the plants.
Among DNA
sequences encoding proteins which confer properties of tolerance to such
animal and
microbial pests, in particular insects, mention will particularly be made of
the genetic
material from Bacillus thuringiensis encoding the Bt proteins widely described
in the
literature and well known to those skilled in the art. Mention will also be
made of
proteins extracted from bacteria such as Photorhabdus (W097/17432 and
W098/08932).
In particular, mention will be made of the Bt Cry or VIP proteins which
include the
Cry1A, CryIAb, CrylAc, CryIIA, CryIIIA, CryIIIB2, Cry9c Cry2Ab, Cry3Bb and
CryIF
proteins or toxic fragments thereof and also hybrids or combinations thereof,
especially
the CrylF protein or hybrids derived from a CrylF protein (e.g. hybrid Cry1A-
CrylF
proteins or toxic fragments thereof), the Cry1A-type proteins or toxic
fragments thereof,
preferably the CrylAc protein or hybrids derived from the CrylAc protein (e.g.
hybrid
86

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
CrylAb-CrylAc proteins) or the CrylAb or Bt2 protein or toxic fragments
thereof, the
Cry2Ae, Cry2Af or Cry2Ag proteins or toxic fragments thereof, the Cry1A.105
protein or
a toxic fragment thereof, the VIP3Aa19 protein, the VIP3Aa20 protein, the
VIP3A
proteins produced in the C0T202 or C0T203 cotton events, the VIP3Aa protein or
a
toxic fragment thereof as described in Estruch et al. (1996), Proc Natl Acad
Sci US A.
28;93(11):5389-94, the Cry proteins as described in W02001/47952, the
insecticidal
proteins from Xenorhabdus (as described in W098/50427), Serratia (particularly
from S.
entomophila) or Photorhabdus species strains, such as Tc-proteins from
Photorhabdus as
described in W098/08932. Also, any variants or mutants of any one of these
proteins
differing in some amino acids (1-10, preferably 1-5) from any of the above-
named
sequences, particularly the sequence of their toxic fragment, or which are
fused to a
transit peptide, such as a plastid transit peptide, or another protein or
peptide, is included
herein.
Another and particularly emphasized example of such properties is conferred
tolerance to one or more herbicides, for example imidazolinones,
sulphonylureas, glyphosate
or phosphinothricin. Among DNA sequences encoding proteins (i.e.,
polynucleotides of
interest) which confer properties of tolerance to certain herbicides on the
transformed plant
cells and plants, mention will be particularly be made to the bar or PAT gene
or the
Streptomyces coelicolor gene described in W02009/152359 which confers
tolerance to
glufosinate herbicides, a gene encoding a suitable EPSPS (5-
Enolpyruvylshikimat-3-
phosphat-Synthase) which confers tolerance to herbicides having EPSPS as a
target,
especially herbicides such as glyphosate and its salts, a gene encoding
glyphosate-n-
acetyltransferase, or a gene encoding glyphosate oxidoreductase. Further
suitable herbicide
tolerance traits include at least one ALS (acetolactate synthase) inhibitor
(e.g.,
W02007/024782), a mutated Arabidopsis ALS/AHAS gene (e.g., U.S. Patent
6,855,533),
genes encoding 2,4-D-monooxygenases conferring tolerance to 2,4-D (2,4-
dichlorophenoxyacetic acid) and genes encoding Dicamba monooxygenases
conferring
tolerance to dicamba (3,6-dichloro-2- methoxybenzoic acid).
Further and particularly emphasized examples of such properties are increased
resistance
against phytopathogenic fungi, bacteria and/or viruses owing, for example, to
systemic
acquired resistance (SAR), systemin, phytoalexins, elicitors and also
resistance genes and
correspondingly expressed proteins and toxins.
Particularly useful transgenic events in transgenic plants or plant cultivars
which can
be treated with preference in accordance with the invention include Event 531/
PV-
87

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
GHBK04 (cotton, insect control, described in W02002/040677), Event 1143-14A
(cotton, insect control, not deposited, described in W02006/128569); Event
1143-51B
(cotton, insect control, not deposited, described in W02006/128570); Event
1445
(cotton, herbicide tolerance, not deposited, described in US-A 2002-120964 or
W02002/034946); Event 17053 (rice, herbicide tolerance, deposited as PTA-9843,
described in W02010/117737); Event 17314 (rice, herbicide tolerance, deposited
as
PTA-9844, described in W02010/117735); Event 281-24-236 (cotton, insect
control -
herbicide tolerance, deposited as PTA-6233, described in W02005/103266 or US-A

2005-216969); Event 3006-210-23 (cotton, insect control - herbicide tolerance,
deposited
as PTA-6233, described in US-A 2007-143876 orW02005/103266); Event 3272 (corn,
quality trait, deposited as PTA-9972, described in W02006/098952 or US-A 2006-
230473); Event 33391 (wheat, herbicide tolerance, deposited as PTA-2347,
described in
W02002/027004), Event 40416 (corn, insect control - herbicide tolerance,
deposited as
ATCC PTA-11508, described in WO 11/075593); Event 43A47 (corn, insect control -

herbicide tolerance, deposited as ATCC PTA-11509, described in W02011/075595);
Event 5307 (corn, insect control, deposited as ATCC PTA-9561, described in
W02010/077816); Event ASR-368 (bent grass, herbicide tolerance, deposited as
ATCC
PTA-4816, described in US-A 2006-162007 or W02004/053062); Event B16 (corn,
herbicide tolerance, not deposited, described in US-A 2003-126634); Event BPS-
CV127-
9 (soybean, herbicide tolerance, deposited as NCIMB No. 41603, described in
W02010/080829); Event BLR1 (oilseed rape, restoration of male sterility,
deposited as
NCIMB 41193, described in W02005/074671), Event CE43-67B (cotton, insect
control,
deposited as DSM ACC2724, described in US-A 2009-217423 or W02006/128573);
Event CE44-69D (cotton, insect control, not deposited, described in US-A 2010-
0024077); Event CE44-69D (cotton, insect control, not deposited, described in
W02006/128571); Event CE46-02A (cotton, insect control, not deposited,
described in
W02006/128572); Event COT102 (cotton, insect control, not deposited, described
in
US-A 2006-130175 or W02004/039986); Event C0T202 (cotton, insect control, not
deposited, described in US-A 2007-067868 or W02005/054479); Event C0T203
(cotton,
insect control, not deposited, described in W02005/054480); ); Event DAS21606-
3 /
1606 (soybean, herbicide tolerance, deposited as PTA-11028, described in
W02012/033794), Event DA540278 (corn, herbicide tolerance, deposited as ATCC
PTA-10244, described in W02011/022469); Event DAS-44406-6 / pDAB8264.44.06.1
(soybean, herbicide tolerance, deposited as PTA-11336, described in
W02012/075426),
88

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
Event DAS-14536-7 /pDAB8291.45.36.2 (soybean, herbicide tolerance, deposited
as
PTA-11335, described in W02012/075429), Event DAS-59122-7 (corn, insectcontrol
-
herbicide tolerance, deposited as ATCC PTA 11384, described in US-A 2006-
070139);
Event DAS-59132 (corn, insect control - herbicide tolerance, not deposited,
described in
W02009/100188); Event DA568416 (soybean, herbicide tolerance, deposited as
ATCC
PTA-10442, described in W02011/066384 or W02011/066360); Event DP-098140-6
(corn, herbicide tolerance, deposited as ATCC PTA-8296, described in US-A 2009-

137395 orWO 08/112019); Event DP-305423-1 (soybean, quality trait, not
deposited,
described in US-A 2008-312082 or W02008/054747); Event DP-32138-1 (corn,
hybridization system, deposited as ATCC PTA-9158, described in US-A 2009-
0210970
or W02009/103049); Event DP-356043-5 (soybean, herbicide tolerance, deposited
as
ATCC PTA-8287, described in US-A 2010-0184079 or W02008/002872); EventEE-I
(brinj al, insect control, not deposited, described in WO 07/091277); Event
Fil 17 (corn,
herbicide tolerance, deposited as ATCC 209031, described in US-A 2006-059581
or WO
98/044140); Event FG72 (soybean, herbicide tolerance, deposited as PTA-11041,
described in W02011/063413), Event GA21 (corn, herbicide tolerance, deposited
as
ATCC 209033, described in US-A 2005-086719 or WO 98/044140); Event GG25 (corn,

herbicide tolerance, deposited as ATCC 209032, described in US-A 2005-188434
or
W098/044140); Event GHB119 (cotton, insect control - herbicide tolerance,
deposited as
ATCC PTA-8398, described in W02008/151780); Event GHB614 (cotton, herbicide
tolerance, deposited as ATCC PTA-6878, described in US-A 2010-050282 or
W02007/017186); Event GJ11 (corn, herbicide tolerance, deposited as ATCC
209030,
described in US-A 2005-188434 or W098/044140); Event GM RZ13 (sugar beet,
virus
resistance, deposited as NCIMB-41601, described in W02010/076212); Event H7-1
(sugar beet, herbicide tolerance, deposited as NCIMB 41158 or NCIMB 41159,
described in US-A 2004-172669 or WO 2004/074492); Event JOPLIN1 (wheat,
disease
tolerance, not deposited, described in US-A 2008-064032); Event LL27 (soybean,

herbicide tolerance, deposited as NCIMB41658, described in W02006/108674 or US-
A
2008-320616); Event LL55 (soybean, herbicide tolerance, deposited as NCIMB
41660,
described in WO 2006/108675 or US-A 2008-196127); Event LLcotton25 (cotton,
herbicide tolerance, deposited as ATCC PTA-3343, described in W02003/013224 or
US-
A 2003-097687); Event LLRICE06 (rice, herbicide tolerance, deposited as ATCC
203353, described in US 6,468,747 or W02000/026345); Event LLRice62 ( rice,
herbicide tolerance, deposited as ATCC 203352, described in W02000/026345),
Event
89

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
LLRICE601 (rice, herbicide tolerance, deposited as ATCC PTA-2600, described in
US-A
2008-2289060 or W02000/026356); Event LY038 (corn, quality trait, deposited as

ATCC PTA-5623, described in US-A 2007-028322 or W02005/061720); Event M1R162
(corn, insect control, deposited as PTA-8166, described in US-A 2009-300784 or
W02007/142840); Event MIR604 (corn, insect control, not deposited, described
in US-A
2008-167456 or W02005/103301); Event MON15985 (cotton, insect control,
deposited
as ATCC PTA-2516, described in US-A 2004-250317 or W02002/100163); Event
MON810 (corn, insect control, not deposited, described in US-A 2002-102582);
Event
M0N863 (corn, insect control, deposited as ATCC PTA-2605, described in
W02004/011601 or US-A 2006-095986); Event M0N87427 (corn, pollination control,
deposited as ATCC PTA-7899, described in W02011/062904); Event M0N87460 (corn,

stress tolerance, deposited as ATCC PTA-8910, described in W02009/111263 or US-
A
2011-0138504); Event M0N87701 (soybean, insect control, deposited as ATCC PTA-
8194, described in US-A 2009-130071 or W02009/064652); Event M0N87705
(soybean, quality trait - herbicide tolerance, deposited as ATCC PTA-9241,
described in
US-A 2010-0080887 or W02010/037016); Event M0N87708 (soybean, herbicide
tolerance, deposited as ATCC PTA-9670, described in W02011/034704); Event
M0N87712 (soybean, yield, deposited as PTA-10296, described in W02012/051199),

Event M0N87754 (soybean, quality trait, deposited as ATCC PTA-9385, described
in
W02010/024976); Event M0N87769 (soybean, quality trait, deposited as ATCC PTA-
8911, described in US-A 2011-0067141 or W02009/102873); Event M0N88017 (corn,
insect control - herbicide tolerance, deposited as ATCC PTA-5582, described in
US-A
2008-028482 or W02005/059103); Event M0N88913 (cotton, herbicide tolerance,
deposited as ATCC PTA-4854, described in W02004/072235 or US-A 2006-059590);
Event M0N88302 (oilseed rape, herbicide tolerance, deposited as PTA-10955,
described
in W02011/153186), Event MON88701 (cotton, herbicide tolerance, deposited as
PTA-
11754, described in W02012/134808), Event M0N89034 (corn, insect control,
deposited
as ATCC PTA-7455, described in WO 07/140256 or US-A 2008-260932); Event
M0N89788 (soybean, herbicide tolerance, deposited as ATCC PTA-6708, described
in
US-A 2006-282915 or W02006/130436); Event MS11 (oilseed rape, pollination
control -
herbicide tolerance, deposited as ATCC PTA-850 or PTA-2485, described in
W02001/031042); Event M58 (oilseed rape, pollination control - herbicide
tolerance,
deposited as ATCC PTA-730, described in W02001/041558 or US-A 2003-188347);
Event NK603 (corn, herbicide tolerance, deposited as ATCC PTA-2478, described
in

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
US-A 2007-292854); Event PE-7 (rice, insect control, not deposited, described
in
W02008/114282); Event RF3 (oilseed rape, pollination control - herbicide
tolerance,
deposited as ATCC PTA-730, described in W02001/041558 or US-A 2003-188347);
Event RT73 (oilseed rape, herbicide tolerance, not deposited, described in
.. W02002/036831 or US-A 2008-070260); Event SYHT0H2 / SYN-000H2-5 (soybean,
herbicide tolerance, deposited as PTA-11226, described in W02012/082548),
Event
T227-1 (sugar beet, herbicide tolerance, not deposited, described in
W02002/44407 or
US-A 2009-265817); Event T25 (corn, herbicide tolerance, not deposited,
described in
US-A 2001-029014 or W02001/051654); Event T304-40 (cotton, insect control -
herbicide tolerance, deposited as ATCC PTA-8171, described in US-A 2010-077501
or
W02008/122406); Event T342-142 (cotton, insect control, not deposited,
described in
W02006/128568); Event TC1507 (corn, insect control - herbicide tolerance, not
deposited, described in US-A 2005-039226 or W02004/099447); Event VIP1034
(corn,
insect control - herbicide tolerance, deposited as ATCC PTA-3925, described in
W02003/052073), Event 32316 (corn, insect control-herbicide tolerance,
deposited as
PTA-11507, described in W02011/084632), Event 4114 (corn, insect control-
herbicide
tolerance, deposited as PTA-11506, described in W02011/084621), event EE-GM3 /

FG72 (soybean, herbicide tolerance, ATCC Accession N PTA-11041) optionally
stacked with event EE-GM1/LL27 or event EE-GM2/LL55 (W0201 1/063413A2), event
DAS-68416-4 (soybean, herbicide tolerance, ATCC Accession N PTA-10442,
W0201 1/066360A1), event DAS-68416-4 (soybean, herbicide tolerance, ATCC
Accession N PTA-10442, W0201 1/066384A1), event DP-040416-8 (corn, insect
control, ATCC Accession N PTA-11508, W0201 1/075593A1), event DP-043A47-3
(corn, insect control, ATCC Accession N PTA-11509, W0201 1/075595A1), event
DP-
004114-3 (corn, insect control, ATCC Accession N PTA-11506, W0201
1/084621A1),
event DP-032316-8 (corn, insect control, ATCC Accession N PTA-11507,
W0201 1/084632A1), event MON-88302-9 (oilseed rape, herbicide tolerance, ATCC
Accession N PTA-10955, W02011/153186A1), event DAS-21606-3 (soybean,
herbicide tolerance, ATCC Accession No. PTA-11028, W02012/033794A2), event
MON-87712-4 (soybean, quality trait, ATCC Accession N . PTA-10296,
W02012/051199A2), event DAS-44406-6 (soybean, stacked herbicide tolerance,
ATCC
Accession N . PTA-11336, W02012/075426A1), event DAS-14536-7 (soybean, stacked

herbicide tolerance, ATCC Accession N . PTA-11335, W02012/075429A1), event
SYN-000H2-5 (soybean, herbicide tolerance, ATCC Accession N . PTA-11226,
91

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
W02012/082548A2), event DP-061061-7 (oilseed rape, herbicide tolerance, no
deposit
N available, W02012071039A1), event DP-073496-4 (oilseed rape, herbicide
tolerance,
no deposit N available, US2012131692), event 8264.44.06.1 (soybean, stacked
herbicide
tolerance, Accession N PTA-11336, W02012075426A2), event 8291.45.36.2
(soybean,
.. stacked herbicide tolerance, Accession N . PTA-11335, W02012075429A2),
event
SYHT0H2 (soybean, ATCC Accession N . PTA-11226, W02012/082548A2), event
MON88701 (cotton, ATCC Accession N PTA-11754, W02012/134808A1), event
KK179-2 (alfalfa, ATCC Accession N PTA-11833, W02013/003558A1), event
pDAB8264.42.32.1 (soybean, stacked herbicide tolerance, ATCC Accession N PTA-
11993, W02013/010094A1), event MZDTO9Y (corn, ATCC Accession N PTA-13025,
W02013/012775A1).
The genes/events (e.g., polynucleotides of interest), which impart the desired
traits in
question, may also be present in combinations with one another in the
transgenic plants.
Examples of transgenic plants which may be mentioned are the important crop
plants, such as
.. cereals (wheat, rice, triticale, barley, rye, oats), maize, soya beans,
potatoes, sugar beet, sugar
cane, tomatoes, peas and other types of vegetable, cotton, tobacco, oilseed
rape and also fruit
plants (with the fruits such as apples, pears, citrus fruits and grapes), with
particular emphasis
being given to maize, soya beans, wheat, rice, potatoes, cotton, sugar cane,
tobacco and
oilseed rape. Traits which are particularly emphasized are the increased
resistance of the
plants to insects, arachnids, nematodes and slugs and snails, as well as the
increased
resistance of the plants to one or more herbicides.
Commercially available examples of such plants, plant parts or plant seeds
that may
be treated with preference in accordance with the invention include commercial
products,
such as plant seeds, sold or distributed under the GENUITY , DROUGHTGARD ,
SMARTSTAX , RIB COMPLETE , ROUNDUP READY , VT DOUBLE PRO , VT
TRIPLE PRO , BOLLGARD II , ROUNDUP READY 2 YIELD , YIELDGARD ,
ROUNDUP READY 2 XTENDTm, INTACTA RR2 PRO , VISTIVE GOLD , and/or
XTENDFLEXTm trade names.
The invention will now be described with reference to the following examples.
It
should be appreciated that these examples are not intended to limit the scope
of the claims to
the invention but are rather intended to be exemplary of certain embodiments.
Any variations
in the exemplified methods that occur to the skilled artisan are intended to
fall within the
scope of the invention.
92

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
EXAMPLES
Example 1. Design of the genomic editing construct for 11B53 and 11B78
The genomic sequences of Zm00001d002754 (HB53) and Zm00001d029934 (HB78)
(Zea mays) were identified from a proprietary maize line. From this reference
line, spacer
sequences SEQ ID NO: 175-182 were identified. The editing constructs (pWISE444

targeting HB53 and pWISE451 targeting both HB53/HB78) contained a CRISPR-Cas
effector protein (Fig. 13, pWISE451 includes all the guides from illustrated
pWISE). The
CRISPR-Cas effector protein associates with a spacer nucleic acid that is
specific to the DNA
sequence of HB53/HB78 transcription factor of maize. A Cpfl cutting enzyme was
used to
create unique disruptions of the DNA binding domain of the gene targets.
Example 2. Transformation and selection of edited EO plants
Dried excised maize embryos were transformed using Agrobacterium to deliver
the
editing construct. Healthy non-chimeric plants (EO) were selected and plugged
in growth
.. trays. Genotyping of EO plants was performed to assess transgene copy and
editing efficacy.
Plants identified to be (1) healthy, non-chimeric and fertile, with (2) low
transgene copy and
(3) a disrupted DNA binding domain were advanced to the next generation. EO
plants that
satisfy all the above criteria were selfed to produce El generation.
For pWISE444, 110 EO plants were derived from a single transformation
experiment.
From this pool of EO plants, we identified plants possessing out-of-frame
deletions leading to
disruption of the HB53 DNA binding domain. Table 1 provides plant identifiers,
identified
deletions and the ultimately advanced El alleles. Fig. 10 shows the genotypes
of El
progenies derived from advanced EO parents.
Table 1. IIB53 EO Parent plants and identified alleles
EO parent EO alleles El allele (advanced)
CE-9775 Heterozygous (5bp and Homozygous 5bp deletion
7bp deletion)
CE-9787 Heterozygous (11bp and Homozygous llbp
14bp deletion) deletion
CE-9778 Het -WT (23bp deletion) Homozygous 23bp
deletion
93

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
For pWISE451, a single transformation experiment produced 44 EO events. Of
these,
a single E0 plant with the desired out-of-frame deletions was advanced. Fig.
11 and Fig. 12
show the nature of the edits for HB53 and HB78 El progenies, respectively.
In addition to pWISE444 and pWISE451, other transformation experiments
produced
desired edits on HB78. Fig. 9 shows the genomic sequence of a representative
edited plant. It
shows that the 10-bp out-of-frame deletion upstream the target DNA binding
domain resulted
in a premature stop codon and therefore loss of the DNA binding domain.
Example 3. Phenotypic assessment of trait activity
For each El family, 100 seeds were planted and screened. Plants identified as
being
(1) healthy, non-chimeric and fertile, (2) transgene free, (3) having a
disrupted DNA binding
domain were advanced to the next generation. Trait activity is generally
assessed at the E2
stage. To assess trait activity of the advanced desired edits, 10-day old
seedlings were
exposed to a simulated shade environment. A custom lighting array was built
from 12 CREE
6500k 36V COBs supplemented with over 330 2.25V CREE Far Red LEDs (720-740)
with a
Bluefish controller. Other similar lighting may be used to simulate a shade
environment.
Edited corn seedlings and control (wild-type and GUS controls) corn seedlings
were
grown in a growth tent with simulated shade environment in which they
experienced red to
far red wavelength ratio of 0.15 at 4001.tmol m-2 s-1PAR (photosynthetically
active
radiation). The edited corn seedlings and control corn seedlings were also
grown in a
separate growth tent where they experienced red to far-red ratio of 1.3 and
4001.tmol m-2 5-1
PAR (non-shade). Plant height was measured at the three height markers
(coleoptile, V1
sheath, V2 sheath) and compared between controls and edited plants in the
simulated shade
and non-shade environment. An editing event was considered trait-active when
edited plants
grown in simulated shade were 5%, 10%, 15% (or more) shorter than the control
plants
(wild-type).
Example 4. Modification of endogenous maize Homeodomain-leucine zipper (HD-
Zip)
transcription factor represses shade avoidance response.
Editing of HD-Zip transcription factors represses the exaggerated elongation
of the
stem in simulated shade condition. Homozygous edits of HD-Zip transcription
factors HB53
and HB53/HB78 demonstrate statistically significant repressed shade avoidance
response as
shown in the V2 stage (Fig. 14) but no significant off-types (Fig. 15). In
contrast, wild type
plants (01DKD2 CONTROL) and plants transformed with guide-free vectors (GUS
94

CA 03165291 2022-06-17
WO 2021/155084
PCT/US2021/015609
CONTROL) show statistically significant shade avoidance response. In addition
to edited
alleles generated from pWISE444 (HB53) and pWISE451 (HB53/HB78), homozygous
HB78
edit (Fig. 8 and Fig. 9) were also tested. Our results indicate trait activity
(i.e., no shade
avoidance response) in the El stage but no significant difference in plant
height between
unedited vs edited HB78 in the E2 stage. As the two tests (El vs E2 stage)
vary in growth
chamber placement, further testing is warranted.
Taken together, our results show that HD-Zip transcription factors are potent
regulators of shade avoidance response. Furthermore, we have demonstrated that
the editing
these transcription factors lead to repression of an evolutionarily conserved
exaggerated
response to shading.
The foregoing is illustrative of the present invention and is not to be
construed as
limiting thereof. The invention is defined by the following claims, with
equivalents of the
claims to be included therein.

Representative Drawing
A single figure which represents the drawing illustrating the invention.
Administrative Status

For a clearer understanding of the status of the application/patent presented on this page, the site Disclaimer , as well as the definitions for Patent , Administrative Status , Maintenance Fee  and Payment History  should be consulted.

Administrative Status

Title Date
Forecasted Issue Date Unavailable
(86) PCT Filing Date 2021-01-29
(87) PCT Publication Date 2021-08-05
(85) National Entry 2022-06-17
Examination Requested 2022-06-17

Abandonment History

There is no abandonment history.

Maintenance Fee

Last Payment of $100.00 was received on 2023-12-05


 Upcoming maintenance fee amounts

Description Date Amount
Next Payment if small entity fee 2025-01-29 $50.00
Next Payment if standard fee 2025-01-29 $125.00

Note : If the full payment has not been received on or before the date indicated, a further fee may be required which may be one of the following

  • the reinstatement fee;
  • the late payment fee; or
  • additional fee to reverse deemed expiry.

Patent fees are adjusted on the 1st of January every year. The amounts above are the current amounts if received by December 31 of the current year.
Please refer to the CIPO Patent Fees web page to see all current fee amounts.

Payment History

Fee Type Anniversary Year Due Date Amount Paid Paid Date
Application Fee 2022-06-17 $407.18 2022-06-17
Request for Examination 2025-01-29 $814.37 2022-06-17
Maintenance Fee - Application - New Act 2 2023-01-30 $100.00 2022-12-13
Maintenance Fee - Application - New Act 3 2024-01-29 $100.00 2023-12-05
Owners on Record

Note: Records showing the ownership history in alphabetical order.

Current Owners on Record
PAIRWISE PLANTS SERVICES, INC.
Past Owners on Record
None
Past Owners that do not appear in the "Owners on Record" listing will appear in other documentation within the application.
Documents

To view selected files, please enter reCAPTCHA code :



To view images, click a link in the Document Description column. To download the documents, select one or more checkboxes in the first column and then click the "Download Selected in PDF format (Zip Archive)" or the "Download Selected as Single PDF" button.

List of published and non-published patent-specific documents on the CPD .

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.


Document
Description 
Date
(yyyy-mm-dd) 
Number of pages   Size of Image (KB) 
Abstract 2022-06-17 1 133
Claims 2022-06-17 19 811
Drawings 2022-06-17 19 860
Description 2022-06-17 95 6,019
Representative Drawing 2022-06-17 1 185
Patent Cooperation Treaty (PCT) 2022-06-17 2 194
International Search Report 2022-06-17 5 132
National Entry Request 2022-06-17 7 235
Cover Page 2022-10-13 1 154
Examiner Requisition 2023-06-22 5 283
Amendment 2023-10-17 237 16,809
Description 2023-10-17 95 8,458
Claims 2023-10-17 19 1,162

Biological Sequence Listings

Choose a BSL submission then click the "Download BSL" button to download the file.

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.

Please note that files with extensions .pep and .seq that were created by CIPO as working files might be incomplete and are not to be considered official communication.

No BSL files available.