Patent 3060613 Summary

(12) Patent:	(11) CA 3060613
(54) English Title:	SYSTEM AND METHOD FOR GENERATING ADVERSARIAL EXAMPLES
(54) French Title:	SYSTEME ET METHODE POUR GENERER DES EXEMPLES CONTRADICTOIRES
Status:	Granted

Bibliographic Data

(51) International Patent Classification (IPC):	G06N 20/00 (2019.01) G06N 3/08 (2006.01)
(72) Inventors :	DIA, OUSMANE (Canada)
(73) Owners :	SERVICENOW CANADA INC. (Canada)
(71) Applicants :	ELEMENT AI INC. (Canada)
(74) Agent:	BCF LLP
(74) Associate agent:
(45) Issued:	2023-08-15
(22) Filed Date:	2019-10-28
(41) Open to Public Inspection:	2021-04-28
Examination requested:	2019-10-28
Availability of licence:	N/A
(25) Language of filing:	English

Patent Cooperation Treaty (PCT):	No

(30) Application Priority Data:	None

Abstracts

English Abstract

ABSTRACT Methods and systems for generating adversarial examples are disclosed. The method comprises accessing a set of inputs and generating an instance of a variable auto-encoder (VAE), the instance of the VAE encoding the set of inputs into latent representation elements associated with a latent space. The method further comprises applying a manifold learning routine on the instance of the VAE to establish a characterization of a manifold in the latent space and applying a perturbation routine to generate perturbed latent representation elements while constraining the perturbed latent representation elements to remain within the manifold. The method further comprises generating adversarial examples based on the perturbed latent representation elements and outputting the adversarial examples. 139405181 43078/8 CA 3060613 2019-10-28

French Abstract

ABRÉGÉ : Il est décrit des méthodes et systèmes de génération dexemples contradictoires. La méthode comprend laccès à un ensemble dentrées et la génération dune instance dun autocodeur variationnel, linstance de lautocodeur variationnel codant lensemble dentrées en éléments de représentation latents associés à un espace latent. La méthode comprend également lapplication dune routine dapprentissage manifold sur linstance de lautocodeur variationnel pour établir une caractérisation dun manifold dans lespace latent et lapplication dune routine de perturbation pour générer des éléments de représentation latents perturbés tout en restreignant les éléments de représentation latents perturbés pour quils demeurent à lintérieur du manifold. La méthode comprend également la génération dexemples contradictoires d'après les éléments de représentation latents perturbés et la transmission des exemples contradictoires. 139405181 43078/8 CA 3060613 2019-10-28

Claims

Note: Claims are shown in the official language in which they were submitted.

25
What is claimed is:
1. A computer-implemented method for generating adversarial examples, the
method
comprising:
accessing a set of inputs;
generating an instance of a variable auto-encoder (VAE), the VAE comprising an

encoder and a decoder, the instance of the VAE encoding the set of inputs into
latent
representation elements associated with a latent space, the latent
representation elements
representing low geometric summaries establishing semantics associated with
the set of
inputs;
applying a manifold learning routine on the instance of the VAE to establish a

characterization of a manifold in the latent space, the manifold being defined
by the latent
representation elements;
applying a perturbation routine to generate perturbed latent representation
elements
while constraining the perturbed latent representation elements to remain
within the
manifold;
generating adversarial examples based on the perturbed latent representation
elements, the adversarial examples sharing common semantics associated with
the set of
inputs; and
outputting the adversarial examples;
wherein the manifold learning routine comprises a Stein Variational Gradient
Descent (SVGD) routine to learn the manifold.
2. The method of claim 1, wherein the adversarial examples are generated so
as to fool
a classifier associated with a neural network.
16713712.1
43078/8
Date Recue/Date Received 2021-04-07

26
3. The method of claim 1, wherein the adversarial examples are generated so
to reinforce
a classifier associated with a neural network.
4. The method of claim 1, wherein the applying the manifold learning
routine and the
applying perturbation routine are executed as a unified routine.
5. The method of claim 1, wherein the manifold learning routine comprises a
variational
inference routine executing learning of the manifold while minimizing
assumptions about a
distribution of the latent representation elements.
6. The method of claim 1, wherein the perturbation routine comprises
establishing a
neighborhood of the latent representation elements which preserves semantics
associated
with the set of inputs.
7. The method of claim 1, wherein the generating adversarial examples is
performed by
the decoder and comprises operating an inversion routine to reconstruct the
adversarial
examples from the perturbed latent representation elements.
8. The method of claim 1, wherein the generating adversarial examples
comprises
optimizing a loss.
9. A computer-implemented method for generating adversarial examples, the
method
comprising:
accessing a set of inputs;
generating an instance of a variable auto-encoder (VAE), the VAE comprising an
encoder and a decoder, the instance of the VAE encoding the set of inputs into
latent
representation elements associated with a latent space, the latent
representation elements
16713712.1
43078/8
Date Recue/Date Received 2021-04-07

27
representing low geometric summaries establishing semantics associated with
the set of
inputs;
applying a manifold learning routine on the instance of the VAE to establish a

characterization of a manifold in the latent space, the manifold being defined
by the latent
representation elements;
applying a perturbation routine to generate perturbed latent representation
elements
while constraining the perturbed latent representation elements to remain
within the
manifold;
generating adversarial examples based on the perturbed latent representation
elements, the adversarial examples sharing common semantics associated with
the set of
inputs; and
outputting the adversarial examples;
wherein constraining the perturbed latent representation elements to remain
within
the manifold comprises applying a Gram-Schmidt Basis Sign Method (GBSM)
routine.
10. The method of claim 9, wherein constraining the perturbed latent
representation
elements to remain within the manifold further comprises applying a manifold
alignment
routine.
11. The method of claim 9, wherein the generating adversarial examples is
performed by
the decoder and comprises operating an inversion routine to reconstruct the
adversarial
examples from the perturbed latent representation elements.
12. The method of claim 9, wherein the generating adversarial examples
comprises
optimizing a loss.
16713712.1
43078/8
Date Recue/Date Received 2021-04-07

28
13. The method of claim 9, wherein the adversarial examples are generated
so as to fool
a classifier associated with a neural network.
14. The method of claim 9, wherein the adversarial examples are generated
so to reinforce
a classifier associated with a neural network.
15. The method of claim 9, wherein the applying the manifold learning
routine and the
applying perturbation routine are executed as a unified routine.
16. The method of claim 9, wherein the manifold learning routine comprises
a variational
inference routine executing learning of the manifold while minimizing
assumptions about a
distribution of the latent representation elements.
17. The method of claim 9, wherein the perturbation routine comprises
establishing a
neighborhood of the latent representation elements which preserves semantics
associated
with the set of inputs.
18. A system for generating adversarial examples, the system comprising:
at least one processor, and
memory storing a plurality of executable instructions which, when executed by
the at
least one processor, cause the system to:
access a set of inputs;
generate an instance of a variable auto-encoder (VAE), the VAE comprising an
encoder and a decoder, the instance of the VAE encoding the set of inputs into
latent
representation elements associated with a latent space, the latent
representation
elements representing low geometric summaries establishing semantics
associated
with the set of inputs;
16713712.1
43078/8
Date Recue/Date Received 2021-04-07

29
apply a manifold learning routine on the instance of the VAE to establish a
characterization of a manifold in the latent space, the manifold being defined
by the
latent representation elements;
apply a perturbation routine to generate perturbed latent representation
elements
while constraining the perturbed latent representation elements to remain
within the
manifold;
generate adversarial examples based on the perturbed latent representation
elements,
the adversarial examples sharing common semantics associated with the set of
inputs;
and
output the adversarial examples;
wherein the manifold learning routine comprises a Stein Variational Gradient
Descent (SVGD) routine to learn the manifold.
19. The system of claim 18, wherein the adversarial examples are generated
so as to fool
a classifier associated with a neural network.
20. The system of claim 18, wherein the adversarial examples are generated
so to
reinforce a classifier associated with a neural network.
21. The system of claim 18, wherein the applying the manifold learning
routine and the
applying perturbation routine are executed as a unified routine.
22. The system of claim 18, wherein the manifold learning routine
comprises a
variational inference routine executing learning of the manifold while
minimizing
assumptions about a distribution of the latent representation elements.
16713712.1
43078/8
Date Recue/Date Received 2021-04-07

30
23. The system of claim 18, wherein the perturbation routine comprises
establishing a
neighborhood of the latent representation elements which preserves semantics
associated
with the set of inputs.
24. A system for generating adversarial examples, the system comprising:
at least one processor, and
memory storing a plurality of executable instructions which, when executed by
the at
least one processor, cause the system to:
access a set of inputs;
generate an instance of a variable auto-encoder (VAE), the VAE comprising an
encoder and a decoder, the instance of the VAE encoding the set of inputs into
latent
representation elements associated with a latent space, the latent
representation
elements representing low geometric summaries establishing semantics
associated
with the set of inputs;
apply a manifold learning routine on the instance of the VAE to establish a
characterization of a manifold in the latent space, the manifold being defined
by the
latent representation elements;
apply a perturbation routine to generate perturbed latent representation
elements
while constraining the perturbed latent representation elements to remain
within the
manifold;
generate adversarial examples based on the perturbed latent representation
elements,
the adversarial examples sharing common semantics associated with the set of
inputs;
and
output the adversarial examples;
wherein constraining the perturbed latent representation elements to remain
within
the manifold comprises applying a Gram-Schmidt Basis Sign Method (GBSM)
routine.
16713712.1
43078/8
Date Recue/Date Received 2021-04-07

31
25. The system of claim 24, wherein constraining the perturbed latent
representation
elements to remain within the manifold further comprises applying a manifold
alignment
routine.
26. The system of claim 24, wherein the adversarial examples are generated
so as to fool
a classifier associated with a neural network.
27. The system of claim 24, wherein the adversarial examples are generated
so to
reinforce a classifier associated with a neural network.
28. The system of claim 24, wherein the applying the manifold learning
routine and the
applying perturbation routine are executed as a unified routine.
29. The system of claim 24, wherein the manifold learning routine comprises
a
variational inference routine executing learning of the manifold while
minimizing
assumptions about a distribution of the latent representation elements.
30. The system of claim 24, wherein the perturbation routine comprises
establishing a
neighborhood of the latent representation elements which preserves semantics
associated
with the set of inputs.
16713712.1
43078/8
Date Recue/Date Received 2021-04-07

Description

Note: Descriptions are shown in the official language in which they were submitted.

1
SYSTEM AND METHOD FOR GENERATING ADVERSARIAL EXAMPLES
FIELD
[01] The present technology relates to systems and methods for generating
adversarial
examples. In particular, the present technology relates to systems and methods
for generating
.. adversarial examples that may be used in the context of training machine
learning algorithms
(MLAs).
BACKGROUND
[02] MLA techniques typically involve developing models and algorithms that
can learn
from and make predictions based on input data. MLA include deep learning
approaches
which typically involve artificial neural networks with multiple layers, often
referred to as
deep neural networks (DNNs). DNNs comprise multiple layers of artificial
neurons, which
may be implemented as non-linear processing units, in which each successive
layer uses the
output of the previous layer as an input.
[03] Recent developments in adversarial machine learning have raised concerns
about the
robustness of DNNs due to their vulnerability to attacks by the presence of
subliminal signals
in the input data which may cause serious harm by influencing the DNNs
cognition functions.
Such subliminal signals may take the form of low-intensity perturbations to
input data (e.g.,
noise introduced to an image or permutations between words of a text) which
may be
unnoticeable to a human. Such subliminal signals may cause DNNs to misclassify
their inputs
and/or learn the wrong things from the environment and, as a result, make
incorrect
predictions. Input data perturbed by subliminal signals are typically referred
to as adversarial
examples.
[04] One of the current approaches to improve resilience of DNNs to
adversarial attacks
involves synthetically generating adversarial examples. However, in order to
provide any
.. benefits, such synthetic adversarial examples need to be coherent, i.e.,
semantically coherent
so as to convey the meaning of true input data. As a first example, in the
image domain,
adversarial examples need to be generated so as to be identical to real images
and preserve
location of fine details of the image. As a second example, in the text
domain, adversarial
examples need to be generated so as to be grammatically and linguistically
correct. Current
13940518.1
43078/8
CA 3060613 2019-10-28

2
methods of generating such synthetic adversarial examples present limitations
as they usually
fail at generating coherent adversarial examples.
[05] There is therefore a need for methods and systems for generating
adversarial examples
for MLAs which are coherent and/or capable of evading defenses known
traditionally to be
resilient to adversarial attacks.
SUMMARY
[06] The present technology is directed to systems and methods for generating
adversarial
examples that may be deemed to be coherent and/or semantically meaningful. The
adversarial
examples generated in accordance with the present technology may be suitable
for being used
as part of a training routine of a MLA, such as, but without being limitative,
a training routine
of a neural network.
[07] In accordance with some broad aspects of the present technology, the
generating of
such adversarial examples may allow fooling a classifier associated with a
neural network,
for example, in the context of black-box attacks" wherein an attacker has only
access to the
predictions of the classifier g and wants to construct adversarial examples
not knowing the
intricacies of g such as, for example but without being limitative, a loss
function or a
gradient. The adversarial examples generated in accordance with the present
technology may
allow evading defenses known traditionally to be resilient to adversarial
attacks.
[08] In accordance with some other broad aspects of the present technology,
the generating
of the adversarial examples may allow reinforcement of a classier by
augmenting a set of
training inputs with the generated adversarial examples. This result in a more
robust training
of the neural networks generated from the augmented set of training inputs.
[09] In one broad aspect, there is provided a method for generating
adversarial examples,
the method comprising:
accessing a set of inputs;
generating an instance of a variable auto-encoder (VAE), the VAE comprising an

encoder and a decoder, the instance of the VAE encoding the set of inputs into
latent
representation elements associated with a latent space, the latent
representation elements
13940518.1
43078/8
CA 3060613 2019-10-28

3
representing low geometric summaries establishing semantics associated with
the set of
inputs;
applying a manifold learning routine on the instance of the VAE to establish a

characterization of a manifold in the latent space, the manifold being defined
by the latent
representation elements;
applying a perturbation routine to generate perturbed latent representation
elements
while constraining the perturbed latent representation elements to remain
within the
manifold;
generating adversarial examples based on the perturbed latent representation
elements,
the adversarial examples sharing a common semantics associated the set of
inputs; and
outputting the adversarial examples.
[10] In another aspect, the adversarial examples are generated so as to fool a
classifier
associated with a neural network.
[11] In another aspect, the adversarial examples are generated so to reinforce
a classifier
associated with a neural network.
[12] In another aspect, the applying the manifold learning routine and the
applying
perturbation routine are executed as a unified routine.
[13] In another aspect, the manifold learning routine comprises a variational
inference
routine executing learning of the manifold while minimizing assumptions about
a distribution
of the latent representation elements.
[14] In another aspect, the manifold learning routine comprises a Stein
Variational
Gradient Descent (SVGD) routine to learn the manifold.
[15] In another aspect, the perturbation routine comprise establishing a
neighborhood of
the latent representation elements which preserves semantics associated with
the set of inputs.
[16] In another aspect, constraining the perturbed latent representation
elements to remain
within the manifold comprises applying a Gram-Schmidt Basis Sign Method (GBSM)

routine.
13940518.1
43078/8
CA 3060613 2019-10-28

4
[17] In another aspect, constraining the perturbed latent representation
elements to remain
within the manifold further comprises applying a manifold alignment routine.
[18] In another aspect, the generating adversarial examples is performed by
the decoder
and comprises operating an inversion routine to reconstruct the adversarial
examples from the
perturbed latent representation elements.
[19] In another aspect, the generating adversarial examples comprises
optimizing a loss.
[20] In other aspects, various implementations of the present technology
provide a non-
transitory computer-readable medium storing program instructions for executing
one or more
methods described herein, the program instructions being executable by a
processor of a
computer-based system.
[21] In other aspects, various implementations of the present technology
provide a
computer-based system, such as, for example, but without being limitative, an
electronic
device comprising at least one processor and a memory storing program
instructions for
executing one or more methods described herein, the program instructions being
executable
by the at least one processor of the electronic device.
[22] In the context of the present specification, unless expressly provided
otherwise, a
computer system may refer, but is not limited to, an "electronic device," a
"computing
device," an "operation system," a "system," a "computer-based system," a
"computer
system," a "network system," a "network device," a "controller unit," a
"monitoring device,"
a "control device," a "server," and/or any combination thereof appropriate to
the relevant task
at hand.
[23] In the context of the present specification, unless expressly provided
otherwise, the
expression "computer-readable medium" and "memory" are intended to include
media of any
nature and kind whatsoever, non-limiting examples of which include RAM, ROM,
disks
.. (e.g., CD-ROMs, DVDs, floppy disks, hard disk drives, etc.), USB keys,
flash memory cards,
solid state-drives, and tape drives. Still in the context of the present
specification, "a"
computer-readable medium and "the" computer-readable medium should not be
construed as
being the same computer-readable medium. To the contrary, and whenever
appropriate, "a"
computer-readable medium and "the" computer-readable medium may also be
construed as a
first computer-readable medium and a second computer-readable medium.
13940518.1
43078/8
CA 3060613 2019-10-28

5
[24] In the context of the present specification, unless expressly provided
otherwise, the
words "first", "second", "third", etc. have been used as adjectives only for
the purpose of
allowing for distinction between the nouns that they modify from one another,
and not for the
purpose of describing any particular relationship between those nouns.
[25] Additional and/or alternative features, aspects and advantages of
implementations of
the present technology will become apparent from the following description,
the
accompanying drawings, and the appended claims.
BRIEF DESCRIPTION OF THE DRAWINGS
[26] For a better understanding of the present technology, as well as other
aspects and
further features thereof, reference is made to the following description which
is to be used in
conjunction with the accompanying drawings, where:
[27] FIG. 1 is a block diagram of an example computing environment in
accordance with
embodiments of the present technology;
[28] FIG. 2 & 3 are schematic illustrations of an operating environment which
may be used
to generate adversarial examples in accordance with embodiments of the present
technology;
[29] FIG. 4 & 5 are illustrations of steps performed by a manifold learning
module and a
perturbation module in accordance with embodiments of the present technology;
[30] FIG. 6-8 are illustrations of examples of experiments based on
embodiments of the
present technology; and
[31] FIG. 9 is a schematic illustration of a computer-implemented method used
in
connection with generating adversarial examples in accordance with embodiments
of the
present technology.
DETAILED DESCRIPTION
[32] The examples and conditional language recited herein are principally
intended to aid
the reader in understanding the principles of the present technology and not
to limit its scope
to such specifically recited examples and conditions. It will be appreciated
that those skilled
in the art may devise various arrangements which, although not explicitly
described or shown
13940518.1
43078/8
CA 3060613 2019-10-28

6
herein, nonetheless embody the principles of the present technology and are
included within
its spirit and scope.
[33] Furthermore, as an aid to understanding, the following description may
describe
relatively simplified implementations of the present technology. As persons
skilled in the art
would understand, various implementations of the present technology may be of
greater
complexity.
[34] In some cases, what are believed to be helpful examples of modifications
to the
present technology may also be set forth. This is done merely as an aid to
understanding, and,
again, not to define the scope or set forth the bounds of the present
technology. These
modifications are not an exhaustive list, and a person skilled in the art may
make other
modifications while nonetheless remaining within the scope of the present
technology.
Further, where no examples of modifications have been set forth, it should not
be interpreted
that no modifications are possible and/or that what is described is the sole
manner of
implementing that element of the present technology.
[35] Moreover, all statements herein reciting principles, aspects, and
implementations of
the present technology, as well as specific examples thereof, are intended to
encompass both
structural and functional equivalents thereof, whether they are currently
known or developed
in the future. Thus, for example, it will be appreciated by those skilled in
the art that any
block diagrams herein represent conceptual views of illustrative circuitry
embodying the
principles of the present technology. Similarly, it will be appreciated that
any flowcharts,
flow diagrams, state transition diagrams, pseudo-code, and the like represent
various
processes which may be substantially represented in computer-readable media
and so
executed by a computer or processor, whether or not such computer or processor
is explicitly
shown.
[36] The functions of the various elements shown in the figures, including any
functional
block labeled as a "processor", may be provided through the use of dedicated
hardware as
well as hardware capable of executing software in association with appropriate
software.
When provided by a processor, the functions may be provided by a single
dedicated
processor, by a single shared processor, or by a plurality of individual
processors, some of
which may be shared. In some embodiments of the present technology, the
processor may be
a general purpose processor, such as a central processing unit (CPU) or a
processor dedicated
13940518.1
43078/8
CA 3060613 2019-10-28

7
to a specific purpose, such as a digital signal processor (DSP). Moreover,
explicit use of the
term a "processor" should not be construed to refer exclusively to hardware
capable of
executing software, and may implicitly include, without limitation,
application specific
integrated circuit (ASIC), field programmable gate array (FPGA), read-only
memory (ROM)
for storing software, random access memory (RAM), and non-volatile storage.
Other
hardware, conventional and/or custom, may also be included.
[37] Software modules, or simply modules which are implied to be software, may
be
represented herein as any combination of flowchart elements or other elements
indicating
performance of process steps and/or textual description. Such modules may be
executed by
hardware that is expressly or implicitly shown. Moreover, it should be
understood that one or
more modules may include for example, but without being limitative, computer
program
logic, computer program instructions, software, stack, firmware, hardware
circuitry, or a
combination thereof which provides the required capabilities.
[38] With these fundamentals in place, we will now consider some non-limiting
examples
to illustrate various implementations of aspects of the present technology.
[39] FIG. 1 illustrates a computing environment in accordance with an
embodiment of the
present technology, shown generally as 100. In some embodiments, the computing

environment 100 may be implemented by any of a conventional personal computer,
a
computer dedicated to managing network resources, a network device and/or an
electronic
device (such as, but not limited to, a mobile device, a tablet device, a
server, a controller unit,
a control device, etc.), and/or any combination thereof appropriate to the
relevant task at
hand. In some embodiments, the computing environment 100 comprises various
hardware
components including one or more single or multi-core processors collectively
represented by
processor 110, a solid-state drive 120, a random access memory 130, and an
input/output
interface 150. The computing environment 100 may be a computer specifically
designed to
operate a machine learning algorithm (MLA). The computing environment 100 may
be a
generic computer system.
[40] In some embodiments, the computing environment 100 may also be a
subsystem of
one of the above-listed systems. In some other embodiments, the computing
environment 100
may be an "off-the-shelf" generic computer system. In some embodiments, the
computing
environment 100 may also be distributed amongst multiple systems. The
computing
13940518.1
43078/8
CA 3060613 2019-10-28

8
environment 100 may also be specifically dedicated to the implementation of
the present
technology. As a person in the art of the present technology may appreciate,
multiple
variations as to how the computing environment 100 is implemented may be
envisioned
without departing from the scope of the present technology.
[41] Those skilled in the art will appreciate that processor 110 is generally
representative
of a processing capability. In some embodiments, in place of one or more
conventional
Central Processing Units (CPUs), one or more specialized processing cores may
be provided.
For example, one or more Graphic Processing Units (GPUs), Tensor Processing
Units
(TPUs), and/or other so-called accelerated processors (or processing
accelerators) may be
provided in addition to or in place of one or more CPUs.
[42] System memory will typically include random access memory 130, but is
more
generally intended to encompass any type of non-transitory system memory such
as static
random access memory (SRAM), dynamic random access memory (DRAM), synchronous
DRAM (SDRAM), read-only memory (ROM), or a combination thereof. Solid-state
drive
120 is shown as an example of a mass storage device, but more generally such
mass storage
may comprise any type of non-transitory storage device configured to store
data, programs,
and other information, and to make the data, programs, and other information
accessible via a
system bus 160. For example, mass storage may comprise one or more of a solid
state drive,
hard disk drive, a magnetic disk drive, and/or an optical disk drive.
[43] Communication between the various components of the computing environment
100
may be enabled by a system bus 160 comprising one or more internal and/or
external buses
(e.g., a PCI bus, universal serial bus, IEEE 1394 "Firewire" bus, SCSI bus,
Serial-ATA bus,
ARINC bus, etc.), to which the various hardware components are electronically
coupled.
[44] The input/output interface 150 may allow enabling networking capabilities
such as
wired or wireless access. As an example, the input/output interface 150 may
comprise a
networking interface such as, but not limited to, a network port, a network
socket, a network
interface controller and the like. Multiple examples of how the networking
interface may be
implemented will become apparent to the person skilled in the art of the
present technology.
For example the networking interface may implement specific physical layer and
data link
.. layer standards such as Ethernet, Fibre Channel, Wi-Fi, Token Ring or
Serial communication
protocols. The specific physical layer and the data link layer may provide a
base for a full
13940518.1
43078/8
CA 3060613 2019-10-28

9
network protocol stack, allowing communication among small groups of computers
on the
same local area network (LAN) and large-scale network communications through
routable
protocols, such as Internet Protocol (IP).
[45] According to some implementations of the present technology, the solid-
state drive
120 stores program instructions suitable for being loaded into the random
access memory 130
and executed by the processor 110 for executing acts of one or more methods
described
herein. For example, at least some of the program instructions may be part of
a library or an
application.
[46] FIG. 2 is a schematic illustration of an operating environment 200 which
may be used
to generate adversarial examples which may also be referred to as "adversarial
input data" or
"adversarial training data". In some embodiments, adversarial examples may be
defined as
inputs to a neural network which comprise one or more subliminal signals. In
some
embodiments, the one or more subliminal signals may take the form of low-
intensity
perturbations to input data (e.g., noise introduced to an image or
permutations between words
of a text). In some embodiments, the low-intensity perturbations to the input
data may be
unnoticeable to a human. In some embodiments, the term "adversarial examples"
may also
refer to inputs that are not maliciously crafted, but represent collections of
inputs that, due to
unexpected factors (e.g., lighting, background noise), cause the neural
network to make
incorrect predictions. Even though reference is made to the training of neural
networks
throughout the present disclosure, this aspect is not limitative. It should be
understood that
other techniques of machine learning and/or other types of machine learning
algorithms
(MLAs) are also envisioned without departing from the scope of the present
technology.
[47] In some embodiments, the operating environment 200 is executed on a
computing
environment which may be similar to the computing environment 100. This aspect
is not
limitative and many variations of computing environments may be envisioned
without
departing from the scope of the present technology. In addition to generating
adversarial
examples, the operating environment 200 may also be used to train a neural
network based on
the generated adversarial examples. In some embodiments, the trained neural
network may be
a deep neural network (DNN). In the illustrated embodiments, the operating
environment 200
comprises a DNN training system 220. The DNN training system 220 may comprise
a
training set manager 222, a DNN trainer 224 and an adversarial example
generator 226. The
13940518.1
43078/8
CA 3060613 2019-10-28

10
DNN training system 220 may receive input 210 and produce a trained DNN 230
based on
the input 210.
[48] In some embodiments, the input 210 may be accessed from a database of
labeled
training data points which are suitable to be used to train an MLA. Each
labeled training data
point in the database of labeled training data points may include an input and
one or more
labels corresponding to the input. For example if the input is a picture of a
cat or a dog, and
the goal of the MLA is to predict whether the image is of a cat or a dog, the
label
corresponding to the input would indicate whether the input is a picture of a
cat or whether it
is a picture of a dog. The labels may be applied by humans. In the example
where the input is
a picture of a cat or a dog, a human may be provided the picture and asked to
select either
"cat" or "dog." The labels may be otherwise determined, such as from measured
data. In
alternative embodiments, the input 210 may be accessed from a database of
unlabeled
training data points, such as in context wherein the MLA training system 220
is performing
unsupervised learning.
[49] In accordance with some embodiments, components of the DNN training
system 220
may interoperate to produce the trained DNN 230. For example but without being
limitative,
creating the trained DNN 230 may comprise generating one or more adversarial
examples to
attack a deep learning algorithm used for image classification (e.g., in the
context of object
recognition, etc) or natural language processing (e.g., in the context of
automatic translation
or text generation, etc). Even though the previous examples refer to image
classification or
natural language processing, it should be understood that this aspect is not
limitative and
other field of applications may be envisioned without departing from the scope
of the present
technology.
[50] As a non-limiting example, the training set manager 222 may determine a
first
training set based on the input 210, the DNN trainer 224 may then cause a
first DNN to be
trained to classify examples using the first training set. In some such
examples, adversarial
examples generator 226 may generate one or more adversarial examples that are
misclassified
by the first DNN.
[51] In some embodiments, the one or more adversarial examples that are
misclassified by
the first DNN may then be provided to the training set manager 222 so as to be
included in a
second training set. In some non-limiting embodiments, the process of
generating adversarial
13940518.1
43078/8
CA 3060613 2019-10-28

11
examples, determining an updated training set and training a DNN with the
update training
set may be repeated such that an iteration parameter is satisfied. For
instance, the input 210
may include a numeric iteration parameter of five (5). In such instances, this
may correspond
to DNN training system 220 performing five iterations of generating
adversarial examples,
determining an updated training set and training a DNN with the updated
training set. In such
embodiment, the DNN trained with a training set that includes the last round
of adversarial
examples generated may be output as the trained DNN 230. In some alternative
embodiments, the process of generating adversarial examples, determining an
updated
training set and training a DNN with the update training set may be repeated
such that a
metric associated with the DNN is met. As an example, but without being
limitative, such
metric may be associated with an accuracy of a classifier associated with the
DNN (e.g., by
measuring a loss of the DNN). In accordance with some embodiments, the DNN may
output
various types of output, including, but not limited to, a classification of
the input 210.
[52] Turning now to FIG. 3, an exemplary embodiment of the adversarial example
generator 226 is depicted. The adversarial generator 226 operates so as to
produce coherent,
i.e., semantically meaningful, adversarial examples. As a first example,
coherent adversarial
examples in the context of image processing, means adversarial examples that
are identical to
real images and preserve location of fine details of the image. As a second
example, coherent
adversarial examples in the context of text or language processing means
adversarial
examples that are grammatically and linguistically correct.
[53] In the illustrated embodiments, the adversarial examples generator 226
comprises a
manifold learning module 302 and a perturbation module 304 operated so that a
first
manifold associated with the input and a second manifold associated with the
adversarial
examples be aligned thereby allowing the adversarial examples to properly
reflect semantics
of the input.
[54] Broadly speaking, the manifold learning module 302 executes a manifold
learning
routine implementing a variational inference method to encode high-dimensional
data into a
low dense representation while avoiding reparametrizing an encoder network.
The
perturbation module 304 executes a perturbation routine implementing the
manifold
invariance concept by establishing a neighborhood of the learned manifold
elements which
preserves the semantics associated with the inputs. Exemplary steps that may
be performed
13940518.1
43078/8
CA 3060613 2019-10-28

12
by the manifold learning module 302 and the perturbation module 304 are
described in
connection with the below description of FIG. 4, 5 and 9.
Exemplary framework
[55] In some embodiments, the adversarial examples generator 226 implements a
framework 400 similar to the framework illustrated in FIG. 4. Instances Om
wherein m E
(1, ,M} of an encoder E are represented. Instances Om define parameters of
the encoder E.
Similarly, instances O'm wherein m E (1, , M} of an encoder E' are
represented. Instances
Olm define parameters of the encoder E'. In some embodiments, instances Om are
based on a
recognition model which takes as input random vectors and the parametrisation
In some
embodiments, the parametrisation i is chosen in such a way that the instance
Om encodes
inputs x into latent variables z. In some embodiments, instances Ofm are based
on a
recognition model which takes as input random vectors and the parametrisation
. A
manifold M is composed of latent codes z (also equally referred to as latent
representations z).
In the illustrated embodiment, model instances Om and O'm are generated from
recognition
networks fn () and fri, () where i and if are the learnt parameters. Model
instances Om and
Oim are used to sample zm and z'm given an input x E D, for any m E (1, , M).
In some
embodiments, x' is generated from posterior sampling of a z' via Bayesian
ensembling. In
some embodiments z' may be passed to a decoder po to generate x'. It should be
noted that
throughout this disclosure reference is made to both instances and particles.
It should be
understood that both terms may be used interchangeably.
[56] In some embodiments, the framework 400 may be implemented via a
variational auto-
encoder (VAE). In some embodiments, the VAE comprises an encoder and a
decoder.
Implementation details of the encoder and of the decoder will become apparent
to the person
skilled in the art of the present technology. Inputs may comprise a dataset of
training
examples (such as the input 210) which may be represented as D. Classes
associated with the
dataset D may be represented as 1. A black-box classifier may be represented
as g. The
black-box classifier typically refers to a classifier for which
parametrization is unknown,
typically in the context of "black-box attacks" wherein an attacker has only
access to the
predictions of the classifier g and wants to construct adversarial examples
not knowing the
intricacies of g such as, for example but without being limitative, a loss
function or a
13940518.1
43078/8
CA 3060613 2019-10-28

13
gradient. It should be noted that the present technology may also be
applicable to "white-box
attacks" wherein attacker may know at least some of the intricacies of a
classifier g.
[57] In some embodiments, the VAE learns to approximate variational posterior
over D.
Such learning may be implemented by a first inference routine and a second
inference
routine. The first inference routine may comprise a Stein Variational Gradient
Descent
(SVGD) routine to establish a topological structure of the manifold M. In some
embodiments,
the SVGD routine may be implemented in accordance with the technology
described in Liu
& Wang (see "Qiang Liu and Dilin Wang, Stein variational gradient descent: A
general
purpose Bayesian inference algorithm. Neural Information Processing Systems
(NIPS),
2016"). The second inference routine may comprise a Gram-Schmidt basis sign
routine
which implementation details will become apparent to the person skilled in the
art of the
present technology. The learning allows to draw instances of model parameters
from the
implicit distributions p(0) and p(0') based on which the two encoders E and E'
are
parameterized. In such embodiments, the two encoders E and E' optimize
uncertainty
inherent to embedding D while easing sampling via Bayesian ensembling.
[58] In some embodiments, the decoder pcb acts as a generative model for
generating
adversarial examples and as a proxy for creating latent targets in the Z space
in order to
optimize the encoder E. To do so, the decoder po may implement an inversion
routine 500,
such as depicted at FIG. 5. During an inner update, the inversion routine 500,
a Gaussian
noise is fed to ffi to generate 0. Given x E D, a sampling of z p(z1x;0) is
conducted to
reconstruct 2. Then, using 2 a sampling of I p(z12;0) is conducted. At this
step, 2 becomes
the target of z. At the next update, 2 becomes the prediction and the process
is repeated to
create a target for I. In some embodiments, the inversion routine 500 may be
implemented in
accordance with the following algorithm:
Algorithm 1 Inversion with one particle 0.
Require: Input J; E V
Require: Network n
I: Sample E Ai"( 0, 1)
2: Sample 0
3: Given .1 , sample z p( z 0)
4: Sample 3 p(xl z, 40)
5: Sample .-;; p(z Li., 0)
6: Use :r and to compute p(ilx; 0)
13940518.1
43078/8
CA 3060613 2019-10-28

14
[59] In accordance with the framework 400, the adversarial examples are not
found for a
given input x in the input space D as it may be the case with existing
approaches. Rather, the
framework 400 learns to perturb the latent code z in a way that the perturbed
version of the
latent code z' and the non-pertubed version of the latent code z lie in a same
manifold M. As
previously explained, x' is then constructed by using the decoder p4, . In
other words, the
framework 400 may be said to efficiently perturb the latent codes and then map
the two-
dimensional representations back onto the input space. Such approach allows
better control of
perturbations injected to the adversarial examples thereby ensuring that they
are more likely
to be similar to the inputs.
Implicit manifold learning
[60] An exemplary embodiment of a manifold learning routine which may be
implemented
by the manifold learning module 302 will be further discussed in the paragraph
below.
Uncovering structure in high dimensional data D and understanding its meta-
properties may
be achieved by mapping D to a low dimensional subspace in which explanatory
hidden
features may become apparent. Assumption is made that data of interests lie on
or near lower
dimensional manifolds on its embedding space. To implement the VAE, datapoints
x E D
are modeled via a decoder xnlz, 14( x I zn) with a prior p(z) placed on the
latent codes
zn. To learn parameters 0 of the decoder, an approach may consist of
maximizing a
variational approximation of an empirical expected log-likelihood which may be
called
evidence lower bound (ELBO) and represented by the following mathematical
representation:
p(xj z: Op(z)]
4(0 [, V; x) Ezlx;01 g (q. ( zlx; lip( z I
x; 4))) + log p(x;
q(zi. r
[61] Expectations Ezlx;g, can be expressed as a sum of a reconstruction loss,
or expected
negative log-likelihood of x, and a Br...L(q(z Ix; le:)0p(z)). The XL term
acts as a regularizer
and forces the encoder q(z I x;1P) to follow a distribution similar to p(z).
To avoid limitations
of prior art approaches imposing a Gaussian form on p(z), the present
technology applies a
Stein Variational Gradient Descent (SVGD) routine to learn the manifold,
instead of
explicitly optimizing the ELBO.
13940518.1
43078/8
CA 3060613 2019-10-28

15
Stein Variational Gradient Descent (SVGD) routine
[62] An exemplary embodiment of a SVGD routine which may be implemented as
part of
the manifold learning module 302 will be further discussed in the paragraph
below. The
SVGD routine may be implemented as a nonparametric variational inference
method which
does not confine a target distribution p(z). In some embodiments, to
approximate p(z), SVGD
( z. 3A1
maintains M particles 1, whi
Z ch
may be initially sampled from a simple
distribution, and which may be iteratively transported via a functional
gradient descent. At
iteration t, each particle E zt may be updated as follows:
zt+1 4-- zt ntr(zt) where 7( zt ) =1 E [k(4, 2)v logp(z-1) vzik(4,zoi
Where 'If is a step-size and k(., .) is a positive-definite kernel.
[63] In the above equation, each particle determines its update direction by
consulting with
other particles and asking their gradients. An importance of a latter
particles may be weighted
according to a distance measure k(.. =) . Closer particles are given higher
consideration than
those lying further away. A term V z/ z )
is a regularizer that acts as a repulsive force
.. between the particles to prevent them from collapsing into one particle.
Upon convergence,
the particles zit, will be unbiased samples of the true implicit distribution
p(z).
Manifold learning via SVGD
[64] An exemplary embodiment of a characterisation of a manifold M based on an
SVGD
routine which may be implemented as part of the manifold learning module 302
will be
further discussed in the paragraph below. In some embodiments, faithfully
characterizing the
manifold M of D may involve optimizing a divergence KL(q(z1x; 0111)(21z; C5))
using the
SVGD routine. In order to improve efficiently as to how M is learnt, a
Bayesian method is
used as it provides a principled way to model uncertainty through posterior
distribution over
M
model parameters. In this regard, M instances of model parameters {on,
} 7,,I are
introduced where every 7,1 e is a particle that defines the weights and
biases of a Bayesian
neural network. The SVGD routine is applied on the particles E. In some
embodiments, the
the SVGD routine may maintain M particles. In some embodiments, to limit
drawbacks such
13940518.1
43078/8
CA 3060613 2019-10-28

16
as computational expenses for large M, only one recognition network fri that
takes as input
,,V(0, I)
and outputs In is maintained. To obtain fp ri is updated through a small
number of gradient steps. If I/ parameterizes fat iteration t, then //is
obtained by:
tit+ arg miii f qt.) with Ot+1 Ot a#1-(01 )
nt nt In
2
Tra=1
Al
where T(01) = [k(011,6t)N76. log p(0 ) + V t4k(0!),
[65] In some embodiments, SVGDT(H) may be used to denote an SVGD update of (4
using the operator 7(.) . As the particles are Bayesian, upon observing D, a
prior P(W)) may
P(W1D) 101 t: =
be updated to obtain a posterior p(V)p(0) .1 .1
which captures the uncertainty.
The data likelihood P('DX)is evaluated over all pairs (17,27',) where 3! E V
and is a
dependent variable. In accordance with embodiments of the present technology,
2 is
generated using an algorithm implementing:
p(DIO jt) H p(ilx; Oit ) where x E
tx,i)
[661 In some embodiments, for any input x E D the corresponding latent code z
is sampled
p(zl
from x;D)
which may be approximated by calculating a Monte Carlo over 0
p(zix;D)
approximation. In some embodiments, may
be represented by the following
equation:
Af
1
p(zjx;D) = f p(zi=E;6)101Mdz E p(zix; oio where Gm p(01D)
Tn=I
Manifold preserving adversarial attacks
[67] Referring back to FIG. 3, this section will now describe how, in
accordance with
embodiments of the present technology, the manifold learning module 302 and
the
13940518.1
43078/8
CA 3060613 2019-10-28

17
perturbation module 304 may interact as a unified learning procedure to
generate adversarial
examples.
[68] As previously discussed, the adversarial examples generators 226 aims at
perturbing
elements M so that the perturbed elements reside in M and exhibit the
semantics of D
captured by M. In other words, a linear mapping 11' NI -4 -1" is sought so
that M may be
said to be preserved under h'. In order to achieve this, and rather than
directly finding a linear
-= (0' }M .
span h', a new set of instances of model parameters m is
introduced. Each
"L denotes the weights and biases of a Bayesian neural network. Then, for any
inputs x E D
and its latent code Z
P(ZIX: D), a point in M, h1(z) z' is set. z' is sampled from
P(zi P) and is approximated by a Monte Carlo approximation using 6'. In some
embodiments, a local smoothness of M is leveraged in a way to encourage z' to
reside in M in
a close neighborhood of z using a Gram-Schmidt Basis Sign Method (GBSM)
routine.
Gram-Schmidt Basis Sign Method (GBSM) routine
[69] This section will now describe an exemplary embodiment of a GBSM routine.
Assuming X being a minibatch of samples of D and Zrn a set of latent codes
7"", AZT": Om)
where x E X and Om E 0. For any m E [1, M} GyIn is learnt to generate
perturbed versions
e ,
of ' Z
"" "
along directions of an orthogonal basis m. As M is locally Euclidean,
dimensions of the subspace Znt are computed by applying Gram-Schmidt to
orthogonalize a
span of representative local points. This step may formalized as an
optimisation routine
which, once executed, allows learning of perturbations Stu, directions sign
(Um) along which
to perturb Zrn and nt in accordance with the following equation:
arg min tr(&õ ) E ¨ [zni + ôiõ sigu(uim)} where Zyyl,
2 ,^4
60.
[70] In some embodiments, the GBSM routine is relying upon the fact that
topological
spaces are closed under their basis vectors to render M invariant to the
perturbations 6tn. The
following steps are an example of how the GBSM routine may be executed. First,
sampling a
model instance ,171 . Then, generating zvirt
P(213';9/1,1) for all x E X. ZM is then
13940518.1
43078/8
CA 3060613 2019-10-28

18
orthogonalized and a noise tensor 6m minimizing 0 along directions of the
basis vector
UM is found. The perturbations Stti are constrained to be small. With (5ni
fixed, elm is
updated by minimizing g again. The notation GBSM(e's A) where LI, = SAI]
r is used to
denote one update of EY via the GBSM routine.
Manifold alignment
[71] This section will now describe an exemplary embodiment of a manifold
alignment
routine. Even though the GBSM routine may confer latent noise imperceptibility
and
sampling speed, (Y. may deviate from (") in which case the learnt manifolds
may be
misaligned. To mitigate this, a manifold alignment routine may be applied by
regularizing
each (it'll E EY after every GBSM update. In some embodiments, an SVGD update
on EY
La I
may be applied to ensure that k-77 follow transform maps constructed by () in
accordance
with the following equation:
A/
041 4-- 0; + at 7r( 0; ) where r (14) E k((J' (-11t)V 0.; log p(14) k
(6e t On]
j I
[72] A notation SVGDN (e') may be used to denote one application of the
gradient update
rule of the equation above. During such update, the model instances EY
determine their own
update direction by consulting the particles B alone instead of consulting
each other.
Maintaining EY for large M being potentially computationally prohibitive, only
one
tfri '
recognition network Li' may be maintained that takes as input Ar(
and generates
.1(Cci: in. In some embodiments, if is updated through a small number of
gradient
steps to obtain good generalization performances, in accordance with the
following equation:
t+1 arg min E ; t ) ¨ I where On' ti4" + nig
(0 itt)
1 2
11' tn
g'õIt
Generating adversarial attacks
[73] Amongst other benefits, the present technology allows generating
adversarial
examples in the context of a "black-box scenario" in which only predictions of
a classifier g
13940518.1
43078/8
CA 3060613 2019-10-28

19
are accessible (as opposed to being able to access parametrization of the
classifier g).
Amongst yet other benefits, the present technology allows generating
adversarial examples in
the context of a "white-box scenario" in which parametrization of a classifier
g is known. In
some embodiments, adversarial examples are generated by optimizing a loss, in
accordance
with the following equation:
C = Ilx ¨ ..eff12 + min 1y=y, . log (1 ¨ P(1112)) such that 11:r ¨ 2112 ,S. E
-attack.
WEY
[74] In accordance with the above equation, the first term is a reconstruction
loss which
may account for a dissimilarity between any input x E D and its adversarial
counterpart x'. In
some embodiments, the first term may be constrained to be smaller than (attack
so that x'
resides within an (anack -radius ball of x. The second term is a log-
likelihood loss of a target
class y' E y \ {y} where li is the class of x. The loss defines the cost
incurred for failing to
fool the classifier g.
[75] In accordance with some embodiments of the present technology, an end-to-
end
adversarial examples generation procedure may be summarized as follows.
Algorithm 2 Generating Adversarial Examples. Lines 2 and 4 compute distances
between sets
keeping a one-to-one mapping between them. x' is adversarial to x when C,,,, 5
(muck and 11 i V-
I: function I bits! TR A 1 NENC;(0, e', q, re, A . i!) > local gradient
updates of fõ, fõ,, A
Require: Learning rates 0, fl.
2: n 4- rt - OV,,I1E3-- SVG1),( 03)112 [::. apply inversion on I and
update it
3: A, EV 4-- GEsm(EY, LI) r> updates and (3' using GBSM
4: ye 4- if - xi'vrelifv¨ svcm,(0)11, w align (i' with ES and update ii'
5: return q, re, A
Require: Training samples (x, y) c D x 3,
Require: Number of model instances M
Require: Number of inner updates T
Require: Initialite weights ri, re, 4/ c., recognition nets f*õ. fõ..
decoder põ,
Require: Initialise perturbations A :¨ fon ..., em 1 t> latent
(adversarial) perturbations
Require: Learning rates c, cr, of', and strength Cm.k
6: Sample 41, ..., 4m from:V(0, I) c, inputs to recognition nets f,,, f,
7: for i = 1 to T do
8: Sample 8 = {Om }X,, 1 where Om ''" fq(C.)
9: Sample EV = {O}:!,-L, where 0, ¨ f,,(,,,)
10: Use 8 and 8' in Equation 310 sample z and z'
II: Sample .i. -- /(ik, cly) and x' ,,,, nix 1 z` . ri)) e=
clean and perturbed reconstructions
12: ri, Tr. A e=innerTraini ng(8, Wm, n', Am
13: C 4 :='x - :112; CA' := IP, X' 112 D reconstruction losses on :r and x'
> c,õ,k
14: C'' :"-- + Mill [im.o, = log (1 - P(04)], otherwise 1
ti`EY
15: q 4 q , (AV ,gC#: re t- re oi'V,,,,Cõ.. E> SGD update using Adam
optimizer
16: 15 4, (-- Ov - frigi,(C, 4 Cf) rp SOD update using Adam optimizer
[76] In accordance with the above end-to-end adversarial examples generation
procedure,
at steps 1-5, vectors 11, 11' and 9 are learnt by a model. At step 6, M random
variables fm are
13940518.1
43078/8
CA 3060613 2019-10-28

20
samples. In other words, = fm}.
At steps 7-9, 0 and 0' are generated. 0 =
(91, ..., Om) where 0 = fi(f) and 0' = [0'1, 01,21
where 0' = frp(f). Particles Om and O'm
are generated from recognition networks MO and fig) where 1 and are
learnt
parameters. In some embodiments, particles Om and model instances O'm are
encoder
.. parameters of E and E'.
[77] At step 10, latent variables z and z' are sampled from the encoder.
Latent variables z
and z' are encoded representation of an input x. Latent variables z and z' may
be
approximated as z p(z1x;e9) = go(x) (e.g., the encoder E) and z' p(z1x;(9') =
go, (x) (e.g.,
the encoder E'). Input x and perturbed variable x' are reconstructed from the
decoder and may
be approximated as x p(z1x;0) = go (z) and x' p(dx; 0') = g, (z').
[78] In some embodiments, an inner training is performed wherein x = input
(e.g., a
portrait), z = = g9 (x) (the input is encoded into a latent representation z),
2 = g1, (z) (the
input is decoded/reconstructed) and 2 = go (32) (the target 2 is re-encoded
into a latent
representation 2). 2 and 2 may be deemed targets of x and z respectively. If
go() and go()
are properly trained, x (2) and z (2) should be similar.
[79] In accordance with some embodiments of the present technology, the
InnerTraining
function called at step 12 may comprise (1) updating 11 with SVGD (0') (i.e.,
the manifold
learning set forth at step 2 above), (2) distorting the latent codes with GBSM
(i.e., the
manifold perturbation/distortion set forth at step 3 above), (3) updating n'
with SVGD (0')
(i.e., the manifold alignment set forth at step 4 above).
[80] At steps 13 and 14, the loss functions Lg and Lx, are computed. In some
embodiments,
Lg is a function quantifying how x and 2 are similar and Lx, is a function
quantifying how x
and x' are similar. For trained model, i.e., when Lie and Lx, are in a local
minimum, then 2, x'
and x lie in a same manifold and x' is an adversarial example of x.
[81] At steps 12 and 14, if Lx, > f attack the learnt parameters may be
updated to fool a
classifier otherwise the learnt parameters may be updated to reinforce the
classifier.
Depending on Lx, and f an," , the model is trained to either fool or reinforce
a classifier
(decoder), for example, but without being limitative, in the context of
generative adversarial
networks (GAN).
13940518.1
43078/8
CA 3060613 2019-10-28

21
Examples of manifold preservation
[82] Referring now to FIG. 6, examples 600 of an experiment based on the
present
technology is illustrated. The experiment is based on a 3D non-linear Swiss
Roll dataset
which comprises 1,600 data points grouped in 4 classes. Graph 602 illustrates
2D plots of a
.. manifold learnt after execution of the manifold learning module 302. The
manifold comprises
learned manifold elements also referred to as latent codes. Graph 604
illustrates 2D plots of
perturbed learned manifold elements generated by the perturbation module 304.
As it may be
appreciated, the perturbed learned manifold elements have been generated so as
to espouse
the Swiss Role manifold. To the contrary, graph 606 illustrates 2D plots of
perturbed learned
manifold elements generated in accordance with prior approaches, in this
example the
projected gradient descent (PGD) approach. As it may be appreciated, the
perturbed learned
manifold elements generated with the PGD approach do not properly espouse the
Swiss Role
manifold. Graph 608 illustrates a 3D plot of the dataset represented in graph
602. Graph 610
illustrates a 3D plot of the dataset represented in graph 604. Graph 612
illustrates a 3D plot of
the dataset represented in graph 606.
[83] Referring to FIG. 7, an experiment 700 relating to generating adversarial
examples in
the context of image processing based on the present technology are
illustrated. Inputs 702
are illustrated along with clean reconstruction 704. Adversarial examples 706
generated in
accordance with the present technology are also illustrated.
[84] Referring to FIG. 8, an experiment 800 relating to generating adversarial
examples in
the context of language processing based on the present technology are
illustrated. True
inputs 1-7 are illustrated along with corresponding adversarial examples
generated in
accordance with the present technology.
Method for generating adversarial examples
[85] Referring now to FIG. 9, some non-limiting example instances of systems
and
computer-implemented methods used in connection with generating adversarial
examples to
be used for training of a MLA, such as, but without being limited to, for
training of a DNN.
More specifically, FIG. 9 shows flowcharts illustrating a computer-implemented
method 900
implementing embodiments of the present technology. The computer-implemented
method of
FIG. 9 may comprise a computer-implemented method executable by a processor of
a
13940518.1
43078/8
CA 3060613 2019-10-28

22
computing environment, such as the computing environment 100 of FIG. 1, the
method
comprising a series of steps to be carried out by the computing environment.
[86] Certain aspects of FIG. 9 may have been previously described with
references to FIG.
2-6. The reader is directed to that disclosure for additional details.
[87] The method 900 starts at step 902 by accessing a set of inputs. As
previously
discussed, the set of inputs may define a set of training data to be used for
the training of an
MLA, such as, for example, the input 210 illustrated at FIG. 2.
[88] Then, at step 904, the method 900 executes generating an instance of a
variable auto-
encoder (VAE). In some embodiments, the VAE comprises an encoder and a
decoder. In
some embodiments, the encoder and the decoder are each associated with weights
and biases.
In some embodiments, the VAE comprises a first encoder, such as the encoder E
depicted at
FIG. 4, and a second encoder, such as the encoder E' also depicted at FIG. 4.
In some
embodiments, the instance of the VAE is based on a recognition model for which
parameters
are learnt in such a way that the instance encodes the set of inputs into
latent representation
elements associated with a latent space. The latent representation elements
represent low
geometric summaries establishing semantics associated with the set of inputs.
A
representation of an exemplary latent space along with low geometric summaries
is depicted
at FIG. 6.
[89] At step 906, the method 900 executes applying a manifold learning routine
on the
instance of the auto-encoder to establish a characterization of a manifold in
the latent space,
the manifold being defined by the latent representation elements. In some
embodiments, the
manifold learning routine comprises a variational inference routine executing
learning of the
manifold while minimizing assumptions about a distribution of the latent
representation
elements. In some embodiments, the manifold learning routine comprises a Stein
Variational
Gradient Descent (SVGD) routine to learn the manifold. In some embodiments,
the SVGD
routine generates the particles that parametrize the encoder E wherein E is
understood as
composed of m encoders where each encoder m is parametrized by Om. The
manifold M is
defined by latent codes generated using the particles 0 . Exemplary
embodiments of the
manifold learning routine and of the SVGD routine are provided in the above
description of
FIG. 3-6. In some embodiments, generating adversarial examples is performed by
the
decoder and comprises operating an inversion routine to reconstruct the
adversarial examples
13940518.1
43078/8
CA 3060613 2019-10-28

23
from the perturbed latent representation elements. Exemplary embodiments of
the inversion
routine are provided in the connection with the above description of FIG. 4.
[90] At step 908, the method 900 executes applying a perturbation routine to
generate
perturbed latent representation elements while constraining the perturbed
latent
representation elements to remain within the manifold. In some embodiments,
the
perturbation routine comprises establishing a neighborhood of the latent
representation
elements which preserves semantics associated with the set of inputs. In some
embodiments,
the neighborhood is a uniformly-bounded ball associated with a radius (E
attack). In some
embodiments, the perturbation routine comprises generating perturbed instances
of the
encoder that define perturbed weights and biases of a perturbed neural
network, the perturbed
instances defining a perturbed manifold.
[91] In some embodiments, constraining the perturbed latent representation
elements to
remain within the manifold comprises applying a Gram-Schmidt Basis Sign Method
(GBSM)
routine. In some embodiments, constraining the perturbed latent representation
elements to
remain within the manifold further comprises applying a manifold alignment
routine. In some
embodiments, the manifold alignment routine comprises regularizing each
perturbed
instances of the encoder after each update of the GBSM. Exemplary embodiments
of the
GBSM routine and of the manifold alignment routine are provided in the above
description of
FIG. 3-6.
[92] Then, at step 910, the method 900 executes generating adversarial
examples based on
the perturbed latent representation elements, the adversarial examples sharing
a common
semantics associated the set of inputs.
[93] In some embodiments, generating adversarial examples also comprises
optimizing a
loss. In some embodiments, optimizing the loss comprises bounding the loss by
a radius
(Eattack) as previously detailed in connection with the description of FIG. 3-
6.
[94] At step 912, the method 900 proceeds to outputting the adversarial
examples which
may be used, for example, but without being limitative, as part of a training
routine of a
DNN. In some embodiments, the adversarial examples are generated so as to fool
a classifier
associated with a neural network, for example, but without being limitative,
in the context of
a "black-box attack" scenario of a classifier which associated parameters are
unknown and/or
in the context of a "white-box attack" scenario of a classifier which
associated parameters are
13940518.1
43078/8
CA 3060613 2019-10-28

24
known. In some other embodiments, the adversarial examples may be used for
reinforcement
of a classier (i.e., by augmenting the set of training inputs with the
generated adversarial
examples). In some embodiments, the steps 906 and 908 are executed as a
unified routine, in
other words, steps 906 and 908 may be executed in parallel and not in series.
[95] While some of the above-described implementations may have been described
and
shown with reference to particular acts performed in a particular order, it
will be understood
that these acts may be combined, sub-divided, or re-ordered without departing
from the
teachings of the present technology. At least some of the acts may be executed
in parallel or
in series. Accordingly, the order and grouping of the act is not a limitation
of the present
technology.
[96] It should be expressly understood that not all technical effects
mentioned herein need
be enjoyed in each and every embodiment of the present technology.
[97] As used herein, the wording "and/or" is intended to represent an
inclusive-or; for
example, "X and/or Y" is intended to mean X or Y or both. As a further
example, "X, Y,
and/or Z" is intended to mean X or Y or Z or any combination thereof. As used
herein, the
wording "at least one of X or Y" or "at least one of X and Y" is intended to
represent an
inclusive-or; for example, "at least one of X or Y" or "at least one of X and
Y" are intended
to mean X or Y or both. As a further example, "at least one of X, Y or Z" or
"at least one of
X, Y and Z" are intended to mean X or Y or Z or any combination thereof.
[98] The foregoing description is intended to be exemplary rather than
limiting.
Modifications and improvements to the above-described implementations of the
present
technology may be apparent to those skilled in the art.
13940518.1
43078/8
CA 3060613 2019-10-28

Representative Drawing

A single figure which represents the drawing illustrating the invention.

Administrative Status

For a clearer understanding of the status of the application/patent presented on this page, the site Disclaimer , as well as the definitions for Patent , Administrative Status , Maintenance Fee and Payment History should be consulted.

Administrative Status

Title	Date
Forecasted Issue Date	2023-08-15
(22) Filed	2019-10-28
Examination Requested	2019-10-28
(41) Open to Public Inspection	2021-04-28
(45) Issued	2023-08-15

Abandonment History

There is no abandonment history.

Maintenance Fee

Last Payment of $100.00 was received on 2023-10-02

Upcoming maintenance fee amounts

Description	Date	Amount
Next Payment if standard fee	2024-10-28	$277.00
Next Payment if small entity fee	2024-10-28	$100.00

Note : If the full payment has not been received on or before the date indicated, a further fee may be required which may be one of the following

the reinstatement fee;
the late payment fee; or
additional fee to reverse deemed expiry.

Patent fees are adjusted on the 1st of January every year. The amounts above are the current amounts if received by December 31 of the current year.
Please refer to the CIPO Patent Fees web page to see all current fee amounts.

Payment History

Fee Type	Anniversary Year	Due Date	Amount Paid	Paid Date
Application Fee		2019-10-28	$400.00	2019-10-28
Request for Examination		2024-10-28	$800.00	2019-10-28
Maintenance Fee - Application - New Act	2	2021-10-28	$100.00	2021-10-27
Registration of a document - section 124		2021-12-21	$100.00	2021-12-21
Maintenance Fee - Application - New Act	3	2022-10-28	$100.00	2022-10-27
Final Fee		2019-10-28	$306.00	2023-06-13
Maintenance Fee - Patent - New Act	4	2023-10-30	$100.00	2023-10-02

Owners on Record

Note: Records showing the ownership history in alphabetical order.

Current Owners on Record
SERVICENOW CANADA INC.

Past Owners on Record
ELEMENT AI INC.

Past Owners that do not appear in the "Owners on Record" listing will appear in other documentation within the application.

Documents

To view selected files, please enter reCAPTCHA code :

To view images, click a link in the Document Description column. To download the documents, select one or more checkboxes in the first column and then click the "Download Selected in PDF format (Zip Archive)" or the "Download Selected as Single PDF" button.

List of published and non-published patent-specific documents on the CPD .

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.

Filter

Download Selected in PDF format (Zip Archive)

Download Selected as Single PDF

Document Description	Date (yyyy-mm-dd)	Number of pages	Size of Image (KB)
Examiner Requisition	2020-12-09	5	269
Amendment	2021-04-07	23	687
Claims	2021-04-07	7	230
Maintenance Fee Payment	2021-10-27	1	33
Examiner Requisition	2021-11-30	6	269
Amendment	2022-03-29	7	180
Maintenance Fee Payment	2022-10-27	1	33
Representative Drawing	2023-02-08	1	13
Cover Page	2023-02-08	1	42
New Application	2019-10-28	6	130
Abstract	2019-10-28	1	18
Description	2019-10-28	24	1,165
Claims	2019-10-28	4	114
Drawings	2019-10-28	9	289
Final Fee	2023-06-13	5	125
Representative Drawing	2023-07-25	1	12
Cover Page	2023-07-25	1	44
Electronic Grant Certificate	2023-08-15	1	2,527

Language selection

Menus

English Abstract

French Abstract

Administrative Status

Abandonment History

Maintenance Fee

Payment History

Your request is in progress.

Requested information will be available
in a moment.

Thank you for waiting.

Patent 3060613 Summary

English Abstract

French Abstract

Administrative Status

Abandonment History

Maintenance Fee

Payment History

Your request is in progress.Requested information will be availablein a moment.Thank you for waiting.

Your request is in progress.

Requested information will be available
in a moment.

Thank you for waiting.