Patent 2723511 Summary

(12) Patent:	(11) CA 2723511
(54) English Title:	SYSTEM AND METHOD FOR ANALYZING COMPUTING SYSTEM RESOURCES
(54) French Title:	SYSTEME T PROCEDE D'ANALYSE DES RESSOURCES D'UN SYSTEME INFORMATIQUE
Status:	Granted and Issued

Bibliographic Data

(51) International Patent Classification (IPC):	G06F 11/34 (2006.01)
(72) Inventors :	HILLIER, ANDREW D. (Canada) YUYITUNG, TOM S. (Canada) SMITH, GERALD W. (Canada)
(73) Owners :	CIRBA IP INC.
(71) Applicants :	CIRBA INC. (Canada)
(74) Agent:	CPST INTELLECTUAL PROPERTY INC.
(74) Associate agent:
(45) Issued:	2018-02-20
(22) Filed Date:	2010-12-03
(41) Open to Public Inspection:	2012-06-03
Examination requested:	2015-11-05
Availability of licence:	N/A
Dedicated to the Public:	N/A
(25) Language of filing:	English

Patent Cooperation Treaty (PCT):	No

(30) Application Priority Data:	None

Abstracts

English Abstract

A system and method are provided for determining computing system efficiencies. The method comprises obtaining a first set of data pertaining to characteristic of each of a plurality of computing systems; obtaining a second set of data pertaining to one or more workloads utilizing the resource; generating a first data model using the first set of data, the first data model comprising resource capacities and capabilities of the computing systems; generating a second data model using the second set of data, the second data model comprising parameters associated with the workloads; performing a workload placement analysis by placing the one or more workloads onto the plurality of computing systems, using the first and second data models, to determine a theoretical minimum computing system capacity required to fulfill the one or more workloads; and computing a fully loaded utilization based on a ratio of the theoretical minimum computing system capacity relative to a total capacity of the plurality of computing systems.

French Abstract

Un système et un procédé sont présentés pour déterminer lefficacité dun système informatique. La méthode comprend lobtention dun premier ensemble de données portant sur une caractéristique de chacun dune pluralité de systèmes informatiques; lobtention dun deuxième ensemble de données portant sur une ou plusieurs charges de travail employant la ressource; la génération dun premier modèle de données au moyen du premier ensemble de données, le premier modèle de données comprenant les capacités et les fonctionnalités de ressource des systèmes informatiques; la génération dun deuxième modèle de données au moyen dun deuxième ensemble de données, le deuxième ensemble de données comprenant des paramètres associés aux charges de travail; lexécution dune analyse de placement de charge de travail en plaçant un ou plusieurs charges de travail dans la pluralité des systèmes informatiques, au moyen des premier et deuxième modèles de données, pour déterminer une capacité de système informatique minimale théorique requise pour satisfaire la une ou les plusieurs charges de travail; et le calcul dune utilisation pleine capacité fondée sur un rapport entre la capacité du système informatique minimale théorique et la capacité totale de la pluralité des systèmes informatiques.

Claims

Note: Claims are shown in the official language in which they were submitted.

Claims:
1. A method of analyzing computing system resources, the method comprising:
obtaining a first set of data pertaining to characteristics of each of a
plurality of
computing systems;
obtaining a second set of data pertaining to a plurality of workloads hosted
on the plurality of
computing systems;
generating a first data model using the first set of data, the first data
model
comprising resource capacities and capabilities of the computing systems;
generating a second data model using the second set of data, the second data
model
comprising requirements associated with the plurality of workloads;
performing a workload placement analysis by iteratively placing the plurality
of workloads
onto different ones of the plurality of computing systems to evaluate multiple
combinations, using the
first and second data models and at least one analysis parameter, to determine
a theoretical
minimum set of computing systems required to fulfill the plurality of
workloads to implement the
placements, the at least one analysis parameter comprising one or more
assumption, constraint or
optimization criterion, wherein if not all of the plurality of workloads can
be placed according to a first
iteration of the workload placement analysis, at least one hypothetical
computing system is
determined, and at least one additional iteration of the workload placement
analysis is performed
using the at least one hypothetical computing system, to determine the
theoretical minimum set of
computing systems required;
computing a theoretical minimum computing system capacity required by summing
the
capacity of the theoretical minimum set of computing systems required;
computing a value based on a ratio of the theoretical minimum computing system
capacity
relative to a total capacity of the plurality of computing systems currently
hosting the plurality of
workloads; and
outputting the value to indicate computing system capacity relative to
workload requirements.
2. The method according to claim 1 , wherein the parameters associated with
the workloads
comprise any one or more or resource utilization, performance requirements,
availability
requirements, and compatibilities.
3. The method according to claim 1 or claim 2, further comprising obtaining
one or more
assumptions, constraints, and optimization criteria indicative of how the one
or more workloads are
- 17 -

to be placed on the plurality of computing systems.
4. The method according to any one of claims 1 to 3, further comprising
obtaining one or more
business policies affecting the one or more workloads.
5. The method according to any one of claims 1 to 4, wherein the computing
systems comprise
one or more servers, additional servers required to host the workloads, or
both.
6. The method according to any one of claims 1 to 5, wherein the resource
is of one of the
following types: CPU, memory, disk I/O, network I/O, power consumption, and
number of virtual
processors.
7. The method according to any one of claims 1 to 6, further comprising
using the value to
conduct one or more what if scenarios to add or remove one or more constraints
or assumptions.
8. The method according to any one of claims 1 to 7, further comprising
repeating the method
to obtain a plurality of values for multiple time periods and comparing the
plurality of values.
9. The method according to claim 8, further comprising generating a
visualization showing the
value for a plurality of scenarios.
10. The method according to any one of claims 1 to 9, further comprising
computing a plurality of
values for multiple what if scenarios over multiple time periods.
11. A computer readable medium comprising computer executable instructions
for analyzing
computing system resources, the computer executable instructions comprising
instructions for:
obtaining a first set of data pertaining to characteristics of each of a
plurality of
computing systems;
obtaining a second set of data pertaining to a plurality of workloads hosted
on the plurality of
computing systems;
generating a first data model using the first set of data, the first data
model
comprising resource capacities and capabilities of the computing systems;
generating a second data model using the second set of data, the second data
model
comprising requirements associated with the plurality of workloads;
- 18 -

performing a workload placement analysis by iteratively placing the plurality
of workloads
onto different ones of the plurality of computing systems to evaluate multiple
combinations, using the
first and second data models and at least one analysis parameter, to determine
a theoretical
minimum set of computing systems required to fulfill the plurality of
workloads to implement the
placements, the at least one analysis parameter comprising one or more
assumption, constraint or
optimization criterion, wherein if not all of the plurality of workloads can
be placed according to a first
iteration of the workload placement analysis, at least one hypothetical
computing system is
determined, and at least one additional iteration of the workload placement
analysis is performed
using the at least one hypothetical computing system, to determine the
theoretical minimum set of
computing systems required;
computing a theoretical minimum computing system capacity required by summing
the
capacity of the theoretical minimum set of computing systems required;
computing a value based on a ratio of the theoretical minimum computing system
capacity
relative to a total capacity of the plurality of computing systems currently
hosting the plurality of
workloads; and
outputting the value to indicate computing system capacity relative to
workload requirements.
12. The computer readable medium according to claim 11, wherein the
parameters associated
with the workloads comprise any one or more or resource utilization,
performance
requirements, availability requirements, and compatibilities.
13. The computer readable medium according to claim 11 or claim 12, further
comprising
instructions for obtaining one or more assumptions, constraints, and
optimization criteria indicative of
how the one or more workloads are to be placed on the plurality of computing
systems.
14. The computer readable medium according to any one of claims 11 to 13,
further comprising
instructions for obtaining one or more business policies affecting the one or
more workloads.
15. The computer readable medium according to any one of claims 11 to 14,
wherein the
computing systems comprise one or more servers, additional servers required to
host the workloads,
or both.
16. The computer readable medium according to any one of claims 11 to 15,
wherein the
resource is of one of the following types: CPU, memory, disk I/O, network I/O,
power consumption,
- 19 -

and number of virtual processors.
17. The computer readable medium according to any one of claims 11 to 16,
further comprising
instructions for using the value to conduct one or more what if scenarios to
add or remove one or
more constraints or assumptions.
18. The computer readable medium according to any one of claims 11 to 17,
further comprising
instructions for repeating the method to obtain a plurality of values for
multiple time periods and
comparing the plurality of values.
19. The computer readable medium according to claim 18, further comprising
instructions for
generating a visualization showing the value for a plurality of scenarios.
20. The computer readable medium according to any one of claims 11 to 19,
further comprising
instructions for computing a plurality of values for multiple what if
scenarios over multiple time
periods.
21. An analysis system comprising a processor and memory, the memory
storing computer
executable instructions for analyzing computing system resources, the computer
executable
instructions comprising instructions for:
obtaining a first set of data pertaining to characteristics of each of a
plurality of
computing systems;
obtaining a second set of data pertaining to a plurality of workloads hosted
on the plurality of
computing systems;
generating a first data model using the first set of data, the first data
model
comprising resource capacities and capabilities of the computing systems;
generating a second data model using the second set of data, the second data
model
comprising requirements associated with the plurality of workloads;
performing a workload placement analysis by iteratively placing the plurality
of workloads
onto different ones of the plurality of computing systems to evaluate multiple
combinations, using the
first and second data models and at least one analysis parameter, to determine
a theoretical
minimum set of computing systems required to fulfill the plurality of
workloads to implement the
placements, the at least one analysis parameter comprising one or more
assumption, constraint or
optimization criterion, wherein if not all of the plurality of workloads can
be placed according to a first
- 20 -

iteration of the workload placement analysis, at least one hypothetical
computing system is
determined, and at least one additional iteration of the workload placement
analysis is performed
using the at least one hypothetical computing system, to determine the
theoretical minimum set of
computing systems required;
computing a theoretical minimum computing system capacity required by summing
the
capacity of the theoretical minimum set of computing systems required;
computing a value based on a ratio of the theoretical minimum computing system
capacity
relative to a total capacity of the plurality of computing systems currently
hosting the plurality of
workloads; and
outputting the value to indicate computing system capacity relative to
workload requirements.
22. The system according to claim 21, wherein the parameters associated
with the workloads
comprise any one or more or resource utilization, performance requirements,
availability
requirements, and compatibilities.
23. The system according to claim 21 or claim 22, further comprising
obtaining one or more
assumptions, constraints, and optimization criteria indicative of how the one
or more workloads are
to be placed on the plurality of computing systems.
24. The system according to any one of claims 21 to 23, further comprising
obtaining one or
more business policies affecting the one or more workloads.
25. The system according to any one of claims 21 to 24, wherein the
computing systems
comprise one or more servers, additional servers required to host the
workloads, or both.
26. The system according to any one of claims 21 to 25, wherein the
resource is of one of the
following types: CPU, memory, disk I/O, network 1/0, power consumption, and
number of virtual
processors.
27. The system according to any one of claims 21 to 26, further comprising
instructions for using
the value to conduct one or more what if scenarios to add or remove one or
more constraints or
assumptions.
- 21 -

28. The system according to any one of claims 21 to 27, further comprising
instructions for
repeating the method to obtain a plurality of values for multiple time periods
and comparing the
plurality of values.
29. The system according to claim 28, further comprising instructions for
generating a
visualization showing the value for a plurality of scenarios.
30. The system according to any one of claims 21 to 29, further comprising
instructions for
computing a plurality of values for multiple what if scenarios over multiple
time periods.
- 22 -

Description

Note: Descriptions are shown in the official language in which they were submitted.

CA 02723511 2010-12-03
SYSTEM AND METHOD FOR ANALYZING COMPUTING SYSTEM RESOURCES
TECHNICAL FIELD
[0001] The following relates generally to computing systems and more
particularly to
systems and methods for analyzing computing system resources.
BACKGROUND
[0002] Modern data centers typically comprise hundreds if not thousands of
servers. Each
server supplies a finite amount of resource capacity, typically in the form
of, but not limited to:
central processing unit (CPU) capacity, memory or storage capacity, disk
input/output (I/O)
throughput, and network I/O bandwidth. Workloads running on these servers
consume varying
amounts of these resources. With the advent of virtualization and cloud
technologies, individual
servers are able to host multiple workloads.
[0003] Percent CPU utilization, which corresponds to the ratio of CPU usage
relative to
CPU capacity, is a common measure of how effectively servers are being
utilized. CPU
utilization tends to vary over time and is often reported using average values
over a given time
interval (e.g. minute, hourly, daily, etc.).
[0004] The overall CPU utilization of a group of servers or an entire data
center can be
computed by aggregating the normalized utilization levels of the servers. CPU
benchmarks can
be used to normalize the percent CPU utilization of the servers to reflect the
relative processing
capabilities of each server.
[0005] Similar utilization metrics can be computed for other resources such
as memory,
storage, disk throughput and network bandwidth.
[0006] In many data centers, workloads do not fully utilize resources of
many of the servers,
for example, the average CPU utilization levels of servers can range from 10
to 20%. Other
server resources such as memory, disk I/O and network I/O tend to also be
underutilized.
[0007] Organizations often seek to reduce capital and operating costs of
data centers by
eliminating excess server capacity. A common strategy is to consolidate
workloads onto a
smaller number of servers. In most virtualized environments, workloads can be
migrated
between servers, easing the implementation of the consolidation strategy.
However, the actual
amount of consolidation that can be safely achieved is often not readily
determinable. For
22057261.6
- 1 -

CA 02723511 2010-12-03
instance, an IT environment where the workloads utilize an average of 20% of
the server CPU
capacity does not necessarily mean that 80% of the servers can be eliminated.
[0008] There are many factors that can affect why servers tend not to be
fully utilized. Such
factors may include, without limitation:
[0009] Peak demands ¨ The resource demands of most workloads are not
constant and
have peak demands that need to be serviced.
[0010] Unbalanced supply and demand of resources ¨ In many cases, one of
the resources
(e.g. memory) is the primary constraint, resulting in unused capacity of the
other resources.
[0011] Capacity fragmentation ¨ Resource capacity may be stranded on
servers due to the
indivisibility of some workloads and discrete supply of capacity from distinct
servers.
[0012] Expected growth ¨ Resource capacity may be reserved for anticipated
growth in
workloads.
[0013] Business policies ¨ Some workloads may need to be run together with
or apart from
other workloads due to data sensitivity, service level agreement (SLA)
requirements, etc.
[0014] Server redundancy ¨ Resource capacity is often reserved for critical
workloads to
handle server failures.
[0015] Thus, traditional metrics such as percent CPU utilization often do
not accurately
reflect what portion of the server capacity can be eliminated.
[0016] It is an object of the following to address the above-noted
disadvantages.
SUMMARY
[0017] In one aspect, there is provided a method of determining computing
system
efficiencies, the method comprising: obtaining a first set of data pertaining
to characteristics of
each of a plurality of computing systems; obtaining a second set of data
pertaining to one or
more workloads utilizing the resource; generating a first data model using the
first set of data,
the first data model comprising resource capacities and capabilities of the
computing systems;
generating a second data model using the second set of data, the second data
model
22057261.6
- 2 -

CA 02723511 2010-12-03
comprising parameters associated with the workloads; performing a workload
placement
analysis by placing the one or more workloads onto the plurality of computing
systems, using
the first and second data models, to determine a theoretical minimum computing
system
capacity required to fulfill the one or more workloads; and computing a fully
loaded utilization
based on a ratio of the theoretical minimum computing system capacity relative
to a total
capacity of the plurality of computing systems.
[0018] In another aspect, there may be provided a computer readable medium
comprising
computer executable instructions for performing the above method.
[0019] In yet another aspect, there may be provided an analysis system
configured to
perform the method according to the above method.
BRIEF DESCRIPTION OF THE DRAWINGS
[0020] Embodiments will now be described by way of example only with
reference to the
appended drawings wherein:
[0021] FIG. 1 is a block diagram of an analysis system configured to obtain
workload data
and system data associated with a computing environment comprising a plurality
of computing
systems.
[0022] FIG. 2 is a block diagram showing an example configuration for the
analysis system
of FIG. 1.
[0023] FIG. 3 is a screen shot illustrating set of workload data providing
a memory utilization
workload chart in hourly quartiles.
[0024] FIG. 4 is a screen shot illustrating a set of workload data
providing a percent CPU
utilization workload chart in hourly quartiles.
[0025] FIG. 5 is a flow chart illustrating an example set of computer
executable operations
for computing a fully loaded utilization metric.
[0026] FIG. 6 is a flow chart illustrating an example set of computer
executable operations
for performing a placement analysis.
22057261.6
- 3 -

CA 02723511 2010-12-03
[0027] FIG. 7 is an example chart showing fully loaded utilization versus
time.
DETAILED DESCRIPTION OF THE DRAWINGS
[0028] It has been recognized that to accurately measure the required
resource capacity of
computing systems such as servers, various relevant factors should be
considered, since the
capacity may be excess or insufficient. As will be described herein, this
measurement can be
reflected by the ratio of the theoretical minimum server capacity required
relative to the actual
server capacity provisioned, hereinafter also referred to as the "fully loaded
utilization".
[0029] By having this ratio, it can be determined that if the fully loaded
utilization is less than
100%, there is excess server capacity relative to the workload requirements,
based on the
defined policies, assumptions and other parameters and constraints. If the
fully loaded
utilization is 100%, the server capacity matches the workload requirements
based on the
defined policies, etc. If the fully loaded utilization is greater than 100%,
there is insufficient
server capacity relative to the workload requirements, based on the defined
policies,
assumptions, and other parameters and constraints.
[0030] It has been found that the fully loaded utilization metric can be
used for, among other
things: setting and measuring progress towards efficiency targets; computing
resource
requirements of a set of workloads for measuring costs and chargeback;
comparing what-if
scenarios (e.g. consolidate clusters, consolidate data centers, modify growth
projections,
change workload security requirements, etc.); and comparing group of servers
(e.g. clusters,
data centers, etc.); by accurately measuring the excess server capacity for a
collection of
workloads and servers.
[0031] Turning now to the figures, FIG. 1 illustrates an example computing
environment 8 to
be analyzed by an analysis system 10. The computing environment 8 in this
example
comprises a plurality of computing systems 12, e.g. servers. The computing
systems 12 are
operable to perform one or more computing tasks or operations and may or may
not work
cooperatively, e.g. in a cluster. In some embodiments, the computing
environment 8 is related
to a single organization, however, the computing environment 8 may also serve
several
organizations or entities, e.g. a computing cloud, "server farm", etc. The
computing systems 12
serve, facilitate, fulfill or otherwise provide resources for one or more
workloads 14. Each
22057261.6
- 4 -

CA 02723511 2010-12-03
workload 14 may generally refer to one or more tasks or operations expected to
be conducted
by one or more of the computing systems 12.
[0032] Each computing system 12 in this example has associated therewith, a
set of system
data 16. The system data 16 may describe, for example, the system's
capabilities and resource
capacities as they relate to the analyses discussed in greater detail below.
Each workload 14,
or the workloads 14 collectively, has/have associated therewith, workload data
18. The
workload data 18 may describe, for example, resource utilization, performance
and availability
requirements, and compatibilities as they relate to modeling constraints
associated with placing
the workloads 18 on the computing systems 12 as will be described in greater
detail below.
One example set of workload data 18 providing a memory utilization workload
chart in hourly
quartiles encompassing 24 hours is shown in FIG. 3. Another example set of
workload data 18
providing a percent CPU utilization workload chart in hourly quartiles
encompassing 24 hours is
shown in FIG. 4. Other embodiments of the resource utilization levels of
workloads are possible
and can include, without limitation, time-series data at 1 minute intervals,
hourly averages, daily
quintiles, etc.
[0033] The analysis system 10 is configured or otherwise operable to obtain
the system
data 16 and workload data 18, e.g. by downloading such data, having such data
uploaded
thereto, or otherwise effecting a transfer or access to the data over an
available interface such
as a network connection, disk drive, etc. The analysis system 10 also
typically considers
analysis parameters 20 in order to perform an efficiency analysis on the
system and workload
data 16, 18. The analysis parameters 20 may include, for example, assumptions,
constraints,
and optimization criteria governing how workloads 14 are to be placed. The
analysis
assumptions and constraints may also consider business policies in performing
an efficiency
analysis. The business policies may include, for example, workloads 14 that
may need to run
together with or apart from other workloads 14 due to data sensitivity, SLA
requirements, etc. as
noted above.
[0034] The analysis system 10 thus obtains or is otherwise provided with
system data 16,
workload data 18 and, if applicable analysis parameters 20, in order to
compute an analysis
output 25 which uses one or more fully loaded utilization 24 computations in
one or more
scenarios and/or uses of the metric. As will be explained in greater detail
below:
22057261.6
- 5 -

CA 02723511 2010-12-03
[0035] fully loaded utilization = theoretical minimum server capacity /
actual server capacity
provisioned * 100
[0036] This value reflects the actual amount of computing system capacity
that is required
for all of the workloads 14. If the fully loaded utilization is less than 100,
this value subtracted
from 100 thus reflects the amount of computing system capacity that can be
eliminated while
still being able to run the workloads 14 safely, i.e. within the parameters of
the analysis.
[0037] The fully loaded utilization 24 can be advantageously used for
estimating how much
capacity can be eliminated from the computing environment 8, and thus how much
cost can be
reduced. The fully loaded utilization 24 can also be advantageously used for
comparing
different environments and for setting targets for increasing efficiency over
time.
[0038] FIG. 2 illustrates an example configuration for the analysis system
10. It will be
appreciated that the example shown in FIG. 2 is for illustrative purposes only
and the analysis
system 10 may therefore be configured in various other ways in order to
operate according to
the principles herein discussed. In this example configuration, the analysis
system 10
comprises a data acquisition interface 28 that that obtains the system data 16
and workload
data 18. The data acquisition interface 28 may comprise, for example, a data
connection/network connection (wired or wireless), data bus, disk drive, etc.,
i.e. any suitable
interface 28 capable of enabling the analysis system 10 to obtain electronic
data. The analysis
system 10 also comprises a user interface 27 to allow users to specify one or
more sets of
analysis parameters 20.
[0039] The analysis system 10 comprises an analysis engine 26 which, in
this example, is
used to perform analyses in order to generate the fully loaded utilization 24,
which can be
provided as an output via an output interface 30. The analysis engine 26 may
be used to
incorporate one or more fully loaded utilizations 24 computations based for
multiple scenarios
defined through the various sets of analysis parameters 20 into an output such
as a series of
charts or other graphical outputs showing particular scenarios or comparing
outcomes. The
output interface 30 may also comprise a data/network connection (wired or
wireless), data bus,
disk drive, etc. as well as a user interface such as a display. As such, it
can be appreciated that
the output interface 30 may be the same as the data acquisition interface 28
or otherwise share
components and/or capabilities in order to provide a suitable output type. The
data acquisition
22057261.6
- 6 -

CA 02723511 2010-12-03
interface 28 and output interface 30 are therefore shown separately in FIG. 2
for illustrative
purposes only.
[0040] The electronic data that is obtained by the analysis engine 26 via
the data acquisition
interface 28 in this example, is provided to a data storage 32. The data
storage 32 may be a
component of the analysis system 10 or otherwise accessible thereto as shown
in FIG. 2. The
data storage 32 may store the electronic data in its native format or in the
format it is provided to
or obtained by the data acquisition interface 28. The data storage 32 may also
store the
electronic data in any other suitable or desired format. For ease of
illustration, the various
exemplary types of electronic data are shown separately and with distinct
reference numerals in
FIG. 2. These include, system data 16, workload data 18, and analysis
parameters 20. As will
be explained below, the system data 16 is used to generate a system data model
34, which is
also stored in the data storage 32 in this example. The workload data 18 is
similarly used to
generate a workload data model 36, which is also stored in the data storage 32
in this example.
The system data model 34 and workload data model 36 pertain to particular sets
of system data
16 and workload data 18 respectively. Therefore, if multiple sets of workload
data 18 and/or
system data 16 are stored in the data storage 32, the data storage 32 may be
organized to
group such sets of data 16, 18 and their respective data models 34, 36 in
order to perform
multiple independent analyzes. The analysis engine 26 comprises a placement
engine 56 for
performing placement analyses, further detail for which will be provided
below. It will be
appreciated that the placement engine 56 may also be distinct from the
analysis engine 26 and
is only shown as part thereof for illustrative purposes.
[0041] As noted above, the fully loaded utilization 24 corresponds to the
ratio of the
theoretical minimum computing system capacity required for the workloads,
relative to the
actual computing system capacity provisioned.
[0042] To compute the fully loaded utilization 24, the ratios of the
computing system
capacities can be expressed using any one of the computing system resources
(e.g. CPU,
memory, disk I/O, network I/O, power consumption, number of virtual
processors, etc.). The
CPU resource is often used when computing this ratio since computing system
capacities, in
particular with servers, are generally equated with their processing
capabilities.
22057261.6
- 7 -

CA 02723511 2010-12-03
[0043] For example, the fully loaded utilization 24 expressed as a ratio of
CPU capacities
can be computed as follows:
[0044] Fully Loaded Utilization cpu = Theoretical Minimum System Capacity
cpu / Actual
System Capacity Provisioned cpu* 100%
[0045] In the above equation, CPU capacities can be expressed using a
common
benchmark unit such as CINT2006 Rate from the Standard Performance Evaluation
Corporation
(SPEC) or Relative Performance Estimates (RPE2) from Ideas International
(IDEAS).
[0046] In another example, the fully loaded utilization 24, expressed as a
ratio of memory
capacities, can be computed as follows:
[0047] Fully Loaded Utilization mem = Theoretical Minimum System Capacity
mem / Actual
System Capacity Provisioned mem * 100%
[0048] In the above equation, the memory capacities can be expressed in a
common unit
such as Kilobytes (KB).
[0049] The theoretical minimum system capacity required by the workloads 14
is
determined by placing the workloads 14 onto the minimum system capacity,
subject to the
applicable constraints, assumptions and optimization criteria.
[0050] The optimization criterion for placing the workloads 14 on the
computing systems 12,
can be based on minimizing the total computing system capacity of one or more
of the
resources, such as CPU, memory, I/O bandwidth, power consumption, etc. For
example, a
common criterion is to minimize the processor capacity of the computing
systems 12 required to
host the workloads 14.
[0051] The criteria for placing the workloads 14 can also be based on
minimizing the
capacity of multiple computing system resources. For example, possible
criteria are to minimize
both the processor and memory capacity of the computing systems 12. Numerical
weights can
be used to indicate the relative importance of the optimization criteria.
[0052] As discussed above, the fully loaded utilization analysis requires
the system data 16
and workload data 18, which describe the computing systems 12 and workloads 14
to be
22057261.6
- 8 -

CA 02723511 2010-12-03
analyzed respectively. The data required for the computing systems 12 may
include the system
resource capacities and system capabilities. For each computing system 12, the
resource
capacities can include the following: number of processors, total processor
speed in MHz, total
processor benchmarks, total amount of memory, disk I/O throughput, network I/O
throughput,
total power consumption, number of virtual processors, etc.
[0053] The computing system capabilities can include such things as: host
operating
system, version and patch level, host compatibility with workload operating
systems, supported
service availability level, to name a few.
[0054] The workload data 18 required for analyzing the one or more
workloads 14 may
include workload utilization and workload requirements. For each workload 14,
historical and/or
projected resource utilization levels are typically used. This particular
portion of the workload
data 18 may be represented using time-series data, hourly quartiles, hourly
averages, etc. (e.g.
see FIGS. 3 and 4 discussed above).
[0055] Examples of resource utilization data can include: CPU utilization
in percent or MHz,
memory utilization (MB), disk I/O activity (e.g. bytes/sec), network I/O
activity (e.g. bytes/sec),
etc. For each workload 14, the following requirements and characteristics may
also be defined:
[0056] Resource allocations, reservations, shares ¨ e.g. number of virtual
processors,
processor resource reservations in MHz, processor shares based on relative
weights, memory
reservations in MB.
[0057] Performance and availability requirements ¨ e.g. service level
agreements, priorities,
etc.
[0058] Compatibility with computing systems 12 ¨ whether workload 14 is
supported to run
on the particular computing systems 12.
[0059] Compatibility with other workloads 14 ¨ whether workloads 14 can be
placed on the
same computing system 12 ¨ e.g. workloads 14 with different security
requirements should
typically not run on the same computing system 12 (in particular where the
computing system
12 is a server).
22057261.6
- 9 -

CA 02723511 2017-01-05
[0060] FIG. 5 illustrates an example set of computer executable operations
for computing
the fully loaded utilization 24 using the analysis engine 26. At 50, the
resource type (e.g. CPU)
that is to be used for the fully loaded utilization analysis is identified,
typically from the analysis
parameters 20 as shown using the dashed line in FIG. 5. The scope of the
computing systems
12 and workloads 14 (e.g. servers) to be included in the analysis is also
determined by
obtaining the system data 16 and workload data 18. The assumptions,
constraints, and
optimization criteria (i.e. analysis parameters 20), if applicable, are also
defined, determined, or
otherwise obtained. The assumptions, constraints, and optimization criteria
govern how the
workloads 14 are to be placed onto the computing systems 12 as discussed
above.
[0061] A placement analysis 70 is then performed using a placement engine
56 according
to the system scope determined using the system data 16 and workload data 18,
and the
analysis parameters 20. Further detail of an example placement analysis 70
using the
placement engine 56 is shown in FIG. 6. The placement analysis 70 is performed
in order to
determine the theoretical minimum system capacity required 58 to accommodate
the workloads
14 in the analysis (see both FIGS. 5 and 6).
[0062] Using the minimum computing systems 12 required to host the
workloads 14 at 58,
and according to the resource type specified at 50, the total capacity for the
minimum
computing systems 12 required can be computed at 60, which provides the total
capacity of the
minimum computing systems 12 that are required at 62.
[0063] Meanwhile, the total capacity for the provisioned computing systems
12 is computed
at 64 according to the resource type determined at 50. The total capacity of
the provisioned
computing system 12, provided at 66, along with the total capacity of the
minimum computing
systems 12 required determined at 62, are then used at 68 to compute the fully
loaded
utilization 24. As discussed above, the fully loaded utilization 24
corresponds to the ratio of the
theoretical minimum computing system capacity 62 relative to the total
computing system
capacity 62.
[0064] As shown in FIG. 6, the placement analysis 70 determines the
theoretical minimum
system capacity required by the workloads 14 in accordance with the analysis
parameters 20,
e.g. the analysis assumptions, constraints and optimization criteria. The
placement analysis 70
in one embodiment, seeks to optimize the total system capacity required for
the workloads 14
- 10 -
22996643.1

CA 02723511 2017-01-05
through a trial-and-error algorithm that evaluates various combinations
workload placements
onto the provisioned computing systems 12. Example algorithms for evaluating
various
combinations may be found in co-owned U.S. Patent Application No. 11/738,936
filed April 23,
2007 and entitled: "Method and System for Determining Compatibility of
Computer Systems",
now U.S. Patent No. 8,793,679. These placements are subject to the analysis
assumptions
and constraints.
[0065] The placement analysis 70 can employ a variety of methods to rate
the validity of the
placement of the workloads 14 on the systems 12 by ensuring that they do not
violate the
constraints and assumptions defined through the analysis parameters 20. The
combined
resource utilizations of the workloads 14 can be modeled on the systems 12 and
checked to
ensure that they do not exceed a specified threshold (e.g. 80% of the server
capacity). The
resource utilization model can incorporate specified growth assumptions. The
resource
utilizations can be modeled and assessed on the basis of any combination of
the peak
demands, average values, etc. Other constraints such as ensuring that certain
workloads 14
not running on the same system 12 can be enforced through affinity rules
processed by a rule
engine. Similarly, constraints such as ensuring the certain workloads 14 be
placed on systems
12 with specific characteristics can be enforced through compatibility rules.
It will be
appreciated that the above is only one example and various other methods may
be employed
according to the principles discussed herein.
[0066] In cases where at 74 it is determined that the existing workloads 14
cannot be
placed on the provisioned systems 12 due to the analysis constraints and
assumptions,
additional systems 12 can be introduced to host the unplaced workloads 14. The
additional
systems 12 can be based on the existing system configurations or other
hypothetical system
configurations as defined through the analysis parameters 20. As such, if not
all workloads 14
can be placed at 74, the placement engine 56 can determine the hypothetical
systems to add at
76. If it is determined at 78 that suitable hypothetical systems can be found,
the additional
hypothetical systems at 72 are provided to the placement analysis 70 to
perform an additional
analysis. Depending on the analysis parameters, the placement analysis may
incrementally
place the unplaced workloads on the additional systems, or redo the placement
analysis for all
workloads onto all the systems in scope.
-11-
22996643.1

CA 02723511 2010-12-03
[0067] In addition to the hypothetical system configurations, analysis
parameters 20 can
specify the maximum quantity and/or order of precedence when selecting the
systems 20 to be
added to the analysis scope. This models the notion that some systems 12 may
be preferred
over others due to availability, costs, etc. Like existing systems 12, the
placements of
workloads 12 on these additional systems 12 are subject to the analysis
assumptions,
constraints and optimization criteria.
[0068] In the event that one or more workloads 14 cannot be placed, even
with the
additional systems 12 determined at 76, the fully loaded utilization value 24
is considered
undefined, indicating that a suitable hypothetical system cannot be found at
78 and an output
indicating that no valid answer has been found provided at 80.
[0069] Example analysis assumptions include: model high priority workloads
14 based on
their peak utilization levels, model medium priority workloads 14 based on
their sustained (3rd
quartile) utilization levels, having to include an annualized growth of 10%
for the utilization levels
of all workloads 14, having to include system capacity to handle N-system
failures, the order of
precedence for selecting systems 12 when placing workloads 14.
[0070] In the event that the existing workloads 14 do not fit on the
provisioned systems 12,
analysis assumptions define how additional system capacity can be added to the
analysis. The
assumptions include the hypothetical system configurations to be used for
additional system
capacity and the maximum quantity of each type of system 12 that may be added
The
hypothetical system configurations may be based on provisioned systems 12 as
well as
systems 12 not found in the provisioned environment 8.
[0071] Example analysis constraints include: the total of the hourly
sustained CPU utilization
of the workloads 14 placed on the system 12 being required to not exceed 70%
of system
processor capacity, the total peak memory utilization of the workloads placed
on the system 12
being required to not exceed 90% of the system memory capacity, the total
number of virtual
processors of the workloads 14 placed on the system 12 relative to the total
physical processor
cores of the system 12 being required to not exceed a ratio of 10 to 1, the
total memory
reservations of the workloads 14 being required to not exceed 90% of the total
memory installed
on the computing system 12, ensuring that workloads 14 belonging to different
security zones
are not placed on the same computing system 12, etc.
22057261.6
- 12 -

CA 02723511 2010-12-03
[0072] The fully loaded utilization metric 24 may be used in various ways
as noted above.
Several example uses of the fully loaded utilization 24 will now be described.
It can be
appreciated that various other uses can apply.
[0073] One example use comprises comparing the fully loaded utilization 24
for multiple
"what if" scenarios for the same set of servers and workloads. In this
example, assuming peak
loads, one may include a high availability capacity for failure of 10% of
computing systems 12,
and a 5% growth in workloads 14 associated with a specific application. The
"what if" may be
that on top of these above assumptions and constraints, a constraint may wish
to be added
which specifies not mixing workloads with different security zones on the same
computing
system 12. If this occurs, the analysis aims to determine: What is the
incremental change to the
fully loaded utilization 24?
[0074] It may be noted that one can also perform what if analyses where a
constraint or
assumption is removed ¨ e.g. do not assume 5% growth in workloads.
[0075] Another example use comprises comparing the fully loaded utilization
24 for multiple
time periods (e.g. daily) for the same computing environment 8 (e.g. data
center, cluster). In
this example, the assumptions and constraints are held constant, but the
utilization levels of the
workloads 14, the actual number of workloads 14 running and computing systems
12 being
provisioned can change over time. The results can be visualized using an x-y
chart as shown in
FIG. 6, wherein the y-axis is the fully loaded utilization 24, and the x-axis
is a time scale.
[0076] Yet another example use comprises combining multiple what if
scenarios with the
multiple time periods. In this example, the fully loaded utilization 24 is
computed for the multiple
what if scenarios over multiple time periods. The results can be visualized
using stacked x-y
charts (e.g. as illustrated in FIG. 6), wherein the y-axis is the fully loaded
utilization 24 for
multiple scenarios, and the x-axis is a time scale.
[0077] Yet another example use comprises adding hypothetical workloads to
evaluate a
what if scenario for new workloads to be introduced into the computing
environment 8. In this
example, new workloads 14 can be added to the analysis scope to determine the
actual amount
of computing system capacity required to host the workloads 14.
22057261.6
- 13-

CA 02723511 2010-12-03
[0078] An example scenario will now be described, wherein the computing
environment 8
comprises identical computing systems 12, e.g. identical servers 12'.
[0079] Consider an environment comprised of 1000 workloads running on 100
servers with
identical resource capacities. Assume that there is one week (7 days) of
workload utilization
data 18 available. As such, the fully loaded utilization 24 can be computed
for the computing
environment 8 for each day of the week.
[0080] In addition, it may be assumed that the fully loaded utilization 24
is to be computed
for two scenarios. In the first scenario, the theoretical minimum server
capacity required may be
determined based on the daily peak utilization levels of the workloads 14. In
the second
scenario, the theoretical minimum server capacity required may be determined
based on the
daily peak utilization levels of the workloads 14 while also applying the
business policies. The
specific business policies in this example are that the workloads 14 with
different security zone
designations are to run on separate servers 12' and that there is needed
sufficient capacity to
handle the failure of 2% of the servers 12'. Thus, the fully loaded
utilization 24 can be computed
for each day of the week and for each scenario, generating fourteen (14) fully
loaded utilization
results.
[0081] For example, assume that based on the peak utilization levels of the
workloads 14
on Monday, the theoretical minimum servers required is thirty (30). And since
the servers have
identical resource capacities and there are 100 servers provisioned, the fully
loaded utilization is
30 / 100 * 100% = 30%. Next, assume that based on the peak utilization levels
of the workloads
14 on Monday and the business policies defined previously, the theoretical
minimum number of
servers 12' is computed to be thirty six (36). The fully loaded utilization 24
for this scenario is
36%. These computations can then be performed for the remaining days of the
week (Tuesday
to Sunday). The results are summarized in FIG. 7.
[0082] It may be noted that because the servers 12' are identical with
respect to server
resource capacities (i.e. same CPU benchmarks, memory, etc.), the fully loaded
utilization
metrics based on CPU capacity, memory, etc. are the same.
[0083] The fully loaded utilization analysis 24 can also be applied to
environments 8
comprising computing systems 12 (e.g. servers 12' in the following example),
with different
resource capacities.
22057261.6
- 14 -

CA 02723511 2010-12-03
[0084] For these cases, the placement analysis considers these differences
between the
servers 12' when placing the workloads 14 on the servers 12'. Furthermore, the
optimization
criteria which defined the resource type (e.g. CPU and/or memory) whose server
capacity is to
be optimized will affect the desired placements.
[0085] For example, consider an environment where the servers 12' have
different CPU and
memory capacities. Two servers 12' have CPU benchmarks and memory capacities
of 5000
RPE2 and 32 GB of memory, respectively. Two other servers 12' have CPU
benchmarks and
memory capacities of 7000 RPE2 and 16 GB of memory.
[0086] Assuming the workloads 14 will fit on any combination having two (2)
of the servers
12', the desired placements will vary depending upon the optimization
criteria.
[0087] For example, if the criterion is to minimize the memory capacity,
the servers 12' with
16 GB of memory will be used in this example. The fully loaded utilization 24
based on memory
capacity would be (16 + 16) / (32 + 32 + 16 + 16)* 100% = 33%.
[0088] If the criterion is to minimize the CPU capacity, the servers 12'
with RPE2 CPU
benchmarks of 5000 will be used in this example. The fully loaded utilization
based on CPU
capacity would be (5000 + 5000) / (5000 + 5000 + 7000 + 7000)* 100% = 42%.
[0089] It will be appreciated that any module or component exemplified
herein that executes
instructions may include or otherwise have access to computer readable media
such as storage
media, computer storage media, or data storage devices (removable and/or non-
removable)
such as, for example, magnetic disks, optical disks, or tape. Computer storage
media may
include volatile and non-volatile, removable and non-removable media
implemented in any
method or technology for storage of information, such as computer readable
instructions, data
structures, program modules, or other data. Examples of computer storage media
include
RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital
versatile
disks (DVD) or other optical storage, magnetic cassettes, magnetic tape,
magnetic disk storage
or other magnetic storage devices, or any other medium which can be used to
store the desired
information and which can be accessed by an application, module, or both. Any
such computer
storage media may be part of the analysis engine 10, or accessible or
connectable thereto. Any
application or module herein described may be implemented using computer
220572616
- 15-

CA 02723511 2010-12-03
readable/executable instructions that may be stored or otherwise held by such
computer
readable media.
[0090] Although the above has been described with reference to certain
specific
embodiments, various modifications thereof will be apparent to those skilled
in the art without
departing from the scope of the claims appended hereto.
22057261.6
- 16 -

Representative Drawing

A single figure which represents the drawing illustrating the invention.

Administrative Status

2024-08-01:As part of the Next Generation Patents (NGP) transition, the Canadian Patents Database (CPD) now contains a more detailed Event History, which replicates the Event Log of our new back-office solution.

Please note that "Inactive:" events refers to events no longer in use in our new back-office solution.

For a clearer understanding of the status of the application/patent presented on this page, the site Disclaimer , as well as the definitions for Patent , Event History , Maintenance Fee and Payment History should be consulted.

Event History

Description	Date
Inactive: Associate patent agent removed	2021-07-07
Maintenance Fee Payment Determined Compliant	2021-02-01
Inactive: Late MF processed	2021-02-01
Letter Sent	2020-12-03
Change of Address or Method of Correspondence Request Received	2020-10-23
Letter Sent	2020-07-16
Letter Sent	2020-07-16
Revocation of Agent Requirements Determined Compliant	2020-07-15
Appointment of Agent Requirements Determined Compliant	2020-07-15
Appointment of Agent Request	2020-06-10
Revocation of Agent Request	2020-06-10
Inactive: Associate patent agent added	2020-04-29
Appointment of Agent Request	2020-03-17
Revocation of Agent Requirements Determined Compliant	2020-03-17
Appointment of Agent Requirements Determined Compliant	2020-03-17
Revocation of Agent Request	2020-03-17
Common Representative Appointed	2019-10-30
Common Representative Appointed	2019-10-30
Grant by Issuance	2018-02-20
Inactive: Cover page published	2018-02-19
Pre-grant	2018-01-04
Inactive: Final fee received	2018-01-04
Letter Sent	2017-07-07
Notice of Allowance is Issued	2017-07-07
Notice of Allowance is Issued	2017-07-07
Inactive: Approved for allowance (AFA)	2017-06-30
Inactive: QS passed	2017-06-30
Amendment Received - Voluntary Amendment	2017-01-05
Inactive: Report - No QC	2016-09-15
Inactive: S.29 Rules - Examiner requisition	2016-09-15
Inactive: S.30(2) Rules - Examiner requisition	2016-09-15
Letter Sent	2016-04-05
Letter Sent	2015-11-16
All Requirements for Examination Determined Compliant	2015-11-05
Request for Examination Requirements Determined Compliant	2015-11-05
Request for Examination Received	2015-11-05
Inactive: Cover page published	2012-06-03
Application Published (Open to Public Inspection)	2012-06-03
Letter Sent	2011-04-29
Inactive: Single transfer	2011-04-05
Inactive: First IPC assigned	2011-01-25
Inactive: IPC assigned	2011-01-25
Inactive: Filing certificate - No RFE (English)	2010-12-22
Filing Requirements Determined Compliant	2010-12-22
Application Received - Regular National	2010-12-22

Abandonment History

There is no abandonment history.

Maintenance Fee

The last payment was received on 2017-11-13

Note : If the full payment has not been received on or before the date indicated, a further fee may be required which may be one of the following

the reinstatement fee;
the late payment fee; or
additional fee to reverse deemed expiry.

Please refer to the CIPO Patent Fees web page to see all current fee amounts.

Fee History

Fee Type	Anniversary Year	Due Date	Paid Date
Application fee - standard			2010-12-03
Registration of a document			2011-04-05
MF (application, 2nd anniv.) - standard	02	2012-12-03	2012-11-22
MF (application, 3rd anniv.) - standard	03	2013-12-03	2013-11-25
MF (application, 4th anniv.) - standard	04	2014-12-03	2014-11-18
MF (application, 5th anniv.) - standard	05	2015-12-03	2015-11-04
Request for examination - standard			2015-11-05
Registration of a document			2016-03-23
MF (application, 6th anniv.) - standard	06	2016-12-05	2016-11-02
MF (application, 7th anniv.) - standard	07	2017-12-04	2017-11-13
Final fee - standard			2018-01-04
MF (patent, 8th anniv.) - standard		2018-12-03	2018-11-19
MF (patent, 9th anniv.) - standard		2019-12-03	2019-10-08
Registration of a document			2020-06-30
Late fee (ss. 46(2) of the Act)		2021-02-01	2021-02-01
MF (patent, 10th anniv.) - standard		2020-12-03	2021-02-01
MF (patent, 11th anniv.) - standard		2021-12-03	2021-11-17
MF (patent, 12th anniv.) - standard		2022-12-05	2022-11-22
MF (patent, 13th anniv.) - standard		2023-12-04	2023-11-22

Owners on Record

Note: Records showing the ownership history in alphabetical order.

Current Owners on Record
CIRBA IP INC.

Past Owners on Record
ANDREW D. HILLIER
GERALD W. SMITH
TOM S. YUYITUNG

Past Owners that do not appear in the "Owners on Record" listing will appear in other documentation within the application.

Documents

To view selected files, please enter reCAPTCHA code :

To view images, click a link in the Document Description column. To download the documents, select one or more checkboxes in the first column and then click the "Download Selected in PDF format (Zip Archive)" or the "Download Selected as Single PDF" button.

List of published and non-published patent-specific documents on the CPD .

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.

Filter

Download Selected in PDF format (Zip Archive)

Download Selected as Single PDF

Document Description	Date (yyyy-mm-dd)	Number of pages	Size of Image (KB)
Description	2017-01-05	16	766
Claims	2017-01-05	6	230
Description	2010-12-03	16	770
Claims	2010-12-03	2	73
Drawings	2010-12-03	7	340
Abstract	2010-12-03	1	24
Representative drawing	2011-10-27	1	6
Cover Page	2012-05-31	2	45
Representative drawing	2018-01-24	1	6
Cover Page	2018-01-24	1	40
Filing Certificate (English)	2010-12-22	1	157
Courtesy - Certificate of registration (related document(s))	2011-04-29	1	104
Reminder of maintenance fee due	2012-08-06	1	111
Reminder - Request for Examination	2015-08-04	1	116
Acknowledgement of Request for Examination	2015-11-16	1	188
Commissioner's Notice - Application Found Allowable	2017-07-07	1	161
Courtesy - Acknowledgement of Payment of Maintenance Fee and Late Fee (Patent)	2021-02-01	1	434
Commissioner's Notice - Maintenance Fee for a Patent Not Paid	2021-01-21	1	545
Fees	2012-11-22	1	156
Request for examination	2015-11-05	3	81
Examiner Requisition	2016-09-15	5	379
Amendment / response to report	2017-01-05	19	818
Final fee	2018-01-04	3	76
Maintenance fee payment	2021-02-01	1	28

Language selection

Menus

English Abstract

French Abstract

Event History

Abandonment History

Maintenance Fee

Fee History

Your request is in progress.

Requested information will be available
in a moment.

Thank you for waiting.

Patent 2723511 Summary

English Abstract

French Abstract

Event History

Abandonment History

Maintenance Fee

Fee History

Your request is in progress.Requested information will be availablein a moment.Thank you for waiting.

Your request is in progress.

Requested information will be available
in a moment.

Thank you for waiting.