Patent 3141742 Summary

(12) Patent Application:	(11) CA 3141742
(54) English Title:	DIGITAL DUPLICATE
(54) French Title:	DUPLICATA NUMERIQUE
Status:	Report sent

Bibliographic Data

(51) International Patent Classification (IPC):	G06F 7/76 (2006.01) G06F 16/93 (2019.01) G06F 40/30 (2020.01)
(72) Inventors :	GUHA, SESHADRI (United States of America) SIKKA, VINAY (United States of America) TURLAPATI, SUBBARAO (United States of America)
(73) Owners :	TADA COGNITIVE SOLUTIONS, LLC (United States of America)
(71) Applicants :	TADA COGNITIVE SOLUTIONS, LLC (United States of America)
(74) Agent:	BERESKIN & PARR LLP/S.E.N.C.R.L.,S.R.L.
(74) Associate agent:
(45) Issued:
(86) PCT Filing Date:	2020-05-29
(87) Open to Public Inspection:	2020-12-03
Examination requested:	2022-09-26
Availability of licence:	N/A
(25) Language of filing:	English

Patent Cooperation Treaty (PCT):	Yes
(86) PCT Filing Number:	PCT/US2020/035104
(87) International Publication Number:	WO2020/243420
(85) National Entry:	2021-11-23

(30) Application Priority Data:

Application No.	Country/Territory	Date
16/425,886	United States of America	2019-05-29
16/544,701	United States of America	2019-08-19
16/544,702	United States of America	2019-08-19

Abstracts

English Abstract

Disclosed herein is new approach for structuring an organization's data, involving at a high level establishing a digital context and populating the digital context with digital content to thereby form what is referred to herein as a digital duplicate. In one aspect, the disclosed approach may be embodied in a computer-implemented method that involves: establishing a data structure comprising (i) a structural context that has at least one data component, where each component of the structural context has associated therewith one or more respective data properties (ii) a semantic context that has at least two data types that further describe individual data properties and; and populating underlying data into an instance of the data structure such that underlying data populated into each respective property of the at least one data component has each of the at least two data types of the semantic context.

French Abstract

L'invention concerne une nouvelle approche pour structurer des données d'une organisation, impliquant à un niveau élevé d'établir un contexte numérique et de remplir le contexte numérique avec un contenu numérique pour ainsi former ce que l'on appelle ici un duplicata numérique. Selon un aspect, l'approche de l'invention peut être mise en uvre dans un procédé mis en uvre par ordinateur qui consiste à : établir une structure de données comprenant (i) un contexte structurel qui a au moins un composant de données, chaque composant du contexte structurel ayant associé à celui-ci une ou plusieurs propriétés de données respectives (ii) un contexte sémantique qui a au moins deux types de données qui décrivent en outre des propriétés de données individuelles ; et remplir des données sous-jacentes dans une instance de la structure de données de telle sorte que des données sous-jacentes remplies dans chaque propriété respective du ou des composants de données aient chacun des au moins deux types de données du contexte sémantique.

Claims

Note: Claims are shown in the official language in which they were submitted.

CLAIMS
We claim:
1. A computing system comprising:
a network interface;
at least one processor;
a non-transitory computer-readable medium; and
program instructions stored on the non-transitory computer-readable medium
that are
executable by the at least one processor to cause the computing system to
perform functions
including:
establishing a data structure comprising (i) a semantic context having at
least two
data types, and (ii) a structural context having at least one data component,
where each
component of the structural context has associated therewith one or more
respective data
properties; and
populating underlying data into an instance of the data structure such that
underlying data populated into each respective property of the at least one
data component
has each of the at least two data types of the semantic context.
2. The computing system of claim 1, wherein the at least two data types of
the
semantic context comprise:
a structural data type configured to contain data describing how the
underlying data is
stored within the computing system; and
a semantic data type configured to contain data describing definitional
aspects of the
underlying data.
3. The computing system of claim 2, wherein the definitional aspects of the

underlying data include a primitive data type and at least one instance of
pointer data configured
to identify a location in data storage communicatively coupled to the
computing system that
contains data defining a function for application to the underlying data
having the same semantic
data type.
4. The computing system of claim 1, wherein the at least one data component

comprises a conceptual data component configured to contain data describing a
particular aspect
of an organization.
59

5. The computing device of claim 4, wherein the at least one data component

comprises an associative component configured to contain data describing a
particular aspect of
an organization and a relationship between two or more conceptual data
components.
6. The computing device of claim 1, wherein the program instructions are
further
executable to cause the computing system to perform functions including:
connecting to at least one data source;
ingesting into the computing system from the at least one data source
underlying
data; and
assigning to the underlying data both (i) a semantic data type, and (ii) a
structural
data type, wherein underlying data assigned to the structural data type
includes data that is
configured to describe how the underlying data is stored within the computing
system, and
wherein underlying data assigned to the semantic data type includes data that
is configured
to contain describe definitional aspects of the underlying data.
7. A method comprising:
establishing a data structure comprising (i) a semantic context having at
least two data
types, and (ii) a structural context having at least one data component, where
each component of
the structural context has associated therewith one or more respective data
properties; and
populating underlying data into an instance of the data structure such that
underlying data
populated into each respective property of the at least one data component has
each of the at least
two data types of the semantic context.
8. The method of claim 7, wherein the at least two data types of the
semantic context
compri se:
a structural data type configured to contain data describing how the
underlying data is
stored within the computing system; and
a semantic data type configured to contain data describing definitional
aspects of the
underlying data.
9. The method of claim 8, wherein the definitional aspects of the
underlying data
include a primitive data type and at least one instance of pointer data
configured to identify a
location in data storage communicatively coupled to the computing system that
contains data
defining a function for application to the underlying data having the same
semantic data type.

10. The method of claim 7, wherein the at least one data component
comprises a
conceptual data component configured to contain data describing a particular
aspect of an
organization.
11. The method of claim 10, wherein the at least one data component
comprises an
associative component configured to contain data describing a particular
aspect of an organization
and a relationship between two or more conceptual data components.
12. The method of claim 7, further comprising:
connecting to at least one data source;
ingesting into a computing system from the at least one data source underlying
data; and
assigning to the underlying data both (i) a semantic data type, and (ii) a
structural data
type, wherein underlying data assigned to the structural data type includes
data that is configured
to describe how the underlying data is stored within the computing system, and
wherein
underlying data assigned to the semantic data type includes data that is
configured to contain
describe definitional aspects of the underlying data.
13. A non-transitory computer-readable storage medium having program
instructions
stored thereon that are executable to cause a computing system to:
establish a data structure comprising (i) a semantic context having at least
two data types,
and (ii) a structural context having at least one data component, where each
component of the
structural context has associated therewith one or more respective data
properties; and
populate underlying data into an instance of the data structure such that
underlying data
populated into each respective property of the at least one data component has
each of the at least
two data types of the semantic context.
14. The computer-readable storage medium of claim 13, wherein the at least
two data
types of the semantic context comprise:
a structural data type configured to contain data describing how the
underlying data is
stored within the computing system; and
a semantic data type configured to contain data describing definitional
aspects of the
underlying data.
61

15. The computer-readable storage medium of claim 14, wherein the
definitional
aspects of the underlying data include a primitive data type and at least one
instance of pointer
data configured to identify a location in data storage communicatively coupled
to the computing
system that contains data defining a function for application to the
underlying data having the
same semantic data type.
16. The computer-readable storage medium of claim 13, wherein the at least
one data
component comprises a conceptual data component configured to contain data
describing a
particular aspect of an organization.
17. The computer-readable storage medium of claim 16, wherein the at least
one data
component comprises an associative component configured to contain data
describing a particular
aspect of an organization and a relationship between two or more conceptual
data components.
18. The computer-readable storage medium of claim 13, wherein the program
instructions are further executable to cause the computing system to perform
functions including:
connecting to at least one data source;
ingesting into the computing system from the at least one data source
underlying data; and
assigning to the underlying data both (i) a semantic data type, and (ii) a
structural data
type, wherein underlying data assigned to the structural data type includes
data that is configured
to describe how the underlying data is stored within the computing system, and
wherein
underlying data assigned to the semantic data type includes data that is
configured to contain
describe definitional aspects of the underlying data.
19. The computer-readable storage medium of claim 18, wherein the
definitional
aspects of the underlying data include a primitive data type and at least one
instance of pointer
data configured to identify a location in data storage communicatively coupled
to the computing
system that contains data defining a function for application to the
underlying data having the
same semantic data type.
20. The computer-readable storage medium of claim 19, wherein the at least
one data
component comprises (i) a conceptual data component configured to contain data
describing a
particular aspect of an organization, and (ii) an associative component
configured to contain data
62

describing a particular aspect of an organization and a relationship between
two or more
conceptual data components.
63

Description

Note: Descriptions are shown in the official language in which they were submitted.

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
DIGITAL DUPLICATE
CROSS-REFERENCE TO RELATED APPLICATIONS
[001] This application claims priority to: (1) U.S. non-provisional
application No.
16/425,886, filed on May 29, 2019 and titled "Digital Duplicate"; (2) U.S. non-
provisional
application No. 16/544,701, filed on August 19, 2019 and titled "Processes and
Systems for
Onboarding Data for a Digital Duplicate"; (3) U.S. non-provisional application
No. 16/544,702,
filed on August 19, 2019 and titled "Data Security Using Semantic Services,"
the contents of each
of which are incorporated by reference herein in their entirety.
BACKGROUND
[002] Businesses and other networks have a fundamental need to derive an
understanding
of their business/network at any moment in time, in order to engage in
strategic & operational
decision-making.
OVERVIEW
[003] Today, this need is served by a range of conventional systems for
storing,
manipulating, and accessing data. Such systems are generally limited in their
scope, flexibility,
and ability to integrate with other such systems that exist within a business
or across multiple
businesses.
[004] Part of this limitation arises from these conventional systems for
storing,
manipulating, and accessing data being built around specific business
functions. As examples,
such systems may include a CRM tool, inventory management system, accounting
system,
enterprise resource planning, payroll tool, among other examples. These
systems further suffer
from being confined to engaging in specific user functions (e.g., report
generation and
visualization, data input, etc.) that are associated with those business
functions.
[005] Further, "data warehousing" and "business intelligence" systems tend
to consume
data originating from various sources in a data network, and aggregate and pre-
process that data
to fit a predefined schema or set of dimensions. As a tool, data warehousing
is "rigid" by virtue
of the fact that the dimensions, metrics, aggregation, and delivery models
(e.g., dashboards) for
the data must be pre-defined prior to utilization. In addition, the data
contained within such
systems may also be used for the specialized simulation and modeling of
specific (narrow) areas
of the business (e.g., supply chain modeling, manufacturing planning,
financial modeling &
forecasting, etc.).
[006] Conventional systems - such as relational databases - are
advantageous for vertical
scaling (e.g., expanding a data table of 22 columns to billions of records),
but tend to be rather
limited in terms of horizontal linking and expansion across multiple tables.
1
SUBSTITUTE SHEET (RULE 26)

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[007] In order to address these shortcomings, and to help improve upon
these and other
problems, the present disclosure seeks to reduce fixed relationships between
data tables through
the disclosed digital duplicate data structure, which utilizes a dynamic model
and method that can
be implemented thru a plurality of techniques including dynamic entity
relationships. This allows
for the digital duplicate to ingest, access data, and adapt to an
organization's changes without the
burden of redesigning the data system from the ground up, as may be required
in conventional
data structures and conventional approaches for implementing data storage
systems and data
structures.
[008] From a user standpoint, conventional data structures and conventional
approaches
for implementing data storage systems may allow for data to be accessed in
response to specific
queries as permitted by the foundational design of database structures (e.g.,
based on requirements
analysis and design, as used to design a relational database system). One
drawback to this
approach, however, is that in order to obtain a desired output from the data
storage system (e.g.,
to obtain a desired query result), the user must have a priori knowledge of
the architecture of the
data storage system, including an understanding of the data structures
utilized in the data storage
system. With the approach disclosed herein, there are no such constraints.
Indeed, the digital
duplicate may replicate the real-world physical reality of the existence of
associations between
digital records (data) describing physical assets, events and other phenomena,
and as such may be
configured to provide to users desired outputs without requiring those users
to have a priori
knowledge of the data storage architecture.
[009] In some respects, the disclosed approaches for establishing new data
structures
provide other advantages and efficiencies. As one example, relationships in
the new data
structures can be established using minimal additional logic. Further, data
ingestion occurring
from multiple data sources can, with the benefit of the present approach for
establishing new data
structures, result in data that is efficiently synthesized and arranged in the
established data
structure, helping to ensure it is consistent across an organization's entire
data store. Additionally,
once relationships between data are established, changes in any underlying
data source (e.g.,
changes to the underlying data models or structure used by the data source) do
not require
changing the established relationships.
[010] Thus, in one aspect, disclosed herein is a computer-implemented
method that
involves: establishing a data structure comprising (i) a semantic context that
has at least two data
types, and (ii) a structural context that has at least one data component,
where each component of
the structural context has associated therewith one or more respective data
properties; and
populating underlying data into an instance of the data structure such that
underlying data
2

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
populated into each respective property of the at least one data component has
each of the at least
two data types of the semantic context.
10111 In another aspect, disclosed herein is another computer-
implemented method that
involves: retrieving a data model for a data source, establishing at least one
filter operation or at
least one clean operation that modifies an aspect of the retrieved data model,
onboarding
underlying data from the data source while applying the established at least
one filter operation or
at least one clean operation, defining at least one transformation operation
to apply to a portion of
the underlying data that has been onboarded, and applying the at least one
transformation
operation to the portion of the underlying data to thereby assign the data to
a semantic network,
the semantic network comprising conceptual data components and associative
data components.
[012] In another aspect, disclosed herein is another computer-implemented
method that
involves: receiving an indication of an instance of a semantic network, the
semantic network
comprising conceptual data components and associative data components,
receiving a selection
of one or more of the conceptual data components and associative data
components of the instance
of the semantic network, the selection comprising an indication to at least
(i) block the selected
one or more conceptual data components and associative data components or (ii)
selectively filter
the selected one or more conceptual data components and associative data
components, and
presenting a visualization of the semantic network, wherein the visualization
is configured to (i)
exclude data related to the selected one or more conceptual data components
and associative data
components or (ii) include data related to the selected one or more conceptual
data components
and associative data components and exclude data not related to the selected
one or more
conceptual data components and associative data components.
[013] In another aspect, disclosed herein is a computing system that
comprises at least
one processor, a non-transitory computer-readable medium, and program
instructions stored on
the non-transitory computer-readable medium that are executable by the at
least one processor to
cause the computing system to carry out the operations disclosed herein,
including but not limited
to the operations of the foregoing methods.
[014] In yet another aspect, disclosed herein is a non-transitory computer-
readable
medium comprising program instructions that are executable to cause a
computing system to carry
out the operations disclosed herein, including but not limited to the
operations of the foregoing
methods.
[015] One of ordinary skill in the art will appreciate these as well as
numerous other
aspects in reading the following disclosure.
3

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
BRIEF DESCRIPTION OF THE DRAWINGS
[016] FIG. 1A depicts an example high-level functional arrangement in which
example
embodiments may be implemented.
[017] FIG. 1B depicts an example network architecture in which example
embodiments
may be implemented.
[018] FIG. 2 depicts a simplified block diagram of an example computing
device in
which example embodiments may be implemented.
[019] FIG. 3 depicts a simplified block diagram of some example data
structures
according to example embodiments.
[020] FIG. 4 is an example output produced by a device executing one
embodiment of a
software tool according to the present disclosure.
[021] FIG. 5 is a flow diagram depicting example operations that may be
carried out in
accordance with one or more embodiments of the present disclosure.
[022] FIG. 6 is a flow diagram depicting example operations that may be
carried out in
accordance with one or more embodiments of the present disclosure.
[023] FIG. 7 is a flow diagram depicting example operations that may be
carried out in
accordance with one or more embodiments of the present disclosure.
[024] FIG. 8 is an example output produced by a device executing one
embodiment of a
software tool according to the present disclosure.
[025] FIG. 9 is an example output produced by a device executing one
embodiment of a
software tool according to the present disclosure.
[026] FIG. 10 is an example output produced by a device executing one
embodiment of
a software tool according to the present disclosure.
[027] FIG. 11A is a flow diagram depicting example operations that may be
carried out
in accordance with one or more embodiments of the present disclosure.
[028] FIG. 11B is a flow diagram depicting example operations that may be
carried out
in accordance with one or more embodiments of the present disclosure.
[029] FIG. 12A is an example output produced by a device executing one
embodiment
of a software tool according to the present disclosure.
[030] FIG. 12B is an example output produced by a device executing one
embodiment
of a software tool according to the present disclosure.
[031] FIG. 13 is an example output produced by a device executing one
embodiment of
a software tool according to the present disclosure.
4

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[032] FIG. 14A is an example output produced by a device executing one
embodiment
of a software tool according to the present disclosure.
[033] FIG. 14B is an example output produced by a device executing one
embodiment
of a software tool according to the present disclosure.
[034] FIG. 15A is an example output produced by a device executing one
embodiment
of a software tool according to the present disclosure.
[035] FIG. 15B is an example output produced by a device executing one
embodiment
of a software tool according to the present disclosure.
[036] FIG. 16A is an example output produced by a device executing one
embodiment
of a software tool according to the present disclosure.
[037] FIG. 16B is an example output produced by a device executing one
embodiment
of a software tool according to the present disclosure.

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
DETAILED DESCRIPTION
[038] The following disclosure references the accompanying figures and
several example
embodiments. One of ordinary skill in the art should understand that such
references are for the
purpose of explanation only and are therefore not meant to be limiting. Part
or all of the disclosed
systems, devices, and methods may be rearranged, combined, added to, and/or
removed in a
variety of manners, each of which is contemplated herein.
[039] The present disclosure is generally directed to technology for
implementing data
security operations in the context of semantic networks. In particular, a
"digital duplicate" may
utilize a semantic network that represents an organization's business
operations offering a unique
set of advantages over conventional systems. Specifically, by building a
digital duplicate using a
new data structure based on the neuro-synaptic model through which humans
combine and use
information in the brain, the digital duplicate may facilitate a more
efficient and dynamic means
of storing, retrieving, searching, securing, navigating, and synthesizing the
data associated with
the business or other network.
[040] When the digital duplicate is populated with the data (embodied as
digital content),
the digital duplicate may allow for the data to be contextualized in a way
that benefits from the
efficiencies realized by human cognition. Furthermore, the data may originate
from a plurality of
sources (e.g., conventional data stores or warehouses) and may be unified
and/or aggregated from
those distributed sources into the context provided by the digital duplicate.
[041] The disclosed system may be built in network-form, making large-scale

multidimensional nodes, associations, and properties of many different data
sources and types
lightweight in comparison with conventional systems. Notably, conventional
systems, such as the
semantic web, do not provide for associations to be formed automatically based
on semantic
alignment between two or more pieces of data. As disclosed, the present
architecture employs a
semantic data type, among other properties and property types, which allows
for associations to
be formed between different data from their shared semantic context,
automatically, without the
association having to be programmed into the system (as it may otherwise be in
existing systems,
such as those that utilize "triplet" form, like OWL, RDF, etc.). Accordingly,
the present disclosure
provides a technique that invention allows for rules, logic and associations
to be established and
utilized around stored data without the need for programmatic logic.
[042] In addition, the introduction of the semantic data type allows for
semantically-
identical information to be correlated even when different language is used by
different users
across a network or networks to describe that same information. This ability
to correlate
6

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
information by its semantics enables a wealth of novel functionality relating
to data consumption,
processing, association, manipulation and use, among others.
I. EXAMPLE SYSTEM ARCHITECTURE
[043] Turning now to the figures, FIG. 1A depicts a high-level arrangement
100 of some
of the functional components that may be involved in establishing, navigating
through, and
facilitating data security operations for a digital duplicate. In one example,
three different tools
may be used to establish and navigate through various parts of a digital
duplicate 105, namely a
designer tool 102, an architect tool 103, and an organizer tool 104, among
other possible tools. At
a high level, the architect tool 103 may be used to establish what is referred
to herein as a "digital
context," which can be thought of as the framework that replicates the
language of a business.
More particularly, but still by way of example, the architect tool 103 may be
used to establish a
"semantic network" 108 that relates the terminology and conceptual meanings
behind the data
collected and stored by an organization, such as various terms, metrics, key
performance
indicators, etc. that will be used within the digital replica of the business.
As will be described
further herein, the semantic network 108 may be a dynamic network of various
data structures
that are linked together, which replaces the typical relational data model of
rows and columns
contained within disparate databases, which provides cross-functional
visibility. A semantic
network 108 may comprise nodes, links, and properties that represent core-
business elements, and
is the foundation of the digital context.
[044] A designer tool 102 may be used to introduce business logic into the
semantic
network by creating "insights" 107 that traverse the network through one or
more "pathways."
The insights 107 may then be used as a basis for information and
visualizations provided to end
users in one or more forms. The insights 107 may be created at the semantic
level, and may thus
be abstracted away from underlying source data 106.
[045] An organizer tool 104 may be used to make a connection between the
semantic
network 108 and the organization's underlying data stores 106 (which, as
depicted, may span
across multiple disparate traditional databases or other data warehouses).
This functionality may,
in some embodiments, include functionality to link multiple data sources to
the semantic network
108, as well as onboard the underlying data from the organization's underlying
data stores 106 to
the organizer data store 109 and ultimately into the semantic network 108
after filtering, cleaning,
transforming, and/or validating the data as desired. These actions may serve
to provide the system
with what is referred to as "digital content," which together with the
"digital context" form what
is referred to as a "digital duplicate."
7

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[046] Another tool may be used to implement data security operations for
semantic
networks referred to as an admin tool. In particular, and as described further
herein below, an
admin tool may facilitate the creation of subsets of a semantic network such
that visualizations of
the subset of the semantic network restrict or hide portions of the semantic
network not included
in the subset. Additionally, the admin tool may facilitate establishing blocks
and/or filters for a
semantic network, such that visualizations of the semantic network either
block selected portions
of the semantic network and/or only show the selected portions of the semantic
network. Such
operations may be applicable when it is desired to present visualizations of
the semantic network
to particular users but restrict from these users view certain portions of the
semantic network,
including the underlying data (i.e., the digital content) assigned to the
semantic network. As
mentioned, this functionality may be embodied in an admin tool, which may in
turn be embodied
as a software tool configured to be executed by the example system
architecture described further
herein below.
[047] Turning now to FIG. 1B, depicted herein is an example network
configuration 110
in which example embodiments of the present disclosure may be implemented. As
shown in FIG.
1B, network configuration 110 includes a back-end platform 112 that may be
communicatively
coupled to one or more client stations, depicted here, for the sake of
discussion, as client stations
113.
[048] Broadly speaking, back-end platform 112 may comprise one or more
computing
systems that have been provisioned with software for carrying out one or more
of the functions
disclosed herein, including but not limited to establishing a digital context
and ingesting data to
form a digital duplicate. The one or more computing systems of back-end
platform 112 may take
various forms and be arranged in various manners.
[049] For instance, as one possibility, back-end platform 112 may comprise
a computing
infrastructure of a public, private, and/or hybrid cloud (e.g., computing
and/or storage clusters)
that has been provisioned with software for carrying out one or more of the
functions disclosed
herein. In this respect, an entity that owns and operates back-end platform
112 may either supply
its own cloud infrastructure or may obtain the cloud infrastructure from a
third-party provider of
"on demand" computing resources, such as Amazon Web Services (AWS) or the
like. As another
possibility, back-end platform 112 may comprise one or more dedicated servers
that have been
provisioned with software for carrying out one or more of the functions
disclosed herein. Other
implementations of back-end platform 112 are possible as well.
[050] In turn, client stations 113 may each be any computing device that is
capable of
running the front-end software disclosed herein. In this respect, client
stations 113 may each
8

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
include hardware components such as a processor, data storage, a user
interface, and a network
interface, among others, as well as software components that facilitate the
client station's ability
to run the front-end software disclosed herein (e.g., operating system
software, web browser
software, etc.). As representative examples, client stations 113 may each take
the form of a desktop
computer, a laptop, a netbook, a tablet, a smartphone, and/or a personal
digital assistant (PDA),
among other possibilities.
[051] As further depicted in FIG. 1B, back-end platform 112 is configured
to interact
with client stations 113 over respective communication paths 111. In this
respect, each
communication path 111 between back-end platform 112 and one of client
stations 113 may
generally comprise one or more communication networks and/or communications
links, which
may take any of various forms. For instance, each respective communication
path 111 with back-
end platform 112 may include any one or more of point-to-point links, Personal
Area Networks
(PANs), Local-Area Networks (LANs), Wide-Area Networks (WANs) such as the
Internet or
cellular networks, cloud networks, and/or operational technology (OT)
networks, among other
possibilities. Further, the communication networks and/or links that make up
each respective
communication path 111 with back-end platform 112 may be wireless, wired, or
some
combination thereof, and may carry data according to any of various different
communication
protocols. Although not shown, the respective communication paths 111 between
client stations
113 and back-end platform 112 may also include one or more intermediate
systems. For example,
it is possible that back-end platform 112 may communicate with a given client
station 113 via one
or more intermediary systems, such as a host server (not shown). Many other
configurations are
also possible.
[052] The interaction between client stations 113 and back-end platform 112
may take
various forms. As one possibility, client stations 113 may send certain user
input related to a digital
duplicate to back-end platform 112, which may in turn trigger back-end
platform 112 to take one
or more actions based on the user input. As another possibility, client
stations 113 may send a
request to back-end platform 112 for certain data and/or a certain front-end
software module, and
client stations 113 may then receive digital duplicate data (and perhaps
related instructions) from
back-end platform 112 in response to such a request. As yet another
possibility, back-end platform
112 may be configured to "push" certain types of digital duplicate data to
client stations 113, in
which case client stations 113 may receive digital duplicate data (and perhaps
related instructions)
from back-end platform 112 in this manner. As still another possibility, back-
end platform 112
may be configured to make certain types of digital duplicate data available
via an API, a service,
or the like, in which case client stations 113 may receive data from back-end
platform 112 by
9

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
accessing such an API or subscribing to such a service. The interaction
between client stations
113 and back-end platform 112 may take various other forms as well.
[053] As also shown in FIG. 1B, back-end platform 112 may also be
configured to
communicate with one or more data sources 114, such as external databases,
internal databases,
and/or another back-end platform or platforms. Such data sources ¨ and the
data output by such
data sources ¨ may take various forms. Further, back-end platform 112 and the
one or more
external data sources 114 may be configured to interact over a communication
path 111, which
may take the form or forms discussed above with respect to the other
communication paths 111.
[054] It should be understood that network configuration 110 is one example
of a
network configuration in which embodiments described herein may be
implemented. Numerous
other arrangements are possible and contemplated herein. For instance, other
network
configurations may include additional components not pictured and/or more or
less of the pictured
components.
EXAMPLE COMPUTING DEVICE
[055] FIG. 2 is a simplified block diagram illustrating some structural
components that
may be included in an example computing device 200, which could serve as, for
instance, the
back-end platform 112 and/or one or more of client stations 113 in FIG. 1B. In
line with the
discussion above, computing device 200 may generally include at least a
processor 202, data
storage 204, and a communication interface 206, all of which may be
communicatively linked by
a communication link 208 that may take the form of a system bus or some other
connection
mechanism.
[056] Processor 202 may comprise one or more processor components, such as
general-
purpose processors (e.g., a single- or multi-core microprocessor), special-
purpose processors (e.g.,
an application-specific integrated circuit or digital-signal processor),
programmable logic devices
(e.g., a field programmable gate array), controllers (e.g., microcontrollers),
and/or any other
processor components now known or later developed. In line with the discussion
above, it should
also be understood that processor 202 could comprise processing components
that are distributed
across a plurality of physical computing devices connected via a network, such
as a computing
cluster of a public, private, or hybrid cloud.
[057] In turn, data storage 204 may comprise one or more non-transitory
computer-
readable storage mediums, examples of which may include volatile storage
mediums such as
random-access memory, registers, cache, etc. and non-volatile storage mediums
such as read-only
memory, a hard-disk drive, a solid-state drive, flash memory, an optical-
storage device, etc. In
line with the discussion above, it should also be understood that data storage
204 may comprise

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
computer-readable storage mediums that are distributed across a plurality of
physical computing
devices connected via a network, such as a storage cluster of a public,
private, or hybrid cloud.
[058] As shown in FIG. 2, data storage 204 may be provisioned with software

components that enable the computing device 200 to carry out the operations
disclosed herein.
These software components may generally take the form of program instructions
that are
executable by the processor 202 to carry out the disclosed functions, which
may be arranged
together into software applications, virtual machines, software development
kits, toolsets, or the
like, all of which are referred to herein as a software tool or software
tools. Further, data storage
204 may be arranged to store data in one or more databases, file systems, or
the like. Data storage
204 may take other forms and/or store data in other manners as well.
[059] Communication interface 206 may be configured to facilitate wireless
and/or wired
communication with other computing devices or systems, such as one or more
client stations 113
when computing device 200 serves as back-end platform 112, or as back-end
platform 112 when
computing device 200 serves as one of client stations 113. As such,
communication interface 206
may take any suitable form for carrying out these functions, examples of which
may include an
Ethernet interface, a serial bus interface (e.g., Firewire, USB 3.0, etc.), a
chipset and antenna
adapted to facilitate wireless communication, and/or any other interface that
provides for wireless
and/or wired communication. Communication interface 206 may also include
multiple
communication interfaces of different types. Other configurations are possible
as well.
[060] Although not shown, computing device 200 may additionally include one
or more
other interfaces that provide connectivity with external user-interface
equipment (sometimes
referred to as "peripherals"), such as a keyboard, a mouse or trackpad, a
display screen, a touch-
sensitive interface, a stylus, a virtual-reality headset, speakers, etc.,
which may allow for direct
user interaction with computing device 200.
[061] It should be understood that computing device 200 is one example of a
computing
device that may be used with the embodiments described herein. Numerous other
arrangements
are possible and contemplated herein. For instance, other computing devices
may include
additional components not pictured and/or more or fewer of the pictured
components.
III. DIGITAL DUPLICATE DATA STRUCTURES
[062] As mentioned, the present disclosure is directed to a new approach
for structuring
an organization's, a business's, or a network's data as well as processes for
implementing data
security operations within this approach, all of which may help to facilitate
more efficient access
to this data. At a high level, this approach involves establishing a digital
context and populating
the digital context with digital content to thereby form what is referred to
herein as a digital
11

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
duplicate. Deploying a digital duplicate in practice includes the high-level
steps of first creating
the digital context, and second adding data to this digital context. The
digital duplicate may be
kept live or refreshed repeatedly over time by continuously updating the
digital context as the
organization's, business's, or network's data changes and the digital content
as the data and the
data sources change. While elements of the digital context and digital content
may change, the
core data structure of the digital duplicate does not typically change,
allowing the information to
remain consistent without having to change the design of the data structure.
[063] FIG. 3 is a simplified block diagram, illustrating an example digital
duplicate data
structure architecture 300 according to an example embodiment of the present
disclosure. At a
high level, and as depicted, digital duplicate data structures 300 may include
a digital context 310
and digital content 320, which together form what is referred to herein as an
instance of a digital
duplicate 301. The data structures 300 also include a registry 302 and a data
store 303. These
various data structures are described herein in further detail.
A. Digital Context
[064] At a more specific level, but still by way of example, FIG. 3 depicts
an example
architecture diagram illustrating certain data structures included within
digital context 310. As
mentioned, digital context 310 is a data structure that generally comprises a
network of individual
data components. This network of data components may include structural
context components
and semantic context components. These components may be stored in data store
as will be
described further herein.
[065] Turning first to the structural context components, these structural
context
components may generally describe how the data is structured and stored in the
digital context. In
one implementation, the structural context components may include conceptual
components 314
(sometimes referred to herein as concepts) and associative components 316
(sometimes referred
to herein as associations). And these components may have one or more
respective properties 315,
317. These components may be designed to hold data that describes various
aspects about how an
organization's information is structured within the digital duplicate 301 as
well as how this
information relates to itself. Although these components are depicted as
blocks in a simplified
block diagram, it should be understood that the underlying data represented by
these blocks may
be stored in an appropriate storage location of data store 303, which may at
time be referred to
herein as a directory.
[066] A conceptual component 314 may generally be a data structure that is
designed to
hold data that describes one aspect of an organization's business. To
illustrate with an example
for a particular organization in the medical services industry, one example
conceptual component
12

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
may be a "physician" component where this conceptual component may be designed
to hold data
that describes the physicians that are employed by the particular
organization. To this end, the
"physician" conceptual component may include various properties 315 for
holding such data,
including a "Last Name" property, a "First Name" property, a "Specialty"
property, a "Telephone
Number" property, and/or a "Years in Service" property, among other examples.
[067] In some cases, properties may be shared across multiple conceptual
components.
For example, the "specialty" property may be shared across multiple
"Physician" conceptual
components and/or the "Clinic" conceptual component. In situations in which a
property is widely
shared across multiple conceptual components, the digital context may be
configured to promote
the "specialty" property from a property to a separate concept. This may be
accomplished without
changing the underlying data structure but rather reconfiguring it. This
ability of the neuro-
semantic network to adapt and learn as the organization changes makes it a
scalable and learning
model. The method provides for the ability to promote properties into concepts
or to collapse them
into concepts and associations to best represent the current structure of the
organization.
[068] Another example conceptual component 314 may be a "patient" component
where
this conceptual component may be designed to hold data that describes the
individuals that are
patients of the various physicians who are employed by the particular
organization. To this end,
the "patient" conceptual component may include various properties 315 for
holding such data,
including a "Last Name" property, a "First Name" property, a "Home Address"
property, and/or
a "preferred Payment Method" property, among other examples.
[069] Yet another example conceptual component 314 may be a "clinic"
component
where this conceptual component may be designed to hold data that describes
the various clinical
facilities utilized by the particular organization. To this end, the "clinic"
conceptual component
may include various properties 315 for holding such data, including a "Clinic
Name" property, an
"Address" property, a "Services Offered" property, and/or a "Capacity"
property, among other
examples.
[070] As depicted, another type of structural component of the digital
context may be an
associative component 316. An associative component is similar to a structural
component in that
it is designed to hold data that describes one aspect of an organization's
business. But more
specifically, the associative component is also designed to hold data that (i)
describes an aspect of
the organization's business such as an activity or a metric and (ii) relates
together to two or more
conceptual components 314. As an example, one example associative component
for the particular
organization in the medical services industry may be a "visit" component
designed to hold data
that describes a particular patient's visit to a particular physician at a
particular clinic and is thus
13

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
associative of multiple conceptual components, including the example
"physician," "patient," and
"clinic" structural components described above. To this end, the "visit"
associative component
may include various properties 317, including a "Date of Visit" property, a
"Duration of Visit"
property, "Billed Value of Visit," and/or a "Diagnosis of Visit" property,
among other examples.
[071] As mentioned throughout the examples given above, structural context
components, including both conceptual components and associative components,
include various
properties 315, 317 for holding certain specific descriptive data for the
structural context
component. In some implementations, each individual property of a given
structural context
component may be described by a particular combination of a structural data
type 318 and a
semantic data type 313, which may thus form a semantic component.
[072] Generally, a structural data type 318 applied to information is data
that describes
how the information is stored within the system. Many different structural
data types are possible.
As one example, a structural data type may take the form of a "temporal" data
type, under which
a "Years in Service" property may fall. As another example, a structural data
type may take the
form of a "spatial" data type, under which a "Clinic Address" property may
fall. As another
example, a structural data type may take the form of a "physical" data type,
under which a "Clinic"
and the "Clinic Name" property may fall. As another example, a structural data
type may take the
form of a "Personal" data type, under which a "Last Name" data type may fall.
As another
example, a structural data type may take the form of a "Quantitative" data
type, under which a
"Billed Value of Visit" property may fall. As another example, a structural
data type may take the
form of a "Categorical" data type, under which a "Specialty" property may
fall. It should be
appreciated that other examples may be possible as well.
[073] Generally, a structural data type helps define how data is managed,
indexed, and
stored for all similar properties in the network. Properties with common
structural data type may
use common data structures to store and retrieve data across a digital
duplicate and provide an
efficient way to store, access and relate data; allowing for unique
computations; and provide better
methods to access, resolve and compare similar data. For example, all
"temporal" data types may
share or "index" to a common timeline data structure that allows independent
events like a sale
event and a marketing discount that happened during the same month without
having to explicitly
compare data. This provides an ability to not only perform unique computations
and analysis on
properties with similar structural data like "same month," or "same quarter,"
but also compare
financial results of two unrelated companies for the same quarter even though
they belong to
different business networks because they use the same temporal data type. In
another case, if two
separate networks provide the population and economic data for the same
spatial data type (such
14

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
as a zip code), it allows one to overlay and contrast population and GDP for
the same zip code
with minimal effort. Multiple similar storage and advantages can be added to
across all shared
structural data types by creating a shared structural data type and storage
model across properties
in a network.
[074] Structural data types like "temporal," "spatial," "personal," or
"organizational"
may allow data and methods to be shared across one or more properties in a
network or across
whole networks using a common data structure like a shared timeline, time
resolution, or temporal
methods; while semantic data types (discussed below) allows for data and
methods to be shared
across a network using common meaning. Shared structural data types may also
have shared
resolution and absolute values. For instance, "February 2015" will have a
resolution of 1 day and
may be a delivery date to a customer or the start date of an employee. This
allows shared
computations like "Start Month" or "Delivery Month" to be performed.
[075] As also indicated, each property may also have a semantic data type
313.
Generally, a semantic data type applied to underlying information is data that
describes what the
information means. A semantic data type may have various aspects that
facilitate describing what
the information means. One aspect that a semantic data type may have is called
a primitive data
type. A primitive data type may describe the general form of the information.
Example primitive
data types may include "integer," "Boolean," "string," "float," etc. Another
aspect that a semantic
data type may have is a pointer that points to a particular function that may
be associated with the
information. This pointer may be stored in the dictionary entry 312 for the
particular semantic
data type and may point to various kinds functionality. As one example, the
pointer may point to
a web method for utilizing the underlying information. A web method may be any
operation or
set of operations embodied in a web service, API, or the like. For instance,
one web method may
be a "mailto:EmailAddress" web method that refers to a web method that causes
an email client
to be invoked, generate a new email message, and populate the "To:" field with
the email address
represented by the data variable "EmailAddress." Other web methods are
possible as well.
[076] Another example of a function to which a pointer may point is
mathematical
operation performed using the underlying information represented by the
semantic data type. For
instance, one type of mathematical operation for a "date of birth" semantic
data type may be an
age computation function. With such a function, the system may compute the age
of an individual
represented by the underlying date of birth information by, for instance
subtracting the "date of
birth" date from "current date" data to arrive at "age" data.
[077] Another type of mathematical function for a "price per unit" semantic
data type
may be a total price aggregation function. With such a function, the system
may aggregate all of

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
the data values from various "price per unit" data types to arrive a total
price value. Such a function
may be useful in situations where a customer is purchasing products or
services in a single order
that stems from two or more aspects of a business, which may not have
aggregated their data
systems in advance. Applying the "price per unit" semantic data type (or, in
other examples, a
similar-functioning semantic data type) serves to link the pricing across what
may be disparate
aspects of the organization and/or disparate data systems.
[078] Another type of mathematical function for a "lead time" semantic data
type may
be a lead time aggregation function. A "lead time" semantic data type may be
associated with a
product, component of a product, subassembly, construction project, etc. With
such a function,
when a customer purchases multiple products at once, an aggregation function
may be executed
in which the system may automatically populate "lead time" data by selecting
the individual lead
time field for each of the purchased products that has the greatest lead time
value. In cases in
which a product may not have a lead time associated with it, the lead time of
each subassembly or
component that makes up the product may by summed to approximate the total
lead time of the
product.
[079] In one example, during data ingestion, the system may capture various
data fields
for an order, including a "deliveryDate" field for describing the delivery
date of an order, an
"orderDate" field for describing the date of the placement of the order, and a
"deliveryTime" field
for describing the time taken for the order to be fulfilled after the product
is fully manufactured
and stocked in inventory, all of which may be specified by various a logistics
or fulfillment
systems. At this stage, the system may compute the actual lead time of the
product to be the
function of (deliveryDate - orderDate) - deliveryTime. Therefore, in the case
where a product is
not built before it is ordered (as is common in the heavy equipment industry,
for example) lead
time may be a residual value, as calculated above. Once lead time is known,
the system may then
engage in a function that compares the actual lead time with the approximated
lead time, which
may be made possible by the existence the "lead time" semantic data type being
used across
multiple business systems that is semantically distinct from a "delivery time"
type. A further
function may add an "error" to the function for computation of approximated
lead times for all
other products, which in turn may propagate the new calculation of
approximated lead times
throughout the digital duplicate instantaneously. In this way, the system may
engage in a kind of
machine learning.
[080] Another example of a function to which a pointer may point is a
linking function
that may operate to link two or more semantic data types together and form a
new property of an
associative structural component. As one example of this, a distance function
may link together
16

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
an "address" property of a "patient" conceptual component and an "address"
property of a "clinic"
conceptual component and computes the distance between these two addresses.
The function may
then save this distance as a new property of a new associative component.
[081] Yet another example of a function to which a pointer may point is a
semantic search
function. With such a function, a search may be executed on a given semantic
data type, which
may retrieve data of the same semantic type from other areas of the
organization or other network.
[082] To help illustrate, consider an example in which respective digital
duplicates have
been established for different aspects of an organization. Each such digital
duplicate will have its
own set of data components stored separately from the data components of the
other digital
duplicates. In a situation in which a user desires to know all employees that
share duties or interact
across the organization's departments, a semantic query can be issued on an
"Employee" semantic
data type. In the context of the present disclosure, such a semantic search
may return all data
objects that are based on this semantic type, regardless of the content,
format, or location of the
data. In this way, the semantic search unifies various disconnected digital
representations. With
conventional approaches, by contrast, a typical search would fail here,
because the data may be
spread out across multiple different databases and arranged in multiple
tables; and as such, any
query would need to account for these multiple databases and the various
tables.
[083] Considering another example, say a user desires to know all entities
(e.g., dealers,
customers, vendors, employees, etc.) having a specific area code. In the
context of the present
disclosure, the user could issue a single query on a "Phone No." data type for
the specific area
code of interest. Such a query would return all data objects having the
specific area code of interest
no matter the location or format of the data. By contrast, with a conventional
approach, a user may
need a deep understanding of the organization's data storage structure in
order to carry out this
query. For instance, the user may need to know what table the employee records
are stored in and
what field and what format the phone number data is stored in. Likewise, the
user may need to
know this same information for the dealer records, the customer records, the
vendor records, etc.
Each additional storage location may add complexity to the query. And to the
extent that the data
is stored in disparate data stores (such as one data warehouse for employee
records and another
data warehouse for vendor records), then the user may need to issue separate
queries for each such
disparate data source further compounding the complexity and vulnerability for
user error. Thus,
with the benefit of the present disclosure, it should be understood how the
semantic data type
provides for more efficient data retrieval, among other advantages.
[084] In some embodiments, user interface elements presented by one or more
computing
devices disclosed herein (e.g., client stations 113) may reflect semantic data
types with specific
17

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
graphical elements, such as icons. As one example, on a user interface that is
displaying multiple
semantic data types for an organization, the user interface may display a
telephone icon adjacent
to data that is of a "phone number" semantic data type, and/or a map icon if
the data is of an
"address" semantic data type, although other examples are possible. It should
be understood that
the functions disclosed herein are merely examples, and that in other
implementations, other
functions may be possible as well.
[085] Depending on the organization, semantic data types can be arranged
into various
semantic groups. A semantic group is generally a set of one or more semantic
data types that are
relevant to a particular categorical aspect of the organization. For instance,
example semantic
groups for an organization may be "Financial & Accounting," "Production &
Manufacturing,"
"Purchasing," and/or "Logistics." In this way, an organization may arrange the
semantic data
types into groups that are reflective of the organization's operating
departments or sectors. Thus,
the "Financial & Accounting," semantic group may have semantic data types that
refer to aspects
of the organization's own financial & accounting department, the "Production &
Manufacturing,"
semantic group may have semantic data types that refer to the aspects of the
organization's own
production and manufacturing operations, etc. As such, these semantic data
types may more
accurately describe the organization's own business operations and may thus be
more useful to
the organization.
[086] Semantic data types may provide various advantages to organizations
who utilize
the digital duplicate schema set forth in FIG. 3 and generally described
herein. As one advantage,
the semantic data type 313 may serve to discriminate between (i) human
language used to describe
an aspect of the organization's operations (which can be stored as the name of
a property, in one
embodiment) and (ii) the underlying meaning of the human language used to
describe the aspect
of the organization's operations (which can be stored as the semantic data
type, in one
embodiment). This discrimination may thus allow for properties in the digital
duplicate to be
unified by their underlying meaning (i.e., unified by their semantic data
type) even when the
human language used to describe them (i.e., their property names) may not be
the same.
[087] More particularly, but still by way of example, the digital duplicate
architecture
300 encourages this unification by having data sets that are consistently
accurate and complete
because no data is mismatched within a given context. To illustrate, if one
property is called
"Digits," and another property is called "Phone No." but these properties
refer to the same thing,
they both may be pulled into a report, a visualization, a computation, or used
in some other way
by the computing system when the digital duplicate calls for the semantic data
type 'Telephone
Number' within a given context. This may be accomplished through an
arrangement where
18

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
"Telephone Number" is a semantic data type that is shared by both the "Digits"
and "Phone No."
properties. In this way, the semantic data type may be said to "unify" two (or
more) properties by
these properties' underlying meanings.
[088] Unification may also allow for functions to be associated with
different properties
of the same semantic data type. To illustrate, as indicated above, "Digits"
and "Phone No." may
be two different properties that share the same semantic data type "Telephone
Number."
Therefore, both "Digits" and "Phone No." may have a pointer that points to a
"Make-Call"
function, which is assigned to these properties by virtue of their shared
semantic data type.
[089] Unification may also occur by enriching the structural context of the
digital
duplicate as a result of automating through-computation of additional
properties based on the
semantic data type(s) of the original properties and the functions available
for the semantic data
type(s). To illustrate using the example from above, the function for
computing "Age" from the
"Date of Birth" semantic data type is a form of unification because "Birth
Date" and "Date of
Birth" may be distinct properties in the digital duplicate but share the same
semantic data type:
"Date of Birth." Other examples of how the digital duplicate architecture
results in unification
are possible as well. The combination of the concept (node) or association
(link) that describes a
property in combination with a semantic data type (and in many cases a
structural data type)
individually and combined also create a strong representation of the digital
context. When
combined, they provide not only a simple way to locate every piece of data in
the business
network, and to locate a relative position of the data to other pieces of data
for navigation and
comparison, but also may provide meaning to the data and structure for
storage. Once combined,
these data structures create ways to simply and efficiently create, manage,
and navigate data in a
business or network using the digital context.
[090] As also depicted in FIG. 3, digital context 310 may contain a
composite structure
319. A composite structure 319 may contain one or more indications of sets of
concepts and
associations that represent various aspects of an organization. One type of
composite structure
may be a layer of concepts and associations. The concepts and associations
that comprise a layer
may represent similar aspects of the organization. In one example, an
organization in the medical
services industry may have a "pharmaceutical" layer that comprises concepts
and associations
related to any pharmaceutical aspects of the organization, such as pharmacy
employees, pharmacy
inventories, and/or an employee layer that comprises concepts and associations
related to
employees across all departments. Another type of composite structure may be a
realm of concepts
and associations. The concepts and associations that comprise a realm may
represent aspects of
the organization that are grouped on a broader level. For instance, a large
organization that is made
19

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
up of or owns several smaller businesses may have a realm that comprises all
the concepts and
associations for one entire business and a realm that comprises all the
concepts and associations
for another entire business. Yet another type of composite structure may be an
insight. The
concepts and associations that comprise an insight may represent collections
of concepts and
insights that have been automatically identified by the system as having some
threshold number
of relationships. The system may identify such insights when certain patterns
develop in the
organization's digital context (e.g., a threshold number of associations
between various concepts,
and/or a threshold number of shared properties between multiple concepts or
associations), and in
this may be identify to users unique aspects of the organization's operations.
Other examples of
layers, realms, and insights are possible as are other types of composite
structures.
B. Digital Content
[091] As also depicted in FIG. 3, the digital duplicate 301 includes
digital content 320.
Generally, digital content 320 is the underlying data that populates one or
more instances of the
framework for the digital duplicate (i.e., the digital context 310) described
above. Such digital
content may comprise data that may be ingested into the system (in accordance
with, perhaps, the
functionality associated with the organizer software tool described further
herein below) from one
or more data sources, such as business systems (e.g. CRM systems, ERP systems,
POS systems,
accounting software, etc.), enterprise data stores, data warehouses, data
lakes, operational data
stores, as well as any other type of kind of databases or data store, such as
data inputted by a user,
data mined from research reports, among other examples.
[092] This underlying data may be either static data, data updated in a
batched manner,
such as on a periodic or aperiodic refresh cycle, or data updated in real-time
or near real-time (e.g.,
data provided to the system in the form of a data "stream", which may or may
not be buffered to
align with the update frequency of the Digital Duplicate's data ingestion
method). Other examples
of data ingestion may be possible as well.
[093] As depicted, digital content may generally be comprised of links and
nodes. In
particular digital content 320 may include node data 321, node properties 322,
and node instances
323. Further, digital content 320 may also include link data 325, link
properties 326, and link
instances 327.
[094] As a general matter, node data 321 and link data 325 may include
underlying
instances of an organization's information that populates a digital context
schema, examples of
which have been described above. Node data 321 in particular may include the
underlying
information that populates the conceptual components of the digital context.
Referring back to the
examples described above, one example conceptual component may be a
"physician" component

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
where this conceptual component may be designed to hold data that describes
the physicians that
are employed by a particular medical services organization. Node data 321 may
thus include
underlying organization information for the "physician" component, such as
individual instances
323 of particular physician information. Thus, for each instance of
information that populates the
"physician" conceptual component, node data 321 may include a corresponding
node. The
underlying information within each respective node may be arranged into node
properties 322 in
accordance with the property structure defined by the conceptual component.
For instance, one
"physician" node may include node property data "Smith" for the "Last Name"
property of the
conceptual component, "John" for the "First Name" property of the conceptual
component,
"Pediatrics" for the "Specialty" property of the conceptual component, "555-
867-5309" for the
"Telephone Number" property of the conceptual component, and "30" for the
"Years in Service"
property of the conceptual component, although other examples are possible.
[095] Similarly, link data 325 may include the underlying information that
populates the
associative components of the digital context. Referring back to the examples
described above,
one example associative component may be a "visit" component where this
associative component
may be designed to hold data that describes a particular patient's visit to a
particular physician at
a particular clinic. Link data 325 may thus include underlying organization
information for the
"visit" component, such as individual instances 327 of particular visit
information. Thus, for each
instance of information that populates the "visit" associative component, link
data 325 may
include a corresponding link. The underlying information within each
respective link may be
arranged into link properties 326 in accordance with the property structure
defined by the
associative component. For instance, one "visit" link may include link
property data "Jan. 2, 2020"
for the "Visit Date" property of the associative component, "1 hour" for the
"Duration of Visit"
property of the associative component, and "$110" for the "Billed Value"
property of the
conceptual component, although other examples are possible.
C. Storage Schema
[096] The network of individual data components described above may be
stored in one
or more data stores 303 in various ways. The structure of the digital context
and well as the storage
schema, as described herein, allows for network traversal as well as semantic
searches (described
above) while querying for data. As a result of traversal of the "data
network," subsets of the data
can be efficiently retrieved and presented to one or more users. Data stores
303 may take the form
of one or more of SQL Server, Oracle, Mongo DB, or other storage technologies.
[097] As one example of the various ways in which the individual data
components may
be stored in data stores 303, relationships between conceptual components 314
and associative
21

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
components 316 may be stored using what are referred to as unique identifiers
("UIDs"). In this
way, each element of each instance of the digital duplicate 301 may have an
associated UID
(which, depending on the situation, may or may not be unique). As outlined
above, the various
elements that may have a UID assigned thereto may be (i) domains, (ii)
subdomains, (iii)
directories, (iv) conceptual components, (v) associative components, (vi)
properties, (vii)
dictionaries, (viii) semantic groups, and/or (ix) semantic data types. In some
implementation, a
UID may take the form of a URI (Uniform Resource Identifier), or any other
standard identifier
type, among other examples.
[098] As an illustrative example the "Patient" conceptual component may
exist in data
storage 303 in, for instance, table form with underlying digital data
populated for the component
in the form of [P1, P2, P3, etc.]. Likewise, the "Physicians" conceptual
component may exist in
data storage 303 in, for instance, table form with underlying digital data
populated for the
component in the form of [H1, H2, H3, etc.]. Likewise, the "Clinics"
conceptual component may
exist in data storage in table form with underlying digital data populated for
the component in the
form of [C1, C2, C3, etc.].
[099] Using this arrangement, the "Visits" associative component may
accordingly exist
in data storage 303 in, for instance, table form with underlying digital data
populated for the
associative component in the form of a table containing intersecting data from
the other related
conceptual components. As an example, one specific instance of the "Visit"
component may have
data that takes the form [P1, H3, C2], where this instance describes a visit
that took place by
patient "P1" who was treated by physician "H3" at clinic "C3," although other
combinations are
be possible.
[100] Reciprocal data tables may be stored in data storage 303 as well. A
reciprocal table
may serve to identify, for the conceptual component data, whether and to what
extent there is
associative component data that relates in some way to the conceptual
component data. Using the
examples set forth above, the "Patient" conceptual component discussed above
may contain a
reciprocal table that intersects its dependent structural components for each
instance of a "Patient,"
where one instance for Patient "P1" may take the form of [V1, H3, C2]. Other
examples of
reciprocal tables may be possible as well.
[101] In this way, the data defining the schema for the digital duplicate
may be embodied
as a "data network" or form of neurosynaptic storage of information, where
associative
information (such as that described above) is stored at the intersection point
of the structural
components. Each instance of such data tables for corresponding "Visits,"
"Patients,"
"Physicians," and "Clinics" (as examples) may be created from source data by
an organizer part
22

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
of the data ingestion method, described below. This provides certain
advantages over traditional
data storage models, such as relational models that use fixed relationships
between data. As one
advantage, the present technique uses a single, atomic template to implement
each structural
association and its corresponding components in the appropriate storage model.
As such, this
technique allows for dynamic expansion to create as many associations as may
be desired to
represent the desired comprehensive network. In comparison to NoSQL databases
that store
entities as collection of key-value pairs and allow for each record to have a
variable structure in
each table, or graph databases that use key-value pairs to store relationships
between two values,
the digital duplicate architecture allows information to be stored within a
flexible neurosynaptic
data structure to describe the data using the directory, dictionary, and the
structural data types.
This provides dramatic flexibility both to store and locate data accurately
and to capture additional
business information within the network.
[102] Further, the data defining the schema for the digital duplicate can
be stored in data
storage 303 via tables that represent all relationships that comprise the
network of components
(referred to herein as the "Digital Context" 301). And in this way, data
ingested can be placed
within this digital context 301. In some implementations, this technique may
provide for
traceability between data sources and its corresponding data context using
UIDs for each source
of data and for each contextual element. As an example of this, a patient's
"First Name" data
element may reference the UID of the structural elements corresponding to this
data element (for
example, the patient's associated visits) and vice-versa.
[103] The system may be further configured to store a particular digital
context 310
and/or the underlying digital content 320 for the particular digital context
with an indication that
the particular digital context and/or the underlying digital content belongs
to unique domain 311
or subdomain. For instance, a unique domain (and/or subdomain) may be
established for each
instance of the digital duplicate and may be stored in a registry 302. A
registry 302 may contain
(a) a list of domains and (b) a list of all subdomains that exist within each
domain. For instance,
returning to the example organization in the medical services industry, the
list of domains may
contain a domain indicator (e.g., a URI) that is specific to this
organization. The domain indicator
may thus represent all the data that is stored as the digital content for a
digital duplicate related to
this organization. Within each domain, there may be one or more subdomains for
individual data
contexts for the organization. For instance, within the domain for the example
organization, in the
medical services industry, there may be a subdomain for "Purchasing," and a
subdomain for
"Marketing," among other examples. This, the list of subdomains may contain
subdomain
indicators (e.g., URIs) that identify these subdomains.
23

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[104] A registry 302 may also contain data describing locations and
identifiers of
authentication security services for users accessing data within a given
domain. For instance,
domains and subdomains may be private (accessible only to users within an
organization), and as
such may contain such authentication data that serves to describes the various
user that have
appropriate permissions to access the given domains and/or subdomains. Domains
and
subdomains may also be public, and therefore accessible to any users or
systems outside of an
organization. Other examples of data that may be stored in the registry are
possible as well.
[105] As explained, the schema for one instance of a digital context may be
stored in or
with what is referred to as a "dictionary" 312. In this way, a single
dictionary 312 may store data
that describes the digital context 310 for one specific organization. The
system may thus store
multiple dictionaries, with one dictionary being stored for each specific
organization that utilizes
the system to create and store an instance of a digital duplicate 301. In some
implementations,
however, dictionaries may be shared between domains and/or subdomains. For
instance, if a first
organization in the medical services industry has already established a
dictionary that stores its
schema data describing its digital context, then a second similar medial
services organization may
benefit from using this same dictionary already established for the first
organization. In this way,
a common set of semantics may be used across organizations in the same or
similar industries.
[106] The digital duplicate may be stored via data store 303 using any
appropriate data
storage technology, including by way of example, graphical databases,
relational databases (SQL,
Oracle), in-memory data storage, as well as other types of storage. Digital
duplicate information
may be stored in two or more such database technologies for redundancy and/or
performance
purposes.
[107] In some implementations an index file may be used as a separation of
concerns
measure. For instance, an index file that may contain data keys may reside in
one location and the
digital duplicate data may reside in another, perhaps remote, location. In
this way, a set of semantic
services may be employed to store and retrieve data specific to the underlying
digital duplicate
data by first accessing the data keys and then using those data keys to
identify and access the
location of the underlying digital duplicate data.
IV. EXAMPLE VISUALIZATIONS
[108] A computing device, such as computing device 200 (FIG. 2), which as
described
above, may serve as one or more of client stations 113 (FIG. 1B) and/or back-
end platform 112
(FIG. 1B), may be configured to generate various visualizations of an
established digital context.
In one example, a computing device may be configured to generate a
visualization of an entire
digital context (which as mentioned above may, from time to time, be referred
to herein as a
24

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
"semantic network") established for a particular organization, where the
structural components of
the semantic network are represented as "nodes" and the associative components
of the semantic
network are represented as "links" between the nodes thus forming a web-like
structure.
[109] To illustrate one example of this, FIG. 4 depicts an example snapshot
400 of a
graphical user interface (GUI) that provides a web visualization 402 and a
listing 404 of various
nodes and links that may comprise an example semantic network. As depicted,
the web
visualization 402 includes a number of nodes, such as example node 406 for
"Product" and
example node 410 for "Ship Date." As explained with reference to a different
example above, the
"Product" node 406 may include various properties that contain data for
describing a product,
such as product name, product model number, etc. Likewise, "Ship Date" node
410 may include
various properties that contain data for describing various shipping dates
whereon various
products were shipped to customers of the organization or the like, such as
month, day, year, etc.
As also depicted, web visualization 402 also includes a number of links, such
as "Orders" link
408, which may contain data that relates together data represented by two or
more nodes. The
"Orders" link 408, for instance, may contain data that describes various
instances of customer
orders for particular products.
[110] A computing device may take one or more actions responsive to
receiving user
inputs via a GUI that displays a visualization, such as the example
visualization depicted in FIG.
4. For instance, responsive to receiving a selection of one of the nodes or
links displayed in web
visualization 402 (e.g., through a mouse click or a tap on a touch-screen
interface, or the like), the
computing device may display further information related to the selected node
or link, such as
displaying the properties of the selected node or link, or displaying data
related to the underlying
data records that have been assigned to the selected node or link, where such
data may be in the
form of a data table, graph, or some other format outlining or summarizing the
underlying data
records. The computing device may take other actions responsive to receiving
various user input
via the GUI as well, such as by rearranging or repositioning the various nodes
and links of the
semantic network in response to receiving a click-and-drag input via a mouse
or a touch-and-drag
input via a touch screen, among other possible inputs and responsive actions.
V. EXAMPLE OPERATIONS
A. Example Digital Context and Digital Content Creation Operations
[111] Example operations that may be carried out by one or more computing
devices
running the disclosed software tool are discussed in detail below. For
purposes of illustration only,
these example operations are described as being carried out by a computing
device, such as
computing device 200 (FIG. 2), which as described above, may serve as one or
more of client

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
stations 112 (FIG. 1) and/or back-end platform 102 (FIG. 1). In this respect,
it should be
understood that, depending on the implementation, the operations discussed
herein below may be
carried out entirely by a single computing device, such as one or more of
client stations 112 or by
back-end platform 102, or may be carried out by a combination of computing
devices, with some
operations being carried out by back-end platform 102 (such as computational
processes and data-
access operations) and other operations being carried out by one or more of
client stations 112
(such as display operations and operations that receive user inputs). However,
other arrangements
are possible as well.
[112] To help describe some of these operations, flow diagrams may also be
referenced
to describe combinations of operations that may be performed by a computing
device. In some
cases, a block in a flow diagram may represent a module or portion of program
code that includes
instructions that are executable by a processor to implement specific logical
functions or steps in
a process. The program code may be stored on any type of computer-readable
medium, such as
non-transitory computer readable media (e.g., data storage 204 (FIG. 2)). In
other cases, a block
in a flow diagram may represent circuitry that is wired to perform specific
logical functions or
steps in a process. Moreover, the blocks shown in the flow diagrams may be
rearranged into
different orders, combined into fewer blocks, separated into additional
blocks, and/or removed,
based upon the particular embodiment. Flow diagrams may also be modified to
include additional
blocks that represent other functionality that is described expressly or
implicitly elsewhere herein.
1. Digital Context Creation
[113] To help illustrate how a computing system (such as back-end platform
102) may
establish a digital context, such as digital context 310, reference will now
be made to flow diagram
500 of FIG. 5, which depicts an example process for stabling a digital context
carried out in
accordance with the disclosed digital duplicate technology.
[114] Turning first to block 502, the computing device may first create a
domain for a
particular organization. As indicated a domain may serve to identify that a
particular set of data is
specific for a particular organization. To this end, as part of creating a
domain for a particular
organization, the computing device may also assign a unique identifier (e.g.,
a URI or the like) for
the domain. As also indicated, domains may include zero or more subdomains for
describing
additional aspects of an organization's business, such as individual
operational departments. Thus,
as part of creating the domain, the computing device may create one or more
subdomains, as
desired, and assign unique identifiers to these subdomains (e.g., UltIs or the
like).
[115] The computing device may create a domain by, for instance, receiving
a user input
or a set of user inputs from a client station 112. For instance, the computing
device may cause a
26

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
client station 112 to present a graphical user interface (GUI) of some kind,
through which a user
may provide a user input or a set of user inputs providing information or
instructions to facilitate
the computing device creating a domain for a particular organization. As one
example, a user input
may provide the computing system with various information to create the
domain, such as the
name of the domain, the name of the organization, and the name of any
subdomains, among other
examples. As another example, the user input may provide an indication and/or
instructions for
how the computing device should retrieve from one or more external data
sources various
information to create the domain. For instance, such an indication or
instructions may include an
identification of one or more external data sources at which domain
information can be retrieved.
In some implementations, a combination of user inputs may be provided to the
computing device
to facilitate creating the domain, including one or more user inputs that may
provide the computing
system with various information to create the domain, such as the name of the
domain, the name
of the organization, and the name of any subdomains, and one or more user
inputs that may provide
an indication and/or instructions for how the computing device should retrieve
from one or more
external data sources various information to create the domain. Other examples
of user inputs may
be possible as well.
[116] Turning next to block 504, the computing device may create or enroll
an existing
dictionary for the organization. As indicated, a dictionary may store data
that describes the digital
context for one specific organization. Depending on the implementation, a
dictionary may be
created for an organization from scratch, such as by engaging in one or more
functions described
by the remaining blocks in the flow diagram 500, or all or part of a
dictionary may be enrolled
based on a dictionary for an existing organization.
[117] In the case that all or part of a dictionary will be enrolled based
on a dictionary for
an existing organization, the computing device may do this by selecting the
organization or
organization from which all or part of a respective dictionary will be
enrolled, and then selecting
all or part of the dictionary for the selected organization. To facilitate,
the computing device may
cause a client station 112 to present a GUI of some kind, through which a user
may provide a user
input or a set of user inputs providing information or instructions to
facilitate the computing device
selecting the organization or organization from which all or part of a
respective dictionary will be
enrolled, and then selecting all or part of the dictionary for the selected
organization. As one
example, the client station 112 may present a selection menu presenting
possible organizations
that may be selected, and once an organization is selected, the client device
112 may present a
selection menu to select that presents possible dictionary entries for that
organization that may be
selected. Other ways to enroll all or part of an existing dictionary may be
possible as well.
27

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[118] In the case that a new dictionary for an organization will be created
from scratch,
the computing device may do this in various ways. As one possibility, the
computing device may
cause a client station 112 to present a GUI of some kind, through which a user
may provide a user
input or a set of user inputs providing information or instructions to
facilitate the computing device
creating a dictionary. Such a user input or user inputs may be provided as
described in the
following blocks.
[119] For instance, at block 506, the computing device may define the
conceptual
components and properties thereof. The computing device may define the
conceptual components
in various ways. As one possibility, the computing device may cause a client
station 112 to present
a GUI of some kind, through which a user may provide a user input or a set of
user inputs providing
information or instructions to facilitate the computing device defining the
conceptual components.
As one example of this, the client station 112 may present a selection menu
presenting possible
conceptual components and possible properties that may be selected. As another
example, the
computing device may cause a client station 112 to present a GUI with one or
more form fillable
fields though which a user may enter information defining the conceptual
components and
properties thereof. Other ways to define conceptual components may be possible
as well.
[120] At block 508, the computing device may define associative components
and
properties thereof. The computing device may define the conceptual components
in various ways,
which may be similar to the ways the computing device defined the conceptual
components. As
one possibility, the computing device may cause a client station 112 to
present a GUI of some
kind, through which a user may provide a user input or a set of user inputs
providing information
or instructions to facilitate the computing device defining the associative
components. As one
example of this, the client station 112 may present a selection menu
presenting possible
associative components and possible properties that may be selected. As
another example, the
computing device may cause a client station 112 to present a GUI with one or
more form fillable
fields though which a user may enter information defining the associative
components and
properties thereof. Other ways to define associative components may be
possible as well.
[121] And at block 510, the computing device may assign to each property of
each
component a semantic data type and a structural data type. The computing
device may define the
conceptual components in various ways, which may be similar to the ways the
computing device
defined the conceptual components. As one possibility, the computing device
may cause a client
station 112 to present a GUI of some kind, through which a user may provide a
user input or a set
of user inputs providing information or instructions to facilitate the
computing device defining the
associative components. As one example of this, the client station 112 may
present a selection
28

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
menu presenting possible associative components and possible properties that
may be selected.
As another example, the computing device may cause a client station 112 to
present a GUI with
one or more form fillable fields though which a user may enter information
defining the
associative components and properties thereof. Other ways to define
associative components may
be possible as well.
2. Digital Content Creation
[122] A computing system (such as back-end platform 102) may facilitate the
creation
of digital content by a process of ingesting organization data and storing it
as an instance of the
digital context. To illustrate how the system may ingest data into the system
and thus store it as
an instance of a digital duplicate in accordance with one example of the above-
described schema,
reference is made to a flow diagram 600 of FIG. 6. At a high level, the
ingestion process may
involve contextualizing and storing in the data store each piece of inbound
data. Generally, this
process may be used to ingest data for (a) an initial population of data to
form an initial instance
of a digital duplicate, and (b) subsequent automated or manual refreshes of
the digital duplicate
instance from all data sources on a regular cadence (e.g. every 10 minutes,
although other
examples may be possible).
[123] Turning first to block 602, a computing device may first identify and
enroll source
data that will be used to populate the digital context to form a digital
duplicate. As a general
matter, the computing device may do this by establishing connections to one or
more data sources
so that the computing device can begin to ingest an organization's data from
these one or more
data sources. For instance, the computing device may cause a client station
112 to present a GUI
through which a user may provide a user input or set of user inputs that
provides to the computing
device data source configuration information, including for instance, data
source location
information (e.g., IP address or other network address), data format
information, data timing
information identifying how often the computing device will ingest new data
from the data source
(e.g., 10 minutes), and any credentials necessary for accessing the
organization's data at the data
sources. Using this information, the computing device may connect to and begin
to ingest the
organization's data. Other ways to identify and enroll data sources may be
possible as well.
[124] After the data is initially ingested, the computing device may
contextualize the
ingested data, which at a high level generally includes assigning or updating
a version number or
timestamp to the data, assigning a semantic data type to the data, assigning a
structural data type
to the data, and/or assigning a URI to the data. Thus, turning next to block
604, the computing
device may assign or update a version number or timestamp to the data. For
instance, when the
computing device initially ingests a piece of data from a given data source,
the computing device
29

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
may assign a version number and/or mark or otherwise associate the data with a
timestamp that
indicates the time of ingestion. When the computing device updates this
ingested data by receiving
a new version of the piece of data from the data source, the computing device
may increment the
version number and/or update the timestamp.
[125] At block 606, the computing device may assign a semantic data type to
the ingested
data. The computing device may do this in various ways. As one possibility, as
part of the
identification and enrollment of the data source (block 602), the computing
device may have
established that a particular semantic data type should be applied to various
data ingested from
the particular data source. For instance, a user may have provided one or more
user inputs at this
step indicating that some or all data ingested from the particular data source
should have a
particular semantic data type assigned thereto. In this case, the computing
device may assign this
semantic data type to the ingested data. As another possibility, the computing
device may cause a
client station 112 to present to a user a GUI through which the user may
select certain ingested
data and then select or otherwise provide a user input that assigns a semantic
data type to that
ingested data. Other ways to assign a semantic data type to the ingested data
may be possible as
well.
[126] At block 608, the computing device may assign a structural data type
to the
ingested data. The computing device may do this in various ways, similar to
the possibilities
described above. As one possibility, as part of the identification and
enrollment of the data source
(block 602), the computing device may have established that a particular
structural data type
should be applied to various data ingested from the particular data source.
For instance, a user
may have provided one or more user inputs at this step indicating that some or
all data ingested
from the particular data source should have a particular structural data type
assigned thereto. In
this case, the computing device may assign this structural data type to the
ingested data. As another
possibility, the computing device may cause a client station 112 to present to
a user a GUI through
which the user may select certain ingested data and then select or otherwise
provide a user input
that assigns a structural data type to that ingested data. Other ways to
assign a structural data type
to the ingested data may be possible as well.
[127] At block 610, the computing device may assign URIs or other
identifying
indicators to the ingested data. Assigning URIs or other identifying
indicators to the ingested data
may serve various purposes and provide various advantages. For instance,
assigning URIs or other
identifying indicators to the ingested data may serve to associate the
ingested data with various
properties for the digital context and thus may associate the data with the
appropriate semantic
data type for each individual property. As another example, assigning URIs or
other identifying

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
indicators to the ingested data may, associate the data with appropriate
conceptual components of
the structural context. And as another example, assigning URIs or other
identifying indicators to
the ingested data may and serve to associate the ingested data with
associative components of the
structural context. The computing device may then persistently store the
ingested data in the data
store (e.g., data store 303) along with the assigned UltIs.
B. Example Organizer Operations
[128] As mentioned above, an organizer tool (which may be embodied by a
software tool
running on a computing device) may be used to make a connection between the
semantic network
and an organization's underlying data stores and may be configured to engage
in certain
functionality, including linking multiple data sources to the semantic
network, and/or onboarding
the underlying data from the organization's data stores to an organizer data
store and ultimately
into the semantic network after filtering, cleaning, transforming, and/or
validating the data as
desired.
[129] Example operations that may be carried out by one or more computing
devices
running the disclosed organizer software tool are discussed in detail below.
For purposes of
illustration only, these example operations are described as being carried out
by a computing
device, such as computing device 200 (FIG. 2), which as described above, may
serve as one or
more of client stations 113 (FIG. 1B) and/or back-end platform 112 (FIG. 1B).
In this respect, it
should be understood that, depending on the implementation, the operations
discussed herein
below may be carried out entirely by a single computing device, such as one or
more of client
stations 113 or by back-end platform 112, or may be carried out by a
combination of computing
devices, with some operations being carried out by back-end platform 112 (such
as computational
processes and data-access operations) and other operations being carried out
by one or more of
client stations 113 (such as display operations and operations that receive
user inputs). However,
other arrangements are possible as well.
[130] To help describe some of these operations, a flow diagram may also be
referenced
to describe combinations of operations that may be performed by a computing
device. In some
cases, a block in a flow diagram may represent a module or portion of program
code that includes
instructions that are executable by a processor to implement specific logical
functions or steps in
a process. The program code may be stored on any type of computer-readable
medium, such as
non-transitory computer readable media (e.g., data storage 204 (FIG. 2)). In
other cases, a block
in a flow diagram may represent circuitry that is wired to perform specific
logical functions or
steps in a process. Moreover, the blocks shown in the flow diagrams may be
rearranged into
different orders, combined into fewer blocks, separated into additional
blocks, and/or removed,
31

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
based upon the particular embodiment. Flow diagrams may also be modified to
include additional
blocks that represent other functionality that is described expressly or
implicitly elsewhere herein.
[131] As noted above, in one aspect, the organizer software tool may engage
in a "data
onboarding process" by which it may cause a computing device to, among other
functions, carry
out a process that facilitates connecting to various data sources that house
an organization's data,
establishing pipelines that operate to onboard the organization's data to an
organizer data store
and apply filter, clean, and/or transformation operations that help map the
organization's data to
a semantic network established for the organization This process may take
various forms.
[132] With reference now to flow diagram 700 of FIG. 7, one example of a
data
onboarding process carried out in accordance with the disclosed organizer
software tool is
illustrated and described. This process may generally involve the following
operations: (i) at block
702, the computing device may create a cloud instance for the digital content,
(ii) at block 704,
the computing device may identify and connect to the data sources used for the
digital content,
(iii) at block 706, the computing device may retrieve the data models (which
may generally be
information that identifies the column structure and general format of data
stored at the data
source) from the data sources and import the data models into the software
tool, (iv) at block 708,
the computing device may establish filter and/or clean operations to be
performed on the source
data, (v) at block 710, the computing device may import the source data to an
organizer data store
and apply the established filer and/or clean operations, (vi) at block 712,
the computing device
may define a set of transformations to apply to the source data, (vii) at
block 714, the computing
device may export the data from the organizer data store to the semantic data
store and apply the
defined transformations, and (viii) at block 716, the computing device may
establish an update
schedule by which the system will refresh the semantic data store. Each of
these operations will
now be discussed in further detail with reference, in some cases, to example
snapshots of GUIs
that may facilitate some or all of the disclosed functionality.
[133] Turning first to block 702, the computing device may begin the data
onboarding
process by creating a cloud instance for the digital content. In one
implementation, this may take
the form of establishing a cloud network storage location wherein a particular
organization's data
may be stored. This may include, for instance, establishing or reserving a
portion of a cloud
network storage location or other data store to serve as either or both of the
organizer data store
114B and/or the semantic network data store 114C (FIG. 1B). This cloud network
data store may
be provided either by the owner or operator of the back-end platform 112 or by
a third-party
provider of "on demand" cloud storage, such as Amazon Web Services (AWS), or
the like. To
facilitate this process, the organizer software tool may present one or more
GUI screens through
32

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
which a user may enter various identifying information for the organization
and/or business unit
or division of the organization and then use this information to establish an
appropriate cloud
instance for the digital content. A computing device may create a cloud
instance for digital content
in other ways as well.
[134] Next at block 704, the computing device may identify the
organization's various
disparate data sources (represented in one example in FIG. 1B by business data
sources 114A or
in another example in FIG. 1A by data sources 106), which are desired to be
used as sources for
the digital content. The computing device may do this in various ways. As one
possibility, the
computing device may present a GUI through which a user may provide various
user inputs that
indicate how the computing device can connect to the one or more
organization's various disparate
data sources. For instance, for each data source desired to be connected to
and used as a source
for the digital content, a user may provide a user input that indicates a name
for the data source
(e.g., "customer database," employee list," etc.), the type of data source
(e.g., Microsoft Excel
workbook, SAP HANA, Microsoft Access database, Oracle database, SQLServer
database, a web
service, TADA database, etc.), the IP address or other location-identifying
information of the data
source, credentials or other security information for accessing the data
contained within the data
source, and/or information identifying the type or format of the data stored
within the data source.
A user may provide other information that facilitates the computing device
connecting to an
organization's disparate data sources as well.
[135] As another possibility, the computing device may receive a file
upload that
comprises a data source. As an example, the computing device may receive a
spreadsheet provided
via computer readable media, such as a flash drive or disc. To facilitate
receiving a file upload
that comprises a data source, the computing device may present a GUI through
which a user may
provide various inputs that instruct the computing device where to retrieve
the file from. The
computing device may identify and connect to an organization's various
disparate data sources in
other ways as well. Once the computing device identifies and/or connects to a
given data source,
that data source may be said to be "enrolled." It should be appreciated that,
in practice, multiple
data sources may be "enrolled" by the disclosed software tool, each of which
may ultimately
provide underlying data to the platform.
[136] Next at block 706, the computing device may then connect to the
identified data
sources, identify the data models used at the data sources, and import the
data models into the
platform. . For example, for data stored in a typical relational database, the
data model for that
database may include information that identifies how many tables are included
within the
database, how many columns there are for each table, the name of each column,
the data type of
33

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
the records that are or will be stored within the data tables (e.g.,
"integer," "string," "decimal,"
"datetime," etc.), and/or other information that further defines the data
model, such as allowable
length of a data entry stored at each column, or other rules for storing data
at each column such as
whether leading or trailing spaces are allowed.
[137] In some implementations, once the computing device has retrieved the
data models
from the one or more data sources, the computing device may present a GUI that
displays a listing
of the one or more data sources, the various tables available at each data
source, and a navigable
interface that allows a user to explore the data model of each data source. To
illustrate one example
of this, FIG. 5 depicts an example snapshot 500 of a GUI that displays a
listing of the data sources,
the various tables available at each data source, and a navigable interface
that allows a user to
explore the data model of each data source. As depicted, the GUI may include a
"source table"
graphical location 506 that may display indications of the various data
sources that have been
enrolled and are thus available to the computing device, including what tables
are included in each
data source and which columns are included within each table. As depicted in
the example GUI
of FIG. 5, the source table location 506 shows a "VPData" data source that
includes several tables
including a "VPData AssetOper" table and a "VPData AssetOperDaily" table,
among other
displayed tables. The example GUI may use a "nested tree structure" layout to
arrange the source
table graphical location 506 such that the tables of a given data source are
nested underneath the
name of the data source, and columns of each table are nested underneath each
table name. Other
ways to display the columns, tables, and data sources available to the
computing device may be
possible as well.
[138] Returning to FIG.7 at block 708, the computing device may next
establish filter
and/or clean operations to be performed on the source data prior to or in
conjunction with
onboarding the data. Many types of filter/clean operations are possible. For
example, a filter/clean
operation may specify which portions of the source data to import from the
various data sources
and which portions not to import. A filter/clean operation may also specify,
for instance, whether
to change the name or data type of any aspect of the data model during the
onboard process. For
instance, one example filter/clean operation may be an operation that changes
the name of a
column from something like "ShNm" to "Shift Name" or from "AID" to "Asset ID."
Another
example filter/clean operation may be an operation that changes a data type
for a column from
"float" to "integer" or from "string" to "nvarchar." Yet another example
filter/clean operation
may be an operation that changes the length of the data input from, say, "500"
to, say, "20." And
yet another example filter clean operation may be an operation that specifies
to import only those
data records that match a particular query filter, such as records that have
an "InstID" value of
34

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
"28." These were merely examples, however, and other types and examples of
filter/clean
operations may be possible in practice. One advantage of applying a
filter/clean operation at this
point in the process is that the data can be preconditioned to a known data
type for further
manipulation, independent of its original data type. Additionally, any errors
in the data source can
be identified during the onboarding process (e.g., if not all items in a
column share the same data
type, an error will be raised at this point).
[139] To facilitate establishing filter and/or clean operations, such as
the forgoing
examples as well as other possible examples, the computing device may present
a GUI through
which a user may provide various user inputs in order to specify one or more
filter/clean operations
to employ. To illustrate with an example, returning to snapshot 800 of FIG. 8,
the GUI depicted
includes an example "filter/clean" graphical location 804 and a "table
interface" graphical location
802.
[140] Referring first to filter/clean graphical location 804, this location
shows various
information about the data model and any established filter/clean operations
for a selected table
from the data source. As depicted in FIG. 8, for instance, the example
filter/clean graphical
location 804 depicts various information related to the "VPData AssetOper"
source data table.
As further depicted in filter/clean graphical location 804, for instance, the
GUI may include a
"Name (Source Table)" column 810 that shows the name of the various columns as
they exist in
the source data table, a "Name (Column Interface)" column 808 that shows the
name of the
corresponding column as it will be identified in the (new) table and column
interface (i.e., when
imported into the organizer data store), a "Source Data Type" column 812 that
shows the data
type for the column as it exists in the source data table, an "Interface Data
Type" column 814 that
shows the (new) data type for the corresponding column as it will exist in the
table and column
interface (i.e., when imported into the organizer data store), and a "Length
or Scale" column 816
that shows the (new) length or scale of the data for the corresponding column
as it will exist in the
table and column interface (i.e., when imported into the organizer data
store).
[141] Portions of the filter/clean graphical location 804 may be configured
to receive
user inputs in order to establish certain filter/clean operations. For
instance, in this example
columns 808, 814, and 816 may be editable so as to provide the ability to
change the name of
columns, change the data type, and/or change the length of data, respectively,
of the data tables of
the data source prior to importing the data from the data source into the
organizer data store. As
an example, a user may enter text into the various rows of columns 808 or 816
in order to change
the name of a given column of a source data table or change the length or
scale of the data stored
for that column. The various rows of column 814 may include drop down boxes
that contain

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
various example data types that may be selected by a user to change the data
type for a given
column of a source data table. The filter/clean graphical location 804 may
include other columns
as well so as to provide the ability to view and/or edit one or more other
properties of the data
model of a data source prior to importing the data from the data source into
the organizer data
store. In this way, a user may navigate through the various data tables of the
data source via the
filter/clean graphical location 804 and provide various user inputs (e.g., via
columns 808, 814,
and/or 816, among, perhaps, other possible columns that are not shown) to
establish a set of
filter/clean operations to apply to the source data as it is imported into the
organizer data store.
[142] Once a set of filter/clean operations are established for one or more
data tables of
a particular data source, the computing device may engage in some additional
functionality. For
instance, in some implementations, the computing device may determine a set of
similar
filter/clean operations to apply to other data tables in the same or different
data source based on
the set of filter/clean operations that were established above by the
computing device, for instance,
receiving via a GUI one or more user inputs that define a set of filter/clean
operations. The
computing device may do this in any of a number of different ways. As one
possibility, for
instance, for a set of filter/clean operations that change the name of a
column from "AID" to
"Asset ID," and that change a data type for this column from "decimal" to
"integer," and that
changes the length of the data input from "500" to "20," the computing device
may first search
other source data tables that have columns that are similarly named (e.g., for
columns that are
named "AID," "AssetID," "Asset ID" or the like) and have similar source data
formats (e.g.,
columns that may have data formats of "decimal" and/or have length at or near
"500"). Once any
such columns are found in other source data tables, the computing device may
apply the same
filter/clean operations to these data columns, or the computing device may
present to a user though
a GUI a dialog box or the like recommending these filter/clean operations and
prompting the user
to approve the set of recommended filter/clean operations. As another
possibility, the computing
device may recognize an existing filter/clean operation to apply as a result
of a user entering in a
few characters of a particular column. For instance, for a new filter/clean
operation where a user
enters "Asse" into the "Name (Column Interfaces)" column 808, the computing
device may
suggest an existing set of filter/clean operations where at least the phrase
"Asse" appears, such as
a set of filter/clean operations that change the name of a column from "AID"
to "Asset ID," and
that change a data type for this column from "decimal" to "integer," and that
changes the length
of the data input from "500" to "20."
[143] In some implementations, the computing device may generate what is
referred to
as a "table interface" having one or more "column interfaces." The computing
device may
36

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
generate one table interface for every source data table that will be
onboarded into the system and
stored in the organizer data store. Likewise, for every column in a particular
source data table that
will be onboarded into the system and stored in the organizer data store, the
computing device
may generate a corresponding column interface. These column interfaces and
table interfaces are
generally data structures that define or contain, among other things, (i)
information regarding
where in the source data stores to get the data that forms the particular
column or table interface,
and (ii) the set of zero or more filter/clean operations that may be applied
to the source data as it
is imported into the organizer data store. In this way, a set of all table
interfaces for a particular
organization can be thought of together as a "data interface," which serves as
the input for the rest
of the data operations carried out (or facilitated) by the disclosed organizer
software tool.
[144] In some embodiments, the computing device may present a GUI or
portion of a
GUI that displays the list of table and column interfaces for a given project.
Returning to the
example GUI of FIG. 8, as mentioned this GUI may include a table interface
graphical location
802, which displays the list of table interfaces and column interfaces in a
nested format. Each table
interface may be selectable such that when a given table interface is
selected, the computing device
may display in filter/clean graphical location 804 a corresponding set of
information regarding the
data model and any selected filter/clean operations for the corresponding
table from the source
database.
[145] Generating a data interface, which comprises table interfaces and
column
interfaces as referred to above, may provide several advantages. For instance,
as mentioned, the
data interface can be thought of as serving as the input for the remainder of
the data operations
carried out (or facilitated) by the organizer software tool. In this way, the
remainder of the data
operations will not depend on the underlying data source directly but rather
will be "shielded" by
the data interface. In this way, if the organization makes a change to some
part of the underlying
source data (such as changing a column name or a column's data type), this
change may not
corrupt the rest of the downstream operations of carried out (or facilitated
by) the organizer
software tool, as those operations refer to and thus utilize the table and
column interfaces rather
than the source data directly.
[146] Returning to FIG. 7 at block 710, the computing device may next
import the
underlying source data records to an organizer data store (e.g., organizer
data store 114B) and
apply the filter/clean operations that were established in accordance with the
previous block 708.
For instance, in one embodiment, as the data is being imported into the
organizer data store, the
computing device may apply the filter/clean operations and store the data in
the organizer data
store after applying the set of established filter/clean operations.
37

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[147] In an alternative embodiment, the computing device may not import the
underlying
source data records to an organizer data store before applying the
filter/clean operations. Rather,
the computing device may apply the filter/clean operations to the data as it
is stored in the
underlying data source. As a result, the computing device may determine if
there are any
exceptions or errors that result from these operations and, if there are,
present these exceptions or
errors to a user for resolution. The set of filter/clean operations can be
stored in a data store, such
as organizer data store 114B.
[148] Moving on to block 712, the computing device may next define a set of

transformations to apply to the source data stored in the organizer data
store. As a general matter,
a transformation may be any change, manipulation, or combination of data,
and/or assignment of
the data to an aspect of the semantic data network, such as a node or a link
in a digital context.
Transformations can be thought of as operations that re-shape the input data
and form the basis of
pipelines that in turn can be thought of as moving data into the semantic
network. More specific
examples of transformations may include, for instance, removing duplicates,
converting data
types, removing certain characters from data, concatenating columns,
extracting dates, filtering
rows, aggregating rows, merging tables, creating a new virtual table or data
source, and/or
assigning data to an aspect of the semantic network, such as a node or link in
the semantic network,
among other possibilities. Transformations may be applied to the data stored
in the organizer data
store in various ways. As one possibility, transformations may be applied to
data from a single
data table from a single data source. As another possibility, transformations
may be applied to
data from multiple data tables from multiple data sources. A set of
transformations established for
a particular data table may be referred to as a "sequence" and a set of
sequences for all the tables
in a particular data source may be referred to as a "data contract."
[149] To facilitate establishing transformations to apply to the source
data stored in the
organizer data store, the computing device may present a GUI through which a
user may provide
various user inputs in order to select or otherwise define the set of
transformations to apply for a
particular data table or data source. To illustrate with an example, FIG. 9 is
an example snapshot
900 of a GUI that displays a "data source" graphical location 902 that is
configured to display data
tables and data sources that have been imported into the organizer data store
(or created via a
transformation, as will be described herein), a "data contract" graphical
location 904 that is
configured to display the set of transformations for each data table of the
data sources that have
been imported into the organizer data store, and a "node/link" graphical
location 906 that is
configured to display the nodes, links, and/or properties of the semantic
network.
38

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[150] Within the data contract graphical location 904, for instance, the
computing device
may display the set of transformations that have been established for a
particular data table. As
depicted in FIG. 9, for instance, the data contract graphical location 904
depicts indications that
transformations that have been established for the "VPData Assets," VPSYS
Instances," and
"VPData LoadCount" data tables. In some embodiments, the computing device may
provide the
ability to select indications of the transformations (e.g., "VPData Assets
Node Model") and
responsively display further details on the transformation as well as provide
an area within which
a user can modify the transformations or add new transformation. For instance,
upon the selection
of an "Add Transform" button or the like, the computing device may provide a
listing of possible
transformations to select from to apply to some or all the data in a
particular data table. As depicted
in FIG 9, for instance, an "Assign Node" transform has been established for
the "Model" column
in data table "VPData Assets" from data source "VPData." The data from the
"Model" column
has been assigned to the "Model" node in the semantic network.
[151] As mentioned, one type of transformation for a data table may result
in some or all
of the data table being resolved into a new, "virtual" data source referred to
in some examples as
a "from contract" data source. A virtual data source may then be treated as if
it were a traditional
data source connected to by the computing device, in that additional
transformations may be
applied to data tables and columns of the virtual data source, such as
assigning data columns to
nodes, links, and properties of the semantic network. As depicted in FIG. 9,
for instance, the data
source graphical location 902 depicts a "From Contract" virtual data source
having several virtual
data tables, including "AO Short Term," "AssetDaily," "LC Short Term," and
"Short TermDate,"
although in practice others are possible.
[152] It may be advantageous to apply a transformation to one or more data
tables of a
data source to form a virtual data source in instances where it has been
identified that a particular
kind of data would be desirable, but the organization does not store that type
of data. For instance,
an organization may store data in a first data table related to customer
information, such as the
customer name and customer address, and may store data in a second data table
related to
purchases made by customers, such as date of purchase, products purchased,
quantity purchased,
etc. The organization may not necessarily, however, store roll-up data that
aggregates purchase
information and customer information. It may be advantageous to establish such
roll-up data in a
virtual data source, in order to help prevent the number of transformations
applied to a given data
table from becoming unwieldy, help an operator or other subject matter expert
visualize the data,
help assist an operator in applying additional transformations, such as
assigning the roll-up data
to a node, link, or property in a semantic network, among other possible
advantages. As an
39

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
example, by establishing an appropriate set of transformations, a user may
enroll a virtual data
source that includes, say, a first table that aggregates by month all the
purchases made from
customers who are located in midwestern states, east-coast states, southern
states, etc., and, say, a
second table that aggregates by product all the purchases made by customers on
a more granular
geographic level, such as by state or even city. Of course, this is merely an
example; in practice
there may be many other types of virtual data sources that can be established
as well as many other
reasons for establishing a virtual data source.
[153] As transformations are established for the various data tables of the
enrolled data
sources, the computing device may engage in some additional functionality. For
instance, based
on a first set of transformations applied to a first set of data, the
computing device may identify a
second set of data with names or other properties similar to names or other
properties of the first
set of data and the computing device may establish a second set of transforms
for the second set
of data that are similar to the first set of transforms. To illustrate with an
example, one table of
one data source may contain data related to an organization's customers. This
data table may
include columns such as "CustomerID," "CustomerName," "CustomerAddress,"
"CustomerCity," "CustomerState," "CustomerCountry," etc. A second table of a
second data
source may also contain data related to the organization's customers and
include such columns as
"CustID," "CustName," "CustPaymentTerms," etc. In this example, if the
computing device
establishes a transformation (e.g., through a user providing a suitable user
input via a GUI)
whereby the column "CustomerID" from the first table of the first data source
is assigned to the
"customer" property of the "customer" node in the semantic database, then the
computing device
may use that transformation as a basis to establish similar transformations.
For instance, the
computing device may recognize that the other columns in the same data table
also contain the
same word "Customer" as the first column and may thus use this as a basis to
establish
transformations for these columns that assign these columns to various other
properties in the
"customer" node.
[154] Similarly, the computing device may recognize that the second table
of the second
data source also contains columns each with the prefix "Cust" in them (which
the computing
device may determine is similar to "Customer") and each column contains
similar suffixes to the
columns of the first data table in the first data source (e.g., "ID" and
"Name"). The computing
device may thus use this as a basis to also establish transformations for
these columns that are
similar to the transformations established for the first data table in the
first data source (i.e., assign
these columns to various other properties in the "customer" node).

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[155] The computing device may also take into account other factors in
determining
whether to establish certain transformations for a particular set of data. For
instance, in one
embodiment, when determining whether to establish or suggest transformations
for a first set of
data columns, the computing device may compare the underlying data records for
the first set of
data columns to underlying data records for another set of data columns (e.g.,
from the same or
different data source) to determine whether the records contain a threshold
number of similar
entries. If the computing device determines that the underlying data records
contain such a
threshold number of similar entries, then the computing device may use this as
a basis for applying
the same transformations to the first set of data columns that have been
established for the other
set of data columns. In some embodiments, the computing device may not
automatically establish
transformations but rather present suggested transformation to apply and,
through a GUI, present
a user with the option to adopt some or all of the suggested transformations.
The computing device
may establish or suggest transformations in other ways and may also engage in
other functionality
as well.
[156] In addition to establishing transformations, the computing device may
also
determine whether discrepancies exist in an organization's data and may
responsively highlight
these discrepancies and present an option or a suggestion to address the
determined discrepancy.
For instance, if one data table contains a first column name "CustomerName"
with a data record
of "AjaxPlumbing" and a second column name "Customer ID" with a corresponding
data record
of "255", and another data table contains a first column name "CustName" with
a data record of
"AjaxP" and a second column name "CustID" with a corresponding data record of
"255," the
computing device may determine that the columns "Customer ID" and "CustID" are
sufficiently
similar in name such that they refer to the same semantic concept. Thus, the
computing device
may recognize that data records for each column contain a similar entry
(namely, "255") and may
thus determine that the data corresponding to the other columns, namely
"CustName" and
"CustomerName" likely refer to the same entity. However, the computing device
may also
determine that underlying data records corresponding to the Customer ID "255"
for each data
table (namely, "AjaxP" and "AjaxPlumbing") are not sufficiently similar.
Responsively, the
computing device may indicate this discrepancy to a user, such as by
presenting the discrepancy
via a GUI. The computing device may also present the user with the option to
address this
discrepancy by, for instance, adopting one version of the customer name over
the other. For
instance, the computing device may provide the user with the option to select
one of the versions
of the customer name, and upon receiving a user input selecting one of the
customer names, the
computing device may establish a transformation that operates to apply the
selected customer
41

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
name in place of the non-selected customer name. The computing device may
address data
discrepancies in other ways as well.
[157] As mentioned, a set of transformations established for a particular
data table may
be referred to as a "sequence" and a set of sequences for all the tables in a
particular data source
may be referred to as a "data contract." In some embodiments, the computing
device may in one
or more ways present visualizations of the various data contacts established
for a given
organization. One example visualization may take the form of a data contract
tree view, which is
depicted by the example snapshot 1000 in FIG. 10. As depicted in location
1002, for instance, the
computing device may present the various transformations that have been
established for each
source data table in a nested tree format. This nested tree format may, among
other things, indicate
for each data table, the name and type of transformation established and may
further indicate any
additional details about each established transformation, such as the node,
link, or property of the
semantic network the data is assigned to. Other ways to present visualizations
of the various data
contacts established for a given organization may be possible as well.
[158] Returning to FIG. 7 at block 714, the computing device may next apply
the
established transformations to the source data stored in the organizer data
store and then export
the transformed data and store it in a semantic data store (e.g., sematic data
store 114C). This
action can be thought of as pairing the organization's data with the digital
context. As such, once
the data has been transformed and stored in the semantic data store, it
combines with the digital
context to form an instance of a digital duplicate. In one embodiment, as the
data is stored in the
semantic data store, the computing device may assign to each instance of the
data a unique
identifier that identifies the location within the semantic data store at
which each data instance is
located. For instance, this unique identifier may take the form of a unique
URL or an identifier of
a location within a SQL or NoSQL table, among other possibilities. In some
embodiments, the
transformations are not yet applied to the source data but, rather, data
describing the
transformations are stored in a data store, such as organizer data store 114B,
and later applied to
the source data as it is onboarded into the platform from the underlying data
sources 114A during
a batch update, or the like.
[159] Next at block 716, the computing device may establish an update
schedule by
which the computing device system will refresh the data stores. An
organization's data may
change over time. For instance, an organization may delete data records,
reformat data models,
and/or add new data sources altogether. To account for this, the disclosed
organizer software tool
may provide the ability to schedule updates to the various data stores,
whereby the computing
device periodically engages in some or all of the forgoing functionality
according to an established
42

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
schedule. For instance, the computing device may periodically engage in
applying filter/clean
operations and importing source data into the organizer data store as well as
engage in applying
transformations to data stored in the organizer data store and exporting the
transformed data from
the organizer data store to the semantic data store. One advantage to
establishing an update
schedule is that even when new underlying data sources are enrolled or have
modified filter/clean
operations or transformations established, the update can happen seamlessly
without a user having
to rebuild any filter/clean operations or transformations.
[160] In some embodiments for instance, the computing device may apply a
periodic
update schedule on a per-data-source basis. For instance, if an organization
updates data (e.g.,
purchasing data) in a first data source on an hourly basis, but updates data
(e.g., customer data) in
a second data source on a daily (or an even less-frequent basis) basis, the
computing device may
take advantage of this knowledge to engage in update actions that involve the
first data source
more frequently (e.g., hourly) than it may engage in update actions that
involve the second data
source (e.g., which may occur daily or weekly, as examples). To facilitate
this, the computing
device may present a GUI or a set of GUIs through which a user may provide
various user inputs
that the computing device may use to establish one or more update schedules
for an organization's
data. Other ways of establishing an update schedule by which the computing
device system will
refresh the data stores are possible as well.
[161] In some embodiments, the computing device may establish a trigger
event, upon
the occurrence of which the computing device will refresh the data stores.
Many different types
of trigger events are possible. For instance, one type of trigger event may
take the form of an
underlying data source being updated. When an underlying data source is
updated with new data
or data in the underlying data source has changed in some way, the computing
device may receive
an indication of such and may responsively engage in refresh actions, such as
applying filter/clean
operations and importing source data into the organizer data store as well as
applying
transformations to data stored in the organizer data store and exporting the
transformed data from
the organizer data store to the semantic data store.as one possibility. Other
types of trigger events
may be possible as well.
[162] In some embodiments, the computing device may cause the system to
store
multiple instances of an organization's data in the various data stores. In
this way, should some
change to the underlying data sources corrupt the organization's data as
stored in either or both of
the organizer data store or the semantic network data store, the computing
device may revert back
to using a previous (uncorrupted) version of the organization's data.
43

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[163] One way that the computing device may engage in updates to the
various data
stores is to selectively update just the data that has changed in the
underlying data source. To
facilitate this, the computing device may retain a timestamp or other
mechanism that indicates the
most recent time the computing device imported data into the organizer data
store and compare
that against a timestamp or other mechanism retrieved from the underlying data
store that indicates
when a data column or data table in the underlying data store was last
updated. When the
timestamp or other mechanism from the underlying data store indicates that a
set of data was
updated more recently than the computing device imported such data into the
organizer data store,
then the computing device may carry out an update process for just that set of
data that was updated
in the underlying data store more recently. Mechanisms other than a timestamp
may be used to
facilitate the computing device determining whether some set of data from the
underlying data
source has been updated more recently than was imported into the organizer
data store, such as
editable revision fields in the data source, update counters, or even data
fields that contain a flag
or other indication such as a text entry "finished," or the like, that
indicate when an update has
been completed.
[164] In some embodiments, the computing device may engage in data
validation
functions. For instance, as mentioned, an architect tool may be used to
establish the framework
by which an organization's data is set out in the semantic network (i.e., the
digital context). In
some cases, an operator may use the architect tool to define some portion of
the digital context,
switch to use the organizer tool to onboard some portion of the organization's
underlying data,
including establishing filter/clean operations and/or transformations, and
then switch back to the
architect tool to continue modifying or adding new aspects to the digital
context. To address
situations like this, the organizer tool may engage in a validation procedure
by which it identifies
any changes to the structure of the semantic network (such as the presence or
absence of nodes,
links, or properties) that affect or are related to established
transformations. For instance, if a
transformation were established that operated to assign a particular data
column to a particular
property of a particular node in the semantic network, and thereafter a change
occurred to the
semantic network such that the particular property and/or the particular node
were removed or
otherwise changed in the semantic network, then the computing device may
provide to a user an
indication of this change. In some embodiments, the computing device may
present though a GUI
an indication of the specifically affected transformation and may provide one
or more
recommendations on how to address the change (such as by revising the
transformation to assign
the data column to some other property of some other node, among other
possibilities). Other
ways to engage in data validation functions may be possible as well.
44

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[165] In some embodiments, the computing device may engage in a lineage
feature, by
which the computing device may use the list of transformations in a data
contact to determine the
data table and column(s) that feed each aspect of the semantic network, such
as the nodes,
properties, and links. The computing device may provide one or more
visualization tools that show
the "data lineage" between an aspect of the semantic network and one or more
underlying data
sources. In this way, a user can trace the lineage of an aspect of the
semantic network back to an
underlying data source and identifying transformations to that data along the
way so as to get a
better understanding of how the underlying data is sued to populate the
semantic network. This
lineage feature may be especially valuable in situations where many different
transformations are
applied to the underlying source data before it is mapped to aspects of the
semantic network.
C. Example Security Operations
[166] It may be advantageous to establish restrictions on accessing the
data as it exists in
the semantic network at the semantic network level. For instance, although as
set forth above, a
computing device may be configured to present one or more visualizations of an
entire semantic
network (i.e., presenting a visualization that includes each and every node
and link of the semantic
network), including displaying data related to the underlying data records
that have been assigned
to the nodes or links of the semantic network, it may be advantageous to, at
times, present a
visualization of just a portion of a semantic network (e.g., presenting a
visualization that includes
just a subset of the nodes and links of the entire semantic network),
including, for instance,
displaying data related to the underlying data records that have been assigned
to just a subset of
the nodes or links of the semantic network. This may be advantageous, for
instance, in situations
where an organization desires to give one type of user (e.g., employees)
access to one type of the
organization's data (e.g., employee data) but desires to restrict access of
that type of user to other
types of the organization's data (e.g., company-financial data).
[167] In some systems, data security operations may be applied at the data
source level.
For instance, access to entire data sources may be restricted on a user-by-
user basis with some
users having access to all data stored in a given data source and other users
not having access to
any of the data stored in the given data source. However, this may become
problematic when data
sources are used to store large amounts of an organization's data. In such
cases, a single data
source may, for instance, store multiple types of data (e.g., employee-related
data and company-
financial data), and thus it may be desirable to allow certain users to access
some but not all of the
data stored in a particular data source. Depending on the data source, this
may or may not be
possible. In situations where it is possible, the data security profiles may
be applied on a table-by-
table or column-by-column basis, which is a cumbersome and time-consuming
process to manage.

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
In situations where allowing certain users to access some but not all of the
data stored in a
particular data source is not possible, then different types of data may have
to be split out and
stored in separate data sources (e.g., with one data source storing employee-
related data and
another separate data source storing company-financial data, among other
possibilities). This too
may lead to added cost associated with implementing multiple data sources. And
in either case,
managing the security profiles when organizations implement changes in the
underlying data
sources may further increase the time and expense. For instance, when an
organization adds a
column or table in a data source or moves data from one column or table to
another column or
table in the same data source, all security profiles may have to be modified
to account for this
change.
[168] To address these situations, and perhaps others, disclosed herein is
an example
software tool that may facilitate engaging in data security operations for
semantic networks and
applying these data security operations at the semantic network level as
opposed to applying the
data security operations at the data source level. Applying the data security
operations at the
semantic network level rather than at the data source level may result in more
efficient application
of data security operations. Indeed, if data security operations are applied
at the semantic network
level, an organization may make a changes to the underlying data sources (such
as adding a data
source, moving data from one data source to another data source, or even
combining data from
multiple data sources into a single data source) without affecting the
existing application of the
data security operations.
[169] For instance, as will be appreciated with the benefit of the present
disclosure, if an
organization moves, say, employee data from one data source to another data
source that stores,
say, company-financial data, so long as the relationship between the
underlying data and the
semantic network remains unaffected (e.g., the employee-related data remains
assigned to the
various employee-related nodes and links of the semantic network), the
existing data security
profiles will remain unaffected too. In this way, for instance, users
authorized to access only the
employee-related data will continue to be able to access the employee-related
data (even though
it is now stored in another data source) and will not be granted access to the
company-financial
data as a result of the change to the underlying data source. Thus, an
organization may make
changes to the underlying data sources without having to reestablish or
otherwise modify the data
security profiles. These, and other advantages, will become apparent in view
of the entire
disclosure.
[170] As disclosed herein, data security operations applied at the semantic
network level
may include operations for establishing user permissions for accessing various
nodes and links of
46

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
a semantic network, which provides access to the underlying data assigned to
these nodes and
links (such as data related to a digital duplicate, including data related to
the digital context and
the digital content of a digital duplicate), and operations for generating
visualizations related to a
semantic network in accordance with these user permissions. In this respect,
and using the
software tool disclosed herein, an administrator or the like may authorize
particular authorized
users to access the underlying data assigned to certain selected portions of
the semantic network.
These authorized users may then be able to access the data assigned to the
selected portions of the
semantic network, access visualizations related to the underlying data
assigned to the selected
portions of the semantic network, determine insights related to the underlying
data assigned to the
selected portions of the semantic network, and manipulate data related to the
underlying data
assigned to the selected portions of the semantic network. However, the
authorized users may not
be able to access data assigned to other portions of the semantic network
(i.e., portions of the
semantic network that were not selected), meaning that the authorized users
may not be able to
access visualizations related to the underlying data assigned to the other
portions of the semantic
network, determine insights related to the underlying data assigned to the
other portions of the
semantic network, or manipulate data related to the underlying data assigned
to the other portions
of the semantic network.
11711 For purposes of illustration only, example operations are
described herein as being
carried out by a computing device, such as computing device 200 (FIG. 2),
which as described
above, may serve as one or more of client stations 113 (FIG. 1B) and/or back-
end platform 112
(FIG. 1B). In this respect, it should be understood that, depending on the
implementation, the
operations discussed herein below may be carried out entirely by a single
computing device, such
as one or more of client stations 113 or by back-end platform 112, or may be
carried out by a
combination of computing devices, with some operations being carried out by
back-end platform
112 (such as computational processes and data-access operations) and other
operations being
carried out by one or more of client stations 113 (such as display operations
and operations that
receive user inputs). However, other arrangements are possible as well.
[172] To help describe some of these operations, flow diagrams may also
be referenced
to describe combinations of operations that may be performed by a computing
device. In some
cases, a block in a flow diagram may represent a module or portion of program
code that includes
instructions that are executable by a processor to implement specific logical
functions or steps in
a process. The program code may be stored on any type of computer-readable
medium, such as
non-transitory computer readable media (e.g., data storage 204 (FIG. 2)). In
other cases, a block
in a flow diagram may represent circuitry that is wired to perform specific
logical functions or
47

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
steps in a process. Moreover, the blocks shown in the flow diagrams may be
rearranged into
different orders, combined into fewer blocks, separated into additional
blocks, and/or removed,
based upon the particular embodiment. Flow diagrams may also be modified to
include additional
blocks that represent other functionality that is described expressly or
implicitly elsewhere herein.
[173] Turning first to FIG. 11A, this figure presents a flow diagram 1100
depicting one
example of a process for engaging in data security operations for semantic
networks, and in
particular, for establishing, and presenting visualizations related to, a
"realm" of a semantic
network. As depicted, this process may generally involve the following
operations: (i) at block
1102, a computing device may receive an input indicating a desire to create a
realm of a semantic
network, (ii) at block 1104, the computing device may define a primary node
for the realm, and
(iii) at block 1106, the computing device may receive a selection of one or
mode nodes or links of
the semantic network to include in the realm. Each of these operations will
now be discussed in
further detail with reference, in some cases, to example snapshots of GUIs
that may facilitate some
or all of the disclosed functionality.
[174] Turning first to block 1102, the computing device may receive an
input indicating
a desire to create a "realm" of a semantic network. A realm of an
organization's semantic network
is generally a subset of nodes and links of the semantic network, where, in
some embodiments,
the subset of nodes and links relate to some aspect of the organization. For
instance, one type of
realm may be an "employee realm," which may include various nodes and links
that describe data
that ought to be accessible to various employees of the organization (such as
employee data) and
may not include nodes or links that describe data that ought not to be
accessible to various
employees (such as company financial data). As another example, another type
of realm may be
a "customer realm," which may include various nodes and links that describe
data that ought to
be accessible to various customers of the organization (such as customer data
and/or order data)
and may not include nodes or links that describe data that ought not to be
accessible to various
customers (such as employee data and/or company financial data). Other types
of realms are
possible as well.
[175] A computing device may receive an input indicating a desire to create
a realm in
various ways. As one example, a computing device may present via a GUI one or
more screens
through which a user may provide to the computing device various user inputs,
including an
instruction to create a new realm and other various identifying information
for the new realm. To
illustrate one example of this, FIG. 12A depicts an example snapshot 1200 of a
GUI through which
a user may provide a user input indicating a desire to create a new realm. For
instance, as depicted,
the GUI may include a realm navigation panel 1202 that includes an "Add Realm"
button 1204.
48

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
In operation, the computing device may receive a user selection of the "Add
Realm" button (e.g.,
by way of a mouse click or touchscreen tap, among other possibilities) and may
responsively
display an information window 1206 through which the user may provide
additional user inputs
to, for instance, to name the new realm and provide a text description of the
new realm. Other
ways for a computing device to receive an input indicating a desire to create
a new realm may be
possible as well.
[176] Returning to FIG. 11A at block 1104, the computing device may next
define what
is referred to as a "primary node" for the realm. A primary node for a realm
may generally be a
node of a semantic network that contains underlying data records desired to be
used as the basis
to limit access to the underlying data of the realm. For instance, for an
"employee realm," which
may be designed to include nodes and links of a semantic network that ought to
be accessible to
employees of an organization and not include nodes or links that ought not to
be accessible to
employees of the organization, a primary node for this realm may an "employee"
node. In this
way, and as will be described further herein, the underlying data records of
the employee node
may be used as a basis to limit access to the underlying data of the rest of
the realm. For instance,
in one example, a particular data record of the primary node may be selected
(e.g., a particular
employee) and, for a given user or set of users, access to the underlying data
of a realm may be
limited to just those data records that relate to the selected data record of
the primary node. Other
examples of primary nodes are possible as well.
[177] A computing device may define a primary node in various ways. As one
example,
a computing device may receive a user input indicating a particular node in
the semantic network
to use as the primary node for the realm being created. To facilitate this,
the computing device
may, as noted above, present via a GUI one or more screens through which a
user may provide to
the computing device various user inputs, including an indication or a
selection of a node to use
as the primary node for a realm being created. As one example of this, FIG.
12B depicts an
example snapshot 1210 of a GUI that is displaying an information window 1208
listing the various
nodes of the semantic network. Upon selection of one of the listed nodes
(e.g., the "employee"
node as depicted in FIG. 12B), the computing device may then define the
primary node for the
realm being created as this selected node as the primary node for the realm.
Other ways of defining
a primary node for a realm may be possible as well.
[178] Returning to FIG. 11A at block 1106, the computing device may next
receive a
selection of one or more nodes or links to define the realm. The computing
device may receive
this selection in a variety of ways. As one possibility, the computing device
may present a GUI
through which a user may provide a user input or series of user inputs that
comprise a selection of
49

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
one or more nodes or links. To illustrate one example of this, FIG. 13 depicts
an example snapshot
1300 of a GUI that displays an object selection panel 1302, a realm navigation
panel 1304, and a
network display location 1306. As depicted, the object selection panel 1302
may list the various
nodes and links of the entire semantic network. To select one or more of the
nodes or links of the
semantic network, the computing device may provide the ability to click on a
text listing of one
of the nodes or links and drag that text listing to the realm navigation panel
1304 to thus select
that node or link for inclusion into the realm. For instance, to select the
"Shipper" node for
inclusion into the realm being created, a user may click on the "Shipper" node
text entry and
corresponding icon listed in the object selection panel 1302 and drag it over
into the realm
navigation panel 1304. Likewise, to select the "Orders" link for inclusion
into the realm being
created, a user may click on the "Orders" link text entry and corresponding
icon listed in the object
selection panel 1302 and drag it over into the realm navigation panel 1304.
Other ways of
providing a user input to indicate a selection of one or more nodes or links
of a semantic network
is possible as well.
[179] In some embodiments, when a link (such as "Orders" link) is selected
for inclusion
in a realm (e.g., by a user dragging and dropping a text entry and
corresponding icon of a link
from the object selection panel 1302 to the realm navigation panel 1304), the
computing device
also includes in the realm being created any nodes that are associated by the
selected link. For
instance, in the example semantic network depicted in FIG. 13, if a user
selected the "Employee
Location" link (e.g., by dragging and dropping a text entry and corresponding
icon of the
"Employee Location" link from the object selection panel 1302 to the realm
navigation panel
1304), the computing device may also populate in the realm navigation panel
1304 the
"Employee" and "Address" nodes, thus selecting these nodes for inclusion into
the realm being
created as well.
[180] As also depicted in FIG. 13, the computing device may provide a
visualization to
help preview what nodes and links have been selected for inclusion into the
realm being created.
For instance, as depicted in display location 1306, the computing device may
display the
"Employee Location," "Employee Territories," and "Orders" links as smaller
circles and may also
display the "Employee," "Address," "Territory," "Product," "Customer,"
"Shipper," "Order
Date," "Ship Date," "Required Date," and "Vendor" nodes as each of these nodes
are associated
with one or more of the selected links. As depicted, this realm includes just
a subset of the nodes
and links of the full semantic network (e.g., as depicted in FIG. 4). The
computing device may
provide other visualizations to help preview and/or visualize the realm as
well. Notably,

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[181] Turning next to FIG. 11B, this figure presents a flow diagram 1120
depicting
another example of a process for engaging in data security operations for
semantic networks, and
in particular, for establishing and applying security profiles for instances
of a semantic network,
including realms. As depicted, this process may generally involve the
following operations: (i) at
block 1122, a computing device may receive an indication of a user for which
to apply a security
profile, (ii) at block 1124, the computing device may receive a section of a
permission profile to
apply for the user, (iii) at block 1126, the computing device may receive a
selection of a semantic
network instance, (iv) at blocks 1128 and 1130, the computing device may
either or both receive
a selection of semantic context information to block for the user or receive a
selection of semantic
context information to selectively filter for the user, and (v) at block 1132,
the computing device
may thereafter present a visualization of the semantic network to the user
implementing either or
both of the blocked context information or the filtered context information.
Each of these
operations will now be discussed in further detail with reference, in some
cases, to example
snapshots of GUIs that may facilitate some or all of the disclosed
functionality.
[182] Turning first to block 1122, the computing device may first receive
an indication
of a user for which to create and apply a security profile. To facilitate
receive this indication, the
computing device may present via a GUI one or more screens though which a user
may provide
various user inputs in order to provide the computing system with an
indication of a user for which
to create and apply a security profile. To illustrate one example of this,
FIG. 14A depicts an
example snapshot 1400 of an admin tool that presents a GUI through which a
user can provide
one or more user inputs that facilitate the computing device establishing and
applying security
profiles for instances of a semantic network, including realms. As depicted,
the GUI may provide
a user information area 1402 within which a user may provide various user
inputs to provide the
computing device with an indication of a user for which to create and apply a
security profile. For
example, as depicted, the user information area 1402 may include a location
for receiving a user
input to provide a user name and locations for receiving other information
about the user, such as
a last name, first name, email address, cell phone, etc. Other ways for
receiving an indication of a
user for which to create and apply a security profile may be possible as well.
[183] Returning to FIG. 11B at block 1124, the computing device may next
receive a
selection of a permission profile to apply for the user. A permission profile
may define what a
user can and cannot do with respect to modifying various aspects of a semantic
network or a realm
of a semantic network. For instance, one type of permission profile may be a
"user" profile,
whereby a user granted with a "user" permission profile would only be able to
view information
contained within the semantic network but may not be able to modify any
information, such as by
51

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
creating new realms, or creating or modifying nodes, links, or properties.
Another type of
permission profile may be a "super user" profile, whereby a user granted with
this profile would
be able to view information contained within the semantic network and also
modify insights
related to the information, and perhaps other aspect about the semantic
network, such as creating
or modifying realms and/or nodes, like, or properties. And yet another type of
permission profile
may be an "administrator" profile, whereby a user granted with this profile
would be able to view
and modify information contained within the semantic network as well as grant
and modify
permissions for other users. Finally, yet another type of permissions profile
may be a specific
group profile, whereby a user granted with a specific group profile may have
one of the various
types of permissions profiles discussed above (e.g., a user, super user, or
administrator profile),
perhaps with other individual security settings, the establishment of which is
described further
herein.
[184] The computing device may receive a selection of a permission profile
to apply for
a user in a variety of ways. As one possibility, the computing device may
present a GUI, such as
that depicted in snapshot 1400 of FIG. 14A, via which a user may provide a
user input selecting
one of a set of available user profiles to apply for the selected user. For
instance, the GUI may
include a drop-down list or the like that lists the available permission
profiles. A user may then
select one of the permissions profiles, through a mouse click or touchscreen
tap, or the like, to
thereby select one of these profiles. The computing device may receive a
selection of a permission
profile in other ways as well.
[185] Next at block 1126, the computing device may receive a selection of a
semantic
network instance. One type of semantic network instance may be an entire
semantic network, such
as a semantic network that may have been created for an organization, which
may include each
and every node, link, and property of the semantic network. Another type of
semantic network
instance may be just a subset of an entire semantic network, such as a realm,
described above. To
facilitate receiving a section of a semantic network instance, the computing
device may present
via a GUI, one or more drop-down lists that include the semantic network
instances (including
realms) that are available for selection. FIG. 14B depicts an example snapshot
1410 of a GUI that
depicts an example drop-down list that 1404 lists example semantic network
instances for
selection. Through this list, for instance, a user may select one or more of
the semantic network
instances listed, through a mouse click or touchscreen tap or the like, and
thereby provide the
computing device with a selection of a semantic network instance. The
computing device may
receive a section of a semantic network instance in other ways as well.
52

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[186] Next at block 1128, the computing device may receive a selection of
semantic
context information to block for the user. The computing device may ultimately
present
visualizations for the selected user and exclude from these visualizations any
semantic information
that has been blocked. For instance, the computing device may receive a
selection of one or more
nodes to block. In this case, when the computing device presents a
visualization related to the
semantic network (e.g., a visualization of the semantic network in a web-like
structure or a
visualization that displays data related to the underlying data records that
have been assigned to
nodes and links of the semantic network, where such data may be in the form of
a data table,
graph, or some other format outlining or summarizing the underlying data
records), the computing
device will omit from the visualization any data related the blocked node(s).
For instance, where
the visualization is a web-like structure illustrating the semantic network,
the computing device
will omit from this visualization any depiction of the blocked node(s). In
another example, where
the visualization displays data related to the underlying data records that
have been assigned to
nodes and links of the semantic network, such as a data table, graph, or some
other format
outlining or summarizing the underlying data records, the computing device
will omit from the
visualization any underlying data records that have been assigned to a blocked
node, and any
insights defined for the semantic network that contain or are related to the
data records assigned
to a blocked node.
[187] In another example, the computing device may receive a selection of
one or more
properties to block. In this case, the computing device may, like described
above, present
visualizations related to the semantic network that omit certain data, but the
computing device
may omit data on a more granular level. For instance, if the computing device
receives a selection
of one or more properties of one or more nodes to block, then the computing
device may present
visualizations for the selected user and exclude from these visualizations any
data related to the
blocked properties of the nodes, but include any data related to the other
properties of the nodes.
By way of example, if the computing device receives a selection of the
property "Last Name" of
the node "Employees" to block, then for any visualizations that the computing
device ultimately
presents to the given user, the computing device may omit any data related to
the "Last Name"
property of the "Employee" node (such as omitting from the visualization any
employee last
names) but including in the visualization the other properties of the
"Employee" node, such as
"First Name" and perhaps an "Employee ID," (assuming the computing device has
not also
received selections to block these properties as well), among other possible
properties.
[188] In some embodiments, the computing device may receive a selection of
semantic
context information to block for the user on a less granular level. In one
example of this, the
53

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
computing device may receive a selection of an entire semantic data type to
block for the user.
For instance, if the computing device receives a selection of an "Email
Address" semantic data
type to block for the user, then for any visualizations that the computing
device ultimately presents
to the given user, the computing device may omit any data that is assigned to
the "Email Address"
semantic data type no matter what node, property, or link the within which the
data happens to be
assigned.
[189] To facilitate receiving a selection of semantic context information
to block for the
user, the computing device may present via a GUI one or more graphical
locations within which
the computing device may receive user inputs to select semantic context
information to block for
the user. As depicted in FIG. 15A, for instance, snapshot 1500 depicts a GUI
that displays a
graphical location 1502 within which a user can select nodes of the semantic
network to block
and/or select properties of certain nodes of the semantic network to block. As
specifically depicted
in graphical location 1502, for instance, a user may have selected an "Agency"
node of a semantic
network to block. In this example, when the computing device ultimately
depicts a visualization
of the semantic network for this user, the computing device will omit from
this visualization any
depiction of the "Agency" node and/or any underlying data records that have
been assigned to the
"Agency" node, depending on the type of visualization, including any insights
that may have been
created that include data assigned to the "Agency" node
[190] Turning next to snapshot 1510 in FIG. 15B, this snapshot depicts a
GUI that
displays a similar graphical location 1512 within which a user can also select
properties of a node
or link of the semantic network to block. As specifically depicted in
graphical location 1512, for
instance, a user may have selected an "Agent Activity BusinessProcess" link
(which may, for
example, link an "Agent Activity" node and a "Business Process" node of the
semantic network),
and the user may have also selected a "TransactionPremiumAmt" property of the
selected link to
block. In this example, when the computing device ultimately depicts a
visualization of the
semantic network for this user, the computing device will omit from this
visualization any
depiction of the "TransactionPremiumAmt" property of the "Agent Activity
BusinessProcess"
link and/or any underlying data records that have been assigned to this
property, depending on the
type of visualization. Other ways to facilitate receiving a selection of
semantic context information
to block for the user are possible as well.
[191] Turning next to block 1130 in FIG. 11B, the computing device may
receive a
selection of semantic context information to selectively filter for the user.
As explained, the
computing device may ultimately present visualizations for the selected user.
However, if the
computing device has received a selection of semantic context information to
filter, then the
54

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
computing device may include in the visualization semantic information data
that conforms to the
filter but exclude from these visualizations any semantic information that
does not conform to the
filter.
[192] For instance, the computing device may receive a selection of one or
more nodes
and/or properties to selectively filter. In this case, when the computing
device presents a
visualization related to the semantic network (e.g., a visualization of the
semantic network in a
web-like structure or a visualization that displays data related to the
underlying data records that
have been assigned to nodes and links of the semantic network, where such data
may be in the
form of a data table, graph, or some other format outlining or summarizing the
underlying data
records), the computing device will include in the visualization only data
related to the one or
more nodes and/or properties that have been selected for the filter and
exclude from the
visualization data not related to the selectively-filtered nodes. For
instance, where the visualization
is a web-like structure illustrating the semantic network, the computing
device will only include
a depiction of the selected nodes or properties and will omit from this
visualization any depiction
of the other nodes or properties. In another example, where the visualization
displays data related
to the underlying data records that have been assigned to nodes and links of
the semantic network,
such as a data table, graph, or some other format outlining or summarizing the
underlying data
records, the computing device will only display data related to the selected
nodes or properties
and will omit from the visualization any underlying data records that are
related to other nodes or
properties.
[193] In another example, the computing device may receive a selection of
one or more
nodes, properties, and underlying data to filter. In this case, the computing
device may, like
described above, present visualizations related to the semantic network that
omit certain data, but
the computing device may include other those data records that match or relate
to the selected
underlying data records for the selected property of the selected node. For
instance, if the
computing device receives a selection of one or more data records of a
particular property of a
particular node, then the computing device may present visualizations for the
selected user that
include the underlying data records or data related to the underlying data
records, and exclude
from these visualizations any other data records. By way of example, if the
computing device
receives a selection of the "John Smith" underlying data record for the
"Employee Name" property
for the "Employee" node, then for any visualizations that the computing device
ultimately presents
to the given user, the computing device may only include in these
visualizations data records that
are or are related to the "John Smith" employee and will omit from the
visualization other data
records that are or that relate to other employees.

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[194] In some embodiments, the computing device may receive a selection of
semantic
context information to selectively filter for the user on a less granular
level. In one example of
this, the computing device may receive a selection of an entire semantic data
type to selectively
filter for the user. For instance, if the computing device receives a
selection of an "Email Address"
semantic data type to filter for the user, then for any visualizations that
the computing device
ultimately presents to the given user, the computing device may include in
these visualizations
only such data that is assigned to the "Email Address" semantic data type no
matter what node,
property, or link the within which the data happens to be assigned related to
the "Email Address"
semantic data type and omit from the visualization any data.
[195] To facilitate receiving a selection of semantic context information
to block for the
user, the computing device may present via a GUI one or more graphical
locations within which
the computing device may receive user inputs to select semantic context
information to selectively
filter for the user. As depicted in FIG. 16A, for instance, snapshot 1600
depicts a GUI that displays
a graphical location 1602 within which a user can select a node, property,
and/or underlying data
records assigned to the property and node to selectively filter for the user.
As specifically depicted
in graphical location 1602, for instance, a user may have selected the
"Agency" node of a semantic
network, and the "BrokerPartyID" property of that node. Further, the user may
have selected
underlying data records of the "BrokerPartyID" property that match data values
of "84" and "87."
In this example, when the computing device ultimately depicts a visualization
of the semantic
network for this user, the computing device will only include in this
visualization underlying data
related to these selected "BrokerPartyID" values. Depending on the embodiment,
including in this
visualization underlying data related to these selected "BrokerPartyID" values
may mean that the
computing device would also include in the visualization underlying data from
other nodes and
other properties of the semantic network, so long as this other data is
related to the selected
"BrokerPartyID" values. For instance, if another node, say, a "Sales" node,
contained data related
to sales made by particular brokers, including for instance brokers
represented by the selected
"BrokerPartyID" values, then the computing device may also include in the
visualization
underlying data assigned to these sales (e.g., data related to sales made by
the brokers represented
by these selected "BrokerPartyID" values).
[196] FIG. 16B depicts another snapshot 1610 of a GUI that includes a
graphical location
1612 that depicts various filters that have been applied for a particular
user. For instance, as
depicted, the computing device may have received multiple selections of
semantic context
information to selectively filter for the user. As depicted in this particular
example, the computing
device may have received a selection of data values "84" and "87" from the
"BrokerPartyID"
56

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
property of the "Agency" node and a selection of a data value "CN" from the
"POLBusinessType"
property of the "LOB" node. In practice, the computing device may receive many
different
selections of semantic context information to selectively filter for a given
user. When a computing
device receives multiple selections of semantic context information to filter
for a given user, such
as is depicted in graphical location 1612 in FIG. 16B, the computing device
may include in a
visualization underlying data so long as the underlying data relates to one
(or more) of the selected
semantic context information. Other ways to depict semantic context
information that has been
selectively filtered for a given user and other ways to facilitate receiving a
selection of semantic
context information to selectively filter for a given user are possible as
well.
[197] In embodiments in which, at block 1126, the computing device received
a selection
of a realm as the selection of a semantic network instance, then the computing
device may engage
in additional functionality in connection with the functionality described
above with respect to
block 1130. For instance, responsive to receiving a selection of a realm as
the selection of the
semantic network instance, the computing device may retrieve from data storage
the node that
was defined as the primary node during creation of the realm (e.g., as
described above with respect
to block 1104 (FIG. 11A). Further, the computing device may responsively
prompt the user (via
a GUI) to select one or more underlying data records assigned to a property of
this primary node
to selectively filter for the user, similar to the procedure described above
with respect to block
1130.
[198] For instance, in an example in which an "Employee" realm has its
primary node
defined to be the "Employee" node, then the computing device may ultimately
prompt the user
(via a GUI) to select one or more underlying data records assigned to the
"Employee" node (e.g.,
data records that represent individual employees) to selectively filter for
the user. In this way,
when the computing device presents to the user a visualization related to the
"Employee" realm,
the computing device may include in this visualization only those underlying
data records (e.g.,
data records related one or more employees) that are contained within the
nodes and links selected
for inclusion in the realm (e.g., as described above with respect to block
1106 (FIG. 11A)) and
that also relate to the underlying data records of the primary node selected
during the
aforementioned step. In this way, a realm (e.g., an "Employee" realm) can be
designed to include
only those nodes and links of the semantic network that include data that
ought to be viewed by
certain users (e.g. employees) and can be further designed to limit which data
records are
accessible depending on which user (e.g., which employee) is viewing the
visualization related to
the realm.
57

CA 03141742 2021-11-23
WO 2020/243420 PCT/US2020/035104
[199] Finally, returning to FIG. 11B, at block 1132 the computing device
may present to
the user a visualization of the semantic network implementing either or both
of the blocked context
information and/or the filtered context information. As indicated above, if
the computing device
received a selection of semantic context information to block, then the
computing device may
present a visualization related to the semantic network (e.g., a visualization
of the semantic
network in a web-like structure or a visualization that displays data related
to the underlying data
records that have been assigned to nodes and links of the semantic network,
where such data may
be in the form of a data table, graph, or some other format outlining or
summarizing the underlying
data records) but omit from the visualization any data related the blocked
semantic context
information. As also described above, if the computing device received a
selection of semantic
context information to selectively filter, then the computing device may
present a visualization
that displays data related to the underlying data records that have been
assigned to nodes and links
of the semantic network, where such data may be in the form of a data table,
graph, or some other
format outlining or summarizing the underlying data records), including in
this visualization only
data related to the one or more nodes and/or properties that have been
selected for the filter and
exclude from the visualization data not related to the selectively-filtered
nodes. Other ways to
present to the user a visualization of the semantic network implementing
either or both of the
blocked context information and/or the filtered context information are
possible as well.
VI. CONCLUSION
[200] Example embodiments of the disclosed innovations have been described
above.
Those skilled in the art will understand, however, that changes and
modifications may be made to
the embodiments described without departing from the true scope and spirit of
the present
invention, which will be defined by the claims.
[201] Further, to the extent that examples described herein involve
operations performed
or initiated by actors, such as "humans," "operators," "users" or other
entities, this is for purposes
of example and explanation only. The claims should not be construed as
requiring action by such
actors unless explicitly recited in the claim language.
58

Representative Drawing

A single figure which represents the drawing illustrating the invention.

Administrative Status

For a clearer understanding of the status of the application/patent presented on this page, the site Disclaimer , as well as the definitions for Patent , Administrative Status , Maintenance Fee and Payment History should be consulted.

Administrative Status

Title	Date
Forecasted Issue Date	Unavailable
(86) PCT Filing Date	2020-05-29
(87) PCT Publication Date	2020-12-03
(85) National Entry	2021-11-23
Examination Requested	2022-09-26

Abandonment History

There is no abandonment history.

Maintenance Fee

Last Payment of $100.00 was received on 2023-05-26

Upcoming maintenance fee amounts

Description	Date	Amount
Next Payment if small entity fee	2024-05-29	$50.00
Next Payment if standard fee	2024-05-29	$125.00

Note : If the full payment has not been received on or before the date indicated, a further fee may be required which may be one of the following

the reinstatement fee;
the late payment fee; or
additional fee to reverse deemed expiry.

Patent fees are adjusted on the 1st of January every year. The amounts above are the current amounts if received by December 31 of the current year.
Please refer to the CIPO Patent Fees web page to see all current fee amounts.

Payment History

Fee Type	Anniversary Year	Due Date	Amount Paid	Paid Date
Application Fee		2021-11-23	$408.00	2021-11-23
Maintenance Fee - Application - New Act	2	2022-05-30	$100.00	2022-05-23
Request for Examination		2024-05-29	$814.37	2022-09-26
Maintenance Fee - Application - New Act	3	2023-05-29	$100.00	2023-05-26

Owners on Record

Note: Records showing the ownership history in alphabetical order.

Current Owners on Record
TADA COGNITIVE SOLUTIONS, LLC

Past Owners on Record
None

Past Owners that do not appear in the "Owners on Record" listing will appear in other documentation within the application.

Documents

To view selected files, please enter reCAPTCHA code :

To view images, click a link in the Document Description column. To download the documents, select one or more checkboxes in the first column and then click the "Download Selected in PDF format (Zip Archive)" or the "Download Selected as Single PDF" button.

List of published and non-published patent-specific documents on the CPD .

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.

Filter

Download Selected in PDF format (Zip Archive)

Download Selected as Single PDF

Document Description	Date (yyyy-mm-dd)	Number of pages	Size of Image (KB)
Abstract	2021-11-23	2	78
Claims	2021-11-23	5	193
Drawings	2021-11-23	18	1,546
Description	2021-11-23	58	3,846
Representative Drawing	2021-11-23	1	25
International Search Report	2021-11-23	1	52
National Entry Request	2021-11-23	8	223
Cover Page	2022-01-14	1	54
Request for Examination / Amendment	2022-09-26	39	1,663
Claims	2022-09-26	33	2,000
Description	2022-09-26	58	5,402
Examiner Requisition	2024-02-09	7	330

Language selection

Menus

English Abstract

French Abstract

Administrative Status

Abandonment History

Maintenance Fee

Payment History

Your request is in progress.

Requested information will be available
in a moment.

Thank you for waiting.

Patent 3141742 Summary

English Abstract

French Abstract

Administrative Status

Abandonment History

Maintenance Fee

Payment History

Your request is in progress.Requested information will be availablein a moment.Thank you for waiting.

Your request is in progress.

Requested information will be available
in a moment.

Thank you for waiting.