Patent 1341154 Summary

(12) Patent:	(11) CA 1341154
(21) Application Number:	1341154
(54) English Title:	MULTIPROCESSOR DIGITAL DATA PROCESSING SYSTEM
(54) French Title:	SYSTEME DE TRAITEMENT DE DONNEES NUMERIQUES A PLUSIEURS PROCESSEURS
Status:	Expired and beyond the Period of Reversal

Bibliographic Data

(51) International Patent Classification (IPC):	G6F 15/167 (2006.01) G6F 13/18 (2006.01)
(72) Inventors :	FRANK, STEVEN J. (United States of America) BURKHARDT, HENRY III (United States of America) GOODMAN, NATHAN (United States of America) MARGULIES, BENSON I. (United States of America) WEBER, FREDERICK D. (United States of America) LEE, LINDA O. (United States of America)
(73) Owners :	SUN MICROSYSTEMS, INC.
(71) Applicants :	SUN MICROSYSTEMS, INC. (United States of America)
(74) Agent:	RICHES, MCKENZIE & HERBERT LLP
(74) Associate agent:
(45) Issued:	2000-12-12
(22) Filed Date:	1988-11-08
Availability of licence:	N/A
Dedicated to the Public:	N/A
(25) Language of filing:	English

Patent Cooperation Treaty (PCT):	No

(30) Application Priority Data:

Application No.	Country/Territory	Date
136,930	(United States of America)	1987-12-22

Abstracts

English Abstract

A multiprocessor digital data processing
system comprises a plurality of processing cells
arranged in a hierarchy of rings. The system
selectively allocates storage and moves exclusive
data copies from cell to cell in response to access
requests generated by the cells. Routing elements
are employed to selectively broadcast data access
requests, updates and transfers on the rings.

Claims

Note: Claims are shown in the official language in which they were submitted.

-78-
The embodiments of the invention in which an exclusive
property or privilege is claimed are defined as follows:
1. A digital data processing apparatus comprising
A, plural memory elements for storing information-representative
signals, each of at least one of which memory
elements is respectively coupled to an associated central
processing unit,
each said central processing unit and its respective
associated memory element comprising at least part of a
processing cell,
B. memory management means coupled to said plural memory
elements for accessing one or more of said information-representative
signals stored in said plural memory elements,
C. at least a requesting one of said central processing
units including access request means for generating an access
request signal representative of a request for access to an
information-representative signal,
said access request means including means for
generating an ownership-request signal representative of a
request for priority access to an information-representative
signal,
at least the memory element associated with the
requesting central processing unit including control means for
selectively transmitting said access-request signal to said
memory management means, and
D. said memory management means including means responsive
to selected ones of said ownership-request signals for
allocating, only within the memory element associated with the
requesting central processing unit and exclusive of the memory
elements associated with the other central processing units,
physical storage space for the requested information-representative
signal and for storing that signal therein.
2. A digital data processing apparatus according to claim 1,
wherein said memory management means comprises deallocation means
responsive to selected ones of said ownership-request signal for
deallocating physical storage space allocated in a memory
element, other than the one associated with the requesting
central processing unit, for storage of the requested
information-representative signal.

-79-
3. A digital data processing apparatus according to claim 1,
Wherein at least one said memory element includes an associated
directory means for storing an SVA identifier signal uniquely
identifying at least selected information-representative signals
stored in that memory element.
4. A digital data processing apparatus according to claim 3,
wherein
A. said access request means includes means for
generating, along with said access request signal, an SVA request
signal identifying the requested information-representative
signal,
B. said memory management means includes interface means
coupled with said directory means for comparing an SVA request
signal with one or more SVA identifier signals stored in that
directory means for determining whether the requested
information-representative signal is stored in the memory element
associated with that directory means.
5. A digital data processing apparatus according to claim 3,
wherein
A. each said directory means includes one or more
identifier storage locations for storing SVA identifier signals,
B. said memory management means includes means for storing
in one of said identifier storage locations an SVA identifier
signal associated with an information-representative signal for
which physical storage space has been allocated within the
associated memory element, and
C. said memory management means includes means for
invalidating an SVA identifier signal corresponding to an
information-representative signal for which physical storage
space has been deallocated within the associated memory element.
6. A digital data processing apparatus according to claim 1,
wherein
said access request means includes means for generating a
read-only request signal representative of a request for
secondary access to an information- representative signal, and
wherein

-80-
said memory management means includes means selectively
responsive to a read-only signal generated by a requesting one of
said central processing units, said read-only signal being
representative of a request for secondary access to an
information-representative signal stored in a memory element
associated with another of said other central processing units,
for transferring a copy of that information-representative signal
to the memory element associated with the requesting central
processing unit.
7. A digital data processing apparatus according to claim 1,
wherein
said central processing units includes means for selectively
generating an update signal representative of modification of an
information-representative signal,
said memory management means includes means responsive to an
update signal generated by an updating one of said central
processing units for modifying a corresponding information-representative
signal stored in the memory element associated
with that central processing unit.
8. A digital data processing apparatus according to claim 7,
wherein said memory management means includes means responsive to
an update signal generated by said updating central processing
unit, said update signal being representative of modification of
a corresponding information- representative signal stored in a
memory element associated with another said central processing
unit, for modifying that corresponding information-representative
signal.
9. A digital data processing apparatus according to claim 1,
wherein
A. said access request means includes atomic-ownership
means for generating a atomic-request signal representative of a
request for exclusive priority access to one or more
information-representative signals,
B. said memory management means includes means selectively
responsive to an atomic-request signal generated by a first one
of said central processing units for
i) allocating, exclusively, in the memory element
associated with the first central processing unit physical

-81-
storage space for said one or more requested information-representative
signals and for storing those information-representative
signals therein, while
ii) preventing access to any of those information-representative
signals in response to a first selected access
request signal generated by a second one of said central
processing units.
10. A digital data processing apparatus according to claim 9,
wherein said memory management means includes means for
responding to a second selected access request generated by said
second central processing unit for effecting exclusive transfer
of requested information-representative signals to the memory
element associated with that central processing unit and for
preventing access to any of those information request signals
generated by the first or any other central process unit.
11. A digital data processing apparatus according to claim 9,
wherein
A. said access request means further includes
atomic-transaction termination means for generating a release signal
representative of termination of said request for priority access
to said one or more information-representative signals, and
B. said memory management means includes means responsive
to said transaction-termination signal for re-enabling selective
access to those information-representative signals in response to
access request signals generated by any of said central
processing units.
12. A digital data processing apparatus according to claim 1,
wherein
A. said plurality of processing cells includes at least a
remote processing cell including an associated memory element,
and
B. remote interface means for transferring signals to and
from the memory element associated with said remote processing
cell.
13. A digital data processing apparatus according to claim 12,
wherein said remote interface means includes fiber optic

-82-
transmission media for carrying signals to and from said remote
cell.
14. A digital data processing apparatus according to any one of
claims 1 to 13, characterized in that
at least one of said memory elements includes means for
storing a data subpage that comprises one or more information-
representative signals and that forms part of a data page, said
data page comprising plural data subpages,
said access request means includes subpage-request
means for generating a signal representative of a request for
access to a data subpage stored in one or more of said memory
elements, and
said memory management means includes means responsive
to at least selected ones of said subpage-request signals for
allocating, within the memory element associated with the
requesting central processing unit, physical storage space for
the data page associated with the requested data subpage, and for
storing the requested data subpage therein.
15. A digital data processing apparatus as claimed in claim 14,
comprising deallocation means for deallocating physical storage
space allocated to a selected data page in one or more of said
memory elements prior to, or substantially concurrent with, the
allocation of said physical storage space for the data page
associated with the requested data subpage.
16. A digital data processing apparatus as claimed in claim 15,
characterized in that said deallocation means includes
recombination means for selecting for deallocation a data page
for which physical storage space is allocated in at least a first
memory element and a second memory element.
17. A digital data processing apparatus as claimed in claim 16,
characterized in that said recombination means includes means for
deallocating physical storage space allocated to said selected
data page in said first memory element and for transferring, from
said first memory element to said second memory element for
storage therein, one or more selected subpages of said selected
data page prior to, or substantially concurrent with, the
deallocation of physical storage apace from said first memory
element.

-83-
18. A digital data processing apparatus as claimed in claim 15,
characterized in that said deallocation means includes LRU table
means associated with one or more of said memory elements for
storing one or more access history-representative signals
representative of a history of access of respective ones of the
data pages for which physical storage space is allocated therein.
19. A digital data processing apparatus as claimed in claim 18,
characterized in that
at least one said processing cell includes subcache means,
coupled to the central processing unit and to the associated
memory element, for storing one or more information-
representative signals of a data subpage in current use by that
central processing unit,
said subcache means including means for transferring
information-representative signals to and/or from that subcache
means and the associated memory element, and
said LRU table means includes means coupled to said subcache
means for generating an access history-representative signal
having a first value in response to a transfer of one or more
information-representative signals of a first data subpage
between that subcache means and its associated memory means.
20. A digital data processing apparatus as claimed in claim 19,
characterized in that said LRU table means includes means for
aging the access history-representative signal associated with
the first data subpage from said first value to a second value in
response to the transfer of one or more information-
representative signals of a second data subpage between said
subcache means and its associated memory element.
21. A digital data processing apparatus as claimed in claim 20,
characterized in that said deallocation means includes means for
identifying as a candidate for deallocation a data page having an
access history-representative signal greater than or equal to a
third value, wherein said third value is between said first value
and said second value.
22. A digital data processing apparatus as claimed in claim 16,
characterized in that at least one said memory element includes
an associated directory means for storing one or more descriptor
signals, each identifying a corresponding data page for which

-84-
physical storage space is allocated within that memory element,
each said descriptor signal including
i) a tag component signal identifying the
corresponding data page,
ii) at least one field component signal representative
of a status of one or more of the subpages which form
the corresponding data page.
23. A digital data processing apparatus as claimed in claim 22,
characterized in that the directory means associated with a first
memory element and a second memory element, both memory elements
of which include physical storage space allocated to the same
data page, store substantially the same tag component signal
corresponding to that data page.
24. A digital data processing apparatus as claimed in claim 23,
characterized in that said memory management means includes means
coupled to said recombination means for transferring one or more
of said field component signals from the directory means
associated with the first memory element to the memory means
associated with the second memory element in conjunction with the
transfer of at least selected data subpages therebetween.
25. A digital data processing apparatus as claimed in claim 22,
characterized in that
said directory means includes means for storing a field
component signal representative of a status including an invalid
state associable with one or more invalid data subpages of the
corresponding data page, and wherein
said deallocation means includes means for deallocating
physical storage space allocated to a data page associated With
said invalid state.
26. A digital data processing apparatus as claimed in claim 22,
characterized in that
said directory means includes means for storing a field
component signal representative of a status including a modified
state associable with one or more modified data subpages of the
corresponding data page, and wherein

-85-
said deallocation means includes means for deallocating
physical storage space allocated to a data page in which none of
the data subpages are associated with said modified state.
27. A digital data processing apparatus as claimed in claim 22,
characterized in that
said directory means includes means for storing a field
component signal representative of a status including a read-only
state associable with one or more data subpages of the
corresponding data page which data subpages are subject to
read-only access by a central processing unit associated with a memory
element in which physical storage space for that data subpage is
allocated, and
said deallocation means includes means for deallocating
physical storage space allocated to a data page in which all of
the data subpages are associated with said read-only state.
28. A digital data processing apparatus as claimed in claim 22,
characterized in that
said directory means includes means for storing a field
component signal representative of a status including an
exclusive owner access state associable with one or more data
subpages of the corresponding data page, which data subpages are
modifiable only by a central processing unit associated with a
memory element in which physical storage space for those data
subpages is allocated, and
said deallocation means includes means for deallocating
physical storage space allocated to a data page in which all of
the data subpages are associated with said exclusive owner state
and, thereby, removing a last copy of that data page from the
memory elements of said processing cells.
29. A digital data processing apparatus as claimed in claim 14,
characterized in that
at least the requesting central processing unit includes
execution means for executing a sequence of instructions, a first
instruction of which comprises a memory access request
instruction for causing said requesting central processing unit
to issue said request for access to said requested data subpage,
said execution means including means for

-86-
i) responding to said memory access request
instruction for generating said request for access to
said requested subpage, and
ii) commencing execution of the remaining instructions
of said first sequence,
said memory management means includes means for processing
said request asynchronously with respect to the execution of the
remaining instructions of said sequence.
30. A digital data processing apparatus as claimed in claim 29,
characterized in that said processing means includes means for
performing at least one of the following functions asynchronously
with respect to the execution of the remaining instructions of
said sequence:
i) allocating physical storage space for the data
page associated with said requested subpage,
ii) storing said requested data subpage in physical
storage space allocated
for the page associated with that data subpage, and
iii) deallocating physical storage space associated
with a data page in one or more memory elements.
31. A digital data processing apparatus as claimed in claim 29,
characterized in that
said execution means includes means for responding to a
selected request instruction for generating a PRBFBTCH access
request signal representing a request for asynchronous access to
a requested subpage in a requested access state,
said access state being selected from a group of access
states including one or more of
i) a read-only access state associable with a copy of an
information-representative signal subject to read-only access by
a central processing unit associated with a memory element in
which that copy is stored, wherein said information
representative signal is stored in a memory element associated
with another of said central processing units and is modifiable
by that other central processing unit,

-87-
ii) a non-exclusive owner access state associable with an
information-representative signal modifiable by a central
processing unit associated with a memory element in which that
information-representative signal is stored, a copy of which
information-representative signal is stored in a memory element
associated with another said central processing unit,
iii) an exclusive owner access state associable with an
information-representative signal modifiable by a central
processing unit associated with a memory element in which
exclusive physical storage space for that information-
representative signal is allocated and in which that information-
representative signal is stored, for which information-
representative signal no physical storage space is allocated in
any other memory element, that information-representative signal,
or a copy thereof, being accessible for transfer to another
memory element, and
iv) an atomic owner access state associable with an
information-representative signal modifiable by a central
processing unit associated with a memory element in which
exclusive physical storage space for that information-
representative signal is allocated and in which that information-
representative signal is stored, for which information-
representative signal no physical storage space is allocated in
another memory element, neither that information-representative
signal, nor a copy thereof, being selectively accessible for
transfer to another memory element.
32. A digital data processing apparatus as claimed in claim 14,
characterized in that said memory management means comprises
backing store means for transferring information-representative
signals forming a selected data page to a secondary storage
device.
33. A digital data processing apparatus as claimed in claim 32,
wherein said backing store means includes means for transferring
to the secondary storage device, substantially concurrently with
the transfer of information-representative signals forming a
selected data page, one or more field component signals of the
descriptor signal corresponding to that selected data page.
34. A digital data processing apparatus as claimed in claim 33,
characterized in that

-88-
said memory management means including state-defining means
coupled with said plural memory elements for associating an
access state with one or more information-representative signals
stored therein, said access state including an atomic owner
access state associable with an data subpage modifiable by a
central processing unit associated with a memory element in which
that data subpage is stored, no copy of which data subpage is
stored in a memory element associated with another said central
processing unit, neither that information-representative signal,
nor a copy thereof, being selectively accessible for transfer to
a memory element associated with another said central processing
unit,
said descriptor signal includes at least one field component
signal representative of an atomic-modified state associable with
one or more data subpages of the corresponding data page which
subpage undergoes a transition into or out of atomic state,
said backing store means includes means for transferring to
the secondary storage device, substantially concurrently with the
transfer of information-representative signals forming a selected
data page, said atomic-modified state-representative field
component signal corresponding to that selected data page.
35. A digital data processing apparatus of the type having
plural memory elements for storing information-representative
signals, each of at least one of which memory elements is
respectively coupled to an associated central processing unit,
each said central processing unit and its respective associated
memory element comprising at least part of a processing cell,
characterized in that
A. at least a requesting one of said central processing
units includes access request means for generating an access
request signal representation of a request for access to an
information representative signal,
B. said apparatus further including memory management
means coupled to said plural memory elements for accessing one or
more information-representative signals stored therein, and
C. said memory management means including state-defining
means coupled with said plural memory elements for associating an
access state with one or more information-representative signals
stored therein, said access state being selected from a group of
access states including one or more of

-89-
i) an exclusive owner access state associable with an
information-representative signal modifiable by a central
processing unit associated with a memory element in which
exclusive physical storage space for that information-
representative signal is allocated and in which that information-
representative signal is stored, for which information-
representative signal no physical storage space is allocated in
any other memory element, that information-representative signal,
or a copy thereof, being accessible for transfer to another
memory element, and
ii) an atomic owner access state associable with an
information-representative signal modifiable by a central
processing unit associated with a memory element in which
exclusive physical storage space for that information-
representative signal is allocated and in which that information-
representative signal is stored, for which information-
representative signal no physical storage space is allocated in
any other memory element, neither that information-representative
signal, nor a copy thereof, being selectively accessible for
transfer to a memory element associated with another said central
processing unit.
36. A digital data processing apparatus according to claim 35,
the further improvement wherein said state-defining means
includes
A. means for selectively and exclusively associating any
one of
i) said atomic owner access state, and
ii) said exclusive owner access state
with an information-representative signal stored in said plural
memory elements, and
B. means for selectively and exclusively associating said
non-exclusive owner access state with one of (n) copies of an
information-representative signal stored in said plural memory
elements, where (n) is an integer greater than or equal to one.
37. A digital data processing apparatus according to claim 36,
the further improvement wherein said state-defining means
includes means for selectively associating any one of

-90-
i) said read-only access state, and
ii) said invalid access state
with (n-1) copies of an information-representative signal stored
in said plural memory elements.
38. A digital data processing apparatus according to claim 35,
the further improvement wherein said memory management means
includes means responsive to association of said atomic access
state with an information-representative signal stored in a
memory element associated with a first central processing unit
for preventing access to that information-representative signal
in response to a first selected access request signal generated
by any other said central processing unit.
39. A digital data processing apparatus according to claim 38,
wherein
A. said memory management means includes means for
responding to a second selected access request signal generated
by a second central processing unit, and directed to one or more
information representative signals associated with the atomic
owner access state, for effecting transfer of those requested
signals to the memory element associated with said second central
processing unit, and
B. said state-defining means includes means for
associating the atomic owner state with those transferred signals
for allocating, exclusively, in the memory element associated
with the second central processing unit physical storage space
for said one or more requested information-representative
signals.
40. A digital data processing apparatus according to claim 35,
wherein
A. said state-defining means includes means responsive to
an access request signal generated by a requesting central
processing unit and directed to an information-representative
signal stored in a memory element associated with an owning
central processing unit, said information-representative signal
being associated with said atomic access state, for associating
with that information-representative signal a transient atomic
state,

-91-
B. said owning central processing unit includes means for
generating a release signal representative of a release of said
requested information-representative signal from said atomic
access state,
C. said memory management means includes means responsive
to a release signal generated by said owning central processing
unit and directed to said requested information-representative
signal for transmitting a copy of that information-representative
signal to the memory element associated with the requesting
central processing unit.
41. A digital data processing apparatus according to claim 35,
the further improvement wherein
A. said access request means includes means for generating
an exclusive access request signal representative of a request
for exclusive access to an information-representative signal, and
B. said state-defining means includes means responsive to
an exclusive access request signal generated by said requesting
central processing unit, said exclusive access request being
representative of a request for exclusive access to an
information-representative signal stored in a memory element
associated with another of said central processing units, wherein
that information-representative signal is associated with said
read- only access state, for effecting exclusive transfer of that
information-representative signal to said requesting central
processing unit and for associating the copy of that information-
representative signal stored in the memory element associated
with that other central processing unit with said invalid access
state.
42. A digital data processing apparatus according to claim 35,
the further improvement wherein
A. said access request means includes means for generating
a non-exclusive access-request signal representative of a request
for non-exclusive access to an information-representative signal,
and
B. said state-defining means includes means responsive to
a non-exclusive access request signal generated by said
requesting central processing unit, said non-exclusive access
request signal being representative of a request for
non-exclusive access to an information-representative signal stored

-92-
in a memory element associated with another of said central
processing units, wherein that information-representative signal
is associated with any one of an exclusive access state and a
non-exclusive access state, for effecting exclusive transfer of
that information-representative signal to said requesting central
processing unit and for associating the information-
representative signal stored in the memory element associated
with that other central processing unit with said read-only
access state.
43. A digital data processing apparatus according to claim 35,
the further improvement wherein
A. at least a requesting one of said central processing
units includes stop request means for generating a stop request
signal representative of a pending request for atomic owner
access to an information-representative signal,
B. said memory management means includes means responsive
to said stop request signal for effecting exclusive storage of a
next available copy of the requested information-representative
signal in the memory element associated with a requesting central
processing unit, and
C. said state-defining means includes means responsive to
said stop request signal for assigning said atomic owner access
state to said next available copy of the requested information-
representative signal stored in the memory element associated
with said requesting central processing unit.
44. A digital data processing apparatus according to claim 35,
wherein said memory management means includes means responsive to
a request for priority access to an information-representative
signal generated by a first central processing unit for
responding to subsequent requests for access to that same
information-representative generated by others of said central
processing units for generating a transient signal for
transmission to said other central processing units, said
transient signal being representative of current ownership of
said requested information-representative signal by said first
central processing unit.
45. A digital data processing apparatus according to any one
of claims 1 to 13, or 15 to 44, comprising:

-93-
a plurality of information transfer domains each including
one or more segments, said plurality of information transfer
domains including a first information transfer domain having a
plurality of domain(0) segments, each including an associated
unidirectional bus element coupled in a ring configuration to
said memory management means for transferring signals between a
respective plurality of said processing cells.
46. A digital data processing apparatus according to claim 45,
comprising:
a domain(1) segment comprising an associated unidirectional
bus element and a plurality of routing elements, each said
routing element being connected to the bus element associated
with the domain(1) segment and to the bus element associated with
one of said domain(0) segments for transferring signals between
the domain(1) segment and the associated domain(0) segment, the
processing cells associated with each of said domain(0) segments
being capable of signal transfer with the processing cells
associated with the remainder of said domain(0) segments only
through the domain(1) segment.
47. A digital data processing apparatus according to any one
of claims 1 to 13 or 15 to 44 comprising:
A. (n) information transfer domains, each respectively
designated as information transfer domain(k), wherein (n) is an
integer greater than or equal to two, and wherein (k) represents
successive integers between (0) and (n-1), inclusive,
B. information transfer domain(0) including a plurality of
domain(0) segments, each such segment including a bus element
connected to said memory management means for transferring
signals between a plurality of said processing cells,
C. each information transfer domain(k), for (k) between
(1) and (n-1), inclusive, including one or more corresponding
domain(k) segments, wherein the number of segments in information
domain(k) is less than the number of segments in domain(k-1), for
each value of (k), and wherein information transfer domain(n-1)
includes only one such segment.
48. A digital data processing apparatus according to claim 47
wherein each said domain(k) segment includes

-94-
A. a bus element for transferring signals within that
domain(k) segment,
B. plural domain routing elements for transferring
information representative signals between that domain(k) segment
and a domain(k-1) segment, each such routing element being
connected for signal transfer with the respective domain(k)
segment bus element and with the respective domain(k-1) segment
bus element.
49. A method of operating a digital data processing apparatus,
said apparatus including plural memory elements for storing
information-representative signals, each of at least one of which
memory elements is respectively coupled to an associated central
processing unit, each said central processing unit and its
respective memory element comprising at least part of a
processing cell, the method comprising the steps of
A. generating within a requesting one of said central
processing units an ownership-request signal representative of a
request for priority access to an information-representative
signal,
B. determining whether the requested information-
representative signal is stored within a memory element other
than one associated with the requesting central processing unit,
and
C. responding to a determination that the requested
information-representative signal is stored in a memory element
other than the one associated with the requesting central
processing unit for allocating, only within the memory element
associated with the requesting central processing unit and
exclusive of the memory elements associated with the other
central processing units, physical storage space for the
requested information-representative signal and for storing that
signal therein.
50. A method of operating a digital data processing system
according to claim 49, wherein said responding step includes the
further step of
deallocating physical storage space, if any, allocated to
the requested information-representative signal within the memory
elements associated with central processing units other than the
requesting one.

-95-
51. A method as claimed in claim 49, including the step of
storing one or more descriptor signals, each identifying a
corresponding data page for which physical storage space is
allocated within an associated one of said memory elements, each
said descriptor signal including
i) a tag component signal identifying the
corresponding data page,
ii) at least one field component signal representative
of a status of one or
more of the subpages which form the corresponding data
page.
52. A method of operating a digital data processing apparatus
according to claim 49, wherein
said responding step includes the step of responding to a
read-only signal generated by a requesting one of said central
processing units, said read-only signal being representative of a
request for secondary access to an information-representative
signal stored in a memory element associated with another of said
other central processing units, for transferring a copy of that
information-representative signal to the memory element
associated with the requesting central processing unit.
53. A method of operating a digital data processing apparatus
according to claim 49, wherein
A. said responding step including the step of responding
to an atomic-request signal generated by a first one of said
central processing units, said being representative of a request
for exclusive priority access to one or more
information-representative signals, for
i) allocating, exclusively, in the memory element
associated with the first central processing unit physical
storage space for said one or more requested information-
representative signals and for storing those information-
representative signals therein, while
ii) preventing access to any of those information-
representative signals in response to a first selected access
request signal generated by a second one of said central
processing units.

-96-
54. A method as claimed in claim 51 characterized in that
A. said generating step includes the step of generating a
subpage-request signal representative of a request for access to
a data subpage stored in one or more of said memory elements,
B. said responding step includes the steps of responding to
at least selected ones of said subpage-request signals by
i) allocating, within the memory element associated
with the requesting central processing unit, physical
storage space for the data page associated with the
requested data subpage, said data subpage comprising one
or more information- representative signals and forming
at least part of a data page,
ii) storing the requested data subpage in that
allocated physical storage space, and
C. said method of further including the step of
deallocating physical storage space allocated to a selected data
page in one or more of said memory elements,
said deallocating step being effected prior to, or
substantially concurrent with, the allocating step.
55. A method as claimed in claim 54, further characterized in
that said deallocating step includes the step of selecting for
deallocation a data page for Which physical storage space is
allocated in at least a first memory element and a second memory
element.
56. A method as claimed in claim 51, wherein said deallocating
step includes the step of storing one or more access
history-representative signals representative of a history of access of
respective ones of data pages for which physical storage space is
allocated within one or more of said memory elements.
57. A method as claimed, in any one of claims 49 to 56 comprising
the steps of
A. associating an access state with one or more
information-representative signals stored in said plural memory
elements, said access state being selected from a group of access
states including one or more of

-97-
i) a read-only access state associable with a copy of an
information-representative signal subject to read-only access by
a central processing unit associated with a memory element in
which that copy is stored, wherein said information-
representative signal is stored in a memory element associated
with another of said central processing units and is modifiable
by that other central processing unit,
ii) a non-exclusive owner access state associable with an
information-representative signal modifiable by a central
processing unit associated with a memory element in which that
information-representative signal is stored, a copy of which
information-representative signal is stored in a memory element
associated with another said central processing unit,
iii) an exclusive owner access state associable with an
information-representative signal modifiable by a central
processing unit associated with a memory element in which
exclusive physical storage space for that information-
representative signal is allocated and in which that information-
representative signal is stored, for which information-
representative signal no physical storage space is allocated in
any other memory element, that information-representative signal,
or a copy thereof, being accessible for transfer to another
memory element, and
iv) an atomic owner access state associable with an
information-representative signal modifiable by a central
processing unit associated with a memory element in which
exclusive physical storage space for that information-
representative signal is allocated and in which that information-
representative signal is stored, for which information-
representative signal no physical storage space is allocated in
another memory element, neither that information-representative
signal, nor a copy thereof, being selectively accessible for
transfer to another memory element.
58. A method as claimed in claim 49, the further improvement
comprising the steps of
A. generating, within a requesting one of said central
processing units, an exclusive access request signal
representative of a request for priority access to an
information-representative signal, and

-98-
B. responding to an exclusive access request signal
generated by said requesting central processing unit, said
exclusive access request being representative of a request for
exclusive access to an information-representative signal stored
in a memory element associated another of said central processing
units, wherein that information-representative signal is
associated with an access state other than said atomic ownership
state, for effecting exclusive transfer of that information-
representative signal to said requesting central processing unit
and for associating the copy of that information-representative
signal stored in the memory element associated with that other
central processing unit with the invalid access state.

Description

Note: Descriptions are shown in the official language in which they were submitted.

1341 154
BACKGROUND OF THE INVENTION
1 Th is invent ion relates in th is d iv is Tonal
application and in the parent Canadian application 582,560
to digital data processing systems and, more particularly,
to multiprocessing systems with distributed hierarchical
memory architectures.
The art provides a number of configurations
for coupling the processing units of multiprocessing
systems. Among the earlier designs, processing units
to that shared data stored in system memory banks were
coupled to those banks via high-bandwidth shared
buses or switching networks. During periods of heavy
usage, bottlenecks were likely to develop as multiple
processing units simultaneously contended for access
to the shared data.
In order to minimize the risk of formation
of transmission bottlenecks, distributed memory
systems were developed coupling individual processing
units with local memory elements to form
semi-autonomous processing cells. To achieve the
benefits of multiprocessing, some of the more
recently designed systems established cell
communications through utilization of hierarchical
architectures.
The distributed memory systems, however,
permit multiple copies of single data items to reside
within multiple processing cells; hence, it is
difficult insure that all processing cells maintain
identical copies of like data elements. Conventional
3o efforts to resolve this problem, i.e., to preserve
data coherency. rely upon software oriented
techniques utilizing complez signalling mechanisms.
To avoid processing and signalling overhead
associated with these software oriented solutions,
Frank et al, United States Patent No. 4,622,631,

1341 154
-2-
1 discloses a multiprocessing system in which a
plurality of processors, each having an associated
private memory, or cache, share data contained in a
main memory element. Data within that common memory
is partitioned into blocks, each of which can be
owned by any one of the main memory and the plural
processors. The current owner of a data block is
said to have the correct data for that block.
A hierarchical approach is disclosed by
Wilson Jr. et al, United Kingdom Patent Application
No. 2,178,205, wherein a multiprocessing system is
said to include distributed cache memory elements
coupled with one another over a first bus. A second,
higher level cache memory, attached to the first bus
and to either a still higher level cache or to the
main system memory, retains copies of every memory
location in the caches below it. The still higher
level caches, if any, and system main memory, in
turn, retain copies of each memory location of cache
below them. The Wilson Jr, et al processors are
understood to transmit modified copies of data from
their own dedicated caches to associated higher level
caches and to the system main memory, while
concurrently signalling other caches to invalidate
their own copies of that newly-modified data.
Notwithstanding the solutions proposed by
Frank et al and Wilson Jr. et al proposal, data
coherency and bus contention remain significant
problems facing both designers and users of
multiprocessing systems. With respect to Wilson Jr.
et al, for example, these problems may be attributed,
at least in part, to the requirement that data in
main memory must always be updated to reflect
permanent modifications introduced to the data
* Published February 4, 1987

1341 154
-3-
elements by each of the processors in the system.
Moreover, neither of the proposed designs is capable
of supporting more than a limited number of
processing units. This restriction in "scalability"
arises from a requirement of both the Wilson Jr. et
al and Frank et al systems that the size of main
memory must increase to accommodate each additional
processor.
It is therefore an object of this invention
to provide an improved multiprocessing system with
improved data coherency, as well as reduced latency
and bus contention. A further object is to provide a
multiprocessing system with unlimited scalability.
~~ Other objects of the invention are to
provide a physically distributed memory
multiprocessing system which requires little or no
software overhead to maintain data coherency, as well
as to provide a multiprocessing system with increased
bus bandwidth and improved synchronization.
25
35

141 154
-4 -
1 SUNfMARY OF THE 7[NV NE T_I4_N_
The invention attains the aforementioned
objects by providing, in one broad aspect, a digital
data processing system comprising a plurality of
processing cells arranged in a hierarchy of rings.
The system selectively allocates storage and moves
exclusive data copies from cell to cell in response
to access requests generated by the cells. Routing
elements are employed to selectively broadcast data
access requests, updates and transfers on the rings.
A system of the type provided by the
invention does not require a main memory element,
i.e., a memory element coupled to and shared by the
systems many processors. Rather, data maintained by
the system is distributed, both on exclusive and
shared bases, among the memory elements associated
with those processors. Modifications to datum stored
exclusively in any one processing cell do not have to
be communicated along the bus structure to other
storage areas. As a result of this design, only that
data which the processors dynamically share, e.g.,
sharing required by the executing program themselves.
must be transmitted along the bus structure. These
aspects, along with the systems hierarchical
structure localize signal traffic greatly, thereby
reducing bus contention and bottlenecks.
With further attention to system structure
and element interconnection, the processing cells
include central processing units coupled with memory
elements, each including a physical data and control
signal store, a directory, and a control element.
Groups of cells are interconnected along
unidirectional intercellular bus rings, forming units

1341 154
-5-
referred to as segments. These segments together
form a larger unit referred to as "information
transfer domain(0)." While cells residing within
each segment may communicate directly with one
another via the associated intercellular bus, the
associated central processing units are not
themselves interconnected. Rather, intersegment
communications are carried out via the ezchange of
data and control signals stored in the memory
elements. A memory management element facilitates
this transfer of information.
Communications between cells of different
domain(0) segments are carried out on higher level
information transfer domains. These higher level
domains are made up of one or more segments, each
comprising a plurality of domain routing elements
coupled via a unidirectional bus ring. It will be
appreciated that the segments of higher level domains
differ from those of domain(0) insofar as the former
comprise a ring of routing elements, while the latter
comprise a ring of processing cells. Each routing
element is connected with an associated one of the
segments of the neat lower information transfer
domain. These connected lower segments are referred
to as "descendants." Every information transfer
domain includes fewer segments than the nezt lower
domain. Apart from the single segment of the
system's highest level domain, signals are
transferred between segments of each information
3o transfer domain via segments of the nezt higher
domain.
An ezemplary system having siz domain(0)
segments includes two domain(1) segments, the first
which transfers data between a first three of the

v
1341 154
-6-
domain(0) segments, and the second of which transfers
data between the other three domain(0) segments.
Data is transferred between the two domain(1)
segments over a domain(2) segment having two domain
routing elements, each connected with a corresponding
one of the domain(1) segments.
The system's memory elements each include a
directory element that maintains a list of
descriptors reflecting the identity and state of each
datum stored in the corresponding memory. One
portion of each descriptor is derived from the
associated datum's system address, while another
portion represents an access state governing the
manner in which the local central processing unit may
utilize the datum. This access state may include any
one of an "ownership" state, a read-only state, and
an invalid state. The first of these states is
associated with data which can be modified by the
local central processing unit, i.e., that unit
included within the cell in which the datum is
stored. The read-only state is associated with data
which may be read, but not modified, by the local
central processing unit. The invalid state is
associated with invalid data copies.
The domain routing elements themselves
maintain directories listing all descriptors stored
in their descendant domain(0) segments. Thus, in the
above ezample, the routing elements of first
domain(1) segments maintain directories reflecting
3o the combined content of the cells of their respective
domain(0) segment. Moreover, the single routing
element of the domain(2) segment maintains a
directory listing all descriptors retained in all of
the system's processing cells.

1341 154
Data access requests generated by a
processor are handled by the local memory element
whenever possible. More particularly, a controller
coupled with each memory monitors the cell's internal
bus and responds to local processor requests by
comparing the request with descriptors listed in the
corresponding directory. If found, matching data is
transmitted back along the internal bus to the
requesting processor.
to Data requests that cannot be resolved
locally are passed from the processing cell to the
memory management system. The management element
selectively routes those unresolved data requests to
the other processing cells. This routing is
accomplished by comparing requested descriptors with
directory entries of the domain routing units.
Control elements associated with each of those other
cells, in turn, interrogate their own associated
directories to find the requested data. Data
satisfying a pending request is routed along the
domain segment hierarchy from the remote cell to the
requesting cell.
Data movement between processing cells is
governed by a protocol involving comparative
evaluation of each access request with the access
state associated with the requested item. The memory
management system responds to a request for exclusive
ownership of a datum by moving that datum to the
memory element of the requesting cell. Concurrently,
the memory management element allocates physical
storage space for the requested item within the
requesting cell's data storage area. The management
element also invalidates the descriptor associated
with the requested item within the data store of the
1

1
_s_ 1341 15~
remote cell, thereby effecting subsequent
deallocation of the physical storage space which had
retained the requested item prior to its transfer to
the requesting cell.
While the aforementioned operations result
in exclusive storage of the requested datum within
the requesting cell, other cells may subsequently
gain concurrent access to that datum, for ezample, on
a read-only basis. Particularly, the memory
management system responds to a request by a first
cell for read-only access to datum ezclusively owned
by a second cell by transmitting a copy of that datum
to the first cell while simultaneously designating
the original copy of that data, stored in the second
cell, as "nonezclusively owned."
The system permits an owning cell to disable
the copying of its data by providing a further
ownership state referred to as the "atomic" state.
The memory management system responds to requests for
data in that state by transmitting a wait, or
"transient," signal to requestors and by broadcasting
the requested data over the hierarchy once atomic
ownership is relinquished.
A system of the type described above
provides improved multiprocessing capability with
reduced bus and memory contention. The dynamic
allocation of exclusive data copies to. processors
requiring ezclusive access, as well as the sharing of
data copies required concurrently by multiple
3o processors reduces bus traffic and data access
delays. Utilization of a hardware-enforced access
protocol further reduces bus and memory contention,
while simultaneously decreasing software overhead
required to maintain data coherency. The

J
1341 154 '
1 interconnection of information transfer domain
segments permits localization of data access,
transfer and update requests.
Accordingly in one aspect the present invention
resides in a di ital data
g processing apparatus comprising
to
A, plural memory elements for storing information-
representative signals, each of at least one of which memory
elements is respectively coupled to an associated central
processing unit,
each said central processing unit and its respective
associated memory element comprising at least part of a
processing cell,
B. memory management means coupled to said plural memory
15 elements for accessing one or more of said information-
representative signals stored in said plural memory elements,
C. at least a requesting one of said central processing
units including access request means for generating an access
request signal representative of a request for access to an
information-representative signal,
said access request means including means for
generating an ownership-request signal representative of a
request for priority access to an information-representative
signal,
at least the memory element associated with the
requesting central processing unit including control means for
selectively transmitting said access-request signal to said
memory management means, and
D. said memory management means including means responsive
to selected ones of said ownership-request signals for
allocating, only within the memory element associated with the
requesting central processing unit and exclusive of the memory
elements associated with the other central processing units,
physical storage space for the requested information-
representative signal and for storing that signal therein.

_ 9A _ 1341 154
In a further aspect the present invention resides in
a digital data processing apparatus of the type having
plural memory elements for storing information-representative
signals, each of at least one of which memory elements is
respectively coupled to an associated central processing unit,
each said central processing unit and its respective associated
memory element comprising at least part of a processing cell,
characterized in that
A. at least a requesting one of said central processing
~9 units includes access request means for generating an access
request signal representation of a request for access to an
information representative signal,
B. said apparatus further including memory management
means coupled to said plural memory elements for accessing one or
t5 more information-representative signals stored therein, and
C. said memory management means including state-defining
means coupled with said plural memory elements for associating an
access state with one or more information-representative signals
stored therein, said access state being selected from a group of
20 access states including one or more of
i) an exclusive owner access state associable with an
information-representative signal modifiable by a central
processing unit associated with a memory element in which
exclusive physical storage space for that information-
25 representative signal is allocated and in which that information-
representative signal is stored, for which information-
representative signal no physical storage space is allocated in
any other memory element, that information-representative signal,
or a copy thereof, being accessible for transfer to another
3g memory element, and
ii) an atomic owner access state associable with an
information-representative signal modifiable by a central
processing unit associated with a memory element in which
exclusive physical storage space for that information-
35 representative signal is allocated and in which that information-
representative signal is stored, for which information-
representative signal no physical storage space is allocated in
any other memory element, neither that information-representative
signal, nor a copy thereof, being selectively accessible for
~0 transfer to a memory element associated with another said central
processing unit.
...98

- 9B - 1341 154
In another aspect the present invention resides in
a method of operating a digital data processing apparatus,
said apparatus including plural memory elements for storing
information-representative signals, each of at least one of which
memory elements is respectively coupled to an associated central
processing unit, each said central processing unit and its
respective memory element comprising at least part of a
processing cell, the method comprising the steps of
A. generating within a requesting one of said central
~0 processing units an ownership-request signal representative of a
request for priority access to an information-representative
signal,
B. determining whether the requested information
representative signal is stored within a memory element other
?5 than one associated with the requesting central processing unit,
and
C. responding to a determination that the requested
information-representative signal is stored in a memory element
other than the one associated with the requesting central
20 processing unit for allocating, only within the memory element
associated with the requesting central processing unit and
exclusive of the memory elements associated with the other
central processing units, physical storage space for the
requested information-representative signal and for storing that
2j signal therein.
These and other aspects of the invention are
evident in the description which follows.
...10

1~~G1 154
-lo-
1 BRIEF DESCRIPTION OF.DRAWINGS
A more complete understanding of the
invention may be attained by reference to the
drawings, in which:
Figure 1 depicts the structure of a
preferred multiprocessing system constructed in
accord with the invention;
Figures 2A and 2B depict a preferred
configuration of processing cells, including
illustration of data movement provided within
respective memory elements, in a digital data
processing system constructed according to a
preferred embodiment of the invention;
Figure 3 depicts a preferred interconnect
structure of processing cells within a domain(0)
segment in a digital data processing system
constructed according to a preferred embodiment of
the invention;
Figure 4 depicts a construction of an
exemplary processing cell in a digital data
processing system constructed according to a
preferred embodiment of the invention;
Figure 5 depicts a preferred
interrelationship between system virtual addresses,
descriptors, and cache directories in a digital data
processing system constructed according to a
preferred embodiment of the invention;
Figures 6A an8 68 present state tables
depicting handling of processor access requests
directed to data stored in local caches in a digital
data processing system constructed according to a
preferred embodiment of the invention;

1341 154 '
-11-
1 Figures 7A, 7H, 7C and 7D present state
tables depicting handling of memory management data
requests directed to data stored in the caches in a
digital data processing system constructed according
to a preferred embodiment of the invention; and
Figure 8 depicts a preferred
interrelationship between the LRU_Insert table and
the LRU set utilized in a digital data processing
system constructed according to a preferred
embodiment of the invention.
Figure 9 depicts a construction for a
preferred domain routing cell, including a remote
interface unit, for a digital data processing system
constructed according to a preferred practice of the
invention.
25
35

1341 154
-12-
1 DETAILED DESCRIPTION OF THE ILLUSTRATED EMBODIMENT
Figure 1 depicts the structure of a
preferred multiprocessing system 10 constructed fn
accord with the invention. The illustrated system 10
includes three information transfer domains:
domain(0), domain(1), and domain(2). Each
information transfer domain includes one or more
domain segments, characterized by a bus element and a
plurality of cell interface elements. Particularly,
domain(0) of the illustrated system 10 includes siz
segments, designated 12A, 12B, 12C, 12D, 12E and 12F,
respectively. Similarly, domain(1) includes segments
14A and 14B, while domain(2) includes segment 16.
Each segment of domain(0), i.e., segments
12A, 12B, ... 12F, comprise a plurality of processing
cells. For example, as shown in the illustration,
segment 12A includes cells 18A, 18B arid 18C; segment
12B includes cells 18D, 18E and 18F; and so forth.
Each of those cells include a central processing unit
and a memory element, interconnected along an
intracellular processor bus (not shown). In accord
with the preferred practice of the invention, the
memory element contained in each cells stores all
control and data signals used by its associated
central processing unit.
As further illustrated, each domain(0)
segment may be characterized as having a bus element
providing a communication pathway for transferring
3o information-representative signals between the cells
of the segment. Thus, illustrated segment 12A is
characterized by bus 20A, segment 12B by 20B, segment
12C by 20C, ~ s~~. As described in greater
detail below, information-representative signals are

1341 154
-13-
1 passed between the cells 18A, 18B and 18C of
ezemplary segment 12A by way of the memory elements
associated with each of those cells. Specific
interfaces between those memory elements and the bus
20A are provided by cell interface units 22A, 22B and
22C, as shown. Similar direct communication pathways
are established in segments 12H, 12C and 12D between
their respective cells 18D, 18E, ... 18R by cell
interface units 22D, 22E, ... 22R, as illustrated.
As shown in the illustration and noted
above, the remaining information transfer domains,
i.e., domain(1) and domain(2), each include one or
more corresponding domain segments. The number of
segments in each successive segment being less than
the number of segments in the prior one. Thus,
domain(1)'s two segments 19A and 14B number fewer
than domain(0)'s siz 12A, 12B ... 12F, while
domain(2), having only segment 16, includes the
fewest of all. Each of the segments in domain(1) and
domain(2), the "higher" domains, include a bus
element for transferring information-representative
signals within the respective segments. In the
illustration, domain(1) segments 14A and 14B include
bus elements 24A and 24B, respectively, while
domain(2) segment 16 includes bus element 26.
The segment buses serve to transfer
information between the components elements of each
segment. that is, between the segment's plural domain
routing elements. The routing elements themselves
provide a mechanism for transferring information
between associated segments of successive domains.
Routing elements 28A, 28H and 28C, for ezample,
provide a means for transferring information to and
from domain(1) segment 14A and each of domain(0)

-14- 1 3 41 1 5 4
1 segments 12A, 12H and 12C, respectively. Similarly,
routing elements 28D, 28E and 28F provide a means for
transferring information to and from domain(1)
segment 14B and each of domain(0) segments 12D, 12E
and 12F, respectively. Further, domain routing
elements 30A and 30B provide an information transfer
pathway between domain(2) segment 16 and domain(1)
segments 14A and 14H, as shown.
The domain routing elements interface their
to respective segments via interconnections at the bus
elements. Thus, domain routing element 28A
interfaces bus elements 20A and 24A at cell interface
units 32A and 34A, respectively, while element 288
interfaces bus elements 20H and 24B at cell interface
units 328 and 34B, respectively, and so forth.
Similarly, routing elements 30A and 30B interface
their respective buses, i.e., 24A, 24B and 26, at
cell interface units 36A, 368, 38A and 38B, as shown.
Figure 1 illustrates further a preferred
mechanism interconnecting remote domains and cells in
a digital data processing system constructed in
accord with the invention. Cell 18R, which resides
at a point physically remote from bus segment 20F, is
coupled with that bus and its associated cells (18P
and 180) via a fiber optic transmission line,
indicated by a dashed line. A remote interface unit
19 provides a physical interface between the cell
interface 22R and the remote cell 18R. The remote
cell 18R is constructed and operated similarly to the
other illustrated cells and includes a remote
interface unit for coupling the fiber optic link at
its remote end.
35_

131 15~
-15=
1 In a like manner, domain segments 12F and
14B are interconnected via a fiber optic link from
their parent segments. As indicated, the respective
domain routing units 28F and 30B each comprise two
remotely coupled parts. With respect to domain
routing unit 28F, for example, a first part is linked
directly via a standard bus interconnect with cell
interface 34F of segment 14B, while a second part is
linked directly with cell interface unit 32F of
segment 12F. These two parts, which are identically
constructed, are coupled via a fiber optic link,
indicated by a dashed line. As above, a physical
interface between the domain routing unit parts and
the fiber optic media is provided by a remote
interface unit (not shown).
Figure 2A depicts a preferred memory
configuration providing data coherence in a
multiprocessing system of the type, for example,
described above. The illustrated system includes
plural central processing units 40(A), 40(B) and
40(C) coupled, respectively, to associated memory
elements 42(A), 42(B) and 92(C). Communications
between the the processing and memory units of each
pair are carried along buses 44A, 44B and 44C, as
shown. The illustrated system further includes
memory management element 46 for accessing
information-representative signals stored in memory
elements 44A, 498 and 44C via buses 48(A), 48(B) and
48(C), respectively.
In the illustrated embodiment, the central
processing units 40A, 408 and 40C each include access
request element, labelled 50A, 50B and 50C,
respectively. These access request elements generate
signals representative of requests for for access to

1341 154
-16-
an information stored in the memory elements 42A, 42H
and 42C. Among the types of access request signals
generated by elements 50A, 50B and 50C is the
ownership-request signal, representing requests for
for priority access to an information-representative
signal stored in the memories. In a preferred
embodiment, access request elements 50A, 50B and 50C
comprise a subset of an instruction subset
implemented on CPU's 40A, 40B and 40C. This
l0 instruction subset is described below.
The memory elements 40A, 408 and 40C include
control elements 52A, 52B and 52C, respectively.
Each of these control units interfaces a data storage
area 54A, 54B and 54C via a corresponding directory
element 56A, 56B and 56C, as shown. Stores 54A, 54B
and 54C are utilized by the illustrated system to
provide physical storage space for data and
instruction signals needed by their respective
central processing units. Thus. store 54A maintains
data and control information used by CPU 40A, while
stores 54B and 54C maintain such information used by
central processing units 40B and 40C, respectively.
The information signals maintained in each of the
stores are identified by unique descriptors,
corresponding to the signals' system addresses.
Those descriptors are stored in address storage
locations of the corresponding directory. While the
descriptors are considered unique, multiple copies of
some descriptors may ezist among the memory elements
42A, 4B and 42C where those copies themselves
identify copies of the same data element.
Access request signals generated by the
central processing units 40A, 40B and 40C include,
along with other control information, an SA request

X341 154
portion matching the SA address of the requested
information signal. The control elements 52A, 52B
and 52C respond to access-request signals generated
their respective central processing units 40A, 408
and 40C for determining whether the requested
information-representative signal is stored fn the
corresponding storage element 54A, 54B and 54C. If
so, that item of information is transferred for use
by the requesting processor. If not, the control
unit 52A, 528, 52C transmits the access-request
signal to said memory management element along lines
48A, 48B and 48C.
In an effort to satisfy a pending
information access request, the memory management
element broadcasts an access-request signal received
from the requesting central processing unit to the
memory elements associated with the other central
processing units. By way of a cell interface unit,
described below, the memory management element
effects comparison of the SA of an access request
signal with the descriptors stored in the directories
56A, 56B and 56C of each of the memory elements to
determine whether the requested signal is stored in
any of those elements. If so, the requested signal,
or a copy thereof, is transferred via the memory
management element 46 to the memory element
associated with the requesting central processing
unit. If the requested information signal is not
found among the memory elements 42A, 42B and 42C, the
operating system can effect a search among the
system's peripheral devices (not shown) in a manner
described below.
Within the illustrated multiprocessor
system, data coherency is maintained through action

1 X41 154
of the memory management element on memory stores
54A, 54H and 54C and their associated directories
56A, 56H and 56C. More particularly, following
generation of an ownership-access request by a first
CPU/memory pair (e. g., CPU 40C and its associated
memory element 42C), the memory management element 46
effects allocation of space to hold the requested
data in the store of the memory element of that pair
(e. g., data store 54C of memory element 42C).
Concurrent with the transfer of the requested
information-representative signal from the memory
element in which it was previously stored (e. g.,
memory element 42A), the memory management element
deallocates that physical storage space which had
been previously allocated for storage of thg
requested signal.
The aforementioned actions of the memory
management element and, more particularly, the data
coherence element are illustrated in Figures 2A and
2B. In the first of those drawings, information
signals DATUM(0), DATUM(1) and DATUM(2) are stored in
the data store of the memory element 42A partnered
with CPU 40A. Descriptors "foo," "bar" and "bas"
which correspond, respectively, to those data signals
are stored in directory 56A. Each such descriptor
includes a pointer indicating the location of its
associated information signal in the store 42A.
In the memory element 428 partnered to CPU
40B, the illustrated system stores information
signals DATUM(3) and DATUM(2). Corresponding to each
of those data elements are descriptors "car" and
"bas," retained in directory 56H. DATUM(2), and its
descriptor "bas," are copied from store 42A and,
therefore. retain the same labels.

1341 154 ~
-19-
1 The system illustrated in Figure 2A does not
store any data in the memory element 54C partnered to
CPU 40C.
Figure 2B illustrates actions effected by
the memory management system 46A following issuance
of an ownership-access request by one of the central
processing units. In particular, the illustration
depicts the movement of information signal DATUM(0)
following issuance of an ownership-access request for
that signal by CPU 40C. At the outset, in response
to the request signal, the memory management element
46 allocates physical storage space in the store 54C
of the memory element partnered with CPU 40C. The
memory management element 46 also moves the requested
information signal DATUM(0) from store 54A, where it
had previously been stored, to the requestor's store
54C, while concurrently deallocating that space in
store 54A which had previously held the requested
signal. Along with moving the requested information
signal, the memory management element 46 also effects
invalidation of the descriptor "foo" in directory
56A, where it had previously been used to identify
DATUM(0) in store 54A, and reallocation of that same
descriptor in directory 56C, where it will
subsequently be used to identify the signal in store
54C.
In the preferred embodiment, the memory
management element 46 includes a mechanism for
assigning access state information to the data and
control signals stored in the memory elements 42A,
42H and 42C. These access states, which include the
invalid, read-only, owner and atomic states, govern
the manner in which data may be accessed by specific
processors. A datum which is stored in a memory

1341 154
-20-
1 element whose associated CPU maintains priority
access over that datum is assigned an ownership
state. While, a datum which is stored in a memory
element whose associated CPU does not maintain
priority access over that datum is assigned a
read-only state. Further, a purported datum which
associated with "bad" data is assigned the invalid
state.
Figure 3 depicts a preferred configuration
for ezemplary domain(0) segment 12A of Figure 1. The
segment 12A includes processing cells 18A, 18B and
18C interconnected by cell interconnects 22A, 22B and
22c along bus segment 20A. Domain routing unit 28A
provides an interconnection between the domain(0)
segment 12A and if parent, domain(1) segment 14a of
Figure 1. This routing unit 28A is coupled along bus
20A by way of cell interconnect 32A, as shown. The
structure of illustrated bus segment 20A, as well as
its interrelationship with cell interconnects 22A,
22B, 22C and 32A is more fully discussed in
copending, commonly assigned, Canadian Patent
Application Serial No. 582,550, entitled
"Interconnection System for Multiprocessing
Structure," filed On November 8, 1988. Further
description of a preferred cell interconnect
structure is provided in Appendiz A, filed herewith.
Figure 4 depicts a preferred structure for
processing cells 18A, 18B . . . 18R. The illustrated
processing cell 18A includes a central processing
~
unit 58 coupled with ezternal device interface 60,
data subcache 62 and instruction subcache 64 over
processor bus 66 and instruction bus 68,
respectively. Interface 60, which provides

r7
-21- 1 3 41 1 5 4
communications with ezternal devices, e.g., disk
drives, over ezternal device bus, is constructed in a
manner conventional to the art.
Processor 58 can comprise any one of several
commercially available processors, for ezample, the
Motorola 68000 CPU, adapted to interface subcaches 62
and 64, under control of a subcache co-execution unit
acting through data and address control lines 69A and
698, in a manner conventional to the art, and further
l0 adapted to ezecute memory instructions as described
below. Schematics for a preferred central processing
unit are provided in Appendiz H, filed herewith. A
description of a preferred processor interface is
provided in Appendiz C, filed herewith. '
. Processing cell 18A further includes data
memory units 72A and 72B coupled, via cache control
units 74A and 74B, to cache bus 76. Cache control
units 74C and 74D, in turn, provide coupling between
cache bus 76 and processing and data buses 66 and
68. As indicated in the drawing, bus 78 provides an
interconnection between cache bus 76 and the
domain(0) bus segment 20A associated with illustrated
cell. A preferred construction for cache control
units 74A, 748, 74C and 74D is provided in Appendiz
D, filed herewith.
In a preferred embodiment, data caches 72A
and 72B dynamic random access memory devices, each
capable of storing up to 8 Mbytes of data. The
subcaches 62 and 64 are static random access memory
devices, the former capable of storing up to 512k
bytes of data, the latter of up to 256k bytes of
instruction information. As illustrated. cache and
processor buses 76 and 64 provide 64-bit transmission
pathways. while instruction bus 68 provides a 32-bit
* Trade Mark

-22- 1 3 ~ 1 1 5 4
i transmission pathway. A preferred construction of
cache bus 76 is provided in Appendiz E, filed
herewith.
Those skilled in the art will understand ,
that illustrated CPU 58 represents a conventional
central processing unit and, more generally, any
device capable of issuing memory requests, e.g., an
i/o controller or other special purpose processing
element.
The Memory Management System
A multiprocessing system 10 constructed in
accord with a preferred embodiment of the invention
permits access to individual data elements stored
within processing cells 18A, 18B, . . . 18R by
reference to a unique system virtual address (SVA)
associated with each datum. Implementation of this
capability is provided by the combined actions of the
memory management system 46, the subcaches 62, 64 and
the caches 72A, 72H. In this regard, it will be
appreciated that the memory management system 46
includes cache control units 74A, 74B, 74C and 74D,
with their related interface circuitry. It will
further be appreciated that the aforementioned
elements are collectively referred to as the "memory
system."
A complete understanding of the structure
and operation of the memory system may be attained
through recognition of its architectural features,
enumerated below:
Data Storage -- The memory in each cache is
divided into pages, each of which may be dynamically
assigned to some page of SVA space. The memory
system maintains usage and status information about
the data in each cache to facilitate efficient
migration to and from secondary storage.

141 154 ~
-23-
1 Data Locality -- The memory system keeps
data recently referenced by a processor in the
subcache or cache in the same cell of that processor.
Data Movement -- The memory system moves
data to the cache of the processor referencing it.
Data Sharing -- The memory system keeps
copies of SVA data in more than one cache to
facilitate efficient data sharing by parallel
programs.
Data Coherence -- The memory system
implements the strongly ordered coherent memory model
and the transaction model.
With regard to the last point, those skilled
in the art will appreciate that a system is
"sequentially consistent" if the result of any
execution is the same as if the operations of all the
processors were executed in some sequential order,
and the operations of each individual processor
appear in this sequence in the order specified by its
program.
Moreover, storage accesses are considered
"strongly ordered" if accesses to data by any one
processor are initiated, issued and performed in
program order and: if at the time when a store by
processor I is observed by processor K, all accesses
to data performed with respect to I before the
issuing of the store must be performed with respect
to K. Hy contrast, storage accesses are weakly
ordered if accesses to synchronizing variables are
strongly ordered and; if no access to synchronizing
variable is issued in a processor before all previous
data accesses have been performed and; if no access
to data is issued by a processor before a previous
access to a synchronizing variable has been performed.

1341 154 '~
-24-
A coherent system with strong ordering of
events is sequentially consistent.
In the illustrated embodiment, the memory
system stores data in units of pages and subpages,
with each page containing 4k bytes and each subpage
containing 64 bytes. The memory system allocates
storage in the caches 74A, 74B on a page basis. Each
page of SVA space is either entirely represented in
the system or not represented at all. The memory
1o system shares data between caches in units of
subpages. In the description which follows, the term
"caches" refers to the cache storage elements 74A,
74B of the respective processing cells.
The organization of SVA space Within the
illustrated system is a major departure from ordinary
virtual memory schemes. Conventional architectures
include a software controlled page-level translation
mechanism that maps system addresses to physical
memory addressor generates missing page exceptions.
In these schemes, the software is responsible for
multiplexing the page tables) among all the segments
in use. In the architecture of the illustrated
system, there is no software controlled page-level
translation mechanism. The memory system can handle
a significant portion of the address space management
normally performed by software in conventional
architectures. These management responsibilities
include:
(1) maintaining page usage and status
3o information,
(2) reusing old pages,
(3) synchronizing and ensuring
coherence of shared data access
amongst multiple processors, and

1341 154
-25-
1
(4) migrating data and copies of data
on a subpage basis from place to
place in the system to keep data
nearest to the processors that are
using it most frequently.
The illustrated system's processors, e.g.,
processors 40A, 40B, 40C, communicate with the memory
system via two primary logical interfaces. The first
is the data access interface, which is implemented by
l0 the load and store instructions. In data access
mode, the processor presents the memory system with
an SVA and access mode information, and the memory
system attempts to satisfy that access by finding the
subpage containing the data and returning it.
The second logical interface mode is control
access, which is implemented by memory system control
instructions. In control access, the processor
instructs the memory system to perform some side
effect or return some information other than the
2p actual data from a page. In addition to the primary
interfaces, system software uses control locations in
SPA space for configuration, maintenance, fault
recovery, and diagnosis.
Sache Structure
The caches, e.g., elements 72A, 72B of cell
18A, stores information in units of pages, i.e., 4096
bytes. Each page of SVA space is either entirely
present in the caches or not present at all. Each
individual cache, e.g., the combination of elements
72A and 72B of cell 18A, allocates space for data on
a page by page basis. Each cache stores data on a
subpage by subpage basis. Therefore, when a page of

131 154 ~~
-26-
SVA space is resident in the system, the following
are true:
(1) One or more caches allocates a page of
storage to the page, each subpage of the page is
5, stored on one or more of the caches with space
allocated, but
(2) Each cache with space allocated for a
page may or may not contain a copy of all of the
page's subpages.
The associations between cache pages and SVA
pages are recorded by each cache in its cache
directory, e.g.. element 56A. Each cache directory
is made up of descriptors. There is one descriptor
for each page of memory in a cache. At a particular
each descriptor is either valid or invalid. If
time
,
a descriptor is valid, then the corresponding cache
memory page is associated with a page of SVA space,
and the descriptor records the associated SVA page
address and state information. If a descriptor is
invalid, then the corresponding cache memory page is
not in use.
Each cache directory 46A acts as a
content-addressable memory. This permits a cache to
locate a descriptor for a particular page of SVA
space without an iterative search through all of its
descriptors. Each cache directory is implemented as
a 32 way set-associative memory with 128 sets. All
of the pages of SVA space are divided into 128
equivalence classes. A descriptor for a page can
only be stored in the set of a cache directory that
corresponds to the page's equivalence class. The
equivalence class is selected by SVA[18:12]. At any
given time, a cache can store no more than 32 pages

1341 154
-27-
1 with the same value for SVA[18:12], since that are 32
elements in each set.
The organization of the cache directory is
shown in Figure 5. SVA[18:12] selects a set. Each
of the descriptors in the selected set is
simultaneously compared against SVA[63:19]. If one
of the elements of the set is a descriptor for the
desired page, the corresponding comparator will
indicate a match. The indez in the set of the
matching descriptor, concatenated with the set
number, identifies a page in the cache. If no
descriptor in the set matches, the cache signals a
missing_page exception. If more than one
descriptor matches, the cache signals a
multiple_descriptor_match exception.
Preferably, SVA[18:12] is used as a hash
function over SVA addresses to select a set. System
software assigns SVA addresses so that this hash
function gives good performance in common cases. Two
important distribution cases are produced by
referencing many pages of a single segment and by
referencing the first page of many segments. The use
of SVA[18:12] to select a cache set produces good
cache behavior for contiguous groups of pages, since
128 contiguous pages can all reside in a set.
However, this key produces poor hashing behavior for
many pages with the same value in that field. System
software avoids this situation by applying a hash
function when allocating SVA space to contezt
segments.
According to a preferred practice,
descriptors contain the following fields, the
bit-size of each of which is, indicated in parentheses:

X341 154
-28-
1 descriptor.valid (1)
A cache sets this bit flag to one to
allocate the corresponding page of cache
memory to a page of SVA space, and zero
otherwise. If descriptor. valid is zero,
then none of the other fields are meaningful.
descriptor. tag (45)
Bits (63:19] of an SVA. System software
sets this field to identify the particular
page of SVA space specified by corresponding
descriptor.
l0 ~ descriptor.modified (1)
A cache sets this bit flag to one when any
data is modified in the page. System
software sets descriptor.modified to zero to
acknowledge the modification of the page.
descriptor.atomic_modified (1)
A cache sets this bit flag to one when any
subpage of this page undergoes a transition
into or out of atomic state. System
software sets descriptor.atomic modified to
zero to acknowledge the change in atomic
state.
descriptor.held (1)
Software sets the bit flag to indicate that
the descriptor may not be invalidated by the
cache even if no subpages are present in the
cache.
descriptor.LRU~osition (5)
The cache maintains this field as the
current position of the descriptor in its
set from Most Recently Used (0) to Least
Recently Used (31).
descriptor.LRU_insert_indez (2)
Software sets this field to bias the
treatment of the page by the cache's LRU
maintenance.
descriptor.no_write (1)
A flag. Software sets this field to prevent
modifications to the page by the local
processor. An attempt to modify the page
fails, and is signalled back to the

1341 154 '~
-29-
1
processor. The processor signals a
page_no_write exception.
descriptor.no_atomic (1)
A flag. Software sets this field to prevent
any cache from acquiring atomic or pending
atomic state on any subpage of this page.
An attempt to acquire an atomic state fails,
and is signalled back to the processor. The
processor signals a page_no_atomic
exception.
descriptor.no_owner (1)
Descriptor. no owner prevents this cache from
acquiring ownership of this page. Any
attempt to acquire ownership fails, and is
signalled back to the processor. The
processor signals a page_no_owner
exception.
descriptor.owner_limit (2)
Descriptor.owner_limit limits ownership to
subpages of the page to a particular cache,
domain(0), or domainl in the domain hierarchy
descriptor.subcache (1)
Set by cache to record that the
corresponding subpage is subcached in the
processor on the cache's cell.
descriptor.subpage_state (3)
The subpage state field is set by the cache
to record the state of each subpage.
descriptor.suromary (2)
Summarizes subpage state field corresponding
to four consecutive subpages.
If descriptor. no write is set, Write
accesses to the page result in a page_no_write
ezception. System software can trap page reads by
keeping a table of pages to be trapped, and refusing
to create an SVA page for them. Then, it can
translate missing~age exceptions into software
generated page_no_read ezceptions.
i

1341 154
-30-
1 Descriptor. no write can be used to implement
an copy-on-access scheme, which in turn can be used
as an approzimation of 'copy-on-write.' When a
process forks, the pages of the forking process's
address space are set to take page_no_write
ezceptions. The child process's address space
segments are left sparse. When the child process
references a page that has not yet been written by
the parent, the page fault is satisfied by making a
copy of the corresponding page of the parent process,
and the descriptor.no_write is cleared for that
page. If the parent writes a page before the child
has copied it, the page_no_write handler copies the
page into the child address space and then clears
descriptor. no write.
If the descriptor. held is 1 in a descriptor,
then the descriptor's cache is prevented from
invalidating it. When the first subpage arrives
after the subpages of a page with a held descriptor
2p became invalid, all of the field of the descriptor
ezcept descriptor. tag, descriptor. held,
descriptor.LRU_insert_indez and descriptor.LRU_in-
sert~riority are reinitialized as if the descriptor
had not existed when the subpage arrived.
Descriptor.held is not propagated from one cache to
another.
Descriptor.owner_limit limits ownership of
subpages of the page to a particular cache or
domain(0) in the system bus hierarchy. The following ''
list shows the values of descriptor.owner_limit, and
the semantics from the point of view of an owning
cache responding to requests from other caches.
(1) Cache_owner_limit - the local cache
will not relinquish ownership to any

1341 154 '.:~
-31-
1
other cache. If another cache requests
ownership, it receives an error
response.
(2)~ Domain0_owner_limit - the local cache
will not relinquish ownership to any
cache that is not in the same domain(o).
(3) Domainl_owner_limit - the local cache
will not relinquish ownership to any
cache that is not in the same domainl.
(4) Default owner_limit - any cache may
be the owner of the subpages of the
page.
Descriptor.owner_limit is propagated to
other caches as follows: so long as all of the
subpages of a descriptor are read-only copies,
descriptor.owner_limit is always
Default owner_limit. When a new cache becomes the
owner of a subpage, it copies the value of
descriptor.owner_limit from the old owner.
If descriptor. no owner is 1 in a descriptor,
then the descriptor's cache cannot acquire an
ownership state for any subpages of the page
described by the descriptor. A cache containing a
descriptor with descriptor. no owner of 1 never
responds to requests from other caches ezcept to
indicate that it is holding the copy.
Descriptor. no owner is not propagated from one cache
to another.
If descriptor.no_atomic is 1 in a
descriptor, then the descriptors cache cannot acquire
atomic or pending atomic ownership states for any
subpages of the page described by the descriptor. A
processor attempt to set atomic or pending atomic
ownership state fails, and is signalled back to the

1341 154 '
-32-
1 processor. The processor signals a page_no_atomic
exception. Descriptor.no_atomic is propagated from
one cache to another.
Descriptor. summary summarizes subpage state
field corresponding to four consecutive subpages.
There is one two-bit field for each of the 12 sets of
four subpages represented by the descriptor. The
following is a list of summary states:
all invalid -- subpage state of all
l0 four subpages are invalid.
all exclusive -- subpage state of all
four subpages are exclusive owner.
no owner -- subpage state of all four
subpages are either invalid or read
only.
owner -- one or more subpages are
either atomic owner, exclusive owner or
nonexclusive owner states. If all four
subpages are exclusive owner state, all
exclusive summary should be used.
The illustrated memory elements. e.g., 42A,
42B, 42C, detect errors, for example, while executing
a synchronous request from its local processor. The
element signals the error in its response to the
request. The local processor then signals a
corresponding exception. When a memory element
detects an error while executing a request from a
remote cell, it sends an interrupt to its local
processor and responds to the request with an error
reply. In the descriptions that follow, the
expression "the cache signals an exception" is an
abbreviation for this process.
Each memory includes a Cache Activity
Descriptor Table (CART) (not shown), in which it

1341 154
-33-
1 maintains the status of ongoing activities. When a
memory element detects an error in responding to a
request from its domain(0) or in executing an
asynchronous control instruction or a remote control
instruction, it notes the error in a descriptor in
the CART before sending an interrupt. Software reads
the CART to identify the particular source and type
of error. Software resets the CART to acknowledge
receipt of the error.
Su .paae~ and Data Sharing
When a page is resident in the memory
system, each of its subpages is resident in one or
more ofwthe caches. When a subpage is resident in a
cache, the descriptor in that cache for the page
containing that subpage records the presence of that
subpage in one of several states. The state of the
subpage in a cache determines two things:
1) What operations that cache's local processor
may perform on the data present in the '
subpage; and
2) What responses, if any, that cache makes to
requests for that subpage received over the
domains from other caches.
The states of subpages in caches change over
time as user programs request operations that require
particular states. A set of transition rules specify
the changes in subpage states that result from
processor requests and inter-cache domain
communications.
In order for a processor to complete a load
or store, two conditions must be satisfied:
1) The subpage containing the data must be
present in its local cache.

1341 154
-34-
1 2) The local cache must hold the subpage in the
appropriate state. The state determines
whether the subpage can be modified and how
the local cache responds to requests from
other caches.
If either of these conditions is not
satisfied, the processor's local cache communicates
over the domains to acquire a copy of the subpage
and/or to acquire the necessary state for the
subpage. If the cache fails to satisfy the request,
it returns an error indication to the processor,
which signals an appropriate exception.
The instruction set includes several
different forms of loan and store instructions that
permit programs to request subpage states appropriate
to the expected future data reference pattern of the
current thread of control, as well as protocol
between different threads of control in a parallel
application.
In the teat which follows there is first
described the states and their transitions in terms
of processor instructions and their effect on
caches. This is followed by a description of the
domain messages that are sent between the caches to
implement those state transactions.
~;~_bpage States
The subpage states and their transition
rules provide two general mechanisms to user programs
ezecuting on the illustrated system:
1) They transparently implement the strongly
ordered sequentially consistent model of
memory access for ordinary load and store
accesses by the processors of the system.
2) They provide a set of transaction primitives
that are used by programs to synchronize

1341 154
-35-
parallel computations. These primitives can
be applied to a variety of traditional and
non-traditional synchronization mechanisms.
The states and their transitions are
described in three groups. The first group are the
basic states and transitions that implement the
strongly ordered, sequentially consistent model of
memory access. Second are the additional states that
implement the transaction primitives. Finally, the
l0 transient states, which improve the performance of
the memory system, are presented.
The processor subcaching system is divided
into two sides: data and instruction. The data
subcache 62 is organized in 64 bit words, like the
cache. The instruction subcache 64 is organized into
32 bit half-words, since there re two 32 bit
instructions in each 64 bit memory word. The data
subcache stores .SMbyte, and the instruction subcache
.25Mbyte. Since the items in the instruction
subcache are half-words, the two subcaches store the
same number of items. The two sides of the subcache
are similar in structure to the cache.
Sub~,~ache Data Units
Subcache descriptors do not describe entire
pages of SVA space. They describe different units,
called blocks. The size of a block is different on
the two sides of the subcache. On the data side,
blocks are the half the size of pages. On the
instruction side, they are one quarter as large as
pages. On both sides, each block is divided into 32
subblocks. The following table shows the relative
sizes of blocks, subblocks and other items in the two
subcaches.

1341 154 ~'
-36-
1 ~omnarison of Instruction and Data Subcaches
total item subblocks bytes subblock
size size /block /block size
Data .50Mbyte 64 bits 32 2. OR 64 bytes
Instruction .25Mbyte 32 bits 32 l.OK 32 bytes
In the same way that the cache allocates a
page of memory but copies data one subpage at a time,
the subcache allocates pages and copies data one
subblock at a time.
Subcache Organization
The subcaches 62, 64 are organized similarly
to the caches. Where the caches are 32-way set
associative (each set contains 32 descriptors), the
subcaches are 4 way set-associative. For the data
side, the set number is bits [16:11] of the SVA, and
the tag bits [63:17]. For the instruction side, the
set number is bits [15:10], and the tag is bits
[63:16]. The data subcaches maintain modification
information for each subblock.
Subcache Hlock Replacement
The subcaches implement a simple
approximation of the cache LRU scheme. Within a set.
each subcache maintains the identity of the most
recently referenced descriptor. When a descriptor is
needed, one of the three descriptors that is not the
most recently referenced descriptor is selected at
random for replacement.
BubOache Block Writeback
The data subcaches write modified subblocks
to their caches as described above in the section
entitled 'Updates from the Subcache to the Cache.'

1341 15~ ~~
-37-
basic Sl;~tes Qd Transitions
The basic model of data sharing is defined
in terms of three classes of subpage states:
invalid, read-only, and owner. These three classes
are ordered in strength according to the access that
they permit; invalid states permit no access,
read-only states permit load access, and owner states
permit load and store access. Only one cache may
hold a particular subpage in an owner state at any
1o given time. The cache that holds a subpage in an
owner state is called the owner of the subpage.
Ownership of each subpage moves from cache to cache
as processors request ownership via store
instructions and special load instructions that
15 request ownership. Any number of caches may hold a
particular subpage in a read-only state.
basic States
The sections below describe the state
20 classes and how they interact to implement the
strongly ordered, sequentially consistent model of
memory access.
Invalid States
25 When a subpage is not present in a cache, it
is said to be in an invalid state with respect to
that cache. If a processor requests a load or store
to a subpage which is in an invalid state in its
local cache, then that cache must request a copy of
30 the subpage in some other state in order to satisfy
the data access. There are two invalid states:
invalid descriptor and invalid. When a
particular cache has no descriptor for a particular
page, then all of the subpages of that page are said

1341 154
-38-
1 to be in invalid descriptor state in that cache.
Thus, subpages in invalid descriptor state are not
ezplicitly represented. When a particular cache has
a descriptor for a particular page, but a particular
subpage is not present in that cache, then that
subpage is in invalid state. The two invalid
states are distinguished because it is much easier
for a subpage to undergo a transition to a
read-only or owner state from invalid than from
invalid descriptor. In the former case, a
descriptor is already present. In the latter case, a
descriptor must be allocated.
Read-Only States
There is only one read-only state:
read-only.
Owner States
There are two basic owner states:
non-ezclusive and ezclusive. When a particular
cache holds a particular subpage in non-ezclusive
state, any number of other caches may simultaneously
hold that subpage in read-only state. When a
particular cache holds a particular subpage in
exclusive state, then no other cache may hold a copy
so long as that cache retains exclusive state. when
a cache holds a subpage in non-exclusive state, and
the data in that subpage are modified, then that
cache sends the modified data to all of the caches
3o with read-only copies.
Basic State Transitions
The basic state transitions can be
illustrated by considering a subpage in ezclusive

1341 154 '
-39-
1
state on a particular cache. The basic mechanism by
which data moves from this first cache to other
caches is the execution of load and store
instructions by processors other than the local
processor of that first cache. The different load
and store instructions, as well as the prefetch
instructions, permit programs to request that their
local cache acquired read-only, non-exclusive, or
ezclusive state. If another cache requests read-only
state, then the first cache reduces its state from
exclusive to non-exclusive and grants read-only state
to the requestor. If another cache requests
non-exclusive state, then the first cache reduces its
state to read-only and grants non-exclusive state to
the requestor. If another cache requests exclusive
state, then the first cache reduces its state to
invalid and grants exclusive state to the requestor.
Ownership moves from cache to cache as
processors request exclusive and non-exclusive
states. When a cache requests non-exclusive
ownership, any ,read-only copies are invalidated
(undergo a transition to an invalid state).
When a cache acquires ownership of a subpage
in order to satisfy a store instruction, it does not
grant that ownership to another cache until the store
instruction is complete. In the case of
non-exclusive state, a cache does not grant ownership
to another cache until the new data from the store is
sent to the caches With read-only copies. This rule
3o provides the strongly ordered nature of the memory
system, in that it ensures readers of a memory
location to see modifications in the order that they
are made.

1341 154 ~ v
-40-
When a particular subpage is fn invalid
state in a particular cache (i.e., a descriptor is
already allocated, but the particular subpage is not '
present), and a copy of that subpage is available on
the domain interconnection due to a request from some
other cache, and at least one other cache in that
cache's local domain(0) has copy, that cache will
acquire a read-only copy of the subpage. The effect
of this mechanism is to accelerate parallel
computations, since it can remove the latency
associated with requesting a copy of a subpage from
another cache.
When a non-exclusive owner modifies a
subpage, it must send the modified data over the
domains to any read-only copies. This permits very
fast propagation of data from a producer to a
consumer. However, it consumes domain bandwidth.
Therefore, the memory system includes two mechanisms
for avoiding unnecessary non-exclusive owner states.
First, when a non-exclusive owner sends an update out
over the domains, it receives a return receipt that
includes whether any other caches actually hold
read-only copies. If the receipt indicates that
there are no read-only copies, then the owner changes
the subpage's state from non-exclusive to exclusive,
avoiding future updates. Second, when a cache
receives an update for a subpage that it holds in
read-only state, its action depends on whether that
subpage is currently resident in the CPU's subcache.
If the subpage is not subcached, then the
cache invalidates it. If the subpage is cached, then
the cache removes it from the subcache. The effect
of these actions is as follows: So long es a subpage '
is not modified, read-only copies of it propagate

1341 154
-41-
1 throughout the memory system.. When a subpage is
modified, each read-only copy persists only if that
copy is referenced at least as frequently as the
subpage is modified.
nsition Tran
It is important to note that the basic
mechanisms provide the strongly ordered memory access
model to programs that use simple load and store
instructions. Programs may use the forms of the
load, store, and prefetch instructions that request
particular states in order to improve their
performance, and it is expected that in many cases
compilers will perform the necessary analysis.
However, this analysis is optional.
nizaton c+-a~p~ a~~ Transition
The synchronization states and related
transitions implement the KSR transaction model. The
transaction model is a primitive synchronization
mechanism that can be used to implement a wide
variety of synchronization protocols between
programs. All of these protocols share the purpose
of imposing an orderly structure in time on accesses
to shared data.
The transaction model is based on two
states, atomic and pending atomic, a set of
instructions that explicitly request transitions to
and from these states, and forms of the load and
3o store instructions Whose semantics are dependent on
Whether the subpage that they reference is currently
in atomic state.

1341 154
-42-
Atomic State and Transactions
The atomic state fs the central feature of
the transaction model. Atomic state is a stronger
form of ownership than exclusive state. Subpage only
enter and leave atomic state as a result of explicit
requests by programs.
Fundamentally, atomic state can be used to
single-thread access to any subpage in SVA space.
When a processor executes an instruction that
requests that a subpage enter atomic state, the
instruction will only complete normally if the
subpage is not in atomic state already. Thus, atomic
state on a subpage can be used as a simple lock. The
lock is locked by taking the subpage atomic, and
unlocked by releasing it to exclusive.
A program requests that a subpage enter
atomic state with one of the forms of the get
instruction, and releases it with the rsp
instruction. These instructions are described in
more detail below.
A sequence of get, manipulate some protected
information, and rsp is the simplest form of
transaction. The following sections present more
complex features of the transaction mechanism that
permit the implementation of more sophisticated
protocols. These protocols provide high performance
for particular parallel programming applications.
n
3o In simple transactions, a subpage is used
purely as a lock. The data in the subpage is not
relevant. Some of the more sophisticated forms of
synchronization mechanisms make use of the data in a
subpage held in atomic state. The simplest case is

1341 154
-43-
1 to use atomic state on a subpage as a lock on the
data in that subpage. Programs take one or more
subpages into atomic state, manipulate their
contents, and release them.
Producers and Consumers - Blocking and Non-Blocking
Load Instructions
In the transactions described above, access
to the protected data is strictly single-threaded.
However, there are important applications where one
program produces a value and many consume it, and
that the consumers need not see a consistent view of
more than one full KSR word of data. In such a case.
it is undesirable for each of the consumers to
serially hold the subpage containing the data in
atomic state, one after the other. The consumers
must wait for the producer to release atomic state,
but they can all read the result simultaneously.
Applications like this can be implemented
using the blocking and non-blocking forms of load
instructions. Non-blocking load instructions access
the data in a subpage regardless of whether or not
that subpage is in atomic state. These are used by
ordinary programs and by the single-threaded
transactions described above. Blocking load
instructions only proceed normally if the subpage is
~, in atomic state. If the subpage referenced by a
blocking load instruction is in atomic state, the
instruction blocks until the subpage leaves atomic
state. In a producer-consumer relationship, the
producers) hold the subpage(s) containing the data
in atomic state, while the consumers read the data
using blocking load instructions.

X341 154
-44-
1 Passive and Active Atomic State Requests - Pending
Atomic State
The get instructions actively request
atomic state over the domains. In some applications,
a program may have absolute knowledge that a
particular subpage is already in atomic state. In
this case, sending a request across the domains is
pointless. Instead, the program can use the stop
instruction to place the subpage in pending atomic
state in the local cache, and depend upon another
program to expel the subpage using the rspe
instruction.
When a subpage is in pending atomic state in
a particular cache, this indicates that atomic state
is desired in that cache. If a message arrives over
the domains in a cache that holds a particular
subpage in pending atomic state that indicates that
atomic state is available for that subpage, then that
cache will take the subpage in atomic state. When a
processor executes a stop instruction for a
subpage, that subpage is placed in pending atomic
state on the local cache. when another processor
executes an rspe instruction, a message is sent
across the domains indicating that atomic state is
available for the subpage. When this message reaches
a cache with the subpage in pending atomic state,
that cache acquires atomic state.
It is important to note that messages of
this kind pass all of the caches in the system in a
single, well defined order. Thus, a series of caches
can use sequences of the form stop, manipulate,
rspe,to pass a synchronization token to each of
themselves in turn.

1 3 ~ 1 1. a 4
-45-
1 ~'ransient States and Transitions
The transitive states are used automatically
by the memory system to improve the performance of
accesses to subpages in case of contention. There
are three transient states: transient atomic,
transient exclusive, and transient non-exclusive.
These sdtatesd correspond to atomic, exclusive, and
non-exclusive states, respectively. A particular
subpage enters a transient state in a particular
l0 cache when that cache receives a request for the
subpage to which it cannot respond immediately. If a
subpage is in atomic state and another cache requests
that subpage, that subpage enters transient atomic
state in the holding cache. when the subpage is
later released with an rsp instruction, the
transient state forces the subpage to be expelled as
if an rspe had been used. If a subpage is in
exclusive or non-exclusive state and is subcached,
and another cache requests that subpage, that subpage
enters the corresponding transient state. When an
up-to-date copy of the subpage is available for the
subcache, the cache expels the subpage, making it
available to the other cache(s).
A subpage enters a transient state on a
cache due to a request by a single other cache.
However, any number of additional caches may make
requests for the same subpage before the holding
cache expels it. In this case, the single expulsion
satisfies all of the requesting caches with a single
message over the domains.
p,~~ailea Transitions Between States
The following is another list of the
states. In this list, the most important conditions

1341 154 ~ ~
-4 6-
for entering and leaving the state are described.
This list provides a more complete listing of the
transitions than the introduction above, but the
precise specifications of the state transitions are
given in the tables provided in below. Some of the
transitions are conditioned by LRU information.
invalid A subpage enters invalid descriptor
descriptor state in a cache when the descriptor
for its page is deallocated. A subpage
leaves invalid state when a descriptor
for its page is allocated in the
cache. Unless descriptor.held is 1, a
descriptor is automatically invalidated
when each of its subpages is implicitly
in invalid descriptor state.
invalid A subpage enters invalid state in a
cache when another cache acquires
ezclusive or atomic state. A subpage
leaves invalid state in a cache when
that cache acquires a copy of the
subpage in any state. A cache will
acquire a copy of a subpage to satisfy
a request from its local processor, in
response to a data movement control
instruction (see below) or when a copy
is available over the domains due to
communication between other caches.
read-only A subpage enters read-only state in a
cache from an invalid state when that
cache requests a read-only copy, or
when a copy is available over the
domains due to communication between
other caches and at least one other
cache in the same domain(0) has a copy
of the subpage. A subpage enters
read-only state from non-ezclusive or
ezclusive state when another cache
requests non-ezclusive state. A
3o ~ subpage leaves read-only state when
another cache requests ezclusive or
atomic state, or when the cache
acquires an owner state.

1341 154
-4 7-
When the nonexclusive owner of a
subpage modifies that subpage, the
owner sends the new data over the
domains to update any read-only
copies. At the time of such an update,
if a cache has a subpage in read-only
state and that subpage is in use by the
cache's local processor, then the cache
updates the copy and replies to the
update to indicate that it has a copy.
A subpage is defined to be in use in a
processor when it is resident in the
processor's subcache. If a cache has a
to subpage in read-only state and the
subpage is not in use, then that cache
invalidates the subpage and does not
respond to the update.
nonexclusive
owner A subpage enters non-exclusive state in
a cache when that cache requests
ownership and some other cache has a
read-only copy. A subpage leaves
non-exclusive state as follows: When a
cache has a copy in non-exclusive state
and another cache requests
non-exclusive state, the holding cache
responds to the request and changes its
state to read-only, giving the
non-exclusive state to the reguesting
cache. When a cache has a copy in
non-exclusive state and another cache
requests exclusive or atomic state, the
holding cache responds to the request
and invalidates its copy. When a cache
holding a subpage in non-exclusive
state receives an update from its local
processor, it sends the new data to the
other caches. If no other cache
indicates that it is holding a copy,
the holding cache changes the subpage's
state to exclusive.
exclusive A subpage enters exclusive state in a
owner cache when that cache requests
ownership and no other cache has a
read-only copy, or when that cache
explicitly requests exclusive state. A
subpage leaves exclusive state on any
request for a copy. The response to

-4 8- 1 3 41 1 5 4
1 requests for different states is as
follow:
read-only - If the page is below HS
High in LRU priority, the holding cache
responds granting exclusive state and
invalidates its copy. If the page is
above BS_High in LRU priority, the
holding cache responds granting
read-only state and changes its copy's
state to non-exclusive.
non-exclusive - If the subpage is
subcached, the holding cache responds
granting non-exclusive state and
changes its copy's state to read-only.
If the subpage is not subcached, the
holding cache responds granting
exclusive state and invalidates its
copy.
exclusive or atomic - The holding
cache responds to the request and
invalidates its copy.
atomic A subpage enters atomic state in a
cache in one of two ways. First, if
the local processor executes a g.~
instruction, and the subpage is not in
atomic state, then the local cache of
the requesting processor will acquire
the subpage in atomic state. Second,
if a cache holds a subpage in pending
atomic state, and the subpage is
expelled from another cache holding it
in atomic state. then that first cache
~ will acquire atomic state.
A subpage leaves atomic state in a
cache only when it is released by
explicit request from the cache's local
processor.
pending A subpage enters pending atomic state
atomic via the StOD instruction.
A subpage leaves pending atomic state
in two ways. If a subpage is in
pending atomic state in a cache, and
the local processor executes an rso

1341 154
-49-
1 ~ instruction, then the subpage leaves
pending atomic and becomes invalid. If
a subpage is in pending atomic state in
a cache, and that subpage is made
available in atomic state via an
expulsion, then that subpage enters
atomic state from pending atomic state.
transient nonezclusive owner
transient exclusive owner
transient atomic owner
If a cache holding a subpage in any
owner state is unable to immediately
respond, the holding cache marks the
subpage transient. For example, when a
cache holds a subpage in atomic state
and another cache requests a copy in
any state, the holding cache marks the
subpage transient atomic. since a
response cannot be issued while in
atomic state.
Transient state is passed on responses
and expulsions. It is cleared only
after an expel traverses the domain
without being acquired by some other
cache.
Hata CQpying_ Strategy
The interaction among states described above
is a tradeoff between time and domain bandwidth spent
waiting for a copy from another cache and time and
bandwidth spent sending updated copies to other
caches. When there are many read-only copies of a
subpage in the system, then the changes that a read
will find the data already in the local cache are
increased. However, if there are any read-only
copies in the system, then the owner must send out
. updates when modifying the subpage.
The following heuristics attempt to
dynamically detect on a short term basis multiple
read/writer sharing from single read/writer access.

-50- 1 3 41 1 5 4
1 Multiple read/writer sharing is multiple read-only
copies with high temporal locality and write updates
with lower temporal locality. Retaining read-only
copies is most efficient since multiple copies are
read multiple times between updates. Updates take
place in a single domain operation. Single
read/writer access is multiple read-only copies
read-only copies with low temporal locality and write
updates with much higher locality. Retaining
to read-only copies is less efficient since copies are
updated multiple times between updates. A single
read/write copy (exclusive owner state) does not
require a domain operation for write update.
Applying these two cases independently to all
15 read-only copies allows transition from non-exclusive
ownership with multiple read-only copies to exclusive
ownership with no read-only copies. The strategy for
balancing these considerations is as follows:
a. When a copy of a subpage is sent across the
20 domains to satisfy a request, any cache that
has a descriptor for a page but no copy of
the subpage picks up a read-only copy from
the message. This mechanism accelerates
applications with high locality of reference.
b. When a cache sends an update for a subpage,
every other cache with a copy of the subpage
25 in read-only state which is not in use
invalidates that copy. If a copy is
subcached, it is considered "in use". When
a copy is preserved in a processor's
subcache, it is removed from the subcache.
This slows down the nezt zeference to that
subpage from that processor. If it were
not removed, the subpage might remain in
3o subcache indefinitely, compelling the owner
to send updates. Since interconnect
bandwidth limits total system performance,
trading latency on one cache for global
throughput improves net systeca performance.

1341 154
-51-
1 c. The caches periodically remove read-only
copies of subpages when the owner is in a
different domain(0). This reduces the cost
of updates, since intra-domain(0) messages
are faster than inter-domain(0) messages.
Tie Processor Side
The tables shown in Figures 6A and 6H
provide the precise specification of the action that
a cache takes in response to data access requests
from its local processor. There is one row of the
table for each processor request to the cache. There
is a column for each possible state of the subpage in
the cache. The entry in the table states the
message, if any, sent over the domains by a cache to
satisfy the request when the subpage is in the
specified state in that cache. The messages are
defined below. when an entry includes "->state", the
local cache sets the subpage in state state after
receiving a successful response to the message.
The Domain Side
Caches send messages over the domains to
acquire copies of subpages in particular states.
Each message consists of a request type, a
descriptor, and the data for a subpage. The tables
shown in Figures 7, 7A, 7H, 7C, and 7D provide the
precise specification of how each cache responds to
messages on the domain. The tables are divided into
three sections: read operations, write operations,
and response operations. Each section includes the
definition of the operations. The tables give the
state that results when a cache with a subpage in a
specified state receives a particular message. In

1341 154
-52-
1 addition to the states, the tables are annotated with
the following side effects
and modifications:
respond The cache replies to the message with a
copy of the subpage.
error The cache signals an exception.
working-set If the LRU priority of the page is
lower than BS_High, then the subpage is
invalidated in favor of the requestor
unless the owner limit prohibits that
transition. Otherwise, as shown.
subcached? If the subpage is not subcached, then
it is invalidated in favor of the
request, unless the other limit
prohibits that transition. Otherwise,
as shown.
owner-limit? If the value of descriptor.owner_limit
is Cache owner_limit, then error. If
it is Domain0_owner_limit and the
source request is in a different
domain(0), then reject. Otherwise, as
specified.
update- If the subpage is subcached, remove it
flush? from subcache and respond to indicate
that the copy exists. If the subpage
is not subcached, invalidate it and do
not respond at all.
dom0-copy? If there are other copies in the local
domain(0), then if there is already a
copy in the cache, retain a read-only
copy. If there is no copy in the
cache, acquire a copy in read-only
state. Otherwise, if there is a copy,
invalidate it.
no change No change in state.
If both update-flush? and dom0-copy? are
specified, then ,~,~ condition is sufficient to
retain a copy.

1 3 41 1 5' 4
-53-
1 Read Messaaeg
With particular reference to Figure 7A, read
operations are used to acquire the state necessary
for a processor operation. Once the subpage has been
'read' into the local cache, the operation can
proceed.
Most of the read messages are simply
requests for the subpage in a particular state, and
are named after the state. For example, read
atomic requests atomic state. There are two read
messages that have complex semantics: highest
read-only and highest nonexclusive. Highest
read-only searches the system in order of increasing
domain distance, and takes the strongest state
available in the nearest cache. If there are any
copies in the local domain(0), the copy in the
strongest~state responds and invalidates itself.
Highest nonexclusive has similar semantics, except
that caches with subpages in read-only state do not
respond. Read one-time copy requests a copy of
subpage without changing subpage state.
sa9'es
Referring to Figure 7H, write operations are
used to send modified data out to other caches or to
force other caches to give up state.
Write Update
is sent by the nonexclusive owner When
the.subpage is modified.
Write Invalidate_
3o is sent by the nonexclusive owner when
the nonexclusive owner needs to acquire
exclusive state. (To acquire atomic
state, a nonexclusive owner uses write
invalidate to get exclusive state, and
then internally changed the state to

1341 154
-54-
1 atomic before allowing any other cache
to request ownership.)
Write Exclusive Recombine
is used to expel a subpage that is in
transient exclusive state or which has
been explicitly expelled by a CPU
instruction. It is also sent by a
cache with a subpage in exclusive state
to find another cache to take
responsibility for the subpage on the
basis of LRU priority. Once one cache
with a descriptor has responded to this
message no other cache responds. If
the message has not been responded to,
and the cache has a descriptor for the
page with an LRU~riority of less that
WS_Top, the cache sets the state to
exclusive owner and responds. This
message is used at the end of a
transaction to send a subpage to a
cache that has requested it during the
transaction. This message is also used
in LRU maintenance.
Write Pon-Exclusive Recombine
is used to expel a subpage that is in
transient non-exclusive state or which
has been explicitly expelled by a CPU
instruction. It is also sent by a
cache with a subpage in non-exclusive
state to find another cache to take
responsibility for the subpage on the
basis of LRU priority. Once one cache
with a descriptor has responded to this
message no other cache responds. If
the message has not been responded to,
and the cache has a descriptor for the
page with an LRU~riority of less that
WS Top, the cache sets the state to
nonexclusive owner and responds. This
message fs used to in LRU maintenance.
3o Both of the recombine messages are limited
by descriptor.owner_limit. When descriptor. owner
limit is Domain0_owner_limit, a recombine message is
not delivered outside of the originating domain(0).
When descriptor.owner_limit is Cache owner_limit, a

1341 154
-55-
1 recombine message is never sent. Note that the
indication 'recombine?" indicates the LRU position
comparison described above.
Response Messages
With reference to Figures 7C and 7D,
response messages are sent by caches that respond to
read messages. The first table shows the action, if
any, taken for a response message by a cache that
already holds the subpage in the specified state.
The second table shows the action of a cache that is
awaiting a response to the specified type of request
for the subpage. There are two cases shown in the
tables:
1) A response may be detectable as an
error. For example, if a cache holds a
subpage exclusively and another cache
sends an exclusive response, there is
an inconsistency, and the holding cache
signals an exception.
2) If a cache has a descriptor for the
page but no copy of the subpage, it
will pick up a copy under some
conditions.
Descriptor Movement
When a cache receives a copy of a subpage in
invalid descriptor state, it initializes its
descriptor by copying most of the fields of the
descriptor on the source cache. LRS_position, LRU ..
insert~indez, subcache, subpage_state, held and no_
owner are never copied. Owner_limit is handled
specifically.

1341 154 '
-56-
1 Processor Data Accesses and Domain ReQUests
A processor makes data requests to its local
cache to satisfy load and store instructions and
co-processor operations. A cache makes requests to
its local processor to force the processor to
invalidate its copy of a subpage in subcache.
Load and Store Instructiong
A processor passes load and store
to instructions to its local cache as requests when the
subpage containing the referenced address is not
present in the subcache in the required state. The
different types of load and store instructions pass
information to the local cache about the access
patterns of the following instructions. For example,
if the sequence of the instructions is a load
followed by a store, and the subpage containing the
data item is not yet resident in the local cache, it
is more efficient to acquire ownership for the load
than to get a read-only copy for the load instruction
and then communicate over the domains a second time
to acquire ownership for the store instruction.
The different forms of load and store
instructions are described below. Each description
begins with a brief summary of the semantics of the
instruction, and continues with a detailed
description of the cache's action.
All of the load instructions described here
have two forms: blocking and non-blocking. These
forms control the behavior of the load instructions
with respect to atomic state. If a processor
ezecutes a blocking load instruction that references
a subpage in atomic state, that instruction will wait
until the subpage leaves atomic state before

1341 154
-57-
1 proceeding. If a processor executes a non-blocking
load instruction that references a subpage in atomic
state, that instruction will acquire atomic state in
the local cache and proceed.
load (default) [ldd/cldd]
The program will continue the current
access pattern. If the subpage is
already present in the cache, the local
cache keeps the subpage in the same
state. If it is not already present,
the local cache requests the subpage in
read-only state. The labd/cldbd
forms of these instructions will block
if the subpage is in atomic state, and
wait for it to be released.
load (ezclusive) [lde/clde]
The program will write the subpage in
the following instruments, and
~ exclusive state is preferable to
non-exclusive state. A program would
use this when the data was expected to
have little sharing, or when a series
of writes was upcoming, and it was
therefore worth eztra work to acquire
ezclusive state to avoid updating
read-only copies.
The local cache requests the subpage in
ezclusive owner state. This allows the
processor to write the subpage without
updating read-only copies unless
another cache gets a copy of the
subpage in read-only state between the
load and the store. The ldbe/cldbe
forms of these instructions will block
if the subpage is in atomic state, and
wait for it to be released.
A particular ezample of the use of load
(ezclusive) is per-program data such as
stacks. Generally, there will be no
3o read-only copies of such data, since
the only copy will be the one in use by
the program. However, if a program
moves from one processor to another,
the new processor's local cache will
have no copy, and the old processor's

1341 154
.,
f
-58-
local cache will continue to hold the
subpage in exclusive state. If the
program uses load (default), the local
cache acquires the subpage in non-
exclusive state, leaving a read-only
copy on the cache of the previous
processor, and requiring inefficient
updates.
Load (exclusive) instructions always
load the data into the subcache in the
same state that is acquired in the
cache.
store (default) [st/cst] The program is unlikely
store into this subpage in the
following few instructions. The local
cache maintains the existing state on
the subpage.
If the subpage is atomic on some other
cache, then the local cache acquires
atomic state.
Store (default) instructions always
load the data into the subcache in
exclusive state.
load subpage (default) [ldspd/cldspd]
loan subpage (exclusive) [ldspe/cldspe]
Load subpage is used to load a full
subpage of data into processor or
co-processor general registers. If the
subpage is present in the subcache, it
is loaded directly from the subcache.
If the subpage is not present in
subcache, it is loaded directly from
the local cell cache. An option to
these instructions specifies whether,
in the case that the data are loaded
from the cell cache, they are stored in
the subcache in addition to the target
registers. The Idspbe/cldspbe forms
of these instructions will block if the
subpage is in atomic state, and wait
for it to be released.
The load subpage (default) and load
subpage (exclusive) instructions have
corresponding semantics to the load

1341 154 '~~
-59-
1 (default) and load (exclusive)
instructions.
load subpage (one time) [ldspo/cldspo]
Load subpage (one time) is used when
the program does not intend to make any
further references to the data in the
subpage at any time in the near
future. This instruction has no effect
on the state of the subcache or any of
the caches, except in some cases to set
transient state. If the data are
available in the subcache, they are
loaded from the subcache. If the data
l0 are not available in the subcache, but
are available in the local cache, they
are loaded from the local cache without
being stored into the subcache. If the
data are not available in the local
cell at all, they are copied over the
domains without disturbing the existing
state.
store subpage (default) [stsp/cstsp]
store subpage (exclusive) [stspe/cstspe]
Store subpage is used to store a full
subpage of data from processor or
co-processor general registers. If the
subpage is present in the subcache, it
is stored directly into the subcache.
If the subpage is not present in
subcache, it is stored directly into
the local cell cache. An option to
these instructions specifies whether,
in the case that the data fs stored
into the cell cache, it is also stored
in the subcache.
instruction fetch
Instruction fetches always fetch the
subpage containing the data in
read-only state.
Su~paae Aton'ic State Instructions
The subpage atomic instructions are the
program interface to the get, stop, and release
operations described above. These instructions ezist

54
-60-
in several forms to permit precise tuning of parallel
programs.
release subpage [rsp]
Release subpage is used to remove a
subpage from pending atomic or atomic
state. If the subpage is in pending
atomic state in the local cache, it is
set to invalid state. If the subpage
is not in pending atomic state in the
local cache, it is set to invalid
state. If the subpage is not in
pending atomic state in the local
to cache, but is in atomic state in some
cache in the system, then it is set
into exclusive state from atomic state
in that cache. If the subpage is in
transient atomic state in that cache,
it is changed to transient exclusive
state and then it is expelled, as per
the release and expel subpage
15 instruction below. If the subpage is
not in pending atomic state in the
local cache and is not in atomic state
in any cache, then release subpage has
no effect.
release & ezpel subpage [reap]
Release and expel subpage has the same
2o semantics as release subpage, except
that if the subpage is changed from
atomic to exclusive state, it is always
expelled as if it were in transient
atomic state.
get subpage [gsp]
25 get subpage & wait [gspw]
get subpage, wait & load [gspwld]
get subpage, wait & load subpage [gwldsp]
Get subpage requests that a subpage be
set into atomic state. For all forms
of the get subpage instruction, if the
subpage is not in atomic state in any
cache, then the local cache acquires it
3o in atomic state.
For the get subpage instruction, if
the subpage is already atomic on the
local cache, then the instructions
signals an ezception. If the subpage

1341 15~
-61-
1 is already atomic on some other cache,
then the instruction proceeds.
Programs must use the mcksp
instruction after gsp to determine if
the attempt to take the subpage into
atomic state succeeded.
For the other get subpage
instructions, if the subpage is already
atomic in any cache, the instruction
waits until the subpage is released.
The local cache then acquires the
subpage in atomic state.
The two load forms of the get subpage
instruction have the same semantics as
a gspw followed by the load
instruction. The only difference is
that the combined instructions are
faster.
stop subpage [ssp]
stop subpage ~ wait [sspw]
stop subpage, wait & load [sspwld]
stop subpage, wait & load subpage [swldsp]
Stop subpage sets the state of a
subpage in the local cache to pending
atomic.
Stop subpage & wait sets the state of a
subpage in the local cache to pending
atomic, and then blocks until the
subpage state changes from pending
atomic to atomic.
Stop subpage, wait, and load is an
indivisible combination of stop subpage
& wait and load (default).
Stop subpage, wait, and load subpage is
an indivisible combination of stop
subpage & Wait and load subpage
(default).
3o release, ezpel & stop subpage [ressp]
wait, release, ezpel ~ stop subpage [wressp]
get subpage, wait, release, ezpel ~ stop subpage
[gwressp]
load, release, ezpel & stop subpage [ldressp]

1341 154 ~ c~
-62-
Release, expel and stop subpage is an
indivisible combination of release &
expel subpage and stop subpage.
Wait, release & stop subpage is an
indivisible combination of waiting for
subpage to be in atomic state in local
cache, release & expel subpage and stop
subpage.
Get, wait, release, expel & stop
subpage is an indivisible combination
of get subpage and wait, release 5
expel subpage and stop subpage.
Load, release and stop subpage is an
indivisible combination of load
(default), release & expel subpage and
stop subpage.
9~h~r Su bpag~In~tructions
Memcheck Subpage [mcksp]
Memcheck subpage checks the progress of
an asynchronous memory system
ins truction for a subpages. It returns
two values: a binary indication of
whether an instruction was in progress
and the current subpage state.
Prefetch Subpage (copy) [pcspc, pdspc, pispc]
Prefetch Subpage (nonexclusive) [pspcn, pdspn, pispn]
Prefetch Subpage (exclusive) [scspe, pdspe, pispe]
Prefetch Subpage requests that a copy
of a subpage be acquired on the local
cache in a specified state. Prefetch
subpage specifies whether or not the
subpage should be prefetched into the
processor's instruction or data
subcache. A subsequent load for the
subpage blocks until the prefetch
subpage has completed.
Prefetch Cache Page (copy) [pcpc]
Prefetch Cache Page (nonexclusive) (pcpa]
Prefetch Cache Page (exclusive) [pcpe]

1341 154
-63-
1 Prefetch cache page requests that all
subpages of a page be acquired on the
local cache in the specified state.
A more detailed description of a preferred
memory instruction set, including processor load
instructions, processor store instructions, and page
manipulation instructions, is provided in Appendiz F,
filed herewith.
Updates from the Subcache to the Cache
When a processor has a copy of a subpage in
subcache, and that subpage is owned by that
processor's local cache, the processor propagates
modifications to the subpage to its local cache as
follows:
If the local cache holds the subpage in
ezclusive state, then the processor
propagates modifications to the cache when:
- the subpage is removed from
subcache, or
- the local cache receives a request
for a copy of the subpage. In this
case. the local cache ezplicitly
requests the updated copy.
- the processor is stalled. When the
processor is stalled, it updates
modified subpages in ezclusive state
to its local cache.
If a processor's local cache holds the
subpage in non-ezclusive state then the processor
w
propagates each modification as it is completed.
A processor propagates modified information
to its local cache with an Update data request.

1341 154
-64-
1 ~'orcina Subcache Invalidation
A cache forces its local processor to remove
a subpage from subcache in order to invalidate the
subpage in response to a request from another cache.
Requests from One Cache to Another
In parallel to responding to requests from
its local processor, each cache responds to messages
from other caches delivered by its local domain(0).
There are three types of messages: read, write, and
response. A read message requests some other cache
to respond with the data for a subpage. Each read
message also requests a particular state, and both
the cache that responds with the data and other
caches with copies change the state of their copy of
the subpage in order to satisfy the state request. a
write message either supplies an updated copy of a
subpage to caches with read-only copies, or directs
other caches to change the state of their copies. A
response message is sent in response to a read
message. Caches other than the original requestor
take actions on response messages as specified below.
It is important to note that read and write
message do not correspond to load and store
instructions. Both load and store instructions
result in read messages to acquire a copy of the
subpage in the appropriate state. A particular
store instruction will not result in an immediate
write message unless the subpage is held in
3o nonezclusive state.
Cache Page Usage and Replacement
The caches of a KSR system can be used by
system software as part of a multilevel storage

1 3 41 1 5 4 '' ~~
-65-
1 system. In such a system, physical memory fs
multiplexed over a large address space via demand
paging. The caches include features that accelerate
the implementation of a multi-level storage system in
which software moves data between the caches and
secondary storage in units of SVA pages.
C'.~a~hes as Two Storage Lev~ls_
All of the caches together make up a
system's primary storage. However, for some
purposes, it is necessary to treat each individual
cache as an independent primary store. This is
because each cache can only hold a limited number of
pages: 4096 pages in each cache, and 32 in any
particular set. Since each page can only be stored
in one set in each cache, a cache signals an
exception when a full set prevents it from allocating
a descriptor for a page. When such an exception is
signalled, software must take action to make room in
the full set.
When a particular set of a particular cache
is full, there is no reason for software to assume
that the entire memory system is correspondingly
full. Thus, it is desirable for software to respond
to a full set by moving a page from that set to the
corresponding set of another cache. In taking this
action, software treats the rest of the memory system
as an additional level of storage between a
particular cache with a full set and secondary
storage, called backing store.
Strategies for Hacking Store Management
Referring to Figure 8, in order to use
memory efficiently, software must use some strategy

1341 154 '~
-66-
1 for identifying an appropriate page to remove from a
full set, and a appropriate target cache, if any, for
the page. The caches include facilities that
accelerate a class of strategies for this page
replacement. To accelerate software's selection of a
page for replacement within a set, each cache
approximately orders the pages from Most Recently
Used (MRU) to Least Recently Used (LRU). When a
page is referenced, it moves to MRU. As other pages
are referenced thereafter, it ages toward LRU. The
LRU information accelerates strategies that replace
the least recently used page.
To accelerate software's selection of a
target cache for replacement between caches, each
cache maintains an approximate measurement of the
working set of the cache. The working set is a
measurement of the number of pages which are in
steady use by programs running on a cache's local
processor over time, as distinct from pages
referenced a few times or infrequently. Software
measures the working set of each cache as a point
between MRU and LRU. Pages above the working set
point are in the working set, while pages below have
left the working set. The working set information
accelerates a software strategy that treats the
non-working set portion of each cache's memory as
system backing store.
Automatic Page Movement purl ~icmnvs~l
3o A new page arrives when a cache's local
processor references data in a subpage that is in
invalid descriptor state. If the corresponding page
is resident elsewhere in the system, the cache will
copy its descriptor and the referenced subpage from

1341 154
-67-
1 another cache. Eventually, this process will fill up
cache sets. When data is eztensively shared by
programs running on multiple processors, this is a
very frequent event. Therefore, each cache includes
facilities to automatically move and remove pages in
parallel with other computation to avoid the need for
frequent software intervention required by full
sets. These facilities use the LRU and working set
information to:
recombine pages
To recombine a page is to collect all
of its subpages in one cache, and free
the descriptors in other caches.
drop pages
To drop a page is to remove an
unmodified pages from all caches.
All of these automatic actions can be tuned
or disabled by software, and are inhibited by
descriptor.held. The following sections describe the
circumstances in Which the caches recombine, move,
and drop pages.
~tecombining Pa4es
Each cache recombines pages from LRU up to
the working set point, it is significantly less
likely to be referenced again that if it is above the
working set point. Therefore, each cache recombines
pages when they pass the working set point. A cache
uses the write ezclusive recombine or write
non-ezclusive recombine message for each subpage to
recombine a page. If the recombine messages fail to
find another cache to take over the page, the
recombining cache retains the data. If the recombine
messages fail to find another cache to take over the

1341 154
-68-
1 page, the recombining cache retains the data. In
effect, it has found itself as the target of the
recombine. Since pages are recombined as soon as
possible after they leave the working set any other
cache with copies is likely to have the page in the
working set. (were it not in the working set in some
other cache, that cache would have recombined it.)
Since pages below the working set point are
less likely to be referenced, most of the recombines
that actually move data to another cache will be of
pages that have recently left the working set. The
pages below the working set that get recombined
elsewhere will be pages that have been referenced
since they left the working set, and so the recombine
moves them to the cache that referenced them.
Pages
Caches invalidate subpages and pages to make
room for other pages. This is called dropping. A
cache will automatically drop a page which is below
the working set point and which has subpages in
read-only or invalid state (a read-only page). If a
cache has no free descriptors and cannot allocate a
descriptor by recombining a page, it will drop a
read-only page that is not subcached and is anywhere
in the MRU~LRU order. If a cache has no read-only
pages, it will drop an unmodified page with subpages
in invalid, read-only, or ezclusive state that is not
subcached and is in the lower portion of the working
3o set, as defined by the WS_Low register defined below.

1341 154 '
-69-
1 ~o war Working=,,Set-Related Action
The following sections describe a software
strategy that makes use of the cache facilities to
implement a multi-level store.
Modified Pages
When a modified page crosses the working set
point, system software undertakes to write it to
disk, so that it will be pure by the time that it
reaches LRU. Since only some of the caches will be
able to write a given page, by virtue of having a
physical connection to the appropriate disk drive,
modified pages must be migrated to the cache in a
cell with an XIU connected to the secondary storage
device for the page. Software moves pages to be
written with copy or chug instructions.
Software sometimes has knowledge of the
expected reference pattern for some data. If
software expects a page to be referenced only one,
that page should be put into the LRU order someplace
below MRU, to avoid displacing information that is
more likely to be used later. In particular, if
software is prefetching data that it won't reference
for a while, and perhaps not reference it at all, it
should not be inserted at NatU. Software controls the
insertion point by setting the appropriate
LRU_insert_indea in the prefetch and chng
instructions.
Each cache maintains LRU state for all of
the resident pages. The LRU data is maintained
separately for each of the 128 sets of the descriptor
associative memory, and orders the 32 pages in the

1341 154
-~o-
set according to their approaiinate time of last
reference.
Basic LRU Maintgnance
Each cache maintains an LRU~MRU ordering
of the descriptors in each set. The ordering is
maintained in descriptor.LRU~riority. Each of the
descriptors in a set has a value from (MRU) to 31
(LRU) in descriptor.LRU~riority. Conceptually, when
1o a page is referenced it moves to MRU. All of the
other pages from MRU down to the referenced page's
LRU priority then move down.
When a page is first subcached,
descriptor.LRU_priority is set to zero, which inserts
15 it at MRU. When the last subcached subpage of a page
is evicted from the subcache, descriptor.LRU~riority
is set as specified by descriptor.LRU_insert_indez.
The insert indez selects an entry in
LRU_insert_table, a per-cache table described below.
20 descriptor.LRU~riority is set to
LRU_insert table(descriptor.LRU_insert_indez)
and the descriptor.LRU~riority is changed for other
descriptors as appropriate to accommodate it. Note
that if an entry of the LRU_insert table is set close
25 enough to MRU, then pages evicted from subcache will
be inserted closer to MRU than pages in subcache.
~lorkina Set Meaguremen~
Each cache has an array of 32 working set
30 rate counters, 16 bits. The counters freeze at
216-1. When a page is subcached, the bucket
corresponding to its current LRU position is
incremented. By periodically reading and clearing
the buckets, software can determine the approzimate

1 341 154
working set size. Software can attempt to maintain
the working set for each schedulable entity, or it
can just run a set of entities on a cache and
maintain the aggregate Working set. The later incurs
less cost at scheduling time.
Subcached pages complicate working set
measurement. The LRU value of a page can change any
time that some other page moves into the cache.
However, the LRU numbers do not consider whether or
not a page is subcached. Instead, all the hardware
mechanisms which use LRU consider all subcached pages
as one LRU level, and all non-subcached pages at
various other LRU levels.
LRU Insert Table
The LRU_insert table maps from four logical
points in the LRU~MRU sequence to the 32 actual
slots in the set. The four slots are named:
1. WS_High - conventionally set close to
or at I~tU.
2. WS_Low - a point low in the working set
of the cache. When software prefetches
low priority data into the cache, it is
usually inserted at this point. This
allows software to 'over prefetch'
without displacing more important data,
as inserting it at WS High would. See
the description of prefetch strategy
below.
3. BS_High - the boundary between working
set and backing store. when software
migrates a page into backing store in a
cache it inserts it here.
4. BS Low - an insertion point for
low-priority backing store items.

t3~1 154
-72-
1 escriptor Al location A~i ions
When a new descriptor in a set is needed,
the cache proceeds through
as many of the following
actions needed to find
a usable descriptor:
1. Looks for an invalid descriptor. If
one eaists, it uses it.
2. Drops copies. If ctl$configuration.cde
is 1, the cache searches up from LRU,
looking for any page which has only
read-only subpages. If it finds one
that is not held (descriptor.held is
0)
and not subcached it invalidates it and
uses it. The scan stops at BS_High.
3. Looks for an opportunity to do a
recombine. If ctl$configuration.are is
1, the cache scans up from LRU looking
for a page that has at least one owned
subpage, has descriptor. held of zero,
does not have descriptor.owner_limit
of
Cache_owner_limit, and is not
subcached. The scan stops at BS_High.
If the scan finds one, the cache sends
a Write REcombine Ezclusive or Write
Recombine Nonexclusive message as
appropriate for each of its subpages
2o which is owned. If all of the subpages
are successfully passed off, then the
descriptor is invalidated and used.
Software can disable this feature by
setting Recombine_high_limit to l~tU.
4. Drops pure pages. If
ctl$configuration.ade is 1, the cache
scans up from LRU looking for pages
that are:
not modified
have only ezclusively owned or
read-only subpages
not atomic modified
3o not held
not subcached.
5. Looks for a pure page (which is
subcached) to drop.

1341 154 '
-73-
6. Signals a no_descriptor_available
exception if it cannot free a
descriptor by the means described above.
Background Recgmbine
The cache uses otherwise idle time to
recombine pages. If Recombine_high_limit is not 31,
then the background task scans across the sets
looking for pages in each set that can be recombined
and recombines them.
to Allocation recombines and automatic
recombines will generally recombine pages as they
leave the working set. There will be few
recombinable pages in backing store. If at some time
there are not recombinable pages in a cache, new
recombinable
pages will appear in the form of pages
dropping out of the working set. If background and
allocation recombines keep up with this rate, the
only source of recombinable pages in the backing
store will be references by other caches to pages in
backing store. The referencing caches' descriptors
will probably be in their working sets, and so
recombining the pages to them is appropriate.
Software LRU Insertion Bias
Software can bias LRU insertion is
instructions. Some of the control instructions
described below include an LRU_insert_indez field.
When one of these instructions moves the first
subpage for a page into subcache, the
LRU_insert_indez in the instruction replaces
descriptor.LRU_insert_indez for that page. Then,
when the page leaves subcache, it is inserted into
the LRU according to the specified entry in the

1341 15.4
-74-
1 LRU_insert_table. The chng instruction and the
various prefetch instructions (described below)
permit the programmer to specify an
LRU_insert_indez. When a cache executes one of these
instructions for a subpage of a page which has no
valid subpages, the specified LRU_insert_indez
becomes descriptor.LRU_insert_inde:.
The LRU insert index specified in control
instruction replaces the indez previously stored in
to the descriptor. Once the indez is set in a cache, it
persists in that cache until it is reset or all of
the subpages become invalid. In effect, the memory
system has a limited memory for LRU bias
information. If one program indicates that a page is
15 likely to be referenced again, and shortly thereafter
another program indicates that it is unlikely to be
referenced again, the second indication persists.
System software can establish a different default
behavior. For example, system software might include
2~ a background task that changes
descriptor.LRU_insert_indea to WS High for pages
below BS_High. This would implement a policy as
follows: once a page had aged to backing store, any
LRU insert indez information was too old to be
25 valuable, and the ordinary default should apply
instead.
Prefetching requires care in the use of LRU
insertion bias. It is desirable for software to be
able to 'over prefetch,' to prefetch any data that it
3o may need. To avoid driving higher priority
information out of the LRU, the over-prefetched pages
should be fetched with an LRU_insert_indez other than
WS_High.

1341 154
-75-
1 Figure 9 depicts an exemplary domain routing
unit 28F constructed according to a preferred
practice of the invention. The unit 28F includes
domain directory section 80 and remote fiber
interface section 82 interconnected via cache bus
76. Directory section 80 includes dual routing
control units 84A and 84B coupled to memory stores
86A and 868, as illustrated. The stores comprise 8
Mbyte dynamic random access memory elements arranged
for storing lists of the descriptors identifying data
maintained in the domain segments which descend from
the upper level domain segment to which the
illustrated routing unit 28F is attached. Routing
control units 84A and 84B are constructed and operate
similarly to the cache control units 74A, 748, 74C
and 74D, described above. The units additionally
include hash encoding logic for controlling storage
and access of descriptors within stores 86A and 86B.
This encoding logic, as well as the descriptor
storage and access mechanisms, are conventional in
the art.
The remote fiber interface section 82
includes remote interface unit 88, coupled with fiber
receiver and decoding section 90 and with fiber
encoding and transmitting section 92, in the manner
illustrated in Figure 9. The receiver 90 interfaces
the incoming fiber optic line 94, while the
transmitter 92 interfaces the outgoing line 96. In
addition to buffering information signal
transmissions, the unit 88 provides CRC encoding and
decoding for the fiber optic link. Receiver 90 and
transmitter 92 are constructed in accord with
techniques conventional in the art.

1341 154
-76-
While the illustrated. domain routing unit
28F is specifically configured to interface remote
domain segments (see, for example, segments 12F and
14B of Figure 1), it will be appreciated that direct
interconnect units (i.e., those domain routing units
which provide interconnection between non-remote
segments, e.g., segments 14A and 12A of Figure 1) are
similarly constructed. In such units, the remote
fiber interface section 82 is replaced by a local
interface section, which provides buffering for
transmissions between respective domain segment buses.
Summary
It is seen that the aforementioned objects
are met by the invention, embodiments of which are
described above, which provides a digital data
processing system comprising a plurality of
processing cells arranged in a hierarchy of rings,
which selectively allocates storage and moves
exclusive data copies from cell to cell in response
to access requests generated by the cells, and which
employs routing elements to selectively broadcaset
data access requests, updates and transfers on the
rings. A multiprocessing system constructed in
accord with the invention features improved data
coherency, reduced latency and bus contention, as
well as unlimited scalability.
It will be appreciated that the embodiments
depicted in the drawings and described above are
illustrative only and that those skilled in the art
may make changes in the illustrated constructions and
sequences without departing from the scope of the
invention. Thus, for example, that special purpose
processing units capable of generating access

1341 154
1 requests may be substituted for the illustrated
central processing units, that remote cells and
domains may be coupled by media other than fiber
optic lines, and so forth.
Accordingly, what we claim is:
15
25
35

Representative Drawing

Sorry, the representative drawing for patent document number 1341154 was not found.

Administrative Status

2024-08-01:As part of the Next Generation Patents (NGP) transition, the Canadian Patents Database (CPD) now contains a more detailed Event History, which replicates the Event Log of our new back-office solution.

Please note that "Inactive:" events refers to events no longer in use in our new back-office solution.

For a clearer understanding of the status of the application/patent presented on this page, the site Disclaimer , as well as the definitions for Patent , Event History , Maintenance Fee and Payment History should be consulted.

Event History

Description	Date
Time Limit for Reversal Expired	2004-12-13
Letter Sent	2003-12-12
Inactive: Cover page published	2000-12-13
Inactive: CPC assigned	2000-12-12
Grant by Issuance	2000-12-12
Inactive: CPC assigned	2000-12-12
Inactive: First IPC assigned	2000-12-12
Inactive: IPC assigned	2000-12-12

Abandonment History

There is no abandonment history.

Fee History

Fee Type	Anniversary Year	Due Date	Paid Date
MF (category 1, 2nd anniv.) - standard		2002-12-12	2002-11-19

Owners on Record

Note: Records showing the ownership history in alphabetical order.

Current Owners on Record
SUN MICROSYSTEMS, INC.

Past Owners on Record
BENSON I. MARGULIES
FREDERICK D. WEBER
HENRY III BURKHARDT
LINDA O. LEE
NATHAN GOODMAN
STEVEN J. FRANK

Past Owners that do not appear in the "Owners on Record" listing will appear in other documentation within the application.

Documents

To view selected files, please enter reCAPTCHA code :

To view images, click a link in the Document Description column (Temporarily unavailable). To download the documents, select one or more checkboxes in the first column and then click the "Download Selected in PDF format (Zip Archive)" or the "Download Selected as Single PDF" button.

List of published and non-published patent-specific documents on the CPD .

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.

Filter

Download Selected in PDF format (Zip Archive)

Download Selected as Single PDF

Document Description	Date (yyyy-mm-dd)	Number of pages	Size of Image (KB)
Claims	2000-12-12	21	1,248
Drawings	2000-12-12	11	373
Abstract	2000-12-12	1	14
Cover Page	2000-12-12	1	20
Descriptions	2000-12-12	79	3,307
Maintenance Fee Notice	2004-02-08	1	175
PCT Correspondence	2000-11-07	1	37
Prosecution correspondence	1992-10-08	355	25,061

Language selection

Menus

English Abstract

Event History

Abandonment History

Fee History

Your request is in progress.

Requested information will be available
in a moment.

Thank you for waiting.

Patent 1341154 Summary

English Abstract

Event History

Abandonment History

Fee History

Your request is in progress.Requested information will be availablein a moment.Thank you for waiting.

Your request is in progress.

Requested information will be available
in a moment.

Thank you for waiting.