Language selection

Search

Patent 2398579 Summary

Third-party information liability

Some of the information on this Web page has been provided by external sources. The Government of Canada is not responsible for the accuracy, reliability or currency of the information supplied by external sources. Users wishing to rely upon this information should consult directly with the source of the information. Content provided by external sources is not subject to official languages, privacy and accessibility requirements.

Claims and Abstract availability

Any discrepancies in the text and image of the Claims and Abstract are due to differing posting times. Text of the Claims and Abstract are posted:

  • At the time the application is open to public inspection;
  • At the time of issue of the patent (grant).
(12) Patent: (11) CA 2398579
(54) English Title: INTERMEDIATE VOICE AND DTMF DETECTOR DEVICE FOR IMPROVED SPEECH RECOGNITION UTILIZATION AND PENETRATION
(54) French Title: DISPOSITIF INTERMEDIAIRE DE DETECTION VOCALE ET DTMF POUR AMELIORER L'UTILISATION ET LA PENETRATION DE LA RECONNAISSANCE DE LA PAROLE
Status: Expired
Bibliographic Data
(51) International Patent Classification (IPC):
  • H04M 3/42 (2006.01)
  • G10L 15/26 (2006.01)
  • H04Q 3/42 (2006.01)
  • H04Q 3/62 (2006.01)
  • G10L 15/00 (2006.01)
(72) Inventors :
  • ERB, PAUL ANDREW (Canada)
  • SCHULZ, DIETER (Canada)
(73) Owners :
  • MITEL NETWORKS CORPORATION (Canada)
(71) Applicants :
  • MITEL KNOWLEDGE CORPORATION (Canada)
(74) Agent: PERRY + CURRIER
(74) Associate agent:
(45) Issued: 2005-11-15
(22) Filed Date: 2002-08-19
(41) Open to Public Inspection: 2004-02-19
Examination requested: 2002-08-19
Availability of licence: N/A
(25) Language of filing: English

Patent Cooperation Treaty (PCT): No

(30) Application Priority Data: None

Abstracts

English Abstract

A telephone system is provided comprising a plurality of telephone devices, a call control for establishing calls between the telephone devices in response to receipt of commands, at least one Voice and DTMF detector (VDD) resource allocated to the calls between the telephone devices for (i) detecting and transmitting DTMF tones received from the telephone devices to the call control for further call processing, and for (ii) detecting and buffering voice received from the telephone devices and notifying the call control. At least one Speech Recognition Engine is also provided for receiving the voice from theVoice and DTMF detector resource upon notification of the call control and in response performing speech recognition and notifying the call control for further call processing.


French Abstract

Un système téléphonique est fourni comprenant une pluralité de dispositifs téléphoniques, un contrôle d'appel pour établir des appels entre les dispositifs téléphoniques en réponse à la réception de commandes, au moins une ressource de détection Voix et DTMF (VDD) allouée aux appels entre les dispositifs téléphoniques pour (i) la détection et la transmission de tonalités DTMF reçues à partir des dispositifs téléphoniques au contrôle d'appel pour un traitement d'appel supplémentaire, et pour (ii) la détection et la mise en mémoire tampon de la voix reçue à partir des dispositifs téléphoniques et la notification du contrôle d'appel. Au moins un moteur de reconnaissance vocale est également prévu pour recevoir la voix depuis la ressource de détection Voix et DTMF sur notification du contrôle d'appel et en réponse en effectuant la reconnaissance vocale et en notifiant le contrôle d'appel pour un traitement d'appel supplémentaire.

Claims

Note: Claims are shown in the official language in which they were submitted.





What is Claimed is:
1. A telephone system comprising:
a plurality of telephone devices;
a call control for establishing calls between said telephone devices in
response to
receipt of commands from said telephone devices;
at least one Voice and DTMF detector resource automatically allocated to said
calls
between said telephone devices for (i) detecting and transmitting DTMF tones
received from said telephone devices to said call control for further call
processing,
and (ii) for detecting and buffering voice received from said telephone
devices and
notifying said call control; and
at least one Speech Recognition Engine for receiving said voice from said at
least one
Voice and DTMF detector resource only upon notification of said call control
and in
response performing speech recognition and notifying said call control for
further call
processing.
2. The telephone system of claim 1, wherein said at least one Voice and DTMF
detector
resource further comprises:
a DTMF detector for detecting DTMF tones received from said telephone devices;
a voice detector for detecting voice received from said telephone devices;
a voice buffer for buffering voice received from said telephone devices; and
decision logic connected to said DTMF detector and said voice detector for
detecting
between voice and DTMF tones received from said telephone devices and in
response
notifying said call control, whereupon said voice buffered within said voice
buffer is
transmitted to said at least one Speech Recognition Engine.
7




3. The telephone system of claim 2, wherein said voice buffer is one of either
a circular
buffer or shift register buffer for continuously updating samples of said
voice such
that each new sample over-writes an oldest buffered sample.
4. The telephone system of claim 2, wherein said at least one Voice and DTMF
detector
resource further comprises a narrow band frequency detector for preventing
spurious
switching of said voice to said at least one Speech Recognition Engine.

Description

Note: Descriptions are shown in the official language in which they were submitted.


CA 02398579 2002-08-19
INTERMEDIATE VOICE AND DTMF DETECTOR DEVICE FOR IMPROVED
SPEECH RECOGNITION UTILIZATION AND PENETRATION
Field of the Invention
The present invention relates in general to voice recognition systems for
telephony,
and more specifically to a method and apparatus for providing voice and tone
detection prior
to allocation of a speech recognition engine to a call.
Background of the Invention
The integration of speech recognition into modern day PBX systems provides new
user interface capabilities to augment traditional telephone device DTMF tones
and 'feature'
keys for call control. Speech recognition capabilities may be provided through
the allocation
of speech recognition engines (SREs) to a call in progress. For example, PBX
systems
manufactured by Mitel Networks Corporation may be configured with a number of
ports for
allocating Speak@EaseTM SRE resources. Each SRE resource is a general purpose
"device"
which provides all speech recognition and related capabilities (which may be
composed of
one or more processes). These capabilities include, but are not limited to,
voice detection,
DTMF detection, voice recognition, and application processing.
As speech recognition becomes more common, it is anticipated that a much
larger
number of SRE resources will be required to accommodate increased utilization.
The
provisioning of additional SRE resources to meet anticipated usage increases
the overall cost
of a PBX installation. As a result, the potential penetration of speech
recognition applications
is subject to cost considerations and is limited except where cost justified.
According to the existing state-of the-art, SRE resources are associated with
a call
whenever there is a potential need for speech recognition, regardless of
whether speech
recognition is actually invoked during the call. Consequently, PBX systems are
now
configured with a plurality of SRE resources that are dedicated to servicing
one or more
speech recognition applications, in a PBX network. When all of the SRE
resources are in use,
subsequent requests for the supported speech recognition applications are
denied or deferred

CA 02398579 2002-08-19
until an SRE resource becomes available. When the SRE is servicing a user, all
capabilities
are provided, regardless of utilization. For example, if a user initiates a
request for which an
SRE is allocated and simply dials digits at the telephone device (i.e. dialing
the destination
number rather than speaking the name) then the full capabilities of the SRE
are underutilized.
However, as indicated above call control allocates the SRE resource whenever
speech
recognition may be required, regardless of actual utilization.
Summar~of the Invention
According to the present invention, a voice and DTMF detector resource (VDD)
is
allocated to a call prior to allocating an SRE resource. The SRE resource is
only allocated
when speech recognition capabilities are required. The Voice and DTMF detector
resource
(VDD) is a limited capability digital signal processor that can be provided in
volume at
relatively low cost (using existing Digital Signal Processing (DSP)
technology). The presence
or absence of the Voice and DTMF detector resource (VDD) does not impact the
SRE
resource.
Brief Descr~tion of the Detailed Drawing
A description of a preferred embodiment of the present invention is provided
herein
below with reference to the accompanying drawings in which:
Figure 1 is a block diagram of a telephony system incorporating a plurality of
intermediate Voice and DTMF detector resources (VDDs) and Speech Recognition
Engines
(SREs), according to the present invention; and
Figures 2 shows the structure and operation of a VDD in accordance with the
preferred embodiment.
Detailed Description of Preferred Embodiments
Turning to Figure 1, a telephony system is shown incorporating the principles
of the
present invention. Specifically, a plurality of telephones 1A, 1B, etc. are
provided for
2

CA 02398579 2002-08-19
conducting telephone calls between parties under control of call control 3. In
accordance with
the present invention, a plurality of Voice and DTMF detector resources (VDDs)
SA, SB...SF
may be allocated to an originating call by call control 3 in a manner similar
to a traditional
DTMF receiver. Each VDD SA, SB, etc. is a special purpose digital signal
processing (DSP)
resource which provides DTMF detection, voice detection, voice buffering and
voice
streaming, as discussed in greater detail below. A plurality of SRE resources
7 is also
provided, as is known in the art.
In operation, call control 3 allocates a VDD (e.g. VDD SA) in response to one
of the
telephones (e.g. phone 1A) initially going off hook. The SRE resources 7 are
not immediately
allocated to the call, in contrast with prior art systems. Upon detection of
DTMF tone, the
VDD SA performs digit collection and reporting as is done by a traditional
DTMF receiver.
Upon detection of voice, however, the VDD SA provides an indication to call
control 3. Call
control 3 then allocates an SRE resource 7, establishes a connection between
the VDD SA
1 S and the SRE resource 7, and requests the VDD SA to begin streaming
buffered voice to the
connected SRE resource 7 (while buffering the received voice). The SRE
resource 7 receives
voice from the VDD SA as though connected directly to a telephone device and
performs the
usual speech recognition operations.
If an SRE resource 7 is unavailable, a warning is provided to the user by call
control
3. The user is prompted to be patient until an SRE resource 7 becomes
available or,
alternatively, to try their request later (i.e. defernng or rejecting the
request, respectively).
However, in the event that the VDD SA contains sufficient memory to buffer up
to ten
seconds of voice, (e.g. by applying real-time voice compression algorithms),
call control 3 is
configured to perform a second attempt to obtain an SRE resource 7 prior to
deferring or
rejecting the user request.
The VDD resource SA purges its voice buffer upon receipt of an instruction
from call
control 3.
Turning to Figure 2, the VDD consists of a DTMF receiver 10, a voice detector
12,
and optionally a narrow band frequency detector 14 to guard against spurious
triggers when
tones are present. The output of the detectors is fed to a decision logic
block 16 that
3

CA 02398579 2002-08-19
determines whether a voice phrase or a DTMF tone is present. In parallel with
the foregoing
blocks, the voice is buffered in a voice buffer 18, to compensate for
latencies in the detection
process and delays in switching in the SRE resource.
The voice buffer 18 is continuously updated such that the newest sample
overwrites
the oldest. This can be implemented either by a circular buffer or a shift
register buffer, both
of which are commonly known in the art. The length of the buffer depends on
the inherent
latencies of the detectors, the delays in the system and the typical time the
user wishes to
allow for the system to respond.
Upon detection of a DTMF digit, the digit is reported to call control 3. When
voice is
detected, the delayed voice stream is switched at 20 to an SRE 7 for speech
recognition.
Additional speech recognition functions can be performed by the VDDs SA, SB,
etc.,
such as voice end-pointing and hot word spotting, in order to further off load
the
requirements of the SRE's 7.
Exemplary pseudo-code for implementation of the DTMF detector 10, is as
follows:
DO Forever
Receive Signal
IF DTMF detected
Send DTMF detected event to decision logic
END
ENDDO
Exemplary pseudo-code for implementation of the voice detector 12, is as
follows:
DO Forever
Receive Signal
IF Voice detected
Send Voice detected event to decision logic
END
ENDDO
Exemplary pseudo-code for implementation of the optional narrow-band frequency
detector, is as follows:
DO Forever
Receive Signal
4

CA 02398579 2002-08-19
IF Narrow Band Freq detected
Send Narrow Band Freq detected event to decision logic
END
ENDDO
Exemplary pseudo-code for implementation of the decision logic 16, is as
follows:
DO Forever
IF DTMF detected
Send DTMF detected event to client
ELSE IF Narrow band freq detected (optional)
Do nothing
ELSE IF Voice detected
Send Voice detected event to client
END
ENDDO
A person of ordinary skill in the art will appreciate that by utilizing the
VDD
resources of the present invention, substantial savings may be obtained in the
cost of
deploying SRE resources. For example, in a networked telephone system having
1000 users
and allowing for 20% of the users to actively initiate call re-direction, 192
VDD resources
may be provided (6 banks X 32 VDDs per bank) for initial call allocation.
Allowing for 10%
of these calls to require actual speech recognition services, then 16 SRE
resources are needed.
The cost of 192 VDDs and 16 SREs is significantly less than the cost of 192
SRE resources.
It will be appreciated that, although embodiments of the invention have been
described and illustrated in detail, various changes and modifications may be
made. For
example, although the preferred embodiment utilizes a Mitel Speech Recognition
Engine
(sold under the name SpeakEasy or eak o asel, other SRE resources may be used.
Furthermore, the telephones 1A, 1B, etc. may be any suitable voice
communication device
such as a Plain Old Telephone Set (POTS), mobile phone, IP Phone, PC with
voice
communication functionality, etc. Also, the principles of the invention may be
applied to
other applications than speech recognition enhanced telephone directory
services. For
example, the invention may be used to advantage in a Call Center and/or
Interactive Voice
Response (IVR) application where a user may be prompted for information and
DTMF or
speech recognition is used to obtain caller responses. The invention as
described above has
been applied specifically to voice applications in the PBX domain. However,
the invention
may also be applied equally to voice applications in the CO domain and in
mixed media
5

CA 02398579 2002-08-19
communications, in either domain. Specifically, the invention may be applied
to any situation
where a large number of general purpose SRE resources can be displaced by a
similar
number of low cost VDD resources and a smaller number of SRE resources.
All of the forgoing changes and modifications may be made without departing
from
the sphere and scope of the invention as defined by the claims appended
herein.
6

Representative Drawing
A single figure which represents the drawing illustrating the invention.
Administrative Status

For a clearer understanding of the status of the application/patent presented on this page, the site Disclaimer , as well as the definitions for Patent , Administrative Status , Maintenance Fee  and Payment History  should be consulted.

Administrative Status

Title Date
Forecasted Issue Date 2005-11-15
(22) Filed 2002-08-19
Examination Requested 2002-08-19
(41) Open to Public Inspection 2004-02-19
(45) Issued 2005-11-15
Expired 2022-08-19

Abandonment History

There is no abandonment history.

Payment History

Fee Type Anniversary Year Due Date Amount Paid Paid Date
Request for Examination $400.00 2002-08-19
Registration of a document - section 124 $100.00 2002-08-19
Application Fee $300.00 2002-08-19
Maintenance Fee - Application - New Act 2 2004-08-19 $100.00 2004-08-16
Registration of a document - section 124 $100.00 2005-07-11
Registration of a document - section 124 $100.00 2005-07-18
Maintenance Fee - Application - New Act 3 2005-08-19 $100.00 2005-08-11
Final Fee $300.00 2005-09-01
Maintenance Fee - Patent - New Act 4 2006-08-21 $100.00 2006-07-05
Maintenance Fee - Patent - New Act 5 2007-08-20 $200.00 2007-07-06
Registration of a document - section 124 $100.00 2007-09-14
Registration of a document - section 124 $100.00 2007-09-14
Maintenance Fee - Patent - New Act 6 2008-08-19 $200.00 2008-07-10
Registration of a document - section 124 $100.00 2009-02-24
Maintenance Fee - Patent - New Act 7 2009-08-19 $200.00 2009-07-13
Registration of a document - section 124 $100.00 2010-01-14
Maintenance Fee - Patent - New Act 8 2010-08-19 $200.00 2010-07-15
Maintenance Fee - Patent - New Act 9 2011-08-19 $200.00 2011-07-12
Maintenance Fee - Patent - New Act 10 2012-08-20 $250.00 2012-07-16
Registration of a document - section 124 $100.00 2013-03-12
Registration of a document - section 124 $100.00 2013-03-12
Registration of a document - section 124 $100.00 2013-03-28
Registration of a document - section 124 $100.00 2013-03-28
Maintenance Fee - Patent - New Act 11 2013-08-19 $250.00 2013-07-11
Registration of a document - section 124 $100.00 2014-02-04
Registration of a document - section 124 $100.00 2014-02-04
Registration of a document - section 124 $100.00 2014-02-13
Maintenance Fee - Patent - New Act 12 2014-08-19 $250.00 2014-07-29
Registration of a document - section 124 $100.00 2015-05-04
Registration of a document - section 124 $100.00 2015-05-28
Maintenance Fee - Patent - New Act 13 2015-08-19 $250.00 2015-07-29
Maintenance Fee - Patent - New Act 14 2016-08-19 $250.00 2016-07-27
Registration of a document - section 124 $100.00 2017-03-10
Registration of a document - section 124 $100.00 2017-03-23
Maintenance Fee - Patent - New Act 15 2017-08-21 $450.00 2017-07-26
Maintenance Fee - Patent - New Act 16 2018-08-20 $450.00 2018-07-25
Registration of a document - section 124 $100.00 2018-12-03
Registration of a document - section 124 $100.00 2018-12-10
Registration of a document - section 124 $100.00 2018-12-10
Registration of a document - section 124 $100.00 2018-12-10
Registration of a document - section 124 $100.00 2018-12-10
Registration of a document - section 124 $100.00 2019-02-27
Maintenance Fee - Patent - New Act 17 2019-08-19 $450.00 2019-07-24
Maintenance Fee - Patent - New Act 18 2020-08-19 $450.00 2020-07-29
Maintenance Fee - Patent - New Act 19 2021-08-19 $459.00 2021-07-28
Registration of a document - section 124 $100.00 2022-10-19
Owners on Record

Note: Records showing the ownership history in alphabetical order.

Current Owners on Record
MITEL NETWORKS CORPORATION
Past Owners on Record
ERB, PAUL ANDREW
MITEL KNOWLEDGE CORPORATION
MITEL NETWORKS CORPORATION
MITEL NETWORKS ULC
MLN ACQUISITIONCO ULC
SCHULZ, DIETER
Past Owners that do not appear in the "Owners on Record" listing will appear in other documentation within the application.
Documents

To view selected files, please enter reCAPTCHA code :



To view images, click a link in the Document Description column. To download the documents, select one or more checkboxes in the first column and then click the "Download Selected in PDF format (Zip Archive)" or the "Download Selected as Single PDF" button.

List of published and non-published patent-specific documents on the CPD .

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.


Document
Description 
Date
(yyyy-mm-dd) 
Number of pages   Size of Image (KB) 
Representative Drawing 2003-01-13 1 9
Cover Page 2004-01-23 2 45
Cover Page 2005-10-26 1 41
Abstract 2002-08-19 1 21
Description 2002-08-19 6 255
Claims 2002-08-19 2 50
Drawings 2002-08-19 2 32
Assignment 2002-08-19 5 211
Prosecution-Amendment 2002-10-24 1 44
Prosecution-Amendment 2004-08-11 1 35
Fees 2004-08-16 1 61
Prosecution-Amendment 2004-05-03 2 55
Correspondence 2005-06-22 9 463
Correspondence 2005-07-19 1 13
Correspondence 2005-07-19 1 15
Correspondence 2005-07-13 9 524
Assignment 2005-07-11 70 4,393
Correspondence 2005-07-14 1 21
Assignment 2005-07-18 42 3,905
Correspondence 2005-09-01 1 36
Fees 2005-08-11 1 35
Assignment 2007-09-14 39 2,305
Assignment 2007-09-14 39 2,319
Assignment 2009-02-24 12 749
Assignment 2010-01-14 12 738
Assignment 2010-01-13 51 2,926
Assignment 2013-03-12 29 1,211
Assignment 2013-03-12 18 680
Assignment 2013-03-28 94 5,139
Assignment 2014-02-13 45 2,104
Assignment 2013-03-28 95 5,213
Assignment 2014-02-04 19 608
Assignment 2014-02-04 19 566
Assignment 2015-05-04 14 501
Assignment 2015-05-28 53 3,950