Language selection

Search

Patent 3153598 Summary

Third-party information liability

Some of the information on this Web page has been provided by external sources. The Government of Canada is not responsible for the accuracy, reliability or currency of the information supplied by external sources. Users wishing to rely upon this information should consult directly with the source of the information. Content provided by external sources is not subject to official languages, privacy and accessibility requirements.

Claims and Abstract availability

Any discrepancies in the text and image of the Claims and Abstract are due to differing posting times. Text of the Claims and Abstract are posted:

  • At the time the application is open to public inspection;
  • At the time of issue of the patent (grant).
(12) Patent Application: (11) CA 3153598
(54) English Title: METHOD OF AND DEVICE FOR PREDICTING VIDEO PLAYBACK INTEGRITY
(54) French Title: PROCEDE ET UN APPAREIL DE PREDICTION DE LA COMPLETUDE D'UNE LECTURE VIDEO
Status: Examination Requested
Bibliographic Data
(51) International Patent Classification (IPC):
  • G06F 16/732 (2019.01)
  • G06F 16/74 (2019.01)
  • G06F 16/783 (2019.01)
(72) Inventors :
  • XU, LIANGWU (China)
(73) Owners :
  • 10353744 CANADA LTD. (Canada)
(71) Applicants :
  • 10353744 CANADA LTD. (Canada)
(74) Agent: HINTON, JAMES W.
(74) Associate agent:
(45) Issued:
(86) PCT Filing Date: 2020-06-24
(87) Open to Public Inspection: 2021-03-11
Examination requested: 2022-09-16
Availability of licence: N/A
(25) Language of filing: English

Patent Cooperation Treaty (PCT): Yes
(86) PCT Filing Number: PCT/CN2020/097861
(87) International Publication Number: WO2021/042826
(85) National Entry: 2022-03-07

(30) Application Priority Data:
Application No. Country/Territory Date
201910845413.2 China 2019-09-05

Abstracts

English Abstract

A video playback completeness prediction method and apparatus, relating to the technical field of big data and deep learning. The method comprises: inputting data to be tested of a user's video playback feature vector (101); performing calculation by a preset video playback completeness prediction model (102); and outputting the video playback completeness value of said data (103), wherein the preset video playback completeness prediction model is obtained by means of training according to user's video playback training data, the user's video playback feature vector comprising at least a user feature vector and a video feature vector. According to the method, a playback completeness improvement strategy is introduced to predict user's video playback completeness, user's interest data closer to the reality is obtained in terms of viewing duration as an important information stream, and thus, the accuracy of identification of user's interest is improved, so as to improve the real relevance of recommendation, thereby greatly increasing user's viewing duration and degree of satisfaction.


French Abstract

L'invention concerne un procédé et un appareil de prédiction de la complétude d'une lecture vidéo, relatifs au domaine technique des mégadonnées et de l'apprentissage profond. Le procédé comprend les étapes suivantes : entrée des données à soumettre à essai d'un vecteur de caractéristiques de lecture vidéo d'un utilisateur (101) ; réalisation d'un calcul par un modèle prédéfini de prédiction de complétude d'une lecture vidéo (102) ; et délivrance en sortie de la valeur de complétude de la lecture vidéo desdites données (103). Le modèle de prédiction de complétude de lecture de vidéo prédéfini est obtenu au moyen d'un entraînement en fonction des données d'entraînement de lecture vidéo de l'utilisateur, et le vecteur de caractéristiques de lecture vidéo de l'utilisateur comprend au moins un vecteur de caractéristiques d'utilisateur et un vecteur de caractéristiques vidéo. Selon le procédé, une stratégie d'amélioration de la complétude de la lecture est adoptée en vue de prédire la complétude de la lecture vidéo de l'utilisateur, des données d'intérêt de l'utilisateur plus proches de la réalité sont obtenues en termes de durée de visualisation sous la forme d'un flux d'informations important et, de ce fait, la précision d'identification de l'intérêt de l'utilisateur est améliorée, de façon à améliorer la pertinence réelle de la recommandation, ce qui permet d'augmenter considérablement la durée de visualisation de l'utilisateur et le degré de satisfaction de l'utilisateur.

Claims

Note: Claims are shown in the official language in which they were submitted.


CLAIMS
What is claimed is:
1. A method of predicting video playback integrity, characterized in
comprising:
inputting to-be-tested data of a user video playback feature vector;
calculating through a preset video playback integrity prediction model; and
outputting a video playback integrity value of the to-be-tested data; wherein
the preset video playback integrity prediction model is obtained through
training by user
video playback training data, and the user video playback feature vector at
least includes a
user feature vector and a video feature vector.
2. The method according to Claim 1, characterized in further comprising:
collecting user video playback information data;
screening the user video playback information data, and obtaining a screening
result; and
performing a feature extraction on the screening result, and generating to-be-
tested data of
the user video playback feature vector.
3. The method according to Claim 2, characterized in that:
the step of collecting user video playback information data includes:
obtaining the user video
playback information data containing user information, user playback
historical information,
video information and user client side information; and/or that
the step of screening the user video playback information data, and obtaining
a screening
result includes: screening the user video playback information data by
employing a multi-
channel recalling mode including user collaboration, user searching, a topic
model, popular
recommendation, a user portrait and a video tag, and obtaining a screening
result; and/or that
the step of performing a feature extraction on the screening result, and
generating to-be-tested
data of the user video playback feature vector includes: performing word
segmentation on a
video title and a video classification tag in the screening result by
employing a word vector
23

obtained by training a preset massive corpus through a word2vec model and IDF
weight
training, generating a video word vector, thereafter performing word vector
calculation
according to the user playback historical information in conjunction with time
decay, and
generating a user word vector.
4. The method according to Claim 1, characterized in that the preset video
playback
integrity prediction model contains DNNs of three hidden layers.
5. The method according to Claim 4, characterized in that the preset video
playback
integrity prediction model is obtained through training by inputting the user
video playback
training data, wherein the user video playback training data is an independent
variable, while
a user watching history video playback integrity value is a dependent
variable, and the user
video playback training data is a feature vector combined by a historical user
vector and a
historical video vector created according to the user playback historical
information.
6. The method according to Claim 1, characterized in further comprising:
sorting video playback integrity values of the to-be-tested data in a
decreasing order,
obtaining topN video sorting results, and recommending the video sorting
results to a
corresponding user according to priority level, wherein N is an integer
greater than 1.
7. A device for predicting video playback integrity, characterized in that the
device
comprises a model calculating module, and that the model calculating module is
employed
for:
inputting to-be-tested data of a user video playback feature vector,
calculating through a
preset video playback integrity prediction model, and outputting a video
playback integrity
value of the to-be-tested data, wherein
the preset video playback integrity prediction model is obtained through
training by user
video playback training data, and the user video playback feature vector at
least includes a
user feature vector and a video feature vector.
24

8. The device according to Claim 7, characterized in further comprising a
data collecting
module, a data screening module, and a vector generating module, of which the
data
collecting module collects user video playback information data, the data
screening module
screens the user video playback information data, and obtains a screening
result, and the
vector generating module performs a feature extraction on the screening
result, and generates
to-be-tested data of the user video playback feature vector.
9. The device according to Claim 8, characterized in that:
the data collecting module obtains the user video playback information data
containing user
information, user playback historical information, video information and user
client side
information; and/or that
the data screening module screens the user video playback information data by
employing a
multi-channel recalling mode including user collaboration, user searching, a
topic model,
popular recommendation, a user poi tiait and a video tag, and obtains a
screening result;
and/or that
the vector generating module performs a feature extraction on the screening
result, and
generates to-be-tested data of the user video playback feature vector,
including: performing
word segmentation on a video title and a video classification tag in the
screening result by
employing a word vector obtained by training a preset massive corpus through a
word2vec
model and IDF weight training, generating a video word vector, thereafter
performing word
vector calculation according to the user playback historical information in
conjunction with
time decay, and generating a user word vector.
10. The device according to Claim 7, characterized in further comprising a
data
recommending module for sorting video playback integrity values of the to-be-
tested data in
a decreasing order, obtaining topN video sorting results, and recommending the
video sorting
results to a corresponding user according to priority level, wherein N is an
integer greater
than 1.

Description

Note: Descriptions are shown in the official language in which they were submitted.


CA 03153598 2022-03-07
METHOD OF AND DEVICE FOR PREDICTING VIDEO PLAYBACK INTEGRITY
BACKGROUND OF THE INVENTION
Technical Field
[0001] The present invention relates to the fields of big data and deep
learning technologies, and
more particularly to a method of and a device for predicting video playback
integrity.
Description of Related Art
[0002] The video recommendation system is created by researching user
interests and
predilections on the basis of large number of users and large quantities of
videos in
dependence of big data analysis and artificial intelligence technology, the
system
recommends user-interested, high-quality videos to target users, solves the
problem of
information overload, achieves the effect of customized services, and enhances
both time
duration of stay and satisfaction of users. the video recommendation system
usually
includes two phases of recalling and sorting, of which the recalling phase is
to select
certain candidate sets from massive videos, and the sorting phase is to
perform more
precise and unified calculation on the candidate sets selected in the
recalling phase, and
to screen out few videos of excellent quality that are most interesting to
users from the
candidate sets.
[0003] Currently, the number of users registered in some video playback
platforms reaches
hundred millions, with daily UV (UniqueVisitor) access exceeding ten millions,
and the
number of plays per day is even higher in mobile ends. In order that users
find out contents
of interest to them from massive videos, a recommendation system is created by

collecting data of plural dimensions (including basic information of users,
playback
histories of users, video attributes and environment attributes, etc.) to
associate the users
1
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
with potentially preferred videos. There is fewer information usable for the
recommendation of short videos, only such information as titles and video
categories is
usable for the purpose, and the currently frequently used sorting model
employs the
method of CTR (Click-Through-Rate) prediction. Clickbaits might be encouraged
on the
basis of the click model, and this cannot bring about longer time duration of
stay of users,
and would adversely affect the watching time duration and satisfaction of
users. However,
watching time duration is an important optimization target in the information
flow, so it
is urgently needed to introduce playback integrity optimization in the short
video sorting
model, so as to enhance reality relevancy of recommendations, and to achieve
enhancement in the watching time duration and satisfaction of users.
SUMMARY OF THE INVENTION
[0004] In order to address problems pending in the state of the art,
embodiments of the present
invention provide a method of and a device for predicting video playback
integrity, by
introducing a playback integrity improving policy to predict user video
playback integrity,
and acquiring interest data of the user more approaching to reality in such
important
information flow aspect as the watching time duration, the present invention
enhances the
precision in recognizing user interests, hence enhances reality relevancy of
recommendations, and achieves relatively great enhancement in the watching
time
duration and satisfaction of users.
[0005] The technical solutions are as follows:
[0006] According to one aspect, there is provided a method of predicting video
playback integrity,
and the method comprises:
[0007] inputting to-be-tested data of a user video playback feature vector;
[0008] calculating through a preset video playback integrity prediction model;
and
[0009] outputting a video playback integrity value of the to-be-tested data;
wherein
2
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0010] the preset video playback integrity prediction model is obtained
through training by user
video playback training data, and the user video playback feature vector at
least includes
a user feature vector and a video feature vector.
[0011] Moreover, the method further comprises:
[0012] collecting user video playback information data;
[0013] screening the user video playback information data, and obtaining a
screening result; and
[0014] performing a feature extraction on the screening result, and generating
to-be-tested data
of the user video playback feature vector.
[0015] Further, the step of collecting user video playback information data
includes: obtaining
the user video playback information data containing user information, user
playback
historical information, video information and user client side information;
and/or
[0016] the step of screening the user video playback information data, and
obtaining a screening
result includes: screening the user video playback information data by
employing a multi-
channel recalling mode including user collaboration, user searching, a topic
model,
popular recommendation, a user portrait and a video tag, and obtaining a
screening result;
and/or
[0017] the step of performing a feature extraction on the screening result,
and generating to-be-
tested data of the user video playback feature vector includes: performing
word
segmentation on a video title and a video classification tag in the screening
result by
employing a word vector obtained by training a preset massive corpus through a

word2vec model and IDF weight training, generating a video word vector,
thereafter
performing word vector calculation according to the user playback historical
information
in conjunction with time decay, and generating a user word vector.
[0018] Further, the preset video playback integrity prediction model contains
DNNs of three
hidden layers.
3
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0019] Further, the preset video playback integrity prediction model is
obtained through training
by inputting the user video playback training data, wherein the user video
playback
training data is an independent variable, while a user watching history video
playback
integrity value is a dependent variable, and the user video playback training
data is a
feature vector combined by a historical user vector and a historical video
vector created
according to the user playback historical information.
[0020] Moreover, the method further comprises:
[0021] sorting video playback integrity values of the to-be-tested data in a
decreasing order,
obtaining topN video sorting results, and recommending the video sorting
results to a
corresponding user according to priority level, wherein N is an integer
greater than 1.
[0022] According to another aspect, there is provided a device for predicting
video playback
integrity, the device comprises a model calculating module, and the model
calculating
module is employed for:
[0023] inputting to-be-tested data of a user video playback feature vector,
calculating through a
preset video playback integrity prediction model, and outputting a video
playback
integrity value of the to-be-tested data, wherein the preset video playback
integrity
prediction model is obtained through training by user video playback training
data, and
the user video playback feature vector at least includes a user feature vector
and a video
feature vector.
[0024] Moreover, the device further comprises a data collecting module, a data
screening module,
and a vector generating module, of which the data collecting module collects
user video
playback information data, the data screening module screens the user video
playback
information data, and obtains a screening result, and the vector generating
module
performs a feature extraction on the screening result, and generates to-be-
tested data of
the user video playback feature vector.
4
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0025] Further, the data collecting module obtains the user video playback
information data
containing user information, user playback historical information, video
information and
user client side information; and/or
[0026] the data screening module screens the user video playback information
data by employing
a multi-channel recalling mode including user collaboration, user searching, a
topic
model, popular recommendation, a user poi _________________________________ ti
ait and a video tag, and obtains a screening
result; and/or
[0027] the vector generating module performs a feature extraction on the
screening result, and
generates to-be-tested data of the user video playback feature vector,
including:
performing word segmentation on a video title and a video classification tag
in the
screening result by employing a word vector obtained by training a preset
massive corpus
through a word2vec model and IDF weight training, generating a video word
vector,
thereafter performing word vector calculation according to the user playback
historical
information in conjunction with time decay, and generating a user word vector.
[0028] Moreover, the device further comprises a data recommending module for
sorting video
playback integrity values of the to-be-tested data in a decreasing order,
obtaining topN
video sorting results, and recommending the video sorting results to a
corresponding user
according to priority level, wherein N is an integer greater than 1.
[0029] The technical solutions provided by the embodiments of the present
invention bring about
the following advantageous effects.
[0030] 1. By modifying the traditional CTR prediction method altogether, a
video playback
integrity indicator is introduced, video playback integrities of different
users are predicted
through a well trained preset video playback integrity prediction model,
interest data of
users more approaching to reality is obtained in such important information
flow aspect
as the watching time duration through prediction results of the video playback
integrities,
the precision in recognizing user interests is enhanced, reality relevancy of
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
recommendations is hence enhanced, and relatively great enhancement is
achieved in the
watching time duration and satisfaction of users.
[0031] 2. Through vectorized representation of the user portrait, interest
transfer of the user is
reflected in combination with time decay of user behaviors, and hotspot videos
and
inadvertently clicked videos are filtered out in the process of user poll" __
ait, whereby
interference with actual interest of the user is avoided, and the user poi __
tiait is made more
precise.
[0032] 3. By collecting such relevant data as user behavior data, video
quality and video
information, etc., vectorized representation of user features and video
attributes is
effectively made, as well as proportions of videos played back at various time
periods,
proportions of various categories and other environment information, different
features
and different data sources are merged in the application of short video
recommendation
sorting model through deep learning modeling and prediction of potential
playback
integrities of videos not watched by users, excellent effect is achieved, and
average
watching time duration of users is enhanced.
[0033] 4. By creating such features as user features, video features,
contextual features and client
side classification, deep learning modeling is employed, the playback
integrity prediction
mode is applied in a group of 10% randomly selected users through the AB test,
and such
indicators as CTR, daily average playback volume and user average playback
integrity
are compared through the final report. In the end, user average playback
integrity and
daily average playback volume are enhanced to a greater extent with slight
decrease in
the CTR.
[0034] 5. A TF-IDF algorithm is employed in terms of video recommendation, and
key
information of videos is effectively emphasized through IDF values.
6
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0035] 6. Reality relevancy of recommendations is enhanced through prediction
of short video
playback integrities, and increase in the time duration of stay of users is
attempted.
BRIEF DESCRIPTION OF THE DRAWINGS
[0036] To more clearly describe the technical solutions in the embodiments of
the present
invention, drawings required to illustrate the embodiments are briefly
introduced below.
Apparently, the drawings introduced below are merely directed to some
embodiments of
the present invention, while persons ordinarily skilled in the art may further
acquire other
drawings on the basis of these drawings without spending creative effort in
the process.
[0037] Fig. 1 is a flowchart illustrating the method of predicting video
playback integrity
provided by an embodiment of the present invention;
[0038] Fig. 2 is a flowchart illustrating the method of predicting video
playback integrity
provided by another embodiment of the present invention;
[0039] Fig. 3 is a view schematically illustrating a preferred mode of
execution of feature
engineering construction in Step 203;
[0040] Fig. 4 is a view schematically illustrating a preferred mode of
execution of a preset video
playback integrity prediction model provided by an embodiment of the present
invention;
[0041] Fig. 5 is a view schematically illustrating the structure of a device
for predicting video
playback integrity provided by an embodiment of the present invention; and
[0042] Fig. 6 is a view schematically illustrating the structure of a device
for predicting video
playback integrity provided by another embodiment of the present invention.
7
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
DETAILED DESCRIPTION OF THE INVENTION
[0043] To make more lucid and clear the objectives, technical solutions and
advantages of the
present invention, the technical solutions in the embodiments of the present
invention will
be clearly and comprehensively described below with reference to the
accompanying
drawings in the embodiments of the present invention. Apparently, the
embodiments as
described are merely partial, rather than the entire, embodiments of the
present invention.
Any other embodiments makeable by persons ordinarily skilled in the art on the
basis of
the embodiments in the present invention without creative effort shall all
fall within the
protection scope of the present invention. As should be noted, the wordings
"plural/more/a plurality of' mean "two and more" in the description of the
present
invention, unless otherwise definitely and specifically defined.
[0044] In the method of and device for predicting video playback integrity
provided by the
embodiments of the present invention, by modifying the traditional CTR
prediction
method altogether, a video playback integrity indicator is introduced, video
playback
integrities of different users are predicted through a well trained preset
video playback
integrity prediction model, interest data of users more approaching to reality
is obtained
in such important information flow aspect as the watching time duration
through
prediction results of the video playback integrities, the precision in
recognizing user
interests is enhanced, reality relevancy of recommendations is hence enhanced,
and
relatively great enhancement is achieved in the watching time duration and
satisfaction
of users. Accordingly, the method of and device for predicting video playback
integrity
are widely applicable to many network video application scenarios concerning
user
interests mining, user requirements matching or user recommendations.
[0045] Specific embodiments and accompanying drawings are combined below to
describe in
detail the method of and device for predicting video playback integrity
provided by the
embodiments of the present invention.
8
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0046] Fig. 1 is a flowchart illustrating the method of predicting video
playback integrity
provided by an embodiment of the present invention. As shown in Fig. 1, the
method of
predicting video playback integrity comprises the following steps:
[0047] 101 - inputting to-be-tested data of a user video playback feature
vector;
[0048] 102 - calculating through a preset video playback integrity prediction
model; and
[0049] 103 - outputting a video playback integrity value of the to-be-tested
data.
[0050] Different from the traditional user technology in which only few such
collected
information as titles, video categories or click-through-rate is used, the
user video
playback feature vector here at least includes user feature vectors and video
feature
vectors, the user feature includes user poi _______________________________ ti
aits, user historical playback records or other
information relevant to users, and the video information includes video
categories, video
time durations, video times, video playback integrity records or other
information
relevant to releasing videos. Besides user feature vectors and video feature
vectors, the
user video playback feature vector can further include such other information
relevant to
video playback as user client side classification information. In addition,
the preset video
playback integrity prediction model is obtained through training by user video
playback
training data, and the video playback integrity prediction model as
specifically used can
be either obtained by training a corresponding deep learning model designed
and
constructed according to requirements, or obtained by training any possible
deep learning
model available in the art, to which no particular restriction is made in the
embodiments
of the present invention.
[0051] Fig. 2 is a flowchart illustrating the method of predicting video
playback integrity
provided by another embodiment of the present invention. As shown in Fig. 2,
the method
of predicting video playback integrity comprises the following steps.
[0052] 201 - collecting user video playback information data.
9
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0053] Specifically, the user video playback information data containing user
information, user
playback historical information, video information and user client side
information is
obtained.
[0054] This process is a phase for collecting user video playback information
data, the user video
playback information mainly includes user information, user playback
historical
information, video information and user client side information, of which the
user
information mainly indicates user poll" ___________________________________
ait information, including basic attribute
information of a user (gender, age, etc.), the user playback historical
information includes
proportions of historical playbacks on a hourly basis by the user, and
proportions of
various types of videos watched by the user, etc., and the client side
information includes
user equipment types and operator types, etc. In addition to the above, such
contextual
information secondarily associated with videos played back by the user as the
time at
which the user watches each video and the user position information, etc., can
be further
collected for the user video playback information according to requirements.
[0055] As is notable, the process of collecting user video playback
information data in Step 201
can as well be realized by modes other than the mode recited in the
aforementioned step,
and these specific modes are not restricted in the embodiments of the present
invention.
[0056] 202 - screening the user video playback information data, and obtaining
a screening result.
[0057] Specifically, the step of screening the user video playback information
data, and obtaining
a screening result includes: screening the user video playback information
data by
employing a multi-channel recalling mode including user collaboration, user
searching,
a topic model, popular recommendation, a user portrait and a video tag, and
obtaining a
screening result.
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0058] This process is a phase for recalling coarsely screened user video
playback information
data, preferably, the screening is mainly directed to the video information in
the user
video playback information data. Since the video is colossal in scale,
possibly reaching
the order of several millions, direct input of the video into the model for
data
preprocessing would require extremely high cost, and the speed would also be
extremely
slow, so it is possible to coarsely screen out some video information with
higher quality
or in other words possibly more to the taste of users in the recalling phase.
Recalling is
usually embodied as multi-channel recalling, such as through user
collaboration, user
searching, topic models, popular recommendations, user portraits and video
tags, so as to
select certain desirable candidate sets from the massive amount of video.
[0059] As is notable, the process of screening the user video playback
information data in Step
202 can as well be realized by modes other than the mode recited in the
aforementioned
step, and these specific modes are not restricted in the embodiments of the
present
invention.
[0060] 203 - performing a feature extraction on the screening result, and
generating to-be-tested
data of the user video playback feature vector.
[0061] Specifically, the step of performing a feature extraction on the
screening result, and
generating to-be-tested data of the user video playback feature vector
includes:
performing word segmentation on a video title and a video classification tag
in the
screening result by employing a word vector obtained by training a preset
massive corpus
through a word2vec model and IDF weight training, generating a video word
vector,
thereafter performing word vector calculation according to the user playback
historical
information in conjunction with time decay, and generating a user word vector.
The user
word vector and the video word vector here correspond to the aforementioned
user feature
vector and video feature vector.
11
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0062] This process is a feature engineering phase, as shown in Fig. 3,
preferably, a word vector
with 200 dimensions per word is trained through word segmentation and a
word2vec
model on the massive corpus, potential meanings of words are characterized in
a
vectorized form, so as to express the relations between words, and word vector

representation of the video is calculated and obtained through a combination
of word
segmentation process of the video title with the information of IDF obtained
by training.
Word vector representation of the user is calculated according to the word
vector
representation of the user historical playback video in conjunction with time
decay; in the
process of calculating user vector, videos with top 3 tags of the user with a
proportion
exceeding 10% are counted according to video tag categories. As found
according to
playback history analysis of the user, videos to which lower proportional
video tags
correspond are not latent interest points of the user, playback of them is
usually because
they are hotspot videos or due to inadvertent clicking by the user, and they
can be
discarded through feature extraction.
[0063] As is notable, the process of performing a feature extraction on the
screening result, and
generating to-be-tested data of the user video playback feature vector in Step
203 can as
well be realized by modes other than the mode recited in the aforementioned
step, and
these specific modes are not restricted in the embodiments of the present
invention.
[0064] 204 - inputting to-be-tested data of a user video playback feature
vector.
[0065] The preset video playback integrity prediction model is obtained
through training by
inputting the user video playback training data, wherein the user video
playback training
data is an independent variable, while a user watching history video playback
integrity
value is a dependent variable, and the user video playback training data is a
feature vector
combined by a historical user vector and a historical video vector created
according to
the user playback historical information, for training to obtain a desirable
preset video
playback integrity prediction model.
12
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0066] Preferably, the preset video playback integrity prediction model
contains DNNs of three
hidden layers, and input information of the input layer includes word vector
representation of the user (various video word vectors are obtained by a
combination of
word segmentation of the user historical playback video with IDF weight
calculation, and
a word vector with 200 dimensions is subsequently calculated and obtained in
overall
consideration of time decay), basic portrait of the user (gender, age, etc.),
proportions of
videos played back at various time periods (on an hourly basis), and
proportions of
various categories of videos, etc.; word vector (200 dimensions) of the video,
quality of
the video (average playback integrity, video hits, etc.), releasing time of
the video, video
category, equipment type, operator type; region; current time period, etc.
[0067] As is notable, the data content and form of inputting to-be-tested data
of a user video
playback feature vector in Step 204 can as well be realized by modes other
than the mode
recited in the aforementioned step, and these specific modes are not
restricted in the
embodiments of the present invention.
[0068] 205 ¨ calculating through a preset video playback integrity prediction
model.
[0069] 206 - outputting a video playback integrity value of the to-be-tested
data.
[0070] Preferably, the following steps are further included after Step 206:
[0071] sorting video playback integrity values of the to-be-tested data in a
decreasing order,
obtaining topN video sorting results, and recommending the video sorting
results to a
corresponding user according to priority level, wherein N is an integer
greater than 1. As
should be noted, it is also possible to base on requirements to arrange the
step of sorting
video playback integrity values in the calculating process of the preset video
playback
integrity prediction model, as shown in Fig. 4, to which no particular
restriction is made
in the embodiments of the present invention.
13
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0072] Fig. 5 is a view schematically illustrating the structure of a device
for predicting video
playback integrity provided by an embodiment of the present invention. As
shown in Fig.
5, the device for predicting video playback integrity comprises a model
calculating
module 1, and the model calculating module 1 is employed for: inputting to-be-
tested
data of a user video playback feature vector, calculating through a preset
video playback
integrity prediction model, and outputting a video playback integrity value of
the to-be-
tested data, wherein the preset video playback integrity prediction model is
obtained
through training by user video playback training data, and the user video
playback feature
vector at least includes a user feature vector and a video feature vector.
[0073] Fig. 6 is a view schematically illustrating the structure of a device
for predicting video
playback integrity provided by another embodiment of the present invention. As
shown
in Fig. 6, the device 2 for predicting video playback integrity comprises a
data collecting
module 21, a data screening module 22, a vector generating module 23, a model
calculating module 24 and a data recommending module 25.
[0074] The data collecting module 21 collects user video playback information
data. Specifically,
the data collecting module 21 obtains user video playback information data
containing
user information, user playback historical information, video information and
user client
side information.
[0075] The data screening module 22 screens the user video playback
information data, and
obtains a screening result. Specifically, the data screening module 22 screens
the user
video playback information data by employing a multi-channel recalling mode
including
user collaboration, user searching, a topic model, popular recommendation, a
user poi Li ait
and a video tag, and obtains a screening result.
[0076] The vector generating module 23 performs a feature extraction on the
screening result,
14
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
and generates the user video playback feature vector. Specifically, the vector
generating
module 23 performs a feature extraction on the screening result, and generates
to-be-
tested data of the user video playback feature vector, including: performing
word
segmentation on a video title and a video classification tag in the screening
result by
employing a word vector obtained by training a preset massive corpus through a

word2vec model and IDF weight training, generating a video word vector,
thereafter
performing word vector calculation according to the user playback historical
information
in conjunction with time decay, and generating a user word vector. The user
word vector
and the video word vector here correspond to the following user feature vector
and video
feature vector.
[0077] The model calculating module 24 inputs to-be-tested data of a user
video playback feature
vector, calculates through a preset video playback integrity prediction model,
and outputs
a video playback integrity value of the to-be-tested data, wherein the preset
video
playback integrity prediction model is obtained through training by user video
playback
training data, and the user video playback feature vector at least includes a
user feature
vector and a video feature vector.
[0078] The data recommending module 25 sorts video playback integrity values
of the to-be-
tested data in a decreasing order, obtains topN video sorting results, and
recommends the
video sorting results to a corresponding user according to priority level,
wherein N is an
integer greater than 1.
[0079] A preferred mode of execution for the method of and device for
predicting video playback
integrity provided by the embodiments of the present invention is introduced
below.
[0080] Firstly, the word segmentation tool of the present embodiment carries a
lexicon therewith,
entertainment stars, film and TV drama names, sports stars and team
information are
additionally added as supplementary lexicon, Netease news, Baidu encyclopedia
and
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
Wikipedia obtained by a crawler system constitute a massive corpus, and word
segmentation and word vector training are performed with respect to the corpus
to finally
obtain word vector representation of each word (dimensions of the word vectors
are 200
dimensions, as determined by test effect, and normalization is then performed
on the
vectors).
[0081] Under the aforementioned corpus, TF-IDF training is carried out to
obtain IDF values,
normalization is then performed, weight enhancement is subsequently performed
on the
supplementary lexicon as 1, similar to the attention mechanism, more attention
is paid to
these words.
[0082] See the following Table 1 for video information, in which are carried
video IDs, video
title information, classification tags, video tag information, and releasing
times, etc. The
video information is word-segmented, a word vector table of words is searched,
and the
word vector representation of the current video is obtained in combination
with weighted
calculation of an IDF value table (performing normalization).
[0083] Table 1 Video Information
Table
id title cata name tag names release time
3714869 Wu lei has athletics athletics, 2019-08-01
become the European 10:07:32
Mr. Key, Championship,
winning odds Wu lei, sports
show Espanyol lottery, lottery
will bring information
surprise in the station, sports
new season lottery video
collection
[0084] The phase of obtaining user poi Li ait is a process of calculating
user word vector, and the
target user group directed is a group of active users, namely recently active
users (having
16
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
playback records within 7 days lately) with certain volume of playbacks
(exceeding 10
videos) within a period lately (such as the recent 30 days). Word vector
calculation of the
users is particularized according to tag categories, for instance, a user
played 100 videos
within a period lately, of which there are 60 relevant to athletics, 20
relevant to finance
and economics, 15 relevant to amusement, 4 relevant to the society, and 1
relevant to
health; during the process of user poll" __ ait, a poi ____________________ Li
ait is drawn for the user under tag
categories ranking top 3 in the proportions and with the proportions each
exceeding 10%,
through which method it is made possible to obtain the main interest points of
the user,
and to remove few operations by inadvertent clicking and hotspot videos that
cannot
represent the interest points of the user. In this example, athletics occupies
60%, finance
and economics occupy 20%, amusement occupies 15%, society occupies 4%, and
health
occupies 1%, so the portrait is drawn for the user in terms of the three
dimensions of
athletics, finance and economics, and amusement with respect to the current
user, and
word vector representations of the corresponding dimensions of the user are
calculated.
[0085] During the process of calculating user word vectors under different tag
categories of the
user, the time decay factor (for instance, the decay period is 5 days, the
decay coefficient
is 0.95, taking for example a video played back 12 days before the current
date, the video
crosses two decay periods, and should be decayed by 0.95^2) is combined to
calculate
the word vector representations of the user.
[0086] In the feature engineering constructing phase, taken into consideration
are the user word
vector (200 dimensions), video word vector (200 dimensions), proportion of a
category
watched by the user, proportions of historical playbacks of the user on a
hourly basis, user
gender, user age (divided according to the groups of over 20, 2030,¨ 30-
40, 40-50, and
over 50, and on-hot coded), current video classification tag, video time
duration (unit:
second), video releasing time (number of days from the current time), video
average
playback integrity (average playback integrity of videos played by the user
within the
recent 24 hours), hits level (divided into five levels according to the number
of playback
17
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
times, and one-hot coded), the time of the video watched by the user (which
day of the
week, the current time period, one-hot coded), position information (one-hot
coded
according to the Province), terminal type (one-hot coded), and operator type
(one-hot
coded).
[0087] The above features are constructed according to playback records of the
user within a
period lately (such as the recent 30 days), and the deep learning model is
trained in
combination with the video playback integrity of the user.
[0088] The possible playback integrity of videos not played back by the target
user is predicted
through the model with respect to the result set recommended to the user in
the recalling
phase, and inversion is carried out according to the playback integrity to
generate the final
recommended result set.
[0089] As should be noted, when the device for predicting video playback
integrity provided by
this embodiment performs a video playback integrity predicting business, the
division
into the aforementioned various functional modules is merely by way of
example, while
it is possible, in actual application, to base on requirements to assign the
functions to
different functional modules for completion, that is to say, to divide the
internal structure
of the device into different functional modules to complete the entire or
partial functions
described above. In addition, the device for predicting video playback
integrity provided
by this embodiment pertains to the same conception as the method of predicting
video
playback integrity provided by the method embodiment ¨ see the corresponding
method
embodiment for its specific realization process, while no repetition will be
made in this
context.
[0090] All the aforementioned optional technical solutions are randomly
combinable to form
optional embodiments of the present invention, to which no repetition is made
on a one-
by-one basis.
18
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0091] To sum it up, in comparison with prior-art technology, the method of
and device for
predicting video playback integrity provided by the embodiments of the present
invention
achieve the following advantageous effects.
[0092] 1. By modifying the traditional CTR prediction method altogether, a
video playback
integrity indicator is introduced, video playback integrities of different
users are predicted
through a well trained preset video playback integrity prediction model,
interest data of
users more approaching to reality is obtained in such important information
flow aspect
as the watching time duration through prediction results of the video playback
integrities,
the precision in recognizing user interests is enhanced, reality relevancy of
recommendations is hence enhanced, and relatively great enhancement is
achieved in the
watching time duration and satisfaction of users.
[0093] 2. Through vectorized representation of the user portrait, interest
transfer of the user is
reflected in combination with time decay of user behaviors, and hotspot videos
and
inadvertently clicked videos are filtered out in the process of user poll" __
ait, whereby
interference with actual interest of the user is avoided, and the user poi __
hait is made more
precise.
[0094] 3. By collecting such relevant data as user behavior data, video
quality and video
information, etc., vectorized representation of user features and video
attributes is
effectively made, as well as proportions of videos played back at various time
periods,
proportions of various categories and other environment information, different
features
and different data sources are merged in the application of short video
recommendation
sorting model through deep learning modeling and prediction of potential
playback
integrities of videos not watched by users, excellent effect is achieved, and
average
watching time duration of users is enhanced.
19
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0095] 4. By creating such features as user features, video features,
contextual features and client
side classification, deep learning modeling is employed, the playback
integrity prediction
mode is applied in a group of 10% randomly selected users through the AB test,
and such
indicators as CTR, daily average playback volume and user average playback
integrity
are compared through the final report. In the end, user average playback
integrity and
daily average playback volume are enhanced to a greater extent with slight
decrease in
the CTR.
[0096] 5. A TF-IDF algorithm is employed in terms of video recommendation, and
key
information of videos is effectively emphasized through IDF values.
[0097] 6. Reality relevancy of recommendations is enhanced through prediction
of short video
playback integrities, and increase in the time duration of stay of users is
attempted.
[0098] As understandable by persons ordinarily skilled in the art, realization
of the entire or
partial steps of the aforementioned embodiments can be completed by hardware,
or by a
program instructing relevant hardware, the program can be stored in a computer-
readable
storage medium, and the storage medium can be a read-only memory, a magnetic
disk, or
an optical disk, etc.
[0099] The embodiments of the present application are described with reference
to flowcharts
and/or block diagrams of the method, device (system), and computer program
product
embodied in the embodiments of the present application. As should be
understood, each
flow and/or block in the flowcharts and/or block diagrams, and any combination
of flow
and/or block in the flowcharts and/or block diagrams can be realized by
computer
program instructions. These computer program instructions can be supplied to a
general
computer, a dedicated computer, an embedded processor or a processor of any
other
programmable data processing device to form a machine, so that the
instructions executed
by the computer or the processor of any other programmable data processing
device
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
generate a device for realizing the functions designated in one or more
flow(s) of the
flowcharts and/or one or more block(s) of the block diagrams.
[0100] These computer program instructions can also be stored in a computer-
readable memory
enabling a computer or any other programmable data processing device to
operate by a
specific mode, so that the instructions stored in the computer-readable memory
generate
a product containing instructing means, and this instructing means realizes
the functions
designated in one or more flow(s) of the flowcharts and/or one or more
block(s) of the
block diagrams.
[0101] These computer program instructions can also be loaded onto a computer
or any other
programmable data processing device, so as to execute a series of operations
and steps
on the computer or the any other programmable device to generate computer-
realized
processing, so that the instructions executed on the computer or the any other

programmable device provide steps for realizing the functions designated in
one or more
flow(s) of the flowcharts and/or one or more block(s) of the block diagrams.
[0102] Although preferred embodiments in the embodiments of the present
application have
been described so far, it is still possible for persons skilled in the art to
make additional
modifications and amendments to these embodiments upon learning of the basic
inventive
conception. Accordingly, the attached Claims are meant to cover the preferred
embodiments and all modifications and amendments that fall within the scope of
the
embodiments of the present application.
[0103] Apparently, persons skilled in the art can make various amendments and
modifications to
the present invention without departing from the spirit and scope of the
present invention.
Thus, if such amendments and modifications to the present invention fall
within the
Claims of the present invention and equivalent technology, the present
invention is also
meant to cover these amendments and modifications.
21
Date Recue/Date Received 2022-03-07

CA 03153598 2022-03-07
[0104] What is described above is merely directed to preferred embodiments of
the present
invention, and they are not meant to restrict the present invention. Any
amendment,
equivalent replacement and improvement makeable within the spirit and scope of
the
present invention shall all be covered within the protection scope of the
present invention.
22
Date Recue/Date Received 2022-03-07

Representative Drawing
A single figure which represents the drawing illustrating the invention.
Administrative Status

For a clearer understanding of the status of the application/patent presented on this page, the site Disclaimer , as well as the definitions for Patent , Administrative Status , Maintenance Fee  and Payment History  should be consulted.

Administrative Status

Title Date
Forecasted Issue Date Unavailable
(86) PCT Filing Date 2020-06-24
(87) PCT Publication Date 2021-03-11
(85) National Entry 2022-03-07
Examination Requested 2022-09-16

Abandonment History

There is no abandonment history.

Maintenance Fee

Last Payment of $100.00 was received on 2023-12-15


 Upcoming maintenance fee amounts

Description Date Amount
Next Payment if small entity fee 2025-06-25 $100.00
Next Payment if standard fee 2025-06-25 $277.00

Note : If the full payment has not been received on or before the date indicated, a further fee may be required which may be one of the following

  • the reinstatement fee;
  • the late payment fee; or
  • additional fee to reverse deemed expiry.

Patent fees are adjusted on the 1st of January every year. The amounts above are the current amounts if received by December 31 of the current year.
Please refer to the CIPO Patent Fees web page to see all current fee amounts.

Payment History

Fee Type Anniversary Year Due Date Amount Paid Paid Date
Application Fee 2022-03-07 $407.18 2022-03-07
Maintenance Fee - Application - New Act 2 2022-06-27 $100.00 2022-03-07
Request for Examination 2024-06-25 $814.37 2022-09-16
Maintenance Fee - Application - New Act 3 2023-06-27 $100.00 2022-12-15
Advance an application for a patent out of its routine order 2023-05-08 $526.29 2023-05-08
Maintenance Fee - Application - New Act 4 2024-06-25 $100.00 2023-12-15
Owners on Record

Note: Records showing the ownership history in alphabetical order.

Current Owners on Record
10353744 CANADA LTD.
Past Owners on Record
None
Past Owners that do not appear in the "Owners on Record" listing will appear in other documentation within the application.
Documents

To view selected files, please enter reCAPTCHA code :



To view images, click a link in the Document Description column. To download the documents, select one or more checkboxes in the first column and then click the "Download Selected in PDF format (Zip Archive)" or the "Download Selected as Single PDF" button.

List of published and non-published patent-specific documents on the CPD .

If you have any difficulty accessing content, you can call the Client Service Centre at 1-866-997-1936 or send them an e-mail at CIPO Client Service Centre.


Document
Description 
Date
(yyyy-mm-dd) 
Number of pages   Size of Image (KB) 
Abstract 2022-03-07 1 25
Claims 2022-03-07 3 139
Drawings 2022-03-07 4 297
Description 2022-03-07 22 983
International Search Report 2022-03-07 4 148
Amendment - Abstract 2022-03-07 2 91
National Entry Request 2022-03-07 14 1,312
Representative Drawing 2022-06-03 1 20
Cover Page 2022-06-03 1 56
Request for Examination 2022-09-16 8 296
Correspondence for the PAPS 2022-12-23 4 149
Special Order / Amendment 2023-05-08 24 973
Early Lay-Open Request 2023-05-08 6 193
Claims 2023-05-08 18 1,082
Acknowledgement of Grant of Special Order 2023-06-02 1 175
Examiner Requisition 2023-12-20 5 278
Amendment 2024-04-22 47 2,471
Claims 2024-04-22 17 1,031
Examiner Requisition 2023-06-12 7 322
Amendment 2023-10-12 49 2,547
Claims 2023-10-12 18 1,062
Drawings 2023-10-12 4 169