Professional Documents
Culture Documents
Abstract-Personalized recommendation is an effective method semantic information, so these models can't accurately
to resolve the current problem of Internet information describe the users' interests [4-6]. Ontology is used to depict
overload. In the recommendation systems, user modeling is a the domain knowledge, provides the common understanding
crucial step. Whether the model can accurately describe the of the knowledge about one area, defines the common
users' interests directly determines the quality of the cognitive vocabulary, and gives the clear definition on
personalized recommendations. At present in most
different domain terms. This paper presents a new ontology
based user modeling method which uses ontology concept
personalized service systems keywords models or user-item
models are used to describe the users' preferences, but vectors
hierarchy tree to represent the users' interests, and we use the
or matrixes used in these models do not contain semantic
reasoning and extension technique of the ontology to mine
information, so it is difficult to accurately model the users'
the users' potential interests. Experiment results show that
interests and hobbies, and it is also hard to extend the users'
this method can more accurately describe the users' interests.
interests. Ontology as a tool used to describe the domain
knowledge is very powerful in conceptual describing and
In the recommendation systems similarity measure plays
logical reasoning. Computation of the neighbor set of users or an important role, which is the base procedure for finding out
resources is also an important step in the recommendation, but the neighbor set of users or resources. At present three
at present three commonly used similarity algorithms have commonly used similarity algorithms are: cosine-based
some shortcomings which lead the system sometimes difficulty similarity, correlation-based similarity and adjusted-cosine
to find similar users or resources. This paper presents a new similarity [7-9]. In this paper, we briefly mention the
ontology-based user modeling approach and an improved inherent drawbacks of the above three similarity algorithms
similarity algorithm. Our experiments show that the user and present an improved similarity algorithm, which can
model presented in this paper can effectively describe the effectively overcome these drawbacks.
users' personalized preferences, and we also prove that the The rest of the paper is organized as follows. In section 2,
improved similarity algorithm is better than other three domain ontology building approach is proposed. In section 3,
commonly used similarity algorithms. we show the ontology-based user modeling method. Section
4 presents an improved similarity algorithm called Simi-New.
Keywords-personalized recommendation; ontology; semantic
Experimental results are provided in section 5. Section 6
reasoning; user modeling; similarity measure
states the conclusion of this paper.
363
Layer
Given the access score vector on leaf nodes
One
v' {v \, V'2,···, V'I}, we use V'i
= to represent how much
Layer the user likes the i(1 � i � t) -th concept leaf node, and use
Two
the variable t denotes the number of leaf nodes in ontology
Layer
concept hierarchy tree. The variable V'i can be calculated as:
Three
modeling method. The following three steps take place to path Pi' b(ni(x+l)) means the number of ni(x+l) 's brother
build the user models: firstly, we analyze the web server logs
in the whole tree, and a is a reasoning factor which is
to obtain the users' access scores on the leaf nodes of
ascertained in applications(the parameter a in this paper is
ontology concept hierarchy tree, secondly, we make use of
ontology reasoning technology and access scores on leaf equal to 1.8). We can compute for all paths according to the
nodes in the ontology tree to get the access scores on non same way. The score of the node nx the user get is given by
leaf nodes, finally we merge the access score vector on leaf
nodes and score vector on non-leaf nodes to build the I
ontology-based user model denoted by V {vI' v2,", vJ. s(nJ L s(niJ (3)
i=l
=
=
how much the user likes the i -th concept node. The variable After that we can get the access score vector on non-leaf
s denotes the number of the nodes in ontology concept
nodes denoted by v" {v'\, V"2"'" v"r}.
hierarchy tree. We will elaborate on how to build the leaf
=
nodes score vector and non-leaf nodes score vector in the So far, we have obtained score vector V on leaf nodes I
following content. and vector v" on non-leaf nodes. After that, we can
combine vector V with v" to generate the ontology-based
I
364
user model denoted by corresponding resource average from each co-rated pair.
V = { VI'
" v2' , VI' ' V "I' V ''2" '" V " r } ,Wh'lCh can aIso be
Formally, the similarity between user i and j using this
scheme is given by
. . •
(7)
Where " . II denotes the dot-product of the two vectors.
In the correlation-based similarity algorithm, similarity
between two users i and j is measured by computing the
The closeness of the scores on the common resources is
Pearson - r correlation carr.t,J' . To make the correlation defined as:
both two users i and j, and Ri,u is the score that the user
365
Here Ri is the score vector rated by i -th user in the n algorithm. The result of the experiment
2.
1 is showed in Figure
Slm
. C'l,}). =1"ii"
e S 1.1
l,}
*Ry "f1+ C1 -wJ;-"S(") (10) 0.20 - --- ---
,...--
correlation-based similarity and adjusted-cosine similarity. cosine-based correlat ion- adjusted Simi-New
366
are 335 users in the experiment data, so the sparsity of the the growth of data sparsity the superiority of the Simi-New
data is 5.05%. We use the above four similarity algorithms to similarity algorithm is more obvious.
verify the superiority of the ontology-based user model
presented in this paper. The result of the experiment 3 is VI. CONCLUSION
showed in Figure 4. This paper presents a new kind of ontology-based user
modeling method. We use the ontology concept hierarchy
0.30 tree to build the user models. This paper newly introduces
the ontology and semantic concept to the user modeling
0.25
method, which makes use of semantic relationship between
0.20
the concept nodes, so the user model presented in this paper
can effectively describe the users' personalized preferences.
O. 15 We also propose an improved similarity algorithm, which
U.J
< effectively bridges the gap between the traditional similarity
::>; O. 10
algorithms and the preCISIOn of personalized
0.05 recommendation. The new ontology-based user models and
improved similarity algorithm effectively improve the
0.00
quality of the personalized service systems and satisfy the
users' growing personalized needs.
cosine-based adjusted correJation- Simi-New
cosine based
Figure 4. MAE of two user models The research is supported by the fund of the State Key
Laboratory of Software Development Environment
As shown in the above chart, we can see the ontology SKLSDE-2009ZX-12.
based user model is superior to user-resource matrix-based
user model in the condition the sparsity of the data is 5.05%.
In experiment 4, we select 7334 records from the first four REFERENCES
months Web logs as the training data, and select 2527 [I] O. Nasraoui, "World Wide Web Personalization," In J. Wang (ed),
records from the last month Web logs as the test set. There Encyclopedia of Data Mining and Data Warehousing, Idea Group,
are 889 users in the experiment data, so the sparsity of the 2005.
data is 2.62%. The result of the experiment 4 is showed in [2] Modi P. J. and Shen W. M., "Collaborative Multiagent learning for
Figure 5. classification tasks," In: Proceedings of the Fifth International
Conference on Autonomous Agents, 2001.
[3] Socha K. and Kisiel-Dorohinicki M, "Agent-based evolutionary
0.20 multi-objective optimisation," Hawaii, USA: Proceedings of CEC'02-
Congress on Evolutionary Computation, 2002.
[4] W Liu, F Jin, and X Zhang, "Ontology-Based User Modeling for E
0.15
Commerce System," In: Proc of the 3rd International Conference on
Pervasive Computing and Applications. Alexandria, Egypt, 2008,
pp.260-263.
�
""
0.10
[5] J. Trajkova and S. Gauch, "Improving Ontology-Based User
Profiles," In: Proc of 2004'RIAO. Avignon, France, 2004.
0.05 [6] S. Berkovsky, T. Kuflik, and F. Ricci, "Mediation of user models for
enhanced personalization in recommender systems," In: Proc of User
Modelling and User-Adapted Interaction. 2008, pp245-286.
0.00
[7] B. Sarwar, G. Karypis, J. Konstan, and J. Riedl, "Item-based
cosine-based adjusted correlation- Simi-New
collaborative filtering recommendation algorithms," In: Proc of the
cosine based
10th International World Wide Web Confence. New York, USA,
2001, pp.285-295.
[8] M. Deshpande and G. Karypis, "Item-based Top-N recommendation
Figure 5. MAE of two user models algorithms," ACM Transactions on Information Systems, 2004,
22(1):143-177.
As shown in the above chart, we can see the ontology [9] G. Karypis, "Evaluation of item-based top-n recommendation
based user model is superior to user-resource matrix-based algorithms," In: Proc of The Tenth International Conference on
Information and Knowledge Management(CIKM). New York, USA,
user model in the condition the sparsity of the data is 2.62%.
200I, pp.247-254.
From the above, we can get the conclusion that the ontology
[10] H. Dapeng, L. Qianhui, and Z. Jingmin, "An Improved Similarity
based user model presented in this paper is better than
Algorithm for Personalized Recommendation," International Forum
commonly used user-resource matrix-based user model. on Computer Science-Technology and Applications, IFSCTA, 2009
From the two charts we can also get the conclusion that with [II] L. Jinzhong, "Pattern Recognition Introduction," Beijing, Higher
Education Press, 1994: 300-301.
367