Next: A Web Mining Framework
Up: Integrating Web Usage and
Previous: Integrating Web Usage and
The intense competition among Internet-based businesses to acquire new
customers and retain the existing ones has made Web personalization an
indispensable part of e-commerce. Web personalization can be defined as
any action that tailors the Web experience to a particular user, or set
of users. The current challenge in electronic commerce is to develop
ways of gaining deep understanding into the behavior of customers based
on data which is, at least in part, anonymous. We believe that the
solution lies in the creation of a flexible framework and the
development of new techniques for unsupervised and undirected knowledge
discovery from Web usage data, and the integration of content
information and meta-data with the discovered usage patterns.
Personalization based on Web usage mining has several advantages over
more traditional techniques. The type of input is not a subjective
description of the users by the users themselves, and thus is not prone
to biases. The profiles are dynamically obtained from user patterns,
and thus the system performance does not degrade over time as the
profiles age. Furthermore, using content similarity alone as a way to
obtain aggregate profiles may result in missing important relationships
among Web objects based on their usage. Thus, Web usage mining will
reduce the need for obtaining subjective user ratings or
registration-based personal preferences. Web usage mining can also be
used to enhance the effectiveness of collaborative filtering
approaches [6,16]. Collaborative filtering is often based on
matching, in real-time, the current user's profile against similar
records (nearest neighbors) obtained by the system over time from other
users. However, as noted in recent studies [10], it becomes
hard to scale collaborative filtering techniques to a large number of
items, while maintaining reasonable prediction performance and
accuracy. One potential solution to this problem is to first cluster
user records with similar characteristics, and focus the search for
nearest neighbors only in the matching clusters. In the context of Web
personalization this task involves clustering user transactions
identified in the preprocessing stage.
Recent work in Web usage mining has focused on the extraction of usage
patterns from Web logs for the purpose of deriving marketing
intelligence [1,2,3,4,12,19,20], as well
as the discovery of aggregate profiles for the customization or
optimization of Web sites [9,11,14,17,18]. For
an up-to-date survey of Web usage mining systems see [13].
Despite the advantages, usage-based personalization can be problematic
when little usage data is available pertaining to some objects or when
the site content may change regularly. For more effective
personalization, both usage and content attributes of a site must be
integrated into a Web mining framework and used by the recommendation
engine in a uniform manner.
In [7,8], we presented a general framework for
usage-based Web personalization, and proposed specific techniques based
on clustering and association rule discovery to obtain dynamic
recommendations from aggregate usage data. In this paper, we extend
this framework to incorporate content profiles into the recommendation
process as way to enhance the effectiveness of personalization actions.
We discuss specific preprocessing tasks necessary for performing both
content and usage mining, and present techniques based on clustering to
derive aggregate profiles. Our goal is to create a uniform
representation for both content and usage profiles that can be
effectively used for personalization tasks by the recommendation engine
in a consistent and integrated fashion. We show how the discovered
knowledge can be combined with the current status of an ongoing Web
activity to perform real-time personalization. Finally, we provide an
experimental evaluation of the proposed techniques using real Web usage
data.
Next: A Web Mining Framework
Up: Integrating Web Usage and
Previous: Integrating Web Usage and
Bamshad Mobasher
2000-08-14