Sei sulla pagina 1di 4

Organizing user Search Histories

Under the guidance of Mrs.ANUSHA, Asst. Professor


[1]
NOWSHEEN BUTUL,[2]K. MAHESHWARI,[3]M.S.ABHISHEK,[4]K.R.SHILPA
[1]
Nowsheenbutul786@gmail.com, [2]mahikopnamoni@gmail.com ,[3]abhishek87121@gmail.com ,
[4]
shilpamuhesh@gmail.com
[1]
8686034099, [2]7097554815, [3]8712149401, [4]8977896809

ABSTRACT:
Internet users continuously make queries over web to obtain required information. They need
information about various tasks and sub tasks for which they use search engines. Over a
period of time they make plenty of related queries. Search engines save these queries and
maintain users search histories. Users can view their search histories in chronological order.
However, the search histories are not organized into related groups. In fact, there is no
organization made except the chronological order. Recently Hwang et al. studied the problem
of organizing historical search information of users into groups dynamically. This automatic
grouping of user search histories can help search engines also in various applications such as
collaborative search, sessionization, query alterations, result ranking and query suggestions.
They proposed various techniques to achieve this. In this paper we implemented those
techniques practically using a prototype web application built in Java technologies. The
experimental results revealed that the proposed application is useful to organize search
histories.

Indexterms: search engine, search history, click graph, query grouping

1.INTRODUCTION it is evident that only 20% of queries are


navigational while the rest are
Information is continuously being added to
transactional. This is because users make
World Wide Web. As the content is
task oriented searches like personal
dramatically increased and made available
finances, travel arrangements, online
to general public, users online make lot of
purchasing and so on. A common thread in
queries to meet their information needs.
all these things is that users make searches
There are many search engines that help by giving a keyword as input. Search
the users in this regard. From a study of engines respond with required information.
search logs of AltaVista [1] and Yahoo [2] Sometimes users may make queries that
are hierarchical and dependent in nature.
To reuse searches and save lot of time, of America}. This information will boost the
late, search engines came up with a feature performance of search engines while
known as Search History. They are able giving ranks. Task level search in
to maintain search histories which are collaborative fashion can be done using
associated with user credentials. The query groups. The search query group
authenticated users can view their search which is the goal of this paper is presented
histories. However, at present the browsers in fig. 2.
are showing search histories in
chronological order only. They do not
organize search histories in more useful
fashion. Fig. 1 shows the search history of
a user with labels such as Yesterday,
Today, and then date wise.

Fig-2
As can be seen in fig. 2, the search history
of a user is given in chronological order.
However, it can be organized more
meaningfully by grouping related queries.

Fig-1
As can be seen in fig. 1, Google search
history is shown in chronological order.
Google can also show the search history in
terms of various categories such as web,
images, news, shopping, Ads, videos,
maps, blogs, books, visual search, travel Fig-3
and finance. However, it does not organize As can be seen in fig. 3, the search history
the search history based on related presented in fig. 2 is grouped into four
similarity of the searches. Query groups categories based on the similarity of
help search engines in many applications. searches. In group 4 financial statement
The key features of search engine can be and Bank of America are grouped
improved by making query groups together as they are closely related. In the
meaningfully. The utilities of query groups same fashion, all the search strings in
include collaborative search, group 1 are closely related. In this paper
sessionization, query alterations, result we implemented the mechanisms proposed
ranking and query suggestions. For by Hwang et al. [3]in which we do not
instance, financial statement is the query depend on temporal properties or textural
which belongs to a group such as properties completely. We depend on the
{financial statement, Bank of behavioral data present in search engines
logs. First of all, we make a query to the URL www.youtube.com or
reformulation graph which contains www.facebook.com. Such a query usually
relationships among queries based on the has skewed click count on one URL, and
frequency. Then we build a query click the class membership of that URL can be
graph that reflects relationships based on excessively influenced by this single
user clicks. query. To avoid their adverse effect on our
fusion based graph algorithms, we identify
Then we combine both query
navigational queries based on measures
reformulation graph and query click graph
proposed in and remove them from our
to generate a query fusion graph. This kind
click graphs.
of approach is also followed in [4], [5] for
session identification and in [6], [7] for In the below fig 4 show the optimization
query clustering. However, in this paper of search based on some real tome
our work extends that in two ways. We use decision based on the Query Fusion Graph,
information from click graph and also Ranking Based Algorithm which is
query reformulation graph for capturing described below. In the context of flow of
similarity in better way. We built a capturing the event based search or click
prototype web application to demonstrate we use graph based algorithm, merged all
the proof of concept. The remainder of these to formulate based on the aspect of
this paper is organized into some sections. mechanism events.
Section II presents review of literature.
Section III provides the proposed approach
for organizing user search histories.
Section IV describes prototype
implementation details.

2. METHODOLOGY:
In the concept of Reliability of search
mechanism based on the historical data
which is the complex analysis of exploring
the fact may be extend with more than one
time. Before exploring particular strategies
for generating relevance judgments from
observed user behavior, we first verify that Fig-4
users react to the relevance of the
presented links. We use the reversed
condition as an intervention that
controllably decreases the quality of the
3.AN OVERVIEW OF PROPOSED
retrieval function and the relevance of the
SYSTEM:
highly ranked abstracts based on the
derived stored procedure. Then we call the 1.We investigate how signals from search
query to be considered as navigational logs such as query reformulations and
when a user is primarily interested in clicks can be used together to determine
visiting a specific web page in mind. For the relevance among query groups. We
example, YouTube or Facebook is study two potential ways of using clicks in
likely to be a navigational query that refers order to enhance this process by fusing the
query reformulation graph and the query [3]. Organizing user search histories have
click graph into a single graph that we very important utilities. They include
refer to as the query fusion graph, and by collaborative search ,sessionization, query
expanding the query set when computing alterations, result ranking and query
relevance to also include other queries suggestions. The application we built
with similar clicked URLs. demonstrates how the search histories of
users are grouped together. Such organized
2.We show through comprehensive
search results are valuable to search
experimental evaluation the effectiveness
engines for various applications mentioned
and the robustness of our proposed search
above.
log- based method, especially when
combined with approaches using other
signals such as text similarity.
REFERENCES
Advantages:
1.Broder, A taxonomy of web search,
1.We will focus on evaluating the SIGIR Forum,. (2002) . 36, 310.
effectiveness of the proposed algorithms in
2.J. Teevan, E. Adar, R. Jones, and M. A.
capturing query relevance.
S. Potts, (2007). Information reretrieval:
2.Relevance Measure
repeat queries in yahoos logs,. 151158.
3.Online query grouping process
4.Similarity function 3.Heasoo Hwang, Hady W. Lauw,
LiseGetoor and AlexandrosNtoulas,
Existing System:
(2012). Organizing User Search
However, this is impractical in our Histories. 24(5).
scenario for two reasons. First, it may have
4.R. Jones and K. L. Klinkner. (2008).
the undesirable effect of changing a users
Beyond the session timeout: Automatic
existing query groups, potentially undoing
hierarchical segmentation of search topics
the users own manual efforts in
in query logs.
organizing her history. Second, it involves
a high computational cost, since we would 5.P. Boldi, F. Bonchi, C. Castillo, D.
have to repeat a large number of query Donato, A. Gionis, and S. Vigna, (2008).
group similarity computations for every The query-flow graph: Model and
new query. applications, in CIKM,.
4.CONCLUSION 6. D. Beeferman and A. Berger. (2000).
Agglomerative clustering of a search
Search engines maintain historical data.
engine query log.
However, they do not organize search
histories well. They only present the 7. R. Baeza-Yates and A. Tiberi. (2007).
search histories in chronological order. In Extracting semantic relations from query
this paper we implemented the logs,
mechanisms to group or organize user
search history such as query reformulation 8. P. Anick. (2003). Using terminological
and click graphs proposed by Hwang etal. feedback for web search refinement: A
log-based study

Potrebbero piacerti anche