合作交流 / 学术报告

On Several Untypical Information Retrieval and Web Search Paradigms

Title: On Several Untypical Information Retrieval and Web Search Paradigms
Speaker: Dr. Byron J. Gao (Texas State University, USA)
Time
: 3:00pm, Friday,June 12
Venue: Lecture Room,State Key Lab of Computer Science, Level 3 Building #5, Institute of Software, CAS

Abstract: In the BoBo project, we study the two-box search paradigm that features two input boxes on the search interface. Besides a search box taking search terms as in normal search engines, a domain box is used to take domain knowledge in the form of keywords. As search terms are inherently ambiguous, domain terms can be optionally used to route search results towards a user-intended domain.

In the Cager project, we study cross-page web search. Existing search engines have page as the unit of retrieval of information. Generally, given a query as a set Q of keywords, they return a ranked list of web pages, each containing Q. However, quite often, users wish to have what we call “cage” as the unit of retrieval. A cage, crossing multiple pages, is a set of closely related web pages that collectively contain Q.

In the Rant project, we try to provide a framework for mass-collaboration-based social search. Google (login) and Microsoft (U Rank) are experimenting on search engines that allow people to organize, edit and annotate search results, as well as share information with others. Currently there is no much systematic research on how to interpret, store, preserve, and utilize user preferences.

Biography:Byron J. Gao received his Ph.D. and B.Sc. in Computer Science from Simon Fraser University, Canada, in 2007 and 2003 respectively. He was a postdoctoral fellow at the University of Wisconsin – Madison before joining Texas State University – San Marcos in 2008, where he currently holds an assistant professor
position. His general areas of research are data mining, databases, information retrieval, and bioinformatics.