The Past, Present and Future of Web Information Retrieval

 

Mehran Sahami

Google

 

Time: 3:45-5:00 p.m.

Date:  October 23, 2003

 

 

Abstract

 

Web search engines have emerged as one of the central applications on the Internet.  In fact, search has become one of the most important activities that people engage in on the Internet.  Even beyond becoming the number one source of information, a growing number of businesses aredepending on web search engines for customer acquisition.  The first generation of web search engines used text-only retrieval techniques. Google revolutionized the field by deploying the PageRank technology --an eigenvector-based analysis of the hyperlink structure -- to analyze the web in order to produce relevant results.  Moving forward, Google's goal is to better allow users to obtain relevant information through the deployment of a variety of technological advances in information analysis, understanding, and retrieval.  This presentation will describe some of the main challenges encountered in web information retrieval, some of the current techniques used, and will offer an overview of the search engine of the future.  We will discuss how foundational issues in information retrieval, large scale systems building, and artificial intelligence technologies all play an important role in building a successful search engine

 

Biography

 

Mehran Sahami is a Senior Research Scientist at Google and is also a Lecturer in the Computer Science Department at Stanford University.  At Google, Mehran conducts research in machine learning technologies to help improve information access.  At Stanford, he teaches classes in Programming Methodology and Discrete Mathematics.  Prior to his current positions, Mehran managed the data mining research and development group at Epiphany for several years, and was involved in a number of machine learning research projects at Stanford, Xerox PARC, SRI International, and Microsoft Research.  He received his BS, MS, and PhD in Computer Science from Stanford, and evidently still has issues with regard to leaving.