Description: After crawling and keyword indexing, the next wave that has made a significant impact on Web search is topic distillation: analyzing properties of the hyperlink graph for enhanced ranking of Web pages in response to a query. Hyperlink induced topic search (HITS) and PageRank (used in Google) are two examples.
We discuss two enhancements to the graph selection process. First we will describe a learning system called a "focused crawler". Second we will discuss a fine-grained model for 'micro-hubs' and new algorithms based on the Minimum Description Length principle.
Speaker(s):
Soumen Chakrabarti, Indian Institute of Technology
|