cloudbook: The Cloud Computing & SaaS Information Resource

Twitter @cloudbook Facebook Group LinkedIn Group Subscribe for Updates
Sign up and stay informed
Email:  
PDF Print E-mail


Carnegie-Mellon University

Multi-Tier Indexing for Web Search Engines

NSF Award 0841275
 
    Abstract: Researchers at Carnegie-Mellon University are using cloud computing to characterize the topicality of web content to more effectively process web searches. Routing searches topically requires less effort than traditional searches, enabling significant computational and financial savings. The project is using the Google/IBM cluster to "crawl" the web and perform the data cleansing and pre-processing necessary to develop a web dataset of 1 billion documents to support the research. The web dataset is also being made available to the larger information retrieval community to multiply the impact of the project on that discipline.

    Presentation: Topic-Partioned Search Engine Indexes - October 2009 PDF





Bookmark and Share
 


Joyent


CloudSwitch


OpSource

NetSuite

GenieDB
Burstorm


Software & Information Industry Association (SIIA)

CloudCamp

Smart Enterprise Exchange

Cloudbook Login



Login using Facebook

What readers are saying about Cloud


You are here  : Home Carnegie Mellon University Research