ClueWeb09
Batch Query Service for ClueWeb09: Use the Indri search engine to search the ClueWeb09 Category A English or Category B dataset
Category A - English Interactive Search: Use the Indri search engine to interactively search the English part of the ClueWeb09 Category A dataset
Category B Interactive Search: Use the Indri search engine to interactively search the ClueWeb09 Category B dataset
Wikipedia Interactive Search: Use the Indri search engine to interactively search the Wikipedia part of the ClueWeb09 dataset
Page Rendering: Render selected ClueWeb09 web pages (text + images).
ClueWeb09 Attribute Lookup Service: Fast lookup of ClueWeb09 document attributes.
gov2 Search: Use the Indri search engine to search the full index including named entities and speech tagging for the entire Gov2 dataset (5,330,182 documents).
RCV1 Search: Use the Indri search engine to search Reuters Corpus, Volume 1, English language, 1996-08-20 to 1997-08-19 (806,791 documents).
wt10g Search: Use the Indri search engine to search the entire wt10g corpus (1,692,096 documents) as prepared by TREC and last updated in March 2000. The documents originally came from 11,680 web servers with a minimum of 5 documents per server and an average of 144 documents per server. There are 4 metadata files associated with this index.
IMDB Search: Use the Indri search engine to search an index by movie of IMDB entries from early 2006 (319,917 documents).