ClueWeb09 Wiki
User:
Password:
Backlinks...
ClueWeb09 Wiki
Dataset Information
Sample Files
Below is a list of sample files taken from the ClueWeb09 dataset. Each file has 100 pages (WARC response records).
ClueWeb09_English_Sample.warc.gz
(348k)
ClueWeb09_Chinese_Sample.warc.gz
(451k)
ClueWeb09_Spanish_Sample.warc.gz
(370k)
Created by:
mhoy
. Last Modification: Wednesday 29 of April, 2009 12:56:26 EDT by
mhoy
.
Powered by
TikiWiki CMS/Groupware