trec123-100-sample300-callan99.v1a:
(115 KB gzipped, 650 KB uncompressed):
-
Description:
A 100 collection testbed created by sampling
trec123-100-bysource-callan99.v2a.
-
Publication: In progress.
trec123-100-bysource-callan99.v2a:
(3 MB gzipped, 23 MB uncompressed):
-
Description:
A 100 collection testbed created from TREC CDs 1, 2, and 3.
-
Publication:
A.L. Powell, J.C. French, J. Callan, M. Connell, and C.L. Viles,
"The impact of database selection on distributed searching."
In
Proceedings of the 23rd International ACM SIGIR Conference on
Research and Development in Information Retrieval (SIGIR 00),
Athens, 2000.
trec123-100-bysource-callan99.v1b:
(3 MB gzipped, 23 MB uncompressed):
-
Description:
A 100 collection testbed created from TREC CDs 1, 2, and 3.
161 AP '90 documents from CD 3 were omitted inadvertently.
-
Publication:
J. French, A. Powell, J. Callan, C. Viles, T. Emmitt, K. Prey, and
Y. Mou,
"Comparing the performance of database selection algorithms."
In the
Proceedings of the 22nd International ACM SIGIR Conference on
Research and Development in Information Retrieval (SIGIR 99),
Berkeley, CA, August 15-19, 1999.
trecvlc1-921-bysource-callan99
(18 MB gzipped, 260 MB uncompressed):
-
Description:
A 921 collection testbed created from the first TREC VLC corpus.
- Authors: J. Callan and Y. Mou.
-
Publication:
J. French, A. Powell, J. Callan, C. Viles, T. Emmitt, K. Prey, and
Y. Mou,
"Comparing the performance of database selection algorithms."
In the
Proceedings of the 22nd International ACM SIGIR Conference on
Research and Development in Information Retrieval (SIGIR 99),
Berkeley, CA, August 15-19, 1999.
trec4-bysource-xu99:
(1.5 MB gzipped, 11.5 MB uncompressed):
trec4-kmeans-xu99:
(2.2 MB gzipped, 11.9 MB uncompressed):
-
Description:
A 100 collection testbed created from TREC-4 data, in which
collections were created by clustering documents (i.e., documents
in each collection are on roughly the same topic).
- Authors: J. Xu and W.B. Croft.
-
Publication:
J. Xu and W.B. Croft, "Cluster-based language models for distributed
retrieval."
In
Proceedings of the 22nd International ACM SIGIR Conference on
Research and Development in Information Retrieval (SIGIR 99),
1999.
trec6-bysource-xu99:
(1.4 MB gzipped, 10.5 MB uncompressed):
trec6-kmeans-xu99:
(2 MB gzipped, 11 MB uncompressed):
-
Description:
A 100 collection testbed created from TREC-6 data, in which
collections were created by clustering documents (i.e., documents
in each collection are on roughly the same topic).
- Authors: J. Xu and W.B. Croft.
-
Publication:
J. Xu and W.B. Croft, "Cluster-based language models for distributed
retrieval."
In
Proceedings of the 22nd International ACM SIGIR Conference on
Research and Development in Information Retrieval (SIGIR 99),
1999.
trec123-17-bysource-callan99.v1a:
(?? MB gzipped, ?? MB uncompressed):
-
Description:
A 17 collection testbed created from TREC CDs 1, 2, and 3.
- Authors: J. Callan and Z. Lu.
-
Publication:
J. P. Callan, Z. Lu and W. B. Croft.
"Searching distributed collections with inference networks."
In the
Proceedings of the 18th International ACM SIGIR Conference on
Research and Development in Information Retrieval (SIGIR 95),
Seattle, WA, 1995.