Table of Contents

Schedule
Preface
Probabilistic IR Models Based on Document and Query Generation
John Lafferty and Chengxiang Zhai (Carnegie Mellon University)
1
Language Models and Uncertain Inference in Information Retrieval
Norbert Fuhr (University of Dortmund)
6
LM vs PM: Where's the Relevance?
Karen Sparck Jones (Cambridge University) and Stephen Robertson (Microsoft Research, Cambridge)
12
Relevance-Based Language Models: Estimation and Analysis
Victor Lavrenko and Bruce Croft (University of Massachusetts, Amherst)
16
Language Models and Probability of Relevance
Stephen Robertson (Microsoft Research, Cambridge and City University London) and Djoerd Hiemstra (Microsoft Research, Cambridge and University of Twente)
21
Is it the Language Model in Language Modeling?
Warren Greiff (The Mitre Corp.)
26
The Dual Role of Smoothing in the Language Modeling Approach
Chengxiang Zhai and John Lafferty (Carnegie Mellon University)
31
Is Information Retrieval Anything More Than Smoothing?
Jay Ponte (The Mitre Corp.)
37
From Latent Semantic Indexing to Language Models and Back
Thomas Hofmann (Brown University)
42
Language Model Feature Induction via Discriminative Techniques
Jerry Xiaojin Zhu (Carnegie Mellon University)
47
Semantic Text Clusters and Word Classes -- The Dualism of Mutual Information and Maximum Likelihood
Jochen Peters (Philips Research Laboratories)
55
Using Language Models for Tracking Events of Interest Over Time
Martijn Spitters and Wessel Kraaij (TNO TPD)
60
Topic Models for Summarizing Novelty
James Allan, Rahul Gupta, and Vikas Khandelwal (University of Massachusetts, Amherst)
66
Unsupervised Topic Discovery
Rich Schwartz, Sreenivasa Sista, Timothy Leek (BBN Technologies)
72
Sub-Word-Based Language Models for Speech Recognition: Implications for Spoken Document Retrieval
Martha Larson (GMD-IMK)
78
Using Compression-Based Language Models for Text Categorization
William J. Teahan and David J. Harper (Robert Gordon University)
83
Language Modeling for Good Generation
Kevin Knight (USC/Information Sciences Institute)
88
Using Models of Score Distributions in Information Retrieval
R. Manmatha, F. Feng and T. Rath (University of Massachusetts, Amherst)
91
A Generative Model for Filtering Thresholds
Yi Zhang and Jamie Callan (Carnegie Mellon University)
97
Index