CS598CXZ Advanced Topics in Information Retrieval (Fall 2016)

Instructor: ChengXiang Zhai

| Home | Basic Information | Schedule |
| Readings | Assignments | Project | Resources |




Readings


Note: In general, the lecture slides are the best "definition" of the core contents -- the contents to be tested in the midterm. That is, you are expected to understand all the major points, models, and techniques that we have discussed in the class; anything beyond the slides can be regarded as optional. Thus, all the lecture slides are required readings.

Required Readings

  1. Rosenfeld's notes (estimation and information theory)

    The goal of reading these notes is to know about some basic concepts in probability, statistics, and information theory. You should read at least Section 3 of the estimation note and all of the information theory note except for section 1.1.6. You should fully understand the derivation of the maximum likelihood estimate for the binomial distribution, and most of the contents in the information theory notes. If you can't understand these, you may want to read relevant discussions in a textbook on probability and statistics, and a book on information theory. Any book on these topics should be sufficient.

  2. [Robertson and Zaragoza 2009]

    This is a nice introduction to how one of the most effective retrieval functions, BM25, has been developed and extended later. As a minimum, read chapters 1-3, but try to read the entire review if you can.

  3. SLMIR

    This book has a chapter (Chapter 2) on a general survey of major retrieval models and an extensive coverage of statistical language models for IR, which might be useful if you want to have a good picture of all kinds of retrieval models in general. Chapters 3-6 are most useful for understanding language models for retrieval. Chapter 7 is useful for understanding topic models.

    More to be posted later.