Advanced Topics in Information Retrieval (COP-6776)
Summer A 2007
Announcements
· midterm stats:
|
min |
19 |
|
max |
80 |
|
median |
59 |
· Welcome to the COP-6776 web page!
General Info
Instructor: Vagelis Hristidis
Lecture time: Tue/Thu 5 pm - 7:40 pm
Location: ECS 138
Office hours: Tuesday 4 pm - 5 pm
Grading
5% participation
35% midterm
20% presentation
40% project
Course Description
Information
Retrieval (IR) principles including indexing and searching document
collections, as well as advanced IR topics such as Web search and IR-style
search in databases.
Some of the topics which will be tentatively presented are:
· Vector Model
· Probabilistic IR
· IR evaluation methods
· Web search
· IR-style search of XML documents and relational databases
Presentation and
Project Assignments
Tentative
Lectures’ Schedule
|
Date |
Topic |
Book Chapters or Other Material |
|
5/8/2007 |
Class Overview, Traditional IR models I |
Chapter 1, intro; |
|
5/10/2007 |
Traditional IR models II |
|
|
5/15/2007 |
Retrieval Evaluation |
1. Chapter 3, 3. slides: IR Evaluation, top-k lists comparing |
|
5/17/2007 |
Query Languages, |
1. 4.1, 4.4 (up to page 109), 5.1-5.2.1, 6.3.3, 7.1-7.2, slides 2. (p2) G Salton, C Buckley. Improving retrieval performance by
relevance feedback. Journal of the American Society for Information
Science, 1990 |
|
5/22/2007 |
Indexing |
1. 8.1-8-3, slides 2. (p3) Zobel, J., Moffat, A., and Ramamohanarao, K. Inverted files versus signature files for text indexing. ACM Trans. Database Syst. 23, 4 (Dec. 1998), 453-490. |
|
5/24/2007 |
Searching the Web, PageRank |
1. (p4) L. Page, S. Brin, R. Motwani, T. Winograd. The
PageRank Citation Ranking: Bringing Order to the Web. 1999 slides: link-based search |
|
5/29/2007 |
MIDTERM |
|
|
5/31/2007 |
Web Search II |
1. (p6) 2. (p7) Heydon, A. and Najork, M. 1999. Mercator: A scalable, extensible Web crawler. World Wide Web 2, 4 (Apr. 1999), 219-229. 3. (instructor will briefly present this) Taher H. Haveliwala, "Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search," IEEE Transactions on Knowledge and Data Engineering, vol. 15, no. 4, pp. 784-796, Jul/Aug, 2003. |
|
6/5/2007 |
Document Clustering and Classification |
1. (p8) Zamir, O. and Etzioni, O. 1998. Web document clustering: a feasibility demonstration. ACM SIGIR '98 2. (p9) Oren Zamir, Oren Etzioni: Grouper: A Dynamic Clustering Interface
to Web Search Results. Computer Networks 31(11-16): 1361-1374 (1999) |
|
6/7/2007 |
XML Search |
1. (p10) Sara Cohen, Jonathan Mamou, Yaron Kanza, Yehoshua Sagiv: XSEarch: A Semantic Search Engine for XML. 45-56, VLDB 2004 2. (p11) L. Guo, F. Shao, C. Botev, J. Shanmugasundaram: XRANK:
Ranked Keyword Search over XML Documents. SIGMOD 2003 |
|
6/12/2007 |
Q&A systems |
1. (p12) Eric Brill, Susan Dumais, Michele Banko An Analysis of the AskMSR Question-Answering System (EMNLP 2002) 2. (p13) S. T. Dumais, E. Cutrell, E., J. J. Cadiz, G. Jancke, R. Sarin and D. C. Robbins. Stuff I've Seen: A system for personal information retrieval and re-use. SIGIR 2003 3. QA slides |
|
6/14/2007 |
Projects’ Presentations |
|
|
6/19/2007 |
Projects’ Presentations |
|
Other Resources
Textbook
By Ricardo Baeza-Yates, Berthier Ribeiro-Neto.
Published by Addison Wesley Professional.
ISBN: 020139829X; Published: May 15, 1999
Also recommended for reference:
Policies
Code of Academic Integrity:
http://www.fiu.edu/~oabp/misconductweb/2codeofacainteg.htm
University Policies: academic misconduct, sexual harassment, religious holidays, and information on services for students with disabilities.