CSC 483/583: Text Retrieval and Web Search (Spring 2017)

Most of the web data today consists of unstructured text. Of course, the fact that this data exists is irrelevant, unless it is made available such that users can quickly find information that is relevant for their needs. This course will cover the fundamental knowledge necessary to build these systems, such as web crawling, index construction and compression, Boolean, vector-based, and probabilistic retrieval models, text classification and clustering, link analysis algorithms such as PageRank, learning to rank, and computational advertising. The students will also complete one programming project, in which they will construct one complex application that combines multiple algorithms into a system that solves real-world problems.

Time and Place

Monday/Wednesday 12:30pm - 1:45pm in Gould-Simpson, Room 906

Instructor Information

Instructor: Mihai Surdeanu
msurdeanu AT email DOT arizona DOT edu
Office: Gould-Simpson 746
Office Hours: Mon/Wed 11 - noon

TA: Enrique Noriega
enoriega AT email DOT arizona DOT edu
Office: Gould-Simpson 931
Office Hours: Tue/Thu 11 - 12:30


This term we will be using Piazza for class discussion. The system is highly catered to getting you help fast and efficiently from classmates, the TA, and myself. Rather than emailing questions to the teaching staff, I encourage you to post your questions on Piazza. If you have any problems or feedback for the developers, email

Find our class page at: (log in, then go the Q & A section)