CIS 8590: Topics in Computer Science -- Text Mining and Language Processing [Spring 2008]

Prerequisites, Text, Description, Grading, Exams, Final Project.

Additional information about this course may be found on the Web at
http://knight.cis.temple.edu/~yates/cis8590/.

Lecture Time: Thursdays: 4:40pm to 7:10pm in Tuttleman 1B

Instructor : Alexander Yates Miscellaneous:

PREREQUISITES

TEXT

There is no textbook for this course. We will be reading extensively from the research literature.

DESCRIPTION

This course will give a broad overview of problems and techniques in natural language processing, and then move on to cover the latest research in selected topics. The overview part of the course will cover problems in: The in-depth part of the course will focus on the latest research in unsupervised information extraction. This part of the course will cover such techniques as stemming, pointwise mutual information, pattern-matching, bootstrapping, TF-IDF, n-gram models, Hidden Markov Models, Conditional Random Fields, statistical parsing, clustering, and language modeling.

GRADING

EXAMS AND QUIZZES

All exams and quizzes are closed book. Their content is cumulative, i.e. they address the material from the entire semester up to the day of the exam. If a student misses the midterm for an emergency [as agreed with instructor], there will be no makeup exam: the homeworks, quizzes, and final project will become proportionally more important. If a student misses the midterm without previous agreement and without definitive proof as to the medical or legal reasons, he or she will get a zero for that exam. Quizzes that are missed will not be made up. //The final exam is mandatory on the scheduled day.

FINAL PROJECT

Several project ideas will be suggested during the course of the semester, but students are free to suggest their own, especially if they relate to their current research. Students will be expected to come up with innovative, novel solutions to problems in text mining and language processing.

Course projects will be undertaken individually or in small teams (2-3 students). Each student on a team will receive the same grade for the project; it is up to the team members to divide the work fairly. More information on course projects will be provided soon.