![]() |
Dottorato in Ingegneria Informatica
e Elettronica Industriali |
A.A. 2003/2004
|
INTRODUCTION TO
A small course (twelve hours) in natural language processing (NLP)
is offered this year within the "Dottorato in Ingegneria
Informatica e Elettronica Industriali".
The course assumes general computer science background,
elementary knowledge of formal language theory, ability to analyze
algorithm complexity, and elementary knowledge of probability.
The teacher Giorgio Satta
can be contacted at
satta@dei.unipd.it
| ||||||||||||||||||
I. SCHEDULE
Twelve (12) hours distributed into three weeks.
| ||||||||||||||||||
II. PREREQUISITES
Before first lecture, please
read the following introductory material to natural language
processing, which offers an overview of the field and some
historical notes.
| ||||||||||||||||||
III. FINITE-STATE MODELS
Regular expressions, finite automata and finite-state transducers;
basic properties and algorithms.
We will look at two applications. First application shows
how to apply finite-state transducers to speed-up
part-of-speech tagging systems. Second application
shows how to implement a phonological theory using
finite-state transducers.
| ||||||||||||||||||
IV. CONTEXT-FREE MODELS
Context-free grammars, push-down automata
and push-down transducers. Parsing strategies and
tabular parsing algorithms.
As an application, we will look at the probabilistic extensions
of context-free grammars and maximum-likelihood estimators.
We will also closely look at a state-of-the art statistical
parser for natural language.
| ||||||||||||||||||
V. CREDITS
In order to get credits out of this course, candidates should
write and discuss with the teacher a short project paper
on a problem related to those presented in the class.
The specific subject of the paper should be first negotiated
with the teacher.
| ||||||||||||||||||
|
Last update: Fri Aug 29 2003