The Taub Faculty of Computer Science Events and Talks
Hanna Fadida (M.Sc. Thesis Seminar)
Wednesday, 21.03.2012, 14:30
Advisor: Prof. A. Itai and Prof. S. Wintner
Verb subcategorization frames determine the number and types of the syntactic arguments that verbs
select, or subcategorize for. Typically, verbs can be associated with subcategorization frames that
specify, for each subcategorized argument, information on the phrases that can realize this
argument. These can be noun phrases (in the case of direct objects), prepositional phrases with one
or more specific preposition, infinitival verb phrases, clauses introduced by a complementizer, etc.
We introduce a technique and an implemented system for automatically acquiring Hebrew verb
subcategorization frames from textual corpora. The system generates a large subcategorization lexicon
which includes subcategorization frames and frequency information for 3532 verbs. Using this lexicon
was shown to improve two tasks: resolving PP-attachment ambiguity, and translating prepositions in an
existing statistical machine translation system from Arabic to Hebrew.