אירועים
אירועים והרצאות בפקולטה למדעי המחשב ע"ש הנרי ומרילין טאוב
אביחי מאיר (הרצאה סמינריונית למגיסטר)
יום רביעי, 18.05.2011, 12:30
Structured classification tasks such as sequence labeling and dependency
parsing have seen much interest by the Natural Language Processing and
the machine learning communities. Several online learning algorithms were
adapted for structured tasks such as Perceptron, Passive-Aggressive and
the recently introduced Confidence-Weighted learning . These online algorithms
are easy to implement, fast to train and yield state-of-the-art performance.
However, unlike probabilistic models like Hidden Markov Model and Conditional
random fields, these methods generate models that output merely a prediction
with no additional information regarding confidence in the correctness of the
output. In this work we fill the gap proposing few alternatives to compute the confidence
in the output of non-probabilistic algorithms. We show how to compute confidence
estimates in the prediction such that the confidence reflects the probability that
the word is labeled correctly. We then show how to use our methods to detect
mislabeled words, trade recall for precision and active learning. We evaluate
our methods on four noun-phrase chunking and named entity recognition sequence
labeling tasks and on dependency parsing for 14 languages.