SciSurf: Index of "manual evaluation" in Proc. ACL 2009

Index of papers in Proc. ACL 2009 that mention

manual evaluation

Seen in text as:

manual evaluation (7)

Seen in 8 sentences in 1 papers.

1. DEPEVAL(summ): Dependency-based Evaluation for Automatic Summaries

Owczarzak, Karolina

In Proc. ACL 2009, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Current practice in summary evaluation	Since manual evaluation is still the undisputed gold standard, both at TAC and DUC there was much effort to evaluate manually as much data as possible.
Current practice in summary evaluation	2.1 Manual evaluation
Current practice in summary evaluation	Automatic metrics, because of their relative speed, can be applied more widely than manual evaluation .
Experimental results	The first question we have to ask is: which of the manual evaluation categories do we want our metric to imitate?
Experimental results	The Pyramid is, at the same time, a costly manual evaluation method, so an automatic metric that successfully emulates it would be a useful replacement.
Experimental results	Table 1: System-level Pearson’s correlation between automatic and manual evaluation metrics for TAC 2008 data.
Introduction	However, manual evaluation of a large number of documents necessary for a relatively unbiased view is often unfeasible, especially in the contexts where repeated evaluations are needed.
Introduction	A more detailed description of BE and ROUGE is presented in Section 2, which also gives an account of manual evaluation methods employed at TAC 2008.

manual evaluation is mentioned in 8 sentences in this paper.

Topics mentioned in this paper: