Experiments | (2009), we performed an automatic held-out evaluation and a manual evaluation . |
Experiments | 7.3.3 Manual Evaluation |
Experiments | For manual evaluation , we picked the top ranked 50 relation instances for the most frequent 15 relations. |
Experimental Evaluation | The lack of ground truth annotation for inferred facts prevents an automated evaluation, so we resorted to a manual evaluation . |
Related Work | (2010) used a human judge to manually evaluate the quality of the learned rules before using them to infer additional facts. |
Results and Discussion | Since it is not feasible to manually evaluate all the inferences made by the MLN, we calculated precision using only the top 1000 inferences. |