Background | Figure 1: End-to-end question answering by GUSP for sentence get flight from toronto to san diego stopping in dtw. |
Experiments | Since our goal is not to produce a specific logical form, we directly evaluate on the end-to-end task of translating questions into database queries and measure question-answering accuracy. |
Experiments | The numbers for GUSP-FULL and GUSP++ are end-to-end question answering accuracy, whereas the numbers for ZC07 and FUBL are recall on exact match in logical forms. |
Grounded Unsupervised Semantic Parsing | Figure 1 shows an example of end-to-end question answering using GUSP. |
Introduction | We evaluated GUSP on end-to-end question answering using the ATIS dataset for semantic parsing (Zettlemoyer and Collins, 2007). |
Introduction | Despite these challenges, GUSP attains an accuracy of 84% in end-to-end question answering, effectively tying with the state-of-the-art supervised approaches (85% by Zettlemoyer & Collins (2007), 83% by Kwiatkowski et al. |
Introduction | We performed an end-to-end evaluation against a database of 15 million facts automatically extracted from general web text (Fader et al., 2011). |
Introduction | 0 We introduce PARALEX, an end-to-end open-domain question answering system. |
Question Answering Model | For the end-to-end QA task, we return a ranked list of answers from the k highest scoring queries. |
Conclusion | Performance remains good when resolving toponyms identified automatically, indicating that end-to-end systems based on our models may improve the experience of digital humanities scholars interested in finding and visualizing toponyms in large corpora. |
Introduction | However, it is important to consider the utility of an end-to-end toponym identification and resolution system, so we also demonstrate that performance is still strong when toponyms are detected with a standard named entity recognizer. |
Toponym Resolvers | 4States and countries are not annotated in CWAR, so we do not evaluate end-to-end using NER plus toponym resolution for it as there are many (falsely) false positives. |