Introduction | Specifically, you are given a corpus of news articles in which all tokens have been labeled as either belonging to personal name mentions or not. |
Introduction | Clearly the problems of identifying names in news articles and e-mails are closely related, and learning to do well on one should help your performance on the other. |
Introduction | When only the type of data being examined is allowed to vary (from news articles to e-mails, for example), the problem is called domain adaptation (Daumé III and Marcu, 2006). |
Investigation | These are: abstracts from biological journals [UT (Bunescu et al., 2004), Yapex (Franzen et al., 2002)]; news articles [MUC6 (Fisher et al., 1995), MUC7 (Borthwick et al., 1998)]; and personal e-mails [CSPACE (Kraut et al., 2004)]. |
Investigation | 0 person names in news articles and e-mails We chose this array of corpora so that we could evaluate our hierarchical prior’s ability to generalize across and incorporate information from a variety of domains, genres and tasks. |
Investigation | Figure 3 shows the results of an experiment in learning to recognize person names in MUC6 news articles . |
Abstract | We create a database of pictures that are naturally embedded into news articles and propose to use their captions as a proxy for annotation keywords. |
Abstract | We also demonstrate that the news article associated with the picture can be used to boost image annotation performance. |
BBC News Database | Many online news providers supply pictures with news articles , some even classify news into broad topic categories (e.g., business, world, sports, entertainment). |
BBC News Database | We downloaded 3,361 news articles from the BBC News website.2 Each article was accompanied with an image and its caption. |
Introduction | News articles associated with images and their captions spring readily to mind (e.g., BBC News, Yahoo News). |
Introduction | Importantly, our images are not standalone, they come with news articles whose content is shared with the image. |
Related Work | For example, news articles often contain images whose captions can be thought of as annotations. |