A crowdsourced database of event sequence descriptions for the acquisition of high-quality script knowledge

Lilian DA Wanzare, Alessandra Zarcone, Stefan Thater, Manfred Pinkal

dc.contributor.author	Lilian DA Wanzare, Alessandra Zarcone, Stefan Thater, Manfred Pinkal
dc.date.accessioned	2020-11-23T09:12:20Z
dc.date.available	2020-11-23T09:12:20Z
dc.date.issued	2016
dc.identifier.uri	https://repository.maseno.ac.ke/handle/123456789/2904
dc.description.abstract	Scripts are standardized event sequences describing typical everyday activities, which play an important role in the computational modeling of cognitive abilities (in particular for natural language processing). We present a large-scale crowdsourced collection of explicit linguistic descriptions of script-specific event sequences (40 scenarios with 100 sequences each). The corpus is enriched with crowdsourced alignment annotation on a subset of the event descriptions, to be used in future work as seed data for automatic alignment of event descriptions (for example via clustering). The event descriptions to be aligned were chosen among those expected to have the strongest corrective effect on the clustering algorithm. The alignment annotation was evaluated against a gold standard of expert annotators. The resulting database of partially-aligned script-event descriptions provides a sound empirical basis for inducing high-quality script knowledge, as well as for any task involving alignment and paraphrase detection of events.	en_US
dc.publisher	Universität des Saarlandes	en_US
dc.subject	scripts, events, crowdsourcing, paraphrase	en_US
dc.title	A crowdsourced database of event sequence descriptions for the acquisition of high-quality script knowledge	en_US
dc.type	Article	en_US

Files in this item

Name:: L16-1556.pdf
Size:: 440.8Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Department of Computer science [62]

Show simple item record