Browse wiki

Jump to: navigation, search
Overview of VideoCLEF 2009: New perspectives on speech-based multimedia content enrichment
Abstract VideoCLEF 2009 offered three tasks relatedVideoCLEF 2009 offered three tasks related to enriching video content for improved multimedia access in a multilingual environment. For each task, video data (Dutch-language television, predominantly documentaries) accompanied by speech recognition transcripts were provided. The Subject Classification Task involved automatic tagging of videos with subject theme labels. The best performance was achieved by approaching subject tagging as an information retrieval task and using both speech recognition transcripts and archival metadata. Alternatively, classifiers were trained using either the training data provided or data collected from Wikipedia or via general Web search. The Affect Task involved detecting narrative peaks, defined as points where viewers perceive heightened dramatic tension. The task was carried out on the "Beeldenstorm" collection containing 45 short-form documentaries on the visual arts. The best runs exploited affective vocabulary and audience directed speech. Other approaches included using topic changes, elevated speaking pitch, increased speaking intensity and radical visual changes. The Linking Task, also called "Finding Related Resources Across Languages," involved linking video to material on the same subject in a different language. Participants were provided with a list of multimedia anchors (short video segments) in the Dutch-language "Beeldenstorm" collection and were expected to return target pages drawn from English-language Wikipedia. The best performing methods used the transcript of the speech spoken during the multimedia anchor to build a query to search an index of the Dutch-language Wikipedia. The Dutch Wikipedia pages returned were used to identify related English pages. Participants also experimented with pseudo-relevance feedback, query translation and methods that targeted proper names.on and methods that targeted proper names.
Abstractsub VideoCLEF 2009 offered three tasks relatedVideoCLEF 2009 offered three tasks related to enriching video content for improved multimedia access in a multilingual environment. For each task, video data (Dutch-language television, predominantly documentaries) accompanied by speech recognition transcripts were provided. The Subject Classification Task involved automatic tagging of videos with subject theme labels. The best performance was achieved by approaching subject tagging as an information retrieval task and using both speech recognition transcripts and archival metadata. Alternatively, classifiers were trained using either the training data provided or data collected from Wikipedia or via general Web search. The Affect Task involved detecting narrative peaks, defined as points where viewers perceive heightened dramatic tension. The task was carried out on the "Beeldenstorm" collection containing 45 short-form documentaries on the visual arts. The best runs exploited affective vocabulary and audience directed speech. Other approaches included using topic changes, elevated speaking pitch, increased speaking intensity and radical visual changes. The Linking Task, also called "Finding Related Resources Across Languages," involved linking video to material on the same subject in a different language. Participants were provided with a list of multimedia anchors (short video segments) in the Dutch-language "Beeldenstorm" collection and were expected to return target pages drawn from English-language Wikipedia. The best performing methods used the transcript of the speech spoken during the multimedia anchor to build a query to search an index of the Dutch-language Wikipedia. The Dutch Wikipedia pages returned were used to identify related English pages. Participants also experimented with pseudo-relevance feedback, query translation and methods that targeted proper names.on and methods that targeted proper names.
Bibtextype inproceedings  +
Doi 10.1007/978-3-642-15751-6_46  +
Has author Larson M. + , Newman E. + , Jones G.J.F. +
Has extra keyword Automatic tagging + , Multilingual environments + , Multimedia access + , Multimedia contents + , Pseudo relevance feedback + , Query translations + , Subject classification + , Training data + , Video contents + , Video data + , Video segments + , Visual arts + , Web searches + , Wikipedia + , Concentration (process) + , Feedback + , Information retrieval + , Linguistics + , Metadata + , Natural language processing systems + , Video recording + , Websites + , Speech recognition +
Isbn 3642157505; 9783642157509  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 354–368  +
Published in Lecture Notes in Computer Science +
Title Overview of VideoCLEF 2009: New perspectives on speech-based multimedia content enrichment +
Type conference paper  +
Volume 6242 LNCS  +
Year 2010 +
Creation dateThis property is a special property in this wiki. 8 November 2014 03:09:44  +
Categories Publications without keywords parameter  + , Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 8 November 2014 03:09:44  +
DateThis property is a special property in this wiki. 2010  +
hide properties that link here 
Overview of VideoCLEF 2009: New perspectives on speech-based multimedia content enrichment + Title
 

 

Enter the name of the page to start browsing from.