Title: Using community-generated contents as a substitute corpus for metadata generation

Authors: M. Meyer, C. Rensing, R. Steinmetz

Addresses: SAP AG, SAP Research CEC Darmstadt, Bleichstr. 8, 64283 Darmstadt, Germany. ' Multimedia Communications Lab, Technical University Darmstadt, Merckstr. 25, 64283 Darmstadt, Germany. ' Multimedia Communications Lab, Technical University Darmstadt, Merckstr. 25, 64283 Darmstadt, Germany

Abstract: Metadata is crucial for reuse of Learning Resources. However, in the area of e-Learning, suitable training corpora for automatic classification methods are hardly available. This paper proposes the use of community-generated substitute corpora for classification methods. As an example for such a substitute corpus, the free online Encyclopaedia Wikipedia is used as a training corpus for domain-independent classification and keyword extraction of Learning Resources.

Keywords: e-learning; classification; metadata generation; Wikipedia; substitute corpus; online learning; learning resourses; reuse.

DOI: 10.1504/IJAMC.2008.016758

International Journal of Advanced Media and Communication, 2008 Vol.2 No.1, pp.59 - 72

Published online: 21 Jan 2008 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article