XML Schema driven Database Management of Speech Corpus Metadata

Autor:
Gasch, Joachim
Aufsatztitel:
XML Schema driven Database Management of Speech Corpus Metadata

Jahrgang:
32
Heft:
01 (2008)
Seiten:
23-33
Abstract:
Electronic speech corpora need to bring together several heterogeneous data formats like audio and video data, corpus-, event- and speaker documentation and time aligned media annotations. The metadata management system has to drive data capture, XML native database storage, dynamic publishing and information retrieval processes. This article describes an XML schema based standardization approach where metadata (documentation and annotation information) of different speech corpora is centrally validated and natively stored within an object-relational XML database.

Zurück