Choosing an XML database for linguistically annotated corpora

Autor:
Eckart, Richard
Aufsatztitel:
Choosing an XML database for linguistically annotated corpora

Jahrgang:
32
Heft:
01 (2008)
Seiten:
7-22
Abstract:
Abstract
XML has become the de-facto standard for representing linguistically annotated corpora. It seems safe to assume that storing and querying an XML-encoded, annotated corpus in an XML database is a straightforward procedure. In reality, however, it is not. This article aims to provide guidelines for deciding whether to use an XML database and how to choose a suitable product. To this end we examine the following questions: Which aspects should be considered before choosing to store an XML-encoded annotated corpus in an XML database? Which facilities does a database need to provide in order to be suitable for storing and querying annotated corpora? Do current XML databases offer these facilities, and, if not, can they be added?

Zurück