Action decide Data Model with respect to Fedora 3

Assigned
KFC + ABR + EK

Prev assigned

Tasks adressed

TaskA.2.2,TaskA.2.1

Time estimated
8md

Time used
6md

Priority
6

Status
In progress

Iteration
11,12

Notes

The Fedora 3 Data Model seems to indicate a different datamodel to what we expected, and this has impact on how we should define the final DOMS data model. This needs to be decided, and soon, so MjĂžlner knows how our final datamodel looks.

The issues we have identified so far:

The purpose of this action is to establish how key concepts, mainly inheritance, should be implemented, and formalise the DOMS base collection. We should start manipulating the Fedora developers into accepting our wishes, mainly by emails to their mailinglists.

The product should be a datamodel description, and the foxml object for the Base collection.

Progress

DOMS Data model

Much of the conceptual work have been done, but there is still a huge amount of documentation lacking. The DataModel document should be updated to reflect the ontology advances, and the changes to the content models. A proper, up to date, description of the data model for the reel tape collection should be made.

DS-COMPOSITE

We use DS-COMPOSITE to store arbitrary metadata, as proposed. Fedora does not choke on it, and preserves it faithfully. A new schema for DS-COMPOSITE have been defined, and a schema for the elements we embed in it have also been defined.

Schema validation

There are objects in the tape collection, that need a subset of the qualified dublin core metadata set. This forced us to concretize how we intended schema validation to work.

Until we know the precise capabilities of the engine that should parse the schemas for the GUI, we are sorely restricted in what we can do. For qualified dublin core, admittedly a simple example, we first recreated the dcterms schema. We dropped all the substitution groups, and all the qualifiers, leaving us with just a long list of unique terms. Then we defined schemas, just sequences, restricting which field should be allowed. The idea is to use such a restriction list, combined with the "full" schema, which should be parsable for even simple systems.

PREMIS, and other more advanced schemas have not been considered yet.

View datastream

Have not been updated. There is a serious issue with the way we have chosen to implement it:

In order for incremental updates of a search index to be made, we must inform the search tool each time a "post" in the index is changed. A "post" is defined as the view of a main object. If one of the objects in this view is changed, it has no way of finding its main object, and thus we cannot inform the search index that this "post" is changed.

This issue is still outstanding.

DOMS owl ontology

The Fedora OWL ontology system we will use have been defined in FedoraOntology. Beside the lack of support for DataTypeProperties at current, we believe that it has no loose ends.

TODO

The TODO list from ActionDataModelTDRRequirements has been added to TaskA.2.3AnalysisDocument.

Checklist For Working On An Action

The Life Cycle of an Action:

Please make sure that you address the below issues, when working on an action:

ActionDataModelFedora3 (last edited 2010-03-17 13:12:54 by localhost)