Feature / Bug List

Modelling/Markup of Data: The modelling of the underlying raw data should follow a few basic principles: (1) it should be designed with human-readability in mind where possible, (2) it should attempt to minimize the total number of tags where possible (without losing )

Metadata Generation: While most of the metadata included in the final marked-up version of the court documents is simply an alternative representation of the information contained in the document's header, there is a significant amount of useful information that can be collected from outside sources. A primary example of this is a listing of cases that reference the case—this information is not contained in the document but can be gathered from the citing documents and incorporated into a document's RDF metadata expressions. Other examples are formal categories in which the document belongs and significant legal keywords draw from the document's text.

Native Display: Although it is expected that the XML version of the data will be converted into any number of formats and custom displays, the xhtml display presented on the OpenGavel site provides a good test case to work solutions to some common display issues. Most of these issues involve finding a standards-compliant CSS solution but they can sometimes reveal an underlying issue with how the data was originally marked-up.

Administrative Issues