4
edits
Changes
Added 2 proposed JHOVE2 half/day sessions
[[Category:Code4Lib2011]]
== Using JHOVE2 for Policy Assessment of Files ==
Description: JHOVE2 is a Java framework and application for format-aware characterization of files, bytestreams within files, and file containers or other file aggregatations. JHOVE2 examine a digital source unit and extracts feature information about that source unit for purposes of classification, analysis, and use.
In addition to detailed output of the features of a format instance, JHOVE2 can provide summary determination of the validity of an item (its conformance to the normative syntactic and semantic requirements defined by an authoritative specification) and can be used for assessing the level of acceptability of a digital object for a specific purpose on the basis of locally-defined policy rules. The latter is one of the significant enhancements of JHOVE2 over its predecessor.
This session will provide some examples of the structure of JHOVE2 format modules, the outputs produced by those modules, and the configuration of the JHOVE2 assessment module so that it can be used to perform rule-based analysis of the reportable properties previously generated during characterization of a source unit.
For more information, visit [http://jhove2.org http://jhove2.org].
Duration: half-day
Speaker Bio: Richard Anderson is a member of the JHOVE2 develpment team and a Software Engineer with the Digital Library Systems and Services unit of Stanford University
Contact: Richard Anderson <rnanders at stanford dot edu>
== Creating a new JHOVE2 Format Module==
Description: JHOVE2 is a Java framework and application for format-aware characterization of files, byte streams within files, and file containers or other file aggregations. JHOVE2 examines a digital source unit and extracts feature information about that source unit for purposes of classification, analysis, and use.
JHOVE2 is a significant re-engineering of its JHOVE ([http://hul.harvard.edu/jhove/ http://hul.harvard.edu/jhove/]) predecessor, with a highly modular structure, intended to facilitate the rapid creation of new characterization modules for many formats that can easily be plugged into the JHOVE2 framework. The initial JHOVE2 distribution includes modules for UTF-8, SGML, Shapefile, TIFF, WAV, XML, and ICC color profiles, with ZIP, PDF and JPEG-2000 modules expected to be deployed in the next few months. Developers at the Wegener Institute ([http://www.awi-potsdam.de http://www.awi-potsdam.de] ) have already created new modules for netCDF and GRIB. Developers at the French National Library (La Bibliothèque nationale de France [http://www.bnf.fr/fr/acc/x.accueil.html http://www.bnf.fr/fr/acc/x.accueil.html]) are currently working on GZIP and ARC modules.
This session will provide an overview of the JHOVE2 processing module and plug-in architecture, and will walk through the steps of creating a new format module.
For more information, visit http://jhove2.org.
Duration: half-day
Speaker Bio: Richard Anderson is a member of the JHOVE2 development team and a Software Engineer with the Digital Library Systems and Services unit of Stanford University
Contact: Richard Anderson <rnanders@stanford.edu>