The iPRES (International Conference on Preservation of Digital Objects), the most important international conference on long-term digital archiving, will take place from 25 to 29 September 2017 in Kyoto (Japan). Michelle Lindlar from the Technische Informationsbibliothek (TIB) – German National Library of Science and Technology will give a lecture entitled "A Test-Set for Well-Formedness Validation in JHOVE – The Good, the Bad and the Ugly" as part of the “PDF Preservation” session on 27 September 2017 (14:10, Main Hall).
The paper deals with the validation of PDF files against the PDF standard ISO 32000. The validation of file formats against standard conformance is an important process of long-term archiving, as it is the only way to verify whether the digital object corresponds to the file format it claims to be. While the de facto standard tool JHOVE is widely used in the community for validation testing, a test set to check the result for correctness was missing. The iPRES paper, which was developed in cooperation with Yvonne Tunnat from the ZBW – Leibniz Information Centre of Economics and Carl Wilson from the OPF – Open Preservation Foundation, closes this gap with the development of a simple synthetic test set: It contains around 90 files that violate individual standard requirements.