Johan van der Knijff, the National Library of the Netherlands, presented his views on ‘PDF/A-3 for preservation’ based on notes on embedded files and JPEG2000.
The presentation was given at DPC briefing (http://bit.ly/1b487mD) which introduced and reviewed recent developments with the PDF / A standard, with particular emphasis on PDF/A version 3 published in October 2012. The meeting took place in Leeds on 13 March 2013.
Scaling API-first – The story of a global engineering organization
PDF/A-3 for preservation. Notes on embedded files and JPEG2000
1. SCAPE
Johan van der Knijff
Koninklijke Bibliotheek – National Library of the Netherlands
DPC, PDF/A-3 Briefing, Leeds, 13.3.2013
PDF/A-3 for preservation
Notes on embedded files and JPEG 2000
16. Not based on “embedded file stream”, but on
“Image XObject” data structure (allows
limited set of pre-defined formats)
What about inline images?
17. No impact on content that is meant to be
rendered by PDF viewer
But PDF/A-3’s may contain file of any possible
format as an attachment
Embedded files wrap-up:
22. ISO 19005-2 (PDF/A-2):
JPEG 2000 support based on subset of JPEG
2000 Part 2 (JPX baseline)
Only Part 1 of the standard (JP2) commonly
used for archival applications!
23. JP2 vs JPX
JP2
JPX
JPEG 2000 Part 1:
Basic still image format
JPEG 2000 Part 2:
= JP2 + assorted
advanced stuff …
25. OS PDF viewers – JPEG 2000 libraries
Ghostscript: OpenJPEG or JasPer
Evince: OpenJPEG
Mupdf: OpenJPEG
Firefox PDF viewer: built-in decoder
None of these libraries support fragmented
codestreams!
26. Is it really a problem?
Fragmented codestreams extremely rare
But why is this feature even allowed in a long-
term archival format?
OS support of JPEG 2000 in general remains
problematic