PDF/A for Long-Term Archiving: Standards and Best Practices
PDF/A is an ISO-standardized subset of PDF designed for digital preservation. Government agencies, legal firms, and healthcare organizations use PDF/A to ensure documents remain readable decades from now without special software.
Key Takeaways
- Standard PDFs can reference external fonts, use proprietary encryption, embed JavaScript, and link to external resources.
- For most archiving needs, PDF/A-2b is the best choice.
- Every font used in the document must be fully embedded — not just referenced or subsetted.
- After creating a PDF/A file, always validate it against the target standard.
- ## Key Requirements ### Embedded Fonts Every font used in the document must be fully embedded — not just referenced or subsetted.
What Makes PDF/A Different
Standard PDFs can reference external fonts, use proprietary encryption, embed JavaScript, and link to external resources. Any of these dependencies can break over time — a font gets discontinued, an encryption standard becomes unsupported, or a linked resource disappears. PDF/A eliminates these risks by requiring complete self-containment.
PDF/A Conformance Levels
| Level | Standard | Key Requirements |
|---|---|---|
| PDF/A-1a | ISO 19005-1 | Full text extraction, logical structure tags, Unicode mapping |
| PDF/A-1b | ISO 19005-1 | Visual reproduction only (no structure requirements) |
| PDF/A-2b | ISO 19005-2 | Adds JPEG 2000, transparency, layers, PDF attachments |
| PDF/A-2u | ISO 19005-2 | Like 2b plus Unicode text mapping |
| PDF/A-3b | ISO 19005-3 | Allows embedding any file type (XML, CSV, CAD) |
Choosing the Right Level
For most archiving needs, PDF/A-2b is the best choice. It supports modern PDF features (transparency, JPEG 2000) while maintaining broad compatibility. Use PDF/A-1b when maximum compatibility with older validators is required. Choose PDF/A-3b when you need to embed source data files alongside the rendered document.
Key Requirements
Embedded Fonts
Every font used in the document must be fully embedded — not just referenced or subsetted. This ensures the text renders correctly even if the font is no longer available. Font embedding typically increases file size by 200-500 KB per font family.
No External Dependencies
PDF/A files cannot contain links to external content, embedded multimedia requiring external codecs, JavaScript, or executable content. All resources must be contained within the file itself.
Color Management
All colors must be specified in a device-independent color space (ICC profiles must be embedded). RGB and CMYK colors without profiles are not permitted — this ensures colors render consistently regardless of the viewing device.
Metadata
PDF/A requires XMP metadata including title, creator, creation date, and modification date. This metadata must be synchronized with the document's Info dictionary for PDF/A-1.
Validation
After creating a PDF/A file, always validate it against the target standard. Common issues that cause validation failures include: missing embedded fonts, RGB colors without ICC profiles, and JavaScript remnants from form fields.