An Introduction to PDF Tags

What PDF tags are, how they define structure, and why they are essential for accessibility

PDF tags are the foundation of accessible PDF documents. Without tags, assistive technologies cannot reliably understand the structure, order, or meaning of PDF content.

This article explains what PDF tags are, how they work, and why tagging is required for accessible PDFs.


What are PDF tags

PDF tags are structural markers embedded in a PDF file. They define how content is organized and how different elements relate to one another.

Tags tell assistive technologies:

  • What content is a heading

  • What content is a paragraph

  • Which items form a list

  • How tables are structured

  • What order content should be read in

A PDF without tags may look correct visually but provides little usable information to screen reader users.


What does a tagged PDF mean

A tagged PDF is a PDF that includes a complete tag structure describing the document’s logical organization.

In a tagged PDF:

  • Headings are identified as headings

  • Lists are identified as lists

  • Tables include row and column relationships

  • Decorative content can be hidden from assistive technologies

  • Reading order follows a logical sequence

Tagging allows screen readers to navigate a document efficiently rather than reading it as a flat stream of text.


Why PDF tags are required for accessibility

Accessibility standards require that content be programmatically determinable. In PDFs, this requirement is met through tagging.

Without proper tags:

  • Screen readers cannot identify headings or sections

  • Reading order may be incorrect

  • Lists and tables may be announced improperly

  • Navigation becomes slow and confusing

This is why untagged or poorly tagged PDFs consistently fail accessibility audits.


Common PDF tagging problems

Even tagged PDFs frequently contain tagging errors.

Common problems include:

  • Missing tags altogether

  • Incorrect heading hierarchy

  • Content tagged visually but not semantically

  • Reading order that does not match visual order

  • Decorative elements exposed to screen readers

  • Tables without proper header associations

These issues are often invisible to sighted reviewers but create significant barriers for assistive technology users.


How PDF tags are created

PDF tags are usually created during document authoring or PDF conversion.

Tags may originate from:

  • Structured Word or PowerPoint documents

  • Authoring tools that preserve semantic structure

  • Automated tagging tools in PDF editors

If the source document lacks proper structure, tagging tools cannot reliably create correct tags. Manual remediation is often required to fix or rebuild the tag tree.


Why tagging alone is not enough

While tagging is essential, it does not guarantee accessibility on its own.

Accessible PDFs also require:

  • Correct metadata

  • Defined document language

  • Meaningful alternative text

  • Logical reading order

  • Usable tables and forms

A PDF can be tagged and still fail accessibility standards if these other requirements are not met.


Conclusion

PDF tags provide the structural foundation that makes PDFs accessible. Without tags, assistive technologies cannot reliably interpret or navigate document content.

Organizations that publish PDFs for public, customer-facing, or regulated use should ensure that documents are properly tagged and reviewed as part of an accessibility process.


Accessibility Testing Note

Tagging issues are commonly identified during accessibility testing, particularly when PDFs are evaluated against WCAG and PDF/UA requirements.