locked
Word-generated tagged PDFs do not have correct structure for paragraphs split across page boundaries RRS feed

  • Question

  • My apologies for the cross-post.  I originally asked this in the Word for Developers forum, but was redirected here.

    We're post-processing PDFs generated by Office 2007 and up.  With the "Document structure tags for accessibility" option checked, it does a pretty good job of generating a description of the document structure in the PDF structure tree.

    However, if a paragraph splits across pages, the structure tree records this as two separate paragraphs with apparently nothing to tie them together.  Using Adobe's Acrobat plug-in for Word (version 9, the latest I have), it makes use of something called a Marked Content Reference to tie the second part of the paragraph to the first (see http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/pdf/pdfs/pdf_reference_archives/PDFReference15_v5.pdf, page 732).

    Does anyone know if there is a way (either through the UI or programmatically) to generate the correct PDF tagging for split paragraphs in Word?

    Thursday, July 11, 2013 1:35 PM

All replies

  • Hi,

    I cannot understand the issue very well, I cannot see the "tagging for paragraphs", could you share some pictures here to give a better understanding?

    And is this issue occurs to all the Word document?


    Jaynet Zhang
    TechNet Community Support

    Friday, July 12, 2013 2:40 AM