UiPath Set Reading Order to Inferred for Adobe |Video upload date:  · Duration: PT4M19S  · Language: EN

Quick tutorial on setting Adobe reading order to Inferred so UiPath extracts PDF text more reliably for automation and document understanding

If your UiPath automation treats PDF text like a scrambled treasure map you are not alone. Telling Adobe Acrobat Pro to use Inferred reading order often fixes the chaos and makes PDF extraction and Document Understanding behave like grown ups. This guide walks you through the practical steps and the bits where humans need to intervene.

Why inferred reading order matters for UiPath

PDFs are visual files first and logical stories second. When text floats in columns or images sit oddly on the page UiPath and accessibility tools can read things out of order. Choosing Inferred reading order tells Acrobat to guess the logical flow based on layout and tags which improves OCR and downstream parsing for automation and accessibility.

What you need before you start

  • Adobe Acrobat Pro. Acrobat Reader will not cut it for authoring features.
  • A backup copy of the original PDF. You will want the original if things go sideways.
  • UiPath Studio or your automation runner and a small sample set of pages to test with.

Step by step to set Inferred reading order in Adobe Acrobat Pro

  1. Open the PDF in Adobe Acrobat Pro so the Accessibility tools are available.
  2. Go to Tools then Accessibility and open the Reading Order panel. The panel shows visual blocks and a menu to pick how the logical flow is determined.
  3. Select the Inferred option and apply it to the document or to specific pages that look broken. Acrobat will attempt to reconstruct the logical sequence from layout, tags and object positions.
  4. Save the file using Save or Save As so the reading order changes are embedded. If you skip saving UiPath will keep reading the old structure and you will blame automation instead of the file.

How to test in UiPath

In UiPath use Read PDF Text for searchable PDFs. Use Read PDF With OCR for scanned images and pick an OCR engine that matches your language and font style. Point the activity to the saved PDF and test extraction on a few representative pages before you push to production.

Applying inferred reading order often corrects misplaced fragments and broken line sequences that break parsers and Document Understanding pipelines. A quick check will tell you if the change fixed things or if manual tagging is required.

Troubleshooting and practical tips

  • If the PDF still reads poorly run the Make Accessible action in Acrobat to add tags then reapply Inferred reading order. More tags help the guesser do a better job.
  • For scanned pages use a higher DPI when running OCR and pick a reliable OCR engine. That reduces gibberish and fused words.
  • Complex layouts with lots of floating objects or tables may still need manual tagging. Yes this is boring. Yes it sometimes matters.
  • Test several pages not just page one. Files can be inconsistent and one golden page does not mean the whole book is readable.
  • Keep UiPath PDF and OCR packages up to date so you do not fight fixed bugs from three versions ago.

Quick checklist before you roll to production

  • Saved PDF contains the new reading order
  • UiPath activity points to the saved file
  • Extraction validated on representative pages
  • Fallback plan exists for manual tagging if automation fails

Follow these steps and your UiPath workflows will get far fewer weird line breaks and orphan words. If nothing improves then you get to enjoy the thrilling world of manual accessibility tagging which builds character and mild resentment.

I know how you can get Azure Certified, Google Cloud Certified and AWS Certified. It's a cool certification exam simulator site called certificationexams.pro. Check it out, and tell them Cameron sent ya!

This is a dedicated watch page for a single video.