Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
dTub
Скачать

UiPath Document Understanding: Extract Tables Out of PDFs

Автор: Anders Jensen

Загружено: 2020-12-09

Просмотров: 47159

Описание:

This full video tutorial shows, how to extract table data out of PDFs with the Document Understanding Package from UiPath to Excel. With the help of a template we are able to extract data from PDFs with our UiPath Workflows. Here we have a structured PDF document, as it is time sheet forms.

You could also watch:
🔵 UiPath Document Understanding - Invoice Data Extraction -    • UiPath Document Understanding - Invoice Da...  
🔵 How to extract data from PDF's with RegEx -    • How to extract data from PDF's with RegEx ...  

0:00 Use case presentation
We have a folder with time sheets PDFs and we want to extract data out of each of them with the Document Understanding package from UiPath and into Excel. The data is structured, meaning that the layout is fixed across all the documents. We will use a template-based approach to determine, which data to read and collect.

1:41 Install packages
We install three packages (1) UiPath.DocumentUnderstanding.ML.Activities, (2) UiPath.IntelligentOCR.Activities and (3) UiPath.OmniPage.Activities to enable the activities, we will use in this use case.

2:25 Load Taxonomy
We use the Load Taxonomy activity to define and set the files/data for the extraction. You can later edit it, if you miss something, so don't worry. We will create a group and category, where our Time Sheet case will go in. We extract a single field of text (employee number) and a table of the time registrations.

5:17 Digitize Document
We now digitize the text and the location and will have output of a string (the text itself) and a Document Object Model (information and properties about the text). You can use all the OCR Engine, but I prefer the OmniPage.

6:53 Data Extraction Scope
Based on our rules we can very easily extract the PDF data. I recommend you install Notepad++ (simply just Google and download). We use the program to edit our taxonomy JSON. We need to copy the DocumentTypeID from the JSON to use in our workflow. Because we have structured/form data, we can use a Form Extractor activity. Use the default End Point and then go to your UiPath Automation Cloud and get the Api Key (it's free). Afterwards we create a template, where we define how our looks like and then specify what data, we want to extract. In Configure Extractors just pick everything.

13:23 Export Extraction Results
We take our extraction results and output it into a DataSet.

13:50 Understand the output data
With a For Each, an Output DataTable and a Write Line we can take a look at the data. Remember to use the Tables property to our DataSet. We now have two DataTables with our data, that we can work with.

16:02 Build DataTable for output data
We create a DataTable with just one column (Employee Number), which we can later merge with our time sheet table data. Besides the column header there is nothing in it.

17:08 Merge Data Table
We merge the newly created Data Table with the Employee number header with the Data Table with the extraction results (the time sheets table).

17:54 Iterate through our output and add data
Using a For Each Row we iterate through our extracted data and then add our employee number as a string.

19:02 Write the extracted to Excel
Use the Write Range activity to write the data to Excel. Remember to Add Headers.

20:26 Extract multiple PDF files
We expand our solution to solve for the case, where we have more than 1 PDF file. We use a For Each and the Directory.GetFiles method. Remember to change the TypeArgument to String. Drag our activities in and change the strDocumentPath to item. Furthermore we need a final DataTable, that is completely empty, where we add data for each of our iterations.

💼 Get the files from the video: https://andersjensen.org/uipath-docum...

Connect with me:
🔔 Subscribe - http://www.youtube.com/user/klogeande...
💼 LinkedIn -   / andersjensens  
👥 Facebook -   / andersjensenorg  
💌 Email Newsletter - https://andersjensen.org/email-newsle...

#uipath #rpa #documentunderstanding

UiPath Document Understanding: Extract Tables Out of PDFs

Поделиться в:

Доступные форматы для скачивания:

Скачать видео mp4

  • Информация по загрузке:

Скачать аудио mp3

Похожие видео

How to do API calls in UiPath | Full Tutorial

How to do API calls in UiPath | Full Tutorial

How to Compare Excel Sheets in UiPath (Full Tutorial)

How to Compare Excel Sheets in UiPath (Full Tutorial)

GenAI Skills for 20 LPA+ Jobs | Big Tech AI Careers Explained (Live Masterclass)

GenAI Skills for 20 LPA+ Jobs | Big Tech AI Careers Explained (Live Masterclass)

UiPath Document Understanding Tutorial - Learn AI Center, ML Extractor, Generative Predict & more...

UiPath Document Understanding Tutorial - Learn AI Center, ML Extractor, Generative Predict & more...

UiPath REFramework - The Basics for Beginners (Full Use Case)

UiPath REFramework - The Basics for Beginners (Full Use Case)

UiPath Json Array | Deserialize Json Array UiPath | UiPath Json deserialize | UiPath Tutorial

UiPath Json Array | Deserialize Json Array UiPath | UiPath Json deserialize | UiPath Tutorial

Учебное пособие по Power BI для начинающих | Создайте свою первую панель мониторинга прямо сейчас...

Учебное пособие по Power BI для начинающих | Создайте свою первую панель мониторинга прямо сейчас...

Понимание Active Directory и групповой политики

Понимание Active Directory и групповой политики

UiPath Tutorial For Beginners | RPA Tutorial For Beginners | UiPath Training Online | Edureka

UiPath Tutorial For Beginners | RPA Tutorial For Beginners | UiPath Training Online | Edureka

РЕВОЛЮЦИОННО!!!!! Понимание документов UiPath и генеративный ИИ — извлечение данных из счетов

РЕВОЛЮЦИОННО!!!!! Понимание документов UiPath и генеративный ИИ — извлечение данных из счетов

Accurate Dispatcher and Performer Sample Project in UiPath

Accurate Dispatcher and Performer Sample Project in UiPath

UiPath PDF Data Extraction | OCR Data Extraction | UiPath Tutorial | RPA Training | Edureka

UiPath PDF Data Extraction | OCR Data Extraction | UiPath Tutorial | RPA Training | Edureka

Extract multi page PDF data to Excel with python PDF Plumber library!

Extract multi page PDF data to Excel with python PDF Plumber library!

How to extract data from PDF's with RegEx in UiPath - Full Tutorial

How to extract data from PDF's with RegEx in UiPath - Full Tutorial

Training UiPath Document Understanding ML Models - Data Manager - Part 1 | RPA

Training UiPath Document Understanding ML Models - Data Manager - Part 1 | RPA

UiPath: How to Extract Outlook Emails with Regex (Full Use Case)

UiPath: How to Extract Outlook Emails with Regex (Full Use Case)

UiPath Tutorial 25- UiPath ReFramework with Excel |UiPath Real Time Project

UiPath Tutorial 25- UiPath ReFramework with Excel |UiPath Real Time Project

UiPath | How to automate Excel and work with Data Tables | Tutorial

UiPath | How to automate Excel and work with Data Tables | Tutorial

Document Understanding with UiPath's Intelligent OCR - Full Tutorial

Document Understanding with UiPath's Intelligent OCR - Full Tutorial

Extract PDF Tables with UiPath: A Step-by-Step Tutorial

Extract PDF Tables with UiPath: A Step-by-Step Tutorial

© 2025 dtub. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]