Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
dTub
Скачать

UiPath Document Understanding: Extract Tables Out of PDFs

Автор: Anders Jensen

Загружено: 2020-12-09

Просмотров: 47159

Описание:

This full video tutorial shows, how to extract table data out of PDFs with the Document Understanding Package from UiPath to Excel. With the help of a template we are able to extract data from PDFs with our UiPath Workflows. Here we have a structured PDF document, as it is time sheet forms.

You could also watch:
🔵 UiPath Document Understanding - Invoice Data Extraction -    • UiPath Document Understanding - Invoice Da...  
🔵 How to extract data from PDF's with RegEx -    • How to extract data from PDF's with RegEx ...  

0:00 Use case presentation
We have a folder with time sheets PDFs and we want to extract data out of each of them with the Document Understanding package from UiPath and into Excel. The data is structured, meaning that the layout is fixed across all the documents. We will use a template-based approach to determine, which data to read and collect.

1:41 Install packages
We install three packages (1) UiPath.DocumentUnderstanding.ML.Activities, (2) UiPath.IntelligentOCR.Activities and (3) UiPath.OmniPage.Activities to enable the activities, we will use in this use case.

2:25 Load Taxonomy
We use the Load Taxonomy activity to define and set the files/data for the extraction. You can later edit it, if you miss something, so don't worry. We will create a group and category, where our Time Sheet case will go in. We extract a single field of text (employee number) and a table of the time registrations.

5:17 Digitize Document
We now digitize the text and the location and will have output of a string (the text itself) and a Document Object Model (information and properties about the text). You can use all the OCR Engine, but I prefer the OmniPage.

6:53 Data Extraction Scope
Based on our rules we can very easily extract the PDF data. I recommend you install Notepad++ (simply just Google and download). We use the program to edit our taxonomy JSON. We need to copy the DocumentTypeID from the JSON to use in our workflow. Because we have structured/form data, we can use a Form Extractor activity. Use the default End Point and then go to your UiPath Automation Cloud and get the Api Key (it's free). Afterwards we create a template, where we define how our looks like and then specify what data, we want to extract. In Configure Extractors just pick everything.

13:23 Export Extraction Results
We take our extraction results and output it into a DataSet.

13:50 Understand the output data
With a For Each, an Output DataTable and a Write Line we can take a look at the data. Remember to use the Tables property to our DataSet. We now have two DataTables with our data, that we can work with.

16:02 Build DataTable for output data
We create a DataTable with just one column (Employee Number), which we can later merge with our time sheet table data. Besides the column header there is nothing in it.

17:08 Merge Data Table
We merge the newly created Data Table with the Employee number header with the Data Table with the extraction results (the time sheets table).

17:54 Iterate through our output and add data
Using a For Each Row we iterate through our extracted data and then add our employee number as a string.

19:02 Write the extracted to Excel
Use the Write Range activity to write the data to Excel. Remember to Add Headers.

20:26 Extract multiple PDF files
We expand our solution to solve for the case, where we have more than 1 PDF file. We use a For Each and the Directory.GetFiles method. Remember to change the TypeArgument to String. Drag our activities in and change the strDocumentPath to item. Furthermore we need a final DataTable, that is completely empty, where we add data for each of our iterations.

💼 Get the files from the video: https://andersjensen.org/uipath-docum...

Connect with me:
🔔 Subscribe - http://www.youtube.com/user/klogeande...
💼 LinkedIn -   / andersjensens  
👥 Facebook -   / andersjensenorg  
💌 Email Newsletter - https://andersjensen.org/email-newsle...

#uipath #rpa #documentunderstanding

UiPath Document Understanding: Extract Tables Out of PDFs

Поделиться в:

Доступные форматы для скачивания:

Скачать видео mp4

  • Информация по загрузке:

Скачать аудио mp3

Похожие видео

How to do API calls in UiPath | Full Tutorial

How to do API calls in UiPath | Full Tutorial

Как разделить PDF-файлы с динамическими диапазонами в UiPath

Как разделить PDF-файлы с динамическими диапазонами в UiPath

UiPath REFramework Beginners Tutorial

UiPath REFramework Beginners Tutorial

КЛАССИЧЕСКАЯ МУЗЫКА ДЛЯ ВОССТАНОВЛЕНИЯ НЕРВНОЙ СИСТЕМЫ🌿 Нежная музыка успокаивает нервную систему 22

КЛАССИЧЕСКАЯ МУЗЫКА ДЛЯ ВОССТАНОВЛЕНИЯ НЕРВНОЙ СИСТЕМЫ🌿 Нежная музыка успокаивает нервную систему 22

Учебник по Excel за 15 минут

Учебник по Excel за 15 минут

Training UiPath Document Understanding ML Models - Data Manager - Part 1 | RPA

Training UiPath Document Understanding ML Models - Data Manager - Part 1 | RPA

How to do Excel VLOOKUPs in UiPath RPA (Full Tutorial)

How to do Excel VLOOKUPs in UiPath RPA (Full Tutorial)

Понимание Active Directory и групповой политики

Понимание Active Directory и групповой политики

How to Split PDF files into Multiple Files with Uipath

How to Split PDF files into Multiple Files with Uipath

Миллиарды на ветер: Су-57 - главный авиационный миф России

Миллиарды на ветер: Су-57 - главный авиационный миф России

UiPath Json Array | Deserialize Json Array UiPath | UiPath Json deserialize | UiPath Tutorial

UiPath Json Array | Deserialize Json Array UiPath | UiPath Json deserialize | UiPath Tutorial

UiPath Document Understanding - Full Tutorial

UiPath Document Understanding - Full Tutorial

4 Hours Chopin for Studying, Concentration & Relaxation

4 Hours Chopin for Studying, Concentration & Relaxation

Как извлечь данные из PDF-файла с помощью Power Automate

Как извлечь данные из PDF-файла с помощью Power Automate

UiPath | How to automate Excel and work with Data Tables | Tutorial

UiPath | How to automate Excel and work with Data Tables | Tutorial

Get Started with PDF Automation in UiPath | Beginners

Get Started with PDF Automation in UiPath | Beginners

Document Understanding with UiPath's Intelligent OCR - Full Tutorial

Document Understanding with UiPath's Intelligent OCR - Full Tutorial

UiPath Tutorial For Beginners | RPA Tutorial For Beginners | UiPath Training Online | Edureka

UiPath Tutorial For Beginners | RPA Tutorial For Beginners | UiPath Training Online | Edureka

Набор инструментов KoBo для начинающих

Набор инструментов KoBo для начинающих

Power Query для начинающих: преобразование данных Excel за считанные минуты (учебное пособие 2025...

Power Query для начинающих: преобразование данных Excel за считанные минуты (учебное пособие 2025...

© 2025 dtub. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]