TF-IDF | Introduction to Text Analytics with R | Part 5
Автор: Data Science Dojo
Загружено: 2017-07-03
Просмотров: 42515
This talk provides an overview of TF-IDF and includes:
1. Discussion of how the document-term frequency matrix representation can be improved:
– How to deal with documents of unequal lengths.
– What to do about terms that are very common across documents.
2. Introduction of the mighty term frequency-inverse document frequency (TF-IDF) to implement these improvements:
-TF for dealing with documents of unequal lengths.
-IDF for dealing with terms that appear frequently across documents.
3. Implementation of TF-IDF using R functions and applying TF-IDF to document-term frequency matrices.
4. Data cleaning of matrices post TF-IDF weighting/transformation.
The data and R code used in this series is available here:
https://code.datasciencedojo.com/data...
Table of contents:
0:00 Introduction
7:08 Term Frequency
10:12 Inverse document frequency
12:55 TDIDF
13:21 Setting up the environment
17:42 Combining the functions
19:03 Transform
21:27 Calculate
25:52 Transpose
27:04 Testing
--
At Data Science Dojo, we believe data science is for everyone. Our data science trainings have been attended by more than 10,000 employees from over 2,500 companies globally, including many leaders in tech like Microsoft, Google, and Facebook. For more information please visit: https://hubs.la/Q01Z-13k0
💼 Learn to build LLM-powered apps in just 40 hours with our Large Language Models bootcamp: https://hubs.la/Q01ZZGL-0
💼 Get started in the world of data with our top-rated data science bootcamp: https://hubs.la/Q01ZZDpt0
💼 Master Python for data science, analytics, machine learning, and data engineering: https://hubs.la/Q01ZZD-s0
💼 Explore, analyze, and visualize your data with Power BI desktop: https://hubs.la/Q01ZZF8B0
--
Unleash your data science potential for FREE! Dive into our tutorials, events & courses today!
📚 Learn the essentials of data science and analytics with our data science tutorials: https://hubs.la/Q01ZZJJK0
📚 Stay ahead of the curve with the latest data science content, subscribe to our newsletter now: https://hubs.la/Q01ZZBy10
📚 Connect with other data scientists and AI professionals at our community events: https://hubs.la/Q01ZZLd80
📚 Checkout our free data science courses: https://hubs.la/Q01ZZMcm0
📚 Get your daily dose of data science with our trending blogs: https://hubs.la/Q01ZZMWl0
--
📱 Social media links
Connect with us: / data-science-dojo
Follow us: / datasciencedojo
Keep up with us: / data_science_dojo
Like us: / datasciencedojo
Find us: https://www.threads.net/@data_science...
--
Also, join our communities:
LinkedIn: / 13601597
Twitter: / 1677363761399865344
Facebook: / aiandmachinelearningforeveryone
Vimeo: https://vimeo.com/datasciencedojo
Discord: / discord
_
Want to share your data science knowledge? Boost your profile and share your knowledge with our community: https://hubs.la/Q01ZZNCn0
#rprogramming #textanalytics
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: