Faster DataFusion with StringView - Xiangpeng Hao (Aug 15, 2024)
Автор: Andrew Lamb
Загружено: 2024-08-16
Просмотров: 646
Xiangpeng Hao summarizes what Apache Arrow StringView is, why it can improve performance, and the practical challenges overcome when realizing the potential.
Xiangpeng Hao presents his 2024 Summer Intern project at @influxdata8893: improving performance in Apache DataFusion, the query engine used in InfluxDB 3.0.
Talk Abstract: We implemented a new string representation—StringView—in the Rust implementation of Apache Arrow, arrow-rs and integrated it into Apache DataFusion, significantly accelerating string-intensive queries in the ClickBench benchmark by 20%- 200%.
@influxdata8893
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: