Skip to content

v2.18.0

Latest
Compare
Choose a tag to compare
@mjakubowski84 mjakubowski84 released this 19 May 18:35

This release introduces two significant changes:

  1. Improved internals responsible for reading content and statistics of Parquet files. The difference is especially noticeable in the case of Stats: it is faster and now you can also query for min and max of partition fields.

  2. Upgrades Parquet to 1.14.0. The biggest improvement is support for Hadoop's vectored IO, which you can optionally enable in ParquetReader.Options. It can significantly improve the performance of reading huge files.