Release 0.129
There is a performance regression in this release for and JOIN
queries when the length of the keys is between 16 and 31 bytes. This is fixed in Release 0.130.
Fix a planner issue that could cause queries involving
OUTER JOIN
to return incorrect results.Some queries, particularly those using or min_by(), now accurately reflect their true memory usage and thus appear to use more memory than before.
Fix hang in large queries with
ORDER BY
andLIMIT
.Fix an issue when casting empty arrays or arrays containing only
NULL
to other types.Minor UI improvements for query detail page.
Do not display useless stack traces for expected exceptions in verifier.
Improve performance of queries involving that write data.
Introduce the
P4HyperLogLog
type, which uses an implementation of the HyperLogLog data structure that trades off accuracy and memory requirements when handling small sets for an improvement in performance.
The driver now properly supports non-query statements. The
Statement
interface supports all variants of theexecute
methods. It also supports thegetUpdateCount
andgetLargeUpdateCount
methods.
Make client request timeout configurable.
The scheduler can now be configured to take network topology into account when scheduling splits. This is set using the node-scheduler.network-topology
config. See for more information.
The S3 region is no longer automatically configured when running in EC2. To enable this feature, use
hive.s3.pin-client-to-current-region=true
in your Hive catalog properties file. Enabling this feature is required to access S3 data in the China isolated region, but prevents accessing data outside the current region.Server-side encryption is now supported for S3. To enable this feature, use
hive.s3.sse.enabled=true
in your Hive catalog properties file.Add support for the
retention_days
table property.Add support for S3
EncryptionMaterialsProvider
.