Release 0.192#
General#
Fix performance regression in split scheduling introduced in 0.191. If a query scans a non-trivial number of splits (~1M splits in an hour), the coordinator CPU utilization can be very high, leading to elevated communication failures.
Fix correctness issue in the
geometry_to_bing_tiles()
function that causes it to return irrelevant tiles when bottom or right side of the bounding box of the geometry is aligned with the tile border.Fix handling of invalid WKT (well-known text) input in geospatial functions.
Fix an issue that can cause long-running queries to hang when writer scaling is enabled.
Fix cast from
REAL
orDOUBLE
toDECIMAL
to conform to the SQL standard. For example, previouslycast (double '100000000000000000000000000000000' as decimal(38))
would return100000000000000005366162204393472
. Now it returns100000000000000000000000000000000
.Fix bug in validation of resource groups that prevented use of the
WEIGHTED_FAIR
policy.Fail queries properly when the coordinator fails to fetch data from workers. Previously, it would return an HTTP 500 error to the client.
Improve memory tracking for queries involving
DISTINCT
orrow_number()
that could cause over-committing memory resources for short time periods.Improve performance for queries involving
grouping()
.Improve buffer utilization calculation for writer scaling.
Remove tracking of per-driver peak memory reservation.
Add
resource-groups.max-refresh-interval
config option to limit the maximum acceptable staleness of resource group configuration.Remove
dictionary-processing-joins-enabled
configuration option anddictionary_processing_join
session property.
Web UI#
Fix incorrect reporting of input size and positions in live plan view.
CLI#
Fix update of prompt after
USE
statement.Fix correctness issue when rendering arrays of Bing tiles that causes the first entry to be repeated multiple times.
Hive#
Fix reading partitioned table statistics from newer Hive metastores.
Do not treat file system errors as corruptions for ORC.
Prevent reads from tables or partitions with
object_not_readable
attribute set.Add support for validating ORC files after they have been written. This behavior can be turned on via the
hive.orc.writer.validate
configuration property.Expose ORC writer statistics via JMX.
Add configuration options to control ORC writer min/max rows per stripe and row group, maximum stripe size, and memory limit for dictionaries.
Allow reading empty ORC files.
Handle ViewFs when checking file system cache expiration.
Improve error reporting when the target table of an insert query is dropped.
Remove retry when creating Hive record reader. This can help queries fail faster.
MySQL#
Remove support for
TIME WITH TIME ZONE
andTIMESTAMP WITH TIME ZONE
types due to MySQL types not being able to store timezone information.Add support for
REAL
type, which maps to MySQL’sFLOAT
type.
PostgreSQL#
Add support for
VARBINARY
type, which maps to PostgreSQL’sBYTEA
type.
MongoDB#
Fix support for pushing down inequality operators for string types.
Add support for reading documents as
MAP
values.Add support for MongoDB’s
Decimal128
type.Treat document and array of documents as
JSON
instead ofVARCHAR
.
JMX#
Allow nulls in history table values.
SPI#
Remove
SliceArrayBlock
class.Add
offset
andlength
parameters toBlock.getPositions()
.