Beryl Chen
09/09/2024, 8:53 PMBeryl Chen
09/12/2024, 1:30 PMGitHub
07/02/2025, 2:19 AMenable_trace_historical_node
controls this behavior (Default: false
). #57083
• Storage Volume adds native support for Google Cloud Storage (GCS): You can now use GCS as a backend storage volume and manage and access GCS resources through the native SDK. #58815
### Improvements
• Optimized error messages when creating Hive external tables fails. #60076
• Optimized count(1)
query performance using the file_record_count
in Iceberg metadata. #60022
• Refined the Compaction scheduling logic to avoid delayed scheduling when all subtasks succeed. #59998
• Added JAVA_OPTS="--add-opens=java.base/java.util=ALL-UNNAMED"
to BE and CN after upgrading to JDK 17. #59947
• Supports modifying the kafka_broker_list
property via the ALTER ROUTINE LOAD command when Kafka Broker endpoints change. #59787
• Supports reducing build dependencies of the Docker base image through parameters. #59772
• Supports accessing Azure using Managed Identity authentication. #59657
• Improved error messages when querying external data via Files()
function with duplicate path column names. #59597
• Optimized LIMIT pushdown logic. #59265
### Bug Fixes
Fixed the following issues:
• Partition pruning issue when queries include Max and Min aggregations and empty partitions. #60162
• Incorrect query results when rewriting queries with materialized views due to missing NULL partitions. #60087
• Refresh errors on Iceberg external tables when using partition expressions based on str2date
. #60089
• Incorrect partition range when creating temporary partitions using the START END syntax. #60014
• Incorrect display of Routine Load metrics on non-leader FE nodes. #59985
• BE/CN crashes when executing queries containing COUNT(*)
window functions. #60003
• Stream Load failures when the target table name contains Chinese characters. #59722
• Overall loading failures to triple-replica tables when loading to a secondary replica fails. #59762
• Missing parameters in SHOW CREATE VIEW output. #59714
### Behavior Changes
• Some FE metrics include the is_leader
label. #59883
StarRocks/starrocksMehdi Sidi Boumedine
07/02/2025, 1:03 PMGitHub
07/04/2025, 8:58 AMslow_lock_print_stack
to prevent process stalls in large clusters when printing thread stacks. #59967
• Reduced unnecessary locks during tablet scheduling. #59744
### Bug Fixes
Fixed the following issues:
• SplitOR fails to prune scan columns. #60223
• Incorrect query plan for null-aware left anti joins. #60119
• Incorrect query results when rewriting queries with materialized views due to missing NULL partitions. #60087
• Partition pruning errors when tables contain empty partitions. #60162
• Refresh errors on Iceberg external tables when using partition expressions based on str2date
. #60089
• Unexpected behavior caused by materialized view schema changes. #60079
• Issues related to low-cardinality global dictionaries in UNION operators. #60075
• Incorrect partition ranges for temporary partitions created using the START END syntax. #60014
• Lock issues with SUBMIT TASK. #60026
• Partial updates fail on Primary Key tables under certain conditions. #60052
• Crashes caused by BE failing to create directories due to a lack of permissions to access storage paths. #60028
• Cache failures due to cache key duplication in concurrent scenarios. #60053
• Hive table metadata background refresh failure in Unified Catalog. #55215
• Query failures caused by incorrect return types of CASE WHEN. #59972
• Query failures when Delta Lake tables UNION themselves. #60030
• Partition creation failure when writing to multiple tables within the same transaction. #59954
• Queries could return empty results instead of errors when tablet versions were updated during execution. #53060
• Queries against modified columns in a table return null after upgrading to v3.4. #59941
• Authentication information is printed in logs. #59907
• Metadata refresh failures for external tables in Hive Catalog. #54596
• CACHE SELECT failures for tables after schema changes. #59812
• Broker Load could not recover after FE Leader shifts. #59732
• Stream Load failures when the target table name contains Chinese characters. #59722
• Incorrect query results in external tables due to search key hash collisions (affecting Iceberg/Delta/Paimon). #59781
StarRocks/starrocksBeryl Chen
07/09/2025, 11:01 PMGitHub
07/11/2025, 5:54 AMinformation_schema.loads
view. Users can view the execution details of all INSERT, Broker Load, Stream Load, and Routine Load subtasks in this view. Additional fields have been added to help users better understand the status of loading tasks and the association with parent jobs (PIPES, Routine Load Jobs).
• Support modifying kafka_broker_list
via the ALTER ROUTINE LOAD
statement.
### Bug Fixes
The following issues have been fixed:
• Under high-frequency loading scenarios, Compaction could be delayed. #59998
• Querying Iceberg external tables via Unified Catalog would throw an error: not support getting unified metadata table factory
. #59412
• When using DESC FILES()
to view CSV files in remote storage, incorrect results were returned because the system mistakenly inferred xinf
as the FLOAT type. #59574
• INSERT INTO
could cause BE to crash when encountering empty partitions. #59553
• When StarRocks reads Equality Delete files in Iceberg, it could still access deleted data if the data had already been removed from the Iceberg table. #59709
• Query failures caused by renaming columns. #59178
### Behavior Changes
• The default value of the BE configuration item skip_pk_preload
has been changed from false
to true
. As a result, the system will skip preloading Primary Key Indexes for Primary Key tables to reduce the likelihood of Reached Timeout
errors. This change may increase query latency for operations that require loading Primary Key Indexes.
StarRocks/starrocksRonit Kapoor
07/18/2025, 5:43 PMGitHub
07/21/2025, 2:11 AMenable_predicate_expr_reuse
to control predicate pushdown. #60603
• Supports a retry mechanism when fetching Kafka partition information fails. #60513
• Removed the restriction requiring exact mapping of partition columns between materialized views and base tables. #60565
• Supports building Runtime In-Filters to enhance aggregation performance by filtering data during aggregation. #59288
### Bug Fixes
Fixed the following issues:
• COUNT DISTINCT queries crash due to low-cardinality optimization for multiple columns. #60664
• Incorrect matching of global UDFs when multiple functions share the same name. #60550
• Null pointer exception (NPE) issue during Stream Load import. #60755
• Null pointer exception (NPE) issue when starting FE during a recovery from a cluster snapshot. #60604
• BE crash caused by column mode mismatch when processing short-circuit queries with out-of-order values. #60466
• Session variables set via PROPERTIES in SUBMIT TASK statements did not take effect. #60584
• Incorrect results for SELECT min/max
queries under specific conditions. #60601
• Incorrect bucket pruning when the left side of a predicate is a function, leading to incorrect query results. #60467
• Crash for queries against a non-existent query_id
via Arrow Flight SQL. #60497
### Behavior Changes
• The default value of lake_compaction_allow_partial_success
is set to true
. Compaction operations can now be marked as successful even if partially completed, preventing blockage of subsequent compaction tasks. #60643
StarRocks/starrocksBeryl Chen
07/22/2025, 4:00 PMBeryl Chen
07/24/2025, 1:45 PMGitHub
07/31/2025, 9:38 AMcpu_core_used_permille
limit in resource groups. #61177
• Conflict between ALTER jobs and partition creation tasks. #61167
• NPE caused by missing globalStateMgr
in ConnectContext
. #60880
• Partition creation failed when partition names matched case-insensitively but had different values. #60909
• Lock competition caused by synchronous access to partition statistics. #61041
• ANALYZE tasks stuck in pending
state after FE restart. #61113
• Issue with JIT (Just-In-Time) compilation in BE. #61060
• Leader address issue in Starmgr. #61016
• CVE vulnerabilities in Broker. #60908
• Actual number of JDBC connections exceeded jdbc_connection_pool_size
limit. #61004
• CVE-2022-41404 vulnerability. #59689
• CVEs related to Parquet and HttpClient5. #58750
• Partition not removed from _partition_map
when physical partition ID was empty. #60842
• Missing version check in shared-data clusters. #59422
• Transaction log missing when publishing logs in batches in shared-data clusters. #60949
• Concurrent publishing of the same transaction when Batch Publish is enabled in shared-data clusters. #57574
• Statistics overwrite issue caused by lack of semi-synchronous mode. #60897
• Inaccurate maxInstantTime
used for filtering Hudi files when retrieving latest merged file slices. #60927
• TaskRun state incompatible with earlier versions. #60438
• CVE-2025-52999 vulnerability. #60795
• Vulnerability caused by log4j-1.2.17-cloudera6
in Broker. #59579
• BE crash when loading OOM partitions. #60778
• Base Compaction tasks blocking other compaction tasks. #60711
• Inefficient handling of error string truncation. #60878
• Materialized view rewrite failed in multi-FE environments. #60841
• INSERT OVERWRITE failed on manually created partitions. #60750
• Issue caused by using random distribution in aggregate keys. #60702
• Crash caused by low cardinality rewrite in multi_distinct_count
. #60664
• Issue with Pivot resolving fields. #60748
• Upgraded hudi-common
to 1.0.2. #59501
• BE crash when CLONE and DROP TABLE run concurrently. #61359
StarRocks/starrocksBeryl Chen
08/05/2025, 7:10 PMGitHub
08/07/2025, 7:58 AMINSERT INTO FILES
, you can now specify the Parquet version via the </StarRocks/starrocks/blob/sql-reference/sql-functions/table-functions/files.md#parquetversion|`parquet.version`> property to improve compatibility with other tools when reading the exported files. #60843
### Bug Fixes
The following issues have been fixed:
• Loading jobs failed due to overly coarse lock granularity in TableMetricsManager
. #58911
• Case sensitivity issue in column names when loading Parquet data via FILES()
. #61059
• Cache did not take effect after upgrading a shared-data cluster from v3.3 to v3.4 or later. #60973
• A division-by-zero error occurred when the partition ID was null, causing a BE crash. #60842
• Broker Load jobs failed during BE scaling. #60224
### Behavior Changes
• The keyword
column in the information_schema.keywords
view has been renamed to word
to align with the MySQL definition. #60863
StarRocks/starrocksGitHub
08/11/2025, 7:30 AMBALANCE
type to cluster balance results. #61081
• Optimized materialized view rewrite for external tables. #61037
• Default value of system variable enable_materialized_view_agg_pushdown_rewrite
is changed to true
, enabling aggregation pushdown for materialized view queries by default. #60976
• Optimized partition statistics lock competition. #61041
### Bug Fixes
The following issues have been fixed:
• Inconsistent Chunk column size after column pruning. #61271
• Synchronous execution of partition statistics loading may cause deadlocks. #61300
• Crash when array_map
processes constant array columns. #61309
• Setting an auto-increment column to NULL results in the system mistakenly rejecting valid data within the same Chunk. #61255
• The actual number of JDBC connections may exceed the jdbc_connection_pool_size
limit. #61038
• FQDN mode did not use IP addresses as cache map keys. #61203
• Array column cloning error during array comparison. #61036
• Deploying serialized thread pool blockage led to query performance degradation. #61150
• OK hbResponse not synchronized after heartbeat retry counter reset. #61249
• Incorrect result for the hour_from_unixtime
function. #61206
• Conflicts between ALTER TABLE jobs and partition creation. #60890
• Cache does not take effect after upgrading from v3.3 to v3.4 or later. #60973
• Vector index metric hit_count
is not set. #61102
• Stream Load transactions fail to find the coordinator node. #60154
• BE crashes when loading OOM partitions. #60778
• INSERT OVERWRITE failed on manually created partitions. #60750
• Partition creation failed when partition names matched case-insensitively but had different values. #60909
• The system does not support PostgreSQL UUID type. #61021
• Case sensitivity issue with column names when loading Parquet data via FILES()
. #61059
StarRocks/starrocksBeryl Chen
08/14/2025, 1:45 PMBeryl Chen
08/15/2025, 6:40 PMRonit Kapoor
08/19/2025, 11:41 PMGitHub
08/22/2025, 11:48 AMprepared_timeout
configuration to Stream Load Transaction Interface. #61539
• Upgraded StarOS to v3.5‑rc3. #61685
### Bug Fixes
The following issues have been fixed:
• Incorrect Dict version of random distribution tables. #61933
• Incorrect query context in context conditions. #61929
• Publish failures caused by synchronous Publish for shadow tablets during ALTER operations. #61887
• CVE‑2025‑55163 issue. #62041
• Memory leak in real-time data ingestion from Apache Kafka. #61698
• Incorrect count of rebuild files in the lake persistent index. #61859
• Statistics collection on generated expression columns causes cross-database query errors. #61829
• Query Cache misaligns in shared-nothing clusters, causing inconsistent results. #61783
• High memory usage in CatalogRecycleBin due to retaining deleted partition information.#61582
• SQL Server JDBC connections fail when the timeout exceeds 65,535 milliseconds. #61719
• Security Integration fails to encrypt passwords, exposing sensitive information. #60666
• MIN()
and MAX()
functions on Iceberg partition columns return NULL unexpectedly. #61858
• Other predicates of Join containing non‑push‑down subfields were incorrectly rewritten. #61868
• QueryContext cancellation can lead to a use‑after‑free situation. #61897
• CBO’s table pruning overlooks other predicates. #61881
• Partial Updates in COLUMN_UPSERT_MODE
may overwrite auto-increment columns with zero. #61341
• JDBC TIME type conversion uses an incorrect timezone offset that leads to wrong time values. #61783
• max_filter_ratio
was not being serialized in Routine Load jobs. #61755
• Precision loss in the now(precision)
function in Stream Load. #61721
• Cancelling a query may result in a “query id not found” error. #61667
• LDAP authentication may miss PartialResultException, causing incomplete query results. #60667
• Paimon Timestamp timezone conversion issue when the query condition contains DATETIME. #60473
StarRocks/starrockswxr xml file
08/25/2025, 6:38 PMGitHub
08/28/2025, 7:43 AMLakePersistentIndex
initialization failed due to cleanup of _memtable
. #62279
• A concurrency issue caused by missing locks when retrieving the maximum Tablet version in the replication transaction manager. #62238
• A hang issue in the phased scheduler, which waited indefinitely during synchronous Profile collection (after the fix, the system correctly terminates Profile collection when scheduling errors occur). #62140
• Exception handling issues in low-cardinality optimization under the ALLOW_THROW_EXCEPTION
mode (after the fix, exceptions in expression evaluation are properly caught and returned). #62098
• FThe system failed to compute nested CTE statistics outside of the memo during table pruning when enable_rbo_table_prune
was set to false
. #62070
• CVE-2025-55163 issue. #62041
• An issue where split_morsel_queue
nested inside partition_morsel_queue
failed to correctly receive the Tablet Schema. #62034
• Incorrect handling of NULL
arrays during Parquet writes, which could cause data inconsistency or crashes (after the fix, the system ensures the split
function can correctly handle NULL
input strings). #61999
• Failure when creating materialized views using CASE WHEN
expressions due to incompatible return types of VARCHAR (after the fix, the system ensures consistency before and after refresh). #61996
• A concurrency safety issue caused by long operations holding shard-level locks while calculating compression scores. #61899
• An incomplete table pruning issue in CBO caused by pruning logic not considering all relevant predicates. #61881
StarRocks/starrocksBeryl Chen
08/28/2025, 10:28 PMBeryl Chen
08/28/2025, 10:36 PMGitHub
09/01/2025, 11:20 AMmax_filter_ratio
is not persisted for Routine Load jobs. #61755
• In Stream Load, the now(precision)
function lost the precision parameter. #61721
• In Audit Log, the Scan Rows result for INSERT INTO SELECT
statements was inaccurate. #61381
• After upgrading the cluster to v3.4.5, the fslib read iops
metric increased compared to before the upgrade. #61724
• Queries against SQLServer using JDBC Catalog often got stuck. #61719
StarRocks/starrocksPaul O'Brien
09/04/2025, 9:10 PMGitHub
09/05/2025, 3:26 AMenable_drop_table_check_mv_dependency
(default: false
). When set to true
, if the object to be dropped is referenced by a downstream materialized view, the system prevents the execution of DROP TABLE
/ DROP VIEW
/ DROP MATERIALIZED VIEW
. The error message lists the dependent materialized views and suggests checking the sys.object_dependencies
view for details. #61584
• Logs now include the Linux distribution and CPU architecture of the build, to facilitate issue reproduction and troubleshooting. Log format: ... build <hash> distro <id> arch <arch>
. #62017
• Persisted per-Tablet index and incremental column group file sizes are now cached, replacing on-demand directory scans. This accelerates Tablet status reporting in BE and reduces latency under high I/O scenarios. #61901
• Downgraded several high-frequency INFO logs in FE and BE to VLOG, and aggregated task submission logs, significantly reducing redundant storage-related logs and log volume under heavy load. #62121
• Improved query performance for External Catalog metadata through information_schema
by pushing table filters before calling getTable
, avoiding per-table RPCs. #62404
### Bug Fixes
The following issues have been fixed:
• NullPointerException when fetching partition-level column statistics during the Plan stage due to missing data. #61935
• Fixed Parquet write issues with non-empty NULL arrays, and corrected SPLIT(NULL, …)
behavior to consistently return NULL, preventing data corruption and runtime errors. #61999
• Failure when creating materialized views using CASE WHEN
expressions due to incompatible VARCHAR type returns (fixed by ensuring consistency before and after refresh, and introducing a new FE configuration transform_type_prefer_string_for_varchar
to prefer STRING and avoid length mismatch). #61996
• Statistics for nested CTEs could not be computed outside of memo when enable_rbo_table_prune
was false
. #62070
• In Audit Logs, inaccurate Scan Rows results for INSERT INTO SELECT statements. #61381
• ExceptionInInitializerError/NullPointerException during initialization caused FE startup failure when Query Queue v2 was enabled. #62161
• BE crash when LakePersistentIndex
initialization failed and _memtable
cleanup was triggered. #62279
• Permission issues during materialized view refresh due to creator roles not being activated (fixed by adding FE configuration mv_use_creator_based_authorization
. When set to false
, materialized views are refreshed as root, for compatibility with LDAP-authenticated clusters). #62396
• Materialized view refresh failures caused by case-sensitive List partition table names (fixed by enforcing case-insensitive uniqueness checks on partition names, aligning with OLAP table semantics). #62389
StarRocks/starrocksGitHub
09/09/2025, 8:26 AMrewrite_data_files
.
• Supports Iceberg tables with hidden partitions, including creating, writing, and reading the tables. #58914
• Supports the TIME data type in the Paimon catalog. #58292
### Security and Authentication
• In scenarios where JWT authentication and the Iceberg REST Catalog are used, StarRocks supports the passthrough of user login information to Iceberg via the REST Session Catalog for subsequent data access authentication. #59611 #58850
• Supports vended credentials for the Iceberg catalog.
### Storage Optimization and Cluster Management
• Introduced the File Bundling optimization for the cloud-native table in shared-data clusters to automatically bundle the data files generated by loading, Compaction, or Publish operations, thereby reducing the API cost caused by high-frequency access to the external storage system. #58316
• Supports Kafka 4.0 for Routine Load.
• Supports full-text inverted indexes on Primary Key tables in shared-nothing clusters.
• Supports enabling case-insensitive processing on names of catalogs, databases, tables, views, and materialized views. #61136
• Supports blacklisting Compute Nodes in shared-data clusters. #60830
• Supports global connection ID. #57256
### Query and Performance Improvement
• Supports DECIMAL256 data type, expanding the upper limit of precision from 38 to 76 bits. Its 256-bit storage provides better adaptability to high-precision financial and scientific computing scenarios, effectively mitigating DECIMAL128's precision overflow problem in very large aggregations and high-order operations. #59645
• Optimized the performance of the JOIN and AGG operators. #61691
• [Preview] Introduced SQL Plan Manager to allow users to bind a query plan to a query, thereby preventing the query plan from changing due to system state changes (mainly data updates and statistics updates), thus stabilizing query performance. #56310
• Introduced Partition-wise Spillable Aggregate/Distinct operators to replace the original Spill implementation based on sorted aggregation, significantly improving aggregation performance and reducing read/write overhead in complex and high-cardinality GROUP BY scenarios. #60216
• Flat JSON V2:
• Supports configuring Flat JSON on the table level. #57379
• Enhance JSON columnar storage by retaining the V1 mechanism while adding page- and segment-level indexes (ZoneMaps, Bloom filters), predicate pushdown with late materialization, dictionary encoding, and integration of a low-cardinality global dictionary to significantly boost execution efficiency. #60953
• Supports an adaptive ZoneMap index creation strategy for the STRING data type. #61960
### Functions and SQL Syntax
• Added the following functions:
• bitmap_hash64
#56913
• bool_or
#57414
• strpos
#57278
• to_datetime
and to_datetime_ntz
#60637
• regexp_count
#57182
• tokenize
#58965
• format_bytes
#61535
• Provides the following syntactic extensions:
• Supports IF NOT EXISTS keywords in CREATE ANALYZE FULL TABLE. #59789
• Supports EXCLUDE clauses in SELECT. #57411
• Supports FILTER clauses in aggregate functions, improving readability and execution efficiency of conditional aggregations. #58937
StarRocks/starrocksBeryl Chen
09/09/2025, 4:00 PMBeryl Chen
09/10/2025, 3:00 PMAndy Ye
09/12/2025, 9:34 PM