http://starrocks.io logo
Join Slack
Powered by
# announcements
  • g

    GitHub

    07/31/2025, 9:38 AM
    Release - 3.3.17 New release published by wangsimo0 ## v3.3.17 Release Date: July 30, 2025 ### Bug Fixes The following issues have been fixed: • Upgraded HttpClient5 to 5.4.3. #61298 • Incorrect
    cpu_core_used_permille
    limit in resource groups. #61177 • Conflict between ALTER jobs and partition creation tasks. #61167 • NPE caused by missing
    globalStateMgr
    in
    ConnectContext
    . #60880 • Partition creation failed when partition names matched case-insensitively but had different values. #60909 • Lock competition caused by synchronous access to partition statistics. #61041 • ANALYZE tasks stuck in
    pending
    state after FE restart. #61113 • Issue with JIT (Just-In-Time) compilation in BE. #61060 • Leader address issue in Starmgr. #61016 • CVE vulnerabilities in Broker. #60908 • Actual number of JDBC connections exceeded
    jdbc_connection_pool_size
    limit. #61004 • CVE-2022-41404 vulnerability. #59689 • CVEs related to Parquet and HttpClient5. #58750 • Partition not removed from
    _partition_map
    when physical partition ID was empty. #60842 • Missing version check in shared-data clusters. #59422 • Transaction log missing when publishing logs in batches in shared-data clusters. #60949 • Concurrent publishing of the same transaction when Batch Publish is enabled in shared-data clusters. #57574 • Statistics overwrite issue caused by lack of semi-synchronous mode. #60897 • Inaccurate
    maxInstantTime
    used for filtering Hudi files when retrieving latest merged file slices. #60927 • TaskRun state incompatible with earlier versions. #60438 • CVE-2025-52999 vulnerability. #60795 • Vulnerability caused by
    log4j-1.2.17-cloudera6
    in Broker. #59579 • BE crash when loading OOM partitions. #60778 • Base Compaction tasks blocking other compaction tasks. #60711 • Inefficient handling of error string truncation. #60878 • Materialized view rewrite failed in multi-FE environments. #60841 • INSERT OVERWRITE failed on manually created partitions. #60750 • Issue caused by using random distribution in aggregate keys. #60702 • Crash caused by low cardinality rewrite in
    multi_distinct_count
    . #60664 • Issue with Pivot resolving fields. #60748 • Upgraded
    hudi-common
    to 1.0.2. #59501 • BE crash when CLONE and DROP TABLE run concurrently. #61359 StarRocks/starrocks
    🚀 1
    🙌 2
  • b

    Beryl Chen

    08/05/2025, 7:10 PM
    Hi <!channel>, We recently hosted a webinar diving into the new features in StarRocks 3.5 with @Sida Shen—and had a great discussion with tons of thoughtful questions from the community! If you missed it (or just want to revisit the highlights), Sida also wrote two blog posts so you can catch up anytime: 📝 Blog: Introducing StarRocks 3.5 https://www.starrocks.io/blog/introducing-starrocks-3-5 💬 Q&A Recap: Security, Iceberg, and more https://www.starrocks.io/blog/35-webinar-qa-recap-starrocks-security-iceberg-support-and-more 🎬 Watch the full recording:

    https://www.youtube.com/watch?v=J0WJyXbWghU▾

    🙌 11
    👍 3
  • g

    GitHub

    08/07/2025, 7:58 AM
    Release - 3.4.6 New release published by jaogoy Release Date: August 7, 2025 ### Improvements • When exporting data to Parquet files using
    INSERT INTO FILES
    , you can now specify the Parquet version via the </StarRocks/starrocks/blob/sql-reference/sql-functions/table-functions/files.md#parquetversion|`parquet.version`> property to improve compatibility with other tools when reading the exported files. #60843 ### Bug Fixes The following issues have been fixed: • Loading jobs failed due to overly coarse lock granularity in
    TableMetricsManager
    . #58911 • Case sensitivity issue in column names when loading Parquet data via
    FILES()
    . #61059 • Cache did not take effect after upgrading a shared-data cluster from v3.3 to v3.4 or later. #60973 • A division-by-zero error occurred when the partition ID was null, causing a BE crash. #60842 • Broker Load jobs failed during BE scaling. #60224 ### Behavior Changes • The
    keyword
    column in the
    information_schema.keywords
    view has been renamed to
    word
    to align with the MySQL definition. #60863 StarRocks/starrocks
    👍 3
  • g

    GitHub

    08/11/2025, 7:30 AM
    Release - 3.5.3 New release published by yingtingdong Release Date: August 11, 2025 ### Feature Enhancements • Lake Compaction adds Segment write time statistics. #60891 • Avoid bRPC communication in local PassThrough exchange scenarios. #60538 • Disable inline mode for Data Cache writes to avoid performance degradation. #60530 • Iceberg metadata scan supports shared file I/O. #61012 • Support termination of all PENDING ANALYZE tasks. #61118 • Force reuse when there are too many CTE nodes to avoid excessive optimization time. #60983 • Added
    BALANCE
    type to cluster balance results. #61081 • Optimized materialized view rewrite for external tables. #61037 • Default value of system variable
    enable_materialized_view_agg_pushdown_rewrite
    is changed to
    true
    , enabling aggregation pushdown for materialized view queries by default. #60976 • Optimized partition statistics lock competition. #61041 ### Bug Fixes The following issues have been fixed: • Inconsistent Chunk column size after column pruning. #61271 • Synchronous execution of partition statistics loading may cause deadlocks. #61300 • Crash when
    array_map
    processes constant array columns. #61309 • Setting an auto-increment column to NULL results in the system mistakenly rejecting valid data within the same Chunk. #61255 • The actual number of JDBC connections may exceed the
    jdbc_connection_pool_size
    limit. #61038 • FQDN mode did not use IP addresses as cache map keys. #61203 • Array column cloning error during array comparison. #61036 • Deploying serialized thread pool blockage led to query performance degradation. #61150 • OK hbResponse not synchronized after heartbeat retry counter reset. #61249 • Incorrect result for the
    hour_from_unixtime
    function. #61206 • Conflicts between ALTER TABLE jobs and partition creation. #60890 • Cache does not take effect after upgrading from v3.3 to v3.4 or later. #60973 • Vector index metric
    hit_count
    is not set. #61102 • Stream Load transactions fail to find the coordinator node. #60154 • BE crashes when loading OOM partitions. #60778 • INSERT OVERWRITE failed on manually created partitions. #60750 • Partition creation failed when partition names matched case-insensitively but had different values. #60909 • The system does not support PostgreSQL UUID type. #61021 • Case sensitivity issue with column names when loading Parquet data via
    FILES()
    . #61059 StarRocks/starrocks
    👍 3
  • b

    Beryl Chen

    08/14/2025, 1:45 PM
    <!channel> Friendly reminder: Today’s session, _"_Why Customer-Facing Analytics Breaks—and How to Build It Right_,"_ is happening at 10 AM PT | 1 PM ET! Save your seat here: https://celerdata.wistia.com/live/events/uqt1a3umn1 Join us as we explore real-world strategies for delivering analytics directly to your customers—whether through dashboards, APIs, or AI agents. We’ll break down the common pitfalls of BI and OLAP systems and share examples of companies that overcame bottlenecks by rebuilding their architectures with StarRocks and Apache Iceberg. (No stress if you’re busy—register to get the recording afterward!)
    👍 8
    🙌 4
    • 1
    • 1
  • b

    Beryl Chen

    08/15/2025, 6:40 PM
    Good news for our Japanese community 🇯🇵 — StarRocks docs are available in Japanese 👉🏻 https://docs.starrocks.io/ja/docs/introduction/StarRocks_intro/ 日本語ドキュメントをご覧いただき、気づいたことや直したほうがいい点があればぜひ教えてください 🙌
    👍 9
    🥰 5
    🇯🇵 3
  • r

    Ronit Kapoor

    08/19/2025, 11:41 PM
    Hey <!channel>! Ron here. You might have seen me at a meetup or chatted with me on Slack, but I’m excited to officially introduce myself as CelerData’s new Developer Advocate! I joined the team almost two months ago and have been ramping up behind the scenes. To that end, I recently wrote a blog introducing myself to the community and wanted to share it here on Slack! https://medium.com/starrocks-engineering/hello-from-your-new-dev-advocate-9e27cec4ffe3 👋 I can’t wait to connect with you all, share what I’m working on, and hear your ideas!
    🙌 10
    dogdance 4
    👍 15
    bufo hello 7
    yay 7
    p
    s
    • 3
    • 2
  • g

    GitHub

    08/22/2025, 11:48 AM
    Release - 3.5.4 New release published by yingtingdong Release Date: August 22, 2025 ### Improvements • Added logs to clarify the reason that tablets cannot be repaired. #61959 • Optimized DROP PARTITION information in logs. #61787 • Assigned a large but configurable row count to tables with unknown stats for statistical estimation. #61332 • Added balance statistic according to label location. #61905 • Added colocate group balance statistics to improve cluster monitoring. #61736 • Skipped the Publish waiting phase when the number of healthy replicas exceeds the default replica count. #61820 • Included the tablet information collection time in the tablet report. #61643 • Supports writing Starlet files with tags. #61605 • Supports viewing cluster balance statistics via SHOW PROC. #61578 • Bumped librdkafka to 2.11.0 to support Kafka 4.0 and removed deprecated configurations. #61698 • Added
    prepared_timeout
    configuration to Stream Load Transaction Interface. #61539 • Upgraded StarOS to v3.5‑rc3. #61685 ### Bug Fixes The following issues have been fixed: • Incorrect Dict version of random distribution tables. #61933 • Incorrect query context in context conditions. #61929 • Publish failures caused by synchronous Publish for shadow tablets during ALTER operations. #61887 • CVE‑2025‑55163 issue. #62041 • Memory leak in real-time data ingestion from Apache Kafka. #61698 • Incorrect count of rebuild files in the lake persistent index. #61859 • Statistics collection on generated expression columns causes cross-database query errors. #61829 • Query Cache misaligns in shared-nothing clusters, causing inconsistent results. #61783 • High memory usage in CatalogRecycleBin due to retaining deleted partition information.#61582 • SQL Server JDBC connections fail when the timeout exceeds 65,535 milliseconds. #61719 • Security Integration fails to encrypt passwords, exposing sensitive information. #60666 •
    MIN()
    and
    MAX()
    functions on Iceberg partition columns return NULL unexpectedly. #61858 • Other predicates of Join containing non‑push‑down subfields were incorrectly rewritten. #61868 • QueryContext cancellation can lead to a use‑after‑free situation. #61897 • CBO’s table pruning overlooks other predicates. #61881 • Partial Updates in
    COLUMN_UPSERT_MODE
    may overwrite auto-increment columns with zero. #61341 • JDBC TIME type conversion uses an incorrect timezone offset that leads to wrong time values. #61783 •
    max_filter_ratio
    was not being serialized in Routine Load jobs. #61755 • Precision loss in the
    now(precision)
    function in Stream Load. #61721 • Cancelling a query may result in a “query id not found” error. #61667 • LDAP authentication may miss PartialResultException, causing incomplete query results. #60667 • Paimon Timestamp timezone conversion issue when the query condition contains DATETIME. #60473 StarRocks/starrocks
    🎉 4
    🙌 3
    ❤️ 1
  • w

    wxr xml file

    08/25/2025, 6:38 PM
    https://techhubshop.goodbarber.app/manage/cms/maps/add/
    • 1
    • 1
  • g

    GitHub

    08/28/2025, 7:43 AM
    Release - 3.3.18 New release published by wangsimo0 ## 3.3.18 Release Date: August 28, 2025 ### Bug Fixes The following issues have been fixed: • BE crashes when
    LakePersistentIndex
    initialization failed due to cleanup of
    _memtable
    . #62279 • A concurrency issue caused by missing locks when retrieving the maximum Tablet version in the replication transaction manager. #62238 • A hang issue in the phased scheduler, which waited indefinitely during synchronous Profile collection (after the fix, the system correctly terminates Profile collection when scheduling errors occur). #62140 • Exception handling issues in low-cardinality optimization under the
    ALLOW_THROW_EXCEPTION
    mode (after the fix, exceptions in expression evaluation are properly caught and returned). #62098 • FThe system failed to compute nested CTE statistics outside of the memo during table pruning when
    enable_rbo_table_prune
    was set to
    false
    . #62070 • CVE-2025-55163 issue. #62041 • An issue where
    split_morsel_queue
    nested inside
    partition_morsel_queue
    failed to correctly receive the Tablet Schema. #62034 • Incorrect handling of
    NULL
    arrays during Parquet writes, which could cause data inconsistency or crashes (after the fix, the system ensures the
    split
    function can correctly handle
    NULL
    input strings). #61999 • Failure when creating materialized views using
    CASE WHEN
    expressions due to incompatible return types of VARCHAR (after the fix, the system ensures consistency before and after refresh). #61996 • A concurrency safety issue caused by long operations holding shard-level locks while calculating compression scores. #61899 • An incomplete table pruning issue in CBO caused by pruning logic not considering all relevant predicates. #61881 StarRocks/starrocks
  • b

    Beryl Chen

    08/28/2025, 10:28 PM
    Shout-out to @Anton Borisov and the data team at Fresha for this fantastic deep dive 👏 https://medium.com/fresha-data-engineering/how-we-accidentally-became-one-of-uks-first-starrocks-production-pioneers-7db249f10010 — check it out before you dive in. Fresha—the world’s leading marketplace for beauty, wellness, and self-care, trusted by millions of consumers and businesses worldwide—hit a wall by mid-2024 when Postgres (OLTP) was overloaded by analytics and Snowflake turned out to be too slow and too costly for near real-time needs. See how adopting StarRocks as the middle layer brought together OLTP protection, open-format federation, and real-time performance—making customer dashboards render in ~200ms. Anton is joining the lineup at our #C090XCAJWQ2 Don’t miss his session! See who else is speaking at the Summit. <!channel>
    🎉 4
    starrocks 4
  • b

    Beryl Chen

    08/28/2025, 10:36 PM
    danceml We’d love for you to join our StarRocks Global Summit Swag Giveaway! StarRocks Summit 2025 is happening free and online on September 10, and we’re celebrating with some premium StarRocks swag packs. Here’s how you can get in on it: • Already registered? Enter the email you used to register [in this form] and you’re in! • Not registered yet? Sign up for Summit HERE, then drop the email you used [in this form] to join the drawing. We’ll pick 2 winners after Summit and ship the swag straight to you!
    👀 3
  • g

    GitHub

    09/01/2025, 11:20 AM
    Release - 3.4.7 New release published by jaogoy Release Date: September 1, 2025 ### Bug Fixes The following issues have been fixed: •
    max_filter_ratio
    is not persisted for Routine Load jobs. #61755 • In Stream Load, the
    now(precision)
    function lost the precision parameter. #61721 • In Audit Log, the Scan Rows result for
    INSERT INTO SELECT
    statements was inaccurate. #61381 • After upgrading the cluster to v3.4.5, the
    fslib read iops
    metric increased compared to before the upgrade. #61724 • Queries against SQLServer using JDBC Catalog often got stuck. #61719 StarRocks/starrocks
    🙌 3
  • p

    Paul O'Brien

    09/04/2025, 9:10 PM
    LAST CALL! starrocks*Make a 10-second StarRocks video, get $100*starrocks We’re inviting YOU to share 🎥 A quick 10–30 seconds on why you use, love, or contribute to StarRocks (phone video is perfect!). If your clip makes it into the Summit reel, you get a $100 Amazon gift card. Details are HERE. It takes 5 minutes 😎
    😎 3
  • g

    GitHub

    09/05/2025, 3:26 AM
    Release - 3.5.5 New release published by yingtingdong Release date: September 5, 2025 ### Improvements • Added a new system variable
    enable_drop_table_check_mv_dependency
    (default:
    false
    ). When set to
    true
    , if the object to be dropped is referenced by a downstream materialized view, the system prevents the execution of
    DROP TABLE
    /
    DROP VIEW
    /
    DROP MATERIALIZED VIEW
    . The error message lists the dependent materialized views and suggests checking the
    sys.object_dependencies
    view for details. #61584 • Logs now include the Linux distribution and CPU architecture of the build, to facilitate issue reproduction and troubleshooting. Log format:
    ... build <hash> distro <id> arch <arch>
    . #62017 • Persisted per-Tablet index and incremental column group file sizes are now cached, replacing on-demand directory scans. This accelerates Tablet status reporting in BE and reduces latency under high I/O scenarios. #61901 • Downgraded several high-frequency INFO logs in FE and BE to VLOG, and aggregated task submission logs, significantly reducing redundant storage-related logs and log volume under heavy load. #62121 • Improved query performance for External Catalog metadata through
    information_schema
    by pushing table filters before calling
    getTable
    , avoiding per-table RPCs. #62404 ### Bug Fixes The following issues have been fixed: • NullPointerException when fetching partition-level column statistics during the Plan stage due to missing data. #61935 • Fixed Parquet write issues with non-empty NULL arrays, and corrected
    SPLIT(NULL, …)
    behavior to consistently return NULL, preventing data corruption and runtime errors. #61999 • Failure when creating materialized views using
    CASE WHEN
    expressions due to incompatible VARCHAR type returns (fixed by ensuring consistency before and after refresh, and introducing a new FE configuration
    transform_type_prefer_string_for_varchar
    to prefer STRING and avoid length mismatch). #61996 • Statistics for nested CTEs could not be computed outside of memo when
    enable_rbo_table_prune
    was
    false
    . #62070 • In Audit Logs, inaccurate Scan Rows results for INSERT INTO SELECT statements. #61381 • ExceptionInInitializerError/NullPointerException during initialization caused FE startup failure when Query Queue v2 was enabled. #62161 • BE crash when
    LakePersistentIndex
    initialization failed and
    _memtable
    cleanup was triggered. #62279 • Permission issues during materialized view refresh due to creator roles not being activated (fixed by adding FE configuration
    mv_use_creator_based_authorization
    . When set to
    false
    , materialized views are refreshed as root, for compatibility with LDAP-authenticated clusters). #62396 • Materialized view refresh failures caused by case-sensitive List partition table names (fixed by enforcing case-insensitive uniqueness checks on partition names, aligning with OLAP table semantics). #62389 StarRocks/starrocks
    👍 2
    🚀 1
  • g

    GitHub

    09/09/2025, 8:26 AM
    Release - 4.0.0-RC New release published by wangsimo0 ## 4.0.0-RC Release date: September 9, 2025 ### Data Lake Analytics • Unified Page Cache and Data Cache for BE metadata, and adopted an adaptive strategy for scaling. #61640 • Optimized metadata file parsing for Iceberg statistics to avoid repetitive parsing. #59955 • Optimized COUNT/MIN/MAX queries against Iceberg metadata by efficiently skipping over data file scans, significantly improving aggregation query performance on large partitioned tables and reducing resource consumption. #60385 • Supports compaction for Iceberg tables via procedure
    rewrite_data_files
    . • Supports Iceberg tables with hidden partitions, including creating, writing, and reading the tables. #58914 • Supports the TIME data type in the Paimon catalog. #58292 ### Security and Authentication • In scenarios where JWT authentication and the Iceberg REST Catalog are used, StarRocks supports the passthrough of user login information to Iceberg via the REST Session Catalog for subsequent data access authentication. #59611 #58850 • Supports vended credentials for the Iceberg catalog. ### Storage Optimization and Cluster Management • Introduced the File Bundling optimization for the cloud-native table in shared-data clusters to automatically bundle the data files generated by loading, Compaction, or Publish operations, thereby reducing the API cost caused by high-frequency access to the external storage system. #58316 • Supports Kafka 4.0 for Routine Load. • Supports full-text inverted indexes on Primary Key tables in shared-nothing clusters. • Supports enabling case-insensitive processing on names of catalogs, databases, tables, views, and materialized views. #61136 • Supports blacklisting Compute Nodes in shared-data clusters. #60830 • Supports global connection ID. #57256 ### Query and Performance Improvement • Supports DECIMAL256 data type, expanding the upper limit of precision from 38 to 76 bits. Its 256-bit storage provides better adaptability to high-precision financial and scientific computing scenarios, effectively mitigating DECIMAL128's precision overflow problem in very large aggregations and high-order operations. #59645 • Optimized the performance of the JOIN and AGG operators. #61691 • [Preview] Introduced SQL Plan Manager to allow users to bind a query plan to a query, thereby preventing the query plan from changing due to system state changes (mainly data updates and statistics updates), thus stabilizing query performance. #56310 • Introduced Partition-wise Spillable Aggregate/Distinct operators to replace the original Spill implementation based on sorted aggregation, significantly improving aggregation performance and reducing read/write overhead in complex and high-cardinality GROUP BY scenarios. #60216 • Flat JSON V2: • Supports configuring Flat JSON on the table level. #57379 • Enhance JSON columnar storage by retaining the V1 mechanism while adding page- and segment-level indexes (ZoneMaps, Bloom filters), predicate pushdown with late materialization, dictionary encoding, and integration of a low-cardinality global dictionary to significantly boost execution efficiency. #60953 • Supports an adaptive ZoneMap index creation strategy for the STRING data type. #61960 ### Functions and SQL Syntax • Added the following functions: •
    bitmap_hash64
    #56913 •
    bool_or
    #57414 •
    strpos
    #57278 •
    to_datetime
    and
    to_datetime_ntz
    #60637 •
    regexp_count
    #57182 •
    tokenize
    #58965 •
    format_bytes
    #61535 • Provides the following syntactic extensions: • Supports IF NOT EXISTS keywords in CREATE ANALYZE FULL TABLE. #59789 • Supports EXCLUDE clauses in SELECT. #57411 • Supports FILTER clauses in aggregate functions, improving readability and execution efficiency of conditional aggregations. #58937 StarRocks/starrocks
    starrocks 10
    🥳 2
  • b

    Beryl Chen

    09/09/2025, 4:00 PM
    Quick heads-up — StarRocks Summit 2025 is tomorrow (Sept 10)! yay See the full agenda + grab your free pass if you haven’t already: [link] <!channel> We’ve shared a lot already, so I’ll keep this short: it’s one day, free, and virtual. 25+ engineer-to-engineer sessions. Real lessons. Real solutions. Real production wins. You’ll hear directly from engineers at Intuit, Coinbase, Pinterest, Demandbase, TRM Labs, Celonis, Eightfold, Fresha, and more — sharing what actually worked (and what broke) in their stacks. Expect deep dives into real-time customer-facing analytics, StarRocks + Iceberg powering lakehouse architectures, and hands-on lessons from engineers working with AI and agents. Excited to learn together with you all tomorrow! cat dance
    starrocks 2
    🎉 13
  • b

    Beryl Chen

    09/10/2025, 3:00 PM
    Summit goes live in 1 hour — 9 AM PT / 12 PM ET / 6 PM CEST / 9:30 PM IST danceml Grab a coffee ☕ and join the ride with us! <!channel> 👉 See the full agenda + grab your free pass if you haven’t already: [link]
    Summit promotion linkedin post.mov
    👍 2
    s
    v
    • 3
    • 2
  • a

    Andy Ye

    09/12/2025, 9:34 PM
    Hi folks, we hosted our very first StarRocks Global Summit. It was a successful event, with engineers from well-known companies such as Intuit, Celonis, Fanatics, and Pinterest sharing their use cases with StarRocks. The content was excellent, and the feedback from the audience was very positive. Many thanks to the colleagues who helped organize the summit, the speakers, and the audience from around the world. We also recorded a community voice session from our open-source users. You’re welcome to like and share it! 🎉 https://www.linkedin.com/posts/starrocks-oss_starrockssummit2025-dataanalytics-dataen[…]m=member_desktop&amp;rcm=ACoAAAEXiCEBMiHpzHjFkEm-Y43MqOWlMShBEAo
    💚 10
    starrocks 3
  • r

    Ronit Kapoor

    09/19/2025, 9:44 PM
    📢 New Video + Feedback Wanted! Hey <!channel>! Ron again - I just published a new beginner-friendly quickstart tutorial for those new to StarRocks: ▶️ StarRocks in Docker –

    https://www.youtube.com/watch?v=h7F4U6xEA5M&t=1s▾

    I’d also love your input on what kind of StarRocks content you’d like to see next. It only takes a minute to share your thoughts here: 📝 Feedback Form - https://forms.gle/VwmrMMrD7QVxv7Mg9 Your guys' feedback will help me decide what tutorials, blogs, and docs to make in the future, so I appreciate the support! Thanks for being part of the StarRocks community!
    👀 2
    👍 4
    yay 3
    rocky heart 3
    🙌 1
  • g

    GitHub

    09/22/2025, 1:26 AM
    Release - 3.5.6 New release published by yingtingdong Release date: September 22, 2025 ### Improvements • A decommissioned BE will be forcibly dropped when all its tablets are in the recycle bin, to avoid the decommission being blocked by those tablets. #62781 • Vacuum metrics will be updated when Vacuum succeeds. #62540 • Added thread pool metrics to the fragment instance execution state report, including active threads, queue count, and running threads. #63067 • Supports S3 path-style access in shared-data clusters to improve compatibility with MinIO and other S3-compatible storage systems. You can enable this feature by setting
    aws.s3.enable_path_style_access
    to
    true
    when creating a storage volume. #62591 • Supports resetting the starting point of the AUTO_INCREMENT value via
    Copy code
    ALTER TABLE`` <table_name>`` AUTO_INCREMENT`` = 10000;
    . #62767 • Supports using Distinguished Name (DN) in Group Provider for group matching, improving the user group solution for LDAP/Microsoft Active Directory environments. #62711 • Supports Azure Workload Identity authentication for Azure Data Lake Storage Gen2. #62754 • Added transaction error messages to the
    Copy code
    information_schema.``loads
    view to aid failure diagnosis. #61364 • Supports reusing common expressions for complex CASE WHEN expressions in Scan predicates to reduce repetitive computation. #62779 • Uses the REFRESH (instead of ALTER) privilege on the materialized view to execute REFRESH statements. #62636 • Disabled low-cardinality optimization on Lake tables by default to avoid potential issues. #62586 • Enabled tablet balancing between workers by default in shared-data clusters. #62661 • Supports reusing expressions in outer-join WHERE predicates to reduce repetitive computation. #62139 • Added Clone metrics in FE. #62421 • Added Clone metrics in BE. #62479 • Added an FE configuration item
    enable_statistic_cache_refresh_after_write
    to disable statistics-cache lazy refresh by default. #62518 • Masked credential information in SUBMIT TASK for better security. #62311 •
    json_extract
    in the Trino dialect returns a JSON type. #59718 • Supports ARRAY type in
    null_or_empty
    . #62207 • Adjusted the size limit for the Iceberg manifest cache. #61966 • Added a remote file-cache limit for Hive. #62288 ### Bug Fixes The following issues have been fixed: • Secondary replicas hang indefinitely due to negative timeout values, which cause incorrect timestamp comparisons. #62805 • PublishTask may be blocked when TransactionState is REPLICATION. #61664 • Incorrect repair mechanism for Hive tables that have been dropped and recreated during materialized view refresh. #63072 • Incorrect execution plans were generated after the materialized view aggregation push‑down rewrite. #63060 • ANALYZE PROFILE failures caused by PlanTuningGuide producing unrecognized strings (null explainString) in the query profiles. #63024 • Inappropriate return type of
    hour_from_unixtime
    and incorrect rewrite rule of
    CAST
    . #63006 • NPE in Iceberg manifest cache under data races. #63043 • Shared-data clusters lack support for colocation in materialized views. #62941 • Iceberg table Scan Exception during Scan Range deployment. #62994 • Incorrect execution plans were generated for view-based rewrite. #62918 • Errors and disrupted tasks due to Compute Nodes are not gracefully shut down on exit. #62916 • NPE when Stream Load execution status updates. #62921 • An issue with statistics when the column name and the name in the PARTITION BY clause differ in case. #62953 • Wrong results are returned when the
    LEAST
    function is used as a predicate. #62826 • Invalid ProjectOperator above the table-pruning frontier CTEConsumer. #62914 • Redundant replica handling after Clone. #62542 • Failed to collect Stream Load profiles. #62802 • Ineffective disk rebalancing caused by improper BE selection. #62776 • A potential NPE crash in LocalTabletsChannel when a missing
    tablet_id
    leads to a null delta writer. #62861 • KILL ANALYZE does not take effect. #62842 • SQL syntax errors in histogram stats when MCV values contain single quotes. #62853 • Incorrect output format of metrics for Prometheus. #62742 • NPE when querying
    information_schema.analyze_status
    after the database is dropped. #62796 • CVE-2025-58056. #62801 • When SHOW CREATE ROUTINE LOAD is executed, wrong results are returned because the database is considered null if not specified. #62745 • Data loss caused by incorrectly skipping CSV headers in
    files()
    . #62719 • NPE when replaying batch-transaction upserts. #62715 • Publish being incorrectly reported as successful during graceful shutdown in shared-nothing clusters. #62417 • Crash in asynchronous delta writer due to a null pointer. #62626 • Materialized view refresh is skipped because the materialized view version map is not cleared after a failed restore job. #62634 • Issues caused by case-sensitive partition column validation in the materialized view analyzer. #62598 • Duplicate IDs for statements with syntax errors. #62258 • StatisticsExecutor status is overridden due to redundant state assignment in CancelableAnalyzeTask. #62538 • Incorrect e… StarRocks/starrocks
    👍 5
    🎉 6
  • g

    GitHub

    09/29/2025, 7:32 AM
    Release - 4.0.0-RC02 New release published by wangsimo0 ## 4.0.0-RC02 Release Date: September 29, 2025 ### New Features • Supports setting sort keys when creating Iceberg tables. • Supports Multi-Table Write-Write Transactions, allowing users to atomically commit
    INSERT
    ,
    UPDATE
    , and
    DELETE
    operations. These transactions are compatible with both Stream Load and
    INSERT INTO
    interfaces, ensuring cross-table consistency in ETL and real-time ingestion scenarios. • Supports modifying aggregation keys of aggregate tables. ### Improvements • Optimized Delta Lake Catalog cache configuration: adjusted default values of
    DELTA_LAKE_JSON_META_CACHE_TTL
    and
    DELTA_LAKE_CHECKPOINT_META_CACHE_TTL
    to 24 hours, and simplified Parquet handler logic. #63441 • Improved Delta Lake Catalog error log format and content for better debugging. #63389 • External groups (e.g., LDAP Group) now support role grant/revoke and display, improving SQL syntax and test coverage for stronger access control. #63385 • Strengthened Stream Load parameter consistency checks to reduce risks caused by parameter drift. #63347 • Optimized Stream Load label passing mechanism to reduce dependencies. #63334 • Improved
    ANALYZE PROFILE
    format: ExplainAnalyzer now supports grouping metrics by operator. #63326 • Enhanced
    QueryDetailActionV2
    and
    QueryProfileActionV2
    APIs to return results in JSON format. #63235 • Improved predicate parsing in scenarios with large numbers of CompoundPredicates. #63139 • Adjusted certain FE metrics to be leader-aware. #63004 • Enhanced
    SHOW PROCESS LIST
    with Catalog and Query ID information. #62552 • Improved BE JVM memory monitoring metrics. #62210 • Optimized materialized view rewrite logic and log outputs. #62985 • Optimized random bucketing strategy. #63168 • Supports resetting
    AUTO_INCREMENT
    start value with
    ALTER TABLE <table_name> AUTO_INCREMENT = 10000;
    . #62767 • Group Provider now supports matching groups by DN. #62711 ### Bug Fixes The following issues have been fixed: • Incomplete
    Left Join
    results caused by ARRAY low-cardinality optimization. #63419 • Incorrect execution plan generated after materialized view aggregate pushdown rewrite. #63060 • Redundant warning logs printed in JSON field pruning scenarios when schema fields were not found. #63414 • Infinite loop caused by SIMD Batch parameter errors when inserting DECIMAL256 data in ARM environments. #63406 • Three storage-related issues: #63398 • Cache exception when disk path is empty. • Incorrect Azure cache key prefix. • S3 multipart upload failure. • ZoneMap filter invalidation after CHAR-to-VARCHAR schema change with Fast Schema Evolution. #63377 • ARRAY aggregation type analysis error caused by intermediate type
    ARRAY<NULL_TYPE>
    . #63371 • Metadata inconsistency in partial updates based on auto-increment columns. #63370 • Metadata inconsistency when deleting tablets or querying concurrently. #63291 • Failure to create
    spill
    directory during Iceberg table writes. #63278 • Ranger Hive Service permission changes not taking effect. #63251 • Group Provider did not support
    IF NOT EXISTS
    and
    IF EXISTS
    clauses. #63248 • Errors caused by using reserved keywords in Iceberg partitions. #63243 • Prometheus metric format issue. #62742 • Version check failure when starting replication transactions with Compaction enabled. #62663 • Missing Compaction Profile when File Bunding was enabled. #62638 • Issues handling redundant replicas after Clone. #62542 • Delta Lake tables failed to find partition columns. #62953 • Materialized views did not support Colocation in shared-data clusters. #62941 • Issues reading NULL partitions in Iceberg tables. #62934 • SQL syntax error caused by single quotes in Histogram statistics MCV (Most Common Values). #62853 •
    KILL ANALYZE
    command not working. #62842 • Failure collecting Stream Load profiles. #62802 • Incorrect CTE reuse plan extraction. #62784 • Rebalance failure due to incorrect BE selection. #62776 •
    User Property
    priority is lower than
    Session Variable
    . #63173 StarRocks/starrocks
    🎉 8
  • g

    GitHub

    09/30/2025, 10:18 AM
    Release - 3.4.8 New release published by jaogoy Release Date: September 30, 2025 ## Behavior Change • Lake internal tablet parallel scan (enable_lake_tablet_internal_parallel) is now enabled by default, increasing per‑query internal parallelism (may raise peak resource usage) #62360 ## Bug Fixes The following issues have been fixed: ### Data Lake Analytics • Delta Lake partition column names were forcibly converted to lowercase, causing mismatch with actual column names #62970 • Iceberg manifest cache eviction race could trigger a NullPointerException #63052 • Uncaught generic exceptions during Iceberg scan phase interrupted scan range submission and produced no metrics #63019 ### Materialized Views (MV) • Complex multi-layer projected views used in MV rewrite produced invalid plans or missing column statistics #63014 #62230 • Case mismatch of Hive external table MV partition columns was incorrectly rejected #62623 • MV refresh used only the creator’s default role, causing insufficient privilege in “no default role” or LDAP setups (role activation strategy & config introduced) #62461 • Case-insensitive conflicts in list-partitioned MV partition names led to duplicate name errors #62443 • Residual version mapping after failed MV restore caused subsequent incremental refresh to be skipped, returning empty results #62643 • Abnormal partitions after MV recovery caused FE restart NullPointerException #62563 • Non-global aggregation queries incorrectly applied aggregation pushdown rewrite producing invalid plans #63105 ### Storage / Metadata • Tablet deletion state was only updated in memory (shutdown) and not persisted, so GC still treated it as running and skipped reclamation #63623 #63620 • Concurrent query plus drop tablet led to early delvec cleanup and “no delete vector found” errors #63307 • Base and cumulative sstable sharing the same max_rss_rowid in PK index compaction were misordered, risking lost delete semantics #63362 • Possible BE crash when LakePersistentIndex destructor ran after a failed initialization #62297 • Graceful shutdown of publish thread pool silently discarded queued tasks without marking failures, creating version holes and a false “all succeeded” impression #62683 • Newly cloned replica on a newly added BE during rebalance was immediately judged redundant and removed, preventing data migration to the new node #62894 • Missing lock when reading tablet max version caused inconsistent replication transaction decisions #62280 ### Query & Optimization • Combination of date_trunc equality and raw column range predicate was reduced to a point interval, returning empty result sets (e.g. date_trunc('month', dt)='2025-09-01' AND dt>'2025-09-23') #63570 • Pushdown of non-deterministic predicates (random/time functions) produced inconsistent results #63533 • Missing consumer node after CTE reuse decision produced incomplete execution plans #63188 • Type mismatch crashes when table functions and low-cardinality (dictionary) encoding coexisted #62500 #62384 ### Ingestion & Export • Oversized CSV split into parallel fragments caused every fragment to skip header rows, leading to data loss (only the first fragment should skip) #62789 • SHOW CREATE ROUTINE LOAD without explicit DB returned job from another database with the same name #62792 • NullPointerException when sameLabelJobs became null during concurrent load job cleanup #63181 ### Cluster Operations & Management • CN normal restart or crash path incorrectly executed scale-in deregistration, harming topology consistency #63002 #63010 • Backend decommission blocked even when all tablets were already in recycle bin (no force completion) #63267 • OPTIMIZE TABLE task stuck in PENDING after thread pool rejection #62556 • Dirty tablet metadata cleanup used GTID arguments in the wrong order #62285 StarRocks/starrocks
  • b

    Beryl Chen

    10/09/2025, 4:03 AM
    <!channel> Friendly reminder: Today’s session, “Real-Time Analytics for Web3: Fraud Detection, Trading, and Growth at Scale,” starts at 11:30 AM IST | 1:00 PM ICT (VN/TH/Jakarta) | 2:00 PM SGT/CST/HKT/PHT/TWT | 3:00 PM KST/JST! 👉 Save your seat here: https://celerdata.wistia.com/live/events/60tbj98da5?utm_campaign=slack Join Sida Shen to explore the unique requirements of real-time analytics for Web3—why legacy stacks can’t keep up, and how companies like Coinbase are overcoming those limits. Discover how to achieve second-level data freshness (<10 s latency without batch jobs), run sub-second queries even under heavy concurrency, and query Iceberg directly without ETL overhead. The timing is perfect for participants in Asia, but the insights are global. Can’t join live? Register anyway, and we’ll send you the recording. danceml
    🙌 3
    party 8
  • g

    GitHub

    10/14/2025, 7:13 AM
    Release - 3.3.19 New release published by wangsimo0 ## 3.3.19 Release Date: October 14, 2025 ### Bug Fixes The following issues have been fixed: •
    UserProperty
    had lower priority than Session Variables. #63173 • Materialized view refresh failures that could occur when the Hive base table was dropped and recreated. #63072 • Issues with the aggregation pushdown rewrite rule. #63060 • Inconsistencies between null columns and data columns in Boolean extraction functions for JSON. #63054 • Issues when getting partition columns in Delta Lake format tables. #62953 • Lack of colocation support for materialized views in shared-data clusters. #62941 • Projection mapping errors in view-based materialized view rewrite. #62918 • SQL syntax errors in histogram statistics when Most Common Values (MCV) contained single quotes. #62853 •
    KILL ANALYZE
    did not work. #62842 • CVE-2025-58056 vulnerability. #62801 • Executing
    SHOW CREATE ROUTINE LOAD
    without specifying a database causes wrong results. #62745 • Data loss caused by incorrectly skipping CSV headers in
    files()
    . #62719 • Version check failures when Replication and Compaction transactions were committed together. #62663 • Materialized view refresh is skipped because the materialized view version map is not cleared after a failed restore job. #62634 • Issues caused by case-sensitive partition column validation in the materialized view analyzer. #62598 StarRocks/starrocks
    🎉 1
  • g

    GitHub

    10/17/2025, 8:53 AM
    Release - 4.0.0 New release published by wangsimo0 # StarRocks version 4.0 ## 4.0.0 Release date: October 17, 2025 ### Data Lake Analytics • Unified Page Cache and Data Cache for BE metadata, and adopted an adaptive strategy for scaling. #61640 • Optimized metadata file parsing for Iceberg statistics to avoid repetitive parsing. #59955 • Optimized COUNT/MIN/MAX queries against Iceberg metadata by efficiently skipping over data file scans, significantly improving aggregation query performance on large partitioned tables and reducing resource consumption. #60385 • Supports compaction for Iceberg tables via procedure
    rewrite_data_files
    . • Supports Iceberg tables with hidden partitions, including creating, writing, and reading the tables. #58914 • Supports setting sort keys when creating Iceberg tables. • Optimizes sink performance for Iceberg tables. • Iceberg Sink supports spilling large operators, global shuffle, and local sorting to optimize memory usage and address small file issues. #61963 • Iceberg Sink optimizes local sorting based on Spill Partition Writer to improve write efficiency. #62096 • Iceberg Sink supports global shuffle for partitions to further reduce small files. #62123 • Enhanced bucket-aware execution for Iceberg tables to improve concurrency and distribution capabilities of bucketed tables. #61756 • Supports the TIME data type in the Paimon catalog. #58292 • Upgraded Iceberg version to 1.10.0. #63667 ### Security and Authentication • In scenarios where JWT authentication and the Iceberg REST Catalog are used, StarRocks supports the passthrough of user login information to Iceberg via the REST Session Catalog for subsequent data access authentication. #59611 #58850 • Supports vended credentials for the Iceberg catalog. • Supports granting StarRocks internal roles to external groups obtained via Group Provider. #63385 #63258 • Added REFRESH privilege to external tables to control the permission to refresh them. #63385 ### Storage Optimization and Cluster Management • Introduced the File Bundling optimization for the cloud-native table in shared-data clusters to automatically bundle the data files generated by loading, Compaction, or Publish operations, thereby reducing the API cost caused by high-frequency access to the external storage system. #58316 • Supports Multi-Table Write-Write Transaction to allow users to control the atomic submission of INSERT, UPDATE, and DELETE operations. The transaction supports Stream Load and INSERT INTO interfaces, effectively guaranteeing cross-table consistency in ETL and real-time write scenarios. #61362 • Supports Kafka 4.0 for Routine Load. • Supports full-text inverted indexes on Primary Key tables in shared-nothing clusters. • Supports modifying aggregate keys of Aggregate tables. #62253 • Supports enabling case-insensitive processing on names of catalogs, databases, tables, views, and materialized views. #61136 • Supports blacklisting Compute Nodes in shared-data clusters. #60830 • Supports global connection ID. #57256 • Added the
    recyclebin_catalogs
    metadata view to Information Schema to display recoverable deleted metadata. #51007 ### Query and Performance Improvement • Supports DECIMAL256 data type, expanding the upper limit of precision from 38 to 76 bits. Its 256-bit storage provides better adaptability to high-precision financial and scientific computing scenarios, effectively mitigating DECIMAL128's precision overflow problem in very large aggregations and high-order operations. #59645 • Improved the performance for basic operators.#61691 #61632 #62585 #61405 #61429 • Optimized the performance of the JOIN and AGG operators. #61691 • [Preview] Introduced SQL Plan Manager to allow users to bind a query plan to a query, thereby preventing the query plan from changing due to system state changes (mainly data updates and statistics updates), thus stabilizing query performance. #56310 • Introduced Partition-wise Spillable Aggregate/Distinct operators to replace the original Spill implementation based on sorted aggregation, significantly improving aggregation performance and reducing read/write overhead in complex and high-cardinality GROUP BY scenarios. #60216 • Flat JSON V2: • Supports configuring Flat JSON on the table level. #57379 • Enhance JSON columnar storage by retaining the V1 mechanism while adding page- and segment-level indexes (ZoneMaps, Bloom filters), predicate pushdown with late materialization, dictionary encoding, and integration of a low-cardinality global dictionary to significantly boost execution efficiency. #60953 • Supports an adaptive ZoneMap index creation strategy for the STRING data type. #61960 • Enhanced query observability: • Optimized EXPLAIN ANALYZE output to display the execution metrics by group and by operator for better readability. #63326 •
    QueryDetailActionV2
    and
    QueryProfileActionV2
    now support JSON format, enhancing cross-FE query capabilities. #63235 • Supports retrieving Query Profile information across all FEs. #61345 • SHOW PROCESSLIST statements display Catalog, Query ID, and other information. #62552 • Enhanced query queue and process monitoring, supporting display of Running/Pending statuses.#62261 • Materialized view rewrites consider the distribution and sort keys of the original table, improving the selection of optimal materialized views. #62830 ### Functions and SQL Syntax • Added the following functions: •
    bitmap_hash64
    #56913 •
    bool_or
    #57414 •
    strpos
    #57278 •
    to_datetime
    and
    to_datetime_ntz
    #60637 •
    regexp_count
    #57182 •
    tokenize
    <https://github.com/Star… StarRocks/starrocks
    👍 10
    starrocks 8
    👍🏼 1
  • b

    Beryl Chen

    10/17/2025, 7:30 PM
    🎉 We did it — The StarRocks Community just crossed 5,000 members! <!channel> You’ve built, tested, benchmarked, shared, and shaped StarRocks into what it is today — a fast, open-source engine driving real-time analytics at scale. Let’s call it #5KFriday rocky confused💚 Thank you for being part of the StarRocks movement! rocky heart
    clapclap 3
    ❤️ 2
    😍 2
    👍 3
    🤩 2
    🥳 2
    starrocks 7
    party 13
    🙌 22
  • p

    Paul O'Brien

    10/20/2025, 6:04 PM
    🚀 StarRocks 4.0 is live! Open • Fast • Governed Highlights: - ~60% faster YoY with deep optimizations for JOINs, aggregations, and spill handling. - First-class Apache Iceberg: faster metadata parsing, hidden-partition reads/writes, a new compaction API, and optimized file writes. - JSON as a first-class type (Flat JSON V2): 3–15× faster queries without flattening. - Real-time at lower cost: file bundling, metadata caching, and smarter compaction cut cloud API calls by up to 90%. - Lakehouse governance: catalog-centric access control on Apache Iceberg (JWT passthrough + vended credentials). - Expanded workloads: Decimal256 for high-precision, large-scale aggregations; multi-statement transactions for financial and multi-stage pipelines; ASOF JOIN for time-series and AI use cases. - Operational improvements: node blacklisting, case-insensitive names, and global connection IDs. 📖 Launch blog: https://www.starrocks.io/blog/starrocks-4.0-now-available 📚 Docs: https://docs.starrocks.io/releasenotes/release-4.0/ Join the full walkthrough + live Q&A with @Sida Shen and @Ronit Kapoor this Thursday, Oct 23: https://celerdata.wistia.com/live/events/iot8atqjzj?utm_campaign=slack
    🚀 12
    party 7
    💪 7
    starrocks 16
  • g

    GitHub

    10/21/2025, 8:27 AM
    Release - 3.5.7 New release published by yingtingdong ### Improvements • Improved memory statistics accuracy for Scan operators by introducing retry backoff under heavy memory contention scenarios. #63788 • Optimized materialized view bucketing inference by leveraging existing tablet distribution to prevent excessive bucket creation. #63367 • Revised the Iceberg table caching mechanism to enhance consistency and reduce cache invalidation risks during frequent metadata updates. #63388 • Added the
    querySource
    field to
    QueryDetail
    and
    AuditEvent
    for better traceability of query origins across APIs and schedulers. #63480 • Enhanced Persistent Index diagnostics by printing detailed context when duplicate keys are detected in MemTable writes. #63560 • Reduced lock contention in materialized view operations by refining lock granularity and sequencing in concurrent scenarios. #63481 ### Bug Fixes The following issues have been fixed: • Materialized view rewrite failures caused by type mismatch. #63659 •
    regexp_extract_all
    has wrong behavior and lacks support for
    pos=0
    . #63626 • Degraded scan performance caused by the profitless simplification of CASE WHEN with complex functions. #63732 • Incorrect DCG data reading when partial updates switch from column mode to row mode. #61529 • A potential deadlock during initialization of
    ExceptionStackContext
    . #63776 • Crashes in Parquet numeric conversion for ARM architecture machines. #63294 • An issue caused by the aggregate intermediate type uses
    ARRAY<NULL_TYPE>
    . #63371 • Stability issue caused by incorrect overflow detection when casting LARGEINT to DECIMAL128 at sign-edge cases (for example, INT128_MIN) #63559 • LZ4 compression and decompression errors cannot be perceived. #63629 •
    ClassCastException
    when querying tables partitioned by
    FROM_UNIXTIME
    on INT-type columns. #63684 • Tablets cannot be repaired after a balance-triggered migration when the only valid source replica is marked
    DECOMMISSION
    . #62942 • Profiles lost SQL statements and Planner Trace when the PREPARE statement is used. #63519 • The
    extract_number
    ,
    extract_bool
    , and
    extract_string
    functions are not exception-safe. #63575 • Shutdown tablets cannot be garbage-collected properly. #63595 • Profiles showing SQL as
    omit
    for returns of the PREPARE/EXECUTE statements. #62988 •
    date_trunc
    partition pruning with combined predicates that mistakenly produced EMPTYSET. #63464 • Crashes in release builds due to the CHECK in NullableColumn. #63553 StarRocks/starrocks
  • b

    Beryl Chen

    10/21/2025, 9:06 PM
    Wanted to share the latest performance benchmark! yay StarRocks 4.0 delivers a 60% year-over-year performance improvement, thanks to deeper query optimizations, faster JOINs, smarter aggregations, and steady performance across JSON & Iceberg workloads. Take a look at the full breakdown → https://celerdata.com/blog/starrocks-4.0-zero-compromise-60-faster
    🚀 2
    party 1
    💪 1
    starrocks 1