docs.starrocks.io Open in urlscan Pro
2600:9000:2491:8600:6:bdd1:34c0:93a1  Public Scan

Submitted URL: https://docs.starrocks.com/en-us/main/introduction/StarRocks_intro
Effective URL: https://docs.starrocks.io/en-us/main/introduction/StarRocks_intro
Submission: On August 24 via manual from IN — Scanned from DE

Form analysis 0 forms found in the DOM

Text Content

Search Docs


Latest-3.1
English(US)
 * Release Notes
   * v3.1
   * v3.0
   * v2.5
   * v2.4
   * v2.3
   * v2.2
   * v2.1
   * v2.0
   * v1.19

 * StarRocks
 * Introduction to StarRocks
   * What is StarRocks?
   * Architecture
   * Features
 * Quick Start
   * Deploy StarRocks with Docker
   * Create a table
   * Load and query data
 * Deployment
   * Deployment overview
   * Prepare
     * Deployment prerequisites
     * Plan StarRocks cluster
     * Check environment configurations
     * Prepare deployment files
   * Deploy
     * Deploy classic StarRocks
       * Deploy StarRocks manually
       * Deploy on Kubernetes
         * Deploy StarRocks with Operator
         * Deploy StarRocks with Helm
       * Deploy StarRocks on AWS
     * Deploy and use shared-data StarRocks
   * Manage
     * Post-deployment setup
     * Upgrade StarRocks
     * Downgrade StarRocks
 * Table Design
   * Understand StarRocks table design
   * Table types
     * Overview of table types
     * Duplicate Key table
     * Aggregate table
     * Unique Key table
     * Primary Key table
   * Data distribution
     * Data distribution
     * Expression partitioning (recommended)
     * List partitioning
     * Dynamic partitioning
     * Temporary partition
   * Data compression
   * Sort keys and prefix indexes
 * Data Loading
   * Concepts
     * Strict mode
   * Overview of data loading
   * Load data from a local file system or a streaming data source using HTTP
     PUT
   * Load data from HDFS
   * Load data from cloud storage
   * Continuously load data from Apache Kafka®
   * Load data using Apache Spark™
     * Load data using Spark Connector
     * Load data in bulk using Spark Load
   * Load data using INSERT
   * Load data using Stream Load transaction interface
   * Realtime synchronization from MySQL
   * Continuously load data from Apache Flink®
   * Change data through loading
   * Transform data at loading
 * Data Unloading
   * Export data using EXPORT
   * Read data from StarRocks using Spark connector
   * Read data from StarRocks using Flink connector
 * Query Data Lakes
   * Catalog
     * Overview
     * Default catalog
     * Hive catalog
     * Iceberg catalog
     * Hudi catalog
     * Delta Lake catalog
     * JDBC catalog
     * Elasticsearch catalog
     * Paimon catalog
     * Query external data
   * External table
   * File external table
   * Data Cache
   * Data lake-related FAQ
 * Query Acceleration
   * Gather CBO statistics
   * Synchronous materialized view
   * Asynchronous materialized view
   * Colocate Join
   * Lateral Join
   * Query Cache
   * Index
     * Bitmap indexing
     * Bloom filter indexing
   * Computing the Number of Distinct Values
     * Use Bitmap for exact count distinct
     * Use HLL for approximate count distinct
   * Sorted streaming aggregate
 * Administration
   * Management
     * Scale in and out
     * Backup and restore
     * Configuration
     * Monitor and alert
     * Manage audit logs within StarRocks via Audit Loader
     * Enable FQDN access
     * Configure a time zone
     * Information Schema
     * Monitor and manage big queries
     * Resource Management
       * Resource group
       * Query queue
       * Query management
       * Memory management
       * Spill to disk
       * Load balance
       * Replica
       * Blacklist
       * File manager
   * Data recovery
   * User Privilege and Authentication
     * Overview of privileges
     * Privileges
     * Manage user privileges
     * Authentication
   * Performance Tuning
     * Query planning
     * Analyze query profile
     * Profiling
 * Reference
   * SQL Reference
     * User Account Management
       * ALTER USER
       * CREATE ROLE
       * CREATE USER
       * DROP ROLE
       * DROP USER
       * EXECUTE AS
       * GRANT
       * REVOKE
       * SET DEFAULT ROLE
       * SET PASSWORD
       * SET PROPERTY
       * SET ROLE
       * SHOW AUTHENTICATION
       * SHOW GRANTS
       * SHOW ROLES
       * SHOW USERS
     * Cluster Management
       * ADD SQLBLACKLIST
       * ADMIN CANCEL REPAIR TABLE
       * ADMIN CHECK TABLET
       * ADMIN REPAIR TABLE
       * ADMIN SET CONFIG
       * ADMIN SET REPLICA STATUS
       * ADMIN SHOW CONFIG
       * ADMIN SHOW REPLICA DISTRIBUTION
       * ADMIN SHOW REPLICA STATUS
       * ALTER RESOURCE GROUP
       * ALTER STORAGE VOLUME
       * ALTER SYSTEM
       * CANCEL DECOMMISSION
       * CREATE FILE
       * CREATE RESOURCE GROUP
       * CREATE STORAGE VOLUME
       * DELETE SQLBLACKLIST
       * DESC STORAGE VOLUME
       * DROP FILE
       * DROP RESOURCE GROUP
       * DROP STORAGE VOLUME
       * EXPLAIN
       * INSTALL PLUGIN
       * KILL
       * SET
       * SET DEFAULT STORAGE VOLUME
       * SHOW BACKENDS
       * SHOW BROKER
       * SHOW COMPUTE NODES
       * SHOW FILE
       * SHOW FRONTENDS
       * SHOW FULL COLUMNS
       * SHOW INDEX
       * SHOW PLUGINS
       * SHOW PROC
       * SHOW PROCESSLIST
       * SHOW RESOURCE GROUP
       * SHOW SQLBLACKLIST
       * SHOW STORAGE VOLUMES
       * SHOW TABLE STATUS
       * SHOW VARIABLES
       * UNINSTALL PLUGIN
     * DDL
       * ALTER DATABASE
       * ALTER MATERIALIZED VIEW
       * ALTER TABLE
       * ALTER VIEW
       * ALTER RESOURCE
       * ANALYZE TABLE
       * BACKUP
       * CANCEL ALTER TABLE
       * CANCEL BACKUP
       * CANCEL RESTORE
       * CREATE ANALYZE
       * CREATE DATABASE
       * CREATE EXTERNAL CATALOG
       * CREATE FUNCTION
       * CREATE INDEX
       * CREATE MATERIALIZED VIEW
       * CREATE REPOSITORY
       * CREATE RESOURCE
       * CREATE TABLE
       * CREATE TABLE AS SELECT
       * CREATE TABLE LIKE
       * CREATE VIEW
       * DROP ANALYZE
       * DROP CATALOG
       * DROP DATABASE
       * DROP FUNCTION
       * DROP INDEX
       * DROP MATERIALIZED VIEW
       * DROP REPOSITORY
       * DROP RESOURCE
       * DROP STATS
       * DROP TABLE
       * DROP VIEW
       * HLL
       * KILL ANALYZE
       * RECOVER
       * REFRESH EXTERNAL TABLE
       * RESTORE
       * SET CATALOG
       * SHOW ANALYZE JOB
       * SHOW ANALYZE STATUS
       * SHOW FUNCTION
       * SHOW META
       * SHOW RESOURCES
       * TRUNCATE TABLE
       * USE
     * DML
       * ALTER LOAD
       * ALTER ROUTINE LOAD
       * BROKER LOAD
       * CANCEL LOAD
       * CANCEL EXPORT
       * CANCEL REFRESH MATERIALIZED VIEW
       * CREATE ROUTINE LOAD
       * DELETE
       * DROP TASK
       * EXPORT
       * GROUP BY
       * INSERT
       * PAUSE ROUTINE LOAD
       * REFRESH MATERIALIZED VIEW
       * RESUME ROUTINE LOAD
       * SELECT
       * SHOW ALTER TABLE
       * SHOW ALTER MATERIALIZED VIEW
       * SHOW BACKUP
       * SHOW CATALOGS
       * SHOW CREATE CATALOG
       * SHOW CREATE DATABASE
       * SHOW CREATE MATERIALIZED VIEW
       * SHOW CREATE TABLE
       * SHOW CREATE VIEW
       * SHOW DATA
       * SHOW DATABASES
       * SHOW DELETE
       * SHOW DYNAMIC PARTITION TABLES
       * SHOW EXPORT
       * SHOW LOAD
       * SHOW MATERIALIZED VIEWS
       * SHOW PARTITIONS
       * SHOW PROPERTY
       * SHOW REPOSITORIES
       * SHOW RESTORE
       * SHOW ROUTINE LOAD
       * SHOW ROUTINE LOAD TASK
       * SHOW SNAPSHOT
       * SHOW TABLES
       * SHOW TABLET
       * SHOW TRANSACTION
       * SPARK LOAD
       * STOP ROUTINE LOAD
       * STREAM LOAD
       * SUBMIT TASK
       * UPDATE
     * Auxiliary Commands
       * DESC
     * Data Types
       * Data type list
       * Numeric
         * TINYINT
         * SMALLINT
         * INT
         * BIGINT
         * LARGEINT
         * DECIMAL
         * DOUBLE
         * FLOAT
         * BOOLEAN
       * String
         * BINARY
         * CHAR
         * STRING
         * VARCHAR
       * Date
         * DATE
         * DATETIME
       * Semi-structured
         * ARRAY
         * JSON
         * MAP
         * STRUCT
       * Others
         * BITMAP
         * HLL
     * Keywords
   * Function Reference
     * Function list
     * Java UDFs
     * Window functions
     * Lambda expression
     * Aggregate Functions
       * any_value
       * approx_count_distinct
       * array_agg
       * avg
       * bitmap
       * bitmap_agg
       * count
       * corr
       * covar_pop
       * covar_samp
       * group_concat
       * grouping
       * grouping_id
       * hll_empty
       * hll_hash
       * hll_raw_agg
       * hll_union
       * hll_union_agg
       * max
       * max_by
       * min
       * min_by
       * multi_distinct_sum
       * multi_distinct_count
       * percentile_approx
       * percentile_cont
       * percentile_disc
       * retention
       * stddev
       * stddev_samp
       * sum
       * variance, variance_pop, var_pop
       * var_samp
       * window_funnel
     * Array Functions
       * all_match
       * any_match
       * array_agg
       * array_append
       * array_avg
       * array_concat
       * array_contains
       * array_contains_all
       * array_cum_sum
       * array_difference
       * array_distinct
       * array_filter
       * array_generate
       * array_intersect
       * array_join
       * array_length
       * array_map
       * array_max
       * array_min
       * array_position
       * array_remove
       * array_slice
       * array_sort
       * array_sortby
       * array_sum
       * arrays_overlap
       * array_to_bitmap
       * cardinality
       * element_at
       * reverse
       * unnest
     * Bit Functions
       * bitand
       * bitnot
       * bitor
       * bitxor
       * bit_shift_left
       * bit_shift_right
       * bit_shift_right_logical
     * Bitmap Functions
       * base64_to_bitmap
       * bitmap_agg
       * bitmap_and
       * bitmap_andnot
       * bitmap_contains
       * bitmap_count
       * bitmap_from_string
       * bitmap_empty
       * bitmap_has_any
       * bitmap_hash
       * bitmap_intersect
       * bitmap_max
       * bitmap_min
       * bitmap_or
       * bitmap_remove
       * bitmap_subset_in_range
       * bitmap_subset_limit
       * bitmap_to_array
       * bitmap_to_base64
       * bitmap_to_string
       * bitmap_union
       * bitmap_union_count
       * bitmap_union_int
       * bitmap_xor
       * intersect_count
       * sub_bitmap
       * to_bitmap
     * JSON Functions
       * Overview of JSON functions and operators
       * JSON operators
       * JSON constructor functions
         * json_array
         * json_object
         * parse_json
       * JSON query and processing functions
         * Arrow function
         * cast
         * get_json_double
         * get_json_int
         * get_json_string
         * json_each
         * json_exists
         * json_length
         * json_keys
         * json_query
         * json_string
     * Map Functions
       * cardinality
       * distinct_map_keys
       * element_at
       * map_apply
       * map_concat
       * map_filter
       * map_from_arrays
       * map_keys
       * map_size
       * map_values
       * transform_keys
       * transform_values
     * Binary Functions
       * to_binary
       * from_binary
     * Conditional Functions
       * case
       * coalesce
       * if
       * ifnull
       * nullif
     * Cryptographic Functions
       * aes_encrypt
       * aes_decrypt
       * base64_decode_binary
       * base64_decode_string
       * from_base64
       * md5
       * md5sum
       * md5sum_numeric
       * sha2
       * sm3
       * to_base64
     * Date Functions
       * add_months
       * adddate
       * convert_tz
       * current_date
       * current_time
       * current_timestamp
       * date
       * date_add
       * date_diff
       * date_format
       * date_slice
       * date_sub, subdate
       * date_trunc
       * datediff
       * day
       * dayname
       * dayofmonth
       * dayofweek
       * dayofyear
       * days_add
       * days_diff
       * days_sub
       * from_days
       * from_unixtime
       * hour
       * hours_add
       * hours_diff
       * hours_sub
       * last_day
       * makedate
       * microseconds_add
       * microseconds_sub
       * minute
       * minutes_add
       * minutes_diff
       * minutes_sub
       * month
       * monthname
       * months_add
       * months_diff
       * months_sub
       * next_day
       * now
       * previous_day
       * quarter
       * second
       * seconds_add
       * seconds_diff
       * seconds_sub
       * str_to_date
       * str2date
       * time_slice
       * time_to_sec
       * timediff
       * timestamp
       * timestampadd
       * timestampdiff
       * to_date
       * to_days
       * unix_timestamp
       * utc_timestamp
       * week
       * week_iso
       * weekofyear
       * weeks_add
       * day_of_week_iso
       * weeks_diff
       * weeks_sub
       * year
       * years_add
       * years_diff
       * years_sub
     * Geographic Functions
       * st_astext
       * st_circle
       * st_contains
       * st_distance_sphere
       * st_geometryfromtext
       * st_linefromtext
       * st_point
       * st_polygon
       * st_x
       * st_y
     * Math Functions
       * abs
       * acos
       * asin
       * atan
       * atan2
       * bin
       * ceil
       * ceiling
       * conv
       * cos
       * cosh
       * cosine_similarity
       * cosine_similarity_norm
       * cot
       * degrees
       * divide
       * e
       * exp
       * floor
       * fmod
       * greatest
       * least
       * ln
       * log
       * log2
       * log10
       * mod
       * multiply
       * negative
       * pi
       * pmod
       * positive
       * pow
       * radians
       * rand
       * round
       * sign
       * sin
       * sinh
       * sqrt
       * square
       * tan
       * tanh
       * truncate
     * String Functions
       * append_trailing_char_if_absent
       * ascii
       * char
       * char_length
       * character_length
       * concat
       * concat_ws
       * ends_with
       * find_in_set
       * group_concat
       * hex
       * hex_decode_binary
       * hex_decode_string
       * instr
       * lcase
       * left
       * length
       * locate
       * lower
       * lpad
       * ltrim
       * money_format
       * null_or_empty
       * parse_url
       * repeat
       * replace
       * reverse
       * right
       * rpad
       * rtrim
       * space
       * split
       * split_part
       * starts_with
       * strleft
       * strright
       * str_to_map
       * substring
       * trim
       * ucase
       * unhex
       * upper
       * url_decode
       * url_encode
     * Pattern Matching Functions
       * like
       * regexp
       * regexp_extract
       * regexp_replace
     * Percentile Functions
       * percentile_approx_raw
       * percentile_empty
       * percentile_hash
       * percentile_union
     * Scalar Functions
       * hll_cardinality
     * Struct Functions
       * named_struct
       * row
     * Table Functions
       * files
       * generate_series
       * json_each
       * unnest
     * Utility Functions
       * catalog
       * current_role
       * current_version
       * database
       * host_name
       * isnull
       * isnotnull
       * last_query_id
       * sleep
       * uuid
       * uuid_numeric
       * version
     * cast function
     * hash function
   * AUTO_INCREMENT
   * Generated columns
   * System variables
   * User-defined variables
   * Error code
   * System limits
 * FAQ
   * Deploy
   * Data Migration
     * Data Loading
       * Data loading
       * Stream Load
       * Routine Load
       * Broker Load
       * Insert Into
       * Synchronize data from MySQL in real time
       * Flink connector
     * Data Unloading
   * SQL
   * query_dump
   * Other FAQs
 * Benchmark
   * SSB Benchmark
   * TPC-H Benchmark

 * Developers
   * Contribute to StarRocks
   * Code Style Guides
     * protobuf style guides
     * thrift style guides
   * Use the debuginfo file for debugging
   * Development Environment
     * IDE Setup
     * IDEA
     * Build StarRocks on Ubuntu
     * Build StarRocks in Docker
     * Build Handbook
   * Trace Tools
     * Trace
 * Integration
   * Authenticate to AWS resources
   * Authenticate to Microsoft Azure Storage
   * Authenticate to Google Cloud Storage
   * BI tools
     * Hex
     * Querybook
     * Apache Superset
     * Tableau Desktop
   * IDE tools
     * DataGrip
     * DBeaver
   * Other tools and systems

Latest-3.1English(US)

Edit
Feedback


STARROCKS

StarRocks is a next-gen, high-performance analytical data warehouse that enables
real-time, multi-dimensional, and highly concurrent data analysis. StarRocks has
an MPP architecture and is equipped with a fully vectorized execution engine, a
columnar storage engine that supports real-time updates, and is powered by a
rich set of features including a fully-customized cost-based optimizer (CBO),
intelligent materialized view and more. StarRocks supports real-time and batch
data ingestion from a variety of data sources. It also allows you to directly
analyze data stored in data lakes with zero data migration.

StarRocks is also compatible with MySQL protocols and can be easily connected
using MySQL clients and popular BI tools. StarRocks is highly scalable,
available, and easy to maintain. It is widely adopted in the industry, powering
a variety of OLAP scenarios, such as real-time analytics, ad-hoc queries, data
lake analytics and more.

Join our Slack channel for live support, discussion, or latest community news.
You can also follow us on LinkedIn to get first-hand updates on new features,
events, and sharing.

About StarRocks
 * Introduction
 * Architecture
 * Shared-data StarRocks cluster
 * Features

Get started
 * Deploy with Docker
 * Create a table
 * Ingest and query data

Table design
 * Understand table design
 * Table types
 * Data distribution
 * Sort keys and prefix indexes

Data loading
 * Overview of data loading
 * Load data from a local file system or a streaming data source using HTTP PUT
 * Load data from HDFS
 * Load data from cloud storage
 * Continuously load data from Apache Kafka®
 * Load data using Apache Spark™
 * Load data using INSERT
 * Realtime synchronization from MySQL
 * Continuously load data from Apache Flink®

Querying
Query acceleration
 * Cost-based optimizer
 * Synchronous materialized view
 * Asynchronous materialized views
 * Colocate Join
 * Query cache

Query semi-structured data
 * JSON
 * ARRAY

Query external data sources
 * Apache Hive™
 * Apache Hudi
 * Apache Iceberg
 * Delta Lake
 * MySQL
 * Elasticsearch
 * JDBC-compatible database

Integration
 * AWS
 * Microsoft Azure Storage
 * Google Cloud Storage
 * BI tools
 * IDE tools

Administration
 * Manage a cluster
 * Scale in and out a cluster
 * Resource group
 * Privileges
 * Data recovery
 * Tune query performance

References
 * SQL reference
 * Function reference
 * Data type

FAQ
 * Ingestion and export
 * Deployment
 * SQL

Benchmarks
 * SSB benchmark
 * TPC-H benchmark

Last updated: 7/24/2023, 8:33:57 AM
Edit
Feedback
In this article
 * StarRocks


Navigated to StarRocks