www.matillion.com Open in urlscan Pro
2a04:4e42:200::645  Public Scan

Submitted URL: https://email.matillion.com/OTkyLVVJVy03MzEAAAGGIly3CIqfxQkBeQrXuXk0iaEWNnsZNOg92a73ub7N6ZxkKF0iU0tLaAXFRu3J9vYYSLOFop8=
Effective URL: https://www.matillion.com/resources/matillion-snowflake-building-the-modern-enterprise-data-stack-in-the-data-cloud/snowfl...
Submission: On August 09 via manual from US — Scanned from DE

Form analysis 1 forms found in the DOM

https://www.matillion.com/resources/search

<form id="uf-search-form" class="uf-search-form" role="search" aria-label="Sitewide" action="https://www.matillion.com/resources/search">
  <input type="search" name="ufq" id="uf-search-input" class="uf-search-input uf-input" placeholder="Search" aria-label="Search">
  <input type="hidden" name="ufs" value="10354596">
  <button type="submit" id="uf-search-submit" class="uf-search-submit" title="Search sitewide">
    <i class="fas fa-search" aria-hidden="true"></i>
    <span class="sr-only">Search sitewide</span>
  </button>
  <button type="button" id="uf-search-close" class="uf-search-close" title="Close Search Box">
    <i class="fas fa-times" aria-hidden="true"></i>
    <span class="sr-only">Close search box</span>
  </button>
</form>

Text Content

MENUMENU
 * Products
   * Matillion ETL
   * Matillion Data Loader
 * Solutions
   
   * * * TYPES OF
         SOLUTIONS
         
          * Data Transformation
          * Data Analytics
          * Data Integration
          * Data Lakes
          * Data Preparation
     
     * * SOLUTIONS FOR
         
          * Enterprise
          * SMB
 * Technology
   * Cloud Data Warehouse
     * Amazon Redshift
     * Google BigQuery
     * Microsoft Azure Synapse
     * Snowflake
   * Lakehouse
     * Delta Lake on Databricks
   * ELT
   * Integrations
   * Security
 * Pricing
 * Partners
   * Become a Partner
   * Partner Log-In
 * Resources
   * Blog
   * Case Studies
   * What is ETL? The Ultimate Guide
   * What is Data Integration? The Ultimate Guide
   * Data Glossary
   * Documentation
   * Matillion ETL Community
   * Matillion Exchange
   * Matillion Academy
   * Certifications
 * About
   * Leadership
   * News
   * Events
   * Brand and Media Kit
   * Careers
   * Investors
   * Contact
 * Support
   * Mission Critical Support
   * Support Portal
 * Sign in
 * Get started

Close
 * Products
   * Matillion ETL
   * Matillion Data Loader
 * Solutions
   * Data Transformation
   * Data Analytics
   * Data Integration
   * Data Lakes
   * Data Preparation
   * SMB
   * Enterprise
 * Technology
   * Cloud Data Warehouse
     * Amazon Redshift
     * Google BigQuery
     * Microsoft Azure Synapse
     * Snowflake
   * ELT
   * Integrations
   * Security
 * Pricing
 * Partners
   * Become a Partner
 * Resources
   * Blog
   * Events
   * Case Studies
   * What is ETL? The Ultimate Guide
   * Data Glossary
   * Documentation
   * Certifications
 * About
   * Leadership
   * News
   * Events
   * Careers
   * Investors
   * Contact
 * Support
   * Mission Critical Support
   * Support Portal
 * Sign in
 * Get started

Skip to main content

Toggle menubar

Matillion Ltd

 * Home
 * Toggle submenu for: Product
   * Matillion Data Loader
   * Matillion ETL for Snowflake
   * Matillion ETL for Amazon Redshift
   * Matillion ETL for Google BigQuery
   * Matillion ETL for Azure Synapse
 * Toggle submenu for: Content Type
   * Blog post
   * Ebooks and white papers
   * Webinars and virtual events
   * Case study
   * Video
 * Toggle submenu for: Solution
   * Data migration
   * Data lakes
   * Business intelligence
   * Machine learning
 * Toggle submenu for: Topic
   * Modernization
   * Data loading
   * Cloud data warehousing
   * Security
   * Automation
   * Analytics
   * Data transformation
   * Developer relations
   * Data Governance

Open search box
Search sitewide Close search box
Share this Post


SHARE THIS POST

 * Share on facebook
 * Share on twitter
 * Share on linkedin
 * Share on email

 1. Home
 2. Matillion + Snowflake: Building the Modern Enterprise Data Stack in the Data
    Cloud
 3. Snowflake Cloud Data Platform Architecture & Basic Concepts


SNOWFLAKE CLOUD DATA PLATFORM ARCHITECTURE & BASIC CONCEPTS

Published Date October 27, 2020 Author Julie Polito



 

As big data continues to get bigger, more organizations are turning to cloud
data warehouses. The cloud is the only platform that provides the flexibility
and scalability that are needed to accommodate today’s massive data volumes.
Initially released in 2014, Snowflake is one of the top cloud data platforms on
the market..

 


WHAT IS SNOWFLAKE CLOUD DATA PLATFORM?

 

Snowflake is a cloud data platform that’s provided as a fully-managed service.
It can be used for data warehousing, data lakes, data engineering, data
analytics, data science, data application development, and for securely sharing
and consuming shared data. One cool feature of Snowflake is that it supports a
near-unlimited number of concurrent workloads, so your users can always do what
they need to do, when they need to do it.

 


WHAT IS THE SNOWFLAKE CLOUD DATA PLATFORM ARCHITECTURE?

 

Snowflake has a fairly unique architecture. The platform includes storage,
compute, and cloud services layers that are physically separated but logically
integrated. This means that you can enable virtually all of your users and data
workloads to access a single copy of your data without impacting performance.
With everyone accessing the same version of your data, there are no data silos.
Everyone has the same source of the truth.

Snowflake charges by credits. A Snowflake credit is a unit of measure that’s
used to pay for the consumption of resources on Snowflake. A Snowflake credit is
consumed when a customer is using resources, such as when a virtual warehouse is
running, the cloud services layer is being used, or serverless features are
being used.

 

Here are some other features of Snowflake:

 

Hybrid Snowflake’s architecture is a hybrid of shared-disk and shared-nothing
architectures. Shared-nothing architecture is a distributed architecture, where
each node is independent and self-sufficient. In shared-disk architectures, all
data is accessible from all cluster nodes. Snowflake combines these two
architectures, using a central data repository for persisted data that is
accessible from all compute nodes. When processing queries, Snowflake uses
massively parallel processing (MPP) compute clusters, and each node in the
cluster stores a portion of the data set locally. With this hybrid model,
Snowflake has the data management simplicity of a shared-disk architecture plus
the performance benefits of a shared-nothing architecture.

Cloud agnostic Unlike many cloud data warehouses, Snowflake doesn’t run on its
own cloud. It is available globally on Amazon Web Services (AWS), Google Cloud
Platform (GCP), and Microsoft Azure. Because it has a common and interchangeable
code base, you can move your data to any cloud in any region, without having to
re-do your application code. However, Snowflake cannot run on a private cloud
infrastructure, either on-premises or hosted.

Separate storage and compute Snowflake’s architecture separates storage from
compute. This means that users aren’t competing for resources. Further, there
are no limits on the number of queries or workloads that can be run
simultaneously, and no limits on the number of users accessing data. All
workloads can simultaneously leverage the compute power they need, when they
need it.

Three-layered Snowflake’s hybrid architecture has three layers that scale
independently of one another: the database storage layer, the cloud services
layer, and the query processing layer.

 * Database storage: Snowflake has a scalable cloud blob storage for storing
   structured and semi-structured data, including JSON, AVRO, and Parquet. The
   storage layer contains tables, schemas, databases, and diverse data. Tables
   can store multiple petabytes of data. Data in Snowflake tables is
   automatically divided into micro-partitions, which are contiguous units of
   storage. Each micro-partition contains between 50 MB and 500 MB of
   uncompressed data.
 * Cloud services layer: The cloud services layer provides services such as
   authentication, infrastructure management, and access control. It also
   provides metadata management.
 * Query processing layer: The query processing layer handles query execution.
   Snowflake processes queries using “virtual warehouses.” Each virtual
   warehouse is an MPP compute cluster made up of multiple compute nodes and
   each virtual warehouse is an independent compute cluster. As a result, each
   virtual warehouse operates independently and has no impact on the performance
   of the other virtual warehouses.

Supports a range of data types Snowflake supports a broad range of data types
and can store them in their native forms, so you’re not creating new data silos.

Scales elastically Snowflake provides automatic cloud elasticity, so when you
need more capacity resources, Snowflake automatically adds them. You only pay
for what you use.

 


SNOWFLAKE CLOUD DATA PLATFORM BEST PRACTICES

 

Here are a few best practices for using Snowflake efficiently and economically:

 


CHOOSE YOUR WAREHOUSE SIZE BASED ON QUERY TYPE

 

Snowflake uses per-second billing, so the size of the platform you choose
doesn’t necessarily matter. In fact, you can run larger platforms (sizes run
from X-Small to 4X-Large) and then just suspend them when they’re not in use.

What’s more important than the size of your warehouse is the type of queries
you’ll be running. Data engineers may want to create separate platforms for
different environments such as development, testing, and production. A smaller
platform is likely sufficient for development or testing environments. For
production environments, larger platforms sizes are usually necessary.

Snowflake recommends that users experiment with different types of queries and
different platform sizes to determine the combinations that best meet your
specific query needs and workload.

 


USE SEPARATE WAREHOUSES FOR LOAD AND QUERY OPERATIONS

 

Loading large datasets can impact your query performance. Snowflake therefore
recommends dedicating separate instances for loading and querying operations to
optimize performance for both loading and querying. Another issue to consider
regarding data loading is that loading performance is influenced more by the
number of files being loaded along with the size of each file than by the size
of the warehouse. A smaller instance may therefore be sufficient for data
loading purposes.

 


MINIMIZE SMALL QUERIES

 

As we’ve already learned, the Snowflake architecture separates its platform into
three distinct functions: compute resources (implemented as virtual warehouses),
data storage, and cloud services. The costs associated with using Snowflake are
based on your usage of each of these functions. Because Snowflake uses
per-second billing, it’s not cost-effective to run small queries. Using
Snowflake for small queries is kind of like using a backhoe when you really need
a hand shovel.

 


ENABLE AUTO-SUSPENSION AND AUTO-RESUMPTION

 

Snowflake provides features that can help you save credits and therefore reduce
costs. Warehouses do not accrue credit usage when they’re suspended. Therefore,
if you enable the auto-suspension and auto-resumption features, you can help cut
costs. With these features enabled, Snowflake will automatically suspend an
instance after a specified period of inactivity. It will also enable the
platform after you submit a query it, and the instance is the current one for
the session.


WANT TO LEARN MORE ABOUT SNOWFLAKE CLOUD DATA PLATFORM?

 

Modern businesses seeking a competitive advantage must harness their data to
gain better business insights. Matillion enables your data journey by extracting
and loading your data and transforming it in the cloud, allowing you to be more
productive, gain new insights and make better business decisions.

 

Effortlessly load source system data into your cloud data warehouse with
Matillion Data Loader, a free SaaS-based data integration tool.

 * Previous Article
   
   Matillion Named Snowflake Data Integration Partner of the Year for 2021
   
     There are some milestones in Matillion’s growth that are unforgettable.
   Debuting Matillion ETL to the world at Amazon re:Invent. Gaining the support
   of valuable investors and partners on our...

 * Next Video
   
   Case Study | Cisco Meraki - Why Matillion


OTHER CONTENT IN THIS STREAM

Show previous Show next
 * 4 months ago
   
   
   PACIFIC LIFE RAPIDLY ACCELERATES DATA DELIVERY WITH SNOWFLAKE AND MATILLION
   
   Read Flipbook

 * 7 months ago
   
   
   MATILLION & SNOWFLAKE: THE BUSINESS BENEFITS OF DATA TRANSFORMATION
   
   Read Flipbook

 * about 1 year ago
   
   
   MATILLION NAMED SNOWFLAKE DATA INTEGRATION PARTNER OF THE YEAR FOR 2021
   
     There are some milestones in Matillion’s growth that are unforgettable.
   Debuting Matillion ETL to the world at Amazon re:Invent. Gaining the support
   of valuable investors and partners on our...
   
   Read Article

 * 1:24
   6 months ago
   
   
   CASE STUDY | CISCO MERAKI - WHY MATILLION
   
   Watch Video

 * 7 months ago
   
   
   SUPERCHARGE YOUR SNOWFLAKE DATA CLOUD
   
   Matillion ETL for Snowake helps data teams get things done faster in the
   cloud. Unlock the value of your data cloud today with low-code data
   ingestion, transformation, and Snowflake platform control.
   
   Read Flipbook

 * 7 months ago
   
   
   SLACK REDUCES TIME TO CREATE CRITICAL REVENUE METRICS FROM 6 HOURS TO 30
   MINUTES
   
   Slack implemented an analytics data warehouse for business systems datasets
   with Snowflake and uses Matillion ETL for Snowflake to empower its employees
   with access to timely, accurate data...
   
   Read Flipbook

 * 0:29
   almost 2 years ago
   
   
   INTRODUCING MATILLION ETL FOR SNOWFLAKE
   
   Matillion ETL for Snowflake simplifies data management on Snowflake and
   unlocks Snowflake's power as a data transformation platform. Available today
   on the AWS and Azure Marketplaces.
   
   Watch Video

 * 7 months ago
   
   
   OPTIMIZING SNOWFLAKE: A REAL WORLD GUIDE
   
   You now need to make the most of the Snowflake platform to get the most out
   of your data. That’s where this eBook comes in handy. It introduces the
   Snowflake platform and helps you understand the...
   
   Read Flipbook

Return to Home
© Matillion Ltd
×
×
Matillion
 * Home
 * Products
 * Pricing
 * Customer Success
 * Cloud Data Warehouse
 * Lakehouse

About Us
 * Company
 * Leadership
 * Careers
 * Press

Solutions
 * Data Transformation
 * Data Analytics
 * Data Integration
 * Data Lakes
 * Data Preparation
 * SMB
 * Enterprise

Products
 * Matillion ETL
 * Matillion Data Loader

Resources
 * ELT Guide
 * Support
 * Integrations
 * Partners
 * Events
 * Matillion ETL Community
 * Matillion Exchange
 * Matillion Academy
 * Certifications

Terms & Privacy
 * 
 * 
 * 
 * 
 * 

 * Get a Demo
 * Get Matillion

Press again to continue 0/1
Notice

We and selected third parties use cookies or similar technologies for technical
purposes and, with your consent, for other purposes as specified in the cookie
policy. Denying consent may make related features unavailable.

In case of sale of your personal information, you may opt out by using the link
"Do Not Sell My Personal Information".

To find out more about the categories of personal information collected and the
purposes for which such information will be used, please refer to our privacy
policy.



You can consent to the use of such technologies by using the “Accept” button.

Learn more and customize
Accept
×