www.matillion.com
Open in
urlscan Pro
2a04:4e42:200::645
Public Scan
Submitted URL: https://email.matillion.com/OTkyLVVJVy03MzEAAAGGIly3CIqfxQkBeQrXuXk0iaEWNnsZNOg92a73ub7N6ZxkKF0iU0tLaAXFRu3J9vYYSLOFop8=
Effective URL: https://www.matillion.com/resources/matillion-snowflake-building-the-modern-enterprise-data-stack-in-the-data-cloud/snowfl...
Submission: On August 09 via manual from US — Scanned from DE
Effective URL: https://www.matillion.com/resources/matillion-snowflake-building-the-modern-enterprise-data-stack-in-the-data-cloud/snowfl...
Submission: On August 09 via manual from US — Scanned from DE
Form analysis
1 forms found in the DOMhttps://www.matillion.com/resources/search
<form id="uf-search-form" class="uf-search-form" role="search" aria-label="Sitewide" action="https://www.matillion.com/resources/search">
<input type="search" name="ufq" id="uf-search-input" class="uf-search-input uf-input" placeholder="Search" aria-label="Search">
<input type="hidden" name="ufs" value="10354596">
<button type="submit" id="uf-search-submit" class="uf-search-submit" title="Search sitewide">
<i class="fas fa-search" aria-hidden="true"></i>
<span class="sr-only">Search sitewide</span>
</button>
<button type="button" id="uf-search-close" class="uf-search-close" title="Close Search Box">
<i class="fas fa-times" aria-hidden="true"></i>
<span class="sr-only">Close search box</span>
</button>
</form>
Text Content
MENUMENU * Products * Matillion ETL * Matillion Data Loader * Solutions * * * TYPES OF SOLUTIONS * Data Transformation * Data Analytics * Data Integration * Data Lakes * Data Preparation * * SOLUTIONS FOR * Enterprise * SMB * Technology * Cloud Data Warehouse * Amazon Redshift * Google BigQuery * Microsoft Azure Synapse * Snowflake * Lakehouse * Delta Lake on Databricks * ELT * Integrations * Security * Pricing * Partners * Become a Partner * Partner Log-In * Resources * Blog * Case Studies * What is ETL? The Ultimate Guide * What is Data Integration? The Ultimate Guide * Data Glossary * Documentation * Matillion ETL Community * Matillion Exchange * Matillion Academy * Certifications * About * Leadership * News * Events * Brand and Media Kit * Careers * Investors * Contact * Support * Mission Critical Support * Support Portal * Sign in * Get started Close * Products * Matillion ETL * Matillion Data Loader * Solutions * Data Transformation * Data Analytics * Data Integration * Data Lakes * Data Preparation * SMB * Enterprise * Technology * Cloud Data Warehouse * Amazon Redshift * Google BigQuery * Microsoft Azure Synapse * Snowflake * ELT * Integrations * Security * Pricing * Partners * Become a Partner * Resources * Blog * Events * Case Studies * What is ETL? The Ultimate Guide * Data Glossary * Documentation * Certifications * About * Leadership * News * Events * Careers * Investors * Contact * Support * Mission Critical Support * Support Portal * Sign in * Get started Skip to main content Toggle menubar Matillion Ltd * Home * Toggle submenu for: Product * Matillion Data Loader * Matillion ETL for Snowflake * Matillion ETL for Amazon Redshift * Matillion ETL for Google BigQuery * Matillion ETL for Azure Synapse * Toggle submenu for: Content Type * Blog post * Ebooks and white papers * Webinars and virtual events * Case study * Video * Toggle submenu for: Solution * Data migration * Data lakes * Business intelligence * Machine learning * Toggle submenu for: Topic * Modernization * Data loading * Cloud data warehousing * Security * Automation * Analytics * Data transformation * Developer relations * Data Governance Open search box Search sitewide Close search box Share this Post SHARE THIS POST * Share on facebook * Share on twitter * Share on linkedin * Share on email 1. Home 2. Matillion + Snowflake: Building the Modern Enterprise Data Stack in the Data Cloud 3. Snowflake Cloud Data Platform Architecture & Basic Concepts SNOWFLAKE CLOUD DATA PLATFORM ARCHITECTURE & BASIC CONCEPTS Published Date October 27, 2020 Author Julie Polito As big data continues to get bigger, more organizations are turning to cloud data warehouses. The cloud is the only platform that provides the flexibility and scalability that are needed to accommodate today’s massive data volumes. Initially released in 2014, Snowflake is one of the top cloud data platforms on the market.. WHAT IS SNOWFLAKE CLOUD DATA PLATFORM? Snowflake is a cloud data platform that’s provided as a fully-managed service. It can be used for data warehousing, data lakes, data engineering, data analytics, data science, data application development, and for securely sharing and consuming shared data. One cool feature of Snowflake is that it supports a near-unlimited number of concurrent workloads, so your users can always do what they need to do, when they need to do it. WHAT IS THE SNOWFLAKE CLOUD DATA PLATFORM ARCHITECTURE? Snowflake has a fairly unique architecture. The platform includes storage, compute, and cloud services layers that are physically separated but logically integrated. This means that you can enable virtually all of your users and data workloads to access a single copy of your data without impacting performance. With everyone accessing the same version of your data, there are no data silos. Everyone has the same source of the truth. Snowflake charges by credits. A Snowflake credit is a unit of measure that’s used to pay for the consumption of resources on Snowflake. A Snowflake credit is consumed when a customer is using resources, such as when a virtual warehouse is running, the cloud services layer is being used, or serverless features are being used. Here are some other features of Snowflake: Hybrid Snowflake’s architecture is a hybrid of shared-disk and shared-nothing architectures. Shared-nothing architecture is a distributed architecture, where each node is independent and self-sufficient. In shared-disk architectures, all data is accessible from all cluster nodes. Snowflake combines these two architectures, using a central data repository for persisted data that is accessible from all compute nodes. When processing queries, Snowflake uses massively parallel processing (MPP) compute clusters, and each node in the cluster stores a portion of the data set locally. With this hybrid model, Snowflake has the data management simplicity of a shared-disk architecture plus the performance benefits of a shared-nothing architecture. Cloud agnostic Unlike many cloud data warehouses, Snowflake doesn’t run on its own cloud. It is available globally on Amazon Web Services (AWS), Google Cloud Platform (GCP), and Microsoft Azure. Because it has a common and interchangeable code base, you can move your data to any cloud in any region, without having to re-do your application code. However, Snowflake cannot run on a private cloud infrastructure, either on-premises or hosted. Separate storage and compute Snowflake’s architecture separates storage from compute. This means that users aren’t competing for resources. Further, there are no limits on the number of queries or workloads that can be run simultaneously, and no limits on the number of users accessing data. All workloads can simultaneously leverage the compute power they need, when they need it. Three-layered Snowflake’s hybrid architecture has three layers that scale independently of one another: the database storage layer, the cloud services layer, and the query processing layer. * Database storage: Snowflake has a scalable cloud blob storage for storing structured and semi-structured data, including JSON, AVRO, and Parquet. The storage layer contains tables, schemas, databases, and diverse data. Tables can store multiple petabytes of data. Data in Snowflake tables is automatically divided into micro-partitions, which are contiguous units of storage. Each micro-partition contains between 50 MB and 500 MB of uncompressed data. * Cloud services layer: The cloud services layer provides services such as authentication, infrastructure management, and access control. It also provides metadata management. * Query processing layer: The query processing layer handles query execution. Snowflake processes queries using “virtual warehouses.” Each virtual warehouse is an MPP compute cluster made up of multiple compute nodes and each virtual warehouse is an independent compute cluster. As a result, each virtual warehouse operates independently and has no impact on the performance of the other virtual warehouses. Supports a range of data types Snowflake supports a broad range of data types and can store them in their native forms, so you’re not creating new data silos. Scales elastically Snowflake provides automatic cloud elasticity, so when you need more capacity resources, Snowflake automatically adds them. You only pay for what you use. SNOWFLAKE CLOUD DATA PLATFORM BEST PRACTICES Here are a few best practices for using Snowflake efficiently and economically: CHOOSE YOUR WAREHOUSE SIZE BASED ON QUERY TYPE Snowflake uses per-second billing, so the size of the platform you choose doesn’t necessarily matter. In fact, you can run larger platforms (sizes run from X-Small to 4X-Large) and then just suspend them when they’re not in use. What’s more important than the size of your warehouse is the type of queries you’ll be running. Data engineers may want to create separate platforms for different environments such as development, testing, and production. A smaller platform is likely sufficient for development or testing environments. For production environments, larger platforms sizes are usually necessary. Snowflake recommends that users experiment with different types of queries and different platform sizes to determine the combinations that best meet your specific query needs and workload. USE SEPARATE WAREHOUSES FOR LOAD AND QUERY OPERATIONS Loading large datasets can impact your query performance. Snowflake therefore recommends dedicating separate instances for loading and querying operations to optimize performance for both loading and querying. Another issue to consider regarding data loading is that loading performance is influenced more by the number of files being loaded along with the size of each file than by the size of the warehouse. A smaller instance may therefore be sufficient for data loading purposes. MINIMIZE SMALL QUERIES As we’ve already learned, the Snowflake architecture separates its platform into three distinct functions: compute resources (implemented as virtual warehouses), data storage, and cloud services. The costs associated with using Snowflake are based on your usage of each of these functions. Because Snowflake uses per-second billing, it’s not cost-effective to run small queries. Using Snowflake for small queries is kind of like using a backhoe when you really need a hand shovel. ENABLE AUTO-SUSPENSION AND AUTO-RESUMPTION Snowflake provides features that can help you save credits and therefore reduce costs. Warehouses do not accrue credit usage when they’re suspended. Therefore, if you enable the auto-suspension and auto-resumption features, you can help cut costs. With these features enabled, Snowflake will automatically suspend an instance after a specified period of inactivity. It will also enable the platform after you submit a query it, and the instance is the current one for the session. WANT TO LEARN MORE ABOUT SNOWFLAKE CLOUD DATA PLATFORM? Modern businesses seeking a competitive advantage must harness their data to gain better business insights. Matillion enables your data journey by extracting and loading your data and transforming it in the cloud, allowing you to be more productive, gain new insights and make better business decisions. Effortlessly load source system data into your cloud data warehouse with Matillion Data Loader, a free SaaS-based data integration tool. * Previous Article Matillion Named Snowflake Data Integration Partner of the Year for 2021 There are some milestones in Matillion’s growth that are unforgettable. Debuting Matillion ETL to the world at Amazon re:Invent. Gaining the support of valuable investors and partners on our... * Next Video Case Study | Cisco Meraki - Why Matillion OTHER CONTENT IN THIS STREAM Show previous Show next * 4 months ago PACIFIC LIFE RAPIDLY ACCELERATES DATA DELIVERY WITH SNOWFLAKE AND MATILLION Read Flipbook * 7 months ago MATILLION & SNOWFLAKE: THE BUSINESS BENEFITS OF DATA TRANSFORMATION Read Flipbook * about 1 year ago MATILLION NAMED SNOWFLAKE DATA INTEGRATION PARTNER OF THE YEAR FOR 2021 There are some milestones in Matillion’s growth that are unforgettable. Debuting Matillion ETL to the world at Amazon re:Invent. Gaining the support of valuable investors and partners on our... Read Article * 1:24 6 months ago CASE STUDY | CISCO MERAKI - WHY MATILLION Watch Video * 7 months ago SUPERCHARGE YOUR SNOWFLAKE DATA CLOUD Matillion ETL for Snowake helps data teams get things done faster in the cloud. Unlock the value of your data cloud today with low-code data ingestion, transformation, and Snowflake platform control. Read Flipbook * 7 months ago SLACK REDUCES TIME TO CREATE CRITICAL REVENUE METRICS FROM 6 HOURS TO 30 MINUTES Slack implemented an analytics data warehouse for business systems datasets with Snowflake and uses Matillion ETL for Snowflake to empower its employees with access to timely, accurate data... Read Flipbook * 0:29 almost 2 years ago INTRODUCING MATILLION ETL FOR SNOWFLAKE Matillion ETL for Snowflake simplifies data management on Snowflake and unlocks Snowflake's power as a data transformation platform. Available today on the AWS and Azure Marketplaces. Watch Video * 7 months ago OPTIMIZING SNOWFLAKE: A REAL WORLD GUIDE You now need to make the most of the Snowflake platform to get the most out of your data. That’s where this eBook comes in handy. It introduces the Snowflake platform and helps you understand the... Read Flipbook Return to Home © Matillion Ltd × × Matillion * Home * Products * Pricing * Customer Success * Cloud Data Warehouse * Lakehouse About Us * Company * Leadership * Careers * Press Solutions * Data Transformation * Data Analytics * Data Integration * Data Lakes * Data Preparation * SMB * Enterprise Products * Matillion ETL * Matillion Data Loader Resources * ELT Guide * Support * Integrations * Partners * Events * Matillion ETL Community * Matillion Exchange * Matillion Academy * Certifications Terms & Privacy * * * * * * Get a Demo * Get Matillion Press again to continue 0/1 Notice We and selected third parties use cookies or similar technologies for technical purposes and, with your consent, for other purposes as specified in the cookie policy. Denying consent may make related features unavailable. In case of sale of your personal information, you may opt out by using the link "Do Not Sell My Personal Information". To find out more about the categories of personal information collected and the purposes for which such information will be used, please refer to our privacy policy. You can consent to the use of such technologies by using the “Accept” button. Learn more and customize Accept ×