www.alluxio.io
Open in
urlscan Pro
3.94.37.138
Public Scan
Submitted URL: https://alluxio.apms5.com/anywhere/m?s=alluxio&m=s_b2e01875-6f4a-48f5-835c-3592f1a86824&u=e1jq4wvfdtfkectk64wk4dtm5mwkjd1h...
Effective URL: https://www.alluxio.io/
Submission: On November 23 via api from SE — Scanned from DE
Effective URL: https://www.alluxio.io/
Submission: On November 23 via api from SE — Scanned from DE
Form analysis
2 forms found in the DOMGET /
<form role="search" method="get" class="search-form form-inline header-search" action="/"><input type="search" class="form-control form-control-sm" value="" name="s" placeholder="search"><a class="toolbar-icon-search-toggle"></a></form>
<form id="newsletter-submission-form" data-autopilot-anywhere="0001556912644788_3f1de995744343d0802a999a419685ca">
<div class="modal fade" id="newsletter-form" tabindex="-1" role="dialog" aria-labelledby="newsletter-form-label" aria-hidden="true">
<div class="modal-dialog" role="document">
<div class="modal-content">
<div class="modal-header">
<h5 class="modal-title" id="newsletter-form-label">Newsletter Signup</h5>
<button type="button" class="close" data-dismiss="modal" aria-label="Close">
<span aria-hidden="true">×</span>
</button>
</div>
<div class="modal-body">
<div>
<label for="newsletter-submission-first-name"> First Name</label>
<input type="text" name="newsletter-submission-first-name" id="newsletter-submission-first-name" required="" aria-required="true">
</div>
<div>
<label for="newsletter-submission-last-name"> Last Name</label>
<input type="text" name="newsletter-submission-last-name" id="newsletter-submission-last-name" required="" aria-required="true">
</div>
<div>
<label for="newsletter-submission-email"> Company Email</label>
<input type="text" name="newsletter-submission-email" id="newsletter-submission-email" required="" aria-required="true">
</div>
</div>
<div class="modal-footer">
<button type="button" class="btn btn-secondary" data-dismiss="modal">Close</button>
<button type="submit" class="btn btn-primary">Submit</button>
</div>
</div>
</div>
</div>
</form>
Text Content
* * Sign In * Blog * Docs * GitHub * Slack * Why Data Orchestration * Product * Alluxio Overview * Alluxio on AWS * Alluxio on GCP * Presto with Alluxio * Spark with Alluxio * Alluxio + Intel * Use Cases * Zero-copy Hybrid Bursting * Zero-copy Burst Across Datacenters * Cloud Analytics Caching * Accelerated Workloads for Object Stores * Community * Alluxio Community * Powered by Alluxio * Data Orchestration Summit * Alluxio Day * Newsletter * Enterprise * Editions * Pricing * Contact Us * Resources * Downloads * Documentation * FAQ * Learning Center * Videos * Tech Talks * Slides from Talks * White Papers * Case Studies * Solution Briefs * Events * Try Alluxio ALLUXIO COMMUNITY DAY VI Virtual community event held October 12th featuring fellow Alluxio community users on how they are solving complex data access challenges. Hear Alluxio users from Facebook, Princeton, Apache Hudi, Zendesk, and Uber. WATCH ON-DEMAND Alluxio 2.7 is now available! 8-12x Improvements in AI/ML Training Efficiency and Enhanced Scalability & Stability. Learn More > Explore today’s data access challenges and open source data orchestration solutions for modernizing your platform. Watch On-Demand > We’re hiring! Join our team and build the future of data orchestration. See open positions > ALLUXIO ENABLES COMPUTE DATA LOCALITY Bring your data close to compute. Make your data local to compute workloads for Spark caching, Presto caching, Hive caching and more. DATA ACCESSIBILITY Make your data accessible. No matter if it sits on-prem or in the cloud, HDFS or S3, make your files and objects accessible in many different ways. DATA ON-DEMAND Make your data as elastic as compute. Effortlessly orchestrate your data for compute in any cloud, even if data is spread across multiple clouds. Schedule a meeting “ZERO-COPY” BURST USER SPOTLIGHT: WALMART Why Walmart chose Alluxio’s “Zero-Copy” burst solution: * No requirement to persist data into the cloud * Improved query performance and no network hops on recurrent queries * Lower costs without the need for creating data copies See more on how Alluxio powers Walmart’s “zero-copy” burst solution in their presentation > FEATURED USE CASES AND DEPLOYMENTS MANAGING DATA COPIES/APP CHANGES WHEN BURSTING COMPUTE TO CLOUD? Zero-copy hybrid bursting with no app changes to intelligently make remote data accessible in the public cloud. EXPANDING COMPUTE CAPACITY ACROSS GEO-DISTRIBUTED DATA CENTERS? Zero-copy bursting across data centers for Presto, Spark, and Hive with no app changes on data stored in HDFS. INTERACT WITH ALLUXIO IN ANY STACK Pick a compute. Pick a storage. Alluxio just works. * Presto * Spark * Hive * HBase * MapReduce * ML via POSIX Tutorial –> Full Docs –> -- Pointing Table location to Alluxio CREATE SCHEMA hive.web WITH (location = 'alluxio://master:port/my-table/‘) Full Docs // Using Alluxio as input and output for RDD scala> sc.textFile("alluxio://master:19998/Input") scala> rdd.saveAsTextFile("alluxio://master:19998/Output") // Using Alluxio as input and output for Dataframe scala> df = sqlContext.read.parquet("alluxio://master:19998/Input.parquet") scala> df.write.parquet("alluxio://master:19998/Output.parquet”) Full Docs -- Pointing Table location to Alluxio hive> CREATE TABLE u_user ( userid INT, age INT) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LOCATION 'alluxio://master:port/table_data'; Full Docs Create and Query table stored in Alluxio hbase(main):001:0> create 'test', 'cf' hbase(main):002:0> list ‘test' Full Docs # Running a wordcount using Alluxio as input and output $ bin/hadoop jar hadoop-mapreduce-examples-2.7.3.jar wordcount \ -libjars /<ALLUXIO_HOME>/client/alluxio-<VERSION>-client.jar \ alluxio://master:19998/wordcount/input.txt \ alluxio://master:19998/wordcount/output Full Docs # Accessing Alluxio after mounting Alluxio service to local file system $ ls /mnt/alluxio_mount $ cat /mnt/alluxio_mount/mydata.txt ALLUXIO * S3 * HDFS * Azure Blob * Google Cloud * Ceph * NFS $ ./bin/alluxio fs mount \ --option aws.accessKeyId=<AWS_ACCESS_KEY_ID> \ --option aws.secretKey=<AWS_SECRET_KEY_ID> \ alluxio://master:port/s3 s3a://<S3_BUCKET>/<S3_DIRECTORY> Full Docs $ ./bin/alluxio fs mount \ alluxio://master:port/hdfs hdfs://namenode:port/dir/ Full Docs $ ./bin/alluxio fs mount \ --option fs.azure.account.key.<AZURE_ACCOUNT>.blob.core.windows.net=<AZURE_ACCESS_KEY> \ alluxio://master:port/azure wasb://<AZURE_CONTAINER>@<AZURE_ACCOUNT>.blob.core.windows.net/<AZURE_DIRECTORY>/ Full Docs $ ./bin/alluxio fs mount \ --option fs.gcs.accessKeyId=<GCS_ACCESS_KEY_ID> \ --option fs.gcs.secretAccessKey=<GCS_SECRET_ACCESS_KEY> \ alluxio://master:port/gcs gs://<GCS_BUCKET>/<GCS_DIRECTORY> Full Docs $ ./bin/alluxio fs mount \ --option aws.accessKeyId=<AWS_ACCESS_KEY_ID> \ --option aws.secretKey=<AWS_SECRET_KEY_ID> \ --option alluxio.underfs.s3.endpoint=http://<rgw-hostname>:<rgw-port> \ --option alluxio.underfs.s3.disable.dns.buckets=true \ alluxio://master:port/ceph s3a://<S3_BUCKET>/<S3_DIRECTORY> Full Docs $ ./bin/alluxio fs mount alluxio://master:port/nfs /mnt/nfs Full Docs 4,000+ STARS The Alluxio open source project has over 4,000 GitHub Stars, growing daily. 1,000+ CONTRIBUTORS Over 1,000 contributors have made close to 30,000 commits to the Alluxio open source project. APACHE 2.0 LICENSED Alluxio is open source software released under the Apache 2.0 license. Alluxio has a vibrant developer and user community. join the community POWERED BY ALLUXIO * * * * * * * * * * * * * see more WHAT’S HAPPENING News Alluxio Raises $50 Million In Funding, Launches New Release Of Its Data Orchestration Platform Alluxio has raised $50 million in a Series C round of funding, capital the company will use to fuel the growth of its global operations and continue building out the capabilities of its data orchestration software for managing large-scale distributed data workloads. November 19, 2021 CRN Solution Briefs Accelerate Machine Learning with Alluxio and Latest-Gen Intel® Xeon® Scalable Processors Intel and Alluxio collaborate to measure a 20-25% price/performance improvement over the prior generation for machine learning models with PyTorch on AWS. This collaboration … Continued November 10, 2021 Blog Presto with Alluxio Overview – Architecture Evolution for Interactive Queries Alluxio is the data orchestration platform to unify data silos across heterogeneous environments. The following blog will discuss the architecture combining Spark with Alluxio. November 10, 2021 Blog Speeding Up the Atlas Supercomputing Platform with Fluid + Alluxio Unisound is an artificial intelligence company focusing on Internet of Things services. Unisound’s AI technology stacks include the perception and expression capabilities of signals, voices, images, and texts, and the cognitive technologies such as knowledge, understanding, analysis, and decision-making, towards a multi-modal AI system. Atlas is the supercomputing platform supporting all kinds of AI applications including model training and reasoning inferencing. November 8, 2021 White Papers Alluxio Use Cases Overview Alluxio started as a virtual distributed file system, a research project out of the AMPLab at U.C. Berkeley. Alluxio foresaw the need for agility … Continued October 21, 2021 On-Demand Videos Best Practice in Accelerating Data Applications with Spark+Alluxio This talk shares the designs and use cases of the Alluxio and Spark integrated solutions, as well as the best practice and “what not … Continued October 12, 2021 * Resources * Blog * White Papers * Tech Talks * Case Studies * Events * Slides from talks * Videos * Open Source * Community * Download * Mailing List * Slack * Powered By Alluxio * Newsletter * Support * Documentation * Account Sign In * Pricing * Services & Support * Contact Us * Company * About * Careers * News & Press * Awards * Partners © Copyright 2021 Alluxio, Inc. All rights reserved. Alluxio is a trademark of Alluxio, Inc. Terms of Service | Privacy Policy NEWSLETTER SIGNUP × First Name Last Name Company Email Close Submit