www.alluxio.io
Open in
urlscan Pro
3.94.37.138
Public Scan
Submitted URL: http://d39kqat1wpn1o5.cloudfront.net/
Effective URL: https://www.alluxio.io/
Submission: On May 24 via api from GB — Scanned from GB
Effective URL: https://www.alluxio.io/
Submission: On May 24 via api from GB — Scanned from GB
Form analysis
2 forms found in the DOMGET /
<form role="search" method="get" class="search-form form-inline header-search" action="/"><input type="search" class="form-control form-control-sm" value="" name="s" placeholder="search"><a class="toolbar-icon-search-toggle"></a></form>
<form id="newsletter-submission-form" data-autopilot-anywhere="0001556912644788_3f1de995744343d0802a999a419685ca">
<div class="modal fade" id="newsletter-form" tabindex="-1" role="dialog" aria-labelledby="newsletter-form-label" aria-hidden="true">
<div class="modal-dialog" role="document">
<div class="modal-content">
<div class="modal-header">
<h5 class="modal-title" id="newsletter-form-label">Newsletter Signup</h5>
<button type="button" class="close" data-dismiss="modal" aria-label="Close">
<span aria-hidden="true">×</span>
</button>
</div>
<div class="modal-body">
<div>
<label for="newsletter-submission-first-name"> First Name</label>
<input type="text" name="newsletter-submission-first-name" id="newsletter-submission-first-name" required="" aria-required="true">
</div>
<div>
<label for="newsletter-submission-last-name"> Last Name</label>
<input type="text" name="newsletter-submission-last-name" id="newsletter-submission-last-name" required="" aria-required="true">
</div>
<div>
<label for="newsletter-submission-email"> Company Email</label>
<input type="text" name="newsletter-submission-email" id="newsletter-submission-email" required="" aria-required="true">
</div>
</div>
<div class="modal-footer">
<button type="button" class="btn btn-secondary" data-dismiss="modal">Close</button>
<button type="submit" class="btn btn-primary">Submit</button>
</div>
</div>
</div>
</div>
</form>
Text Content
* * Sign In * Blog * Docs * GitHub * Slack * Why Data Orchestration * Product * Alluxio Overview * Alluxio on AWS * Alluxio on GCP * Presto with Alluxio * Spark with Alluxio * Alluxio + Intel * Alluxio + NetApp * Use Cases * Zero-copy Hybrid Bursting * Zero-copy Burst Across Datacenters * Cloud Analytics Caching * Accelerated Workloads for Object Stores * Community * Alluxio Community * Powered by Alluxio * Data Orchestration Summit * Alluxio Day * Product School * Newsletter * Enterprise * Editions * Pricing * Contact Us * Resources * Downloads * Documentation * FAQ * Learning Center * Videos * Tech Talks * Slides from Talks * White Papers * Case Studies * Solution Briefs * Events * Try Alluxio PRODUCT SCHOOL COMMUNITY VIRTUAL EVENT Learn how Alluxio uses Apache Ranger’s centralized access policies to control access to virtual paths in the Alluxio virtual file system and enforce existing access policies for the HDFS under stores. register now Check out the talks from our virtual community event, Alluxio Day XII, featuring presenters from Websec, Shopee, and Alluxio. Watch on-demand > Alluxio 2.8 expands data access & security for data-driven applications in heterogeneous environments – Enhanced S3 API, data encryption & policy-driven data management, and more. Read the blog > We’re hiring! Join our team and build the future of data orchestration. See open positions > ALLUXIO ENABLES COMPUTE DATA LOCALITY Bring your data close to compute. Make your data local to compute workloads for Spark caching, Presto caching, Hive caching and more. DATA ACCESSIBILITY Make your data accessible. No matter if it sits on-prem or in the cloud, HDFS or S3, make your files and objects accessible in many different ways. DATA ON-DEMAND Make your data as elastic as compute. Effortlessly orchestrate your data for compute in any cloud, even if data is spread across multiple clouds. Schedule a meeting “ZERO-COPY” BURST USER SPOTLIGHT: WALMART Why Walmart chose Alluxio’s “Zero-Copy” burst solution: * No requirement to persist data into the cloud * Improved query performance and no network hops on recurrent queries * Lower costs without the need for creating data copies See more on how Alluxio powers Walmart’s “zero-copy” burst solution in their presentation > FEATURED USE CASES AND DEPLOYMENTS MANAGING DATA COPIES/APP CHANGES WHEN BURSTING COMPUTE TO CLOUD? Zero-copy hybrid bursting with no app changes to intelligently make remote data accessible in the public cloud. EXPANDING COMPUTE CAPACITY ACROSS GEO-DISTRIBUTED DATA CENTERS? Zero-copy bursting across data centers for Presto, Spark, and Hive with no app changes on data stored in HDFS. INTERACT WITH ALLUXIO IN ANY STACK Pick a compute. Pick a storage. Alluxio just works. * Presto * Spark * Hive * HBase * MapReduce * ML via POSIX Tutorial –> Full Docs –> -- Pointing Table location to Alluxio CREATE SCHEMA hive.web WITH (location = 'alluxio://master:port/my-table/‘) Full Docs // Using Alluxio as input and output for RDD scala> sc.textFile("alluxio://master:19998/Input") scala> rdd.saveAsTextFile("alluxio://master:19998/Output") // Using Alluxio as input and output for Dataframe scala> df = sqlContext.read.parquet("alluxio://master:19998/Input.parquet") scala> df.write.parquet("alluxio://master:19998/Output.parquet”) Full Docs -- Pointing Table location to Alluxio hive> CREATE TABLE u_user ( userid INT, age INT) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LOCATION 'alluxio://master:port/table_data'; Full Docs Create and Query table stored in Alluxio hbase(main):001:0> create 'test', 'cf' hbase(main):002:0> list ‘test' Full Docs # Running a wordcount using Alluxio as input and output $ bin/hadoop jar hadoop-mapreduce-examples-2.7.3.jar wordcount \ -libjars /<ALLUXIO_HOME>/client/alluxio-<VERSION>-client.jar \ alluxio://master:19998/wordcount/input.txt \ alluxio://master:19998/wordcount/output Full Docs # Accessing Alluxio after mounting Alluxio service to local file system $ ls /mnt/alluxio_mount $ cat /mnt/alluxio_mount/mydata.txt ALLUXIO * S3 * HDFS * Azure Blob * Google Cloud * Ceph * NFS $ ./bin/alluxio fs mount \ --option aws.accessKeyId=<AWS_ACCESS_KEY_ID> \ --option aws.secretKey=<AWS_SECRET_KEY_ID> \ alluxio://master:port/s3 s3a://<S3_BUCKET>/<S3_DIRECTORY> Full Docs $ ./bin/alluxio fs mount \ alluxio://master:port/hdfs hdfs://namenode:port/dir/ Full Docs $ ./bin/alluxio fs mount \ --option fs.azure.account.key.<AZURE_ACCOUNT>.blob.core.windows.net=<AZURE_ACCESS_KEY> \ alluxio://master:port/azure wasb://<AZURE_CONTAINER>@<AZURE_ACCOUNT>.blob.core.windows.net/<AZURE_DIRECTORY>/ Full Docs $ ./bin/alluxio fs mount \ --option fs.gcs.accessKeyId=<GCS_ACCESS_KEY_ID> \ --option fs.gcs.secretAccessKey=<GCS_SECRET_ACCESS_KEY> \ alluxio://master:port/gcs gs://<GCS_BUCKET>/<GCS_DIRECTORY> Full Docs $ ./bin/alluxio fs mount \ --option aws.accessKeyId=<AWS_ACCESS_KEY_ID> \ --option aws.secretKey=<AWS_SECRET_KEY_ID> \ --option alluxio.underfs.s3.endpoint=http://<rgw-hostname>:<rgw-port> \ --option alluxio.underfs.s3.disable.dns.buckets=true \ alluxio://master:port/ceph s3a://<S3_BUCKET>/<S3_DIRECTORY> Full Docs $ ./bin/alluxio fs mount alluxio://master:port/nfs /mnt/nfs Full Docs 4,000+ STARS The Alluxio open source project has over 4,000 GitHub Stars, growing daily. 1,000+ CONTRIBUTORS Over 1,000 contributors have made close to 30,000 commits to the Alluxio open source project. APACHE 2.0 LICENSED Alluxio is open source software released under the Apache 2.0 license. Alluxio has a vibrant developer and user community. join the community POWERED BY ALLUXIO see more WHAT’S HAPPENING Blog Deep Dive into the Implementation of Alluxio Metadata Storage This article introduces the design and implementation of metadata storage in Alluxio Master, either on heap and off heap (based on RocksDB). May 18, 2022 News Alluxio Expands Data Access and Security for Data-driven Applications in Heterogeneous Environments with v2.8 Release Alluxio, the developer of the open source data orchestration platform for data driven workloads such as large-scale analytics and AI/ML, announced the immediate availability of version 2.8 of its Data Orchestration Platform. May 16, 2022 AIThority Blog What’s New in Alluxio 2.8: Enhanced S3 API Functionality, Enterprise-grade Security and Data Migration With Better Usability and Low Cost The Alluxio 2.8 version focuses on the S3 API, enterprise-grade security, scalability and observability in data migration. Enhanced S3 API makes managing Alluxio easier than ever. Features such as encryption at rest and policy-driven data management further improve Alluxio’s functionality to support enterprise customers. May 4, 2022 Solution Brief Simplify and Accelerate Your Geo-Distributed Analytics Platform at Scale Today, many organizations are running a multitude of data-driven applications and data platforms that span multiple geographic regions and across heterogeneous environments – public, … Continued Blog From Zookeeper to Raft: How Alluxio Stores File System State with High Availability and Fault Tolerance Raft is an algorithm for state machine replication as a way to ensure high availability (HA) and fault tolerance. This blog shares how Alluxio has moved to a Zookeeper-less, built-in Raft-based journal system as a HA implementation. April 13, 2022 White Paper Spark + Alluxio Overview | Pair Spark with Alluxio to Modernize Your Data Platform By bringing Alluxio together with Spark, you can modernize your data platform in a scalable, agile, and cost-effective way. In this post, we provide … Continued * Resources * Blog * White Papers * Tech Talks * Case Studies * Events * Slides from talks * Videos * Open Source * Community * Download * Mailing List * Slack * Powered By Alluxio * Newsletter * Support * Documentation * Account Sign In * Pricing * Services & Support * Contact Us * Company * About * Careers * News & Press * Awards * Partners © Copyright 2021 Alluxio, Inc. All rights reserved. Alluxio is a trademark of Alluxio, Inc. Terms of Service | Privacy Policy NEWSLETTER SIGNUP × First Name Last Name Company Email Close Submit