www.alluxio.io Open in urlscan Pro
3.94.37.138  Public Scan

Submitted URL: http://d39kqat1wpn1o5.cloudfront.net/
Effective URL: https://www.alluxio.io/
Submission: On May 24 via api from GB — Scanned from GB

Form analysis 2 forms found in the DOM

GET /

<form role="search" method="get" class="search-form form-inline header-search" action="/"><input type="search" class="form-control form-control-sm" value="" name="s" placeholder="search"><a class="toolbar-icon-search-toggle"></a></form>

<form id="newsletter-submission-form" data-autopilot-anywhere="0001556912644788_3f1de995744343d0802a999a419685ca">
  <div class="modal fade" id="newsletter-form" tabindex="-1" role="dialog" aria-labelledby="newsletter-form-label" aria-hidden="true">
    <div class="modal-dialog" role="document">
      <div class="modal-content">
        <div class="modal-header">
          <h5 class="modal-title" id="newsletter-form-label">Newsletter Signup</h5>
          <button type="button" class="close" data-dismiss="modal" aria-label="Close">
            <span aria-hidden="true">×</span>
          </button>
        </div>
        <div class="modal-body">
          <div>
            <label for="newsletter-submission-first-name"> First Name</label>
            <input type="text" name="newsletter-submission-first-name" id="newsletter-submission-first-name" required="" aria-required="true">
          </div>
          <div>
            <label for="newsletter-submission-last-name"> Last Name</label>
            <input type="text" name="newsletter-submission-last-name" id="newsletter-submission-last-name" required="" aria-required="true">
          </div>
          <div>
            <label for="newsletter-submission-email"> Company Email</label>
            <input type="text" name="newsletter-submission-email" id="newsletter-submission-email" required="" aria-required="true">
          </div>
        </div>
        <div class="modal-footer">
          <button type="button" class="btn btn-secondary" data-dismiss="modal">Close</button>
          <button type="submit" class="btn btn-primary">Submit</button>
        </div>
      </div>
    </div>
  </div>
</form>

Text Content

 * 
 * Sign In
 * Blog
 * Docs
 * GitHub
 * Slack

 * Why Data Orchestration
 * Product
   * Alluxio Overview
   * Alluxio on AWS
   * Alluxio on GCP
   * Presto with Alluxio
   * Spark with Alluxio
   * Alluxio + Intel
   * Alluxio + NetApp
 * Use Cases
   * Zero-copy Hybrid Bursting
   * Zero-copy Burst Across Datacenters
   * Cloud Analytics Caching
   * Accelerated Workloads for Object Stores
 * Community
   * Alluxio Community
   * Powered by Alluxio
   * Data Orchestration Summit
   * Alluxio Day
   * Product School
   * Newsletter
 * Enterprise
   * Editions
   * Pricing
   * Contact Us
 * Resources
   * Downloads
   * Documentation
   * FAQ
   * Learning Center
   * Videos
   * Tech Talks
   * Slides from Talks
   * White Papers
   * Case Studies
   * Solution Briefs
   * Events
 * Try Alluxio


PRODUCT SCHOOL

COMMUNITY VIRTUAL EVENT

Learn how Alluxio uses Apache Ranger’s centralized access policies to control
access to virtual paths in the Alluxio virtual file system and enforce existing
access policies for the HDFS under stores.

register now

Check out the talks from our virtual community event, Alluxio Day XII, featuring
presenters from Websec, Shopee, and Alluxio.

Watch on-demand >

Alluxio 2.8 expands data access & security for data-driven applications in
heterogeneous environments – Enhanced S3 API, data encryption & policy-driven
data management, and more.

Read the blog >

We’re hiring! Join our team and build the future of data orchestration. See open
positions >

ALLUXIO ENABLES COMPUTE


DATA LOCALITY

Bring your data close to compute.
Make your data local to compute workloads for Spark caching, Presto caching,
Hive caching and more.


DATA ACCESSIBILITY

Make your data accessible.
No matter if it sits on-prem or in the cloud, HDFS or S3, make your files and
objects accessible in many different ways.


DATA ON-DEMAND

Make your data as elastic as compute.
Effortlessly orchestrate your data for compute in any cloud, even if data is
spread across multiple clouds.


Schedule a meeting

“ZERO-COPY” BURST USER SPOTLIGHT: WALMART

Why Walmart chose Alluxio’s “Zero-Copy” burst solution:

 * No requirement to persist data into the cloud

 * Improved query performance and no network hops on recurrent queries 

 * Lower costs without the need for creating data copies

See more on how Alluxio powers Walmart’s “zero-copy” burst solution in their
presentation >



FEATURED USE CASES AND DEPLOYMENTS

MANAGING DATA COPIES/APP CHANGES WHEN BURSTING COMPUTE TO CLOUD?




Zero-copy hybrid bursting with no app changes to intelligently make remote data
accessible in the public cloud.

EXPANDING COMPUTE CAPACITY ACROSS GEO-DISTRIBUTED DATA CENTERS?




Zero-copy bursting across data centers for Presto, Spark, and Hive with no app
changes on data stored in HDFS.

INTERACT WITH ALLUXIO IN ANY STACK

Pick a compute. Pick a storage. Alluxio just works.


 * Presto
 * Spark
 * Hive
 * HBase
 * MapReduce
 * ML via POSIX

Tutorial –> Full Docs –>

-- Pointing Table location to Alluxio

CREATE SCHEMA hive.web

WITH (location = 'alluxio://master:port/my-table/‘)

Full Docs

// Using Alluxio as input and output for RDD

scala> sc.textFile("alluxio://master:19998/Input")            

scala> rdd.saveAsTextFile("alluxio://master:19998/Output")

 

// Using Alluxio as input and output for Dataframe

scala> df = sqlContext.read.parquet("alluxio://master:19998/Input.parquet")

scala> df.write.parquet("alluxio://master:19998/Output.parquet”)

Full Docs

-- Pointing Table location to Alluxio

hive> CREATE TABLE u_user (

userid INT,

age INT)

ROW FORMAT DELIMITED

FIELDS TERMINATED BY '|'

LOCATION 'alluxio://master:port/table_data';

Full Docs

Create and Query table stored in Alluxio

hbase(main):001:0> create 'test', 'cf'

hbase(main):002:0> list ‘test'

Full Docs

# Running a wordcount using Alluxio as input and output

$ bin/hadoop jar hadoop-mapreduce-examples-2.7.3.jar wordcount \

  -libjars /<ALLUXIO_HOME>/client/alluxio-<VERSION>-client.jar \

  alluxio://master:19998/wordcount/input.txt \

  alluxio://master:19998/wordcount/output

Full Docs

# Accessing Alluxio after mounting Alluxio service to local file system

$ ls /mnt/alluxio_mount

$ cat /mnt/alluxio_mount/mydata.txt

ALLUXIO

 * S3
 * HDFS
 * Azure Blob
 * Google Cloud
 * Ceph
 * NFS

$ ./bin/alluxio fs mount \

--option aws.accessKeyId=<AWS_ACCESS_KEY_ID> \

--option aws.secretKey=<AWS_SECRET_KEY_ID> \

alluxio://master:port/s3 s3a://<S3_BUCKET>/<S3_DIRECTORY>

Full Docs

$ ./bin/alluxio fs mount \

alluxio://master:port/hdfs hdfs://namenode:port/dir/

Full Docs

$ ./bin/alluxio fs mount \

--option

fs.azure.account.key.<AZURE_ACCOUNT>.blob.core.windows.net=<AZURE_ACCESS_KEY> \

alluxio://master:port/azure

wasb://<AZURE_CONTAINER>@<AZURE_ACCOUNT>.blob.core.windows.net/<AZURE_DIRECTORY>/

Full Docs

$ ./bin/alluxio fs mount \

--option fs.gcs.accessKeyId=<GCS_ACCESS_KEY_ID> \

--option fs.gcs.secretAccessKey=<GCS_SECRET_ACCESS_KEY> \

alluxio://master:port/gcs gs://<GCS_BUCKET>/<GCS_DIRECTORY>

Full Docs

$ ./bin/alluxio fs mount \

--option aws.accessKeyId=<AWS_ACCESS_KEY_ID> \

--option aws.secretKey=<AWS_SECRET_KEY_ID> \

--option alluxio.underfs.s3.endpoint=http://<rgw-hostname>:<rgw-port> \

--option alluxio.underfs.s3.disable.dns.buckets=true \

alluxio://master:port/ceph s3a://<S3_BUCKET>/<S3_DIRECTORY>



Full Docs

$ ./bin/alluxio fs mount alluxio://master:port/nfs /mnt/nfs

Full Docs



4,000+
STARS


The Alluxio open source project has over 4,000 GitHub Stars, growing daily.




1,000+
CONTRIBUTORS

Over 1,000 contributors have made close to 30,000 commits to the Alluxio open
source project.



APACHE 2.0
LICENSED

Alluxio is open source software released under the Apache 2.0 license. Alluxio
has a vibrant developer and user community.

join the community



POWERED BY ALLUXIO

see more



WHAT’S HAPPENING


Blog
Deep Dive into the Implementation of Alluxio Metadata Storage

This article introduces the design and implementation of metadata storage in
Alluxio Master, either on heap and off heap (based on RocksDB).

May 18, 2022
News
Alluxio Expands Data Access and Security for Data-driven Applications in
Heterogeneous Environments with v2.8 Release

Alluxio, the developer of the open source data orchestration platform for data
driven workloads such as large-scale analytics and AI/ML, announced the
immediate availability of version 2.8 of its Data Orchestration Platform.

May 16, 2022

AIThority

Blog
What’s New in Alluxio 2.8: Enhanced S3 API Functionality, Enterprise-grade
Security and Data Migration With Better Usability and Low Cost

The Alluxio 2.8 version focuses on the S3 API, enterprise-grade security,
scalability and observability in data migration. Enhanced S3 API makes managing
Alluxio easier than ever. Features such as encryption at rest and policy-driven
data management further improve Alluxio’s functionality to support
enterprise customers. 

May 4, 2022
Solution Brief
Simplify and Accelerate Your Geo-Distributed Analytics Platform at Scale

Today, many organizations are running a multitude of data-driven applications
and data platforms that span multiple geographic regions and across
heterogeneous environments – public, … Continued

Blog
From Zookeeper to Raft: How Alluxio Stores File System State with High
Availability and Fault Tolerance

Raft is an algorithm for state machine replication as a way to ensure high
availability (HA) and fault tolerance. This blog shares how Alluxio has moved to
a Zookeeper-less, built-in Raft-based journal system as a HA implementation.

April 13, 2022
White Paper
Spark + Alluxio Overview | Pair Spark with Alluxio to Modernize Your Data
Platform

By bringing Alluxio together with Spark, you can modernize your data platform in
a scalable, agile, and cost-effective way.  In this post, we provide … Continued




 * Resources
   * Blog
   * White Papers
   * Tech Talks
   * Case Studies
   * Events
   * Slides from talks
   * Videos
 * Open Source
   * Community
   * Download
   * Mailing List
   * Slack
   * Powered By Alluxio
   * Newsletter
 * Support
   * Documentation
   * Account Sign In
   * Pricing
   * Services & Support
   * Contact Us
 * Company
   * About
   * Careers
   * News & Press
   * Awards
   * Partners

© Copyright 2021 Alluxio, Inc. All rights reserved.
Alluxio is a trademark of Alluxio, Inc.
Terms of Service | Privacy Policy

NEWSLETTER SIGNUP

×
First Name
Last Name
Company Email
Close Submit