Introduction to Alluxio

Video description

Alluxio is the solution of choice for big companies who need to manage data at multi-petabyte scale. In this course, PMC member Calvin Jia offers a full-blown Alluxio tour to any data scientist, developer or system administrator looking to improve the performance of their workloads, develop applications with Alluxio, or deploy and manage Alluxio clusters.

He offers a high level view (why Alluxio was developed, the problems it solves, who uses it, etc.) as well as a hands-on practicum. You'll set-up your own deployment (locally and in a cluster) using a compute framework on top of Alluxio, connecting it to multiple persistent data stores while preserving one namespace. Take this course and you'll come away knowing the benefits Alluxio brings to big data stacks.

  • Understand the features and benefits of Alluxio and master the basics of how to use it
  • Discover why companies like Intel, Baidu, and Alibaba use Alluxio for their big data needs
  • Learn how the storage unification layer bridges computation frameworks and storage systems
  • Gain practical experience deploying Alluxio in local and cluster modes
  • Learn how to use Alluxio tools like the command line and the web UI
  • Explore the Alluxio open source ecosystem and learn who the players are
Calvin Jia is the software engineer from Alluxio, Inc. who co-led the "Unified Namespace and Tiered Storage in Alluxio" session at Strata+Hadoop World 2016 San Jose. He holds a Bachelor of Science (BS), Electrical Engineering and Computer Science degree from the University of California, Berkeley.

Publisher resources

View/Submit Errata

Product information

  • Title: Introduction to Alluxio
  • Author(s): Calvin Jia
  • Release date: June 2016
  • Publisher(s): Infinite Skills
  • ISBN: 9781771376006