Alluxio Advances Analytics and AI with NVIDIA Accelerated Computing
[ Back ]   [ More News ]   [ Home ]
Alluxio Advances Analytics and AI with NVIDIA Accelerated Computing

SAN MATEO, Calif., March 23, 2021 (GLOBE NEWSWIRE) -- Alluxio, the developer of open source cloud data orchestration software, today announced the integration of RAPIDS Accelerator for Apache Spark 3.0 with the Alluxio Data Orchestration Platform to accelerate data access on NVIDIA accelerated computing clusters for computation of both analytics and Artificial Intelligence (AI) pipelines. Validation testing of the integration for caching of large datasets and data availability for NVIDIA GPU processing showed 2x faster acceleration for a data analytics and business intelligence workload. At the same time, NVIDIA GPU clusters with Alluxio demonstrated 70% better return on investment (ROI) compared to CPU clusters.

Data processing is increasingly making use of NVIDIA GPUs for massive parallelism. This is the case for both analytics pipelines and AI / Machine Learning (ML) pipelines. Benefits from GPU acceleration for an end-to-end pipeline are limited if data access dominates the execution time. GPU-based processing drives higher data access throughput than a CPU-based cluster. With the separation of processing clusters for analytics and AI from data storage systems, accelerating data access allows for cost savings on agile business intelligence and data science workloads.

“With the advances made from the unrivaled processing power of NVIDIA’s software and hardware, the bottleneck for users is now storage access throughout the data pipeline,” said Haoyuan Li, Founder and CEO, Alluxio. “From this integration, users now benefit from the separation of processing clusters for analytics and AI from data storage systems, accelerating data access within milliseconds to make critical decisions, find efficiencies, lower cost, and improve customer experience.”

“Accelerating data processing compute speeds means that data also needs to be accessed more quickly by data science and AI applications so that the entire pipeline works in harmony,” said Scott McClellan, Senior Director, Data Science Product Group, NVIDIA. “Alluxio’s integration of RAPIDS for Apache Spark, combined with the accelerated computing power of NVIDIA GPUs, means that Alluxio Data Orchestration customers will be able to boost the efficiency of their analytics and AI workloads without any code changes.”

Key highlights of the Alluxio with RAPIDS Accelerator for Apache Spark 3.0 integration, include:

RAPIDS Accelerator for Apache Spark 3.0 with Alluxio Data Orchestration Platform integration is immediately available.

Resources

Tweet this: @Alluxio integrates @RAPIDSai to accelerate #analytics and #AI pipelines on @NVIDIAAI #GPU clusters https://bit.ly/3lEgtt0

About Alluxio

Proven at global web scale in production for modern data services, Alluxio is the creator of open source data orchestration software for the cloud. Alluxio orchestrates data closer to data analytics and AI/ML applications in any cloud across clusters, regions, and countries, providing memory-speed data access. Intelligent data tiering and data management deliver consistent high performance to customers in financial services, high tech, retail and telecommunications. Alluxio is in production use today at seven out of the top ten internet companies. Venture-backed by Andreessen Horowitz, Seven Seas Partners, and Volcanics Venture. Alluxio was founded at UC Berkeley’s AMPLab by the creators of the Tachyon open source project. For more information, contact info@alluxio.com or follow us on LinkedIn, or Twitter.

Media Contact:
Beth Winkowski
Winkowski Public Relations, LLC for Alluxio
978-649-7189
beth@alluxio.com


Primary Logo