March 26th, 2014 - 10:00am to 11:00am
Joyent Manta helps you process large data sets in parallel, and now it's easily accessible to R language users.
In Part 2 of this webinar series, Dr. Christopher Hogue will go into further detail on Map/Reduce system in Manta as invoked from the R work space. Watch how Manta compute jobs marshal hundreds of CPUs to do Map/Reduce operations without complex Hadoop setups, and without moving data from the object store - all from the R command line. View Part I here.
R data and R workspaces are easily stored, managed, retrieved and published on Manta's hierarchical object storage file system. Using R and Manta together, R analysts can comb through extremely large datasets in a familiar environment. By using a cloud-based service, you are freed from system management and operational overhead and can focus on the data.
Joyent Manta is a secure distributed object storage service with integrated high performance computing. Big Data analytics on Manta takes advantage of the supercomputing capacity of the Manta system, which computes on data stored in-place.
Interactive R support for Joyent Manta arrived in early January as the mantaRSDK providing R users on Windows, Mac OSX and Linux/Unix platforms with a fully documented R package with help for each function, and a simple-to-remember R command-line syntax.
Christopher Hogue's blog post on the mantaRSDK is here.