Introduction to RHadoop

The classroom session covered the following:

  • MapReduce in R – An overview
  • The rmr2 package – Map-reduce jobs in R
  • The rhdfs package – Interacting with HDFS
  • Input/Output formats – Different options for reading and writing data
  • Examples for discussion
  • Exercises
  • Appendix
    A: Useful links
    B: Overview of R functions used in this session
    C: More on Rhive vs. RHadoop

Training session