Introduction to Map-Reduce

The session covers the following topics:

  • Introduction to Distributed computing
  • Example: Word count using map-reduce
  • Key-value pairs – The core of map-reduce
  • The map-reduce paradigm – The step-by-step approach
  • The map phase – Extreme parallelism
  • Shuffling, Sorting – The invisible middle operations
  • The reduce phase – Bringing it all together
  • Examples and exercises

Training material