0

I'm using the new Hadoop API and as there is no way to explicitly specify the number of mappers (unlike the old API), I need to change the size of the data chunks so that I can control the number of mappers. How to change the default size of the data split in the code?

HHH
  • 6,085
  • 20
  • 92
  • 164

1 Answers1

0

Number of Mappers is determined by (File) Splits
The split size is determined by the InputFormat being used.
mapred.max.split.size parameter will let you define the split size.

Jasper
  • 8,440
  • 31
  • 92
  • 133