I'm using the new Hadoop API and as there is no way to explicitly specify the number of mappers (unlike the old API), I need to change the size of the data chunks so that I can control the number of mappers. How to change the default size of the data split in the code?
Asked
Active
Viewed 146 times
1 Answers
0
Number of Mappers is determined by (File) Splits
The split size is determined by the InputFormat being used.
mapred.max.split.size parameter will let you define the split size.

Jasper
- 8,440
- 31
- 92
- 133