Generated by
JDiff

Package org.apache.hadoop.mapred.lib

Added Classes
CombineFileInputFormat An abstract org.apache.hadoop.mapred.InputFormat that returns i int) method.
CombineFileRecordReader A generic RecordReader that can hand out different recordReaders for each chunk in a CombineFileSplit A CombineFileSplit can combine data chunks from multiple files.
CombineFileSplit A sub-collection of input files.
 

Changed Classes and Interfaces
ChainMapper The ChainMapper class allows to use multiple Mapper classes within a single Map task.
ChainReducer The ChainReducer class allows to chain multiple Mapper classes after a Reducer within the Reducer task.
DelegatingInputFormat An InputFormat that delegates behaviour of paths to multiple other InputFormats.
DelegatingMapper An Mapper that delegates behaviour of paths to multiple other mappers.
FieldSelectionMapReduce This class implements a mapper/reducer class that can be used to perform field selections in a manner similar to unix cut.
HashPartitioner Partition keys by their Object#hashCode()
IdentityMapper Implements the identity function mapping inputs directly to outputs.
IdentityReducer Performs no reduction writing all input values directly to the output.
InputSampler Utility for collecting samples and writing a partition file for org.apache.hadoop.mapred.lib.TotalOrderPartitioner
InputSampler.IntervalSampler Sample from s splits at regular intervals.
InputSampler.RandomSampler Sample from random points in the input.
InputSampler.Sampler Interface to sample using an org.apache.hadoop.mapred.InputFormat
InputSampler.SplitSampler Samples the first n records from s splits.
InverseMapper A Mapper that swaps keys and values.
KeyFieldBasedPartitioner Defines a way to partition keys based on certain key fields (also see KeyFieldBasedComparator The key specification supported is of the form -k pos1[ pos2] where pos is of the form f[.c][opts] where f is the number of the key field to use and c is the number of the first character from the beginning of the field.
LongSumReducer A Reducer that sums long values.
MultipleInputs This class supports MapReduce jobs that have multiple input paths with a different InputFormat and Mapper for each path
MultipleOutputFormat This abstract class extends the FileOutputFormat allowing to write the output data to different output files.
MultipleOutputs The MultipleOutputs class simplifies writting to additional outputs other than the job default output via the OutputCollector passed to the map() and reduce() methods of the Mapper and Reducer implementations.
MultipleSequenceFileOutputFormat This class extends the MultipleOutputFormat allowing to write the output data to different output files in sequence file output format.
MultipleTextOutputFormat This class extends the MultipleOutputFormat allowing to write the output data to different output files in Text output format.
MultithreadedMapRunner Multithreaded implementation for @link org.apache.hadoop.mapred.MapRunnable.
NLineInputFormat NLineInputFormat which splits N lines of input as one split.
NullOutputFormat Consume all outputs and put them in /dev/null.
RegexMapper A Mapper that extracts text matching a regular expression.
TokenCountMapper A Mapper that maps text values into pairs.
TotalOrderPartitioner Partitioner effecting a total order by reading split points from an externally generated source.