org.apache.hadoop.mapred.lib

hadoop 0.20.1 Overview Package Class Statistics Help

Generated by
JDiff

PREV PACKAGE NEXT PACKAGE FRAMES NO FRAMES DETAIL: REMOVED | ADDED | CHANGED

Package `org.apache.hadoop.mapred.lib`

Added Classes

CombineFileInputFormat An abstract org.apache.hadoop.mapred.InputFormat that returns i int) method.

CombineFileRecordReader A generic RecordReader that can hand out different recordReaders for each chunk in a CombineFileSplit A CombineFileSplit can combine data chunks from multiple files.

CombineFileSplit A sub-collection of input files.

Changed Classes and Interfaces

ChainMapper The ChainMapper class allows to use multiple Mapper classes within a single Map task.

ChainReducer The ChainReducer class allows to chain multiple Mapper classes after a Reducer within the Reducer task.

DelegatingInputFormat An InputFormat that delegates behaviour of paths to multiple other InputFormats.

DelegatingMapper An Mapper that delegates behaviour of paths to multiple other mappers.

FieldSelectionMapReduce This class implements a mapper/reducer class that can be used to perform field selections in a manner similar to unix cut.

HashPartitioner Partition keys by their Object#hashCode()

IdentityMapper Implements the identity function mapping inputs directly to outputs.

IdentityReducer Performs no reduction writing all input values directly to the output.

InputSampler Utility for collecting samples and writing a partition file for org.apache.hadoop.mapred.lib.TotalOrderPartitioner

InputSampler.IntervalSampler Sample from s splits at regular intervals.

InputSampler.RandomSampler Sample from random points in the input.

InputSampler.Sampler Interface to sample using an org.apache.hadoop.mapred.InputFormat

InputSampler.SplitSampler Samples the first n records from s splits.

InverseMapper A Mapper that swaps keys and values.

KeyFieldBasedPartitioner Defines a way to partition keys based on certain key fields (also see KeyFieldBasedComparator The key specification supported is of the form -k pos1[ pos2] where pos is of the form f[.c][opts] where f is the number of the key field to use and c is the number of the first character from the beginning of the field.

LongSumReducer A Reducer that sums long values.

MultipleInputs This class supports MapReduce jobs that have multiple input paths with a different InputFormat and Mapper for each path

MultipleOutputFormat This abstract class extends the FileOutputFormat allowing to write the output data to different output files.

MultipleOutputs The MultipleOutputs class simplifies writting to additional outputs other than the job default output via the OutputCollector passed to the map() and reduce() methods of the Mapper and Reducer implementations.

MultipleSequenceFileOutputFormat This class extends the MultipleOutputFormat allowing to write the output data to different output files in sequence file output format.

MultipleTextOutputFormat This class extends the MultipleOutputFormat allowing to write the output data to different output files in Text output format.

MultithreadedMapRunner Multithreaded implementation for @link org.apache.hadoop.mapred.MapRunnable.

NLineInputFormat NLineInputFormat which splits N lines of input as one split.

NullOutputFormat Consume all outputs and put them in /dev/null.

RegexMapper A Mapper that extracts text matching a regular expression.

TokenCountMapper A Mapper that maps text values into pairs.

TotalOrderPartitioner Partitioner effecting a total order by reading split points from an externally generated source.

hadoop 0.20.1 Overview Package Class Statistics Help

PREV PACKAGE NEXT PACKAGE FRAMES NO FRAMES

Added Classes
`CombineFileInputFormat`	An abstract org.apache.hadoop.mapred.InputFormat that returns i int) method.
`CombineFileRecordReader`	A generic RecordReader that can hand out different recordReaders for each chunk in a CombineFileSplit A CombineFileSplit can combine data chunks from multiple files.
`CombineFileSplit`	A sub-collection of input files.

Changed Classes and Interfaces
ChainMapper	The ChainMapper class allows to use multiple Mapper classes within a single Map task.
ChainReducer	The ChainReducer class allows to chain multiple Mapper classes after a Reducer within the Reducer task.
DelegatingInputFormat	An InputFormat that delegates behaviour of paths to multiple other InputFormats.
DelegatingMapper	An Mapper that delegates behaviour of paths to multiple other mappers.
FieldSelectionMapReduce	This class implements a mapper/reducer class that can be used to perform field selections in a manner similar to unix cut.
HashPartitioner	Partition keys by their Object#hashCode()
IdentityMapper	Implements the identity function mapping inputs directly to outputs.
IdentityReducer	Performs no reduction writing all input values directly to the output.
InputSampler	Utility for collecting samples and writing a partition file for org.apache.hadoop.mapred.lib.TotalOrderPartitioner
InputSampler.IntervalSampler	Sample from s splits at regular intervals.
InputSampler.RandomSampler	Sample from random points in the input.
InputSampler.Sampler	Interface to sample using an org.apache.hadoop.mapred.InputFormat
InputSampler.SplitSampler	Samples the first n records from s splits.
InverseMapper	A Mapper that swaps keys and values.
KeyFieldBasedPartitioner	Defines a way to partition keys based on certain key fields (also see KeyFieldBasedComparator The key specification supported is of the form -k pos1[ pos2] where pos is of the form f[.c][opts] where f is the number of the key field to use and c is the number of the first character from the beginning of the field.
LongSumReducer	A Reducer that sums long values.
MultipleInputs	This class supports MapReduce jobs that have multiple input paths with a different InputFormat and Mapper for each path
MultipleOutputFormat	This abstract class extends the FileOutputFormat allowing to write the output data to different output files.
MultipleOutputs	The MultipleOutputs class simplifies writting to additional outputs other than the job default output via the `OutputCollector` passed to the `map()` and `reduce()` methods of the `Mapper` and `Reducer` implementations.
MultipleSequenceFileOutputFormat	This class extends the MultipleOutputFormat allowing to write the output data to different output files in sequence file output format.
MultipleTextOutputFormat	This class extends the MultipleOutputFormat allowing to write the output data to different output files in Text output format.
MultithreadedMapRunner	Multithreaded implementation for @link org.apache.hadoop.mapred.MapRunnable.
NLineInputFormat	NLineInputFormat which splits N lines of input as one split.
NullOutputFormat	Consume all outputs and put them in /dev/null.
RegexMapper	A Mapper that extracts text matching a regular expression.
TokenCountMapper	A Mapper that maps text values into pairs.
TotalOrderPartitioner	Partitioner effecting a total order by reading split points from an externally generated source.

Package org.apache.hadoop.mapred.lib

Package `org.apache.hadoop.mapred.lib`