invoke the reducer cleanup if there is one
invoke the reducer cleanup if there is one
copy all outputs files to the destinations specified by sink files
copy all outputs files to the destinations specified by sink files
copy the success file to the destinations specified by sink files
copy the success file to the destinations specified by sink files
use the output converter of a sink to convert a value to a key/value
use the output converter of a sink to convert a value to a key/value
create an emitter to output values on the current tag for each sink.
create an emitter to output values on the current tag for each sink. Values are converted to (key, values) using the sink output converter. This emitter is used by both the GbkOutputChannel and the BypassOutputChannel
store the reducer environment during the setup if there is one
store the reducer environment during the setup if there is one
return the reducer environment if there is one
return the reducer environment if there is one
the last node of this channel
output nodes for this channel
output nodes for this channel
all the ProcessNodes for this channel
reduce all the key/values with either the reducer, or the combiner otherwise just emit key/value pairs.
reduce all the key/values with either the reducer, or the combiner otherwise just emit key/value pairs.
The key and values are untagged. The emitter is in charge of writing them to the proper tag, which is the channel's tag
create a ScoobiConfiguration from a Hadoop one
create a ScoobiConfiguration from a Hadoop one
only the reducer needs to be setup if there is one
only the reducer needs to be setup if there is one
all the sinks defined by the nodes of the input channel
the tag identifying a GbkOutputChannel is the groupByKey id
the tag identifying a GbkOutputChannel is the groupByKey id
Output channel for a GroupByKey.
It can optionally have a reducer and / or a combiner applied to the grouped key/values.
The possible combinations are
There can not be gbk -> reducer -> combiner because in that case the second combiner is transformed as a parallelDo by the Optimiser