An association of a key to 1 or many values.
store the output path of a Sink as a checkpoint
Base trait for "computation nodes" with no generic type information for easier rewriting
store the compression parameters for sinks
A list that is distributed across multiple machines.
A wrapper around an object that is part of the graph of a distributed computation
An output store from a MapReduce job
specify an object on which it is possible to add sinks and to compress them
DataSource for a computation graph.
Interface for specifying parallel operation over DLists in the absence of an environment
Interface for writing outputs from a DoFn
Interface for specifying parallel operation over DLists.
An object holder which can hold a distributed value
Define the expiry policy for checkpoint files
A distributed list of associations.
Specify the way in which key-values are "shuffled".
Implicit definitions of Grouping instances for common types.
Convert an InputFormat's key-value types to the type produced by a source
fusion of both trait when bi-directional conversion is possible
A non-empty iterable contains at least one element.
A non-empty iterator contains at least one element.
Convert the type consumed by a DataSink into an OutputFormat's key-value types.
This class wraps the Hadoop (mutable) configuration with additional configuration information such as the jars which should be added to the classpath.
This is a Sink which can also be used as a Source
Trait that is sub-classed by objects to provide sets of unique identifiers.
Typeclass for sending types across the Hadoop wire
Implicit definitions of WireFormat instances for common types.
Definition of the Equal instance for CompNodes