com.nicta.scoobi.application

InputsOutputs

trait InputsOutputs extends TextInput with TextOutput with AvroInput with AvroOutput with SequenceInput with SequenceOutput

This trait provides way to create DLists from files and to add sinks to DLists so that the results of computations can be saved to files

Linear Supertypes
Known Subclasses
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. InputsOutputs
  2. SequenceOutput
  3. SequenceInput
  4. AvroOutput
  5. AvroInput
  6. TextOutput
  7. TextInput
  8. AnyRef
  9. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Type Members

  1. type AvroFixed[A] = io.avro.AvroFixed[A]

  2. type AvroSchema[A] = io.avro.AvroSchema[A]

  3. implicit class ConvertKeyListToSequenceFile[K] extends AnyRef

  4. implicit class ConvertKeyListToSequenceFile1[K, V] extends AnyRef

  5. implicit class ConvertKeyObjectToSequenceFile[K] extends AnyRef

  6. implicit class ConvertListToSequenceFile[T] extends AnyRef

  7. implicit class ConvertObjectToSequenceFile[T] extends AnyRef

  8. implicit class ConvertValueListToSequenceFile[V] extends AnyRef

  9. implicit class ConvertValueListToSequenceFile1[K, V] extends AnyRef

  10. implicit class ConvertValueObjectToSequenceFile[V] extends AnyRef

  11. implicit class ListToAvroFile[A] extends AnyRef

  12. implicit class ListToCheckpointFile[A] extends AnyRef

    checkpoints

  13. implicit class ListToDelimitedTextFile[A <: Product] extends AnyRef

  14. implicit class ListToTextFile[A] extends AnyRef

  15. implicit class ObjectToAvroFile[A] extends AnyRef

  16. implicit class ObjectToDelimitedTextFile[A <: Product] extends AnyRef

  17. implicit class ObjectToTextFile[A] extends AnyRef

  18. type SeqSchema[A] = io.sequence.SeqSchema[A]

    Sequence File I/O

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. object ADouble

    Extract a Double from a String.

  7. object AFloat

    Extract a Float from a String.

  8. object ALong

    Extract a Long from a String.

  9. object AnInt

    Extract an Int from a String.

  10. val AvroSchema: io.avro.AvroSchema.type

    Avro I/O

  11. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  12. def avroSink[B](path: String, overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck, checkpoint: Boolean = false, expiryPolicy: ExpiryPolicy = ExpiryPolicy.default)(implicit sc: ScoobiConfiguration, schema: io.avro.AvroSchema[B]): AvroSink[AvroType, B]

    Definition Classes
    AvroOutput
  13. def checkedSource[K, V](paths: Seq[String], checkKeyValueTypes: Boolean = true)(implicit arg0: Manifest[K], arg1: Manifest[V]): CheckedSeqSource[K, V, (K, V)]

    Definition Classes
    SequenceInput
  14. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  15. def defaultSequenceInputFormat[K, V]: Class[SequenceFileInputFormat[K, V]]

    Definition Classes
    SequenceInput
  16. def defaultTextConverter: InputConverter[LongWritable, Text, String]

    Definition Classes
    TextInput
  17. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  18. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  19. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  20. def fromAvroFile[A](paths: Seq[String], checkSchemas: Boolean = true, check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck)(implicit arg0: WireFormat[A], arg1: io.avro.AvroSchema[A]): DList[A]

    Create a new DList from the contents of a list of one or more Avro files.

    Create a new DList from the contents of a list of one or more Avro files. The type of the DList must conform to the schema types allowed by Avro, as constrained by the 'AvroSchema' type class. In the case of a directory being specified, the input forms all the files in that directory.

    Definition Classes
    AvroInput
  21. def fromAvroFile[A](paths: String*)(implicit arg0: WireFormat[A], arg1: io.avro.AvroSchema[A]): DList[A]

    Create a new DList from the contents of one or more Avro files.

    Create a new DList from the contents of one or more Avro files. The type of the DList must conform to the schema types allowed by Avro, as constrained by the 'AvroSchema' type class. In the case of a directory being specified, the input forms all the files in that directory.

    Definition Classes
    AvroInput
  22. def fromDelimitedTextFile[A](path: String, sep: String = "\t", check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck)(extractFn: PartialFunction[Seq[String], A])(implicit arg0: WireFormat[A]): DList[A]

    Create a distributed list from a text file that is a number of fields delimited by some separator.

    Create a distributed list from a text file that is a number of fields delimited by some separator. Use an extractor function to pull out the required fields to create the distributed list.

    Definition Classes
    TextInput
  23. def fromLazySeq[A](seq: ⇒ Seq[A], seqSize: Int = 1000)(implicit arg0: WireFormat[A]): DList[A]

    create a DList from a stream of elements which will only be evaluated on the cluster

  24. def fromSequenceFile[K, V](paths: Seq[String], checkKeyValueTypes: Boolean = true, check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck)(implicit arg0: WireFormat[K], arg1: io.sequence.SeqSchema[K], arg2: WireFormat[V], arg3: io.sequence.SeqSchema[V]): DList[(K, V)]

    Create a new DList from the contents of a list of one or more Sequence Files.

    Create a new DList from the contents of a list of one or more Sequence Files. Note that the type parameters K and V are the "converted" Scala types for the Writable key-value types that must be contained in the the Sequence Files. In the case of a directory being specified, the input forms all the files in that directory.

    Definition Classes
    SequenceInput
  25. def fromSequenceFile[K, V](paths: String*)(implicit arg0: WireFormat[K], arg1: io.sequence.SeqSchema[K], arg2: WireFormat[V], arg3: io.sequence.SeqSchema[V]): DList[(K, V)]

    Definition Classes
    SequenceInput
  26. def fromSequenceSource[K, V, A](source: SeqSource[K, V, A])(implicit arg0: WireFormat[A]): DListImpl[A]

    Definition Classes
    SequenceInput
  27. def fromTextFile(paths: Seq[String], check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck): DList[String]

    Definition Classes
    TextInput
  28. def fromTextFile(paths: String*): DList[String]

    Create a distributed list from one or more files or directories (in the case of a directory, the input forms all files in that directory).

    Create a distributed list from one or more files or directories (in the case of a directory, the input forms all files in that directory).

    Definition Classes
    TextInput
  29. def fromTextFileWithPath(path: String, check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck): DList[(String, String)]

    Create a distributed list from one or more files or directories (in the case of a directory, the input forms all files in that directory).

    Create a distributed list from one or more files or directories (in the case of a directory, the input forms all files in that directory). The distributed list is a tuple where the first part is the path of the originating file and the second part is a line of text.

    Definition Classes
    TextInput
  30. def fromTextFileWithPaths(paths: Seq[String], check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck): DList[(String, String)]

    Create a distributed list from a list of one or more files or directories (in the case of a directory, the input forms all files in that directory).

    Create a distributed list from a list of one or more files or directories (in the case of a directory, the input forms all files in that directory). The distributed list is a tuple where the first part is the path of the originating file and the second part is a line of text.

    Definition Classes
    TextInput
  31. def fromTextSource[A](source: TextSource[A])(implicit arg0: WireFormat[A]): DListImpl[A]

    Definition Classes
    TextInput
  32. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  33. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  34. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  35. def keyFromSequenceFile[K](paths: Seq[String], checkKeyType: Boolean = true, check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck)(implicit arg0: WireFormat[K], arg1: io.sequence.SeqSchema[K]): DList[K]

    Create a new DList from the "key" contents of a list of one or more Sequence Files.

    Create a new DList from the "key" contents of a list of one or more Sequence Files. Note that the type parameter K is the "converted" Scala type for the Writable key type that must be contained in the the Sequence Files. In the case of a directory being specified, the input forms all the files in that directory.

    Definition Classes
    SequenceInput
  36. def keyFromSequenceFile[K](paths: String*)(implicit arg0: WireFormat[K], arg1: io.sequence.SeqSchema[K]): DList[K]

    Create a new DList from the "key" contents of one or more Sequence Files.

    Create a new DList from the "key" contents of one or more Sequence Files. Note that the type parameter K is the "converted" Scala type for the Writable key type that must be contained in the the Sequence Files. In the case of a directory being specified, the input forms all the files in that directory.

    Definition Classes
    SequenceInput
  37. def keySchemaSequenceFile[K](path: String, overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck)(implicit convK: io.sequence.SeqSchema[K]): SeqSink[SeqType, NullWritable, K]

    Definition Classes
    SequenceOutput
  38. def keyToSequenceFile[K](dl: DList[K], path: String, overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck)(implicit convK: io.sequence.SeqSchema[K]): T

    Specify a distributed list to be persistent by converting its elements to Writables and storing it to disk as the "key" component in a Sequence File.

    Specify a distributed list to be persistent by converting its elements to Writables and storing it to disk as the "key" component in a Sequence File.

    Definition Classes
    SequenceOutput
  39. def lazyObject[A](o: ⇒ A)(implicit arg0: WireFormat[A]): DObject[A]

    create a DObject which will only be evaluated on the cluster

  40. def listToDelimitedTextFile[A <: Product](dl: DList[A], path: String, sep: String = "\t", overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck)(implicit arg0: Manifest[A]): DList[String]

    Persist a distributed lists of 'Products' (e.

    Persist a distributed lists of 'Products' (e.g. Tuples) as a delimited text file.

    Definition Classes
    TextOutput
  41. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  42. final def notify(): Unit

    Definition Classes
    AnyRef
  43. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  44. def objectFromAvroFile[A](paths: Seq[String], checkSchemas: Boolean = true)(implicit arg0: WireFormat[A], arg1: AvroSchema[A]): DObject[A]

  45. def objectFromAvroFile[A](paths: String*)(implicit arg0: WireFormat[A], arg1: AvroSchema[A]): DObject[A]

  46. def objectFromDelimitedTextFile[A](path: String, sep: String = "\t", check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck)(extractFn: PartialFunction[Seq[String], A])(implicit arg0: WireFormat[A]): DObject[A]

  47. def objectFromSequenceFile[K, V](paths: Seq[String], checkKeyValueTypes: Boolean = true)(implicit arg0: WireFormat[K], arg1: SeqSchema[K], arg2: WireFormat[V], arg3: SeqSchema[V]): DObject[(K, V)]

  48. def objectFromSequenceFile[K, V](paths: String*)(implicit arg0: WireFormat[K], arg1: SeqSchema[K], arg2: WireFormat[V], arg3: SeqSchema[V]): DObject[(K, V)]

  49. def objectFromTextFile(paths: String*): DObject[String]

    Text file I/O

  50. def objectKeyFromSequenceFile[K](paths: Seq[String], checkKeyType: Boolean = true)(implicit arg0: WireFormat[K], arg1: SeqSchema[K]): DObject[K]

  51. def objectKeyFromSequenceFile[K](paths: String*)(implicit arg0: WireFormat[K], arg1: SeqSchema[K]): DObject[K]

  52. def objectToDelimitedTextFile[A <: Product](o: DObject[A], path: String, sep: String = "\t", overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck)(implicit arg0: Manifest[A]): DObject[String]

    Persist a distributed object of 'Products' (e.

    Persist a distributed object of 'Products' (e.g. Tuples) as a delimited text file.

    Definition Classes
    TextOutput
  53. def objectValueFromSequenceFile[V](paths: Seq[String], checkValueType: Boolean = true)(implicit arg0: WireFormat[V], arg1: SeqSchema[V]): DObject[V]

  54. def objectValueFromSequenceFile[V](paths: String*)(implicit arg0: WireFormat[V], arg1: SeqSchema[V]): DObject[V]

  55. def schemaSequenceSink[K, V](path: String, overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck, checkpoint: Boolean = false, expiryPolicy: ExpiryPolicy = ExpiryPolicy.default)(implicit convK: io.sequence.SeqSchema[K], convV: io.sequence.SeqSchema[V], sc: ScoobiConfiguration): SeqSink[SeqType, SeqType, (K, V)] with SinkSource { def toSource: com.nicta.scoobi.io.sequence.SeqSource[convK.SeqType,convV.SeqType,(K, V)] }

    Definition Classes
    SequenceOutput
  56. def sequenceSink[K <: Writable, V <: Writable](path: String, overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck, checkpoint: Boolean = false, expiryPolicy: ExpiryPolicy = ExpiryPolicy.default)(implicit mk: Manifest[K], mv: Manifest[V], sc: ScoobiConfiguration): SeqSink[K, V, (K, V)] with SinkSource { def toSource: com.nicta.scoobi.io.sequence.SeqSource[K,V,(K, V)] }

    Definition Classes
    SequenceOutput
  57. def source[K, V](paths: Seq[String])(implicit arg0: WireFormat[K], arg1: io.sequence.SeqSchema[K], arg2: WireFormat[V], arg3: io.sequence.SeqSchema[V]): SeqSource[K, V, (K, V)]

    Definition Classes
    SequenceInput
  58. def source[A](paths: Seq[String], checkSchemas: Boolean = true, check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck)(implicit arg0: io.avro.AvroSchema[A]): AvroDataSource[io.avro.AvroInput.source.schema.type.AvroType, A] forSome {val schema: io.avro.AvroSchema[A]}

    Definition Classes
    AvroInput
  59. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  60. def textFileSink[A](path: String, overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck)(implicit arg0: Manifest[A]): TextFileSink[A]

    Definition Classes
    TextOutput
  61. def textSource(paths: Seq[String], check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck): TextSource[String]

    create a text source

    create a text source

    Definition Classes
    TextInput
  62. def toAvroFile[B](list: DList[B], path: String, overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck, checkpoint: Boolean = false, expiryPolicy: ExpiryPolicy = ExpiryPolicy.default)(implicit schema: io.avro.AvroSchema[B], sc: ScoobiConfiguration): T

    Specify a distributed list to be persistent by storing it to disk as an Avro File

    Specify a distributed list to be persistent by storing it to disk as an Avro File

    Definition Classes
    AvroOutput
  63. def toDelimitedTextFile[A <: Product](dl: DList[A], path: String, sep: String = "\t", overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck)(implicit arg0: Manifest[A]): DList[String]

  64. def toSequenceFile[K, V](dl: DList[(K, V)], path: String, overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck, checkpoint: Boolean = false)(implicit convK: io.sequence.SeqSchema[K], convV: io.sequence.SeqSchema[V], sc: ScoobiConfiguration): T

    Specify a distributed list to be persistent by converting its elements to Writables and storing it to disk as "key-values" in a Sequence File

    Specify a distributed list to be persistent by converting its elements to Writables and storing it to disk as "key-values" in a Sequence File

    Definition Classes
    SequenceOutput
  65. def toString(): String

    Definition Classes
    AnyRef → Any
  66. def toTextFile[A](dl: DList[A], path: String, overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck)(implicit arg0: Manifest[A]): T

    Persist a distributed list as a text file.

    Persist a distributed list as a text file.

    Definition Classes
    TextOutput
  67. def valueFromSequenceFile[V](paths: Seq[String], checkValueType: Boolean = true, check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck)(implicit arg0: WireFormat[V], arg1: io.sequence.SeqSchema[V]): DList[V]

    Create a new DList from the "value" contents of a list of one or more Sequence Files.

    Create a new DList from the "value" contents of a list of one or more Sequence Files. Note that the type parameter V is the "converted" Scala type for the Writable value type that must be contained in the the Sequence Files. In the case of a directory being specified, the input forms all the files in that directory.

    Definition Classes
    SequenceInput
  68. def valueFromSequenceFile[V](paths: String*)(implicit arg0: WireFormat[V], arg1: io.sequence.SeqSchema[V]): DList[V]

    Create a new DList from the "value" contents of one or more Sequence Files.

    Create a new DList from the "value" contents of one or more Sequence Files. Note that the type parameter V is the "converted" Scala type for the Writable value type that must be contained in the the Sequence Files. In the case of a directory being specified, the input forms all the files in that directory.

    Definition Classes
    SequenceInput
  69. def valueSchemaSequenceFile[V](path: String, overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck, checkpoint: Boolean = false, expiryPolicy: ExpiryPolicy = ExpiryPolicy.default)(implicit convV: io.sequence.SeqSchema[V], sc: ScoobiConfiguration): SeqSink[NullWritable, SeqType, V]

    Definition Classes
    SequenceOutput
  70. def valueSource[V](paths: Seq[String])(implicit arg0: io.sequence.SeqSchema[V]): SeqSource[Writable, io.sequence.SequenceInput.valueSource.convV.type.SeqType, V] forSome {val convV: io.sequence.SeqSchema[V]}

    Definition Classes
    SequenceInput
  71. def valueToSequenceFile[V](dl: DList[V], path: String, overwrite: Boolean = false, check: (Path, Boolean, ScoobiConfiguration) ⇒ Unit = Sink.defaultOutputCheck, checkpoint: Boolean = false)(implicit convV: io.sequence.SeqSchema[V], sc: ScoobiConfiguration): T

    Specify a distributed list to be persistent by converting its elements to Writables and storing it to disk as the "value" component in a Sequence File.

    Specify a distributed list to be persistent by converting its elements to Writables and storing it to disk as the "value" component in a Sequence File.

    Definition Classes
    SequenceOutput
  72. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  73. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  74. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from SequenceOutput

Inherited from SequenceInput

Inherited from AvroOutput

Inherited from AvroInput

Inherited from TextOutput

Inherited from TextInput

Inherited from AnyRef

Inherited from Any

Ungrouped