com.nicta.scoobi.io.text

TextInput

trait TextInput extends AnyRef

Smart functions for materialising distributed lists by loading text files.

Linear Supertypes
AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. TextInput
  2. AnyRef
  3. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. object ADouble

    Extract a Double from a String.

  7. object AFloat

    Extract a Float from a String.

  8. object ALong

    Extract a Long from a String.

  9. object AnInt

    Extract an Int from a String.

  10. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  11. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  12. def defaultTextConverter: InputConverter[LongWritable, Text, String]

    INPUT CONVERTERS

  13. def defaultTextConverterToString: InputConverter[Text, Text, (String, String)]

  14. def defaultTextConverterWithPath: InputConverter[LongWritable, Text, (String, String)]

  15. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  16. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  17. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  18. def fromDelimitedTextFile[A](path: String, sep: String = "\t", check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck)(extractFn: PartialFunction[Seq[String], A])(implicit arg0: WireFormat[A]): DList[A]

    Create a distributed list from a text file that is a number of fields delimited by some separator.

    Create a distributed list from a text file that is a number of fields delimited by some separator. Use an extractor function to pull out the required fields to create the distributed list.

  19. def fromDelimitedTextFiles[A](paths: Seq[String], sep: String = "\t", check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck)(extractFn: PartialFunction[Seq[String], A])(implicit arg0: WireFormat[A]): DList[A]

    Create a distributed list from a list of one ore more files or directories (in the case of a directory, the input forms all files in that directory).

    Create a distributed list from a list of one ore more files or directories (in the case of a directory, the input forms all files in that directory). The file(s) contain a number of fields delimited by a separator. Use an extractor function to pull out the required fields to create the distributed list.

  20. def fromTextFile(paths: String*): DList[String]

    Create a DList from one or more files or directories (in the case of a directory, the input forms all files in that directory).

  21. def fromTextFileWithPath(path: String, check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck): DList[(String, String)]

    Create a distributed list from one or more files or directories (in the case of a directory, the input forms all files in that directory).

    Create a distributed list from one or more files or directories (in the case of a directory, the input forms all files in that directory). The distributed list is a tuple where the first part is the path of the originating file and the second part is a line of text.

  22. def fromTextFileWithPaths(paths: Seq[String], check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck): DList[(String, String)]

    Create a distributed list from a list of one or more files or directories (in the case of a directory, the input forms all files in that directory).

    Create a distributed list from a list of one or more files or directories (in the case of a directory, the input forms all files in that directory). The distributed list is a tuple where the first part is the path of the originating file and the second part is a line of text.

  23. def fromTextFiles(paths: Seq[String], check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck): DList[String]

  24. def fromTextSource[A](source: DataSource[_, _, A])(implicit arg0: WireFormat[A]): DListImpl[A]

    TEXT SOURCES

  25. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  26. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  27. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  28. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  29. final def notify(): Unit

    Definition Classes
    AnyRef
  30. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  31. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  32. def textSource(paths: Seq[String], check: (Seq[Path], ScoobiConfiguration) ⇒ Unit = Source.defaultInputCheck): TextSource[String]

    create a text source

  33. def toString(): String

    Definition Classes
    AnyRef → Any
  34. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  35. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  36. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from AnyRef

Inherited from Any

Ungrouped