DefaultSource

Instance Constructors

new DefaultSource()

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def createRelation(sc: SQLContext, mode: SaveMode, options: Map[String, String], df: DataFrame): BaseRelation

Write the given DataFrame to the database and return a base relation with which it can subsequently be read.
Write the given DataFrame to the database and return a base relation with which it can subsequently be read.
Arguments are supplied as a collection of (key,value) pairs. They include all of the options described above, as for reading; in particular:
- dbtable The name of the target table to write to.
Other optional arguments include:
- url: A string of the form "IRIS://host:port/namespace" that specifies the cluster to which the data is to be written. If omitted, the default cluster specified via the "spark. iris.master.url" configuration setting is used instead.
- user: The user account with which to make the connection to the cluster named in the "url" option above.
- password: The password for the given user account.
- batchsize: The number of rows to insert per server round trip.
Default = 1000.
- isolationlevel: The transaction isolation level.
Can be either NONE, REPEATABLE_READ, READ_COMMITTED, READ_UNCOMMITTED, or SERIALIZABLE.
Corresponds to the standard transaction isolation levels specified by the JDBC Connection object.
Default = READ_UNCOMMITTED.
- description: An optional description for the newly created table.
- publicrowid: Specifies whether or not the master row ID column for the newly created table is to be made publicly visible.
- shard: Indicates the records of the table are to be distributed across the instances of the cluster; the optional value specifies the shard key to use - see IRIS documentation for CREATE TABLE for more details.
- autobalance: When writing a dataset to a table that is sharded on a system assigned shard key, the value true specifies that the inserted records are to be evenly distributed amongst the available shards,while the value false specifies that that they be sent to whichever shard is closest to where the partitions of the dataset reside.
Default = true.
sc
The context in which to create the new relation.
mode
Describes the behavior if the target table already exists.
options
The arguments with which to initialize the new relation.
df
The source data frame that is to be written.
returns
A new Relation that can be called upon to execute the given query on the cluster as needed.

Definition Classes
DataSource → CreatableRelationProvider
Exceptions thrown
Exception if the proposed write operation would certainly fail.
IllegalArgumentException if passed an invalid parameter value.
See also
Using JDBC Transaction Isolation Levels
InterSystems IRIS SQL Reference for more information on the options supported by the CREATE TABLE statement.
def createRelation(sc: SQLContext, options: Map[String, String]): BaseRelation

Constructs a BaseRelation from the given arguments.
Constructs a BaseRelation from the given arguments. These are supplied as a collection of (key,value) pairs, and must include a value for either:
- query: The text of a query to be executed on the cluster, or
- dbtable: The name of a database table within the cluster, in which case the entire table is loaded.
Other optional arguments include:
- url: A string of the form "IRIS://host:port/namespace" that specifies the cluster from which the data is to be read. If omitted, the default cluster specified via the "spark. iris.master.url" configuration setting is used instead.
- user: The user account with which to make the connection to the cluster named in the "url" option above.
- password: The password for the given user account.
- mfpi: The maximum number of partitions per server instance to include in any implicit query factorization performed by the server.
Default = 1.
- fetchsize: The number of rows to fetch per server round trip.
Default = 1000.
- partitionColumn, lowerBound, upperBound, and numPartitions: An explicit description of how to partition the queries sent to each distinct instance; these have an identical semantics to the similarly named arguments for the JDBC data source that is built into Spark.
If both mfpi and partionColumn arguments are given, then the explicit partitioning specification takes precedence.
sc
The context in which to create the new relation.
options
The arguments with which to initialize the new relation.
returns
A new Relation that can be called upon to execute the given query on the cluster as needed.

Definition Classes
DataSource → RelationProvider
Exceptions thrown
IllegalArgumentException if passed an invalid parameter value.
See also
JDBC to Other Databases for more on the semantics of the column, lo, hi, and partitions parameters.
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
lazy val log: Logger

A dedicated logger that instances of this class can write to.
A dedicated logger that instances of this class can write to.

Definition Classes
Logging
def logName: String

The name of the logger that instances of this class can write to.
The name of the logger that instances of this class can write to.

Definition Classes
Logging
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
val shortName: String

A short user-friendly name for the data source.
A short user-friendly name for the data source.

Definition Classes
DataSource → DataSourceRegister
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

Related Doc: package spark

final class DefaultSource extends DataSource

Instance Constructors

new DefaultSource()

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

final def asInstanceOf[T0]: T0

def clone(): AnyRef

def createRelation(sc: SQLContext, mode: SaveMode, options: Map[String, String], df: DataFrame): BaseRelation

def createRelation(sc: SQLContext, options: Map[String, String]): BaseRelation

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

final def getClass(): Class[_]

def hashCode(): Int

final def isInstanceOf[T0]: Boolean

lazy val log: Logger

def logName: String

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

val shortName: String

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from DataSource

Inherited from Logging

Inherited from RelationProvider

Inherited from CreatableRelationProvider

Inherited from DataSourceRegister

Inherited from AnyRef

Inherited from Any

Ungrouped