Specifies the connection details of the cluster to read from.
Specifies the connection details of the cluster to read from.
Overrides the default cluster specified in the Spark configuration for the duration of this read operation.
A string of the form "IRIS://host:port/namespace" that specifies the cluster to read from.
The user account with which to make the connection to the cluster named in the "url" option above.
The password for the given user account.
The same DataFrameReader on which this method was invoked.
Specifies the connection details of the cluster to read from.
Specifies the connection details of the cluster to read from.
Overrides the default cluster specified in the Spark configuration for the duration of this read operation.
The connection details of the cluster to read from.
The same DataFrameReader on which this method was invoked.
Executes a query on the given cluster to compute a suitably partitioned DataFrame.
Executes a query on the given cluster to compute a suitably partitioned DataFrame.
This enables one to write, for example:
spark.read.iris("SELECT * FROM Owls","column",0,10000,2)
as a convenient shorthand for the more explicit:
spark.read .format("com.intersystems.spark") .option("query","SELECT * FROM Owls") .option("paftitionCol","column") .option("lowerBound",0) .option("upperBound",10000) .option("numPartitions",2) .load()
The following options affect how the operation is performed:
url
: A string of the form "IRIS://host:port/namespace"
that specifies the cluster from which the data is to
be read. If omitted, the default cluster specified
via the "spark.iris.master.url" configuration setting
is used instead.user
: The account with which to make the connection to the
cluster named in the "url" option above.password
: The password for the given user account.fetchsize
: The number of rows to fetch per server round trip.Default = 1000.
The text of a query to be executed on the cluster or the name of an existing table in the cluster to load.
The name of the integral valued column in the result set with which to further partition the query.
The lower bound of the partitioning column.
The upper bound of the partitioning column.
The number of partitions per instance to create.
The results of the query in the form of a suitably partitioned DataFrame.
SQLException
if a database access error occurs.
JDBC to Other Databases
for more on the semantics of the column
, lo
, hi
, and partitions
parameters.
Executes a query on the given cluster to compute a suitably partitioned DataFrame.
Executes a query on the given cluster to compute a suitably partitioned DataFrame.
This enables one to write, for example:
spark.read.iris("SELECT * FROM table",2)
as a convenient shorthand for the more explicit:
spark.read .format("com.intersystems.spark") .option("query","SELECT * FROM table") .option("mfpi",2) .load()
The following options affect how the operation is performed:
url
: A string of the form "IRIS://host:port/namespace"
that specifies the cluster from which the data is to
be read. If omitted, the default cluster specified
via the "spark.iris.master.url" configuration setting
is used instead.user
: The account with which to make the connection to the
cluster named in the "url" option above.password
: The password for the given user account.fetchsize
: The number of rows to fetch per server round trip.Default = 1000.
The text of a query to be executed on the cluster or the name of an existing table in the cluster to load.
The maximum number of factors per distinct instance to include in the factorization implicitly performed by the server, or 0 if no limit is necessary.
The results of the query in the form of a suitably partitioned DataFrame.
SQLException
if a database access error occurs.
© 2024 InterSystems Corporation, Cambridge, MA. All rights reserved. Privacy & Terms Guarantee Accessibility
Extends the given reader with IRIS specific methods.