Skip to main content

Data Source Details (2.13)

This page provides details for each type of data source for InterSystems Data Studio™. For guidance on how to create, edit, delete and test a data source, see Defining Data Sources.

Please note that some data sources require credentials. Create credentials before creating these data sources.

Structured Data Sources

Structured data sources provide access to data in a defined format, such as delimited files, database tables, APIs, and feeds.

Bloomberg

A Bloomberg data source retrieves data from a Bloomberg data sourceOpens in a new tab . This source is only available within InterSystems Data Studio™ with the asset management module.

For a Bloomberg source, specify the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

Credential

Required. Select the saved credential that authorizes use of this data source.

Expiration Date

Required. Specify the expiration date of this data source.

Local FileDir Source

Optional. The local FileDir data source folder where files are staged.

To use the File Manager interface for staged files, create a FileDir data source before creating this data source, and select it here. If you do not specify a FileDir source, the system stages files in a temporary folder, and the File Manager interface is not available.

Commcise

A Commcise data source retrieves Commcise dataOpens in a new tab. This source is only available within InterSystems Data Studio™ with the asset management module.

For a Commcise source, specify the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog.

Credential

Required. Select the saved credential that authorizes use of this data source.

Certificate File Path

Optional. The location of the certificate file used to obtain an access token from the Commcise Auth server.

Email for Client Id

Required. The email address used as the Commcise user ID.

Auth Server

Required. The authorization server to use when connecting to Commcise.

API Server

Required. Commcise API server to use.

ExcelSingleFileDir

An ExcelSingleFileDir data source provides access to data stored in Microsoft Excel files that are periodically written to a directory accessible by Data Studio. Apart from the file format, this data source is the same as FileDir.

FileDir

A FileDir source is a source that provides data via files, periodically written to a specific directory within the file system. The file should contain data separated by delimiters; a common example is a comma-separated value (CSV) file. Another common delimiter is the tab.

For a FileDir source, specify the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

Interface Path Location

Required. The directory in which these files will be found, relative to the system base path. Because this is a relative path, it should not start with / or \ (but any leading path separator is automatically removed).

To avoid confusion, this directory should not be used by any other data source.

When you create a FileDir source, the system automatically creates the directory named by Interface Path Location , as well as the subdirectories Samples , Source , Work , and Archive . Using the File Manager explains the purpose of these directories.

GLEIF

A GLEIF data source provides access to data from the Global Legal Entity Identifier Foundation (GLEIF). This source is only available within InterSystems Data Studio™ with the asset management module.

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

API Server

Optional. The base URL for the GLEIF API endpoint.

Connection Test Path

Optional. A REST endpoint used to verify the connection. This value is appended to the API Server path.

Local FileDir Source

Optional. The local FileDir data source folder where files are staged.

To use the File Manager interface for staged files, create a FileDir data source before creating this data source, and select it here. If you do not specify a FileDir source, the system stages files in a temporary folder, and the File Manager interface is not available.

Guardian

A Guardian data source provides access to The Guardian OpenPlatformOpens in a new tab news feed, provided by The Guardian News & Media LimitedOpens in a new tab ; see https://open-platform.theguardian.com/documentation/Opens in a new tab .

For such a data source, provide the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

APIKEY

Required. The API key needed to access this data source.

JDBC

A JDBC data source provides access to a database via a JDBC connection.

For a JDBC data source, specify the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

Credential

Required. The credential that defines the username and password to access the database. Select the applicable credential from the dropdown list.

JDBC URL

Required. Enter the JDBC connection string needed to access the database.

JDBC Database

Required. Select the database vendor and version.

Enable Foreign Tables

If you select this check box, it will be possible to project tables from the selected database as foreign tables within Data Studio. This is useful when it is not feasible or reasonable to load data directly into Data Studio. For example, a table may be extremely large and might not be queried frequently. A foreign table is read-only but can otherwise be accessed in the same way as local tables.

If you select this check box, also specify JDBC Foreign Table Local Schema , which is the default name of the schema to contain any foreign tables from this data source.

NYSE

An NYSE data source provides access to NYSE-related data. This source is only available within InterSystems Data Studio™ with the asset management module.

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

Entity Description

Optional. A description of the entity. The default value is US - NYSE.

NYT

An NYT data source provides access to a news feed APIOpens in a new tab provided by The New York Times CompanyOpens in a new tab ; see https://developer.nytimes.com/get-startedOpens in a new tab .

For such a data source, provide the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

APIKEY

Required. The API key needed to access this data source.

REST

A REST data source provides access to a RESTful endpoint. Use this type of data source to connect to APIs that expose data over HTTP. For this type of data source, supply the following details:

Data Source Name

Required. The unique name of the data source. This is the name users see when browsing the Data Catalog.

Credential

Optional. Select the saved credential that authorizes access to this data source.

API Token/Key

Optional. The API key or token required to access this data source.

API Server

Optional. The base URL for the REST API endpoint (for example, https://test.example.com).

Connection Test Path

Optional. A REST endpoint used to verify the connection. This value is appended to the API Server path (for example, /oauth2/token).

Local FileDir Source

Optional. The local FileDir data source folder where files are staged.

To use the File Manager interface for staged files, create a FileDir data source before creating this data source, and select it here. If you do not specify a FileDir source, the system stages files in a temporary folder, and the File Manager interface is not available.

Known REST Endpoints

Optional. REST endpoints that require custom processing.

After creating the REST data source, navigate to Data Catalog > REST Registry to add additional metadata.

RSS

An RSS data source provides access to an RSS feed. For such a data source, provide the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

RSSFeedURL

Required. The URL of the RSS feed.

TimeOut

Required. The length of time to wait, in seconds, for a response from the RSS feed URL before throwing an error.

S3Delimited

An S3Delimited data source provides access to an S3 bucket that contains delimited files.

For an S3Delimited data source, specify the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

Credential

Required. The credential that defines the username and password to access the given S3 bucket. Select the applicable credential from the dropdown list.

S3 Bucket Name

Required. The name of the S3 bucket to access.

AWS Session Token

Optional. The session token to use when accessing the S3 bucket.

Source Path

Optional. Location of the folder from which to load files. You can optionally include %RUNDATE in the path for the ISO date to be injected at the time the files are loaded. The root folder is used if one is not provided.

Samples Path

Optional. Location of the folder from which to import file schemas. You can optionally include %RUNDATE in the path for the ISO date to be injected at the time the files are listed. The root folder is used if one is not provided.

Archive Path

Optional. Location of the folder to archive previously loaded files. You can optionally include %RUNDATE in the path for the ISO date to be injected at the time of the archiving. The root folder is used if one is not provided.

Target Path

Optional. Location of the folder to write files to, in the case when a recipe promotes data to a file on the given S3 bucket.

SAP4HanaAPI

A SAP4HanaAPI data source provides access to SAP S/4HANA data through OData v2 APIs.

For a SAP data source, specify the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

Credential

Required. The credential that defines the username and password to access the given SAP server.

Select the appropriate credential from the dropdown list.

SAP Server Host

Required. The host name of the SAP server (for example, from https://{host}:{port}/sap/opu/odata/sap/... ).

SAP Server Port

Required. The port to use on the SAP server.

Maximum Retries

Required. The number of times to retry if the SAP server returns a 500 error.

Schema Filter

Required. Specifies which schemas should appear as options.

Salesforce

A Salesforce data source provides access to a Salesforce instance via the Salesforce API. Depending on the selected authorization flow, some connection details may differ.

For a Salesforce data source, specify the following details:

Authorization Flow

Required. Select the appropriate authorization flow for your Salesforce instance. Choose either password or client_credentials. The choice you make will alter what details are necessary to provide when creating this data source.

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

Credential

Required for the password authorization flow. The credential that defines the username and password to authenticate the Salesforce API. Select the applicable credential from the dropdown list.

Client ID Credentials

Required. Select the appropriate SDS Datasource (DS) Credentials record for the ClientId and Client Secret.

Authentication Server

Required. Specify the server used for authentication (which does not have to be the same server on which Salesforce is running).

API end-point Server

Required. Specify the server on which the Salesforce API is running.

Port

Optional. Specify the web server port to use, if that is not the standard port number.

SftpDelimited

An SftpDelimited data source provides access to an SFTP server that contains delimited files.

For an SftpDelimited data source, specify the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

Credential

Required. The credential that defines the username and password to access the given SFTP server. Select the applicable credential from the dropdown list.

Passphrase

Optional. Passphrase for key file.

Host

Required. The host name of the SFTP server.

Port

Required. The port to use on the SFTP server.

SFTP Command Options

Optional. Additional command options to use when connecting to the SFTP server.

Remote Source Path

Optional. Location of folder to load files from in SFTP server. You can optionally include %RUNDATE in the path for the ISO date to be injected at the time the files are loaded. The root folder is used if one is not provided.

Remote Target Path

Optional. Location of folder to place files promoted to SFTP server. The root folder is used if one is not provided.

Local FileDir Source

Optional. The local FileDir data source folder where files are staged.

To use the File Manager interface for staged files, create a FileDir data source before creating this data source, and select it here. If you do not specify a FileDir source, the system stages files in a temporary folder, and the File Manager interface is not available.

Certificate File

Optional. Upload your certificate file. Either click Select files... to open your computer’s file explorer, or drag and drop files into the section of the screen that says Drop files here to upload.

Unstructured Data Sources

Unstructured data sources provide access to document files stored in external systems such as S3 buckets and SFTP servers.

S3Document

An S3Document data source provides access to an S3 bucket that contains document files.

For an S3Document data source, specify the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

Credential

Optional. The credential that defines the username and password to access the given S3 bucket. Select the applicable credential from the dropdown list.

S3 Bucket Name

Required. The name of the S3 bucket to access.

AWS Session Token

Optional. The session token to use when accessing the S3 bucket.

SftpDocument

An SftpDocument data source provides access to an SFTP server that contains document files.

For an SftpDocument data source, specify the following details:

Data Source Name

Required. The unique name of this data source. This is the name that users see when they browse the Data Catalog .

Credential

Required. The credential that defines the username and password to access the given SFTP server. Select the applicable credential from the dropdown list.

Passphrase

Optional. Passphrase for key file.

Host

Required. The host name of the SFTP server.

Port

Required. The port to use on the SFTP server.

SFTP Command Options

Optional. Additional command options to use when connecting to the SFTP server.

Certificate File

Optional. Upload your certificate file. Either click Select files... to open your computer’s file explorer, or drag and drop files into the section of the screen that says Drop files here to upload.

See Also

FeedbackOpens in a new tab