Data Asset node

You can use the Data Asset import node to pull in data from remote data sources using connections. First, you must create the connection.

You can also pull in data from an Excel file (.xls) or a .csv file. Only the first sheet is imported. In the node's properties, under DATA, select one or more data files to upload.

Note: You can import a stream (.str) into Watson Studio that was created in SPSS Modeler Subscription or SPSS Modeler client. If the imported stream contains one or more import or export nodes, you'll be prompted to convert the nodes. See Migrating import and export nodes.
Different countries use different symbols to separate the integer part from the fractional part of a number and to separate fields in data. For example, you might use a comma instead of a period to separate the integer part from the fractional part of numbers. And, rather than using commas to separate fields in your data, you might use colons or tabs. With a Data Asset import or export node, you can specify these symbols for field delimiter and decimal. Double-click the node to open its properties and select the Field delimiter and the Decimal symbol as desired. Available delimiters are Comma, Tab, Colon, or Other. Select Other if you need to specify your own custom delimiter.
Figure 1. Field delimiter and decimal symbol options
Field delimiter and decimal symbol options

Using SQL to pull in data

In the Data Asset import node properties, under Mode, you can select SQL Query if you want to use custom SQL to import data from a database. Use a SQL SELECT statement to pull in rows or columns of data from a database. Note that the Source path field doesn't apply if you're using the SQL Query mode.

The following example pulls in certain rows of data from a database table:
where  UNIT_COST > 40,000 LIMIT 4
And this example pulls in certain columns of data from a database table:
Note that the SQL syntax you use can vary depending on database platform. For example, if pulling in data from an Informix database, Informix requires field names to be surround by double quotes. For example:
select "Age", "Sex" from testuser.canvas_drug

This SQL feature should only be used to pull in data. Use caution so as not to manipulate the data in your database.

The following databases currently support this custom SQL feature:
  • Apache Hive
  • Cloudera Impala
  • Compose for PostgreSQL
  • Db2 on Cloud
  • Db2 Warehouse
  • Informix
  • Microsoft SQL Server
  • Oracle
  • Pivotal Greenplum
  • Snowflake
  • Sybase
  • Sybase IQ