About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Last updated: Jun 19, 2024
Use this information to resolve questions about using Data Refinery.
Cannot refine data from an Excel data asset
The Data Refinery flow might fail if it cannot read the data. Confirm the format of the Excel file. By default, the first line of the file is treated as the header. You can change this setting in the Flow settings. Click the Flow settings icon . Go to the Source data sets tab and click the Overflow icon
next to the data source, and select Edit format. You can also specify the first line property, which designates which row is the first row in the data set to be read. Changing
these properties affects how the data is displayed in Data Refinery as well as the Data Refinery job run and flow output.
Data Refinery flow job fails with a large data asset
If your Data Refinery flow job fails with a large data asset, try these troubleshooting tips to fix the problem:
- Instead of using a project data asset as the target of the Data Refinery flow (default), use Cloud storage. For example, IBM Cloud Object Storage, Amazon S3, or Google Cloud Storage.
- Select a Spark & R environment for the Data Refinery flow job or create a new Spark & R environment template.