The Data Refinery flow might fail if it cannot read the data. Confirm the format of the Excel file. By default, the first line of the file is treated as the header. You can change this setting in the Flow settings. Click the Flow settings icon . Go to the Source data sets tab and click the Overflow icon next to the data source, and select Edit format. You can also specify the first line property, which designates which row is the first row in the data set to be read. Changing
these properties affects how the data is displayed in Data Refinery as well as the Data Refinery job run and flow output.
Data Refinery flow job fails with a large data asset
Copy link to section
If your Data Refinery flow job fails with a large data asset, try these troubleshooting tips to fix the problem:
Instead of using a project data asset as the target of the Data Refinery flow (default), use Cloud storage. For example, IBM Cloud Object Storage, Amazon S3, or Google Cloud Storage.
Select a Spark & R environment for the Data Refinery flow job or create a new Spark & R environment template.
About cookies on this siteOur websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising.For more information, please review your cookie preferences options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.