Add very large objects to a project's Cloud Object Storage
The amount of data you can load to a project's Cloud Object Storage at any one time depends on where you load the data from. If you are using the Watson Studio app to load the data, the limit is 5 GB. To add larger objects to a project's Cloud Object Storage, you can use any of the following methods:
- The Cloud Object Storage API
- The FTP client called Cyberduck
- The IBM Cloud Object Storage Python SDK (in case you can't use the FTP client)
Load data in multiple parts by using the Cloud Object Storage API
With the Cloud Object Storage API, you can load data objects as large as 5 GB in a single PUT, and objects as large as 5 TB by loading the data into object storage as a set of parts which can be loaded independently in any order and in parallel. After all of the parts have been loaded, they are presented as a single object in Cloud Object Storage.
You can load files with these formats and mime types in multiple parts:
- text/plain; charset=utf-8
To load a data object in multiple parts:
Initiate a multipart load:
curl -X "POST" "https://(endpoint)/(bucket-name)/(object-key)?uploads" -H "Authorization: bearer (token)"
The values for
tokenare on the Watson Studio project's Settings page. Click Services > Data Services and Actions > Manage in IBM Cloud on the Cloud Object Storage for the endpoint value.
Load the parts by specifying arbitrary sequential part numbers and an UploadId for the object:
curl -X "PUT" "https://(endpoint)/(bucket-name)/(object-key)?partNumber=(sequential-integer)&uploadId=(upload-id)" -H "Authorization: bearer (token)" -H "Content-Type: (content-type)"
Complete the multipart load:
curl -X "POST" "https://(endpoint)/(bucket-name)/(object-key)?uploadId=(upload-id)" -H "Authorization: bearer (token)" -H "Content-Type: text/plain; charset=utf-8" -d \$'<CompleteMultipartUpload> <Part> <PartNumber>1</PartNumber> <ETag>(etag)</ETag> </Part> <Part> <PartNumber>2</PartNumber> <ETag>(etag)</ETag> </Part>
- Add your file to the project as an asset. From the Assets page of your project, click the Find and add data icon. Then, from the Files pane, click the action menu and select Add as data set.
- Refine the data
- Analyze the data with notebooks or dashboards
- Analyze the data with models or AI
- Publish the data asset to a catalog