Adding very large objects to a project’s Cloud Object Storage
The amount of data you can load to a project’s Cloud Object Storage at any one time depends on where you load the data from. If you are using Watson Studio to load the data, the limit is 5 GB. To add larger objects to a project’s Cloud Object Storage, you can use any of the following methods:
- The Cloud Object Storage API
- An FTP client
- The IBM Cloud Object Storage Python SDK (in case you can’t use an FTP client)
Load data in multiple parts by using the Cloud Object Storage API
With the Cloud Object Storage API, you can load data objects as large as 5 GB in a single PUT, and objects as large as 5 TB by loading the data into object storage as a set of parts which can be loaded independently in any order and in parallel. After all of the parts have been loaded, they are presented as a single object in Cloud Object Storage.
You can load files with these formats and mime types in multiple parts:
- text/plain; charset=utf-8
To load a data object in multiple parts:
- Initiate a multipart load:
curl -X "POST" "https://(endpoint)/(bucket-name)/(object-name)?uploads" -H "Authorization: bearer (token)"
The values for
tokenare on the Watson Studio project’s Settings page. Click Manage in IBM Cloud on the Watson Studio for the endpoint value.
- Load the parts by specifying arbitrary sequential part numbers and an UploadId for the object:
curl -X "PUT" "https://(endpoint)/(bucket-name)/(object-name)?partNumber=(sequential-integer)&uploadId=(upload-id)" -H "Authorization: bearer (token)" -H "Content-Type: (content-type)"
- Complete the multipart load:
curl -X "POST" "https://(endpoint)/(bucket-name)/(object-name)?uploadId=(upload-id)" -H "Authorization: bearer (token)" -H "Content-Type: text/plain; charset=utf-8" -d \$'<CompleteMultipartUpload> <Part> <PartNumber>1</PartNumber> <ETag>(etag)</ETag> </Part> <Part> <PartNumber>2</PartNumber> <ETag>(etag)</ETag> </Part>
- Add your file to the project as an asset. From the Assets page of your project, click the Find and add data icon. Then, from the Files pane, click the action menu and select Add as data set.
- Refining the data
- Analyzing the data with notebooks or dashboards
- Analyzing the data with models or AI
- Publishing the data asset to a catalog