Last updated: Nov 26, 2024
Description
The Watson Speech services for IBM® Software Hub offer speech recognition and speech synthesis capabilities for your applications:
- Watson Speech to Text for IBM Software Hub transcribes written text from spoken audio.
The service leverages machine learning to combine knowledge of grammar, language structure, and the
composition of audio and voice signals to accurately transcribe the human voice. It continuously
updates and refines its transcription as it receives more speech audio. The service is ideal for
applications that need to extract high-quality speech transcripts for use cases such as call
centers, custom care, agent assistance, and similar solutions.
For more information about the service, see About Watson Speech to Text.
- Watson Text to Speech for IBM Software Hub synthesizes natural-sounding speech from
written text. The service streams the results back to the client with minimal delay. The service is
appropriate for voice-driven and screenless applications, where audio is the preferred method of
output.
For more information about the service, see About Watson Text to Speech.
You can customize the to suit your language and application needs. Both services offer HTTP and WebSocket programming interfaces that make them suitable for any application that produces or accepts audio.
The services add a tool or other type of interface that runs in IBM Cloud outside of IBM Software Hub as a Service and provides APIs that you can run in notebooks.
Quick links
- Use: Work with the service
- Develop: Write code and build applications
- What's new: See what's new each week
- Create: Create the service instance
Integrated services
Service | Capability |
---|---|
watsonx Assistant | Build your own branded assistant into any device, application, or channel. Users interact with your application through the user interface that you implement. |