Speech to Text API
The VoxSigma REST API is so simple that you can integrate our speech-to-text
service in your application by adding only one command-line in your application
script. It can be used with command-line HTTP clients such as cURL, or with HTTP
processing functions: language
, and speech-text
You can integrate our speech-to-text technology today.
VoxSigma API Features
- Protocol : REST API over HTTPS;
POST, GET and PUT HTTP methods are accepted;
Both URI encoded requests and MIME multi-part requests are supported;
Three submission modes: file, streaming, and real-time.
- Audio file format : AAC, AIFF, ASF, FLAC,
MS-Wave, MPEG, Ogg/Vorbis, Nist Sphere, Sun AU
- Audio type : telephone or
broadcast quality, most sampling rates are supported.
- Functions : language identification,
audio and speaker segmentation, speech-to-text conversion, and speech-text alignment.
- Output : XML data with speaker
diarization, language identification tags, word transcription, punctuation,
confidence measures, numerical entities and other specific entities.
We provide hotline support (via email and phone) for our products and services to
help users and system integrators solve problems in the shortest