The VoxSigma software suite offers large
vocabulary
speech-to-text
capabilities in multiple languages. It includes adaptive features allowing the
transcription of noisy speech, such as speech over background music. The
software suite has been designed for professional users needing to transcribe
large quantities of audio and video documents such as broadcast data, either in
batch mode or in real-time. Versions can also be used to transcribe call-center
data.
The full speech-to-text conversion process (also call
voice-to-text
conversion) is done in three steps. The software first identifies the audio
segments containing speech, then it identifies the language being spoken if it
is not known a priori, and finally it converts the speech segments to
text. It includes adaptive features allowing the transcription of noisy speech
such as speech with background music.
The
speech-to-text
processing result is a fully annotated XML document
including labels for speech and non-speech segments, speaker labels,
words with time codes and high quality confidence scores. This XML
file can be directly indexed by a search engine, or alternatively can
be converted into plain text with capitalization and punctuation.
| Platforms | Unix-like x86 and x86_64
(OpenSuse, Fedora, CentOS, Ubuntu, SuSE, Red Hat, Mac OS X, ...)
|
| API | command line tools, C++ library |
| Audio | studio (e.g. broadcast) and telephone bandwidths |
| Key functions | audio segmentation, speaker segmentation,
language identification, spoken word transcription (speech-to-text) |
| Operating modes | batch, real-time, single or multi-threaded |
| Ouputs | XML with speaker diarization, language identification tags, word transcription, punctuation,
confidence measures, numeral entities and other specific entities |
| Supported Languages | Arabic, Dutch,
English (US, UK), French, Finnish, German, Greek, Italian, Mandarin, Polish,
Portuguese, Romanian, Russian, Spanish
(other language options are under development, contact us for more information)
|
To get more information about the VoxSigma software suite or to get a
price quote you may fill out our online request
form.