Vocapia Research announces the availability of a speech-text alignment
functionality on its web service. Speech-text alignment is the process
of synchronizing a speech signal with a speech transcript or a closely
related text, providing timecodes for words and sentences. There are
many uses of this technology, including audio books, language
learning, and video subtitling.
The speech-text alignment process assigns timecodes to each word
and each punctuation mark in the audio transcript and provides
confidence scores to identify areas where the alignment may not be
perfect in particular when the transcript differs from what has really
been said.
This new functionnality is offered on Vocapia web service via a simple
and efficient REST API as for Vocapia speech-to-text technology.
Vocapia Research, founded in July 2000, is an R&D company and
software publisher developing and providing leading edge speech
technologies and solutions for many languages, including most major
European Union languages as well as Arabic, Mandarin, and Russian. The
Vocapia Research VoxSigma
® software suite uses advanced
language technologies such as language identification, speech
recognition, and speaker identification to transform raw audio and
audiovisual data into structured and searchable XML documents. This
technology relies on over 25 years of research at LIMSI-CNRS, with
which there is a priviledged partnership. Joint systems developed
with LIMSI have achieved top ranks in national and international
challenges of speech-to-text transcription. The most common
applications of the VoxSigma software suite are audio and audiovisual
data mining (broadcast data, podcasts, call center data), media
monitoring, and media asset management. Vocapia Research is located in
the scientific pole of the Saclay Plateau, France. Readers who wish to
get more information about Vocapia Research are invited to check out
the Vocapia Research website or use the contact information page
http://www.vocapia.com/contact.