Speech to Text Services
Vocapia offers a variety of speech recognition services. In addition to
our speech transcription Web service, we offer services for batch
processing of large quantities of data and for the development of
customized models. We bring extensive experience and work with our
clients to provide tailored solutions to match their needs.
You can integrate VoxSigma software or service into your application today.
The VoxSigma sofware suite is offered as a Web service
via a REST API
over HTTPS, always providing customers access to
our latest systems thereby quickly benefiting from regular advances and take
advantage of additional features offered by the online environment.
service is available 24/7/365 with failover servers and geographic redundancy
- Supported languages : Arabic, Cantonese, Czech, Dutch, English, Finnish, French, German, Greek,
Hebrew, Hindi, Hungarian, Italian, Latvian, Lithuanian, Mandarin, Pashto, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Swahili, Swedish, Turkish, Ukrainian and Urdu.
- Main functionalities : Speech-to-text transcription, language
identification, speech-text synchronization.
Document based adaptation
Automatic on-the-fly adaptation allows the user to provide texts related to the
audio document being processed, what can be considered topic/domain
adaptation. These accompanying texts serve to increase the lexical coverage of
system and to adapt the language model to the specific domain
of the audio document with the aim of improving the transcription accuracy.
On-demand batch processing
Batch processing is offered as an
offline or online service to process audio and audiovisual archives, in
particular when specific needs and models are required
Tailoring models for your application is the best way to ensure you
get the best possible results for your needs. For speech-to-text
applications, high accuracy
is essential to maximize your
, as to a first
approximation, the cost of using automatic transcriptions in your
workflow is proportional to the system's error rate. Therefore using a
system with a 90% accuracy (i.e. 10% error) may cost almost twice that
of using a system with a 95% accuracy (i.e. 5% error).
If you are interested in a particular language or technology please
use our contact form
or our VoxSigma request form
, or send a note
directly to firstname.lastname@example.org
We provide hotline support (via email and phone) for our products and services
to help users and integrators solve problems in the shortest possible
timeframe [support form