Speech to Text Services

Vocapia offers a variety of speech recognition services. In addition to our speech transcription Web service, we offer services for batch processing of large quantities of data and for the development of customized models. We bring extensive experience and work with our clients to provide tailored solutions to match their needs.

Request Form
You can integrate VoxSigma software or service into your application today.

The VoxSigma sofware suite is offered as a Web service via a REST API over HTTPS, always providing customers access to our latest systems thereby quickly benefiting from regular advances and take advantage of additional features offered by the online environment. Our speech-to-text service is available 24/7/365 with failover servers and geographic redundancy [request form].

Supported languages : Arabic, Cantonese, Czech, Dutch, English, Finnish, French, German, Greek, Hebrew, Hindi, Hungarian, Italian, Latvian, Lithuanian, Mandarin, Pashto, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Swahili, Swedish, Turkish, Ukrainian and Urdu.
Main functionalities : Speech-to-text transcription, language identification, speech-text synchronization.

SaaS Status

Automatic on-the-fly adaptation allows the user to provide texts related to the audio document being processed, what can be considered topic/domain adaptation. These accompanying texts serve to increase the lexical coverage of the speech-to-text system and to adapt the language model to the specific domain of the audio document with the aim of improving the transcription accuracy.

Batch processing is offered as an offline or online service to process audio and audiovisual archives, in particular when specific needs and models are required [request form].

Tailoring models for your application is the best way to ensure you get the best possible results for your needs. For speech-to-text applications, high accuracy is essential to maximize your ROI, as to a first approximation, the cost of using automatic transcriptions in your workflow is proportional to the system's error rate. Therefore using a system with a 90% accuracy (i.e. 10% error) may cost almost twice that of using a system with a 95% accuracy (i.e. 5% error).

If you are interested in a particular language or technology please use our contact form or our VoxSigma request form, or send a note directly to contact@vocapia.com.

If your data processing needs are relatively low or are irregular, or if you need to process video data or want to manually adapt or correct the automatic transcriptions, please check out our partner's service

. This service pay-as-you-go also offers many export formats such as XML, CSV, SRT, SBV, RTF, VTT, PDF, DOC, DOCX.

We provide hotline support (via email and phone) for our products and services to help users and integrators solve problems in the shortest possible timeframe [support form].

VoxSigma SaaS

Document based adaptation

On-demand batch processing

Customized models

Support