| Home | About Us | Contact Us | Support | Twitter Linkedin Facebook RSS
Vocapia Logo Leading edge speech processing technology

Speech to Text Services

Vocapia offers a variety of speech recognition services. In addition to our speech transcription Web service, we offer services for batch processing of large quantities of data and for the development of customized models. We bring extensive experience and work with our clients to provide tailored solutions to match their needs.

Request Form You can integrate VoxSigma software or service into your application today.

VoxSigma SaaS

The VoxSigma sofware suite is offered as a Web service via a REST API over HTTPS, always providing customers access to our latest systems thereby quickly benefiting from regular advances and take advantage of additional features offered by the online environment. Our speech-to-text service is available 24/7/365 with failover servers and geographic redundancy [request form].
  • Supported languages : Arabic, Cantonese, Czech, Dutch, English, Finnish, French, German, Greek, Hebrew, Hindi, Hungarian, Italian, Latvian, Lithuanian, Mandarin, Pashto, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Swahili, Swedish, Turkish, Ukrainian and Urdu.
  • Main functionalities : Speech-to-text transcription, language identification, speech-text synchronization.
SaaS Status

Document based adaptation

Automatic on-the-fly adaptation allows the user to provide texts related to the audio document being processed, what can be considered topic/domain adaptation. These accompanying texts serve to increase the lexical coverage of the speech-to-text system and to adapt the language model to the specific domain of the audio document with the aim of improving the transcription accuracy.

On-demand batch processing

Batch processing is offered as an offline or online service to process audio and audiovisual archives, in particular when specific needs and models are required [request form].

Customized models

Tailoring models for your application is the best way to ensure you get the best possible results for your needs. For speech-to-text applications, high accuracy is essential to maximize your ROI, as to a first approximation, the cost of using automatic transcriptions in your workflow is proportional to the system's error rate. Therefore using a system with a 90% accuracy (i.e. 10% error) may cost almost twice that of using a system with a 95% accuracy (i.e. 5% error).

If you are interested in a particular language or technology please use our contact form or our VoxSigma request form, or send a note directly to contact@vocapia.com.


We provide hotline support (via email and phone) for our products and services to help users and integrators solve problems in the shortest possible timeframe [support form].


Tuesday July 16, 2024

© Vocapia Research SAS,
2006-2023. All rights reserved.

Legal Notice   Privacy
About Us
Apply for job
Contact Us
Request form
STT for Linux