| Home | About Us | Contact Us | Support | Twitter Linkedin Facebook RSS
Vocapia Logo Leading edge speech processing technology

Speech-to-Text Software

Broadcast Monitoring - Lecture and Seminar Transcription - Video Subtitling - Conference Call and Voicemail Transcription - Speech Analytics - Speech Recognition

Vocapia Research develops leading-edge, multilingual speech processing technologies. These technologies include large vocabulary continuous speech recognition, automatic audio segmentation, language identification, speaker diarization and audio-text synchronization. Vocapia's VoxSigma™ speech-to-text software suite delivers state-of-the-art performance in many languages for a variety of audio data types, including broadcast data, parliamentary hearings and conversational data. [REQUEST FORM]

voxsigma speech-to-text VoxSigma Software Suite

The VoxSigma software suite provides large vocabulary speech recognition capabilities in multiple languages, as well as audio segmentation and partitioning, speaker identification and language recognition. The software suite has been designed for professional users needing to transcribe large quantities of audio and video documents such as broadcast data, either in batch mode or in real-time. Versions specifically target the transcription of conversational telephone speech and call-center data. [MORE]
 

speech-to-text SaaS  VoxSigma SaaS

VoxSigma is available as a Web service via our REST speech-to-text API. The VoxSigma SaaS offers full speech transcription, audio indexing and speech-text alignment capabilities via a REST API over HTTPS allowing customers to quickly reap the benefits of regular improvements to the technology and take advantage of additional features offered by the online environment. The VoxSigma SaaS is available 24/7/365 with failover servers and geographic redundancy. [MORE]

Speech-to-Text Conversion

Large vocabulary continuous speech recognition, also called speech-to-text or voice-to-text conversion is the key technology for enabling content-based information access in audio and video documents. Once automatically processed the linguistic information and metadata in the structured document are available for further downstream processing, providing direct access to relevant portions of audio documents. Among the most common applications of our technology are audio and audiovisual data mining (broadcast and telephone data), speech analytics, media monitoring, media asset management, speech transcription and subtitling.

We provide solutions and expertise for core speech processing technologies in many languages. For example, speech-to-text transcription is available for the Arabic, Dutch, English, Finnish, French, German, Greek, Italian, Lithuanian, Mandarin, Polish, Portuguese, Romanian, Russian, Spanish and Turkish languages, with several others under development. Our language identification module identifies the spoken language from a set of 50 languages, and clients can create models for their desired language set. We also work with our clients to adapt, tune or create specific models or systems tailored to their application needs. [REQUEST FORM]

Building upon Speech-to-Text Software

audio indexing
Broadcast monitoring & audio visual archive indexing   The VoxSigma software suite offers advanced language technologies including speech recognition, language identification and speaker diarization to transform raw audio data into structured and searchable XML documents, enabling users to access content in video documents. [MORE]

transcription of speeches Debate and lecture transcription and indexing   VoxSigma helps reduce the production time and cost to produce transcripts, minutes and/or summaries of public presentations and meetings. VoxSigma also aligns existing transcriptions with audio files, thus significantly enhancing usability. This same alignment technology is used for audiobooks. [MORE]

speech analytics Telephone Speech Analytics   Vocapia's speech recognition software and language identification software process telephone data making the recorded calls searchable and analyzable via text-based methods. VoxSigma is used by call management companies and for defense applications. The transcripts are further analyzed and categorized, generating statistics about customer calls. [MORE]

 
teleconf transcription
Transcription of business conference calls   Vocapia's speech recognition software significantly reduces the cost of transcribing business conference calls. The audio document is converted to a fully annotated XML document including speech and non speech segments, speaker labels, words with time codes, high quality confidence scores, as well as punctuation. Vocapia offers services to adapt, tune or create specific models or systems tailored to exactly match the application needs. [MORE]

subtitling Video Subtitling   While fully automatic processing generally does not deliver high enough quality subtitles, Vocapia's speaker diarization, speech-to-text transcription and speech-text alignment technologies significantly reduce the effort entailed when closely integrated in the subtitle creation process. [MORE]

Discover More...

The VoxSigma speech recognition software suite is the latest generation of transcription software offered by Vocapia Research, building upon accurate statistical modeling techniques for speech production and perception. The VoxSigma software suite is offered as a stand-alone solution under Linux and as a Web service. [REQUEST FORM]

We offers services to adapt, tune or create specific models or systems tailored to exactly match your needs. Tailoring models for your application is the best way to ensure you get the best possible results for your needs and high accuracy is essential to maximize your ROI. In addition to our online speech recognition service, we offer services for batch processing of very large quantities of data such as archives.

 
Wednesday September 03, 2014

© Vocapia Research SAS,
2006-2014. All rights reserved.

Legal Notice   Privacy
About Us
API
Apply for job
Apps
Contact Us
Logos
FAQs
Glossary
News
Publications
Request form
Services
Speech-to-text
STT for Linux
Support
Technologies
Videos
VoxSigma