Speech synthesis is the artificial production of human speech. ... Speech recognition (in many contexts, also known as automatic speech recognition, computer speech recognition or voice recognition) is the process of converting a speech signal to a set of words, by means of an algorithm implemented as a computer program. ... The task of recognizing people from their voices is termed as speaker recognition. ... Speech coding is the compression of speech (into a code) for transmission with speech codecs that use audio signal processing and speech processing techniques. ...
Loquendo is the only speechtechnology vendor that provides a complete product line for servers, desktop, PDAs and embedded, guaranteeing the same wide range of languages and the same core engine in all these environments.
As the speech market expands rapidly across the world, Loquendo, with over 30 years of experience in speechtechnology, has remained continually answered to market needs, expanding into new markets and last July made a move to expand their presence in Spain and Portugal after recognizing the growing demand for speech in the countries.
Speechtechnology provider Loquendo has been continuously growing their TTS family of voices and has with the latest launch, introduced two new female voices, Soledad and Zeynep.
Speech synthesis systems use two basic approaches to determine the pronunciation of a word based on its spelling, a process which is often called text-to-phoneme or grapheme-to-phoneme conversion, as phoneme is the term used by linguists to describe distinctive sounds in a language.
Speech synthesis systems for languages like this often use the rule-based method as the core means of text-to-phoneme conversion, resorting to dictionaries only for those few words, like foreign names and borrowings, whose pronunciation is not obvious from the spelling.
Speech synthesis markup languages should be distinguished from dialogue markup languages such as VoiceXML, which includes, in addition to text-to-speech markup, tags related to speech recognition, dialogue management and touchtone dialing.