Patent application number | Description | Published |
20080270139 | Converting text-to-speech and adjusting corpus - The present invention provides a method and apparatus for text to speech conversion, and a method and apparatus for adjusting a corpus. The method for text to speech comprises: text analysis step for parsing the text to obtain descriptive prosody annotations of the text based on a TTS model generated from a first corpus; prosody parameter prediction step for predicting the prosody parameter of the text according to the result of text analysis step; speech synthesis step for synthesizing speech of said text based on said the prosody parameter of the text; wherein descriptive prosody annotations of the text include prosody structure for the text, the prosody structure of the text is adjusted according to a target speech speed for the synthesized speech. The present invention adjusts the prosody structure of the text according to the target speech speed. The synthesized speech will have improved quality. | 10-30-2008 |
20090037179 | Method and Apparatus for Automatically Converting Voice - The invention proposes a method and apparatus for significantly improving the quality of voice morphing and guaranteeing the similarity of converted voice. The invention sets several standard speakers in a TTS database, and selects the voices of different standard speakers for speech synthesis according to different roles, wherein the voice of the selected standard speaker is similar to the original role to a certain extent. Then the invention further performs voice morphing on the standard voice similar to the original voice to a certain extent, in order to accurately mimic the voice of the original speaker, so as to make the converted voice closer to the original voice features while guaranteeing the similarity. | 02-05-2009 |
20090089063 | VOICE CONVERSION METHOD AND SYSTEM - A method, system and computer program product for voice conversion. The method includes performing speech analysis on the speech of a source speaker to achieve speech information; performing spectral conversion based on said speech information, to at least achieve a first spectrum similar to the speech of a target speaker; performing unit selection on the speech of said target speaker at least using said first spectrum as a target; replacing at least part of said first spectrum with the spectrum of the selected target speaker's speech unit; and performing speech reconstruction at least based on the replaced spectrum. | 04-02-2009 |
20090299746 | METHOD AND SYSTEM FOR SPEECH SYNTHESIS - A method for performing speech synthesis to a textual content at a client. The method includes the steps of: performing speech synthesis to the textual content based on a current acoustical unit set S | 12-03-2009 |
20110054901 | METHOD AND APPARATUS FOR ALIGNING TEXTS - A method and apparatus for aligning texts. The method includes acquiring a target text and a reference text and aligning the target text and the reference text at word level based on phoneme similarity. The method can be applied to automatically archiving a multimedia resource and a method of automatically searching a multimedia resource. | 03-03-2011 |
20110214979 | REACTIVE DISTILLATION APPARATUS FOR A MULTISTAGE COUNTER-CURRENT ROTATING BED AND ITS APPLICATION - The present invention discloses a reactive distillation apparatus for multistage counter-current rotating bed and its application, the apparatus comprises a closed shell, in the center of which a revolving shaft linking each shell section is set, the said shaft is provided with two or more rotors in series connection, a feeding inlet, a reflux inlet and an outlet of the gas phase are mounted on the top end face of the shell while a waste liquid outlet and an inlet of the gas phase are set on the bottom end face of the shell, the said shell consists of an upper section of the shell and a lower section of the shell along the axial direction, the said rotor consists of a rotating disc firmly connecting with the revolving shaft and a static disc mounted to the shell, a group of concentric dynamic filler rings but with different diameters are installed at intervals along the radial direction, wherein the wall of the dynamic filler rings is holed, and the ring clearance between the dynamic filler rings is configured with static rings fastened on the static disc; the filler filled in the said dynamic filler ring includes a catalytic filler and a wire gauze filler with the catalyst filler filled in the dynamic filler ring of the outer circle of the upper rotor and the inner circle of the lower rotor and the wire gauze filler filled in the rest of the dynamic filler rings, to make the whole rotor structure equivalent to the distillation section, reactive distillation section and stripping section; a feeding inlet is arranged on the top cover of the shell corresponding to the spray nozzle of raw material liquid; a rotating liquid distributor is arranged on the inner side of the innermost dynamic filler ring of the said lower rotor. The catalyst of the present invention not only plays the role of catalytic reaction, but also increases the interphase mass transfer area; the present invention improves the mass transfer efficiency and the separation efficiency of the reactive distillation process. | 09-08-2011 |
20110270605 | ASSESSING SPEECH PROSODY - A method, system and computer readable storage medium for assessing speech prosody. The method includes the steps of: receiving input speech data; acquiring a prosody constraint; assessing prosody of the input speech data according to the prosody constraint; and providing assessment result where at least of the steps is carried out using a computer device. | 11-03-2011 |
20130054244 | METHOD AND SYSTEM FOR ACHIEVING EMOTIONAL TEXT TO SPEECH - A method and system for achieving emotional text to speech. The method includes: receiving text data; generating emotion tag for the text data by a rhythm piece; and achieving TTS to the text data corresponding to the emotion tag, where the emotion tags are expressed as a set of emotion vectors; where each emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories. A system for the same includes: a text data receiving module; an emotion tag generating module; and a TTS module for achieving TTS, wherein the emotion tag is expressed as a set of emotion vectors; and wherein emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories. | 02-28-2013 |
20140019121 | DATA PROCESSING METHOD, PRESENTATION METHOD, AND CORRESPONDING APPARATUSES - A data processing method includes obtaining text information corresponding to a presented content, the presented content comprising a plurality of areas; performing text analysis on the text information to obtain a first keyword sequence, the first keyword sequence including area keywords associated with at least one area of the plurality of areas; obtaining speech information related to the presented content, the speech information at least comprising a current speech segment; and using a first model network to perform analysis on the current speech segment to determine the area corresponding to the current speech segment, wherein the first model network comprises the first keyword sequence. | 01-16-2014 |
20140019133 | DATA PROCESSING METHOD, PRESENTATION METHOD, AND CORRESPONDING APPARATUSES - A data processing method includes obtaining text information corresponding to a presented content, the presented content comprising a plurality of areas; performing text analysis on the text information to obtain a first keyword sequence, the first keyword sequence including area keywords associated with at least one area of the plurality of areas; obtaining speech information related to the presented content, the speech information at least comprising a current speech segment; and using a first model network to perform analysis on the current speech segment to determine the area corresponding to the current speech segment, wherein the first model network comprises the first keyword sequence. | 01-16-2014 |
20140095160 | CORRECTING TEXT WITH VOICE PROCESSING - The present invention relates to voice processing and provides a method and system for correcting a text. The method comprising: determining a target text unit to be corrected in a text; receiving a reference voice segment input by the user for the target text unit; determining a reference text unit whose pronunciation is similar to a word in the target text unit based on the reference voice segment; and correcting the word in the target text unit in the text by the reference text unit. The present invention enables the user to easily correct errors in the text vocally. | 04-03-2014 |
20140129220 | SPEAKER AND CALL CHARACTERISTIC SENSITIVE OPEN VOICE SEARCH - Techniques disclosed herein include systems and methods for open-domain voice-enabled searching that is speaker sensitive. Techniques include using speech information, speaker information, and information associated with a spoken query to enhance open voice search results. This includes integrating a textual index with a voice index to support the entire search cycle. Given a voice query, the system can execute two matching processes simultaneously. This can include a text matching process based on the output of speech recognition, as well as a voice matching process based on characteristics of a caller or user voicing a query. Characteristics of the caller can include output of voice feature extraction and metadata about the call. The system clusters callers according to these characteristics. The system can use specific voice and text clusters to modify speech recognition results, as well as modifying search results. | 05-08-2014 |
20140136198 | CORRECTING TEXT WITH VOICE PROCESSING - The present invention relates to voice processing and provides a method and system for correcting a text. The method comprising: determining a target text unit to be corrected in a text; receiving a reference voice segment input by the user for the target text unit; determining a reference text unit whose pronunciation is similar to a word in the target text unit based on the reference voice segment; and correcting the word in the target text unit in the text by the reference text unit. The present invention enables the user to easily correct errors in the text vocally. | 05-15-2014 |
20140298186 | ADJUSTING INFORMATION PROMPTING IN INPUT METHOD - A computer-implemented method and apparatus for adjusting information prompting in an input method. The method includes: obtaining prompt information displayed in response to entering a word in an input box by a user; and adjusting the sequence of subsequent prompt words in a prompt box of the input method according to the prompt information. The method for adjusting information prompting in an input method according to the embodiments of the present invention can adjust the sequence of prompt words in the prompt box of the input method in real time based on prompt information in the prompt box to facilitate user selection. | 10-02-2014 |