Patent application number | Description | Published |
20090259461 | Gain Control System, Gain Control Method, and Gain Control Program - Disclosed is a gain control system in which speech model constituted from a sound pressure and a feature is stored in a speech model storage unit for each of a plurality of phonemes or for each of clusters into which a speech is divided. When an input signal is given, a feature conversion unit calculates a feature and a sound pressure of the input signal. A sound pressure comparison unit determines a sound pressure ratio between the input signal and each of speech models. A distance calculation unit calculates a distance between the feature of the input signal and the feature of each of the speech models. A gain calculation unit calculates a gain value from the sound pressure ratio and information on the distance. A sound pressure compensation unit thereby compensates for the sound pressure of the input signal. | 10-15-2009 |
20100070277 | VOICE RECOGNITION DEVICE, VOICE RECOGNITION METHOD, AND VOICE RECOGNITION PROGRAM - A voice recognition device that recognizes a voice of an input voice signal, comprises a voice model storage unit that stores in advance a predetermined voice model having a plurality of detail levels, the plurality of detail levels being information indicating a feature property of a voice for the voice model; a detail level selection unit that selects a detail level, closest to a feature property of an input voice signal, from the detail levels of the voice model stored in the voice model storage unit; and a parameter setting unit that sets parameters for recognizing the voice of an input voice according to the detail level selected by the detail level selection unit. | 03-18-2010 |
20100268532 | SYSTEM, METHOD AND PROGRAM FOR VOICE DETECTION - A system for voice detection includes a feature value calculation unit that calculates a feature value from an input signal sliced on a per frame basis, a provisional voice/non-voice decision unit that provisionally decides a voiced interval and a non-voiced interval from the feature value calculated on a per frame basis, and a voice/non-voice decision unit that determines a voiced interval duration threshold value or a non-voiced interval duration threshold value, using a ratio of the feature value found on a per frame basis to a threshold value for the feature value and that re-decides the voiced interval and the non-voiced interval, using the voiced interval duration threshold value determined and the non-voiced interval duration threshold value determined. By determining the voiced interval duration threshold value and the non-voiced interval duration threshold value, using the feature value found on a per frame basis and the threshold value for the feature value, the constraint of the shaping rule may be made weaker, or stronger in case the feature value found on a per frame basis can be regarded as being reliable or not, thereby allowing voice detection to be made without dependency upon a noise environment. | 10-21-2010 |
20110246185 | VOICE ACTIVITY DETECTOR, VOICE ACTIVITY DETECTION PROGRAM, AND PARAMETER ADJUSTING METHOD - A frame extracting means | 10-06-2011 |
20120116765 | SPEECH PROCESSING DEVICE, METHOD, AND STORAGE MEDIUM - A speech recognition unit ( | 05-10-2012 |
20120239401 | VOICE RECOGNITION SYSTEM AND VOICE RECOGNITION METHOD - Provided is a voice recognition system capable of, while suppressing negative influences from sound not to be recognized, correctly estimating utterance sections that are to be recognized. A voice segmenting means calculates voice feature values, and segments voice sections or non-voice sections by comparing the voice feature values with a threshold value. Then, the voice segmenting means determines, to be first voice sections, those segmented sections or sections obtained by adding a margin to the front and rear of each of those segmented sections. On the basis of voice and non-voice likelihoods, a search means determines, to be second voice sections, sections to which voice recognition is to be applied. A parameter updating means updates the threshold value and the margin. The voice segmenting means determines the first voice sections by using the one of the threshold value and the margin which has been updated by the parameter updating means. | 09-20-2012 |
20120310866 | DATA PROCESSING DEVICE, COMPUTER PROGRAM THEREFOR AND DATA PROCESSING METHOD - A plurality of pruning measures (PM) are calculated from a feature amount (CV) of test data (TD) which is input, a plurality of isopycnic surfaces (EC) are plotted and set on a threshold space (SS), a threshold curved surface (SC) in which a decrease in at least one of a plurality of pruning measures (PM) causes an increase in at least one thereof is generated using a portion of one isopycnic surface (EC) as a part, a hypothesis curved surface (HC) of subject data (CD) is generated on the threshold space (SS) to set a position intersecting the threshold curved surface (SC) to a pruning threshold (PS), and a plurality of hypotheses of the subject data (CD) are pruned. Thereby, there is provided a data processing device of which at least one of the recognition speed and the recognition accuracy is higher than in the related art. | 12-06-2012 |
20130024192 | ATMOSPHERE EXPRESSION WORD SELECTION SYSTEM, ATMOSPHERE EXPRESSION WORD SELECTION METHOD, AND PROGRAM - Disclosed is an information display system provided with: a signal analyzing unit which analyzes the audio signals obtained from a predetermined location and which generates ambient sound information regarding the sound generated at the predetermined location; and an ambient expression selection unit which selects an ambient expression which expresses the content of what a person is feeling from the sound generated at the predetermined location on the basis of the ambient sound information. | 01-24-2013 |
20130132078 | VOICE ACTIVITY SEGMENTATION DEVICE, VOICE ACTIVITY SEGMENTATION METHOD, AND VOICE ACTIVITY SEGMENTATION PROGRAM - Provided is a noise-robust voice activity segmentation device which updates parameters used in the determination of voice-active segments without burdening the user, and also provided are a voice activity segmentation method and a voice activity segmentation program. | 05-23-2013 |
20130144609 | TEXT PROCESSING SYSTEM, TEXT PROCESSING METHOD, AND TEXT PROCESSING PROGRAM - Provided is a text processing system capable of avoiding declining processing efficiency in analyses of text that does not contain breaks. | 06-06-2013 |
20130185068 | SPEECH RECOGNITION DEVICE, SPEECH RECOGNITION METHOD AND PROGRAM - The present invention provides a speech recognition device includes a threshold value candidate generation unit which extracts a feature indicating likeliness of being speech from a temporal sequence of input sound, and generates a plurality of threshold value candidates for discriminating between speech and non-speech; a speech determination unit which, by comparing the feature indicating likeliness of being speech with the plurality of threshold value candidates, determines respective speech sections, and outputs determination information as a result of the determination; a search unit which corrects each of the speech sections represented by the determination information, using a speech model and a non-speech model; and a parameter update unit which estimates a threshold value for determining a speech section, on the basis of distribution profiles of the feature respectively in utterance sections and in non-utterance sections, within each of the corrected speech sections, and makes an update with the threshold value. | 07-18-2013 |
20130231929 | SPEECH RECOGNITION DEVICE, SPEECH RECOGNITION METHOD, AND COMPUTER READABLE MEDIUM - The present invention can increase the types of noises that can be dealt with enough to enable speech recognition with a speech recognition rate of high accuracy. | 09-05-2013 |
20130282370 | SPEECH PROCESSING APPARATUS, CONTROL METHOD THEREOF, STORAGE MEDIUM STORING CONTROL PROGRAM THEREOF, AND VEHICLE, INFORMATION PROCESSING APPARATUS, AND INFORMATION PROCESSING SYSTEM INCLUDING THE SPEECH PROCESSING APPARATUS - An apparatus of this invention is a speech processing apparatus that acquires pseudo speech from a mixture sound including desired speech and noise. The speech processing apparatus includes a first microphone that inputs a first mixture sound including desired speech and noise and outputs a first mixture signal, a second microphone that is opened to the same sound space as that of the first microphone, inputs a second mixture sound including the desired speech and the noise at a ratio different from the first mixture sound, and outputs a second mixture signal, a first sound collector including a concave surface that collects the first mixture sound to the first microphone, a second sound collector including a concave surface that collects the second mixture sound to the second microphone and disposed in a direction different from the first sound collector, and a noise suppression circuit that suppresses an estimated noise signal based on the first mixture signal and the second mixture signal and outputs a pseudo speech signal. With this arrangement, it is possible to, in a single sound space where desired speech and noise mix, collect the desired speech and the noise, correctly estimate the noise, and reconstruct pseudo speech close to the desired speech. | 10-24-2013 |
20130297303 | SPEECH PROCESSING APPARATUS, CONTROL METHOD THEREOF, STORAGE MEDIUM STORING CONTROL PROGRAM THEREOF, AND VEHICLE, INFORMATION PROCESSING APPARATUS, AND INFORMATION PROCESSING SYSTEM INCLUDING THE SPEECH PROCESSING APPARATUS - An apparatus of this invention is a speech processing apparatus that acquires pseudo speech from a mixture sound including desired speech and noise. The speech processing apparatus includes a first microphone that inputs a first mixture sound including desired speech and noise and outputs a first mixture signal, a second microphone that is opened to the same sound space as that of said first microphone and disposed at a focus position of an interface that is part of a boundary of the sound space and has one of a quadratic surface shape and a pseudo surface shape approximating a quadratic surface, inputs a second mixture sound including the desired speech reflected by the interface and the noise reflected by the interface at a ratio different from the first mixture sound, and outputs a second mixture signal, and a noise suppression circuit that suppresses an estimated noise signal based on the first mixture signal and the second mixture signal and outputs a pseudo speech signal. | 11-07-2013 |
20130311175 | SPEECH PROCESSING APPARATUS, CONTROL METHOD THEREOF, STORAGE MEDIUM STORING CONTROL PROGRAM THEREOF, AND VEHICLE, INFORMATION PROCESSING APPARATUS, AND INFORMATION PROCESSING SYSTEM INCLUDING THE SPEECH PROCESSING APPARATUS - An apparatus of this invention is a speech processing apparatus that acquires pseudo speech from a mixture sound including desired speech and noise. The speech processing apparatus includes a first microphone that inputs a first mixture sound including desired speech and noise and outputs a first mixture signal, a second microphone that is opened to the same sound space as that of the first microphone, inputs a second mixture sound including the desired speech and the noise at a ratio different from the first mixture sound, and outputs a second mixture signal, a sound insulator that is disposed between the first microphone and the second microphone, and a noise suppression circuit that suppresses an estimated noise signal based on the first mixture signal and the second mixture signal and outputs a pseudo speech signal. With this arrangement, it is possible to, in a single sound space where desired speech and noise mix, correctly estimate the noise and reconstruct pseudo speech close to the desired speech. | 11-21-2013 |