Class / Patent application number | Description | Number of patent applications / Date published |
704210000 | Silence decision | 24 |
20100145689 | KEYSTROKE SOUND SUPPRESSION - An audio signal is received that might include keyboard noise and speech. The audio signal is digitized and transformed from a time domain to a frequency domain. The transformed audio is analyzed to determine whether there is likelihood that keystroke noise is present. If it is determined there is high likelihood that the audio signal contains keystroke noise, a determination is made as to whether a keyboard event occurred around the time of the likely keystroke noise. If it is determined that a keyboard event occurred around the time of the likely keystroke noise, a determination is made as to whether speech is present in the audio signal around the time of the likely keystroke noise. If no speech is present, the keystroke noise is suppressed in the audio signal. If speech is detected in the audio signal or if the keystroke noise abates, the suppression gain is removed from the audio signal. | 06-10-2010 |
20100145690 | SOUND SIGNAL GENERATING METHOD, SOUND SIGNAL GENERATING DEVICE, AND RECORDING MEDIUM - A sound signal generating method includes: generating, using a computer, a plurality of unit waveform signals by dividing the original sound signal having a periodic length of repeating similar waveforms by the length of the waveform; generating, using a computer, a repetitive waveform signal for each of the generated unit waveform signals by repeating the waveform of the unit waveform signal a given number of times; and generating, using a computer, an outputsound signal by shifting each of the repetitive waveform signals in each length with a sequence in which the unit waveform signals form the original sound signal and then superimposing on one another. | 06-10-2010 |
20100204985 | FREQUENCY AXIS WARPING FACTOR ESTIMATION APPARATUS, SYSTEM, METHOD AND PROGRAM - A warping factor estimation system comprises label information generation unit that outputs voice/non-voice label information, warp model storage unit in which a probability model representing voice and non-voice occurrence probabilities is stored, and warp estimation unit that calculates a warping factor in the frequency axis direction using the probability model representing voice and non-voice occurrence probabilities, voice and non-voice labels, and a cepstrum. | 08-12-2010 |
20100268531 | Method and device for DTX decision - A DTX decision method includes: obtaining sub-band signal(s) according to an input signal; obtaining a variation of characteristic information of each of the sub-band signals; and performing DTX decision according to the variation of the characteristic information of each of the sub-band signals. With the invention, a complete and appreciate DTX decision result is obtained by making full use of the noise characteristic in the speech encoding/decoding bandwidth and using band-splitting and layered processing. As a result, the SID encoding/CNG decoding may closely follow the characteristic variation of the actual noise. | 10-21-2010 |
20100324891 | MUTE CONTROL IN AUDIO ENDPOINTS - Architecture that uses near-end speech detection and far-end energy level detection to notify a user when a local microphone and/or speaker that the user is using, are muted. A voice activity detector is employed to detect the presence of near-end speech, sense the existing mute state of the near-end microphone, and then notify the user when the current microphone is muted. Separately or in combination therewith, received far-end voice signals are detected, the associated energy level computed, the existing mute state of the near-end audio speaker is sensed, and the user notified when the speaker is muted and/or at a reduced volume setting. These determinations enhance the user experience when the architecture is employed for communications sessions where participants connect via different communications modalities by automatically notifying the user of the audio device state, without attempting to contribute only to find that a microphone or speaker was muted. | 12-23-2010 |
20110246189 | DICTATION CLIENT FEEDBACK TO FACILITATE AUDIO QUALITY - An audio quality feedback system and method is provided. The system receives audio from a client via a communication device such as a microphone, The audio quality feedback system compares the received audio to one or more parameters regarding the quality of the feedback. The parameters include, for example, clipping, periods of silence, signal to noise ratios. Based on the comparison, feedback is generated to allow adjustment of the communication device or use of the communication device to improve the quality of the audio. | 10-06-2011 |
20120065966 | Voice Activity Detection Method and Apparatus, and Electronic Device - A voice activity detection method and apparatus, and an electronic device are provided. The method includes: obtaining a time domain parameter and a frequency domain parameter from an audio frame; obtaining a first distance between the time domain parameter and a long-term slip mean of the time domain parameter in a history background noise frame, and obtaining a second distance between the frequency domain parameter and a long-term slip mean of the frequency domain parameter in the history background noise frame; and judging whether the audio frame is a foreground voice frame or a background noise frame according to the first distance, the second distance and a set of decision inequalities based on the first distance and the second distance. The above technical solutions enable the judgment criterion to have an adaptive adjustment capability, thus improving the performance of the voice activity detection. | 03-15-2012 |
20120084080 | Machine for Enabling and Disabling Noise Reduction (MEDNR) Based on a Threshold - The present invention provides a novel system and method for monitoring the audio signals, analyze selected audio signal components, compare the results of analysis with a threshold value, and enable or disable noise reduction capability of a communication device. | 04-05-2012 |
20120278068 | VOICE ACTIVITY DETECTION METHOD AND APPARATUS, AND ELECTRONIC DEVICE - A voice activity detection method and apparatus, and an electronic device are provided. The method includes: obtaining a time domain parameter and a frequency domain parameter from an audio frame; obtaining a first distance between the time domain parameter and a long-term slip mean of the time domain parameter in a history background noise frame, and obtaining a second distance between the frequency domain parameter and a long-term slip mean of the frequency domain parameter in the history background noise frame; and determining whether the audio frame is a foreground voice frame or a background noise frame according to the first distance, the second distance and a set of decision inequalities based on the first distance and the second distance. The above technical solutions enable the determination criterion to have an adaptive adjustment capability, thus improving the performance of the voice activity detection. | 11-01-2012 |
20130073281 | NON-SPEECH SECTION DETECTING METHOD AND NON-SPEECH SECTION DETECTING DEVICE - A non-speech section detecting device generating a plurality of frames having a given time length on the basis of sound data obtained by sampling sound, and detecting a non-speech section having a frame not containing voice data based on speech uttered by a person, the device including: a calculating part calculating a bias of a spectrum obtained by converting sound data of each frame into components on a frequency axis; a judging part judging whether the bias is greater than or equal to a given threshold or alternatively smaller than or equal to a given threshold; a counting part counting the number of consecutive frames judged as having a bias greater than or equal to the threshold or alternatively smaller than or equal to the threshold; a count judging part judging whether the obtained number of consecutive frames is greater than or equal to a given value. | 03-21-2013 |
20130103395 | OPERATING METHODS FOR VOICE ACTIVITY DETECTION/SILENCE SUPPRESSION SYSTEM - A Voice Activity Detection/Silence Suppression (VAD/SS) system is connected to a channel of a transmission pipe. The channel provides a pathway for the transmission of energy. A method for operating a VAD/SS system includes detecting the energy on the channel, and activating or suppressing activation of the VAD/SS system depending upon the nature of the energy detected on the channel. | 04-25-2013 |
20130144614 | Bandwidth Extender - An apparatus for extending the bandwidth of an audio signal, the apparatus being configured to: generate an excitation signal from an audio signal, wherein in the audio signal comprises a plurality of frequency components; extract a feature vector from the audio signal, wherein the feature vector comprises at least one frequency domain component feature and at least one time domain component feature; determine at least one spectral shape parameter from the feature vector, wherein the at least one spectral shape parameter corresponds to a sub band signal comprising frequency components which belong to a further plurality of frequency components; and generate the sub band signal by filtering the excitation signal through a filter bank and weighting the filtered excitation signal with the at least one spectral shape parameter. | 06-06-2013 |
20130268265 | METHOD AND DEVICE FOR PROCESSING AUDIO SIGNAL - The present invention relates to a method for processing an audio signal, and the method comprises the steps of: receiving an audio signal; determining a coding mode corresponding to a current frame, by receiving network information for indicating the coding mode; encoding the current frame of said audio signal according to said coding mode; and transmitting said encoded current frame, wherein said coding mode is determined by the combination of a bandwidth and bitrate, and said bandwidth includes two or more bands among narrowband, wideband, and super wideband. | 10-10-2013 |
20130282367 | METHOD AND APPARATUS FOR PERFORMING VOICE ACTIVITY DETECTION - This application relates to a voice activity detection (VAD) apparatus configured to provide a voice activity detection decision for an input audio signal. The VAD apparatus includes a state detector and a voice activity calculator. The state detector is configured to determine, based on the input audio signal, a current working state of the VAD apparatus among at least two different working states. Each of the at least two different working states is associated with a corresponding working state parameter decision set which includes at least one voice activity decision parameter. The voice activity calculator is configured to calculate a voice activity detection parameter value for the at least one voice activity decision parameter of the working state parameter decision set associated with the current working state, and to provide the voice activity detection decision by comparing the calculated voice activity detection parameter value with a threshold. | 10-24-2013 |
20130325456 | SPEECH SPEED CONVERSION FACTOR DETERMINING DEVICE, SPEECH SPEED CONVERSION DEVICE, PROGRAM, AND STORAGE MEDIUM - A speech speed conversion factor determining device has a physical index calculation unit including a sound/silence judgment unit that distinguishes between sound and silent intervals of an input signal, a fundamental frequency calculation unit that calculates a fundamental frequency of the signal in the sound intervals and determines stable and unstable intervals, a frequency smoothing unit that smoothes the fundamental frequency in the stable intervals, a pseudo fundamental frequency calculation unit that calculates, for the intervals, a pseudo fundamental frequency by interpolation , and a fundamental frequency general shape connection unit that connects the smoothed and pseudo frequencies to obtain sampled values of a general shape of the frequency, such that the sampled values are output as an index, based on which conversion factor are calculated. | 12-05-2013 |
20140207444 | SYSTEM, DEVICE AND METHOD FOR DETECTING SPEECH - A mobile communication system comprises a mobile communication device provided with a touch screen; and a speech activity analyzer suitable to receive from said touch screen data indicative of pressure applied to an area of said touch screen, and of changes thereto with time. | 07-24-2014 |
20140337020 | Method and Apparatus for Performing Voice Activity Detection - A voice activity detection (VAD) apparatus configured to provide a voice activity detection decision for an input audio signal. The VAD apparatus includes a state detector and a voice activity calculator. The state detector is configured to determine, based on the input audio signal, a current working state of the VAD apparatus among at least two different working states. Each of the at least two different working states is associated with a corresponding working state parameter decision set which includes at least one voice activity detection parameter. The voice activity calculator is configured to calculate a voice activity detection parameter value for the at least one voice activity detection parameter of the working state parameter decision set associated with the current working state, and to provide the voice activity detection decision by comparing the calculated voice activity detection parameter value with a threshold. | 11-13-2014 |
20150051906 | Hierarchical Active Voice Detection - One or more audio signals are processed using a multi-stage (hierarchical) voice and/or signal activity detector (VAD/SAD). A first stage is capable of reducing the workload bandwidth by employing an inexpensive VAD/SAD processor. One or more subsequent stages may further process the audio signals from the first stage. Other implementations may include a first stage that also performs continuity preservation between last blocks of audio signal and the first blocks of audio after it is detected that relevant audio signals are resumed. In yet other implementations, the first stage may extract features from audio signals when they are presented in their coded domain, and possibly with little or no decoding of the audio signal. | 02-19-2015 |
20150073782 | VOICE ACTIVITY DETECTION/SILENCE SUPPRESSION SYSTEM - A Voice Activity Detection/Silence Suppression (VAD/SS) system is connected to a channel of a transmission pipe. The channel provides a pathway for the transmission of energy. A method for operating a VAD/SS system includes detecting the energy on the channel, and activating or suppressing activation of the VAD/SS system depending upon the nature of the energy detected on the channel. | 03-12-2015 |
20150088496 | MINUTES MAKING ASSISTANCE DEVICE, ELECTRONIC CONFERENCE DEVICE, ELECTRONIC CONFERENCE SYSTEM, MINUTES MAKING ASSISTANCE METHOD, AND STORAGE MEDIUM STORING MINUTES MAKING ASSISTANCE PROGRAM - A minutes making assistance device according to the present invention includes: a sound processing unit that performs processing regarding a voice and determines whether or not speaking is started; an operation processing unit that performs processing regarding an operation and determines whether or not the operation is performed; a display processing unit that performs processing regarding a display; and a control unit that stores speaking start time and warning time in a memory when the sound processing unit determines that the speaking is started, performs warning processing when the current time becomes the warning time, and terminates the processing when the operation processing unit determines that the operation is performed before the warning time. | 03-26-2015 |
20160035359 | SYSTEM AND METHOD TO REDUCE TRANSMISSION BANDWIDTH VIA IMPROVED DISCONTINUOUS TRANSMISSION - The present disclosure is directed towards a method for discontinuous transmission (“DTX”) bandwidth reduction. The method may include receiving, at a processor, a frame identified as speech and determining that the frame was mistakenly identified as speech based upon, at least in part, a voice activity detection algorithm. The method may further include labeling the frame as a silence indicator frame. | 02-04-2016 |
20160035360 | Method and Means of Encoding Background Noise Information - The invention relates to a method and means for encoding background noise information during voice signal encoding methods. A basic idea of the invention is to provide the scalability known for transmitting voice information in a similar manner when forming an SID frame. The invention provides encoding of a narrowband first component and of a broadband second component of a piece of background noise information and formation of an SID frame which describes the background noise with separate areas for the first and second components. | 02-04-2016 |
20160078885 | VOICE ACTIVITY DETECTION/SILENCE SUPPRESSION SYSTEM - A Voice Activity Detection/Silence Suppression (VAD/SS) system is connected to a channel of a transmission pipe. The channel provides a pathway for the transmission of energy. A method for operating a VAD/SS system includes detecting the energy on the channel, and activating or suppressing activation of the VAD/SS system depending upon the nature of the energy detected on the channel. | 03-17-2016 |
20180025732 | AUDIO CLASSIFIER THAT INCLUDES A FIRST PROCESSOR AND A SECOND PROCESSOR | 01-25-2018 |