Patent application number | Description | Published |
20080205676 | Phase-Amplitude Matrixed Surround Decoder - A frequency domain method for phase-amplitude matrixed surround decoding of 2-channel stereo recordings and soundtracks, based on spatial analysis of 2-D or 3-D directional cues in the recording and re-synthesis of these cues for reproduction on any headphone or loudspeaker playback system. | 08-28-2008 |
20080232617 | MULTICHANNEL SURROUND FORMAT CONVERSION AND GENERALIZED UPMIX - An audio signal is processed in the frequency domain to convert an input signal format to an output signal format. That is, a multichannel audio signal intended for playback over a predefined speaker layout can be formatted to achieve spatial reproduction over a different layout comprising a different number of speakers. | 09-25-2008 |
20080247558 | Robust and Efficient Frequency-Domain Decorrelation Method - An audio signal is processed by transforming the signal into a frequency domain representation having a plurality of frequency subbands. A decorrelated signal is derived from the frequency domain representation using a phase rotation. | 10-09-2008 |
20090092258 | CORRELATION-BASED METHOD FOR AMBIENCE EXTRACTION FROM TWO-CHANNEL AUDIO SIGNALS - A method of ambience extraction includes analyzing an input signal to determine the time-dependent and frequency-dependent amount of ambience in the input signal, wherein the amount of ambience is determined based on a signal model and correlation quantities computed from the input signals and wherein the ambience is extracted using a multiplicative time-frequency mask. Another method of ambience extraction includes compensating a bias in the estimation of a short-term cross-correlation coefficient. In addition, systems having various modules for implementing the above methods are disclosed. | 04-09-2009 |
20090092259 | Phase-Amplitude 3-D Stereo Encoder and Decoder - A two-channel phase-amplitude stereo encoding and decoding scheme enabling flexible and spatially accurate interactive 3-D audio reproduction via standard audio-only two-channel transmission. The encoding scheme allows associating a 2-D or 3-D positional localization to each of a plurality of sound sources by use of frequency independent inter-channel phase and amplitude differences. The decoder is based on frequency-domain spatial analysis of 2-D or 3-D directional cues in a two-channel stereo signal and re-synthesis of these cues using any preferred spatialization technique, thereby allowing faithful reproduction of positional audio cues and reverberation or ambient cues over arbitrary multi-channel loudspeaker reproduction formats or over headphones, while preserving source separation despite the intermediate encoding over only two audio channels. | 04-09-2009 |
20090103749 | Microphone Array Processor Based on Spatial Analysis - An array processing system improves the spatial selectivity by forming multiple steered beams and carrying out a spatial analysis of the acoustic scene. The analysis derives a time-frequency mask that, when applied to a reference look-direction beam (or other reference signal), enhances target sources and substantially improves rejection of interferers that are outside of the specified region. | 04-23-2009 |
20090198356 | Primary-Ambient Decomposition of Stereo Audio Signals Using a Complex Similarity Index - An audio signal is processed to derive primary and ambient components of the signal. The signal is first transformed to generate frequency-domain subband signals. Primary and ambient components are separated by comparing frequency subband content using a complex-valued similarity metric, wherein one of the primary and ambient components is determined to be the residual after the other is identified using the similarity metric. | 08-06-2009 |
20090252341 | Adaptive Primary-Ambient Decomposition of Audio Signals - A stereo audio signal is processed to determine primary and ambient components by transforming the signal into vectors corresponding to subband signals, and decomposing the left and right channel vectors into ambient and primary components by matrix and vector operations. Principal component analysis is used to determine a primary component unit vector, and ambience components are determined according to a correlation-based cross-fade or an orthogonal basis derivation. | 10-08-2009 |
20090252356 | SPATIAL AUDIO ANALYSIS AND SYNTHESIS FOR BINAURAL REPRODUCTION AND FORMAT CONVERSION - A frequency-domain method for format conversion or reproduction of 2-channel or multi-channel audio signals such as recordings is described. The reproduction is based on spatial analysis of directional cues in the input audio signal and conversion of these cues into audio output signal cues for two or more channels in the frequency domain. | 10-08-2009 |
20110142257 | Reparation of Corrupted Audio Signals - Corrupted portions of an audio signal are detected and repaired. An audio signal may be received from an audio input device. The audio signal may include numerous sequential frames. One or more corrupted frames included in the audio signal may be identified. A frame approximating an uncorrupted frame and corresponding to each corrupted frame may be constructed. Each corrupted frame may be replaced with a corresponding constructed frame to generate a repaired audio signal. The repaired audio signal may be outputted via an audio output device. | 06-16-2011 |
20120010881 | Monaural Noise Suppression Based on Computational Auditory Scene Analysis - The present technology provides a robust noise suppression system which may concurrently reduce noise and echo components in an acoustic signal while limiting the level of speech distortion. An acoustic signal may be received and transformed to cochlear domain sub-band signals. Features such as pitch may be identified and tracked within the sub-band signals. Initial speech and noise models may be then be estimated at least in part from a probability analysis based on the tracked pitch sources. Speech and noise models may be resolved from the initial speech and noise models and noise reduction may be performed on the sub-band signals and an acoustic signal may be reconstructed from the noise-reduced sub-band signals. | 01-12-2012 |
20130231925 | Monaural Noise Suppression Based on Computational Auditory Scene Analysis - The present technology provides a robust noise suppression system that may concurrently reduce noise and echo components in an acoustic signal while limiting the level of speech distortion. A time-domain acoustic signal may be received and be transformed to frequency-domain sub-band signals. Features, such as pitch, may be identified and tracked within the sub-band signals. Initial speech and noise models may be then be estimated at least in part from a probability analysis based on the tracked pitch sources. Speech and noise models may be resolved from the initial speech and noise models and noise reduction may be performed on the sub-band signals. An acoustic signal may be reconstructed from the noise-reduced sub-band signals. | 09-05-2013 |
20150025881 | SPEECH SIGNAL SEPARATION AND SYNTHESIS BASED ON AUDITORY SCENE ANALYSIS AND SPEECH MODELING - Provided are systems and methods for generating clean speech from a speech signal representing a mixture of a noise and speech. The clean speech may be generated from synthetic speech parameters. The synthetic speech parameters are derived based on the speech signal components and a model of speech using auditory and speech production principles. The modeling may utilize a source-filter structure of the speech signal. One or more spectral analyses on the speech signal are performed to generate spectral representations. The feature data is derived based on a spectral representation. The features corresponding to the target speech according to a model of speech are grouped and separated from the feature data. The synthetic speech parameters, including spectral envelope, pitch data and voice classification data are generated based on features corresponding to the target speech. | 01-22-2015 |