53rd week of 2020 patent applcation highlights part 62 |
Patent application number | Title | Published |
20200410941 | DISPLAY PANEL - A display device includes a plurality of pixels disposed in an display area, and a pixel driver connected to at least two of the pixels, wherein the pixel driver drives the at least two pixels, where a portion of the pixel driver is disposed in the display area, and the display device includes the display area, on which an image is displayed, and a non-display area, on which no image is displayed. | 2020-12-31 |
20200410942 | DISPLAY DEVICE PERFORMING ADAPTIVE REFRESH - A display device includes display device includes: a display panel including a plurality of pixels; a data driver configured to generate data voltages based on output image data, and to provide the data voltages to the plurality of pixels; and a controller configured to receive input image data and input frequency information from a host processor, and to provide the output image data to the data driver, wherein the controller includes an adaptive refresh block configured to: determine a target frequency by analyzing the input image data; determine a masking ratio based on an input frequency represented by the input frequency information and the target frequency; and selectively output the input image data as the output image data by performing a masking operation on the input image data with the masking ratio. | 2020-12-31 |
20200410943 | STAGE AND SCAN DRIVER INCLUDING THE STAGE - The disclosure relates to a stage and a scan driver including the stage. The stage is connected to each of scan lines and supplies a scan signal and a sensing signal to the scan lines. The stage includes an input unit configured to control voltages of a first node and a second node based on a first control signal and a previous stage carry signal, and an output buffer including an eleventh node and a twelfth node electrically connected to the first node and the second node, respectively, in response to a second control signal, and configured to output a carry signal and the scan signal in response to a scan clock signal according to voltages of the eleventh node and the twelfth node and to output the sensing signal in response to a sensing clock signal. | 2020-12-31 |
20200410944 | SYSTEM FOR DISPLAYING INFORMATION TO A USER - The invention relates to a system for displaying information to a user, comprising: an emission device ( | 2020-12-31 |
20200410945 | METHODS FOR OBTAINING BACKLIGHT INTENSITY AND COMPENSATION VALUE, AND DISPLAY DEVICE - A method for obtaining a backlight intensity may improving data processing speed of a display device. The method includes: dividing image data into N sets of data; calculating a backlight intensity of each backlight block according to a corresponding set of data; for each group of pixels, calculating a backlight intensity corresponding to a first pixel according to a backlight intensity of each effective backlight block corresponding to the first pixel and a backlight diffusion weight of the effective backlight block corresponding to the first pixel; calculating backlight intensities corresponding to second to Mth pixels in the Tth group of pixels according to the backlight intensities corresponding to first pixels in the Tth group of pixels and a (T+1)th group of pixels; and for a Nth group of pixels, setting the backlight intensity corresponding to the first pixel as backlight intensities corresponding to second to Mth pixels. | 2020-12-31 |
20200410946 | IMAGE PROCESSING DEVICE AND LIQUID CRYSTAL PROJECTOR - An image processing device that uses pixels displayed in a liquid crystal panel to display, across a plurality of fields, pixels constituting an image specified with image data includes, a temporary determining unit configured to, based on a gradation level specified for one pixel of a liquid crystal panel and a gradation level specified for another pixel adjacent to the one pixel in one field, make temporary determination on whether to correct the gradation level of at least one of the one pixel or the other pixel, and a cancellation unit configured to cancel the temporary determination when the gradation level specified for each of the one pixel and the gradation level specified for the other pixel are identical to a gradation level in a field that precedes, by a plurality of the fields, the one field. | 2020-12-31 |
20200410947 | DISPLAY DEVICE DRIVING METHOD - A display device driving method is provided, which is applicable to a display device including pixel circuits coupled with a first node point, a source driving circuit for providing a data signal, and a reading circuit. The display device driving method includes operations: coupling the first node point with the source driving circuit or the reading circuit; supplying first control signals to the pixel circuits, where the first control signals sequentially provide a first impulse so that the pixel circuits sequentially receive the data signal from the first node point; supplying second control signals to optical sensing circuits, where the second control signals sequentially provide a second impulse so that the optical sensing circuits sequentially output a sensing signal to the first node point; amplifying the sensing signals and outputting the amplified sensing signals by the reading circuit, where durations of the first impulses the second impulses are not overlapped. | 2020-12-31 |
20200410948 | GATE DRIVER ON ARRAY (GOA) CIRCUIT AND DISPLAY PANEL - A gate driver on array (GOA) circuit display panel is disclosed and achieves control to a voltage of a leakage path by a gate electrode signal of a seventh thin film transistor in a voltage regulation module. The display panel eliminates a stop leakage path of a touch panel by changing a signal. | 2020-12-31 |
20200410949 | Display Device - A display device includes a substrate, a plurality of scan lines and a plurality of data lines. The data lines respectively have a first segment that overlaps one of the scan lines and a second segment that is located between adjacent two of the scan lines. A first segment of a first data line and a first segment of a second data line are separated by a distance Wa. A first segment of a third data line and a first segment of a fourth data line are separated by a distance Wc. A second segment of the first data line and a second segment of the second data line are separated by a distance W | 2020-12-31 |
20200410950 | Data Driver and Driving Method for Driving Display Panel - A data driver for driving a display panel includes a first driving channel coupled to a polarity inversion circuit and configured to generate a first data voltage signal having a positive polarity output to the display panel according to a plurality of first pixel data; a second driving channel coupled to the polarity inversion circuit and configured to generate a second data voltage signal having a negative polarity output to the display panel according to a plurality of second pixel data; wherein the first data voltage signal is output to first output node through the polarity inversion circuit during a first line period and the second data voltage signal is output to the first output node through the polarity inversion circuit during a second line period after the first line period, and the first line period and the second line period respectively belong to two consecutive frame periods. | 2020-12-31 |
20200410951 | DISPLAY DEVICE - A display device may include gate lines, clock lines, a gate driver, connection lines, and compensators. The gate driver may be electrically connected to the gate lines and disposed between the gate lines and the clock lines. The connection lines may be electrically connected to the clock lines and may transmit clock signals to the gate driver. The compensators may be respectively electrically connected to the connection lines. One of the clock lines may be electrically connected to one of the compensators and may be electrically connected to one of the connection lines. The one of the clock lines may be positioned between a first section of the one of the compensators and a first section of the one of the connection lines. | 2020-12-31 |
20200410952 | INFORMATION CONFIGURATION METHOD AND ELECTRONIC APPARATUS - An information configuration method includes transmitting a signal to a display of an electronic apparatus at a first transmission frequency, and, upon detecting that a current frequency band of the electronic apparatus is not in the first frequency band, switching the first transmission frequency to a second transmission frequency when the display is inactive. A parameter of the electronic apparatus associated with the first transmission frequency meets a predetermined condition when the electronic apparatus operates at a first frequency band. The parameter associated with the second transmission frequency meets the predetermined condition when the electronic apparatus operates at a second frequency band. | 2020-12-31 |
20200410953 | REDUCING LATENCY IN AUGMENTED REALITY (AR) DISPLAYS - Disclosed are systems, methods, and non-transitory computer-readable media for reducing latency in augmented reality displays. A display controller receives, from a GPU, a stream of image pixels of a frame of virtual content to be presented on a display of a display device. The stream of image pixels is received via a high-speed bulk interface that transfers data at least as fast as can be consumed by the display. As the stream of image pixel is received, the display controller converts each respective image pixel from a data format used to transmit the stream of image pixels via the high-speed bulk interface to a data format that is compatible for display by the display. Each converted image pixel is stored in a pixel cell of the display, after which the frame is presented on the display. | 2020-12-31 |
20200410954 | DISPLAY APPARATUS AND THE CONTROL METHOD THEREOF - A display apparatus, including a display; an interface configured to couple with a dongle, wherein the dongle and the display apparatus are configured to use different operating systems; a user input device; and at least one processor configured to, based on receiving a first execution image of an application from the dongle, display the first execution image on an area of the display, based on receiving a user operation at a point within the area, convert first coordinate information representing a location of the point relative to the display into second coordinate information representing the location of the point relative to the area, and transmit the second coordinate information to the dongle, and based on receiving a second execution image of the application from the dongle, display the second execution image, wherein the second execution image is generated by the dongle based on the second coordinate information. | 2020-12-31 |
20200410955 | AUTOMATICALLY ADAPT USER INTERFACE COLOR SCHEME FOR DIGITAL IMAGES AND VIDEO - A method for selecting a dominant color among a set of digital content. The dominant color is used to set the visual aspects of a user interface through which the user views the digital content, enabling a visually appealing application, rather than one reliant on neutral and possibly uncomplimentary colors. | 2020-12-31 |
20200410956 | DISPLAY DEVICE AND IMAGE PROCESSING METHOD THEREOF - A display device according to an embodiment of the disclosure comprises: a linear gamut mapping unit for deriving a linear gamut mapping result for matching a gamut of an input image signal to a target display gamut; a non-linear gamut mapping unit for deriving a non-linear gamut mapping result for matching the gamut of the input image signal to the target display gamut; and a mixing unit for generating an output image signal by mixing the linear gamut mapping result and the non-linear gamut mapping result. The disclosure may provide an optimal gamut mapping result that is intended by a user and an originator. | 2020-12-31 |
20200410957 | DISPLAY APPARATUS AND DISPLAY METHOD - A display apparatus includes: an obtainer that obtains HDR video data representing a luminance of each pixel by a code value; a converter that converts the HDR video data into HDR video using a first EOTF; a region extractor that extracts a first region including a pixel having a code value included in a first range of less than a first code value corresponding to a first point at which a slope of a tangent to the first EOTF is a predetermined slope and a second region including a pixel having a code value included in a second range of greater than or equal to the first code value; an adjuster that increases a sharpness gain of the first region relative to a sharpness gain of the second region in the HDR video data; and a display that displays the HDR video using adjusted data resulting from the adjustment. | 2020-12-31 |
20200410958 | PROJECTION BRIGHTNESS ADJUSTMENT METHOD AND PROJECTOR THEREOF - A projection brightness adjustment method includes a power source driving a projector to project an image, a temperature sensor detecting a working temperature of the projector, the projector calculating a pixel average amount corresponding to each pixel brightness level of the image by dividing a total pixel amount of the image by a total brightness level amount of the image, and the projector controlling the power source to output an overload current to the projector for image projection when determining the working temperature is less than an upper operating-temperature limit and determining a level amount of at least one pixel brightness level having a pixel amount larger than the pixel average amount is less than or equal to half of the total brightness level amount. The magnitude of the overload current is between a maximum current limit and the upper operating-current limit. | 2020-12-31 |
20200410959 | METHOD AND ELECTRONIC DEVICE FOR VERIFYING A CHARACTER TO BE DISPLAYED ON A SCREEN COMPARED TO A REFERENCE CHARACTER, ASSOCIATED COMPUTER PROGRAM PRODUCT AND GRAPHICS PROCESSOR - The invention relates to a method for verifying a character to be displayed on a screen, compared to a reference character. The character to be displayed and the reference character each comprise at least several characteristic points. The method comprises:
| 2020-12-31 |
20200410960 | INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND RECORDING MEDIUM - [Problem] An information processing device, an information processing method, and a recording medium that enable change in display of real space without being noticed by a communicatee are to be proposed. [Solution] An information processing device, including: a determining unit that determines a gazing state or a non-gazing state of a first user present in first real space, for a display object displayed by a display device of the first user, the display object being associated with second real space different from the first real space; and a display control unit that changes appearance of the display object when the gazing state has been changed to the non-gazing state. | 2020-12-31 |
20200410961 | DISPLAY SYSTEM AND DISPLAY METHOD THEREOF - A display system includes a display device and a host. The host is separately disposed on the display device and connected to the display device through a wireless or wired manner. The host transmits an image signal to the display device. The display device is configured to display a frame according to the image signal and enlarge a local portion of the frame. | 2020-12-31 |
20200410962 | INFORMATION DISPLAYING METHOD AND ELECTRONIC DEVICE THEREFOR - Disclosed is an electronic device comprising a display, a memory, and a processor operatively connected to the display and the memory. The processor may be configured to: display a partial image, corresponding to a view area to be watched, in an omnidirectional image stored in the memory, on the display; select a display attribute on the basis of the distance between the view area and an area of interest in the omnidirectional image; and display additional information associated with the area of interest, on the display on the basis of the selected display attribute. Various other embodiments found through the specification are also possible. | 2020-12-31 |
20200410963 | IMAGE DISPLAY SYSTEM, INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, PROGRAM, AND MOBILE OBJECT - An image display system according to an embodiment of the present technology includes an image display unit, an acquisition unit, and a display control unit. The image display unit is capable of displaying an image and moves in association with a movement of a user. The acquisition unit acquires movement information regarding a movement of the image display unit. The display control unit causes the image display unit to execute suppression image display for suppressing an influence of the movement of the image display unit with respect to an external space on the basis of the acquired movement information. | 2020-12-31 |
20200410964 | KEY ASSEMBLY AND KEYBOARD APPARATUS - A key assembly includes a capstan and a key. The key includes an upper surface, a hole part having a first inner side surface supporting the capstan, and an opening part arranged between the upper surface and the hole part, the opening part having a second inner side surface surrounding the capstan. | 2020-12-31 |
20200410965 | KEYBOARD APPARATUS - A keyboard apparatus includes a first key assembly, a second key assembly, and a third key assembly. The first key assembly includes a first key being slidably in contact with a first member at a first position and a second member at a second position. A first minimum distance between the first key assembly and the second key assembly at the rear ends thereof is larger than a second minimum distance between the first key assembly and the second key assembly at the second position within a range of rotation of the first key assembly. A third minimum distance between the first key assembly and the third key assembly at the rear ends thereof being larger than a fourth minimum distance between the first key assembly and the third key assembly at the second position within the range of rotation of the first key assembly. | 2020-12-31 |
20200410966 | MACHINE-CONTROL OF A DEVICE BASED ON MACHINE-DETECTED TRANSITIONS - Apparatus, methods, and systems that operate to provide interactive streaming content identification and processing are disclosed. An example apparatus includes a classifier to determine an audio characteristic value representative of an audio characteristic in audio; a transition detector to detect a transition between a first category and a second category by comparing the audio characteristic value to a threshold value among a set of threshold values, the set of threshold values corresponding to the first category and the second category; and a context manager to control a device to switch from a first fingerprinting algorithm to a second fingerprinting algorithm different than the first fingerprinting algorithm, responsive to the detected transition between the first category and the second category. | 2020-12-31 |
20200410967 | METHOD FOR DISPLAYING TRIGGERED BY AUDIO, COMPUTER APPARATUS AND STORAGE MEDIUM - This disclosure relates to a method of displaying triggered by an audio, a computer apparatus, and a storage medium. The method comprises acquiring a background audio carrying a sound effect; playing the background audio, and generating a to-be-triggered area in a display page in response to playing to the sound effect; receiving an input trigger instruction, and detecting whether the trigger instruction matches the to-be-triggered area; displaying the to-be-triggered area according to a first preset effect in response to the trigger instruction matching the to-be-triggered area. | 2020-12-31 |
20200410968 | METHOD OF COMBINING AUDIO SIGNALS - A method for automatically generating an audio signal, the method comprising receiving a source audio signal analyzing the source audio signal to identify a musical parameter characteristic thereof obtaining a supplemental audio signal based on the identified musical parameter characteristic and combining the source audio signal and the supplemental audio signal to form an extended audio signal. | 2020-12-31 |
20200410969 | SOUND ENHANCING ACCESSORY FOR A MUSICAL INSTRUMENT - An accessory for modifying sound output of a musical instrument. The body of the instrument has a soundboard. The accessory includes a sound sensor, an actuator, a fastener, and a controller. The sound sensor engages the body and senses vibration of the body representing the sound output of the musical instrument. The actuator engages the soundboard and deforms the soundboard of the musical instrument so as to modify the sound output of the musical instrument. The sound sensor is preferably arranged distally to the actuator. The fastener engages the accessory to the musical instrument, to locate the actuator against the soundboard of the musical instrument. The controller is connected to the actuator and the sound sensor for receiving and analysing the sound output sensed by the sound sensor, and controlling the actuator in dependence on the sound output sensed by the sound sensor. | 2020-12-31 |
20200410970 | More Embodiments for Common-Point Pickup Circuits in Musical Instruments Part C - This invention continues and adds to the embodiments under NPPA Ser. No. | 2020-12-31 |
20200410971 | ELECTROMAGNETIC MULTI-FUNCTION MULTI-PURPOSE CHORDOPHONE - An electromagnetic multi-function multi-purpose chordo-phone musical instrument upon which any form or style of music may be played in any position by a performer standing or seated, in a fixed location or moving throughout a performance venue. The player may initiating the vibrations of each string, any combination of strings, or all of the strings with a plectrum or the fingers of one or both hands, singly or in any combination and such vibrations will continue until their eventual natural termination unless damped and/or muted by the player using the fingers or the palm of one or both hands singly or in any combination. The shape and style of the rigid frame of the instrument is not limited by the need for frets, a fretboard, or keys. | 2020-12-31 |
20200410972 | REINFORCED SOUND-ABSORBING PANEL AND METHOD FOR THE PRODUCTION THEREOF - A sound-absorbing panel comprising a padding layer comprising heat-bonded synthetic fibers is described. The sound-absorbing panel has a first outer face and a second outer face which are spaced from each other so as to form a panel thickness between them. The sound-absorbing panel also comprises a channel within the panel thickness. The padding material between at least one of the outer faces of the panel and the channel has a density greater than the density of the padding material far from the channel so that the panel is more rigid in the region of said channel. | 2020-12-31 |
20200410973 | SOLID-STATE TRANSDUCER, SYSTEM, AND METHOD - The present disclosure includes solid-state transducers, a system, and a method. In one embodiment, a solid-state transducer includes a housing, a first end portion, a second end portion, a plurality of electrical conductors, and a thin-film resistive material. The thin-film resistive material is disposed between and in electrical communication with a plurality of electrical conductors. The thin-film resistive material is configured to receive one or more electrical signals from the plurality of electrical conductors, and generate thermal oscillations to create pressure waves in a medium in response to receiving the one or more electrical signals. | 2020-12-31 |
20200410974 | CAR CHARGING DEVICE WITH NOISE REDUCTION FUNCTION - A car charging device includes a main body, a charging interface, an input terminal, a conversion circuit, a microphone, a loudspeaker, and a noise reduction circuit. A main body includes an insertion portion and the top. The charging interface includes a charging terminal. The input terminal is located at the insertion portion. The conversion circuit is electrically connected to the input terminal and the charging terminal and configured to convert electric power. The microphone is located on the top and configured to convert a sound into an input audio. The loudspeaker is located on the top. The noise reduction circuit is configured to generate a noise reduction audio according to the input audio, and drive, by using the noise reduction audio, the loudspeaker to emit a noise reduction sound, where the noise reduction audio is an inversion of a regular audio continuing for a preset time in the input audio. | 2020-12-31 |
20200410975 | AUDIO SYNTHESIS METHOD, COMPUTER APPARATUS, AND STORAGE MEDIUM - The present disclosure relates to an audio synthesis method, a computer apparatus and storage medium for synthesizing the audio. The method includes: obtaining an original audio; identifying a rhythm point in the original audio, and labeling an audio effect area in the original audio according to the rhythm point; obtaining an audio effect audio corresponding to the audio effect area, and synthesizing an audio effect of the audio effect audio into the audio effect area of the original audio to obtain a synthesized audio. | 2020-12-31 |
20200410976 | SPEECH STYLE TRANSFER - Computer-implemented methods for speech synthesis are provided. A speech synthesizer may be trained to generate synthesized audio data that corresponds to words uttered by a source speaker according to speech characteristics of a target speaker. The speech synthesizer may be trained by time-stamped phoneme sequences, pitch contour data and speaker identification data. The speech synthesizer may include a voice modeling neural network and a conditioning neural network. | 2020-12-31 |
20200410977 | VOICE GENERATION BASED ON CHARACTERISTICS OF AN AVATAR - Methods and systems for generating voices based on characteristics of an avatar. One or more characteristics of an avatar are obtained and one or more parameters of a voice synthesizer for generating a voice corresponding to the one or more avatar characteristics are determined. The voice synthesizer is configured based on the one or more parameters and a voice is generated using the parameterized voice synthesizer. | 2020-12-31 |
20200410978 | COGNITIVE MODIFICATION OF VERBAL COMMUNICATIONS FROM AN INTERACTIVE COMPUTING DEVICE - A method includes: determining, by a computer device, a current context associated with a user that is the target audience of an unprompted verbal output of an interactive computing device; determining, by the computer device, one or more parameters that are most effective in getting the attention of the user for the determined current context; and modifying, by the computer device, the unprompted verbal output of the interactive computing device using the determined one or more parameters. | 2020-12-31 |
20200410979 | METHOD, DEVICE, AND COMPUTER-READABLE STORAGE MEDIUM FOR SPEECH SYNTHESIS IN PARALLEL - The disclosure provides a method, an apparatus, a device, and a computer-readable storage medium for speech synthesis in parallel. The method includes: splitting a piece of text into a plurality of segments; based on the piece of text, obtaining a plurality of initial hidden states of the plurality of segments for a recurrent neural network. The method further includes: synthesizing the plurality of segments in parallel based on the plurality of initial hidden states and input features of the plurality of segments. | 2020-12-31 |
20200410980 | INTERACTIVE ELECTRONIC APPARATUS, COMMUNICATION SYSTEM, METHOD, AND PROGRAM - An interactive electronic apparatus includes a controller. The controller is configured to acquire a privacy level corresponding to a person located in the vicinity of the interactive electronic apparatus. The controller performs a content modification operation for modifying the content to be verbally output from a speaker, based on the privacy level. The interactive electronic apparatus may be configured as a mobile terminal. The controller may perform the content modification operation when the interactive electronic apparatus is mounted on a charging stand. | 2020-12-31 |
20200410981 | TEXT-TO-SPEECH (TTS) PROCESSING - A speech model is trained using multi-task learning. A first task may correspond to how well predicted audio matches training audio; a second task may correspond to a metric of perceived audio quality. The speech model may include, during training, layers related to the second task that are discarded at runtime. | 2020-12-31 |
20200410982 | INFORMATION PROCESSING APPARATUS AND INFORMATION PROCESSING METHOD AND COMPUTER-READABLE STORAGE MEDIUM - An information processing apparatus and an information processing method as well as a computer readable storage medium are provided. The information processing apparatus includes a processing circuitry configured to: select, from a sound, sound elements which are related to scene features during making of the sound; establish a correspondence relationship including a first correspondence relationship between the scene features and the sound elements and between the respective sound elements, and store the scene features and the sound elements as well as the correspondence relationship in association in a correspondence relationship library; and generate, based on a reproduction scene feature and the correspondence relationship library, a sound to be reproduced. | 2020-12-31 |
20200410983 | SYSTEM AND METHOD FOR A LANGUAGE UNDERSTANDING CONVERSATIONAL SYSTEM - A virtual assistant device recognizes multiple wake-up phrases. In response to a particular wake-up phrase the device sends speech audio to either a default or a third party virtual assistant server. A virtual assistant server can receive speech audio and an indication of which of multiple wake-up phrases was used and, accordingly, send the speech audio, or text recognized from the speech audio using automatic speech recognition, to a third party server. A response from the third party server can be voice audio or text for the virtual assistant server to synthesize distinctively corresponding to the wake-up phrase. | 2020-12-31 |
20200410984 | EMERGENCY SERVICE REQUEST SYSTEMS AND METHODS - An emergency service request system that allows a user to effectively and/or efficiently provide information regarding an emergency situation to an emergency response center. The system presents a series of prompts to a user based on the user's preferred language, with each prompt having one or more prepopulated responses that are selectable by the user in response to the prompt. The user's responses to the prompts are prepared and formatted into a message that is transmitted to an emergency response center. The message contains the user-provided information regarding the emergency situation and the information is provided in a preferred language of the emergency response center. | 2020-12-31 |
20200410985 | METHOD, APPARATUS, AND STORAGE MEDIUM FOR SEGMENTING SENTENCES FOR SPEECH RECOGNITION - The present disclosure describes a method, apparatus, and storage medium for performing speech recognition. The method includes acquiring, by an apparatus, first to-be-processed speech information. The apparatus includes a memory storing instructions and a processor in communication with the memory. The method includes acquiring, by the apparatus, a first pause duration according to the first to-be-processed speech information; and in response to the first pause duration being greater than or equal to a first threshold, performing, by the apparatus, speech recognition on the first to-be-processed speech information to obtain a first result of sentence segmentation of speech, the first result of sentence segmentation of speech being text information, the first threshold being determined according to speech information corresponding to a previous moment. | 2020-12-31 |
20200410986 | SYSTEM AND METHOD FOR AUTOMATING NATURAL LANGUAGE UNDERSTANDING (NLU) IN SKILL DEVELOPMENT - A method includes receiving, from an electronic device, information defining a user utterance associated with a skill to be performed, where the skill is not recognized by a natural language understanding (NLU) engine. The method also includes receiving, from the electronic device, information defining one or more actions for performing the skill. The method further includes identifying, using at least one processor, one or more known skills having one or more slots that map to at least one word or phrase in the user utterance. The method also includes creating, using the at least one processor, a plurality of additional utterances based on the one or more mapped slots. In addition, the method includes training, using the at least one processor, the NLU engine using the plurality of additional utterances. | 2020-12-31 |
20200410987 | INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, PROGRAM, AND INFORMATION PROCESSING SYSTEM - An information processing device includes an input unit to which a predetermined voice is input, and a determination unit that determines whether or not a voice input after a voice including a predetermined word is input is intended to operate a device. | 2020-12-31 |
20200410988 | INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING SYSTEM, AND INFORMATION PROCESSING METHOD, AND PROGRAM - A device and method that enable the execution of processing requested by a user based on a natural user's speech without using an unnatural default speech start keyword are provided. The present disclosure has a keyword analysis unit that assesses whether or not a user's speech is a speech start keyword, and the keyword analysis unit has a user registration speech start keyword processing unit that assesses whether or not the user's speech is a user registration speech start keyword registered by the user in advance. The user registration speech start keyword processing unit assesses that the user's speech is the user registration speech start keyword, in a case where the user's speech is similar to a pre-registered keyword, and a pre-registered registration condition, such as an application being executed, or an input time or input timing of the user's speech, satisfies a registration condition. | 2020-12-31 |
20200410989 | SYSTEM AND METHOD FOR NATURAL LANGUAGE UNDERSTANDING - An electronic device for natural language understanding includes at least one memory and at least one processor coupled to the at least one memory. The at least one processor is configured to process an utterance using a trained model. The at least one processor is also configured to replace a first portion of the utterance with a first token, where the first token represents a semantic role of the first portion of the utterance based on a slot vocabulary. The at least one processor is further configured to determine a slot value in the utterance based on the first token. In addition, the at least one processor is configured to perform a task corresponding to the utterance based on the determined slot value. | 2020-12-31 |
20200410990 | Digital Media Environment for Conversational Image Editing and Enhancement - Conversational image editing and enhancement techniques are described. For example, an indication of a digital image is received from a user. Aesthetic attribute scores for multiple aesthetic attributes of the image are generated. A computing device then conducts a natural language conversation with the user to edit the digital image. The computing device receives inputs from the user to refine the digital image as the natural language conversation progresses. The computing device generates natural language suggestions to edit the digital image based on the aesthetic attribute scores as part of the natural language conversation. The computing device provides feedback to the user that includes edits to the digital image based on the series of inputs. The computing device also includes as feedback natural language outputs indicating options for additional edits to the digital image based on the series of inputs and the previous edits to the digital image. | 2020-12-31 |
20200410991 | SYSTEM AND METHOD FOR PREDICTIVE SPEECH TO TEXT - A method, computer program product, and computer system for receiving, by a computing device, speech from a user. A next word following a current word in the speech from the user may be predicted. The next word that is predicted following the current word recognized in the speech from the user may be presented to the user in real time. Feedback from the user may be received whether to one of accept and reject the next word that is predicted. The speech from the user may be processed to convert the speech to text, wherein the text may include the next word when the feedback from the user is to accept the next word that is predicted and wherein the text may exclude the next word when the feedback from the user is to reject the next word that is predicted. | 2020-12-31 |
20200410992 | DEVICE FOR RECOGNIZING SPEECH INPUT FROM USER AND OPERATING METHOD THEREOF - Provided are a device for recognizing a speech input including a named entity from a user and an operating method thereof. The device is configured to: generate a weighted finite state transducer model by using a vocabulary list including a plurality of named entities; obtain a first string from a speech input received from a user, by using a first decoding model; obtain a second string by using a second decoding model that uses the weighted finite state transducer model, the second string including a word sequence, which corresponds to at least one named entity, and an unrecognized word sequence not identified as a named entity; and output a text corresponding to the speech input by substituting the unrecognized word sequence of the second string with a word sequence included in the first string. | 2020-12-31 |
20200410993 | PRE-PROCESSING FOR AUTOMATIC SPEECH RECOGNITION - A method is provided that includes obtaining two or more microphone audio signals; analysing the two or more microphone audio signals for a defined noise type; and processing the two or more microphone audio signals based on the analysis to generate at least one audio signal suitable for automatic speech recognition. A corresponding apparatus is also provided. | 2020-12-31 |
20200410994 | VOICE-BASED TRANSACTION PROCESSING WITH LOCATION-BASED ACTIVATION - A natural-language voice chatbot is initiated and a voice session is established between the chatbot and a customer while the customer is operating a vehicle device within a vehicle. A pre-staged order is taken from a customer during the session and the session is suspended until the customer arrives at a store associated with the pre-staged order. A location-based trigger is raised when the customer is detected as being present at a transaction terminal of a store; the session is resumed on the transaction terminal and/or the vehicle device. The pre-stage order is confirmed during the resumed session and payment is obtained from the customer for the order when payment was not already obtained from the customer. The order is sent to a fulfillment station and, in an embodiment, the items associated with the order are delivered to the customer while the customer remains at the terminal. | 2020-12-31 |
20200410995 | SYSTEMS AND METHODS FOR DISAMBIGUATING A VOICE SEARCH QUERY BASED ON GESTURES - Systems and methods are described herein for disambiguating a voice search query by determining whether the user made a gesture while speaking a quotation from a content item and whether the user mimicked or approximated a gesture made by a character in the content item when the character spoke the words quoted by the user. If so, a search result comprising an identifier of the content item is generated. A search result representing the content item from which the quotation comes may be ranked highest among other search results returned and therefore presented first in a list of search results. If the user did not mimic or approximate a gesture made by a character in the content item when the quotation is spoken in the content item, then a search result may not be generated for the content item or may be ranked lowest among other search results. | 2020-12-31 |
20200410996 | VOICE ASSISTANT-ENABLED WEB APPLICATION OR WEB PAGE - Various embodiments discussed herein enable applications to seamlessly contribute to executing voice commands of users via voice assistant functionality. In response to receiving a user request to open an application or web page, the application can request and responsively receive a voice assistant runtime component along with the application or web page. The application, using a particular universal application interface component can compile or interpret the voice assistant runtime component from a source code format to an intermediate code format. In response to the application or web page being rendered and the detection of a key word or phrase, the application can activate voice assistant command execution functionality. The user can issue a voice command after which the application along with specific services can help execute the voice command. | 2020-12-31 |
20200410997 | ISSUE TRACKING SYSTEM HAVING A VOICE INTERFACE SYSTEM FOR FACILITATING A LIVE MEETING DIRECTING STATUS UPDATES AND MODIFYING ISSUE RECORDS - An issue tracking system configured to track issues, tickets, or tasks is described herein. The issue tracking system may include a voice interface system that may be used to create, modify, and delete issue records during a live meeting or event. The voice interface system may be configured to facilitate a live meeting conducted in a particular format or structure. The voice interface system may be adapted to determine a relevance score between a voice input and one or more respective issue records being tracked by the issue tracking system. If the relevancy score satisfies a threshold, a respective issue record may be selected for modification or editing during the live meeting through a series of responsive voice commands or other voice input. | 2020-12-31 |
20200410998 | VOICE INTERFACE SYSTEM FOR FACILITATING ANONYMIZED TEAM FEEDBACK FOR A TEAM HEALTH MONITOR - A team health monitor system having a voice interface system for monitoring and improving team dynamics is described herein. The systems and techniques are directed to a voice interface system that is configured to conduct a health monitor or health diagnostic meeting in which a graduated score is received for a set of key team attributes from each of the meeting participants. The voice interface system also collects score narratives and pairs narratives with associated team attributes by determining a relevance percentage or similar criteria. The voice interface system is also configured to facilitate a consensus vote for each of the team attributes and construct an anonymized report that includes consensus scoring and composite narratives without attributing content to a particular team member or participant. | 2020-12-31 |
20200410999 | VOICE PROCESSING METHOD AND APPARATUS - Provided are a voice processing method and an apparatus, the method including: acquiring, during playback of a content of a first type, a first voice inputted by a user, where the first voice instructs a terminal to switch a played content to a content of a second type; and where the terminal plays a content of a predefined type before playing the content of the first type; playing a first reply voice according to the first voice, prompting the user to determine whether to continue to play the content of the second type after the content of the predefined type during a predefined period; and continuing to play a content of a target type after the content of the predefined type during the predefined period, where the target type is related to the user's feedback on the first reply voice, thus improving a reliability for the terminal. | 2020-12-31 |
20200411000 | OPTIMIZATION METHOD, APPARATUS, DEVICE FOR WAKE-UP MODEL, AND STORAGE MEDIUM - Provided are an optimization method, apparatus, device for a wake-up model and a storage medium, which allow for: acquiring a training set and a verification set; performing an iterative training on the wake-up model according to the training set and the verification set; during the iterative training, periodically updating the training set and the verification set according to the wake-up model and a preset corpus database, and continuing performing the iterative training on the wake-up model according to the updated training set and verification set; and outputting the wake-up model when a preset termination condition is reached. The embodiments of the present disclosure, by periodically updating the training set and the verification set according to the wake-up model and the preset corpus database during an iteration, may improve optimization efficiency and effects of the wake-up model, thereby improving stability and adaptability of the wake-up model and avoiding overfitting. | 2020-12-31 |
20200411001 | METHOD FOR CONTROLLING THE OPERATION OF AN APPLIANCE BY A USER THROUGH VOICE CONTROL - A method for controlling operation of an appliance by a user through voice control includes at least the steps of: detecting, by the appliance, a control action performed by the user on the appliance; activating a voice control system by the appliance; capturing, by the voice control system, a voice input from the user as a captured voice input; recognizing, by the voice control system, a piece of information and/or an instruction in the captured voice input from the user as a recognized information and/or instruction; and executing, by the voice control system, a user control action on the appliance in accordance with the recognized information and/or instruction. | 2020-12-31 |
20200411002 | ELECTRONIC APPARATUS AND CONTROL METHOD THEREOF - An electronic apparatus is provided. The electronic apparatus includes: a memory configured to store at least one instruction; and a processor configured to execute the at least one instruction to: obtain usage information on an application installed in the electronic apparatus, obtain a natural language understanding model, among a plurality of natural language understanding models, corresponding to the application based on the usage information, perform natural language understanding of a user voice input related to the application based on the natural language understanding model corresponding to the application, and perform an operation of the application based on the preformed natural language understanding. | 2020-12-31 |
20200411003 | Smart Speaker System with Cognitive Sound Analysis and Response - Smart speaker system mechanisms, associated with a smart speaker device comprising an audio capture device, are provided for processing audio sample data captured by the audio capture device. The mechanisms receive, from the audio capture device of the smart speaker device, an audio sample captured from a monitored environment. The mechanisms classify a sound in the audio sample data as a type of sound based on performing a joint analysis of a plurality of different characteristics of the sound and matching results of the joint analysis to criteria specified in a plurality of sound models. The mechanisms determine, based on the classification of the sound, whether a responsive action is to be performed based on the classification of the sound. In response to determining that a responsive action is to be performed, the mechanisms initiate performance of the responsive action by the smart speaker system. | 2020-12-31 |
20200411004 | CONTENT INPUT METHOD AND APPARATUS - A content input method and a content input device are provided. The method includes the following steps. In a case where a display event of an input box is detected, the input box and a speech input control corresponding to the input box is displayed in response to the display event so that the user can directly perform a speech input operation on the first speech input control. Then, speech data inputted by the user is received in response to the speech input operation and the speech data inputted by the user is converted into display content displayable in a first input box, and the display content is displayed in the first input box. | 2020-12-31 |
20200411005 | VEHICLE FUNCTION CONTROL WITH SENSOR BASED VALIDATION - The present disclosure is generally related to a data processing system to validate vehicular functions in a voice activated computer network environment. The data processing system can improve the efficiency of the network by discarding action data structures and requests that invalid prior to their transmission across the network. The system can invalidate requests by comparing attributes of a vehicular state to attributes of a request state. | 2020-12-31 |
20200411006 | TRANSIT VOICE ASSISTANT - Transit voice assistant is a conversational voice-based assistant, accessible 24 hours a day and 7 days a week. It responds to user's request for real-time transit system information as well as transit alerts. Just say where you want to go to and transit voice assistant will make it happen. Transit voice assistant provides a unique experience for the customer by enabling the user to interact in a more intuitive way using only their voice. Transit voice assistant responds to the way users speak and think, without requiring users to type on a keyboard or screen. Transit voice assistant brings customers new levels of ease and convenience through voice technology, including natural language understanding and automatic speech recognition. The transit voice assistant is constantly learning and improves as more data is collected. Reach and delight more customers, where they are, through millions of voice powered devices. | 2020-12-31 |
20200411007 | TRANSCRIPTION OF COMMUNICATIONS - A method may include obtaining audio data originating at a first device during a communication session between the first device and a second device and providing the audio data to a first speech recognition system to generate a first transcript based on the audio data and directing the first transcript to the second device. The method may also include in response to obtaining a quality indication regarding a quality of the first transcript, multiplexing the audio data to provide the audio data to a second speech recognition system to generate a second transcript based on the audio data while continuing to provide the audio data to the first speech recognition system and direct the first transcript to the second device, and in response to obtaining a transfer indication that occurs after multiplexing of the audio data, directing the second transcript to the second device instead of the first transcript. | 2020-12-31 |
20200411008 | VOICE CONTROL METHOD AND DEVICE - A voice control method and a voice control device are provided. The method includes: receiving voice data in response to a trigger operation for an interaction interface, the trigger operation being an operation that triggers voice control and that is recognized by a client on the interaction interface; converting the voice data into text data; generating a control instruction based on the text data; and executing the control instruction. | 2020-12-31 |
20200411009 | ASYNCHRONOUS PROCESSING OF USER REQUESTS - Methods, systems, and apparatus, including computer programs stored on a computer-readable storage medium, for asynchronous execution of client requests. In some implementations, data indicating a user request to a digital assistant is received. An action corresponding to the user request is determined. It is determined that the action is classified as an action to be performed asynchronously to the user request. A confirmation message is sent, for output, and the action is performed asynchronously to the user request. | 2020-12-31 |
20200411010 | GRAPH-BASED APPROACH FOR VOICE AUTHENTICATION - Methods for voice authentication include receiving a plurality of mono telephonic interactions between customers and agents; creating a mapping of the plurality of mono telephonic interactions that illustrates which agent interacted with which customer in each of the interactions; determining how many agents each customer interacted with; identifying one or more customers an agent has interacted with that have the fewest interactions with other agents; and selecting a predetermined number of interactions of the agent with each of the identified customers. In some embodiments, the methods further include creating a voice print from first and second speaker components of each interaction; comparing the voice prints of a first selected interaction to the voice prints from a second selected interaction; calculating a similarity score between the voice prints; aggregating scores; and identifying the voice prints that are associated with the agent. | 2020-12-31 |
20200411011 | ELECTRONIC DEVICE, CONTROL METHOD THEREOF, AND COMPUTER READABLE RECORDING MEDIUM - An electronic device includes a communication interface that receives voice data and fingerprint data; and a processor that determines an access right to the electronic device based on at least one of a voice score obtained by comparing the received voice data with stored voice data and a fingerprint score obtained by comparing the received fingerprint data with stored fingerprint data. | 2020-12-31 |
20200411012 | SPEECH RECOGNITION DEVICE, SPEECH RECOGNITION SYSTEM, AND SPEECH RECOGNITION METHOD - A speech recognition device includes: a speech recognition unit for executing speech recognition on a spoken sound that is made for an operational input by a speaking person among multiple on-board persons seated on speech recognition target seats in a vehicle; a speaking person identification unit for executing at least one of personal identification processing of identifying the speaking person, and seat identification processing of identifying the seat on which the speaking person is seated; and a response mode setting unit for executing response mode setting processing of setting a mode for a response to the speaking person, according to a result identified by the speaking person identification unit; the response mode setting processing is processing in which the mode for the response is set as a mode that allows each of the multiple on-board persons to determine whether to be subjected to the response. | 2020-12-31 |
20200411013 | CALLER IDENTIFICATION IN A SECURE ENVIRONMENT USING VOICE BIOMETRICS - A method for passive enrollment and identification of one or more speakers in an audio file includes automatically converting audio data to a format suitable for biometric processing, separating different channels present in the converted audio data separating speakers in the converted audio data, generating audio files specific to individual speakers in the converted audio data, iteratively grouping the audio files of individual speakers according to a predetermined matching criteria, creating biometric voice prints from the groups of audio files, and authenticating speakers in the biometric voice prints by comparing the biometric voice prints to entries in a biometric voice print database. | 2020-12-31 |
20200411014 | USER AUTHENTICATION WITH AUDIO REPLY - Various implementations include approaches for authenticating user identity with audio-based verification. Certain approaches include: receiving a request to authenticate a user of an audio device; prompting the user of the audio device to speak a verification word or phrase in response to receiving the request; detecting an acoustic response at the audio device or a connected smart device; comparing the detected acoustic response with an acoustic signature of a known user associated with the audio device and the verification word or phrase, wherein the audio device is registered as an authentication device prior to receiving the request to authenticate the user of the audio device; and sending a confirmation response indicating the user of the audio device is the known user in response to the acoustic response corresponding with the acoustic signature and the verification word or phrase. | 2020-12-31 |
20200411015 | DEVICE FOR RECOGNIZING VOICE CONTENT, SERVER CONNECTED THERETO, AND METHOD FOR RECOGNIZING VOICE CONTENT - An artificial intelligence (AI) device, such as a robot, comprises: an output interface to output content in response to a request of a user; a camera to acquire an image of the user; a microphone to acquire a voice signal including a voice content uttered by the user; a processor to determine a characteristic of the user based on the content, the image, and/or the voice signal, and recognize the voice content through a voice recognition mode corresponding to the determined characteristic. The AI device may include a communication interface to forward the voice signal to a remote computer that identifies the characteristic and recognizes the voice content based on the characteristic. According to an embodiment, when an irregular voice is recognized from the acquired voice signal, the processor may recognize a regular voice corresponding to the irregular voice using an artificial intelligence-based learning model. | 2020-12-31 |
20200411016 | ENCODING APPARATUS, DECODING APPARATUS, FRICATIVE SOUND JUDGMENT APPARATUS, AND METHODS AND PROGRAMS THEREFOR - An encoding apparatus comprising an encoding part | 2020-12-31 |
20200411017 | AUDIO ENCODER AND DECODER - The present disclosure provides methods, devices and computer program products for encoding and decoding of a vector of parameters in an audio coding system. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system. According to the disclosure, a modulo differential approach for coding and encoding a vector of a non-periodic quantity may improve the coding efficiency and provide encoders and decoders with less memory requirements. Moreover, an efficient method for encoding and decoding a sparse matrix is provided. | 2020-12-31 |
20200411018 | HIERARCHICAL ENCODER FOR SPEECH CONVERSION SYSTEM - A speech conversion system is described that includes a hierarchical encoder and a decoder. The system may comprise a processor and memory storing instructions executable by the processor. The instructions may comprise to: using a second recurrent neural network (RNN) (GRU1) and a first set of encoder vectors derived from a spectrogram as input to the second RNN, determine a second concatenated sequence; determine a second set of encoder vectors by doubling a stack height and halving a length of the second concatenated sequence; using the second set of encoder vectors, determine a third set of encoder vectors; and decode the third set of encoder vectors using an attention block. | 2020-12-31 |
20200411019 | BACKWARD-COMPATIBLE INTEGRATION OF HIGH FREQUENCY RECONSTRUCTION TECHNIQUES FOR AUDIO SIGNALS - A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. | 2020-12-31 |
20200411020 | SPATIAL SOUND REPRODUCTION USING MULTICHANNEL LOUDSPEAKER SYSTEMS - An apparatus for spatial audio signal decoding associated with a plurality of speaker nodes ( | 2020-12-31 |
20200411021 | INFORMATION PROCESSING APPARATUS AND INFORMATION PROCESSING METHOD - The present disclosure relates to an information processing apparatus and an information processing method for transmitting audio data of higher quality. Given a file in a predetermined file format for storing encoded data derived from audio data, the encoded data being in groups of a predetermined number of blocks, a sample is set to the file, the sample being a minimum access unit in the file and including initialization information for decoding each of the groups of the blocks. The present disclosure may be applied to image processing apparatuses, image encoding apparatuses, or image decoding apparatuses, for example. | 2020-12-31 |
20200411022 | APPARATUS AND METHOD FOR ENCODING AND DECODING OF INTEGRATED SPEECH AND AUDIO - Provided are an apparatus and a method for integrally encoding and decoding a speech signal and a audio signal. The encoding apparatus may include: an input signal analyzer to analyze a characteristic of an input signal; a first conversion encoder to convert the input signal to a frequency domain signal, and to encode the input signal when the input signal is a audio characteristic signal; a Linear Predictive Coding (LPC) encoder to perform LPC encoding of the input signal when the input signal is a speech characteristic signal; and a bitstream generator to generate a bitstream using an output. | 2020-12-31 |
20200411023 | METHOD AND APPARATUS FOR IDENTIFYING TYPE OF VOCODER - In accordance with an aspect of the present disclosure, there is provided a method for identifying a type of a vocoder. The method comprises acquiring identification target bitstreams encoded with a voice signal, acquiring, for each of a plurality of vocoders, a probability that each of the plurality of vocoders is related to the identification target bitstreams from the identification target bitstreams, acquiring waveforms for each decoder of each of the plurality of vocoders by inputting the identification target bitstreams to the each decoder of each of the plurality of vocoders, acquiring intelligibility values for each of the waveforms obtained for the each decoder of each of the plurality of vocoders from the waveforms, and determining the type of the vocoder related to the voice signal from the probability and the intelligibility values for each waveform. | 2020-12-31 |
20200411024 | DECODING AUDIO BITSTREAMS WITH ENHANCED SPECTRAL BAND REPLICATION METADATA IN AT LEAST ONE FILL ELEMENT - Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided. | 2020-12-31 |
20200411025 | METHOD, DEVICE, AND SYSTEM FOR AUDIO DATA PROCESSING - A method and apparatus that filters audio data received from a speaking person that includes a specific filter for that speaker. The audio characteristics of the speaker's voice may be collected and the specific filter may be formed to reduce noise while also enhancing voice quality. For instance, if a speaker's voice does not contain specific frequencies, then a filter may cancel the noise at such frequencies to ease noise cancellation and reduce processing sound spectrum for cleaning that is not needed. Additionally, the strength frequencies of a speaker's voice may be identified from the collected audio characteristics and those spectrums can be filtered with finer granularity to provide a speaker specific filter that enhances the voice quality of the speaker's voice data that is transmitted or output by a communication device. The audio data may also be output based upon a user's predefined hearing spectrum. | 2020-12-31 |
20200411026 | DYNAMIC BEAMFORMING TO IMPROVE SIGNAL-TO-NOISE RATIO OF SIGNALS CAPTURED USING A HEAD-WEARABLE APPARATUS - Method to perform dynamic beamforming to reduce SNR in signals captured by head-wearable apparatus starts with microphones generating acoustic signals. Microphones are coupled to first stem of the apparatus and to second stem of the apparatus. First and second beamformers generate first and second beamformer signals, respectively. Noise suppressor attenuates noise content from the first beamformer signal and the second beamformer signal. Noise content from first beamformer signal are acoustic signals not collocated in second beamformer signal and noise content from second beamformer signal are acoustic signals not collocated in first beamformer signal. Speech enhancer generates clean signal comprising speech content from first noise-suppressed signal and second noise-suppressed signal. Speech content are acoustic signals collocated in first beamformer signal and second beamformer signal. | 2020-12-31 |
20200411027 | SIGNAL ANALYSIS DEVICE, SIGNAL ANALYSIS METHOD, AND SIGNAL ANALYSIS PROGRAM - A signal analysis device includes an estimation unit that models a sound source position occurrence probability matrix Q using a product of a sound source position probability matrix B and a sound source existence probability matrix A, and estimates at least one of the sound source position probability matrix B and the sound source existence probability matrix A based on the modeling, the sound source position occurrence probability matrix Q being composed of probabilities of arrival of a signal from each sound source position candidate per frame, which is a time section, with respect to a plurality of sound source position candidates. The sound source position probability matrix B being composed of probabilities of arrival of a signal from each sound source position candidate per sound source with respect to a plurality of sound sources. | 2020-12-31 |
20200411028 | SIGNAL PROCESSING APPARATUS - A signal processing apparatus includes a generator, an output controller, and an echo canceller. The generator generates an output sound signal by combining an over-the-phone sound signal with a system sound signal different from the over-the-phone sound signal. The output controller outputs, to a loudspeaker, the output sound signal generated by the generator. The echo canceller cancels the output sound signal from an input sound signal input via a microphone located in a vicinity of the loudspeaker. The output controller suppresses a level of the system sound signal to be output from the loudspeaker so as not to be greater than a predetermined value within a range in which a volume value for the over-the-phone sound signal is settable. | 2020-12-31 |
20200411029 | METHOD AND DEVICE FOR UPDATING COEFFICIENT VECTOR OF FINITE IMPULSE RESPONSE FILTER - A method and a device for updating a coefficient vector of a finite impulse response filter are provided. The update method includes: obtaining an updated step-size diagonal matrix for a coefficient vector of the FIR filter; and obtaining an updated coefficient vector of the FIR filter based on the updated step-size diagonal matrix. | 2020-12-31 |
20200411030 | SYSTEM AND METHOD FOR ACOUSTIC ECHO CANCELATION USING DEEP MULTITASK RECURRENT NEURAL NETWORKS - A method for performing echo cancellation includes: receiving a far-end signal from a far-end device at a near-end device; recording a microphone signal at the near-end device including: a near-end signal; and an echo signal corresponding to the far-end signal; extracting far-end features from the far-end signal; extracting microphone features from the microphone signal; computing estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including: an echo estimator including a first stack of a recurrent neural network configured to compute estimated echo features based on the far-end features; and a near-end estimator including a second stack of the recurrent neural network configured to compute the estimated near-end features based on an output of the first stack and the microphone signal; computing an estimated near-end signal from the estimated near-end features; and transmitting the estimated near-end signal to the far-end device. | 2020-12-31 |
20200411031 | SIGNAL ANALYSIS DEVICE, SIGNAL ANALYSIS METHOD, AND RECORDING MEDIUM - A signal analysis device includes a memory and processing circuitry coupled to the memory and configured to obtain, for a spatial covariance matrix R | 2020-12-31 |
20200411032 | GENERATING AUDIO USING NEURAL NETWORKS - Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an output sequence of audio data that comprises a respective audio sample at each of a plurality of time steps. One of the methods includes, for each of the time steps: providing a current sequence of audio data as input to a convolutional subnetwork, wherein the current sequence comprises the respective audio sample at each time step that precedes the time step in the output sequence, and wherein the convolutional subnetwork is configured to process the current sequence of audio data to generate an alternative representation for the time step; and providing the alternative representation for the time step as input to an output layer, wherein the output layer is configured to: process the alternative representation to generate an output that defines a score distribution over a plurality of possible audio samples for the time step. | 2020-12-31 |
20200411033 | CONVERSATION ASPECT IMPROVEMENT - One embodiment provides a method, including: analyzing, using a digital assistant, a conversation between at least two users; identifying, using a processor, an improvement opportunity related to the conversation; and presenting, based on the identified improvement opportunity, a suggestion to improve an aspect of the conversation. Other aspects are described and claimed. | 2020-12-31 |
20200411034 | RESOLUTION OF EDIT CONFLICTS IN AUDIO-FILE DEVELOPMENT - A processor may store a first version of an audio file and fragment the audio file into at least a first time segment. The processor may receive a first edit to the audio file and identify a first edited version of the first time segment in the first edit. The processor may update the first version of the audio file with the first edit, resulting in a second version of the audio file comprising the first edited version of the first time segment. The processor may receive a second edit to the first version of the audio file and identify a second edited version of the first time segment in the second edit. The processor may determine, based on the second edited version, that the second edit alters an outdated version of the first time segment, resulting in an edit conflict. The processor may notify a user of the conflict. | 2020-12-31 |
20200411035 | Mobile Emulator Determination using Sound Detection - A method and apparatus for mobile emulator determination using sound fingerprinting is disclosed. The method includes a verification computer system receiving a transaction request from a computing device purporting to be a mobile device. Responsive to receiving the request, the verification computer system transmits a request for verification information to the computing device. The verification system includes information regarding a tone to be generated by a speaker of the computing device. Thereafter, verification information is received from the computing device. The verification information includes information tone information generated by the computing device, wherein the tone is, after generation, detected by a microphone. The verification system then verifies, based on the receive verification information, whether the information indicates that the computing device is a mobile device. | 2020-12-31 |
20200411036 | COUGH DETECTION DEVICE, COUGH DETECTION METHOD, AND RECORDING MEDIUM - A cough detection device including: an acoustic feature extractor that extracts at least one acoustic feature from acoustic data output by a microphone array according to a sound received; a first identifier that performs identification of the sound based on the at least one acoustic feature to determine whether the sound is a cough sound; a direction estimator that estimates an arrival direction of the sound from the acoustic data; an image selector that selects, from first image data indicating an image obtained by capturing a scene in which the sound occurs, second image data indicating an area corresponding to the arrival direction estimated; and a second identifier that performs identification of the image based on the second image data to determine whether a coughing action is shown in the image. | 2020-12-31 |
20200411037 | ALTERNATE RESPONSE GENERATION - Techniques for performing conversation recovery of a system/user exchange are described. In response to determining that an action responsive to a user input cannot be performed, a system may determine a topic to recommend to a user. The topic may be unrelated to the original substance of the user input. The system may have access to various data representing a context in which a user provides an input to the system. The system may use these inputs and various data at runtime to make a determination regarding whether a user should be recommended a topic, as well as what that topic should be. The system may cause a question be output to the user, with the question asking the user about the topic, for example whether the user would like a song played, whether the user would like to hear information about a particular individual (e.g., artist), whether the user would like to know about a particular skill (e.g., a skill having a significantly high popularity among users of the system), or whether the user would like to know about some other topic. If the user responds affirmatively to the recommended topic, the system may pass the user experience off to an appropriate component of the system (e.g., one that is configured to perform an action related to the topic). If the user responds negatively, does not respond at all, or the system is unsure whether the user's response was affirmative or negative, the system may cease interaction with the user, thereby enabling the user to interact with the system as the user desires. | 2020-12-31 |
20200411038 | SYSTEMS AND METHODS FOR IMPROVING AUDIO CONFERENCING SERVICES - Systems and methods are disclosed herein for improving audio conferencing services. One aspect relates to processing audio content of a conference. A first audio signal is received from a first conference participant, and a start and an end of a first utterance by the first conference participant are detected from the first audio signal. A second audio signal is received from a second conference participant, and a start and an end of a second utterance by the second conference participant is detected from the second audio signal. The second conference participant is provided with at least a portion of the first utterance, wherein at least one of start time, start point, and duration is determined based at least in part on the start, end, or both, of the second utterance. | 2020-12-31 |
20200411039 | SPIN ORBITAL TORQUE BASED ENERGY ASSISTED MAGNETIC RECORDING - A magnetic recording head includes a trailing shield, a main pole, and a spin Hall layer. The spin Hall layer is disposed between the trailing shield and the main pole. A first spin torque layer is disposed between the spin Hall layer and the trailing shield. A second spin torque layer is disposed between the spin Hall layer and the main pole. | 2020-12-31 |
20200411040 | METHOD FOR EVALUATING MAGNETIC HEAD AND EVALUATION APPARATUS OF MAGNETIC HEAD - According to one embodiment, a method for evaluating a magnetic head is disclosed. The method can include measuring an electrical characteristic of a current path when an alternating-current magnetic field is applied to the magnetic head. The magnetic head includes the current path. The current path includes an oscillator. The method can include, based on the electrical characteristic, deriving a frequency value relating to an oscillation frequency of the oscillator. | 2020-12-31 |