Patent application number | Description | Published |
20100057463 | System and Method for Generating Natural Language Phrases From User Utterances in Dialog Systems - Embodiments of a dialog system that employs a corpus-based approach to generate responses based on a given number of semantic constraint-value pairs are described. The system makes full use of the data from the user input to produce dialog system responses in combination with a template generator. The system primarily utilizes constraint values in order to realize efficiencies based on the more frequent tasks performed in real dialog systems although rhetorical or discourse aspects of the dialog could also be included in a similar way, that is, labeling the data with such information and performing a training process. The benefits of this system include higher quality user-aligned responses, broader coverage, faster response time, and shorter development cycles. | 03-04-2010 |
20100124325 | System and Method for Interacting with Live Agents in an Automated Call Center - Embodiments of an interface system that enables a call center agent to access and intervene in an interaction between an automated call center system and a caller whenever necessary for complex application tasks is described. The system includes a user interface that presents the agent with one or more categories of information, including the conversation flow, obtained semantic information, the recognized utterances, and access to the utterance waveforms. This information is cross-linked and attached with a confidence level for better access and navigation within the dialog system for the generation of appropriate responses to the caller. | 05-20-2010 |
20100125456 | System and Method for Recognizing Proper Names in Dialog Systems - Embodiments of a dialog system that utilizes contextual information to perform recognition of proper names are described. Unlike present name recognition methods on large name lists that generally focus strictly on the static aspect of the names, embodiments of the present system take into account of the temporal, recency and context effect when names are used, and formulates new questions to further constrain the search space or grammar for recognition of the past and current utterances. | 05-20-2010 |
20100204982 | System and Method for Generating Data for Complex Statistical Modeling for use in Dialog Systems - Embodiments of a dialog system that utilizes grammar-based labeling scheme to generate labeled sentences for use in training statistical models. During the process of training data development, a grammar is constructed manually based on the application domain or adapted from a general grammar rule. An annotation schema is created accordingly based on the application requirements, such as syntactic and semantic information. Such information is then included in the grammar specification. After the labeled grammar is constructed, a generation algorithm is then used to generate sentences for training various statistical models. | 08-12-2010 |
20100241431 | System and Method for Multi-Modal Input Synchronization and Disambiguation - Embodiments of a dialog system that utilizes a multi-modal input interface for recognizing user input in human-machine interaction (HMI) systems are described. Embodiments include a component that receives user input from a plurality of different user input mechanisms (multi-modal input) and performs certain synchronization and disambiguation processes. The multi-modal input components synchronizes and integrates the information obtained from different modalities, disambiguates the input, and recovers from any errors that might be produced with respect to any of the user inputs. Such a system effectively addresses any ambiguity associated with the user input and corrects for errors in the human-machine interaction. | 09-23-2010 |
20110022292 | METHOD AND SYSTEM FOR IMPROVING SPEECH RECOGNITION ACCURACY BY USE OF GEOGRAPHIC INFORMATION - A method for speech recognition includes providing a source of geographical information within a vehicle. The geographical information pertains to a current location of the vehicle, a planned travel route of the vehicle, a map displayed within the vehicle, and/or a gesture marked by a user on a map. Words spoken within the vehicle are recognized by use of a speech recognition module. The recognizing is dependent upon the geographical information. | 01-27-2011 |
20120101823 | SYSTEM AND METHOD FOR RECOGNIZING PROPER NAMES IN DIALOG SYSTEMS - Embodiments of a dialog system that utilizes contextual information to perform recognition of proper names are described. Unlike present name recognition methods on large name lists that generally focus strictly on the static aspect of the names, embodiments of the present system take into account of the temporal, recency and context effect when names are used, and formulates new questions to further constrain the search space or grammar for recognition of the past and current utterances. | 04-26-2012 |
20120303267 | METHOD AND SYSTEM FOR IMPROVING SPEECH RECOGNITION ACCURACY BY USE OF GEOGRAPHIC INFORMATION - A method for speech recognition includes providing a source of geographical information within a vehicle. The geographical information pertains to a current location of the vehicle, a planned travel route of the vehicle, a map displayed within the vehicle, and/or a gesture marked by a user on a map. Words spoken within the vehicle are recognized by use of a speech recognition module. The recognizing is dependent upon the geographical information. | 11-29-2012 |
Patent application number | Description | Published |
20120271631 | SPEECH RECOGNITION USING MULTIPLE LANGUAGE MODELS - In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating a frequency count of each utterance in the plurality of utterances, generating a high-frequency plurality of utterances from the plurality of utterances having a frequency that exceeds a predetermined frequency threshold, generating a low-frequency plurality of utterances from the plurality of utterances having a frequency that is below the predetermined frequency threshold, generating a grammar-based language model using the high-frequency plurality of utterances as training data, and generating a statistical language model using the low-frequency plurality of utterances as training data. | 10-25-2012 |
20130132086 | METHODS AND SYSTEMS FOR ADAPTING GRAMMARS IN HYBRID SPEECH RECOGNITION ENGINES FOR ENHANCING LOCAL SR PERFORMANCE - A speech recognition method includes providing a processor communicatively coupled to each of a local speech recognition engine and a server-based speech recognition engine. A first speech input is inputted into the server-based speech recognition engine. A first recognition result from the server-based speech recognition engine is received at the processor. The first recognition result is based on the first speech input. The first recognition result is stored in a memory device in association with the first speech input. A second speech input is inputted into the local speech recognition engine. The first recognition result is retrieved from the memory device. A second recognition result is produced by the local speech recognition engine. The second recognition result is based on the second speech input and is dependent upon the retrieved first recognition result. | 05-23-2013 |
20130173268 | SPEAKER VERIFICATION IN A HEALTH MONITORING SYSTEM - A method for verifying that a person is registered to use a telemedical device includes identifying an unprompted trigger phrase in words spoken by a person and received by the telemedical device. The telemedical device prompts the person to state a name of a registered user and optionally prompts the person to state health tips for the person. The telemedical device verifies that the person is the registered user using utterance data generated from the unprompted trigger phrase, name of the registered user, and health tips. | 07-04-2013 |
20140019522 | System And Method Of Conversational Assistance For Automated Tasks With Integrated Intelligence - A method of providing information assistance services includes generating a plurality of service requests for a plurality of request elements that are generated from a single client request received by a processor. The service requests are sent to both software application service providers that are executed by the processor and remote service providers that are connected to the local processor through a data network. The processor receives a plurality of service responses from the service providers, generating at least one output message element corresponding to the service responses, and sending the output message data to at least one output device that is operatively connected to the processor to produce a response to the client request. | 01-16-2014 |
20140058584 | System And Method For Multimodal Interaction With Reduced Distraction In Operating Vehicles - A method of interaction with an in-vehicle information system includes receiving first and second inputs from an operator with first and second input devices, respectively. The method further includes identifying a service request corresponding to the first input, and a parameter of the service request with a value that is included in the second input with a controller in the in-vehicle information system. The controller executes stored program instructions to perform the identified service request with reference to the identified parameter. | 02-27-2014 |
20140270382 | System and Method for Identifying Handwriting Gestures In An In-Vehicle Information System - An in-vehicle information system includes a camera and a controller that accept gesture input. A controller receives frames of video data and generates trajectory data for a movement of a hand in the video data. The controller uses a first hidden Markov model (HMM) to decode a sequence of strokes from the trajectory data, removes a starting and ending stroke to form an edited stroke sequence, and re-normalizes the strokes in the edited stroke sequence. The controller uses a second HMM corresponding to a predetermined set of characters to identify a character corresponding to the re-normalized edited stroke sequence. | 09-18-2014 |
20140365219 | Speaker Verification in a Health Monitoring System - A method for verifying that a person is registered to use a telemedical device includes identifying an unprompted trigger phrase in words spoken by a person and received by the telemedical device. The telemedical device prompts the person to state a name of a registered user and optionally prompts the person to state health tips for the person. The telemedical device verifies that the person is the registered user using utterance data generated from the unprompted trigger phrase, name of the registered user, and health tips. | 12-11-2014 |