Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Zhe Feng

Zhe Feng, Shanghai CN

Patent application number	Description	Published
20100057463	System and Method for Generating Natural Language Phrases From User Utterances in Dialog Systems - Embodiments of a dialog system that employs a corpus-based approach to generate responses based on a given number of semantic constraint-value pairs are described. The system makes full use of the data from the user input to produce dialog system responses in combination with a template generator. The system primarily utilizes constraint values in order to realize efficiencies based on the more frequent tasks performed in real dialog systems although rhetorical or discourse aspects of the dialog could also be included in a similar way, that is, labeling the data with such information and performing a training process. The benefits of this system include higher quality user-aligned responses, broader coverage, faster response time, and shorter development cycles.	03-04-2010
20100124325	System and Method for Interacting with Live Agents in an Automated Call Center - Embodiments of an interface system that enables a call center agent to access and intervene in an interaction between an automated call center system and a caller whenever necessary for complex application tasks is described. The system includes a user interface that presents the agent with one or more categories of information, including the conversation flow, obtained semantic information, the recognized utterances, and access to the utterance waveforms. This information is cross-linked and attached with a confidence level for better access and navigation within the dialog system for the generation of appropriate responses to the caller.	05-20-2010
20100125456	System and Method for Recognizing Proper Names in Dialog Systems - Embodiments of a dialog system that utilizes contextual information to perform recognition of proper names are described. Unlike present name recognition methods on large name lists that generally focus strictly on the static aspect of the names, embodiments of the present system take into account of the temporal, recency and context effect when names are used, and formulates new questions to further constrain the search space or grammar for recognition of the past and current utterances.	05-20-2010
20100204982	System and Method for Generating Data for Complex Statistical Modeling for use in Dialog Systems - Embodiments of a dialog system that utilizes grammar-based labeling scheme to generate labeled sentences for use in training statistical models. During the process of training data development, a grammar is constructed manually based on the application domain or adapted from a general grammar rule. An annotation schema is created accordingly based on the application requirements, such as syntactic and semantic information. Such information is then included in the grammar specification. After the labeled grammar is constructed, a generation algorithm is then used to generate sentences for training various statistical models.	08-12-2010
20100241431	System and Method for Multi-Modal Input Synchronization and Disambiguation - Embodiments of a dialog system that utilizes a multi-modal input interface for recognizing user input in human-machine interaction (HMI) systems are described. Embodiments include a component that receives user input from a plurality of different user input mechanisms (multi-modal input) and performs certain synchronization and disambiguation processes. The multi-modal input components synchronizes and integrates the information obtained from different modalities, disambiguates the input, and recovers from any errors that might be produced with respect to any of the user inputs. Such a system effectively addresses any ambiguity associated with the user input and corrects for errors in the human-machine interaction.	09-23-2010
20110022292	METHOD AND SYSTEM FOR IMPROVING SPEECH RECOGNITION ACCURACY BY USE OF GEOGRAPHIC INFORMATION - A method for speech recognition includes providing a source of geographical information within a vehicle. The geographical information pertains to a current location of the vehicle, a planned travel route of the vehicle, a map displayed within the vehicle, and/or a gesture marked by a user on a map. Words spoken within the vehicle are recognized by use of a speech recognition module. The recognizing is dependent upon the geographical information.	01-27-2011
20120101823	SYSTEM AND METHOD FOR RECOGNIZING PROPER NAMES IN DIALOG SYSTEMS - Embodiments of a dialog system that utilizes contextual information to perform recognition of proper names are described. Unlike present name recognition methods on large name lists that generally focus strictly on the static aspect of the names, embodiments of the present system take into account of the temporal, recency and context effect when names are used, and formulates new questions to further constrain the search space or grammar for recognition of the past and current utterances.	04-26-2012
20120303267	METHOD AND SYSTEM FOR IMPROVING SPEECH RECOGNITION ACCURACY BY USE OF GEOGRAPHIC INFORMATION - A method for speech recognition includes providing a source of geographical information within a vehicle. The geographical information pertains to a current location of the vehicle, a planned travel route of the vehicle, a map displayed within the vehicle, and/or a gesture marked by a user on a map. Words spoken within the vehicle are recognized by use of a speech recognition module. The recognizing is dependent upon the geographical information.	11-29-2012

Patent applications by Zhe Feng, Shanghai CN

Zhe Feng, Xian CN

Patent application number	Description	Published
20090119104	Switching Functionality To Control Real-Time Switching Of Modules Of A Dialog System - Systems and methods are described that automatically control modules of dialog systems. The systems and methods include a dialog module that receives and processes utterances from a speaker and outputs data used to generate synthetic speech outputs as responses to the utterances. A controller is coupled to the dialog module, and the controller detects an abnormal output of the dialog module when the dialog module is processing in an automatic mode. The controller comprises a mode control for an agent to control the dialog module by correcting the abnormal output and transferring a corrected output to a downstream dialog module that follows, in a processing path, the dialog module. The corrected output is used in further processing the utterances.	05-07-2009
20090119586	Automatic Generation of Interactive Systems From a Formalized Description Language - Systems and methods are described that automatically generate interactive systems configured for collecting dialog data of human-machine interactions in dialog systems. The systems and methods comprise receiving a task flow that describes operations of a dialog system. A formal description of the task flow is generated, and an interactive system comprising a graphical user interface (GUI) is automatically generated from the formal description. The GUI consists of templates for control of the dialog system and real-time collection and annotating of dialog data during a live dialog between only the dialog system and callers to the dialog system. The dialog data consists of data of the live dialog.	05-07-2009

Zhe Feng, Palo Alto, CA US

Patent application number	Description	Published
20090030686	Method and system for computing or determining confidence scores for parse trees at all levels - In a confidence computing method and system, a processor may interpret speech signals as a text string or directly receive a text string as input, generate a syntactical parse tree representing the interpreted string and including a plurality of sub-trees which each represents a corresponding section of the interpreted text string, determine for each sub-tree whether the sub-tree is accurate, obtain replacement speech signals for each sub-tree determined to be inaccurate, and provide output based on corresponding text string sections of at least one sub-tree determined to be accurate.	01-29-2009
20140089864	Method of Fusing Multiple Information Sources in Image-based Gesture Recognition System - A method of interpreting input from a user includes providing a surface within reach of a hand of the user. A plurality of locations on the surface that are touched by the user are sensed. An alphanumeric character having a shape most similar to the plurality of touched locations on the surface is determined. The determining includes collecting information associated with hand region localized modules, and modeling the information using statistical models. The user is informed of the alphanumeric character and/or a word in which the alphanumeric character is included. Feedback is received from the user regarding whether the alphanumeric character and/or word is an alphanumeric character and/or word that the user intended to be determined in the determining step.	03-27-2014

Patent applications by Zhe Feng, Palo Alto, CA US

Zhe Feng, Mountain View, CA US

Patent application number	Description	Published
20120271631	SPEECH RECOGNITION USING MULTIPLE LANGUAGE MODELS - In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating a frequency count of each utterance in the plurality of utterances, generating a high-frequency plurality of utterances from the plurality of utterances having a frequency that exceeds a predetermined frequency threshold, generating a low-frequency plurality of utterances from the plurality of utterances having a frequency that is below the predetermined frequency threshold, generating a grammar-based language model using the high-frequency plurality of utterances as training data, and generating a statistical language model using the low-frequency plurality of utterances as training data.	10-25-2012
20130132086	METHODS AND SYSTEMS FOR ADAPTING GRAMMARS IN HYBRID SPEECH RECOGNITION ENGINES FOR ENHANCING LOCAL SR PERFORMANCE - A speech recognition method includes providing a processor communicatively coupled to each of a local speech recognition engine and a server-based speech recognition engine. A first speech input is inputted into the server-based speech recognition engine. A first recognition result from the server-based speech recognition engine is received at the processor. The first recognition result is based on the first speech input. The first recognition result is stored in a memory device in association with the first speech input. A second speech input is inputted into the local speech recognition engine. The first recognition result is retrieved from the memory device. A second recognition result is produced by the local speech recognition engine. The second recognition result is based on the second speech input and is dependent upon the retrieved first recognition result.	05-23-2013
20130173268	SPEAKER VERIFICATION IN A HEALTH MONITORING SYSTEM - A method for verifying that a person is registered to use a telemedical device includes identifying an unprompted trigger phrase in words spoken by a person and received by the telemedical device. The telemedical device prompts the person to state a name of a registered user and optionally prompts the person to state health tips for the person. The telemedical device verifies that the person is the registered user using utterance data generated from the unprompted trigger phrase, name of the registered user, and health tips.	07-04-2013
20140019522	System And Method Of Conversational Assistance For Automated Tasks With Integrated Intelligence - A method of providing information assistance services includes generating a plurality of service requests for a plurality of request elements that are generated from a single client request received by a processor. The service requests are sent to both software application service providers that are executed by the processor and remote service providers that are connected to the local processor through a data network. The processor receives a plurality of service responses from the service providers, generating at least one output message element corresponding to the service responses, and sending the output message data to at least one output device that is operatively connected to the processor to produce a response to the client request.	01-16-2014
20140058584	System And Method For Multimodal Interaction With Reduced Distraction In Operating Vehicles - A method of interaction with an in-vehicle information system includes receiving first and second inputs from an operator with first and second input devices, respectively. The method further includes identifying a service request corresponding to the first input, and a parameter of the service request with a value that is included in the second input with a controller in the in-vehicle information system. The controller executes stored program instructions to perform the identified service request with reference to the identified parameter.	02-27-2014
20140270382	System and Method for Identifying Handwriting Gestures In An In-Vehicle Information System - An in-vehicle information system includes a camera and a controller that accept gesture input. A controller receives frames of video data and generates trajectory data for a movement of a hand in the video data. The controller uses a first hidden Markov model (HMM) to decode a sequence of strokes from the trajectory data, removes a starting and ending stroke to form an edited stroke sequence, and re-normalizes the strokes in the edited stroke sequence. The controller uses a second HMM corresponding to a predetermined set of characters to identify a character corresponding to the re-normalized edited stroke sequence.	09-18-2014
20140365219	Speaker Verification in a Health Monitoring System - A method for verifying that a person is registered to use a telemedical device includes identifying an unprompted trigger phrase in words spoken by a person and received by the telemedical device. The telemedical device prompts the person to state a name of a registered user and optionally prompts the person to state health tips for the person. The telemedical device verifies that the person is the registered user using utterance data generated from the unprompted trigger phrase, name of the registered user, and health tips.	12-11-2014

Patent applications by Zhe Feng, Mountain View, CA US

Zhe Feng, Shenyang CN

Patent application number	Description	Published
20130305199	IN-PLACE RESYNTHESIS AND REMAPPING TECHNIQUES FOR SOFT ERROR MITIGATION IN FPGA - In-place resynthesis for static memory (SRAM) based Field Programmable Gate Arrays (FPGAs) toward reducing sensitivity to single event upsets (SEUs). Resynthesis and remapping are described which have a low overheard and improve FPGA designs without the need of rerouting LUTs of the FPGA. These methods include in-place reconfiguration (IPR), in-place X-filling (IPF), and in-place inversion (IPV), which reconfigure LUT functions only, and can be applied to any FPGA architecture. In addition, for FPGAs with a decomposable LUT architecture (e.g., dual-output LUTs) an in-place decomposition (IPD) method is described for remapping a LUT function into multiple smaller functions leveraging the unused outputs of the LUT, and making use of built-in hard macros in programmable-logic blocks (PLBs) such as carry chain or adder. Methods are applied in-place to mapped circuits before or after routing without affecting placement, routing, and design closure.	11-14-2013