Patent application number | Description | Published |
20100169088 | AUTOMATED DEMOGRAPHIC ANALYSIS - A method of generating demographic information relating to an individual is provided. The method includes monitoring an environment for a voice activity of an individual and detecting the voice activity of the individual. The method further includes analyzing the detected voice activity of the individual and determining, based on the detected voice activity of the individual, a demographic descriptor of the individual. | 07-01-2010 |
20100281435 | SYSTEM AND METHOD FOR MULTIMODAL INTERACTION USING ROBUST GESTURE PROCESSING - Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for multimodal interaction. The method includes receiving a plurality of multimodal inputs associated with a query, the plurality of multimodal inputs including at least one gesture input, editing the at least one gesture input with a gesture edit machine. The method further includes responding to the query based on the edited gesture input and remaining multimodal inputs. The gesture inputs can be from a stylus, finger, mouse, and other pointing/gesture device. The gesture input can be unexpected or errorful. The gesture edit machine can perform actions such as deletion, substitution, insertion, and aggregation. The gesture edit machine can be modeled as a finite-state transducer. In one aspect, the method further includes generating a lattice for each input, generating an integrated lattice of combined meaning of the generated lattices, and responding to the query further based on the integrated lattice. | 11-04-2010 |
20110067059 | MEDIA CONTROL - Systems and methods to control media are disclosed. A particular method includes receiving a speech input at a mobile communications device. The speech input is processed to generate audio data. The audio data is sent, via a mobile data network, to a first server. The first server processes the audio data to generate text based on the audio data. Data related to the text is received from the first server. One or more commands are sent to a second server via the mobile data network. In response to the one or more commands, the second server sends control signals based on the one or more commands to a media controller. The control signals cause the media controller to control multimedia content displayed via a display device. | 03-17-2011 |
20110078720 | APPLIED AUTOMATIC DEMOGRAPHIC ANALYSIS - A method for managing a data stream that is transmitted from a stream transmitter to a stream receiver disposed in an environment that includes at least one individual is provided. The method includes detecting an action of the individual in the environment and determining a demographic descriptor of the individual based on the detected action. The method further includes correlating the determined demographic descriptor and a content of the data stream to determine whether a predetermined condition is satisfied, and, in response to the correlating the demographic descriptor of the individual and the content of the data stream satisfying the predetermined condition, automatically modifying the data stream. | 03-31-2011 |
20110082696 | SYSTEM AND METHOD FOR SPEECH-ENABLED ACCESS TO MEDIA CONTENT - Disclosed herein are systems, methods, and computer-readable storage media for generating a speech recognition model for a media content retrieval system. The method causes a computing device to retrieve information describing media available in a media content retrieval system, construct a graph that models how the media are interconnected based on the retrieved information, rank the information describing the media based on the graph, and generate a speech recognition model based on the ranked information. The information can be a list of actors, directors, composers, titles, and/or locations. The graph that models how the media are interconnected can further model pieces of common information between two or more media. The method can further cause the computing device to weight the graph based on the retrieved information. The graph can further model relative popularity information in the list. The method can rank information based on a PageRank algorithm. | 04-07-2011 |
20110099013 | SYSTEM AND METHOD FOR IMPROVING SPEECH RECOGNITION ACCURACY USING TEXTUAL CONTEXT - Disclosed herein are systems, methods, and computer-readable storage media for improving speech recognition accuracy using textual context. The method includes retrieving a recorded utterance, capturing text from a device display associated with the spoken dialog and viewed by one party to the recorded utterance, and identifying words in the captured text that are relevant to the recorded utterance. The method further includes adding the identified words to a dynamic language model, and recognizing the recorded utterance using the dynamic language model. The recorded utterance can be a spoken dialog. A time stamp can be assigned to each identified word. The method can include adding identified words to and/or removing identified words from the dynamic language model based on their respective time stamps. A screen scraper can capture text from the device display associated with the recorded utterance. The device display can contain customer service data. | 04-28-2011 |
20110153310 | MULTIMODAL AUGMENTED REALITY FOR LOCATION MOBILE INFORMATION SERVICE - In one or more embodiments, one or more methods and/or systems described can perform producing a lattice of object hypotheses based on multiple reference objects from image information; receiving input speech information that includes a request for information associated with at least one reference object of the multiple reference objects; producing a lattice of speech hypotheses based on at least a first possible description included in the speech information; producing a lattice of scored semantic hypotheses based on at least the lattice of object hypotheses and the lattice of speech hypotheses; determining that a single semantic interpretation score of the lattice of scored semantic hypotheses exceeds a predetermined value; and providing requested information associated with the at least the first reference object of the plurality of reference objects. | 06-23-2011 |
20110161341 | SYSTEM AND METHOD FOR AN ITERATIVE DISAMBIGUATION INTERFACE - Disclosed herein are systems, methods, and computer-readable storage media for an iterative disambiguation interface. A system practicing the method receives a search query formatted according to a standard XML markup language for containing and annotating interpretations of user input, the search query being based on a natural language spoken query from a user and retrieves search results based on the search query. The system transmits the search results to a user device and iteratively receives multimodal input from the user to change search attributes and transmits updated search results to the user device based on the changed search attributes. The search results can include a link to additional information, such as a video presentation, related to the search results. The standard XML markup language can be Extensible MultiModal Annotation (EMMA) markup language from W3C. The system can generate an iteration transaction history for each multimodal input and updated search result. | 06-30-2011 |
20110161347 | SYSTEM AND METHOD FOR AN N-BEST LIST INTERFACE - Disclosed herein are systems, methods, and computer-readable storage media for providing an N-best list interface. A system practicing the method receives a search query formatted according to a standard language for containing and annotating interpretations of user input, the search query being based on a natural language spoken query from a user and retrieves an N-best list of recognition results based on the search query. The system then transmits the N-best list of recognition results to a user device, receives multimodal disambiguation input from the user, the input indicating an entry in the N-best list, and transmits to the user device additional information associated with the selected entry. The additional information can be a map indicating an address for the selected entry. The standard language can be XML-based Extensible MultiModal Annotation (EMMA) markup language from W3C. | 06-30-2011 |
20110321098 | System and Method for Automatic Identification of Key Phrases during a Multimedia Broadcast - An Internet Protocol television system includes a user profile agent, a keyword detection agent, and an information search agent. The user profile agent is in communication with a multimedia device, and generates a user profile based on information received from the multimedia device. The keyword detection agent is in communication with the user profile agent, and searches text associated with a multimedia video stream transmitted to the multimedia device for keywords associated with the user profile. The information search agent is in communication with the keyword detection agent, and connects to an information source associated with the keywords detected by the keyword detection agent, and provides additional information associated with the keywords to the multimedia device. | 12-29-2011 |
20120030709 | Customized Interface Based on Viewed Programming - In one embodiment, a system generates a customized interface based on displayed programming. The system stores a program that a user displayed through a media device; searches through a network for information related to the displayed program; and extracts data associated with the information related to the displayed program. A custom interface is generated based substantially on the data associated with the information related to the displayed program. | 02-02-2012 |
20120072217 | SYSTEM AND METHOD FOR USING PROSODY FOR VOICE-ENABLED SEARCH - Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating relevant responses to a user query with voice-enabled search. A system practicing the method receives a word lattice generated by an automatic speech recognizer based on a user speech and a prosodic analysis of the user speech, generates a reweighted word lattice based on the word lattice and the prosodic analysis, approximates based on the reweighted word lattice one or more relevant responses to the query, and presents to a user the responses to the query. The prosodic analysis examines metalinguistic information of the user speech and can identify the most salient subject matter of the speech, assess how confident a speaker is in the content of his or her speech, and identify the attitude, mood, emotion, sentiment, etc. of the speaker. Other information not described in the content of the speech can also be used. | 03-22-2012 |
20120072219 | SYSTEM AND METHOD FOR ENHANCING VOICE-ENABLED SEARCH BASED ON AUTOMATED DEMOGRAPHIC IDENTIFICATION - Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating responses to a user speech query in voice-enabled search based on metadata that include demographic features of the speaker. A system practicing the method recognizes received speech from a speaker to generate recognized speech, identifies metadata about the speaker from the received speech, and feeds the recognized speech and the metadata to a question-answering engine. Identifying the metadata about the speaker is based on voice characteristics of the received speech. The demographic features can include age, gender, socio-economic group, nationality, and/or region. The metadata identified about the speaker from the received speech can be combined with or override self-reported speaker demographic information. | 03-22-2012 |
20120117112 | Systems, Methods, and Computer Program Products for Location Salience Modeling for Multimodal Search - Computational models of dialog context have often focused on unimodal spoken dialog or text, using the language itself as the primary locus of contextual information. But as spoken unimodal interaction is replaced by situated multimodal interaction on mobile platforms supporting a combination of spoken dialog with graphical interaction, touch-screen input, geolocation, and other non-linguistic contextual factors, a need arises for more sophisticated models of context that capture the influence of these factors on semantic interpretation and dialog flow. The systems, methods, and computer program products disclosed herein address this need. A method for multimodal search includes, in part, determining an intended location of search query based upon information received from a remote mobile device that issued the search query. | 05-10-2012 |
20120323566 | AUTOMATED DEMOGRAPHIC ANALYSIS BY ANALYZING VOICE ACTIVITY - Methods, systems, and media for determining a response to be generated in an environment are provided. The methods, systems, and media monitor the environment for a voice activity of an individual. The voice activity of the individual is detected and analyzed. A content descriptor of the voice activity is determined based on the voice activity of the individual. A demographic descriptor of the individual is determined based on the voice activity of the individual. The content descriptor, the demographic descriptor, and known information are correlated to determine the response to be generated in the environment. | 12-20-2012 |
20130144629 | SYSTEM AND METHOD FOR CONTINUOUS MULTIMODAL SPEECH AND GESTURE INTERACTION - Disclosed herein are systems, methods, and non-transitory computer-readable storage media for processing multimodal input. A system configured to practice the method continuously monitors an audio stream associated with a gesture input stream, and detects a speech event in the audio stream. Then the system identifies a temporal window associated with a time of the speech event, and analyzes data from the gesture input stream within the temporal window to identify a gesture event. The system processes the speech event and the gesture event to produce a multimodal command. The gesture in the gesture input stream can be directed to a display, but is remote from the display. The system can analyze the data from the gesture input stream by calculating an average of gesture coordinates within the temporal window. | 06-06-2013 |
20130218561 | System and Method for Enhancing Voice-Enabled Search Based on Automated Demographic Identification - Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating responses to a user speech query in voice-enabled search based on metadata that include demographic features of the speaker. A system practicing the method recognizes received speech from a speaker to generate recognized speech, identifies metadata about the speaker from the received speech, and feeds the recognized speech and the metadata to a question-answering engine. Identifying the metadata about the speaker is based on voice characteristics of the received speech. The demographic features can include age, gender, socio-economic group, nationality, and/or region. The metadata identified about the speaker from the received speech can be combined with or override self-reported speaker demographic information. | 08-22-2013 |
20150074702 | SYSTEM AND METHOD FOR AUTOMATIC IDENTIFICATION OF KEY PHRASES DURING A MULTIMEDIA BROADCAST - Aspects of the subject disclosure may include, for example, a process that determines information from a first media stream including a number of keywords associated with viewing habits of a user. A second media stream of a media program is scanned for one of a word or phrase corresponding to a keyword of the number of keywords. A second keyword is determined based on the one of the word or phrase, and additional information associated with the second keyword is identified. Results associated with the additional information are provided to a multimedia device. Other embodiments are disclosed. | 03-12-2015 |
20150088920 | System and Method for an Iterative Disambiguation Interface - Disclosed herein are systems, methods, and computer-readable storage media for an iterative disambiguation interface. A system practicing the method receives a search query formatted according to a standard XML markup language for containing and annotating interpretations of user input, the search query being based on a natural language spoken query from a user and retrieves search results based on the search query. The system transmits the search results to a user device and iteratively receives multimodal input from the user to change search attributes and transmits updated search results to the user device based on the changed search attributes. The search results can include a link to additional information, such as a video presentation, related to the search results. The standard XML markup language can be Extensible MultiModal Annotation (EMMA) markup language from W3C. The system can generate an iteration transaction history for each multimodal input and updated search result. | 03-26-2015 |
20150156521 | APPLIED AUTOMATIC DEMOGRAPHIC ANALYSIS - A system for managing a data stream that is transmitted to an environment is provided. The system includes a receiver that receives the data stream. The data stream includes a first program, with the first program configured to be displayed in the environment. An input receives information of an individual in the environment. A processor analyzes the information, determines a demographic descriptor of the individual based on the information, and correlates the demographic descriptor of the individual with a content of the first program to determine whether a predetermined condition is satisfied. The processor further determines a second program based on the demographic descriptor of the individual and modifies the first program based on the second program when the predetermined condition is satisfied. | 06-04-2015 |
20150178044 | System and Method for Controlling Presentations Using a Multimodal Interface - The invention provides for a system, method, and computer readable medium storing instructions related to controlling a presentation in a multimodal system. The method embodiment of the invention is a method for the retrieval of information on the basis of its content for real-time incorporation into an electronic presentation. The method comprises receiving from a presenter a content-based request for at least one segment of a first plurality of segments within a media presentation and while displaying the media presentation to an audience, displaying to the presenter a second plurality of segments in response to the content-based request. The computing device practicing the method receives a selection from the presenter of a segment from the second plurality of segments and displays to the audience the selected segment. | 06-25-2015 |
20150234800 | SYSTEM AND METHOD FOR CREATING A PRESENTATION USING NATURAL LANGUAGE - The invention provides for a system, method, and computer readable medium storing instructions related to controlling a presentation in a multimodal system. The method embodiment of the invention is a method for the retrieval of information on the basis of its content for incorporation into an electronic presentation. The method comprises receiving from a user a content-based request for at least one segment from a first plurality of segments within a media presentation preprocessed to enable natural language content searchability; in response to the request, presenting a subset of the first plurality of segments to the user; receiving a selection indication from the user associated with at least one segment of the subset of the first plurality of segments and adding the selected at least one segment to a deck for use in a presentation. | 08-20-2015 |
Patent application number | Description | Published |
20100055137 | Microemulsion & sub-micron emulsion process & compositions - An oil in water microemulsion or sub-micron emulsion composition for dermal delivery of at least one pharmaceutically active ingredient, is provided. The composition includes an oil phase dispersed throughout a water phase, the oil phase including at least one member selected from the group consisting of an animal oil, a mineral oil, a vegetable oil, a silane member, a siloxane, an ester, a fatty acid, a fat, a halogen compound, and an alkoxylated alcohol; and at least one lipophilic surfactant, the water phase including at least one hydrophilic surfactant, water and optionally a non-surfactant amphiphilic compound, the weight ratio of the at least one hydrophilic surfactant to the at least one lipophilic surfactant being approximately 9.0:1.0 to 2.0:3.0. | 03-04-2010 |
20130296284 | MICROEMULSION & SUB-MICRON EMULSION PROCESS & COMPOSITIONS - An oil in water microemulsion or sub-micron emulsion composition for dermal delivery of at least one pharmaceutically active ingredient, is provided. The composition includes an oil phase dispersed throughout a water phase, the oil phase including at least one member selected from the group consisting of an animal oil, a mineral oil, a vegetable oil, a silane member, a siloxane, an ester, a fatty acid, a fat, a halogen compound, and an alkoxylated alcohol; and at least one lipophilic surfactant, the water phase including at least one hydrophilic surfactant, water and optionally a non-surfactant amphiphilic compound, the weight ratio of the at least one hydrophilic surfactant to the at least one lipophilic surfactant being approximately 9.0:1.0 to 2.0:3.0. | 11-07-2013 |
20140151255 | TOPICAL GLYCOPYRROLATE FORMULATIONS - Individually packaged topical formulations comprising about 0.25 to about 6% w/w of glycopyrrolate for the treatment of hyperhidrosis, wherein said wipe is contained within a pouch resistant to leakage. The formulations may further comprise ethanol, a buffering agent and water. In addition, the formulations may further comprise a polymer system comprising a hydrophobic polymer in combination with a hydrophilic polymer. | 06-05-2014 |