Entries |
Document | Title | Date |
20080208587 | Document Session Replay for Multimodal Applications - Methods, apparatus, and computer program products are described for document session replay for multimodal applications. including identifying, by a multimodal browser in dependence upon a log produced by a Form Interpretation Algorithm (‘FIA’) during a previous document session with a user, a speech prompt provided by a multimodal application in the previous document session; identifying, by a multimodal browser in replay mode in dependence upon the log, a response to the prompt provided by a user of the multimodal application in the previous document session; retrieving, by the multimodal browser in dependence upon the log, an X+V page of the multimodal application associated with the speech prompt and the response; rendering, by the multimodal browser, the visual elements of the retrieved X+V page; replaying, by the multimodal browser, the speech prompt; and replaying, by a multimodal browser, the response. | 08-28-2008 |
20080208588 | Invoking Tapered Prompts In A Multimodal Application - Methods, apparatus, and computer program products are described for invoking tapered prompts in a multimodal application implemented with a multimodal browser and a multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes. Embodiments include identifying, by a multimodal browser, a prompt element in a multimodal application; identifying, by the multimodal browser, one or more attributes associated with the prompt element; and playing a speech prompt according to the one or more attributes associated with the prompt element. | 08-28-2008 |
20080208589 | Presenting Supplemental Content For Digital Media Using A Multimodal Application - Presenting supplemental content for digital media using a multimodal application, implemented with a grammar of the multimodal application in an automatic speech recognition (‘ASR’) engine, with the multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine, includes: rendering, by the multimodal application, a portion of the digital media; receiving, by the multimodal application, a voice utterance from a user; determining, by the multimodal application using the ASR engine, a recognition result in dependence upon the voice utterance and the grammar; identifying, by the multimodal application, supplemental content for the rendered portion of the digital media in dependence upon the recognition result; and rendering, by the multimodal application, the supplemental content. | 08-28-2008 |
20080208590 | Disambiguating A Speech Recognition Grammar In A Multimodal Application - Disambiguating a speech recognition grammar in a multimodal application, the multimodal application including voice activated hyperlinks, the voice activated hyperlinks voice enabled by a speech recognition grammar characterized by ambiguous terminal grammar elements, including maintaining by the multimodal browser a record of visibility of each voice activated hyperlink, the record of visibility including current visibility and past visibility on a display of the multimodal device of each voice activated hyperlink, the record of visibility further including an ordinal indication, for each voice activated hyperlink scrolled off display, of the sequence in which each such voice activated hyperlink was scrolled off display; recognizing by the multimodal browser speech from a user matching an ambiguous terminal element of the speech recognition grammar; selecting by the multimodal browser a voice activated hyperlink for activation, the selecting carried out in dependence upon the recognized speech and the record of visibility. | 08-28-2008 |
20080208591 | Enabling Global Grammars For A Particular Multimodal Application - Methods, apparatus, and computer program products are described for enabling global grammars for a particular multimodal application according to the present invention by loading a multimodal web page; determining whether the loaded multimodal web page is one of a plurality of multimodal web pages of the particular multimodal application. If the loaded multimodal web page is one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes loading any currently unloaded global grammars of the particular multimodal application identified in the multimodal web page and maintaining any previously loaded global grammars. If the loaded multimodal web page is not one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes unloading any currently loaded global grammars. | 08-28-2008 |
20080208592 | Configuring A Speech Engine For A Multimodal Application Based On Location - Methods, apparatus, and products are disclosed for configuring a speech engine for a multimodal application based on location. The multimodal application operates on a multimodal device supporting multiple modes of user interaction with the multimodal application. The multimodal application is operatively coupled to a speech engine. Configuring a speech engine for a multimodal application based on location includes: receiving a location change notification in a location change monitor from a device location manager, the location change notification specifying a current location of the multimodal device; identifying, by the location change monitor, location-based configuration parameters for the speech engine in dependence upon the current location of the multimodal device, the location-based configuration parameters specifying a configuration for the speech engine at the current location; and updating, by the location change monitor, a current configuration for the speech engine according to the identified location-based configuration parameters. | 08-28-2008 |
20080208593 | Altering Behavior Of A Multimodal Application Based On Location - Methods, apparatus, and products are disclosed for altering behavior of a multimodal application based on location. The multimodal application operates on a multimodal device supporting multiple modes of user interaction with the multimodal application, including a voice mode and one or more non-voice modes. The voice mode of user interaction with the multimodal application is supported by a voice interpreter. Altering behavior of a multimodal application based on location includes: receiving a location change notification in the voice interpreter from a device location manager, the device location manager operatively coupled to a position detection component of the multimodal device, the location change notification specifying a current location of the multimodal device; updating, by the voice interpreter, location-based environment parameters for the voice interpreter in dependence upon the current location of the multimodal device; and interpreting, by the voice interpreter, the multimodal application in dependence upon the location-based environment parameters. | 08-28-2008 |
20080208594 | Effecting Functions On A Multimodal Telephony Device - Methods, apparatus, and computer program products are described for effecting functions on a multimodal telephony device, implemented with the multimodal application operating on a multimodal telephony device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to an automated speech recognition engine. Embodiments include receiving the speech of a telephone call; identifying with the automated speech recognition engine action keywords in the speech of the telephone call; selecting a function of the multimodal telephony device in dependence upon the action keywords; identifying parameters for the function of the multimodal telephony device; and executing the function of the multimodal telephony device using the identified parameters. | 08-28-2008 |
20080208595 | System and method for capturing steps of a procedure - The present invention relates to a system and method for capturing the steps of a procedure in a workplace or other environment to assist with operations, knowledge transfer or regulatory compliance and for other general purposes. The system and method enable a person to capture a procedure while actively carrying out the procedure in its associated environment. | 08-28-2008 |
20080215335 | COMPUTER, DISPLAY CONTROL DEVICE, POINTER POSITION CONTROL METHOD, AND PROGRAM - To provide a pointer position control method and the like for manipulating a pointer more easily. The user moves the pointer P two-dimensionally and perform click and other operations by using only “voice”—by varying the volume and pitch of produced voice without uttering any specific command. The user moves the pointer P by varying the volume and switches the travel direction of the pointer P by changing the pitch. Also, by stopping to vary the volume, the user can automatically enter a fine adjustment mode in which the user can make fine adjustments. Furthermore, the user can perform a click by stopping to produce voice suddenly and return to normal speech recognition mode by keeping silent. | 09-04-2008 |
20080215336 | METHOD AND SYSTEM FOR ENABLING A DEVICE FUNCTION OF A VEHICLE - The current invention provides a method and system for enabling a device function of a vehicle. A speech input stream is received at a telematics unit. A speech input context is determined for the received speech input stream. The received speech input stream is processed based on the determination and the device function of the vehicle is enabled responsive to the processed speech input stream. A vehicle device in control of the enabled device function of the vehicle is directed based on the processed speech input stream. A computer usable medium with suitable computer program code is employed for enabling a device function of a vehicle. | 09-04-2008 |
20080215337 | SYSTEM, METHOD AND COMPUTER PROGRAM PRODUCT FOR ADDING VOICE ACTIVATION AND VOICE CONTROL TO A MEDIA PLAYER - A media player system, method and computer program product are provided. In use, an utterance is received. A command for a media player is then generated based on the utterance. Such command is utilized for providing wireless control of the media player. | 09-04-2008 |
20080221903 | Hierarchical Methods and Apparatus for Extracting User Intent from Spoken Utterances - Improved techniques are disclosed for permitting a user to employ more human-based grammar (i.e., free form or conversational input) while addressing a target system via a voice system. For example, a technique for determining intent associated with a spoken utterance of a user comprises the following steps/operations. Decoded speech uttered by the user is obtained. An intent is then extracted from the decoded speech uttered by the user. The intent is extracted in an iterative manner such that a first class is determined after a first iteration and a sub-class of the first class is determined after a second iteration. The first class and the sub-class of the first class are hierarchically indicative of the intent of the user, e.g., a target and data that may be associated with the target. The multi-stage intent extraction approach may have more than two iterations. By way of example only, the user intent extracting step may further determine a sub-class of the sub-class of the first class after a third iteration, such that the first class, the sub-class of the first class, and the sub-class of the sub-class of the first class are hierarchically indicative of the intent of the user. | 09-11-2008 |
20080228492 | Device Control Device, Speech Recognition Device, Agent Device, Data Structure, and Device Control - A language analyzer performs speech recognition on a speech input by a speech input unit, specifies a possible word which is represented by the speech, and the score thereof, and supplies word data representing them to an agent processing unit. The agent processing unit stores process item data which defines a data acquisition process to acquire word data or the like, a discrimination process, and an input/output process, and wires or data defining transition from one process to another and giving a transition constant to the transition, and executes a flow represented generally by the process item data and the wires to thereby control devices belonging to an input/output target device group. To which process in the flow the transition takes place is determined by the weighting factor of each wire, which is determined by the connection relationship between a point where the process has proceeded and the wire, and the score of word data. | 09-18-2008 |
20080228493 | Determining voice commands with cooperative voice recognition - A method of recognizing voice commands cooperatively includes generating a voice command from a user specifying a target machine and a desired action to be performed by the target machine, and a plurality of machines receiving the voice command, the plurality of machines comprising the target machine and at least one member machine. The method also includes each of the plurality of machines performing a recognition process on the voice command to produce a corresponding recognition result, each member machine sending its corresponding recognition result to the target machine, and the target machine evaluating its own recognition result together with the recognition result from each member machine to determine a most likely final recognition result for the voice command. | 09-18-2008 |
20080228494 | Speech-Enabled Web Content Searching Using A Multimodal Browser - Speech-enabled web content searching using a multimodal browser implemented with one or more grammars in an automatic speech recognition (‘ASR’) engine, with the multimodal browser operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal browser operatively coupled to the ASR engine, includes: rendering, by the multimodal browser, web content; searching, by the multimodal browser, the web content for a search phrase, including yielding a matched search result, the search phrase specified by a first voice utterance received from a user and a search grammar; and performing, by the multimodal browser, an action in dependence upon the matched search result, the action specified by a second voice utterance received from the user and an action grammar. | 09-18-2008 |
20080228495 | Enabling Dynamic VoiceXML In An X+ V Page Of A Multimodal Application - Enabling dynamic VoiceXML in an X+V page of a multimodal application implemented with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to a VoiceXML interpreter, including representing by the multimodal browser an XML element of a VoiceXML dialog of the X+V page as an ECMAScript object, the XML element comprising XML content; storing by the multimodal browser the XML content of the XML element in an attribute of the ECMAScript object; and accessing the XML content of the XML element in the attribute of the ECMAScript object from an ECMAScript script in the X+V page. | 09-18-2008 |
20080228496 | SPEECH-CENTRIC MULTIMODAL USER INTERFACE DESIGN IN MOBILE TECHNOLOGY - A multi-modal human computer interface (HCI) receives a plurality of available information inputs concurrently, or serially, and employs a subset of the inputs to determine or infer user intent with respect to a communication or information goal. Received inputs are respectively parsed, and the parsed inputs are analyzed and optionally synthesized with respect to one or more of each other. In the event sufficient information is not available to determine user intent or goal, feedback can be provided to the user in order to facilitate clarifying, confirming, or augmenting the information inputs. | 09-18-2008 |
20080235029 | Speech-Enabled Predictive Text Selection For A Multimodal Application - Methods, apparatus, and products are disclosed for speech-enabled predictive text selection for a multimodal application, the multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to an automatic speech recognition (‘ASR’) engine through a VoiceXML interpreter, including: identifying, by the VoiceXML interpreter, a text prediction event, the text prediction event characterized by one or more predictive texts for a text input field of the multimodal application; creating, by the VoiceXML interpreter, a grammar in dependence upon the predictive texts; receiving, by the VoiceXML interpreter, a voice utterance from a user; and determining, by the VoiceXML interpreter using the ASR engine, recognition results in dependence upon the voice utterance and the grammar, the recognition results representing a user selection of a particular predictive text. | 09-25-2008 |
20080235030 | Automatic Method For Measuring a Baby's, Particularly a Newborn's, Cry, and Related Apparatus - The present invention concerns an automatic method for measuring a baby's cry, comprising the following step: A. having N samples ρ(i), for i=O, 1, . . . , (N−1), of an acoustic signal p(t) representing the cry, sampled at a sampling frequencŷ for a period of duration P; the method being characterised in that it assigns a score PainScore to the acoustic signal p(t) by means of a function AF of one or more acoustic parameters selected from the group comprising: —a root-mean-square or rms value prms of the acoustic signal p(t) in the period P; —a fundamental or pitch frequency F | 09-25-2008 |
20080235031 | Interface apparatus, interface processing method, and interface processing program - An interface apparatus according to an embodiment of the invention includes: an operation detecting section configured to detect a device operation; a status detecting section configured to detect a status change or status continuance of a device or in the vicinity of the device; an operation history accumulating section configured to accumulate a operation detection result and a status detection result in association with each other; an operation history matching section configured to match a status detection result for a newly detected against accumulated status detection results, and select a device operation that corresponds to the status detection result for the newly detected; and an utterance section configured to utter as sound a word corresponding to the selected device operation. | 09-25-2008 |
20080235032 | Method and Apparatus for Data Capture Using a Voice Activated Workstation - A method and apparatus for capturing data in a workstation, wherein a large number of data associated with a sample which is viewed, by a user, through an optical device, such as a microscope, is to be entered in a computer related file. The optical device can be moved to a data-sampling position utilizing voice commands. A pointer can then be moved to an appropriate place in the file to receive the data relating to the data-sampling position. Data can be then entered in the appropriate position utilizing a voice command. The steps of moving the pointer and entering the data can then be repeated until all data is provided with respect to the data-sampling positions. | 09-25-2008 |
20080243517 | SPEECH BOOKMARKS IN A VOICE USER INTERFACE USING A SPEECH RECOGNITION ENGINE AND ACOUSTICALLY GENERATED BASEFORMS - A system and method for navigating a dialog hierarchy from a voice user interface (VUI) using speech bookmarks. The method can detect a user spoken command for bookmarking a location within a dialog hierarchy of a voice response system. A user spoken bookmark can be received, which is added to a personalized bookmark grammar that is associated with a user who spoke the bookmark name. A database record can be used to associate the new bookmark name with a location within the dialog hierarchy. During a subsequent interaction between the user and the voice response system, the user can speak the bookmark name, which results in a match being detected between the spoken phrase and the personalized bookmark grammar. The voice response system can then navigate to the location within bookmark hierarchy that is associated with the speech bookmark. | 10-02-2008 |
20080249782 | Web Service Support For A Multimodal Client Processing A Multimodal Application - Web service support for a multimodal client processing a multimodal application, the multimodal client providing an execution environment for the application and operating on a multimodal device supporting multiple modes of user interaction including a voice mode and one or more non-voice modes, the application stored on an application server, includes: receiving, by the server, an application request from the client that specifies the application and device characteristics; determining, by a multimodal adapter of the server, modality requirements for the application; selecting, by the adapter, a modality web service in dependence upon the modality requirements and the characteristics for the device; determining, by the adapter, whether the device supports VoIP in dependence upon the characteristics; providing, by the server, the application to the client; and providing, by the adapter to the client in dependence upon whether the device supports VoIP, access to the modality web service for processing the application. | 10-09-2008 |
20080255850 | Providing Expressive User Interaction With A Multimodal Application - Methods, apparatus, and products are disclosed for providing expressive user interaction with a multimodal application, the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of user interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to a speech engine through a VoiceXML interpreter, including: receiving, by the multimodal browser, user input from a user through a particular mode of user interaction; determining, by the multimodal browser, user output for the user in dependence upon the user input; determining, by the multimodal browser, a style for the user output in dependence upon the user input, the style specifying expressive output characteristics for at least one other mode of user interaction; and rendering, by the multimodal browser, the user output in dependence upon the style. | 10-16-2008 |
20080255851 | Speech-Enabled Content Navigation And Control Of A Distributed Multimodal Browser - Speech-enabled content navigation and control of a distributed multimodal browser is disclosed, the browser providing an execution environment for a multimodal application, the browser including a graphical user agent (‘GUA’) and a voice user agent (‘VUA’), the GUA operating on a multimodal device, the VUA operating on a voice server, that includes: transmitting, by the GUA, a link message to the VUA, the link message specifying voice commands that control the browser and an event corresponding to each voice command; receiving, by the GUA, a voice utterance from a user, the voice utterance specifying a particular voice command; transmitting, by the GUA, the voice utterance to the VUA for speech recognition by the VUA; receiving, by the GUA, an event message from the VUA, the event message specifying a particular event corresponding to the particular voice command; and controlling, by the GUA, the browser in dependence upon the particular event. | 10-16-2008 |
20080255852 | APPARATUSES AND METHODS FOR VOICE COMMAND PROCESSING - An apparatus for voice command processing comprising a mobile agent execution platform is provided. The mobile agent execution platform comprises a native platform, at least one agent, a mobile agent execution context, and a mobile agent management unit. The mobile agent execution context provides an application interface, enabling the agent to access resources of the native platform via the application interface. The mobile agent management unit performs initiation, running, suspension, resumption and dispatch of the agent. The agent performs functions regarding voice command processing. | 10-16-2008 |
20080262847 | USER POSITIONABLE AUDIO ANCHORS FOR DIRECTIONAL AUDIO PLAYBACK FROM VOICE-ENABLED INTERFACES - The present invention discloses a concept and a use of audio anchors within voice-enabled interfaces. Audio anchors can be user configurable points from which audio playback occurs. In the invention, a user can identify an interface position at which an audio anchor is to be established. The computing device can determine an anchor direction setting, with values that include forward playback and backward playback. Interface items can then be audibly enumerated from the audio anchor in a direction indicated by the anchor direction setting. For example, if a set of interface items are alphabetically ordered items and if an audio anchor is set at a first item beginning with a letter “G” and an anchor direction is set to indicate backward playback, then the interface items beginning with letters “A-F” can be audibly played in reverse alphabetical order. Additionally, a rate of audio playback can be user adjustable. | 10-23-2008 |
20080262848 | Applications Server and Method - A speech applications server is arranged to provide a user driven service in accordance with an application program in response to user commands for selecting service options. The user is prompted by audio prompts to issue the user commands. The application program comprises a state machine operable to determine a state of the application program from one of a predetermined set of states defining a logical procedure through the user selected service options, transitions between states being determined in accordance with logical conditions to be satisfied in order to change between one state of the set and another state of the set. The logical conditions include whether a user has provided one of a set of possible commands. A prompt selection engine is operable to generate the audio prompts for prompting the commands from the user in accordance with predetermined rules. The prompt selected by the prompt selection engine is determined at run-time. Since the state machine and the prompt selection engine are separate entities and the prompts to be selected are determined at run-time, it is possible to effect a change to the prompt selection engine without influencing the operation of the state machine, enabling different customisations to be provided for the same user driven services, in particular this allows multilingual support, with the possibility of providing rules to adapt the prompt structure allowing for grammatical differences between to languages to be taken into account thus providing higher quality multiple language support. | 10-23-2008 |
20080262849 | VOICE CONTROL SYSTEM - A voice control system allows a user to control a device through voice commands. The voice control system includes a speech recognition unit that receives a control signal from a mobile device and a speech signal from a user. The speech recognition unit configures speech recognition settings in response to the control signal to improve speech recognition. | 10-23-2008 |
20080275707 | Voice Based Network Management Method and Agent - A method of providing voice based device management, comprising defining a set of one or more status queries for a device, defining for each of the status queries a respective set of status responses for the device corresponding to the instantaneous status of the device, mapping the status queries to corresponding voice format status queries, and mapping the status responses to corresponding voice format status responses. | 11-06-2008 |
20080275708 | NETWORK-BASED VOICE ACTIVATED AUTO-ATTENDANT SERVICE WITH B2B CONNECTORS - A network-based voice activated auto-attendant service is disclosed. In a particular embodiment, a data processor is provided that can construct an enterprise voice directory by executing instructions to encrypt eXtended Markup Language (XML)-based files using an encryption key issued by a voice activated auto-attendant service provider network to form encrypted XML-based files. The instructions are further to store the encrypted XML-based files in a manner that is accessible to the voice activated auto-attendant service provider network, and to create the enterprise voice directory based on the encrypted XML-based files. The enterprise voice directory is configured to provide run-time access to the voice activated auto-attendant service provider network. | 11-06-2008 |
20080281601 | USER SPEECH INTERFACES FOR INTERACTIVE MEDIA GUIDANCE APPLICATIONS - A user speech interface for interactive media guidance applications, such as television program guides, guides for audio services, guides for video-on-demand (VOD) services, guides for personal video recorders (PVRs), or other suitable guidance applications is provided. Voice commands may be received from a user and guidance activities may be performed in response to the voice commands. | 11-13-2008 |
20080288259 | SPEECH RECOGNITION MACRO RUNTIME - The disclosed speech recognition system enables users to define personalized, context-aware voice commands without extensive software development. Command sets may be defined in a user-friendly language and stored in an eXtensible Markup Language (XML) file. Each command object within the command set may include one or more user configurable actions, one or more configurable rules, and one or more configurable conditions The command sets may be managed by a command set loader, that loads and processes each command set into computer executable code. The command set loader may enable and disable command sets. A macro processing component may provide a speech recognition grammar to an API of the speech recognition engine based on currently enabled commands. When the speech recognition engine recognizes user speech consistent with the grammar, the macro processing component may initiate the one or more computer executable actions. | 11-20-2008 |
20080288260 | Input/Output Apparatus Based on Voice Recognition, and Method Thereof - Provided is an input/output apparatus based on voice recognition, and a method thereof. An object of the apparatus is to improve a user interface by making pointing input and command execution such as application program control possible according to a voice command of a user possible based on a voice recognition technology without individual pointing input device such as a mouse and a touch pad, and a method thereof. The apparatus includes: a voice recognizer for recognizing a voice command inputted from outside; a pointing controller for calculating a pointing location on a screen which corresponds to a voice recognition result transmitted from the voice recognizer; a displayer for displaying a screen; and a command controller for processing diverse commands related to a current pointing location. | 11-20-2008 |
20080300886 | SYSTEMS AND METHODS OF A STRUCTURED GRAMMAR FOR A SPEECH RECOGNITION COMMAND SYSTEM - In embodiments of the present invention, a system and method for enabling a user to interact with a computer platform using a voice command may comprise the steps of defining a structured grammar for handling a global voice command, defining a global voice command of the structured grammar wherein the global voice command enables access to an object of the computer platform using a single command, and mapping at least one function of the object to the global voice command, wherein upon receiving voice input from the user of the computer platform the object recognizes the global voice command and controls the function. | 12-04-2008 |
20080306740 | REMOTELY AND INTERACTIVELY CONTROLLING SEMI-AUTOMATIC DEVICES - An apparatus, system, method and computer program product are provided for enabling a user to remotely and interactively control, using voice commands, the processing tasks of multiple pieces of equipment, such as semi-automatic medication storing, dispensing and packaging devices. In particular, an apparatus may be configured to provide a user with a voice prompt associated with a dynamically prioritized task. In response, the apparatus may further be configured to receive, a voice command from the use and to transmit an instruction associated with the voice command to one of the multiple pieces of equipment for performance of the prioritized task. | 12-11-2008 |
20080306741 | ROBOT AND METHOD FOR ESTABLISHING A RELATIONSHIP BETWEEN INPUT COMMANDS AND OUTPUT REACTIONS - The present invention relates to a robot and method for establishing a relationship between input commands and output reactions. When initiating an input configuration program, the robot fetches a predetermined motion output reaction and performs a corresponding motion. At this time, the robot receives a vocal input command from a user to obtain a vocal input profile, and establishes a relationship between the motion output reaction and the vocal input profile. When receiving the vocal input command again, the robot performs the corresponding motion according to the relationship. In addition, a sound assigned to the motion output reaction can be altered according to users' preferences. Accordingly, the motion output reaction may have different naming sound. | 12-11-2008 |
20080306742 | APPARATUS, METHOD, AND PROGRAM FOR SUPPORTING SPEECH INTERFACE DESIGN - For design of a speech interface accepting speech control options, speech samples are stored on a computer-readable medium. A similarity calculating unit calculates a certain indication of similarity of first and second sets of ones of the speech samples, the first set of speech samples being associated with a first speech control option and the second set of speech samples being associated with a second speech control option. A display unit displays the similarity indication. | 12-11-2008 |
20080306743 | SYSTEM AND METHOD OF USING MODULAR SPOKEN-DIALOG COMPONENTS - A system and method are disclosed for switching contexts within a spoken dialog between a user and a spoken dialog system. The spoken dialog system utilizes modular subdialogs that are invoked by at least one flow controller that is a finite state model and that associated with a dialog manager. The spoken dialog system includes a dialog manager with a flow controller and a reusable subdialog module. The method includes, while the spoken dialog is being controlled by the subdialog module that was invoked by the flow controller, receiving context-changing input associated with speech from a user that changes a dialog context and comparing the context-changing input to at least one context shift. And, if any of the context shifts are activated by the comparing step, then passing control of the spoken dialog to the flow controller with context shift message and destination state. | 12-11-2008 |
20080312934 | USING RESULTS OF UNSTRUCTURED LANGUAGE MODEL BASED SPEECH RECOGNITION TO PERFORM AN ACTION ON A MOBILE COMMUNICATIONS FACILITY - A user may control a mobile communication facility through recognized speech provided to the mobile communication facility. Speech that is recorded by a user using a mobile communication facility resident capture facility is transmitted through a wireless communication facility to a speech recognition facility. The speech recognition facility generates results using an unstructured language model based at least in part on information relating to the recording. The results are transmitted to the mobile communications facility where an action is performed on the mobile communication facility based on the results. | 12-18-2008 |
20080312935 | MEDIA DEVICE WITH SPEECH RECOGNITION AND METHOD FOR USING SAME - A media player utilizing speech recognition software to perform functions of the media player or make file selections that may be played by the media player. The media player may include one or more microphones to receive a voice command from the user. The one or more microphones may be actuated into a state for receiving a voice command and providing the voice command to one or more microprocessors which perform a function based on the voice command. | 12-18-2008 |
20080319761 | SPEECH PROCESSING METHOD BASED UPON A REPRESENTATIONAL STATE TRANSFER (REST) ARCHITECTURE THAT USES WEB 2.0 CONCEPTS FOR SPEECH RESOURCE INTERFACES - The present invention discloses a method of performing speech processing operations based upon Web 2.0 type interfaces with speech engines. The method can include a step of interfacing with a Web 2.0 server from a standard browser. A speech-enabled application served by the Web 2.0 server can be accessed. The browser can render markup of the speech-enabled application. Speech input can be received from a user of the browser. A RESTful protocol, such as the ATOM Publishing Protocol (APP), can be utilized to access a remotely located speech engine. The speech engine can accept GET, PUT, POST, and DELETE commands. The speech processing engine can process the speech input and can provide results to the Web 2.0 server. The Web 2.0 server can perform a programmatic action based upon the provided results, which results in different content being presented in the browser. | 12-25-2008 |
20080319762 | USING A WIKI EDITOR TO CREATE SPEECH-ENABLED APPLICATIONS - The present invention discloses a system and a method for creating and editing speech-enabled WIKIs. A WIKI editor can be served to client-side Web browsers so that end-users can utilize WIKI editor functions, which include functions to create and edit speech-enabled WIKI applications. A WIKI server can serve speech-enabled WIKI applications created via the WIKI editor. Each of the speech-enabled WIKI applications can include a link to at least one speech processing engine located in a speech processing system remote from the WIKI server. The speech processing engine can provide a speech processing capability for the speech-enabled WIKI application when served by the WIKI server. In one embodiment, the speech-enabled applications can include an introspection document, an entry collection of documents, and a resource collection of documents in accordance with standards specified by an ATOM PUBLISHING PROTOCOL (APP). | 12-25-2008 |
20080319763 | SYSTEM AND DIALOG MANAGER DEVELOPED USING MODULAR SPOKEN-DIALOG COMPONENTS - A dialog manager and spoken dialog service having a dialog manager generated according to a method comprising selecting a top level flow controller based on application type, selecting available reusable subdialogs for each application part, developing a subdialog for each application part not having an available subdialog and testing and deploying the spoken dialog service using the selected top level flow controller, selected reusable subdialogs and developed subdialogs. The dialog manager capable of handling context shifts in a spoken dialog with a user. Application dependencies are established in the top level flow controller thus enabling the subdialogs to be reusable and to be capable of managing context shifts and mixed initiative dialogs. | 12-25-2008 |
20090006099 | Depicting a speech user interface via graphical elements - Depiction of a speech user interface via graphical elements is provided. One or more bits of a graphical user interface bitmask are re-designated as speech bits. When a software application processes the re-designated speech bits, a window manager responsible for generating and rendering a graphical user interface for the application passes information to a secondary window manager responsible for generating and rendering a speech user interface. The secondary speech window manager may load a text-to-speech engine, a speech recognizer engine, a lexicon or library of recognizable words or phrases and a set of “grammars” (recognizable words and phrasing) for building a speech user interface that will receive, recognize and act on spoken input to the associated software application. | 01-01-2009 |
20090006100 | Identification and selection of a software application via speech - An audible indication of a user's position within a given speech grammar framework is provided for a speech-enabled software application, and recognition of speech grammars are limited to use only when a software application that has requested a given set of speech grammars is in focus by a user of an associated mobile computing device. | 01-01-2009 |
20090006101 | Method to detect and assist user intentions with real time visual feedback based on interaction language constraints and pattern recognition of sensory features - A language model back-off system can be used with a user interface employing one or more language models to constrain navigation of selectable user interface input components. A user input interpretation module receives user input and interprets the user input to determine if a selection is made of one or more user interface input components. If a selection is not made, the user input interpretation module determines whether conditions are met for backing off one or more language models employed to constrain navigation of the user interface input components. If the conditions are met, a language model back-off module backs off the one or more language models. | 01-01-2009 |
20090012795 | METHOD AND SYSTEM FOR DYNAMIC CONDITIONAL INTERACTION IN A VOICEXML RUN-TIME SIMULATION ENVIRONMENT - A method and system for testing voice applications, such as VoiceXML applications, is provided. The system provides a run-time simulation environment for voice applications that simulates and automates user interaction. A user simulation script is provided in a customized mark-up language. The voice application is processed to derive a nominal output of the voice application. The user simulation script is processed to generate a simulated output for the voice application corresponding to the nominal output. Conditional logic may be applied to the nominal output to generate a simulated input in response thereto. The user simulation script is specified in a customized mark-up language having a set of one or more conditional tags and an internal variable for the nominal output of the voice application. | 01-08-2009 |
20090024394 | AUDIO GUIDANCE SYSTEM - A CPU of a speech ECU acquires vehicle position information. If it is determined from the position information and map data stored in a memory that the vehicle has moved between areas where different languages are spoken as dialects or official languages, the CPU determines a language corresponding to the vehicle position information and transmits a request signal to a speech information center to transmit speech information in the language. By receiving the speech information from the speech information center, the CPU updates speech information pre-stored in the memory with the speech information transmitted from the speech information center. | 01-22-2009 |
20090030695 | System And Method For Hazard Mitigation In Voice-Driven Control Applications - A speech recognition and control system including a sound card for receiving speech and converting the speech into digital data, the sound card removably connected to an input of a computer, recognizer software executing on the computer for interpreting at least a portion of the digital data, event detection software executing on the computer for detecting connectivity of the sound card, and command control software executing on the computer for generating a command based on at least one of the digital data and the connectivity of the sound card. | 01-29-2009 |
20090030696 | USING RESULTS OF UNSTRUCTURED LANGUAGE MODEL BASED SPEECH RECOGNITION TO CONTROL A SYSTEM-LEVEL FUNCTION OF A MOBILE COMMUNICATIONS FACILITY - A user may control a mobile communication facility through recognized speech provided to the mobile communication facility. Speech that is recorded by a user using a mobile communication facility resident capture facility. A speech recognition facility generates results of the recorded speech using an unstructured language model based at least in part on information relating to the recording. A function of the operating system of the mobile communication facility is controlled based on the results. | 01-29-2009 |
20090030697 | USING CONTEXTUAL INFORMATION FOR DELIVERING RESULTS GENERATED FROM A SPEECH RECOGNITION FACILITY USING AN UNSTRUCTURED LANGUAGE MODEL - A user may control a mobile communication facility through recognized speech provided to the mobile communication facility. Speech that is recorded by a user using a mobile communication facility resident capture facility. A speech recognition facility generates results of the recorded speech using an unstructured language model based at least in part on information relating to the recording. Determining a context of the mobile communications facility at the time speech is recorded, and based on the context, delivering the generated results to a facility for performing an action on the mobile communication facility. | 01-29-2009 |
20090030698 | USING SPEECH RECOGNITION RESULTS BASED ON AN UNSTRUCTURED LANGUAGE MODEL WITH A MUSIC SYSTEM - Speech recorded by an audio capture facility of a music facility is processed by a speech recognition facility to generate results that are provided to the music facility. When information related to a music application running on the music facility are provided to the speech recognition facility, the results generated are based at least in part on the application related information. The speech recognition facility uses an unstructured language model for generating results. The user of the music facility may optionally be allowed to edit the results being provided to the music facility. The speech recognition facility may also adapt speech recognition based on usage of the results. | 01-29-2009 |
20090043587 | SYSTEM AND METHOD FOR IMPROVING RECOGNITION ACCURACY IN SPEECH RECOGNITION APPLICATIONS - A speech recognition system and method are provided to correctly distinguish among multiple interpretations of an utterance. This system is particularly useful when the set of possible interpretations is large, changes dynamically, and/or contains items that are not phonetically distinctive. The speech recognition system extends the capabilities of mobile wireless communication devices that are voice operated after their initial activation. | 02-12-2009 |
20090076827 | Control of plurality of target systems - A system for controlling or operating a plurality of target systems via spoken commands is provided. The system includes a first plurality of target systems, a second plurality of controllers for controlling or operating target systems via spoken commands, a speech recognition system that stores interface information that is specific to a target system or a group of target systems that are to be controlled or operated. A first controller in the second plurality of controllers includes a microphone for picking up audible signals in the vicinity of the first controller and a device for transmitting the audible signals to a speech recognition system. The speech recognition system is operable to analyze the interface information to recognize spoken commands issued for controlling or operating said target system. | 03-19-2009 |
20090083039 | ROBOT APPARATUS WITH VOCAL INTERACTIVE FUNCTION AND METHOD THEREFOR - The present invention provides a robot apparatus with a vocal interactive function. The robot apparatus receives a vocal input, and recognizes the vocal input. The robot apparatus stores a plurality of output data, a last output time of each of the output data, and a weighted value of each of the output data. The robot apparatus outputs output data according to the weighted values of all the output data corresponding to the vocal input, and updates the last output time of the output data. The robot apparatus calculates the weighted values of all the output data corresponding to the vocal input according to the last output time. Consequently, the robot apparatus may output different and variable output data when receiving the same vocal input. The present invention also provides a vocal interactive method adapted for the robot apparatus. | 03-26-2009 |
20090089064 | SYSTEM, METHOD AND ARCHITECTURE FOR CONTROL AND MULTI-MODAL SYNCHRONIZATION OF SPEECH BROWSERS - Clients connecting to a VoiceXML browser obtain a control channel. Using this channel, clients may initialize a new VoiceXML session or attach to an existing VoiceXML session. The client after obtaining a session may perform a range of actions including controlling and monitoring actions. | 04-02-2009 |
20090089065 | ADJUSTING OR SETTING VEHICLE ELEMENTS THROUGH SPEECH CONTROL - A speech processing device includes an automotive device that filters data that is sent and received across an in-vehicle bus. The device selectively acquires vehicle data related to a user settings or adjustments of an in-vehicle system. An interface acquires the selected vehicle data from one or more in-vehicle sensors in response to a user's articulation of a first code phrase. A memory stores the selected vehicle data with unique identifying data associated with a user. The unique identifying data establishes a connection between the selected vehicle data and the user when a second code phrase is articulated by the user. A data interface provides access to the selected vehicle data and relationship data retained in the memory and enables the processing of the data to customize the in-vehicle system. The data interface is responsive to a user's articulation of a third code phrase to process the selected vehicle data that enables the setting or adjustment of the in-vehicle system. | 04-02-2009 |
20090099849 | Voice input system, interactive-type robot, voice input method, and voice input program - A first voice input system according to the present invention includes: a voice input unit | 04-16-2009 |
20090106029 | VOICE ACQUISITION SYSTEM FOR A VEHICLE - A voice acquisition system for a vehicle includes an interior rearview mirror assembly. The mirror assembly may include a microphone for receiving audio signals within a cabin of the vehicle and generating an output indicative of these audio signals. The microphone may provide sound capture for a hands free cell phone system, an audio recording system and/or an emergency communication system. The system may include a control that is responsive to the output from the microphone and that distinguishes vocal signals from non-vocal signals present in the output. The microphone may provide sound capture for at least one accessory of the equipped vehicle, and the accessory may be responsive to a vocal signal captured by the microphone. The interior rearview mirror assembly may include at least one accessory, such as an antenna, a video device, a security system status indicator, a tire pressure indicator display and/or a loudspeaker. | 04-23-2009 |
20090112603 | CONTROL OF A NON-ACTIVE CHANNEL IN A MULTI-CHANNEL RECEIVER - In one embodiment, a satellite radio receiver is capable of simultaneously processing (i) a first radio channel that is playing on a first speaker and (ii) a second radio channel, different from the first radio channel, that is not playing on the first speaker. The second radio channel can simultaneously be playing on a second speaker, be recorded onto a non-volatile memory, and/or have its processing modified. A user can control the satellite radio receiver using vocal commands, while the first channel is playing on the first speaker. The radio receiver has a microphone connected to a voice-recognition command interpreter that includes an interfering-sound canceller, which reduces sounds interfering with the vocal commands, and a command-recognition module, which recognizes vocal commands and provides a control signal to a multi-channel control processor, which processes and controls the first and second radio channels, received from corresponding decoders connected to a satellite radio receiver antenna. | 04-30-2009 |
20090112604 | Automatically Generating Interactive Learning Applications - Systems and methods are described for generating an interactive voice response (IVR) application from a state transition table and set of extensible markup language templates. Embodiments include representing an interactive student-computer dialog as a state transition table. The target interactive voice and video response (IVVR) markup language is encoded as a discrete set of extensible templates. The dialog states are mapped to IVVR markup language by selecting appropriate extensible templates and instantiating parameterized elements of each template with dialog state constituents. Embodiments organize the extended templates coherently and package the extended templates for deployment on an IVVR delivery platform. | 04-30-2009 |
20090112605 | FREE-SPEECH COMMAND CLASSIFICATION FOR CAR NAVIGATION SYSTEM - The present invention provides a system and method associating the freeform speech commands with one or more predefined commands from a set of predefined commands. The set of predefined commands are stored and alternate forms associated with each predefined command are retrieved from an external data source. The external data source receives the alternate forms associated with each predefined command from multiple sources so the alternate forms represent paraphrases of the predefined command. A representation including words from the predefined command and the alternate forms of the predefined command, such as a vector representation, is generated for each predefined command. A similarity value between received speech data and each representation of a predefined command is computed and the speech data is classified as the predefined command whose representation has the highest similarity value to the speech data. | 04-30-2009 |
20090125311 | VEHICULAR VOICE CONTROL SYSTEM - A vehicular voice control system includes a first and a second microphone located on a vehicle external to a vehicle cabin. The microphones receive audio signals from an audio source external to the vehicle and generate microphone output signals. A signal processor processes the microphone output signals, generates a processed signal, and determines a location of the audio source. A speech recognition system receives the processed signal and obtains a recognition result. A controller controls one or more vehicular elements based on the recognition result and the determined location of the audio source. | 05-14-2009 |
20090132255 | Systems and Methods of Performing Speech Recognition with Barge-In for use in a Bluetooth System - Embodiments of the present invention improve methods of performing speech recognition with barge-in. In one embodiment, the present invention includes a speech recognition method comprising starting a synthesis of recorded speech, receiving a user speech input signal providing information regarding a user choice, detecting an initial portion of the user speech input signal, selectively altering the synthesis of recorded speech, and recognizing the user choice. | 05-21-2009 |
20090132256 | Command and control of devices and applications by voice using a communication base system - A first communication path for receiving a communication is established. The communication includes speech, which is processed. A speech pattern is identified as including a voice-command. A portion of the speech pattern is determined as including the voice-command. That portion of the speech pattern is separated from the speech pattern and compared with a second speech pattern. If the two speech patterns match or resemble each other, the portion of the speech pattern is accepted as the voice-command. An operation corresponding to the voice-command is determined and performed. The operation may perform an operation on a remote device, forward the voice-command to a remote device, or notify a user. The operation may create a second communication path that may allow a headset to join in a communication between another headset and a communication device, several headsets to communicate with each other, or a headset to communicate with several communication devices. | 05-21-2009 |
20090150159 | Voice Searching for Media Files - A consumer electronic device has a controller, a speech processing circuit, and a memory to store media files such as audio or video files. The device allows the user to use his or her voice to fast-forward or rewind through the media file to a desired position. Particularly, the device searches one or more selected media file for an audible sound such as a keyword or phrase uttered by the user. If the device locates the audible sound, the device renders the media file having the audible sound starting from that position. | 06-11-2009 |
20090150160 | SYSTEMS AND METHODS OF PERFORMING SPEECH RECOGNITION USING GESTURES - Embodiments of the present invention improve methods of performing speech recognition using human gestures. In one embodiment, the present invention includes a speech recognition method comprising detecting a gesture, selecting a first recognition set based on the gesture, receiving a speech input signal, and recognizing the speech input signal in the context of the first recognition set. | 06-11-2009 |
20090171667 | SYSTEMS AND METHODS FOR LANGUAGE ASSISTED PATIENT INTAKE - A method for assisting in the communication of a medical care provider and a patient is disclosed. The method may include displaying a first display section, the first display section including a plurality of anatomical features, each anatomical feature associated with an indicia indicating the location of the anatomical feature, the anatomical feature also associated with a first name provided in a first language and a second name provided in a second language name. The method may also include displaying a second display section, the second display section including a plurality of questions relating to patient intake, where each question provided in the first language and the second language. | 07-02-2009 |
20090171668 | Recursive Adaptive Interaction Management System - A management system for guiding an agent in a media-specific dialogue has a conversion engine for instantiating ongoing dialogue as machine-readable text, if the dialogue is in voice media, a context analysis engine for determining facts from the text, a rules engine for asserting rules based on fact input, and a presentation engine for presenting information to the agent to guide the agent in the dialogue. The context analysis engine passes determined facts to the rules engine, which selects and asserts to the presentation engine rules based on the facts, and the presentation engine provides periodically updated guidance to the agent based on the rules asserted. | 07-02-2009 |
20090171669 | Methods and Apparatus for Implementing Distributed Multi-Modal Applications - Embodiments of a system include a client device ( | 07-02-2009 |
20090177476 | Method, system and mobile device for registering voice data with calendar events - A system, method and apparatus for registering voice data with a calendar event are provided. Voice data is recorded during the calendar event with a mobile device. The voice data is associated with the calendar event using the mobile device. | 07-09-2009 |
20090177477 | Voice-Controlled Clinical Information Dashboard - A method provides a display area of a computer system for displaying a set of data. The data includes clinical data for one or more medical patients. The method provides multiple controls for performing multiple functions. The method provides an audio interface for controlling at least one of the controls through audio commands. | 07-09-2009 |
20090182562 | DYNAMIC USER INTERFACE FOR AUTOMATED SPEECH RECOGNITION - Techniques are described for generating a dynamic user interface for a position-determining device that may account for a variety of input modes. In one example, a position-determining device is initiated in a first input mode (e.g., a touch screen mode) and a graphical user interface (GUI) of the device is configured to accept input via the first input mode. The position-determining device then receives an indication to switch to a second input mode (e.g., a speech input mode) and the GUI is configured to receive input via the second input mode. The position-determining device can dynamically transition between GUI configurations based on a plurality of input modes. | 07-16-2009 |
20090192801 | SYSTEM AND METHOD FOR CONTROLLING AN ELECTRONIC DEVICE WITH VOICE COMMANDS USING A MOBILE PHONE - A method for controlling an electronic device with voice commands using a mobile phone ( | 07-30-2009 |
20090204409 | Voice Interface and Search for Electronic Devices including Bluetooth Headsets and Remote Systems - Systems and methods for improving the interaction between a user and a small electronic device such as a Bluetooth headset are described. The use of a voice user interface in electronic devices may be used. In one embodiment, recognition processing limitations of some devices are overcome by employing speech synthesizers and recognizers in series where one electronic device responds to simple audio commands and sends audio requests to a remote device with more significant recognition analysis capability. Embodiments of the present invention may include systems and methods for utilizing speech recognizers and synthesizers in series to provide simple, reliable, and hands-free interfaces with users. | 08-13-2009 |
20090204410 | VOICE INTERFACE AND SEARCH FOR ELECTRONIC DEVICES INCLUDING BLUETOOTH HEADSETS AND REMOTE SYSTEMS - Systems and methods for improving the interaction between a user and a small electronic device such as a Bluetooth headset are described. The use of a voice user interface in electronic devices may be used. In one embodiment, recognition processing limitations of some devices are overcome by employing speech synthesizers and recognizers in series where one electronic device responds to simple audio commands and sends audio requests to a remote device with more significant recognition analysis capability. Embodiments of the present invention may include systems and methods for utilizing speech recognizers and synthesizers in series to provide simple, reliable, and hands-free interfaces with users. | 08-13-2009 |
20090204411 | IMAGE PROCESSING APPARATUS, VOICE ASSISTANCE METHOD AND RECORDING MEDIUM - An image processing apparatus comprises: a voice input portion; a memory that stores in itself as voice data, voice of a plurality of users for voice assistance, which is inputted by the voice input portion; a selection portion that selects voice data applied for a login user among the voice data stored in the memory, if information should be given by voice; and a voice output portion that outputs voice corresponding to the selected voice data. | 08-13-2009 |
20090210232 | LAYERED PROMPTING: SELF-CALIBRATING INSTRUCTIONAL PROMPTING FOR VERBAL INTERFACES - A plurality of prompting layers configured to provide varying levels of detailed assistance in prompting a user are maintained. A prompt from a current prompting layer is presented to a user. Input is received from the user. A level of detail in prompting the user is adaptively changed based on user behavior. Upon the user making a hesitant verbal gesture that reaches a threshold duration, a transition is made from the current prompting layer to a more detailed prompting layer. Upon the user interrupting the prompt with a valid input, a transition is made from the current prompting layer to a less detailed prompting layer. | 08-20-2009 |
20090210233 | COGNITIVE OFFLOADING: INTERFACE FOR STORING AND COMPOSING SEARCHES ON AND NAVIGATING UNCONSTRAINED INPUT PATTERNS - One or more commands are configured to cause content to be stored for retrieval. The content to be stored includes one or more entries. The content may include event-triggered content stored for retrieval upon an occurrence of a specified event or other content. The content is retrieved in response to a retrieval command specifying a given pattern by comparing the given pattern with the stored content and, upon finding a match for the given pattern, wherein the match corresponds with the given pattern within a predetermined variance, retrieving additional content stored with the match for the given pattern. The content also may be retrieved by identifying the occurrence of the specified event and retrieving the event-triggered content upon the occurrence of the specified event. | 08-20-2009 |
20090216538 | Method for Interacting With Users of Speech Recognition Systems - A computer implemented method facilitates a user interaction via a speech-based user interface. The method acquires spoken input from a user in a form of a phrase of one or more words. It further determines, using a plurality of different domains) whether the phrase is a query or a command. If the phrase is the query the method retrieves and presents relevant items from a plurality of databases. If the phrase is a command, the method performs an operation. | 08-27-2009 |
20090216539 | IMAGE CAPTURING DEVICE - An image capturing device includes a digital signal processor for processing an image captured by an imaging sensor, a display unit for displaying the image, a storage unit for storing the image and preset voice samples, and a voice processing unit for picking up sound waves and converting the sound waves into text information. Each voice sample represents a category. In a first operation mode, the digital signal processor assigns the image to the category if the text information approximately matches one of the voice samples, or establishes a new category and assigns the images to the new category if the text information does not match any of the voice samples. In a second operation mode, the digital signal processor causes the image in the category corresponding to the text information to be displayed by the display unit in a slideshow fashion or a thumbnail fashion. | 08-27-2009 |
20090216540 | Open Architecture For A Voice User Interface - A system and method for processing voice requests from a user for accessing information on a computerized network and delivering information from a script server and an audio server in the network in audio format. A voice user interface subsystem includes: a dialog engine that is operable to interpret requests from users from the user input, communicate the requests to the script server and the audio server, and receive information from the script server and the audio server; a media telephony services (MTS) server, wherein the MTS server is operable to receive user input via a telephony system, and to transfer the user input to the dialog engine; and a broker coupled between the dialog engine and the MTS server. The broker establishes a session between the MTS server and the dialog engine and controls telephony functions with the telephony system. | 08-27-2009 |
20090222270 | VOICE COMMAND INTERFACE DEVICE - A device includes a speech input device. A speech recognition processor connected to the speech input device receives speech input. The device includes a computer readable medium coupled to the speech recognition processor. A command table stored on the computer readable medium includes commands corresponding to a control on a manual input interface on a digital music player. The digital music player is separate from the speech input device. The speech recognition processor compares the speech input to the commands in the command table and generates instructions if the speech input matches a command in the command table. A programmable controller is coupled to the speech recognition processor and is configured to receive instructions and to convert the instructions into control signals. The device includes a standard interface connector coupled to the programmable controller. The programmable controller sends the control signals through the standard interface connector. | 09-03-2009 |
20090222271 | Method For Operating A Navigation System - A method for operating a navigation system analyzes several address components to determine the most likely address desired by a user. The navigation device includes a receiving device on which an acoustic address input consisting of several input components can be registered. The input components of the address are analyzed with a speech recognition module, wherein at least one geographical location, which is defined by an address with several address components, is selected from a database for further processing depending on the result of the speech recognition analysis. The method includes analyzing several address component combinations to determine the most likely address inputted by the user. | 09-03-2009 |
20090240502 | MULTIMEDIA CONTROLLER TABLET - A media controller tablet is disclosed comprising: an ergonomic housing including a pair of complimentary curvilinear surfaces tapering from an upper end to a lower end, and a gripping surface in contact with one of the complimentary curvilinear surfaces wherein the gripping surface and the complimentary curvilinear surfaces are adapted to engage a human hand with a palm of the human hand in contact with one of the pair of complimentary curvilinear surfaces and a plurality of fingers of the human hand wrapped around the gripping surface; a controller embedded in the ergonomic housing and adapted to receive an input and wirelessly control a media device in accordance with the input; and a display integrated with the ergonomic housing and connected with the controller for displaying information about media content available for viewing on the media device. | 09-24-2009 |
20090248418 | Speech Recognition and Statistics-Based Call Route Determination - A method of call route determination based upon a statistics-based business intelligence engine (BEI) queried by an IVR subsystem with caller parameters descriptive of the caller to determine a next best route for a received call, when the default or best route for the call exceeds a threshold time. A call is received at a contact center from a caller. Content and identity information of the caller is extracted from the received call. IVR determines a first estimated wait time associated with a default route of the received call. If the first estimated wait time is greater than a threshold time, and thus unacceptable, then the IVR queries a business intelligence engine (BIE) with caller parameters descriptive of the caller to determine a next best route of the received call, with the next best route having a second estimated wait time less than the first estimated wait time of the default route. The caller is then routed to the next best route. | 10-01-2009 |
20090248419 | SPEECH RECOGNITION ADJUSTMENT BASED ON MANUAL INTERACTION - A method of operating a speech recognition system on a vehicle having a visual display and manually-operated input device that includes initiating a speech recognition system, controlling menu selections on a visual display using a manually-operated input device, receiving a notification from the manually-operated input device indicating that the user is manipulating the device in conjunction with the menu selections on the visual display, and adjusting operation of the speech recognition system based on input received by the manually-operated input device. | 10-01-2009 |
20090248420 | MULTI-PARTICIPANT, MIXED-INITIATIVE VOICE INTERACTION SYSTEM - A voice interaction system includes one or more independent, concurrent state charts, which are used to model the behavior of each of a plurality of participants. The model simplifies the notation and provide a clear description of the interactions between multiple participants. These state charts capture the flow of voice prompts, the impact of externally initiated events and voice commands, and capture the progress of audio through each prompt. This system enables a method to prioritize conflicting and concurrent events leveraging historical patterns and the progress of in-progress prompts. | 10-01-2009 |
20090254351 | MOBILE TERMINAL AND MENU CONTROL METHOD THEREOF - A mobile terminal including an input unit configured to receive an input to activate a voice recognition function on the mobile terminal, a memory configured to store information related to operations performed on the mobile terminal, and a controller configured to activate the voice recognition function upon receiving the input to activate the voice recognition function, to determine a meaning of an input voice instruction based on at least one prior operation performed on the mobile terminal and a language included in the voice instruction, and to provide operations related to the determined meaning of the input voice instruction based on the at least one prior operation performed on the mobile terminal and the language included in the voice instruction and based on a probability that the determined meaning of the input voice instruction matches the information related to the operations of the mobile terminal. | 10-08-2009 |
20090271203 | Voice-activated remote control service - A method of remotely controlling operation of a controlled device involves receiving a telephone call from an owner via a telephone network; authenticating the telephone call to establish that the owner is authorized to control the controlled device; interpreting a voice command from the owner that issues instructions to the controlled device; identifying the controlled device based upon the authentication and identification by the owner of the controlled device; converting the voice command to one or more data packets capable of interpretation by the controlled device to execute the command; and delivering the one or more data packets to the controlled device via the Internet. This abstract is not to be considered limiting, since other embodiments may deviate from the features described in this abstract. | 10-29-2009 |
20090276224 | SYSTEM AND METHOD FOR MONITORING DELIVERY OF MEDIA CONTENT BY A MEDIA COMMUNICATION SYSTEM - A system that incorporates teachings of the present disclosure may operate according to, for example, a method involving recording audio feedback from a plurality of subscribers commenting on media content supplied by a media communication system on at least one of a plurality of media channels, detecting one or more trigger words in the recorded audio feedback having an association with a disruption of one or more media services supplied by the media communication system, selecting one or more network elements of the media communication system in at least one transmission path that supplies media services to one or more of the plurality of subscribers that supplied audio feedback matching the one or more trigger words, and directing the selected one or more network elements to record media content on one or more media channels selected from the plurality of media channels. Other embodiments are disclosed. | 11-05-2009 |
20090276225 | METHOD FOR AUTOMATED SENTENCE PLANNING IN A TASK CLASSIFICATION SYSTEM - The invention relates to a method for sentence planning ( | 11-05-2009 |
20090299751 | Robot apparatus and method for registering shortcut command thereof - A robot apparatus including an input unit to receive a voice command from a user, a determination unit to determine whether a voice command is repeated a predetermined number of times, and a control unit to register a shortcut command to shorten a voice command if it is determined a voice command is repeated a predetermined number of times. A shortcut command to shorten a voice command of a user is generated, and thus user convenience is enhanced. | 12-03-2009 |
20090299752 | Recognition of Voice-Activated Commands - Systems and methods for voice activated commands in a digital home communication terminal are disclosed. One example method includes storing a program audio signal corresponding to a program tuned by the digital home communication terminal. The method also includes storing an incoming audio signal carrying speech and removing from the incoming audio signal a portion of the incoming audio signal that corresponds to the program audio signal, this producing an improved version of the incoming audio signal. The method also includes selecting one of a plurality of voice-activated commands that corresponds to the improved version of the incoming audio signal, and performing a function corresponding to the selected voice-activated command. | 12-03-2009 |
20090306990 | VOICE ACTUATED AND OPERATOR VOICE PROMPTED COORDINATE MEASURING SYSTEM - A vehicle coordinate measuring system and method including a coordinate measuring device operably connected to a computer, and a voice input device for receiving prompts from the computer and enabling an operator to transmit responses to the prompts to the computer, with the computer adapted to translate the responses to digital information. The coordinate measuring device may record and transmit point location data to the computer, and the computer may correlate the point location data from the coordinate measuring device with the digital information from the responses. | 12-10-2009 |
20090306991 | METHOD FOR SELECTING PROGRAM AND APPARATUS THEREOF - A program selection method and a display apparatus thereof are provided. The program selection method includes generating a program list including at least one program title, determining whether there is a voice input for a program selection; searching for a desired program title corresponding to the voice input for the program selection among the at least one program title in the program list, and selecting a program corresponding to the desired program title based on the searching for the desired program title. | 12-10-2009 |
20090313026 | CONVERSATIONAL COMPUTING VIA CONVERSATIONAL VIRTUAL MACHINE - A conversational computing system that provides a universal coordinated multi-modal conversational user interface (CUI) | 12-17-2009 |
20090319276 | Voice Enabled Remote Control for a Set-Top Box - A remote control device includes a digital audio storage device, a talk button, and an optical distance measurer. The digital audio storage device is configured to continually record an audio input for a specific amount of time. The talk button is coupled to the digital audio storage device and is configured to initiate a transmission of the audio input to a set-top box device. The optical distance measurer is coupled to the talk button and is configured to automatically measure a distance to a user in response to the talk button being pressed. | 12-24-2009 |
20090326956 | VOICE CONTROL SYSTEM AND METHOD FOR OPERATING DIGITAL PHOTO FRAME - A voice control system includes an acoustic sensor, and a digital photo frame. The acoustic sensor is configured to receive a voice signal, and transform the voice signal to an electronic signal. The digital photo frame includes a transforming module, an instruction module, and a comparing module. The transforming module receives the electronic signal sent from the acoustic sensor and transforms the electronic signal to a transformed electronic code. The instruction module defines a plurality of predetermined electronic codes for performing predetermined functions of the digital photo frame. The comparing module compares the transformed electronic code with the predetermined electronic codes. If the transformed electronic code matches one of the predetermined electronic codes, the digital photo frame performs a function of the predetermined functions associated with the matched predetermined electronic code. A method for operating the digital photo frame is also provided. | 12-31-2009 |
20090326957 | OPERATION METHOD OF INTERACTIVE REFRIGERATOR SYSTEM - An operation method of an interactive refrigerator system, includes displaying information about stored items corresponding to a speech input by a user, generating and outputting a response message for the information about the stored items, checking whether or not storage periods of the stored items are expired; and outputting expiration information about storage periods of the stored items or expected expiration information about storage periods of the stored items. | 12-31-2009 |
20100017212 | TURN-TAKING MODEL - A method is claimed for managing interactive dialog between a machine and a user. In one embodiment, an interaction between the machine and the user is managed in response to a timing position of possible speech onset from the user. In another embodiment, the interaction between the machine and the user is dependent upon the timing of a recognition result, which is relative to a cessation of a verbalization of a desired sequence from the machine. In another embodiment, the interaction between the machine and the user is dependent upon a recognition result and whether the desired sequence was ceased or not ceased. | 01-21-2010 |
20100049527 | Method and Device for Voice Control of a Device or of a System in a Motor Vehicle - A method for voice controlling of a device or of a system in a motor vehicle, the device or the system being capable of being operated both by voice inputs and also by non-voice inputs, in particular through the actuation of switches and/or buttons and/or a touch screen, and in which the user of the device or system, in particular the driver of the motor vehicle, is alerted optically and/or acoustically and/or haptically that voice operation of the device or system is possible, dependent on the presence or absence of particular predefined conditions. In addition, the present invention relates to a device or system for supporting the voice controlling of a device or system in a motor vehicle, with which this method is able to be executed. | 02-25-2010 |
20100049528 | SYSTEM AND METHOD FOR CUSTOMIZED PROMPTING - A method for providing an audible prompt to a user within a vehicle. The method includes retrieving one or more data files from a memory device. The data files define certain characteristics of an audio prompt. The method also includes creating the audio prompt from the data files and outputting the audio prompt as an audio signal. | 02-25-2010 |
20100049529 | INTEGRATED SYSTEM AND METHOD FOR MOBILE AUDIO PLAYBACK AND DICTATION - A method and system provides for a single-pass review and feedback of a document. During audio playback of the document to be reviewed, voice-activated recording of feedback and submission of feedback relative to the location in the original document are accomplished. This provides for a fully integrated, single pass review and feedback of documentation to occur. | 02-25-2010 |
20100057468 | BINARY-CACHING FOR XML DOCUMENTS WITH EMBEDDED EXECUTABLE CODE - A method, system and voice browser execute voice applications to perform a voice-based function. A document is retrieved and parsed to create a parse tree. Script code is created from the parse tree, thereby consuming part of the parse tree to create a reduced parse tree. The reduced parse tree is stored in a cache for subsequent execution to perform the voice-based function. | 03-04-2010 |
20100057469 | METHOD AND SYSTEM FOR ORDERING CONTENT USING A VOICE MENU SYSTEM - A method and system for ordering content includes a voice menu system and a phone device communicating a phone signal to the voice menu system. The voice menu system determines the phone number associated with the phone device through the phone signal and generates a voice prompt for recording a content selection from the voice menu system. The phone device selects a recording content option. The voice menu system generates prompts for determining a content title. The phone device selects a content title by communicating a selection signal to the voice menu system. The voice menu system enables a content recording at a recording device in response to the selection signal. | 03-04-2010 |
20100057470 | SYSTEM AND METHOD FOR VOICE-ENABLED MEDIA CONTENT SELECTION ON MOBILE DEVICES - A system for voice-enabled location and execution for playback of media content selections stored on a media content playback device has a voice input circuitry for inputting voice-based commands into the playback device; codec circuitry for converting voice input from analog content to digital content for speech recognition and for converting voice-located media content to analog content for playback; and a media content synchronization device for maintaining at least one grammar list of names representing media content selections in a current state according to what is currently stored and available for playback on the playback device. | 03-04-2010 |
20100063823 | METHOD AND SYSTEM FOR GENERATING DIALOGUE MANAGERS WITH DIVERSIFIED DIALOGUE ACTS - A method to generate dialogue manager (DM) is provided, in which a plurality DMs with the same purpose but having different dialogue acts is automatically generated according to a DM designed by a designer. An automatic aiding tool facilitates the design of a dialogue flow and the adjustment of DM rules, and also helps a system designer to find out potential problems in the original DM. The method adopts the current DM combined with a user simulation technique and further employs a specially designed scoring function, so as to automatically generate a plurality of new DMs. The new DMs achieve the same dialogue purpose as the original DM, but differ from the original DM in system acts and responses during the dialogue process. The dialogue flow of the dialogue system is enhanced, and meanwhile, the design and improvement of the DM are also accelerated. | 03-11-2010 |
20100082351 | UNIVERSAL REMOTE CONTROLLER AND CONTROL CODE SETUP METHOD THEREOF - The present invention provides a universal remote controller that transmits and informs the learning of control codes, the setup of the control codes, and the setup of preference channels by voice commands, learns a voice command or key value input according to the voice command transmitted, and registers the leaned key value as the control code. A control codes setup method includes detecting input of a voice command or a specific key signal requesting a control codes setup in a standby mode, starting, when the voice command or specific key signal is detected, a control codes setup mode and transmitting a control codes setup method step by step by voice information, recognizing a voice command or key signal input by a user according to the voice information transmitted, and starting a standby mode after registering and storing the control codes matching with the recognized voice commands or key signals and transmitting voice information on the registering and storing of the control codes. | 04-01-2010 |
20100114580 | Responding to a Call to Action Contained in an Audio Signal - An audio signal is monitored to detect the presence of a call to action contained therein. Addressing information is automatically extracted from the call to action and stored on a storage medium. An electronic message responding to the call to action may be automatically prepared, or a contact field may be automatically populated for inclusion in a contact list. The audio signal may be digitized or obtained from a broadcast transmission, and the process may be performed by a mobile communication device, a central system, or a combination thereof. | 05-06-2010 |
20100121645 | OPERATING DEVICE FOR A MOTOR VEHICLE - In a method for the operator control of a motor vehicle having a display for displaying variable information and having a microphone, the viewing direction of an operator of the motor vehicle is ascertained, it is checked whether the viewing direction of the operator is aimed toward the display, and information assigned to an acoustic command is shown on the display when a corresponding acoustic command is given while the viewing direction of the operator is aimed toward the display. | 05-13-2010 |
20100131280 | VOICE RECOGNITION SYSTEM FOR MEDICAL DEVICES - A system for transmitting voice commands to a medical device for carrying out those commands by the medical device. The system includes a remote control device that receives the voice commands from the caregiver and recognizes the caregiver as being authorized to give such commands. The recognized commands are then analyzed to determine the particular command, and the signals representing that command are transmitted in digital form by a wireless protocol, such as a ZigBee wireless protocol, to a receiving module incorporated into or in communication with the medical device. The receiving module decodes the wireless protocol, identifies the particular command, and interfaces that command to the patient device, whereby the command effects the operation of the patient device, such as by silencing an alarm on the medical device. | 05-27-2010 |
20100138224 | NON-DISRUPTIVE SIDE CONVERSATION INFORMATION RETRIEVAL - Information is exchanged between a user of a communications device and an application during an ongoing conversation between the user using the communications device and a party, without disrupting the conversation. An application associated with the communications device is accessed via the communications device in response to a command and keyword spoken by the user during the communications session. Information is retrieved from the application according to the keyword spoken by the user. When the information is retrieved from the application, the user is prompted in a manner transparent to the party, after which a response is sent to the user. | 06-03-2010 |
20100145710 | Data-Driven Voice User Interface - A method for developing a voice user interface for a statistical semantic system is described. A set of semantic meanings is defined that reflect semantic classification of a user input dialog. Then, a set of speech dialog prompts is automatically developed from an annotated transcription corpus for directing user inputs to corresponding final semantic meanings. The statistical semantic system may be a call routing application where the semantic meanings are call routing destinations. | 06-10-2010 |
20100161339 | METHOD AND SYSTEM FOR OPERATING A VEHICULAR ELECTRONIC SYSTEM WITH VOICE COMMAND CAPABILITY - Methods and systems for operating an avionics system with voice command capability are provided. A first voice command is received. A first type of avionics system function is performed in response to the receiving of the first voice command. A second voice command is received. A second type of avionics system function that has a hazard level higher than that of the first type of avionics system function is performed in response to the receiving of the second voice command only after a condition is detected that is indicative of a confirmation of the request to perform the second type of avionics function. The avionics system may also have the capability to test whether or not the voice command feature is functioning properly. | 06-24-2010 |
20100169097 | AUDIBLE LIST TRAVERSAL - Many embodiments may comprise logic such as hardware and/or code to implement user interface for traversal of long sorted lists, via audible mapping of the lists, using sensor based gesture recognition, audio and tactile feedback and button selection while on the go. In several embodiments, such user interface modalities are physically small in size, enabling a user to be truly mobile by reducing the cognitive load required to operate the device. For some embodiments, the user interface may be divided across multiple worn devices, such as a mobile device, watch, earpiece, and ring. Rotation of the watch may be translated into navigation instructions, allowing the user to traverse the list while the user receives audio feedback via the earpiece to describe items in the list as well as audio feedback regarding the navigation state. Many embodiments offer the user a simple user interface to traverse the list without visual feedback. | 07-01-2010 |
20100169098 | SYSTEM AND METHOD OF A LIST COMMANDS UTILITY FOR A SPEECH RECOGNITION COMMAND SYSTEM - In embodiments of the present invention, a system and computer-implemented method for enabling a user to interact with a mobile device using a voice command may include the steps of defining a structured grammar for generating a global voice command, defining a global voice command of the structured grammar, wherein the global voice command enables access to an object of the mobile device using a single command, and mapping at least one function of the object to the global voice command, wherein upon receiving voice input from the user of the mobile device, the object recognizes the global voice command and controls the function. | 07-01-2010 |
20100174546 | Sound recognition apparatus of robot and method for controlling the same - Disclosed is a sound recognition apparatus of a robot and a method for controlling the same. The sound recognition apparatus senses sound and determines if the sound is for communication by comparing the sensed sound with a preset reference condition. If the sound is for conversation, the movement of the robot is controlled. The method includes comparing the sound sensed by the robot with a preset reference condition, thereby determining if the sound is for communication with a user. When a conversation is intended, recognition rate is increased, and the robot is moved according to the intention of communication. | 07-08-2010 |
20100185449 | METHOD AND SYSTEM FOR COMMUNICATING WITH AN INTERACTIVE VOICE RESPONSE (IVR) SYSTEM - Disclosed is a method and system for interacting with an IVR system. In one aspect, a computing device receives a user request to connect to an IVR system to perform an action. A request for information (e.g., a request to select from a plurality of menu options) is obtained from the IVR system. In response to the request, the computing device automatically supplies an answer to the request for information to the IVR system. In one embodiment, the answer is a dual-tone multi-frequency (DTMF) signal. The obtaining and supplying steps are repeated until the action has been performed. | 07-22-2010 |
20100191535 | SYSTEM AND METHOD FOR INTERRUPTING AN INSTRUCTIONAL PROMPT TO SIGNAL UPCOMING INPUT OVER A WIRELESS COMMUNICATION LINK - A voice interactive session includes detection of an input signaling an interrupt to the session. When the interrupt is detected, instructional and or informational output is interrupted and detection of voice input begins. The voice input is not detected until the output is interrupted. Upon detection of a voice input (or other sound-based input), a determination may be made if the input was valid. If the input was valid, the input is processed, otherwise, instructional and/or informational output may be relayed again and/or the voice input may be redetected. | 07-29-2010 |
20100217604 | SYSTEM AND METHOD FOR PROCESSING MULTI-MODAL DEVICE INTERACTIONS IN A NATURAL LANGUAGE VOICE SERVICES ENVIRONMENT - A system and method for processing multi-modal device interactions in a natural language voice services environment may be provided. In particular, one or more multi-modal device interactions may be received in a natural language voice services environment that includes one or more electronic devices. The multi-modal device interactions may include a non-voice interaction with at least one of the electronic devices or an application associated therewith, and may further include a natural language utterance relating to the non-voice interaction. Context relating to the non-voice interaction and the natural language utterance may be extracted and combined to determine an intent of the multi-modal device interaction, and a request may then be routed to one or more of the electronic devices based on the determined intent of the multi-modal device interaction. | 08-26-2010 |
20100280829 | Photo Management Using Expression-Based Voice Commands - A system and method are provided for photo management using expression-based voice commands. The method interfaces a photo-image discovery device, having no dedicated display, to a display monitor. Expression-based user voice prompt are received and used to access a photo-image in storage at a storage site. The accessed photo-image is then presented on the display monitor. The photo-image in storage at the storage site can be accessed to perform an operation such as: selecting a storage site, selecting a photo-image, transforming a selected photo-image, converting a file format of a selected photo-image, and selecting a delivery option. In one aspect, a menu of photo-image user prompt options are presented on the display monitor, originating from the photo discovery device, and the expression-based user voice prompts are received in response to the presented menu. | 11-04-2010 |
20100292990 | AUDIO ENABLED CONTROL MENU SYSTEM, METHOD AND DEVICE - An audio enabled control menu system, method and device is provided. Embodiments of the present invention include an encoder including an input device actuation of the encoder by an operator of the control menu device; memory including a menu structure and a plurality of audio segments stored in the memory; and a microcontroller in operable communication with the encoder and the memory, the microcontroller further configured to receive menu navigation input from the encoder and output one of the plurality of audio segments in response to the menu navigation input, the microcontroller further configured to execute predetermined control actions in response to the menu navigation input. Embodiments of the invention transmit menu options to an operator in an audio format such that the operator can browse and select menu options with one hand and does not need to look at a visually displayed menu. | 11-18-2010 |
20100292991 | METHOD FOR CONTROLLING GAME SYSTEM BY SPEECH AND GAME SYSTEM THEREOF - Embodiments of the present invention provide a method for controlling a game system by speech and a game system thereof. The method includes collecting a speech command, storing the speech command in association with a game command; receiving a speech command from a user during a game, searching for a game command associated with the speech command, and controlling a game system using the game command found. The game system includes a speech collecting module, an associated storage module, a speech command recognizing module and a game controlling module. The present invention can implement control of a game system using speech. | 11-18-2010 |
20100305951 | Methods And Systems For Resolving The Incompatibility Of Media Items Playable From A Vehicle - A system for monitoring hands-free accessibility of media items for play at a vehicle includes a vehicle entertainment computing system (VECS) configured to receive predetermined rules for voice-activated access of the media items. Violations of the rules are detected based on media item metadata. If a violation is detected, a prompt is outputted. Media items are retrieved and played based on voice-activated requests. One embodiment includes a method for monitoring hands-free accessibility of media items for play at a vehicle. A system for formatting media items for accessibility at a VECS includes a media item incompatibility resolution system (MIIRS) configured to resolve violations of the predetermined rules by receiving additional rules relating to formatting violating media items. The media items are searched and the violations addressed by reformatting the media items for voice-activated access. The media items are outputted to the MIIRS. | 12-02-2010 |
20100318366 | Touch Anywhere to Speak - The present invention provides a user interface for providing press-to-talk-interaction via utilization of a touch-anywhere-to-speak module on a mobile computing device. Upon receiving an indication of a touch anywhere on the screen of a touch screen interface, the touch-anywhere-to-speak module activates the listening mechanism of a speech recognition module to accept audible user input and displays dynamic visual feedback of a measured sound level of the received audible input. The touch-anywhere-to-speak module may also provide a user a convenient and more accurate speech recognition experience by utilizing and applying the data relative to a context of the touch (e.g., relative location on the visual interface) in correlation with the spoken audible input. | 12-16-2010 |
20100332234 | Dynamically Extending The Speech Prompts Of A Multimodal Application - Dynamically extending the speech prompts of a multimodal application including receiving, by the prompt generation engine, a media file having a metadata container; retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application; and modifying, by the prompt generation engine, the multimodal application to include the speech prompt. | 12-30-2010 |
20100332235 | INTELLIGENT HOME AUTOMATION - An intelligent home automation system answers questions of a user speaking “natural language” located in a home. The system is connected to, and may carry out the user's commands to control, any circuit, object, or system in the home. The system can answer questions by accessing the Internet. Using a transducer that “hears” human pulses, the system may be able to identify, announce and keep track of anyone entering or staying in the home or participating in a conversation, including announcing their identity in advance. The system may interrupt a conversation to implement specific commands and resume the conversation after implementation. The system may have extensible memory structures for term, phrase, relation and knowledge, question answering routines and a parser analyzer that uses transformational grammar and a modified three hypothesis analysis. The parser analyzer can be dormant unless spoken to. The system has emergency modes for prioritization of commands. | 12-30-2010 |
20100332236 | VOICE-TRIGGERED OPERATION OF ELECTRONIC DEVICES - A system and method operating features of telecommunications, audio headsets, speakers, and other communications and electronic devices, such as mobile telephones, personal digital assistants and cameras, using voice-activated, voice-trigged or voice-enabled operation. In accordance with an embodiment, the electronic device is capable of operating in an idle mode, in which the device listens for verbal commands from a user. When the user speaks or otherwise issues a command, the device recognizes the command and responds accordingly, including, depending on the context in which the command is issued, following a series of prompts to guide the user through operating one or more features of the device, such as accessing menus or other features. In accordance with an embodiment, this allows the user to operate the device in a hands-free mode if desired. | 12-30-2010 |
20110004477 | Facility for Processing Verbal Feedback and Updating Digital Video Recorder(DVR) Recording Patterns - A method, a system and a computer program product for using speech/voice recognition technology to update digital video recorder (DVR) program recording patterns, based on program viewer/listener feedback. A speech controlled pattern modification (SCPM) utility utilizes a DVR recording sub-system integrated with speech processing functionality to compare control phrases with phrases uttered by a viewer. If a control phrase matches a phrase uttered by the viewer, the SCPM utility modifies the DVR recording patterns, according to a set of pre-programmed governing rules. For example, the SCPM utility may avoid modifying the recording patterns for programs within a list of “favorite” programs but may modify the recording patterns for programs excluded from the list. The SCPM utility determines priority of the uttered phrases by identifying users and retrieving a preset priority level of the identified users. The priority level is then used to control changes to the recording patterns. | 01-06-2011 |
20110010180 | Speech Enabled Media Sharing In A Multimodal Application - Speech enabled media sharing in a multimodal application including parsing, by a multimodal browser, one or more markup documents of a multimodal application; identifying, by the multimodal browser, in the one or more markup documents a web resource for display in the multimodal browser; loading, by the multimodal browser, a web resource sharing grammar that includes keywords for modes of resource sharing and keywords for targets for receipt of web resources; receiving, by the multimodal browser, an utterance matching a keyword for the web resource, a keyword for a mode of resource sharing and a keyword for a target for receipt of the web resource in the web resource sharing grammar thereby identifying the web resource, a mode of resource sharing, and a target for receipt of the web resource; and sending, by the multimodal browser, the web resource to the identified target for the web resource using the identified mode of resource sharing. | 01-13-2011 |
20110015932 | METHOD FOR SONG SEARCHING BY VOICE - The present invention relates to a method for song searching by voice, especially the method with which users can complete settings and then start searching, so that the users' voices of search conditions will be acquired to make voice recognition, and the recognition results will be compared with the instruction data and song attribute data in the voice recognition database to obtain comparison data. If the comparison data do not correspond with the preset conditions, the next search condition generated from the comparison data will be broadcast with voice, and the users are allowed to speak out the next search condition to make comparisons of search conditions in the next process. If the comparison data correspond with the preset conditions, one or more song files will be read according to the comparison data and will be given a preview. With this method in hand, the users will not touch buttons or knobs by mistake, do not need to spend time in searching for song files one by one, and do not need to free one or both of their hands to press the buttons or knobs, either. Besides, the users can decide on such matters as search conditions, initial position of previews, whether to play immediately after choices are made, preview period, sequential or shuffle play, etc, thus promoting convenience for users in searching for songs and meeting preferences and needs of different users. | 01-20-2011 |
20110022396 | METHOD, SYSTEM AND USER INTERFACE FOR AUTOMATICALLY CREATING AN ATMOSPHERE, PARTICULARLY A LIGHTING ATMOSPHERE, BASED ON A KEYWORD INPUT - The invention relates to the automatic creation of an atmosphere, particularly a lighting atmosphere, based on a keyword input such as a keyword typed or spoken by a user. A basic idea of the invention is to enable a user of an atmosphere creation system such as a lighting system to automatically create a specific atmosphere by simply using a keyword which is input to the system. The keyword, for example “eat”, “read”, “relax”, “sunny”, “cool”, “party”, “Christmas”, “beach”, may be spoken or typed by the user and may enable the user to find and explore numerous atmospheres in an interactive and playful way in embodiments of the invention. Finding atmosphere elements related to the keyword may be done in various ways according to embodiments of the invention. The invention allows also a non expert in designing or creating atmosphere scenes to control the creation of a desired atmosphere in an atmosphere creation system. | 01-27-2011 |
20110029315 | VOICE DIRECTED SYSTEM AND METHOD FOR MESSAGING TO MULTIPLE RECIPIENTS - A method for sending messages in a voice-enabled system and a voice-enabled system to communicate a message are provided. The method comprises generating a message with a message generating device, analyzing the message to determine a voice-enabled device to send the message, and determining whether the voice-enabled device is available to receive the message. The method further comprises sending the message to the voice-enabled device in response to determining that the voice-enabled device is available to receive the message and, in response to determining that the voice-enabled device is not available, escalating the message based on an escalation protocol. | 02-03-2011 |
20110029316 | SPEECH RECOGNITION SYSTEM AND METHOD - According to the present invention, a method for integrating processes with a multi-faceted human centered interface is provided. The interface is facilitated to implement a hands free, voice driven environment to control processes and applications. A natural language model is used to parse voice initiated commands and data, and to route those voice initiated inputs to the required applications or processes. The use of an intelligent context based parser allows the system to intelligently determine what processes are required to complete a task which is initiated using natural language. A single window environment provides an interface which is comfortable to the user by preventing the occurrence of distracting windows from appearing. The single window has a plurality of facets which allow distinct viewing areas. Each facet has an independent process routing its outputs thereto. As other processes are activated, each facet can reshape itself to bring a new process into one of the viewing areas. All activated processes are executed simultaneously to provide true multitasking. | 02-03-2011 |
20110046962 | VOICE TRIGGERING CONTROL DEVICE AND METHOD THEREOF - A voice triggering control device for enabling a data collection host which assembled on it comprises a processing unit, a speaker, a control module, a power supply module and a housing containing the elements disclosed above. The control device controls the processing unit to output a high-frequency audio signal which is corresponded to an act command Then, broadcasting a high-frequency audio through the speaker, wherein the high-frequency audio is generated by the high-frequency audio signal, and the data collection host is enabled to perform the act command while receiving and decoding the high-frequency audio. Thereby, making the triggering control device enabling the data collection host proceed a functional action by the high-frequency audio can solve the contact fault problem in the prior art. | 02-24-2011 |
20110054907 | AUDIO INTERFACE UNIT FOR SUPPORTING NETWORK SERVICES - Techniques for providing network services at an audio interface unit include determining, based on spoken sounds of a user of an apparatus received at a microphone of the apparatus, whether to present audio data received from a different apparatus. If it is determined to present the received audio data, then presentation of the received audio data at a speaker of the apparatus is initiated. In some embodiments, an apparatus includes a data communications bus; and logic encoded in one or more tangible media configured to performs the above steps. In some embodiments, the apparatus does not include a visual display and does not include a keypad of multiple buttons. | 03-03-2011 |
20110054908 | IMAGE PROCESSING SYSTEM, IMAGE PROCESSING APPARATUS AND INFORMATION PROCESSING APPARATUS - An image processing system includes an information processing apparatus and an image processing apparatus connected to each other via a network. The information processing apparatus has an application installed thereon to give a new function to the image processing apparatus. The image processing apparatus transmits to the information processing apparatus, voice data obtained by a microphone of the image processing apparatus and data set via an operation screen customized according to the application. The information processing apparatus determines answer information indicating an action to be taken by the image processing apparatus, based on the received voice data, a dictionary owned by the application and the data set via the operation screen, and then transmits the determined answer information to the image processing apparatus. The image processing apparatus takes an action according to the answer information received therefrom. | 03-03-2011 |
20110054909 | LOCALIZING THE POSITION OF A SOURCE OF A VOICE SIGNAL - The invention relates to localizing the position of a person speaking by using pictures of a pattern ( | 03-03-2011 |
20110060592 | IPTV SYSTEM AND SERVICE METHOD USING VOICE INTERFACE - Provided is an IPTV system using voice interface which includes a voice input device, a voice processing device, a query processing and content search device, and a content providing device. The voice processing device performs voice recognition to convert voice into a text. The voice processing device includes a voice preprocessing unit, a sound model database, a language model database, and a decoder. The voice preprocessing unit performs preprocessing which includes improving the quality of sound or removing noise for the received voice, and extracts a feature vector. The decoder converts the feature vector into a text by using a sound model and a language model. Moreover, the voice processing device stores the profile and preference of a user to provide personalized service. The result of voice recognition is updated in a sound model database and a user profile database each time service for a user is provided, the performance of voice recognition and the performance of personalized service can continuously be improved. | 03-10-2011 |
20110077947 | CONFERENCE BRIDGE SOFTWARE AGENTS - Systems and methods are provided to generate a software agent that is initiated to continue the business process flow during a conference. Upon initiating a teleconference in response to a selection associated with the business process or predefined rule associated with the business process that requires a conference, an instance of a software agent is instantiated and associated with the teleconference. The software agent may be a sub-process of the conference bridge that conducts the teleconference or a separate process that interacts with the conference bridge as another party to the teleconference. The software agent is initiated with information about the business process step that requires an action or a decision. During the teleconference, the software agent listens for a command from one of the parties and acts on any command given. The commands can send another event or action back to a business process application to continue or complete the business process. The event or action sent back may be based on the commands or a result of an action on the business process step that initiate the conference. | 03-31-2011 |
20110077948 | METHOD AND SYSTEM FOR CONTAINMENT OF USAGE OF LANGUAGE INTERFACES - Client software is modified by a translator to use unique variant of linguistic interface of a service. An interceptor pre-processes subsequent client service requests from translated unique linguistic interface to standard linguistic interface implemented by service. Usage of linguistic interfaces of service is contained, rendering service incapable of executing arbitrary input, even if such input is crafted specifically for the service interface. | 03-31-2011 |
20110099017 | System and method for interactive communication with a media device user such as a television viewer - A personalized television or internet video viewing environment, where the user can respond to messages. Messages are received over the internet and overlaid onto the video program. A light and vibrator on the remote control alert the viewer to respond by speaking into a microphone in the remote control unit. Voice recognition techniques are used to interpret the user's response, and biometric voice analysis can be used to identify the user. Successive interactions can be related and tailored to the particular user. | 04-28-2011 |
20110119062 | Voice-recognition/voice-activated vehicle signal system - A control system is operable within a host vehicle to control the operation of signaling apparatus indicative of a driver intent to execute right, left or U-turn actions. The control system includes a voice recognition circuit for activating turn signal devices within the vehicle. In some embodiments, a wireless link facilitates aftermarket applications while in other embodiments original equipment manufacture is accommodated. | 05-19-2011 |
20110119063 | REMOTE NOTIFICATION SYSTEM AND METHOD AND INTELLIGENT AGENT THEREFOR - The invention relates to remote access systems and methods using automatic speech recognition to access a computer system. The invention also relates to an intelligent agent resident on the computer system for facilitating remote access to, and receipt of, information on the computer system through speech recognition or text-to-speech read-back. The remote access systems and methods can be used by a user of the computer system while traveling. The user can dial into a server system which is configured to interact with the user by automatic speech recognition and text-to-speech conversion. The server system establishes a connection to an intelligent agent running on the user's remotely located computer system by packet communication over a public network. The intelligent agent sources information on the user's computer system or a network accessible to the computer system, processes the information and transmits it to the server system over the public network. The server system converts the information into speech signals and transmits the speech signals to a telephone operated by the user. | 05-19-2011 |
20110125503 | METHODS AND SYSTEMS FOR UTILIZING VOICE COMMANDS ONBOARD AN AIRCRAFT - Methods and systems are provided for utilizing audio commands onboard an aircraft. A method comprises identifying a flight phase for the aircraft, resulting in an identified flight phase, receiving an audio input, resulting in received audio input, filtering the received audio input in a manner that is influenced by the identified flight phase for the aircraft, resulting in filtered audio input, and validating the filtered audio input as a first voice command of a first plurality of possible voice commands. | 05-26-2011 |
20110125504 | MOBILE DEVICE AND METHOD AND COMPUTER-READABLE MEDIUM CONTROLLING SAME - A mobile device moves by calculating a distance between a sound source and the mobile device using a sound source direction estimation technique. The mobile device moves by a reference distance in a direction perpendicular to a direction in which the mobile device faces the sound source when call sound of the sound source is generated, outputs voice to instruct to the sound source to generate recall sound, checks a directional angle of the mobile device when recall sound is generated by the sound source, calculates the distance between the sound source and the mobile device according to the reference distance and the directional angle of the mobile device, and moves to the vicinity of the sound source. | 05-26-2011 |
20110137657 | KITCHEN AND/OR DOMESTIC APPLIANCE - The invention relates to a kitchen and/or domestic appliance comprising input means, which are connected to a voice-recognition system, for acoustic operator commands. The invention is characterised in that means for executing command-dependent actions are provided and that the voice-recognition system is used to identify and check the authorisation of a user. | 06-09-2011 |
20110145000 | Apparatus, System and Method for Voice Dialogue Activation and/or Conduct - An apparatus, a system and a method for voice dialogue activation and/or conduct. The apparatus for voice dialogue activation and/or conduct has a voice recognition unit, a speaker recognition unit and a decision-maker unit. The decision-maker unit is designed to activate a result action on the basis of results from the voice and speaker recognition units. | 06-16-2011 |
20110153332 | Device and Method for Booting Handheld Apparatus by Voice Control - A device for booting a handheld apparatus by voice control includes a base, a power-on device, a trigger switch, and an acoustic sensor. Upon the handheld apparatus being placed at the base to trigger the trigger switch, the trigger switch controls the power-on device to power on the handheld apparatus. After the handheld apparatus is powered on, the acoustic sensor detects a sound of the handheld apparatus and then controls a pressure head of the power-on device to move away. The device and its method for booting a handheld apparatus by voice control come with the advantages of a simple and easy operation and a high efficiency. | 06-23-2011 |
20110166863 | RELEASE OF TRANSACTION DATA - For clearing transaction data selected for a processing, there is generated in a portable data carrier ( | 07-07-2011 |
20110178804 | VOICE RECOGNITION DEVICE - A voice recognition device includes a voice input unit | 07-21-2011 |
20110184740 | Integration of Embedded and Network Speech Recognizers - A method, computer program product, and system are provided for performing a voice command on a client device. The method can include translating, using a first speech recognizer located on the client device, an audio stream of a voice command to a first machine-readable voice command and generating a first query result using the first machine-readable voice command to query a client database. In addition, the audio stream can be transmitted to a remote server device that translates the audio stream to a second machine-readable voice command using a second speech recognizer. Further, the method can include receiving a second query result from the remote server device, where the second query result is generated by the remote server device using the second machine-readable voice command and displaying the first query result and the second query result on the client device. | 07-28-2011 |
20110191109 | METHOD OF CONTROLLING A SYSTEM AND SIGNAL PROCESSING SYSTEM - A method of controlling a system which includes the steps of obtaining at least one signal representative of information communicated by a user via an input device in an environment of the user, wherein a signal from a first source is available in a perceptible form in the environment; estimating at least a point in time when a transition between information flowing from the first source and information flowing from the user is expected to occur; and timing the performance of a function by the system in relation to the estimated time. | 08-04-2011 |
20110196683 | System, Method And Computer Program Product For Adding Voice Activation And Voice Control To A Media Player - A media player system, method and computer program product are provided. In use, an utterance is received. A command for a media player is then generated based on the utterance. Such command is utilized for providing wireless control of the media player. | 08-11-2011 |
20110202351 | Audio system and method for coordinating tasks - A system includes a hands free mobile communication device. Software stored on a machine readable storage device is executed to cause the hands free mobile communication device to communicate audibly with a field operator performing field operations. The operator receives instructions regarding operations to be performed. Oral communications are received from the operator and are processed automatically to provide further instructions in response to the received oral communications. | 08-18-2011 |
20110246204 | IMAGE DISPLAY DEVICE AND METHOD THEREOF - An image display device includes a display unit, a storage unit, a voice receiving unit and a processing unit. The storage unit stores a plurality of image data, a plurality of voice data and a plurality of image files, wherein each of the image data is corresponding to one of the voice data respectively. The voice receiving unit receives a current voice. The processing unit judges whether the current voice is similar to one of the voice data, so as to determine one image data corresponding to the current voice. When the current voice is similar to one of the voice data, the processing unit determines whether each of the image files contains the image data corresponding to the current voice and then displays the image file(s), which contain the image data corresponding to the current voice, on the display unit. | 10-06-2011 |
20110270615 | GLOBAL SPEECH USER INTERFACE - A global speech user interface (GSUI) comprises an input system to receive a user's spoken command, a feedback system along with a set of feedback overlays to give the user information on the progress of his spoken requests, a set of visual cues on the television screen to help the user understand what he can say, a help system, and a model for navigation among applications. The interface is extensible to make it easy to add new applications. | 11-03-2011 |
20110276335 | METHODS FOR SYNCHRONOUS AND ASYNCHRONOUS VOICE-ENABLED CONTENT SELECTION AND CONTENT SYNCHRONIZATION FOR A MOBILE OR FIXED MULTIMEDIA STATION - A system is provided for enabling voice-enabled selection and execution for playback of media files stored on a media content playback device. The system includes a voice input circuitry and speech recognition module for enabling voice input recognizable on the device as one or more voice commands for task performance; a push-to-talk interface for activating the voice input circuitry and speech recognition module; and a media content synchronization device for maintaining synchronization between stored media content selections and at least one list of grammar sets used for speech recognition by the speech recognition module, the names identifying one or more media content selections currently stored and available for playback on the media content playback device. | 11-10-2011 |
20110282673 | INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM - Provided is an information processing apparatus including: a voice analysis unit which performs an analysis process for a user speech; and a data processing unit which is input with analysis results of the voice analysis unit to determine a process which is to be performed by the information processing apparatus, wherein in the case where a factor of inhibiting process continuation occurs in a process based on the user speech, the data processing unit performs a process of generating and outputting feedback information corresponding to a process stage in which the factor of inhibiting occurs. | 11-17-2011 |
20110288871 | INFORMATION PRESENTATION SYSTEM - In an in-vehicle navigation apparatus, an in-vehicle BT communications device receives a speech recognition result via a BT communications link from a cellular phone. Based on the received speech recognition result, an in-vehicle control circuit outputs a talk-back sound about the speech recognition result via an in-vehicle sound output device. | 11-24-2011 |
20110301958 | System-Initiated Speech Interaction - Whenever an event occurs on a computing system which will accept a response from a user of the system, the system automatically determines whether or not to enable speech interaction with the system for the event response. Whenever speech interaction is enabled with the system for the event response, the system provides a notification to the user which informs the user of the event and their options for responding thereto, where these options include responding verbally. Whenever the user responds within a prescribed period of time via a voice command (VC), the system attempts to recognize the VC. Whenever the VC is successfully recognized, the system responds appropriately to the VC. | 12-08-2011 |
20110301959 | VOICE ACQUISITION SYSTEM FOR A VEHICLE - A voice acquisition system for a vehicle includes an interior rearview mirror assembly attached at an inner portion of the windshield of a vehicle equipped with the interior rearview mirror assembly. The interior rearview mirror assembly includes at least two microphones for receiving audio signals within a cabin of the vehicle and generating an output indicative of the audio signals. A control is in the vehicle and is responsive to the output from the at least one microphone. The control at least partially distinguishes vocal signals from non-vocal signals present in the output. The at least two microphones provide sound capture for at least one of a hands free cell phone system, an audio recording system and a wireless communication system. | 12-08-2011 |
20110307260 | MULTI-MODAL GENDER RECOGNITION - Gender recognition is performed using two or more modalities. For example, depth image data and one or more types of data other than depth image data is received. The data pertains to a person. The different types of data are fused together to automatically determine gender of the person. A computing system can subsequently interact with the person based on the determination of gender. | 12-15-2011 |
20110313774 | Methods, Systems, and Products for Measuring Health - Methods, systems, and products measure health data related to a user. A spoken phrase is received and time-stamped. The user is identified from the spoken phrase. A window of time is determined from a semantic content of the spoken phrase. A sensor measurement is received and time-stamped. A difference in time between the time-stamped spoken phrase and the time-stamped sensor measurement is determined and compared to the window of time. When the difference in time is within the window of time, then the sensor measurement is associated with the user. | 12-22-2011 |
20110313775 | Television Remote Control Data Transfer - A computer-implemented method for information sharing between a portable computing device and a television system includes receiving a spoken input from a user of the portable computing device, by the portable computing device, submitting a digital recording of the spoken query from the portable computing device to a remote server system, receiving from the remote server system a textual representation of the spoken query, and automatically transmitting the textual representation from the portable computing device to the television system. The television system is programmed to submit the textual representation as a search query and to present to the user media-related results that are determined to be responsive to the spoken query. | 12-22-2011 |
20110313776 | System and Method for Controlling Devices that are Connected to a Network - A system, method and computer-readable medium for controlling devices connected to a network. The method includes receiving an utterance from a user for remotely controlling a device in a network; converting the received utterance to text using an automatic speech recognition module; accessing a user profile in the network that governs access to a plurality of devices on the network and identifiers which control a conversion of the text to a device specific control language; identifying based on the text a device to be controlled; converting at least a portion of the text to the device control language; and transmitting the device control language to the identified device, wherein the identified device implements a function based on the transmitted device control language. | 12-22-2011 |
20120010890 | POWER-OPTIMIZED WIRELESS COMMUNICATIONS DEVICE - The present invention is an Always On, Hands-free, Speech Activated, Power-optimized Wireless Communications Device with associated base. The unique value of the device is that a person can use the device at any time, | 01-12-2012 |
20120016678 | Intelligent Automated Assistant - An intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact. | 01-19-2012 |
20120022873 | Speech Recognition Language Models - Methods, computer program products and systems are described for forming a speech recognition language model. Multiple query-website relationships are determined by identifying websites that are determined to be relevant to queries using one or more search engines. Clusters are identified in the query-website relationships by connecting common queries and connecting common websites. A speech recognition language model is created for a particular website based on at least one of analyzing at queries in a cluster that includes the website or analyzing webpage content of web pages in the cluster that includes the website. | 01-26-2012 |
20120022874 | DISAMBIGUATION OF CONTACT INFORMATION USING HISTORICAL DATA - Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for disambiguating contact information. A method includes receiving an audio signal, generating an affinity score based on a frequency with which a user has previously communicated with a contact associated with an item of contact information, and further based on a recency of one or more past interactions between the user and the contact associated with the item of contact information, inferring a probability that the user intends to initiate a communication using the item of contact information based on the affinity score generated for the item of contact information, and generating a communication initiation grammar. | 01-26-2012 |
20120022875 | SYNCHRONIZING VISUAL AND SPEECH EVENTS IN A MULTIMODAL APPLICATION - Exemplary methods, systems, and products are disclosed for synchronizing visual and speech events in a multimodal application, including receiving from a user speech; determining a semantic interpretation of the speech; calling a global application update handler; identifying, by the global application update handler, an additional processing function in dependence upon the semantic interpretation; and executing the additional function. Typical embodiments may include updating a visual element after executing the additional function. Typical embodiments may include updating a voice form after executing the additional function. Typical embodiments also may include updating a state table after updating the voice form. Typical embodiments also may include restarting the voice form after executing the additional function. | 01-26-2012 |
20120022876 | Voice Actions on Computing Devices - A computer-implemented method includes receiving spoken input at a computing device from a user of the computing device, the spoken input including a carrier phrase and a subject to which the carrier phrase is directed, providing at least a portion of the spoken input to a server system in audio form for speech-to-text conversion by the server system, the portion including the subject to which the carrier phrase is directed, receiving from the server system instructions for automatically performing an operation on the computing device, the operation including an action defined by the carrier phrase using parameters defined by the subject, and automatically performing the operation on the computing device. | 01-26-2012 |
20120029921 | SPEECH RECOGNITION SYSTEM AND METHOD - According to the present invention, a method for integrating processes with a multi-faceted human centered interface is provided. The interface is facilitated to implement a hands free, voice driven environment to control processes and applications. A natural language model is used to parse voice initiated commands and data, and to route those voice initiated inputs to the required applications or processes. The use of an intelligent context based parser allows the system to intelligently determine what processes are required to complete a task which is initiated using natural language. A single window environment provides an interface which is comfortable to the user by preventing the occurrence of distracting windows from appearing. The single window has a plurality of facets which allow distinct viewing areas. Each facet has an independent process routing its outputs thereto. As other processes are activated, each facet can reshape itself to bring a new process into one of the viewing areas. All activated processes are executed simultaneously to provide true multitasking. | 02-02-2012 |
20120029922 | METHOD OF ACCESSING A DIAL-UP SERVICE - A method of accessing a dial-up service is disclosed. An example method of providing access to a service includes receiving a first speech signal from a user to form a first utterance; recognizing the first utterance using speaker independent speaker recognition; requesting the user to enter a personal identification number; and when the personal identification number is valid, receiving a second speech signal to form a second utterance and providing access to the service. | 02-02-2012 |
20120035935 | APPARATUS AND METHOD FOR RECOGNIZING VOICE COMMAND - An apparatus and method for recognizing a voice command for use in an interactive voice user interface are provided. The apparatus includes a command intention belief generation unit that is configured to recognize a first voice command and that may generate one or more command intention beliefs for the first voice command. The apparatus also includes a command intention belief update unit that is configured to update each of the command intention beliefs based on a system response to the first voice command and a second voice commands. The apparatus also includes a command intention belief selection unit that is configured to select one of the updated command intention beliefs for the first voice command. The apparatus also includes an operation signal output unit that is configured to select a final command intention from the selected updated command intention belief and to output an operation signal based on the selected final command intention. | 02-09-2012 |
20120041766 | VOICE-CONTROLLED NAVIGATION DEVICE AND METHOD - In a voice-controlled navigation device and method, a voice command is received, and divided into voice segments Vi (i=1˜n) by comparing with one or more keywords. A voice segment Vi (i=1˜n) is obtained in sequence to be compared with tree nodes in a search tree of place names. A weight value of each tree node is computed according to a comparison, to select one or more tree nodes whose weight values are greater than a predetermined value. Routes formed by all the selected tree nodes are obtained to select a route whose total weight value is the greatest. A navigation to a destination is given by indicating the selected route on an electronic map according to place names represented by the tree nodes of the selected route. | 02-16-2012 |
20120046952 | REMOTE CONTROL SYSTEM AND METHOD - A remote control system includes a receiving and recognition module, a converting module, and a control interface module. The receiving and recognition is used for receiving a signal from a user and recognizing the signal as a user command associated with an electronic device. The converting module is used for converting the user command into a control command identifiable by the electronic device. The control interface module is used for sending the control command to the electronic device to control the electronic device. | 02-23-2012 |
20120046953 | ESTABLISHING A MULTIMODAL PERSONALITY FOR A MULTIMODAL APPLICATION - Methods, apparatus, and computer program products are described for establishing a multimodal personality for a multimodal application that include selecting, by the multimodal application, matching vocal and visual demeanors and incorporating, by the multimodal application, the matching vocal and visual demeanors as a multimodal personality into the multimodal application. | 02-23-2012 |
20120130719 | REMOTE CONTROL SIGNALING USING AUDIO WATERMARKS - A system for using a watermark embedded in an audio signal to remotely control a device. Various devices such as toys, computers, and appliances, equipped with an appropriate detector, detect the hidden signals, which can trigger an action, or change a state of the device. The watermarks can be used with a “time gate” device, where detection of the watermark opens a time interval within which a user is allowed to perform an action, such as pressing a button, typing in an answer, turning a key in a lock, etc. | 05-24-2012 |
20120136666 | AUTOMATED PERSONAL ASSISTANCE SYSTEM - An automated personal assistance system employing artificial intelligence technology that includes speech recognition and synthesis, situational awareness, pattern and behavioral recognition, and the ability to learn from the environment. Embodiments of the system include environmental and occupant sensors and environmental actuators interfaced to an assistance controller having the artificial intelligence technology incorporated therein to control the environment of the system. An embodiment of the invention is implemented as a vehicle which reacts to voice command for movement and operation of the vehicle and detects objects, obstructions, and distances. This invention provides the ability to monitor for the safety of operation and modify dangerous maneuvers as well as to learn locations in the environment and to automatically find its way to them. The system may also incorporate communication capability to convey patterns of environmental and occupant parameters and to a monitoring center. | 05-31-2012 |
20120136667 | VOICE ASSISTANT SYSTEM - Methods and apparatuses to assist a user in the performance of a plurality of tasks are provided. The invention includes storing at least one care plan for a resident, the care plan defining a plurality of tasks to be performed for providing care to the resident. Capturing speech inputs from the user, and providing speech outputs to the user to provide a speech dialog with the user reflective of the care plan. Information is captured with a contactless communication interface and is used for engaging the care plan. | 05-31-2012 |
20120136668 | ELEVATOR CONTROL DEVICE - An elevator control device makes call registration by voice recognition by using a microphone outputting a user's voice as a sound signal and includes: an indicator controller that causes an image which specifies one of objects selectable as a call registration object to be displayed; a storage which stores, in advance, the sound signal of a predetermined voice, which is used for designating the call registration object, as a registered sound signal; and a voice recognizing mechanism that compares the sound signal delivered from the microphone with the registered sound signal, and delivers a control signal if these sound signals coincide with each other. The indicator controller outputs registration information, in which the object specified by the image displayed on the indicator is the call registration object, when receiving the control signal sent from the voice recognizing mechanism. | 05-31-2012 |
20120150546 | APPLICATION STARTING SYSTEM AND METHOD - A computing device and method starts applications via voice commands. The computing device records a sound input by a microphone of the computing device and sends the recorded sound input to a sound sensor of the computing device. Furthermore, the computing device reads a voice command by an embedded controller of the computing device from the sound sensor, in response to a determination that the recorded sound input matches a predetermined verbal statement of the voice command. The computing device notifies an operating system of the computing device to start the application corresponding to the voice command. | 06-14-2012 |
20120158407 | VOICE CONTROL SYSTEM FOR AN IMPLANT - A system for the control of an implant ( | 06-21-2012 |
20120166203 | System and Method for Mobile Workflow Processing - A system and method of wirelessly serving a work flow protocol to agents for use with respect to subjects. The agents wear headsets, each with a display and a microphone coupled to a portable controller. The work flow protocol causing presentation of queries through the headsets based on a logical tree structure. Data generated by speech of the agents is received and stored. | 06-28-2012 |
20120166204 | Navigation System and Radio Receiving System - An object of the invention is also a navigation system having an input device for the input of an input scale value, having a display device for displaying road map information according to a selected display scale value and having a processor device, wherein the number of enterable input scale values is larger than the number of the selectable display scale values. | 06-28-2012 |
20120173244 | APPARATUS AND METHOD FOR VOICE COMMAND RECOGNITION BASED ON A COMBINATION OF DIALOG MODELS - Provided are a voice command recognition apparatus and method capable of figuring out the intention of a voice command input through a voice dialog interface, by combining a rule based dialog model and a statistical dialog model rule. The voice command recognition apparatus includes a command intention determining unit configured to correct an error in recognizing a voice command of a user, and an application processing unit configured to check whether the final command intention determined in the command intention determining unit comprises the input factors for execution of an application. | 07-05-2012 |
20120173245 | NAVIGATION SYSTEM - A navigation system is provided which facilitates discrimination between an icon of a facility associated with a route, along which the user is expected to move from now on, and an ordinary icon. To achieve this, it includes a destination estimating unit for acquiring information about a driving history and for estimating a destination from the information about the driving history acquired; a drawing decision changing unit for drawing a destination candidate estimated by the destination estimating unit in a form different from an icon of a non-destination candidate; and an information display unit for causing the icon drawn by the drawing decision changing unit to be displayed. | 07-05-2012 |
20120179472 | ELECTRONIC DEVICE CONTROLLED BY A MOTION AND CONTROLLING METHOD THEREOF - An electronic device is provided. The electronic device includes a motion recognition unit which recognizes motion of an object and a control unit which, if a push motion in which the object located in front of the electronic device is moved in a direction of the electronic device is sensed by the motion recognition unit, activates a motion recognition mode, tracks the motion of the object, and performs a control operation of the electronic device corresponding to a subsequent motion of the object. The control unit may inactivate the motion recognition mode if an end motion in which the motion of the object is in a direction to contact a body part of a user or an additional object is recognized by the motion recognition unit while the motion recognition mode is activated. | 07-12-2012 |
20120179473 | SPEECH INTERACTIVE APPARATUS AND COMPUTER PROGRAM PRODUCT - According to an embodiment, a speech interactive apparatus includes an output unit to output a first response; a receiving unit to receive a start instruction of a speech input as a reply to the first response; a response control unit to stop the output of the first response when the start instruction is received while the first response is being output; and a deciding unit to decide on a first determination period, which is used in determining whether a silent state has occurred, based on whether the start instruction is received while the first response is being output or based on the timing of receiving the start instruction. When the input speech is not input during a period starting from the reception of the start instruction till an elapse of the first determination period, the response control unit instructs the output unit to output the first response again. | 07-12-2012 |
20120191461 | Method and Apparatus for Voice Controlled Operation of a Media Player - A system and methods for voice controlled operation of a media player are provided. In one embodiment, a method includes detecting user positioning of a microphone power switch to an off position, detecting user positioning of the microphone power switch to an on position within a predetermined period of time and entering a voice recognition mode, by the media player, based on the user positioning of the microphone power switch to the on position within the predetermined period of time. The method may further include detecting one or more output signals of the microphone, detecting a voice command based on the one or more output signals of the microphone, and controlling operation of the media player based on the voice command, wherein the media player outputs a graphical display associated with the voice command. | 07-26-2012 |
20120197647 | COMPUTERIZED INFORMATION PRESENTATION APPARATUS - A computerized information system and computer readable apparatus. In one embodiment, the apparatus is configured for use in a transport apparatus and comprises a computer readable medium having at least one computer program disposed thereon, the at least one program being configured to provide the user with requested information (such as for example directions to a desired business or other entity). At least a portion of the information is obtained via a wireless link with a remote server. | 08-02-2012 |
20120203559 | ACTIVATING FUNCTIONS IN PROCESSING DEVICES USING START CODES EMBEDDED IN AUDIO - Apparatus, system and method for performing an action such as accessing supplementary data and/or executing software on a device capable of receiving multimedia are disclosed. After multimedia is received, a monitoring code is detected and a signature is extracted in response thereto from an audio portion of the multimedia. The ancillary code includes a plurality of code symbols arranged in a plurality of layers in a predetermined time period, and the signature is extracted from features of the audio of the multimedia. Supplementary data is accessed and/or software is executed using the detected code and/or signature. | 08-09-2012 |
20120215543 | Adding Speech Capabilities to Existing Computer Applications with Complex Graphical User Interfaces - At design time of a graphical user interface (GUI), a software component (VUIcontroller) is added to the GUI. At run time of the GUI, the VUIcontroller analyzes the GUI from within a process that executes the GUI. From this analysis, the VUIcontroller automatically generates a voice command set, such as a speech-recognition grammar, that corresponds to controls of the GUI. The generated voice command set is made available to a speech recognition engine, thereby speech-enabling the GUI. Optionally, a GUI designer may add properties to ones of the GUI controls at GUI design time, without necessarily writing a voice command set. These properties, if specified, are then used at GUI run time to control or influence the analysis of the GUI and the automatic generation of the voice command set. | 08-23-2012 |
20120215544 | COMPUTERIZED INFORMATION PRESENTATION APPARATUS - A computerized information apparatus useful for providing directions and other information to a user. In one embodiment, the apparatus comprises a processor and network interface and computer readable medium having at least one computer program disposed thereon, the at least one program being configured to receive a speech input from the user regarding an organization or entities, and provide a graphic or visual representation of the organization or entity to aid them in finding the organization or entity. At least a portion of the information is obtained via the network interface from a remote server. | 08-23-2012 |
20120215545 | ROBUST VOICE BROWSER SYSTEM AND VOICE ACTIVATED DEVICE CONTROLLER - The present invention relates to a system for acquiring information from sources on a network, such as the Internet. A voice browsing system maintains a database containing a list of information sources, such as web sites, connected to a network. Each of the information sources is assigned a rank number which is listed in the database along with the record for the information source. In response to a speech command received from a user, a network interface system accesses the information source with the highest rank number in order to retrieve information requested by the user. | 08-23-2012 |
20120221341 | MOTOR-VEHICLE VOICE-CONTROL SYSTEM AND MICROPHONE-SELECTING METHOD THEREFOR - A voice-control system for motor vehicles has a plurality of spaced microphones emitting respective microphone signals, and an evaluation unit connected to the microphones. This unit serves for assembling correlation pairs from the signals of two of the microphones, calculating a correlation coefficient for each correlation pair, detecting an energy value for each microphone, detecting a respective delay time of a voice signal between a voice signal source and the each of the microphones, and selecting in dependence on current correlation coefficients of the correlation pairs, on the current energy values of the microphones, and on the current delay time of the voice signal to the microphones, that microphone whose signal is optimal as a basis for the operation of the voice-control system. | 08-30-2012 |
20120226502 | TELEVISION APPARATUS AND A REMOTE OPERATION APPARATUS - According to one embodiment, a television apparatus includes a speech input unit, an indication input unit, a speech recognition unit, and a control unit. The speech input unit is configured to input a speech. The indication input unit is configured to input an indication to start speech recognition from a user. The speech recognition unit is configured to recognize the user's speech inputted after the indication is inputted. The control unit is configured to execute an operation command corresponding to a recognition result of the user's speech. The control unit, if a volume of the television apparatus at a timing when the indication is inputted is larger than or equal to a threshold, temporarily sets the volume to a value smaller than the threshold while the speech recognition unit is recognizing. | 09-06-2012 |
20120226503 | INFORMATION PROCESSING APPARATUS AND METHOD - An information processing apparatus comprising an information output unit configured to switch a plurality of languages at each given time interval while output a guidance information set by the plurality of languages, a response detection unit configured to detect a response to the guidance information when the guidance information is output while the languages are switched and a processing language determination unit configured to take the language which detect the response to the guidance information as a processing language. | 09-06-2012 |
20120245945 | IN-VEHICLE APPARATUS AND INFORMATION DISPLAY SYSTEM - An in-vehicle apparatus receives an image data representative of a screen image from a portable terminal with a touch panel. The apparatus extracts a text code data from the image data, and identifies a text-code display area in the screen image. The apparatus determines a command text based on a user-uttered voice command. The apparatus identifies a text-code display area as a subject operation area in the screen image of the portable terminal, based on the command text, the text code data extracted from image data, and information on the text-code display area corresponding to the text code data. An area of the screen image of the touch panel corresponding to the text-code display area is identified as the subject operation area, and a signal indicative of the subject operation area identified is transmitted to the portable terminal. | 09-27-2012 |
20120245946 | Reusable Mulitmodal Application - A method and system are disclosed herein for accepting multimodal inputs and deriving synchronized and processed information. A reusable multimodal application is provided on the mobile device. A user transmits a multimodal command to the multimodal platform via the mobile network. The one or more modes of communication that are inputted are transmitted to the multimodal platform(s) via the mobile network(s) and thereafter synchronized and processed at the multimodal platform. The synchronized and processed information is transmitted to the multimodal application. If required, the user verifies and appropriately modifies the synchronized and processed information. The verified and modified information are transferred from the multimodal application to the visual application. The final result(s) are derived by inputting the verified and modified results into the visual application. | 09-27-2012 |
20120253824 | METHODS AND SYSTEM OF VOICE CONTROL - This invention relates to a system with different modes of operation or performance that integrates all the key components for the control of most domestic services, such as telephone, lighting and audio/video system, through audio inputs such as words or phrases by a user. | 10-04-2012 |
20120253825 | RELEVANCY RECOGNITION FOR CONTEXTUAL QUESTION ANSWERING - Disclosed are systems, methods and computer-readable media for controlling a computing device to provide contextual responses to user inputs. The method comprises receiving a user input, generating a set of features characterizing an association between the user input and a conversation context based on at least a semantic and syntactic analysis of user inputs and system responses, determining with a data-driven machine learning approach whether the user input begins a new topic or is associated with a previous conversation context and if the received question is associated with the existing topic, then generating a response to the user input using information associated with the user input and any previous user input associated with the existing topic. | 10-04-2012 |
20120259641 | METHODS AND APPARATUS FOR INITIATING ACTIONS USING A VOICE-CONTROLLED INTERFACE - Methods and apparatus for initiating an action using a voice-controlled human interface. The interface provides a hands free, voice driven environment to control processes and applications. According to one embodiment, a method comprises electronically receiving first user input, parsing the first user input to determine whether the first user input contains a command activation statement that cues a voice-controlled human interface to enter a command mode in which a second user input comprising a voice signal is processed to identify at least one executable command and, in response to determining that the first user input comprises the command activation statement, identifying the at least one executable command in the second user input. | 10-11-2012 |
20120265538 | VOICE REMOTE CONTROL - A device may include a display and logic. The logic may be configured to receive, from a user, a selection f a first control action associated with an application stored in the device, provide, via the display, a number of choices associated with the first control action, and receive, from the user, a word or a phrase to use as a voice command corresponding to the first control action, wherein the word or phrase is selected from the choices. The logic may also associate the word or phrase with the first control action, receive voice input from the user, identify the voice input as corresponding to the word or phrase, and perform the first control action based on the identified voice input | 10-18-2012 |
20120271639 | PERMITTING AUTOMATED SPEECH COMMAND DISCOVERY VIA MANUAL EVENT TO COMMAND MAPPING - An input from a manually initiated action within a computing system can be received. The system can be associated with a speech component. The input can be associated with a system function. The function can be an operation within the computing system and can be linked to a function identifier. The identifier can be translated to a command data. The command data can be associated with a command identifier, a command, and an alternative command. The command data can be a speech command registered within the speech component. The command data can be presented within a speech interface responsive to the translating. The speech interface can be associated with the speech component. | 10-25-2012 |
20120271640 | Implicit Association and Polymorphism Driven Human Machine Interaction - A voice based user-system interaction may take advantage of implicit association and/or polymorphism to achieve smooth and effective discoursing between the user and the voice enabled system. This user-system interaction may occur at a local control unit, at a remote server, or both. | 10-25-2012 |
20120271641 | METHOD AND APPARATUS FOR EDUTAINMENT SYSTEM CAPABLE FOR INTERACTION BY INTERLOCKING OTHER DEVICES - An apparatus and method provide interactive edutainment through connection of a smart TV and other devices (e.g., a tablet PC, a smart phone, and a projector). The method includes connecting with a control device, and when at least one main story for interactivity is stored, receiving from a user a selection of the main story to be executed through the control device. The method also includes executing the selected main story, and when a control command is received from the control device, processing the control command. | 10-25-2012 |
20120271642 | ESTABLISHING A MULTIMODAL ADVERTISING PERSONALITY FOR A SPONSOR OF A MULTIMODAL APPLICATION - Establishing a multimodal advertising personality for a sponsor of a multimodal application, including associating one or more vocal demeanors with a sponsor of a multimodal application and presenting a speech portion of the multimodal application for the sponsor using at least one of the vocal demeanors associated with the sponsor. | 10-25-2012 |
20120271643 | INFERRING SWITCHING CONDITIONS FOR SWITCHING BETWEEN MODALITIES IN A SPEECH APPLICATION ENVIRONMENT EXTENDED FOR INTERACTIVE TEXT EXCHANGES - The disclosed solution includes a method for dynamically switching modalities based upon inferred conditions in a dialogue session involving a speech application. The method establishes a dialogue session between a user and the speech application. During the dialogue session, the user interacts using an original modality and a second modality. The speech application interacts using a speech modality only. A set of conditions indicative of interaction problems using the original modality can be inferred. Responsive to the inferring step, the original modality can be changed to the second modality. A modality transition to the second modality can be transparent the speech application and can occur without interrupting the dialogue session. The original modality and the second modality can be different modalities; one including a text exchange modality and another including a speech modality. | 10-25-2012 |
20120278083 | VOICE CONTROLLED DEVICE AND METHOD - A voice control device includes a storage module, a voice recording module, and a processing module. The storage module stores a number of computerized voice commands. The voice recording module records audio signals of a user. The processing module processes the recorded voice signals to a machine readable command, determines whether the determined machine readable command matches one stored computerized voice command, and controls the device to execute a function according to the machine readable command if the determined machine readable command matches one stored computerized voice command. The processing module stores the determined machine readable command as a history command. The processing module further obtains all the history commands and determines which function the voice controlled device is to do according to the history commands if the determined machine readable command is partially the same as at least two of the stored computerized voice commands. | 11-01-2012 |
20120278084 | METHOD FOR SELECTING ELEMENTS IN TEXTUAL ELECTRONIC LISTS AND FOR OPERATING COMPUTER-IMPLEMENTED PROGRAMS USING NATURAL LANGUAGE COMMANDS - A method for controlling a program by natural language allows a user to efficiently operate a computer-implemented target program through intuitive natural language commands. A list of natural language commands related to the target program is compiled. Each natural language command is stored as an element in an electronic list. Natural language commands generally consist of short sentences comprising at least a predicate (a verb) and an object (a noun). A user can filter the list of natural language commands by entering the initials of a natural language command. The user enters the first character of the first word to be filtered, followed by the first character of the second word to be filtered, and so forth. Filtering by initials very rapidly reduces the number of choices presented to a user and minimizes the number of keystrokes required to select a particular list element. | 11-01-2012 |
20120284031 | METHOD AND DEVICE FOR OPERATING TECHNICAL EQUIPMENT, IN PARTICULAR A MOTOR VEHICLE - A method and device for operating technical equipment, in particular in a motor vehicle. Speech inputs are fed by a speech input unit and manual inputs are fed by means of a manual input unit as operating instructions to a controller by which a command corresponding to the operating instruction is generated and fed to the corresponding technical equipment, which then executes the operating procedure associated with the operating instruction. A basic structure of the command is established by the speech input unit or the manual input unit, and then the basic structure of the command is supplemented by the manual input unit or the speech input unit. | 11-08-2012 |
20120296655 | PREDICTIVE PRE-RECORDING OF AUDIO FOR VOICE INPUT - Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for providing predictive pre-recording of audio for voice input. In one aspect, a method includes obtaining sensor data from one or more sensors of a mobile device while the mobile device is operating in an inactive state, determining that a user of the mobile device is interacting with the mobile device based on the sensor data, invoking voice input functionality of the mobile device in response to determining that the user of the mobile device is interacting with the mobile device, detecting a voice input, and activating the mobile device in response to detecting the voice input. | 11-22-2012 |
20120303373 | ELECTRONIC APPARATUS AND METHOD FOR CONTROLLING THE ELECTRONIC APPARATUS USING VOICE - An electronic apparatus includes a microphone, a processor, a motherboard, and a voice recognition microchip. The voice recognition microchip compares a voice command with a pre-stored voice command. If the voice command is identical with the pre-stored voice command, the processor outputs a control signal to the motherboard. The motherboard controls the electronic apparatus to perform an action corresponding to the control signal. | 11-29-2012 |
20120303374 | APPARATUS AND METHOD FOR TRANSMITTING VIDEO DATA FROM MOBILE COMMUNICATION TERMINAL - A mobile terminal includes an input unit receiving an input; a data storage unit storing data; a communication unit communicating signals; and a controller. The controller is configured to receive a selection input of a video data, the selection input being processed to select the video data among a plurality of video data stored in the data storage unit; temporarily store a selected portion of the video data for transmission based on a start position and a stop position specifying the selected portion in the video data; automatically attach the selected portion of the video data for transmission to a message without receiving any further user input when the selected portion of the video data is specified; transmit the message with the selected portion of the video data; and delete the selected portion of the video data from the data storage unit when the transmission of the message is completed. | 11-29-2012 |
20120316884 | Wheelchair System Having Voice Activated Menu Navigation And Auditory Feedback - A personal mobility vehicle, such as a wheelchair system, includes an input audio transducer having an output coupled to a speech recognition system and an output audio transducer having an input coupled to a speech synthesis system. The wheelchair system further includes a control unit having a data processor and a memory. The data processor is coupled to the speech recognition system and to the speech synthesis system and is operable in response to a recognized utterance made by a user to present the user with a menu containing wheelchair system functions. The data processor is further configured in response to at least one further recognized utterance made by the user to select from the menu at least one wheelchair system function, to activate the selected function and to provide audible feedback to the user via the speech synthesis system. | 12-13-2012 |
20120323580 | EDITING TELECOM WEB APPLICATIONS THROUGH A VOICE INTERFACE - Systems and associated methods for editing telecom web applications through a voice interface are described. Systems and methods provide for editing telecom web applications over a connection, as for example accessed via a standard phone, using speech and/or DTMF inputs. The voice based editing includes exposing an editing interface to a user for a telecom web application that is editable, dynamically generating a voice-based interface for a given user for accomplishing editing tasks, and modifying the telecom web application to reflect the editing commands entered by the user. | 12-20-2012 |
20130006643 | Devices and Methods for Identifying a Prompt Corresponding to a Voice Input in a Sequence of Prompts - This is directed to processing voice inputs received by an electronic device while prompts are provided. In particular, this is directed to providing a sequence of prompts to a user (e.g., voice over prompts) while monitoring for a voice input. When the voice input is received, a characteristic time stamp can be identified for the voice input, and can be compared to periods or windows associated with each of the provided prompts. The electronic device can then determine that the prompt corresponding to a window that includes the characteristic time stamp was the prompt to which the user wished to apply the voice input. The device can process the voice input to extract a user instruction, and apply the instruction to the identified prompt (e.g., and perform an operation associated with the prompt). | 01-03-2013 |
20130013318 | USER INPUT BACK CHANNEL FOR WIRELESS DISPLAYS - As part of a communication session, a wireless source device can transmit audio and video data to a wireless sink device, and the wireless sink device can transmit user input data received at the wireless sink device back to the wireless source device. In this manner, a user of the wireless sink device can control the wireless source device and control the content that is being transmitted from the wireless source device to the wireless sink device. The input data received at the wireless sink device can be a voice command. | 01-10-2013 |
20130013319 | METHODS AND APPARATUS FOR INITIATING ACTIONS USING A VOICE-CONTROLLED INTERFACE - Methods and apparatus for initiating an action using a voice-controlled human interface. The interface provides a hands free, voice driven environment to control processes and applications. According to one embodiment, a method comprises electronically receiving first user input, parsing the first user input to determine whether the first user input contains a command activation statement that cues a voice-controlled human interface to enter a command mode in which a second user input comprising a voice signal is processed to identify at least one executable command and, in response to to determining that the first user input comprises the command activation statement, identifying the at least one executable command in the second user input. | 01-10-2013 |
20130013320 | MULTIMODAL AGGREGATING UNIT - In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality. | 01-10-2013 |
20130018659 | Systems and Methods for Speech Command Processing - Methods and apparatus related to processing speech input at a wearable computing device are disclosed. Speech input can be received at the wearable computing device. Speech-related text corresponding to the speech input can be generated. A context can be determined based on database(s) and/or a history of accessed documents. An action can be determined based on an evaluation of at least a portion of the speech-related text and the context. The action can be a command or a search request. If the action is a command, then the wearable computing device can generate output for the command. If the action is a search request, then the wearable computing device can: communicate the search request to a search engine, receive search results from the search engine, and generate output based on the search results. The output can be provided using output component(s) of the wearable computing device. | 01-17-2013 |
20130024200 | METHOD FOR SELECTING PROGRAM AND APPARATUS THEREOF - A program selection method and a display apparatus thereof are provided. The program selection method includes generating a program list including at least one program title, determining whether there is a voice input for a program selection; searching for a desired program title corresponding to the voice input for the program selection among the at least one program title in the program list, and selecting a program corresponding to the desired program title based on the searching for the desired program title. | 01-24-2013 |
20130030814 | SYSTEMS AND METHODS FOR IMPROVING QUALITY OF USER GENERATED AUDIO CONTENT IN VOICE APPLICATIONS - Methods and arrangements for improving quality of content in voice applications. A specification is provided for acceptable content for a voice application, and user generated audio content for the voice application is inputted. At least one test is applied to the user generated audio content, and it is thereupon determined as to whether the user generated audio content meets the provided specification. | 01-31-2013 |
20130030815 | MULTIMODAL INTERFACE - Provided is a multimodal graphical user interface. The multimodal graphical user interface includes a menu with at least one menu item, wherein the at least one menu item is displayed as command name along with a unique hand shape, wherein the at least one menu item is configured to receive a combination of cursor and selection gesture input. | 01-31-2013 |
20130030816 | OFFLINE DELIVERY OF CONTENT AVAILABLE IN VOICE APPLICATIONS - Methods and arrangements for facilitating offline delivery of content available in voice applications. User access to a voice application is permitted, and the user is accorded a capability to select content in the voice application for offline delivery. The selected content is stored in a holding arrangement, and the selected content is availed for delivery to the user. | 01-31-2013 |
20130035941 | METHOD FOR CONTROLLING ELECTRONIC APPARATUS BASED ON VOICE RECOGNITION AND MOTION RECOGNITION, AND ELECTRONIC APPARATUS APPLYING THE SAME - An electronic apparatus and a method for controlling thereof are provided. The method recognizes one of among a user voice and a user motion through one of among a voice recognition module and a motion recognition module, and if a user voice is recognized through the voice recognition module, performs a voice task corresponding to the recognized user voice, and, if a user motion is recognized through the motion recognition module, performs a motion task corresponding to the recognized user motion. | 02-07-2013 |
20130035942 | ELECTRONIC APPARATUS AND METHOD FOR PROVIDING USER INTERFACE THEREOF - An electronic apparatus and a method for providing a user interface (UI) thereof are provided. Specifically, an electronic apparatus which displays an executable icon of an application which is controllable through voice recognition distinctively from an executable icon of an application which is uncontrollable through voice recognition in a voice task mode, and a method for providing UI thereof are provided. Some of the disclosed exemplary embodiments provide an electronic apparatus which is capable of recognizing a user voice command and a user motion gesture, and displays an executable icon of an application which is controllable through voice recognition and a name of the executable icon distinctively from an executable icon of an application which is uncontrollable through voice recognition and a name of the executable icon in a voice task mode, and a method for providing a UI thereof. | 02-07-2013 |
20130041670 | SPEECH COMMAND INPUT RECOGNITION SYSTEM FOR INTERACTIVE COMPUTER DISPLAY WITH INTERPRETATION OF ANCILLARY RELEVANT SPEECH QUERY TERMS INTO COMMANDS - In an interactive computer controlled display system with speech command input recognition and visual feedback including means for predetermining a plurality of speech commands for respectively initiating each of a corresponding plurality of system actions in combination with means for providing for each of the plurality of speech commands an associated set of speech terms, each term having relevance to its associated command Also included are means responsive to a detected speech term having relevance to one of the speech commands for displaying a relevant command. The system preferably may display basic speech commands simultaneously along with relevant commands. The means for providing the associated set of speech terms may comprise a stored relevance table of universal speech input commands and universal computer operation terms conventionally associated with system actions initiated by the input commands, and means for relating operation terms of the system with terms in the relevance table. | 02-14-2013 |
20130041671 | Event Driven Motion Systems - A motion system for allowing a person to cause a desired motion operation to be performed, comprising a network, a motion machine, a speech to text converter, a message protocol generator, an instant message receiver, and a motion services system. The motion machine is capable of performing motion operations. The speech to text converter generates a digital representation of a spoken motion message spoken by the person. The message protocol generator generates a digital motion command based on the digital representation of the spoken motion message and causes the digital motion command to be transmitted over the network. The instant message receiver receives the digital motion command. The motion services system causes the motion machine to perform the desired motion operation based on the digital motion command received by the instant message receiver. | 02-14-2013 |
20130046544 | MULTIMODAL TEXT INPUT SYSTEM, SUCH AS FOR USE WITH TOUCH SCREENS ON MOBILE PHONES - A system and method for entering text from a user includes a programmed processor that receives inputs from the user and disambiguates the inputs to present word choices corresponding to the text. In one embodiment, inputs are received in two or more modalities and are analyzed to present the word choices. In another embodiment, a keyboard is divided into zones each of which represents two more input characters. A sequence of zones selected by the user is analyzed to present word choices corresponding to the zone selected. | 02-21-2013 |
20130054246 | Voice-Activated Measurement System - A voice-activated instrument performs a measurement and displays the measured value when commanded by voice. The system also resets under voice control. The measurement trigger is any single-syllable command such as “Count” or “Go”. The reset trigger is any two-syllable command such as “Reset”. Any type of momentary measurement device may be controlled in this way, including time interval measurements, event counting, length measuring, weighing, and electronic metering measurements, and many others. | 02-28-2013 |
20130054247 | FACILITATING TANGIBLE INTERACTIONS IN VOICE APPLICATIONS - Methods and arrangements for facilitating tangible interactions in voice applications. At least two tangible objects are provided, along with a measurement interface. The at least two tangible objects are disposed to each be displaceable with respect to one another and with respect to the measurement interface. The measurement interface is communicatively connected with a voice application. At least one of the two tangible objects is displaced with respect to the measurement interface, and the displacement of at least one of the at least two tangible objects is converted to input for the voice application. | 02-28-2013 |
20130054248 | PROJECTOR, PROJECTION SYSTEM, AND RETRIEVED INFORMATION DISPLAYING METHOD - A projector includes a display part configured to display a first image by projection; a retrieval object specifying part configured to cause a user of the projector to specify an object of retrieval; a result display area specifying part configured to cause the user to specify an area for displaying the result of the retrieval in the displayed first image; and an image combining part configured to receive a second image of the result of the retrieval from a server that has performed the retrieval with respect to the object of retrieval specified by the user, and to display the second image by combining the second image with the area for displaying the result of the retrieval in the displayed first image. | 02-28-2013 |
20130066636 | Apparatus and method for a wireless extension collar device for use with mobile and fixed end-user wireless devices - A wireless extension device to end-user wireless device has a collar that is worn around the neck. The collar has two end-members that are positioned on the two collar bone areas next to the neck. The end-members have positioned directional speakers therein that radiate sound in the direction of two ears of the human wearing the collar around the neck. The end-members have positioned microphones that pick up voice commands of a human wearing the collar around the neck. The wireless collar extension device is used for hands free communication with end-user wireless device, without having to plug a prior art BLUETOOTH earpiece into one of the ears. | 03-14-2013 |
20130066637 | INFORMATION PROCESSOR - Information processor | 03-14-2013 |
20130073293 | ELECTRONIC DEVICE AND METHOD FOR CONTROLLING THE SAME - An electronic device, a system including the same, and a method for controlling the same are provided. The electronic device may select a specific electronic device to perform a user's voice command in an environment including a plurality of electronic devices capable of voice recognition. The embodiments of the present disclosure allows for interaction between the user and the plurality of electronic devices so that the electronic devices can be efficiently controlled in the N screen environment. | 03-21-2013 |
20130073294 | Voice Controlled Wireless Communication Device System - A wireless communication device that accepts recorded audio data from an end-user. The audio data can be in the form of a command requesting user action. The audio data is reduced to a digital file in a format that is supported by the device hardware. The digital file is sent via wireless communication to at least one server computer for further processing. The command includes a unique device identifier that identifies the wireless communication device. The server computer determines required additional processing for the command based on the unique device identifier. The server computer constructs an application command based on the processed command, and transmits the application command to the wireless communication device. The application command includes at least one instruction that causes a corresponding application on the wireless communication device to execute the application command. | 03-21-2013 |
20130080177 | SPEECH RECOGNITION REPAIR USING CONTEXTUAL INFORMATION - A speech control system that can recognize a spoken command and associated words (such as “call mom at home”) and can cause a selected application (such as a telephone dialer) to execute the command to cause a data processing system, such as a smartphone, to perform an operation based on the command (such as look up mom's phone number at home and dial it to establish a telephone call). The speech control system can use a set of interpreters to repair recognized text from a speech recognition system, and results from the set can be merged into a final repaired transcription which is provided to the selected application. | 03-28-2013 |
20130080178 | USER INTERFACE METHOD AND DEVICE - A user interface method and corresponding device, where the user interface method includes waiting for detection of an event, which is a function of the user interface device, performing the event detection in the user interface device and notifying a user that the event has been detected, activating a voice input unit configured to allow the user to input his or her voice therethrough, receiving a voice command from the user with respect to the event through the voice input unit, and processing a function according to the received voice command from the user. | 03-28-2013 |
20130080179 | USING A PHYSICAL PHENOMENON DETECTOR TO CONTROL OPERATION OF A SPEECH RECOGNITION ENGINE - A device may include a physical phenomenon detector. The physical phenomenon detector may detect a physical phenomenon related to the device. In response to detecting the physical phenomenon, the device may record audio data that includes speech. The speech may be transcribed with a speech recognition engine. The speech recognition engine may be included in the device, or may be included with a remote computing device with which the device may communicate. | 03-28-2013 |
20130085761 | Voice Control For Asynchronous Notifications - A computing device may receive an incoming communication and, in response, generate a notification that indicates that the incoming communication can be accessed using a particular application on the communication device. The computing device may further provide an audio signal indicative of the notification and automatically activate a listening mode. The computing device may receive a voice input during the listening mode, and an input text may be obtained based on speech recognition performed upon the voice input. A command may be detected in the input text. In response to the command, the computing device may generate an output text that is based on at least the notification and provide a voice output that is generated from the output text via speech synthesis. The voice output identifies at least the particular application. | 04-04-2013 |
20130090930 | Speech Recognition for Context Switching - Various embodiments provide techniques for implementing speech recognition for context switching In at least some embodiments, the techniques can enable a user to switch between different contexts and/or user interfaces of an application via speech commands. In at least some embodiments, a context menu is provided that lists available contexts for an application that may be navigated to via speech commands. In implementations, the contexts presented in the context menu include a subset of a larger set of contexts that are filtered based on a variety of context filtering criteria. A user can speak one of the contexts presented in the context menu to cause a navigation to a user interface associated with one of the contexts. | 04-11-2013 |
20130090931 | MULTIMODAL COMMUNICATION SYSTEM - The present invention, in various embodiments, comprises systems and methods for providing a communication system. In one embodiment, the system is an assistive technology (AT) in a single, highly integrated, multimodal, multifunctional, multipurpose, minimally invasive, unobtrusive, wireless, wearable, easy to use, low cost, and reliable AT that can potentially provide people with severe disabilities with flexible and effective computer access and environmental control in various conditions. In one embodiment, a multimodal Tongue Drive System (mTDS) is disclosed that uses tongue motion as its primary input modality. Secondary input modalities including speech, head motion, and diaphragm control are added to the tongue motion as additional input channels to enhance the system speed, accuracy, robustness, and flexibility, which are expected to address many of the aforementioned issues with traditional ATs that have limited number of input channels/modalities and can only be used in certain conditions by a certain group of users. | 04-11-2013 |
20130090932 | VEHICULAR APPARATUS - A hands-free conversation vehicular apparatus coupling with a communication terminal includes: a communication device; a sound output device; a sound input device inputting a transmission speech sound; a vehicle information acquisition device; a first sound extraction device setting a first direction for a directionality of the sound input device, and extracting a first sound along the first direction; a sound recognition device; a second sound extraction device specifying a second direction for the transmission speech sound recognized by the sound recognition device, and extracting a second sound along the second direction; a sound quality comparison unit comparing a sound quality of the first and second sounds; a changeover device for selecting one of the first and second sounds as the transmission speed sound; and a control device for allowing the changeover device to perform a changeover when a determination condition is fulfilled. | 04-11-2013 |
20130096925 | SYSTEM FOR PROVIDING A SOUND SOURCE INFORMATION MANAGEMENT SERVICE - Disclosed is a system for providing a sound source information management service. The system for providing a sound source information management service manages sound source information transmitted from a driver terminal and extracts the sound source information corresponding to voice input data via voice recognition according to the voice input data transmitted from the driver terminal and provides the extracted sound source information to the driver terminal. | 04-18-2013 |
20130103404 | MOBILE VOICE PLATFORM ARCHITECTURE - A mobile voice platform providing a user speech interface to computer-based services uses a device having a processor, communication circuitry, an operating system, and applications that are run using the operating system and that utilize the computer-based services via the communication circuitry. The mobile voice platform includes a non-transient digital storage medium storing first and second program modules. Upon execution by the processor the first program module receives speech recognition results, determines a desired service based on the speech recognition results, and provides at least some of the speech recognition results to the second program module. The second program module, when executed, generates a service request based on the speech recognition results provided from the first program module, provides the service request to one or more of the computer-based services, obtains a service result from the computer-based service(s), and supplies the first program module with a response. | 04-25-2013 |
20130103405 | OPERATING SYSTEM AND METHOD OF OPERATING - An operation determination processing section of a center extracts words included in the utterance of a driver and an operator, reads an attribute associated with each word from a synonym and related word in which an attribute is stored so as to be associated with each word, reads a domain of a candidate or the like for the task associated with the attribute from the synonym and related word in which domains of a candidate for a task associated with the read attribute or domains of a task to be actually performed are stored, totals the domains read for each word for words included in the utterance of the driver or the like, and estimates those related to a domain with a highest total score as the candidate for the task and the task to be actually performed. In this manner, it is possible to estimate the task with high accuracy. | 04-25-2013 |
20130110517 | ENABLING SPEECH WITHIN A MULTIMODAL PROGRAM USING MARKUP | 05-02-2013 |
20130110518 | Active Input Elicitation by Intelligent Automated Assistant | 05-02-2013 |
20130110519 | Determining User Intent Based on Ontologies of Domains | 05-02-2013 |
20130110520 | Intent Deduction Based on Previous User Interactions with Voice Assistant | 05-02-2013 |
20130117027 | ELECTRONIC APPARATUS AND METHOD FOR CONTROLLING ELECTRONIC APPARATUS USING RECOGNITION AND MOTION RECOGNITION - An electronic apparatus and a method thereof are provided. A method for controlling an electronic apparatus includes recognizing a voice signal that is input; and displaying text corresponding to the recognized voice signal on a display unit of the electronic apparatus; and deleting selected text of the text displayed on the display unit in response to a deletion motion that is input while the text is displayed on the display unit. | 05-09-2013 |
20130124207 | VOICE-CONTROLLED CAMERA OPERATIONS - A computing device (e.g., a smart phone, a tablet computer, digital camera, or other device with image capture functionality) causes an image capture device to capture one or more digital images based on audio input (e.g., a voice command) received by the computing device. For example, a user's voice (e.g., a word or phrase) is converted to audio input data by the computing device, which then compares (e.g., using an audio matching algorithm) the audio input data to an expected voice command associated with an image capture application. In another aspect, a computing device activates an image capture application and captures one or more digital images based on a received voice command. In another aspect, a computing device transitions from a low-power state to an active state, activates an image capture application, and causes a camera device to capture digital images based on a received voice command. | 05-16-2013 |
20130124208 | REAL-TIME DISPLAY OF SYSTEM INSTRUCTIONS - A system and method for reviewing inputted voice instructions in a vehicle-based telematics control unit. The system includes a microphone, a speech recognition processor, and an output device. The microphone receives voice instructions from a user. Coupled to the microphone is the speech recognition processor that generates a voice signal by performing speech recognition processing of the received voice instructions. The output device outputs the generated voice signal to the user, The system also includes a user interface for allowing the user to approve the outputted voice signal, and a communication component for wirelessly sending the generated voice signal to a server over a wireless network upon approval by the user. | 05-16-2013 |
20130124209 | INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM - An information processing apparatus includes: a plurality of information input units; an event detection unit that generates event information including estimated position information and estimated identification information of users present in the real space based on analysis of the information from the information input unit; and an information integration processing unit that inputs the event information, and generates target information including a position of each user and user identification information based on the input event information, and signal information representing a probability value of the event generation source, wherein the information integration processing unit includes an utterance source probability calculation unit, and wherein the utterance source probability calculation unit performs a process of calculating an utterance source score as an index value representing an utterance source probability of each target by multiplying weights based on utterance situations by a plurality of different information items from the event detection unit. | 05-16-2013 |
20130124210 | INFORMATION TERMINAL, CONSUMER ELECTRONICS APPARATUS, INFORMATION PROCESSING METHOD AND INFORMATION PROCESSING PROGRAM - According to an information terminal connectable to a target apparatus, including a determining unit and a control unit wherein the determining unit determines whether the information terminal is held by a user or not, the control unit perform to output, when changed from a status of being held to a status of not being held, a control signal to instruct accepting an operation given from the user to the target apparatus, and the control unit performs, when changed from the status of not being held to the status of being held, at least either one of displaying a remote controller to operate the target device on a display screen of the information terminal or acquiring information on a status of the target apparatus from the target apparatus to display the information on the display screen. | 05-16-2013 |
20130124211 | SYSTEM AND METHOD FOR ENHANCED COMMUNICATIONS VIA SMALL DATA RATE COMMUNICATION SYSTEMS - A system and method for interacting with an interactive communication system include processing a profile associated with an interactive communication system; generating a user interface based on the processing of the profile to solicit a user response correlating to a response required by the interactive communication system; receiving the user response via the user interface; updating the user interface using the profile based on the user response; and sending a signal to the interactive communication system based on one or more user responses. | 05-16-2013 |
20130132094 | SYSTEM AND METHOD FOR VOICE ACTUATED CONFIGURATION OF A CONTROLLING DEVICE - A speech recognition engine is provided voice data indicative of at least a brand of a target appliance. The speech recognition engine uses the voice data indicative of at least a brand of the target appliance to identify within a library of codesets at least one codeset that is cross-referenced to the brand of the target appliance. The at least one codeset so identified is then caused to be provisioned to the controlling device for use in commanding functional operations of the target appliance. | 05-23-2013 |
20130132095 | AUDIO PATTERN MATCHING FOR DEVICE ACTIVATION - A system and method are disclosed for activating an electric device from a standby power mode to a full power mode. The system may include one or more microphones for monitoring audio signals in the vicinity of the electric device, and a standby power activation unit including a low-power microprocessor and a non-volatile memory. Audio captured by the one or more microphones is digitized and compared by the microprocessor against predefined activation pattern(s) stored in the non-volatile memory. If a pattern match is detected between the digital audio pattern and a predefined activation pattern, the electric device is activated. | 05-23-2013 |
20130132096 | Systems and Techniques for Producing Spoken Voice Prompts - Methods and systems are described in which spoken voice prompts can be produced in a manner such that they will most likely have the desired effect, for example to indicate empathy, or produce a desired follow-up action from a call recipient. The prompts can be produced with specific optimized speech parameters, including duration, gender of speaker, and pitch, so as to encourage participation and promote comprehension among a wide range of patients or listeners. Upon hearing such voice prompts, patients/listeners can know immediately when they are being asked questions that they are expected to answer, and when they are being given information, as well as the information that considered sensitive. | 05-23-2013 |
20130138444 | MODIFICATION OF OPERATIONAL DATA OF AN INTERACTION AND/OR INSTRUCTION DETERMINATION PROCESS - It is inter alia disclosed to perform at least one of operating an interaction process with a user of the medical apparatus and determining, based on a representation of at least one instruction given by the user, at least one instruction operable by the medical apparatus. Therein, the at least one of the operating and the determining at least partially depends on operational data. It is further disclosed to receive modification information for modifying at least a part of the operational data, wherein the modification information is at least partially determined based on an analysis of a representation of at least one instruction given by the user. | 05-30-2013 |
20130144629 | SYSTEM AND METHOD FOR CONTINUOUS MULTIMODAL SPEECH AND GESTURE INTERACTION - Disclosed herein are systems, methods, and non-transitory computer-readable storage media for processing multimodal input. A system configured to practice the method continuously monitors an audio stream associated with a gesture input stream, and detects a speech event in the audio stream. Then the system identifies a temporal window associated with a time of the speech event, and analyzes data from the gesture input stream within the temporal window to identify a gesture event. The system processes the speech event and the gesture event to produce a multimodal command. The gesture in the gesture input stream can be directed to a display, but is remote from the display. The system can analyze the data from the gesture input stream by calculating an average of gesture coordinates within the temporal window. | 06-06-2013 |
20130159002 | VOICE APPLICATION ACCESS - A system may include a mobile computing device configured to receive voice input; identify, in the voice input, a navigate command including a sequence indication; determine, based on a sequence control map, a control of a user interface corresponding to the sequence indication; and activate the control of the user interface corresponding to the sequence indication. | 06-20-2013 |
20130159003 | METHOD AND APPARATUS FOR PROVIDING CONTENTS ABOUT CONVERSATION - Disclosed are a method and an apparatus for providing contents about conversation, which collect voice information from conversation between a user and another person, search contents on the basis of the collected voice information, and provide contents about the conversation between the user and the person. The method of providing contents about conversation includes: a voice information collecting step of collecting voice information from conversation between a user and another person; a keyword creating control step of creating search keywords by using the collected voice information; and a contents providing control step of searching contents by using the created search keywords, and providing the searched contents. | 06-20-2013 |
20130166305 | SPEECH RECOGNITION ADJUSTMENT BASED ON MANUAL INTERACTION - A method of operating a speech recognition system on a vehicle having a visual display and manually-operated input device that includes initiating a speech recognition system, controlling menu selections on a visual display using a manually-operated input device, receiving a notification from the manually-operated input device indicating that the user is manipulating the device in conjunction with the menu selections on the visual display, and adjusting operation of the speech recognition system based on input received by the manually-operated input device. | 06-27-2013 |
20130173270 | ELECTRONIC APPARATUS AND METHOD OF CONTROLLING ELECTRONIC APPARATUS - An electronic apparatus and a method of controlling the electronic apparatus are provided. The method includes: receiving a voice command; and if the voice command is a first voice start command, changing a mode of the electronic apparatus to a first voice task mode in which the electronic apparatus is controlled according to further voice input, and if the voice command is a second voice start command, changing the mode of the electronic apparatus to a second voice task mode in which the electronic apparatus is controlled according to the further voice input received via an external apparatus which operates with the electronic apparatus. Therefore, providing efficiency and flexibility in controlling the electronic apparatus by using a microphone of the electronic apparatus or a microphone of the external apparatus. | 07-04-2013 |
20130179172 | IMAGE REPRODUCING DEVICE, IMAGE REPRODUCING METHOD - An image reproducing device connected to a reproducing unit that reproduces image data includes an extraction unit configured to extract first-condition-satisfying-image data that satisfies a first extraction condition from image data stored in a storage unit; a voice keyword extraction unit configured to extract a keyword that matches a voice input to a voice input unit; and a presentation unit configured to determine, while the first-condition-satisfying-image data is being reproduced by the reproducing unit, a second extraction condition based on a relationship between the first extraction condition applied when extracting the first-condition-satisfying-image data being reproduced and the keyword that has been extracted, and present information pertinent to second-condition-satisfying-image data that satisfies the second extraction condition among the image data stored in the storage unit. | 07-11-2013 |
20130179173 | METHOD AND APPARATUS FOR EXECUTING A USER FUNCTION USING VOICE RECOGNITION - A method and an apparatus for executing a user function using voice recognition. The method includes displaying a user function execution screen; confirming a function to be executed according to voice input; displaying a voice command corresponding to the confirmed function on the user function execution screen; recognizing a voice input by a user, while a voice recognition execution request is continuously received; and executing the function associated with the input voice command, when the recognized voice input is at least one of the displayed voice command. | 07-11-2013 |
20130179174 | MACHINE, SYSTEM AND METHOD FOR USER-GUIDED TEACHING AND MODIFYING OF VOICE COMMANDS AND ACTIONS EXECUTED BY A CONVERSATIONAL LEARNING SYSTEM - A machine, system and method for user-guided teaching and modifications of voice commands and actions to be executed by a conversational learning system. The machine includes a system bus for communicating data and control signals received from the conversational learning system to a computer system, a vehicle data and control bus for connecting devices and sensors in the machine, a bridge module for connecting the vehicle data and control bus to the system bus, machine subsystems coupled to the vehicle data and control bus having a respective user interface for receiving a voice command or input signal from a user, a memory coupled to the system bus for storing action command sequences learned for a new voice command and a processing unit coupled to the system bus for automatically executing the action command sequences learned when the new voice command is spoken. | 07-11-2013 |
20130185078 | METHOD AND SYSTEM FOR USING SOUND RELATED VEHICLE INFORMATION TO ENHANCE SPOKEN DIALOGUE - Sound related vehicle information representing one or more sounds may be received in the processor. The sound related vehicle information may or may not include an audio signal. Spoken dialogue of a spoken dialogue system associated with the vehicle based on the sound related vehicle information may be modified. | 07-18-2013 |
20130185079 | HOME APPLIANCE, HOME APPLIANCE SYSTEM, AND METHOD FOR OPERATING SAME - The present invention relates to a home appliance, to a home appliance system, and to a method for operating same, wherein the home appliance and a mobile terminal are connected to one another to add or update data in the home appliance through the mobile terminal connected thereto, diagnose the state of the home appliance by means of the mobile terminal, and supplement the function of the home appliance by means of the mobile terminal, thus expanding the functions of the home appliance to enable the easy control of the home appliance, and more conveniently controlling the home appliance. | 07-18-2013 |
20130185080 | USER SPEECH INTERFACES FOR INTERACTIVE MEDIA GUIDANCE APPLICATIONS - A user speech interface for interactive media guidance applications, such as television program guides, guides for audio services, guides for video-on-demand (VOD) services, guides for personal video recorders (PVRs), or other suitable guidance applications is provided. Voice commands may be received from a user and guidance activities may be performed in response to the voice commands. | 07-18-2013 |
20130185081 | Maintaining Context Information Between User Interactions with a Voice Assistant - Methods, systems, and computer readable storage medium related to operating an intelligent digital assistant are disclosed. A first task is performed using a first parameter. A text string is obtained from a speech input received from a user. Based at least partially on the text string, a second task different from the first task or a second parameter different from the first parameter is identified. The first task is performed using the second parameter or the second task is performed using the first parameter. | 07-18-2013 |
20130191132 | VEHICLE-TO-VEHICLE COMMUNICATION DEVICE - A vehicle-to-vehicle communication device generates voice information that includes a voice message and added information regarding an output of the voice message. The voice information is transmitted in one direction of a subject vehicle via a transmission unit, and voice information from another vehicle is received via a reception unit. The vehicle-to-vehicle communication device plays the voice message of the voice information received by the reception unit based on the added information of the voice information. In such manner, information regarding a travel situation is appropriately transmitted by the vehicle-to-vehicle communication device. | 07-25-2013 |
20130197914 | VOICE ACTIVATED AUDIO CONTROL SYSTEM AND ASSOCIATED METHOD OF USE - A voice activated system for operating electronic devices in an environment includes a microphone for receiving a verbal command that requests the addition of a new voice command, a first processor, that is electrically connected to the microphone, for receiving a customized command input regarding a preexisting user for the voice activated system that should be associated with the new verbal command, input involving a new verbal command, and input involving a system command, where the first processor is then able to receive verbal input to recognize a user, a verbal command, and then determine an associated action, an appropriate command for that action and then generate an associated system command, and a second processor, in electronic communication with the first processor, and two or more electronic devices in an environment, where the second processor is capable of receiving the system command and operating the two or more devices. | 08-01-2013 |
20130197915 | SPEECH-BASED USER INTERFACE FOR A MOBILE DEVICE - A method of providing hands-free services using a mobile device having wireless access to computer-based services includes carrying out a completed speech session via a mobile device without any physical interaction with the mobile device, wherein the speech session includes receiving a speech input from a user, and obtaining from a cloud service a service result responsive to the speech input, and providing the service result as a speech response presented to the user. | 08-01-2013 |
20130197916 | TERMINAL DEVICE, SPEECH RECOGNITION PROCESSING METHOD OF TERMINAL DEVICE, AND RELATED PROGRAM - According to one embodiment, a terminal device including a main body, includes: a sound input module configured to receive a voice, convert the voice into a digital signal, and output the digital signal; a state detecting module having an acceleration sensor, configured to detect one or both of a movement and a state of the main body and output a detection result; an executing module, which is capable to execute plural speech recognition response processes, configured to execute one of the speech recognition response processes to the digital signal according to the detection result detected by the state detecting module. | 08-01-2013 |
20130197917 | METHODS AND SYSTEMS FOR UTILIZING VOICE COMMANDS ONBOARD AN AIRCRAFT - Methods and systems are provided for utilizing audio commands onboard an aircraft. A method comprises identifying a flight phase for the aircraft, resulting in an identified flight phase, receiving an audio input, resulting in received audio input, filtering the received audio input in a manner that is influenced by the identified flight phase for the aircraft, resulting in filtered audio input, and validating the filtered audio input as a first voice command of a first plurality of possible voice commands. | 08-01-2013 |
20130204629 | VOICE INPUT DEVICE AND DISPLAY DEVICE - An voice input device includes a wave guide unit for guiding an incident sound wave, a microphone unit for converting a sound wave guided through the wave guide unit to an electrical sound signal, and a signal processing unit for processing the sound signal obtained by the microphone unit, using an acoustic characteristic given by the wave guide unit to the sound wave, in which, the wave guide unit has a structure which gives the acoustic characteristic that is different between direct sound, which is sound that reaches the microphone unit without reflecting off an internal surface of the wave guide unit, and indirect sound, which is sound that is reflected off the internal surface before reaching the microphone unit, and the signal processing unit determines whether or not the direct sound is input based on a difference in the acoustic characteristic between the direct sound and the indirect sound. | 08-08-2013 |
20130211842 | Method For Quick Scroll Search Using Speech Recognition - A method for a computing device to search for data entails receiving first user input that initiates a quick scrolling action and activates a speech recognition subsystem, receiving second user input by recognizing voice input using the speech recognition subsystem to determine a search query, and searching for data that corresponds to the search query. The quick scrolling action and activation of the speech recognition subsystem may be triggered, for example, by a swiping gesture on an optical jog pad, on a touch screen, or on a touch-sensitive mouse, or by a contactless three-dimensional gesture. | 08-15-2013 |
20130211843 | ENGAGEMENT-DEPENDENT GESTURE RECOGNITION - Methods, apparatuses, systems, and computer-readable media for performing engagement-dependent gesture recognition are presented. According to one or more aspects, a computing device may detect an engagement of a plurality of engagements, and each engagement of the plurality of engagements may define a gesture interpretation context of a plurality of gesture interpretation contexts. Subsequently, the computing device may detect a gesture. Then, the computing device may execute at least one command based on the detected gesture and the gesture interpretation context defined by the detected engagement. In some arrangements, the engagement may be an engagement pose, such as a hand pose, while in other arrangements, the detected engagement may be an audio engagement, such as a particular word or phrase spoken by a user. | 08-15-2013 |
20130211844 | Solar Powered Portable Control Panel - A solar powered portable control panel is disclosed herein for wirelessly controlling one or more lights or other devices. An embodiment of the control panel includes a solar panel, a regulator connected to the solar panel, a power storage device connected to the regulator, a wireless transceiver, a controller connected to the power storage device, and a user interface connected to the controller. The user interface is adapted to accept control input and provide it to the controller. The controller is adapted to transmit commands on the wireless transceiver. | 08-15-2013 |
20130218572 | METHOD AND APPARATUS FOR SMART VOICE RECOGNITION - A display device with a voice recognition capability may be used to allow a user to speak voice commands for controlling certain features of the display device. As a means for increasing operational efficiency, the display device may utilize a plurality of voice recognition units where each voice recognition unit may be assigned a specific task. | 08-22-2013 |
20130218573 | VOICE COMMAND RECOGNITION METHOD AND RELATED ELECTRONIC DEVICE AND COMPUTER-READABLE MEDIUM - An electronic device for browsing a document is disclosed. The document being browsed includes a plurality of command-associated text strings. First, a text string selector of the electronic device selects a plurality of candidate text strings from the command-associated text strings. Afterward, an acoustic string provider of the electronic device prepares a candidate acoustic string for each of the candidate text strings. Thereafter, a microphone of the electronic device receives a voice command. Next, a speech recognizer of the electronic device searches the candidate acoustic strings for a target acoustic string that matches the voice command, wherein the target acoustic string corresponds to a target text string of the candidate text strings. Finally, a document browser of the electronic device executes a command associated with the target text string. | 08-22-2013 |
20130218574 | Management and Prioritization of Processing Multiple Requests - Systems and methods are described for systems that utilize an interaction manager to manage interactions—also known as requests or dialogues—from one or more applications. The interactions are managed properly even if multiple applications use different grammars. The interaction manager maintains a priority for each of the interactions, such as via an interaction list, where the priority of the interactions corresponds to an order in which the interactions are to be processed. Interactions are normally processed in the order in which they are received. However, the systems and method described herein may provide a grace period after processing a first interaction and before processing a second interaction. If a third interaction that is chained to the first interaction is received during this grace period, then the third interaction may be processed before the second interaction. | 08-22-2013 |
20130218575 | AUDIO INPUT APPARATUS, COMMUNICATION APPARATUS AND CONDITION NOTIFICATION METHOD - The used condition of a simplex communication apparatus is notified with a light-emitting device attached to the communication apparatus. It is determined whether a communication mode of the simplex communication apparatus is a transmission mode or a standby mode. A sound pick-up state of a sound carried by a speech signal to be transmitted is determined if the communication mode is the transmission mode. The light-emitting device is controlled so that it is turned off, turned on or repeatedly turned on and off based on determination results of the communication-mode determination and the sound pick-up state determination. | 08-22-2013 |
20130226588 | Simulated Conversation by Pre-Recorded Audio Navigator - A method is provided for a simulated conversation by a pre-recorded audio navigator, with particular application to informational and entertainment settings. A monitor may utilize a navigation interface to select pre-recorded responses in the voice of a character represented by a performer. The pre-recorded responses may then be queued and sent to a speaker proximate to the performer. By careful organization of an audio database including audio buckets and script-based navigation with shifts for tailoring to specific guest user profiles and environmental contexts, a convincing and dynamic simulated conversation may be carried out while providing the monitor with a user-friendly navigation interface. Thus, highly specialized training is not necessary and flexible scaling to large-scale deployments is readily supported. | 08-29-2013 |
20130226589 | CONTROL USING TEMPORALLY AND/OR SPECTRALLY COMPACT AUDIO COMMANDS - A sound-activated control system includes an audio receiver and a command discriminator. The receiver is configured to receive an audio waveform and to produce a digital audio waveform therefrom. The command discriminator is configured to detect a temporally and/or spectrally compact nonphonetic audio command within the digital audio waveform and to control a voice-activated system an action in response to the nonphonetic command. | 08-29-2013 |
20130226590 | VOICE INPUT APPARATUS AND METHOD - Provided is a voice input method and apparatus that may select and drive an execution screen of an application executing a screen that is requested to be executed instead of executing a default screen if executing the application. If executing an application, a user may further conveniently and quickly execute the user's selected function and display an execution screen by decreasing a plurality of touch input operations. | 08-29-2013 |
20130226591 | METHOD AND APPARATUS FOR CONTROLLING LOCK/UNLOCK STATE OF TERMINAL THROUGH VOICE RECOGNITION - A method for controlling a terminal through a voice input is provided. The method includes receiving a voice input when the terminal is in a state in which the terminal is locked and performing an operation corresponding to the voice input if the voice input corresponds to a preset command. | 08-29-2013 |
20130226592 | METHOD FOR BROWSING WITHIN A CONTENT DISPLAYABLE BY BROWSING COMMANDS, BROWSING DEVICE AND ASSOCIATED PROGRAM - A method for browsing in a visual content such as a document or a list. The content is available on a terminal having a browsing command. Part of the content is displayed on a display of the terminal. The browsing commands enable the contents displayed on the screen to be made to scroll in the direction specified by the command introduced. The displayed part is duplicated into two identical images when one end of the content situated in the direction of movement specified by the browsing command is displayed on the means for displaying. A first image remains still and a second image moves in the direction of movement specified by the browsing command so long as the command is active. In this way, the user sees that the command has indeed been taken into account and notes visually that the end of the visual content has been reached. | 08-29-2013 |
20130231937 | Context Sensitive Overlays In Voice Controlled Headset Computer Displays - In headset computers that leverage voice commands, often the user does not know what voice commands are available. In one embodiment, a method includes providing a user interface in a headset computer and, in response to user utterance of a cue toggle command, displaying at least one cue in the user interface. Each cue can correspond to a voice command associated with code to execute. In response to user utterance of the voice command, the method can also include executing the code associated with the voice command. The user can therefore ascertain what voice commands are available. | 09-05-2013 |
20130231938 | Method and Apparatus for Communication Between Humans and Devices - This invention relates to methods and apparatus for improving communications between humans and devices. The invention provides a method of modulating operation of a device, comprising: providing an attentive user interface for obtaining information about an attentive state of a user; and modulating operation of a device on the basis of the obtained information, wherein the operation that is modulated is initiated by the device. Preferably, the information about the user's attentive state is eye contact of the user with the device that is sensed by the attentive user interface. | 09-05-2013 |
20130238341 | DEVICE CAPABLE OF PLAYING MUSIC AND METHOD FOR CONTROLLING MUSIC PLAYING IN ELECTRONIC DEVICE - An electronic device includes a music play module that plays music and a voice recorder that records ambient voice around the electronic device. The electronic device further includes a music control module that identifies voice characteristics of the ambient voice, and controls the music play module to pause the playing of the music when the voice characteristics of the ambient voice match pre-configured voice reference information. | 09-12-2013 |
20130246071 | ELECTRONIC DEVICE AND METHOD FOR CONTROLLING POWER USING VOICE RECOGNITION - An electronic apparatus and a power controlling method are provided. The electronic apparatus includes: a voice input unit which receives an audio input in a stand-by mode of the electronic apparatus; a voice sensing unit which determines whether the received audio input is a user voice, and if the user voice is input, outputs a power control signal; and a power control voice recognition unit which, if the power control signal is received from the voice recognition unit, turns on and performs voice recognition regarding the input user voice. | 09-19-2013 |
20130246072 | System and Method for Customized Voice Response - Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating an accent source. A system practicing the method collects data associated with customer specific services, generates country-specific or dialect-specific weights for each service in the customer specific services list, generates a summary weight based on an aggregation of the country-specific or dialect-specific weights, and sets an interactive voice response system language model based on the summary weight and the country-specific or dialect-specific weights. The interactive voice response system can also change the user interface based on the interactive voice response system language model. The interactive voice response system can tune a voice recognition algorithm based on the summary weight and the country-specific weights. The interactive voice response system can adjust phoneme matching in the language model based on a possibility that the speaker is using other languages. | 09-19-2013 |
20130253937 | METHOD AND APPARATUS FOR SMART VOICE RECOGNITION - A display device with a voice recognition capability may be used to allow a user to speak voice commands for controlling certain features of the display device. As a means for increasing operational efficiency, the display device may utilize a plurality of voice recognition units where each voice recognition unit may be assigned a specific task. | 09-26-2013 |
20130262126 | Systems and Methods for Off-Board Voice-Automated Vehicle Navigation - A method of providing navigational information includes processing destination information spoken by a vehicle occupant on-board. The processed voice information is transmitted to a remote center wirelessly. The processed voice information is voice recognition analyzed at the remote data center to recognize components of the destination information spoken. The remote center generates a list of hypothetical recognized components of the destination information listed by confidence levels as calculated for each component of the destination information analyzed by the voice recognition system. The hypothetical recognized component list is displayed with confidence levels at the remote center for selective checking by a human data center operator. A component set is selected based on the confidence levels and accuracy of the selected set is confirmed by interactive voice exchanges between the vehicle driver and the remote data center. A destination is determined from confirmed components of the destination information. | 10-03-2013 |
20130268276 | Menu Hierarchy Skipping Dialog for Directed Dialog Speech Recognition - A method and a processing device for managing an interactive speech recognition system is provided. Whether a voice input relates to expected input, at least partially, of any one of a group of menus different from a current menu is determined. If the voice input relates to the expected input, at least partially, of any one of the group of menus different from the current menu, skipping to the one of the group of menus is performed. The group of menus is different from the current menu include menus at multiple hierarchical levels. | 10-10-2013 |
20130275139 | VOICE RESPONSIVE FLUID DELIVERY, CONTROLLING AND MONITORING SYSTEM AND METHOD - A system and methodology of delivery fluids and monitoring their status which is voice actuated. This system has application where a hands-free environment is preferred. Voice commands are given by the user via a Bluetooth® headset and received typically by the user's Smartphone. Voice recognition circuitry is programmed to recognize the simple commands and through complementing electronics, and electro-mechanical and mechanical elements, delivery at corresponding flow rates is accomplished. A further feature allows for respective voice commands to initiate a monitoring function where the status of any particular characteristic of the fluid can be relayed back to the user via the headset. | 10-17-2013 |
20130282380 | Method And System For Facilitating Communications For A User Transaction - Current human-to-machine interfaces enable users to interact with a company's database and enter into a series of transactions (e.g., purchasing products/services and paying bills). Each transaction may require several operations or stages requiring user input or interaction. Some systems enable a user to enter a voice input parameter providing multiple operations of instruction (e.g., single natural language command). However, users of such a system do not know what types of commands the system is capable of accepting. Embodiments of the present invention facilitate communications for user transactions by determining a user's goal transaction and presenting a visual representation of a voice input parameter for the goal transaction. The use of visual representations notifies the user of the system's capability of accepting single natural language commands and the types of commands the system is capable of accepting, thereby enabling a user to complete a transaction in a shorter period of time. | 10-24-2013 |
20130282381 | METHODS AND SYSTEMS FOR SPEECH-ENABLING A HUMAN-TO-MACHINE INTERFACE - Generally, human-to-machine interfaces are configured to accept speech input from a user. However, such interfaces, e.g., web browsers, must be configured to enable acceptance of speech input from the user. Some interfaces, such as mobile browsers, have less configuration adaptability and are not able to be configured to accept speech input from a user. Embodiments of the present invention speech-enable human-to-machine interfaces by loading content of the human-to-machine interface and adding logic configured to enable speech interaction with the content to the interface. The embodiment then activates speech interaction with the content via the logic for the user. Thus, embodiments of the present invention enable speech interaction with interfaces that are not configured to be adapted to allow speech interaction and are able to enable the speech interaction in a seamless manner. | 10-24-2013 |
20130290000 | Voiced Interval Command Interpretation - A method is disclosed for controlling a voice-activated device by interpreting a spoken command as a series of voiced and non-voiced intervals. A responsive action is then performed according to the number of voiced intervals in the command. The method is well-suited to applications having a small number of specific voice-activated response functions. Applications using the inventive method offer numerous advantages over traditional speech recognition systems including speaker universality, language independence, no training or calibration needed, implementation with simple microcontrollers, and extremely low cost. For time-critical applications such as pulsers and measurement devices, where fast reaction is crucial to catch a transient event, the method provides near-instantaneous command response, yet versatile voice control. | 10-31-2013 |
20130290001 | IMAGE PROCESSING APPARATUS, VOICE ACQUIRING APPARATUS, VOICE RECOGNITION METHOD THEREOF AND VOICE RECOGNITION SYSTEM - Disclosed are an image processing apparatus, a voice acquiring apparatus, a voice recognition method and a voice recognition system. The image processing apparatus includes an image processor which processes an image signal, a communication unit which communicates with at least one electronic apparatus, and a controller which includes a voice recognition engine to recognize a voice command, and controls the communication unit to transmit a command to the at least one electronic apparatus corresponding to the voice command recognized by the voice recognition engine. | 10-31-2013 |
20130290002 | VOICE CONTROL DEVICE, VOICE CONTROL METHOD, AND PORTABLE TERMINAL DEVICE - A voice control device includes a calculation section configured to calculate a response time representing a time difference between a voice in a received signal and a voice in a sending signal; a hearing estimate section configured to estimate hearing of a user based on the calculated response time; and a voice control section configured to control the received signal by a compensation quantity responsive to the estimated hearing. | 10-31-2013 |
20130297318 | SPEECH RECOGNITION SYSTEMS AND METHODS - A method of enabling speech commands in an application includes identifying, by a computer processor, a user interaction element within a resource of the application; extracting, by the computer processor, text associated with the identified user interaction element; generating, by the computer processor, a voice command corresponding to the extracted text; and adding the generated voice command to a grammar associated with the application. | 11-07-2013 |
20130297319 | MOBILE DEVICE HAVING AT LEAST ONE MICROPHONE SENSOR AND METHOD FOR CONTROLLING THE SAME - A mobile device having at least one microphone sensor and a method for controlling the same are disclosed. The method includes receiving at least two audio signals through the at least one microphone sensor within a predetermined time period, recognizing input directions and voice command from the at least two audio signals sequentially, determining whether the recognized input directions and voice command match to preset input directions and preset voice command mapped to the preset directions, sequentially for the at least two received audio signals, and executing a preset control command, if the recognized input directions and voice command match to the preset input directions and voice command. | 11-07-2013 |
20130297320 | VOICE-CONTROLLED THREE-DIMENSIONAL FABRICATION SYSTEM - An additive three-dimensional fabrication system includes voice control for user interaction. This voice-controlled interface can enable a variety of voice-controlled functions and operations, while supporting interactions specific to consumer-oriented fabrication processes. | 11-07-2013 |
20130297321 | LANDMARK-BASED LOCATION BELIEF TRACKING FOR VOICE-CONTROLLED NAVIGATION SYSTEM - An utterance is received from a user specifying a location attribute and a landmark. A set of candidate locations is identified based on the specified location attribute, and a confidence score can be determined for each candidate location. A set of landmarks is identified based on the specified landmark, and confidence scores can be determined for the landmarks. An associated kernel model is generated for each landmark. Each kernel model is centered at the location of the associated landmark on a map, and the amplitude of the kernel model can be based on landmark attributes, landmark confidence scores, characteristics of the user, and the like. The candidate locations are ranked based on the amplitudes of overlapping kernel models at the candidate locations, and can also be ranked based on confidence scores associated with the candidate locations. A candidate location is selected and presented to the user based on the candidate location ranking | 11-07-2013 |
20130304479 | Sustained Eye Gaze for Determining Intent to Interact - Methods and systems for determining intent in voice and gesture interfaces are described. An example method includes determining that a gaze direction is in a direction of a gaze target, and determining whether a predetermined time period has elapsed while the gaze direction is in the direction of the gaze target. The method may also include providing an indication that the predetermined time period has elapsed when the predetermined time period has elapsed. According to the method, a voice or gesture command that is received after the predetermined time period has elapsed may be determined to be an input for a computing system. Additional example systems and methods are described herein. | 11-14-2013 |
20130317828 | CONTENT RANKING AND SERVING ON A MULTI-USER DEVICE OR INTERFACE - The effectiveness of targeted content delivery at a multi-user interface can be directly linked to a proper targeting of users. A way of improving targeted content delivery at a multi-user interface can be to determine which users should be targeted based on one or more criteria. The present technology provides various methodologies for selecting one or more users associated with a multi-user interface to receive targeted content. Such users can be selected based on criteria associated with a ranking or priority of the users, criteria associated with an analysis of their interactions with the multi-user interface, criteria based on their most common characteristics, or any combination thereof. The user characteristics associated with such identified used can then be utilized to determine which content should be delivered to the multi-user interface. | 11-28-2013 |
20130325479 | SMART DOCK FOR ACTIVATING A VOICE RECOGNITION MODE OF A PORTABLE ELECTRONIC DEVICE - A dock for a portable electronic device including a housing, a connector extending from the housing to connect the portable electronic device to the dock, a microphone integrated within the housing, and a processor. The processor is operatively coupled to receive audio input from the microphone, and in response to the audio input, transmit a message to the portable electronic device via the connector to activate a voice recognition mode of the portable electronic device. | 12-05-2013 |
20130325480 | REMOTE CONTROLLER AND CONTROL METHOD THEREOF - A remote controller includes a housing, a direction sensor, a microphone, a controller, and a wireless transmitter. A control method of the remote controller includes detecting an angle between an axis of a remote controller and a vertical axis, enabling a microphone of the remote controller when the angle is within a predetermined range in order to generate a voice signal according to a voice command, and generating a first control signal according the voice signal and transmit the first control signal wirelessly. | 12-05-2013 |
20130325481 | VOICE INSTRUCTIONS DURING NAVIGATION - A method of providing navigation on an electronic device when the display screen is locked. The method receives a verbal request to start navigation while the display is locked. The method identifies a route from a current location to a destination based on the received verbal request. While the display screen is locked, the method provides navigational directions on the electronic device from the current location of the electronic device to the destination. Some embodiments provide a method for processing a verbal search request. The method receives a navigation-related verbal search request and prepares a sequential list of the search results based on the received request. The method then provides audible information to present a search result from the sequential list. The method presents the search results in a batch form until the user selects a search result, the user terminates the search, or the search items are exhausted. | 12-05-2013 |
20130325482 | ESTIMATING CONGNITIVE-LOAD IN HUMAN-MACHINE INTERACTION - Estimating cognitive-load of a user in human-machine interaction by identifying an expression of cognitive-load within a user expression captured by a dialogue system and using a user model to estimate a level of the cognitive-load based on the expression of cognitive-load. | 12-05-2013 |
20130325483 | DIALOGUE MODELS FOR VEHICLE OCCUPANTS - Methods and apparatus for creating and managing multiple dialogue models in a statistical dialogue modeling system capable of learning, and conducting human-machine dialogues based on selected models. Dialogue models are selected according to feature vectors that describe characteristics of the dialogue participants and their current situation. Mobile apparatus in motor vehicles can provide optimized dialogue service to occupants of the motor vehicles according to vehicle location and route, in addition to personal characteristics of the occupants, whether driver or passenger. When networked via a remote dialogue server, a large pool of dialogue participants is available for automatic building of dialogue models suitable for handling a variety of situations and participants. | 12-05-2013 |
20130325484 | METHOD AND APPARATUS FOR EXECUTING VOICE COMMAND IN ELECTRONIC DEVICE - An apparatus and method for executing a voice command in an electronic device. In an exemplary embodiment, a voice signal is detected and speech thereof is recognized. When the recognized speech contains a wakeup command, a voice command mode is activated, and a signal containing at least a portion of the detected voice signal is transmitted to a server. The server generates a control signal or a result signal corresponding to the voice command, and transmits the same to the electronic device. The device receives and processes the control or result signal, and awakens. Thereby, voice commands are executed without the need for the user to physically touch the electronic device. | 12-05-2013 |
20130325485 | DETECTION AND USE OF ACOUSTIC SIGNAL QUALITY INDICATORS - A computer-driven device assists a user in self-regulating speech control of the device. The device processes an input signal representing human speech to compute acoustic signal quality indicators indicating conditions likely to be problematic to speech recognition, and advises the user of those conditions. | 12-05-2013 |
20130339027 | DEPTH BASED CONTEXT IDENTIFICATION - A method or system for selecting or pruning applicable verbal commands associated with speech recognition based on a user's motions detected from a depth camera. Depending on the depth of the user's hand or arm, the context of the verbal command is determined and verbal commands corresponding to the determined context are selected. Speech recognition is then performed on an audio signal using the selected verbal commands. By using an appropriate set of verbal commands, the accuracy of the speech recognition is increased. | 12-19-2013 |
20130339028 | Power-Efficient Voice Activation - A voice activation system is provided. The voice activation system includes a first stage configured to output a first activation signal if at least one energy characteristic of a received audio signal satisfies at least one threshold and a second stage configured to transition from a first state to a second state in response to the first activation signal and, when in the second state, to output a second activation signal if at least a portion of a profile of the audio signal substantially matches at least one predetermined profile. | 12-19-2013 |
20130339029 | REMOTE CONTROL SIGNALING USING AUDIO WATERMARKS - A system for using a watermark embedded in an audio signal to remotely control a device. Various devices such as toys, computers, and appliances, equipped with an appropriate detector, detect the hidden signals, which can trigger an action, or change a state of the device. The watermarks can be used with a “time gate” device, where detection of the watermark opens a time interval within which a user is allowed to perform an action, such as pressing a button, typing in an answer, turning a key in a lock, etc. | 12-19-2013 |
20130339030 | INTERACTIVE SPOKEN DIALOGUE INTERFACE FOR COLLECTION OF STRUCTURED DATA - A multimodal dialog interface for data capture at point of origin is disclosed. The interface is designed to allow loading of forms needed for task record keeping, and is therefore customizable to wide range of record keeping requirements such as medical record keeping or recording clinical trial. The interface has a passive mode that is able to capture data while the user is performing other tasks, and an interactive dialog mode that ensures completion of all required information. | 12-19-2013 |
20130339031 | DISPLAY APPARATUS, METHOD FOR CONTROLLING THE DISPLAY APPARATUS, SERVER AND METHOD FOR CONTROLLING THE SERVER - A display apparatus is disclosed. The display apparatus includes a voice collecting unit which collects a user's voice; a first communication unit which transmits the user's voice to a first server, and receives text information corresponding to the user's voice from the first server; a second communication unit which transmits the received text information to a second server, and receives response information corresponding to the text information; an output unit which outputs a response message corresponding to the user's voice based on the response information; and a control unit which controls the output unit to output a response message differentiated from a response message corresponding to a previously collected user's voice, when a user's voice having a same utterance intention is re-collected | 12-19-2013 |
20130339032 | SERVER AND METHOD OF CONTROLLING THE SAME - A server which interacts with a display apparatus is provided. The server includes a storage unit configured to store conversation patterns for each service domain, a communication unit configured to receive a user's voice from the display apparatus, and a control unit configured to determine a service domain including the user's voice, generate response information corresponding to the user's voice based on a conversation pattern of the determined service domain, and to control the communication unit to transmit the response information to the display apparatus. When it is determined that a currently received user's voice is included in another service domain which is different from a service domain including a previously received user's voice, the control unit generates the response information corresponding to the currently received user's voice based on a conversation pattern of the other service domain. | 12-19-2013 |
20130339033 | DYNAMICALLY EXTENDING THE SPEECH PROMPTS OF A MULTIMODAL APPLICATION - A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt. | 12-19-2013 |
20130346084 | Enhanced Accuracy of User Presence Status Determination - Technologies are described herein for enhancing a user presence status determination. Visual data may be received from a depth camera configured to be arranged within a three-dimensional space. A current user presence status of a user in the three-dimensional space may be determined based on the visual data. A previous user presence status of the user may be transformed to the current user presence status, responsive to determining the current user presence status of the user. | 12-26-2013 |
20130346085 | MOUTH CLICK SOUND BASED COMPUTER-HUMAN INTERACTION METHOD, SYSTEM AND APPARATUS - Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, and including hardware devices performing a mouth click sound based human-device interaction. In one aspect, receiving at least one mouth click sound signals from a human user, by an acoustic-to-electric sensor of a computing device, and processing the received signals. The received mouth click sound signals may be accompanied by other mouth click sound signals, and other interaction signals. | 12-26-2013 |
20130346086 | Method and System for Providing an Automated Web Transcription Service - A system, method and computer readable medium that provides an automated web transcription service is disclosed. The method may include receiving input speech from a user using a communications network, recognizing the received input speech, understanding the recognized speech, transcribing the understood speech to text, storing the transcribed text in a database, receiving a request via a web page to display the transcribed text, retrieving transcribed text from the database, and displaying the transcribed text to the requester using the web page. | 12-26-2013 |
20140006033 | METHOD AND APPARATUS FOR PROCESSING MULTIPLE INPUTS | 01-02-2014 |
20140006034 | CALL REGISTRATION DEVICE FOR ELEVATOR | 01-02-2014 |
20140012587 | METHOD AND APPARATUS FOR CONNECTING SERVICE BETWEEN USER DEVICES USING VOICE - A method of connecting a service between a device and at least one other device is provided. The method includes recording, by the device, a user voice input in a state where a voice command button has been input, outputting first information based on the recorded user voice when an input of the voice command button is cancelled, receiving, by the device, second information corresponding to the first information, recognizing a service type according to the first information and the second information, connecting the device to a subject device in an operation mode of the device determined according to the recognized service type, and performing a service with the connected subject device. | 01-09-2014 |
20140019140 | METHOD FOR CONTROLLING EXTERNAL INPUT AND BROADCAST RECEIVING APPARATUS - A method for controlling an external input and a broadcast receiving apparatus are provided. The method includes: setting a call word of an external input apparatus connected through an external input terminal; associating the call word with the external input terminal and storing the call word and the external input terminal in association with each other; in response to a voice of a user being input, recognizing the voice to determine whether the voice includes the call word; and in response to determining the voice includes the call word, enabling the external input terminal corresponding to the call word to communicate with the external input apparatus using the external input terminal corresponding to the call word. | 01-16-2014 |
20140019141 | METHOD FOR PROVIDING CONTENTS INFORMATION AND BROADCAST RECEIVING APPARATUS - A method of providing contents information and broadcast receiving apparatus are provided. The method of providing contents information includes requesting, according to user input, a contents providing server to perform a contents search; receiving contents data on contents searched in response to the contents search request from the contents providing server; converting the contents data into audio data using a Text-To-Speech technology; and processing the audio data and outputting the processed audio data, according to at least one characteristic of the searched contents and/or user input. | 01-16-2014 |
20140032223 | VOICE ACTIVATED PHARMACEUTICAL PROCESSING SYSTEM - The embodiments disclosed herein relate to a system and method for processing a prescription through voice-activated commands. The system and method efficiently and effectively process the prescription so that a pharmacy may handle the increasing prescription processing demands. | 01-30-2014 |
20140032224 | METHOD OF CONTROLLING ELECTRONIC APPARATUS AND INTERACTIVE SERVER - A method of controlling an electronic apparatus is provided. The method includes: inputting a user message; comparing the input user message with stored information; and outputting a response message and one of a plurality of inquiry messages in response to the input user message and based on a result of the comparing, wherein the outputting the one of the plurality of inquiry messages is also based on a plurality of priorities. | 01-30-2014 |
20140039898 | METHODS AND APPARATUS FOR VOICED-ENABLING A WEB APPLICATION - Methods and apparatus for voice-enabling a web application, wherein the web application includes one or more web pages rendered by a web browser on a computer. At least one information source external to the web application is queried to determine whether information describing a set of one or more supported voice interactions for the web application is available, and in response to determining that the information is available, the information is retrieved from the at least one information source. Voice input for the web application is then enabled based on the retrieved information. | 02-06-2014 |
20140039899 | INDEXING DIGITIZED SPEECH WITH WORDS REPRESENTED IN THE DIGITIZED SPEECH - Indexing digitized speech with words represented in the digitized speech, with a multimodal digital audio editor operating on a multimodal device supporting modes of user interaction, the modes of user interaction including a voice mode and one or more non-voice modes, the multimodal digital audio editor operatively coupled to an ASR engine, including providing by the multimodal digital audio editor to the ASR engine digitized speech for recognition; receiving in the multimodal digital audio editor from the ASR engine recognized user speech including a recognized word, also including information indicating where, in the digitized speech, representation of the recognized word begins; and inserting by the multimodal digital audio editor the recognized word, in association with the information indicating where, in the digitized speech, representation of the recognized word begins, into a speech recognition grammar, the speech recognition grammar voice enabling user interface commands of the multimodal digital audio editor. | 02-06-2014 |
20140039900 | Systems and Methods For Haptic Confirmation Of Commands - Systems and methods for haptic confirmation of commands are disclosed. For example a system for generating haptic effects to confirm receipt of a voice command includes a microphone; a housing configured to be contacted by a user, and an actuator in communication with the housing, the actuator configured to output a haptic effect to the housing. The system also includes a processor in communication with the microphone and the actuator, the processor configured to receive speech information from the microphone; recognize the speech information and determine a command associated with the speech information. If the speech information is recognized and the command is determined, the processor is configured to generate a first actuator signal configured to cause the actuator to output a first haptic effect, and transmit the first actuator signal to the actuator. Otherwise, the processor is configured generate a second actuator signal configured to cause the actuator to output a second haptic effect; and transmit the second actuator signal to the actuator. | 02-06-2014 |
20140052450 | USER INTERFACE FOR ENTERTAINMENT SYSTEMS - Methods and apparatus for providing a search interface for an electronic device including a tuner configured to tune the electronic device to receive scheduled programming content. A search query is received and one or more data sources including information about media content are searched based, at least in part, on the search query. The results of the search are presented on a user interface using a time-based axis and a time-independent axis. | 02-20-2014 |
20140052451 | USER INTERFACE FOR ENTERTAINMENT SYSTEMS - Methods and apparatus for providing a search interface for an electronic device including a tuner configured to tune the electronic device to receive scheduled programming content. A search query is received and one or more data sources including information about media content are searched based, at least in part, on the search query. The results of the search are presented on a user interface using a time-based axis and a time-independent axis. | 02-20-2014 |
20140052452 | USER INTERFACE FOR ENTERTAINMENT SYSTEMS - Methods and apparatus for providing a search interface for an electronic device including a tuner configured to tune the electronic device to receive scheduled programming content. A search query is received and one or more data sources including information about media content are searched based, at least in part, on the search query. The results of the search are presented on a user interface using a time-based axis and a time-independent axis. | 02-20-2014 |
20140052453 | USER INTERFACE FOR ENTERTAINMENT SYSTEMS - Methods and apparatus for searching for content to display on a digitally-tunable electronic device configured to display scheduled programming content. The method comprises receiving a search query from a user, and determining, based on the search query, an action the user wants to perform. The method further comprises determining one or more data sources to search based, at least in part, on the action the user wants to perform, and searching based, at least in part, on the search query, the one or more data sources for the content to display on the electronic device. | 02-20-2014 |
20140067403 | MANAGING SPEECH INTERFACES TO COMPUTER-BASED SERVICES - A method of managing speech interfaces to computer-based services includes beginning a first speech session that is carried out in a vehicle over a short-range wireless connection between a vehicle occupant and a mobile device; detecting an initiation of a second speech session while the first speech session is being carried out; determining an assigned priority level of the first speech session relative to an assigned priority level of the second speech session; and when the assigned priority level of the second speech session has a higher priority than the assigned priority level of the first speech session, carrying out a session-appropriate action on the first speech session. | 03-06-2014 |
20140074480 | VOICE STAMP-DRIVEN IN-VEHICLE FUNCTIONS - In-vehicle functions are implemented using a plurality of microphones disposed in a vehicle. Each of the microphones is disposed in a portion of the vehicle defined by a zone. The in-vehicle functions are also implemented via a central controller of the vehicle. The central controller includes a computer processor executing logic. The logic receive a voice communication from an individual via one of the microphones, identifies the zone in the vehicle occupied by the individual, identifies the individual by comparing a voice stamp from the voice communication to a database of voice stamps, and implements at least one vehicle electronic component in the zone based on user preferences associated with the voice stamp. | 03-13-2014 |
20140074481 | Wave Analysis for Command Identification - A method is disclosed for identifying a spoken command by detecting intervals of voiced and unvoiced sound, and then comparing the order of voiced and unvoiced sounds to a set of templates. Each template represents one of the predetermined acceptable commands of the application, and is associated with a predetermined action. When the order of voiced and unvoiced intervals in the spoken command matches the order in one of the templates, the associated action is thus selected. Silent intervals in the command may also be included for enhanced recognition. Efficient protocols are disclosed for discriminating voiced and unvoiced sounds, and for detecting the beginning and ending of each sound interval in the command, and for comparing the command sequence to the templates. In a sparse-command application, this method provides fast and robust recognition, and can be implemented with low-cost hardware and extremely minimal software. | 03-13-2014 |
20140074482 | VOICE GUIDANCE SYSTEM AND ELECTRONIC EQUIPMENT - A voice guidance system is provided in which the voice guidance is enabled to easily follow a trend of change intervals, a rapid change of change intervals, etc. in a menu operation. The voice guidance system is configured with an input analyzing unit which inputs and analyzes an operation instruction signal of a menu item, a voice guidance control unit which controls voice guidance of the menu item according to the analysis result by the input analyzing unit, and a textual guidance control unit which performs display control of the menu item according to the analysis result by the input analyzing unit. The voice guidance control unit determines reproduction speed of the voice guidance according to the analysis result, on the basis of a speed trend obtained from a speed history as a set of plural pieces of reproduction speed information. | 03-13-2014 |
20140074483 | Context-Sensitive Handling of Interruptions by Intelligent Digital Assistant - Methods and systems related to intelligent interruption handling by digital assistants are disclosed. In some embodiments, a first information provision process is initiated in response to a first speech input. The first information provision process comprises preparing a first response and a second response to the first speech input. After or concurrent with the provision of the first response to the user, but before provision of the second response to the user, an event operable to initiate a second information provision process is detected. The second information provision process is initiated in response to detecting the event. The second information provision process comprises preparing a third response to the event. A relative urgency between the second response and the third response is determined. One of the second response and the third response is provided to the user in an order based on the determined relative urgency. | 03-13-2014 |
20140081644 | Method and Device for Voice Operated Control - At least one exemplary embodiment is directed to a method and device for voice operated control. The method can include measuring a first sound received from a first microphone, measuring a second sound received from a second microphone, detecting a spoken voice based on an analysis of measurements taken at the first and second microphone, mixing the first sound and the second sound to produce a mixed signal, and controlling the production of the mixed signal based on one or more aspects of the spoken voice. | 03-20-2014 |
20140095171 | SYSTEMS AND METHODS FOR PROVIDING A VOICE AGENT USER INTERFACE - Some embodiments provide techniques performed by at least one voice agent. The techniques include receiving voice input specifying a requested action; and identifying a subject of the requested action from the voice input and information relating to a prior action invoked by the at least one voice agent, wherein the information identifies a subject of the prior action. | 04-03-2014 |
20140095172 | SYSTEMS AND METHODS FOR PROVIDING A VOICE AGENT USER INTERFACE - Some embodiments provide techniques performed by at least one voice agent. The techniques include receiving voice input from a user at least partially specifying a requested action to be performed at least in part by an application program, wherein the requested action requires a plurality of inputs to be fully specified; and in response to receiving the voice input, making the application program accessible to the user prior to completion of performance of the requested action, so as to enable the user to provide and/or edit at least one input of the plurality of inputs by directly interacting with the application program. | 04-03-2014 |
20140095173 | SYSTEMS AND METHODS FOR PROVIDING A VOICE AGENT USER INTERFACE - Some embodiments provide techniques performed by at least one voice agent. The techniques include receiving voice input; identifying at least one application program as relating to the received voice input; and displaying at least one selectable visual representation that, when selected, causes focus of the computing device to be directed to the at least one application program identified as relating to the received voice input. | 04-03-2014 |
20140095174 | ELECTRONIC DEVICE, SERVER AND CONTROL METHOD THEREOF - Provided are a display apparatus, a control method thereof, a server, and a control method thereof. The display apparatus includes: a processor which processes a signal; a display which displays an image based on the processed signal; a command receiver which receives a voice command; a communicator which communicates with a first server; a storage; and a controller which receives, from the first server, a voice recognition command list comprising a voice recognition command and control command information corresponding to the voice recognition command, and stores the received voice recognition command list in the storage, the voice recognition command being among user's voice commands which have successfully been recognized a predetermined number of times or more, determines whether the voice command corresponds to the voice recognition command included in the voice recognition command list, and if so, controls the processor to operate based on the control command information, and if not, transmits the voice command to the first server, receives corresponding control command information from the first server, and controls the processor to operate based on the received control command information. | 04-03-2014 |
20140095175 | IMAGE PROCESSING APPARATUS AND CONTROL METHOD THEREOF AND IMAGE PROCESSING SYSTEM - An image processing apparatus including: image processor which processes broadcasting signal, to display image based on processed broadcasting signal; communication unit which is connected to a server; a voice input unit which receives a user's speech; a voice processor which processes a performance of a preset corresponding operation according to a voice command corresponding to the speech; and a controller which processes the voice command corresponding to the speech through one of the voice processor and the server if the speech is input through the voice input unit. If the voice command includes a keyword relating to a call sign of a broadcasting channel, the controller controls one of the voice processor and the server to select a recommended call sign corresponding to the keyword according to a predetermined selection condition, and performs a corresponding operation under the voice command with respect to the broadcasting channel of the recommended call sign. | 04-03-2014 |
20140095176 | ELECTRONIC DEVICE, SERVER AND CONTROL METHOD THEREOF - Provided are a display apparatus, a control method thereof, a server, and a control method thereof. The display apparatus includes: a processor which processes a signal; a display which displays an image based on the processed signal; a first command receiver which receives a voice command; a storage which stores a plurality of voice commands said by a user; a second command receiver which receives a user's manipulation command; and a controller which, upon receiving the voice command, displays a list of the stored plurality of voice commands, selects one of the plurality of voice commands of the list according to the received user's manipulation command and controls the processor to process based on the selected voice command. | 04-03-2014 |
20140095177 | ELECTRONIC APPARATUS AND CONTROL METHOD OF THE SAME - An electronic apparatus includes a voice acquirer which receives a first voice, a voice processor which processes a voice signal, a communication unit which communicates with at least one external electronic apparatus and receives information on at least one second voice, and a controller which determines whether the first voice is a user's command based on the information on at least one second voice transmitted by the communication unit, and if the first voice is not the user's command, does not perform an operation according to the first voice. | 04-03-2014 |
20140100854 | SMART SWITCH WITH VOICE OPERATED FUNCTION AND SMART CONTROL SYSTEM USING THE SAME - A smart switch applied to a smart control system in a smart house, includes a storage, a voice input unit configured to receive vocal commands and convert the vocal commands to electronic data, and a remote control unit. A processor unit which includes a voice identifying module, a determining module, and a control module is also included. The smart switch recognizes a voice command and sends a remote control command to the target electronic devices, thereby controlling the electronic devices to execute an operation. A smart control system is also provided. | 04-10-2014 |
20140108018 | SUBSCRIPTION UPDATES IN MULTIPLE DEVICE LANGUAGE MODELS - Systems and methods for intelligent language models that can be used across multiple devices are provided. Some embodiments provide for a client-server system for integrating change events from each device running a local language processing system into a master language model. The change events can be integrated, not only into the master model, but also into each of the other local language models. As a result, some embodiments enable restoration to new devices as well as synchronization of usage across multiple devices. In addition, real-time messaging can be used on selected messages to ensure that high priority change events are updated quickly across all active devices. Using a subscription model driven by a server infrastructure, utilization logic on the client side can also drive selective language model updates. | 04-17-2014 |
20140108019 | Smart Home Automation Systems and Methods - A smart home interaction system is presented. It is built on a multi-modal, multithreaded conversational dialog engine. The system provides a natural language user interface for the control of household devices, appliances or household functionality. The smart home automation agent can receive input from users through sensing devices such as a smart phone, a tablet computer or a laptop computer. Users interact with the system from within the household or from remote locations. The smart home system can receive input from sensors or any other machines with which it is interfaced. The system employs interaction guide rules for processing reaction to both user and sensor input and driving the conversational interactions that result from such input. The system adaptively learns based on both user and sensor input and can learn the preferences and practices of its users. | 04-17-2014 |
20140114665 | KEYWORD VOICE ACTIVATION IN VEHICLES - Systems and methods for keyword voice activation in vehicles are provided. In one example, a system comprises one or more microphones, a voice monitoring device, and an automatic speech recognition (ASR) system. The voice monitoring device can receive an acoustic signal from the microphones. A noise in the acoustic signal is reduced or suppressed to obtain a clean speech component. The ASR system may detect one or more keywords in the clean speech component and provide a command associated with the one or more keywords to vehicle systems. The system can associated a profile with the one or more keywords. The profile can include parameters specific to one operator or a group of operators. The parameters associated with the operator's profile can be used in the noise suppression, identification of the operator, and/or detecting keywords in the clean speech component. | 04-24-2014 |
20140122084 | Data Search Service - In an embodiment, speech may be acquired from a user. A concept, that may be associated with the user, may be identified from the acquired speech. The concept may be identified by fuzzy matching one or more words in the acquired speech with data contained in a data store. The data store may be associated with the user. An action may be performed based on the identified concept. | 05-01-2014 |
20140122085 | Voice Controlled Vibration Data Analyzer Systems and Methods - Embodiments of the present general inventive concept provide a voice controlled vibration data analyzer system, including a vibration sensor to detect vibration data from a machine-under-test, a data acquisition unit to receive the vibration data from the vibration sensor, and a control unit having a user interface to receive manual and audio input from a user, and to communicate information relating to the machine-under-test, the control unit executing commands in response to the manual or audio input to control the data acquisition unit and/or user interface to output an audio or visual message relating to a navigation path of multiple machines to be tested, to collect and process the vibration data, and to receive manual or audio physical observations from the user to characterize collected vibration data. | 05-01-2014 |
20140122086 | AUGMENTING SPEECH RECOGNITION WITH DEPTH IMAGING - Embodiments related to the use of depth imaging to augment speech recognition are disclosed. For example, one disclosed embodiment provides, on a computing device, a method including receiving depth information of a physical space from a depth camera, receiving audio information from one or more microphones, identifying a set of one or more possible spoken words from the audio information, determining a speech input for the computing device based upon comparing the set of one or more possible spoken words from the audio information and the depth information, and taking an action on the computing device based upon the speech input determined. | 05-01-2014 |
20140122087 | METHOD AND APPARATUS FOR ACTIVATING A PARTICULAR WIRELESS COMMUNICATION DEVICE TO ACCEPT SPEECH AND/OR VOICE COMMANDS - An apparatus, method, and computer program for initiating a word spotting algorithm ( | 05-01-2014 |
20140122088 | IMAGE PROCESSING APPARATUS AND CONTROL METHOD THEREOF AND IMAGE PROCESSING SYSTEM - An image processing apparatus is provided, the image processing apparatus includes: a voice input which receives a user's speech; a voice processor which performs a preset operation according to a voice command corresponding to the user's speech; and a controller which adjusts the preset operation of the voice command if the user's speech input into the voice input does not match the preset operation determined by the voice processor, and performs the adjusted preset operation that matches the user's speech according to the adjustment result. | 05-01-2014 |
20140122089 | IMAGE PROCESSING APPARATUS AND CONTROL METHOD THEREOF AND IMAGE PROCESSING SYSTEM - An image processing apparatus is provided, the image processing apparatus includes: a voice input which receives a user's speech; a voice processor which performs a preset operation according to a voice command corresponding to the user's speech; and a controller which adjusts the preset operation of the voice command if the user's speech input into the voice input does not match the preset operation determined by the voice processor, and performs the adjusted preset operation that matches the user's speech according to the adjustment result. | 05-01-2014 |
20140122090 | ELECTRONIC DEVICE AND METHOD FOR RECOGNIZING VOICE - An electronic device and a method for recognizing a voice are provided. An operating method of the electronic device includes detecting, at least one of two or more first sensors disposed in a preset region, detecting an amount of charge transfer over a preset value, when detecting the amount of the charge transfer over the preset value, detecting, at one of two or more second sensors disposed in a preset distance from two or more microphones, an object in a preset distance; and collecting, at one of the two or more microphones, the one disposed in a preset distance from the second sensor detecting the object in the preset distance, a voice. | 05-01-2014 |
20140122091 | ESTABLISHING A MULTIMODAL PERSONALITY FOR A MULTIMODAL APPLICATION IN DEPENDENCE UPON ATTRIBUTES OF USER INTERACTION - Establishing a multimodal personality for a multimodal application, including evaluating, by the multimodal application, attributes of a user's interaction with the multimodal application; selecting, by the multimodal application, a vocal demeanor in dependence upon the values of the attributes of the user's interaction with the multimodal application; and incorporating, by the multimodal application, the vocal demeanor into the multimodal application. | 05-01-2014 |
20140122092 | PERSONAL AUDIO ASSISTANT DEVICE AND METHOD - A computer readable medium containing instructions for controlling an electronic device causes one or more processors to perform operations including receiving an ambient signal, receiving a desired signal, combining the ambient signal with the desired signal to generate a mixed signal, and initiating control of audio content or at least one operation of the electronic device in response to at least one voice command detected in the mixed signal. In another aspect, the one more processors apply at least one among active noise reduction, echo cancellation, or signal cancellation to the ambient signal using an environmentally customized filter to provide a filtered signal and initiate control of audio content or at least one operation of an electronic device. Other embodiments are disclosed. | 05-01-2014 |
20140129232 | Automatic Display of User-Specific Financial Information Based on Audio Content Recognition - Aspects herein describe at least a new method, system, and computer readable storage media for recognizing the content of the audio. A computing device determines whether the content comprises one or more financial products and services offered by a financial institution, correlates the one or more financial products and services with a profile of a person, determines a subset of the one or more financial products and services that are of interest to the person based on the correlation, and transmits data related to the subset to a television for viewing by the person. The subset of the one or more products and services are displayed on a portion of the screen of the television. | 05-08-2014 |
20140129233 | APPARATUS AND SYSTEM FOR USER INTERFACE - Disclosed is apparatus and system for user interface. The apparatus for user interface comprises a body unit including a groove which is corresponding to a structure of an oral cavity and operable to be mounted on upper part of the oral cavity; a user input unit receiving a signal from the user's tongue in a part of the body unit; a communication unit transmitting the signal received from the user input unit; and a charging unit supplying an electrical energy generated from vibration or pressure caused by movement of the user's tongue. | 05-08-2014 |
20140129234 | ELECTRONIC APPARATUS AND METHOD OF CONTROLLING ELECTRONIC APPARATUS - An electronic apparatus and a method of controlling the electronic apparatus are provided. The method includes: receiving a voice command; and if the voice command is a first voice start command, changing a mode of the electronic apparatus to a first voice task mode in which the electronic apparatus is controlled according to further voice input, and if the voice command is a second voice start command, changing the mode of the electronic apparatus to a second voice task mode in which the electronic apparatus is controlled according to the further voice input received via an external apparatus which operates with the electronic apparatus. Therefore, providing efficiency and flexibility in controlling the electronic apparatus by using a microphone of the electronic apparatus or a microphone of the external apparatus. | 05-08-2014 |
20140136210 | SYSTEM AND METHOD FOR ROBUST PERSONALIZATION OF SPEECH RECOGNITION - Personalization of speech recognition while maintaining privacy of user data is achieved by transmitting data associated with received speech to a speech recognition service and receiving a result from the speech recognition service. The speech recognition service result is generated from a general purpose speech language model. The system generates an input finite state machine from the speech recognition result and composes the input finite state machine with a phone edit finite state machine, to yield a resulting finite state machine. The system composes the resulting finite state machine with a user data finite state machine to yield a second resulting finite state machine, and uses a best path through the second resulting finite state machine to yield a user specific speech recognition result. | 05-15-2014 |
20140136211 | VOICE CONTROL ON MOBILE INFORMATION DEVICE - A method for controlling a mobile information device based on verbal input from a user is presented. The method comprises waiting for a predetermined verbal input from a user. The method further comprises controlling a functional module of the mobile information device to determine a value within a predetermined range for a functional parameter in response to a first portion of the verbal input. Finally, the method comprises executing a functional operation by the functional module based on a determined value, in response to a second portion of the verbal input, wherein the second portion follows the first portion. | 05-15-2014 |
20140136212 | SPOKEN DIALOG SYSTEM BASED ON DUAL DIALOG MANAGEMENT USING HIERARCHICAL DIALOG TASK LIBRARY - The present invention relates to a spoken dialog system and method based on dual dialog management using a hierarchical dialog task library that may increase reutilization of dialog knowledge by constructing and packaging the dialog knowledge based on a task unit having a hierarchical structure, and may construct and process the dialog knowledge using a dialog plan scheme about relationship therebetween by classifying the dialog knowledge based on a task unit to make design of a dialog service convenient, which is different from an existing spoken dialog system in which it is difficult to reuse dialog knowledge since a large amount of construction costs and time is required. | 05-15-2014 |
20140136213 | MOBILE TERMINAL AND CONTROL METHOD THEREOF - A mobile terminal according to an embodiment of the present disclosure may include a microphone configured to receive a user's voice; a user input unit configured to sense a user's input; a controller configured to start a first operation in response to the user's input, and execute a voice recognition mode prior to completing the first operation, and recognize voice received through the microphone during the execution of the voice recognition mode to generate recognition result information, and execute a second operation based on the recognition result information; a display unit configured to display a loading screen image until at least one of the first and the second operation is completed, and display a second execution screen image based on the second operation more preferentially than a first execution screen image based on the execution result of the first operation when the second operation is completed. | 05-15-2014 |
20140136214 | ADAPTATION METHODS AND SYSTEMS FOR SPEECH SYSTEMS - Methods and systems are provided for adapting a speech system of a vehicle. In one example a method includes: logging data from the vehicle; logging speech data from the speech system; processing the data from the vehicle and the data from the speech system to determine a pattern of context and a relation to user interaction behavior; and selectively updating a user profile of the speech system based on the pattern of context. | 05-15-2014 |
20140136215 | Information Processing Method And Electronic Apparatus - The present invention provides information processing method and electronic apparatus. The method is applied in an electronic apparatus having voice recognition service, and the method includes: obtaining first voice information; recognizing the first voice information by a first recognition model to obtain a first recognition result; deciding whether the first recognition result conforms to a first preset condition; recognizing the first voice information by a second recognition model different from the first recognition model to obtain a second recognition result when the first recognition result conforms to the first preset condition; and controlling the electronic apparatus to execute a corresponding control instruction based on the second recognition result. | 05-15-2014 |
20140142949 | Voice-Activated Signal Generator - A voice-activated signal generator is a device to produce output signals responsive to spoken commands. The device accepts only predetermined commands and responsively generates specific output signals such as a pulse, a series of pulses, a voltage level, or a periodic waveform. The device is suitable for triggering an oscilloscope, or controlling a circuit under test, or activating another instrument. The invention also enables safely controlling a hazardous system such as a high voltage system, hands-free and with precise timing determined by the user. Also disclosed are fast, compact, robust algorithms for analyzing spoken commands, and particularly for detecting voiced and unvoiced sound, and for identifying commands by comparing the order of sound intervals in the spoken command to templates that represent the predetermined commands. The device may have one output or multiple outputs in parallel, all controlled by voice commands with precision output timing. | 05-22-2014 |
20140142950 | INTERLEAVING VOICE COMMANDS FOR ELECTRONIC MEETINGS - A method, computer program product, and system for identifying collaborators is described. A command precursor associated with delivery of a voice command associated with an electronic meeting is received. An audio signal including the voice command is received. A portion of the audio signal is identified as representing the voice command, based upon, at least in part, receiving the command precursor. The voice command is interpreted. The interpreted voice command is caused to be executed. | 05-22-2014 |
20140142951 | INTERLEAVING VOICE COMMANDS FOR ELECTRONIC MEETINGS - A method, computer program product, and system for identifying collaborators is described. A command precursor associated with delivery of a voice command associated with an electronic meeting is received. An audio signal including the voice command is received. A portion of the audio signal is identified as representing the voice command, based upon, at least in part, receiving the command precursor. The voice command is interpreted. The interpreted voice command is caused to be executed. | 05-22-2014 |
20140142952 | ENHANCED INTERFACE FOR USE WITH SPEECH RECOGNITION - Improved methods of presenting speech prompts to a user as part of an automated system that employs speech recognition or other voice input are described. The invention improves the user interface by providing in combination with at least one user prompt seeking a voice response, an enhanced user keyword prompt intended to facilitate the user selecting a keyword to speak in response to the user prompt. The enhanced keyword prompts may be the same words as those a user can speak as a reply to the user prompt but presented using a different audio presentation method, e.g., speech rate, audio level, or speaker voice, than used for the user prompt. In some cases, the user keyword prompts are different words from the expected user response keywords, or portions of words, e.g., truncated versions of keywords. | 05-22-2014 |
20140142953 | MOBILE TERMINAL AND CONTROLLING METHOD THEREOF - A mobile terminal including a microphone configured to receive a voice input; a touchscreen configured to display information; and a controller configured to activate a voice recognition mode on the mobile terminal for receiving the voice input from the microphone, receive the voice input indicating a particular function on the mobile terminal is to be executed, execute the particular function indicated by the received voice input, if the voice recognition mode is interrupted while the particular function is being executed, determine whether the particular function is in a complete state or an incomplete state, if the particular function is in the incomplete state, display a display object corresponding to the particular function in the incomplete state, and resume the particular function and activate the microphone for receiving additional voice input to complete the particular function. | 05-22-2014 |
20140149122 | VOICE CONTROL DEVICE AND VOICE CONTROL METHOD - A voice control device and a corresponding voice control method are provided. The voice control device includes a sound receiver, a sound converter, a voice identifier, and a central processing unit (CPU). The sound receiver receives a first sound signal. The sound converter converts the first sound signal from analog signal to digital signal. The voice identifier identifies a first voice signal from the first sound signal, performs a first comparison on the first voice signal and a second voice signal, and generates a wake-up signal according to the first comparison. When receiving the wake-up signal, the CPU enters a working state from a sleeping state, performs a second comparison on the first voice signal and the second voice signal, and takes over the voice input from the sound receiver and the sound converter according to the second comparison. | 05-29-2014 |
20140156281 | VOICE-CONTROLLED CONFIGURATION OF AN AUTOMATION SYSTEM - Methods and apparatus are provided for configuring control of an automation system for a home or other space, using audio input to a controller. Activation of an appliance in the automation system initiates the providing of the capabilities of the appliance to the controller and a data collection process via an audible interface. Audible user input is converted to an audio signal, and then processed by the controller to determine control input for the appliance. The audible input may also be used for user authentication. Subsequently, the controller controls the appliance based on the control input. | 06-05-2014 |
20140156282 | METHOD AND SYSTEM FOR CONTROLLING TARGET APPLICATIONS BASED UPON A NATURAL LANGUAGE COMMAND STRING - Disclosed is a method and system for controlling applications based upon a natural language command string. Embodiments may utilize skills of expert users of one or more target applications to create a domain specific language definition. An embodiment may then permit a less sophisticated user to control target applications using natural language command strings. An embodiment may process the natural language command string to obtain the complex code and/or configurations necessary to control the target applications. During the processing, each word (i.e., token/element) of the natural language command string is processed and compared with the domain specific language definition, which provides cardinal, order-of-operation, and other applicable data for each token/element, as well as translation procedures (i.e., jobs) that when run for each token/element provide the translation for the natural language command string. An embodiment may also permit a job to create new grammar to be evaluated recursively with additional jobs. | 06-05-2014 |
20140156283 | ACCESSING AN AUTOMOBILE WITH A TRANSPONDER - An automobile to detect a signal, having a security code, from a mobile transponder within range of the automobile is disclosed. The automobile may determine that the security code is valid to process an audible command from a user. The audible command may correspond to an automobile function. The automobile may also determine if the audible command matches a voiceprint of the user and process the automobile function accordingly. The automobile function may provide customized user settings for utilization of the automobile. | 06-05-2014 |
20140163994 | METHOD OF IDENTIFYING CONTACTS FOR INITIATING A COMMUNICATION USING SPEECH RECOGNITION - A method and system on an electronic device which uses speech recognition to initiate a communication from a mobile device having access to contact information for a number of contacts. The method includes: receiving through an audio input interface a voice input for initiating a communication, extracting from the voice input a type of communication and at least part of a contact name, and outputting, to an output interface, a selectable list of all contacts from the contact information which have the part of the contact name and which have a contact address associated with the type of communication. The mobile device may also be configured to access remote contact information from a remote server. | 06-12-2014 |
20140163995 | VOICE CONTROLLED WIRELESS COMMUNICATION DEVICE SYSTEM - A wireless communication device that accepts recorded audio data from an end-user. The audio data can be in the form of a command requesting user action. The audio data is reduced to a digital file in a format that is supported by the device hardware. The digital file is sent via wireless communication to at least one server computer for further processing. The command includes a unique device identifier that identifies the wireless communication device. The server computer determines required additional processing for the command based on the unique device identifier. The server computer constructs an application command based on the processed command, and transmits the application command to the wireless communication device. The application command includes at least one instruction that causes a corresponding application on the wireless communication device to execute the application command. | 06-12-2014 |
20140163996 | CONTROLLING A SET-TOP BOX VIA REMOTE SPEECH RECOGNITION - A device may receive over a network a digitized speech signal from a remote control that accepts speech. In addition, the device may convert the digitized speech signal into text, use the text to obtain command information applicable to a set-top box, and send the command information to the set-top box to control presentation of multimedia content on a television in accordance with the command information. | 06-12-2014 |
20140163997 | METHOD AND DEVICE FOR CHANGING DYNAMIC DISPLAY EFFECT OF MOBILE PHONE APPLICATION BY WAY OF VOICE CONTROL - This disclosure relates to methods and devices for changing dynamic display effect of mobile phone application by way of voice control. The method includes a recording step, recording an audio file of a voice external to a mobile phone; a judgment step, calculating a voice energy value linearly dependent upon sound volume in the audio file, comparing the voice energy value to a pre-set noise threshold value and performing a rate calculation step when it is greater than the pre-set noise threshold value; a rate calculation step, calculating a corresponding changing rate according to the voice energy value; and an application display step, setting the changing rate of the current application as a calculated changing rate and displaying same for the current application. This disclosure increases the approaches whereby a user controls the dynamic application changing effect and it can be applied to a mobile phone without a touch screen. | 06-12-2014 |
20140172431 | MUSIC PLAYING SYSTEM AND MUSIC PLAYING METHOD BASED ON SPEECH EMOTION RECOGNITION - A music playing system and a music playing method suitable for playing music based on speech emotion recognition are provided. The music playing method includes following steps. A plurality of songs and song emotion coordinates of the songs mapping on an emotion coordinate graph are stored in a first database. Emotion recognition parameters are stored in a second database. A voice data is received and analyzed, and a current emotion coordinate of the voice data mapping on the emotion coordinate graph is obtained according to the second database. The setting of a target emotion coordinate is received. At least one specific song emotion coordinate closest to a cheer-up line connecting the current emotion coordinate and the target emotion coordinate is found. Songs corresponding to aforementioned emotion coordinates are sequentially played. | 06-19-2014 |
20140180697 | IDENTIFICATION OF UTTERANCE SUBJECTS - Features are disclosed for generating markers for elements or other portions of an audio presentation so that a speech processing system may determine which portion of the audio presentation a user utterance refers to. For example, an utterance may include a pronoun with no explicit antecedent. The marker may be used to associate the utterance with the corresponding content portion for processing. The markers can be provided to a client device with a text-to-speech (“TTS”) presentation. The markers may then be provided to a speech processing system along with a user utterance captured by the client device. The speech processing system, which may include automatic speech recognition (“ASR”) modules and/or natural language understanding (“NLU”) modules, can generate hints based on the marker. The hints can be provided to the ASR and/or NLU modules in order to aid in processing the meaning or intent of a user utterance. | 06-26-2014 |
20140180698 | INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD AND STORAGE MEDIUM - According to one embodiment, an information processing apparatus includes a display, a touch panel on the display, and a voice recognition module. The display is configured to display video. The touch panel is configured to detect a touch. The voice recognition module is configured to perform voice recognition processing based on a position of the touch detected by the touch panel. | 06-26-2014 |
20140188482 | VOICE CONTROL METHOD, DEVICE, AND RECORDING MEDIUM FOR THE SAME - A voice control method is provided. At least one object name-action prompt correspondence document is received and processed into an object name-action prompt correspondence document set that defines at least one object name and at least one corresponding action prompt. The object name-action prompt correspondence document set is processed to establish an object name-action prompt correspondence list. A voice is recognized as one or multiple voice recognition results to generate one or multiple corresponding candidate object names. At least one corresponding candidate action prompt is outputted according to the candidate object name(s) and the object name-action prompt correspondence list. A selected action prompt is received, and a module providing the selected action prompt is requested to execute an operation. | 07-03-2014 |
20140188483 | AUDIO DEVICE AND STORAGE MEDIUM - An audio device transfers voice data which requests playback, generated by a user to a portable player. The portable player performs voice recognition of the transferred voice data. Control for playback start is started in accordance with the content as the result of the voice recognition. The audio device starts monitoring for a playback state of the portable player. If the start of the playback operation is detected within a predetermined period, an output source of sound to be output is changed to the portable player. | 07-03-2014 |
20140188484 | USER INTERFACE FOR A REMOTE CONTROL APPLICATION - A hand-held electronic device having a remote control application user interface that functions to displays operational mode information to a user. The graphical user interface may be used, for example, to setup the remote control application to control appliances for one or more users in one or more rooms, to perform activities, and to access favorites. The remote control application is also adapted to be upgradeable. Furthermore, the remote control application provides for the sharing of operational mode information. | 07-03-2014 |
20140188485 | CENTRAL CONTROLLER AND METHOD FOR CONTROLLING THE SAME - Disclosed is a central controller, comprising: a microphone; an audio output unit configured to output audible information; a wireless communication unit configured to communicate with an electronic device using wireless signals; an event unit configured to receive event information on an event generated from the electronic device, through the wireless communication unit; and a controller configured to control the audio output unit based on the received event information, such that a guide voice indicating a status of the electronic device is output, wherein once a voice for controlling the electronic device is input to the microphone, the wireless communication unit is controlled such that a control command corresponding to the voice is transmitted to the electronic device. | 07-03-2014 |
20140188486 | DISPLAY APPARATUS AND CONTROLLING METHOD THEREOF - Provided are a display apparatus and controlling method thereof. The display apparatus including: a voice receiver which collects a user's utterance; a communicator which transmits the user's utterance to a dialogue type server, and receives response information generated based on the user's utterance; a storage unit which stores control information corresponding to each user's utterance; a controller which determines whether the control information corresponding to the collected user's utterance is stored in the storage unit and performs operations corresponding to the user's utterance based on a result of the determination. | 07-03-2014 |
20140195246 | MIRROR WITH AUDIO-EMITTING DEVICE - A mirror has an audio-emitting device. A pre-recorded message plays aloud through a speaker on the frame of the mirror. The mirror has a voice or sound recognition actuation means. The message is preferable one of positive encouragement, and the preferred embodiment is for use as a therapeutic tool for treating individuals with low self-esteem or depression. | 07-10-2014 |
20140195247 | Bifurcated Speech Recognition - Presented are improvements for speech recognition systems used to control devices. Features include two-stage confirmation, two-stage limited speech recognition mode, and two-stage wake-up for speech driven applications and systems. A headset computer device includes such staged confirmation operation. | 07-10-2014 |
20140195248 | INTERACTIVE SERVER, DISPLAY APPARATUS, AND CONTROL METHOD THEREOF - An interactive server, a display apparatus, and a control method thereof are disclosed. An interactive server includes a communication unit configured to perform communication with a display apparatus and receive a voice command signal including a first command element representing a target and a second command element representing an execution command; a storage unit configured to store indicators and command words; an extraction unit configured to extract an indicator corresponding to the first command element and a command word corresponding to the second command element from the storage unit; and a controller configured to generate response information corresponding to the voice command signal by combining the extracted indicator and command word, and send the response information to the display apparatus, wherein the first command element is an command element that is determined based on a displaying status of objects displayed on a screen of the display apparatus. | 07-10-2014 |
20140195249 | INTERACTIVE SERVER, CONTROL METHOD THEREOF, AND INTERACTIVE SYSTEM - An interactive server, a control method thereof, and an interactive system are provided. The interactive server includes: a communicator which communicates with a display apparatus to receive an uttered voice signal; a storage device which stores utterance history information of a second uttered voice signal received from the display apparatus before the first uttered voice signal is received; an extractor which extracts uttered elements from the received first uttered voice signal; and a controller which generates response information based on the utterance history information stored in the storage device and the extracted uttered elements and transmits the response information to the display apparatus. Therefore, the interactive server comprehends intentions of the user with respect to various uttered voices of the user to generate response information according to the intentions and transmits the response information to the display apparatus. | 07-10-2014 |
20140195250 | VOICE REMOTE CONTROL - A device may include a display and logic. The logic may be configured to receive a selection of a first control action associated with an application stored in the device, provide a number of choices associated with the first control action, and receive a word or a phrase to use as a voice command corresponding to the first control action, wherein the word or phrase is selected from the choices. The logic may also associate the word or phrase with the first control action, receive voice input from a user, identify the voice input as corresponding to the word or phrase, and perform the first control action based on the identified voice input. | 07-10-2014 |
20140195251 | SYSTEM AND METHOD FOR CUSTOMIZED PROMPTING - A method for providing an audible prompt to a user within a vehicle. The method includes retrieving one or more data files from a memory device. The data files define certain characteristics of an audio prompt. The method also includes creating the audio prompt from the data files and outputting the audio prompt as an audio signal. | 07-10-2014 |
20140195252 | SYSTEMS AND METHODS FOR HANDS-FREE NOTIFICATION SUMMARIES - A method includes outputting an alert corresponding to an information item. In some implementations, the alert is a sound. In some implementations, the alert is ambiguous (e.g., the sound indicates several possible information items). The method further includes receiving a speech input after outputting the alert. The method further includes determining whether the speech input includes a request for information about the alert. The method further includes, in response to determining that the speech input includes a request for information about the alert, providing a first speech output including information about the alert. | 07-10-2014 |
20140200898 | METHOD FOR CONTROLLING FUNCTIONAL DEVICES IN A VEHICLE DURING VOICE COMMAND OPERATION - For retrofitting an infotainment system of a motor vehicle which has the option of operating using voice commands, such voice commands are subsequently defined in an SCXML file. The voice commands are assigned state transitions, which are intended to run through the functional devices after the corresponding voice command is received. The SCXML file is interpreted by an interpreter. | 07-17-2014 |
20140207465 | Method and Apparatus for Incoming Audio Processing - A system includes a processor configured to receive a verbal request to active an audio playback application on a remote device wirelessly connected to the processor. The processor is also configured to relay the request to the remote device for handling. The processor is further configured to receive a request from the remote device for audio playback. The processor is also configured to select a source channel for incoming audio. The processor is additionally configured to receive incoming audio from the remote device over the selected source channel and playback the incoming audio over a vehicle output. | 07-24-2014 |
20140207466 | METHOD AND SYSTEM FOR AUTOMATICALLY IDENTIFYING VOICE TAGS THROUGH USER OPERATION - A method for automatically identifying voice tags on an electronic device. After failure to initiate a communication using a voice input command, the user may then subsequently contact the recipient using an application program of the electronic device. The original audio of the voice input command may be identified as a potential voice tag for the now-identified recipient. The method includes: receiving, through a voice interface program, a voice input command, the voice input command including a command element and a content element; ending the voice interface program without performing the voice input command; receiving, through an application program, a user input which identifies data for executing an application program command; performing the application program command; and identifying audio of the content element as a voice tag associated with the data identified by the user input. | 07-24-2014 |
20140207467 | Hybrid Input Device For Touchless User Interface - An apparatus includes a sensor comprising a sensing film configured to provide a signal based upon a user's breath and a controller operably associated with the sensor. The controller is configured to receive the signal based upon the user's breath. | 07-24-2014 |
20140207468 | EVENT-TRIGGERED HANDS-FREE MULTITASKING FOR MEDIA PLAYBACK - A system and method are provided for hands-free operation of a device based on a context of an event. An example system configured to practice the method can detect an event during playback of media content to a user, and optionally output a first audible indication of the event. Based on the event, the system can activate a speech recognition application using a custom speech recognition grammar for recognizing a set of speech commands associated with the event. Then the system can optionally output a second audible indication of readiness to process speech in association with the event. The system can monitor, for a predetermined duration of time after the second audible indication, audio input received via the microphone to recognize a command via the speech recognition application, and execute the command. | 07-24-2014 |
20140207469 | REDUCING SPEECH SESSION RESOURCE USE IN A SPEECH ASSISTANT - A method of utilizing a speech assistant, the speech assistant designed to provide a voice input and speech output capability, the method comprising, enabling the use of the speech assistant for communication with a user, and terminating the speech assistant when the communication is complete. The method further comprises receiving a notification from a native application associated with the communication, and activating a sub-portion of the speech assistant, to enable outputting of the notification using speech output, thereby enabling the use of speech output for periodic announcements without enabling the speech assistant. | 07-24-2014 |
20140207470 | ELECTRONIC APPARATUS AND VOICE PROCESSING METHOD THEREOF - Apparatuses and methods related an electronic apparatus and a voice processing method thereof are provided. More particularly, the apparatuses and methods relate to an electronic apparatus capable of recognizing a user's voice and a voice processing method thereof. An electronic apparatus includes: a voice recognizer configured to recognize a user's voice; a storage configured to have previously stored instructions; a function executor which performs a predetermined function; and a controller configured to control the function executor to execute the function in response to the instruction in response to a user's voice corresponding to the instruction being input, and controls the function executor to execute the function in accordance with results of an external server which analyzes a user's voice in response to a preset dialogue selection signal and a dialogue voice for executing the function being input by a user. | 07-24-2014 |
20140207471 | ARRANGEMENT FOR FACILITATING SELECTION AND ACTIVATION OF A VOICE CONTROL SYSTEM BY A VEHICLE OPERATOR - An arrangement is provided for facilitating selection and activation of a voice control system by a vehicle operator. The arrangement may include a switch selectively switchable between a first switch position (P | 07-24-2014 |
20140207472 | AUTOMATED COMMUNICATION INTEGRATOR - An apparatus includes a plurality of applications and an integrator having a voice recognition module configured to identify at least one voice command from a user. The integrator is configured to integrate information from a remote source into at least one of the plurality of applications based on the identified voice command. A method includes analyzing speech from a first user of a first mobile device having a plurality of applications, identifying a voice command based on the analyzed speech using a voice recognition module, and incorporating information from the remote source into at least one of a plurality of applications based on the identified voice command. | 07-24-2014 |
20140214428 | VOICE INPUT AND OUTPUT DATABASE SEARCH METHOD AND DEVICE - A voice input and output database search method that includes: extracting at least one candidate keyword represented by a word or phrase from text information included in a search result; memorizing the extracted candidate keyword and a search state including the search result from which the candidate keyword is extracted in correspondence; and after a new search condition that is different from the search condition and is provided by a voice message has been recognized and a search has been conducted in accordance with this recognized search condition, when a backtracking keyword represented by a word or phrase is recognized together with a backtracking directive that is represented by a word or phrase and backtracks to a search state, controlling on the basis of memory contents so as to backtrack to a search state memorized in correspondence with a candidate keyword that corresponds to the recognized backtracking keyword. | 07-31-2014 |
20140214429 | Method for Voice Activation of a Software Agent from Standby Mode - A method for voice activation of a software agent from a standby mode. In one embodiment, an audio recording ( | 07-31-2014 |
20140214430 | REMOTE CONTROL SYSTEM AND DEVICE - The present invention is for a system, method and device which provides a command signal corresponding to voice commands of a user. The invention includes an audio acquisition device for receiving an audible signal including a command and providing an electrical signal to a processor. The processor generates commands according to predetermined criteria, based on auditory association between the command of the plurality of commands. A visual display device displays a plurality of indicia, and a user operable selection and input device provides for user selection of commands by selection of the indicia corresponding to the command. | 07-31-2014 |
20140222435 | NAVIGATION SYSTEM WITH USER DEPENDENT LANGUAGE MECHANISM AND METHOD OF OPERATION THEREOF - A method of operation of a navigation system includes: providing a history list including a request having a tag; assigning a probability to the request based on the tag to create a speaker dependent model; providing a returned result generated from the speaker dependent model; and updating the request and the tag of the history list based on a user's confirmation of the returned result for displaying on a device. | 08-07-2014 |
20140222436 | VOICE TRIGGER FOR A DIGITAL ASSISTANT - A method for operating a voice trigger is provided. In some implementations, the method is performed at an electronic device including one or more processors and memory storing instructions for execution by the one or more processors. The method includes receiving a sound input. The sound input may correspond to a spoken word or phrase, or a portion thereof. The method includes determining whether at least a portion of the sound input corresponds to a predetermined type of sound, such as a human voice. The method includes, upon a determination that at least a portion of the sound input corresponds to the predetermined type, determining whether the sound input includes predetermined content, such as a predetermined trigger word or phrase. The method also includes, upon a determination that the sound input includes the predetermined content, initiating a speech-based service, such as a voice-based digital assistant. | 08-07-2014 |
20140244266 | Interaction with a Portion of a Content Item through a Virtual Assistant - Techniques for interacting with a portion of a content item through a virtual assistant are described herein. The techniques may include identifying a portion of a content item that is relevant to user input and causing an action to be performed related to the portion of the content item. The action may include, for example, displaying the portion of the content item on a smart device in a displayable format that is adapted to a display characteristic of the smart device, performing a task for a user that satisfies the user input, and so on. | 08-28-2014 |
20140244267 | INTEGRATION OF USER ORIENTATION INTO A VOICE COMMAND SYSTEM - Embodiments disclosed herein provide systems and methods for integrating user orientation into a voice command system. In a particular embodiment, a method provides receiving audio information spoken by a user during a time period and determining whether the audio information includes a voice command. The method further provides determining a first orientation of the user during the time period and complying with the voice command based on the first orientation. | 08-28-2014 |
20140244268 | METHOD AND APPARATUS FOR VOICE CONTROL OF A MOBILE DEVICE - A method and apparatus for voice control of a mobile device are provided. The method establishes a connection between the mobile device and a voice-control module. Responsive to establishing the connection, the mobile device enters into an intermediate mode; and the voice-control module monitors for verbal input comprising a verbal command from among a set of predetermined verbal commands. The voice-control module sends instructions to the mobile device related to the verbal command received; and the mobile device acts on the received instructions. An apparatus/voice control module (VCM) for voice control of a mobile device, wherein the VCM includes a connection module configured for establishing a connection between the VCM and the mobile device; a monitoring module configured for monitoring for a verbal command from among a set of predetermined verbal commands; and a communications module configured for sending instructions to the mobile device related to the verbal command received. | 08-28-2014 |
20140244269 | DEVICE AND METHOD FOR ACTIVATING WITH VOICE INPUT - An information processing apparatus that detects a voice command via a microphone in order to activate the device and execute certain applications. The apparatus comprises a digital signal processor (DSP) and a host controller which are responsible for processing the voice commands. The DSP recognizes and processes voice commands intermittently while the host processor is in a sleep state, thereby reducing the overall power consumption of the apparatus. Further, when the DSP is configured to recognize voice commands intended only to activate the device, a memory having a sufficiently lower storage capacity suffices. | 08-28-2014 |
20140244270 | METHOD AND SYSTEM FOR IMPROVING RESPONSIVENESS OF A VOICE RECOGNITION SYSTEM - A system includes a voice converter converting a first voice command into a first electrical command and a command library having library contents. A language responsiveness module (LRM) stores the first electrical command in a temporary set when a first control command cannot be determined from the library contents. A voice prompt module receives a second voice command when the first control command cannot be determined from the library contents. The voice converter converts a second voice command into a second electrical command corresponding to the second voice command. The LRM compares the second electrical command to the command library. The LRM determines a second control command corresponding to the second electrical command in response to comparing the second voice command to the command library and stores the first voice command in the command library after determining the control command corresponding to the second voice command. | 08-28-2014 |
20140244271 | Electronic Devices with Voice Command and Contextual Data Processing Capabilities - An electronic device may capture a voice command from a user. The electronic device may store contextual information about the state of the electronic device when the voice command is received. The electronic device may transmit the voice command and the contextual information to computing equipment such as a desktop computer or a remote server. The computing equipment may perform a speech recognition operation on the voice command and may process the contextual information. The computing equipment may respond to the voice command. The computing equipment may also transmit information to the electronic device that allows the electronic device to respond to the voice command. | 08-28-2014 |
20140244272 | CONTROL METHOD AND ELECTRONIC DEVICE - The present invention discloses a control method and an electronic device, which are capable of solving the technical problem in the prior art that it is not rapid enough when controlling a voice recognition engine to enter an operating state. The control method is applied in an electronic device which comprises a voice recognition engine and comprises or is connected to a microphone, wherein the method comprises: acquiring first airflow information collected by the microphone; determining whether the first airflow information satisfies a first preset condition; and controlling the voice recognition engine to enter a second state when the first airflow information satisfies the first preset condition. | 08-28-2014 |
20140244273 | VOICE-CONTROLLED COMMUNICATION CONNECTIONS - Systems and methods for voice-controlled communication connections are provided. An example system includes a mobile device being operated consecutively in listen, wakeup, authentication, and connect modes. Each of subsequent modes consumes more power than a preceding mode. The listen mode consumes less than 5 mW. In the listen mode, the mobile device listens for an acoustic signal, determines whether the acoustic signal includes voice, and upon the determination, selectively enters the wakeup mode. In the wakeup mode, the mobile device determines whether the acoustic signal includes a spoken word and, upon the determination, enters the authentication mode. In authentication mode, the mobile device identifies a user using the spoken command and, upon the identification, enters the connect mode. In the connect mode, the mobile device receives an acoustic signal, determines whether the acoustic signal includes a spoken command and performs one or more operations associated with the spoken command. | 08-28-2014 |
20140249825 | REMOTE COMMUNICATION SYSTEMS AND METHODS FOR COMMUNICATING WITH A BUILDING GATEWAY CONTROL TO CONTROL BUILDING SYSTEMS AND ELEMENTS - A wearable monitoring device, worn by a user, has one or more sensors which acquire at least one of a user's activities, behaviors and habit information, an antenna and a unique user ID. The monitoring device includes a wireless user interface with a one or more input selection elements, which are accessible by the user to control at least a portion of one or more controllable devices housed in a building. One or more controllable systems or devices are at the building. At least a first portion of the one or more controllable systems or devices have an interface with a receiver in communication with the monitoring device that enables the monitoring device to communicate with the receiver. | 09-04-2014 |
20140249826 | SPEECH DIALOGUE SYSTEM AND SPEECH DIALOGUE METHOD - A speech dialogue system generates a response sentence in a way to improve the efficiency of the dialogue with the user, based on a result of estimation on an attribute of a proper name in an utterance of a user. The system includes a database attribute estimation unit to estimate the attribute of the input proper name by utilizing a database, and a web attribute estimation unit to estimate an attribute of an input proper name by utilizing information on the web. A reliability integration unit calculates integrated reliability of estimation for each of possible attributes obtained from the estimation by the units, by integrating first reliability of the estimation. A response generation unit generates a response sentence to an input utterance based on the integrated reliabilities of the possible attributes. | 09-04-2014 |
20140257821 | SYSTEM AND METHOD FOR PROCESSOR WAKE-UP BASED ON SENSOR DATA - A system for processor wake-up based on sensor data includes an audio buffer, an envelope buffer, and a processor. The audio buffer is configured to store a first data from a sensor. The first data is generated according to a first sampling rate. The envelope buffer is configured to store a second data, which is derived from the first data according to a second sampling rate, which is less than the first sampling rate. The processor is configured to wake up periodically from an idle state and read the second data from the envelope buffer. If the second data indicates an activity, the processor is configured to read the first data from the audio buffer. If the second data does not indicate an activity, the processor is configured to return to the idle state. | 09-11-2014 |
20140278435 | METHODS AND APPARATUS FOR DETECTING A VOICE COMMAND - Some aspects include a method of monitoring an acoustic environment of a mobile device operating in a low power mode, the mobile device having a first and second processor, the method comprises receiving acoustic input while the mobile device is operating in the low power mode, performing at least one first processing stage on the acoustic input using the first processor, prior to engaging the second processor, to evaluate whether the acoustic input includes a voice command, performing at least one second processing stage on the acoustic input using the second processor to evaluate whether the acoustic input includes a voice command if further processing is needed to determine whether the acoustic input includes a voice command, and initiating responding to the voice command when either the at least one first processing stage or the at least one second processing stage determines that the acoustic input includes a voice command. | 09-18-2014 |
20140278436 | VOICE INTERFACE SYSTEMS AND METHODS - A voice-controlled system is described that can be accessed by a mobile computing device. A user can communicate requests using natural language utterances. A microphone can collect the utterances and provide them to the mobile computing device. The mobile computing device can transmit the human utterance to a voice interface system. The voice interface system can utilize user preferences when executing the request to provide a personalized user experience. Computer-implemented methods are also described herein. | 09-18-2014 |
20140278437 | USER SENSING SYSTEM AND METHOD FOR LOW POWER VOICE COMMAND ACTIVATION IN WIRELESS COMMUNICATION SYSTEMS - A method of activating voice control on a wireless device includes sampling signals from a plurality of sensors on the device, determining if the device is in a hands-on state by a user on the basis of the signal sampling, and enabling a voice activated detection (VAD) application on the device on the basis of the determination. A voice controlled apparatus in a wireless device includes a plurality of sensors arranged on the device, a microphone, a controller to sample signals from one or more of the plurality of sensors, a processor coupled to the controller, and a voice activated detection (VAD) application running on the processor coupled to the controller and the microphone. | 09-18-2014 |
20140278438 | Providing Content on Multiple Devices - Techniques for receiving a voice command from a user and, in response, providing audible content to the user using a first device and providing visual content for the user using a second device. In some instances, the first device includes a microphone for generating audio signals that include user speech, as well as a speaker for outputting audible content in response to identified voice commands from the speech. However, the first device might not include a display for displaying graphical content. As such, the first device may be configured to identify devices that include displays and that are proximate to the first device. The first device may then instruct one or more of these other devices to output visual content associated with a user's voice command. | 09-18-2014 |
20140278439 | VOICE BASED AUTOMATION TESTING FOR HANDS FREE MODULE - An electronic control unit (ECU) of a hands-free module may be tested by an automated voice based testing tool in a first device. The tool reads test input data from an Excel input file. The tool generates simulated audible voice commands in a specified language, accent, pitch, volume or speed to test the hands-free module. The voice commands are transmitted via a speaker to a hands-free module microphone. The hands-free ECU is coupled to a CAN bus and the tool receives CAN bus information corresponding to hands-free module operations. The tool outputs test verdict information and/or CAN bus message logs as text in an Excel file. | 09-18-2014 |
20140278440 | FRAMEWORK FOR VOICE CONTROLLING APPLICATIONS - A system for voice control of applications includes an electronic device that receives speech signals and converts the speech signals into words. A voice navigation module analyzes an application and determines application type and enabled features. A command registration module registers commands based on the determined application type and enabled features. The commands control the application when matched with associated speech. A speech command interpretation module receives the words and detects a speech mode for matching commands with interpreted speech, and executes matched commands for navigating through and controlling the application. | 09-18-2014 |
20140278441 | SYSTEMS AND METHODS FOR SWITCHING PROCESSING MODES USING GESTURES - Systems and methods for switching between voice dictation modes using a gesture are provided so that an alternate meaning to a dictated word may be applied. The provided systems and methods time stamp detected gestures and detected words from the voice dictation and compare the time stamp at which a gesture is detected to the time stamp at which a word is detected. When it is determined that a time stamp of a gesture approximately matches a time stamp of a word, the word may be processed to have an alternate meaning, such as a command, punctuation, or action. | 09-18-2014 |
20140278442 | VOICE TRANSMISSION STARTING SYSTEM AND STARTING METHOD FOR VEHICLE - A voice transmission starting system and starting method for a vehicle are provided to start a voice transmission device through recognition of a motion. The method includes outputting, by a controller, an ultrasonic wave having substantially uniform amplitude and recognizing the output ultrasonic wave as an input. In addition, the controller generates a signal based on information regarding the recognized ultrasonic wave and transmits the signal to start the voice transmission device. | 09-18-2014 |
20140278443 | Voice Control User Interface with Progressive Command Engagement - A method include placing a first processor in a sleep operating mode and running a second processor that is operative to wake the first processor from the sleep operating mode in response to a speech command phrase. The method includes identifying, by the second processor, a speech command phrase segment and performing a control operation in response to detecting the segment in detected speech. The control operation is performed while the first processor is maintained in the sleep operating mode. | 09-18-2014 |
20140278444 | CONTEXT-SENSITIVE HANDLING OF INTERRUPTIONS - A speech output to be provided to a user of a device is received. Thereafter, it is determined if the device is currently receiving speech input from a user. Upon determining that the device is not currently receiving speech input from the user, the speech output to the user is provided. On the other hand, upon determining that the device is receiving speech input from the user it is determined if provision of the speech output is urgent. When the speech output is urgent, the speech output is provided to the user. When the speech output is not urgent, provision of the speech output to the user is stayed. | 09-18-2014 |
20140278445 | INTEGRATED SENSOR-ARRAY PROCESSOR - An integrated sensor-array processor and method includes sensor array time-domain input ports to receive sensor signals from time-domain sensors. A sensor transform engine (STE) creates sensor transform data from the sensor signals and applies sensor calibration adjustments. Transducer time-domain input ports receive time-domain transducer signals, and a transducer output transform engine (TTE) generates transducer output transform data from the transducer signals. A spatial filter engine (SFE) applies suppression coefficients to the sensor transform data, to suppress target signals received from noise locations and/or amplification locations. A blocking filter engine (BFE) applies subtraction coefficients to the sensor transform data, to subtract the target signals from the sensor transform data. A noise reduction filter engine (NRE) subtracts noise signals from the BFE output. An inverse transform engine (ITE) generates time-domain data from the NRE output. | 09-18-2014 |
20140297287 | Voice-Activated Precision Timing - A method is disclosed for generating a voice-activated responsive action, such as a pulse or a measurement, with improved speed and time resolution. First, an operator calls an arming command to prepare the application, and then calls a trigger command to initiate the responsive action. The responsive action is generated immediately upon detecting the leading edge of the trigger command, resulting in an extremely fast response with precise timing. The particular responsive action may be determined by the type of sound detected, or by other information contained in the trigger command, or by the arming command, or by another prior command. Other types of preparatory events are disclosed. Efficient methods for identifying commands are disclosed. For all applications requiring an extremely prompt, precisely timed, hands-free response to a voice command, this invention is enabling. | 10-02-2014 |
20140297288 | TELEPHONE VOICE PERSONAL ASSISTANT - A system and associated method are provided for using a voice activated voice personal assistant (VPA) for a first user equipment, comprising: detecting establishment of a voice communication with a second user equipment; monitoring the voice communications using the VPA for commands relevant to the VPA; identifying, by the VPA, the commands within the voice communication; and implementing an action related to the commands during the ongoing voice communication. | 10-02-2014 |
20140297289 | VOICE CONTROL DEVICE, VOICE CONTROL METHOD AND PROGRAM - According to an illustrative embodiment, an information processing apparatus is provided. The information processing apparatus includes a communication device to receive plural pieces of tag information corresponding to respective positions within a target area, the target area having a position defined by the position of the apparatus; and an output device to output a plurality of sounds such that for each sound at least a portion of the sound overlaps with at least a portion of another of the sounds, each of the sounds being indicative of a respective piece of tag information. | 10-02-2014 |
20140310004 | VOICE CONTROL METHOD, MOBILE TERMINAL DEVICE, AND VOICE CONTROL SYSTEM - A voice control method, a mobile terminal device, and a voice control system are provided. The voice control method includes the following steps. An application provides at least one operating parameter for a speech software development module. The speech software development module receives a voice signal and parses the voice signal, and thus a voice recognition result is obtained. The speech software development module determines whether the voice recognition result matches the operating parameters. When the voice recognition result matches the operating parameters, the speech software development module provides an operating signal for the application. | 10-16-2014 |
20140310005 | Virtual assistant conversations for ambiguous user input and goals - Ambiguous input of a user received during an interactive session with a virtual agent may be processed. The virtual agent may be presented via a computing device to facilitate the interactive session with the user. The user may provide the ambiguous input, which is processed to determine a response to the input. The virtual agent may provide the response to the user. The virtual agent may also carry out a goal-based dialogue where a goal to be accomplished is identified. The virtual agent may prompt the user for information related to the goal. | 10-16-2014 |
20140330569 | DEVICE VOICE RECOGNITION SYSTEMS AND METHODS - Device voice recognition systems and methods are described herein. One example of a method for device voice recognition includes receiving a voice command, determining a number of devices relating to the voice command, and adjusting a setting of the number of devices based on the received voice command. | 11-06-2014 |
20140330570 | SATISFYING SPECIFIED INTENT(S) BASED ON MULTIMODAL REQUEST(S) - Techniques are described herein that are capable of satisfying specified intent(s) based on multimodal request(s). A multimodal request is a request that includes at least one request of a first type and at least one request of a second type that is different from the first type. Example types of request include but are not limited to a speech request, a text command, a tactile command, and a visual command. A determination is made that one or more entities in visual content are selected in accordance with an explicit scoping command from a user. In response, speech understanding functionality is automatically activated, and audio signals are automatically monitored for speech requests from the user to be processed using the speech understanding functionality. | 11-06-2014 |
20140337036 | LOW POWER ACTIVATION OF A VOICE ACTIVATED DEVICE - In a mobile device, a bone conduction or vibration sensor is used to detect the user's speech and the resulting output is used as the source for a low power Voice Trigger (VT) circuit that can activate the Automatic Speech Recognition (ASR) of the host device. This invention is applicable to mobile devices such as wearable computers with head mounted displays, mobile phones and wireless headsets and headphones which use speech recognition for the entering of input commands and control. The speech sensor can be a bone conduction microphone used to detect sound vibrations in the skull, or a vibration sensor, used to detect sound pressure vibrations from the user's speech. This VT circuit can be independent of any audio components of the host device and can therefore be designed to consume ultra-low power. Hence, this VT circuit can be active when the host device is in a sleeping state and can be used to wake the host device on detection of speech from the user. This VT circuit will be resistant to outside noise and react solely to the user's voice. | 11-13-2014 |
20140337037 | Systems and Methods for Speech Command Processing - Methods and apparatus related to processing speech input at a wearable computing device are disclosed. Speech input can be received at the wearable computing device. Speech-related text corresponding to the speech input can be generated. A context can be determined based on database(s) and/or a history of accessed documents. An action can be determined based on an evaluation of at least a portion of the speech-related text and the context. The action can be a command or a search request. If the action is a command, then the wearable computing device can generate output for the command. If the action is a search request, then the wearable computing device can: communicate the search request to a search engine, receive search results from the search engine, and generate output based on the search results. The output can be provided using output component(s) of the wearable computing device. | 11-13-2014 |
20140343949 | SMART MICROPHONE DEVICE - A smart microphone device is provided. The smart microphone device is coupled to a host, and includes: an analog microphone unit, receiving sounds; a voice detection unit, coupled to the analog microphone unit, detecting voices from the sounds; a speech detection unit, coupled to the voice detection unit, detecting a speech from the voices; and a channel select pin, coupled between the smart microphone device and the host, wherein an interrupt signal is sent from the smart microphone device to the host via the channel select pin to enable the host to operate in the normal mode when the speech detection unit detects the speech. | 11-20-2014 |
20140343950 | INTERACTIVE USER INTERFACE FOR AN INTELLIGENT ASSISTANT - A system, method and computer program for performing voice commands on a mobile device and presenting the results on an interactive timeline is disclosed. A user may utter a voice command into the microphone of their mobile device while an application is running. The voice command is processed to derive the intention of the user, specifically by determining the domain, at least one task and at least one parameter for the task from the voice command. A services component performs the task identified and presents the results on the mobile device screen. In various embodiments, the results are presented on a timeline and may be grouped together by domains and sorted by the time that the results were obtained. A search history view may also be viewed that includes search results sorted chronologically each of which is represented graphically by an icon that represents the category of each search. When a user utters a voice command, the text representation is displayed together with an edit button, a resay button, and a progress bar. A user may modify the text representation at any time while the natural language processing is being performed. | 11-20-2014 |
20140343951 | Simplified Decoding of Voice Commands Using Control Planes - Systems and methods for training voice activation control of electronic equipment are disclosed. One example method includes receiving a selection corresponding to at least one command used to control the electronic equipment. The method further includes instructing a user to speak, and responsive to the instruction, receiving a digitized speech stream. The method further includes segmenting the speech stream into speech segments, storing at least one of the speech segments as an entry in a dictionary, and associating the dictionary entry with the selected command. | 11-20-2014 |
20140343952 | SYSTEMS AND METHODS FOR LIP READING CONTROL OF A MEDIA DEVICE - Systems and methods of generating device commands based upon spoken user commands are disclosed. An exemplary embodiment captures a series of images of a user of a media device, generates image information corresponding to the series of captured images, determines lip movement of the user from the generated image information, determines at least one spoken user command based upon the determined lip movement of the user, and determines a device command based upon the determined spoken user command. Then, the device command is communicated to, for example, a media presentation device, wherein an operation of the media presentation device is controlled in accordance with the determined spoken user command. | 11-20-2014 |
20140350941 | Method For Finding Elements In A Webpage Suitable For Use In A Voice User Interface (Disambiguation) - A disambiguation process for a voice interface for web pages or other documents. The process identifies interactive elements such as links, obtains one or more phrases of each interactive element, such as link text, title text and alternative text for images, and adds the phrases to a grammar which is used for speech recognition. A group of interactive elements are identified as potential best matches to a voice command when there is no single, clear best match. The disambiguation process modifies a display of the document to provide unique labels for each interactive element in the group, and the user is prompted to provide a subsequent spoke command to identify one of the unique labels. The selected unique label is identified and a click event is generated for the corresponding interactive element. | 11-27-2014 |
20140350942 | VEHICLE HUMAN MACHINE INTERFACE WITH GAZE DIRECTION AND VOICE RECOGNITION - A human machine interface (HMI) system for a vehicle equipped with a plurality of voice activated devices. An occupant monitor is used to determine a gaze direction or gesture of an occupant of the vehicle. The system determines to which of the voice activated devices a voice command is directed based on the gaze direction or gesture. | 11-27-2014 |
20140350943 | PERSONAL AUDIO ASSISTANT DEVICE AND METHOD - A server includes one or more processors, and a computer readable memory coupled to the one or more processors. The computer readable memory contains instructions which when executed by the one or more processors causes the one or more processors to perform the operations of receiving captured audio via an microphone of a mobile or wearable device at a location remote from the server via a communications module operatively coupled to the mobile or wearable device, analyzing the captured audio to provide analyzed captured audio, and sending information to the communications module in response to the analyzed captured audio, the information including instructions to initiate control of media content or initiate operations of the mobile or wearable device. A method operating at the server and other embodiments are disclosed. | 11-27-2014 |
20140358552 | LOW-POWER VOICE GATE FOR DEVICE WAKE-UP - A staged processing system may be configured to reduce power consumption during voice detection in an audio signal. A first stage may include detecting a minimal threshold of sound in an audio signal. A second stage may then be activated to apply a Teager operator to determine a signal-to-noise ratio of speech energy in an audio signal. When a minimum SNR is detected, a third stage may be activated to detect periodicity in the audio signal and identify a voice signal in the audio signal. When a voice signal is detected, a fourth stage may be activated to process the voice command. | 12-04-2014 |
20140358553 | VOICE COMMAND FOR CONTROL OF AUTOMATION SYSTEMS - A software for controlling automation systems by using voice command. A user may speak a command into a microphone that is operatively connected to a computer. The software loaded on the computer may translate the voice command to computer readable commands. The computer may then schedule a specified day and time to implement the commands. During the specified time and day, the computer may control the automation system based on the specified command. | 12-04-2014 |
20140365225 | ULTRA-LOW-POWER ADAPTIVE, USER INDEPENDENT, VOICE TRIGGERING SCHEMES - Methods and systems are provided for ultra-low-power adaptive, user independent, voice triggering in electronic devices. A voice trigger, which may be configured as ultra-low-power function, may be run in an electronic device, when the electronic device transitions to a power-saving state, and may be used to control the electronic device based on audio inputs. The controlling may comprise capturing an audio input, and processing the audio input to determine when the audio input corresponds to a triggering command, to trigger transitioning of the electronic device from the power-saving state. The processing of audio input, to determine that it corresponds to the triggering command, may be based on use of an adaptively configured state machine. The state machine may be based on a Hidden Markov Model (HMM), and may be configured as a two-dimensional state machine that comprises plurality of lines of incantations, each of which corresponding to the triggering command. | 12-11-2014 |
20140365226 | SYSTEM AND METHOD FOR DETECTING ERRORS IN INTERACTIONS WITH A VOICE-BASED DIGITAL ASSISTANT - The method is performed at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors. A speech input containing a request is received from a user. At least one action in furtherance of satisfying the request is performed. A user interaction is detected, such as a speech input to a digital assistant or a physical interaction with a device. It is determined whether the user interaction is indicative of a problem in the performing of the at least one action. Upon determining that the user interaction is indicative of a problem, information relating to the request is stored in a repository for error analysis. | 12-11-2014 |
20140365227 | INTERPRETING AND ACTING UPON COMMANDS THAT INVOLVE SHARING INFORMATION WITH REMOTE DEVICES - An electronic device with one or more processors and memory includes a procedure for sharing information with a third party recipient. In some embodiments, the device receives a speech input from a first user, the speech input specifying a second user different from the first user, and an information item to be shared with the second user. In response to the speech input, the device initiates a background process during which a digital assistant searches for the information item and causes the information item to be sent to the second user without further review and instruction from the first user. | 12-11-2014 |
20140365228 | INTERPRETATION OF AMBIGUOUS VEHICLE INSTRUCTIONS - Various exemplary embodiments relate to a command interpreter for use in a vehicle control system in a vehicle for interpreting user commands, a vehicle interaction system including such a command interpreter, a vehicle including such a vehicle interaction system, and related method and non-transitory machine-readable storage medium, including: a memory and a processor, the processor being configured to: receive, from at least one human via a first input device, a first input having a first type; receive a second input having a second type via a second input device, wherein the second type comprises at least one of sensed information describing a surrounding environment of the vehicle and input received from at least one human; interpret both the first input and the second input to generate a system instruction; and transmit the system instruction to a different system of the vehicle. | 12-11-2014 |
20140365229 | SYSTEM AND METHOD FOR EXCERPT CREATION BY DESIGNATING A TEXT SEGMENT USING SPEECH - An apparatus includes at least one input device configured to receive a speech input, a display configured to present predetermined content acquired by the apparatus from which excerpts may be extracted, and a processor configured to execute computer readable program code. The computer readable program code is configured to collect a speech recognition vocabulary set that corresponds to content visible on the display such that at least one vocabulary word in the speech recognition vocabulary set is the same as a word presented on the display, designate a segment of the content to be excerpted based on the speech input, and create a link to a source of the excerpted content and display the link with the excerpted content. | 12-11-2014 |
20140379353 | Environmentally aware dialog policies and response generation - Environmental conditions, along with other information, are used to adjust a response of a conversational dialog system. The environmental conditions may be used at different times within the conversational dialog system. For example, the environmental conditions can be used to adjust the dialog manager's output (e.g., the machine action). The dialog state information that is used by the dialog manager includes environmental conditions for the current turn in the dialog as well as environmental conditions for one or more past turns in the dialog. The environmental conditions can also be used after receiving the machine action to adjust the response that is provided to the user. For example, the environmental conditions may affect the machine action that is determined as well as how the action is provided to the user. The dialog manager and the response generation components in the conversational dialog system each use the available environmental conditions. | 12-25-2014 |
20140379354 | METHOD, APPARATUS AND SYSTEM FOR PAYMENT VALIDATION - A method, apparatus and system for payment validation have been disclosed. The method includes: receiving a payment validation request from a terminal, wherein the payment validation request includes identification information and a current voice signal; detecting whether the identification information is identical to a pre-stored identification information; if identical: extracting voice characteristics associated with an identity information and a text password from the current voice signal; matching the current voice characteristics to a pre-stored speaker model; if successfully matched: sending an validation reply message to the terminal to indicate that payment request has been authorized. The validation reply message is utilized by the terminal to proceed with a payment transaction. The identity information identifies an owner's current voice signal, and the text password is indicated by the current voice signal. The method eliminates the requirement of the server sending a SMS message with a validation code to the terminal. | 12-25-2014 |
20150012279 | METHOD AND APPARATUS FOR ASSIGNING KEYWORD MODEL TO VOICE OPERATED FUNCTION - A method, performed in an electronic device, for assigning a target keyword to a function is disclosed. In this method, a list of a plurality of target keywords is received at the electronic device via a communication network, and a particular target keyword is selected from the list of target keywords. Further, the method may include receiving a keyword model for the particular target keyword via the communication network. In this method, the particular target keyword is assigned to a function of the electronic device such that the function is performed in response to detecting the particular target keyword based on the keyword model in an input sound received at the electronic device. | 01-08-2015 |
20150012280 | SERVER, CONTROL METHOD THEREOF, IMAGE PROCESSING APPARATUS, AND CONTROL METHOD THEREOF - A system includes a server and an image processing apparatus, and the server is provided that includes a communication interface, a storage, and a processor. The communication interface is configured to communicate with the image processing apparatus. The storage is configured to store data. The processor may provide a result of processing a first event that includes a speech of a user to the image processing apparatus in response to the first event being received from the image processing apparatus, store a record of the first event in the storage according to processing of the first event, determine a relation between the first and second events that includes a user input by a non-speech method in response to the second event being received from the image processing apparatus, and process the second event based on the record of the first event stored in the storage in response to the relation. | 01-08-2015 |
20150019229 | Using Voice Commands To Execute Contingent Instructions - Apparatus, systems and/or methods can be configured to identify a desired action from a user input, identify a contingency from an audible verbal instruction, initiate a check for satisfaction of the contingency, and initiate the desired action if the contingency is deemed to be satisfied. | 01-15-2015 |
20150025893 | IMAGE PROCESSING APPARATUS AND CONTROL METHOD THEREOF - An image processing apparatus and control method are provided. The image processing apparatus includes: a communication interface which is configured to communicably connect to a server; a voice input interface which is configured to receive a speech of a user and generate a voice signal corresponding the speech; a storage which is configured to store at least one user account of the image processing apparatus and signal characteristic information of a voice signal that is designated corresponding to the user account; and a controller which is configured to, in response to an occurrence of a log-in event with respect to the user account, determine a signal characteristic of the voice signal corresponding the speech received by the voice input interface, select and automatically log in to a user account corresponding to the determined signal characteristic from among the at least one user account stored in the storage, and control the communication interface to connect to the server with the selected user account. | 01-22-2015 |
20150032456 | INTELLIGENT PLACEMENT OF APPLIANCE RESPONSE TO VOICE COMMAND - Systems and methods for intelligent placement of appliance response to a voice command are provided. An exemplary system includes a plurality of appliances. An exemplary method includes connecting each of the plurality of appliances over a local area network and generating a location map providing a location of each of the plurality of appliances. The method includes receiving the human voice signal at a plurality of microphones respectively included in the plurality of appliances and determining an originating location of the human voice signal based at least in part on the location map. The method includes selecting one of the plurality of appliances to respond to the human voice signal based at least in part on the location map and the originating location. | 01-29-2015 |
20150032457 | APPARATUS AND METHOD OF CONTROLLING VOICE INPUT IN ELECTRONIC DEVICE SUPPORTING VOICE RECOGNITION - A method of controlling a voice input of a terminal supporting a voice recognition function is provided. The method includes controlling a microphone to be in a turn-off state during operation of a voice recognition mode; detecting a first user input for requesting turn-on of the microphone based on at least one of a touch input, a touch pen input, and a key input; controlling the microphone in the turn-off state to be in a turn-on state when the first user input is detected; collecting voice input data of a user through the microphone in the turn-on state; and controlling the microphone in the turn-on state to be in the turn-off state when a second user input for requesting the turn-off of the microphone is detected, and terminating collecting the voice input data. | 01-29-2015 |
20150032458 | COMPUTERIZED INFORMATION PRESENTATION APPARATUS - A computerized information system and computer readable apparatus. In one embodiment, the apparatus is configured for use in a transport device and comprises a computer readable medium having at least one computer program disposed thereon, the at least one program being configured to provide the user with requested information (such as for example directions to a desired business or other entity) via speech query. The user may also be provided with other topical information such as weather, parking rates, and the like. In one embodiment, at least a portion of the information is obtained via a wireless link, such as from a remote server. | 01-29-2015 |
20150032459 | COMPUTERIZED INFORMATION AND DISPLAY APPARATUS - Computerized apparatus useful for obtaining and presenting information to users. In one embodiment, the computerized apparatus includes a display device and speech recognition apparatus configured to receive user speech input and enable performance of various tasks, such as obtaining desired information relating to an entity, maps or directions, weather, news, or any number of other topics. The obtained data may also, in one variant, be displayed with contextually related content. In another variant, retrieved data can be downloaded to a portable user device. | 01-29-2015 |
20150039316 | SYSTEMS AND METHODS FOR MANAGING DIALOG CONTEXT IN SPEECH SYSTEMS - Methods and systems are provided for managing spoken dialog within a speech system. The method includes establishing a spoken dialog session having a first dialog context, and receiving a context trigger associated with an action performed by a user. In response to the context trigger, the system changes to a second dialog context. In response to a context completion condition, the system then returns to the first dialog context. | 02-05-2015 |
20150039317 | SYSTEM WITH MULTIPLE SIMULTANEOUS SPEECH RECOGNIZERS - A speech recognition system interprets both spoken system commands as well as application commands. Users may speak commands to an open microphone of a computing device that may be interpreted by at least two speech recognizers operating simultaneously. The first speech recognizer interprets operating system commands and the second speech recognizer interprets application commands. The system commands may include at least opening and closing an application and the application commands may include at least a game command or navigation within a menu. A reserve word may be used to identify whether the command is for the operation system or application. A user's cadence may also indicate whether the speech is a global command or application command. A speech recognizer may include a natural language software component located in a remote computing device, such as in the so-called cloud. | 02-05-2015 |
20150039318 | APPARATUS AND METHOD FOR SELECTING CONTROL OBJECT THROUGH VOICE RECOGNITION - There are provided an apparatus and a method for selecting a control object through voice recognition. The apparatus for selecting a control object according to the present invention that is an apparatus for selecting a control object through voice recognition includes one or more processing devices, in which the one or more processing devices are configured to obtain input information on the basis of a voice of a user, to match the input information to at least one identification information obtained based on a control object, to obtain matched identification information matched to the input information among the identification information, and to select a control object corresponding to the matched identification information. | 02-05-2015 |
20150039319 | Command Handling Method, Apparatus, and System - A command handling method, apparatus, and system. The method includes receiving multiple voice instructions sent by a voice parsing server, where the multiple voice instructions are generated after the voice parsing server parses source voice commands that are from different voice control devices; separately determining whether any two voice instructions in the multiple voice instructions are similar instructions, where the similar instructions are voice instructions corresponding to source voice commands that are obtained by the different voice control devices by collecting same voice information; and when two voice instructions that are similar instructions exist in the multiple voice instructions, discarding one voice instruction in the two similar voice instructions. The embodiments of the present invention further provide a command handling apparatus and system. The embodiments eliminate a control error caused by repeated execution of a command. | 02-05-2015 |
20150046168 | Method and Apparatus for a Multi I/O Modality Language Independent User-Interaction Platform - Automated user-machine interaction is gaining attraction in many applications and services. However, implementing and offering smart automated user-machine interaction services still present technical challenges. According to at least one example embodiment, a dialogue manager is configured to handle multiple dialogue applications independent of the language, the input modalities, or output modalities used. The dialogue manager employs generic semantic representation of user-input data. At a step of a dialogue, the dialogue manager determines whether the user-input data is indicative of a new request or a refinement request based on the generic semantic representation and at least one of a maintained state of the dialogue, general knowledge data representing one or more concepts, and data representing history of the dialogue. The dialogue manager then responds to determined user-request with multi-facet output data to a client dialogue application indicating action(s) to be performed. | 02-12-2015 |
20150046169 | INFORMATION PROCESSING METHOD AND ELECTRONIC DEVICE - The present disclosure provides information processing methods and electronic devices in view of the problem in the conventional technology that accuracy of adjusting parameters via voice input is low. The information processing method is applied in an electronic device comprising an output unit. The method comprises: outputting, by the output unit, first data corresponding to a first application when the electronic device executes the first application; acquiring a first voice input that is inputted in a voice input approach; performing voice recognition on the first voice input to acquire a first operation instruction; controlling the output unit to output second data based on the first operation instruction, wherein a first parameter of the second data is different from that of the first data; and setting, based on the first operation instruction, a response unit in a first operation area on the electronic device as a first function response unit adjust the first parameter, wherein the input approach for the first operation area is different from the voice input approach. | 02-12-2015 |
20150046170 | INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM - There is provided an information processing device including a processor configured to achieve a function of registering implicit preference information about a user on the basis of an activity of the user, a function of outputting a question to the user by voice on the basis of the implicit preference information, a function of acquiring an answer to the question from a voice spoken by the user, and a function of registering explicit preference information about the user depending on the answer. | 02-12-2015 |
20150058020 | Using Voice Recognition for Recording Events - Various implementations described herein are directed to a wearable device used to determine whether audio data corresponds to a fishing event. The wearable device may include at least one microphone. The wearable device may include a computer system with a processor and memory. The memory may have a plurality of executable instructions. When the executable instructions are executed by the processor, the processor may receive audio data from the at least one microphone, determine whether the received audio data corresponds to a fishing event and store a record of the fishing event, and a timestamp corresponding to the fishing event. | 02-26-2015 |
20150058021 | Miniature, Wearable, Kitchen Ordering Apparatus - A miniature, wearable, kitchen ordering apparatus having a wearable bracket, a control box coupled to the bracket, a circuit board in the control box, a display output port, wireless communications unit, data processing unit, voice analyzer unit, and an audio pickup module, all operably connected to the circuit board. The output terminal of the audio pickup module is operably connected to the voice analyzer unit input terminal, the output terminal from the voice analyzer unit connects to the data processing unit input terminal, the digital communications port on the data processing unit connects to the wireless communications unit, and the data processing unit display interface connects to the display output port, which in turn connects to a flexible display panel installed on the wearable bracket. There is a battery connected to the data processing unit. | 02-26-2015 |
20150058022 | METHOD FOR PROCESSING DATA AND ELECTRONIC DEVICE THEREOF - An operation method of an electronic device is provided. The method includes detecting audio information from all or some of media data, determining a setting duration as at least one duration which satisfies a reference condition using the audio information, and displaying the setting duration on a display. | 02-26-2015 |
20150066513 | MECHANISM FOR PERFORMING SPEECH-BASED COMMANDS IN A SYSTEM FOR REMOTE CONTENT DELIVERY - A method for performing speech-based commands in a system for remote content delivery, includes receiving speech, recognizing the speech, transmitting the speech to a speech server, receiving a device-based signal corresponding to the speech from the speech server when the speech is a speech-based command, forwarding the device-based signal to a streaming server; and receiving content from the streaming server corresponding to the device-based signal. | 03-05-2015 |
20150066514 | INFORMATION PROCESSING METHOD AND ELECTRONIC DEVICE - The disclosure discloses an information processing method and an electronic device, which relate to the field of electronic technologies, to improve a match degree between a recognition result of a voice recognition engine and a result required by the user and thus to improve the user experience. The electronic device includes N objects, each object corresponds to a weight value, and the weight value of each object is used to indicate a weight of the object in a search space of the voice recognition engine. The method provided by the disclosure includes: acquiring a first input operation; acquiring an execution object according to the first input operation; responding to the first input operation with the execution object; after the first input operation is responded to, determining L objects that have been displayed by the display unit in a first time period. | 03-05-2015 |
20150066515 | EQUIPMENT CONTROL METHOD AND SPEECH-BASED EQUIPMENT CONTROL SYSTEM - A method is disclosed for controlling a supply device according to a user's speech in a speech-based equipment control system. The s system periodically updates a current quantity of a material that has already been supplied, and ascertains an already-supplied quantity. A first specific quantity is stored in the system, which is a quantity of the prescribed material to be supplied as designated by a user using speech. The instruction is received to change the first specific quantity to a second specific quantity, and if the second specific quantity is greater than the already-supplied quantity at the current time, the prescribed material is supplied until the already-supplied quantity reaches the second specific quantity. | 03-05-2015 |
20150066516 | APPLIANCE CONTROL METHOD, SPEECH-BASED APPLIANCE CONTROL SYSTEM, AND COOKING APPLIANCE - In a case of receiving, from an audio input device, instruction information including first audio information indicating operation instructions for a cooking appliance when first and second cooking units are executing first and second cooking programs, respectively, operation instructions are recognized from the first audio information. In a case where it is determined that the instruction information includes second audio information related to the first cooking menu information or the second cooking menu information, a control command is transmitted to the cooking appliance to cause the cooking appliance to execute a process corresponding to the operation instructions, without executing a process according to the first cooking program or the second cooking program corresponding to one of the first cooking menu information or the second cooking menu information to which the second audio information is related. | 03-05-2015 |
20150073808 | REMOTE CONTROL AND PAYMENT TRANSACTIONING SYSTEM USING NATURAL LANGUAGE, VEHICLE INFORMATION, AND SPATIO-TEMPORAL CUES - A system enables a mobile platform to issue commands using natural language dialog in order to control and/or monitor the functionality of remote systems according to a desired set of criteria and/or meta-criteria. | 03-12-2015 |
20150073809 | TRANSPORT APPARATUS WITH COMPUTERIZED INFORMATION AND DISPLAY APPARATUS - Transport apparatus which includes computerized apparatus useful for obtaining and displaying information. In one embodiment, the computerized apparatus includes a display device and speech recognition apparatus configured to receive user speech input and enable performance of various tasks, such as obtaining desired information relating to a location of an entity, maps or directions, or any number of other topics. The obtained data may also, in one variant, be displayed with contextually related content. In another variant, retrieved data can be downloaded to a portable user device. | 03-12-2015 |
20150073810 | MUSIC PLAYING METHOD AND MUSIC PLAYING SYSTEM - The music playing method comprises acquiring an inputted voice, detecting that any one registered character string from among a plurality of registered character strings stored in a recording medium is contained in the voice, and outputting music corresponding to the registered character string after a delay time stored in the recording medium has passed from the time when the voice containing the registered character string is detected in the character string detecting. | 03-12-2015 |
20150073811 | Systems and Methods for Generating Markup-Language Based Expressions from Multi-Modal and Unimodal Inputs - When using finite-state devices to perform various functions, it is beneficial to use finite state devices representing regular grammars with terminals having markup-language-based semantics. By using markup-language-based symbols in the finite state devices, it is possible to generate valid markup-language expressions by concatenating the symbols representing the result of the performed function. The markup-language expression can be used by other applications and/or devices. Finite-state devices are used to convert strings of words and gestures into valid markup-language, for example, XML, expressions that can be used, for example, to provide an application program interface to underlying system applications. | 03-12-2015 |
20150081309 | COMPUTERIZED INFORMATION AND DISPLAY APPARATUS - Computerized apparatus capable of providing a user, such as a passenger of a transportation device, with various types of information. In one embodiment, the apparatus includes speech processing and speech synthesis apparatus, as well as a wireless interface and a database, to enable the user to obtain information both locally and remotely (such as from a remote server and/or the Internet) while being transported. In one variant, the user can engage in a verbal interchange with the apparatus in order to obtain desired information. | 03-19-2015 |
20150088523 | Systems and Methods for Designing Voice Applications - Examples disclose a method and system for designing voice applications. The method may be executable to receive a verbal input, parse the verbal input to recognize a keyword, and identify a plurality of applications associated with the recognized keyword. Moreover, the method may be further executable to determine a relevance and/or payment associated with the verbal input and/or keyword, to identify one or more applications that are already installed on a computing device, and to initiate or offer to initiate the identified installed application based on the determined relevance and/or payment. When an installed application is not identified, the method may be executable to identify one or more applications from a plurality of relevant candidate applications and present one or more of the identified relevant candidate applications to a user for possible installation. The payment may be based on whether the identified application is already installed on the computing device. | 03-26-2015 |
20150088524 | APPARATUS AND METHOD FOR GENERATING AN EVENT BY VOICE RECOGNITION - There are provided an apparatus and a method for generating an event through voice recognition. The apparatus for generating an event through voice recognition according to the present invention includes one or more processing devices, in which the one or more processing devices are configured to obtain input information on the basis of a voice of a user, to match the input information to at least one identification information obtained based on application screen information, to obtain matched identification information matched to the input information among the identification information, and to generate an event in at least a partial area of areas corresponding to the matched identification information. | 03-26-2015 |
20150088525 | METHOD AND APPARATUS FOR CONTROLLING APPLICATIONS AND OPERATIONS ON A TERMINAL - A method and apparatus for controlling an application startup and its functions on a terminal have been disclosed. The method including: acquiring a first speech data input by a user, wherein speech recognition is being performed on the first speech data to obtain a first speech recognition result; determining whether the first speech recognition result includes a startup command word for a particular installed application which has not been started on a terminal, wherein the particular installed application includes at least a social networking application; if the first speech recognition result includes the startup command word for the particular installed application, then the particular installed application is regarded as a controlled application, and the startup command word is converted into a startup command for the controlled application; and starting the controlled application utilizing the startup command of the controlled application. | 03-26-2015 |
20150088526 | Synthesis and Display of Speech Commands Method and System - A construction and display of speech commands system that allows a user to simply read what is on an application that involves visual elements with which the user interacts, and in doing so, gives the appropriate commands to the speech recognition system for the task at hand. The construction and display of speech commands system may include a speech recognition system, a grammar builder module, and a speech enablement module. The construction and display of speech commands system may automatically generate a speech enabled application from generated speech grammar. | 03-26-2015 |
20150095036 | CONTROLLING A SYSTEM USING VOICELESS ALARYNGEAL SPEECH - Apparatus, including a sensor which is configured to be fixed to a neck of an operator of equipment in a location suitable for sensing a voiceless alaryngeal speech vibration generated by the operator during operation of the equipment, The apparatus further includes a processor which is configured to receive and process a signal output by the sensor so as to measure the voiceless alaryngeal speech vibration and so as to generate a control signal for the equipment responsively to the measured voiceless alaryngeal speech vibration. | 04-02-2015 |
20150095037 | VEHICULAR DEVICE, SERVER, AND INFORMATION PROCESSING METHOD - The technique is implemented by a vehicular device for performing dialogs with the driver. This device has: a communication portion for communicating with a server; an output portion for outputting speech information to the driver; an input portion for inputting information based on speech uttered by the driver; and a controller for controlling the communication portion, the output portion, and the input portion. When trigger information for starting a dialog process in the vehicular device or in the server is generated, the controller receives information indicative of the type of a first dialog forming a starting point of the dialog process and information indicative of the type of a second dialog forming an ending point of the dialog process from the server and carries out the dialog process based on the received information. The second dialog is different in type from the first dialog. | 04-02-2015 |
20150100321 | INTELLIGENT STATE AWARE SYSTEM CONTROL UTILIZING TWO-WAY VOICE / AUDIO COMMUNICATION - The embodiments provide a method and system for enabling an intelligent state aware system control utilizing two-way voice/audio communication using an electronic device. The method includes receiving voice commands from a user and identifying one or more actions associated with the voice command. Further, the method includes maintaining internal states of the actions based on one or more rules, where the internal states are dynamically defined based on a response to the voice command and the action. Further, the method includes computing application commands by performing the actions in accordance to the internal state, and providing a voice response to the user from the electronic device in response to execution of the application commands on corresponding applications. | 04-09-2015 |
20150100322 | REMOTE CONTROL APPARATUS FOR INPUTTING USER VOICE AND METHOD THEREOF - A remote control apparatus is disclosed. The remote control apparatus includes a movement detector which is configured to detect a movement of the remote control apparatus, a microphone which is configured to receive a voice input, a controller which is configured to activate the microphone in response to the remote control apparatus moving for a preset first time by at least a threshold angle, and a communicator which is configured to transmit the voice input to an external device in response to the voice input being input through the activated microphone. | 04-09-2015 |
20150100323 | WEARABLE TERMINAL AND METHOD FOR CONTROLLING THE SAME - A wearable terminal includes voice data generation unit a voice data generation unit configured to generate audio data, a sensing unit configured to sense a motion of a user's upper limb in a first axis direction perpendicular to a plane defined by a vertically downward oriented direction of the upper limb and a direction of movement of the user, and to generate motion data concerning the motion, a determination unit configured to determine, based on the motion data, whether or not the user is going to perform remote control of a home electric appliance, and a data processing unit configured to process the audio data. The data processing unit includes a transmission data generation unit configured to generate transmission data corresponding to the audio data if the determination unit determines that the user is going to perform the remote control, and a transmission unit configured to transmit the transmission data to a network. | 04-09-2015 |
20150106105 | Automatic Door - In some implementations a microcontroller is coupled to a storage device, the storage device having a voice-recognition engine stored thereon, and the microcontroller is operably coupled to a device-controller of an automatic door. | 04-16-2015 |
20150112690 | LOW POWER ALWAYS-ON VOICE TRIGGER ARCHITECTURE - The description is directed to systems and methods for a low-power, hands-free voice triggering of a main processing complex of a computing system to wake from a suspended state. An always-on voice activity detection module samples output received from a microphone in the computing system and determines whether a portion of the sampled output potentially contains a triggering keyphrase. A special purpose audio processing engine is turned on to confirm the presence of the triggering keyphrase in the sampled output before triggering the main processing complex of the computing system to wake from the suspended state. | 04-23-2015 |
20150112691 | Automatically Monitoring for Voice Input Based on Context - In one implementation, a computer-implemented method includes detecting a current context associated with a mobile computing device and determining, based on the current context, whether to switch the mobile computing device from a current mode of operation to a second mode of operation during which the mobile computing device monitors ambient sounds for voice input that indicates a request to perform an operation. The method can further include, in response to determining whether to switch to the second mode of operation, activating one or more microphones and a speech analysis subsystem associated with the mobile computing device so that the mobile computing device receives a stream of audio data. The method can also include providing output on the mobile computing device that is responsive to voice input that is detected in the stream of audio data and that indicates a request to perform an operation. | 04-23-2015 |
20150120305 | SPEECH COMMUNICATION SYSTEM FOR COMBINED VOICE RECOGNITION, HANDS-FREE TELEPHONY AND IN-CAR COMMUNICATION - A multi-mode speech communication system is described that has different operating modes for different speech applications. A speech service compartment contains multiple system users, multiple input microphones that develop microphone input signals from the system users to the system, and multiple output loudspeakers that develop loudspeaker output signals from the system to the system users. A signal processing module is in communication with the speech applications and includes an input processing module and an output processing module. The input processing module processes the microphone input signals to produce a set user input signals for each speech application that are limited to currently active system users for that speech application. The output processing module processes application output communications from the speech applications to produce loudspeaker output signals to the system users, wherein for each different speech application, the loudspeaker output signals are directed only to system users currently active in that speech application. The signal processing module dynamically controls the processing of the microphone input signals and the loudspeaker output signals to respond to changes in currently active system users for each application. | 04-30-2015 |
20150127353 | ELECTRONIC APPARATUS AND METHOD FOR CONTROLLING ELECTRONIC APPARATUS THEREOF - A method for controlling the electronic apparatus including: receiving an input of an audio which includes a user's voice; processing the audio and generating a user voice signal; transmitting the user voice signal to a first server; receiving text information corresponding to the user voice signal from the first server; and controlling the electronic apparatus, according to the text information. | 05-07-2015 |
20150134340 | VOICE INTERNET SYSTEM AND METHOD - A system and method is provided for voice activated Web based infrastructure (Voice Portal) which accepts spoken input from a variety of devices, including desktop and laptop computers, tablets, smart phones, standard mobile phones, and ordinary hard-wired telephones. | 05-14-2015 |
20150134341 | DISPLAY CONTROL APPARATUS, DISPLAY CONTROL METHOD, PROGRAM, AND INFORMATION STORAGE MEDIUM - A display control apparatus includes: a voice message acceptance block configured to accept a voice message; an option identification block configured to identify, from among a plurality of options related with information indicative of voice messages, in accordance with acceptance of a voice message by the voice message acceptance block, an option of attention that is an option related with information indicative of the accepted voice message and an alternative option other than this option of attention identified on the basis of the information indicative of this voice message or this option of attention; and a display control block configured to display information indicative that the option of attention is in a selected state and information indicative of a voice message by which the option identification block identifies the alternative option as the option of attention in accordance with the acceptance by the voice message acceptance block. | 05-14-2015 |
20150142447 | SYSTEM AND METHOD FOR AN INTEGRATED, MULTI-MODAL, MULTI-DEVICE NATURAL LANGUAGE VOICE SERVICES ENVIRONMENT - A system and method for an integrated, multi-modal, multi-device natural language voice services environment may be provided. In particular, the environment may include a plurality of voice-enabled devices each having intent determination capabilities for processing multi-modal natural language inputs in addition to knowledge of the intent determination capabilities of other devices in the environment. Further, the environment may be arranged in a centralized manner, a distributed peer-to-peer manner, or various combinations thereof. As such, the various devices may cooperate to determine intent of multi-modal natural language inputs, and commands, queries, or other requests may be routed to one or more of the devices best suited to take action in response thereto. | 05-21-2015 |
20150142448 | FUNCTION EXECUTION INSTRUCTION SYSTEM, FUNCTION EXECUTION INSTRUCTION METHOD, AND FUNCTION EXECUTION INSTRUCTION PROGRAM - To appropriately execute a function based on a plurality of words, a function-execution instruction server of a function-execution instruction system includes: a function-execution instruction unit that issues an instruction of the execution of one or more tasks; a word input unit that inputs information containing a plurality of words that are arranged in order; and an executed-function determination unit that determines a task the execution of which is instructed on the basis of the order of words input. | 05-21-2015 |
20150142449 | Method and Device for Operating a Speech-Controlled Information System for a Vehicle - A voice input by a vehicle user is taken as a basis for determining at least one keyword from a set of prescribed keywords. The at least one keyword is taken as a basis for determining at least one event and/or at least one state from a set of events and/or states of the vehicle that are stored during a prescribed period of time. This involves the respective event and/or the respective state being stored in conjunction with at least one condition occurrence that characterizes a respective condition that needs to be met in order for the event to occur and/or the respective state to exist. In addition, a response is determined from a set of prescribed responses on the basis of the condition occurrence that is associated with the determined event and/or state. Furthermore, a signaling signal is determined on the basis of the ascertained response. | 05-21-2015 |
20150149182 | Sharing Intents to Provide Virtual Assistance in a Multi-Person Dialog - A computing system is operable as virtual personal assistant (VPA) to understand relationships between different instances of natural language dialog expressed by different people in a multi-person conversational dialog session. The VPA can develop a common resource, a shared intent, which represents the VPA's semantic understanding of at least a portion of the multi-person dialog experience. The VPA can store and manipulate multiple shared intents, and can alternate between different shared intents as the multi-person conversation unfolds. With the shared intents, the computing system can generate useful action items and present the action items to one or more of the participants in the dialog session. | 05-28-2015 |
20150302846 | MOBILE DEVICE EXECUTING FACE-TO-FACE INTERACTION MONITORING, METHOD OF MONITORING FACE-TO-FACE INTERACTION USING THE SAME, AND INTERACTION MONITORING SYSTEM INCLUDING THE SAME, AND MOBILE INTERACTION MONITORING APPLICATION EXECUTED ON THE SAME - Disclosed herein is a mobile face-to-face interaction monitoring device and method using the same and system including the same, for supporting accurate and efficient turn monitoring. One embodiment of the mobile face-to-face interaction monitoring device may comprise a conversation group detector for scanning mobile devices in a surrounding area and setting a conversation group, a turn detector for determining (conversational) turn using volume topography created based on sound signals detected in the mobile devices in the conversation group, and a meta-linguistic information processor for extracting meta-linguistic context of participants or interactants in the conversation group based on the turn. Other embodiments are described and shown. | 10-22-2015 |
20150302854 | SMARTPHONE CONTROL OF ELECTRICAL DEVICES - A wireless mobile device controls at least one controllable electrical device from audio speech, by receiving the audio speech that is associated with the at least one controllable electrical device at a microphone of the wireless mobile device, generating speech data that is associated with a vocabulary set of the at least one controllable electrical device from the audio speech, converting the speech data to an ASCII string that is associated with the at least one controllable electrical device at the wireless mobile device, the ASCII string representing control data of the at least one controllable electrical device that is indicated by the audio speech, establishing a WIFI communication path with a WIFI wireless router that is associated with the at least one controllable electrical device at the wireless mobile device, and sending the ASCII string from the wireless mobile device to the WIFI wireless router that is associated with the at least one controllable electrical device through the WIFI communication path. | 10-22-2015 |
20150302855 | METHOD AND APPARATUS FOR ACTIVATING APPLICATION BY SPEECH INPUT - A method, which is performed in an electronic device, for activating a target application is disclosed. The method may include receiving an input sound stream including an activation keyword for activating the target application and a speech command indicative of a function of the target application. The method may also detect the activation keyword from the input sound stream. If the activation keyword is detected, a portion of the input sound stream including at least a portion of the speech command may be buffered in a buffer memory. In addition, in response to detecting the activation keyword, the target application may be activated to perform the function of the target application. | 10-22-2015 |
20150302857 | DEVICE CONTROL METHOD, DISPLAY CONTROL METHOD, AND PURCHASE SETTLEMENT METHOD - A device control method includes acquiring voice information, obtaining a spoken command indicating a control instruction as to a device based on the acquired voice information, identifying speaker information relating to a speaker which has uttered the acquired voice information, based on the acquired voice information, identifying, out of a plurality of devices, a device to be controlled, based on the spoken command and the speaker information, and controlling the identified device to be controlled. | 10-22-2015 |
20150310861 | PROCESSING NATURAL LANGUAGE USER INPUTS USING CONTEXT DATA - An embodiment provides a method, including: receiving, at a device, user input; identifying, using a processor, elements included in the user input; determining, using a processor, that at least one of the identified elements renders the user input ambiguous; identifying, using a processor, a source of context data; accessing, using a processor, context data associated with the user input from the source of context data; disambiguating, using a processor, the user input based on the context data associated with the user input; and forming, using a processor, an altered input based on the disambiguating. Other embodiments are described and claimed. | 10-29-2015 |
20150317299 | METHOD OF CONTROLLING A TEXT INPUT AND ELECTRONIC DEVICE THEREOF - According to various embodiments, a method for an electronic device includes receiving an input of a first word from a keypad, recognizing a voice input and converting the voice input into a text including a second word, in response to determining that the second word is erroneously recognized based on the first word, correcting the text by replacing the second word with the first word, and entering the corrected text. An electronic device for recognizing a voice includes a keypad configured to receive an input of a first word, a sensor configured to recognize a voice input, a controller configured to convert the voice input into a text including a second word, in response to determining that the second word is erroneously recognized based on the first word, correct the text by replacing the second word with the first word, and enter the corrected text. | 11-05-2015 |
20150317978 | CONTACT PRIORITIZED COMMUNICATION FOR VOICE COMMANDS - Methods and electronic devices for facilitating communication are described. In one example, the present application describes processor-implemented method. The method includes: receiving an audio signal; determining that the audio signal includes a voice command to communicate with a contact using a first communication type; and determining that communication with the contact using the first communication type is unavailable; and after determining that communication with the contact using the first communication type is unavailable, facilitating communication with the contact using a second communication type. | 11-05-2015 |
20150324179 | VOICE CONTROL COMPONENT INSTALLATION - A method for voice control component installation is described. In one embodiment, a speech recognizable input spoken by an installer is identified, the speech recognizable input relating to installation of a system component. The system component is in communication with a control panel. An installation task for the system component is performed according to the speech recognizable input. | 11-12-2015 |
20150325241 | METHOD FOR PROCESSING DATA AND ELECTRONIC DEVICE THEREOF - An operation method of an electronic device is provided. The method includes detecting audio information from all or some of media data, determining a setting duration as at least one duration which satisfies a reference condition using the audio information, and displaying the setting duration on a display. | 11-12-2015 |
20150331664 | VOICE RECOGNITION DEVICE AND DISPLAY METHOD - Because a voice recognition device in accordance with the present invention can adjust the output of a voice recognition result according to the priority of a display of the recognition result with respect to display information other than the voice recognition result at all times while recognizing an uttered voice, the voice recognition device prevents the acquisition of other information important for the user from being blocked due to the display of the recognition result, and improves the user's convenience. | 11-19-2015 |
20150331666 | System and Method for Processing Control Commands in a Voice Interactive System - A system and method for processing user speech commands in a voice interactive system is disclosed. Users issue speech phrases on a local device in a premises network, and the local devices first determine if the speech phrases match any commands in a set of local control commands. The control commands, in examples, can activate and deactivate premises devices such as “smart” televisions and simpler lighting devices connected to home automation hubs. In the event of a command match, local actions associated with the commands are executed directly on the premises devices in response. When no match is found on the local device, the speech phrases are sent in messages to a remote server over a network cloud such as the Internet for further processing. This can save on bandwidth and cost as compared to current voice recognition systems. | 11-19-2015 |
20150338833 | System Combining an Audio Mixing Unit and a Lighting Control Unit - The present invention relates to a system configured to control a lighting of at least one light fixture, where the system comprises an audio mixing unit configured to mix audio signals, at least one actuating element provided to control the lighting of the at least one light fixture; and a first processing unit configured to determine an operating status of the at least one actuating element. The system furthermore comprises a light control unit configured to generate a light control signal for the at least one light fixture. The user of the system can use the actuating elements of the audio mixing unit to control the lighting control unit. | 11-26-2015 |
20150340042 | METHODS AND APPARATUS FOR DETECTING A VOICE COMMAND - According to some aspects, a method of monitoring an acoustic environment of a mobile device, at least one computer readable medium encoded with instructions that, when executed, perform such a method and/or a mobile device configured to perform such a method is provided. The method comprises receiving, by the mobile device, acoustic input from the environment of the mobile device, detecting whether the acoustic input includes a voice command from a user without requiring receipt of an explicit trigger from the user, and initiating responding to the detected voice command. | 11-26-2015 |
20150348554 | INTELLIGENT ASSISTANT FOR HOME AUTOMATION - This relates to systems and processes for using a virtual assistant to control electronic devices. In one example process, a user can speak an input in natural language form to a user device to control one or more electronic devices. The user device can transmit the user speech to a server to be converted into a textual representation. The server can identify the one or more electronic devices and appropriate commands to be performed by the one or more electronic devices based on the textual representation. The identified one or more devices and commands to be performed can be transmitted back to the user device, which can forward the commands to the appropriate one or more electronic devices for execution. In response to receiving the commands, the one or more electronic devices can perform the commands and transmit their current states to the user device. | 12-03-2015 |
20150348556 | DIGITAL MEDIA FRAME - A method and a device for displaying images on a digital media frame is disclosed. In one embodiment, the device includes a memory, a processing unit, a display, an interface circuit, and a display circuit. The interface circuit has at least one receiving port capable of identifying various types of networking protocols that are used to transfer the image data. The processing unit attaches auxiliary information to each image before images are stored in a memory. The display circuit displays images according to the image data received. The digital media frame further contains a user input device, which allows a user to alter the image display sequence. The user input device is an input device other than a keyboard or a cursor control device. | 12-03-2015 |
20150348557 | VOICE-CONTROLLED THREE-DIMENSIONAL FABRICATION - An additive three-dimensional fabrication system includes voice control for user interaction. This voice-controlled interface can enable a variety of voice-controlled functions and operations, while supporting interactions specific to consumer-oriented fabrication processes. | 12-03-2015 |
20150356982 | SPEECH DETECTION CIRCUIT AND METHOD - A speech detection circuit (SDC). The SDC includes a first-in, first-out (FIFO) memory array, a multiplier, a summer, a fast Fourier transformer, a counter, an RMS comparator, and a sparsity comparator. The FIFO stores a plurality of data samples. The multiplier squares the data samples. The summer sums the plurality of squared data samples. The fast Fourier transformer performs an FFT on the plurality of data samples. The counter counts a quantity of the plurality of data samples that exceed a spectral threshold. The RMS comparator compares the summed plurality of squared data samples to an RMS threshold, the quantity of which are compared to a sparsity threshold. The SDC then outputs a wakeup signal when the summed plurality of squared data samples exceeds the RMS threshold and the quantity of the plurality of data samples that exceed the spectral threshold is less than the sparsity threshold. | 12-10-2015 |
20150364142 | PLANT CONTROL SYSTEM USING VOICE AS A CONTROL MECHANISM - A system and method for controlling processing equipment. The system includes a control computer communicatively coupled to a terminal computer. Voice data for each of several authorized operators at a plant is stored. The control computer is programmed to implement a voice recognition and authenticated voice-activated control program. The control computer, responsive to receiving a voice-derived input, analyzes the voice-derived input to determine if the voice-derived input matches the voice data for any of the authorized operators. Provided the voice-derived input matches the voice data, the control computer determines at least one command from the voice-derived input for controlling the processing equipment to modify an operation at the plant. The control computer executes the command to control the processing equipment. | 12-17-2015 |
20150365510 | Command Prefix For Voice Commands - Methods, systems, and products describe hands-free operation of automotive features. A user defines a command prefix that is recognized as preceding one or more voice commands. When the user speaks the command prefix, a processor identifies the spoken command prefix and treats a next spoken word as one of the voice commands. The voice command may then be executed for control. | 12-17-2015 |
20150370530 | RECEIVING AT A DEVICE AUDIBLE INPUT THAT IS SPELLED - In one aspect, a device includes a processor, a display accessible to the processor, and a memory accessible to the processor. The memory bears instructions executable by the processor to receive first input pertaining to second input to the device that will be spelled, receive the second input, and execute a function based on the second input. The second input is audible input. | 12-24-2015 |
20150370531 | DEVICE DESIGNATION FOR AUDIO INPUT MONITORING - A computing device comprises at least one processor, and at least one module operable by the at least one processor to designate a particular computing device from a plurality of computing devices to process audio input, wherein the computing device comprises a first computing device from the plurality of computing devices. The at least one module may be further operable by the at least one processor to, if the particular computing device is not the first computing device, cease processing of audio input, and if the particular computing device is the first computing device, receive first audio input and process the first audio input to determine whether the first audio input includes a predetermined audio command. | 12-24-2015 |
20150371638 | Context Aware Sound Signature Detection - A low power sound recognition sensor is configured to receive an analog signal that may contain a signature sound. Sparse sound parameter information is extracted from the analog signal. The extracted sound parameter information is sampled in a periodic manner and a context value is updated to indicate a current environmental condition. The sparse sound parameter information is compared to both the context value and a signature sound parameter database stored locally with the sound recognition sensor to identify sounds or speech contained in the analog signal, such that identification of sound or speech is adaptive to the current environmental condition. | 12-24-2015 |
20150373393 | DISPLAY DEVICE AND OPERATING METHOD THEREOF - An operating method of a display device is provided. The method includes: recognizing, by the display device, a user's function control voice for controlling a function of a peripheral device; controlling, by the display device, the peripheral device to perform a function corresponding to the recognized function control voice in the peripheral device through a remote control device; and providing, by the display device, a control state of the peripheral device representing that the function corresponding to the recognized function control voice is performed in the peripheral device. | 12-24-2015 |
20150378671 | SYSTEM AND METHOD FOR ALLOWING USER INTERVENTION IN A SPEECH RECOGNITION PROCESS - A system and method for allowing user intervention in a speech recognition pipeline is presented. Embodiments may include receiving, at a computing device, a speech signal at a speech recognition engine, the speech signal being associated with an application. Embodiments may further include generating one or more suggested speech results at the speech recognition engine, the suggested speech results based upon, at least in part, the speech signal. Embodiments may also include displaying, at a graphical user interface associated with the computing device, the one or more suggested speech results prior to applying a final speech result. Embodiments may further include receiving a non voice-based selection of at least one of the one or more suggested speech results and applying the non voice-based selection to the application. | 12-31-2015 |
20150378672 | INFORMATION COMMUNICATION TERMINAL AND DIALOGUE PRESENTATION METHOD - An information communication terminal includes the followings. An input receiving unit receives an input from a user. A communication unit obtains presentation information corresponding to an input by the user from a server according to a dialogue scenario, every time the input is received. A dialogue processing unit presents the user with the presentation information obtained by the communication unit. A communication state determination unit determines a communication state between the communication unit and the server. When the communication state determination unit makes a first determination that the communication is deteriorated during a dialogue, the dialogue processing unit causes the communication unit to obtain, as candidate presentation information, at least one presentation information with a possibility of being presented to the user after the first determination according to the dialogue scenario. | 12-31-2015 |
20150379992 | OPERATING METHOD FOR MICROPHONES AND ELECTRONIC DEVICE SUPPORTING THE SAME - An electronic device which includes a plurality of microphones and an audio data processing module is provided. The plurality of microphones is operatively coupled to the electronic device, and the audio data processing module is capable of being implemented with at least one processor. The audio data processing module recognizes a specified command, based on first audio data collected using a portion of the plurality of microphones and executes a function or an application corresponding to second audio data collected using all the plurality of microphones, when the specified command is recognized. | 12-31-2015 |
20150379993 | METHOD OF PROVIDING VOICE COMMAND AND ELECTRONIC DEVICE SUPPORTING THE SAME - An electronic device, a method, and a chip set are provided. The electronic device includes a memory configured to store at least one of audio feature data of audio data and speech recognition data obtained by speech recognition of audio data; and a control module connected to the memory, wherein the control module is configured to update a voice command that is set to execute a function through voice, the function being selected based on at least one of the audio feature data, the speech recognition data, and function execution data executed in relation to the audio data. | 12-31-2015 |
20160005404 | DEVICE CONTROL METHOD AND ELECTRIC DEVICE - A method for controlling an operation of a target device using a plurality of input devices is disclosed. The method comprises: receiving from one of the plurality of the input devices a first operation instruction issued to the target device, with a first data format; recognizing the first operation instruction and the first data format; determining that the one of the plurality of the input devices is a first input device corresponding to the first data format; and providing to a user of the target device a recommendation for a second input device, a type of the second input device being different from a type of the first input device, when it is determined that a type of the first operation instruction is identical to a type of a second operation instruction received from the second input device earlier than the reception of the first operation instruction. | 01-07-2016 |
20160011853 | METHODS AND SYSTEMS FOR MANAGING SPEECH RECOGNITION IN A MULTI-SPEECH SYSTEM ENVIRONMENT | 01-14-2016 |
20160019891 | AUDIO COMMAND ADAPTIVE PROCESSING SYSTEM AND METHOD - A system and method are provided for adaptively processing audio commands supplied by a user in an aircraft cabin, and includes receiving ambient noise in the aircraft cabin via one or more audio input device, sampling, with a processor, the received ambient noise, and analyzing, in the processor, the sampled ambient noise and, based on the analysis, selecting one or more filter functions and adjusting one or more filter parameters associated with the one or more selected filter functions. Audio and ambient noise are selectively received via the one or more audio input devices, and are filtered, through the selected one or more filter functions, to thereby supply filtered audio. | 01-21-2016 |
20160026433 | SPEECH RECOGNITION INTERFACE FOR VOICE ACTUATION OF LEGACY SYSTEMS - Methods and apparatus are disclosed for a technician to access a systems interface to back-end legacy systems by voice input commands to a speech recognition module. Generally, a user logs a computer into a systems interface which permits access to back-end legacy systems. Preferably, the systems interface includes a first server with middleware for managing the protocol interface. Preferably, the systems interface includes a second server for receiving requests and generating legacy transactions. After the computer is logged-on, a request for voice input is made. A speech recognition module is launched or otherwise activated. The user inputs voice commands that are processed to convert them to commands and text that can be recognized by the client software. The client software formats the requests and forwards them to the systems interface in order to retrieve the requested information. | 01-28-2016 |
20160028878 | METHODS AND ARRANGEMENTS EMPLOYING SENSOR-EQUIPPED SMART PHONES - The present technology concerns improvements to smart phones and related sensor-equipped systems. Some embodiments involve spoken clues, e.g., by which a user can assist a smart phone in identifying what portion of imagery captured by a smart phone camera should be processed, or identifying what type of image processing should be conducted. Some arrangements include the degradation of captured content information in accordance with privacy rules, which may be location-dependent, or based on the unusualness of the captured content, or responsive to later consultation of the stored content information by the user. A great variety of other features and arrangements are also detailed. | 01-28-2016 |
20160034246 | Transmitting Method and Transmitting Device, Receiving Method and Receiving Device, and Transfer Method and Transfer System - Data broadcast data, which is broadcast in data broadcasts, is constructed by disposing, for example, EMD (Electric Music Distribution) links required to acquire song data as actual broadcast data, which is broadcast in actual broadcasts by a transmitting device, the actual broadcast data is transmitted, and the data broadcast data wherein the EMD links for the song data in the actual broadcasts are disposed, is transmitted periodically during the transmission of the actual broadcast data. The actual broadcast data and the data broadcast data are received by a user terminal, and the EMD links disposed in the data broadcast data are stored whenever there is an input of an operation to attach a “bookmark”. Thus, audio data such as songs in programs broadcast can easily be acquired by radio. | 02-04-2016 |
20160034249 | SPEECHLESS INTERACTION WITH A SPEECH RECOGNITION DEVICE - Embodiments for interacting with speech input systems are provided. One example provides an electronic device including an earpiece, a speech input system, and a speechless input system. The electronic device further includes instructions executable to present requests to a user via audio outputs, and receive user inputs in response to the requests via a first input mode in which user inputs are made via the speech input system, and also receive user inputs in response to the requests via a second input mode in which responses to the requests are made via the speechless input system. | 02-04-2016 |
20160034250 | FLIGHT DECK MULTIFUNCTION CONTROL DISPLAY UNIT - Systems and methods for controlling a flight deck multifunction control display unit are disclosed. In various embodiments, the systems may comprise a flight management system or other MCDU driven devices, a command database that stores a plurality of voice commands and a plurality of multifunction control display unit commands. In various embodiments, each voice command is associated with one of the plurality of multifunction control display unit commands. The systems may further comprise a pilot voice interface configured to receive a voice command from a pilot and transmit the voice command to the multifunction control display unit. The multifunction control display unit can receive the voice command from the pilot voice interface and, in response, access the command database to identify a multifunction control display unit command in the command database that is associated with the voice command. | 02-04-2016 |
20160034254 | MULTI-LEVEL VOICE MENU - Methods, apparatus, and computer-readable media are described herein related to a user interface (UI) that can be implemented on a head-mountable device (HMD). The UI can include a voice-navigable UI. The voice-navigable UI can include a voice navigable menu that includes one or more menu items. The voice-navigable UI can also present a first visible menu that includes at least a portion of the voice navigable menu. In response to a first utterance comprising one of the one or more menu items, the voice-navigable UI can modify the first visible menu to display one or more commands associated with the first menu item. In response to a second utterance comprising a first command, the voice-navigable UI can invoke the first command. In some embodiments, the voice-navigable UI can display a second visible menu, where the first command can be displayed above other menu items in the second visible menu. | 02-04-2016 |
20160035350 | ELECTRONIC APPARATUS AND CONTROL METHOD THEREOF - An electronic apparatus and a controlling methods thereof are disclosed. The electronic apparatus includes a voice input unit configured to receive a user voice, a storage unit configured to store a plurality of voice print feature models representing a plurality of user voices and a plurality of utterance environment models representing a plurality of environmental disturbances, a controller, in response to a user voice being input through the voice input unit, configured to extract utterance environment information of an utterance environment model among the plurality of utterance environment models corresponding to a location where the user voice is input, compare a voice print feature of the input user voice with the plurality of voice print feature models, revise a result of the comparison based on the extracted utterance environment information, and recognize a user corresponding to the input user voice based on the revised result. | 02-04-2016 |
20160035351 | DISPLAY DEVICE, METHOD OF CONTROLLING DISPLAY DEVICE, AND PROGRAM - A head mounted display device includes an image display unit that allows a user to visually recognize an image and through which outside scenery is transmitted and a microphone that detects a voice. In addition, the head mounted display device further includes a data acquisition unit that acquires data and an additional data display control unit that allows the image display unit to display an image based on the voice detected by the microphone and the data acquired by the data acquisition unit when the outside scenery is visually recognized by the user through the image display unit. | 02-04-2016 |
20160041811 | SHARED SPEECH DIALOG CAPABILITIES - The disclosure includes a speech-enabled device and method to share speech dialog capabilities of the speech-enabled device with a dumb device. The speech-enabled device includes a processor and a memory storing instructions that, when executed by the processor, cause the speech-enabled device to: receive speech dialog data of the dumb device that indicates a function of the dumb device; receive speech input; determine the function of the dumb device to be invoked based on the speech input by using the speech dialog data; generate a command effective to invoke the function of the dumb device based on the speech dialog data; and send the command to the dumb device to invoke the function of the dumb device. | 02-11-2016 |
20160042736 | METHOD FOR PROCESSING DIALOGUE BASED ON PROCESSING INSTRUCTING EXPRESSION AND APPARATUS THEREFOR - Disclosed are a method for processing a dialogue based on processing instructing expression in a multi-modal environment and an apparatus therefor. The method for processing a dialogue in an information processing device capable of processing digital signals includes the steps of: extracting an instructing expression from an inputted sentence; generating an intermediate instructing expression representing the modifying relations between the words constituting the extracted instructing expression; and searching the object corresponding with the intermediate instructing expression in a predetermined object search range. Thus, a terminal can be effectively and conveniently used without separately clarifying various instructing expressions representing things or objects with the terminal. | 02-11-2016 |
20160049147 | DISTRIBUTED VOICE INPUT PROCESSING BASED ON POWER AND SENSING - Techniques to coordinate the processing of audio in a distributed audio processing system are described. A power preferred computing device includes an audio processing coordinator to coordinate the capture and processing of audio signals by the power preferred device and secondary computing devices in a network. The network may be a personal area network. The audio processing coordinator may wake secondary devices to capture or process audio based on determinations that the power preferred device is not adequate to capture or process the audio. | 02-18-2016 |
20160049148 | SMART INPUTTING DEVICE, SETTING METHOD AND CONTROLLING METHOD THEREOF - A smart inputting device, a setting method and a controlling method thereof are provided. The smart inputting device includes a voice receiving unit and a plurality of buttons. The setting method of the smart inputting device includes following step. a voice command from a user is received by the voice receiving unit. A pressing signal generated from the buttons is sensed. A mapping data between the voice command and the pressing signal is recorded. | 02-18-2016 |
20160049150 | SPEECH RECOGNITION METHOD AND SPEECH RECOGNITION DEVICE - A speech recognition method that recognizes speech for causing equipment to operate includes: a speech signal acquiring step of acquiring speech signal from a microphone disposed in a designated space; a spatial sound pressure distribution detecting step of detecting a spatial sound pressure distribution indicating a distribution of sound pressure in the space, on the basis of the acquired speech signal; a point sound source detecting step of detecting a point sound source in the space on the basis of the detected spatial sound pressure distribution; and a speech recognition controlling step of judging to conduct a speech recognition process on the acquired speech signal when the point sound source is detected. | 02-18-2016 |
20160055847 | SYSTEM AND METHOD FOR SPEECH VALIDATION - A system and method for validating a wake-up-word. Embodiments of the present disclosure may include receiving, at a first computing device, an audio signal from a second computing device, the audio signal being identified as possibly including a wake-up-word. Embodiments may further include rewinding the audio signal to a starting point of the wake-up-word, to generate a rewound audio signal. Embodiments may also include determining if the rewound audio signal includes the wake-up-word. Embodiments may further include transmitting feedback to the second computing device, wherein the feedback includes at least one of a go-back-to-sleep directive and an accepted detection directive. | 02-25-2016 |
20160055848 | SPEECH ENABLED MANAGEMENT SYSTEM - A speech-enabled management system is described herein. One system includes a grammar building tool configured to create a set of grammar keys based on ontology analytics corresponding to data received from a digital video manager (DVM) server, a speech recognition engine configured to recognize a speech command from a set of grammar files, a command translator configured to translate the recognized speech command to an executable command, and a processor configured to execute the speech command based on a particular grammar key from the set of grammar keys. | 02-25-2016 |
20160070533 | SYSTEMS AND METHODS FOR SIMULTANEOUSLY RECEIVING VOICE INSTRUCTIONS ON ONBOARD AND OFFBOARD DEVICES - To allow a user to provide a voice instruction to either a portable device or a computing device embedded within a vehicle, both the portable device and the embedded computing device receive the voice instruction such as, “Direct me to Kansas City.” Moreover, both the portable device and the embedded computing device may determine the likelihoods that the portable device and the computing device, respectively, can carry out the voice instruction. The portable device and the computing device may then communicate with each other to compare the determined likelihoods. Based on the comparison, either the portable device or the computing device may respond to the voice instruction by, for example, playing a requested song, turning on the radio in the vehicle, providing navigation directions from the current location to a destination, etc. | 03-10-2016 |
20160073350 | USER SENSING SYSTEM AND METHOD FOR LOW POWER VOICE COMMAND ACTIVATION IN WIRELESS COMMUNICATION SYSTEMS - A method of activating voice control on a wireless device includes sampling signals from a plurality of sensors on the device, determining if the device is in a hands-on state by a user on the basis of the signal sampling, and enabling a voice activated detection (VAD) application on the device on the basis of the determination. A voice controlled apparatus in a wireless device includes a plurality of sensors arranged on the device, a microphone, a controller to sample signals from one or more of the plurality of sensors, a processor coupled to the controller, and a voice activated detection (VAD) application running on the processor coupled to the controller and the microphone. | 03-10-2016 |
20160077574 | Methods and Apparatus for Unsupervised Wakeup with Time-Correlated Acoustic Events - Methods and apparatus for unsupervised wakeup of a device including receiving a first acoustic event at a first time and a second acoustic event at a second time, wherein the first and second acoustic events have scores above a first threshold identifying the first and second acoustic events as wakeup candidates for a wakeup phrase for an unsupervised wakeup of a device. It can be determined that the first acoustic event score is below a second threshold, which is higher than the first threshold and whether a difference between the first and second times is within a range to check for correlation in time between the first and second acoustic events. Occurrence of a wakeup event can be determined based upon the first and second times. | 03-17-2016 |
20160077788 | Systems and Methods for Interactive Communication Between an Object and a Smart Device - Methods for interactive communication between an object and a smart device are provided. Signals can be transmitted from the smart device to the object to control movement of a movable part at the object. Signals can also be transmitted from the smart device to the object to broadcast words and/or songs at a speaker at the object. In addition, in response to a user's touching the object, the object's speaker can broadcast words and/or songs. The signals transmitted from the smart device to the object transceiver can be audio signals so as to create a two-way interactive and live communication. In addition, voice instructions can be spoken into the microphone of the object, and then transmitted from the object to the smart device to initiate an activity at the smart device. The activity can be broadcast of the voice instructions at the speaker of the smart device, or the broadcast of a story or music at the speaker of the smart device. | 03-17-2016 |
20160077792 | METHODS AND APPARATUS FOR UNSUPERVISED WAKEUP - Methods and apparatus for unsupervised wakeup of a device including receiving a first acoustic event at a first time and a second acoustic event at a second time, wherein scores of the first and second acoustic events are above a first threshold identifying the first and second acoustic events as wakeup candidates for a wakeup phrase for an unsupervised wakeup of a device. It is determined that the first acoustic event is above a second threshold, which is higher than the first threshold, and that the second acoustic event is above a third threshold, which is higher than the first threshold. Occurrence of a wakeup event can be determined based upon acoustic similarity of the events. | 03-17-2016 |
20160077794 | DYNAMIC THRESHOLDS FOR ALWAYS LISTENING SPEECH TRIGGER - Systems and processes are disclosed for dynamically adjusting a speech trigger threshold, which can be used in triggering a virtual assistant. Audio input can be received via a microphone. The received audio input can be sampled, and a confidence level can be determined of whether the sampled audio input includes a portion of a spoken trigger. In response to the confidence level exceeding a threshold, a virtual assistant can be triggered to receive a user command from the audio input. The threshold can be dynamically adjusted in response to perceived events (e.g., events indicating a user may be more or less likely to initiate speech interactions, events indicating a trigger may be difficult to detect, events indicating a trigger was missed, etc.), thereby minimizing both missed triggers and false positive triggering events. | 03-17-2016 |
20160078083 | IMAGE DISPLAY DEVICE, METHOD FOR DRIVING THE SAME, AND COMPUTER READABLE RECORDING MEDIUM - An image display device, a method for driving the same, and a computer readable recording medium are provided. The image display device includes a speech acquirer configured to acquire a speech query associated with a query created by a user, a display configured to display a query list composed of candidate queries having the same as or similar semantic as the acquired speech query, and an operation performer configured to perform an operation related to the query selected from the displayed query list. | 03-17-2016 |
20160078870 | METHOD FOR INITIATING A WIRELESS COMMUNICATION LINK USING VOICE RECOGNITION - A method for establishing a wireless mobile communication link between a vehicle communication system and a mobile network includes a vehicle communication system receiving a single continuous user verbal command string consisting of a first verbal command and at least a second verbal command. The first verbal command identifies a selected communication device for establishing the wireless mobile communication link. The communication device is selected from a group including at least two communication devices. The vehicle communication system determines if the selected communication device is communicatively connected to the vehicle communication system and proceeds to establish the wireless mobile communication link using the selected communication device. | 03-17-2016 |
20160085506 | SYSTEM AND METHOD FOR SPEECH-ENABLED ACCESS TO MEDIA CONTENT - Disclosed herein are systems, methods, and computer-readable storage media for generating a speech recognition model for a media content retrieval system. The method causes a computing device to retrieve information describing media available in a media content retrieval system, construct a graph that models how the media are interconnected based on the retrieved information, rank the information describing the media based on the graph, and generate a speech recognition model based on the ranked information. The information can be a list of actors, directors, composers, titles, and/or locations. The graph that models how the media are interconnected can further model pieces of common information between two or more media. The method can further cause the computing device to weight the graph based on the retrieved information. The graph can further model relative popularity information in the list. The method can rank information based on a PageRank algorithm. | 03-24-2016 |
20160086603 | Power-Efficient Voice Activation - A voice activation system is provided. The voice activation system includes a first module configured to receive an audio signal and output an activation signal if an energy characteristic of the audio signal satisfies a threshold stored in a memory, a control module configured to enable or disable a third state using a control signal, and a speech recognition engine coupled to the first module and the control module, the speech recognition engine configured to transition between a first state, a second state, and the third state. The speech recognition engine transitions from the first state to the second state in response to the activation signal and in response to the control signal being disabled. The speech recognition engine transitions from the first state to the third state in response to the activation signal and in response to the control signal being enabled. The speech recognition engine transitions from the third state to the second state in response to detection of a wake-up word by the speech recognition engine. | 03-24-2016 |
20160088262 | Method For Managing Storage Product In Refrigerator Using Image Recognition, And Refrigerator For Same - A refrigerator is provided. The refrigerator includes an imaging unit generating a goods loading/unloading video through video recording of storage goods loaded into or unloaded from the refrigerator; a data storage unit storing the goods loading/unloading video, goods information that may be stored in the refrigerator, and storage items information; a control unit recognizing loading/unloading of goods based on an optical flow detected through vision recognition from the goods loading/unloading video, and updating the storage items information in the refrigerator based on recognition information generated through vision recognition on loading/unloading of the goods and the goods information stored in the data storage unit; and a display unit displaying a managed state of storage items for a user based on the updated storage items information. | 03-24-2016 |
20160098990 | Information-Sharing System - An information-sharing system has a server connected to a network, the server having a processor and a coupled data repository, with software executing on the processor from a non-transitory medium providing system intelligence, a plurality of computerized communication devices coupled to the network, each having a microphone, a speaker, and a display screen, each executing coded instructions providing local intelligence at least presenting interactive interfaces to a user. The user is enabled by the coded instructions executing at the computerized communication device to record an audio or audio-video input which the system, by use of natural language processing, determines to be either a voice command or a Shout, as a situational report local to the user, and if the enunciation is determined to be a Shout is transmitted to the Server, where it is processed and transmitted to other users according to determinations made by the server intelligence. | 04-07-2016 |
20160098992 | Voice and Connection Platform - A system and method for providing a voice assistant including receiving, at a first device, a first audio input from a user requesting a first action; performing automatic speech recognition on the first audio input; obtaining a context of user; performing natural language understanding based on the speech recognition of the first audio input; and taking the first action based on the context of the user and the natural language understanding. | 04-07-2016 |
20160098997 | MANAGEMENT OF VOICE COMMANDS FOR DEVICES IN A CLOUD COMPUTING ENVIRONMENT - Provided is a lightweight computational device that is configured to be in communication with a cloud both directly and via a smart computational device. The lightweight computational device receives a voice command from a user, wherein the lightweight computational device does not have adequate processing power to convert the voice command to a text command. The voice command is transmitted from the lightweight computational device to a smart computational device, wherein the smart computational device uses voice recognition to convert the voice command to a text command in the smart computational device, and transmits the text command for being processed by that cloud that provides at least one of voice recognition service and other services. The lightweight computational device receives a data response for the user from the cloud, via the smart computational device, based on the other services provided by the cloud. | 04-07-2016 |
20160103655 | Co-Verbal Interactions With Speech Reference Point - Example apparatus and methods improve efficiency and accuracy of human device interactions by combining speech with other input modalities (e.g., touch, hover, gestures, gaze) to create multi-modal interactions that are more natural and more engaging. Multi-modal interactions expand a user's expressive power with devices. A speech reference point is established based on a combination of prioritized or ordered inputs. Co-verbal interactions occur in the context of the speech reference point. Example co-verbal interactions include a command, a dictation, or a conversational interaction. The speech reference point may vary in complexity from a single discrete reference point (e.g., single touch point) to multiple simultaneous reference points to sequential reference points (single touch or multi-touch), to analog reference points associated with, for example, a gesture. Establishing the speech reference point allows surfacing additional context-appropriate user interface elements that further improve human device interactions in a natural and engaging experience. | 04-14-2016 |
20160104483 | HOTWORD DETECTION ON MULTIPLE DEVICES - Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio data that corresponds to an utterance. The actions further include determining a likelihood that the utterance includes a hotword. The actions further include determining a loudness score for the audio data. The actions further include based on the loudness score, determining an amount of delay time. The actions further include, after the amount of delay time has elapsed, transmitting a signal that indicates that the computing device will initiate speech recognition processing on the audio data. | 04-14-2016 |
20160110159 | INPUT INFORMATION SUPPORT APPARATUS, METHOD FOR SUPPORTING INPUT INFORMATION, AND COMPUTER-READABLE RECORDING MEDIUM - A buffer receives input from a plurality of objects. A display controller performs control to cause a display unit to display received pieces of input content of the objects in divided frames in time-series order of reception and display received input content additionally in a frame that displays past input content when a specific condition is satisfied. | 04-21-2016 |
20160111088 | AUDIO VIDEO NAVIGATION DEVICE, VEHICLE AND METHOD FOR CONTROLLING THE AUDIO VIDEO NAVIGATION DEVICE - An Audio Video Navigation (AVN) device includes a voice receiver for receiving a command from a user in a voice recognition mode; a storage for storing Help; and a controller for providing the Help for the user if the number of times a same pattern has occurred is equal to or greater than a threshold in the voice recognition mode. | 04-21-2016 |
20160111091 | SYSTEM AND METHOD FOR OPERATING DEVICES USING VOICE COMMANDS - System and method for operating electric devices based on voice commands, as well as electric devices that can be controlled via voice commands. An electric device comprises an audio sensor to capture audio that contains speech; and a transmitter to transmit the captured audio to a remote server, together with a dictionary identifier that indicates to the remote server which particular dictionary or vocabulary-set to utilize for performing speech recognition on the recorded audio. The remote server performs speech recognition using the relevant dictionary table; and selects a command-code that is transmitted back to the electric device, to trigger an operational modification of the electric device. | 04-21-2016 |
20160118043 | ELECTRONIC DEVICE AND ALARM CONTROL METHOD OF THE ELECTRONIC DEVICE - In an alarm control method executed in an electronic device, voice commands are set using a voice capturing device and stored into a storage device. The voice commands are set for controlling an alarm of the electronic device. In event the alarm rings, real-time audio data is captured using the voice capturing device. The voice commands are read from the storage device, and the voice commands are compared with the audio data to determine that at least one voice command matches the audio data. The alarm is controlled according to a matched voice command. | 04-28-2016 |
20160118048 | PROVIDING VOICE RECOGNITION SHORTCUTS BASED ON USER VERBAL INPUT - A method for determining a voice command shortcut includes receiving a first voice command providing instructions for performing a particular task and a second voice command providing additional instructions for performing the same task. The voice command shortcut may be used in place of the first and second voice commands, which are typically submitted in response to system prompts. The availability of a voice command shortcut is determined based on the first and second voice commands. If a voice command shortcut is available, an audible and/or visual notification may be provided to inform the user of the available voice command shortcut. | 04-28-2016 |
20160124706 | SYSTEM AND METHOD FOR INITIATING MULTI-MODAL SPEECH RECOGNITION USING A LONG-TOUCH GESTURE - A system, method and computer-readable storage devices are disclosed for multi-modal interactions with a system via a long-touch gesture on a touch-sensitive display. A system operating per this disclosure can receive a multi-modal input comprising speech and a touch on a display, wherein the speech comprises a pronoun. When the touch on the display has a duration longer than a threshold duration, the system can identify an object within a threshold distance of the touch, associate the object with the pronoun in the speech, to yield an association, and perform an action based on the speech and the association. | 05-05-2016 |
20160125879 | AUGMENTATION OF KEY PHRASE USER RECOGNITION - Examples for augmenting user recognition via speech are provided. One example method comprises, on a computing device, monitoring a use environment via one or more sensors including an acoustic sensor, detecting utterance of a key phrase via data from the acoustic sensor, and based upon the selected data from the acoustic sensor and also on other environmental sensor data collected at different times than the selected data from the acoustic sensor, determining a probability that the key phrase was spoken by an identified user. The method further includes, if the probability meets or exceeds a threshold probability, then performing an action on the computing device. | 05-05-2016 |
20160125880 | METHOD AND SYSTEM FOR IDENTIFYING LOCATION ASSOCIATED WITH VOICE COMMAND TO CONTROL HOME APPLIANCE - The present invention relates to a method for controlling a home appliance located in assigned room with voice commands in home environment. The method comprises the steps of: receiving a voice command by a user; recording the received voice command; sampling the recorded voice command and feature extracting from the recorded voice command; determining room label by comparing the extracted features of the voice command with feature references, wherein the room label is associated with the feature references; assigning the room label to the voice command; and controlling the home appliance located in the assigned room in accordance with the voice command. | 05-05-2016 |
20160125895 | VOICE INTERACTIVE SYSTEM FOR INDUSTRIAL FIELD INSTRUMENTS AND FIELD OPERATORS - A device performs a method in an industrial control and automation system. The method includes receiving a command audio signal generated by a verbal command, wherein the command audio signal includes one or more instructions. The method also includes transmitting one or more command signals to a controller to implement the one or more instructions. The method further includes receiving one or more update signals from the controller, wherein the one or more update signals are based on at least one of a parameter measured by one or more sensors or a status of one or more actuators. The method includes transmitting the one or more update signals for speech output. | 05-05-2016 |
20160132290 | GAZE TRIGGERED VOICE RECOGNITION - One embodiment provides a method, involving: detecting, at an electronic device, a location of user gaze; activating, based on the location of the user gaze, a voice input module; detecting, at the electronic device, a voice input; evaluating, using the voice input module, the voice input, and performing, based on evaluation of the voice input, at least one action. Other aspects are described and claimed. | 05-12-2016 |
20160133254 | CONTEXT-BASED ACTIONS - A computing device receives voice command inputs from a user. The device obtains a language processing result based on the voice command input. The result includes an intent and a set of arguments. The device also obtains a variety of different types of contextual information. An action is identified based on the intent, the arguments, and the contextual information, and the device then suggests the action by displaying a user selectable input mechanism that can be actuated by the user to perform the action. The device can automatically perform the action as well. | 05-12-2016 |
20160133255 | VOICE TRIGGER SENSOR - A method for voice triggering, the method may include coupling, by an interface of a voice trigger sensor, the voice trigger sensor to a computer; receiving, by the voice trigger sensor, from the computer configuration information; configuring the voice trigger sensor by using the configuration information; coupling, by the interface, the voice trigger sensor to a target device during a voice activation period; receiving, by a processor of the voice trigger sensor, during the voice activation period, input signals; applying, by the processor, on the input signals a voice activation process to detect a voice command; and | 05-12-2016 |
20160138858 | REFRIGERATOR, TERMINAL, MANAGEMENT SYSTEM AND MANAGEMENT METHOD FOR REFRIGERATOR CONTENTS - A contents management method for an item in the refrigerator includes: receiving voice information input by a user when detecting that a refrigerator door is opened, where the voice information includes basic item change information corresponding to a change operation of the item in the refrigerator by a user; and identifying the voice information input by the user, performing pretreatment to generate complete item change information corresponding to the change operation of the item in the refrigerator by the user, and transmitting the complete item change information to a terminal. This enables the terminal to generate contents management information after the item in the refrigerator has been changed. By using the refrigerator, the terminal, and the management system and the management method for an item in the refrigerator, it can be more convenient to perform intelligent management on the item in the refrigerator. | 05-19-2016 |
20160140960 | VOICE RECOGNITION SYSTEM, SERVER, DISPLAY APPARATUS AND CONTROL METHODS THEREOF - A voice recognition system includes a server storing a plurality of manuals and a display apparatus transmitting, when a spoken voice of a user is recognized, characteristic information and a spoken voice signal corresponding to the spoken voice to the server, the characteristic information is characteristic information of the display apparatus, the server transmits a response signal to the spoken voice signal to the display apparatus based on a manual corresponding to the characteristic information among the plurality of manuals, and the display apparatus processes an operation corresponding to the received response signal; as a result, user convenience increases. | 05-19-2016 |
20160140962 | PROMOTING VOICE ACTIONS TO HOTWORDS - Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword. | 05-19-2016 |
20160140967 | METHOD PERFORMED BY AN APPLICATION FOR COMMUNICATION WITH A USER INSTALLED AT A MOBILE TERMINAL AND A MOBILE TERMINAL FOR COMMUNICATING WITH A USER - Disclosed are a method performed by an application for communication with a user and a mobile terminal for communicating with a user. The method performed by the application for communication with the user installed at the mobile terminal, the method comprising: outputting, via an output unit of the mobile terminal, one or more questions based on a predetermined psychology algorithm; receiving, via a mic of the mobile terminal, one or more verbal answers to the questions from the user; analyzing information related to the verbal answers using the predetermined psychology algorithm; editing a response pattern using the analysis result of the information related to the verbal answers; and outputting, via the output unit of the mobile terminal, if a verbal request of the user is received through the mic of the mobile terminal, one or more responses to the verbal request using the edited response pattern. | 05-19-2016 |
20160148615 | METHOD AND ELECTRONIC DEVICE FOR VOICE RECOGNITION - Disclosed are a method and electronic device for voice recognition. The voice recognition method includes recognizing, in a first processor using low power mode, a voice signal inputted through a microphone, entering an active state and performing voice recording in a second processor if the recognized voice signal is a previously set keyword, and performing voice recognition in the second processor if the end of a voice input is determined during the voice recording. | 05-26-2016 |
20160155442 | EXTENDING DIGITAL PERSONAL ASSISTANT ACTION PROVIDERS | 06-02-2016 |
20160155443 | DEVICE ARBITRATION FOR LISTENING DEVICES | 06-02-2016 |
20160163311 | COMMUNICATION SYSTEM - Systems and methods for responding to spoken language input or multi-modal input are described herein. More specifically, one or more user intents are determined or inferred from the spoken language input or multi-modal input to determine one or more user goals via a dialogue belief tracking system. The systems and methods disclosed herein utilize the dialogue belief tracking system to perform actions based on the determined one or more user goals and allow a device to engage in human like conversation with a user over multiple turns of a conversation. Preventing the user from having to explicitly state each intent and desired goal while still receiving the desired goal from the device, improves a user's ability to accomplish tasks, perform commands, and get desired products and/or services. Additionally, the improved response to spoken language inputs from a user improves user interactions with the device. | 06-09-2016 |
20160163313 | INFORMATION PROCESSING METHOD AND ELECTRONIC DEVICE - The present disclosure discloses an information processing method and an electronic device. The electronic device comprises a first processing unit and a second processing unit. The second processing unit is capable of executing at least one application program. The information processing method comprises: collecting first sound information; when the first sound information comprises first information which matches with audio data preset in the first processing unit, generating a first instruction, and transmitting the first instruction to the second processing unit; when it is determined that there is a first application which meets a predetermined condition in the at least one application program, generating, by the second processing unit, a second instruction based on the first instruction and the first application; and executing the second instruction in the first application. | 06-09-2016 |
20160163314 | DIALOG MANAGEMENT SYSTEM AND DIALOG MANAGEMENT METHOD - An intention estimated-weight determination processor | 06-09-2016 |
20160163315 | WIRELESS CONTROLLER INCLUDING INDICATOR - Embodiments disclosed herein provide a wireless controller which shows a voice, motion, or an image complying with or not complying with a user's command and controls an external device in accordance with the user's command. According to an embodiment, a wireless controller includes a main body provided in a shape of a flowerpot, and includes a voice recognition unit, a control unit generating a signal for controlling an object to be controlled, which is designated by a voice recognized in the voice recognition unit, in accordance with the voice, and a communication unit outputting the control signal generated in the control unit to the object to be controlled; and an indicator provided at the main body in a shape of at least one of a stem, a leaf, a flower, and a tree, and showing a motion corresponding to the voice recognized in the voice recognition unit. | 06-09-2016 |
20160163319 | DETERMINING A DEGREE OF AUTOMATICITY FOR A MOBILE SYSTEM OPERATION - The disclosure includes a system and method for determining a degree of automaticity for a mobile system operation. The system includes a processor and a memory storing instructions that, when executed, cause the system to: receive sensor data from one or more sensors communicatively coupled to the processor, determine a current state of a mobile system based on the sensor data, determine that the current state of the mobile system is a candidate to be changed to a target state based on a comparison of the current state to the target state, and determine a degree of automaticity for an operation to change the current state of the mobile system to the target state. | 06-09-2016 |
20160170710 | METHOD AND APPARATUS FOR PROCESSING VOICE INPUT | 06-16-2016 |
20160171980 | DIGITAL ASSISTANT VOICE INPUT INTEGRATION | 06-16-2016 |
20160179462 | CONNECTED DEVICE VOICE COMMAND SUPPORT | 06-23-2016 |
20160180174 | COMMODITY REGISTRATION DEVICE AND COMMODITY REGISTRATION METHOD | 06-23-2016 |
20160180844 | EXECUTING A VOICE COMMAND DURING VOICE INPUT | 06-23-2016 |
20160180853 | APPLICATION FOCUS IN SPEECH-BASED SYSTEMS | 06-23-2016 |
20160182938 | System for Controlling Electronic Devices by Means of Voice Commands, More Specifically a Remote Control to Control a Plurality of Electronic Devices by Means of Voice Commands | 06-23-2016 |
20160189717 | DISCOVERING CAPABILITIES OF THIRD-PARTY VOICE-ENABLED RESOURCES - Techniques are described for discovering capabilities of voice-enabled resources. A voice-controlled digital personal assistant can respond to user requests to list available voice-enabled resources that are capable of performing a specific task using voice input. The voice-controlled digital personal assistant can also respond to user requests to list the tasks that a particular voice-enabled resource can perform using voice input. The voice-controlled digital personal assistant can also support a practice mode in which users practice voice commands for performing tasks supported by voice-enabled resources. | 06-30-2016 |
20160203816 | SOCKET AND VOICE-RECOGNITION METHOD USING SAME | 07-14-2016 |
20160253149 | Method and Apparatus for Voice Control User Interface with Discreet Operating Mode | 09-01-2016 |
20160253996 | ACTIVATING VOICE PROCESSING FOR ASSOCIATED SPEAKER | 09-01-2016 |
20160253998 | Method and Apparatus for Voice Control User Interface with Discreet Operating Mode | 09-01-2016 |
20160379631 | SYSTEM AND METHODS FOR VOICE-CONTROLLED SEAT ADJUSTMENT - A control system for a vehicle having a seat with a first moveable portion and an adjustment actuator coupled with the first moveable seat portion includes a voice input device, a touchscreen input device, and a controller in communication with the adjustment actuator, the voice input device, and the touchscreen input device. The controller has a processor programed to interpret a first adjustment command from one of a voice command received from the voice input device and a manual command received from the touchscreen input device, carry out a first seat adjustment by causing the adjustment actuator to move the first moveable seat portion according to the first adjustment command, and to present information related to the first adjustment command on the touchscreen. | 12-29-2016 |
20160379633 | Speech-Controlled Actions Based on Keywords and Context Thereof - A device includes a plurality of components, a memory having a keyword recognition module and a context recognition module, a microphone configured to receive an input speech spoken by a user, an analog-to-digital converter configured to convert the input speech from an analog form to a digital form and generate a digitized speech, and a processor. The processor is configured to detect, using the keyword recognition module, a keyword in the digitized speech, initiate, in response to detecting the keyword by the keyword recognition module, an action to be taken one of the plurality of components, wherein the keyword is associated with the action, determine, using the context recognition module, a context for the keyword, and execute the action if the context determined by the context recognition module indicates that the keyword is a command. | 12-29-2016 |
20160379637 | COMMUNICATION SYSTEM - Systems and methods for responding to spoken language input or multi-modal input are described herein. More specifically, one or more user intents are determined or inferred from the spoken language input or multi-modal input to determine one or more user goals via a dialogue belief tracking system. The systems and methods disclosed herein utilize the dialogue belief tracking system to perform actions based on the determined one or more user goals and allow a device to engage in human like conversation with a user over multiple turns of a conversation. Preventing the user from having to explicitly state each intent and desired goal while still receiving the desired goal from the device, improves a user's ability to accomplish tasks, perform commands, and get desired products and/or services. Additionally, the improved response to spoken language inputs from a user improves user interactions with the device. | 12-29-2016 |
20160379642 | ELECTRONIC DEVICE AND AUDIO CONVERTING METHOD THEREOF - An electronic device and an audio converting method thereof are provided. The method includes determining state information of the electronic device, receiving or outputting an audio signal based on a first mode, when the state information is first state information, and receiving or outputting the audio signal based on a second mode, when the state information is second state information. | 12-29-2016 |
20170236515 | Model for Enabling Service Providers to Address Voice-Activated Commands | 08-17-2017 |
20180024811 | METHOD AND APPARATUS FOR PROXIMITY DETECTION FOR DEVICE CONTROL | 01-25-2018 |
20180024985 | INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM | 01-25-2018 |
20180025727 | VOICE INTERACTIVE DEVICE AND UTTERANCE CONTROL METHOD | 01-25-2018 |
20180025733 | ACTIVATING VOICE ASSISTANT BASED ON AT LEAST ONE OF USER PROXIMITY AND CONTEXT | 01-25-2018 |
20190147051 | INTELLIGENT PLAYING METHOD AND APPARATUS BASED ON PREFERENCE FEEDBACK | 05-16-2019 |
20190147858 | METHODS, SYSTEMS AND APPARATUSES FOR IMPROVING SPEECH RECOGNITION USING TOUCH-BASED PREDICTIVE MODELING | 05-16-2019 |
20190147863 | METHOD AND APPARATUS FOR PLAYING MULTIMEDIA | 05-16-2019 |
20190147864 | VOICE INTERACTION BASED METHOD AND APPARATUS FOR GENERATING MULTIMEDIA PLAYLIST | 05-16-2019 |
20190147865 | CONTENT RECOGNIZING METHOD AND APPARATUS, DEVICE, AND COMPUTER STORAGE MEDIUM | 05-16-2019 |
20190147871 | INTELLIGENT INTERACTION PROCESSING METHOD AND APPARATUS, DEVICE AND COMPUTER STORAGE MEDIUM | 05-16-2019 |
20190147874 | INFORMATION CHOICE AND SECURITY VIA A DECOUPLED ROUTER WITH AN ALWAYS LISTENING ASSISTANT DEVICE | 05-16-2019 |
20190147883 | BUILDING AUTOMATION SYSTEM WITH NLP SERVICE ABSTRACTION | 05-16-2019 |
20190147905 | SECURE AND PRIVATE PROCESSING OF GESTURES VIA VIDEO INPUT | 05-16-2019 |
20220137921 | Communication System and Method - A method, computer program product, and computing system for defining a communication computing system within a computing network, wherein the computing network includes a plurality of disparate platforms configured to provide information concerning various topics; enabling a user to issue a verbal command concerning one or more of the plurality of disparate platforms; processing the verbal command to generate a platform-useable command based, at least in part, upon the verbal command; and providing the platform-useable command to at least a portion of the plurality of disparate platforms via the communication computing system. | 05-05-2022 |
20220139376 | PERSONAL SPEECH RECOMMENDATIONS USING AUDIENCE FEEDBACK - Aspects of the present invention disclose a method for generating speech recommendations for a user based on feedback data corresponding to a plurality of viewers of the user. The method includes one or more processors identifying speech of a user in audio data of the user. The method further includes identifying feedback of one or more audience members of the user associated with the speech of the user. The method further includes generating an assessment of the speech of the user, wherein the assessment is based at least in part on the feedback of the one or more audience members. The method further includes generating a speech recommendation for the speech of the user based at least in part on the assessment of the speech. | 05-05-2022 |
20220139391 | ACTIVATION MANAGEMENT FOR MULTIPLE VOICE ASSISTANTS - Systems and methods include activation of a first voice assistant application to execute a first user dialog session, the first application associated with a first voice keyword and, while the first application is active and executing the first session, reception of second audio signals representing a second voice keyword associated with a second voice assistant application, determination, in response to reception of the second audio signals, that the first application is uninterruptable, wherein the second application remains inactive in response to reception of the second audio signals, reception of a signal from the first application indicating that the first application is interruptable, reception of third audio signals representing the second keyword and, in response to reception of the third audio signals, determination that the first application is interruptable and transmission of an activation signal to the second application to activate the second application and execute a second user dialog session. | 05-05-2022 |