Patent application number | Description | Published |
20080208585 | Ordering Recognition Results Produced By An Automatic Speech Recognition Engine For A Multimodal Application - Ordering recognition results produced by an automatic speech recognition (‘ASR’) engine for a multimodal application implemented with a grammar of the multimodal application in the ASR engine, with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine through a VoiceXML interpreter, includes: receiving, in the VoiceXML interpreter from the multimodal application, a voice utterance; determining, by the VoiceXML interpreter using the ASR engine, a plurality of recognition results in dependence upon the voice utterance and the grammar; determining, by the VoiceXML interpreter according to semantic interpretation scripts of the grammar, a weight for each recognition result; and sorting, by the VoiceXML interpreter, the plurality of recognition results in dependence upon the weight for each recognition result. | 08-28-2008 |
20080208592 | Configuring A Speech Engine For A Multimodal Application Based On Location - Methods, apparatus, and products are disclosed for configuring a speech engine for a multimodal application based on location. The multimodal application operates on a multimodal device supporting multiple modes of user interaction with the multimodal application. The multimodal application is operatively coupled to a speech engine. Configuring a speech engine for a multimodal application based on location includes: receiving a location change notification in a location change monitor from a device location manager, the location change notification specifying a current location of the multimodal device; identifying, by the location change monitor, location-based configuration parameters for the speech engine in dependence upon the current location of the multimodal device, the location-based configuration parameters specifying a configuration for the speech engine at the current location; and updating, by the location change monitor, a current configuration for the speech engine according to the identified location-based configuration parameters. | 08-28-2008 |
20080208593 | Altering Behavior Of A Multimodal Application Based On Location - Methods, apparatus, and products are disclosed for altering behavior of a multimodal application based on location. The multimodal application operates on a multimodal device supporting multiple modes of user interaction with the multimodal application, including a voice mode and one or more non-voice modes. The voice mode of user interaction with the multimodal application is supported by a voice interpreter. Altering behavior of a multimodal application based on location includes: receiving a location change notification in the voice interpreter from a device location manager, the device location manager operatively coupled to a position detection component of the multimodal device, the location change notification specifying a current location of the multimodal device; updating, by the voice interpreter, location-based environment parameters for the voice interpreter in dependence upon the current location of the multimodal device; and interpreting, by the voice interpreter, the multimodal application in dependence upon the location-based environment parameters. | 08-28-2008 |
20080235029 | Speech-Enabled Predictive Text Selection For A Multimodal Application - Methods, apparatus, and products are disclosed for speech-enabled predictive text selection for a multimodal application, the multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to an automatic speech recognition (‘ASR’) engine through a VoiceXML interpreter, including: identifying, by the VoiceXML interpreter, a text prediction event, the text prediction event characterized by one or more predictive texts for a text input field of the multimodal application; creating, by the VoiceXML interpreter, a grammar in dependence upon the predictive texts; receiving, by the VoiceXML interpreter, a voice utterance from a user; and determining, by the VoiceXML interpreter using the ASR engine, recognition results in dependence upon the voice utterance and the grammar, the recognition results representing a user selection of a particular predictive text. | 09-25-2008 |
20080255850 | Providing Expressive User Interaction With A Multimodal Application - Methods, apparatus, and products are disclosed for providing expressive user interaction with a multimodal application, the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of user interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to a speech engine through a VoiceXML interpreter, including: receiving, by the multimodal browser, user input from a user through a particular mode of user interaction; determining, by the multimodal browser, user output for the user in dependence upon the user input; determining, by the multimodal browser, a style for the user output in dependence upon the user input, the style specifying expressive output characteristics for at least one other mode of user interaction; and rendering, by the multimodal browser, the user output in dependence upon the style. | 10-16-2008 |
20120166199 | HOSTED VOICE RECOGNITION SYSTEM FOR WIRELESS DEVICES - Methods, systems, and software for converting the audio input of a user of a hand-held client device or mobile phone into a textual representation by means of a backend server accessed by the device through a communications network. The text is then inserted into or used by an application of the client device to send a text message, instant message, email, or to insert a request into a web-based application or service. In one embodiment, the method includes the steps of initializing or launching the application on the device; recording and transmitting the recorded audio message from the client device to the backend server through a client-server communication protocol; converting the transmitted audio message into the textual representation in the backend server; and sending the converted text message back to the client device or forwarding it on to an alternate destination directly from the server. | 06-28-2012 |
20140208210 | DISPLAYING SPEECH COMMAND INPUT STATE INFORMATION IN A MULTIMODAL BROWSER - Methods, systems, and products are disclosed for displaying speech command input state information in a multimodal browser including displaying an icon representing a speech command type and displaying an icon representing the input state of the speech command. In typical embodiments, the icon representing a speech command type and the icon representing the input state of the speech command also includes attributes of a single icon. Typical embodiments include accepting from a user a speech command of the speech command type, changing the input state of the speech command, and displaying another icon representing the changed input state of the speech command. Typical embodiments also include displaying the text of the speech command in association with the icon representing the speech command type. | 07-24-2014 |
Patent application number | Description | Published |
20090055175 | CONTINUOUS SPEECH TRANSCRIPTION PERFORMANCE INDICATION - A method of providing speech transcription performance indication includes receiving, at a user device data representing text transcribed from an audio stream by an ASR system, and data representing a metric associated with the audio stream; displaying, via the user device, said text; and via the user device, providing, in user-perceptible form, an indicator of said metric. Another method includes displaying, by a user device, text transcribed from an audio stream by an ASR system; and via the user device, providing, in user-perceptible form, an indicator of a level of background noise of the audio stream. Another method includes receiving data representing an audio stream; converting said data representing an audio stream to text via an ASR system; determining a metric associated with the audio stream; transmitting data representing said text to a user device; and transmitting data representing said metric to the user device. | 02-26-2009 |
20090076917 | FACILITATING PRESENTATION OF ADS RELATING TO WORDS OF A MESSAGE - Targeted delivery of contextually relevant ad impressions to a mobile device is provided. The ad impressions are delivered within text messages and/or instant message chat threads. Monetizing of text messaging and instant messaging by providers of such services is achieved, while providing unobtrusive and contextually relevant information to users of such services. | 03-19-2009 |
20090083032 | METHODS AND SYSTEMS FOR DYNAMICALLY UPDATING WEB SERVICE PROFILE INFORMATION BY PARSING TRANSCRIBED MESSAGE STRINGS - Systems, methods, and software for parsing and/or filtering message strings of text messages and/or instant messages in order to identify keywords, phrases, or fragments as a function of which user preferences of user profiles are dynamically updated are disclosed. Such systems, methods, and software are utilized in the context of a communication system including text messaging, instant messaging, or both. Furthermore, such communication system preferably includes an automatic speech recognition (ASR) system. Additionally, ad impressions are selected and delivered to users based, at least in part, on the parsing and/or filtering and/or data maintained in user profiles as dynamically updated from time to time. The ad impression preferably is delivered within a text message or within an instant message conversation and is generally unobtrusive. Revenues preferably may be generated from the delivering of the ad impressions, whereby a provider of instant messaging or text messaging may further derive monetary benefit from providing such service and whereby users of such service may be provided with contextually relevant information in an unobtrusive manner. | 03-26-2009 |
20090240488 | CORRECTIVE FEEDBACK LOOP FOR AUTOMATED SPEECH RECOGNITION - A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier. | 09-24-2009 |
20090248415 | USE OF METADATA TO POST PROCESS SPEECH RECOGNITION OUTPUT - A method of utilizing metadata stored in a computer-readable medium to assist in the conversion of an audio stream to a text stream. The method compares personally identifiable data, such as a user's electronic address book and/or Caller/Recipient ID information (in the case of processing voice mail to text), to the n-best results generated by a speech recognition engine for each word that is output by the engine. A goal of this comparison is to correct a possible misrecognition of a spoken proper noun such as a name or company with its proper textual form or a spoken phone number to correctly formatted phone number with Arabic numerals to improve the overall accuracy of the output of the voice recognition system. | 10-01-2009 |
20100058200 | FACILITATING PRESENTATION BY MOBILE DEVICE OF ADDITIONAL CONTENT FOR A WORD OR PHRASE UPON UTTERANCE THEREOF - A method for presenting additional content for a word that is part of a message, and that is presented by a mobile communication device, includes the steps of: presenting the message, including emphasizing one or more words for which respective additional content is available for presenting by the mobile communication device; receiving an utterance that includes an emphasized word for which additional content is available for presenting by the mobile communication device; and presenting the additional content for the emphasized word included in the utterance received by the mobile communication device. These steps are performed by the mobile communication device. | 03-04-2010 |
20120303445 | FACILITATING PRESENTATION OF ADS RELATING TO WORDS OF A MESSAGE - Targeted delivery of contextually relevant ad impressions to a mobile device is provided. The ad impressions are delivered within text messages and/or instant message chat threads. Monetizing of text messaging and instant messaging by providers of such services is achieved, while providing unobtrusive and contextually relevant information to users of such services. | 11-29-2012 |
20150025884 | CORRECTIVE FEEDBACK LOOP FOR AUTOMATED SPEECH RECOGNITION - A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier. | 01-22-2015 |
Patent application number | Description | Published |
20100307647 | Metastable Beta-Titanium Alloys and Methods of Processing the Same by Direct Aging - Metastable beta titanium alloys and methods of processing metastable β-titanium alloys are disclosed. For example, certain non-limiting embodiments relate to metastable β-titanium alloys, such as binary β-titanium alloys comprising greater than 10 weight percent molybdenum, having tensile strengths of at least 150 ksi and elongations of at least 12 percent. Other non-limiting embodiments relate to methods of processing metastable β-titanium alloys, and more specifically, methods of processing binary β-titanium alloys comprising greater than 10 weight percent molybdenum, wherein the method comprises hot working and direct aging the metastable β-titanium alloy at a temperature below the β-transus temperature of the metastable β-titanium alloy for a time sufficient to form α-phase precipitates in the metastable β-titanium alloy. Articles of manufacture comprising binary β-titanium alloys according to various non-limiting embodiments disclosed herein are also disclosed. | 12-09-2010 |
20110038751 | METASTABLE BETA-TITANIUM ALLOYS AND METHODS OF PROCESSING THE SAME BY DIRECT AGING - Metastable beta titanium alloys and methods of processing metastable β-titanium alloys are disclosed. For example, certain non-limiting embodiments relate to metastable β-titanium alloys, such as binary β-titanium alloys comprising greater than 10 weight percent molybdenum, having tensile strengths of at least 150 ksi and elongations of at least 12 percent. Other non-limiting embodiments relate to methods of processing metastable β-titanium alloys, and more specifically, methods of processing binary β-titanium alloys comprising greater than 10 weight percent molybdenum, wherein the method comprises hot working and direct aging the metastable β-titanium alloy at a temperature below the β-transus temperature of the metastable β-titanium alloy for a time sufficient to form α-phase precipitates in the metastable β-titanium alloy. Articles of manufacture comprising binary β-titanium alloys according to various non-limiting embodiments disclosed herein are also disclosed. | 02-17-2011 |
20120166199 | HOSTED VOICE RECOGNITION SYSTEM FOR WIRELESS DEVICES - Methods, systems, and software for converting the audio input of a user of a hand-held client device or mobile phone into a textual representation by means of a backend server accessed by the device through a communications network. The text is then inserted into or used by an application of the client device to send a text message, instant message, email, or to insert a request into a web-based application or service. In one embodiment, the method includes the steps of initializing or launching the application on the device; recording and transmitting the recorded audio message from the client device to the backend server through a client-server communication protocol; converting the transmitted audio message into the textual representation in the backend server; and sending the converted text message back to the client device or forwarding it on to an alternate destination directly from the server. | 06-28-2012 |
20140065010 | TITANIUM ALLOYS INCLUDING INCREASED OXYGEN CONTENT AND EXHIBITING IMPROVED MECHANICAL PROPERTIES - One aspect of the present disclosure is directed to a metastable β titanium alloy comprising, in weight percentages: up to 0.05 nitrogen; up to 0.10 carbon; up to 0.015 hydrogen; up to 0.10 iron; greater than 0.20 oxygen; 14.00 to 16.00 molybdenum; titanium; and incidental impurities. Articles of manufacture including the alloy also are disclosed. | 03-06-2014 |
20140076468 | METASTABLE BETA-TITANIUM ALLOYS AND METHODS OF PROCESSING THE SAME BY DIRECT AGING - Metastable beta titanium alloys and methods of processing metastable (β-titanium alloys are disclosed. For example, certain non-limiting embodiments relate to metastable (β-titanium alloys, such as binary β-titanium alloys comprising greater than 10 weight percent molybdenum, having tensile strengths of at least 150 ksi and elongations of at least 12 percent. Other non-limiting embodiments relate to methods of processing metastable β-titanium alloys, and more specifically, methods of processing binary (β-titanium alloys comprising greater than 10 weight percent molybdenum, wherein the method comprises hot working and aging the metastable β-titanium alloy at a temperature below the (β-transus temperature of the metastable (β-titanium alloy for a time sufficient to form α-phase precipitates in the metastable β-titanium alloy. The metastable β-titanium alloys are not solution heat treated after hot working and prior to aging. Articles of manufacture comprising binary β-titanium alloys according to various non-limiting embodiments disclosed herein are also disclosed. | 03-20-2014 |
Patent application number | Description | Published |
20090076917 | FACILITATING PRESENTATION OF ADS RELATING TO WORDS OF A MESSAGE - Targeted delivery of contextually relevant ad impressions to a mobile device is provided. The ad impressions are delivered within text messages and/or instant message chat threads. Monetizing of text messaging and instant messaging by providers of such services is achieved, while providing unobtrusive and contextually relevant information to users of such services. | 03-19-2009 |
20090083032 | METHODS AND SYSTEMS FOR DYNAMICALLY UPDATING WEB SERVICE PROFILE INFORMATION BY PARSING TRANSCRIBED MESSAGE STRINGS - Systems, methods, and software for parsing and/or filtering message strings of text messages and/or instant messages in order to identify keywords, phrases, or fragments as a function of which user preferences of user profiles are dynamically updated are disclosed. Such systems, methods, and software are utilized in the context of a communication system including text messaging, instant messaging, or both. Furthermore, such communication system preferably includes an automatic speech recognition (ASR) system. Additionally, ad impressions are selected and delivered to users based, at least in part, on the parsing and/or filtering and/or data maintained in user profiles as dynamically updated from time to time. The ad impression preferably is delivered within a text message or within an instant message conversation and is generally unobtrusive. Revenues preferably may be generated from the delivering of the ad impressions, whereby a provider of instant messaging or text messaging may further derive monetary benefit from providing such service and whereby users of such service may be provided with contextually relevant information in an unobtrusive manner. | 03-26-2009 |
20090240488 | CORRECTIVE FEEDBACK LOOP FOR AUTOMATED SPEECH RECOGNITION - A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier. | 09-24-2009 |
20090248415 | USE OF METADATA TO POST PROCESS SPEECH RECOGNITION OUTPUT - A method of utilizing metadata stored in a computer-readable medium to assist in the conversion of an audio stream to a text stream. The method compares personally identifiable data, such as a user's electronic address book and/or Caller/Recipient ID information (in the case of processing voice mail to text), to the n-best results generated by a speech recognition engine for each word that is output by the engine. A goal of this comparison is to correct a possible misrecognition of a spoken proper noun such as a name or company with its proper textual form or a spoken phone number to correctly formatted phone number with Arabic numerals to improve the overall accuracy of the output of the voice recognition system. | 10-01-2009 |
20100058200 | FACILITATING PRESENTATION BY MOBILE DEVICE OF ADDITIONAL CONTENT FOR A WORD OR PHRASE UPON UTTERANCE THEREOF - A method for presenting additional content for a word that is part of a message, and that is presented by a mobile communication device, includes the steps of: presenting the message, including emphasizing one or more words for which respective additional content is available for presenting by the mobile communication device; receiving an utterance that includes an emphasized word for which additional content is available for presenting by the mobile communication device; and presenting the additional content for the emphasized word included in the utterance received by the mobile communication device. These steps are performed by the mobile communication device. | 03-04-2010 |
20120303445 | FACILITATING PRESENTATION OF ADS RELATING TO WORDS OF A MESSAGE - Targeted delivery of contextually relevant ad impressions to a mobile device is provided. The ad impressions are delivered within text messages and/or instant message chat threads. Monetizing of text messaging and instant messaging by providers of such services is achieved, while providing unobtrusive and contextually relevant information to users of such services. | 11-29-2012 |
20150025884 | CORRECTIVE FEEDBACK LOOP FOR AUTOMATED SPEECH RECOGNITION - A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“ASR”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier. | 01-22-2015 |