Patent application number | Description | Published |
20110314449 | METHOD AND SYSTEM FOR ESTIMATING EFFORT FOR MAINTENANCE OF SOFTWARE - The present invention provides a method, a system, and a computer program product for determining an effort associated with the maintenance of software. The method, the system, and the computer program product enable receiving values corresponding to predefined factors, which are segregated into corrective factors, preventive factors, perfective factors, and adaptive factors. A corrective effort is determined based on the corrective factors and predefined rules. Thereafter, a preventive effort is determined based on the preventive factors, the predefined rules, and the corrective effort. Thereafter, a perfective effort is determined based on the perfective factors, the predefined rules, and the corrective effort. Subsequently, an adaptive effort is determined based on the adaptive factors, the predefined rules, the corrective effort, the preventive effort, and the perfective effort. A total effort is then generated based on the corrective effort, the preventive effort, the perfective effort, and the adaptive effort. | 12-22-2011 |
Patent application number | Description | Published |
20110270808 | Systems and Methods for Discovering Synonymous Elements Using Context Over Multiple Similar Addresses - A clustering-based approach to data standardization is provided. Certain embodiments take as input a plurality of addresses, identify one or more features of the addresses, cluster the addresses based on the one or more features, utilize the cluster(s) to provide a data-based context useful in identifying one or more synonyms for elements contained in the address(es), and standardize the address(es) to an acceptable format, with one or more synonyms and/or other elements being added to or taken away from the input address(es) as part of the standardization process. | 11-03-2011 |
20130332407 | IN-QUERYING DATA CLEANSING WITH SEMANTIC STANDARDIZATION - The present invention relates to data cleansing, and in particular performing the semantic standardization process within a database before the transform portion of the extract-transform-load (ETL) process. Provided are a method, system and computer program product for standardizing data within a database engine, configuring the standardization function to determine at least one standardized value for at least one data value by applying the standardization table in a context of at least one data value, receiving a database query identifying the standardization function, at least one database value and the context of the data, and invoking the standardization function. | 12-12-2013 |
20130332408 | IN-QUERYING DATA CLEANSING WITH SEMANTIC STANDARDIZATION - The present invention relates to data cleansing, and in particular performing the semantic standardization process within a database before the transform portion of the extract-transform-load (ETL) process. Provided are a method, system and computer program product for standardizing data within a database engine, configuring the standardization function to determine at least one standardized value for at least one data value by applying the standardization table in a context of at least one data value, receiving a database query identifying the standardization function, at least one database value and the context of the data, and invoking the standardization function. | 12-12-2013 |
Patent application number | Description | Published |
20080256063 | TECHNIQUE FOR SEARCHING FOR KEYWORDS DETERMINING EVENT OCCURRENCE - A keyword search system including a text input unit for inputting subtexts obtained by dividing each text into parts, while associating the subtexts with an event through a process recorded in the text; a prediction device adjuster for adjusting a corresponding event prediction device to maximize the percentage of text in which the inputted event is identical to a prediction result in a first text group selected from the subtexts; a prediction processor for generating a prediction result for each section, by inputting each text in a second text group selected from the corresponding subtexts in the adjusted event prediction device; and a search unit for calculating the prediction precision for the second text group of the event prediction device using a comparison between the inputted event and the prediction result for each subtext, and searching for keywords in sections with a certain degree of prediction precision. | 10-16-2008 |
20090063150 | METHOD FOR AUTOMATICALLY IDENTIFYING SENTENCE BOUNDARIES IN NOISY CONVERSATIONAL DATA - Sentence boundaries in noisy conversational transcription data are automatically identified. Noise and transcription symbols are removed, and a training set is formed with sentence boundaries marked based on long silences or on manual markings in the transcribed data. Frequencies of head and tail n-grams that occur at the beginning and ending of sentences are determined from the training set. N-grams that occur a significant number of times in the middle of sentences in relation to their occurrences at the beginning or ending of sentences are filtered out. A boundary is marked before every head n-gram and after every tail n-gram occurring in the conversational data and remaining after filtering. Turns are identified. A boundary is marked after each turn, unless the turn ends with an impermissible tail word or is an incomplete turn. The marked boundaries in the conversational data identify sentence boundaries. | 03-05-2009 |
20090132442 | Method and Apparatus for Determining Decision Points for Streaming Conversational Data - A method for determining a decision point in real-time for a data stream from a conversation includes receiving streaming conversational data; and determining when to classify the streaming conversational data, using a measure of certainty, by performing certainty calculations at a plurality of time instances during the conversation and by selecting a decision point in response to the certainty calculations, the decision point not being based on a fixed window of conversational data but being based on accumulated conversational data available at different ones of the plurality of time instances. Systems and computer program products are also provided. | 05-21-2009 |
20110078158 | Automatic Taxonomy Enrichment - Techniques for enriching a taxonomy using one or more additional taxonomies are provided. The techniques include receiving two or more taxonomies, wherein the two or more taxonomies comprise a destination taxonomy and one or more additional taxonomies, determining one or more relevant portions of the two or more taxonomies by identifying one or more common terms between the two or more taxonomies, importing one or more relevant portions from the one or more additional taxonomies into the destination taxonomy, and using the one or more imported taxonomy portions to enrich the destination taxonomy. | 03-31-2011 |
20110191781 | RESOURCES MANAGEMENT IN DISTRIBUTED COMPUTING ENVIRONMENT - A method, system and a computer program product for determining resources allocation in a distributed computing environment. An embodiment may include identifying resources in a distributed computing environment, computing provisioning parameters, computing configuration parameters and quantifying service parameters in response to a set of service level agreements (SLA). The embodiment may further include iteratively computing a completion time required for completion of the assigned task and a cost. Embodiments may further include computing an optimal resources configuration and computing at least one of an optimal completion time and an optimal cost corresponding to the optimal resources configuration. Embodiments may further include dynamically modifying the optimal resources configuration in response to at least one change in at least one of provisioning parameters, computing parameters and quantifying service parameters. | 08-04-2011 |
20120047179 | SYSTEMS AND METHODS FOR STANDARDIZATION AND DE-DUPLICATION OF ADDRESSES USING TAXONOMY - Systems and associated methods for address standardization and applications related thereto are described. Embodiments exploit a common context in a taxonomy and a given address to detect and correct deviations in the address. Embodiments establish a possible path from a root of the taxonomy to a leaf in the taxonomy that can possibly generate a given address. Given a new address, embodiments use complete addresses, and/or segments or elements thereof, to compute the representations of the elements and find a closest matching leaf in the taxonomy. Embodiments then traverse the path to a root node to detect the agreement and disagreement between the path and the address entry. Taxonomical structured is thus used to detect, segregate and standardize the expected fields. | 02-23-2012 |
20120150825 | Cleansing a Database System to Improve Data Quality - According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above. | 06-14-2012 |
20120179658 | Cleansing a Database System to Improve Data Quality - According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above. | 07-12-2012 |
20120221508 | SYSTEMS AND METHODS FOR EFFICIENT DEVELOPMENT OF A RULE-BASED SYSTEM USING CROWD-SOURCING - Described herein are methods, systems, apparatuses and products for efficient development of a rule-based system. An aspect provides a method including accessing data records; converting said data records to an intermediate form; utilizing intermediate forms to compute similarity scores for said data records; and selecting as an example to be provided for rule making at least one record of said data records having a maximum dissimilarity score indicative of dissimilarity to already considered examples. | 08-30-2012 |
20120323866 | EFFICIENT DEVELOPMENT OF A RULE-BASED SYSTEM USING CROWD-SOURCING - Described herein are methods, systems, apparatuses and products for efficient development of a rule-based system. An aspect provides a method including accessing data records; converting said data records to an intermediate form; utilizing intermediate forms to compute similarity scores for said data records; and selecting as an example to be provided for rule making at least one record of said data records having a maximum dissimilarity score indicative of dissimilarity to already considered examples. | 12-20-2012 |
20130238610 | Automatically Mining Patterns For Rule Based Data Standardization Systems - Computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group. | 09-12-2013 |
20130238611 | Automatically Mining Patterns for Rule Based Data Standardization Systems - Methods, computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group. | 09-12-2013 |
20140156673 | MEASURING AND ALTERING TOPIC INFLUENCE ON EDITED AND UNEDITED MEDIA - Methods and arrangements for measuring and utilizing media topic influence. A publically disseminated media transmission is received. Public influence of the media transmission is measured via: identifying one or more media sources used to disseminate the media transmission; and obtaining one or more predetermined influence values associated with the one or more media sources. | 06-05-2014 |
20140214832 | INFORMATION GATHERING VIA CROWD-SENSING - Methods and arrangements for gathering and managing crowd-sourced information. An event is identified using crowd-sourced information, and component parts of the event are identified using the crowd-sourced information. Information missing from the event is identified using the crowd-sourced information. Individuals associated with the event are identified, and additional crowd-sourced information on the event is harvested from the individuals. | 07-31-2014 |
20140244611 | KEYWORD REFINEMENT IN TEMPORALLY EVOLVING ONLINE MEDIA - Methods and arrangements for keyword refinement and enhancement. There is received an initial keyword list comprising one or more keywords. Information is harvested from one or more information feeds, and an item is ascertained from the harvested information. One or more keywords from the initial keyword list are associated with the item. One or more new keywords are developed based on the associating of one or more keywords from the initial keyword list with the item. Other variants and embodiments are broadly contemplated herein. | 08-28-2014 |
20150066990 | SYSTEMS AND METHODS FOR DISCOVERING TEMPORAL PATTERNS IN TIME VARIANT BIPARTITE GRAPHS - Systems and methods for identifying entities sharing a temporal pattern using bipartite graphs are described. In one embodiment, a method includes identifying a temporal pattern in a sequence of bipartite graphs for a sequence of records involving two entity types, where records of the sequence of bipartite graphs vary according to time. An embodiment may color code the edges between entity types in the sequence of bipartite graphs according to the at least one temporal pattern identified (e.g., increasing sales between a business representative and a customer). An embodiment may therefore identify a time-based relationship between at least two entities according to the coded edges. Given the identification of entities having a time-based relationships, groups of these entities may be identified and trends may be derived therefrom (e.g., increasing sales for business units of a particular geographic region). | 03-05-2015 |
20150379525 | DEVELOPING A CUSTOMER BASE THROUGH SOCIAL NETWORKS AND OTHER SOURCES - Methods and arrangements for identifying a potential customer base of a concern. A contemplated method includes: identifying a prospective customer of a concern, via a social media network; identifying connections of the prospective customer within the social media network; determining a subset of the connections, wherein members of the subset comprise members of the concern; determining connections of the members of the subset; determining whether the connections of the members of the subset comprise the prospective customer; and if the connections of the members of the subset comprise the prospective customer, determining contact information of the prospective customer. Other variants and embodiments are broadly contemplated herein. | 12-31-2015 |
Patent application number | Description | Published |
20130151487 | AUTOMATIC SELECTION OF BLOCKING COLUMN FOR DE-DUPLICATION - Blocking column selection can include determining a first parameter for each column set of a plurality of column sets, wherein the first parameter indicates distribution of blocks in the column set, and determining a second parameter for each column set. The second parameter can indicate block size for the column set. For each column set, a measure of blockability that is dependent upon at least the first parameter and the second parameter can be calculated using a processor. The plurality of column sets can be ranked according to the measures of blockability. | 06-13-2013 |
20130151490 | AUTOMATIC SELECTION OF BLOCKING COLUMN FOR DE-DUPLICATION - A method of blocking column selection can include determining a first parameter for each column set of a plurality of column sets, wherein the first parameter indicates distribution of blocks in the column set, and determining a second parameter for each column set. The second parameter can indicate block size for the column set. For each column set, a measure of blockability that is dependent upon at least the first parameter and the second parameter can be calculated using a processor. The plurality of column sets can be ranked according to the measures of blockability. | 06-13-2013 |
20140297585 | Processing Spatial Joins Using a Mapreduce Framework - Techniques, systems, and articles of manufacture for processing spatial joins using a MapReduce framework. A method includes partitioning a spatial data domain based on a distribution of spatial data objects across multiple nodes of a cluster of machines, defining at least one operation to be performed on the partitioned spatial data domain based on one or more predicates of a query, and executing the at least one defined operation on the partitioned spatial data domain to determine a response to the query. | 10-02-2014 |
20140358843 | Determining Related Data Points from Multi-Modal Inputs - Techniques, systems, and articles of manufacture for determining related data points from multi-modal inputs. A method includes collecting multiple items of multi-modal data comprising at least one dimension from multiple data sources, wherein said at least one dimension comprises a geographic dimension, a temporal dimension and/or an event-related dimension, determining a window of relevance for each of the multiple items of multi-modal data with respect to the at least one dimension, and identifying two or more of the multiple items of multi-modal data as related, by determining an overlap of the window of relevance corresponding to each of the two or more items of multi-modal data with respect to the at least one dimension. | 12-04-2014 |
20140358912 | Identifying Event-Specific Social Discussion Threads - Techniques, systems, and articles of manufacture for identifying event-specific social discussion threads. A method includes identifying a spatial relationship and one or more additional relationships across two or more topical clusters derived from a text source, extracting one or more temporally evolving discussion sequences across the two or more topical clusters, identifying at least one social discussion thread across the two or more topical clusters by identifying a correlation between the one or more additional relationships and the one or more temporally evolving discussion sequences, and to identifying a geographically-constrained social discussion thread among the at least one identified social discussion thread by identifying a correlation between the spatial relationship across the two or more topical clusters and the at least one identified social discussion thread. | 12-04-2014 |
Patent application number | Description | Published |
20140269446 | SYSTEM AND METHOD FOR GEO-LOCATION BASED MEDIA RECORDING FOR A CONTACT CENTER - A system and method for recording media for a contact center includes establishing via a processor in a first geographic location, a communication session between first and second communication devices. The processor determines that media exchanged during the media session is to be recorded. The processor identifies a second geographic location associated with a resource involved in the media session, and identifies a media controller associated with the second geographic location. The media controller bridges the media path between the first and second communication devices, and the media exchanged during the media session is recorded. | 09-18-2014 |
20140270093 | SYSTEM AND METHOD FOR HANDLING CALL RECORDING FAILURES FOR A CONTACT CENTER - A system and method for handling call recording failures for a contact center. A processor receives information on a first media controller currently assigned to a telephony call. The first media controller bridges a first media path between the first and second communication devices and records, into a storage device, media exchanged in the first media path during the telephony call. The processor detects failure of the first media controller during the telephony call, where the failure of the first media controller tears down the first media path. In response to detecting the failure, the processor bridges a second media path between the first and second communication devices until a second media controller is identified. In response to the second media controller being identified, the second media controller is signaled to bridge and record media exchanged during the telephony call. | 09-18-2014 |
20140270105 | NETWORK RECORDING AND SPEECH ANALYTICS SYSTEM AND METHOD - A system and method for network recording and speech analytics wherein a recording system receives media exchanged between first and second communication devices during a telephony call. The media is received by the recording system over a wide area network. The recording system bridges a media path between the first and second communication devices, and replicates media exchanged in the media path for storing the replicated media in a mass storage device. The recording system further captures metadata associated with the call, and stores the captured metadata in association with the stored media. The stored media and metadata may then be provided to a requesting device over the wide area network. The recording system may also be configured to analyze the call recording along with the associated metadata for detecting key words or phrases and/or triggering actionable events. | 09-18-2014 |
20140270141 | CALL EVENT TAGGING AND CALL RECORDING STITCHING FOR CONTACT CENTER CALL RECORDINGS - A system and method for recording media for a contact center allows for call event tagging and call recording stitching. A processor receives call metadata for a telephony call. The call includes a link to a recording of media exchanged during the telephony call. The processor receives a call event associated with the telephony call. The call event includes a timestamp of when the event occurred during the telephony call. The processor stores the call metadata and the call event in a database record. The processor retrieves the database record for displaying the call event on a display device. A user command is received for identifying the call event in response to the display on the display device. A portion of the recording associated with the call event is retrieved in response to the user command. An audible rendering is then provided of the retrieved portion of the recording. | 09-18-2014 |
20140270154 | SYSTEM AND METHOD FOR ENCRYPTING AND RECORDING MEDIA FOR A CONTACT CENTER - A system and method for recording media for a contact center where a processor is configured to determine that media exchanged between first and second communication devices during a telephony call is to be recorded; bridge a media path between the first and second communication devices; cause replicating of the media exchanged in the media path; encrypt the replicated media via a first cryptographic key for storing the encrypted media in a data storage device; and encrypt the first cryptographic key via a second cryptographic key for storing the encrypted first cryptographic key as metadata for the encrypted media. | 09-18-2014 |
20150189074 | SYSTEM AND METHOD FOR HANDLING CALL RECORDING FAILURES FOR A CONTACT CENTER - A system and method for handling call recording failures for a contact center. A processor receives information on a first media controller currently assigned to a telephony call. The first media controller bridges a first media path between the first and second communication devices and records, into a storage device, media exchanged in the first media path during the telephony call. The processor detects failure of the first media controller during the telephony call, where the failure of the first media controller tears down the first media path. In response to detecting the failure, the processor bridges a second media path between the first and second communication devices until a second media controller is identified. In response to the second media controller being identified, the second media controller is signaled to bridge and record media exchanged during the telephony call. | 07-02-2015 |
20150312410 | NETWORK RECORDING AND SPEECH ANALYTICS SYSTEM AND METHOD - A system and method for network recording and speech analytics wherein a recording system receives media exchanged between first and second communication devices during a telephony call. The media is received by the recording system over a wide area network. The recording system bridges a media path between the first and second communication devices, and replicates media exchanged in the media path for storing the replicated media in a mass storage device. The recording system further captures metadata associated with the call, and stores the captured metadata in association with the stored media. The stored media and metadata may then be provided to a requesting device over the wide area network. The recording system may also be configured to analyze the call recording along with the associated metadata for detecting key words or phrases and/or triggering actionable events. | 10-29-2015 |