Patent application number | Description | Published |
20100094829 | DATABASE QUERY PROFILER - A query profiling system and associated operating methods implement a database query profiler. The illustrative database system comprises a parser, a query model, a catalog manager, a binder and view rewriter, and a query analyzer. The parser parses statements by checking syntax of a statement according to a structured query language (SQL) grammar and representing the statements in a syntax-oriented internal format. The catalog manager processes data definition SQL statements (DDL) that are transformed to the internal representation; transform the representations of the DDL to database objects abstractions; groups the database objects into a Catalog. The binder and view rewriter take the internal representations obtained by parsing workload queries (data manipulation SQL statements or DML) and rewrite table references by their corresponding View definitions; bind database object references to their actual database objects contained in the catalog. The query analyzer analyzes the queries for relevant attributes and stores the attributes in a profile repository. | 04-15-2010 |
20100114976 | Method For Database Design - Embodiments in accordance with the present invention include methods and systems for database design. A method includes analyzing tables to generate recommendations for an initial design of a database before the database is constructed. | 05-06-2010 |
20100228783 | Desensitizing Character Strings - A method for desensitizing character strings comprises ordering an original set of character strings and generating a tree structure that has nodes corresponding to the ordered set of sensitive strings. The sensitive characters on the nodes of the tree are then desensitized in depth-first order such that a set of desensitized strings is generated that preserves the ordering of the original sensitive set. | 09-09-2010 |
20100280990 | ETL FOR PROCESS DATA WAREHOUSE - One embodiment is a method extract information technology (IT) events that indicate start and completion times of a business process. The method transforms the IT events into business data changes that are agnostic to multiple different ETL implementation languages and transforms the business data changes into execution data. Execution data is stored in a data warehouse. | 11-04-2010 |
20110047525 | QUALITY-DRIVEN ETL DESIGN OPTIMIZATION - A method for quality objective-based ETL pipeline optimization is provided. An improvement objective is obtained from user input into a computing system. The improvement objective represents a priority optimization desired by a user for improved ETL flows for an application designed to run in memory of the computing system. An ETL flow is created in the memory of the computing system. The ETL flow is restructured for flow optimization with a processor of the computing system. The flow restructuring is based on the improvement objective. Flow restructuring can include application of flow rewriting optimization or application of an algebraic rewriting optimization. The optimized ETL flow is stored as executable code on a computer readable storage medium. | 02-24-2011 |
20110209149 | OPTIMIZATION OF INTEGRATION FLOW PLANS - Computer-based methods, computer-readable storage media and computer systems are provided for optimizing integration flow plans. An initial integration flow plan, one or more objectives and/or an objective function related to the one or more objectives may be received as input. A computing cost of the initial integration flow plan may be compared with the objective function. Using one or more heuristics, a set of close-to-optimal integration flow plans may be identified from all possible integration flow plans that are functionally equivalent to the initial integration flow plan. A close-to-optimal integration flow plan with a lowest computing cost may be selected from the set as a replacement for the initial integration flow plan. | 08-25-2011 |
20120072413 | SYSTEM AND METHOD FOR COMPARING DATABASE QUERY PLANS - A method and device may accept a first database query plan including a plurality of rows and a second database query plan including a plurality of rows, compare each row in the first database query plan with a corresponding row in the second database query plan, and based on each comparison, calculate a difference between the compared rows. Each difference may be recorded in an output file. | 03-22-2012 |
20120076416 | DETERMINING CORRELATIONS BETWEEN SLOW STREAM AND FAST STREAM INFORMATION - A collection of documents are correlated with information items in a fast stream of information using categorical hierarchical neighborhood trees (C-HNTs). First data entities extracted from the documents are inserted into corresponding C-HNTs. The first data entities that are neighbors in the C-HNTs of second data entities extracted from the fast stream items are identified. Similarities between the documents and the fast stream items are determined based on the location at which the neighbors are located. | 03-29-2012 |
20120089620 | EXTRACTING DATA - Information can be extracted from unstructured documents using embodiments described herein. An entity recognition may be performed on an unstructured document and found entities may be annotated. Annotating includes inserting tags around the found entities to generate marked entities. A rule is applied to each of the marked entities in the unstructured document to generate a confidence value for every marked entity, wherein the rule comprises a plurality of prefixes for a target entity and a plurality of suffixes for the target entity. A marked entity with the highest confidence value is selected as an extraction target. | 04-12-2012 |
20120197903 | OBJECTIVE-FUNCTION BASED SENTIMENT - A system and article are disclosed for objective-function based sentiment. In one example, the system includes a set of domain information, and a computer programmed with executable instructions which operate a set of modules. The modules include a sentiment polarization module for identifying a domain-aspect opinion-word pair within a set of domain data, and assigning a sentiment polarity score to the domain-aspect opinion-word pair based on an objective function which includes sentiment data from the domain information. | 08-02-2012 |
20120197950 | SENTIMENT CUBE - A sentiment cube system is disclosed. In one example, the system discloses a sentiment storage, including a sentiment cube data structure having a set of cells arranged by a set of dimensions. The system includes a computer programmed with executable instructions which operate a set of modules, wherein the modules comprise: a sentiment storage module which receives sentiment values associated with a set of entity features, and then populates a hierarchy of the cells in the sentiment cube with the sentiment values. A sentiment analysis module effecting a set of operations on the sentiment cube. | 08-02-2012 |
20130018892 | Visually Representing How a Sentiment Score is ComputedAANM Castellanos; Maria G.AACI SunnyvaleAAST CAAACO USAAGP Castellanos; Maria G. Sunnyvale CA USAANM Ruiz; PerlaAACI HermosilloAACO MXAAGP Ruiz; Perla Hermosillo MXAANM Dayal; UmeshwarAACI SaratogaAAST CAAACO USAAGP Dayal; Umeshwar Saratoga CA USAANM Dekhil; MohamedAACI Santa ClaraAAST CAAACO USAAGP Dekhil; Mohamed Santa Clara CA US - A method of visually representing how a sentiment score is computed comprises, with a sentiment scoring device, determining a number of sentiment scores for each of a number of attributes within a forum, writing a visualization file in a database based on metadata representing the sentiment scores, and outputting, to an output device, a representation of how the sentiment score was computed based on the visualization file. A system for displaying to a user how a sentiment score is computed comprises a sentiment scoring device, a forum source communicatively coupled to the sentiment scoring device, and an output device communicatively coupled to the sentiment scoring device, in which the sentiment scoring device obtains text from the forum source, determines sentiment scores for a number of attributes within the text, and outputs, to the output device, a representation of how the sentiment score was computed. | 01-17-2013 |
20130191306 | Providing Operational Business Intelligence - Embodiments described herein can be used to provide business intelligence. For example, a tangible, computer-readable medium may include code configured to direct a processor to create a conceptual model of an business process. The code may be configured to direct the processor to parse the conceptual model to create a logical model of the business process, and to parse the logical model to create a physical model of the business process. | 07-25-2013 |
20130253907 | INTENTION STATEMENT VISUALIZATION - An example system includes an extraction module, an intention processing module, and an intention visualization module. The extraction module is configured to ingest textual data from a text source. The intention processing module is configured to process the textual data and identify one or more intention statements within the textual data. The intention visualization module is configured to provide an interactive interface that facilitates filtering and visualization of aspects of the one or more intention statements. | 09-26-2013 |
20130259362 | ATTRIBUTE CLOUD - An example system includes at least one memory and at least one processing device. The at least one memory comprises instructions that when executed by the at least one processing device cause the system to receive a text corpus, filter the text corpus to obtain a plurality of attributes, determine the semantic cohesion between the plurality of attributes, and create an attribute cloud including the plurality of attributes, where the placement of the plurality of attributes within the attribute cloud is based at least in part on the determined semantic cohesion between the plurality of attributes. | 10-03-2013 |
20130263019 | ANALYZING SOCIAL MEDIA - A system, method and a non-transitory computer readable medium comprising instructions for automated analysis of for analyzing social media, the method comprising a processor to acquire data as a snapshot or a continuous stream from one or more online sites via adapters. Storing data in a database, the database configured for rapid acquisition of data and rapid responses to queries from one or a plurality of users. Analyzing the data using one or a plurality of algorithms, the algorithms configured to distill insight at an attribute level, and presenting one or a plurality of graphical user interfaces on a user-configurable, and temporal-view adjustable dashboard, the dashboard configured to present one or more results of said one or a plurality of algorithms, said one or more results depicted through one or a plurality of paradigms of data visualization. | 10-03-2013 |
20130290232 | IDENTIFYING NEWS EVENTS THAT CAUSE A SHIFT IN SENTIMENT - A method identifies news events that cause shifts in sentiments. The method includes compiling a sentiment time series, the sentiment time series expressing a shift in sentiment; compiling a news events time series; correlating the sentiment and news events time series; identifying from the correlation news events that caused a shift in sentiment and predicting if a selected news event may cause a shift in sentiment in the future. | 10-31-2013 |
20130325440 | GENERATION OF EXPLANATORY SUMMARIES - A method for generating sum maries of text is described. The method includes the step of extracting features from text of text lists from summaries. The explanatoriness of the text is then evaluated, wherein evaluating the explanatoriness of text includes evaluating the features of the text, including at least the step of evaluating the discriminativeness of the features of the text by comparing the text to a first text data set, wherein the first text data set is derived from a topic label. The evaluated text is then ranked based on the explanatoriness evaluation. | 12-05-2013 |
20140039782 | DETERMINING A SPATIOTEMPORAL IMPACT OF A PLANNED EVENT ON TRAFFIC - Systems and methods are provided for determining an impact of a planned event. Traffic flow velocity is measured at each of a plurality of traffic sensors over a period of time before the planned event, with each traffic sensor having an associated geographical location. A plurality of measured traffic flow velocities for each traffic sensor are stored on a non-transitory computer readable medium. A traffic flow velocity at each traffic sensor is measured at each of a plurality of representative times within a defined time period including a known start time and an expected end time of the event. A threshold velocity is calculated from the stored traffic flow velocities for each traffic sensor corresponding to each representative time. For each representative time, a set of traffic sensors is determined at which the measured traffic flow velocity during the defined time period is less than the threshold velocity. | 02-06-2014 |
20140108455 | Capturing Intentions Within Online Text - A method of capturing intentions within online text comprises with a data mining device ( | 04-17-2014 |
20140156707 | STREAMING SYSTEM PERFORMANCE OPTIMIZATION - According to an example, a dataflow directed acyclic graph including nodes and edges connecting the nodes. Parameters for the nodes are calculated, and the parameters may include batch size and a degree of parallelism. | 06-05-2014 |
20140304549 | RECOVERING A FAILURE IN A DATA PROCESSING SYSTEM - A technique of recovering a failure in a data processing system comprises recording a number of input channels and sequence numbers for a number of input tuples transferred to a recipient task, recording a number of output channels and sequence numbers for a number of output tuples, and if a failure occurs, resolving the input and output channels. | 10-09-2014 |
Patent application number | Description | Published |
20110202387 | Data Prediction for Business Process Metrics - Embodiments in accordance with the present invention include methods and systems for data prediction. A method includes analyzing time-series data in a business process with a single-metric technique and with a multiple-metric technique; and combining predictions from the single-metric technique and the multiple-metric technique to predict a predetermined change in the business process | 08-18-2011 |
20130283233 | MULTI-ENGINE EXECUTABLE DATA-FLOW EDITOR AND TRANSLATOR - A system, and a corresponding method, that allow a programmer to create and edit a data-flow employing multiple execution engines are provided. The system includes a data-flow editor and a data-flow translator. The method includes providing an illustration of the data-flow and metadata associated with the data-flow on a graphical user interface; representing the data-flow and the metadata by a first code language; dividing the data-flow illustrated on the graphical user interface into fragments; and translating the first code language into the execution code language of the execution engine corresponding to each of the fragments. Each of the fragments are executable on different execution engines and each of the different execution engines are supported by a different execution code language | 10-24-2013 |
20140149417 | CAUSAL TOPIC MINER - Causal topic mining can include incorporating non-text time series data with a number of articles based on a time relationship and analyzing the incorporated non-text time series data and the number of articles at a particular time to determine a causal relationship. | 05-29-2014 |
20140325527 | RESENDING MESSAGES - A technique for resending messages can include determining if a particular message with a first unique identifier has been received by a recipient task among a number of potential recipient tasks in a distributed streaming system, notifying the number of potential recipient tasks to ignore the particular message with the first unique identifier, and resending the particular message with a second unique identifier. | 10-30-2014 |
20150106155 | Determining and Visualizing Social Media Expressed Sentiment - A technique includes determining social media expressed sentiment, including processing data indicative of a plurality of social media messages to decompose each of the social media messages into a plurality of attributes; for each social media message, identifying attributes of the plurality of attributes, which are associated with user selected attribute categories and are part of the message, and for each identified attribute, assigning a sentiment to the attribute and updating statistics for the selected attribute categories based on the assigned sentiment. The technique includes visualizing the social media expressed sentiment, including displaying at least some of the statistics. | 04-16-2015 |
20160087759 | TUPLE RECOVERY - A technique for recovering tuples can include sending or receiving a request to reseed a tuple. | 03-24-2016 |