Patent application number | Description | Published |
20080222087 | System and Method for Optimizing Query Access to a Database Comprising Hierarchically-Organized Data - An cost based optimizer optimizes access to at least a portion of hierarchically-organized documents, such as those formatted using eXtensible Markup Language (XML), by estimating a number of results produced by the access of the hierarchically-organized documents. Estimating the number of results comprises computing the cardinality of each operator executing query language expressions and further computing a sequence size of sequences of hierarchically-organized nodes produced by the query language expressions. Access to the hierarchically-organized documents is optimized using the structure of the query expression and/or path statistics involving the hierarchically-organized data. The cardinality and the sequence size are used to calculate a cost estimation for execution of alternate query execution plans. Based on the cost estimation, an optimal query execution plan is selected from among the alternate query execution plans. | 09-11-2008 |
20090106244 | DISCOVERING INTERESTINGNESS IN FACETED SEARCH - Exemplary embodiments of the present invention relate to enhanced faceted search support for OLAP queries over unstructured text as well as structured dimensions by the dynamic and automatic discovery of dimensions that are determined to be most “interesting” to a user based upon the data. Within the exemplary embodiments “interestingness” is defined as how surprising a summary along some dimensions is from a user's expectation. Further, multi-attribute facets are determined and a user is optionally permitted to specify the distribution of values that she expects, and/or the distance metric by which actual and expected distributions are to be compared. | 04-23-2009 |
20090132568 | DATA CLASSIFICATION BY KERNEL DENSITY SHAPE INTERPOLATION OF CLUSTERS - A method for representing a dataset comprises clustering the dataset using an unsupervised, non-parametric clustering method to generate a set of clusters each comprising a set of data points in an image; clustering the data points of each cluster using a supervised, partitional clustering method to partition each cluster into a specified number of sub-clusters; generating a density estimate value of each grid point of a set of grid points sampled from the image at a specified resolution for each sub-cluster using a kernel density function; identifying a maximum density estimate value and a sub-cluster associated with the maximum density estimate value for the grid point; adding each grid point for which the maximum density estimate value exceeds a specified threshold to the sub-cluster associated with the maximum density estimate value; and, for each cluster, merging the sub-clusters of the cluster into a corresponding cluster region in the image. | 05-21-2009 |
20090132594 | DATA CLASSIFICATION BY KERNEL DENSITY SHAPE INTERPOLATION OF CLUSTERS - A data processing system is provided that comprises a processor, a random access memory for storing data and programs for execution by the processor, and computer readable instructions stored in the random access memory for execution by the processor to perform a method for obtaining a shape interpolated representation of shapes of clusters in an image of a clustered dataset. The method comprises generating a density estimate value of each grid point of a set of grid points sampled from the image at a specified resolution for each cluster using a kernel density function; evaluating the density estimate value of each grid point for each cluster to identify a maximum density estimate value of each grid point and a cluster associated with the maximum density estimate value; and adding each grid point for which the maximum density estimate value exceeds a specified threshold to the associated cluster to form a shape interpolated representation. | 05-21-2009 |
20090271664 | METHOD FOR MONITORING DEPENDENT METRIC STREAMS FOR ANOMALIES - A method for monitoring dependent metric streams for anomalies including identifying a plurality of sets of dependent metric streams from a plurality of metric streams of a computer system by measuring an association of the plurality of metric streams using a statistical dependency measure analysis, wherein each set includes a plurality of the dependent metric streams and each metric stream includes a plurality of data, determining a subset of the plurality of sets of dependent metric streams to monitor by selecting a quantity of the sets of dependent metric streams that have a highest statistical dependency, cleaning the data of each set of dependent metric streams of the subset by identifying and removing outlier data, fitting a probability density function to the cleaned data of each set of dependent metric streams of the subset, wherein the probability density function is a likelihood function that provides a likelihood of an occurrence of the cleaned data, determining a detection threshold that is a lower threshold on the likelihood of the occurrence of the cleaned data of each set of dependent metric streams of the subset based on the likelihood function, detecting an anomaly if a likelihood of an occurrence of a new data of one of the sets of dependent metric streams of the subset is less than the detection threshold, and transmitting an alert signal in response to detecting the anomaly. | 10-29-2009 |
20110047144 | SYSTEM, METHOD, AND APPARATUS FOR PARALLELIZING QUERY OPTIMIZATION - A computer program product that includes a computer useable storage medium to store a computer readable program that, when executed on a computer, causes the computer to perform operations, including operations to receive a query for which a query execution plan (QEP) is to be computed, divide a search space into a plurality of subproblems for which constituent QEPs are to be created, partition the plurality of subproblems into a plurality of partitions, and allocate each of the plurality of partitions to a thread of a plurality of threads within a multiple thread architecture. Possible QEPs describe a search space. Each subproblem references one or more quantifiers and each of the subproblems within a partition references the same number of quantifiers. A partition containing subproblems referencing fewer quantifiers is executed before a partition containing subproblems referencing more quantifiers. | 02-24-2011 |
20130103655 | MULTI-LEVEL DATABASE COMPRESSION - Embodiments of the invention relate to a multi-level database compression technique to compress table data objects stored in pages. A compact dictionary structure is encoded that represents frequent values of data at any level of granularity. More than one level of compression is provided, wherein input to a finer level of granularity is an output of a coarser level of granularity. Based upon the encoded dictionary structure, a compression technique is applied to a stored page to compress each row on the page. Similarly, a de-compression technique may be applied to decompress the compressed data, utilizing the same dictionary structures at each level of granularity. | 04-25-2013 |
20130325900 | INTRA-BLOCK PARTITIONING FOR DATABASE MANAGEMENT - A method for storing database information includes storing a table having data values in a column major order. The data values are stored in a list of blocks. The method also includes assigning a tuple sequence number (TSN) to each data value in each column of the table according to a sequence order in the table. The data values that correspond to each other across a plurality of columns of the table have equivalent TSNs. The method also includes assigning each data value to a partition based on a representation of the data value. The method also includes assigning a tuple map value to each data value. The tuple map value identifies the partition in which each data value is located. | 12-05-2013 |
20130325901 | INTRA-BLOCK PARTITIONING FOR DATABASE MANAGEMENT - A method for storing database information, including: storing a table having data values in a column major order, wherein the data values are stored in a list of blocks, assigning a tuple sequence number (TSN) to each data value in each column of the table according to a sequence order in the table, wherein data values that correspond to each other across a plurality of columns of the table have equivalent TSNs; assigning each data value to a partition based on a representation of the data value; and assigning a tuple map value to each data value, wherein the tuple map value identifies the partition in which each data value is located. | 12-05-2013 |
20140006379 | EFFICIENT PARTITIONED JOINS IN A DATABASE WITH COLUMN-MAJOR LAYOUT | 01-02-2014 |
20140006380 | EFFICIENT PARTITIONED JOINS IN A DATABASE WITH COLUMN-MAJOR LAYOUT | 01-02-2014 |
20140074818 | MULTIPLICATION-BASED METHOD FOR STITCHING RESULTS OF PREDICATE EVALUATION IN COLUMN STORES - A system joins predicate evaluated column bitmaps having varying lengths. The system includes a column unifier for querying column values with a predicate and generating an indicator bit for each of the column values that is then joined with the respective column value. The system also includes a bitmap generator for creating a column-major linear bitmap from the column values and indicator bits. The column unifier also determines an offset between adjacent indicator bits. The system also includes a converter for multiplying the column-major linear bitmap with a multiplier to shift the indicator bits into consecutive positions in the linear bitmap. | 03-13-2014 |
20140214796 | EFFICIENT JOIN WITH ONE OR MORE LARGE DIMENSION TABLES - Embodiments of the invention relate to processing queries that utilize fact and/or dimension tables. In one aspect, a pre-join filtering phase precedes a star join. The necessary conditions for the pre-join filtering are considered for a given SQL query, including an estimated size of the hash table exceeding a threshold and presence of a local predicate either on the fact table or one or more dimension tables that is not a large dimension table. Once the necessary conditions are satisfied, the execution of the query exploits the pre-join filtering to build a pre-join output filter from columns of a reduced fact table that joins with each large dimension table. Thereafter, all the dimension tables and the fact table are joined in a star join while exploiting each pre-join filter. Accordingly, the order of when joins occur is changed in order to reduce the size of the fact table and to work from the fact table to reduce the size of large dimension tables. | 07-31-2014 |
20140214900 | SUPPORTING FLEXIBLE TYPES IN A DATABASE - Providing database support. A first group of data are received, the first group of data being expressed in a first format, and a second group of data are received, the second group of data being expressed in a second format, the second format being different from the first format. The first and second groups of data are merged, and are represented in at least one common column. Such representing includes: maintaining the first and second formats; and providing a tuple map which provides reference to the first and second formats. | 07-31-2014 |
20140330801 | LOCK-FREE CREATION OF HASH TABLES IN PARALLEL - A hash table is created in parallel without requiring a lock or random accesses to memory. The hash table of a database system is logically partitioned and a separate thread is assigned to each partition of the hash table. As many separate threads as can fit their corresponding hash table partitions into the processor's cache are executed in parallel with other threads without a lock. Execution of a number of separate threads includes: scanning an input data table for a thread's partition and applying a hash function to each key, inserting data of keys that hash to the thread's partition into the thread's partition, and ignoring keys that do not hash to the thread's partition. | 11-06-2014 |
20140372389 | Data Encoding and Processing Columnar Data - Aspects of the invention are provided for accessing a plurality of data elements. A page of column data is stored in a format that includes compressed and/or non-compressed elements, with the format including a plurality of arrays and a vector. Each of the arrays stores elements with common characteristics, with the vector functioning as a mapping to the stored data elements. The vector is leveraged to identify an array and determine an offset to support access to one or more of the data elements. | 12-18-2014 |
20140372411 | ON-THE-FLY ENCODING METHOD FOR EFFICIENT GROUPING AND AGGREGATION - Embodiments include a method and computer program product for encoding data while it is being processed as part of a query is provided. The method includes receiving a query request and determining a set of values associated with data to be encoded for completing the query request. The method also includes encoding those values such that any subsequent processing operations can be performed on the encoded values to complete the requested query. After performing the subsequent processing operations to complete the requested query, each value is decoded back to its original value. | 12-18-2014 |
20140372470 | ON-THE-FLY ENCODING METHOD FOR EFFICIENT GROUPING AND AGGREGATION - Embodiments include a system for encoding data while it is being processed. The system includes a processor, an encoder and a decoder. The processor is configured to process a query request by determining a set of values. The encoder is configured for encoding the set of values, such that a subsequent processing operation can be performed on the encoded values. The processor performs the subsequent processing operations. The decoder is configured for decoding each value back to its value prior to being encoded upon completion of the processor completing the requested query. | 12-18-2014 |
Patent application number | Description | Published |
20110245257 | HETEROCYCLIC COMPOUNDS AND THEIR USES - Substituted bicyclic heteroaryls and compositions containing them, for the treatment of general inflammation, arthritis, rheumatic diseases, osteoarthritis, inflammatory bowel disorders, inflammatory eye disorders, inflammatory or unstable bladder disorders, psoriasis, skin complaints with inflammatory components, chronic inflammatory conditions, including but not restricted to autoimmune diseases such as systemic lupus erythematosis (SLE), myestenia gravis, rheumatoid arthritis, acute disseminated encephalomyelitis, idiopathic thrombocytopenic purpura, multiples sclerosis, Sjoegren's syndrome and autoimmune hemolytic anemia, allergic conditions including all forms of hypersensitivity, The present invention also enables methods for treating cancers that are mediated, dependent on or associated with p110δ activity, including but not restricted to leukemias, such as Acute Myeloid leukaemia (AML) Myelo-dysplastic syndrome (MDS) myelo-proliferative diseases (MPD) Chronic Myeloid Leukemia (CML) T-cell Acute Lymphoblastic leukaemia (T-ALL) B-cell Acute Lymphoblastic leukaemia (B-ALL) Non Hodgkins Lymphoma (NHL) B-cell lymphoma and solid tumors, such as breast cancer. | 10-06-2011 |
20120107275 | FUSED HETEROCYCLIC DERIVATIVES AND METHODS OF USE - Selected compounds are effective for prophylaxis and treatment of diseases, such as HGF mediated diseases. The invention encompasses novel compounds, analogs, prodrugs and pharmaceutically acceptable salts thereof, pharmaceutical compositions and methods for prophylaxis and treatment of diseases and other maladies or conditions involving, cancer and the like. The subject invention also relates to processes for making such compounds as well as to intermediates useful in such processes. | 05-03-2012 |
Patent application number | Description | Published |
20120083502 | HETEROCYCLIC COMPOUNDS AND THEIR USES - Substituted bicyclic heteroaryls and compositions containing them, for the treatment of general inflammation, arthritis, rheumatic diseases, osteoarthritis, inflammatory bowel disorders, inflammatory eye disorders, inflammatory or unstable bladder disorders, psoriasis, skin complaints with inflammatory components, chronic inflammatory conditions, including but not restricted to autoimmune diseases such as systemic lupus erythematosis (SLE), myestenia gravis, rheumatoid arthritis, acute disseminated encephalomyelitis, idiopathic thrombocytopenic purpura, multiples sclerosis, Sjoegren's syndrome and autoimmune hemolytic anemia, allergic conditions including all forms of hypersensitivity, The present invention also enables methods for treating cancers that are mediated, dependent on or associated with p110 activity, including but not restricted to leukemias, such as Acute Myeloid leukaemia (AML) Myelo-dysplastic syndrome (MDS) myelo-proliferative diseases (MPD) Chronic Myeloid Leukemia (CML) T-cell Acute Lymphoblastic leukaemia (T-ALL) B-cell Acute Lymphoblastic leukaemia (B-ALL) Non Hodgkins Lymphoma (NHL) B-cell lymphoma and solid tumors, such as breast cancer. | 04-05-2012 |
20120094972 | HETEROCYCLIC COMPOUNDS AND THEIR USES - Substituted bicyclic heteroaryls and compositions containing them, for the treatment of general inflammation, arthritis, rheumatic diseases, osteoarthritis, inflammatory bowel disorders, inflammatory eye disorders, inflammatory or unstable bladder disorders, psoriasis, skin complaints with inflammatory components, chronic inflammatory conditions, including but not restricted to autoimmune diseases such as systemic lupus erythematosis (SLE), myestenia gravis, rheumatoid arthritis, acute disseminated encephalomyelitis, idiopathic thrombocytopenic purpura, multiples sclerosis, Sjoegren's syndrome and autoimmune hemolytic anemia, allergic conditions including all forms of hypersensitivity, The present invention also enables methods for treating cancers that are mediated, dependent on or associated with p110 activity, including but not restricted to leukemias, such as Acute Myeloid leukaemia (AML) Myelo-dysplastic syndrome (MDS) myelo-proliferative diseases (MPD) Chronic Myeloid Leukemia (CML) T-cell Acute Lymphoblastic leukaemia (T-ALL) B-cell Acute Lymphoblastic leukaemia (B-ALL) Non Hodgkins Lymphoma (NHL) B-cell lymphoma and solid tumors, such as breast cancer. | 04-19-2012 |
20130231352 | HETEROCYCLIC COMPOUNDS AND THEIR USES - Substituted bicyclic heteroaryls having the general formula (I) and compositions containing them, for the treatment of general inflammation, arthritis, rheumatic diseases, osteoarthritis, inflammatory bowel disorders, inflammatory eye disorders, inflammatory or unstable bladder disorders, psoriasis, skin complaints with inflammatory components, chronic inflammatory conditions, including but not restricted to autoimmune diseases such as systemic lupus erythematosis (SLE), myestenia gravis, rheumatoid arthritis, acute disseminated encephalomyelitis, idiopathic thrombocytopenic purpura, multiples sclerosis, Sjoegren's syndrome and autoimmune hemolytic anemia, allergic conditions including all forms of hypersensitivity, The present invention also enables methods for treating cancers that are mediated, dependent on or associated with p1 108 activity, including but not restricted to leukemias, such as Acute Myeloid leukaemia (AML), Myelodysplastic syndrome (MDS), myelo-proliferative diseases (MPD), Chronic Myeloid Leukemia (CML), T-cell Acute Lymphoblastic leukaemia (T-ALL), B-cell Acute Lymphoblastic leukaemia (B-ALL), Non Hodgkins Lymphoma (NHL), B-cell lymphoma and solid tumors, such as breast cancer. | 09-05-2013 |