Patent application number | Description | Published |
20110145220 | SYSTEM AND METHOD FOR EXECUTING A QUERY - There is provided a computer-implemented method of executing a query. An exemplary method comprises optimizing the query. The query may specify multiple operations on corresponding multiple column sets of a table in a database. The exemplary method further comprises partitioning data of the table based on the operations within a single pass of the table. Additionally, the exemplary method comprises performing the operations on the partitioned data in parallel. | 06-16-2011 |
20120084287 | ESTIMATION OF UNIQUE DATABASE VALUES - Estimation of unique values in a database can be performed where a data field having multiple information values is provided in the database. The data field can be partitioned into multiple intervals such that each interval includes a range of information values. An interval specific Bloom filter can be calculated for each of the multiple intervals. A binary Bloom filter value can be calculated for an information value within an interval specific Bloom filter. The binary Bloom filter value can represent whether the information value is unique. A number of unique values in the database can be determined based on calculated binary Bloom filter values. | 04-05-2012 |
20130226941 | SYSTEM AND METHOD FOR CLASSIFYING SIGNALS USING THE BLOOM FILTER - The present disclosure generally relates to data processing. Bloom filters are used to process data at high speed. A Bloom filter that is initialized based on a source string can be used to quickly determine the similarity between the source string and a query string. | 08-29-2013 |
20130226972 | METHODS AND SYSTEMS FOR PROCESSING DATA ARRAYS USING BLOOM FILTERS - The present disclosure relates to computing techniques. Data arrays are processed using Bloom filters to determine aggregate count, maximum, and minimum. These methods can be used on different types of data, including data groups, partial groups, data cubes, hypercubes, and others. | 08-29-2013 |
20130290378 | ADAPTIVE PROBABILISTIC INDEXING WITH SKIP LISTS - A method of indexing in a skip list is disclosed. Key/value pairs are randomly inserted at an appropriate page in a skip list. A new page is created at the lowest level in the skip list. When creating the new page, the page is incremented to a higher level with a write probability. Reading the new page during a search. When reading the new page, the page is incremented to a higher level with a read probability. The read probability is not equal to the write probability. | 10-31-2013 |
20140149433 | Estimating Unique Entry Counts Using a Counting Bloom Filter - A method of estimating a number of unique entry counts of an attribute in a database comprises, with a processor: identifying a sample of entries from an attribute database, determining frequencies of a number of input observations of the sample of entries, determining a number of high frequency values of the sample of entries, and estimating a number of unique entry counts of an attribute within the attribute database using a counting Bloom filter and based on the frequencies of the input observations and the high frequency values. | 05-29-2014 |
20140330768 | INCREMENTALLY UPDATED SAMPLE TABLES - An example apparatus may include a processor and a memory device including computer program code. The memory device and the computer program code may be for, with the processor, causing the apparatus to delete, in a sample table, rows corresponding to a predicate, wherein rows in the sample table are representative of a random sample of rows in a base table of a database; generate sample rows representative of a random sample of rows in the base table corresponding to the predicate; and add the sample rows to the sample table to generate an incrementally updated sample table. | 11-06-2014 |
20160110417 | Incrementally Updating Statistics - Incrementally updating statistics includes sampling rows from a database column in a database to generate a first sample, sampling a subset of modified rows from the database column after generating the first sample to generate a second sample, determining whether distribution changes occurred to the database column based on the first and second samples, and updating a database statistic about the database column in response to determining that a distribution change exists. | 04-21-2016 |