Entries |
Document | Title | Date |
20100057807 | Processing data from genotyping chips - Processing genetic data is disclosed, including: receiving two or more genetic data sets for an individual from one or more genetic data sources; merging the genetic data sets from the one or more genetic data sources, including identifying a duplicate SNP between the genetic data sets and determining one or more data values to be stored for the duplicate SNP; and storing a single set of merged genetic data for the individual. | 03-04-2010 |
20100057808 | INTELLIGENT CONTAINER INDEX AND SEARCH - Aspects are introduced for indexing contents of a shell namespace extension. A plurality of files are indexed, and it is encountered that one of the files contains at least one file reference. The file is then accessed as a shell namespace extension so that the file reference can be indexed. | 03-04-2010 |
20100063970 | METHOD FOR MANAGING AND PROCESSING INFORMATION OF AN OBJECT FOR PRESENTATION OF MULTIPLE SOURCES AND APPARATUS FOR CONDUCTING SAID METHOD - When preparing meta data for a stored arbitrary content, the present method creates meta data including protocol information and access location information of the arbitrary content, creates an item for an auxiliary content that shall be played in synchronization with the arbitrary content, and incorporates identifying information of the item into the meta data. Further, information on language data of the auxiliary content is written in the created item. auxiliary item | 03-11-2010 |
20100063971 | UNIVERSAL ANNOTATION CONFIGURATION AND DEPLOYMENT - Systems and articles of manufacture for managing annotations made for a variety of different type data objects manipulated (e.g., created, edited, and viewed) by a variety of different type applications are provided. Some embodiments allow users collaborating on a project to create, view, and edit annotations from within the applications used to manipulate the annotated data objects, which may facilitate and encourage the capturing and sharing of tacit knowledge through annotations. Further, annotations may be stored separate from the application data they describe, decoupling the tacit knowledge captured in the annotations from the applications used to manipulate the annotated data. | 03-11-2010 |
20100070507 | HYBRID CONTENT RECOMMENDING SERVER, SYSTEM, AND METHOD - A content recommending server includes: a content information collecting section collecting content information including metadata of contents from a content server through a network; a content database storing the content information collected by the content information collecting section; a user profile collecting section collecting user profiles of users from user terminals through the network, each of the user profiles including each user's preference; a user profile database storing the user profiles, the user profiles including a subject user profile; a content indexer acquiring the metadata and generating content indices of the contents; a user indexer acquiring the user profiles from the user profile database and generating user indices of each of the users; an index database storing the content indices and the user indices; and a content recommending section receiving the subject user profile, searching the index database for an certain index corresponding to the subject user profile, and determining a recommend content. | 03-18-2010 |
20100076980 | TOKENIZED JAVASCRIPT INDEXING SYSTEM - A client generates an index token for each of a plurality of data objects received from a server as a function of at least one of the plurality of fields of the data object. The client creates an index for the plurality of data objects based on the generated index token for each data object. The client may then utilize the index to search plurality of data objects to identify and render a subset of the data objects. | 03-25-2010 |
20100076981 | Method and Apparatus for Efficient Indexed Storage for Unstructured Content - A method and apparatus for efficient indexed storage for unstructured content have been disclosed. | 03-25-2010 |
20100082629 | SYSTEM FOR ASSOCIATING DATA ITEMS WITH CONTEXT - A system is described for associating data items with context. The system may include a processor, a memory and an interface. The processor may identify an action performed by a user and may determine the spatial, temporal, social and topical attributes of the action. The spatial attribute of the action may relate to the user's location, the temporal attribute may relate to the time the action was performed, the social attribute may relate to a social relation of the user, and the topical attribute may relate to a topic of interest to the user. The processor may store an association between a descriptor of the action, the spatial attribute of the action, the temporal attribute of the action, the social attribute of the action and the topical attribute of the action in the memory. The processor may use the stored association to provide a contextually relevant data item via the interface. | 04-01-2010 |
20100082630 | PERSISTING EXTERNAL INDEX DATA IN A DATABASE - Systems, methods, and computer program products to persist external index data in a database are disclosed. In an embodiment, a method includes maintaining a database including a first data table that is configured to store data in a database format and a second data table that is configured to store index data. The index data represents an index of a first portion of the first data table, and the index maintained externally to the database by an index manager. The method also includes generating a data update representing a modification to the first data table and communicating the data update to the index manager. The method further includes receiving index update data from the index manager representing a modification to the index as a result of the data update and applying the index update data to the second data table. | 04-01-2010 |
20100082631 | SYSTEM AND METHOD FOR DYNAMICALLY CONFIGURING CONTENT-DRIVEN RELATIONSHIPS AMONG DATA ELEMENTS - A computer system configures data elements based on textual sources by identifying subunits of a textual source, indexing the subunits into a sequence comprised of terms, identifies based on a target a base subsequence of the sequence, and stores the terms in such a way that they can be expanded or contracted and a user can rapidly and efficiently derive relevant information and context even from a vast amount of information including by navigable display to the user. Other methods and systems of configuring and displaying data elements from textual sources are provided. | 04-01-2010 |
20100082632 | SYSTEMS AND METHODS FOR HANDLING PERFORMANCE MONITORING DATA - The present disclosure provides systems and methods for handling performance monitoring (PM) data in a database in a manner that supports different PM data sets from different network element (NE) device types in an integrated manner. The present invention defines a single database structure that can be utilized to store, handle, retrieve, etc. PM counters from multiple NEs including NEs of different types, e.g. SONET/SDH NE, DWDM NE, optical switch, Ethernet switch, etc. The single database enables an efficient mechanism to handle PM data in a uniform way across a network with different device types. | 04-01-2010 |
20100082633 | Database index and database for indexing text documents - The present invention concerns a database index for indexing one or more text documents ( | 04-01-2010 |
20100082634 | System, Method, and User Interface for Organizing and Searching Information - A computer-implemented system, method, and user interface for searching and organizing information, particularly large information sets such as those found on the Internet and World Wide Web. Information is organized and searched according to content, and this organization is reflected directly in the user interface provided to users for searching as well as the search results they are shown. | 04-01-2010 |
20100082635 | AUDIO/VIDEO ARCHIVING SYSTEM AND METHOD - A method of archiving audio/video content is presented. In the method, a plurality of audio/video content streams is received. Each of the received audio/video content streams is indexed according to time. Each of the indexed audio/video content streams is transferred to a data storage system for storage. A request for a portion of one of the stored audio/video content streams associated with a time period is received. The requested portion of the one of the stored audio/video content streams is retrieved from the data storage system. The requested portion of the one of the stored audio/video content streams is transferred. | 04-01-2010 |
20100088318 | INFORMATION SEARCH SYSTEM, METHOD, AND PROGRAM - Disclosed is a system in which an index registration unit registers an index, which will be used for search processing, as a partitioned index which is partitioned on a time series basis, and a search means reads indexes older than a specified point in time, which is used as a search base point, to perform search processing, thereby searching for information based on a point in time in the past. | 04-08-2010 |
20100088319 | System and Method for Efficiently Storing Information Related to a Telephone number - Information related to telephone numbers is efficiently stored and retrieved by generating an index based on a portion of a telephone number for which information is to be stored or retrieved. For example, the portion of the telephone number can be a prefix of the telephone number. Information related to the telephone number is stored and retrieved in accordance with the generated index. | 04-08-2010 |
20100094875 | Method and system for classifying text - A content classification system, method and computer product is presented. In connection with the invention, a data structure is created by identifying a plurality of words and mapping each word to one or more categories. The data structure is indexed. An item of content is identified and classified based on the data structure. The classification includes identifying all one—or more—word combinations in the item of content; for each word of at least a pre-determined number of characters in length in each of the word combinations, identifying each of the categories to which it is mapped; and determining a weight for each of the words based on an inverse proportion to the number of categories to which it is mapped. | 04-15-2010 |
20100094876 | METHOD, APPARATUS AND SYSTEM OF SEARCHING AND DOWNLOADING MOBILE TELEPHONE FILE - A method of searching and downloading mobile telephone file, a server of the locality domain of a mobile terminal receives a keyword information of a searching request, searches the resource in the mobile communication network, and returns the searched resource list to the mobile terminal, the mobile terminal receives data resource information selected by user according as the resource list, he mobile terminal sends the request of downloading the needed resource to the server of the locality domain, and the server of the domain of the mobile terminal sends the resource to the mobile terminal. An apparatus of searching and downloading mobile telephone lie is provided also, which includes a storage, a receiving mould, a sending mould, a request mould, and can also include a data base, a communication interface and a statistic mould. A system of searching and downloading mobile telephone file is provided also. | 04-15-2010 |
20100100548 | STORING CHANGE FEATURES DETECTED FROM HIGH-RESOLUTION, REMOTELY SENSED IMAGERY - A method, system, and medium are provided for identifying facilitating storage and retrieval of information associated with high-resolution, remotely sensed imagery. In one embodiment, a data structure includes information derived from change-detection information associated with remotely sensed imagery, which change-detection information is useable by the application program, and wherein the data structure includes a plurality of change indexes, each of which (1) indicates change in a given item from a first state to a second state and (2) is associated with a feature set that includes one or more of item-texture features and item-spectral features, wherein each of the change indexes is an entropy balanced statistical (EBS) k-d tree index that describes features associated with items within a geographic area that is reflected in the imagery, and wherein the features include one or more of the item-texture features and item-spectral features. | 04-22-2010 |
20100100549 | CONTENTS SPACE FORMING APPARATUS, METHOD OF THE SAME, COMPUTER, PROGRAM, AND STORAGE MEDIA - A space providing server analyzes content to generate meta data. Further the space providing server generates contents space information defining the coordinates of a plurality of main bodies of contents data on a network according to the amount of features thereof, based on the meta data. The terminal apparatus displays a screen which includes image corresponding to the plurality of contents, based on the coordinates included in the contents space information. | 04-22-2010 |
20100100550 | Apparatus, System and Method For Providing Contents and User Interface Program - A character folder generator generates one or more character folders, each of which corresponds to a specific character, at each layer of a hierarchical folder structure, and provides a terminal with information of said one or more character folders generated at each layer. A folder ID obtainer obtains identification information of the selected character folders from the terminal, when one of the character folders at each layer is selected at the terminal. A character sequence generator generates a character sequence identified based on a sequence of the character folders, each of which has been selected at each layer. A searcher searches for files by using as a keyword the generated character sequence and generates one or more files retrieved by search in the character folder finally selected at a certain layer and providing the terminal with information of the generated files in the finally selected character folder. | 04-22-2010 |
20100100551 | SYSTEM AND METHOD OF DYNAMICALLY GENERATING INDEX INFORMATION - A system and method of generating index information for electronic documents. The system includes a client, one or more information retrieval (IR) engines, such as a search engine, which are each in communication with each other via a network. In one embodiment of the invention, the server maintains a plurality or data objects that are protected by digital rights management (DRM) software. Upon receiving a network request from one of the IR systems, the server dynamically generates an electronic document that provides index information that is associated with one of the data objects. In one embodiment of the invention, the server dynamically generates the contents of the electronic document based upon the indexing characteristics of the IR system. Furthermore, upon receiving a network request from one of the client, the server determines whether the client is authorized to access the data object that is associated with the network request. If the client is authorized to access the data object, the server transmits the data object to the user. Alternatively, if the client is not authorized to access the data object, the server dynamically prepares instructions to the client, the instructions describing additional steps the user at the client may perform to get authorized to access the data object. | 04-22-2010 |
20100106727 | SYSTEM AND METHOD FOR ENHANCING KEYWORD RELEVANCE BY USER'S INTEREST ON THE SEARCH RESULT DOCUMENTS - A system and method are provided for enabling a user to search for documents that the user has previously viewed on its local machine. The system includes three main components: the desktop integration module, the index module, and the graphical user interface module. The desktop integration module is an application which monitors documents with which the user interacts for predetermined events, and obtains content data and metadata from the monitored documents. The index module indexes the content data and metadata received from the desktop integration module. The graphical user interface module then permits a user to utilize the desktop integration module and index module by allowing a user to search for a document. | 04-29-2010 |
20100106728 | System and Method for Indexing, Searching and Presenting Technical Concepts - A data processing system and method are provided to index, search and present information about defined, “fuzzy” concepts across language, chemical and product identity, composition, biological classification and legal reference. The database system and method permits the use of combinations of concepts, such as labeling, health claims and/or nutritional claims, across linguistic, scientific and legal usages. The user is able to search for concepts in a defined way across any language or means of reference. The definition of the concepts used in the database is accepted as a standard by the database system (user(s). Such standardization then permits one user to pass the results of a search to another user with common meaning, and permits value-added services, such as the capability to provide news services about changes in worldwide regulation of goods and services, i.e., changes in information applicable to the concepts of interest to specific users. | 04-29-2010 |
20100114896 | METHOD, SYSTEM AND APPARATUS FOR HANDLING A REQUEST FOR A MEDIA-OVER-PACKET COMMUNICATION SESSION - Embodiments of the present invention are directed to a method, system and apparatus handling a request for a media-over-packet communication session. The method comprises receiving a request for a media-over-packet communication session destined for a network address. Responsive to determining existence of an active communication session with a first communication client registered to the network address, the method further comprises determining an identifier associated with a second commmunication client registered to the network address, the second communication client different from the first communication client; and routing the request to the second communication client using the identifier. | 05-06-2010 |
20100114897 | INDEXING AND SEARCHING A NETWORK OF MULTI-FACETED ENTITY DATA - Some aspects include determination of a first set of attribute-value pairs associated with an entity, the first set of attribute-value pairs comprising a plurality of attributes and first values, and respective ones of the first values corresponding to each of the plurality of attributes, determination of a second set of attribute-value pairs associated with the entity, the second set of attribute-value pairs comprising the plurality of attributes and second values, and respective ones of the second values corresponding to each of the plurality of attributes, and generation of index entries, each of the index entries indicating one of the first set or the second set of attribute-value pairs, an attribute of the indicated set of attribute-value pairs, a value of the attribute of the indicated set of attribute-value pairs, and the entity. | 05-06-2010 |
20100114898 | PUBLICATION SERVICES - An apparatus, method, and article of manufacture provide the ability to publish information to an external source as part of an integrated workflow in a computer system. The computer system executes a relational database management system (RDBMS). A publication services processing engine utilizes the RDBMS to publish the information based on a publication node. A publication object defines a collection of information that is published to the external source. A publication action defines a specification of a manner in which the information in the publication object is to be published to the external source. The publication node defines a workflow data process that specifies the publication object and the publication action. | 05-06-2010 |
20100114899 | Method and system for business intelligence analytics on unstructured data - Various embodiments of the present invention disclose a method for Business Intelligence (BI) metrics on unstructured data. Unstructured data is collected from numerous data sources that include unstructured data as ingested data. The ingested data is indexed and represents hyperlink and extracted data and metadata for each document. Thereafter, the ingested data is automatically classified into one or more relevance classes. Further, numerous analytics are performed on the classified data to generate business intelligence metrics that may be presented on an access device operated by a user. | 05-06-2010 |
20100114900 | Apparatus, System and Method for Rapid Cohort Analysis - An apparatus, system, and method for rapid cohort analysis. In one embodiment, the apparatus includes an interface and a processor. The interface may receive an identifier of a first index attribute. The processor may search the database for a first group of records associated with the first index attribute, search the database for a second group of records, each record in the second group of records sharing a common second index attribute with a record in the first group of records, but not associated with the first index attribute, and calculate a statistic in response to information associated with the first group of records and the second group of records. | 05-06-2010 |
20100114901 | COMPUTER-READABLE RECORDING MEDIUM, CONTENT PROVIDING APPARATUS COLLECTING USER-RELATED INFORMATION, CONTENT PROVIDING METHOD, USER-RELATED INFORMATION PROVIDING METHOD AND CONTENT SEARCHING METHOD - Content providing and searching technology provides content capable of collecting user-related information for searching to which a user's intention is reflected, and collects user-related information from user devices that receive the content. By providing user-related information to user devices, it may be possible to process various user-related events. Moreover, since searching to which a user's intention is reflected may be possible, the user may easily search for desired content. | 05-06-2010 |
20100114902 | HIDDEN-WEB TABLE INTERPRETATION, CONCEPTULIZATION AND SEMANTIC ANNOTATION - Indexing hidden web information. First and second web pages are accessed, which include data organized in table format. The tables from the first and second web page are compared. Based on the comparison, a determination is made as to which table cells contain category labels and which contain instance data. The category labels from the first web page are compared to the category labels from the second web page. A general structure of individual tables is inferred based on the act of comparing the category labels. The general structure is chosen from among standard table templates. Data in two or more web pages organized according to the selected table templates is identified. Data from the two or more web pages is stored by associating the table data from two or more web pages to one or more of the selected table templates. | 05-06-2010 |
20100114903 | Internet-based targeted information retrieval system - The invention is a system and process, whereby using source data, facts related to the source data are found primarily on the open internet. These related facts may be found on totally unrelated websites. The facts are indexed to create a high value database of profiles of individuals or organizations. In a specific disclosed embodiment, the facts relate to donations made to non-profit organizations, creating a database highly valuable to those soliciting donations. | 05-06-2010 |
20100114904 | SYSTEM AND METHOD FOR GEOGRAPHICALLY ORGANIZING AND CLASSIFYING BUSINESSES ON THE WORLD-WIDE WEB - A method and search engine for classifying a source publishing a document on a portion of a network, includes steps of electronically receiving a document, based on the document, determining a source which published the document, and assigning a code to the document based on whether data associated with the document published by the source matches with data contained in a database. An intelligent geographic- and business topic-specific resource discovery system facilitates local commerce on the World-Wide Web and also reduces search time by accurately isolating information for end-users. Distinguishing and classifying business pages on the Web by business categories using Standard Industrial Classification (SIC) codes is achieved through an automatic iterative process. | 05-06-2010 |
20100121855 | Lookup Partitioning Storage System and Method - A method, system, and computer-readable medium for accessing and managing an online storage system is disclosed. Access to a storage resource in a multiple server storage system is provided by sending to a lookup partitioning service server a resource identifier that is associated with a storage resource stored in a particular storage partition of a storage server. The LPS returns the looked-up partition that stores the storage resource associated with the resource identifier. Access to the storage resource is then enabled to the looked-up storage partition. | 05-13-2010 |
20100125582 | MUSIC SEARCH METHOD BASED ON QUERYING MUSICAL PIECE INFORMATION - A method for searching music based on music segment information inquiry comprises: a) analyzing certain music or song to obtain music rhythm and note information of any segment, and converting it to digital data as a basis for searching the music or the song after quantification; b) storing indexes of any segment of music rhythm and note information for the music or song in database; c) Take the inquiry requirement as a basis for searching and comparing to find the required music or song. | 05-20-2010 |
20100131512 | System and methods for selective local database access restriction - A nonintrusive database access monitoring mechanism employs a hybrid approach that disallows, or blocks, the access mediums which are not feasible to intercept or analyze, as well as intercepting and analyzing access mediums for which interception and interrogation is available. Accordingly, various configurations provide the hybrid coverage approach to identifying access mediums, and either block or intercept the access attempts. In this manner, access mediums, such as interprocess communication (IPC) system calls, which may be efficiently intercepted and analyzed are captured and substantively processed, while other access mediums that are excessively burdensome or intrusive to capture are unselectively blocked from any communication, avoiding the need to analyze such access attempts. | 05-27-2010 |
20100131513 | PATENT MAPPING - System and method permit patent mapping. A method may comprise maintaining a database of patent portfolios and a database of patents with each patent stored in the database of patents associated with one or more patent portfolios stored in the database of patent portfolios. A search query may be received associated with a first patent portfolio and the first portfolio may be searched as a function of the search query. Search results may be generate which include one or more patent claims associated with the search query. The one or more patent claims may be mapped to a patent concept. | 05-27-2010 |
20100131514 | REAL-TIME AUTOMATIC SEARCHING SYSTEM FOR MEDICAL IMAGE AND METHOD FOR USING THE SAME - A real-time automatic searching method for a medical image is disclosed. The method includes in response to a search command for a first image identifying a difference between a first searchable information and a second searchable information, applying a predetermined set of weights to the first searchable information according to the difference, calculating a weighted index for the first image after applying the predetermined weights to the first searchable information, and locating the first image according to the weighted index. | 05-27-2010 |
20100138423 | CONTENT DISCOVERY USING GATEWAY BROWSING DATA - An internet browsing gateway is utilized to locate mobile web content and improve search capabilities for wireless devices. Mobile browsing data is logged as web pages are requested from and returned to wireless devices associated with the same wireless network. An index server continually updates search indexes in real time as users request searches and search results are returned. As a result, the mobile content that is discovered becomes more relevant for users of the same wireless network or mobile carrier. In other words, the search results are more localized in accordance with the location of the wireless network that services the mobile users within a specific geographic region. When a web search is requested from a wireless device, the index server queries the search indexes. The search results are ranked using a search algorithm that incorporates the browsing data to improve the accuracy and relevance of searches. | 06-03-2010 |
20100138424 | Methods and Apparatus for the Creation and Editing of Media Intended for the Enhancement of Existing Media - Methods and apparatus are provided for simplifying the creation and editing of media intended for the enhancement consumer media. A software-based authoring tool is disclosed that allows users to interact directly with and control a DVD or other media format and/or storage medium, specify media content that is intended to enhance the DVD, and output a program and associated media that would be available to other applications to combine with the DVD media. The authoring tool, identified as QC the Interactive Media Enhancement Editor, allows the author to play the DVD5 stop it at certain desired times or events and then allow the author to specify the enhancing media (identified as Secondary Media) that is to be associated with that time or event. The development of Secondary Media content relies on precise coordination with certain events (such as entering a specific menu or chapter of the DVD) or on certain time events (such as 25 seconds into chapter three of the DVD). | 06-03-2010 |
20100138425 | ENHANCED SEARCH RESULTS - A method includes receiving a search query from a user and generating search results based on the search query. The method may also include providing the search results and information identifying at least one of a telephone number or an address associated with a first one of the search results to the user. The method may further include providing a link to a map associated with at least the first search result to the user. | 06-03-2010 |
20100138426 | INDEX GENERATING SYSTEM, INFORMATION RETRIEVAL SYSTEM, AND INDEX GENERATING METHOD - In an index generating device, a similarity calculating unit calculates access similarities indicating similarities of access histories between documents, based on history information indicating the access histories to the respective documents by users, and a similar document specifying unit specifies a similar document similar to a given document as a retrieval target, based on the access similarities or the like. A retrieval index generating unit generates a retrieval index for the given document as the retrieval target from words appearing in a document set consisting of the similar document and the given document as the retrieval target. | 06-03-2010 |
20100145950 | Realtime Unification Management Information Data Conversion and Monitoring Apparatus and Method for Thereof - Provided are an apparatus and method for converting and monitoring management information data in an integrated manner and in real time. More particularly, the present invention relates to an apparatus and method for converting and monitoring management information data in an integrated manner and in real time which are capable of reducing the number of processes and stably operating an IT infrastructure through integrated management by flexibly collecting and converting management information data having a variety of formats and transmitting it to a destination integrated management solution when existing point-specific management solutions are managed in an integrated manner using an integrated management solution. | 06-10-2010 |
20100153402 | METHODS FOR EFFICIENTLY AND SYSTEMATICALLY SEARCHING STOCK, IMAGE, AND OTHER NON-WORD-BASED DOCUMENTS - One embodiment of a non-word-based information retrieval system includes searching stock or image documents in a huge data source. A non-word-based document is first divided into a series of elements or an array of cells. Each element or cell is matched against a series of predefined token patterns, so that a match will generate a token having a name. The collection of the generated named tokens is a word-based representation of the non-word-based document. After tokens from all documents are collected in a master collection of tokens, the non-word-based documents can be efficiently and systematically searched in a manner analogous to a document search in a word-based search system | 06-17-2010 |
20100161613 | SYSTEM AND METHOD FOR DYNAMICALLY MONETIZING KEYWORD VALUES - A system and method for dynamically monetizing keyword values based on a correlation between one or more indices. The method includes generating a first index for a subject being searched by users via web server, and a second index for the subject based on a corresponding level of popularity. The first index is correlated with the second index to predict a future second index for the subject based on the first index. | 06-24-2010 |
20100161614 | DISTRIBUTED INDEX SYSTEM AND METHOD BASED ON MULTI-LENGTH SIGNATURE FILES - A distributed index system and method based on multi-length signature files are provided. The distributed index system includes a feature vector extracting unit, a high-dimensional index unit, a high-dimensional index managing unit. The feature vector extracting unit extracts N-dimensional feature vectors from multimedia object and identifier. The high-dimensional index unit establishes a tree-based distributed index according to the identifier of the multimedia object and the N-dimensional feature vectors, and determines a signature length by comparing number of leaf nodes of the established distributed index tree and a reference cluster size. The high-dimensional index managing unit generates signatures for each leaf node, on which the determined length is reflected, and stores the generated signatures by matching with the N-dimensional feature vectors. | 06-24-2010 |
20100161615 | INDEX ANAYSIS APPARATUS AND METHOD AND INDEX SEARCH APPARATUS AND METHOD - Provided are an index analysis apparatus and method, and an index search apparatus and method. The index analysis apparatus extracts indexes from digital data, which is included in a disk image of a virtual drive, by using pattern matching, and the index search apparatus receives the extracted indexes and searches the digital data, which includes the received indexes, using a keyword keyed in by a user. Accordingly, the accuracy of digital forensic analysis can be increased, and digital forensic search can be sped up. | 06-24-2010 |
20100161616 | SYSTEMS AND METHODS FOR COUPLING STRUCTURED CONTENT WITH UNSTRUCTURED CONTENT - A method of coupling structured content, such as that found in an enterprise resource planning system, with unstructured content, such as that stored via an electronic content management system, is presented. In the method, mapping information relating at least one type of structured content with indexing data of at least one type of unstructured content is received. The indexing data is configured to facilitate access to the at least one type of unstructured content in a data storage system. The unstructured content is then received, as well as indexing data associated with the unstructured content. Structured content associated with the unstructured content is identified based on the indexing data. The unstructured content is stored in the data storage system. The identified structured content is then linked with the unstructured content stored in the data storage system via the indexing data to allow access to the unstructured content in the data storage system via the identified structured content. | 06-24-2010 |
20100161617 | INDEX SERVER ARCHITECTURE USING TIERED AND SHARDED PHRASE POSTING LISTS - An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are extracted from the document collection. Documents are the indexed according to their included phrases, using phrase posting lists. The phrase posting lists are stored in an cluster of index servers. The phrase posting lists can be tiered into groups, and sharded into partitions. Phrases in a query are identified based on possible phrasifications. A query schedule based on the phrases is created from the phrases, and then optimized to reduce query processing and communication costs. The execution of the query schedule is managed to further reduce or eliminate query processing operations at various ones of the index servers. | 06-24-2010 |
20100169321 | METHOD AND APPARATUS FOR CIPHERTEXT INDEXING AND SEARCHING - The present invention provides a method and apparatus for ciphertext indexing and searching. Indices of multiple levels are created for the encrypted files. Each item in the primary index includes a primary index item identifier and the ciphertext of the primary indexing information of the related file. The primary indexing information each includes an identifier(s) of the related secondary index item identifier(s) and the corresponding decryption information. Each item in the secondary index includes a secondary index item identifier and the ciphertext of the secondary indexing information. Information necessary for obtaining a file is included in the corresponding secondary indexing information. With the decryption information of the secondary indexing information in the decrypted primary indexing information, the ciphertext of the related secondary indexing information is decrypted so as to obtain information such as the decryption key of the file. | 07-01-2010 |
20100174717 | Interative serialisation procedure for structured software objects - This invention concerns an iterative procedure for conversion of structured software objects into a raw data stream and vice versa, providing for their direct transfer using simple communication resources such as those of an embedded computer station, and reset of said software objects or reutilisation of memory space allocated to them. This procedure can be used by an embedded platform ( | 07-08-2010 |
20100174718 | Indexing for Regular Expressions in Text-Centric Applications - A method, system, and article are provided for evaluating regular expressions over large data collections. A general purpose index is built to handle complex regular expressions at the character level. Characters, character classes, and associated metadata are identified and stored in an index of a collection of documents. Given a regular expression, a query is generated based on the contents of the index. This query is executed over the index to identify a set of documents in the collection of documents over which the regular expression can be evaluated. Based upon the query execution, the identified set of documents is returned for evaluation by the regular expression responsive to execution of the query over the index. | 07-08-2010 |
20100174719 | SYSTEM, METHOD, AND PROGRAM PRODUCT FOR PERSONALIZATION OF AN OPEN NETWORK SEARCH ENGINE - A system for personalization of a search engine for a network includes a least one search account. A first data structure stores index data for words each having a number of resources less than a first number. A second data structure stores index data for words each having a number of resources greater than the first number and less than a second number. The second data structure can be personalized for the search account. A third data structure stores index data for words each having a number of resources greater than the second number. The third data structure can be personalized for search account. At least one index includes the first data structure, the second data structure and the third data structure where when the search engine responds to a query from a user of a search account, the search engine uses an index corresponding to the search account. | 07-08-2010 |
20100174720 | COHERENT DATA IDENTIFICATION METHOD AND APPARATUS FOR DATABASE TABLE DEVELOPMENT - A method, apparatus, and system for configuring, designing, and/or implementing database tables is detailed that provides a framework into which a remainder of database tables are developed. Also detailed is a method to develop this framework of database tables. This so developed framework provides a platform for integrating data from multiple databases. A method is also provided for maintaining and managing master data as a single source of reference data to multiple databases that are based upon this framework. | 07-08-2010 |
20100174721 | METHOD AND SYSTEM FOR MULTI-DIMENSIONAL AND GEOGRAPHIC SEARCH - One embodiment of the present invention is directed to a multi-dimensional and geographic-search system that includes a search-region-receiving component that receives coordinates that define a region to be searched, an indexing component that generates an index for each of the received coordinates, a recursive-decomposition component that decomposes the region into sub-regions based on the indices generated by the indexing component, and a search component that selects points of interest from a points-of-interest database by comparing indices computed for the sub-regions to indices computed and stored in the database for each point of interest. Additional embodiments are directed to carrying out efficient, index-based searching for various additional types index-characterized entities. | 07-08-2010 |
20100179953 | INFORMATION PRESENTATION SYSTEM, INFORMATION PRESENTATION METHOD, AND PROGRAM FOR INFORMATION PRESENTATION - An information presentation system comprises a time-series search processing unit | 07-15-2010 |
20100185621 | Systems and methods for performing authorization - Systems, methods, and computer program products are provided for performing an authorization check for a user. In one implementation, a data processing system is provided that includes means for storing first data descriptive of at least one directed acyclic graph and for storing second data descriptive of an assignment of a user to a first node of the graph. The data processing system also includes means for receiving an access request of the user, the access request specifying a second node of the graph. Furthermore, the data processing system includes means for determining a least common ancestor of the first and second nodes of the graph and means for performing an authorization check adapted to grant authorization for the access request if the least common ancestor of the first and second nodes is the first node. | 07-22-2010 |
20100185622 | Attributes of Captured Objects in a Capture System - Regular expressions used for searching for patterns in captured objects can be grouped into attributes. Such attributes can be associated with captured objects using tags stored in a database. In one embodiment, the present invention includes capturing an object being transmitted over a network, and determining that a regular expression appears in the object, the regular expression belonging to a group of one or more regular expressions associated with an attribute. If a regular expression associated with the attribute is found in the object, then an attribute field of a tag containing metadata related to the captured object is set to indicate the presence of the attribute in the captured object. The presence of the attribute in the captured object can now be determined from the tag, which can be stored in a database. | 07-22-2010 |
20100191738 | APPARATUS, SYSTEM, AND METHOD FOR MODIFYING DATA SET NAMES - An apparatus, system, and method are disclosed for modifying data set names. A communication module receives a data set name as an index of a catalog entry for a catalog. The data set name comprises at least two qualifiers. A modification module modifies at least one data set name qualifier using a naming algorithm, wherein an alias qualifier is not modified. A store module stores the catalog entry indexed with the modified data set name in order to optimize the location of that entry within the catalog. | 07-29-2010 |
20100198829 | METHOD AND COMPUTER-PROGRAM PRODUCT FOR RANGED INDEXING - A method for generating and searching a ranged index provides a computer-readable medium which is adapted to store a database including a data chunk, and a ranged index including a data chunk index; generating the data chunk index by determining a high value in the data chunk and a low value in the data chunk; generate the ranged index from such data chunk index; and storing the ranged index on the computer-readable medium. A search value or values may then be provided; comparing the search value or values to the high value and the low value from the data chunk index for the data chunk in the ranged index for the database; and searching the data chunk to determine if the search value or values is lower than or equal to the high value and higher than or equal to the low value. By using inexpensive, quick comparisons of minima and maxima, the method and computer-program product avoids more costly sequential searches of larger data chunks where possible. | 08-05-2010 |
20100198830 | DYNAMIC DATA DISTRIBUTION AGGREGATION - Systems and methods reduce a time to calculate an aggregate of data values associated with a range of index entries. The system manages aggregate values within data stored in the entries of an index during insertion, modification, and deletion operations, and provides new index operations to generate aggregate values associated with a range of index entries. Multiple simultaneous operations can be executed efficiently. | 08-05-2010 |
20100198831 | KNOWLEDGE ANNOTATION RESULT CHECKING METHOD AND SYSTEM - The present invention discloses a knowledge annotation result checking system and method. The knowledge annotation result checking system comprises: a receiving unit for receiving a knowledge annotation result; and a checking unit, for comparing the received knowledge annotation result with information within a tag registration database to obtain comparison information. According to the present invention, inconsistent tag information can be detected. | 08-05-2010 |
20100198832 | DATA PROCESSING APPARATUS AND METHOD OF PROCESSING DATA - Data processing apparatus comprising: a chunk store configured for containing specimen data chunks, a discrete manifest store, configured for containing at least one manifest that represents at least a part of a data set and that comprises at least one reference to at least one of said specimen data chunks, the processing apparatus being operable to process input data into input data chunks and to identify at least one manifest in the manifest store that includes at least one said reference to a said specimen data chunk corresponding to at least one of said input data chunks. | 08-05-2010 |
20100198833 | SYSTEM AND METHOD FOR SOCIAL NETWORKING - A method of social networking including the step of providing a networking system with a website on a web network. The method further includes the steps of receiving from a claimant entering the website an agreement to the terms of service and thereafter receiving the claimant as a member, receiving from the claimant the name of a person claimed and other relevant information, and indexing and storing the claims so as to permit searching by the public and/or members of the networking system. In addition the networking system is constructed to receive facts arising from a dispute proving the claim is invalid or erroneous, and notify the claimant of the dispute in some convenient manner. | 08-05-2010 |
20100198834 | System for Creating and Maintaining a Database of Information Utilizing User Options - A system for automatically creating and maintaining a database of information utilizing user opinions about subjects, particularly exceptional experiences. Described is an Internet system assisting/motivating a population of users interested in information about certain categories of subjects to automatically maintain the database content and to improve the usefulness and quality of the database information without any substantial management by the website owner-manager. The user opinions are primarily in the form of both comments and ratings about which natural-language terms best describe a particular subject, enabling user searches of the subject database to be by way of preferred such descriptive natural-language terms, which terms are further preferred to be evaluative and approving. | 08-05-2010 |
20100205181 | AVERAGE CASE ANALYSIS FOR EFFICIENT SPATIAL DATA STRUCTURES - A computer performed method models a spatial index having n spatial regions defined in a multidimensional space using a tree-based model representing an infinite number of arrangements of n spatial regions in the multidimensional space allowable by the spatial index using a finite number of tree representations, computes an average retrieval complexity measure for content retrieval using the spatial index based on the tree based model, and provides a spatial index recommendation based on the average retrieval complexity measure. In some embodiments a spatial index selection module selects the spatial index based on average retrieval complexity measures for candidate spatial indices that are functionally dependent upon a number of spatial regions to be defined by the spatial index. | 08-12-2010 |
20100211571 | SYSTEMS AND METHODS FOR CLASSIFYING AND TRANSFERRING INFORMATION IN A STORAGE NETWORK - Systems and methods for data classification to facilitate and improve data management within an enterprise are described. The disclosed systems and methods evaluate and define data management operations based on data characteristics rather than data location, among other things. Also provided are methods for generating a data structure of metadata that describes system data and storage operations. This data structure may be consulted to determine changes in system data rather than scanning the data files themselves. | 08-19-2010 |
20100217766 | Mapping Courses to Program Competencies - A mapping device configured to map accreditation data to curriculum data. A mapping device may include an accreditation module, a curriculum module and/or a mapping module. An accreditation module may be configured to retrieve accreditation data. A curriculum module may be configured to retrieve curriculum data. A mapping module may be configured to map one or more competencies and/or one or more accreditation content areas to one or more course contents and/or one or more course objectives. A mapping device may include an analytical module, which may be configured to identify deficiencies, and/or an alignment module, which may be configured to address one or more deficiencies. A mapping device may be configured to employ a linkage template, which may include an academic level, and/or may employ a leveling rubric, which may be multi-leveled. A mapping module may be configured to generate an output graph, which may implement weights. | 08-26-2010 |
20100217767 | SYSTEM AND METHOD FOR ANALYZING REMOTE TRAFFIC DATA IN A DISTRIBUTED COMPUTING ENVIRONMENT - A system, method and storage medium embodying computer-readable code for analyzing traffic data in a distributed computing environment are described. The distributed computing environment includes a plurality of interconnected systems operatively coupled to a server, a source of traffic data hits and one or more results tables categorized by an associated data type. Each results table includes a plurality of records. The server is configured to exchange data packets with each interconnected system. Each traffic data hit corresponds to a data packet exchanged between the server and one such interconnected system. Each traffic data hit is collected from the traffic data hits source as access information into one such record in at least one results table according to the data type associated with the one such results table. Each of the records in the results table corresponds to a different type of access information for the data type associated with the results table. The access information collected into the results tables during a time slice is summarized periodically into analysis results. The time slice corresponds to a discrete reporting period. The access information is analyzed from the results tables in the analysis results to form analysis summaries according to the data types associated with the results tables. | 08-26-2010 |
20100228734 | MECHANISM FOR EFFICIENTLY SEARCHING XML DOCUMENT COLLECTIONS - The techniques presented herein are directed towards providing a user-directed keyword-based search on a large collection of XML documents, and displaying a summary of results to the user. Prior to receiving search requests from a user, an offline analysis of a large collection of XML documents is performed to construct an inverted index of keywords. For each keyword, the index stores a set of location indicators that identify all the instances of the keyword found in the collection of documents. A location indicator may comprise a document identifier, an indication of the position of the node in the hierarchy of nodes within the XML document containing the keyword, and an indication of the pathname of the node containing the keyword. Once the index is constructed, keyword searching can be done efficiently by a keyword lookup in the index. Various display strategies enable the user to see the specific portion of a large XML document containing the keyword and/or path frequency information allowing the user to easily refine the search to specific paths within the collection of documents. | 09-09-2010 |
20100235359 | FILE RETRIEVAL METHOD AND DEVICE AND TIME STREAM FILE PROCESSOR - A file retrieval method of a file allocation table (FAT) system is provided, which includes steps of: A: acquiring cluster numbers of all clusters of a file; B: sorting the cluster numbers to create a cluster number index table; and C: looking up the cluster number index table to obtain a cluster number of a file cluster to be retrieved, and acquiring a content of the file cluster to be retrieved from a physical storage address corresponding to the cluster number. A file retrieval device is further provided, which includes a central processing unit (CPU) and an address index storage unit. The address index storage unit stores a cluster number index table of sorted cluster numbers of all clusters of a file. The CPU is adapted to obtain a cluster number of a file cluster to be retrieved according to the cluster number index table during retrieving and acquire a content of the file cluster to be retrieved from a physical storage address corresponding to the cluster number. A time stream file processor is further provided. Therefore, a file retrieving speed is enhanced. | 09-16-2010 |
20100235360 | SYNCHRONIZED RELAY MESSAGING AND COORDINATED NETWORK PROCESSING USING SNMP - Methods, devices, and systems are provided for utilizing asynchronous messaging protocols, such as the Simple Network Management Protocol (SNMP), to implement relay messaging through a network, gather data from and/or set parameters on a plurality of managed devices in a network through the issuance and transmission of a single SNMP message, and/or cause multiple managed devices to perform cooperative and coordinated computations through the issuance of a single SNMP message. | 09-16-2010 |
20100241628 | User generated image facades on virtual 3D structures displays on mobile platforms for mobile users - The present invention relates to the creation of user's generated elements for 3D virtual reality scenery for navigating users on computing devices. The present invention allows for an efficient gathering of content such as images of structures such as buildings external and/or internal facades by the users computing devices and to provide a means to individuals and/or entities to deliver possibly timely actual image and/or images of structures such as buildings external and/or internal facades to end users who are traveling with computing devices by vehicle and/or by foot in an urban and/or suburban region and/or inside structures such as buildings private and/or public areas such as for example office building, shopping malls and/or sport areas. | 09-23-2010 |
20100241629 | System and Methods for Database Distribution and Querying over Key-based Scalable Storage - A system includes one or more application nodes sending SQL queries to one or more query execution nodes. Each query execution node translates the SQL queries into key-based requests to the storage engine, which comprises one or more storage nodes. The system provides an abstraction layer with an external schema and an internal schema, maps the external schema into the internal schema; and generates one or more microshards for the storage node. | 09-23-2010 |
20100241630 | Methods for indexing and retrieving information - A preferred method for providing an indexing methodology, an index table and method for retrieving information are disclosed. In a preferred method, an association between a plurality of word elements from a first data corpus such as a query is identified through an identification information such as a number. Then, a preferred index table comprising additional information for identifying the association information such as numbers between the word elements and/or multiple word elements of other or target data corpuses such as a data source is also disclosed, which in conjunction lead to retrieval of irrelevance-free information. | 09-23-2010 |
20100241631 | Methods for indexing and retrieving information - A preferred method for providing an indexing methodology, an index table and method for retrieving information are disclosed. In a preferred method, an association between a plurality of word elements from a first data corpus such as a query is identified. Then, a preferred index table comprising additional information for identifying the associations between the several word elements of other data corpuses such as a data source is also disclosed, which in conjunction lead to retrieval of irrelevance-free information. | 09-23-2010 |
20100250548 | INFORMATION TERMINAL EQUIPPED WITH CONTENT SEARCH SYSTEM - An information terminal equipped with a content search system is provided. The information terminal receives a query from a user, analyzes the query, determines a search type on the basis of the result of the analysis, searches for content items corresponding to the query according to the search type, and displays the content items, wherein the information terminal receives a content item selected by the user from the displayed content items, and reads and provides content corresponding to the content item from the content database. Therefore, the information terminal organically combines an information search technique with a content providing technique so that a user can quickly and conveniently search for his or her desired content. | 09-30-2010 |
20100250549 | STORING A VARIABLE NUMBER OF INSTANCES OF DATA OBJECTS - Described are systems and methods for storing a variable number of instances of data objects (e.g., 1, 2, 3, or up to N-1 instances, where N is the number of instances of the data object included in primary data) in secondary storage across a data storage network. In some examples, a system for storing a variable number of instances of data objects includes, one or more computing devices storing a set of data objects and multiple storage devices distinct from the one or more computing devices. Each of the multiple storage devices is configured to store at least a single instance of a data object. The system also includes a database configured to store information associated with the data objects. This information includes substantially unique identifiers for the data objects and, for each of the data objects, a number of instances of the data object stored on the multiple storage devices. | 09-30-2010 |
20100250550 | SYSTEM, METHOD, AND DATA STRUCTURE FOR PROVIDING ACCESS TO INTERRELATED SOURCES OF INFORMATION - A system, method, and data structure for storing and accessing interrelated data pertaining to a given subject is disclosed. The system includes a user module and a service module, wherein said service module contains a data storage component. The system facilitates reviewing sources of information that relate to particular subject. Where there sources of information can or must be understood in the context of other sources of information, the present invention also comprises a data structure and method of populating said data structure that facilitates searching and access to any related sources of information. The disclosed invention includes embodiments wherein the sources of information can consist of many different formats. Whereas many user modules can access the service module from any location, searching and accessing of any of said information sources can be reviewed in their full context, as defined by the related sources of information, from any location that has access to the communication medium connecting the user and service modules. | 09-30-2010 |
20100250551 | XML DATA PROCESSING SYSTEM, DATA PROCESSING METHOD AND XML DATA PROCESSING CONTROL PROGRAM USED FOR THE SYSTEM - XML analyzer | 09-30-2010 |
20100250552 | INDEXING DOCUMENTS ACCORDING TO GEOGRAPHICAL RELEVANCE - A local search engine efficiently indexes documents relevant to a geographical area by indexing, for each document, multiple location identifiers that collectively define an aggregate geographic region. When creating the index, the search engine may determine a set of geographical areas surrounding a geographical area relevant to a document and associate references to the set of geographical areas with the document index. | 09-30-2010 |
20100262606 | Method for Scoring Content of Nodes in a Database - The following disclosure contains a method and system for establishing, maintaining, reporting and presenting data regarding the scoring of content and entities, specifically levels of veracity in information or content and the credibility of an evaluating entity or entities that communally determine this veracity. An aspect of the invention permits reporting on an active node, file or files with an associated communally derived veracity score. Scores can be filtered contextually allowing for veracity scores to reflect specific communal or contextual values which are likely to vary from general scores. Scores are generated through a weighted system of consumption, verifications and disputes. The weighted system is comprised of a communally derived credibility scores of each evaluating entity within the system. Evaluating entities are awarded credibility scores through communal verifications of authored public content, referential treatment to this public content as well as a demonstrated awareness of overall existing content. | 10-14-2010 |
20100268715 | INDEX DATABASE CREATING APPARATUS AND INDEX DATABASE RETRIEVING APPARATUS - An index database creating apparatus includes a feature extracting unit for extracting features from a content; an index creating unit for creating feature indices of the extracted features; an index selector for selecting one or more of the feature indices based on frequency-based importance levels thereof; and a feature index database for storing therein the selected feature indices along with locations thereof in the content. The index selector obtains the frequency-based importance levels based on frequency distributions of the feature indices in the content and in other contents to be stored in the feature index database. | 10-21-2010 |
20100268716 | SYSTEM AND METHODS OF PROVIDING INTERACTIVE EXPERTIZED COMMUNICATIONS RESPONSES USING SINGLE AND MULTI-CHANNEL SITE-SPECIFIC INTEGRATION - A real-time search system enables askers to identify and submit questions to topically and skill level relevant potential answerers. A computer server receives and analyzes short text questions, determines a corresponding set of informational facets semantically and topically characterizing the question. The informational facets are evaluated against a database index of informational facets identified from prior analyzed messages correlated by profile identifiers of message originators to provide an identification of a plurality of potential answerers. The question is distributed to the plurality of potential answerers and ensuing message conversations between the asker responsive answerers are monitored for quality and sufficiency of response. The stored profiles of responsive answerers are updated to reflect the occurrence and quality of response. | 10-21-2010 |
20100281028 | SYSTEM AND METHOD FOR INDEXING A DATA STREAM - There are provided methods, computer program products, and systems for indexing a data stream. A method for indexing a data stream having attribute values includes the steps of parsing the data stream, and forming an index of tuples for a subset of attribute values of the data stream. The index is configured for retrieving the top-K tuples that optimize linearly weighted sums of at least some of the attribute values in the subset. | 11-04-2010 |
20100281029 | RECOMMENDATIONS BASED ON BRANDING - A method and a system for providing recommendations based on branding are disclosed. For example, a brand preference corresponding to a first brand and a first category may be identified based on user activity. A recommendation is provided to the user based on the brand preference. The recommendation may be provided based on a predetermined brand relationship comprising the first brand associated with the first category, a second brand associated with a second category, and a recommendation score between the first and second categories and brands. The recommendation may provided by accessing a relationships database to determine at least one brand relationship of the brand relationships corresponding to the brand preference. | 11-04-2010 |
20100281030 | DOCUMENT MANAGEMENT & RETRIEVAL SYSTEM AND DOCUMENT MANAGEMENT & RETRIEVAL METHOD - A document management & retrieval system is configured to: store, for each word in a set of words, appearance positions of the each word in a set of documents as a word index; store, for each tag in a set of tags attached to words, a set of words that appear to a right and left of the each tag, and also store, as a tag LR index, appearance positions of the each tag in a set of documents with a combination of the each tag and a word appearing to its right or a combination of the each tag and a word appearing to its left as a key; and, in a tag search where a query phrase contains words and a tag next to each other, refer to the index with a tag and the word to the right or left of the tag as a key, thereby reducing the size of a document list to be read without needing to have a tag name as a secondary key. A tag is updated by just updating two places in the tag LR index. | 11-04-2010 |
20100281031 | METHODS AND SYSTEM FOR INFORMATION STORAGE ENABLING FAST INFORMATION RETRIEVAL - Methods and apparatus for judicially storing information to enable fast information retrieval are disclosed. The information is organized in information fields each field having a respective set of information elements. Cells of information elements are defined and corresponding information records are cached. The cells may be user defined or formed based on affinity coefficients of pairs of information elements. With a large number of fields, each having a large number of information elements, cells are generated recursively. Each cell is associated with at least one pre-defined query. | 11-04-2010 |
20100281032 | INDEX COMPRESSION - Systems and methods for compressing an index are described. In one exemplary method, the results of a search are annotated and then encoded into one or more chunks of compressed data in accordance with the annotations of the results. The annotations include an indication of a best encoding method selected from a set of available encoding methods, and an indication of whether to switch to a new chunk during encoding or to continue encoding in the current chunk. Other methods are described and data processing systems and machine readable media are also described. | 11-04-2010 |
20100287164 | INSTALLATION FOR MANAGING A DATABASE - An installation for managing a database, the database having a database filter to provide the data sought in response to a query defining the filter and at least one data hierarchy of at least three hierarchical levels. The installation includes an interrogator to provide at least one query defining a filter to select data sought by using the database filter. The installation includes a representation of a hierarchy table that includes, for each hierarchised datum, the data on which it is dependent, regardless of the number of hierarchical levels separating them, the data being contained in a separate record of the table which also contains the hierarchised datum. During an interrogation on the basis of a query pertaining to hierarchised data, the database filter provides a response to the query formulated on the basis of the hierarchy table without resorting to external interrogation tools. | 11-11-2010 |
20100287165 | INDEXING A REFERENCE SEQUENCE FOR OLIGOMER SEQUENCE MAPPING - Generating an index includes receiving a reference sequence and applying one or more key patterns to the reference sequence to obtain a plurality of keys in the index. Each of the one or more key patterns is derived based on a corresponding set of oligomer sequence relationships of a plurality of oligomer sequences that are expected to be generated from the reference, and the keys correspond to a plurality of candidate and/or validated locations in the reference sequence. | 11-11-2010 |
20100287166 | Method and system for search engine indexing and searching using the index - Data indexing includes receiving data from a data source; classifying the data into one of a plurality of categories according to a predetermined data classification criteria; establishing a corresponding relationship between the data and an index associated with the data, the index having a preset maximum capacity; and recording the relationship between the data and the index. The index is one of a plurality of indices, and each of the plurality of indices is exclusively written by an index writing device. | 11-11-2010 |
20100293167 | BIOLOGICAL DATABASE INDEX AND QUERY SEARCHING - Methods and system for biological database indexing and query searching are described. In one embodiment, one or more words may be extracted from a biological sequence using a spacer. The spacer may be one or more characters within the biological sequence. The word and a position of the word within the biological sequence may be stored in a sequence index associated with the spacer. The sequence index may be capable of being used for an operation associated with the biological sequence. | 11-18-2010 |
20100299331 | ACCESSING DEEP WEB INFORMATION ASSOCIATED WITH TRANSPORTATION SERVICES USING A SEARCH ENGINE - Methods, apparatuses, and articles for receiving a search request associated with a transportation service from a client device, the search request including a plurality of search criteria, are described herein. Additionally, the methods, apparatuses, and articles further return to the client device an answer page having a plurality of answers potentially associated with the transportation service, the plurality of answers identifying a plurality of information locations having information potentially associated with at least a one of the plurality of search criteria, where at least one of the answers includes at least one input field of a query answer page for entry of at least one feature of the transportation service, the query answer page to be dynamically generated by one of the information locations in response to a query. | 11-25-2010 |
20100299332 | METHOD AND SYSTEM OF INDEXING NUMERICAL DATA - The present invention provides a computer-implemented method for indexing numerical information embedded in one or more electronic files. The method comprises determining whether an electronic file comprises one or more images containing embedded numerical data, including the steps of inputting the one or more images into a classification system comprising a plurality of interconnected classifiers; and classifying the one or more images using the classification system to output data classifying each image. The output data classifies each image as one of: containing embedded numerical data or not containing embedded numerical data. The method further comprises analysing the file to output data classifying it as one of: containing tabulated numerical data or not containing tabulated numerical data. If the outputted data indicates that the file comprises one or more images with embedded numerical data and/or contains tabulated numerical data, and the method further comprises extracting text and/or other data associated with the numerical data and indexing this text and/or other data in a database. | 11-25-2010 |
20100306200 | Mobile Image Search and Indexing System and Method - A computer-implemented system and method are described for image searching and image indexing that may be incorporated in a mobile device that is part of an object identification system. A computer-implemented system and method relating to a MISIS client and MISIS server that may be associated with mobile pointing and identification system for the searching and indexing of objects in in situ images in geographic space taken from the perspective of a system user located near the surface of the Earth including horizontal, oblique, and airborne perspectives. | 12-02-2010 |
20100306201 | NEIGHBOR SEARCHING APPARATUS - To provide a neighbor searching apparatus that can select an index suitable for each search target. A neighbor searching apparatus has: a storage part that stores a meta table containing index-dependent meta data associated with a data structure of each index; a database managing part that searches for an index associated with an instruction when receiving the instruction from a user and makes an indexing part perform a processing associated with the instruction using the index-dependent meta data associated with the index; and the indexing part that performs the processing associated with the instruction using the index-dependent meta data based on the instruction from the managing database part. | 12-02-2010 |
20100306202 | File Format Converting Method - A file format converting method for converting a first format file in a first file system of a storage apparatus to a second format file in a second file system is provided. The method includes creating a temporary file and defining a data amount of the temporary file as a first predetermined value; defining a start position of the temporary file to be the same as that of the first format file; and defining the data amount of the temporary file to be the same as that of the first format file to generate the second format file. | 12-02-2010 |
20100306203 | SYSTEMATIC PRESENTATION OF THE CONTENTS OF ONE OR MORE DOCUMENTS - Disclosed herein, in certain embodiments, is a method of systematically presenting the contents of at least one document, comprising: (a) a user providing an electronic version of at least one document to a computer; (b) a user accepting or modifying noise words generated by a computer module; (c) generating a list of every non-noise word by means of a computer module wherein the list indicates every page on which a non-noise word appears; and (d) displaying the entire list of non-noise words. In some embodiments, the list of non-noise words further indicates the number of times a word occurs on a page. In some embodiments, the list of non-noise words further indicates each line on which a non-noise word appears. | 12-02-2010 |
20100312770 | System and method for computer-assisted manual and automatic logging of time-based media - A customizable logging and content management system for indexing multimedia, including a synchronized timer object that provides a time reference upon request in connection with the media, and a logger object that logs predefined events that occur in the media by associating the events with respective time references from the timer object. A video server is provided that captures and digitally stores events logged by the logging application as media segments, and a search and retrieval engine is provided that enables the media segments to be located, retrieved and viewed based on the indexes. The system includes a graphical user interface generator that enables customized user interfaces and logging databases to be created from database tables for use in the logging application. | 12-09-2010 |
20100318518 | CONTENTS MANAGING METHOD AND APPARATUS - A method of managing contents is provided. Contents are searched from one or more indexing files that manage contents stored in one or more devices and displayed in the form of a list. The contents may be displayed along with devices storing the contents. If the contents are stored in an external device, a connecting state of the external device may be displayed. When contents requested to be reproduced by a user is stored in an unconnected external device, a message that requests the user to connect the unconnected external device may be displayed. When an external device is connected, the indexing file of the external device is searched and the searched indexing file is synchronized with indexing file managed in the device. Accordingly, the user can easily detect the device storing contents that the user desires and selectively reproduce the contents with efficiency. | 12-16-2010 |
20100332479 | PERFORMING DATA STORAGE OPERATIONS IN A CLOUD STORAGE ENVIRONMENT, INCLUDING SEARCHING, ENCRYPTION AND INDEXING - Systems and methods are disclosed for performing data storage operations, including content-indexing, containerized deduplication, and policy-driven storage, within a cloud environment. The systems support a variety of clients and cloud storage sites that may connect to the system in a cloud environment that requires data transfer over wide area networks, such as the Internet, which may have appreciable latency and/or packet loss, using various network protocols, including HTTP and FTP. Methods are disclosed for content indexing data stored within a cloud environment to facilitate later searching, including collaborative searching. Methods are also disclosed for performing containerized deduplication to reduce the strain on a system namespace, effectuate cost savings, etc. Methods are disclosed for identifying suitable storage locations, including suitable cloud storage sites, for data files subject to a storage policy. Further, systems and methods for providing a cloud gateway and a scalable data object store within a cloud environment are disclosed, along with other features. | 12-30-2010 |
20100332480 | SYSTEMS, METHODS AND SOFTWARE FOR DISTRIBUTED LOADING OF DATABASES - The invention relates to an online information retrieval system having a queue for storing load requests and a set of two or more load managers for retrieving data from the queue and indexing documents based on the request retrieved from the queue. Each load manager resides in a different geographical location. A set of candidate documents comprise a unique identifier and a version indicator, wherein the unique identifier for each candidate document is identical for a given document and the version indicator is associated with a determination of which document within the set of candidate documents shall ultimately be communicated to a user. | 12-30-2010 |
20110004597 | SYSTEM AND METHOD FOR CONTENT AWARE ROUTING OF DATA WITHIN A PLATFORM HUB - A method, system and platform hub for content aware routing of data, the platform hub comprising: a processor port for bidirectional data communication with a processing platform; at least one network ports for bidirectional data communication with at least one corresponding network agents; and at least one content aware units configured for performing content aware classification of data incoming into the platform hub, wherein the platform hub is configured for routing data to a suitable destination based on the content aware classification. The method for content aware routing of data comprises: receiving in a platform hub a data packet sent from a network towards a processing platform; performing a content aware classification in the platform hub; and routing the data packet based on the content aware classification. | 01-06-2011 |
20110004598 | SERVICE RESPONSE PERFORMANCE ANALYZING DEVICE, METHOD, PROGRAM, AND RECORDING MEDIUM CONTAINING THE PROGRAM - A device for analyzing service response performance, a method and a program which can take action with high immediacy according to the communication state between a service provider's terminal and a service requester's terminal, and a recording medium containing the program are provided. | 01-06-2011 |
20110010369 | METHOD, SYSTEM AND PROGRAM FOR INFORMATION RE-ORGANIZATION - An information re-organization system includes a plurality of counters coordinated to meaning attributes, and a re-organization incentive notification unit that updates, in case the information stored in preset storage unit has been updated, value of a counter out of the multiple counters that has the meaning attribute associated with contents updated. The information re-organization system also includes an information re-organization processor that executes, in case the value of the counter section updated has met one of a number of predetermined conditions for information re-organization, a processing for information re-organization corresponding to the condition for information re-organization on the information stored in the preset storage unit. | 01-13-2011 |
20110022599 | SCALABLE INDEXING FOR LAYOUT BASED DOCUMENT RETRIEVAL AND RANKING - A computer-based method and a system for indexing, querying, and ranking documents based on layout are provided. The method includes providing a plurality of documents to computer memory, extracting layout blocks from the provided documents, clustering the layout blocks into a plurality of layout block clusters, computing a representative block for each of the layout block clusters, generating a document index for each provided document based on the layout blocks of the document and the computed representatives blocks, clustering the created document indexes into a plurality of document index clusters, and generating a representative cluster index for each of the document index clusters. The indexes generated, together with the representative blocks and document index clusters, can be stored and used for retrieval of documents responsive to a layout query. | 01-27-2011 |
20110029538 | SYSTEM FOR CREATION OF CONTENT WITH CORRELATED GEOSPATIAL AND VIRTUAL LOCATIONS BY MOBILE DEVICE USERS - A system and method for a user of a mobile device or computer to easily and directly create content with correlated virtual and geospatial locations and associated context information. The content may include audio, video, and images, and be exposed via a unique URL based on content identifier. This content is indexed, including using a geospatial, “place-based” index, enabling users to easily share content with their matrix of social networks, friends, and communities, or to discover content created by others. | 02-03-2011 |
20110029539 | METADATA AS COMMENTS FOR SEARCH PROBLEM DETERMINATION AND ANALYSIS - Techniques for using metadata as comments to assist with search problem determination and analysis are provided. Before an action is taken on a search, contextual information is gathered as metadata about the action and actor requesting the action. The metadata is embedded in the search as comments and the comments are subsequently logged when the action is performed on the search. The comments combine with other comments previously recorded to permit subsequent analysis on searches. | 02-03-2011 |
20110035382 | Associating Information with Media Content - Information is associated with media content. Upon an upload of a portion of the media content, a media fingerprint is derived therefrom. The media fingerprint includes a unique representation of the media content portion, which is derived from a characteristic component of the media content portion. The information is associated with the media content portion based on the derived media fingerprint. Upon streaming the media content portion, the associated information is linked and presented in real time with the media content portion. Upon upload of the media content portion, the media fingerprint may be derived therefrom at upload time or at any time subsequent to upload time and prior to presentation time. The media content may comprise an original instance of content or a derivative instance of the original content. | 02-10-2011 |
20110040765 | Using The Z/OS Load Module System Status Index To Distinguish Product Tag Files - A method, system, and computer-usable medium are disclosed for managing software product information in a z/OS environment. Product information associated with a software product is determined and then processed to generate product tag information. A System Status Index (SSI) attribute is generated with a unique identifier signifying that the SSI attribute comprises SSI product tag information. The SSI product tag information is then associated with the SSI attribute with the unique identifier. The z/OS environment is then scanned for SSI attributes. Those SSI attributes comprising a unique identifier are processed to read their associated SSI product tag information. The product tag information is then processed to generate software product information, which is then stored in a repository of software product information. | 02-17-2011 |
20110047163 | Relevance-Based Image Selection - A system, computer readable storage medium, and computer-implemented method presents video search results responsive to a user keyword query. The video hosting system uses a machine learning process to learn a feature-keyword model associating features of media content from a labeled training dataset with keywords descriptive of their content. The system uses the learned model to provide video search results relevant to a keyword query based on features found in the videos. Furthermore, the system determines and presents one or more thumbnail images representative of the video using the learned model. | 02-24-2011 |
20110047164 | Universal Data Relationship Inference Engine - While a user is viewing content on a computer display, the universal data relationship inference engine presents related information from disparate data sources. A normalized index is maintained that indexes content to a set of standard taxonomy terms. The inference engine parses content being viewed by the user. If the content includes tags for some of the standard taxonomy terms, then the system may provide the user with the ability to view the related content that is indexed by the normalized index. If there are not taxonomy tags then the system may attempt to recognize non-standard taxonomy terms in the content in order to provide the user with related content. The inference engine may also identify related content by identifying synonyms to the taxonomy terms. | 02-24-2011 |
20110055217 | SYSTEM TO MODIFY WEBSITES FOR ORGANIC SEARCH OPTIMIZATION - A system is configured to modify a website to optimize the website for an organic search of a topic. The system includes a data management module configured to receive search terms for the topic, receive related terms to the search terms for the topic, and receive website input data. The system also includes a scanning module configured to scan and index web pages and web objects in the website and identify a subset of web pages and web objects associated with the search terms and the related terms. The system also includes an optimizing module configured to process the subset of web pages and web objects from the website by prioritizing the subset of web pages and web objects or measuring a metric with respect to the subset of web pages and web objects. The search terms and related terms are deployed into the subset of web pages and web objects on the website based on the processing. | 03-03-2011 |
20110055218 | ACCESSING CONTENT IN A NETWORK - A computer-implemented method, a computer program product and a computer system assist a user to access a content item in a network such as a home network, the method comprises gathering capability information from a device, such as a digital music player in the network, wherein the device includes a directory of content items, wherein the directory includes a first container and a second container, wherein at least one of the plurality of content items is organized in the first container, and wherein at least one of the plurality of content items is organized in the second container; providing an index referring to the content items, the first container, and the second container; determining whether the directory has been modified; performing a content item search if the directory has been modified; and updating the index based on the results of the content item search. | 03-03-2011 |
20110055219 | DATABASE MANAGEMENT DEVICE AND METHOD - A database system includes a database device for managing a database including data records, each data record including identification information and an attribute value, index devices for managing an index database including index records, each index record including search information determined by the attribute value and the identification information, each index devices managing each of parts of the index database, and a management device executing a process including detecting an update frequency of the attribute value stored in the database, generating search information by combining the attribute values, each attribute values having the update frequency higher than a predetermined frequency, generating the index record including the generated search information and the identification information associated with the combined attribute values included in the generated search condition information, and determining one of the plurality of index devices for managing the generated index record. | 03-03-2011 |
20110060741 | DISTRIBUTION AND USAGE OF MEDIA BUNDLES - Media bundles described herein can associate digital media assets so as to be able to provide media presentations involving the digital media assets at a client device. A media bundle can be used on a client device to support media presentation of digital media assets at the client device. The media presentation can provide a multimedia experience for the user of the client device. Conversion data can be provided to the client device to facilitate mapping from provider identifiers within the media bundle to identifiers used by a media distribution system. | 03-10-2011 |
20110060742 | Digital Media Bundles for Media Presentation Playback - Media bundles that can be submitted, distributed and utilized to facilitate media presentations at client devices. Each media presentation is typically associated with a digital media asset. The media bundles can include or identify various digital media assets for use in the media presentations and can provide computer instructions for controlling the media presentations. The resulting media presentations can provide multimedia experiences for users of client devices. | 03-10-2011 |
20110060743 | Locating and Retrieving Data Content Stored in a Compressed Digital Format - A method and apparatus is provided for locating and retrieving specified data content in a database. The data comprises compressed digital audio or video data files associated with the recorded speech. Retrieval of the specified content requires decompression of only a portion of the compressed data. A method for locating specified content of the above type is provided. A compressed audio file comprising recorded speech is converted into a corresponding text file. A searchable index is constructed from the text file. One or more specified search arguments are used to search respective elements of the searchable index in order to detect one or more text segments. The identifiers of respective detected segments are then used to locate the specified content in the audio file. Only portions of the audio file that contain specified content require decompression, in order to retrieve the content. | 03-10-2011 |
20110066620 | Automated Boolean Expression Generation for Computerized Search and Indexing - A computer implemented method of indexing a plurality of responses for later retrieval and presentation to a user in response to queries, includes, for each of the plurality of responses, receiving at least one representative query for that response. The representative query(s) represent text (e.g. natural language) query(s) to be input by an end user searching for information addressed by that response. Each representative query is parsed into terms. The terms are analyzed to determine which of these terms are more likely to uniquely identify queries for the particular response among terms in representative queries for all indexed responses. Boolean expression(s) satisfied by a text query containing one of the terms determined to more likely uniquely identify that response, and another one of the parsed terms are formed. | 03-17-2011 |
20110066621 | SYSTEM AND METHOD FOR MODELING AND ANALYZING COMPLEX SCENARIOS - An embodiment of the present invention includes a method for analyzing and solving possibility tree. A possibility tree having a plurality of programmable nodes is constructed and solved with a solver module executed by a processor element. The solver module executes the programming of said nodes, and tracks the state of at least a variable through a branch. When a variable of said branch is out of tolerance with a parameter, the solver disables remaining nodes of the branch and marks the branch as an invalid solution. The valid solutions are then aggregated and displayed as valid tree solutions. | 03-17-2011 |
20110066622 | PRODUCT LINE EXTRACTION - Methods, systems and computer readable media for extracting product lines from a plurality of product titles are provided. In one embodiment, the plurality of product titles are broken into tokens. Association rules are calculated for individual tokens and pairs of tokens. Brand specific terms and product class specific terms within the product titles are identified. In one embodiment, a token tree is used to identify product lines within the list of product titles using the association rules, the brand specific terms, and the product class specific terms. | 03-17-2011 |
20110072023 | Detect, Index, and Retrieve Term-Group Attributes for Network Search - In one embodiment, concept tag a network document comprising document words based on a set of document concepts, each of the document words being indexed with its position within the network document, such that for each of the document words, if the document word represents one of the document concepts, index a document concept tag corresponding to the one document concept with the position of the document word within the network document. Concept tag a search query based on a set of query concepts by associating appropriate query concept tags with selected query words. For each of the query words associated with the query concept tags, determine zero or more first positions within the network document at which the document words match the query word or its synonym and zero or more second positions within the network document at which the document concept tags correspond to the query concept tag. | 03-24-2011 |
20110078148 | SYSTEM, METHOD AND APPARATUS FOR INFORMATION RETRIEVAL AND DATA REPRESENTATION - A system, method, and apparatus for information retrieval are provided. Embodiments of the present invention may generate data structures that may be used to process user queries. According to embodiments of the present invention, a processor component configured to perform the operations of an indexing module and a storage module, the indexing module configured to generate a term list and a term-file matrix from information stored on the storage module, the indexing module further configured to generate an adjacency matrix from the one or more files, wherein the adjacency matrix represents a relationship of the one or more terms in each of the one or more files; and the indexing module further configured to generate a probability matrix using the adjacency matrix and a one-step or two-step random walk. | 03-31-2011 |
20110078149 | Management of Access to Data Distributed Across Multiple Computing Devices - Methods, systems and apparatus for a distributed data environment in which data can be seamlessly accessed from remote computing devices are disclosed. For a given user having multiple computing devices, a personal distributed data bank can be formed from data storage capacities resident at the multiple computing device. The given user can access data on any of the multiple computing devices without knowing on which of the multiple computing devices the data resides. In one embodiment, an index for the user's data within the personal distributed data bank can be provided to facilitate searching, browsing and accessing of user data by the given user. | 03-31-2011 |
20110078150 | Intelligent Sorting and Correlation of Email Traffic - A method comprises receiving a plurality of emails. Each of the plurality of emails comprises message information and header information. The header information comprises subject information and one or more email addresses. The method further comprises generating filtered email data comprising email addresses by at least filtering message information and subject information from each of the plurality of emails. The method further comprises correlating sender information and recipient information in the filtered email data, the correlation based at least in part on the one or more email addresses. The method further comprises storing in memory one or more email addresses from the filtered email data. | 03-31-2011 |
20110082864 | Method for generating indexes for downloading data - A method for bi-directional data replication between a Web Service application and a relational database are provided. In an electronic communications network, a computer implemented method is provided for generating indexes of relationship fields by accessing the metadata of a remote web service. A plurality of records may be downloaded from the remote web service to a computer. The plurality of records may include metadata related to the identity of data fields involved in parent-child record relationships. An index may be built at least partially on the data fields involved in parent-child relationships. | 04-07-2011 |
20110082865 | Method for Digitally Labelling Websites - A host website, method and system for digitally labelling websites or subscribers with digital labels which represent characteristics or qualities of the website is disclosed. Digital labels are stored on host websites, each of which is directed to a specific subject or activity. The host website can also provide digital labels for storage in other host websites. The host website is provided with artificial intelligence capabilities. Methods and systems for asymmetric access in a host website database are disclosed. | 04-07-2011 |
20110087670 | SYSTEMS AND METHODS FOR CONCEPT MAPPING - Concepts relevant to natural language content may be identified using an ontology. The natural language content may be tokenized and normalized. Using the tokenized content, one or more candidate concepts within the ontology may be identified. Concepts relevant to the natural language content may be selected using the identified concepts and the relationships between concepts defined within the ontology. A spreading activation process may be used to identify related concepts. The spreading activation process may be iterative and/or may reach multiple generations of connected concepts within the ontology. The relevant concepts associated with the natural language content may be used to index the natural content, identify related content, provide targeting advertising related to the natural language content, and the like. | 04-14-2011 |
20110087671 | Document Processing System and Method Thereof - In a document processing system and a document processing method, the document processing method includes the steps of: obtaining a document and dividing the document into a plurality of terms and giving a syntactic index to each of the terms; dividing the document into a plurality of sentences; selecting a key syntactic index from the syntactic indexes, and transforming each of the sentences into a preliminary key terms sequence which includes a preliminary key term corresponding to the key syntactic indexes according to the key syntactic indexes; selectively expanding the preliminary key term of a representative sentence of the document into an advanced key term; and determining at least one core candidate sentence among the sentences to generate a core abstract of the document according to the preliminary key terms and the advanced key term. | 04-14-2011 |
20110093467 | SELF-INDEXING DATA STRUCTURE - A machine based tool and associated logic and methodology are used in converting data from an input form to a target form using context dependent conversion rules, and in efficiency generating an index that may be utilized to access the converted data in a database. Once the data has been converted, an index data structure for each data object may be automatically generated that encodes one or more characteristics or attributes of the converted data so that an entity may access the data using the index structure. As an example, the one or more characteristics may include categories, subcategories, or other attributes of the data. | 04-21-2011 |
20110093468 | SYSTEM AND METHOD FOR IMPROVING STORAGE AND RETREIVAL OF DOCUMENTS - A system and method for enhancing the storage and retrieval of documents includes employing information about documents filed in a retained document filing arrangement and information about document processing channels. The information is used in the system and method to implement a document processing channel that enhances the further storage and retrieval of document filed in the retained document filing arrangement. A controller computer may be in communications with sensors connected to the retained document filing arrangement to provided attribute data for retained documents. The document processing channels can employ scanning, digitizing and indexing. | 04-21-2011 |
20110093469 | TECHNIQUES FOR EXTRACTING SEMANTIC DATA STORES - Techniques for enforcing policies. A set of data is stored in one or more data stores. A plurality of semantic concepts for an ontology are defined and a map is constructed from the set of data to the concepts. The map is executed in order to transform data from the set of data to a second set of data. The second set of data is stored according to the ontology. The second set of data is reasoned in order to determine compliance with one or more policies. | 04-21-2011 |
20110093470 | METHOD AND SYSTEM FOR OFFLINE INDEXING OF CONTENT AND CLASSIFYING STORED DATA - A method and system for creating an index of content without interfering with the source of the content includes an offline content indexing system that creates an index of content from an offline copy of data. The system may associate additional properties or tags with data that are not part of traditional indexing of content, such as the time the content was last available or user attributes associated with the content. Users can search the created index to locate content that is no longer available or based on the associate attributes. | 04-21-2011 |
20110099172 | DOCUMENT EXPOSURE TRACKING PROCESS AND SYSTEM - A system and process to allow more accurate tracking of attention received by an electronic document via a display screen. The process, and the system that implements the process: maps a machine readable document into data segments; defines a list of data segment indicators corresponding to the data segments; determines a portion of the machine readable document truncated to a occupy view port; and associates an attention value with the data segment indicators corresponding to the presence of the data segments available to the view port. The present invention may further utilize the track list to monitor the availability of machine readable documents, truncated or not, in the view port, and tint the machine readable document accordingly. | 04-28-2011 |
20110099173 | Indexing In A Resource Description Framework Environment - Indexing in a resource description framework data system including a search key set of ranking relationships for the subject, predicate, and object of RDF statements, generation of a single nodal index including a node for each of the RDF statements, each node having a plurality of rank tracking indicators that include a position indicator representing the relative position of the RDF statement in each ranking relationship. A computer readable storage medium and a system for indexing a resource description framework data system are also provided. | 04-28-2011 |
20110099174 | Data Recovery System and Method In A Resource Description Framework Environment - A method of optimizing data recovery in an RDF data system in which the occurrence of data values across one or more of subject, predicate, and object are ranked, a handle is assigned to each of one or more of the ranked data values wherein the handle is directly based on the ranking. The rank-based handle can then be used to index the RDF statements. An RDF data recovery system includes an RDF statement processor configured to rank data values and assign handles to data values based on ranking. | 04-28-2011 |
20110106810 | Aggregating Content from Multiple Content Contributors - Utilizing logical identifiers or sets of metadata to describe, organize, and index content associated with a software application system from one or more content contributors is provided. When a request is made for specific content, the request may point to a logical identifier, and pieces of content applicable to the requested content may be retrieved and presented to a user. By utilizing schematized metadata associated with individual content items, a user's experience may be improved by providing content available from a plurality of content contributors applicable to the user's query. | 05-05-2011 |
20110106811 | Efficient XML Tree Indexing Structure Over XML Content - A method and apparatus are provided for building and using a persistent XML tree index for navigating an XML document. The XML tree index is stored separately from the XML document content, and thus is able to optimize performance through the use of fixed-sized index entries. The XML document hierarchy need not be constructed in volatile memory, so creating and using the XML tree index scales even for large documents. To evaluate a path expression including descendent or ancestral syntax, navigation links can be read from persistent storage and used directly to find the nodes specified in the path expression. The use of an abstract navigational interface allows applications to be written that are independent of the storage implementation of the index and the content. Thus, the XML tree index can index documents stored at least in a database, a persistent file system, or as a sequence of in memory. | 05-05-2011 |
20110106812 | XPath-Based Creation Of Relational Indexes And Constraints Over XML Data Stored In Relational Tables - Techniques and approaches are provided for creating indexes and column constraints on structured XML data that is stored in a relational database. Data Definition Language (DDL) Create Index and Create Constraint commands have extended syntax that allows the specification of a path-based expression instead of requiring a column and table name. A mapping created by the system when an XML Schema is registered stores the correspondence of XML data elements to automatically-created database tables and columns that are given names only useful for the internal system. When a user provides a path-based expression in a DDL when creating an index or constraint, the path-based expression is translated to the underlying database constructs using the mapping. Issues are addressed for handling path-based expressions that evaluate to more than one element. Additional index optimization is described using data type information available in the XML schema to select the optimal index type. | 05-05-2011 |
20110106813 | Database System and Method for Data Acquisition and Perusal - A data acquisition and perusal system and method including a database selection module, a database index generator module and a search module. The database selection module enables selection of a plurality of files for inclusion into at least one selectable database. The database index generator module enables generation of a searchable index of the data contained in the selectable database. The search module enables a search to be performed of the searchable index according to search criteria. The system allows for the capture of HTML data which is automatically indexed without human intervention and has the ability to automatically and accurately locate or “pinpoint,” and highlight specific text or groups of text designated by the user within the resulting database. | 05-05-2011 |
20110106814 | SEARCH DEVICE, SEARCH INDEX CREATING DEVICE, AND SEARCH SYSTEM - A search device includes a partial character string extracting unit for acquiring partial character strings for search from a search query inputted, a partial character string searching unit for acquiring name text candidates and pieces of partial character string appearance position information respectively showing the appearance positions of the partial character strings within the name text candidates according to the partial character strings for search, a candidate counting unit for counting an accumulated score for each name text candidates by providing consistency among the appearance positions in consideration of the pieces of partial character string appearance position information in such a way that the appearance positions do not overlap one another in each name text candidate, a candidate-to-be-presented selecting unit for determining a candidate to be presented according to the accumulated score, and a candidate presentation unit for presenting the candidate to be presented. | 05-05-2011 |
20110113035 | FILE SYSTEM - Embodiments of the present invention provide a computer system, comprising a personal file system module for determining a file system folder structure based upon a user profile and mapping user content stored on one or more storage devices to one or more folders of the file system. | 05-12-2011 |
20110131211 | METHODS AND SYSTEMS FOR VISUALIZING TOPIC LOCATION IN A DOCUMENT REDUNDANCY GRAPH - Methods, systems, and computer program products can visualize topic location in a document redundancy graph. Document subcomponents such as, for example, document paragraphs of a document or document set can be searched for user interface specified search data and a determination can be made as to which document subcomponents contain information relating to the search data. Based on the redundancy graph data, a determination can then be made as to which particular user selectable node(s) of the graph contain the particular document subcomponent(s). Data for identifying the determined particular user selectable node(s) on a user interface can be generated and rendered to allow a user to quickly locate the particular user selectable node(s) on the redundancy graph. The particular user selectable nodes themselves can for example be visually highlighted on the graph. Keyword indexing can be used to assure chosen keywords are keywords relating to particular topics contained in the document set. | 06-02-2011 |
20110131212 | INDEXING DOCUMENTS - A document to be indexed is initially indexed in dependence upon language-specific rules of a single language. A success metric is used to assess the effectiveness of the single language indexing. If a threshold level of success is not attained, the document is identified as multi-lingual. In response to identifying the document as multi-lingual, the document is queued for multi-lingual indexing. A document may be fragmented into a number of smaller documents, each of which is indexed separately. | 06-02-2011 |
20110131213 | Apparatus and Method for Mining Comment Terms in Documents - This invention discloses a method for mining a comment term in a document. The method comprises, first, to build a document database and a keyword database, wherein the document database includes at least one digital document, the keyword database includes at least one keyword. Then, a language of the digital document is determined. The digital document is processed based on the language to form a first document. Next, word groups are gathered from the first document based on a gathering range and apart-of-speech, wherein each word group includes the keyword and a word with the part-of-speech. | 06-02-2011 |
20110131214 | INFORMATION RETRIEVAL METHOD, COMPUTER READABLE MEDIUM AND INFORMATION RETRIEVAL APPARATUS - According to an aspect of the invention, a computer readable medium stores a program causing a computer to execute a process for retrieving information. The process includes an extracting process, an executing process, a first creating process, a second creating process, a determining process. The extracting process extracts, from a first composition that is an object to be searched for and that includes first sentence elements and a second composition that indicates a retrieval condition and that includes second sentence elements, the first sentence elements, the second sentence elements, and sentence element relations indicating relations between the first sentence elements and relations between the second sentence elements. | 06-02-2011 |
20110137909 | LOCATION INDEPENDENT EXECUTION OF USER INTERFACE OPERATIONS - This disclosure relates to systems, methods, and software that involve location-independent execution of user interface operations. This execution can include identifying a user interaction through a client-side user interface that is communicably coupled with a server-side computer. It is then dynamically determined whether to process the identified user interaction on the client-side or the server-side. If the user interaction is to be processed on the client-side, then a client-side action handler is invoked. If the user interaction is to be processed on the server-side, then it is requested that a server-side action handler process the interaction. | 06-09-2011 |
20110137910 | LAZY EVALUATION OF SEMANTIC INDEXING - A method for searching a database of digital media assets, comprising: designating a database of digital media assets, wherein the database of digital media assets has been indexed according to a set of general indexers; receiving a search query; defining specialized search conditions by identifying one or more elements of the search query corresponding to one or more specialized indexers; defining general search conditions by identifying elements of the search query corresponding to the general indexers; identifying a subset of the digital media assets by applying the general search conditions; indexing the subset of the digital media assets using the identified specialized indexers; and ranking the subset of the digital media assets by applying the specialized search conditions. | 06-09-2011 |
20110137911 | SYSTEM AND METHOD FOR PROVIDING INTEGRATED MANAGEMENT OF ELECTRONIC INFORMATION - A system and method for managing electronic information using a computer system for one or more users, comprising creating an electronic folder and storing the folder in an electronic folder management system, creating files for the electronic folder, maintaining a collection of electronic folders and the files stored therein, and associating the electronic folders with one or more files such that each of the files may be accessed by identifying the related electronic folder. | 06-09-2011 |
20110137912 | SYSTEM, METHOD AND COMPUTER PROGRAM PRODUCT FOR DOCUMENTS RETRIEVAL - The invention provides a system and method for retrieving documents from a collection of documents that match a word search query. A word index is generated for each document in which each entry is an enriched-term string built from the stemmed form of the word to be searched and a separator character followed by the original form of the word to be searched. During a retrieving operation, a search query is processed depending the original form or the stemmed form of a word to be searched. Cross-documents tables are addressed to find documents that match the enriched-term string of the word to be searched. | 06-09-2011 |
20110137913 | Systems and Methods for Interest-Driven Stock market Segmentation and Stock Trading - Systems and methods for indexing and searching companies for use in a stock trading application are provided. Indexing may assign a scoring system based on the several pre-determined factors as well as the data source from which the data has been received. Additionally, the indexing methodology supports the creation of minimarkets of stocks based on a term-based or user interest-based groupings of companies. | 06-09-2011 |
20110137914 | CUSTOM CACHING - Methods and systems are presented for custom caching. Application threads define caches. The caches may be accessed through multiple index keys, which are mapped to multiple application thread-defined keys. Methods provide for the each index key and each application thread-defined key to be symmetrical. The index keys are used for loading data from one or more data sources into the cache stores on behalf of the application threads. Application threads access the data from the cache store by providing references to the caches and the application-supplied keys. Some data associated with some caches may be shared from the cache store by multiple application threads. Additionally, some caches are exclusively accessed by specific application threads. | 06-09-2011 |
20110145255 | Systems and methods for distribution of data in a database index - Systems and methods consistent with the invention may include storing, in a memory device of the computer system, a node including existing data values stored in a sequence, including in the node an insert position value and an append count value, the insert position value providing a position in the node where one of the existing data values is included, and the append count value being a counter value that is modified based on the inclusion of the existing data values in the node; receiving a new data value; determining a location in the memory device for storing the new data value, wherein the location is determined based on at least one of a type of the new data value, the sequence, and the append count value; and storing the new data value in the determined location. | 06-16-2011 |
20110153616 | System and Method for Providing Access to Log Data Files - A system and method for providing a user with efficient access to a log data file. The system takes a computer generated log data file and applies a user defined filter criteria to the log data file. The indexed and filtered log data file is then displayed to the user in an application player where the user can interact with the log data file. | 06-23-2011 |
20110153617 | METHOD AND SYSTEM FOR DESCRIBING AND ORGANIZING IMAGE DATA - The present invention is directed to methods and systems for describing and organizing image data. An analysis system includes a sensor, a processing unit, a machine, a database, and/or a display. The analysis system is connected to an electronic device. The sensor generates sensor data which is imported by the processing unit. The processing unit analyzes the sensor data to determine features of the sensor data such as the subject, the time, the location, the object, the event, and/or the action in the sensor data. The processing unit creates entries for the sensor data in a features index. The processing unit receives a query for an item and uses the features index to determine search results including the item, such as a location and a time of the item. The processing unit can also determine whether a machine has completed its mission using the features in the sensor data. | 06-23-2011 |
20110161325 | SYSTEM, METHOD AND COMPUTER-READABLE STORAGE MEDIUM FOR GENERATION AND REMOTE CONTENT MANAGEMENT OF COMPILED FILES - An apparatus is provided that includes a processor configured to perform or cause the apparatus to perform at least a number of operations. The operations include receiving a compiled media file including a plurality of objects, where the compiled file is received via a remoting gateway configured to post the compiled file to the apparatus. The operations also include parsing the compiled file to identify its objects and produce an index of the objects, and include mapping one or more data or files to one or more of the objects based on the index. The compiled file is servable to a viewer device, which includes receiving a request for the compiled file, and in response to the request, determining one or more data or files mapped to one or more of the objects, and serving the compiled file and the determined one or more data or files. | 06-30-2011 |
20110161326 | SYSTEM AND METHOD FOR SERVING SEARCH RESULTS OF TEXTUAL DATA IN RESPONSE TO A QUERY AS IT IS TYPED - A system and a method provide in memory two level of indices to a data set, respectively referred to as an instance layer and an entry layer. The instance layer is brought from disk into memory when an application is initialized to allow insertion of new data into the data set. A temporary data structure augments the instance layer and is provided to keep track of the new data. When query processing is required, the entry layer is brought in, so that query processing may be achieved using both the instance layer indices and the entry layer indices. The temporary data is merged into the instance and entry layers and the data set is updated upon termination of the application or at predetermined times, whereupon the resulting data set and index layers are written out to disk. | 06-30-2011 |
20110161327 | ASYNCHRONOUS METHODS OF DATA CLASSIFICATION USING CHANGE JOURNALS AND OTHER DATA STRUCTURES - Aspects of the present invention are generally concerned with systems and methods for generating a database of metadata that describes system data and storage operations. The database of metadata may be referred to herein as a “metabase.” For example, to generate a metabase, a data agent may traverse a file system to obtain certain characteristics of data managed by the file system while substantially simultaneously detecting and recording change notifications. These actions may be performed even if the actions of the data agent are interrupted one or more times during the traversal of the file system. The data agent may process the characteristics and change notifications to generate and update a metabase. Once formed, the metabase may be consulted to determine changes in system data rather than determining the changes by scanning the data files themselves. | 06-30-2011 |
20110167069 | SYSTEM AND METHOD FOR CREATING AND PROVIDING MEDIA OBJECTS IN A NAVIGABLE ENVIRONMENT - A system and method for storing and providing media objects in a navigable environment implemented on at least one computer executing computer readable program code stored in a tangible memory medium. The at least one computer is configured to access a plurality of media objects comprising a media recording and metadata. The at least one computer is further configured to generate an index of said plurality of media objects based on said metadata. | 07-07-2011 |
20110167070 | REUSING ASSETS FOR PACKAGED SOFTWARE APPLICATION CONFIGURATION - Reusing assets for packaged software application configurations may include defining one or more query criteria based on information associated with a project, querying an asset repository stored in a storage device using the one or more query criteria, receiving a list of one or more assets from the asset repository in response to the querying, and using the one or more assets on the list to compose a document defining deployment procedures associated with deploying a packaged software application. | 07-07-2011 |
20110167071 | METHOD FOR SCORING INDIVIDUAL NETWORK COMPETITIVENESS AND NETWORK EFFECT IN AN ONLINE SOCIAL NETWORK - The present invention relates to a method for scoring individual network competitiveness and network effect by network analysis in an online social network, particularly, to embody a social network in an online way, to measure network competitiveness and network effect of each node in an online social network based on results of mutual evaluation for trust, integrity, solidarity, openness, importance, and intimacy among the 1 | 07-07-2011 |
20110173203 | PROVIDING WEB SERVICES FROM BUSINESS INTELLIGENCE QUERIES - In an embodiment of the invention, analytical data may be extracted from a business intelligence (BI) system on criteria specified in metadata of a business intelligence query. Once the data is extracted from the BI system, the extracted content may be provided in the context of semantic identifiers. In an embodiment, the semantic identifiers may provide a description of the extracted content associated with the identifier. In one embodiment, the semantic identifiers may be obtained from the criteria specified in the query. In another embodiment, the semantic identifiers may be defined in and obtained from a separate enterprise service repository. In an embodiment, a proposed mapping between query elements and semantic identifiers may be suggested by a processor. | 07-14-2011 |
20110173204 | ASSIGNING GESTURE DICTIONARIES - Techniques for assigning a gesture dictionary in a gesture-based system to a user comprise capturing data representative of a user in a physical space. In a gesture-based system, gestures may control aspects of a computing environment or application, where the gestures may be derived from a user's position or movement in a physical space. In an example embodiment, the system may monitor a user's gestures and select a particular gesture dictionary in response to the manner in which the user performs the gestures. The gesture dictionary may be assigned in real time with respect to the capture of the data representative of a user's gesture. The system may generate calibration tests for assigning a gesture dictionary. The system may track the user during a set of short gesture calibration tests and assign the gesture dictionary based on a compilation of the data captured that represents the user's gestures. | 07-14-2011 |
20110173205 | METHOD AND SYSTEM FOR DYNAMICALLY PROCESSING AMBIGUOUS, REDUCED TEXT SEARCH QUERIES AND HIGHLIGHTING RESULTS THEREOF - A method and system are provided of processing a search query entered by a user of a device having a text input interface with overloaded keys. The search query is directed at identifying an item from a set of items. Each of the items has a name including one or more words. The system receives from the user an ambiguous search query directed at identifying a desired item. The search query comprises a prefix substring of at least one word in the name of the desired item. The system dynamically identifies a group of one or more items from the set of items having one or more words in the names thereof matching the search query as the user enters each character of the search query. The system also orders the one or more items of the group in accordance with given criteria. The names of the one or more items of the identified group are output to be displayed on the device operated by the user as ordered with the characters of the one or more words in the names corresponding to the prefix substring of the search query being highlighted. | 07-14-2011 |
20110173206 | METHOD AND APPARATUS FOR IDENTIFYING A PIECE OF CONTENT - A system and method for identifying a piece of content is disclosed. In one embodiment, an identifier for a piece of digital content can be generated using the title of the piece of digital content, the size of the piece of digital content, and a hash of other attributes of the piece of digital content such as title, album, and artist names. | 07-14-2011 |
20110173207 | MANAGING COPIES OF DATA - A method in a computer system for retrieving data from one of multiple copies of the data is provided, referred to as the data management system. The data management system receives a request identifying at least one data object to be accessed. Then, the data management system queries a metabase to locate data copies that contain the identified at least one data object, wherein the data copies are created from similar source data, and wherein for each data copy the metabase contains an indication of the availability of the copy relative to other copies. Next, the data management system determines one of the located data copies to use to access the identified at least one data object, wherein the determination is made based on the indicated availability contained in the metabase for each of the located data copies. Then, the data management system accesses the identified at least one data object using the determined one of the located data copies. | 07-14-2011 |
20110179038 | PER-THREAD, PER-INSTANCE DATA STORAGE - A method of storing per-thread, per-instance data, includes identifying a unique index value corresponding to a first instance, identifying type parameters based on the identified index value, and instantiating a generic holder object based on the identified type parameters. The generic holder object includes a thread local field configured to store per-thread data that is local to the first instance. | 07-21-2011 |
20110179039 | METHOD AND SYSTEM FOR SEARCHING STORED DATA - A complete document management system is disclosed. Accordingly, systems and methods for managing data associated with a data storage component coupled to multiple computers over a network are disclosed. Systems and methods for managing data associated with a data storage component coupled to multiple computers over a network are further disclosed. Additionally, systems and methods for accessing documents available through a network, wherein the documents are stored on one or more data storage devices coupled to the network, are disclosed. | 07-21-2011 |
20110184956 | ACCESSING DIGITALLY PUBLISHED CONTENT USING RE-INDEXING OF SEARCH RESULTS - Illustrated is a system and method to identify, using an identification module, indexed digitally published content responsive to a search query. The system and method further includes generating an index value, using a indexing engine, based upon a characteristic of the indexed digitally published content. Additionally, the system and method includes re-indexing, using a re-indexing module, the indexed digitally published content based upon the index value. | 07-28-2011 |
20110191346 | DYNAMICALLY-CREATED PYRAMID TO DELIVER CONTENT - Dynamically-created content may be rendered in the form of a virtual image pyramid, and tiles of the pyramid may be provided for display. In one example, a user interacts with a server-based application, and the application dynamically generates visual content to be shown to the user. Based on the application's interaction with the user, or based on some other type of state information, the application determines the appropriate magnification level at which to show content to the user, and also determines the particular spatial region of the content to show to the user. This content is rendered onto one or more tiles, and the one or more tiles are delivered to the user. In this way, tiles are created and may be rendered as if there were an image pyramid of the dynamically-created content. | 08-04-2011 |
20110191347 | ADAPTIVE ROUTING OF DOCUMENTS TO SEARCHABLE INDEXES - Documents are assigned to one or more indexes in a document indexing system on the basis of document properties such as total number of tokens in the document, number of numeric tokens in the document, number of alphabetic tokens in the document, size of the document, and metadata associated with the document. Based on statistical distributions of document properties (over a large number of documents), different indexes can be defined, and a document router can direct a particular document to one index or another based on the properties of the particular document. In some implementations, certain document properties may be used to identify a nonrelevant document, or garbage document, so that it is either not indexed or assigned to an index dedicated for such documents. | 08-04-2011 |
20110191348 | METHOD OF INDEXING DATA IN DATA STORAGE DEVICE AND APPARATUSES USING THE METHOD - A method of indexing data in a data storage device includes generating index information of data using the data storage device and storing the index information in a storage region using the data storage device. | 08-04-2011 |
20110191349 | Method and System For Indexing, Relating and Managing Information About Entities - Systems and methods for use in association with a master entity index system may allow data records to be grouped together into various entities, where each of the entities may represent a logical or physical item. These entities may also be associated with one another in a manner such that relationships between entities may likewise be represented. In one embodiment, an interface may be provided for use in conjunction with the master entity index system such that these various entities and relationships may be better managed, manipulated or visualized. This interface may allow a user to search for or otherwise obtain an entity, where a representation of this entity and one or more associated entities may be presented to the user along with representations of the relationships between these entities. | 08-04-2011 |
20110196873 | System and Method for Replicating Objects In A Distributed Storage System - A system and method for inserting an object into a distributed database is provided. An object to be inserted into a priority queue is received, wherein the object includes a unique identifier and a priority. Next, an index for the object is generated. A row name for the object is then generated based on the index, the priority of the object, and the unique identifier of the object, wherein a lexicographical order of the row name for a higher priority object is smaller than the lexicographical order of the row name for a lower priority object. The object is then inserted into a row of a distributed database using the row name. | 08-11-2011 |
20110202537 | SYSTEM AND METHOD FOR USING TOPIC MESSAGES TO UNDERSTAND MEDIA RELATING TO AN EVENT - A system and method for using topic messages to understand media relating to an event. An event (such as a live event or broadcast event), is selected where at least a portion of the content of the event is stored as at least one media object, such as, for example a video or audio recording in digital format. A plurality of topic messages relating to the broadcast event is retrieved over a network and are analyzed to identify a plurality of content segments within the event. The topic messages are further analyzed to identify at least one key actor related to the event. Segment and key actor data relating to the event is stored to a computer readable medium in association with the media objects, whereby the segment data identifies the plurality of content segments within the media objects and the key actor data identifies key actors in the event. | 08-18-2011 |
20110202538 | VOXEL APPROACH TO TERRAIN REPOSITORIES FOR MODELING AND SIMULATION - A set of sensors can capture raw data that geospatially corresponds to a real world volumetric space. The raw data can include point cloud data encoded in a light detecting and ranging (LiDAR) information format, imagery and video data, and elevation data encoded in a digital elevation model (DEM) or digital surface model (DSM) format. The real-world volumetric space can be segmented into a set of volumetric units, wherein datum of the raw data is indexed against the volumetric units. Each of the volumetric units of the real-world volumetric space can be mapped to a voxel in a storage volumetric space of a voxel database. The raw data can then be stored and fused in the voxel database such that each voxel in the voxel database represents a combination of volumetrically stored data for all source products that includes providing point cloud data, imagery data, and elevation data. | 08-18-2011 |
20110202539 | QUERY SYSTEM FOR A HYBRID VOXEL AND FEATURE DATABASE - A hybrid database can receive a hybrid query for an object having a real world analog. A feature of a set of feature records can be determined that corresponds to the object. Feature-level attribute values of the feature can be extracted from the feature records. A volumetric envelope forming an outer boundary of the feature in a volumetric storage space of the hybrid database can be determined. A set of uniquely indexed voxels can be determined. Each voxel can be a volumetric unit of the volumetric storage space that represent the volume contained by the volumetric envelope. Voxel-level semantic values can be extracted from the set of uniquely indexed voxels from voxel records. Extracted voxel-level semantic values and extracted feature-level attribute values can be combined to generate a hybrid result. The hybrid result can be conveyed to a requestor from which the hybrid query was received. | 08-18-2011 |
20110202540 | METHOD AND APPARATUS FOR EFFICIENT INDEXED STORAGE FOR UNSTRUCTURED CONTENT - A method and apparatus for efficient indexed storage for unstructured content have been disclosed. | 08-18-2011 |
20110208744 | METHODS FOR DETECTING AND REMOVING DUPLICATES IN VIDEO SEARCH RESULTS - Duplicate video search results are detected and removed. Digital signatures are generated for each video content item of a video content corpus. Duplicates are determined for the top n previously received queries by determining the similarity of video content items that are within the same results set of each particular query of the top n previously received queries. Similarities are calculated between any two video documents of the result set of the particular query by measuring the difference between the digital signatures of two video documents. If a similarity between two videos is determined to be above a particular threshold, then the two videos are considered duplicates of each other and the search index is updated by retaining the most relevant of the video documents to the particular query. The less relevant video documents are flagged as duplicates with respect to the particular query. | 08-25-2011 |
20110213781 | FILTERED REMOTE JOURNAL - An operating system on a source computer system includes a filtered remote journal mechanism that generates a target map of a remote journal to specify where in the remote journal entries will be stored, then generates a vector array that represents the data of interest in the database journal based on specified selection criteria and the target map. The vector array and location information from the target map is then sent to a communication manager, which sends the data indicated in the vector array to the target computer system. On the target computer system, the communication manager reads the data sent from the source computer system and writes the data to the remote journal. The filtered remote journal mechanism may operate on an existing database journal, or may operate iteratively as journal entries are written on the source computer system. | 09-01-2011 |
20110213782 | INDEXING OF DATABASE QUERIES - Making data available from a database is disclosed. Making data available includes specifying a query function having a query function name, wherein the query function includes a structure and a member, determining the structure and the member included in the query function, wherein the query function has a query function name and includes the structure and the member, creating an index for the structure on the member, and compiling the query function to be available to a user by invoking the query function name without the user having to specify the structure and the member. Retrieving data from a database is disclosed. Retrieving includes invoking a query function that specifies a plurality of structures and a value, accessing a cross index of the plurality of structures, and using the cross index to access the data. | 09-01-2011 |
20110213783 | METHOD AND APPARATUS FOR GATHERING, CATEGORIZING AND PARAMETERIZING DATA - An apparatus for and a method of gathering new data form a specific data source, categorizing this data into a hierarchical directory structure, and building parameters to further describe this data. A harvesting system is an application that can organize and locate external data; the harvesting system can be used to navigate, organize and locate this data. The harvesting system utilizes a defined data model, referred to as a core data structure, that describes the logical structure of the data. Within the core data structure are pointers, which refer back to the source data. The loading process can be manually done, or can be automatically accomplished using this harvesting process. | 09-01-2011 |
20110219007 | METHOD EXECUTED BY MANAGEMENT SERVER, SYSTEM, AND COMPUTER-READABLE RECORDING MEDIUM - A method executed by a management server controlling a plurality of index servers for searching a database storing an object identifier and an attribute value, each of the plurality of index servers handling a part of the database, storing index information including the attribute value within a range predefined for the each of the plurality of index servers and the associated object identifier stored in the database, the method has deleting one of index information stored in one of the index servers in reference to the range of each of the index servers when an updated attribute value coming to be out of the range of the one of the index servers, obtaining a frequency of the deleting and the storing the one of the index information, and enabling the index server to obtain an attribute value from the database when the obtained frequency being more than a predetermined value. | 09-08-2011 |
20110225164 | GRANULAR AND WORKLOAD DRIVEN INDEX DEFRAGMENTATION - This patent application relates to granular and workload driven database index defragmentation techniques. These techniques allow for defragmenting individual index ranges, performing benefit analyses to estimate the impact of defragmenting indexes or index ranges, and leveraging such benefit analyses to provide automated workload-driven recommendations of index(es) or index range(s) to defragment. | 09-15-2011 |
20110225165 | METHOD AND SYSTEM FOR PARTITIONING SEARCH INDEXES - Methods and systems for partitioning search indexes are described. In one embodiment, such a method includes identifying, via a partition manager, a search index to be partitioned, where the search index includes a plurality of entity types, and wherein each of the plurality of entity types associates records stored in a data store with the respective entity type within the search index. The method further includes partitioning, via the partition manager, the search index into a first partitioned search index and a second partitioned search index, correlating, via a partitioning scheme, a first subset of the plurality of entity types with the first partitioned search index, and correlating, via the partitioning scheme, a second subset of the plurality of entity types with the second partitioned search index. | 09-15-2011 |
20110231403 | SCALABLE INDEX BUILD TECHNIQUES FOR COLUMN STORES - Architecture that includes an index creation algorithm that utilizes available resources and dynamically adjusts to successfully scale with increased resources and be able to do so for any data distribution. The resources can be processing resources, memory, and/or input/output, for example. A finer level of granularity, called a segment, is utilized to process tuples in a partition while creating an index. The segment also aligns with compression techniques for the index. By choosing an appropriate size for a segment and using load balancing the overall time for index creation can be reduced. Each segment can then be processed by a single thread thereby limiting segment skew. Skew is further limited by breaking down the work done by a thread into parallelizable stages. | 09-22-2011 |
20110231404 | File storage and retrieval method - A file storage and retrieval method or technique for processing alpha numeric information that has particular advantages when accessing data in a database on a computer. The retrieval technique uses the ASCII values of characters in a search string concatenated together to form a numeric value which serves as the index to the data itself or to the index which holds the computer address of the location of the data. This technique allows data to be accessed with only one access when searching for a word or phrase within a database and lends itself for use on static storage systems of the future as well as on current disk based systems. | 09-22-2011 |
20110238666 | METHOD AND APPARATUS FOR ACCESSING AN AUDIO FILE FROM A COLLECTION OF AUDIO FILES USING TONAL MATCHING - There is provided a method for accessing at least one audio file from a collection comprising more than one audio file stored within or accessible with an electronic device. The method includes generating one index comprising of information entries obtained from each of the more than one audio file in the collection, with each audio file in the collection information being linked to at least one information entry; receiving an input during an input reception mode; converting the input into a digital signal using an analog-digital converter; analysing the digital signal using either frequency spectrum analysis or melody contour analysis into discrete portions; and comparing the discrete portions with the entries in the index. It is advantageous that the audio file is accessed when the discrete portions substantially coincide with at least one of the information entries in the index. The discrete portions may be musical notes, melody contours or waveforms. The at least one information entry may also be musical notes, melody contours or waveforms. An apparatus employing the aforementioned method is also disclosed. | 09-29-2011 |
20110238667 | B-Tree Ordinal Approximation - A system, method, and computer program product are provided for generating a range of ordinals corresponding to key values represented in a B-Tree data structure. This allows for a boundary point, usable, for example, for partitioning the B-Tree for parallel processing, to be specified using an ordinal value of the range of ordinals. Then an ordinal-to-key transformation can be performed in order to identify a key value corresponding to the boundary point. Additionally, ordinal values can be identified through the use of key-to-ordinal transformations, allowing for specification of ranges, for example, in terms of key values. | 09-29-2011 |
20110238668 | DOCUMENT MANAGEMENT SYSTEM - In a document management system that manages index item definition and document data by cabinet, an index can be easily provided. A user that can log into a first database can use an index item defined by the first database to provide an index value to document data stored in a second database. | 09-29-2011 |
20110238669 | METHOD AND SYSTEM TO SEARCH OBJECTS IN PUBLISHED LITERATURE FOR INFORMATION DISCOVERY TASKS - The present invention relates to the identification, extraction, linking, storage and provisioning of data that constitute the captioned components of published or “print ready” literature for computerized information discovery activities including search, browse and data mining. These components, or objects, include the tabular presentation of data (“tables”) and graphics such as “figures”, “images” and “illustrations” typically used to supplement the textual narrative of the publication. | 09-29-2011 |
20110246470 | FAST SEARCH - Data records in one or more databases may be scanned to create a search index. The search index may include an identifier for each data record scanned, and a list of fragments of data in one or more data fields of each scanned data record. Each fragment in the list may be created by extracting a fixed number of adjoining characters from data in each data field of a corresponding data record. The list of fragments may include all fixed number of adjoining character sequences in each data field of a data record. Multiple search indexes may be included, with each search index including a different fixed number of adjoining characters in the fragment list. Systems and methods may be provided. | 10-06-2011 |
20110246471 | RETRIEVING VIDEO ANNOTATION METADATA USING A P2P NETWORK - A method of annotating video programs (media) with metadata, and making the metadata available for download on a P2P network. Program annotators will analyze a video media and construct annotator index descriptors or signatures descriptive of the video media as a whole, annotator scenes of interest, and annotator items of interest. This will serve as an index to annotator metadata associated with specific scenes and items of interest. Viewers of these video medias on processor equipped, network capable, video devices will select scenes and items of interest as well, and the video devices will construct user indexes also descriptive of the video media, scenes and areas of interest. This user index will be sent over the P2P network to annotation nodes, and will be used as a search tool to find the appropriate index linked metadata. This will be sent back to the user video device over the P2P network. | 10-06-2011 |
20110246472 | DATA STORE INTERFACE THAT FACILITATES DISTRIBUTION OF APPLICATION FUNCTIONALITY ACROSS A MULTI-TIER CLIENT-SERVER ARCHITECTURE - Methods and systems for facilitating distribution of application functionality across a multi-tier client-server architecture are provided. According to one embodiment, a method is provided for instantiating a DataMap. A data store interface reads a set of definitions and instructions from a datastore that describe the structure of the DataMap. The data store interface interprets the set of definitions and instructions to instantiate the DataMap. According to another embodiment, a method is provided for indexing into a DataMap. A data store interface receives an expression. The data store interface parses the expression to identify a set of keys suitable for indexing into the DataMap and corresponding DataPoints. | 10-06-2011 |
20110246473 | COMPUTERIZED METHOD FOR ANALYIZING INNOVATION INTERRELATIONSHIPS WITHIN AND BETWEEN LARGE PATENT PORTFOLIOS - The method of the present invention provides a labor and time saving ability to determine interrelationships within patents determined by searching, via a computer system, through patent fields to see if one or more particular pieces of alphanumeric data are common to any of the patents in the database in which the field indicia are located. Such commonality is searchable in backward or forward direction, or both, from, for example, one patent of particular interest. The method allows for presentation of families of interrelated patents within minutes rather than hours, weeks or longer by utilizing computer based technology. Further, the methodology allows for determinations of interrelationships within desired degrees of separation by manipulation of the indicative data to be searched. | 10-06-2011 |
20110246474 | DATA MANAGEMENT APPARATUS, DATA MANAGEMENT METHOD, AND DATA MANAGEMENT PROGRAM - [Problems to be Solved by the Invention] It is an object of the present invention to provide a data management apparatus capable of efficiently managing various data that is transmitted and received via a network, by automatically and adequately generating an ID with which the identity of pieces of data can be verified and associating the ID with the pieces of data, even if specifying information such as a marking assigned to the piece of data is modified by a completely independent server. | 10-06-2011 |
20110246475 | SYSTEM AND METHOD FOR LOCATING AND RETRIEVING PRIVATE INFORMATION ON A NETWORK - A system for document retrieval in a network environment is provided where documents are stored with corresponding privacy codes. A query server computer is in communication with the network and is programmed to generate a privacy index of all documents available on the network indexed by their corresponding privacy codes. The privacy codes define document access permissions that are securely associated with the documents and are assigned by document custodians. A search engine in communication with the network is configured to receive a query from a requester and generate a list of documents from the privacy index which match search parameters of the query and privacy codes of the requester. | 10-06-2011 |
20110246476 | METHOD AND SYSTEM FOR PERFORMING A SEARCH OF A FEED IN AN ON-DEMAND ENTERPRISE SERVICES ENVIRONMENT - A method of searching in a database of feed items enables efficient searching through a large corpus of data. Each feed item is associated with at least one object in the database. An index of feed items is generated. For each feed item in the database, the index includes a body of the feed item and at least one foreign key that identifies an object associated with the feed item. A search request with one or more search terms is received from a first user. Using the index of feed items, a pre-filtered set of feed items is generated for the first user. The search is executed, using the index of feed items and the pre-filtered set of feed items. Search results are limited to feed items in the pre-filtered set of feed items. | 10-06-2011 |
20110246477 | FILESYSTEM HAVING A FILENAME CACHE - A system comprising a processor, a data storage device that is accessible by the processor, and filesystem software that is executable by the processor to organize files on the data storage device are provided. The filesystem software is executable to maintain a filename cache comprising filename entries. The filename entries may include a filename header section and each filename entry identifies a filename of a corresponding file. | 10-06-2011 |
20110252037 | GENERATING SERVICE SPECIFICATION OF A SERVICE ORIENTED ARCHITECTURE (SOA) SOLUTION - A system and associated method for automatically generating a service specification of a Service Oriented Architecture (SOA) solution. A process model framework and a data model framework are received as inputs. Processes in the process model framework perform services of various complexity levels. Processes are decomposed into a respective set of atomic service processes in the lowest complexity level and data objects are extracted from the decomposed atomic service processes. The data objects are associated with data elements of the data model framework. The data model framework is extended and flexibility patterns are added for reusability of the service specification. The service specification of the SOA solution is generated as process interfaces represented with the data objects according to inputs from a user customizing aspects of the service specification, for either a desired service of the SOA solution or a desired process in the process model framework. | 10-13-2011 |
20110252038 | SEARCH EXTENSIBILITY TO THIRD PARTY APPLICATIONS - At least certain embodiments of the present disclosure include a method to extend search capabilities to third party applications installed on a device. In one embodiment, records associated with a third party application are indexed in a process isolated from other third party applications installed on the device using a search plugin specific to the third party application. Furthermore, the indexed records can be searched in response to a user search query without invoking the third party application. | 10-13-2011 |
20110252039 | SYSTEM, APPARATUS AND METHOD FOR PROCESSING CONTENT ON A COMPUTING DEVICE - A system, apparatus and method for processing content on a computing device is provided whereby, in one embodiment, a domain object model and associated style sheet are received at a computing device. The computing device is configured to parse the domain object model and determine styles from the style sheet and to store those styles in an interim storage for later use. | 10-13-2011 |
20110258196 | METHOD AND SYSTEM OF CONTENT RECOMMENDATION - A method of content recommendation, includes: generating a first digital mathematical representation of contents to associate the contents with a first plurality of words describing the contents; generating a second digital mathematical representation of text documents different from the contents to associate the documents with a second plurality of words; processing the first and second pluralities of words to determine a common plurality of words; processing the first and second digital mathematical representations to generate a common digital mathematical representation of the contents and the text documents based on the common plurality of words; and providing content recommendation by processing the common digital mathematical representation. | 10-20-2011 |
20110258197 | METHOD AND APPARATUS FOR DATA CAPTURE AND ANALYSIS SYSTEM - Content leaving a local network can be captured and indexed so that queries can be performed on the captured data. In one embodiment, the present invention comprises an apparatus that connects to a network. In one embodiment, this apparatus includes a network interface module to connect the apparatus to a network, a packet capture module to intercept packets being transmitted on the network, an object assembly module to reconstruct objects being transmitted on the network from the intercepted packets, an object classification module to determine the content in the reconstructed objects, and an object store module to store the objects. This apparatus can also have a user interface to enable a user to search objects stored in the object store module. | 10-20-2011 |
20110264664 | IDENTIFYING LOCATION NAMES WITHIN DOCUMENT TEXT - Concepts and technologies are described herein for identifying location names within document text. Through an implementation of the concepts and technologies presented herein, functionality can be provided for identifying location names within articles, websites, travelogues, or other such documents. For instance, documents containing the names of cities, regions, countries, landmarks, or other locations may be associated with those locations. The location names may be unambiguously identified even when the location names may also have common word meanings that are not location associated or when the location name may be associated with more than one location. | 10-27-2011 |
20110264665 | INFORMATION RETRIEVAL SYSTEM WITH CUSTOMIZATION - A data search and retrieval system that, in response to a search query, dynamically selects and applies a model of information to be returned to a user. The model may be selected based on the search query directly or indirectly based on data returned by a search engine applying the query. For this purpose, the system may include an index of models, similar to a search index. Models may be authored and contributed to the search and retrieval system by third parties, and an association between each such contributed model and characteristics of a search query, such as specific search query terms, may be stored in the index of models. A user of the search and retrieval system may provide feedback on a model that was used to generate information in response to the user's search query, and such feedback may be used to update the index of models. | 10-27-2011 |
20110264666 | METHOD AND APPARATUS FOR INDEX GENERATION AND USE - An approach is provided for generating a searchable index including an associated object graph. Data objects of a database are converted into corresponding rows in a set of searchable columns. An object graph corresponding to each row of the searchable columns are determined. An index including the set of searchable columns and the corresponding object graphs are generated. | 10-27-2011 |
20110270841 | Distributed Tag-Based Correlation Engine - Systems may use explicit ratings from users to construct user to user correlations. This technique may reduce the user-content correlation to a single dimension, i.e., the content that a plurality of users may rate similarly. Embodiments of the present invention may use DHT as an underlying distributed signaling mechanism, but may also make the rating implicit. Furthermore, embodiments of the present invention may construct the user to content correlation based on multi-dimensional metadata related to the content. | 11-03-2011 |
20110270842 | WEB SERVICE DISCOVERY VIA DATA ABSTRACTION MODEL WITH INPUT ASSISTANCE - A computer-implemented method includes receiving a selection of a particular logical field via a database query interface. The method includes querying a discovery registry associated with a data abstraction model of a database to discover one or more web services that are resolvable based on the particular logical field. The data abstraction model includes a plurality of logical fields, where each logical field corresponds to a particular data structure of the database. A web service input and a web service output of a particular discovered web service are associated with logical fields of the data abstraction model via the discovery registry. A modified database query interface includes an input assistance selectable input associated with the discovered web services. In response to a selection of the input assistance selectable input, a form input interface may be displayed that includes one or more input forms associated with the discovered web services. | 11-03-2011 |
20110270843 | SPECIALIZED SEARCH ENGINES - A specialized search engine tool designed for subject matter experts facilitates access to information relevant to their area of expertise available on public domains over the Internet. The specialized search engine represents the collection of thousands of links that are sorted, resorted, categorized and placed into databases that interact with one another. The specialized search engine may permit a user to compare results from multiple databases and automatically submit their search query to many popular searchable databases and web sites from a central web page, without having to individually visit each site. In one example, the specialized search engine is a medical search engine. | 11-03-2011 |
20110276575 | Query Generation for a Capture System - A document accessible over a network can be registered. A registered document, and the content contained therein, is not transmitted undetected over and off of the network. In one embodiment, the invention includes a manager agent to maintain signatures of registered documents and a match agent to detect the unauthorized transmission of the content of registered documents. | 11-10-2011 |
20110282880 | METHODS, APPARATUS, AND ARTICLES OF MANUFACTURE TO RANK WEB SITE INFLUENCE - Methods, apparatus, and articles of manufacture to rank a web site are disclosed herein. An example method to rank the influence of a web site involves receiving a topic term, filtering out a post that is not relevant to the topic term from a first plurality of posts to a web site, determining a number of inbound links to the web site based on the topic term, determining a time between the inbound links to the web site based on the topic term, and generating a rank of the web site based on the number of inbound links and the time between the inbound links. | 11-17-2011 |
20110282881 | METHODS AND SYSTEMS FOR DETERMINING CANDIDATES FOR A CUSTOM INDEX IN A MULTI-TENANT DATABASE ENVIRONMENT - Methods and systems are described for determining candidates for a custom index in a multi-tenant database environment. In one embodiment, a method includes, capturing a query that is directed to a multi-tenant database, determining whether the captured query is a candidate for an additional filter, determining operators used by the captured query if the query is a candidate, determining data types of the database used by the captured query if the query is a candidate, determining whether there is a current filter for the operator and data types used by the captured query if the query is a candidate, selecting the captured query based on the determined operators, data types, and the determined current filters, and generating a custom index for the selected query. | 11-17-2011 |
20110282882 | INTELLIGENT CONTAINER INDEX AND SEARCH - Aspects are introduced for indexing contents of a shell namespace extension. A plurality of files are indexed, and it is encountered that one of the files contains at least one file reference. The file is then accessed as a shell namespace extension so that the file reference can be indexed. | 11-17-2011 |
20110282883 | INDEXING SERVER AND METHOD THEREFOR - An indexing server of a P2P network and a method therefor are provided. The indexing server comprises: a metadata storage unit, which stores one or more entries, each of which is associated with a data file and includes a plurality of information items each indicating a node offering the data file and a location of the node; and a node information managing unit, which monitors the metadata storage unit to identify an entry stored in the metadata storage unit in which the number of information items exceeds a threshold, and transfers a portion of the information items included in the identified entry to another server, the transferred portion including as many as possible such information items that indicate nodes whose locations are close to each other. | 11-17-2011 |
20110282884 | Method and Device for Recording Media - A method and a device for recording media are provided. The method includes performing a media recording process. During the media recording process, periodically media index information are generated according to currently written media chunks. The generated media index information is saved. | 11-17-2011 |
20110289091 | Methods and Systems for Providing Multiple Column Custom Indexes In A Multi-Tenant Database Environment - In accordance with embodiments, there are provided methods and systems for providing multiple column custom indexes in a multi-tenant database environment. A method embodiment provides defining a multi-tenant data structure having a plurality of data columns data fields and a plurality of rows for each of multiple tenants, each row including a data column for a tenant identifier, defining a first data field for a tenant, the first data field having a first data type, defining a second data field for the tenant, the second field having a second data type, and defining an index table including a tenant identifier for the tenant, a copy of data from the first data field and the second data field, and a key to the corresponding rows of the multi-tenant data structure. | 11-24-2011 |
20110289092 | METHOD AND SYSTEM TO MONITOR A DIVERSE HETEROGENEOUS APPLICATION ENVIRONMENT - In various embodiments, a system and related method for organizing transactional data from a diverse and heterogeneous application environment is disclosed. In an example embodiment, a system includes a file system and one or more daemon indexers in electrical communication with the file system. The file system is arranged as a non-relational and serverless file system to allow for cost-effectiveness with ready scalability. The file system is to receive, in substantially real-time, unsorted transactional data from a publishing module. The one or more daemon indexers are arranged to receive the unsorted transactional data from the file system, organize the unsorted transactional data by operational characteristics, and store the organized transactional data on the file system. | 11-24-2011 |
20110295858 | METHOD AND APPARATUS FOR SEARCHING NUCLEIC ACID SEQUENCE - A method and apparatus for searching compressed nucleic acid sequences are disclosed. In the method, a reference sequence is compared with a subject sequence to be encoded, the subject sequence is compressed, an index is created with respect to the reference sequence and the compressed subject sequence, a position corresponding to a query is searched for in the compressed subject sequence using the index, a character found at the position within the compressed sequence is converted into a sequence, and the sequence is output as the response to the query. | 12-01-2011 |
20110307489 | METHOD AND APPARATUS FOR USER BASED SEARCH IN DISTRIBUTED INFORMATION SPACE - An approach is provided for enabling dynamic user based search within a distributed information space. A request for conducting a search over one or more information spaces is distributed to one or more autonomous agents. The autonomous agents process the request according to one or more functions specific to the one or more autonomous agents. Results are rendered to an interface of a user device in response to the search request. | 12-15-2011 |
20110307490 | Context Level Protocols And Interfaces - An invention for dissemination or retrieval of digital resources or online information via context layer or context-level protocols and interfaces is described. According to one embodiment, an interface or protocol that a computer uses to communicate with other computers is associated with a subject matter context. User-level contents or digital resources received across that interface or protocol are then associated with that subject matter context, and the computer may respond accordingly. For instance, a computer may associate a given network port with a subject matter context of shopping, and treat all digital resource requests received on that port as applying to only a shopping subject matter context. A web server may also listen on a network port associated with a subject matter context, thereby contextualizing the overall nature of the website that the web server hosts. | 12-15-2011 |
20110307491 | DIGITAL PHOTO ORGANIZING AND TAGGING METHOD - The digital photo organizing and tagging method includes software and a database that organizes digitized versions of dispersed collections of images such as photographs, pictures and other graphical objects from a variety of sources. The method facilitates describing, resizing, indexing, and sorting the different aspects contained in images. Features include documentation, e-mail, updating, version control, inclusion of video, embellishment, sound, animation, text to speech and, dynamic search. An Internet application facilitates annotation, organization, researching and sharing images and corresponding information collected at different times and places. Desktop/Internet applications link and relate images that share common elements. The same person found as a child in one image, as an adult in another, and referenced in a newspaper article is a relation example. The method collects the knowledge embedded in multiple images and produces well documented, organized, and clearly explained printable images, as well as dynamic albums of related images. | 12-15-2011 |
20110314026 | System and Method for Retrieving Information Using a Query Based Index - A search system generates a plurality of pseudo-documents for a set of documents. Each pseudo-document includes a representation one or more documents from the set of documents that would be retrieved in response to executing a respective basis query over the set of documents. The search system generates an index of the plurality of pseudo-documents. The index provides an indication, for a plurality of respective documents from the set of documents, of the pseudo-documents that include a representation of the respective document. After generating the index, the search system receives a document-query that includes a representation of a first document; and in response to receiving the document-query, the search system identifies one or more pseudo-documents that include representations of at least one document having a predefined relationship to the first document; and generates a result that is based on the first set of one or more pseudo-documents. | 12-22-2011 |
20110320458 | IDENTIFICATION OF NAME ENTITIES VIA SEARCH, DETERMINATION OF ALTERNATIVE SEARCHES, AND AUTOMATIC INTEGRATION OF DATA ACROSS A COMPUTER NETWORK FOR DYNAMIC PORTAL GENERATION - A dynamic portal generation system includes an indexing module that indexes structured and unstructured data in a database. The database includes information residing in associated standalone applications having documents from information sources, and a name-entity repository that includes name entities and their corresponding name-entity types. A search module searches the information residing in the indexed information to obtain a search result. A name-entity extraction module extracts a matching name-entity that corresponds to a name-entity in the name-entity repository. A portal generation module dynamically generates a portal triggered by the search query. The portal generation module aggregates the search result by a type or a source to form search result segments, renders data extracted from the search result segments in corresponding formats, integrates relevant actions corresponding to search result segments and displays actionable portlets that enable the user to perform additional operations on the information residing in associated standalone applications. | 12-29-2011 |
20120005213 | METHODS AND APPARATUS TO OBTAIN ANONYMOUS AUDIENCE MEASUREMENT DATA FROM NETWORK SERVER DATA FOR PARTICULAR DEMOGRAPHIC AND USAGE PROFILES - Methods and apparatus to obtain anonymous audience measurement data from network server data for particular demographic and usage profiles are disclosed. An example method to provide anonymous audience measurement data to an audience measurement entity disclosed herein comprises obtaining a demographic profile and a network usage profile, sampling customer data stored in a customer database not accessible by the audience measurement entity to generate a customer sample representative of the demographic profile and the network usage profile without customer intervention, the customer sample including customer identification information, processing log data obtained from a network server not accessible by the audience measurement entity using the customer identification information to determine audience measurement data associated with customers in the customer sample, and removing the customer identification information from the audience measurement data to prepare the anonymous audience measurement data for the audience measurement entity. | 01-05-2012 |
20120011126 | SYSTEMS AND METHODS FOR FACILITATING COMMUNICATION BETWEEN A PLURALITY OF BUILDING AUTOMATION SUBSYSTEMS - A computer-implemented system for facilitating communication between a plurality of building automation subsystems and a plurality of applications includes a first database storing a hierarchical model of a building automation system. The system further includes a second database storing information for the plurality of building automation subsystems. The second database stores each record with a projected semantic type string. The projected semantic type string includes multiple levels of type information and based on the hierarchical model stored in the first database. The system further includes a messaging engine configured to receive a message from one of the disparate building automation subsystems and for one of the plurality of applications. The messaging engine is configured to use the information in the second database to transform the message into a standard format, to append the projected semantic type string to the transformed message, and to provide the transformed message to one of the plurality of applications. | 01-12-2012 |
20120023105 | SportZeus Mobile Computing Apparatus and Worldwide Website Racing Event Database System - A worldwide mobile computing apparatus and database of race events indexed for mobile applications used on Smart phones, computers and computer peripherals for customers and enhanced with a proprietary news magazine, online and in print, and promoted with “AppWear” clothing and consumer goods. | 01-26-2012 |
20120030208 | Facilitating People Search in Video Surveillance - Techniques for facilitating a video surveillance search of a person are provided. The techniques include maintaining a database of one or more attributes of one or more people captured on one or more video cameras, indexing the one or more attributes in the database extracted from the one or more video cameras, and pruning one or more images captured from the one or more video cameras using the one or more attributes and one or more items of qualifying information to facilitate a video surveillance search of a person. | 02-02-2012 |
20120030209 | METADATA/CONTENT RE-ASSOCIATION SYSTEM AND METHOD - Digital content, such as video, audio, multimedia, and similar files may be associated with metadata that is in some manner descriptive of the content by means of a unique identifier generated based on the content or a portion of the content. The two may be packaged in a wrapper. If the content is separated from the metadata, such as by circulation in a non-compliant environment, the content may be recognized by a regeneration of the identifier. Based upon the re-created identifier, the metadata may be re-associated with the content. Other scenarios may include generation of compilations for files that are not included in a wrapper with content and metadata, content that becomes dissociated with metadata by transcoding, and so forth. | 02-02-2012 |
20120030210 | DISCERNING HUMAN INTENT BASED ON USER-GENERATED METADATA - Methods and systems for discerning human intent based on user-generated metadata are provided. Communications from a plurality of social networks are evaluated to determine whether they are associated with contexts indicative of user intent. When a social network communication is determined to be associated with such a context, a tag may be generated associating the social network communication with the indicated human intent. In addition, information regarding the social network communication may be exported to an application responsible for responsive action. | 02-02-2012 |
20120030211 | MESSAGE PROCESSING METHOD AND SYSTEM - A message processing method and system. The message processing method includes: acquiring messages and position information of the messages; clustering the messages according to the position information of the message to obtain message clusters; extracting addresses in contents of the messages in the message cluster; and building classifiers of the addresses based on the contents of the messages in the same message cluster. By sufficiently utilizing the position information of the related message, etc., the system can conveniently provide the message users with related accurate address information and can provide useful information for management decision. | 02-02-2012 |
20120030212 | Systems and Methods for Video Cache Indexing - Methods and systems for indexing content in a cache data structure within a content cache are disclosed. The methods and systems identify at least one characterization data associated with a received content request and input the at least one characterization data into a hash function to generate an index that is used for identifying an entry associated with the requested content in the cache data structure. The at least one characterization data may include a portion of content associated with the received content request, a portion of a Uniform Resource Locator (URL) associated with the received content request, an IP address associated with the received content request, the length of the content associated with the received content request, and one or more parameters parsed from the URL associated with the received content request. | 02-02-2012 |
20120030213 | METHODS AND SYSTEMS FOR MANAGING COMPOSITE DATA FILES - Systems and methods for managing data, such as metadata or non-metadata such as content. In one exemplary method, a composite document is received and it is determined whether the composite document contains at least one subdocument and if it does, the method captures metadata and/or content from the subdocument and stores the captured metadata and/or content for use in future searches (or an immediate search). The metadata and/or content from the composite document is typically combined together with information about the hierarchy of the subdocuments in the document. The type of information in metadata for one type of file differs from the type of information in metadata for another type of file. Other methods are described and data processing systems and machine readable media are also described. | 02-02-2012 |
20120041956 | SYSTEM AND METHOD FOR COMPUTER-ASSISTED MANUAL AND AUTOMATIC LOGGING OF TIME-BASED MEDIA - A customizable logging and content management system for indexing multimedia, including a synchronized timer object that provides a time reference upon request in connection with the media, and a logger object that logs predefined events that occur in the media by associating the events with respective time references from the timer object. A video server is provided that captures and digitally stores events logged by the logging application as media segments, and a search and retrieval engine is provided that enables the media segments to be located, retrieved and viewed based on the indexes. The system includes a graphical user interface generator that enables customized user interfaces and logging databases to be created from database tables for use in the logging application. | 02-16-2012 |
20120041957 | EFFICIENTLY INDEXING AND SEARCHING SIMILAR DATA - Techniques for efficiently indexing and searching similar data are described herein. According to one embodiment, in response to a query for one or more terms received from a client, a query index is accessed to retrieve a list of one or more super files. Each super file is associated with a group of similar files. Each super file includes terms and/or sequences of terms obtained from the associated group of similar files. Thereafter, the super files representing groups of similar files are presented to the client, where each of the super files includes at least one of the queried terms. Other methods and apparatuses are also described. | 02-16-2012 |
20120047145 | ATTRIBUTED SEMANTIC SEARCH - A method and system for performing a semantic search on structured data. An unstructured search query is received from a requestor. The query is evaluated within a computer to identify a best structured request based on the unstructured search query. The selected structured request is applied to a set of structured data. The result of the application of the structured request is then returned to the requestor. | 02-23-2012 |
20120059827 | Enterprise Data Duplication Identification - Systems, methods, and computer program products are provided for identifying duplicate data. In one exemplary embodiment, there is provided a method for identifying duplicate data. The method may include identifying one or more reference fields that include one or more data values. The method may include retrieving the one or more reference fields and one or more data values. The method may also include transforming the one or more reference fields into one or more reference fingerprint patterns. The method may also include identifying one or more target fields that include one or more target field values. The method may also include retrieving the one or more target fields. The method may also include transforming the one or more target field values into one or more target fingerprint patterns. The method may also include comparing the one or more reference fingerprint patterns with the one or more target fingerprint patterns. The method may further include determining an overlap between the one or more reference fingerprint patterns and the one or more target fingerprint patterns. | 03-08-2012 |
20120066227 | E-MAIL THREAD HIERARCHY DETECTION - A plurality of segments in an e-mail collection by parsing content of e-mails is generated. Corresponding segment signature for each segment is created and a signature index is populated using the generated segment signatures. After receiving a query e-mail, a plurality of query segments in the query e-mail is generated using content of the query e-mail and corresponding query segment signature for each query segment is generated. A query root segment is identified and corresponding query root segment signature is generated. A set of root segment signatures of the signature index is identified and the query root segment signature is compared with each root segment signature from the signature index. A subset of the signature index is identified, using a match between the root segment signature and the query root segment signature. An e-mail thread hierarchy is built using the identified subset of the signature index. | 03-15-2012 |
20120066228 | BASELINES OVER INDEXED, VERSIONED DATA - Provided are techniques for receiving a request to create a current baseline with a baseline identifier and selecting data elements that are to be included in the current baseline. For each of the data elements, for a version of the data element to be included in the current baseline, a reference to the version of the data element is added to the current baseline and a baseline index element for the version of the data element is maintained by one of creating a new baseline index element or updating an existing baseline index element. | 03-15-2012 |
20120072425 | CUSTOMER FOCUSED KEYWORD SEARCH IN AN ENTERPRISE - A method, system, and computer readable storage medium are provided for performing a centralized search to locate information having a common context in an enterprise. Such search can be provided by defining a logical object that groups customer profile and related objects such as contacts, leads, opportunities, notes, interactions, and the like for a search in a customer context. A customer identifier can serve as a key to link the related objects. A keyword search of a logical group is also provided that allows for a single unified search across customers and related objects, or to search within a single customer and related objects. One aspect of the logical group keyword search provides for refining the search to show a subset of objects and to use filtering of object attributes. Search results indicate the customer context by highlighting the customer name in the result. | 03-22-2012 |
20120072426 | SELF-SERVICE SOURCES FOR SECURE SEARCH - A flexible and extensible architecture allows for secure searching across an enterprise. Such an architecture can provide a simple Internet-like search experience to users searching secure content inside (and outside) the enterprise. The architecture allows for the crawling and searching of a variety of sources across an enterprise, regardless of whether any of these sources conform to a conventional user role model. The architecture further allows for security attributes to be submitted at query time, for example, in order to provide real-time secure access to enterprise resources. The user query also can be transformed to provide for dynamic querying that provides for a more current result list than can be obtained for static queries. | 03-22-2012 |
20120072427 | EFFECTIVE PRODUCT RECOMMENDATION USING THE REAL-TIME WEB - A method for generating product recommendations comprises analyzing a database of messages, comprising a set of messages posted by users of a micro-blogging service to generate a user index and a product index. The user index comprises for each of a plurality of users of the system, a ranked set of terms included by the user in their posted messages. The product index comprises for each product which is to be potentially recommended, a ranked set of terms derived from messages posted by users and referencing the product. Responsive to a query identifying a user, the user index for the user is compared to the product indices to return a limited set of product identifiers corresponding to product indices most similar to the user index. The set of product identifiers are provided as recommendations to a service provider. | 03-22-2012 |
20120078914 | SEARCHABLE SYMMETRIC ENCRYPTION WITH DYNAMIC UPDATING - A Searchable Symmetric Encryption (SSE) mechanism is described which allows efficient dynamic updating of encrypted index information. The encrypted index information includes pointer information that is encrypted using a malleable encryption scheme. The SSE mechanism updates the encrypted index information by modifying at least one instance of the pointer information without decrypting the pointer information, and thereby without revealing the nature of the changes being made. In one implementation, the SSE mechanism includes a main indexing structure and a deletion indexing structure. An updating operation involves patching applied to both the main indexing structure and deletion indexing structure. | 03-29-2012 |
20120084291 | APPLYING SEARCH QUERIES TO CONTENT SETS - Queries applied to content sets (e.g., files in a filesystem) often produce search results including many content items having identifiers that match the keywords of the query. However, many search techniques do not account for the relevance of the matching, e.g., whether the match is predictably relevant to the user, or whether the content item only tangentially matches the query. The techniques presented herein involve indexing the content items in a content index according to various identifiers having an identifier weight indicating the predicted relevance if a token of a query matches the identifier. Candidate content items may then be presented as search results sorted by the aggregated identifier weights of the matching identifiers, thereby promoting highly relevant content items and demoting incidentally matching content items. Additional adjustments may be made (e.g., promoting content items that match a particularly infrequent token or that match a phrase in the query). | 04-05-2012 |
20120084292 | RECOMMENDING MOBILE DEVICE ACTIVITIES - Techniques for recommending mobile device activities, such as accessing mobile applications and/or mobile Web pages, are described. Some embodiments provide an Activity Recommendation System (“ARS”) configured to recommend relevant activities for a user to perform with a mobile device, based on context of the mobile device. In one embodiment, the ARS recommends mobile applications based content items (e.g., Web pages, images, videos) that are being currently accessed via the mobile device. The ARS may process information about mobile applications and content items to determine semantic information, such as entities and/or categories referenced or associated therewith. The ARS may then use the semantic information to determine mobile applications that have semantic information that is at least similar to that of a content item accessed via a mobile device. | 04-05-2012 |
20120084293 | PROVIDING ANSWERS TO QUESTIONS INCLUDING ASSEMBLING ANSWERS FROM MULTIPLE DOCUMENT SEGMENTS - A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer. | 04-05-2012 |
20120084294 | METHOD AND APPARATUS FOR PROVIDING INDEXING SERVICE IN PORTABLE TERMINAL - Provided is a method and apparatus for providing an indexing service in a portable terminal. The method includes recognizing at least one data selected by a user from a first file provided through a text viewer of the portable terminal and extracting absolute locations of the recognized data in the first file, storing the extracted locations of the data and arranging the data in a predetermined order to create a first sub file related to the first file, and indexing the first file to allow a search in the first file based on particular data by displaying the locations stored for the respective data arranged in the created first sub file. The first sub file is further linked to data from other files according to the data selected by a user so that a user can determine location of a particular data. | 04-05-2012 |
20120084295 | METHOD AND SYSTEM FOR GENERATING AN INDEX OF SECURITIES - A method for generating an index of securities includes selecting index constituents and weighing the selected index constituents by a measure (“an intangible asset measure”) of quality and/or value of the intangible assets of constituent entities. The index constituents can be selected from a set of potential constituents based on an intangible asset measure of each constituent in the set, and the selected index constituents may then be weighted by a desired measure or methodology or weighted equally, or by an intangible asset measure. Another method for generating an index of securities includes combining groups of constituents, which are selected based on an intangible asset measure, to be index constituents and weighing the index constituents by a desired measure or methodology. | 04-05-2012 |
20120089610 | Structured Data In A Business Networking Feed - Disclosed are systems, apparatus, methods, and computer readable media for presenting updates to associated records to publish on an information feed in an on-demand database service environment. In one embodiment, data is received at a computing device configured to provide computing services via a network. The data may be separated into a plurality of data entries to define structured data. Each data entry may have a plurality of data fields. Each data field may be capable of being assigned a respective data value. The structured data may be stored in a database in an on-demand computing services environment. References to the structured data may be capable of being stored as feed items in one or more feed tables in a database. The feed items may be capable of being published in an information feed displayed on a display device. | 04-12-2012 |
20120096005 | System and method for remote source data verification - A remote source data verification system. The inventive system includes a remote server running a novel source data verification software application adapted to electronically acquire images of source documents, index the images and store the images in a source database in a predetermined order based on the indexing, select an image from the source database, interface with a clinical database, and simultaneously display data from the clinical database alongside the selected image. In a preferred embodiment, the system uses an electronic pen or other real time data capture device that simultaneously creates physical source documents and corresponding electronic recordings. The recordings are automatically transmitted to the remote server and processed to generate images of the original source documents. | 04-19-2012 |
20120096006 | BUILDING A LIBRARY OF SPECTRA FOR OPTICAL MONITORING - A method of controlling polishing includes storing a library having a plurality of reference spectra, polishing a substrate, measuring a sequence of spectra of light from the substrate during polishing, for each measured spectrum of the sequence of spectra, finding a best matching reference spectrum using a matching technique other than sum of squared differences to generate a sequence of best matching reference spectra, and determining at least one of a polishing endpoint or an adjustment for a polishing rate based on the sequence of best matching reference spectra. Finding a best matching reference spectrum may include performing a cross-correlation of the measured spectrum with each of two or more of the plurality of reference spectra from the library and selecting a reference spectrum with the greatest correlation to the measured spectrum as a best matching reference spectrum. | 04-19-2012 |
20120096007 | EMBEDDING CONTENT-BASED SEARCHABLE INDEXES IN MULTIMEDIA FILES - Content-based analysis is performed on multimedia content prior to encoding the multimedia content in the rendering chain of processing. A content-based index stream is generated based on the content-based analysis and the content-based index stream is embedded in the multimedia file during rendering. The content-based index stream can be used to generate a content-based searchable index when necessary. | 04-19-2012 |
20120102039 | METHODS AND SYSTEMS FOR PROVIDING IMPROVED ACCESS TO DATA AND MEASUREMENTS IN A MANAGEMENT SYSTEM - A method of managing device data related to a data center infrastructure includes generating, by a first server, a first portion of the device data in response to a change in a status of at least one of a plurality of data center infrastructure devices, storing, by a second server, the first portion of the device data in a database, generating, by the first server, a second portion of the device data in response to a request for data that is not contained in the database, the second portion of the device data being different than the first portion of the device data and being generated in a lightweight format, and displaying, in response to a user request, at least a portion of the device data using a user interface that is provided to the client computer by the second server. | 04-26-2012 |
20120102040 | SYSTEMS AND METHODS FOR MAPPING RECORDS IN A MANUFACTURER LINE, SERIES, MODEL HIERARCHY - A computer-implemented method, apparatus, and computer-readable media for mapping product records according to attributes in a standard schema, the standard schema mapping product records from a plurality of content providers according to a product hierarchy from each respective content provider, the method including: receiving product data from a plurality of feeds including feeds from one or more content providers; identifying a plurality of product records received from each of the plurality of feeds; identifying a taxonomy of products received in each of the plurality of feeds; and storing product records and corresponding product attributes in a standard schema in a data set for each feed according to the taxonomy identified for each respective feed. | 04-26-2012 |
20120102041 | GENETIC INFORMATION MANAGEMENT SYSTEM AND METHOD - A system and method for efficiently managing a vast amount of read data and genetic information obtained from sequencing data are provided. The system includes a parsing unit reading the read data and converting the read data into a table-embedded index, a table management unit classifying the table-embedded index into a master table which contains basic items about the genetic information and a slave table which contains additional items about the genetic information and managing the master table and the slave table, a search unit retrieving data from the master table and/or the slave table and outputting the retrieved data. | 04-26-2012 |
20120109967 | METHODS FOR PREFIX INDEXING - According to one aspect of the invention, in response to one or more terms to be indexed, each of the terms is indexed in a regular index. In addition, for each of the terms having multiple characters, at least one prefix portion of the term is indexed in a prefix index, where the regular index is used for regular searches and the prefix index is used for prefix searches without having to combine a plurality of postings lists of the regular index at the point in time. | 05-03-2012 |
20120109968 | INFORMATION PROCESSING APPARATUS, INFORMATION CREATING APPARATUS, INFORMATION PROCESSING METHOD, INFORMATION CREATING METHOD, INFORMATION PROCESSING POROGRAM, INFORMATION CREATINGN PROGRAM, AND RECORDING MEDIUM - An information processing apparatus includes a point database of position information and address information (text data of an address without a predetermined area name) for a point and with which a point search process is executed; a map database that includes text data of an area name for map display and with which a map display process is executed; and a control unit that, when a process is executed that uses text data of an address of a search point retrieved by the point search process, acquires from the point database, the position information and address information of the search point, acquires based on the position information of the search point and from the map database, the text data of the area name in the address of the search point, combines the acquired address information and acquired text data, generating the text data of the address of the search point. | 05-03-2012 |
20120109969 | METHOD FOR DATA ANALYSIS AND DIGITAL FORENSICS AND SYSTEM USING THE SAME - A system and method for data analysis and digital forensics is provided. The system for data analysis and digital forensics may include: an online data forensic server for collecting and analyzing usage history information from an object device, which is subject to data collection, downloading and collecting data on the Internet based on the usage history information, requesting issuance of a timestamp token for the collected data, and receiving the issued timestamp token; a timestamp token issuing server for issuing the timestamp token for the collected data in response to the request for issuance of the timestamp token and proving the issued timestamp token to the online data forensic server; and a storage device for storing the collected data. | 05-03-2012 |
20120117075 | SYSTEMS AND METHODS FOR GROUPED REQUEST EXECUTION - A server architecture performs concurrent information processing in a server system on a multi-core processor environment. The architecture supports simultaneous processing requests comprising multiple classes of queries and/or executing transactions in an application server and/or database server. Requests, which are made in an asynchronous manner, are structured with hash values to enable similar requests to be grouped together. The similar requests are grouped into a group session. All of the requests in the group session are executed at the same time. Accordingly, similar database inserts can be grouped and executed as a single request. The architecture minimizes thread-switching overhead by exploiting inherent parallelism in the inflowing requests. The threads and requests are de-coupled and hence any lock request only makes the execution threads take up another request instead of waiting until the lock is acquired. As such, the threads never go into sleep/wait mode and system resources are utilized more efficiently. | 05-10-2012 |
20120117076 | Suffix array candidate selection and index data structure - A method and system for identifying a candidate subset of a data set comprises comparing suffixes of query field values to data field values of records in the data set. Sufficiently similar records are included in the candidate subset. Query field value suffixes may range in length from the query field value itself down to a minimum suffix length. The longest suffix may be processed first, and then successively shorter suffixes may be processed until a satisfactory number of candidates are identified. Entries in an index data structure derived from the data set may associate various suffixes found in the data set with individual records. The data structure entries may include record keys identifying records with data field values identical to the suffix and may also include suffix pointers identifying related data structure entries with suffixes similar to the entry's suffix. | 05-10-2012 |
20120117077 | Annotation Framework - A fact repository contains facts having attributes and values and further having associated annotations, which are used, among other things, to vet facts in the repository and which can be returned in response to a query. | 05-10-2012 |
20120117078 | Method and System for Indexing and Searching Timed Media Information Based Upon Relevant Intervals - A method and system for indexing, searching, and retrieving information from timed media files based upon relevance intervals. The method and system for indexing, searching, and retrieving this information is based upon relevance intervals so that a portion of a timed media file is returned, which is selected specifically to be relevant to the given information representations, thereby eliminating the need for a manual determination of the relevance and avoiding missing relevant portions. The timed media includes streaming audio, streaming video, timed HTML, animations such as vector-based graphics, slide shows, other timed media, and combinations thereof | 05-10-2012 |
20120124053 | Annotation Framework - A fact repository contains facts having attributes and values and further having associated annotations, which are used, among other things, to vet facts in the repository and which can be returned in response to a query. | 05-17-2012 |
20120124054 | RECORDING MEDIUM, RECORDING DEVICE, RECORDING METHOD, DATA SEARCH DEVICE, DATA SEARCH METHOD, AND DATA GENERATING DEVICE - A data search device includes a playback unit playing back subjective data representing a subjectivity for predetermined content data from a recording medium on which the subjective data is recorded in association with the content data; a receiving unit receiving subjective data input by a user; and a searching unit searching for the content data in accordance with the subjective data played back by the playback unit and the subjective data received by the receiving unit. | 05-17-2012 |
20120124055 | SEARCHING FOR DATA OBJECTS - Method and system to search for data objects are provided. The method comprises generating an attribute index and a content index. The attribute index comprises entries associated with respective access control information for a set of data objects and generating. The content index comprises words from the set of data objects. A search query directed at the set of data objects is processed by searching the content index, searching the attribute index, and obtaining a search result based on the results of searching of the content index and the attribute index. | 05-17-2012 |
20120131008 | INDENTIFYING REFERRING EXPRESSIONS FOR CONCEPTS - Referring expressions are identified for concepts by analyzing search query and result selection information. An ontology of concepts is identified, and at least one document is associated with each concept. The document associated with a concept is representative of the concept. Search query information from a search engine is analyzed to identify search queries that resulted in user selections of documents associated with the concepts. Referring expressions that refer to the concepts are identified based on the search queries that resulted in user selections of documents corresponding with the concepts. After identifying referring expressions for concepts, search queries may be mapped to referring expressions to identify concepts to which the search queries pertain, and search result pages may be generated based on knowledge of the concepts. | 05-24-2012 |
20120131009 | ENHANCING PERSONAL DATA SEARCH WITH INFORMATION FROM SOCIAL NETWORKS - The personal data search technique uses data input by users for a given user's personal data on a social networking site to enrich the given user's personal data. The technique annotates personal data stored on a personal computing device or in a computing cloud with data obtained from social networking sites (for example, tags, comments, likes/dislikes and so forth) provided by friends/other users in the given user's social network or networks. Such annotations can later are used by search engine to enhance the search functionality and/or to improve the ranking of search results. Since the data is entered by actual human users it is very accurate and since the data is already readily available on social networks the cost to obtain it is very inexpensive. | 05-24-2012 |
20120136868 | EFFICIENTLY REGISTERING A RELATIONAL SCHEMA - A method, device, and non-transitory computer-readable storage medium are provided for efficiently registering a relational schema. In co-compilation and data guide approaches, a subset of entities from schema descriptions are selected for physical registration, and other entities from the schema descriptions are not physically registered. In the co-compilation approach, a first schema description references a second schema description, and the subset includes a set of entities from the second schema description that are used by the first schema description. In the data guide approach, the subset includes entities that are used by a set of structured documents. In a pay-as-you-go approach, schema registration includes logically registering entities without creating relational database structures corresponding to the entities. A database server may execute database commands that reference the logically registered entities. A request to store data for the entities may be executed by creating relational database structures to store the data. | 05-31-2012 |
20120136869 | System and Method of Processing Information Stored in Databases - In one embodiment the present invention includes a computer implemented method of processing information stored in a data structure. The method includes continuously generating individual records and storing the individual records in a database as a plurality of records. The method further includes selecting, according to a time period, a set of the plurality of records that qualify according to the time period. The method further includes moving the set to a table in the database, wherein the table excludes the plurality of records that are not in the set. In this manner, the records may be stored in a number of parallel tables, and this process may be managed at the application level instead of at the database level. Using parallel tables instead of a single large table may offer performance improvements, for example regarding deleting (or archiving) data. | 05-31-2012 |
20120136870 | SYSTEMS AND METHODS FOR TWO STREAM INDEXING OF AUDIO CONTENT - Systems and methods provide for indexing audio content by fusing the indexes derived from a keyword stream and a large vocabulary stream search. For example, systems and methods provide for two stream searching of Spoken Web VoiceSites, wherein metadata is extracted from the VoiceSite and is used to determine a set of keywords for high precision search while a traditional standard vocabulary set is used to perform a high results, low precision search. The results of the keyword search and the standard vocabulary search are fused together to form a comprehensive, ranked list of results. | 05-31-2012 |
20120136871 | SKIP LIST GENERATION - A method of generating a skip list is disclosed. The skip list comprises a data structure for referencing a plurality of ordered nodes, the data structure having a plurality of linked lists, a total number of the plurality of linked lists being constrained by an available memory of a memory system, the method comprising the steps of: inserting a first node of the plurality of nodes into a predetermined linked list of the plurality of linked lists of the data structure of the skip list in the available memory; promoting the first node to one or more other linked lists based on a first set of criteria; and generating the skip list by inserting at least a second node of the plurality of nodes into the data structure and promoting the second node based on a second set of criteria, said second set of criteria being different from the first set of criteria. | 05-31-2012 |
20120136872 | Fast Database Matching - A method of improving the speed with which a sample data record can be matched against records in a database comprises defining a list of possible key values ( | 05-31-2012 |
20120136873 | REAL-TIME PROCESS FOR DEFINING, PROCESSING AND DELIVERING A HIGHLY CUSTOMIZED CONTACT LIST OVER A NETWORK - A system and method of generating a contact list based on a geographic definition and, in certain embodiments, other screening criteria. In an embodiment, a geographic definition, specifying a geographic area, is received. The geographic definition is translated into at least one linkage key. A contact list, comprising a plurality of records associated with the geographic area, is then generated from one or more databases using the at least one linkage key as an index into the one or more databases. | 05-31-2012 |
20120143873 | METHOD AND APPARATUS FOR UPDATING A PARTITIONED INDEX - Techniques for enhanced updating of a partitioned index include first data that indicates a plurality of fields for each entry in an index for a data store. A current number of partitions for the index is determined. Second data that indicates at least one value for at least one field of at least a first entry in the index is received. A next number of partitions for the index based on the second data is determined automatically. | 06-07-2012 |
20120150861 | HIGHLIGHTING KNOWN ANSWERS IN SEARCH RESULTS - Answers to search queries are identified and highlighted in search result snippets for documents containing the answers. When a user submits a search query to a search engine, the search engine may query a search index to identify a set of documents as relevant search results for the search query. Additionally, an answer data store may be queried to determine if there is a short answer to the search query. If an answer is found, the answer may be used when generating search result snippets. As such, the search result snippets for documents containing the answer may include the answer. The answer may also be highlighted in the search result snippets. | 06-14-2012 |
20120150862 | SYSTEM AND METHOD FOR AUGMENTING AN INDEX ENTRY WITH RELATED WORDS IN A DOCUMENT AND SEARCHING AN INDEX FOR RELATED KEYWORDS - A method for enhancing a search of a set of documents is described. The method allows a user to present a word of interest. The word is then matched to related words in a larger corpus of words and the related words are matched against an index of the document to identify words that appear in both the matched words and the document index. The word selected by the user may be taken from a previously generated index of the document or the word may be presented by the user based on a topic of interest. | 06-14-2012 |
20120150863 | BOOKMARKING OF MEETING CONTEXT - Architecture that facilitates the ability to trigger the capture and storing of meeting state (or context) by way of a single user interaction (a “one-click” operation), referred to herein as a bookmark operation, and then to store and access the state for subsequent use. The state is captured relative to a point of reference, such as time, user, keywords, and reference to a document, for example. Thus, all state elements such as meeting activities, participants, and content (e.g., audio, video, images, text, documents, etc.). The bookmark assigned to the state at a particular reference can be selected to rehydrate all the state elements captured and associated with that bookmark (e.g., getting back to the point in the meeting to perceive a relevant portion of a document, part of the meeting video, or other recorded feed), as well as all other allowed state elements. | 06-14-2012 |
20120150864 | TEXT INDEXING FOR UPDATEABLE TOKENIZED TEXT - Systems, methods, and other embodiments associated with text indexing for updateable tokenized text are described. One example method includes receiving revised tokenized text intended to replace existing tokenized text in an indexed document. Token location information corresponding to the revised tokenized text is stored in an allocated free space portion of a text index posting. | 06-14-2012 |
20120150865 | Techniques For Efficient Queries On A File System-Like Repository - Techniques are provided to achieve performance improvements for path-based access to hierarchical data and for utilizing an extensible indexing framework. Extensible operators may be optimized by evaluating the operators using functions native to the database system instead of using functions defined through an extensible indexing mechanism. Furthermore, the database system may resolve a resource path of a query when the query is compiled, and may store the mapping of the resource path to one or more resources in a cursor for the query execution plan to avoid resolving the resource path each time the cursor is used to run the query. The cursor may be made dependent on the one or more resources to which the resource path refers at compile time. This takes advantage of the observation that the mapping of the resource path to particular resources rarely changes between evaluations of a particular query. | 06-14-2012 |
20120150866 | SYSTEMS AND METHODS FOR DOCUMENT SEARCHING - Systems and methods are provided for document searching. In one implementation, a computer-implemented method provides keyword searching. The method may receive a plurality of noisy keywords for a document collection. A server may generate tokens for a plurality of keywords in the document collection and merge the tokens to create an index. A search query may be received. The search query may include at least one search phrase. For the at least one search phrase, an indication may be received from a user specifying to perform one of a noisy phrase search or a noiseless phrase search. The method may search the index for the at least one search phrase based on the indication received from the user. | 06-14-2012 |
20120158731 | DERIVING DOCUMENT SIMILARITY INDICES - The present invention extends to methods, systems, and computer program products for deriving document similarity indices. Embodiments of the invention include scalable and efficient mechanisms for deriving and updating a document similarity index for a plurality of documents. The number of maintained similarities can be controlled to conserve CPU and storage resources. | 06-21-2012 |
20120158732 | BUSINESS APPLICATION PUBLICATION - A data marketplace infrastructure provides a crowd sourcing solution to development, discovery and publication of decision applications. Applications can be submitted from a user to a data warehouse in association with a data feed. One or more discovery properties are determined with regard to each application. The applications are made available to other client systems in association with the data feed. A relevant data feed and a relevant application can be identified based on satisfaction of a discovery request by the one or more determined discovery properties of the application. The application can be selected and downloaded to the user for evaluation and customization. The customized application can then be submitted to the data warehouse for publication with the other applications associated with the data feed. | 06-21-2012 |
20120158733 | SYSTEM AND METHOD OF MANAGING COMMUNITY-BASED AND CONTENT-BASED INFORMATION NETWORKS - A system for storing, managing, and accessing information on a network by providing an interface between a social network and a content network includes an applications platform. The system provides messaging and social networking facility incorporating enhanced instant messaging, file synchronization, network presence, interactive chat capabilities, text messaging, voice and video messaging, blogging, and email. The system includes a viewer, an indexing facility, and a storage facility. The viewer enables users to traverse content and provides services based upon context of time, place, structure, node, and observed user behavior. The viewer provides a means for users to interact with information on the network and services to manipulate information and transact activities. The indexing facility manages the structure of the network and tracks attributes and controlled vocabularies. The indexing facility supports navigation across the structure and resolves the logical index to a physical storage location. The storage facility provides physical persistence for an information component. | 06-21-2012 |
20120158734 | DATA MANAGEMENT SYSTEM AND METHOD - A data management apparatus includes an index generation unit configured to subdivide an entire interval of data into bucket intervals, allocate indices for the respective bucket intervals, transform the bucket intervals having the allocated indices into bucket intervals of specific lengths, and generate bucket-based indices for pieces of data included in the bucket intervals of the specific lengths. The data management apparatus further includes a data management unit configured to transmit the encrypted data and the bucket-based indices to a server-side data management apparatus in order to store the encrypted data, transmit a user query to the server-side data management apparatus in order to search for a desired encrypted data, and decrypt encrypted data corresponding to the user query from the server-side data management apparatus. The user query includes the index of first bucket interval and the index of second bucket interval neighboring to the first bucket interval. | 06-21-2012 |
20120158735 | Method and System for Aggregating Reviews and Searching within Reviews for a Product - The embodiments disclosed herein include new, more efficient ways to collect product reviews from the Internet, aggregate reviews for the same product, and provide an aggregated review to end users in a searchable format. One aspect of the invention is a graphical user interface on a computer that includes a plurality of portions of reviews for a product and a search input area for entering search terms to search for reviews of the product that contain the search terms. | 06-21-2012 |
20120166444 | CO-MAP COMMUNICATION OPERATOR - A high level programming language provides a co-map communication operator that maps an input indexable type to an output indexable type according to a function. The function maps an index space corresponding to the output indexable type to an index space corresponding to the input indexable type. By doing so, the co-map communication operator lifts a function on an index space to a function on an indexable type to allow composability with other communication operators. | 06-28-2012 |
20120173535 | Allowing External Access to Private Information of a Local Data Store - Techniques provided for allowing external access by other users to private information that is maintained on local storage of a computer and owned by an information owner. The private information is uploaded from the local storage to an externally accessible information source that is accessible by the other users. A request from a user to access the private information is received by the owner, who determines whether to allow access the private information. If so, the owner sends a private information sharing authorization to a collaboration orchestrator, which retrieves the private information from the external source and provides the private information to the user. The owner optionally requests to collaborate with the user before deciding whether to allow access to the private information. One or both of the identities of the owner and user can remain anonymous until agreeing on revealing identities. A system and program product is also provided. | 07-05-2012 |
20120173536 | Index of Locally Recorded Content - A method to index recorded content at a media device includes extracting, at a remote service provider, event index data from an event being recorded at a media device and associating the event index data with locator code data of the event. The method further includes storing, at the remote service provider, the extracted event index data and the associated locator code data; searching the extracted event index data for a plurality of segments associated with the event, the search being associated with a search request; determining index display data for a presentation of the plurality of segments based on the search request; and transmitting, to the media device, the locator code data associated with the plurality of segments, and the index display data. | 07-05-2012 |
20120173537 | ONE NUMBER, INTELLIGENT CALL PROCESSING SYSTEM - Systems and methods for retrieving household data based on an origination identifier. In an embodiment, an origination identifier of a communication is captured. The origination identifier is indexed into a master table comprising a plurality of records. Each of the records comprises an association between an origination identifier and a universal database linkage key, and each universal database linkage key comprises an index into one or more databases. A universal database linkage key associated with the captured origination identifier is retrieved and indexed into one or more databases. Household data associated with the captured origination identifier is retrieved from the one or more databases and communicated to at least one recipient. | 07-05-2012 |
20120173538 | STRUCTURAL ANALYSIS OF PROTEINS - The present invention is directed to systems and methods for fast and accurate structural representation and comparison of proteins. Specifically, the present invention provides a method for retrieval of a candidate set of near structural neighbors or structurally similar proteins of a query protein. The method is based on a representation of a protein structure as a “bag of words”—a collection of small disjoint backbone protein fragments. The representation allows quick comparison procedures of the query protein structure to a large number of known protein structures obtained for example, from a repository or database of proteins. | 07-05-2012 |
20120173539 | METHODS AND SYSTEMS FOR MANAGING DATA - Methods and systems for managing an index database. In one exemplary method, an index database is stored on a machine readable volume with an operating system and the files which have been indexed, and then the volume is, after the storing, made available for distribution to licensees or customers. In this manner, the volume will include a previously created index database, allowing a user to begin use of the index database without having to perform an indexing operation. | 07-05-2012 |
20120179687 | SYSTEM AND METHOD FOR DATA MINING AND SECURITY POLICY MANAGEMENT - A system and method to generate and maintain controlled growth DAG are described. The controlled growth DAG conveys information about objects captured by a capture system. | 07-12-2012 |
20120179688 | SYSTEM AND METHOD FOR BROKERING INFORMATION BETWEEN A PLURALITY OF COMMERCIALLY DISTINCT CLIENTS - A method, apparatus, article of manufacture, and a memory structure for brokering information between a plurality of clients using identifiers defining a plurality of data constructs is disclosed. An exemplary method comprises accepting a new data construct from an authoring entity, assigning a globally unique identifier to the new data construct, storing the new data construct and the assigned globally unique identifier in a database, and brokering between the authoring entity and a second entity commercially distinct from the authoring entity to provide the second entity access to the new data construct by reference to the assigned globally unique identifier of the new data construct or to provide the authoring entity access to an at least one of a plurality of pre-existing data constructs for use with the new data construct by reference to a globally unique identifier of the existing data construct. | 07-12-2012 |
20120185486 | SYSTEMS AND METHODS FOR UTILIZING AND SEARCHING SOCIAL NETWORK INFORMATION - Methods and systems for indexing, storing, recalling and displaying social network user profiles, event calendar postings and user feed postings are described. A single, discrete set of keywords, can be utilized and assigned to both user profiles and postings and can operate as a method of indexing. The assignment of these keywords may allow users to control the display of calendar and feed content by matching assigned posting keywords to assigned profile keywords via a matching algorithm. Matched event-related postings may also be automatically displayed in a user's calendar. Searches of profiles and postings may also be performed by constructing queries using the same set of discrete keywords attached to profiles and postings. Users may have the ability to control the display of specific profile information and postings via privacy settings, which utilize unique methods of tracking relationship segmentation and social distance. | 07-19-2012 |
20120191721 | Method and System for Efficiently Locating in a Database a User Profile in an IMS Network - Method and system for processing a request associated with a user from a requesting node to an answering node in a telecommunications network. A repository is associated with the answering node, the repository including a data structure including a plurality of user profiles associated with a plurality of users. In the answering node a user profile of the plurality of user profiles is associated with the user. The method comprising the steps of assigning a unique user index to each user profile in the data structure, wherein the user index is representative of the location of the user profile within the data structure, communicating at least one user index to the requesting node, incorporating the user index in the request by the requesting node, transmitting the request from the requesting node to the answering node, and retrieving the user profile associated with the user associated with the request by the answering node on the basis of the user index. | 07-26-2012 |
20120191722 | ATTRIBUTES OF CAPTURED OBJECTS IN A CAPTURE SYSTEM - Regular expressions used for searching for patterns in captured objects can be grouped into attributes. Such attributes can be associated with captured objects using tags stored in a database. In one embodiment, the present invention includes capturing an object being transmitted over a network, and determining that a regular expression appears in the object, the regular expression belonging to a group of one or more regular expressions associated with an attribute. If a regular expression associated with the attribute is found in the object, then an attribute field of a tag containing metadata related to the captured object is set to indicate the presence of the attribute in the captured object. The presence of the attribute in the captured object can now be determined from the tag, which can be stored in a database. | 07-26-2012 |
20120191723 | QUERY SYSTEM FOR A HYBRID VOXEL AND FEATURE DATABASE - A hybrid database can receive a hybrid query for an object having a real world analog. A feature of a set of feature records can be determined that corresponds to the object. Feature-level attribute values of the feature can be extracted from the feature records. A set of uniquely indexed voxels can be determined. Each voxel can be a volumetric unit of the volumetric storage space that represents the volume of real-world geography. Voxel-level semantic values can be extracted from the set of uniquely indexed voxels from voxel records. Extracted voxel-level semantic values and extracted feature-level attribute values can be combined to generate a hybrid result. The hybrid result can be conveyed to a requestor from which the hybrid query was received. | 07-26-2012 |
20120197898 | Indexing Sensor Data - In particular embodiments, a method includes, from an indexer in a sensor network, accessing a set of sensor data that includes sensor data aggregated together from sensors in the sensor network, one or more time stamps for the sensor data, and metadata for the sensor data identifying one or more pre-determined attributes of the sensor data. The method includes, at the indexer, generating an index of the set of sensor data according to a multi-dimensional array configured for querying of the set of sensor data along with a plurality of other sets of sensor data. One or more first ones of the dimensions of the multi-dimensional array include time, and one or more second ones of the dimensions of the multi-dimensional array include one or more of the pre-determined sensor-data attributes. The method includes, from the indexer, communicating the index of the set of sensor data for use in responding to one or more queries of the set of sensor data along with a plurality of other sets of sensor data. | 08-02-2012 |
20120197899 | METHOD AND APPARATUS FOR RECOMMENDING A SHORT MESSAGE RECIPIENT - A method and apparatus for recommending a short message recipient. The method includes parsing history short messages of a user to generate data associated with contacts, constructing a semantic association database by using the data, identifying a critical object in a new short message text of the user, analyzing an association between the critical object and the contacts by using the semantic association database, and recommending a short message recipient to the user according to a strength of association. | 08-02-2012 |
20120203786 | SYSTEM, METHOD AND COMPUTER PROGRAM PRODUCT FOR PROVIDING CONTENT BASED UPON A REPRESENTATION OF THE SAME - A system for providing content includes a source and a client. The source is capable of generating a representation of the content, the content being hierarchically-structured and including at least one item having an associated type. The source is capable of generating a representation including an identity table and at least one type table, each item of content being included in the identity table, and included in a type table based upon the type of the item. Thereafter, the source can be capable of providing the representation of the content. The client is capable of receiving the representation of the content, and accessing at least a portion of the content based upon the representation. The client can also be capable of receiving a query for at least one item of content, searching the representation for the item(s), and accessing at least a portion of the content including the item(s). | 08-09-2012 |
20120203787 | INFORMATION MANAGEMENT APPARATUS, DATA PROCESSING METHOD AND COMPUTER PROGRAM - An information management apparatus includes: a data receiving section, a collected data storage section, an aggregating section, a feature extracting section, a determining section, and an evaluation data storage section. The data receiving section periodically receives action data showing an action of a user. The collected data storage section stores the action data received by the data receiving section every user. The aggregating section generates a data set every user by aggregating action data containing an approximate content, of the action data stored in the collected data storage section. The feature extracting section extracts an index and a reference showing privacy confidentiality of the data set as a feature to incorporate in the data set. The determining section determines whether or not the privacy confidentiality of the feature of the data set is equal to or higher than a predetermined level. The evaluation data storage section stores the data set which passed the determining section. | 08-09-2012 |
20120209853 | METHODS AND SYSTEMS TO EFFICIENTLY FIND SIMILAR AND NEAR-DUPLICATE EMAILS AND FILES - A set of trigrams can be generated for each document in a plurality of documents processed by an e-discovery system. Each trigram in the set of trigrams for a given document is a sequence of three terms in the given document. A set of trigrams for each similar document is then determined based on the set of trigrams for the original document. To facilitate identification of the similar documents, a full text index is then generated for the plurality of documents and the set of trigrams for each document are indexed into the full text index, as individual terms. Queries can be generated into the full text index based on trigrams of a document to determine other similar or near-duplicate documents. After a set of potentially similar documents are identified, a separate distance criteria can be applied to evaluate the level of similarity between the two documents in an efficient way. | 08-16-2012 |
20120209854 | Mass Analysis Data Processing Method and Mass Spectrometer Using the Same - Provided is a method for quickly obtaining an intensity value at a desired m/z value in a compressed data obtained by run-length encoding of a mass analysis data. An index is created by pairing either the start position of a section where zero-intensity consecutively occurs two or more times in an array of an original spectrum data, or the start position of a sequence of data having significant intensity values in an array of the original spectrum data, with the corresponding position in an array of a compressed data. This index is stored separate from the compressed data. The creation of the index does not affect the array of the compressed data. Therefore, the data can be decompressed even by a data processing system that does not use the index. The index helps to quickly locate a compressed data corresponding to the desired m/z and obtain the necessary intensity value. | 08-16-2012 |
20120215785 | Composite Term Index for Graph Data - An indexing system for graph data. In particular implementations, the indexing system provides for denormalization and replica index functionality to improve query performance. | 08-23-2012 |
20120215786 | Server-Side Search Of Email Attachments - An email server receives an email sent to an email account and including one or more attachments. The one or more attachments are extracted from the email into one or more temporary storage elements. Content from the temporary storage elements is indexed. The indexed content is stored along with an identifier of the email in an index corresponding to the email account. The one or more temporary storage elements is disposed of after the indexing. | 08-23-2012 |
20120215787 | System and Method for Analyzing Data Records - A method and system for analyzing data records includes allocating groups of records to respective processes of a first plurality of processes executing in parallel. In each respective process of the first plurality of processes, for each record in the group of records allocated to the respective process, a query is applied to the record so as to produce zero or more values. Zero or more emit operators are applied to each of the zero or more produced values so as to add corresponding information to an intermediate data structure. Information from a plurality of the intermediate data structures is aggregated to produce output data. | 08-23-2012 |
20120215788 | Data Processing - A method comprising: receiving sample data for a plurality of channels, wherein the sample data comprises a plurality of separate sample values and each sample value may be identified using at least a channel index that differentiates between channels and a sampling index that differentiates between sample values; performing energy compaction with respect to at least one of the channel indexes and the sampling indexes to create compacted sample values where each compacted sample value may be identified using at least a channel index that differentiates between channels and a sampling index that differentiates between sample values; and selecting some but not all of the compacted sample values for further program. | 08-23-2012 |
20120221576 | COMPRESSED JOURNALING IN EVENT TRACKING FILES FOR METADATA RECOVERY AND REPLICATION - Embodiments are directed towards employing compressed journaling for event tracking files for metadata recovery and replication. Event data and related metadata are received from one or more client devices. When a feature within the received metadata is detected that is previously unwritten to a journal, then the previously unwritten feature is written to the journal. Further, any feature is detected for the received event data that is determined to be different from a feature associated with an immediately preceding event data that is written in the journal, then the detected different feature is identified in the journal. In one embodiment, the identification employs writing to the journal an effective feature record that may employ indices identifying the different feature. The received event data is also written to the journal and may further employ string arguments to minimize recording of redundant information into the journal. | 08-30-2012 |
20120221577 | METHOD OF AND APPARATUS FOR ORGANIZING DATA RECORDS IN A RELATIONAL DATABASE - Embodiments of the invention relate to organizing data records in a relational database. An aspect of the invention includes creating index items for a plurality of data records. Each index item includes a counter and the creating results in a plurality of counters. The numerical values of counters in corresponding index items are updated for data records in the plurality of data records that are subjected to random access. The plurality of data records are reorganized based upon the numerical values of the plurality of counters. | 08-30-2012 |
20120226698 | FOOD OR DRINK PRODUCT SEARCHING AND MATCHING SYSTEM AND METHOD - A system and method for providing an information repository that optimizes profiles of sensory characteristics of food or drink products. The system receives user preferences or search criteria of similar sensory characteristics to match against food or drink products in a database with a very high degree of certainty or accuracy. The system and method also provide personalization to users, i.e. personal recommendations based on personal preferences, as well as product matching processes. | 09-06-2012 |
20120233174 | SYSTEM AND METHOD FOR USING AN ORDINARY ARTICLE OF COMMERCE TO ACCESS A REMOTE COMPUTER - A system and method for using identification codes found on ordinary articles of commerce to access remote computers on a network. In accordance with one embodiment of the invention, a computer is provided having a database that relates Uniform Product Code (“UPC”) numbers to Internet network addresses (or “URLs”). To access an Internet resource relating to a particular product, a user enters the product's UPC symbol manually, by swiping a bar code reader over the UPC symbol, or via other suitable input means. The database retrieves the URL corresponding to the UPC code. This location information is then used to access the desired resource. | 09-13-2012 |
20120233175 | DATABASE, SLIP DATA MANAGEMENT SERVER, AND INDEX DATA MANAGEMENT PROGRAM - In a database storing an index table in which index data used for retrieval of slip data that are generated for every business unit in a business process are registered. The index data are data containing a plurality of slip processed data respectively corresponding to the slip data. The slip processed data are data in which a specific item, which contains a predetermined item suitable for grasp of a business process in each business and a key item defined in advance in each business. The content of the specific item among items respectively set up to the slip data on various kinds of businesses are associated with each other in unit of slip data. | 09-13-2012 |
20120239660 | DISTRIBUTED LICENSE MANAGEMENT FOR A DATA LIMITED APPLICATION - The invention is directed towards enabling data volume and data type based licensing of software in a distributed system of a plurality of remote and/or local nodes. The invention enables measuring and optionally restricting the use of software based on one or more provided licenses that restrict the amount and type of data that may be processed by the software. New and older licenses may be added together for a single, bulk entitlement for a given volume of data processing for one or all types of data. Different users in the same enterprise may combine license entitlements too. Also, a new license can be acquired repeatedly, without requiring the issuance of combined licenses by the issuing authority and/or the revocation of prior licenses. | 09-20-2012 |
20120239661 | Method and System for Meta-Tagging Media Content and Distribution - A unique application within Video that allows for user generated Meta-tagging to be delivered in real time to individual clips is provided. This meta-tag creates its own tail based upon the user generated words population, which is searchable via a “spider network” that runs invisible behind the web pages of the site. These same “tagged” words are broadcast over various delivery networks including but not limited to live feeds from SMS, MMS, News Feeds within Community Web Sites, Video Sharing Web Sites, Widget applications any other forms of electronic communication that are dropped in real time to identified users and friends of users. | 09-20-2012 |
20120239662 | DOCUMENT MANAGEMENT APPARATUS AND DOCUMENT MANAGEMENT METHOD - According to the present invention, a document template and a plurality of document generated based on the document template are registered in association with each other. A document including a search term is searched. A document which includes the search term in a portion corresponding to the document template and a document which includes the search term in a portion other than the document template are displayed in an identifiable manner. In a full-text search, a user can identify a document hit based on a text originally included in the document template and a document hit based on the portion (portion specific to a document input by the user) other than the document template. | 09-20-2012 |
20120239663 | PERSPECTIVE-BASED CONTENT FILTERING - An automated system and a method are described for providing a personalized set and stream of content from the system to a user of an end-user device. The content may include multiple items of various sorts and media types, including textual items, video, audio and/or proprietary or composite content elements that originate from various sources. The content may be consumed by the end-user or by multiple applications on behalf of the end-user, whether on an end-user device or on a server for further processing. The system filters the content made available to end-users in a manner unique to each end-user. Filtering is based on characteristics associated with the content element, the originating source and author, and unique preferences relevant to the end-user. Filtering can be done in real time or be based on prior processing. | 09-20-2012 |
20120246166 | UNIFORM POINT CLOUD DECIMATION - A method, apparatus, system, article of manufacture, and computer readable medium provide the ability to create a point cloud indexed file. A grid (of cells that are divided into subcells) is mapped over points in a point cloud dataset. An occupancy value, that indicates whether a subcell contains a point, is computed for each subcell. A surface area contribution factor is computed for each cell and identifies a count of subcells that are occupied divided by a total number of subcells. The surface area contribution factor for each cell and points for each cell are written to the point cloud indexed file. | 09-27-2012 |
20120246167 | Reducing Bookmark File Search Time - A player utilizes a bookmark file to process content. The bookmark file includes a sequential series of bookmarks that each include information specifying how to process content. In one or more implementations, only some bookmarks are indicated as searchable. When the player searches for a particular bookmark, it only searches bookmarks indicated as searchable. In various other implementations, the player utilizes an index to search for a particular bookmark. In some cases, the index contains an entry for each bookmark in the bookmark file that includes only the type and location of the respective bookmark. In other cases, only some of the bookmarks may be indicated as searchable and the index may only include entries for the searchable bookmarks. However, in these cases, the index may not include entries for bookmarks indicated as non-searchable. In either set of cases, the index may be updated if the bookmark file changes. | 09-27-2012 |
20120246168 | SYSTEM AND METHOD FOR CONTEXTUAL RESUME SEARCH AND RETRIEVAL BASED ON INFORMATION DERIVED FROM THE RESUME REPOSITORY - A computer-based system and method for intelligent resume search on online repositories is disclosed. The parameters in the resumes and the attributes related to the said parameters are identified and extracted by scanning the resumes sequentially and are stored in an index file. Search queries are constructed based on accepted query parts as input. The index file is indexed to locate the parameters relevant to the search queries. An initial score is assigned to the parameters located which is transformed to new score based on identifying additional domain intelligence in the derived attributes related to the located parameters. Finally, the resumes relevant to the parameters with the transformed score are retrieved and displayed. | 09-27-2012 |
20120246169 | QUERYING COMPRESSED TIME-SERIES SIGNALS - Technologies pertaining to compressing time-series signals are described herein. Groups of time-series signals are generated based upon similarities between time-series signals. Each group of time-series signals includes a respective base time-series signal. Ratio signals that are representative of time-series signals are computed, wherein the ratio signals are based upon the base time-series signal and other respective time-series signals in a group of time-series signals. | 09-27-2012 |
20120254189 | MULTILEVEL INDEXING SYSTEM - A multilevel indexing system for indexing documents including structure and content information. The system may include a structure index module generating a structure index for the documents based on a document structure. A content index module may generate a content index for the documents based on a document type and document content. A computerized tree generation module may generate a multilevel indexing tree including the structure and content indexes. A search into the structure index may drive a search into the content index. | 10-04-2012 |
20120254190 | EXTRACTING METHOD, COMPUTER PRODUCT, EXTRACTING SYSTEM, INFORMATION GENERATING METHOD, AND INFORMATION CONTENTS - An extracting method includes storing to a storage device: files that include character units; first index information indicating which file includes at least one character unit in a character unit group having a usage frequency less than a predetermined frequency and among character units having common information in a predetermined portion, the usage frequency indicating the extent of files having a given character unit; second index information indicating which file includes a first character unit having a usage frequency at least equal to the predetermined frequency and among the character units having common information in a predetermined portion; and referring to the first and second index information to extract a file having character units in the first and second index information, when a request is received for extraction of a file having the first character unit and a second character unit that is included in the character unit group. | 10-04-2012 |
20120259860 | LOCK-FREE INDEXING OF DOCUMENTS - Methods and systems for serving documents are provided. Documents may be served to users, for example, in response to search query inputs. Documents may be individually received for serving to users. In order to serve documents, documents are typically indexed. To serve documents in a real-time environment, a data structure is configured to index documents that are received while simultaneously being queried for documents to serve to a user. Documents that have not yet completed the indexing process are not available for serving and, thus, are not queried. Additionally, the data structure includes a memory optimizer such that indexing does not waste memory of the data structure. | 10-11-2012 |
20120259861 | METHOD AND APPARATUS FOR STORING AND SEARCHING FOR INDEX INFORMATION - The present invention discloses a method for storing index information. The method includes: after storing a content, sending index information pointing to the stored content to index nodes in an aggregation network that belong to the same geographical area as the stored content for storing; and sending the index information pointing to the stored content to index nodes in a core network for storing. The present invention also discloses a method for searching for index information, an apparatus for storing index information, an apparatus for searching for index information, and a communication network system. | 10-11-2012 |
20120265762 | System and method for indexing electronic discovery data - Systems and methods for efficiently processing electronically stored information (ESI) are described. The systems and methods describe processing ESI in preparation for, or association with, litigation. The invention preserves the contextual relationships among documents when processing and indexing data, allowing for increased precision and recall during data analytics. | 10-18-2012 |
20120265763 | SYSTEM AND METHOD FOR DYNAMICALLY CONFIGURING CONTENT-DRIVEN RELATIONSHIPS AMONG DATA ELEMENTS - A computer system configures data elements based on textual sources by identifying subunits of a textual source, indexing the subunits into a sequence comprised of terms, identifies based on a target a base subsequence of the sequence, and stores the terms in such a way that they can be expanded or contracted and a user can rapidly and efficiently derive relevant information and context even from a vast amount of information including by navigable display to the user. Other methods and systems of configuring and displaying data elements from textual sources are provided. | 10-18-2012 |
20120271831 | Dynamically Generating Recommendations Based on Social Graph Information - In one embodiment, a method includes maintaining access to information comprising nodes and edges; receiving a request from a first user corresponding to a first user node for a structured document corresponding to a first concept node; determining a first data set that identifies concept nodes connected by edges with user nodes that are each connected by edges with both the first user node and the first concept node; determining a second data set that identifies concept nodes connected by edges with the first concept node and user nodes that are each connected to the first user node; generating a score for each concept node in the data sets; selecting one or more concept nodes based on their scores as recommended nodes; and transmitting to the client device the structured document and code executable by a client application to render node names or identifiers of the recommended nodes for display. | 10-25-2012 |
20120271832 | METHOD AND SYSTEM FOR SEARCHING STORED DATA - A complete document management system is disclosed. Accordingly, systems and methods for managing data associated with a data storage component coupled to multiple computers over a network are disclosed. Systems and methods for managing data associated with a data storage component coupled to multiple computers over a network are further disclosed. Additionally, systems and methods for accessing documents available through a network, wherein the documents are stored on one or more data storage devices coupled to the network, are disclosed. | 10-25-2012 |
20120278333 | COMPUTERIZED DATA SET SEARCH METHOD - A method provides computerized searching of a data set. A method determines the location of an item in a contiguous data set including a plurality of items, the data set is stored in a computer system, and the method assigns a numeric value to each of the plural items in the data set. A least squares polynomial may be created for the data set using ordered pairs of (x, y), wherein x is the assigned numeric value of the item the data set and y is the location of the item in the data set. A command is received to search for the location of a target item in the data set, and a calculated target item location in the data set is generated using a target item assigned numeric value and the least squares polynomial. | 11-01-2012 |
20120278334 | Database System - A data structure. The data structure includes a record containing data, a record identifier associated with the record, a user identifier associated with the record, and a linking identifier containing the record identifier of a single record with which the linking identifier is associated and the user identifier associated with the record. | 11-01-2012 |
20120284276 | Access to Annotated Digital File Via a Network - Apparatuses, systems, and processes provide remote access to virtual objects abstracted from an electronic document. A publisher's electronic document is abstracted into its component virtual objects that are each, in turn, identified. Corresponding computer readable files are formed for respective virtual objects and the computer readable files are indexed. The index is encoded in a non-transitory computer readable medium. Users annotate the virtual objects and the annotations and corresponding annotation metadata are indexed and encoded in the non-transitory computer readable medium. The user annotations are shared amount a plurality of users. | 11-08-2012 |
20120284277 | EVENT INFORMATION MANAGEMENT SYSTEM, EVENT MANAGEMENT METHOD AND PROGRAM - When multiple types of monitored objects are sorted into main monitored objects and smaller objects accommodated therein, the states of the smaller objects can be made searchable by only monitoring information about said main monitored objects. The event information management system is provided with: an association information storage unit which stores association information in which association lists comprising main monitored object IDs and subordinate monitored object IDs associated therewith are marked with specific list identifiers; an event management module which, if the attribute information of an input event shows a physical quantity, selects the list identifiers of the association lists containing main monitored object IDs matching the entity ID of said input event, and creates index information in which the list identifiers and the input event attribute information are associated; and an index storage unit which stores the index information for purposes of searching for events. | 11-08-2012 |
20120284278 | MONITORING STATUS DISPLAY DEVICE, MONITORING STATUS DISPLAY METHOD AND MONITORING STATUS DISPLAY PROGRAM - A monitoring status display device according to the present invention has: use index number determination means for, when displaying an arrangement of indices representing monitoring subjects in a predetermined display region, determining use index numbers that can be used with respect to importance levels set for the monitoring subjects, based on a ratio of the importance levels and a total number of the indices; importance level allocation means for determining an index to be allocated to each of the importance levels, based on the arrangement of the indices and the use index numbers set for the respective importance levels, in a manner that indices with the same importance level are disposed close to each other; and monitoring subject allocation means for allocating, substantially evenly to the indices, the monitoring subjects having an importance level same as that of the indices. | 11-08-2012 |
20120284279 | CODE STRING SEARCH APPARATUS, SEARCH METHOD, AND PROGRAM - An index data configuration adapted to a code-string search method for a structured code string having data codes, first separator codes that separate a data code or a data code string and second separator codes that divide a code string into partial code strings. The configuration has a code ID range table holding the code ID ranges for each code and a next code ID table holding next code IDs. Using the configuration, a partial code string is searched for in the search target code string by a first search code string consisting of the data code or the data code string and a first separator code. Next, using a second search code string consisting of first separator codes, the data code or the data code string separated by each of the first separator codes is searched from the found partial code string. | 11-08-2012 |
20120290582 | METHOD AND APPARATUS FOR PROVIDING DISTRIBUTED KEY RANGE MANAGEMENT - An approach is provided for providing distributed key range management. A key management platform determines to rebalance a node for storing a key value range by splitting at least one other node from the node, wherein the at least one other node stores at least a portion of the key value range. The key management platform also causes, at least in part, a creation of one or more node lists for the node, the at least one other node, or a combination thereof. The key management platform further causes, at least in part, a storage of the one or more node lists at the node and the at least one other node, wherein any one of the one or more node lists can be used to initiate a routing of one or more requests between the node and the at least one other node to operate on the key value range. | 11-15-2012 |
20120290583 | Using Application Metadata To Identify Applications Of Interest - Methods, systems, apparatus, and computer storage mediums for receiving an initial index, the initial index associating each application of one or more applications with one or more applications of interest; for each application, receiving metadata associated with the application and receiving metadata associated with each of the one or more applications of interest; processing the metadata to selectively remove at least one of the one or more applications of interest from the initial index to provide a final index, the final index associating each application of the one or more applications with a remaining one or more applications of interest; storing the final index in memory; receiving a user selection of a particular application of the one or more applications; retrieving the one or more applications of interest based on the particular application using the final index; and providing the one or more applications of interest for display. | 11-15-2012 |
20120290584 | Recommending Applications for Mobile Devices Based on Installation Histories - Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving installation data, the installation data corresponding to one or more applications that can be installed and executed on mobile computing devices, receiving metadata corresponding to each of the one or more applications, for each application: generating a time series based on a number of installs, processing the time series and the metadata, calculating a score, determining that the score is greater than or equal to a threshold score, and in response to the determining, adding the respective application to an index of trending applications, storing the index of trending applications in computer-readable memory, retrieving the index of trending applications, and transmitting indications of one or more applications for display based upon the index of trending applications. | 11-15-2012 |
20120296913 | System and Method of Partitioned Lexicographic Search - A search method is provided. The method comprises, building a first plurality of collectors, associating a first plurality of index entries with the first plurality of collectors, wherein each index entry comprises a search target and is associated with at least one of the first plurality of collectors based on a similarity between the search target of the index entry and the at least one collector, selecting a second plurality of collectors from among the first collectors based on a similarity between a search term and each of the second plurality of collectors, selecting a second plurality of index entries from among the index entries associated with the second plurality of collectors based on a similarity between the search term and the search target of each of the index entries associated with the second plurality of collectors, and presenting information related to the second plurality of index entries. | 11-22-2012 |
20120296914 | TEMPORALLY-CORRELATED ACTIVITY STREAMS FOR CONFERENCES - Techniques are provided for capturing events and activities that occur during a conference, generating metadata related to the events, and correlating the metadata with specific points in time, within the conference, at which the corresponding events occurred. The resulting temporally-correlated event metadata may be stored as part of the conference recording, or separate from the conference recording. Once the temporally-correlated event metadata has been stored for a conference, the conference may be indexed based on the metadata. The index may then be used to not only to locate a conference that satisfies specified search criteria, but to identify the points or snippets, within the conference, that correspond to the search criteria. | 11-22-2012 |
20120296915 | Collective Acceleration Unit Tree Structure - A mechanism is provided in a collective acceleration unit for performing a collective operation to distribute or collect data among a plurality of participant nodes. The mechanism receives an input collective packet for a collective operation from a neighbor node within a collective tree. The input collective packet comprises a tree identifier and an input data field and wherein the collective tree comprises a plurality of sub trees. The mechanism maps the tree identifier to an index within the collective acceleration unit. The index identifies a portion of resources within the collective acceleration unit and is associated with a set of neighbor nodes in a given sub tree within the collective tree. For each neighbor node the collective acceleration unit stores destination information. The collective acceleration unit performs an operation on the input data field using the portion of resources to effect the collective operation. | 11-22-2012 |
20120296916 | METHOD, APPARATUS AND SOFTWARE FOR PROCESSING DATA ENCODED AS ONE OR MORE DATA ELEMENTS IN A DATA FORMAT - An index of data elements is stored with a file. The file encodes the data elements. The index indicates memory locations of the data elements within the file. The index and the file are stored in a storage device. A parser or decoder accesses the index to identify data elements. At least one of the storing and accessing are performed by a computing device. | 11-22-2012 |
20120303627 | RESPONDING TO A QUERY IN A DATA PROCESSING SYSTEM - A data processing system includes a plurality of processing stages. In response to a query, a membership structure is accessed to determine whether partially processed data from a particular one of the processing stages. | 11-29-2012 |
20120303628 | PARTITIONED DATABASE MODEL TO INCREASE THE SCALABILITY OF AN INFORMATION SYSTEM - A database includes data tables and indexes that are partitioned. Searches against the data table are performed in parallel over the multiple partitions. The indexes on each partition maintain indexes associated with the data on the given partition. Data tables storing string data include a string data file and index files for each word stored in the string data file. | 11-29-2012 |
20120303629 | SYSTEMS AND METHODS FOR ASSISTING PERSONS IN STORING AND RETRIEVING INFORMATION IN AN INFORMATION STORAGE SYSTEM - A system is disclosed for assisting users in storing and retrieving information in an information storage system. The system includes selection means for permitting a user to choose selected information to be stored; convening means for convening a tagging tribunal, the tagging tribunal providing a graphical aggregation of available tags and information nuggets that may be selected by a user to create an association between at least one selected tag or information nugget and the selected information based on the user's decision to create such an association; and linking means for creating an associative link between the at least one selected tag or information nugget and the selected information, the associative link being part of an associative network. | 11-29-2012 |
20120303630 | APPARATUS AND METHOD FOR ORGANIZING, STORING AND RETRIEVING DATA USING A UNIVERSAL VARIABLE-LENGTH DATA STRUCTURE - A method and apparatus implementing a “hadron” data structure architecture. In some embodiments, each hadron includes a frame identifier and a holder for a payload of data, wherein the plurality of hadron data structures are stored in a hadron block that provides a specification that includes metadata that specifies the payload. Some embodiments include a plurality of hadron blocks first and second hadron data blocks, each block including one or more hadron pages. Upon receiving data particles of a dataset, the system forms a first plurality of hadron data structures (hadrons) by creating a first frame identifier and associating the first frame identifier with a first data particle to form the first hadron, and creating a second frame identifier and associating the second frame identifier with a second data particle to form the second hadron, and stores the first plurality of hadron data structures in the first hadron page. | 11-29-2012 |
20120303631 | System and Method for Storing Files in a Distributed Environment - A system and method for storing data-files stored on distributed devices connected to a network. Data-elements of the data-files are allocated to data-blocks stored on the distributed devices. Key-identifiers are calculated for each of the data-blocks based on the allocated data-elements. The key-identifiers are stored in distributed tables stored on the distributed devices. Index-nodes are generated for the data-files based on the data-blocks. A Paxos algorithm is executed for the index-nodes based on the key-identifiers to provide a consensus of the data-files that are stored on the plurality of distributed devices. | 11-29-2012 |
20120310945 | Allocation of Assessments - Apparatuses, computer readable media, methods, and systems are described for processing a workload record for each of a plurality of assessors, each of the workload records identifying an assessment previously assigned to a particular one of the assessors, calculating a complexity score for each of the assessments, calculating a workload index for each of the assessors based on the complexity score of the assessment previously assigned to that assessor, and assigning a new assessment to a particular one of the assessors based on the workload indexes. | 12-06-2012 |
20120310946 | IDENTIFYING AN ITEM BASED ON DATA ASSOCIATED WITH THE ITEM - There are provided methods and systems to identify an item based on data associated with the item. Information is received that is descriptive of an item to be listed on an information storage and retrieval platform. The information is associated with an aspect-value pair, which is an aspect of the item and a value of the aspect. An expression of an interest of a user in the item is received. The expression is associated with the aspect-value pair. In response to the associating of the expression with the aspect-value pair, the item is identified. | 12-06-2012 |
20120310947 | ADAPTIVE HYBRID REASONING DECISION SUPPORT SYSTEM - A method for indexing a plurality of nodes using a computer system is provided. The computer system includes data storage and a processor coupled to the data storage. The method includes acts of storing the plurality of nodes in the data storage, each of the plurality of nodes having a hit count, a link count and an outcome, creating a qualitative index ordering a plurality of nodes according to the hit count, the link count and the outcome of each node and storing the qualitative index in the data storage. The hit count of each node indicates a number of times a case attribute associated with the node is presented to a user. The link count of each node indicates a number of times the case attribute associated with the node is affirmed as useful. The outcome of each node indicates a desirability of the outcome. | 12-06-2012 |
20120310948 | Indexing for Regular Expressions in Text-Centric Applications - A method, system, and article are provided for evaluating regular expressions over large data collections. A general purpose index is built to handle complex regular expressions at the character level. Characters, character classes, and associated metadata are identified and stored in an index of a collection of documents. Given a regular expression, a query is generated based on the contents of the index. This query is executed over the index to identify a set of documents in the collection of documents over which the regular expression can be evaluated. Based upon the query execution, the identified set of documents is returned for evaluation by the regular expression responsive to execution of the query over the index. | 12-06-2012 |
20120317121 | METHODS AND SYSTEMS FOR USING DISTRIBUTED MEMORY AND SET OPERATIONS TO PROCESS SOCIAL NETWORKS - Systems and methods for managing and evaluating a social network. The social network is represented as a graph structure and stored in distributed memory. A viable path from one node that is not directly connected to another node in the graph structure may be determined by traversing the graph in stages, moving outward from each node in stages until common midpoint nodes are found providing a connection between the nodes. When midpoint nodes are found, the paths connecting the one node to the other node may be reconstructed. | 12-13-2012 |
20120317122 | DISPERSED STORAGE NETWORK VIRTUAL ADDRESS FIELDS - A dispersed storage network includes a dispersed storage device to store data. The dispersed storage device includes a processing module operable to slice a data segment of a data object into data slices. The processing module further creates a slice name for each of the data slices. The slice name includes routing information containing a vault identifier that identifies a vault associated with the data object, in which the vault identifies at least one user. The slice name further includes a source data name containing an identifier of the data object. | 12-13-2012 |
20120323923 | Sorting Data in Limited Memory - A system for sorting tables comprises an interface operable to receive a first segment of an index column and a first segment of a key column from an on-disk database (ODDB), wherein a value in the index column represents a row of information in the ODDB and a value in the key column represents data to be sorted and each index value is associated with a key value, and wherein the ODDB is operable to store the sorted index values and key values in the first segments, a processor communicatively coupled to the interface, the processor is operable to sort the index values in the first segment and key values in the first segment by the key values according to sorting criteria, remove the sorted index values and key values in the first segments from an in-memory database in a sorting module, and the interface is operable to receive a second segment of the index column and a second segment of the key column from the ODDB. | 12-20-2012 |
20120323924 | METHOD AND SYSTEM FOR A MULTIPLE DATABASE REPOSITORY - Method, system, and programs for creating partitioned or fragmented log files during data logging to better manage file size, more easily facilitate data retrieval and file optimization. In an embodiment, typically large monolithic log files are fragmented or divided into smaller files that can be searched, stored, vacuumed and retrieved more easily. | 12-20-2012 |
20120323925 | System and Method for Synchronously Generating An Index to a Media Stream - A computer-implemented system and method for generating an index to a captured media stream. The system includes an output device configured to play a media stream. The system further includes an automatic tagging system for generating at least one auto tag based on the content of the received media stream, the auto tag associated with a portion of the received media stream and a user driven tagging system for generating at least one user tag based on a command received from a user, the user tag associated with a portion of the received media stream being provided at the time the command is received. The system yet further includes a non-transitory storage medium for capturing the received media stream in a media data file associated with a media index file, the media index file including the at least one auto tag and the at least one user tag. | 12-20-2012 |
20120323926 | Efficient Optimization over Uncertain Data - The subject disclosure is directed towards using fingerprints, comprising lists of simulation results corresponding to partial (random sampled) simulation results, to determine whether a full simulation may be avoided by reusing simulation results from a previous full simulation. Before running a full simulation, a current fingerprint is obtained via a partial simulation. If a previous fingerprint matches (is identical or similar to) the current fingerprint, the associated previous results are reused. Also described is indexing fingerprint data to facilitate efficient lookup-based fingerprint matching. | 12-20-2012 |
20120330964 | SMART INDEX CREATION AND RECONCILIATION IN AN INTERCONNECTED NETWORK OF SYSTEMS - A tool for using an interconnected network of systems to create an index for a database table. An index advisor on a primary server recommends one or more indexes to improve efficiency. While resources of the primary server are being used by various queries and processes, the primary server sends the recommendations to a secondary server (with available resources) so that the recommended indexes may be built in parallel with the processes executing on the primary server. The secondary server builds the recommended indexes based on its own copies of the database tables. The secondary server sends the built indexes to the primary server, where the primary server must reconcile the indexes with any changes that took place to the database tables subsequent to the replication of the tables on the secondary server. The primary server makes the associations between the new indexes and the tables they were built for. | 12-27-2012 |
20120330965 | METHOD AND APPARATUS FOR STORING AND SEARCHING FOR KEYWORD - A method for storing a keyword includes: performing a first Hash function operation and a second Hash function operation on the keyword to obtain an addresses of a first Hash bucket and an address of a second Hash bucket respectively; searching for the first Hash bucket and the second Hash bucket according to the address of the first Hash bucket and the address of the second Hash bucket; when the first Hash bucket has remaining space, storing the compressed keyword of the keyword and a pointer of the keyword into the first Hash bucket; and when the first Hash bucket has no remaining space, the second Hash bucket has remaining space, and no compressed keyword in the second Hash bucket conflicts with the compressed keyword of the keyword, storing the compressed keyword of the keyword and the pointer of the keyword into the second Hash bucket. | 12-27-2012 |
20120330966 | INTERFACE SYSTEMS AND METHODS FOR ACCESSING STORED DATA - A modular data and storage management system. The system includes a time variance interface that provides for storage into a storage media of data that is received over time. The time variance interface of the modular data and storage management system provides for retrieval, from the storage media, of an indication of the data corresponding to a user specified date. The retrieved indication of the data provides a user with an option to access specific information relative to the data, such as content of files that are included in the data. | 12-27-2012 |
20130006998 | INTERESTINGNESS OF DATA - Provided are techniques for analyzing fields. Statistical metrics for each field in a data set are received. A general interestingness index is generated for each field using one or more combination functions that aggregate standardized interestingness sub-indexes. One or more fields are identified as interesting for further analysis using the general interestingness index. One or more expert recommendations for field transformations are constructed for the identified one or more fields. | 01-03-2013 |
20130006999 | METHOD AND APPARATUS FOR PERFORMING A SEARCH FOR ARTICLE CONTENT AT A PLURALITY OF CONTENT SITES - In order to retrieve article level content from a plurality of content providers, a federated search program receives a generic query from a user and dispatches the query simultaneously to a plurality of connector objects. Each connector object that is associated with a particular content source and contains source specific code that reformats the generic query into a proprietary format required for the associated content source. The proprietary query is then dispatched to the content source. When the results at the content source are ready, the result set is fetched by the connector. The fetched results are then mapped into a standard format. The standard result sets from the different content sources are then merged into a single consolidated result set. Duplicate documents are removed from the consolidated result set and the final results are sorted in accordance with criteria specified by the user and presented to the user. | 01-03-2013 |
20130007000 | Method and System for High Performance Integration, Processing and Searching of Structured and Unstructured Data Using Coprocessors - Disclosed herein is a method and system for integrating an enterprise's structured and unstructured data to provide users and enterprise applications with efficient and intelligent access to that data. In accordance with exemplary embodiments, the generation of metadata indexes about unstructured data can be hardware-accelerated by processing streaming unstructured data through a reconfigurable logic device to generate the metadata about the unstructured data for the index. | 01-03-2013 |
20130007001 | SEARCHING WITH EXCLUSION TOKENS - An aspect includes phrase searching using exclusion tokens. A token division unit is configured to divide an input character string to be searched into a plurality of tokens. A token position definition unit is configured to set each token to be excluded in an occurrence position calculation as an exclusion token and to set each token to be included in the occurrence position calculation as a headword token, and define an occurrence position for each headword token. A position offset information assigning unit is configured to assign, to each of the exclusion tokens, position information obtained with the headword token followed by the exclusion tokens and to assign the headword token followed by the exclusion tokens as a starting point. An indexing processing unit is configured to perform indexing on the plurality tokens such that whether or not the exclusion tokens follow one of the plurality of tokens is identifiable. | 01-03-2013 |
20130007002 | SYSTEMS AND METHODS FOR USING METADATA TO ENHANCE DATA IDENTIFICATION OPERATIONS - Systems and methods for managing electronic data are disclosed. Various data management operations can be performed based on a metabase formed from metadata. Such metadata can be identified from an index of data interactions generated by a journaling module, and obtained from their associated data objects stored in one or more storage devices. In various embodiments, such processing of the index and storing of the metadata can facilitate, for example, enhanced data management operations, enhanced data identification operations, enhanced storage operations, data classification for organizing and storing the metadata, cataloging of metadata for the stored metadata, and/or user interfaces for managing data. In various embodiments, the metabase can be configured in different ways. For example, the metabase can be stored separately from the data objects so as to allow obtaining of information about the data objects without accessing the data objects or a data structure used by a file system. | 01-03-2013 |
20130007003 | INTERESTINGNESS OF DATA - Provided are techniques for analyzing fields. Statistical metrics for each field in a data set are received. A general interestingness index is generated for each field using one or more combination functions that aggregate standardized interestingness sub-indexes. One or more fields are identified as interesting for further analysis using the general interestingness index. One or more expert recommendations for field transformations are constructed for the identified one or more fields. | 01-03-2013 |
20130013615 | PROVIDING ANSWERS TO QUESTIONS INCLUDING ASSEMBLING ANSWERS FROM MULTIPLE DOCUMENT SEGMENTS - A method, system and computer program product for generating answers to questions. In one embodiment, the method comprises receiving an input query, identifying a plurality of candidate answers to the query; and for at least one of these candidate answers, identifying at least one proof of the answer. This proof includes a series of premises, and a multitude of documents are identified that include references to the premises. A set of these documents is selected that include references to all of the premises. This set of documents is used to generate one or more scores for the one of the candidate answers. A defined procedure is applied to the candidate answers to determine a ranking for the answers, and this includes using the one or more scores for the at least one of the candidate answers in the defined procedure to determine the ranking for this one candidate answer. | 01-10-2013 |
20130018889 | Lossless compression of high nominal-range dataAANM Jagmohan; AshishAACI IrvingtonAAST NYAACO USAAGP Jagmohan; Ashish Irvington NY USAANM Knight; Joshua W.AACI Mohegan LakeAAST NYAACO USAAGP Knight; Joshua W. Mohegan Lake NY USAANM Lastras-Montano; Luis A.AACI Cortlandt ManorAAST NYAACO USAAGP Lastras-Montano; Luis A. Cortlandt Manor NY US - A method for receiving a data stream that includes data samples, each data sample having one of a plurality of actual values. For each data sample in the data stream, a first index in a dictionary is selected. The dictionary includes indices corresponding to each of the plurality of actual values. The first index corresponds to an actual value of the data sample. A predicted value of the data sample is generated in response to previously received data samples in the data stream and to a prediction algorithm. A second index in the dictionary that corresponds to an actual value in the dictionary that is closest to the value of the predicted value is selected. The difference between the first index and the second index is calculated and compressed. The compressed difference between the first index and the second index is then output. This process is performed for each data sample in the data stream. | 01-17-2013 |
20130018890 | CREATING A CUSTOM INDEX IN A MULTI-TENANT DATABASE ENVIRONMENT - Methods and systems are described for creating a custom index in a multi-tenant database environment. In one embodiment, a method includes obtaining query for a multi-tenant database that is recommended as a candidate for creating an additional filter, evaluating the query against criteria to determine whether to select the query for creating the additional filter, and creating the additional filter for the query, if the query is selected. | 01-17-2013 |
20130024458 | System and Method for Efficiently Representing and Managing A Computer Facility - A method of representing and managing hierarchical relationship configuration in a computing facility is described. The method includes providing and storing a first index of hardware identifier assigned to each object in the computing facility; providing and storing a second index of ancestry identifiers of each object in the computing facility, the ancestry identifier of an object being the hardware identifier of an ancestor object at 1 to n hierarchy levels above the object; providing and storing a type information element for each ancestor object indicative of a type of ancestor object; and identifying an ancestor object of a particular object in the computing facility by accessing the first index of hardware identifier of the particular object, and identifying an ancestor object thereof of a particular type by accessing the ancestry identifiers and the type information element of the particular object. | 01-24-2013 |
20130024459 | Combining Full-Text Search and Queryable Fields in the Same Data Structure - A method for creating a search index is disclosed. A plurality of words found in one or more documents is identified. For each word of the plurality of words, one or more fields of the one or more documents in which the word can be found is identified. Using a computing device, a search index is created for each word of the plurality of words. The search index for each word of the plurality of words provides a mapping between the word and each occurrence of the word in each field of the one or more documents in which the word is found. | 01-24-2013 |
20130024460 | APPARATUS, SYSTEM, AND METHOD FOR INDEXING DATA OF AN APPEND-ONLY, LOG-BASED STRUCTURE - Methods for indexing data of an append-only, log-based structure include writing a plurality of data packets to a storage medium by sequentially appending the data packets to a log-based structure of the storage medium, the data packets associated with different logical identifiers belonging to a logical address space that is independent of physical storage locations on the storage media. The methods may further include writing an index segment associated with the plurality of data packets to the log-based structure, the index segment comprising index entries for determining the logical identifiers of the data packets and recording, on the storage media, information indicating where the index segment is written on the storage medium. | 01-24-2013 |
20130024461 | SYSTEM AND METHOD FOR PROVIDING LOCATION-SENSITIVE AUTO-COMPLETE QUERY - Provided is a system and method for providing a location-sensitive auto-complete query. The system includes at least one processor; a memory to store at least one program configured to be executed by the at least one processor; a keyword generating unit to generate a keyword based on the query inputted from a user terminal and location information received from the user terminal; and a query providing unit to provide an auto-complete query corresponding to the generated keyword among the auto-complete queries indexed according to location information. | 01-24-2013 |
20130031103 | System and Method for using a Device Description Repository - A method, computer program product, and computer system for using a device description repository comprises calculating an index by a processor based upon, at least in part, at least a portion of a plurality of user preferences associated with a computing device. One of a first experience and a second experience may be provided to the computing device based upon, at least in part, the index. | 01-31-2013 |
20130036122 | ASSESSING APPLICATION PERFORMANCE WITH AN OPERATIONAL INDEX - The present invention relates to a system and method for assessing application performance and user satisfaction. In one embodiment, the system and method calculates an Operational Index (OPdex) representing user satisfaction with an application. The OPdex may be a number quantifying user satisfaction with an application, such as a web application, and system performance. The OPdex may be based on one or more measurable metrics having a range of values that may affect user satisfaction or performance of an application. The OPdex may comprise calculating the index based on a soft threshold, a hard threshold, and measurements indicating a perceived application response time. The OPdex calculation may also account for sensitivity of user satisfaction to response time. Based on the OPdex, the system and methods also provide information indicating the relationship among application response time thresholds set by the users, the user satisfaction level, and the mean response time. | 02-07-2013 |
20130036123 | ANTI-TAMPER PROCESS TOOLSET - A tool set is provided that uses a series of modules (e.g. spreadsheet workbooks) for assessing risks/threats to and attacks against a project and developing countermeasures against such risks/threats/attacks. The project may cover a wide berth of functionality from alarm systems, computer security, building projects, etc. The roll up feature of the spreadsheet may be employed to evaluate changing scenarios in a short period of time. Ranked threats and countermeasures operable to defeat such threats may be displayed in a tree diagram generated by one or more of the modules. | 02-07-2013 |
20130046765 | SEARCHING ENCRYPTED ELECTRONIC BOOKS - An e-book server creates index information for encrypted e-book content. The index information is used by a reader module to efficiently search the encrypted content. To create the index information, the e-book server identifies logical partitions of an e-book that include phrases, and creates encrypted segments including a portion of the e-book content that is not necessarily aligned with the logical partitions of the e-book. The e-book server then stores index information including a mapping between the encrypted segments and the phrases located in the logical partitions. | 02-21-2013 |
20130046766 | ITEM SELECTING APPARATUS, ITEM SELECTING METHOD AND ITEM SELECTING PROGRAM - In an item selecting apparatus performing a selection of an item to be recommended for each user, it is performed to calculate, with respect to each usage registration of an item by a user, an elapsed value as a difference between a time point of creating the item or staring providing of the item and a predetermined time point, acquire a usage characteristics of each user based on the elapsed value and calculate a freshness value representing a degree of freshness about each item. Further, using correspondence rules of different characteristics corresponding to the usage characteristics, it is performed to calculate a novelty index by applying the freshness value of each item on the correspondence rule corresponding to the usage characteristics of each user (S | 02-21-2013 |
20130054607 | Method and System for Preparing Digital Information for Long-Term Preservation - A system and method are provided for creating or extending a preservation-ready digital document. This document is represented so as to be durably intelligible and reliably trustworthy. It includes within itself standardized metadata, provenance information, and reliable links to chosen documents within a world-wide network of digital repositories. These links and the documents' own identifier(s) are chosen to uniquely, unambiguously, and forever identify what they refer to. This system provides a robustly durable method of preserving an unbounded number of digital objects for as long as their representing bit-strings are kept in existence and findable by now-conventional digital library technology, as first described in [Gladney 2000] and publications by the same author. The overall system herein described provides this service without requiring that pre-existing software be modified, and without requiring that any information object that it is intended to protect be modified from what its declared authors, editors, and producers created and provided as input for this preservation packaging service. | 02-28-2013 |
20130054608 | Computer-Implemented Systems And Methods For Processing A Multi-Dimensional Data Structure - Computer-implemented systems and methods are provided for processing a multi-dimensional data structure. For example, systems and methods are provided for processing the multi-dimensional data structure and allowing cell selection rules related to the multi-dimensional data structure to be resolved efficiently, so that computational cost associated with processing the multi-dimensional data structure can be saved. | 02-28-2013 |
20130054609 | Accessing Anchors in Voice Site Content - A method, an apparatus and an article of manufacture for accessing a specific location in voice site audio content. The method includes indexing, in a voice site index, a specific location in the voice site that contains the audio content, mapping the audio content with information regarding the location and adding the mapped content to the index of the voice site, using the index to determine content and location of an input query in the voice site, automatically marking the specific location in the voice site that contains the determined content and location of the input query, and automatically transferring to the marked location in the voice site. | 02-28-2013 |
20130054610 | Sharing Parts of a Document Using Search Framework - Embodiments are configured to provide sharing of business logic items. A document may contain business logic items, for example, sets, members, or measures. Some business logic items may be created by a publisher who wants to make the business logic available to other users so that others can access the business logic. Embodiments provide for using an integrated server platform search component to automatically retrieve business logic items which exist in one or more documents stored in a document library. This may allow for a publisher to provide business logic to other users without having to rely on the other users to retrieve the business logic from a specific document, and without requiring the other users to know of the existence of the business logic. Restrictions may be placed so that a publisher can control what specific pieces of business logic may be made available. | 02-28-2013 |
20130054611 | APPARATUS AND METHOD FOR PROCESSING PARTITIONED DATA FOR SECURING CONTENT - An apparatus to process partitioned data includes a storage unit to store a first data portion of a data file; a communication unit to receive a second data portion of the data file from a device in response to an authentication between the apparatus and the device; and a recovery processing unit to obtain the data file, by merging the first data portion and the second data portion of the data file using partition information. | 02-28-2013 |
20130054612 | Universal Document Similarity - Described herein are methods for finding substantially similar/different sources (files and documents), and estimating similarity or difference between given sources. Similarity and difference may be found across a variety of formats. Sources may be in one or more languages such that similarity and difference may be found across any number and types of languages. A variety of characteristics may be used to arrive at an overall measure of similarity or difference including determining or identifying syntactic roles, semantic roles and semantic classes in reference to sources. | 02-28-2013 |
20130060780 | Column Domain Dictionary Compression - In column domain dictionary compression, column values in one or more columns are tokenized by a single dictionary. The domain of the dictionary is the entire set of columns. A dictionary may not only map a token to a tokenized value, but also to a count (“token count”) of the number of occurrences of the token and corresponding tokenized value in the dictionary's domain. Such information may be used to compute queries on the base table. | 03-07-2013 |
20130060781 | PERFORMING INDEX SCANS IN A DATABASE - Methods and apparatus, including computer program products, implementing and using techniques for performing an index scan in a database system. A sequence of pages visited during an index scan are dynamically tracked. The tracking includes tracking the number of page accesses requiring a wait for input or output. It is determined whether prefetching of an object is desirable. The determination being made at least in part based on the number of page accesses requiring a wait for input or output. If prefetching is desirable, a prefetching method is selected among a set of available prefetching methods. An effectiveness of the selected prefetching method is continuously monitored by tracking the sequence of pages accessed and available access methods. A different prefetching method is selected from the set of available prefetching methods if the different prefetching method has a better effectiveness for a portion of the index that is being scanned. | 03-07-2013 |
20130060782 | DETERMINING INDEXES FOR IMPROVING DATABASE SYSTEM PERFORMANCE - Methods and systems for identifying one or more indexes are described. A system can collect a set of database operations that were performed during a given time window. Next, the system can partition the set of database operations based on a set of database tables that were accessed by the set of database operations. Then, for each database table in the set of database tables, the system can determine a set of indexes that are candidates for improving database system performance. Next, based on the candidate indexes, the system can identify one or more indexes, one or more combinations of indexes, and/or a consolidated index that improve overall database system performance. | 03-07-2013 |
20130066876 | IDENTIFYING UNSEEN CONTENT OF INTEREST - Making an information retrieval process public, so that it can be followed by others, allows capturing of an interest graph that allows people to learn more about shared interests with other people. This also allows items of interest to a trusted resource (such as an expert) to be identified. These items can then be brought to the attention of other users that share the same interest as the expert. In addition, by keeping track of what particular content a user has already seen, the system can bring items of interest to the user's attention, where the user has not yet seen those items. | 03-14-2013 |
20130066877 | DATA STREAMING FOR INTERACTIVE DECISION-ORIENTED SOFTWARE APPLICATIONS - Providing data to a computer software application by intercepting a request made by a computer software application to retrieve a first data set from a first data storage location, accessing a record, defined in advance of the request, that includes a reference to the first data set and a reference to a second data set, retrieving the first data set from a data storage location other than the first data storage location, and retrieving the second data set from a data storage location other than the first data storage location in advance of a request by the computer software application to access the second data set at the first data storage location. | 03-14-2013 |
20130066878 | METHOD AND APPARATUS FOR ENABLING VIRTUAL TAGS - A method and apparatus for enabling virtual tags is described. The method may include receiving a first digital image data and virtual tag data to be associated with a real-world object in the first digital image data, wherein the first digital image data is captured by a first mobile device, and the virtual tag data includes metadata received from a user of the first mobile device. The method may also include generating a first digital signature from the first digital image data that describes the real-world object, and in response to the generation, inserting in substantially real-time the first digital signature into a searchable index of digital images. The method may also include storing, in a tag database, the virtual tag data and an association between the virtual tag data and the first digital signature inserted into the index of digital images. | 03-14-2013 |
20130066879 | METHOD AND APPARATUS FOR MANAGING NUMEROUS MESSAGES - A method and apparatus for managing numerous messages are provided. The method includes: A. pulling, by a client terminal, a message index list from a server; B. pulling, by the client terminal, message bodies corresponding to message indexes in the message index list in batches; C. managing, by the client terminal, the pulled message bodies and message indexes by using a message pool mode; D. after receiving a finish indication triggered by a user, storing, by the client terminal, the message indexes and the message bodies in the message pool to a local disk. By using the present invention, numerous messages are managed. | 03-14-2013 |
20130073557 | MATCHING OF CASES BASED ON ATTRIBUTES INCLUDING AN ATTRIBUTE RELATING TO FLOW OF ACTIVITIES - A particular case is matched to further cases, where the matching is based on plural attributes contained in the particular case and in the further cases, wherein one of the plural attributes relates to a flow of activities taken to address the respective case. | 03-21-2013 |
20130073558 | SYSTEM AND PROGRAM PRODUCT FOR PROVIDING HIGH PERFORMANCE DATA LOOKUP - Under the present invention, index keys are generated for a set of documents. This is typically accomplished by examining the set of documents, and connecting data values extracted from the set of documents to yield the index keys. Once the index keys are generated, an index view will be generated into which the index keys are populated. Using the index keys in the index view, an agent will automatically obtain the set of documents (i.e., in the background). Then, when a user requests one of the documents, the document will already have been retrieved from storage. As such, it can readily be provided to the user. It should be understood that as used herein, the term “document” is intended to refer to any type of electronically stored data. | 03-21-2013 |
20130080439 | Systems and Methods for Contextual Analysis and Segmentation of Information Objects - Systems and methods are disclosed for contextual analysis and segmentation of information objects. In accordance with one implementation, information objects, such as web pages and user profiles, may be processed to obtain a list of key terms. An index may be created containing each of the information objects and associated key terms. Information objects may then be matched to contextual segments (i.e., categories of information objects) by submitting terms associated with the contextual segments against the index. Further, thresholding may be applied, so that only the most relevant information objects for a contextual segment are assigned to the contextual segment. | 03-28-2013 |
20130080440 | DOCUMENT ELEMENT INDEXING SYSTEM - Devices, methods and systems are disclosed herein to describe a document element indexing system which efficiently organizes different files or the constituent parts of different files into an index, thereby allowing users to easily and efficiently access, annotate and/or review the desired information. The indexing system may be applied to organize components of homogeneous and/or heterogeneous file types. | 03-28-2013 |
20130080441 | INDEX SELECTION FOR XML DATABASE SYSTEMS - A method, computer-implemented system, and computer program product for creating indexes over XML data managed by a database system are provided. The method, computer-implemented system, and computer program product provide for receiving a workload for the XML data, the workload including one or more database statements, utilizing an optimizer of the database system to enumerate a set of one or more path expressions by creating a virtual universal index based on the workload received and matching a path expression to the virtual universal index, and recommending one or more path expressions from the set of one or more candidate path expressions to create the indexes over the XML data. | 03-28-2013 |
20130080442 | METHODS AND SYSTEMS FOR A CLOSEST MATCH SEARCH - A system to generate an index for a closest match search is described. The system receives a corpus of information that includes member information. The system parses the member information to generate signatures for each of the members and stores the signatures in the index. The signatures are unique to the members. Accordingly, the signatures signify the respective members. The system subsequently utilizes the index to identify input information that matches signatures in the index to identify a closest match of the input information to one or more members in the corpus of information. | 03-28-2013 |
20130086070 | PRIOR ART MANAGEMENT - System and method permit prior art management. A method may comprise maintaining a database of patent portfolios and a database of patents with each patent stored in the database of patents associated with one or more patent portfolios stored in the database of patent portfolios. A method and system of prior art management may include maintaining a database of prior art portfolios, a database of patent matters, and a database of prior art reference citations, at least some of the prior art reference citations being associated with at least one of the patent matters stored in the database. Prior art references may be cross-cited between patent matters and portfolios. | 04-04-2013 |
20130091140 | Multi-Shot Scheduling System - A Multi-Shot Scheduling System chooses from multiple candidate playlists of positions to select a broadcast playlist. Candidate playlists are generated based upon scoring and selecting content items for the positions through the use of index values. Various embodiments of the Multi-Shot Scheduling System can select broadcast playlists for multiple groups of content and can provide different methods of controlling scheduling performance by restricting the range of candidate playlists from which the best playlist can be selected. | 04-11-2013 |
20130091141 | CONTENT QUALITY AND USER ENGAGEMENT IN SOCIAL PLATFORMS - Systems and methods for enhancing content quality and user engagement in social platforms are described. In one implementation, a method includes assessing a quality of each of a plurality of user contributed contents related to at least one topic associated with a web based social platform based at least on topic quality assessment parameters to generate a topic quality score for the at least one topic. Further, the method comprises generating a credibility score for each of a plurality of users who contributed the user contributed contents based in part on the topic quality score. Based on the credibility scores, at least one credible user from the plurality of users is identified to enhance at least one of quality of the web based social platform and user engagement in the web based social platform. | 04-11-2013 |
20130097172 | METHOD AND APPARATUS FOR INDEXING AND RETRIEVING MULTIMEDIA WITH OBJECTIVE METADATA - A method and system for indexing, searching, retrieving, and viewing multimedia content employs metadata. The multimedia content is stored in the system, along with associated two types of metadata: (1) objective metadata and (2) subjective metadata. The associated objective metadata contains the values of various objective parameters that pertain to the creation of the multimedia content. Examples of such objective parameters include focal length of the camera used, the recording medium (film or digital), dimensional view (2D or 3D), source of light, light temperature, selected focus (fore, mid, or background), camera motion, recorded frame rate, and rendered playback frame rate. The associated subjective metadata pertains to the subjective content of the multimedia and typically consists of at least one keyword or tag that describes the subjects, themes, or genres depicted by the multimedia content. A user may search for, retrieve, and view the multimedia content by employing search techniques to both the objective and subjective metadata stored in the system. | 04-18-2013 |
20130097173 | Methods and Apparatus for Searching a Message Store - Methods and apparatus for searching a message store on a portable device are described herein. The methods allow a plurality of possible search criteria to be generated and displayed. One or more of the possible search criteria may be selected and added to a search string that is used to search the message store. | 04-18-2013 |
20130103691 | USING A DATABASE TO TRANSLATE A NATURAL KEY TO A SURROGATE KEY - A technique includes, in response to an access to a database involving access to a table and specifying a natural key, using the database to translate the natural key to a surrogate key based at least in part on a mapping | 04-25-2013 |
20130103692 | Predicting User Responses - Predicting user responses to items is useful in many application domains, such as personalized information retrieval and recommendation systems. In an embodiment a contacts service identifies contacts of a target user and predictions are elicited from the contacts about the target user's response to an item. In various examples, the predictions are combined taking into account weights of the contacts to produce a prediction of the target user's response. For example, the response may be one or more of: a numerical rating, a word or phrase describing the targets user's opinion of the item and a word or phrase stating a reason that the target user holds the opinion. In examples, accuracy of the predictions is calculated after observing the target user's actual response. The accuracy may be used to calculate and display scores and rankings of the contact's prediction abilities and to update the weights of the contacts. | 04-25-2013 |
20130103693 | INFORMATION SEARCH DEVICE, INFORMATION SEARCH METHOD, COMPUTER PROGRAM, AND DATA STRUCTURE - An information search device includes: a path field generating part that, for each node, generates a path field in which paths with the node as an origin are connected; an index generating part that, for each element configuring graph structure information, generates a posting list as a list of information composed of a node having a path field including the element and location information representing a location where the element appears in the path field, and generates an index repository in which the element and the posting list are associated; a search path generating part that generates a search path representing a search condition; and a searching part that extracts a node having a path field satisfying an appearance sequence of elements included in the search path based on the location information of the posting list. | 04-25-2013 |
20130110841 | METHOD AND APPARATUS FOR QUERYING MEDIA BASED ON MEDIA CHARACTERISTICS | 05-02-2013 |
20130110842 | TOOLS AND TECHNIQUES FOR EXTRACTING KNOWLEDGE FROM UNSTRUCTURED DATA RETRIEVED FROM PERSONAL DATA SOURCES | 05-02-2013 |
20130110843 | METHOD AND SYSTEM FOR CLASSIFYING DOCUMENTS | 05-02-2013 |
20130110844 | DOCUMENT INDEXING | 05-02-2013 |
20130110845 | BULK MATCHING WITH UPDATE | 05-02-2013 |
20130110846 | DATA PROCESSING METHOD AND APPARATUS | 05-02-2013 |
20130117272 | SYSTEMS AND METHODS FOR HANDLING ATTRIBUTES AND INTERVALS OF BIG DATA - Data management techniques are provided for handling of big data. A data management process can account for attributes of data by analyzing or interpreting the data, assigning intervals to the attributes based on the data, and effectuating policies, based on the attributes and intervals, that facilitate data management. In addition, the data management process can determine relations among data in a data collection and generate and store approximate results concerning the data based on the attributes, intervals, and the policies. | 05-09-2013 |
20130117273 | FORENSIC INDEX METHOD AND APPARATUS BY DISTRIBUTED PROCESSING - Provided is a forensic index method by distributed processing, including: generating data to be divided by dividing data to be indexed according to predetermined division setting for distributed processing; allocating the generated data to be divided to a plurality of data processing units according to the predetermined division setting, extracting an index by filtering the allocated data to be divided in the plurality of data processing units, and generating divided index data including the extracted index; and generating an index database by merging the generated divided index data. | 05-09-2013 |
20130117274 | ELECTRONIC DEVICE WITH ADDRESS BOOK AND ADDRESS BOOK MANAGEMENT METHOD - In an address book management method of an electronic device, a directory of members of an address book is created. A communication bulk and a communication count of each of the members listed in the address book are obtained. The communication bulk for each member is a total quantity of electronic communication in a predetermined time period between a predetermined user of the electronic device and the member, the total quantity measured according to a predetermined criterion, and the communication count for each member is a total number of occasions of electronic communication between the user and the member in the predetermined time period. An accumulative contact quantity index of each member is calculated according to the calculated communication bulk and communication count of the member. Thus, the members in the directory of the address book are ordered according to the accumulative contact quantity indexes. | 05-09-2013 |
20130117275 | INDEX MONITORING SYSTEM, INDEX MONITORING METHOD AND PROGRAM - Provided are an index monitoring system, an index monitoring method and a program which enable to understand a state of a monitoring target object, including a retention state of an index value. | 05-09-2013 |
20130124531 | SYSTEMS FOR EXTRACTING RELEVANT AND FREQUENT KEY WORDS FROM TEXTS AND THEIR PRESENTATION IN AN AUTO-COMPLETE FUNCTION OF A SEARCH SERVICE - Systems for searching and reviewing text files among a plurality of users are disclosed. The systems include a server that is configured to receive, index, and store a plurality of text files, which are received by the server from a plurality of sources, within at least one database in communication with the server. In addition, the server is configured to provide users with the ability to search for certain text files stored within the system. The search functionality will include an auto-complete feature, which provides a user of the system with a list of proposed key words to use when conducting the search. The proposed key words will represent the most frequently searched and information-rich key words that the system identifies over a period of time. | 05-16-2013 |
20130138658 | DATABASE QUERY ACROSS PROCESSES WITH CHANGE NOTIFICATION - Indexes for predefined search orders of items in a database are generated and stored. When a client issues a database query a responsive pre-generated index list is retrieved and provided to the client for use in, e.g., populating a U/I view for a user. Only those items that a client needs, e.g., for populating a current U/I view, are retrieved from the database and output to the client. When a change is rendered to the database, e.g., an item is added or deleted or an existing item is altered, only the change is output to the client, rather than the entire modified index or altered item. In this manner clients can more quickly and efficiently respond to user data query requests by performing some processing upfront and by limiting communications traffic to communications relevant to the client's current processing. | 05-30-2013 |
20130138659 | METHOD AND SYSTEM FOR RETRIEVING INFORMATION FROM SEMANTIC DATABASE - Embodiments of the present disclosure set forth a method for retrieving information from a semantic database having a plurality of semantic data. The method includes selecting a hub that is resolved with a number of semantic data based on a resolution rule, when the number of the semantic data is greater than a threshold in an offline environment; calculating a semantic data set that is associated with the hub based on a searching approach and the resolution rule in the offline environment; indexing the semantic data set in the offline environment; modifying the semantic database to include the semantic data set in the offline environment; and retrieving information from the semantic data set in an online environment. | 05-30-2013 |
20130144885 | FILE SEARCH APPARATUS AND METHOD USING ATTRIBUTE INFORMATION - A file search apparatus using attribute information, includes an attribute extraction unit configured to extract attribute information by analyzing a file; and a distributed index generation unit configured to generate an attribute-based index database on the basis of the attribute information of the file. Further, the file search apparatus includes a storage unit configured to store the attribute-based index database; and a file search unit configured to search, when a query is input, an index database corresponding to the query in the storage unit to generate a search result. | 06-06-2013 |
20130159315 | METHODS FOR PREFIX INDEXING - According to one aspect of the invention, in response to one or more terms to be indexed, each of the terms is indexed in a regular index. In addition, for each of the terms having multiple characters, at least one prefix portion of the term is indexed in a prefix index, where the regular index is used for regular searches and the prefix index is used for prefix searches without having to combine a plurality of postings lists of the regular index at the point in time. | 06-20-2013 |
20130166566 | ACCELERATED QUERY OPERATORS FOR HIGH-SPEED, IN-MEMORY ONLINE ANALYTICAL PROCESSING QUERIES AND OPERATIONS - An additional data structure can be initialized for a column of compressed data to include a prefix storing, for each block of values in the column, a total number of bits set in previous blocks in the bit vector. A block number can be determined for a target block of the plurality of blocks, for example by checking whether or not a specified row number is located in the prefix. If the specified row number is located in the prefix, the prefix value of the prefix is returned, the most frequently occurring value is returned if a corresponding bit in the bit vector in the specified row number is not located in the prefix, or a position of the specified row in an index vector for the column is returned. | 06-27-2013 |
20130166567 | PROCESSING AND PUBLISHING DIGITAL CONTENTS INCLUDING ENCYCLOPEDIA - Techniques, devices and systems for publishing digital contents such as encyclopedia are provided. One method for such digital publishing includes obtaining entries of encyclopedic resource files; indexing attributes of each of the entries by using preset indexing rules; reorganizing the entries based on the attributes; and publishing the reorganized entries by applying a preset template. | 06-27-2013 |
20130166568 | SCALABLE ANALYSIS PLATFORM FOR SEMI-STRUCTURED DATA - A method of operating a query system includes retrieving objects from a data source, wherein each of the retrieved objects includes (i) data and (ii) metadata describing the data. The method includes dynamically creating a cumulative schema by inferring a schema from each of the retrieved objects and merging the inferred schema with the cumulative schema. The method includes storing the data of each of the retrieved objects in a storage service. The method includes receiving, from a user, a query, and responding to the query based on data stored by the storage service. | 06-27-2013 |
20130173627 | Efficient Deduplicated Data Storage with Tiered Indexing - A deduplicated data storage system provides high performance storage to heterogeneous clients that connect to it via a communications network. The deduplicated data storage system provides fast access to deduplication data by caching the most frequently accessed deduplication data in a hyperindex. Updates to the non-cached deduplication data are serialized by use of a store queue and hold queue. | 07-04-2013 |
20130173628 | SYSTEM AND METHOD FOR GEOGRAPHICALLY ORGANIZING AND CLASSIFYING BUSINESSES ON THE WORLD-WIDE WEB - A method and search engine for classifying a source publishing a document on a portion of a network, includes steps of electronically receiving a document, based on the document, determining a source which published the document, and assigning a code to the document based on whether data associated with the document published by the source matches with data contained in a database. An intelligent geographic- and business topic-specific resource discovery system facilitates local commerce on the World-Wide Web and also reduces search time by accurately isolating information for end-users. Distinguishing and classifying business pages on the Web by business categories using Standard Industrial Classification (SIC) codes is achieved through an automatic iterative process. | 07-04-2013 |
20130173629 | SYSTEM AND METHOD FOR GEOGRAPHICALLY ORGANIZING AND CLASSIFYING BUSINESSES ON THE WORLD-WIDE WEB - A method and search engine for classifying a source publishing a document on a portion of a network, includes steps of electronically receiving a document, based on the document, determining a source which published the document, and assigning a code to the document based on whether data associated with the document published by the source matches with data contained in a database. An intelligent geographic- and business topic-specific resource discovery system facilitates local commerce on the World-Wide Web and also reduces search time by accurately isolating information for end-users. Distinguishing and classifying business pages on the Web by business categories using Standard Industrial Classification (SIC) codes is achieved through an automatic iterative process. | 07-04-2013 |
20130173630 | SYSTEM AND METHOD FOR GEOGRAPHICALLY ORGANIZING AND CLASSIFYING BUSINESSES ON THE WORLD-WIDE WEB - A method and search engine for classifying a source publishing a document on a portion of a network, includes steps of electronically receiving a document, based on the document, determining a source which published the document, and assigning a code to the document based on whether data associated with the document published by the source matches with data contained in a database. An intelligent geographic- and business topic-specific resource discovery system facilitates local commerce on the World-Wide Web and also reduces search time by accurately isolating information for end-users. Distinguishing and classifying business pages on the Web by business categories using Standard Industrial Classification (SIC) codes is achieved through an automatic iterative process. | 07-04-2013 |
20130173631 | System and Method for Importing NetCDF Data - System and method for providing an automated method of importing NetCDF formatted data from a variety of sources that contain a variety of internal datasets. Options for processing and analysis can be provided and can occur simultaneously as the data are being processed. Storage and organization options provide efficient viewing and analysis of NetCDF datasets. | 07-04-2013 |
20130191396 | DOCUMENT INDEXING AND DELIVERY SYSTEM - A document indexing and delivery system is disclosed. Embodiments of the present invention provide an agent-based delivery platform to supply financial document images in a variety of formats to fulfill requests from various clients of an image processing system at a financial institution. Workflows can be created using extensible markup language (XML) to direct the operation of the system. Software agents on the system can include a client agent, a delivery agent, an index agent, a format agent, an image agent, and/or a workflow agent. The workflow agent manages the execution of the workflow. Images and relevant data can be located using a master index residing in partitioned, high-speed databases. | 07-25-2013 |
20130198197 | METHOD AND APPARATUS FOR CORRELATING AND VIEWING DISPARATE DATA - A computer-implemented method comprising collecting data from a plurality of information sources, identifying a geographic location associated with the data and forming a corresponding event according to the geographic location, correlating the data and the event with one or more topics based at least partly on the identified geographic location and storing the correlated data and event and inferring the associated geographic location if the data does not comprise explicit location information, including matching the data against a database of geo-referenced data. | 08-01-2013 |
20130198198 | GENERATING METHOD, GENERATING SYSTEM, AND RECORDING MEDIUM - A computer-readable recording medium stores a program for causing an apparatus to execute a procedure. The procedure includes: regarding each data in a data set having a key and a value corresponding to the key, calculating a key characteristic value of the data as an index value indicating an existence of the key in the data and another key included in the value based on the value of the key and the value of the other key included in the value; generating a segment in which data blocks of which the calculated key characteristic values are similar to each other are grouped in one group; and setting a storage destination of the data in the generated segment. | 08-01-2013 |
20130198199 | METHOD OF INDEXING A DATABASE - A method of sorting a database of records and data items, in which each record has an identifier, data variables and paths pointing to data items being the value of the data variables is disclosed. The database has a first and second frequency for each path of the first and second data variables, respectively. The method includes creating an intermediate array having a section for each value of the second data variable. Storing the identifier of each record. Creating a final array having sections for each value of the first data variable. Storing the identifier of the records into the section of the final array corresponding to the value of its first data variable. Identifying break points in the final array. Repeating the previous two steps for each section of the intermediate array. Creating a break point index. | 08-01-2013 |
20130198200 | MEDICAL IMAGE PROCESSING APPARATUS, PROGRAM, AND MEDICAL APPARATUS - According to one embodiment, a medical image processing apparatus includes a reception unit, determination unit, dividing unit, processing control unit, and storage processing unit. The reception unit receives first data including supported and unsupported tags, and individual data added to the supported or unsupported tag. The determination unit determines whether a tag included in the first data is supported or not. The dividing unit divides the first data into second data and third data. The processing control unit executes data processing by referring to the second data. The storage processing unit generates fourth data by adding second data to data acquired by the processing. | 08-01-2013 |
20130198201 | Specification Document Check Method, Program, and System - A system for storing a specification document to be checked on a storage device in a computer readable format. A computer implemented method which stores a graph structure of rules for performance of checking, and creates a rule application table of whether a rule is an application subject. For each node rule of the graph structure, average entropy of whether a specification satisfies the rules is derived by searching the graph structure of rules. While performing a depth-first searching of the graph structure of rules, a priority order index is derived from the rule application table for each node rule. Amendment candidates are then displayed according to value of the priority order index. | 08-01-2013 |
20130198202 | METHOD AND COMPUTER PROGRAM PRODUCT FOR GEOPHYSICIAL AND GEOLOGIC DATA IDENTIFICATION, GEODETIC CLASSIFICATION, ORGANIZATION, UPDATING, AND EXTRACTING SPATIALLY REFERENCED DATA RECORDS - A method of geodetic information acquisition and a computer program product having computer code for performing the method are provided. The method includes identifying a file having multiple records, where the records contain spatially referenced data. The records are classified into header records and data records. Each data record is divided into fields. Patterns of fields consistent with sets of the data records are identified. The records are associated with a format based on the patterns of fields. At least some of the spatially referenced data is extracted from the records according to the format. | 08-01-2013 |
20130204878 | MUSIC RECOMMENDATION METHOD WITH RESPECT TO MESSAGE SERVICE - A music recommendation method and a music recommendation system are provided. The music recommendation method includes: selecting music files according to a theme of the message service and music, a mood of the music, a similarity between content of the message service and content of the music; and recommending selected music files to a user. | 08-08-2013 |
20130212112 | METHOD AND APPARATUS FOR MANAGING FILES IN AN ONLINE ACCOUNT - An approach is provided for managing files in an online account. A file access platform causes, at least in part, retrieval of a file associated with a first communication stored in an online account. Next, the file access platform determines whether the file is modified after the retrieval and generates a second communication including a modified version of the file based, at least in part, on the determination. Then, the file access platform causes, at least in part, transmission of the second communication including the modified version to the online account. | 08-15-2013 |
20130218896 | Indexing Quoted Text in Messages in Conversations to Support Advanced Conversation-Based Searching - A conversation server system having one or more processors and memory stores a plurality of index components in an index. The server receives a first message, associates the first message with a conversation having one or more other messages and identifies quoted text in the message based on text that occurs in one or more of the other messages. The server stores, in the index, a plurality of first-message index components including one or more index components that correspond to terms in original text of the first message and one or more index components that correspond to terms that occur in the quoted text, where the first-message index components for original text of the first message are distinguished from the first-message index components for quoted text of the first message in the index. | 08-22-2013 |
20130218897 | Enabling Search for Conversations with Two Messages Each Having a Query Term - A conversation server system having one or more processors and memory stores a plurality of index components in an index. The server associates a first message having a first term with a conversation that includes at least a second message. The first term is not included in the second message and the second message includes a second term that is not included in the first message. The server stores, in the index, a plurality of index components for a same referenced object, including an index component indicative of the first term and an index component indicative of the second term. In some embodiments the same referenced object is associated with index components for a first sender of the first message and a second sender of the second message, so that a search for a conversation with messages from the first sender and the second sender retrieves the referenced object. | 08-22-2013 |
20130218898 | MECHANISMS FOR METADATA SEARCH IN ENTERPRISE APPLICATIONS - Metadata search is enhanced by utilizing relationship data indicating relationships between metadata items. A server generates an index mapping metadata items to terms associated with the metadata items and a graph describing relationships between each of the metadata items. When the server receives a search request, the server locates a candidate set of the metadata items based on the search term(s) and the index. The server performs a link analysis of the graph to determine a relationship score for each metadata item. For each particular metadata item in the candidate set of the metadata items, the server calculates a ranking score based at least on the relationship score for the particular metadata item. The server generates a ranked result set based on comparing the ranking scores for the candidate set of metadata items. The server then provides information indicating the ranked result set in response to the search request. | 08-22-2013 |
20130218899 | MECHANISMS FOR SEARCHING ENTERPRISE DATA GRAPHS - Highly relevant search results for unstructured queries are efficiently identified from normalized data. A server graphs relationships between each of the data objects. For each search term, the server identifies a candidate set of data objects mapped to the term. The server calculates priority scores for the candidate data objects based at least in part on one or more of: a link analysis of the graph; or an analysis of metadata describing structural constraints upon the candidate data objects. Based on the graph, the server identifies one or more search result subgraphs, each comprising at least one data object from each of the candidate sets. The server looks for subgraphs in an order that is based on the priority scores for the candidate data objects. The server may terminate its search early, in part because prioritizing the candidate data objects increases the likelihood of receiving relevant search results first. | 08-22-2013 |
20130218900 | INDEX SEARCHING USING A BLOOM FILTER - A system for index searching comprises a first index, a second index, a processor, and a memory. The processor is configured to determine, using a Bloom filter for the second index of fingerprints, whether the Bloom filter indicates that an entry corresponding to a segment has been previously stored in the second index. The processor is further configured to, in the event that the Bloom filter indicates that an entry corresponding to the segment has been previously stored in the second index, retrieve a second index entry associated with the segment that has been previously stored. The memory is coupled to the processor and configured to provide the processor with instructions. | 08-22-2013 |
20130226928 | System And Method For Mapping Text Phrases To Geographical Locations - A system and method for mapping text phrases to geographical locations is provided. Entities, each comprising one of a location, person, and place, are identified in one or more documents. Possible candidate locations associated with each entity are determined. An initial score is assigned to each location. The initial scores are adjusted and the candidate location with the highest adjusted score is selected for each entity. The selected candidate location is applied to all occurrences of the entity in the documents. | 08-29-2013 |
20130226929 | SYSTEM AND METHOD FOR GENERATING POSTAL ADDRESSED INDEXED EMAIL DATABASE - A system and method for providing a database of physical postal addresses indexed to a general email identifier involves obtaining a plurality of physical postal addresses. The system and method validates whether each of the plurality of physical postal addresses are a valid physical postal address. The system and method next generates a general email address for each of the validated plurality of physical postal addresses and indexes each of the general email addresses to one of the plurality of physical postal addresses. | 08-29-2013 |
20130226930 | Apparatus and Methods For Indexing Multimedia Content - A method, medium, and apparatus are disclosed for indexing multimedia content by a computer. The method comprises segmenting the multimedia content into a plurality of segments. For each segment, the method identifies one or more features present in the segment, wherein the features are of respective media types. The method then identifies, for each identified feature in each segment, one or more respective keywords associated the identified feature. Then, the method determines, for each identified keyword associated with an identified feature in a given segment, a respective relevance of the keyword to the given segment. The respective relevance is dependent on a weight associated with the respective media type of the identified feature. | 08-29-2013 |
20130226931 | METHOD AND SYSTEM FOR APPEND-ONLY STORAGE AND RETRIEVAL OF INFORMATION - A method, a system, and a computer program product for storing and retrieving information are provided. The system receives information. Thereafter, the system organizes the information for optimal storage and retrieval based on qualities of a storage medium. The organizing may include optimizing the information for sequential read and/or write access. The information may be organized in an append-only manner. Once the information has been organized, the apparatus presents and/or stores the organized information. | 08-29-2013 |
20130226932 | Systems and Methods for Real Time Data Management in a Collaborative Environment - Systems and methods for real time data management in a collaborative environment. In one embodiment, the systems and methods dynamically link stakeholders, through a collaborative environment, to current and updated data. The systems and methods also cross-reference data objects between stakeholders for calling data objects by a common name, and determine a preferred method for each operation related to processing data for a data object. | 08-29-2013 |
20130232151 | Aggregation of Load Profiles for Consumption Management Systems - Methods and systems using aggregated electrical system load profiles in determining additional load profiles, and determining consumption management system characteristics, features, and operating requirements at a site are disclosed, along with methods of generating and maintaining databases of load profile information and consumption management system requirements. By using some embodiments of the methods and systems described herein, it is faster and easier to design and implement effective consumption management systems, to determine problematic electrical systems at a site, and to diminish anomalous consumption patterns in an aggregate unprofiled electrical system load profile. | 09-05-2013 |
20130232152 | LISTING DATA OBJECTS USING A HIERARCHICAL DISPERSED STORAGE INDEX - A method begins by a dispersed storage (DS) processing module receiving a request for a data index list that identifies data objects having one or more common attributes, searching a hierarchical ordered index structure to identify a data object level index node corresponding to a first data object boundary match, and determining whether the data object level index node corresponds to a second data object boundary match. When the data object level index node does not correspond to the second data object boundary match, the method continues with the DS processing module searching an adjacent data object level index node to determine whether the adjacent data object level index node corresponds to the second data object boundary match. When the adjacent data object level index node corresponds to the second data object boundary match, the method continues with the DS processing module generating the data index list. | 09-05-2013 |
20130232153 | MODIFYING AN INDEX NODE OF A HIERARCHICAL DISPERSED STORAGE INDEX - A method begins by a dispersed storage (DS) processing module determining to create or modify a data object index key of a data object regarding an attribute of a plurality of attribute categories. The method continues with the DS processing module entering a process to update a data object level index node by retrieving the data object level index node from a set of dispersed storage (DS) units. The method continues with the DS processing module updating and encoding the data object level index node to produce a set of slices. The method continues with the DS processing module generating a set of write commands to write the set of slices to the set of DS units. When less than a threshold number of the set of write commands are successfully executed by the set of DS units, repeating the process to update the data object level index node. | 09-05-2013 |
20130238627 | INTEGRATING SEARCHES - Methods, systems, and computer-storage media having computer-usable instructions embodied thereon, for integrating searches are provided. An entity index may be compiled that includes entity files for a plurality of identified entities such that any information known about a single entity is contained in a single entity file and is easily accessible. Web indexes, including web page information, may be referenced in order to associate web pages with entities, or entity files. Once identified as related to an entity, a web page may be associated with an entity identifier that is associated with the related entity such that a search query for the identified entity results in both entity information for the entity and web pages associated with the entity. | 09-12-2013 |
20130238628 | REFRESHING A FULL-TEXT SEARCH INDEX IN A PARTITIONED DATABASE - A full-text search index for a table stored in a set of partitions of a database is refreshed. At a first point in time, table row distribution information and a table partitioning list are captured. At a second point in time, it is determined that an index refresh operation is to be performed. The captured table row distribution information is compared with current table row distribution information to determine whether there is at least one difference. In response to the determining that there is at least one difference, a text index distribution is re-aligned for any table rows that are affected by the change in table data row distribution by: generating a new table partitioning list, and indexing documents associated with table data rows that moved from one partition in the captured table partitioning list to another partition in the new table partitioning list. | 09-12-2013 |
20130238629 | SEARCHING NETWORK CONFIGURATION DATA - A programmed hardware network configuration file repository indexer is configured with a network-configuration-specific index-operation rule set. In another example, a network-configuration-specific index-operation rule set can be used in generating an index to a network configuration file repository. In the latter example, the index and the index-operation rule set is used in searching the network configuration file repository. | 09-12-2013 |
20130238630 | SPLITTING AN INDEX NODE OF A HIERARCHICAL DISPERSED STORAGE INDEX - A method begins by a dispersed storage (DS) processing module determining to remove data object index entries from a data object level index node, identifying data object index entries to extract, and creating a temporary sibling data object level index node to include the identified data object index entries. The method continues with the DS processing module initiating updating of an hierarchical ordered index structure by identifying an address for storing the temporary sibling data object level index node, setting up linking the temporary sibling data object level index node to a next level node and to an adjacent data object level index node, and determining whether a change has occurred. When the change has not occurred, the method continues with the DS processing module removing the data object index entries from the data object level index node and commencing the updating of the hierarchical ordered index structure. | 09-12-2013 |
20130246435 | FRAMEWORK FOR DOCUMENT KNOWLEDGE EXTRACTION - A knowledge extraction framework may iteratively enrich an ontology that is used to classify structured knowledge obtained from web pages based on structured knowledge previously acquired from other web pages. The framework may enable a user to define the ontology for extracting structured knowledge from a plurality of web pages. The framework applies the ontology using a supervised extraction algorithm to extract seed information from a set of web pages. The framework further applies an unsupervised extraction algorithm to extract the structured knowledge from an additional set of web pages. The framework subsequently maps the structured knowledge to the ontology based on the seed information to enrich the ontology. | 09-19-2013 |
20130246436 | SYSTEM AND METHOD FOR DOCUMENT INDEXING AND DRAWING ANNOTATION - A system and method for parsing a machine-readable document having associated drawing figures with components labeled by references, to identify the occurrence of the references for generating a dynamic reference index table, and for either automatically annotating the references in the associated drawing figures with descriptive words or phrases cross-referenced to the references within the generated dynamic reference index table, or generating a reference usage report identifying inconsistencies and/or errors within the document associated with the identified reference occurrences. | 09-19-2013 |
20130246437 | EXTENDED DATABASE SEARCH - Methods, program products, and systems implementing integrated repository of structured and unstructured data are disclosed. An integrated environment is provided for accessing structured data (e.g., data of a relational database) and unstructured data (e.g., data stored in a text or binary file), including creating, managing, modifying, and searching the structured data and unstructured data. The integrated environment can include an integrated user interface, a set of commands and application programming interface (API), and storage for a relational database and a document repository. The integrated environment can include a database abstraction layer that allows database operations on both the structured data and the unstructured data. | 09-19-2013 |
20130254208 | METHOD AND SYSTEM FOR INDEXING IN DATASTORES - A method, system, apparatus, and computer program product for indexing information stored in data-stores. The system receives a new index request. The system creates an index in response to the request. The new index includes at least one segment, a first flag, and a last flag. Each segment comprises index summary information. The system then stores the index in memory. | 09-26-2013 |
20130254209 | CONSENSUS SEARCH DEVICE AND METHOD - A consensus search device is provided, which includes a semantic data indexing unit configured to divide text data of an electronic document written about at least one object into segments, to extract at least one semantic descriptor from the each segmented text data, and to generate a semantic data index matching each of the extracted semantic descriptor to the object and the each segmented text data. The consensus search device also includes a semantic searching unit configured to retrieve an object related to a query, based on the semantic data index. The text data is divided into the segments by units of meaning. | 09-26-2013 |
20130254210 | INDEX SELECTION IN A MULTI-SYSTEM DATABASE MANAGEMENT SYSTEM - A system includes a multi-system database management system having a plurality of database systems. An index selection subsystem combines sets of query information from respective ones of the plurality of database systems into a workload. The index selection subsystem then generates candidate indexes from the workload, and selects recommended indexes from the candidate indexes based on one or more criteria. | 09-26-2013 |
20130262469 | DEMOGRAPHIC AND MEDIA PREFERENCE PREDICTION USING MEDIA CONTENT DATA ANALYSIS - Methods, systems and computer program products are provided for predicting data. A name or title is obtained from a taste profile. There is an index into a data set based on the name or title, and a set of terms and corresponding term weights associated with the name or title are retrieved. A sparse vector is constructed based on the set of terms and term weights. The sparse vector is input to a training model including target data. The target data includes a subset of test data which has a correspondence to a predetermined target metric of data. A respective binary value and confidence level is output for each term, corresponding to an association between the term and the target metric. | 10-03-2013 |
20130268536 | RANKING AND ORDERING OF USER GENERATED CONTENT - Ordering user generated content includes: calculating a rank of the user generated content associated with an on-line property, said the calculating based on a calculated freshness score and reputation metrics; storing the calculated rank as an indexed field; receiving a request to serve the on-line property; at run-time, sorting the user generated content by rank using the indexed field; and presented presenting the sorted user generated content along with the requested on-line property. | 10-10-2013 |
20130268537 | METHOD, SYSTEM AND PROGRAM FOR INFORMATION RE-ORGANIZATION - An information re-organization system includes a plurality of counters coordinated to meaning attributes, and a re-organization incentive notification unit that updates, in case the information stored in preset storage unit has been updated, value of a counter out of the multiple counters that has the meaning attribute associated with contents updated. The information re-organization system also includes an information re-organization processor that executes, in case the value of the counter section updated has met one of a number of predetermined conditions for information re-organization, a processing for information re-organization corresponding to the condition for information re-organization on the information stored in the preset storage unit. | 10-10-2013 |
20130268538 | Database and Index Organization for Enhanced Document Retrieval - A customized, topical database and methods for constructing and using such a database are provided. Selection and indexing of articles is done by experts in the topic with which the database is concerned. As a result, articles are indexed in a manner that allows facile, rapid retrieval of highly relevant articles with few or no false positives. | 10-10-2013 |
20130275434 | DEVELOPING IMPLICIT METADATA FOR DATA STORES - A system enables metadata to be gathered about a data store beginning from the creation and generation of the data store, through subsequent use of the data store. This metadata can include keywords related to the data store and data appearing within the data store. Thus, keywords and other metadata can be generated without owner/creator intervention, with enough semantic meaning to make a discovery process associated with the data store much easier and efficient. Usage of or communication regarding a data store are monitored and keywords are extracted from the usage or communication. The keywords are then written to otherwise associated with metadata of the data store. During searching, keywords in the metadata are made available to be used to attempt to match query terms entered by a searcher. | 10-17-2013 |
20130275435 | CODE BASED INDEXING AND INFORMATION RETRIEVAL SYSTEM - An indexing and information retrieval system includes a database configured for storing a record of content and indexing the record with a code and property of predetermined location information, a member device configured for inputting the record to the database with the code and the corresponding property of predetermined location information, and a client device configured for retrieving the records from the database with the code. The code is associated with a predetermined location. | 10-17-2013 |
20130282728 | METHOD OF QUERYING A DATA - A method of querying a data structure having a plurality of records and data items is disclosed. Each record has an index identifying the record and a collection of paths pointing to a selection of data items. Each path corresponds to a data variable. The method includes receiving a query to sort the records according to a data variable. Creating a first array and defining start points for data items are performed. Storing the index of the record after the start point and determining an end point for each start point are performed. Determining locations in each sequence of locations and returning the locations to designate where the record indexes may be found are performed. | 10-24-2013 |
20130282729 | INTELLIGENT HIGHLIGHTING SYSTEM - An intelligent highlighting system for automatically determining where the author has marked a manuscript for insertion of multimedia, and then inserting the multimedia into digital publications is provided. In the system, digital publication are parsed to identify multimedia references, such as footnotes, comments and hyperlinks. A database table index containing the parsed results is generated and stored in a projected identification database. Next, metadata specific to the parsing results are also stored in the project identification database and a pre-determined highlighting method is identified. The metadata and highlighting method are compiled into an intelligent highlighting object, which is used to embed a hidden multimedia path within the keywords. The keywords with the embedded multimedia are then stored as a binary file. Using the data stored in the project identification database, the binary file and e-reader specifications, the multimedia files are converted to the appropriate format compatible with the e-readers. | 10-24-2013 |
20130282730 | DERIVING DOCUMENT SIMILARITY INDICES - Methods, systems, and computer program products are provided for deriving and updating document similarity indices for a plurality of documents. The number of maintained similarities can be controlled to conserve CPU and storage resources. | 10-24-2013 |
20130282731 | INFORMATION SEARCH DEVICE - An information search device in accordance with the present invention is configured such that a search request acceptance means accepts, object search condition information and associated information search condition information while a search means stores, into the main storage portion from the data storage portion, the index of the analysis object information including the analysis object corresponding to the object search condition information, and the index of the associated information including the associated information corresponding to the associated information search condition information, searches for the analysis object identified by the object search condition information from the main storage portion and stores, into the main storage portion, the index associating the associated information with the analysis object information corresponding to the searched analysis object, and searches for the information corresponding to the search condition information from the respective indexes stored in the main storage portion. | 10-24-2013 |
20130290343 | MULTIPLE VARIABLE COVERAGE MEMORY FOR DATABASE INDEXING - Technologies related to multiple variable coverage memory for database indexing are generally described. Disclosed methods may be performed to implement high-speed database access to digital service provider customer data as the digital service provider builds an optimized index for a database. Initially, the digital service provider may maintain an appropriate level of service by keeping a relatively slow performing, basic index in a relatively high performance first memory. As the digital service provider builds the optimized index, the digital service provider may maintain the appropriate level of service by gradually shifting from use of the first memory to the use of a relatively lower performance second memory. | 10-31-2013 |
20130290344 | UPDATING A SEARCH INDEX USED TO FACILITATE APPLICATION SEARCHES - Techniques are disclosed for maintaining a search index used to support application searches. The techniques include receiving application data from one or more data sources, identifying an application referenced in the application data. The techniques further include extracting a plurality of attributes corresponding to the application from the application data based on an application specific search schema. The techniques also include generating an application representation of the application based on the plurality of attributes and the application specific search schema. The techniques also include updating a search index with the application representation. | 10-31-2013 |
20130297613 | INDEXING BASED ON KEY RANGES - The present invention is a fast indexing technique that builds an indexing structure based on multi-level key ranges typically for large data storage systems. The invention is explained based on the B | 11-07-2013 |
20130297614 | METHODS FOR FACILITATING PRESERVATION AND RETRIEVAL OF HETEROGENEOUS CONTENT AND DEVICES THEREOF - A method, non-transitory computer readable medium, and content management apparatus receives a storage request including content and context information associated with the received content, the context information comprising at least metadata and information for one or more user tags, wherein the user tags are customizable and established by an administrator. One of a plurality of types of content is identified for the received content. Searchable information is extracted from the received content based on the identified one of the plurality of types of content. A searchable index is generated for the received content based on at least the extracted searchable information and the context information associated with the received content. The received content is stored in a manner which is retrievable based on one or more associations in the generated searchable index. | 11-07-2013 |
20130297615 | CLUSTERED INDEX WITH DIFFERENTIATED SUBFIELDS - A database system uses indexes to improve performance. The system can use one or more of the following: differentiated subfields for index clustering; set function calculations to reduce the amount of I/O; and/or using an inferred index in a circular table. | 11-07-2013 |
20130297616 | WEB DISCOVERY METHOD FOR ASSESSING INGREDIENTS FOR REGULATORY COMPLIANCE - A regulatory information retrieval system is disclosed. In one embodiment, the method comprises building ingredient data for manufacturing businesses. Then, those ingredients are compared with regulatory requirements. Relevant requirements are identified regarding permitted usage are extracted and converted to a guideline document. The guideline document can be viewed in a variety of ways, such as a detailed view, or an overview of repositories searched. The guidelines may then be converted to a file format that can be imported by other business applications. | 11-07-2013 |
20130304744 | DIFFERENTIAL DATAFLOW - The techniques discussed herein efficiently perform data-parallel computations on collections of data by implementing a differential dataflow model that performs computations on differences in the collections of data. The techniques discussed herein describe defined operators for use in a data-parallel program that performs the computations on the determined differences between the collections of data by creating a lattice and indexing the differences in the collection of data according to the lattice. | 11-14-2013 |
20130304745 | STORING INDEXED DATA TO A DISPERSED STORAGE NETWORK - A method begins by a dispersed storage (DS) processing module storing a data portion in a dispersed storage network (DSN), where the data portion is associated with a set of data index keys, a version index key, and with a set of portion index keys. The method continues with the DS processing module traversing, based on the set of data index keys, a plurality of index structures to determine whether an object retrieval structure exists for the data portion. When the object retrieval structure exists, the method continues with the DS processing module updating the object retrieval structure by determining whether a version record exists that corresponds to the version index key and when the version record exists, creating, within the version record, a new data portion record for the data portion to include an identifier of the data portion and the set of portion index keys. | 11-14-2013 |
20130311478 | SYSTEMS, METHODS, AND APPARATUS TO MONITOR MEDIA PRESENTATIONS - Systems, methods and apparatus to monitor media presentations are disclosed. An example method includes identifying a media presentation device associated with a Domain Name Service (DNS) query based on a public Internet Protocol (IP) address that originated the DNS query and a destination address of the DNS query. A media source is identified based on a domain name requested in the DNS query. Identification of the media presented in association with the DNS query based on at least one of tagging data and metering data is attempted. A panelist associated with the media presentation device is credited with exposure to the identified media from the media source via the media presentation device when the media is identified. The panelist associated with the media presentation device is credited with exposure to media from the media source via the media presentation device when the media is not identified. | 11-21-2013 |
20130311479 | Determining a Cause of an Incident Based on Text Analytics of Documents - According to one embodiment of the present invention, a system analyzes one or more change records based on text analytics using dictionaries and rules for the analysis in order to generate an index of analyzed data that represents the one or more change records. The change records each include a change and corresponding time frame for occurrence of the change. Information from a request is applied to the index of analyzed data to determine one or more candidate causes for the incident and the corresponding time frame for occurrence of the change. A time associated with the request is correlated with the corresponding time frame for occurrence of the change to identify the one or more candidate causes in the one or more change records as causes for the incident. Embodiments of the present invention further include a method and computer program product for determining causes of an incident. | 11-21-2013 |
20130311480 | SENSOR DATA LOCATING - A method, an apparatus, and a system for locating sensor data. The method includes the steps of: obtaining an index table; intercepting a query for sensor data in runtime; extracting a characteristic parameter from a query condition; locating a block identifier of matching sensor data storage blocks in the index table by using the characteristic parameter; and loading the storage blocks into a memory space of a working processor; where the index table contains mapping relationships between block identifiers of sensor data storage blocks and characteristic attributes of sensor data. | 11-21-2013 |
20130311481 | Determining a Cause of an Incident Based on Text Analytics of Documents - A system for determining a cause of an incident analyzes one or more change records based on text analytics using a dictionary and rules for the analysis in order to generate an index of analyzed data that represents the one or more change records. The change records each include a change and corresponding time frame for occurrence of the change. Information from a request is applied to the index of analyzed data to determine one or more candidate causes for the incident and the corresponding time frame for occurrence of the change. A time associated with the request is correlated with the corresponding time frame for occurrence of the change to identify the one or more candidate causes in the one or more change records as causes for the incident. Embodiments of the present invention further include a method and computer program product for determining causes of an incident. | 11-21-2013 |
20130318090 | SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR FAST AND SCALABLE PROXIMAL SEARCH FOR SEARCH QUERIES - Embodiments of the invention provide a system, method and computer program products for information retrieval from multiple documents by proximity searching for search queries. A method includes generating an index for the multiple documents, wherein the index includes words in snippets in the documents. An input search query is processed against the index by searching query terms over the snippets to introduce term proximity information implicitly in the information retrieval. Results of multiple sentence level search operations are combined as output. | 11-28-2013 |
20130318091 | SYSTEMS, METHODS, AND COMPUTER PROGRAM PRODUCTS FOR FAST AND SCALABLE PROXIMAL SEARCH FOR SEARCH QUERIES - Embodiments of the invention provide a method and computer program products for information retrieval from multiple documents by proximity searching for search queries. A method includes generating an index for the multiple documents, wherein the index includes words in snippets in the documents. An input search query is processed against the index by searching query terms over the snippets to introduce term proximity information implicitly in the information retrieval. Results of multiple sentence level search operations are combined as output. | 11-28-2013 |
20130318092 | Method and System for Efficient Large-Scale Social Search - To answer search queries on a social network rich with user-generated content, it is desirable to give a higher ranking to content that is closer to the individual issuing the query. Queries occur at nodes in the network, documents are also created by nodes in the same network, and a goal is to find the document that matches the query and is closest in network distance to the node issuing the query. Embodiments of the present invention provide solutions to this problem. After a some offline pre-processing, the system according to an embodiment of the present invention allows for social index operations (e.g., social search queries and insertion and deletion of words into and from a document at any node). | 11-28-2013 |
20130318093 | SHORT STRING COMPRESSION - Systems and techniques are disclosed to express sequences of codes, and in particular sequences of ASCII characters, in a lossless compressed format. The techniques may include dividing a universe of expressible codes into smaller subsets, called code sets, such that every code exists within one code set, but no code exists within two code sets. The code sets are then utilized for compression based on the heuristic that it is more likely that a next code in the sequences of codes is in the same code set as a previous code in the sequences of codes, rather than that the next code in sequences of codes being in any other code set (sentence structure). | 11-28-2013 |
20130318094 | INDEXING AND EXECUTION OF SOFTWARE APPLICATIONS IN A NETWORK - A method is provided for managing the indexing of a software application, implemented by a multimedia content server (DMS). The method includes the following phases: indexing at least one software application as content in a content indexing database, by recording metadata indicating that the thus indexed content is a software application and recording a location address of a data file comprising a software module designed to execute the at least one software application; and transmitting the metadata and the location address to a drive device (DMC), serving to select an indexed content to be delivered and to drive the delivery of a selected content. | 11-28-2013 |
20130318095 | DISTRIBUTED COMPUTING ENVIRONMENT FOR DATA CAPTURE, SEARCH AND ANALYTICS - An application engine of a distributed data management system includes acquisition applications which execute to obtain portions of source data from different data sources. Each portion of source data is mapped to an interlingual representation. The application engine transmits data objects including the portions of source data and corresponding interlingual representations to a data container. For each data object, the data container stores the source data and the interlingual representation in one or more databases. The data container also parses the source data of the data object according to one or more of a full-text indexing technique, a semantic indexing technique, or a structured metadata indexing technique, and stores the indexed data. A database client may receive a search query and search the source data and interlingual representations stored in the databases. | 11-28-2013 |
20130325869 | CREATING A CONTENT INDEX USING DATA ON USER ACTIONS - Personalized content is generated from different media items using a content index. The content index is generated or updated by identifying segments of media items that are of particular interest to users. User interactions with the media items are analyzed and metadata of segments of media items that are determined to be of particular interest to the users is recorded. The parameters associated with a request for personalized content for a user are matched with the recorded metadata to identify relevant media items or segments of media items which are transmitted to the user as the personalized content. | 12-05-2013 |
20130325870 | USING CONTENT - Among other things, the ability of people and entities to produce, distribute, and use text, images, video, and other items of digital content is enhanced by providing software tools that enable them to (a) clip items of the digital content on any platform that is capable of presenting the digital content, (b) store copies of the clipped items along with copies of items clipped by other people or entities, in a common storage place controlled by a host, (c) form and store meshes of tags to represent their mindsets about items of content. The tags include primary tags that express their direct observations about the content and secondary tags that express their observations about the primary tags and the secondary tags. Meshes of the tags are made available to the people who formed them and, if permitted by them, to other people and entities for use in understanding their mindsets and in producing, delivering, and using digital content. | 12-05-2013 |
20130325871 | Method and System for Collecting and Organizing Data Corresponding to an Event - A system and method for analyzing data from a plurality of computer environments. The computer environments are authenticated and data is imported to a memory location. The data is converted into a uniform format to enable expedited searching by one or more authenticated users. The data may be marked so that a user may determine which computer environment provided the data. The system may also create one or more indexes of the data to assist one or more users in searching the data. | 12-05-2013 |
20130325872 | Method and System for Collecting and Organizing Data Corresponding to an Event - A system and method for analyzing data from a plurality of computer environments. The computer environments are authenticated and data is imported to a memory location. The data is converted into a uniform format to enable expedited searching by one or more authenticated users. The data may be marked so that a user may determine which computer environment provided the data. The system may also create one or more indexes of the data to assist one or more users in searching the data. | 12-05-2013 |
20130325873 | SYSTEMS AND METHODS FOR LOAD-BALANCING BY SECONDARY PROCESSORS IN PARALLELIZED INDEXING - The invention relates to electronic indexing, and more particularly, to the parallelization of indexing. Systems and methods of the invention index data archives by breaking a job into work items and sending the work items to multiple processors that can each determine whether to index data associated with the work item or to create a new work item and have a different processor index the data. This gives the system an internal load-balancing that results in indexing jobs during which no processor stands idle while another processor indexes data of unexpected complexity. | 12-05-2013 |
20130332461 | COMPUTER-BASED CONFIDENTIAL DISCLOSURE SEARCH TOOL - A computer-based apparatus for searching confidential documents, including a computer with a memory element and a processor to execute instructions stored in the memory to receive a confidential document and related non-confidential information from a source entity. The processor executes the instructions to: store the confidential document and non-confidential information in the memory element; and restrict access to the confidential document stored in the memory element to the source entity and a library entity only, or to the first source entity only. The processor executes the computer readable instructions to: receive a search request from a searching entity including a search parameter; identify the search parameter as being applicable to the confidential document; and transmit for access by the searching entity, the non-confidential information. The library entity is different from the source entity. | 12-12-2013 |
20130332462 | GENERATING CONTENT RECOMMENDATIONS - A system to generate content recommendations by identifying content and selecting a content entry for the content. The system comprises identifying a keyword in the content entry, generating a tag for the content based on the keyword, generating a plurality of recommendations based on the tag, and displaying the recommendations. | 12-12-2013 |
20130332463 | Method and System of Content Distribution and Broadcast - Methods and systems allowing content owners to make their content available to broadcasters over a network or communications medium such as via the Internet. | 12-12-2013 |
20130332464 | Adding Information or Functionality to a Rendered Document via Association with an Electronic Counterpart - An action plan data structure for one or more selected rendered documents is described. The data structure contains information specifying an action to perform automatically in response to a text capture from any of the selected rendered documents. | 12-12-2013 |
20130332465 | DATABASE MANAGEMENT DEVICE AND DATABASE MANAGEMENT METHOD - A database management device includes: plural index blocks that have an index entry and an access counter including a read counter and a write counter; an access management unit that updates the read counter in response to a read access to an index block and updates the write counter in response to an update access to the index block; an extended block manipulation unit that acquires an extended block not identified with an index entry of another index block and serving as a destination of storing a new index entry on the basis of a result of comparison between the read counter and the write counter of a leaf block serving as a target of inserting the new index entry, and sets identification information for identifying the extended block to the leaf block serving as the target of insertion. | 12-12-2013 |
20130339365 | INTEGRATED DEVELOPMENT ENVIRONMENT-BASED REPOSITORY SEARCHING IN A NETWORKED COMPUTING ENVIRONMENT - Embodiments of the present invention provide an approach for integrated development environment (IDE)-based repository searching (e.g., for library elements such as classes and/or functions) in a networked computing environment. In a typical embodiment, a first program code file is received from a first integrated development environment (IDE). The first program file may be associated with a set of attributes as stored in an annotation, header, or the like. Regardless, the first program file may be parsed and indexed into a repository based on the set of attributes. A search request may then be received from a second IDE. Based on the search request and the set of attributes, a matching program code file may then be identified as stored in the repository. Once identified, the matching program code file may be transmitted/communicated to the second IDE to fulfill the search request. | 12-19-2013 |
20130339366 | METHOD AND SYSTEM FOR CREATING INDICES AND LOADING KEY-VALUE PAIRS FOR NOSQL DATABASES - Systems and methods are provided for creating indices and loading key-value pairs for NoSQL databases. Attributes are created that correspond to records in a NoSQL database based on corresponding record fields. An index is created based on the attributes. A memory is loaded with attributes that correspond to a subset of the index as keys in a key-value pair and identifiers that correspond to records that correspond to the attributes as values in the key-value pair. The attributes that correspond to the subset of the index are sorted in the memory. Any duplicate attributes are identified from the sorted attributes in the memory. Any identifiers that correspond to any duplicate attributes also identify records in the NoSQL database to be evaluated as potential duplicate records. | 12-19-2013 |
20130339367 | METHOD AND SYSTEM FOR PREFERENTIAL ACCESSING OF ONE OR MORE CRITICAL ENTITIES - Embodiment is related to a method and system for preferentially accessing of one or more critical entities. One or more event data sets, related to one or more entities stored in a database, are received from one or more data sources by a processing engine. The entities, which are associated to the received event data sets are identified. The identified entities are compared with a preconfigured list of critical entities to determine at least one of the one or more entities matching with at least one critical entity. The preconfigured list of critical entities is stored in a pattern based selective index. The critical entities are stored in the graph database and indexed by the pattern based selective index. The entities are accessed, using address information associated to the respective entities, from the database when identified entities are matched with the critical entity in preconfigured list of critical entities. | 12-19-2013 |
20130339368 | METHOD AND SYSTEM FOR HANDLING NON-PRESENCE OF ELEMENTS OR ATTRIBUTES IN SEMI-STRUCTURED DATA - The invention relates to detecting non-presence of nodes in a document of semi-structured data and to creating a values index which properly reflects this non-presence. The document of semi-structured data, comprising multiple hierarchically organized nodes, is scanned for values (step | 12-19-2013 |
20130346414 | Correlation Engine and Method for Granular Meta-Content Having Arbitrary Non-Uniform Granularity - One disclosed method includes receiving correlation instructions related to a plurality of meta-content elements that are associated with a primary content. The primary content may be multimedia content such as, but not limited to, an audiovisual content. The method includes performing a correlation in response to receiving the instructions. The correlation is between the meta-content elements, where the meta-content elements each have an arbitrary granularity defining meta-content segments. The method returns a result based on the correlation. Another disclosed method include receiving a request having correlation instructions related to a plurality of meta-content elements, where the meta-content elements are associated with a primary content. Again, each meta-content element has an arbitrary granularity defining meta-content segments. The method includes determining, in response to receiving the request, a composite of meta-content segments of the plurality of meta-content elements, based on the correlation instructions. | 12-26-2013 |
20130346415 | SYSTEMS AND METHODS MULTI-KEY ACCESS TO DATA - A computer-implemented method of storing data for fast lookup comprises forming a first and a second array of pointers, forming a record to store, the record comprising fields for, a first list pointer, a second list pointer, which is not the first field in the record, a first key, and a second key. The method further comprises determining a first index based at least in part the first key, setting the value of the pointer at the first index in the first array to the location of the first pointer field of the record, determining a second index based at least in part the second key, and setting the value of the pointer at the second index in the second array to the location of the second pointer field of the record. | 12-26-2013 |
20130346416 | Long-Query Retrieval - Described herein is a technology that facilitates efficient large-scale similarity-based retrieval. In several embodiments documents, images, and/or other multimedia files are compactly represented and efficiently indexed to enable robust search using a long-query in a large-scale corpus. As described herein, these techniques include performing decomposition of a file, e.g., an image, a document containing an image, or a document-like representation of an image. The techniques use dimension reduction to obtain three parts, low-dimensional representations (major semantics), file specific terms (minor semantics), and background words, representing the major semantics in a feature vector and the minor semantics as keywords. Using the techniques described, file vectors are matched in a topic model and the results ranked based on the keywords. | 12-26-2013 |
20130346417 | STREAM DATA ANOMALY DETECTION METHOD AND DEVICE - The present invention is a stream data anomaly detection method for detecting an anomaly in input data received by a computer. The computer has: an input module for receiving the input data; an anomaly detection unit for detecting an anomaly in the input data; and an index management unit for calculating an index for the detection of an anomaly in the input data. The anomaly detection unit compares the received input data with the index and detects an anomaly in the input data when the difference between the value of the input data and the index satisfies a predetermined condition. The index management unit determines whether or not the index should be recalculated, and, when the index is determined to be recalculated, the index management unit recalculates a new index on the basis of the input data and substitutes the index with the calculated index. | 12-26-2013 |
20140006410 | REAL-TIME, COOPERATIVE, ADAPTIVE AND PERSISTENT SEARCH SYSTEM | 01-02-2014 |
20140006411 | METHOD AND APPARATUS FOR MULTIDIMENSIONAL DATA STORAGE AND FILE SYSTEM WITH A DYNAMIC ORDERED TREE STRUCTURE | 01-02-2014 |
20140006412 | Composite Term Index for Graph Data | 01-02-2014 |
20140012857 | Retrieving Electronic Documents by Converting Them to Synthetic Text - The present invention relies on the two-dimensional information in documents and encodes two-dimensional structures into a one-dimensional synthetic language such that two-dimensional documents can be searched at text search speed. The system comprises: an indexing module, a retrieval module, an encoder, a quantization module, a retrieval engine and a control module coupled by a bus. Electronic documents are first indexed by the indexing module and stored as a synthetic text library. The retrieval module then converts an input image to synthetic text and searches for matches to the synthetic text in the synthetic text library. The matches can be in turn used to retrieve the corresponding electronic documents. In one or more embodiments, the present invention includes a method for comparing the synthetic text to documents that have been converted to synthetic text for a match. | 01-09-2014 |
20140019454 | Systems and Methods for Caching Data Object Identifiers - A server receives a query specifying filter criteria from a client. The server obtains the object identifiers (IDs) for the data objects satisfying the query from one or more object identifier caches. The data objects are retrieved from one or more data sources using lookups based on object identifiers (IDs) obtained from the one or more object identifier (ID) caches. The retrieved data objects are returned to the client in response to the query. | 01-16-2014 |
20140019455 | HISTORICAL VIEW OF OPEN FILES - Managing versions of an electronic entity comprising many independently managed, but mutually-dependent, subcomponents can be challenging. File management functionality is provided for use with an integrated development environment to produce a visual indication of the relationships among the subcomponents. The approach described herein provides an improvement over source code control systems and backup systems in the ability to revert the state of one or more files as their content existed at an historical time point. The technique does not require a user to predict in advance at which time points the content state of one or more files will be interesting as historical time points for future use. | 01-16-2014 |
20140019456 | Application Certification and Search System - In one embodiment, a method searches a plurality of application marketplaces for applications where each application marketplace includes a plurality of applications available for download to a mobile device. The method then compares a set of applications in different marketplaces to determine certified instances of applications and uncertified instances of applications. Information is stored for the certified instances of the applications that are found in different marketplaces in a database. The method indexes the information for the certified instances of the applications in the database and stores an index based on the indexing. The index is used to provide a search result for a search query for an application. | 01-16-2014 |
20140019457 | SYSTEM AND METHOD FOR INDEXING, RANKING, AND ANALYZING WEB ACTIVITY WITHIN AN EVENT DRIVEN ARCHITECTURE - Disclosed is a system for organizing a web activity including a parsing module for receiving the web activity, a concept indexing module for indexing the web activity according to a plurality of concepts in a concept index, a web event creation module for generating a plurality of web events from the web activity, a web activity indexing module for indexing the web activity according to the plurality of web events in a web event index, a ticker management module for generating a plurality of tickers each respectively associated with at least one of the plurality of concepts, and a database for storing the concept index, the web event index, and the plurality of tickers. | 01-16-2014 |
20140025684 | INDEXING AND SEARCHING A DATA COLLECTION - A system searches a data collection using a bloom filter index to reduce the search scope to improve search efficiency, while at the same time allowing more flexible searches of the data collection using search terms containing any arbitrary string of data. | 01-23-2014 |
20140025685 | MANAGING STORAGE OF INDIVIDUALLY ACCESSIBLE DATA UNITS - Managing data by: receiving a group of individually accessible data units, each data unit identified by a key value, with key values determined such that the key value identifying a first data unit received before a second data unit occurs earlier in a sort order than the key value identifying the second data unit; and processing the data units for storage in a data storage system. The processing includes: storing blocks of data, the blocks being generated by combining a plurality of the data units; providing an index with entries that enable location, based on a provided key value, of a block that includes a data unit corresponding to the provided key value; and generating one or more screening data structures associated with the blocks for determining, based on a given key value, whether to search the stored blocks for a data unit corresponding to the given key value. | 01-23-2014 |
20140032562 | APPARATUS AND METHODS FOR USER GENERATED CONTENT INDEXING - A method and client device is disclosed for indexing content of a multimedia file. The method comprises using a client device to segment the content of the multimedia file into a plurality of segments and to determine structure-searchable data for each segment. Determining structure searchable data for a segment comprises (1) identifying one or more features of respective multimedia types in the segment; (2) correlating each of the identified features to one or more respective keywords; and (3) calculating one or more respective relevance factors for each of the keywords, where at least one of the relevance factors is based on one or more characteristics of the client device. The method also comprises the client device transmitting the structure-searchable data (including the keywords, relevance factors, and respective media types of the identified features) to an indexing server. | 01-30-2014 |
20140032563 | Indexing Based on Object Type - In one embodiment, a method includes maintaining data stores for objects of a social-networking system, each data store being configured to store objects of a particular one of multiple object types indexed by a search index; receiving an object; identifying an object type of the received object; storing the received object in at least one data store being configured to store objects of the identified object type; and updating the search index of the at least one data store based on the received object. | 01-30-2014 |
20140032564 | Social Static Ranking for Search - In one embodiment, one or more computing devices assign each of a plurality of nodes of a graph of a social-networking system to one of a plurality of search indices. Each search index corresponds to a node type, and each node assigned to a search index is of the node type that the search index corresponds to. For each search index, the one or more computing devices determine a value for each node assigned to the search index based at least in part on edges connected to the node in the graph and rank the nodes assigned to the search index based at least in part on their values. The one or more computing devices provide the search indices for storage to facilitate responding to queries encompassing objects represented by the nodes assigned to the search indices. | 01-30-2014 |
20140032565 | SECONDARY CONTENT - A method includes receiving primary content, wherein the primary content includes a video program. The method also includes identifying attributes of the primary content, identifying secondary content based on the identified attributes of the primary content, and indexing the primary content to the secondary content. The method also includes identifying a primary user device associated with the primary content and an associated secondary user device. A request for secondary content may be received from the secondary user device. The method further includes providing the secondary content at the secondary user device synchronized with receipt of the primary content at the primary user device. | 01-30-2014 |
20140032566 | SYSTEMS AND METHODS TO BUILD AND UTILIZE A SEARCH INFRASTRUCTURE - Methods and systems to build and utilize a search infrastructure are described. The system generates index information components in real-time based on a database that is time-stamped. The system updates index information at a plurality of query node servers based on the index information components. A query engine receives a search query from a client machine and identifies search results based on the query and the index information. The system communicates the search results, over the network, to the client machine. | 01-30-2014 |
20140040273 | HYPERVIDEO BROWSING USING LINKS GENERATED BASED ON USER-SPECIFIED CONTENT FEATURES - A method for generating content links between a first digital file and a second digital file by detecting a content feature of a first digital file segment of the first digital file during playback of the first digital file segment of the first digital file, searching an index of a plurality of content features for a plurality of segments including a second digital file segment of the second digital file, and dynamically generating a link between the first digital file one segment of the first digital file and the second digital file segment of the second digital file when a content feature of the first digital file segment of the first digital file is related to the content feature of the at least one segment of the second digital file. | 02-06-2014 |
20140040274 | DISAMBIGUATING USER INTENT IN CONVERSATIONAL INTERACTION SYSTEM FOR LARGE CORPUS INFORMATION RETRIEVAL - A method of disambiguating user intent in conversational interactions for information retrieval is disclosed. The method includes providing access to a set of content items with metadata describing the content items and providing access to structural knowledge showing semantic relationships and links among the content items. The method further includes providing a user preference signature, receiving a first input from the user that is intended by the user to identify at least one desired content item, and determining an ambiguity index of the first input. If the ambiguity index is high, the method determines a query input based on the first input and at least one of the structural knowledge, the user preference signature, a location of the user, and the time of the first input and selects a content item based on comparing the query input and the metadata associated with the content item. | 02-06-2014 |
20140040275 | SEMANTIC SEARCH TOOL FOR DOCUMENT TAGGING, INDEXING AND SEARCH - A semantic tagging engine automatically generates semantic tags for the given documents and enables semantic search, based on meanings of search terms and content tags. A Semantic Knowledge Management Tool (SKMT) forms a semantic search and knowledge management platform to search, analyze and manage enterprise content. SKMT scans different content sources and generates indexes of semantic keywords. Its interface allows users to manage various data sources, search, explore and visualize search results at semantic level. SKMT provides high precision of semantic search and semantic data visualization. | 02-06-2014 |
20140046949 | INCREMENTAL DYNAMIC DOCUMENT INDEX GENERATION - New text associated with an updated document is compared with existing embedded index items within the updated document and with existing contextual text within the updated document associated with the existing embedded index items. A determination is made as to whether any of the new text includes candidate text for at least one new index item that is not already indexed within the existing embedded index items using a contextual index compendium including contextual index item generation rules and the existing contextual text within the updated document associated with the existing embedded index items. At least one new index item is generated, using the contextual index item generation rules, in response to determining that at least one portion of the new text includes the candidate text that is not already indexed within the existing embedded index items. | 02-13-2014 |
20140046950 | SYSTEMS AND METHODS FOR EFFICIENT PAGING OF DATA - Paging of data includes querying a table of data to determine an item identifier for each item in the table. An index table including an index identifier column of sequential integers and a corresponding item identifier column for each item in the table is generated and stored in a memory. A page index table including the item identifier that corresponds to a first index identifier and each item identifier that corresponds to one plus a pagesize multiple of the index identifier is generated and stored in a memory. A desired page of data is retrieved using the determined item identifier corresponding to the first item on the desired page and either the determined item identifier corresponding to the first item on a next page after the desired page or a determined item identifier corresponding to the last item on the desired page. | 02-13-2014 |
20140046951 | AUTOMATED SUBSTITUTION OF TERMS BY COMPOUND EXPRESSIONS DURING INDEXING OF INFORMATION FOR COMPUTERIZED SEARCH - Methods, software and devices for indexing responses for later providing to users in response to queries are disclosed. For each stored response, representative queries are stored in association with that response, where each representative query represents a possible query for searching for information addressed by that response. Representative queries are selectively modified by substituting terms by corresponding chosen substitute expressions, where a substitute expression is chosen for a particular term in one of the representative queries based on past substitutions in others of said representative queries. For each response, a Boolean expression is formed from those representative queries associated with that response, as selectively modified, where the Boolean expression is satisfied by each of those representative queries. The Boolean expression is stored in association with that response for later applying to each query, that response to be provided to the user in response of those queries satisfying the Boolean expression. | 02-13-2014 |
20140046952 | APPARATUS, SYSTEM AND METHOD FOR DATA COLLECTION, IMPORT AND MODELING - A method of and system for data analysis, including interrogating a plurality of computer systems to generate for each computer system at least one dump file, each dump file including configuration and state information; extracting predetermined configuration and state information from the respective dump files; and storing the extracted configuration and state information in a database in a normalized format. | 02-13-2014 |
20140046953 | TEXT PROCESSING METHOD, SYSTEM AND COMPUTER PROGRAM - A method includes hierarchically identifying occurrences of some of the words in the set of sentences; creating a first index for each of some of the words based on the upper hierarchy of occurrences identified for each word; receiving input of a queried word; hierarchically identifying occurrences of the queried word in the set of sentences; creating a second index based on the upper hierarchy of occurrences identified for the queried word; comparing the first index and the second index to calculate an estimated value for the number of occurrences of a word in the neighborhood of the queried word; and calculating the actual value of the number of occurrences of a word in the neighborhood of the queried word based on an upper hierarchy and lower hierarchy of the occurrences on condition that the estimated value is equal to or greater than a predetermined number. | 02-13-2014 |
20140052732 | Analytics Data Indexing System and Methods - Provided is a method that includes a method for updating index data. The method includes receiving index data, including an index value indicative of user activity on a network site and an index time corresponding to a time used for calculating the index value, receiving an update index time corresponding to a time used for updating the index data, determining an updated index value using an exponential decay of the index value from the index time to the update index time, wherein the updated index value comprises a decayed value of the index value corresponding to the update time, and storing updated index data including the updated index value and the update index time. | 02-20-2014 |
20140052733 | INDEXING PREVIEW - Embodiments are directed towards previewing results generated from indexing data raw data before the corresponding index data is added to an index store. Raw data may be received from a preview data source. After an initial set of configuration information may be established, the preview data may be submitted to an index processing pipeline. A previewing application may generate preview results used on the preview index data and the configuration information. The preview results may enable previewing how the data is being processed by the indexing application. If the preview results are not acceptable, the configuration information may be modified. The preview application enables modification of the configuration information until the generated preview results may be acceptable. If the configuration information is acceptable, the preview data may be processed and indexed in one or more index stores. | 02-20-2014 |
20140052734 | COMPUTING DEVICE AND METHOD FOR CREATING DATA INDEXES FOR BIG DATA - In a method for creating data indexes for big data of a computing device, data lists are obtained from a data pool in a storage device, and a priority is set for each of the data lists. Data queues are created in the storage device, and the data lists are assigned to the data queues according to the set priorities. A node index is created for each data list stored in each of the data queues, and the data lists are deleted from the data queue after the node indexes creation. The method obtains a data list having a highest priority from the data pool if such a data list needs to be processed first, combines the node indexes to generate a root index for the data pool, and stores the root index of the data pool and the node indexes of the data lists in the storage device. | 02-20-2014 |
20140052735 | Propagating Information Among Web Pages - Web pages of a Website may be processed to improve search results. For example, information likely to pertain to more than just the Web page it is directly associated with may be identified. One or more other, related, Web pages that such information is likely to pertain to is also identified. The identified information is associated with the identified other Web page(s) and this association is saved in a way to affect a search result score of the Web page(s). | 02-20-2014 |
20140059052 | PARTITIONING SORTED DATA SETS - Techniques described herein generally relate to processing data sets. The data sets can each include multiple indexed data values that can be partitioned into first and second portions. Each data value in each of the first portions of the data sets may have a greater magnitude than each data value in each of the second portions of the data sets. The first portions and second portions of the data values can be further processed by either different processors, or different processor cores as may be desirable. | 02-27-2014 |
20140059053 | Method and System for Meta-Tagging Media Content and Distribution - A unique application within Video that allows for user generated Meta-tagging to be delivered in real time to individual clips is provided. This meta-tag creates its own tail based upon the user generated words population, which is searchable via a “spider network” that runs invisible behind the web pages of the site. These same “tagged” words are broadcast over various delivery networks including but not limited to live feeds from SMS, MMS, News Feeds within Community Web Sites, Video Sharing Web Sites, Widget applications any other forms of electronic communication that will be dropped in real time to identified users and friends of users. | 02-27-2014 |
20140067819 | EFFICIENT XML TREE INDEXING STRUCTURE OVER XML CONTENT - A method and apparatus are provided for building and using a persistent XML tree index for navigating an XML document. The XML tree index is stored separately from the XML document content, and thus is able to optimize performance through the use of fixed-sized index entries. The XML document hierarchy need not be constructed in volatile memory, so creating and using the XML tree index scales even for large documents. To evaluate a path expression including descendent or ancestral syntax, navigation links can be read from persistent storage and used directly to find the nodes specified in the path expression. The use of an abstract navigational interface allows applications to be written that are independent of the storage implementation of the index and the content. Thus, the XML tree index can index documents stored at least in a database, a persistent file system, or as a sequence of in memory. | 03-06-2014 |
20140067820 | SYSTEM AND METHOD FOR PHONETIC SEARCHING OF DATA - A method of phonetically searching media information comprises receiving a plurality of search queries from one or more client systems and providing a phonetic representation of each search query. One or more search jobs are instantiated, each search job comprising a plurality of tasks, each task being arranged to sequentially read a block from an archive file. The archive file is stored within a distributed filing system (DFS) in which sequential blocks of data comprising the archive file are replicated to be locally available to one or more processors from a cluster of processors for executing the tasks. Each block stores index files corresponding to a plurality of source media files, each index file containing a phonetic stream corresponding to audio information for a given source media file. Each task obtains phonetic representations of outstanding search queries for a block and sequentially searches the block for each outstanding search query. | 03-06-2014 |
20140067821 | STORAGE AND RETRIEVAL OF SENSOR DATA AND COMPUTED PARAMETERS FOR USE IN CONDITION BASED MAINTENANCE SYSTEMS - A system and method for storing and accessing data in an embedded system of an aircraft extracts identifiers from headers in stored data, and stores the identifiers in a separately indexable array. | 03-06-2014 |
20140067822 | SYSTEM AND METHOD FOR SELECTIVE INDEXING OF CONTENT - Provided are a system and method for selective indexing of content. The system includes an indexing mode identifier to identify an indexing mode of content in a storage device selected by a user, and a content indexing unit to index the content, based on the indexing mode. By providing multiple indexing modes, the system can provide multiple options of indexing the content based on speed of indexing and level of detail of the indexing. | 03-06-2014 |
20140067823 | Textual Search for Numerical Properties - A textual search index is used to identify data objects that have numerical properties that satisfy search queries. As described herein, a search index contains search index entries, each of which specifies a reference string and a set of data object identifiers. When an original search query is received, the search query is transformed into a transformed search query such that numerical comparison query elements in the original search query are transformed into equivalent textual comparison query elements. The search index is then used to identify ones of the data objects that satisfy the transformed search query. A list of the identified data objects may then be provided in response to the original search query. | 03-06-2014 |
20140074849 | REMOTE ARTIFACT REPOSITORY - System for generating a pseudo-repository. The system scans a directory to detect compiled binary files, and assembles an index of the compiled binary files based on metadata describing the compiled binary files. Then the system generates a pseudo-repository based on the index that maps each compiled binary file with at least one associated artifact, wherein the pseudo-repository responds to client requests for one of the binary files. | 03-13-2014 |
20140074850 | VISUALIZATION OF DATA FROM CLUSTERS - Embodiments are directed towards the visualization of machine data received from computing clusters. Embodiments may enable improved analysis of computing cluster performance, error detection, troubleshooting, error prediction, or the like. Individual cluster nodes may generate machine data that includes information and data regarding the operation and status of the cluster node. The machine data is received from each cluster node for indexing by one or more indexing applications. The indexed machine data including the complete data set may be stored in one or more index stores. A visualization application enables a user to select one or more analysis lenses that may be used to generate visualizations of the machine data. The visualization application employs the analysis lens to produce visualizations of the computing cluster machine data. | 03-13-2014 |
20140074851 | DYNAMIC DATA ACQUISITION METHOD AND SYSTEM - Embodiments of the present application relate to a dynamic data acquisition method, a dynamic data acquisition system, and a computer program product for dynamically acquiring data. A dynamic data acquisition method is provided. The method includes extracting a search term from a search request string that is received, looking up the search term in a threshold value dictionary to acquire a dynamic threshold score corresponding to the search term, using the search term as a query condition and the dynamic threshold score corresponding to the search term as a filter condition to acquire, in an index data table, one or more corresponding pieces of index information, acquiring data information corresponding to the search term based on the index information in the index data table, and sending the data information to be displayed in a page of a website. The dynamic threshold score varies based on a characteristic factor. | 03-13-2014 |
20140074852 | Visual Search Using Multiple Visual Input Modalities - Systems, methods, and computer-readable storage media for web-scale visual search capable of using a combination of visual input modalities are provided. An edgel index is created that includes shape-descriptors, including edgel-based representations, that correspond to each of a plurality of images. Each edgel-based representation includes pixels that depicts edges or boundary contours of an image and is created, at least in part, by segmenting the image into a plurality of image segments and performing a multi-phase contour detection on each segment. Upon receiving a search query having a visual query input, the visual query input is converted into shape-descriptors, including an edgel-based representation, and the shape-descriptors, including the edgel-based representation, of each of the plurality of images is compared with the shape-descriptors, including the edgel-based representation, of the visual query input to identify at least one image of the plurality of images that matches the visual query input. | 03-13-2014 |
20140081979 | Creation and Access of Quota Trees in a File System - Exemplary embodiments provide various techniques for the creation and access of quota trees in a file system. In one example, a method of identifying metadata referenced by a quota tree is provided. A quota tree metafile is accessed, and this quota tree metafile includes references to locations of inode grouped data structures associated with quota trees. Here, each quota tree is allocated an inode grouped data structure. A reference to a location of an inode grouped data structure that is allocated to the quota tree is identified from the quota tree metafile. The inode grouped data structure is accessed based on the reference, and this inode grouped data structure defines a set of inode identifiers associated with the quota tree. An inode grouped data structure that stores the metadata is then located using the set of inode identifiers as index | 03-20-2014 |
20140081980 | METHOD AND APPARATUS FOR ACCESSING AND DISPLAYING PRIVATE USER INFORMATION - Methods and apparatuses are provided for accessing and displaying private user information. A privacy-preserving overlay platform determines a request, from at least one application, for user data. The privacy-preserving overlay platform processes and/or facilitates a processing of the data to generate at least one index value of the data. Further, the privacy-preserving overlay platform causes, at least in part, a return of the at least one index value to the at least one application in response to the request. | 03-20-2014 |
20140081981 | Methods, Systems, and Program Products for Identifying a Matched Tag Set - Methods and systems are described for identifying a matched tag set. In one aspect, a first resource is identified that is first tagged with a first tag associated with a first tagging criterion by a first matched tag and that is second tagged with the first tag associated with a second tagging criterion by a second matched tag. Selection information is received identifying the first tag. Based on the first tagging criterion, a first set is identified that includes at least one resource when the selection information identifies the first matched tag, wherein each resource in the first set is tagged with the first tag. Based on the second tagging criterion, a second resource, not included in the first set, is identified that is tagged with the first tag when the selection information identifies the second matched tag. | 03-20-2014 |
20140081982 | Method and Computer for Indexing and Searching Structures - A method for indexing a plurality of structures derived from a plurality of externalizations of users' mental modelings is disclosed. The method includes receiving at least one of the plurality of structures; analyzing the structure according to a predetermined principle of normalization to obtain a plurality of index analysis results in a form of a plurality of tuples comprising a plurality of elements and/or a plurality of relations thereof related to the plurality of structures; and obtaining an index store according to the index analysis result. | 03-20-2014 |
20140081983 | Session-Based Processing Method and System - A log file processing system sorts records from large log files and groups them by session without making a complete copy of the log files by capturing a subset of the log files in a sliding memory window and identifying all records in the window that form a complete user session. Records belonging to a complete session are output for analyzing, and the remaining records are output as raw log data for additional processing. Using a ring buffer to implement the sliding memory window, data structures are used to group records by session, to identify completed sessions, and to index into the ring buffer to retrieve records for completed sessions that are to be directly analyzed. Any records remaining in the ring buffer at the end of slide window processing may be output as raw log file data and are processed as incomplete or malformed session records. An embodiment of the log file processing system provides a significant improvement on the speed of data extraction from log files into analyzable session data. | 03-20-2014 |
20140081984 | SYSTEMS AND METHODS FOR SCALABLE DELOCALIZED INFORMATION GOVERNANCE - The invention relates to electronic indexing, and more particularly, to the indexing, in a cloud, data held in a cloud. Systems and methods of the invention index data by accessing the data in place in the cloud and breaking a job into work items and sending the work items to multiple cloud processes that can each determine whether to index data associated with the work item or to create a new work item and have a different cloud process index the data. Each cloud process is proximal to an item that it indexes. This gives the system scale as well as an internal load-balancing. | 03-20-2014 |
20140089315 | Method and Apparatus for Enhancing Electronic Reading by Identifying Relationships between Sections of Electronic Text - An apparatus, method and article of manufacture of the present invention detects the presence of references to the same concept in separate sections of text, and, with no input required from the reader, presents the reader with information concerning the detected references to the concept. The information provided may comprise information related to the location of the reference to the concept in other sections of text, and the reader also is provided the ability to move from one reference to a concept directly to another reference to the same concept. | 03-27-2014 |
20140089316 | Method and Apparatus for Enhancing Electronic Reading by Identifying Relationships between Sections of Electronic Text - An apparatus, method and article of manufacture of the present invention detects the presence of references to the same concept in separate sections of text, and, with no input required from the reader, presents the reader with information concerning the detected references to the concept. The information provided may comprise information related to the location of the reference to the concept in other sections of text, and the reader also is provided the ability to move from one reference to a concept directly to another reference to the same concept. | 03-27-2014 |
20140089317 | ASSOCIATING OPERATIONS INFORMATION AND COMMUNICATIONS INFORMATION - A method can include providing operations information associated with a coordinate of a subterranean formation; associating communications information with the coordinate; indexing the provided operations information and the associated communications information; and storing a search index based at least in part on the indexing. Various other apparatuses, systems, methods, etc., are also disclosed. | 03-27-2014 |
20140089318 | METADATA QUERYING METHOD AND APPARATUS - Embodiments of the present invention provide a metadata querying method and apparatus. The method includes: sampling at least one piece of first metadata from to-be-searched-for metadata; using at least a part of feature values in each piece of the sampled first metadata as an index, and searching a sparse index table preset in a memory for a corresponding container identifier; selecting, according to the number of times that a same container identifier is found, a container corresponding to a container identifier that meets a set condition; loading metadata in the selected container into a metadata cache; and searching the metadata cache for a data block that is the same as the to-be-searched-for metadata. In the embodiments of the present invention, querying performance can be improved and occupied memory space can be reduced. | 03-27-2014 |
20140095511 | MULTIPLE INDEX BASED INFORMATION RETRIEVAL SYSTEM - An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are identified that predict the presence of other phrases in documents. Documents are the indexed according to their included phrases. The document index is partitioned into multiple indexes, including a primary index and a secondary index. The primary index stores phrase posting lists with relevance rank ordered documents. The secondary index stores excess documents from the posting lists in document order. | 04-03-2014 |
20140101165 | SEARCHABLE AND SIZE-CONSTRAINED LOCAL LOG REPOSITORIES FOR TRACKING VISITORS' ACCESS TO WEB CONTENT - A method and system for tracking visitors' access to web content using a searchable and size-constrained local log repository is disclosed. A repository indexer receives visitor logs from a remote source and indexes the logs to include a usage field indicating when or how frequently an associated visitor log was accessed from a local log repository by a report request manager. The local log repository stores the logs and is size constrained. A repository manager removes a subset of the logs in the local log repository based on parameters within the subset of the logs' usage field. The report request manager receives a request from a requestor for a report indicating visitors' access to a content object, searches for logs within the local log repository pertinent to the request, aggregates data across the logs responsive to the search, and generates a report presenting the aggregated data. | 04-10-2014 |
20140101166 | Methods And Systems For Processing Media Files - Systems and methods for processing media files are described. In one embodiment, one or more events are captured having associated event data and associated with a client device, wherein each event is associated with an article and at least one of the articles is a media file, wherein at least one of the events is captured in real time upon the occurrence of the event, at least some of the event data and articles associated with the events are indexed and stored, a search query is received, and the at least one media file is determined as relevant to the search query. | 04-10-2014 |
20140108413 | Responsive Images Service - A system, method, and computer-readable medium are disclosed for automating the management of a device description repository (DDR). A device properties detection script embedded in a web page is executed when the web page is processed by a browser. Upon execution, the embedded script determines various properties associated with the user's device, which are then provided along with the device's user-agent identifier for processing. In turn, the provided user-agent identifier is used to search a predetermined DDR for a matching user-agent identifier. If a matching user-agent identifier is not found, then the provided user-agent identifier and its corresponding device properties are stored in the DDR. The device properties associated with the user-agent identifier are then used to initiate the provision of device-optimized images to the browser. | 04-17-2014 |
20140108414 | SCALABLE DISTRIBUTED PROCESSING OF RDF DATA - In general, techniques are described for an RDF (Resource Description Framework) database system which can scale to huge size for realistic data sets of practical interest. In some examples, a database system includes a Resource Description Framework (RDF) database that stores a plurality of data chunks to one or more storage drives, wherein each of the plurality of data chunks includes a plurality of triples of the RDF database. The database system also includes a working memory, a query interface that receives a query for the RDF database, a SPARQL engine that identifies a subset of the data chunks relevant to the query, and an index interface that includes one or more bulk loaders that load the subset of the data chunks to the working memory. The SPARQL engine executes the query only against triples included within the loaded subset of the data chunks to obtain a query result. | 04-17-2014 |
20140108415 | METHOD AND SYSTEM OF MAPREDUCE IMPLEMENTATIONS ON INDEXED DATASETS IN A DISTRIBUTED DATABASE ENVIRONMENT - In one exemplary embodiment, a method of a distributed database system includes the step receiving a query in a query language from a client with a distributed database system. An index that matches the query is located. The index is pre-generated from a database table in the distributed database system. A map function of a MapReduce programming model is implemented using the index. A reduce function of the MapReduce programming model is implemented using the output of the map function. Optionally, a finalize function can be implemented using the output of the reduce function. The distributed database system can be a scalable NoSQL database. The reduce function can be optional when the value of the output of the map function is guaranteed to be unique. | 04-17-2014 |
20140108416 | INDEXING USING A LOCKLESS BURST TRIE - Lockless burst trie indexing inserts marker nodes into a linked list of data and marker nodes, each marker node corresponding to a symbol in an ordered data sequence stored in the data nodes. A trie structure contains entries that represent the symbols of the marker nodes and point to the corresponding marker nodes. The trie structure can contain multiple levels of entries that are linked together in a hierarchy to represent subsets of the data sequences. | 04-17-2014 |
20140108417 | METHOD AND APPARATUS FOR WEB AD MATCHING - A method and apparatus are provided for better web ad matching by combining relevance with consumer click feedback. In one example, the method includes receiving a query page, extracting features from the query page, re-weighting the query page, evaluating the query page in light of each ad in order to score each ad and pick substantially best ad matches of the indexed ads, and returning the substantially best ad matches to the consumer computer. | 04-17-2014 |
20140108418 | SEARCHING CODE BY SPECIFYING ITS BEHAVIOR - Systems and methods are disclosed for receiving a first specification that identifies program code behavior associated with a plurality of documents. The specification includes an input-output pair with a first data entity and a second data entity. The systems and methods further include identifying one or more documents, within the plurality of documents, that are configured to (i) use at least a portion of the first data entity as an input to program code associated with particular ones of the documents, and (ii) provide at least a portion of the second data entity as output associated with the program code, wherein the particular ones of the documents correspond to a positive matching between one or more constraints associated with each document and one or more constraints associated with the specification, and generating search results comprising the identified one or more documents. | 04-17-2014 |
20140114980 | Mining Developer Directories from Project Change History - Methods and arrangements for creating a searchable developer directory. A developer profile is generated relative to a project, the developer profile including information from change history with respect to the project. Metrics related to developer participation in the project are included in the developer profile, and the developer profile is indexed with at least one other developer profile to provide a search basis for search queries. | 04-24-2014 |
20140122498 | ENHANCED VIEWS OF TAGGED ITEMS IN A GALLERY - Disclosed herein are systems, methods, and software for facilitating gallery environments and views. In at least one implementation an initial view is presented of tagged items arranged in tag groups. The tag groups correspond to tags and the tagged items are arranged in the tag groups based on with which of the tags each of the tagged items is associated. One of the groups may be identified for enhanced viewing. Accordingly, tagged items associated with the identified group, by way of their association with a tag corresponding to the group, are identified. In some implementations they may be referred to as enhanced tagged items. An enhanced view may then be presented of at least the enhanced tagged items. | 05-01-2014 |
20140122499 | OPTIMIZED QUERIES FOR FILE PATH INDEXING IN A CONTENT REPOSITORY - Techniques for indexing file paths of items in a content repository may include taking turns in querying each different item type or folder type in a round robin schedule to visit select nodes of the folder tree of that type to update and maintain the file path indexes. Item types or folder types may be associated with a count of instances or children of instances that are missing indexes. For each item type or folder type, a query may be performed for instances of the item type or folder type having children that are missing indexes, the instances or children of the instances returned may be associated with file path indexes, and the count of instances or children of instances may be adjusted based on the associating. | 05-01-2014 |
20140129564 | PROVIDING FILE INDEXES AND PER-FILE VIEWING MODES WITHIN A FILE MANAGEMENT APPLICATION - Various embodiments present file indexes within a file managing and navigation interface. In one embodiment, a set of files is presented within a user interface of a file managing and navigation application. A visual indicator is associated with at least one file in the set of files. The visual indicator indicates to a user that the at least one file is associated with an index. The index includes a set of index components associated with a content set of the at least one file. A request from the user to is received to display the index. The index is presented to the user within the user interface based on receiving the request. | 05-08-2014 |
20140129565 | INFORMATION PROCESSING DEVICE - An information processing device of the present invention includes: an index generating means for generating index data distributedly stored into a plurality of servers, each index data being generated for each data point information including metadata representing a characteristic of data to be searched, and each index data being associated with reference point information that are other data point information such that distances to the each data point information based on a preset reference are within a preset range; and a communication cost storing means for storing, for each of the servers storing the index data, communication cost information representing a preset cost that costs in communication with the server. The index generating means selects the reference point information to be associated with specific one of the data point information, based on the communication cost information of the server storing the index data corresponding to the reference point information. | 05-08-2014 |
20140136543 | AUTOCOMPLETE SEARCHING WITH SECURITY FILTERING AND RANKING - A system that provides secure autocomplete searching receives an autocomplete query from a user, the autocomplete query including a prefix of a search phrase, and retrieves security information of the user. The system searches one or more prefix indexes to find a set of matching objects, where the matching objects each include associated object security information. The system excludes matching objects that the user is not authorized to access from the set of matching objects based on the object security information and the user security information. The system then returns the set of matching objects to the user. | 05-15-2014 |
20140136544 | NATURAL LANGUAGE PROCESSING OPTIMIZED FOR MICRO CONTENT - A system and a method for microcontent natural language processing are presented. The method comprising steps of receiving a microcontent message from a social networking server, tokenizing the microcontent message into one or more text tokens, detecting the language of the microcontent message and selecting the property dictionary for part-of-speech tag, part-of-speech tagging the microcontent message to identify related pronouns and nouns based on the selected dictionary, and extracting topics form the microcontent messages and assigning confidence values to the topics. | 05-15-2014 |
20140143256 | SEARCHING ENTITY-KEY ASSOCIATIONS USING IN-MEMORY OBJECTS - Disclosed are a method and system for searching entities and keys that are associated with each other using entity-key associations stored in an in-memory object. When a query is issued for retrieving entity-key associations, the disclosed technique searches an in-memory object, such as a binary large object (BLOB) containing a plurality of entity-key association strings that represent the entity-key associations. An entity-key association string includes an entity name of an entity concatenated with a key name of a key that are associated with each other. Searching for the entity-key associations using the entity-key association strings in the BLOB is faster than a conventional search that includes querying multiple tables in a database. The disclosed technique creates the entity-key association strings by reading necessary data from a data structure stored in data storage unit. Both literal and regular expressions are supported without adversely affecting the query performance. | 05-22-2014 |
20140143257 | ANNOTATION SYSTEM FOR CREATING AND RETRIEVING MEDIA AND METHODS RELATING TO SAME - The invention described herein is generally directed to a method and apparatus for creating and retrieving audio data. In one implementation the invention comprises an annotation system configured to record, store, and retrieve media. The annotation system contains a set of client-processing devices configured to capture media for subsequent playback. Each client-processing device typically contains a record button to initiate the capture and is configured upon performing the capture operation to trigger an association of a unique ID with the media. The client-processing devices are further configured to upload the media and a unique ID to a server for purposes of storage. The server obtains the media and unique ID for subsequent retrieval and provides the media and the unique ID to at least one client-processing device from the set of client processing devices. | 05-22-2014 |
20140149421 | DYNAMICALLY QUANTIFYING THE DEMAND OF EACH SOFTWARE COMPONENT OF EACH SOFTWARE STACK DEPLOYED IN THE CLOUD ENVIRONMENT - A method, system and computer program product for dynamically quantifying a demand for the software components deployed in a cloud environment. An administrative server generates a table mapping the software levels of the software components of the deployed software stacks with the number of instances of the deployed software stacks tracked over a period of time. The depth weight based on subtracting a depth index (zero-based) from a depth size is calculated for each software component, where a depth index refers to the software level of the software component in question and a depth size refers to the number of software levels for the software stack bearing the software component in question. A metric used in quantifying the demand for the software component (“popularity index”) is then determined for each software component based on the number of deployed instances and the depth weight for that software component. | 05-29-2014 |
20140149422 | Automating Management of Social Media Data - A system, method, and computer-readable medium are disclosed for automating the management of social media data. Customer identification information associated with a target individual is processed to generate corresponding social media identification information, which in turn is processed to identify associated social media data stored in a social media data source. Once identified, the associated social media data is retrieved, indexed to the customer and social media identification data, and stored in a social profile associated with the target individual. In various embodiments, the customer identification data comprises one or more of a customer's name, a unique customer identifier, a physical address, an electronic mail address, a telephone number, or a customer demographic. | 05-29-2014 |
20140149423 | REPORT ACCELERATION USING INTERMEDIATE RESULTS IN A DISTRIBUTED INDEXER SYSTEM FOR SEARCHING EVENTS - A method and system for managing searches of a data set that is partitioned based on a plurality of events. A structure of a search query may be analyzed to determine if logical computational actions performed on the data set is reducible. Data in each partition is analyzed to determine if at least a portion of the data in the partition is reducible. In response to a subsequent or reoccurring search request, intermediate summaries of reducible data and reducible search computations may be aggregated for each partition. Next, a search result may be generated based on at least one of the aggregated intermediate summaries, the aggregated reducible search computations, and a query of adhoc non-reducible data arranged in at least one of the plurality of partitions for the data set. | 05-29-2014 |
20140156667 | Increasing Internet Protocol Version 6 Host Table Scalability in Top of Rack Switches for Data Center Deployments - Techniques are provided for optimizing storage of address information in switch device databases. A control packet is received from a source host. An address associated with a host is determined. The address comprises a first and second address component. The first address component is stored in a first database and is mapped to an index value that is stored in the first database. The index value is also stored in a second database along with the second address component. The first database may also be examined to determine whether the first address component is stored in the first database. If the first address component is stored in the first database, the index value mapped to the first address component is retrieved. The second database is examined to determine whether the index value and the second address component are stored in the second database. | 06-05-2014 |
20140156668 | APPARATUS AND METHOD FOR INDEXING ELECTRONIC CONTENT - Apparatus and method are provided for indexing electronic content to be served to users that operate mobile and/or stationary communications and computing devices. An index is composed of multiple slices, with each slice capable of storing multiple entries and each entry representing one content item. A content item's index entry is populated with integer values representing attribute/value pairs of a target audience of the content item. A query or request to identify content items for serving to a particular user is similarly formatted with integers representing attribute/value pairs of the user. Queries can then be executed rapidly across any or all index entries in any or all slices. Within a slice, entries may be sorted by value or score, and integer components within an individual entry may be sorted to facilitate rapid comparison with a query. | 06-05-2014 |
20140156669 | APPARATUS AND METHOD FOR INDEXING ELECTRONIC CONTENT - Apparatus and methods are provided for indexing electronic content to be served to users' mobile and/or stationary communications and computing devices. An index is composed of multiple slices, with each slice storing multiple entries and each entry representing one content item or one campaign or collection of content items. An entry is populated with tokens representing attribute/value pairs of a target audience of the content item and/or property/value pairs of the item or the item's campaign. A query or request to identify content items for serving to a particular user is similarly formatted with tokens representing attribute/value pairs of the user and/or item/campaign. Queries can then be executed rapidly across any or all index entries in any or all slices. Within a slice, entries may be sorted by value or score, and integer components within an individual entry may be sorted to facilitate rapid comparison with a query. | 06-05-2014 |
20140156670 | NUMERIC RANGE SEARCH DEVICE, NUMERIC RANGE SEARCH METHOD, AND NUMERIC RANGE SEARCH PROGRAM - A numeric range search device includes: interval index storage means for storing, for numeric ranges to be searched, an interval index that is made up of records each including at least an interval filter and reference information and in which the records are organized in units of records whose interval filters are at least partially common, the interval filter being data indicating a correspondence relationship between an input numeric value or numeric range and subintervals obtained by dividing a range including all of the numeric ranges by a predetermined boundary value, and the reference information being information for referring to a numeric range to be searched; interval query generation means for generating an interval query which is data indicating a correspondence relationship between the input numeric value or numeric range and the subintervals; and candidate selection means for selecting, from the records, a record for which an AND of an interval filter included in the record and the interval query is equal to the interval query. | 06-05-2014 |
20140156671 | Index Constructing Method, Search Method, Device and System - The present disclosure discloses an index constructing method, search method, device and system. The index constructing method includes: constructing data obtained in each preset time period into a primary index segment, respectively, and merging the constructed primary index segments into a primary index segment set after each preset time period is over; copying and merging each primary index segment in the primary index segment set into a corresponding large index segment, to obtain a large index segment set; and sending the primary index segments in the primary index segment set or the large index segments in the large index segment set to a search device to provide a search service. The present disclosure constructs the primary index segments on the basis of time, merges the constructed primary index segments into the large index segment, and provides the search service simultaneously with the primary index segments or the large index segments, thus, improving search speed. | 06-05-2014 |
20140172866 | SYSTEM FOR STORAGE, QUERYING, AND ANALYSIS OF TIME SERIES DATA - A system for storing time series data includes an ingester that prepares metadata indices associated with blocks of incoming time series data and stores the blocks of data in a time series database and the indices in a separate index database. The time series database distributes storage of the data blocks among multiple data nodes. A query layer receives queries and uses the index database to determine which data blocks are needed to process the query, and then requests only those data blocks from the time series database. Processing of the query is performed within the time series database only on those data nodes that contain relevant data, and partial results are passed to an output layer for formation into a final query result. | 06-19-2014 |
20140172867 | METHOD FOR STORAGE, QUERYING, AND ANALYSIS OF TIME SERIES DATA - A method for performing queries on a distributed time series data storage system is presented. The time series data storage system has a time series database that stores data blocks containing time stamped data across a plurality of computing devices. The system also includes an index database that stores an index associated with the time stamped data in each data block. The method includes the steps of sending a query, requesting indices, returning the indices, preparing a sub-query, forwarding the sub-query to an evaluator, evaluating the sub-query, performing a logical operation on each sub-query's result, receiving the sub-results at an output handler, and combining the sub-results. | 06-19-2014 |
20140172868 | SYSTEM AND METHOD FOR STORAGE, QUERYING, AND ANALYSIS SERVICE FOR TIME SERIES DATA - A service for storing time series data provides a data pipe for receiving time series data, a query pipe for making requests to the service, and a result pipe for receiving output from the service. Data sent to the query pipe is processed by an ingester that prepares metadata indices associated with blocks of incoming time series data and stores the blocks of data in a time series database and the indices in a separate index database. A query layer receives queries from the query pipe and uses the index database to determine which data blocks are needed to process the query, and then requests only those data blocks from the time series database. Processing of the query is performed within the time series database only on those data nodes that contain relevant data, and partial results are passed to an output layer for formation into a final query result which is sent out by the results pipe. | 06-19-2014 |
20140172869 | INDEXING OF LARGE SCALE PATIENT SET - Systems and methods for indexing data include formulating an objective function to index a dataset, a portion of the dataset including supervision information. A data property component of the objective function is determined, which utilizes a property of the dataset to group data of the dataset. A supervised component of the objective function is determined, which utilizes the supervision information to group data of the dataset. The objective function is optimized using a processor based upon the data property component and the supervised component to partition a node into a plurality of child nodes. | 06-19-2014 |
20140172870 | INDEXING OF LARGE SCALE PATIENT SET - Systems and methods for indexing data include formulating an objective function to index a dataset, a portion of the dataset including supervision information. A data property component of the objective function is determined, which utilizes a property of the dataset to group data of the dataset. A supervised component of the objective function is determined, which utilizes the supervision information to group data of the dataset. The objective function is optimized using a processor based upon the data property component and the supervised component to partition anode into a plurality of child nodes. | 06-19-2014 |
20140172871 | METHODS AND AUTOMATED SYSTEMS FOR TESTING, OPTIMIZATION, AND ANALYSIS THAT USE ROBUST STATISTICAL PROCESSING OF NON-BINOMIAL EXPERIMENTAL RESULTS - The current document is directed to automated electronic testing, optimization, and/or analysis systems that perform testing that results in non-binomial experimental results that are processed by using efficient and robust processing methods. In one implementation, efficient and robust processing methods are employed to process non-binomial results produced from single-factor tests. In a second implementation, robust and computationally efficient processing methods are employed to analyze non-binomial results from multi-factor/multi-level tests. | 06-19-2014 |
20140172872 | RELATIONSHIP COLLABORATION SYSTEM - A computer implemented method of processing data containing information about relationships between contacts and a community of contact owners is provided, which includes the steps of: collecting data having contact information, contact owner information and one or more values related to the strength of a relationship between the contact and the contact owner; evaluating the strength of the relationship based on the one or more values; and storing the collected data and evaluated strength in a computer database. | 06-19-2014 |
20140181115 | CONTENT DELIVERY SYSTEM WITH PROFILE GENERATION MECHANISM AND METHOD OF OPERATION THEREOF - A method of operation of a content delivery system includes: generating a first user profile for representing a user's identity in a context; generating a second user profile for representing the user's identity in a different context; selecting the first user profile or the second user profile based on the context or the different context; and generating a display content based on the first user profile or the second user profile for displaying on a device. | 06-26-2014 |
20140181116 | Method and device of cloud storage - Embodiments of the present invention disclose a cloud storage method and device, to provide efficient storage of huge amounts of data. The cloud storage method includes: calculating the hash value of a file, converting the hash value of the file into the string, and using the string as the filename; calculating the storage path of the file by using the hash value of the file according to a predefined rule; looking up a physical storage location of the storage path of the file in an index table and storing the file in the physical storage location, wherein the index table stores corresponding relationship tables between all possible storage paths and physical locations in the storage disk in advance. | 06-26-2014 |
20140181117 | PERSON SEARCH METHOD AND APPARATUS - According to an example, a search request is received, and the search request comprises a search keyword and information of a current user; interpersonal relationship chain data of the current user are obtained according to the information of the current user; person information corresponding to the interpersonal relationship chain data of the current user is searched for a person matching the search keyword; and the person matching the search keyword is taken as a search result. | 06-26-2014 |
20140188891 | CONTENT CREATION - Described herein is a technology for facilitating content creation. In some implementations, profiles are collected. The profiles are analyzed to build a profile parameter index. A dummy profile is created based on the profile parameter index. The dummy profile is a fictitious character having profile parameters based on input from a user of the profile parameter index. The control of the dummy profile is under the user. | 07-03-2014 |
20140188892 | SOCIAL NETWORK RESOURCE INTEGRATION - A method can include providing operations information associated with a coordinate of a subterranean formation; associating communications information with the coordinate; indexing the provided operations information and the associated communications information; and storing a search index based at least in part on the indexing. Various other apparatuses, systems, methods, etc., are also disclosed. | 07-03-2014 |
20140195541 | MESSAGE EVALUATION TOOL - In one aspect, there is provided a method. The method may include configuring, at a message evaluator, one or more factors to enable monitoring of a plurality of messages exchanged between a sending object and a receiving object, the one or more factors including a key generated based on a plurality of attributes obtained by accessing at least one of the sending object or the receiving object, the plurality of attributes representing data fields in at least one of the sending object or the receiving object; selecting, by the message evaluator, at least one of the plurality of messages, when at least one of a match or a substantial match exists between the generated key and a portion of the at least one of the plurality of messages; and presenting, by the message evaluator, the selected at least one of the plurality of message. | 07-10-2014 |
20140195542 | ADAPTIVE RANGE FILTERS FOR RANGE AND POINT QUERIES - The technique described herein provides a way to summarize data and can also minimize unnecessary accesses to a data store. In one embodiment, the technique creates and stores an adaptive range filter that contains a compact summary of the contents of an index for a data store in the form of a trie data structure. Before accessing the index of the data store in response to a query, the technique checks the filter to determine whether the data store does not contain any keys for a specific range of data. If the adaptive range filter indicates that the index contains no keys satisfying the query predicate, the index of the data store and the data itself is not accessed. The adaptive range filter of the technique supports both range predicates and equality predicates. It is adaptive to changes in data and queries by learning the query and data distribution. | 07-10-2014 |
20140195543 | System And Method For Providing Orientation Into Subject Areas Of Digital Information For Augmented Communities - A system and method for providing orientation into subject areas of digital information for augmented communities is provided. Topics are provided for subject areas. Each subject area is associated with an augmented community and each topic is hierarchically organized under a fine-grained topic category. Evergreen indexes are maintained and includes topic models that are each matched to topics associated with each augmented community. Interrelatedness of the evergreen indexes is determined by identifying the topics found in the topic hierarchy for each evergreen index that are shared. A social network including neighbors made up of each augmented community associated with each interrelated evergreen index is formed. A user interest is determined and each of the plurality of topic models for the evergreen indexes is evaluated against the digital information. Access to the digital information via the identified best matched evergreen index and the identified other evergreen index is provided. | 07-10-2014 |
20140201211 | Apparatus and Method for Computing N-Way Co-Occurrences of Data Tuples in Scalar Indexes - A non-transitory computer readable storage medium includes executable instructions to receive primary matched values with corresponding document identifications. The primary matched values correspond to a specified primary value, for example expressed in a search query. A secondary scalar index is scanned to match the document identifications with secondary matched values with corresponding secondary values. A tertiary scalar index is scanned to match the document identifications with tertiary matched values with corresponding tertiary values. The secondary values and tertiary values are provided as tuples. Each tuple has the specified primary value, a secondary value, a tertiary value and frequency indicia. | 07-17-2014 |
20140201212 | METHODS AND ARRANGEMENTS FOR DEVICE PROFILES IN WIRELESS NETWORKS - Embodiments provide a device profile mechanism for wireless devices. Many embodiments comprise a medium access control (MAC) sublayer logic to build frames comprising a device profile index element for a first device. Embodiments may facilitate access by a second device to a device profile for the first device without communication of the entire device profile from the first device. In some embodiments, the second device may access a storage medium integrated with or accessible to the second device to determine the device profile. Some embodiments may store the device profile index element in memory, in logic, or in another manner that facilitates transmission of the device profile index element in frames. Some embodiments may receive and detect communications with the device profile index element. Further embodiments may generate and transmit a communication with the device profile index element. | 07-17-2014 |
20140201213 | SYSTEM AND METHOD FOR RANKING ASSET DATA PROBABILITY OF RECOVERY - A system for determining the probability of recovering an asset is provided. The system includes a data server that receives data corresponding to a plurality of vehicles from a plurality of client sources. A database in operative communication with the data server and stores the received data as information records corresponding to each of the plurality of vehicles. A user interface is in operative communication with the data server and receives a request for historical vehicle data relating to a plurality of vehicle assets. An output processor operatively coupled to the data server and generates recoverability indices for the plurality of vehicle assets. | 07-17-2014 |
20140207787 | Multi-tenant system for consolidating, searching and sharing distributed user-specific digital content - This invention is a system for consolidating distributed user-specific digital content, making the content available for search through a user-specific search index, and enabling sharing of selected user-specific digital content with other users through dynamically created ad hoc shared search indexes. The preferred embodiment of this invention is a multi-tenant user subscription based service hosted on the internet. Another embodiment of this invention is a multi-tenant system hosted within an organization. | 07-24-2014 |
20140207788 | SYSTEM AND METHOD FOR ORGANIZING DATA - A system and method for organizing raw data from one or more sources uses an improved mechanism for identifying duplicate data between fields (e.g., columns) in the databases. The fields may be similar fields within a single database or similar or identical fields within a pair of databases and as organized as arrays or field vectors. The present invention sorts each of the field vectors and if necessary, partitions them by common value. A number of comparisons required to identify the duplicate data between the field vectors is reduced by feeding back a difference between the compared values. This difference is used to adjust indices into the field vectors for subsequent comparison. | 07-24-2014 |
20140207789 | SYSTEM AND METHOD FOR PROVIDING LOCATION-SENSITIVE AUTO-COMPLETE QUERY - A system for providing an auto-complete query to a user terminal includes a query indexing unit configured to index search queries. Each of the search queries is associated with location information. The indexing includes incorporating the location information into an associated one of the search queries. At least one of the search queries being a previously provided auto-complete query. The system includes a keyword generating unit configured to generate a keyword by incorporating location information associated with the user terminal into a query inputted by the user terminal. The system includes a query providing unit configured to determine that at least one of the indexed search queries corresponds to the generated keyword, and provide, to the user terminal, the determined at least one of the indexed search queries as the auto-complete query. | 07-24-2014 |
20140214851 | ELECTRONIC DEVICE AND INDEX MANAGING METHOD - An electronic device records access records when a social network was accessed by the electronic device and operation records of operating the electronic device, acquires a first keyword from the access records and the operation records, creates an index based on the first keyword and an index item that records a relationship between the index and the corresponding files. The electronic device receives a search term from a user, creates a second keyword based on the search term, searches an index item corresponding to the second keyword from index items created by the index managing module and provides corresponding files according to the index item. | 07-31-2014 |
20140214852 | METHOD FOR PRODUCING AND USING A RECURSIVE INDEX OF SEARCH ENGINES - The present invention pertains to extraction of text from an index of a search engine starting at an arbitrary position in the text, and to analysis of texts for co-occurrence of words, and to the use of said extraction and analysis for inferring implicit (causal, associative, etc) relationships among objects in sequences thereof. | 07-31-2014 |
20140222824 | METHOD AND APPARATUS FOR PORTABLE INDEX ON A REMOVABLE STORAGE MEDIUM - Techniques for supporting a portable index include, in response to determining that a removable computer-readable storage medium is recently mounted on a particular drive having a particular drive identifier, determining a particular media identifier for the removable computer readable storage medium. Before harvesting metadata from each file of a set of one or more files on the removable computer-readable storage medium, it is determined whether a portable index file is stored on the removable computer-readable storage medium. The portable index file includes, for each file of the set, data that associates the media identifier ( | 08-07-2014 |
20140222825 | ELECTRONIC DEVICE AND METHOD FOR SEARCHING HANDWRITTEN DOCUMENT - According to one embodiment, an electronics device generates first and second index information, the first index information including codes of character strings corresponding to strokes, the second index information including characteristic quantity of strokes. The device executes at least either one of a first search and a second search, according to character string likelihood of first strokes which is a search key. The first search is performed by using the second index information and characteristic quantity of the first strokes. The second search is performed by using the first index information and a code of a character string corresponding to the first strokes. | 08-07-2014 |
20140222826 | DATA SYSTEM AND METHOD - A system and method for content sharing includes acquiring, by a processing device, a plurality of data objects from data sources, storing the plurality of data objects in a data warehouse, generating a high-level index that is shared by the plurality of data objects, generating a plurality of low-level indices that each provides a respective low-level index for a respective one of the plurality of data objects, and providing the plurality of data objects on the content sharing platform for query or search using the high-level index and the plurality of low-level indices. | 08-07-2014 |
20140222827 | STORING DATA AND METADATA IN A DISTRIBUTED STORAGE NETWORK - A method begins with receiving a data search criteria. The method continues by accessing a master database that includes a plurality of entries, wherein an entry includes a data name field, a metadata field, a data DSN address, and a metadata DSN address. The method continues by indexing the database based on a comparison of the data search criteria with metadata contained in the metadata field of the entries. The method continues by, when one or more entries of the database have the metadata that substantially matches the data search criteria, utilizing the data DSN address of the one or more entries to retrieve one or more sets of encode data slices. The method continues by decoding the one or more sets of encoded data slices to retrieve one or more data segments corresponding to the data search criteria. | 08-07-2014 |
20140236957 | System and method for terminal capacity management - A system and method for determining rail terminal capacity needs. In one embodiment, terminal railcar inventory is determined periodically throughout a specific time interval based on terminal traffic data. Train traffic data is accessed and indexed into discrete time periods, such as daily and hourly intervals. Dwell occupancy indexes are created with periodic intervals, and assigned dwell occupancy values based on said dwell occupancy definitions and traffic data. The railcar inventory of the train terminal at any particular time may then be determined by cross-tabulating the occupancy dwell values, and applying business rules. The number and size of tracks needed to accommodate a rail terminal may be determined by assigning blocks of railcars to terminal tracks. Using the railcar inventory, a list of railcar blocks, railcar block sizes, railcar block classifications, and track classifications is created. After specifying switch parameters, the list of railcar blocks is evaluated based on the list of railcar block sizes and the list of railcar block classifications. The list of railcar blocks is then assigned to the terminal tracks, based on the list of railcar block classifications, the list of track classifications, and the switching parameters. After assigning the blocks to tracks, a final list of tracks that have been assigned blocks is generated. | 08-21-2014 |
20140236958 | EVOLVING METADATA - Described herein are techniques related to inferring context information for a piece of content. A client context agent infers the context information by associating the piece of content with information in the computing environment. The association may be made in response to a user interaction with the piece of content in an application, and the information in the computing environment is external to the application. For example, if a user sends an email, the client context went may use a keyword analysis to associate the email with a web page that is open concurrently, and then embed as evolving metadata the Uniform Resource Identifier (URI) of the web page in the email. This Abstract is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. | 08-21-2014 |
20140236959 | SYSTEM AND METHOD FOR PROVIDING INTEGRATED MANAGEMENT OF ELECTRONIC INFORMATION - A system and method for managing electronic information using a computer system for one or more users, comprising creating an electronic folder and storing the folder in an electronic folder management system, creating files for the electronic folder, maintaining a collection of electronic folders and the files stored therein, and associating the electronic folders with one or more files such that each of the files may be accessed by identifying the related electronic folder. | 08-21-2014 |
20140236960 | System and Method for Database Searching - In one embodiment, a method for searching a database includes receiving, by a processor from a user, a message, indicating a query, where the query comprises a pattern and determining, by the processor, a first threshold in accordance with a data set of the database. The method also includes comparing, by the processor, the pattern to a first key of the data set to produce a comparison and determining, by the processor, whether to jump to a second key of the data set or scan to a third key of the data set in accordance to the comparison and the first threshold including jumping to the second key of the data set when an absolute value of the comparison is greater than the first threshold, and scanning to the third key of the data set when the absolute value of the comparison is less than or equal to the first threshold, where the first key and the third key are sequential. | 08-21-2014 |
20140236961 | Methods, Systems, and Products for Indexing Scenes in Digital Media - Methods, systems, and products index digital scenes in digital media. A uniform resource locator is assigned to each different digital scene within the digital media. The uniform resource locator uniquely identifies a resource from which each different digital scene may be retrieved. Individual scenes may thus be retrieved, thus conserving bandwidth and memory. | 08-21-2014 |
20140244652 | Communication Aggregator - A communication aggregator capable of obtaining and displaying a record of party-to-party communication records across one or more communication mediums. This aggregation application serves to provide a user with simplified communication records between one or more parties. | 08-28-2014 |
20140244653 | Manual Activity Prioritization - A computer system enables a user to manually order messages in a list on a per-message basis. More generally, the system enables data elements of a plurality of types, such as messages, appointments, and tasks, to be combined into a single list, which may be manually ordered by the user on a per-element basis. The system also enables the user to mark any element in a list as complete. The system takes an appropriate action in response to a list element being marked as complete, such as removing the element from the list. | 08-28-2014 |
20140244654 | DATA MIGRATION - In one embodiment, a method for data migration includes: receiving intermediate data and corresponding key value-row location information mapping list; importing each row of the intermediate data to a corresponding row of a target data table, where each row of the target data table has a target physical row ID; and establishing, by a computer processor, a target index of the target data table based on the target physical row IDs and the key value-row location information mapping list. With the method or system of this invention, the efficiency of reestablishing a data index after data migration can be improved. | 08-28-2014 |
20140244655 | MULTI-SHOT SCHEDULING SYSTEM - A Multi-Shot Scheduling System chooses from multiple candidate playlists of positions to select a broadcast playlist. Candidate playlists are generated based upon scoring and selecting content items for the positions through the use of index values. Various embodiments of the Multi-Shot Scheduling System can select broadcast playlists for multiple groups of content and can provide different methods of controlling scheduling performance by restricting the range of candidate playlists from which the best playlist can be selected. | 08-28-2014 |
20140244656 | TRANS-COMMUNITY SEARCH METHOD AND APPARATUS - A trans-community search method and apparatus is disclosed. The method includes: receiving a search request from a user and determining online communities used by the user according to user attribute information contained in the search request; generating a sub-search request corresponding to each online community used by the user according to the search request and acquiring a search result corresponding to the sub-search request; combining search results corresponding to all sub-search requests and sending the combined search results to the user. The invention can improve search efficiency for users. | 08-28-2014 |
20140250129 | MANAGING OWNERSHIP OF REDUNDANT DATA - According to one embodiment, a method includes storing instances of a file in a first storage tier and a second storage tier of a storage system, wherein the second storage tier uses less expensive and/or slower-to-access storage media than the first storage tier. Each instance of the file on the first and second storage tier is associated with a unique program identifier (UPI). A request to access the file or instance thereof is received from a user in a group of users, and a UPI associated with an instance of the file that exists on the first storage tier is searched for. Remote access to the instance of the file on the first storage tier is provided to the user requesting access to the file. Additional embodiments are also presented. | 09-04-2014 |
20140250130 | SYSTEM AND METHODS FOR UNITS-BASED NUMERIC INFORMATION RETRIEVAL - An information retrieval and analysis system for numeric data which provides high precision and recall for numeric search and uses a methodology for determining contextualization of the extracted data. The capabilities include extracting, parsing, and contextualizing numeric data including both a numeric value and an accompanying unit. This system facilitates the organization of largely unstructured numeric data into an inverted index and other database formats. An information retrieval system which enables the exploration and refinement of an extracted numeric data set defined by a search input that may be precise or initially vague. This system also facilitates analyzing and portraying numeric data graphically, creating knowledge by combining data from multiple sources, extracting correlations between seemingly disparate variables, and recognizing numeric data trends. This system uses local natural language processing, mathematical analysis, and expert-based scientific heuristics to score the numeric and contextual relevancy of the data to the query parameters. | 09-04-2014 |
20140258305 | SYSTEMS AND METHODS FOR PROVIDING CONTEXTUAL TRUST SCORES - The disclosed embodiments include methods and systems for providing contextual trust scores. The disclosed embodiments include, for example, a system for providing a contextual trust score including a memory storing software instructions and one or more processors configured to execute the software instructions. In one aspect, the one or more processors may be configured to perform operations including receiving a user scenario associated with a user. The operations may also include selecting one or more key variables from one or more data sources based on the user scenario, and may measure the key variables across one or more contextual dimensions. The operations may further include comparing the results of the measuring, and generating a contextual trust score associated with a user. The operations may also include continuously monitoring the data sources, updating the generated trust score, and providing the update trust score to the user or a third party. | 09-11-2014 |
20140258306 | Novel Simulation and Permutation Methods for the Determination of Temporal Association between Two Events - An embodiment in accordance with the present invention provides methods and software for determining an association between two health events, temporally. The methods can be implemented on a computing device either individually, or as a group, and are noted as simulation, permutation shuffling, and permutation sliding. Simulation, permutation shuffling, and permutation sliding each use the comparison of the experimentally found occurrence of association between the two health events to the null distribution of the association statistics, obtained by independently simulating the two health events. In particular, these methods of determining association between two health events can be used for determining an association between reflux and apnea in infants. However, these methods can also be applied more generally to determining a potential relationship between health or other events in the temporal plane. | 09-11-2014 |
20140280182 | METHOD AND SYSTEM FOR CALCULATING AND CHARTING WEBSITE PERFORMANCE - A web performance index tool along with related systems and methods for calculating and charting website performance that provide a single metric snapshot indicating how an end user would perceive the performance of a website. The metric provides an objective way to evaluate website performance by using an algorithm that measures the speed, quantity, and quality of the web content. | 09-18-2014 |
20140280183 | Method For Cross-Domain Feature Correlation - A method for correlating information across distinct domains without requiring feature co-occurrence. The disparate information collections are broken down into features, and a correlation index with correlation score is created. To determine the correlation between distinct domains, an information artifact collection is reduced to a representational set of features, these features are replaced with correlated features using the correlation index, and the new set of features is matched against the second information artifact collection using an appropriate comparison technique. The correlation method allows a single input artifact to be matched against an existing collection, resulting in a set of correlated artifacts from the disparate collection, each ranked by correlation score. | 09-18-2014 |
20140280184 | Updating Index Information When Adding Or Removing Documents - In several aspects, an electronic device and method index a repository of N documents by W words, by not storing between queries, N*W numbers that are specific to each word i and each document j, normally used to compute a score of relevance to a query, of each document j. Instead, the electronic device and method generate the N*W word-specific-document-specific numbers dynamically at query time, based on a set of W numbers corresponding to the W words, and one or more sets (e.g. x sets) of N numbers corresponding to the N documents. Query-time generation of word-specific-document-specific numbers reduces memory otherwise required, e.g. to store these numbers. Hence, in certain aspects W+xN numbers are maintained between queries, and these numbers are changed incrementally when a new document is added to the set or an existing document is removed. Maintaining W+xN numbers reduces processing otherwise required, to start from scratch. | 09-18-2014 |
20140280185 | Providing Task-Based Information - Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for providing task-based information are disclosed. In one aspect, a method includes identifying a set of previous interactive sessions for a particular task. For each identified interactive session in the set of previous interactive sessions, a set of implements used during the interactive session are identified based on content of the interactive session. At least one implement in the identified set of implements is classified as a required implement for the particular task. The classification is based, at least in part, on a portion of the previous interactive sessions in which the implement was identified. Classified implements are indexed according to the particular task. Data identifying the indexed implements are provided in response to receipt of a search query referencing the particular task. | 09-18-2014 |
20140280186 | CROWDSOURCING AND CONSOLIDATING USER NOTES TAKEN IN A VIRTUAL MEETING - Arrangements relate to crowdsourcing and consolidating user notes taken within a virtual meeting. Notes from one or more meeting attendees can be received. The received user notes can be analyzed to identify a key element therein using natural language processing. The analysis of received user notes can be performed by a processor. A consolidated system notes can be generated. The consolidated system notes can include the key element. | 09-18-2014 |
20140280187 | DATA STORAGE SYSTEM HAVING MUTABLE OBJECTS INCORPORATING TIME - A data storage system having mutable objects incorporating time is described herein. According to the systems and methods described herein, a data item may be partitioned into parts (data objects) and stored as an index object. As the object storage system provides immutable objects, when a new version of a data item needs to be stored, only those parts (data objects) of the data item that changed need be saved rather than the entire data item. The systems and methods described herein allow for efficient storage, access and manipulation of mutable data items using an underlying immutable object system. | 09-18-2014 |
20140280188 | System And Method For Tagging Filenames To Support Association Of Information - A system and methods to associate information with a file are described. A system including memory, one or more processors, and one or more modules stored in memory and configured for execution by the one or more processors. The system including a unique identifier determination module configured to determine a unique identifier of a file. Further, the system includes an index management module configure to lookup a unique identifier. In addition, the system includes a tag generating configured to generate a tag based on a filename. And, the system includes a tagging module configured to associate the tag with the file. | 09-18-2014 |
20140280189 | FAST APPROACH TO FINDING MINIMUM AND MAXIMUM VALUES IN A LARGE DATA SET USING SIMD INSTRUCTION SET ARCHITECTURE - Systems and methods may determine a boundary value data unit in a large data set in parallel with determining an associated index of the determined boundary value data unit into the large data set using a single instruction multiple data (SIMD) instruction set architecture and a specialized data layout of array entries. In one example, the specialized data layout of array entries combines a data value and its associated index to an array into a single array entry. | 09-18-2014 |
20140280190 | SYSTEMS, METHODS, AND APPARATUSES FOR IMPLEMENTING DATA UPLOAD, PROCESSING, AND PREDICTIVE QUERY API EXPOSURE - Disclosed herein are systems and methods for implementing data upload, processing, and predictive query API exposure including means for receiving a dataset in a tabular form, the dataset having a plurality of rows and a plurality of columns; processing the dataset to generate indices representing probabilistic relationships between the rows and the columns of the dataset; storing the indices in a database; exposing an Application Programming Interface (API) to query the indices in the database; receiving a request for a predictive query or a latent structure query against the indices in the database; querying the database for a prediction based on the request via the API; and returning the prediction responsive to the request. Other related embodiments are further disclosed. | 09-18-2014 |
20140280191 | SYSTEMS, METHODS, AND APPARATUSES FOR IMPLEMENTING A PREDICT COMMAND WITH A PREDICTIVE QUERY INTERFACE - Disclosed herein are systems and methods for implementing a PREDICT command with a predictive query interface including means for generating indices from a dataset of columns and rows, the indices representing probabilistic relationships between the rows and the columns of the dataset; storing the indices within a database of a host organization; exposing the database of the host organization via a request interface; receiving, at the request interface, a query for the database specifying at least (i) a PREDICT command term, (ii) one or more specified columns to be predicted, and (iii) one or more column name=value pairs specifying column names to be fixed and the values by which to fix them; and querying the database using the PREDICT command term and passing the one or more specified columns to be predicted and the one or more column name=value pairs to generate a representation of a joint conditional distribution of the one or more specified columns to be predicted fixed according to the column name=value pairs using the indices stored in the database. Other related embodiments are further disclosed. | 09-18-2014 |
20140280192 | SYSTEMS, METHODS, AND APPARATUSES FOR IMPLEMENTING A RELATED COMMAND WITH A PREDICTIVE QUERY INTERFACE - Disclosed herein are systems and methods for implementing a RELATED command with a predictive query interface including means for generating indices from a dataset of columns and rows, the indices representing probabilistic relationships between the rows and the columns of the dataset; storing the indices within a database of a host organization; exposing the database of the host organization via a request interface; receiving, at the request interface, a query for the database specifying a RELATED command term and a specified column as a parameter for the RELATED command term; querying the database using the RELATED command term and passing the specified column to generate a predictive record set; and returning the predictive record set responsive to the query, the predictive record set having a plurality of elements therein, each of the returned elements including a column identifier and a confidence indicator for the specified column passed with the RELATED command term, wherein the confidence indicator indicates whether a latent relationship exists between the specified column passed with the RELATED command and the column identifier returned for the respective element. Other related embodiments are further disclosed. | 09-18-2014 |
20140280193 | SYSTEMS, METHODS, AND APPARATUSES FOR IMPLEMENTING A SIMILAR COMMAND WITH A PREDICTIVE QUERY INTERFACE - Disclosed herein are systems and methods for implementing a SIMILAR command with a predictive query interface including means for generating indices from a dataset of columns and rows, the indices representing probabilistic relationships between the rows and the columns of the dataset; storing the indices within a database of a host organization; exposing the database of the host organization via a request interface; receiving, at the request interface, a query for the database specifying a SIMILAR command term, a specified row as a parameter for the SIMILAR command term, and a specified column as a parameter for the SIMILAR command term; querying the database using the SIMILAR command term and passing the specified row and the specified column to generate a predictive record set; and returning the predictive record set responsive to the query, the predictive record set having a plurality of elements therein, each of the returned elements of the predictive record set including (i) a row identifier which corresponds to a row of the dataset assessed to be similar, according to a latent structure, to the specified row passed with the SIMILAR command term based on the specified column and (ii) a confidence indicator which indicates a likelihood of a latent relationship between the specified row passed with the SIMILAR command and the row identifier returned for the respective element. Other related embodiments are further disclosed. | 09-18-2014 |
20140280194 | METHOD AND SYSTEM FOR GENERATING AND USING A MASTER ENTITY ASSOCIATIVE DATA NETWORK - A method and system for generating and using a master entity associative data network. The master entity associative data network system includes data sources storing data entities. Each data entity is of a specific entity type having specific entity properties, associations to other data entities, and an assigned common identifier label. Also, the master entity associative data network system includes a data processor configured to extract data entities from the data sources and extract associations to other data entities. The data processor stitches the data entities to each other based on at least one or more of the extracted associations, specific entity properties, and the common identifier label to generate one or more master data entities. | 09-18-2014 |
20140280195 | Educational Hub - In an example implementation, a global educational hub provides academic users with collaboration tools to exchange and build on the accumulated knowledge of subjects of interest. Users are provided with a central location to collect, categorize, and rank resources, and store notes and comments related to a particular section of an electronic textbook being studied. Users also can refer to online resources, such as web services, multimedia, website pages, newsgroups, search engine results, RSS, any other current and future external resources, and other user's notes. | 09-18-2014 |
20140280196 | METHOD, USER TERMINAL, AND WEB SERVER FOR PROVIDING SERVICE AMONG HETEROGENEOUS SERVICES - A method of providing a service among heterogeneous services may include verifying whether a second web application associated with an external web service is installed in a user agent, when data of the external web service is requested from a first web application executed in the user agent, requesting key information to be used for accessing the requested data from a first server providing a web service associated with the first web application, when the second web application is installed in the user agent, receiving the key information from the first server, and accessing the requested data existing on the second web application, using the received key information, in the first web application. | 09-18-2014 |
20140280197 | LOG FILE MANAGEMENT TOOL - A system includes: a processor; and a memory, wherein the memory stores instructions that, when executed by the processor, cause the processor to: retrieve a first log over a network from a first server of a plurality of servers for at least one contact center; generate an index of values in the first log; update an index database based on the index; and save the first log on a storage device. | 09-18-2014 |
20140280198 | OBJECT STORE MANAGEMENT OPERATIONS WITHIN COMPUTE-CENTRIC OBJECT STORES - Object store management operations within compute-centric object stores are provided herein. An exemplary method may include transforming an object storage dump into an object store table by a table generator container, wherein the object storage dump includes at least objects within an object store that are marked for deletion, transmitting records for objects from the object store table to reducer containers, such that each reducer container receives object records for at least one object, the object records comprising all object records for the at least one object, generating a set of cleanup tasks by the reducer containers, and executing the cleanup tasks by a cleanup agents. | 09-18-2014 |
20140289257 | METHODS AND SYSTEMS FOR PROVIDING FILE DATA FOR MEDIA FILES - A method of providing file data for a media file from a file system, where the format of the file requires that the file comprise a plurality of segments of file data located at pre-declared locations within the file. The file data making up the file is based upon derived file data. After receiving a request to open the file, a maximum segment length for the file is determined based on the desired properties of the file. In response to a request for the location of a segment within the file, a location calculated by considering each segment of the file to have the maximum segment length is returned. In response to a request for file data from a segment of the file, generated file data for the segment is returned, where the generated file data comprises the derived file data and padding data to give the generated file data the maximum segment length. | 09-25-2014 |
20140289258 | EMAIL TAGS - Suggesting email tags. A non-transitory machine-readable storage device includes executable instructions that when executed, cause one or more processor to provide a suggestion for at least one suggested tag based on content of an email, receive a selection of a selected tag store the email in a computer database, thus creating a stored email, and associate the selected tag with the stored email in the computer database. | 09-25-2014 |
20140297650 | METHOD AND APPARATUS FOR MANAGING INDEX IN A SHARED MEMORY - The present invention relates to a method of managing an index in a shared memory in a multi-process environment. An apparatus for managing an index in a shared memory is implemented to log an address of an index node where an operation is to be performed, information on a number of slots stored in a slot count unit, and information on moved slots stored in a slot movement count unit. | 10-02-2014 |
20140297651 | USING HASH TABLE TO RETRIEVE LEAF INDEX BLOCKS - In a Database Management System (DBMS), an index is used to organize the data such that the data can be efficiently retrieved to satisfy queries. The index may be organized in blocks which are stored in memory or on disk of a computer system. To satisfy a DBMS request, such as an update or query of data, multiple index blocks may need to be retrieved and processed. The claimed subject matter reduces the number of index blocks that needs to be retrieved and processed to satisfy the DBMS requests. | 10-02-2014 |
20140297652 | Hierarchical, Parallel Models for Extracting in Real-Time High-Value Information from Data Streams and System and Method for Creation of Same - A computer system identifies high-value information in data streams. The computer system receives a receiving a plurality of data streams. Each of the data streams includes a plurality of posts. Each of the posts includes a content portion and one or more source characteristics, In real time, for each post in a particular data stream: the system assigns the post a post identifier; generates a content packet and one or more source packets; queries memory to access a source profile using a respective source identifier included in the content packet; correlates the content packet with information from the source profile to produce a correlated content packet; and broadcasts the correlated content packet to a plurality of filter graph definitions. | 10-02-2014 |
20140304273 | Systems And Methods For Providing Hi-Fidelity Contextual Search Results - Systems and methods for providing hi-fidelity contextual search results are described. In one described embodiment, a method for providing hi-fidelity contextual search results includes receiving a query comprising a search term, determining a location on a page that is responsive to the query, wherein the page has a native appearance, determining a contextual area associated with the location on the page, and causing the contextual area to be output in a hi-fidelity result set, wherein the contextual area has an appearance the same as the native appearance of the page. | 10-09-2014 |
20140304274 | Systems and Methods for Publishing Datasets - Systems and methods for publishing datasets are provided herein. According to some embodiments, methods for publishing datasets may include receiving a request to publish a dataset to at least one of an internal environment located within a secured zone and an external environment located outside the secured zone, the request comprising at least one selection criteria, selecting the dataset based upon the at least one selection criteria, the dataset being selected from an index of collected datasets, and responsive to the request, publishing the dataset to at least one of the internal environment and the external environment. | 10-09-2014 |
20140310287 | METHOD AND SYSTEM FOR STORING DATA IN COMMODITY COMPUTING - Disclosed herein is a method and a system for data storage and retrieval that is agnostic to the database models of any of the databases in embodiments of the common infrastructure. Data may be stored via one database, according to a corresponding model, and then stored such that the data may be retrieved by a second database with a different database model. The data is stored in key-value pairs, and is thus not stored according to any database model. However, metadata perspectives construct a set of retrieved data into a format expected by the database retrieving the set of data. | 10-16-2014 |
20140317123 | INDEXING OF SIGNIFICANT MEDIA GRANULARS - Identifying significant media granulars may comprise partitioning a media document into one or more media granulars. A score may be assigned to each of the one or more media granulars. The one or more media granulars may be monitored to detect one or more user actions on the one or more media granular. The score associated with a respective one of the media granulars may be updated based on a number of user actions detected on the respective one of the media granulars. A media granular may be identified as being significant based on the score meeting a threshold. | 10-23-2014 |
20140324874 | MANAGEMENT OF A DATABASE SYSTEM - A method, system, and computer program product to manage a database is disclosed. The method, system, and computer program product may include structuring the database to have a first table having an index and a second table. A first key of the first table may be related to a second key of the second table. The method, system, and computer program product may include creating an entry locator in the index. The method, system, and computer program product may include maintaining an association between the second key of the second table and the entry locator of the index. | 10-30-2014 |
20140324875 | INDEX FOR FAST BATCH UPDATES OF LARGE DATA TABLES - Systems and processes for managing data using a composite index formed from a major sub-index and zero or more minor sub-indexes are described. Updates to the data may be cached in memory. When the cache memory becomes full, the contents of the cache may be sorted and stored as entries in a minor sub-index in a hard-disk drive with a single streaming disk write. In response to a threshold condition, the major sub-index may be updated using streaming disk accesses based on the entries in the minor sub-indexes. Once the major sub-index is updated to include all of the updates from the minor sub-indexes, the minor sub-indexes may be deleted. | 10-30-2014 |
20140324876 | MANAGEMENT OF A DATABASE SYSTEM - A method, system, and computer program product to manage a database is disclosed. The method, system, and computer program product may include structuring the database to have a first table having an index and a second table. A first key of the first table may be related to a second key of the second table. The method, system, and computer program product may include creating an entry locator in the index. The method, system, and computer program product may include maintaining an association between the second key of the second table and the entry locator of the index. | 10-30-2014 |
20140324877 | SEARCH INDEX - Method of searching comprising applying a function to individual elements within a digital work to form a set of index elements. Storing the index elements as an index for the digital work. Receiving a search term. Applying the function to one or more individual elements within the search term to convert the search term into one or more converted search term elements. Identifying a digital work having an index containing one or more index elements that match one or more of the converted search term elements. Returning search results of the identified digital work. Searchable index for a digital work formed by applying a function to individual elements within the digital work to form a set of index elements. | 10-30-2014 |
20140324878 | SYSTEM AND METHOD FOR SEARCH INDEXING - A system includes circuitry configured to: read a plurality of character information and a plurality of identifiers that are included in a text file; determine whether a character information among the plurality of character information is included between the at least one pair of identifiers among the plurality of identifiers in the text file; and associate the character information with the at least one pair of identifiers when it is determined that the character information is included between the at least one pair of identifiers. | 10-30-2014 |
20140324879 | CONTENT BASED SEARCH ENGINE FOR PROCESSING UNSTRUCTURED DIGITAL DATA - Systems and methods for receiving and indexing native digital data and generating signature vectors for subsequent storage and searching for such native digital data in a database of digital data are disclosed. Native digital data may be transformed into associated transform data sets. Such transformation may comprise entropy-like transforms and/or spatial frequency transforms. The native and associated transform data sets may then be partitioned in to spectral components and those spectral components may have statistical moments applied to them to create a signature vector. Other systems and methods for processing non-image digital data are disclosed. Non-image digital data may be transformed into an amplitude vs time data set and a spectrogram may then be applied to such data sets. Such transformed data sets may then be processed as described. | 10-30-2014 |
20140324880 | INDEX SEARCHING USING A BLOOM FILTER - A system for index searching comprises a first index, a second index, a processor, and a memory. The processor is configured to determine, using a Bloom filter for the second index of fingerprints, whether the Bloom filter indicates that an entry corresponding to a segment has been previously stored in the second index. The processor is further configured to, in the event that the Bloom filter indicates that an entry corresponding to the segment has been previously stored in the second index, retrieve a second index entry associated with the segment that has been previously stored. The memory is coupled to the processor and configured to provide the processor with instructions. | 10-30-2014 |
20140324881 | METHOD AND APPARATUS FOR STRESS MANAGEMENT IN A SEARCHABLE DATA SERVICE - Method and apparatus for stress management in a searchable data service. The searchable data service may provide a searchable index to a backend data store, and an interface to build and query the searchable index, that enables client applications to search for and retrieve locators for stored entities in the backend data store. Embodiments of the searchable data service may implement a distributed stress management mechanism that may provide functionality including, but not limited to, the automated monitoring of critical resources, analysis of resource usage, and decisions on and performance of actions to keep resource usage within comfort zones. In one embodiment, in response to usage of a particular resource being detected as out of the comfort zone on a node, an action may be performed to transfer at least part of the resource usage for the local resource to another node that provides a similar resource. | 10-30-2014 |
20140330834 | COMPARING MARKUP LANGUAGE FILES - Processing markup language files can include pre-processing an original file and a new file, wherein the original file and the new file are markup language files. Pre-processing includes removing tags from the original file and the new file resulting in a pure text version of the original file and a pure text version of the new file. Using a processor, the pure text version of the original file is compared with the pure text version of the new file to determine differences. The differences are specified in a comparison result. The comparison result is post-processed by applying formatting to the differences and tags from the new file. | 11-06-2014 |
20140330835 | DOCUMENT ORDER MANAGEMENT VIA RELAXED NODE INDEXING - An illustrative embodiment includes processing parse tree data. A parse tree data structure that is representative of a document object model (DOM) tree data structure is received. Concomitant to receiving the parse tree data structure, an assignment of index values for the DOM nodes consisting of distinct index values for each existing DOM node is received by the processor. Requests to manipulate the parse tree data structure that include node inserts and document order comparisons are also performed. | 11-06-2014 |
20140330836 | OFFLINE SEARCHING OF ENCRYPTED CONTENT - The present disclosure is directed towards systems and methods for searching of encrypted content. Upon receiving a search request on an access device, the systems and methods of the present disclosure determine a requested content item type based on the search request and identify one or more responsive indices according to the requested content item type. The systems and methods of the present disclosure then interrogate the one or more responsive indices to determine one or more responsive index entries to the search request and generate one or more search results responsive to the search request comprising one or more responsive index entries and a document identifier for each responsive index entry. The systems and methods of the present disclosure then receive a selected search result, identify the encrypted content item associated with the selected search result according to the document identifier, decrypt and transmit the content item. | 11-06-2014 |
20140330837 | METHOD, APPARATUS AND SYSTEM FOR PUSHING MICRO-BLOGS - A method for pushing micro-blogs is provided. The method is performed at micro-blog pushing server. In this method, a set of keywords is determined, each keyword is associated with at least one of micro-blogs received at a micro-blog receiving server, an index of the plurality of micro-blogs is created using the set of keywords, a subset of keywords, each having an interest value greater than a preset threshold, are obtained, micro-blogs associated with the obtained keywords are searched for, the searched out micro-blogs are sorted based on a timeline or based on relevancies of the micro-blogs to the keywords having interest values greater than a preset threshold, and then the sorted micro-blogs are pushed to the user. In addition, an apparatus, a system and a non-transitory computer-readable storage medium related to the method are also provided. | 11-06-2014 |
20140337353 | Bio Data Filter Interpretation Apparatus - A therapeutic optimization platform is described that encompasses a bio data filter interpretation apparatus for the purpose of selecting, validating and optimizing digital health content by way of linking the clinician and patient with meaningful medical data. | 11-13-2014 |
20140337354 | Indexing Preview - Embodiments are directed towards previewing results generated from indexing data raw data before the corresponding index data is added to an index store. Raw data may be received from a preview data source. After an initial set of configuration information may be established, the preview data may be submitted to an index processing pipeline. A previewing application may generate preview results based on the preview index data and the configuration information. The preview results may enable previewing how the data is being processed by the indexing application. If the preview results are not acceptable, the configuration information may be modified. The preview application enables modification of the configuration information until the generated preview results may be acceptable. If the configuration information is acceptable, the preview data may be processed and indexed in one or more index stores. | 11-13-2014 |
20140344281 | SYSTEM AND METHOD FOR MEASURING THE VIRALITY OF INTERNET CONTENT USING INDICES - A system may measure the virality of content items on a network. The virality may be measured for Internet content using indices. Indices may be generated that represent the share or news worthiness of content. An indexer may monitor the content items and generate a score which may be used to identify which content items are most likely to receive a user access request or referral from the remote server. A logic generator may display visual elements as graphical representations of the index score. | 11-20-2014 |
20140344282 | SYSTEMS, METHODS AND DEVICES FOR DETERMINING SLEEP QUALITY WITH WEARABLE DEVICES - The methods and systems described herein may involve determining at least one lifeotype of at least one individual, analyzing the at least one lifeotype, and delivering content to at least one individual based on the analysis. The methods and systems described herein may involve providing a game, determining at least one lifeotype of at least one player of the game, analyzing the at least one lifeotype, and affecting the game play based on the analysis. The methods and systems described herein may involve providing an interactive space, determining at least one lifeotype of at least one individual in the space, analyzing the at least one lifeotype, and modifying at least one attribute of the space based on the analysis. | 11-20-2014 |
20140344283 | METHOD OF SERVER-BASED APPLICATION HOSTING AND STREAMING OF VIDEO OUTPUT OF THE APPLICATION - A server-based application hosting and streaming of video output across a network to a client terminal. A method receives, via the network, an initiation request from a client terminal to stream an application from an application server to the client terminal. Retrieving from an application database a common application data image of the application, and retrieving from a profile database incremental application data uniquely associated with the user indicative of a recent application state for the user. The method includes modifying the common application data image in accordance with the incremental application data to create a modified application image which is unique to the user and indicative of the recent application state. The method includes spawning and hosting a user-specific instance of the application based on the modified application image followed by streaming at least video output of the user-specific instance of the application to the client terminal. | 11-20-2014 |
20140344284 | COMMAND LINE INTERPRETER FOR ACCESSING A DATA OBJECT STORED IN A DISTRIBUTED STORAGE NETWORK - A user device includes a DSN interface operably coupled with a DSN memory and a DS processing module for storing and retrieving a data object from the DSN memory, wherein the data object is partitioned into a plurality of data segments and wherein each of the plurality of data segments is stored in the DSN memory as a plurality of encoded data slices that are generated based on an error encoding dispersal function. The user device also includes a command line interpreter operable to interpret a file command as a request to display the data object by an application program, determine the data object is stored in the DSN memory and request the DSN processing module to retrieve the data object from the DSN memory and provide the data object to the application program. | 11-20-2014 |
20140351260 | ENCRYPTED SEARCH ACCELERATION - Techniques are disclosed for using a false positive-prone data structure to store normalization data for use in an encrypted search context. A file host server in a cloud provider network may provide search functionality for a user's encrypted data by use of a search index. In updating the search index, the server consolidates common identities of the user on different web services (e.g., an e-mail service or a social media service) into a normalized identifier. These normalization values are stored in the false positive-prone data structure (e.g., a colliding hash table, a Bloomier filter, etc.). Consequently, the data structure may return false positives after a search operation. The server may filter out these false positives before returning search results. | 11-27-2014 |
20140351261 | REPRESENTING ENTERPRISE DATA IN A KNOWLEDGE GRAPH - A knowledge base provides a mechanism for storing an organization's data in a way that represents the semantics of the data being stored. The knowledge base may include a knowledge graph that represents relationships between the different classes of data comprising the organization's data. Data that is loaded into the knowledge graph may be stored in data tables associated with the knowledge graph, and cross referenced with node identifiers that contain the data. Searching the knowledge base includes parsing a search input to identify terms in the search input, and mapping the terms to nodes in the knowledge base using the cross referenced information. The relationships among the identified nodes are used to identify a suitable application for processing the search. | 11-27-2014 |
20140351262 | METHODS AND SYSTEMS FOR MANAGING DATA - Systems and methods for managing data, such as metadata or index databases. In one exemplary method, a notification that an existing file has been modified or that a new file has been created is received by an indexing software component, which then, in response to the notification performs an indexing operation, where the notification is either not based solely on time or user input or the notification includes an identifier that identifies the file. Other methods in data processing systems and machine readable media are also described. | 11-27-2014 |
20140358932 | Entity Integration Using High-Level Scripting Languages - Embodiments of the present invention relate to a new method of entity integration using high-level scripting languages. In one embodiment, a method of and computer product for entity integration is provided. An entity declaration is read from a machine readable medium. The entity declaration describes an entity including at least one nested entity. An index declaration is read from a machine readable medium. The index declaration describes an index of nested entities. An entity population rule is read from a machine readable medium. The entity population rule describes a mapping from an input schema to an output schema. The output schema conforms to the entity declaration. A plurality of input records is read from a first data store. The input records conform to the input schema. The entity population rule applies to the plurality of records to create a plurality of output records complying with the output schema. An index of nested entities is populated. The index complies with the index declaration. | 12-04-2014 |
20140358933 | Information Association And Suggestion - Methods and systems for processing, analyzing, and managing information are disclosed. An example method can comprise receiving at least two data groups. Each of the at least two data groups can comprise one or more values. An example method can also comprise associating the at least two data groups based on a first value being in each of the at least two data groups. Associating the at least two data groups can comprise iteratively performing a union operation on the at least two data groups. An example method can further comprise providing a suggestion to add a second value from one data group of the at least two data groups to another data group of the at least two data groups. | 12-04-2014 |
20140358934 | DATABASE SYSTEM AND METHOD FOR SEARCHING DATABASE - A plurality of servers are each coupled to a different one of a plurality of databases that are synchronized with each other. A search request specifying a search range is sent from a search-requesting device to the servers. Each server calculates a search subrange from the search range specified in the search request received from the search-requesting device, performs a data search operation only within the calculated search subrange of the database coupled thereto, and sends a search result back to the search-requesting device. | 12-04-2014 |
20140358935 | SYSTEMS AND METHODS FOR USING METADATA TO ENHANCE DATA IDENTIFICATION OPERATIONS - Systems and methods for managing electronic data are disclosed. Various data management operations can be performed based on a metabase formed from metadata. Such metadata can be identified from an index of data interactions generated by a journaling module, and obtained from their associated data objects stored in one or more storage devices. In various embodiments, such processing of the index and storing of the metadata can facilitate, for example, enhanced data management operations, enhanced data identification operations, enhanced storage operations, data classification for organizing and storing the metadata, cataloging of metadata for the stored metadata, and/or user interfaces for managing data. In various embodiments, the metabase can be configured in different ways. For example, the metabase can be stored separately from the data objects so as to allow obtaining of information about the data objects without accessing the data objects or a data structure used by a file system. | 12-04-2014 |
20140365498 | Finding A Data Item Of A Plurality Of Data Items Stored In A Digital Data Storage - A computer implemented method for finding at least one data item of a plurality of data items ( | 12-11-2014 |
20140365499 | System and Method for Determining Concepts in a Content Item Using Context - The present invention is directed towards systems and methods for indexing one or more items of content. The method of the present invention comprises extracting one or more items of text from a given item of content. The one or more items of extracted text are tokenized into one or more concepts. One or more related concepts associated with the one or more concepts are identified. A support score is generated for the one or more concepts, and the item of content is index with the one or more concepts and the one or more associated support scores. | 12-11-2014 |
20140372447 | KNOWLEDGE INDEX SYSTEM AND METHOD OF PROVIDING KNOWLEDGE INDEX - A knowledge index system and a method of providing a knowledge index are provided. The knowledge index system includes: a knowledge graph storage unit that includes a plurality of nodes representing a core word or a subject word about knowledge of a specific field and an edge representing an association relationship between the plurality of nodes with a line, and that stores a knowledge index representing a structure of the knowledge field in a graph form having a cycle in which more than one path may exist between any pair of nodes by displaying a related degree between two nodes at the edge; a data storage unit that stores data constituting the knowledge index; and a display unit that outputs a knowledge index of the graph form on a screen. | 12-18-2014 |
20140372448 | SYSTEMS AND METHODS FOR SEARCHING CHEMICAL STRUCTURES - Systems, methods, and computer-readable media are provided for distributing structured data sets. In accordance with one implementation, a computer-implemented method is provided that comprises operations performed by one or more processors, including receiving structured data, the structured data including a plurality of entity data elements and one or more relationship data elements; assigning universal identifiers to the entity data elements; and determining one or more relationship instances, the one or more relationship instances corresponding to one or more relationships between the assigned universal identifiers according to the one or more relationship data elements. The method also includes segmenting the entity data elements into sub elements having types, and distributing the sub elements among a plurality of entity partitions and distributing the determined one or more relationship instances among one or more relationship partitions. | 12-18-2014 |
20140372449 | IMAGE-BASED INDEXING IN A NETWORK-BASED MARKETPLACE - A method and system to index and search item listings in a network-based publication system, such as a network-based marketplace. For example, a listing query may be received from a user, the listing query including a query image. The query image may then be processed to generate a plurality of index sets. Each index set of the plurality of index sets may correspond to a respective portion of the query image and may comprise a set of index values. Each index value of the set of index values may be indicative of an image attribute, such as light intensity, for a respective sub-portion of the corresponding portion of the query image. At least one result image from a database of listing images may be identified by comparing the plurality of index sets for the query image to pre-compiled index sets in an index database. Each pre-compiled index set may be associated with at least one listing image which, in turn, may be associated with an item listing on a network-based marketplace. | 12-18-2014 |
20140379725 | ON DEMAND PARALLELISM FOR COLUMNSTORE INDEX BUILD - The degree of parallel processing used to build a database index can be dynamically adjusted based on actual memory usage of individual parallel processing units. Memory can be reserved to prevent an out-of-memory condition. A predetermined number of initial parallel processing units can be activated. The actual usage of resources by the initial activated parallel processing unit(s) can be measured to establish an initial baseline for resource consumption per parallel processing unit. The baseline for resource consumption per parallel processing unit can be used to determine how many additional parallel processing units are activated. The actual resource usage of each parallel processing unit can be measured and used to refine the baseline memory usage. The refined average memory usage can be used to determine how many additional parallel processing units are activated. | 12-25-2014 |
20140379726 | Associative Memory and Data Searching System and Method - A method for searching sequences includes storing a probe sequence and a target sequence expressed in a first orthogonal domain. The target sequence includes potential probe match sequences each characterized by the length of the target sequence. The probe sequence representation and the target sequence are transformed into an orthogonal domain. In the orthogonal domain, the target sequence is encoded with modulation functions to produce a plurality of encoded target sequences, each of the modulation functions having a position index corresponding to one of the potential probe match sequences. The plurality of encoded target sequences is interfered with the probe sequence in the orthogonal domain and an inverse transform result is obtained. If the inverse transform result exceeds a threshold, information is output indicating a match between the probe sequence and a corresponding one of the potential probe match sequences. | 12-25-2014 |
20140379727 | DATABASE MANAGEMENT METHOD AND INFORMATION PROCESSING APPARATUS - An information processing apparatus includes a determination unit and an output unit. If data included in a database with an index is updated, the determination unit determines whether to perform an index update process so that the index update process will be performed with a determined probability. If the determination unit determines to perform the index update process, the output unit outputs an index update request. | 12-25-2014 |
20150012543 | REGION LABELING METHOD AND DEVICE OF DATA DOCUMENTS - A region labeling method of data documents and a device thereof are provided. The region labeling method includes: obtaining a tree structure, which has a plurality of nodes including a plurality of administrative division names and iconic names with a hierarchical relationship therebetween; receiving the data document and retrieving at least one keyword from the data document; comparing the at least one keyword with the nodes to find a first node matching the at least one keyword; and labeling the first node and at least one father node of the first node to the data document. | 01-08-2015 |
20150019561 | Method for generating indexes for downloading data - A method for bi-directional data replication between a Web Service application and a relational database are provided. In an electronic communications network, a computer implemented method is provided for generating indexes of relationship fields by accessing the metadata of a remote web service. A plurality of records may be downloaded from the remote web service to a computer. The plurality of records may include metadata related to the identity of data fields involved in parent-child record relationships. An index may be built at least partially on the data fields involved in parent-child relationships. | 01-15-2015 |
20150019562 | METHOD AND SYSTEM OF MAPREDUCE IMPLEMENTATIONS ON INDEXED DATASETS IN A DISTRIBUTED DATABASE ENVIRONMENT - In one exemplary embodiment, a method of a distributed database system includes the step receiving a query in a query language from a client with a distributed database system. An index that matches the query is located. The index is pre-generated from a database table in the distributed database system. A map function of a MapReduce programming model is implemented using the index. A reduce function of the MapReduce programming model is implemented using the output of the map function. Optionally, a finalize function can be implemented using the output of the reduce function. The distributed database system can be a scalable NoSQL database. The reduce function can be optional when the value of the output of the map function is guaranteed to be unique. | 01-15-2015 |
20150026186 | Enterprise Asset Management - Methods and apparatuses for asset management are disclosed. In one example, a system and method includes receiving an indication of a coupling of an asset to a computing device. An asset identifier stored on the asset is received and an asset data record corresponding to the asset is located. A data associated with the asset is recorded in the asset data record. | 01-22-2015 |
20150026187 | INFERRING RELEVANCE BASED ON USER INTERACTIONS WITH EMAIL - Embodiments are directed towards employing an index of relevant items to determine and provide items that are relevant to a query. A data store that includes a plurality of items may be generated for each user. The user data store may be employed to generate an index of relevant items for each user. Relevant items may be pre-ranked in the index based on a degree of relevancy to a corresponding item. The index of relevant items may be employed to determine ranked relevant items associated with at least one item associated with a query. In response to the query, the relevant items may be further ranked or re-ranked based on at least a current time of the query and a time decay function associated the relevant items. A subset of the determined ranked relevant items may be provided to a client device for display to the user. | 01-22-2015 |
20150026188 | SYSTEM AND METHOD FOR MATHEMATICS ONTOLOGY EXTRACTION AND RESEARCH - An extensive computer based online math research system (the “Research System”) having as its foundation an Ontology of mathematics, and utilizing unique and intensive computer support, coordination, data structuring, data storage, computer processing, retrieval capabilities, and data-mining capabilities, and an Ontology editing system that runs on computer software with computer processors and data storage capabilities (the “Ontology Editor System”). The Research System also includes a methodology to enable online reference and data manipulation of the Ontology, and an Internet based search of the concepts of mathematics and applications of mathematics to the sciences on the basis of the Ontology. | 01-22-2015 |
20150026189 | INDEX STRUCTURE FOR A RELATIONAL DATABASE TABLE - Embodiments are directed to building an index structure for a relational database (RDB) table and for querying the RDB table. An aspect includes obtaining JSON documents from a RDB table, identifying JSON objects in the JSON documents and assigning identifiers for them, and building an index structure according to values in the JSON objects such that the index structure includes at least one node that records values in a JSON object and includes the identifier of the specific JSON object. Querying the RDB table includes receiving a query request including an indicator which indicates that the query is to be performed on JSON objects, and searching the index structure based on query conditions in the query request so as to determine a target JSON object that meets the query conditions. | 01-22-2015 |
20150032756 | NORMALIZED SEARCHABLE CLOUD LAYER - An example system for indexing heterogeneous resources includes a data module that invokes one or more APIs, and responsive to the invoked one or more APIs, receives a first set of attributes of one or more cloud resources of a first subset of the set of cloud resources and a second set of attributes of one or more cloud resources of a second subset of the set of cloud resources. The system also includes a normalization module that for at least one attribute of the first and second sets of attributes, searches a database for a normalized attribute corresponding to the respective one attribute, normalizes based on the uniform schema the respective attribute, and places the normalized attribute in a data structure. The system further includes an indexing module that indexes the normalized attributes in the data structure. | 01-29-2015 |
20150032757 | SYSTEMS AND METHODS FOR DATA COMPRESSION - Event data comprising an unordered string set may be received. String set dictionary indexes may be assigned for strings of the unordered string set in a string set dictionary. The unordered string set may be sorted to provide a sorted series based on the string set dictionary indexes for the unordered string set. A differential series may be computed from the sorted series. The differential series may be encoded into binary code words. In an embodiment, the event data also may comprise strings. A schema version associated with the strings in a row may be determined. Computing resources may be allocated based on the schema version. | 01-29-2015 |
20150032758 | High Performance Index Creation - High performance index creation using parallel query plans with repartitioning scan and vector-based repartitioning scan is described. An example method includes extracting index key columns from data rows of the database table to build a set of index rows, wherein the index on the database table is defined by a plurality of index key columns including a first index key column and a second index key column. Partition boundary values are generated to divide up the index rows into range-partitioned sets, and the index rows are sorted based on values of the index key columns. A repartitioning scan, including a SARG-based or a vector-based partitioning scan is performed on the index rows, using a plurality of worker threads executing in parallel to build sub-indexes. Subsequently, each range-partitioned set of index rows are assigned to a worker thread in the plurality of worker threads. Accordingly, the sub-indexes generated from the plurality of work threads are merged to build the index for the database table. | 01-29-2015 |
20150032759 | SYSTEM AND METHOD FOR ANALYZING RESULT OF CLUSTERING MASSIVE DATA - Disclosed are a system and a method for analyzing a result of clustering massive data. An open-source map/reduce framework named Hadoop is used to calculate a silhouette coefficient corresponding to a significance verification index capable of evaluating a result of clustering massive data. To implement the system and the method for analyzing a result of clustering massive data, clustered data is divided into blocks. For all of the blocks, input splits are generated. Then, the generated input splits are assigned to multiple computers. Each computer stores only data of blocks included in an input split assigned in a memory, and calculates a silhouette coefficient for each record. Each computer provides only the calculated silhouette coefficient to an index coefficient calculation apparatus, and enables the index coefficient calculation apparatus to calculate a silhouette coefficient for a cluster. Therefore, the result of clustering the massive data can be rapidly and objectively analyzed. | 01-29-2015 |
20150046463 | SYSTEM AND METHOD FOR CONTENT STORAGE - A system and method of content storage includes a content store for storing a plurality of first content blocks and one or more first indexes, a content manager including one or more first content analyzers, and an interface for receiving access requests for the plurality of first content blocks. Each of the one or more first indexes includes a plurality of first entries for organizing a plurality of second content blocks from the plurality of first content blocks according to a respective organizational structure. Each of the first entries includes a link to a corresponding content block from the plurality of second content blocks. Each of the plurality of content analyzers monitors the plurality of first content blocks for changes and updates a second index from the one or more first indexes based on information associated with the monitored changes. | 02-12-2015 |
20150046464 | DETERMINATION OF DIFFERENCES IN HIERARCHICAL DATA - A system includes determination of a first tree comprising a first hierarchy of nodes and a second tree comprising a second hierarchy of nodes, determination of first subtrees of the first tree which are identical to first subtrees of the second tree, generation of a first mapping between nodes of the first subtrees of the first tree and nodes of the first subtrees of the second tree, generation of a first feature vector for each of second subtrees of the first tree, insertion of the first feature vectors into an index structure, generation of a second feature vector for each of second subtrees of the second tree, and generation of a second mapping between nodes of second subtrees of the first tree and nodes of the second subtrees of the second tree based on the second feature vectors and the index structure. | 02-12-2015 |
20150046465 | SYSTEM AND METHOD FOR TARGETING RELEVANT RESEARCH ACTIVITY IN RESPONSE TO DIAGNOSTIC MARKER ANALYSES - A system and method for targeting relevant research activity for clinical application in response to diagnostic markers analyses is described. Diagnostic analysis is performed to detect the level of each of at least three diagnostic markers. The levels of the tested markers are used to identify relevant publications from among a large database of articles. The most relevant literature, such as, one which reports research and studies that have been conducted to identify, moderate, and define the mechanisms unique to individual and combinations of diagnostic markers for various disease states, is then provided to the patient and/or the patient's physician, optionally with a summarization of the treatment recommendations from the provided literature. The customized information delivery provides a range of published peer-reviewed therapeutic options and/or published research studies. | 02-12-2015 |
20150052147 | System And Method For Analyzing And Reporting Gateway Configurations And Rules - A method for analyzing and reporting gateway configurations and rules includes receiving configuration data from gateway devices providing access to an enterprise network. The gateway devices may be associated with multiple vendors. At least one computer processor parses the configuration information associated with each of the gateway devices to identify configuration output data for each of the gateway devices. The configuration output data for each of the gateway devices is stored in a single data structure. The configuration output data stored in the single data structure is indexed to generate one or more optimized data tables. | 02-19-2015 |
20150052148 | Method and System for High Performance Integration, Processing and Searching of Structured and Unstructured Data Using Coprocessors - Disclosed herein is a method and system for integrating an enterprise's structured and unstructured data to provide users and enterprise applications with efficient and intelligent access to that data. In accordance with exemplary embodiments, the generation of metadata indexes about unstructured data can be hardware-accelerated by processing streaming unstructured data through a reconfigurable logic device to generate the metadata about the unstructured data for the index. | 02-19-2015 |
20150058351 | QUERIES FOR THIN DATABASE INDEXING - A method of database indexing is disclosed. Two functions (f and g) from row values to row number values are set. The functions are utilized to determine a row number in a database column containing a target search value, wherein the target search value comprises a search value being sought in the database column. A candidate row number variable is set initially to the function g of the target search value by a processor. Iteratively the following is performed: a current value of the candidate row number variable is used as an address to read a value in a corresponding row in the database column, and the current value of the candidate row number variable is updated to the function f of the most recently read value in a corresponding row in the database column. | 02-26-2015 |
20150058352 | THIN DATABASE INDEXING - A method of indexing a database column is disclosed. A permutation function f is determined. A shortcut that connects two non-adjacent elements of a permutation cycle based on f is created. A traversal of the permutation cycle without the shortcut comprises starting at a first element of the permutation cycle by updating a candidate row number to an initial value, wherein the initial value comprises a value stored in the database column. Traversing through one element of the permutation cycle comprises reading a row value in the database column using the candidate row number and then updating the candidate row number with function f of the most recently read row value. The step of traversing through one element of the permutation cycle is repeated until the most recently read row value is equal to the initial value. Creating the shortcut comprises storing a relation between the two non-adjacent elements. | 02-26-2015 |
20150058353 | Report Acceleration Using Intermediate Summaries - A method and system for managing searches of a data set that is partitioned based on a plurality of events. A structure of a search query may be analyzed to determine if logical computational actions performed on the data set is reducible. Data in each partition is analyzed to determine if at least a portion of the data in the partition is reducible. In response to a subsequent or reoccurring search request, intermediate summaries of reducible data and reducible search computations may be aggregated for each partition. Next, a search result may be generated based on at least one of the aggregated intermediate summaries, the aggregated reducible search computations, and a query of adhoc non-reducible data arranged in at least one of the plurality of partitions for the data set. | 02-26-2015 |
20150066941 | PHOTO CATALOGING, STORAGE AND RETRIEVAL USING RELATIONSHIPS BETWEEN PEOPLE - A computer system includes a photo processing mechanism that allows cataloging and storing a user's photos using relationships between people that allow the user's photos to be retrieved using a search engine. A user enters people and specifies relationships, and may also enter locations, events, and other information. Photos are then processed, and indexing info is generated for each photo that may include any or all of the following: user-defined relationships, system-derived relationships, user-defined locations, system-defined locations, user-defined events, and system-derived events and ages for the people in the photos. The indexing info may be stored as metadata with the photo or may be stored separately from the photo. The indexing info allows photos to be retrieved using a powerful search engine. | 03-05-2015 |
20150066942 | UNIT OF WORK BASED INCREMENTAL DATA PROCESSING - Systems, methods, and other embodiments associated with a unit of work for incremental data processing are described. In one embodiment, a method includes selecting a unit of work key for a program that processes records in a source table. The unit of work key is selected such that modifications to a record having a certain unit of work key value will not affect the program's processing of records having a different unit of work key value. The selected unit of work key is associated with the program. When a record in the source table is created, modified, or deleted, a unit of work key value is identified for the record and a selected set of records having the identified unit of work key value is provided to the program. Thus, records in the data source that do not have the identified unit of work key values are not provided to the program. | 03-05-2015 |
20150066943 | DATABASE SYSTEM AND DATABASE MANAGEMENT METHOD - A database system comprises an interface for a storage device, and a control device for accessing the storage device through the interface. The storage device stores a table that manages a plurality of rows comprising a plurality of item values, and an index configured in a tree structure based on a plurality of nodes enabling to identify the rows by using one or more item values. The control device identifies an item value condition included in a search condition of a search query, determines whether or not the node of the index corresponding to the item value condition is reference-disabled, and when the node is reference-disabled, identifies a range of item values managed by the reference-disabled node, and searches for a row that satisfies the search condition for the identified range. | 03-05-2015 |
20150066944 | DATA PROCESSING, APPARATUS AND METHODS - The invention provides an apparatus for processing data items in a data source, and an equivalent method. More specifically, the apparatus is configured to reduce a lattice representation of data items in a data source. The apparatus has a processor that is operable to retrieve data items from the data source and to determine a context defining relationships between data items and one or more properties associated with the data items. An initial lattice data structure is generated based on the determined context. The processor then reduces the size of the lattice by dividing the initial lattice data structure into a plurality of subgroup lattice, reducing the size of each of the subgroups of lattice, and combining them to form a reduced lattice data structure. | 03-05-2015 |
20150066945 | LIBRARY INDEXING SYSTEM AND METHOD - A system and method for collectively indexing a collection of documents in paper or electronic form. The collective index is created for all documents, which may come from a variety of sources and be initially presented in a variety of formats. Documents are converted to a standard electronic format, then analyzed and processed to identify, locate, and tag objects of interest. Hierarchical menus are created with multiple categories. | 03-05-2015 |
20150066946 | Selectively Retrieving Search Results in Accordance with Different Logical Relationships - A search server system having one or more processors and memory receives a search query including filter criteria. The filter criteria specify operands and a logical relationship. In response to the search query the server identifies index portions sorted in a same sort order. The server identifies matching entities that match the search query by identifying index entries having a first identifier and matching filter(s) of a first operand and identifying, index entries having a second identifier that is sequentially adjacent to the first identifier in the sort order and matching filter(s) of a second operand. When index portion(s) associated with the first operand include index entries having a respective identifier between the first identifier and the second identifier and matching the filter(s) of the first operand, the server marks a respective entity associated with the respective identifier as matching the search query and sends search results to the client. | 03-05-2015 |
20150074115 | DISTRIBUTED STORAGE OF DATA - A file generation system for storage of structured data onto a distributed database includes an intermediate data generation module to generate a set of intermediate key-value pairs for each of one or more records in at least one subset of the structured data. A key-value pair includes a key and a value corresponding to the key, where the key is a unique identifier of the value. A file generation system further includes an intermediate data sorting module to sort the set of intermediate key-value pairs to generate a plurality of output files. Each of the plurality of output files includes at least one key-value pair. Further, the file generation system includes a file storing module to store the plurality of output files in the distributed database, where the plurality of output files are representative of the structured data. | 03-12-2015 |
20150074116 | INDEXING ATTACHABLE APPLICATIONS FOR COMPUTING SYSTEMS - Examples disclosed herein provide systems, methods, and software for indexing attachable applications for computing systems. In one example, a method of attaching an application to a computing system includes identifying an application attach triggering event for the computing system. The method further includes, identifying a storage location for the application based on the application attach triggering event and an application index, and attaching at least one storage volume to the computing system corresponding to the storage location. | 03-12-2015 |
20150081717 | SYSTEMS AND METHODS FOR GENERATING AND USING AGGREGATED SEARCH INDICES AND NON-AGGREGATED VALUE STORAGE - Systems, methods and computer program products for using searchable aggregate indices associated with non-aggregated value storage. In one method, a search system stores metadata values for each of a plurality of objects in a storage unit. The metadata values are stored in corresponding value storage locations that are associated with an identifiable metadata fields. An aggregate index is provided which includes a dictionary of terms that are contained in metadata values associated with a designated set of the metadata fields. The aggregate index is searched for one or more specific search terms, and one or more of the metadata values are retrieved from the value storage locations in response to the search, where the individual metadata fields associated with the retrieved metadata values are identified. | 03-19-2015 |
20150081718 | IDENTIFICATION OF ENTITY INTERACTIONS IN BUSINESS RELEVANT DATA - The present disclosure describes methods, systems, and computer program products for extracting entity interaction information from business relevant data. One computer-implemented method includes receiving a dataset comprising information about a plurality of entities and comprising a plurality of non-overlapping data subsets, each of the data subsets having the same predetermined size, analyzing the dataset to identify a plurality of interactions in the dataset, each identified interaction associated with two or more entities from the plurality of entities, receiving a query regarding a specific interaction for a specific entity, determining whether one of the identified interactions for the specific entity matches the specific interaction, and providing information from one or more non-overlapping data subsets that each comprise data about the specific interaction and the specific entity based on determining that at least one of the identified interactions for the specific entity matches the specific interaction. | 03-19-2015 |
20150088898 | Methods And Systems For Data Management And Analysis - Provided are methods comprising receiving a query for information from the database, determining particular data element types and data element values that are the subject of the query, instantiating a query data structure containing the data element types and the data element values that are the subject of the query, identifying records within the database that contain one or more data element types and/or data element values that are included in the query data structure, and instantiating a results data structure comprising information relating to the identified records. | 03-26-2015 |
20150088899 | SYSTEM AND METHOD FOR IDENTIFYING A SEGMENT OF A FILE THAT INCLUDES TARGET CONTENT - An electronic device with one or more processors, memory and a display obtains a file header for a file that corresponds to a plurality of separately indexed clusters. The file is divided into segments one or more of which include multiple sequential clusters. The file header includes a segment index that enables identification of a segment that includes requested content. The device receives a request to seek to a respective position within the file. In response, the device identifies a first segment that includes content that corresponds to the respective position based on the segment index, obtains the first segment, and obtains a content index for a respective cluster within the first segment. After obtaining the first segment and the content index, the device identifies respective content within the respective cluster that corresponds to the respective position based on a content index and provides the respective content to a user. | 03-26-2015 |
20150088900 | AGGREGATING KEYS OF DEPENDENT OBJECTS FOR A GIVEN PRIMARY OBJECT - Keys are obtained and aggregated by storing a primary object as an entry in a parent keys storage and a child keys storage, the entry identified as unvisited in each. An object evaluation process is then performed until all unique entries in the parent keys storage and all unique entries in the child keys storage have been visited and by committing the keys of at least one related object as an entry to the hierarchical database. The object evaluation process visits each unvisited object in the parent keys storage and child keys storage by selecting, for the unvisited object, objects in the parent direction that have not already been visited and objects in the child direction that have not already been visited and by inserting the keys of the selected related objects as entries in the parent keys storage or child keys storage. | 03-26-2015 |
20150095340 | Information Sets for Data Management - According to one embodiment of the present invention, a system selects member objects from among data objects stored in one or more data sources by evaluating criteria against information about the data objects in an index, wherein the information about the data objects in the index is derived from the data sources. The system creates a set of data objects comprising a collection of references to the selected member objects, and associates the set with a set-identifier. The system receives a request to apply an action to a subset of the data objects stored in the one or more data sources, wherein the request specifies the subset by including the set-identifier, and applies the action to the set of data objects associated with the set-identifier within the request. Embodiments of the present invention further include methods and computer program-products for managing data in substantially the same manners described above. | 04-02-2015 |
20150095341 | SYSTEM AND A METHOD FOR HIERARCHICAL DATA COLUMN STORAGE AND EFFICIENT QUERY PROCESSING - An embodiment provides intermediate data derived in the form of column stores which are in turn based on hierarchical data stores. This intermediate data represents a reduced subset of data matched appropriately to a query (or modified query) such that the amount of data handled in a query processing task on large data is greatly reduced. An embodiment may appropriately choose column data stores and/or modify queries in order leverage parallelization techniques such as map-reduce in order to query large data. The result is the ability to query large data stores in parallel while reducing the amount of data that must be handled. | 04-02-2015 |
20150095342 | Virtual table index mechanism and method for multi-attribute compound condition query - A virtual table index mechanism and a method for a multi-attribute compound condition query relate to a field of computer application technology. The virtual table index mechanism for a multi-attribute compound condition query includes: an index manager; a condition analyzer; and a pre-execution engine. The present invention forms index key values corresponding to a plurality of attributes in a virtual table. When querying, a query condition for the virtual table is calculated. Referring to a key value index, whether a mapping physical table of the virtual table should be executed is independently pre judged according to the attributes, so as to ensure that query is applied only to a virtual sub-table satisfying a corresponding key value condition. The present invention is effective for multi-attribute compound condition query, and is suitable for indexing virtual tables. | 04-02-2015 |
20150095343 | CLOUD DATABASE CONNECTION MULTIPLEXING - Techniques are described herein for routing requests for database cloud services. In an embodiment, a different Uniform Resource Identifier (URI) is assigned to each respective database schema of a plurality of database schemas in one or more databases. For each respective database schema, mapping data is stored that maps at least a portion of the URI to the respective database schema. In response to receiving a first request that includes a URI and a set of one or more database commands, a target database schema in a target database is identified based on the mapping data and the URI. A connection is established for accessing the target database schema in the target database. The set of one or more database commands is sent over the connection to access the target database schema in the target database. | 04-02-2015 |
20150095344 | Database Access Using Partitioned Data Areas - Provided are techniques that ensure efficient database accesses by partitioning. The techniques includes a partition generating unit which generates a value partition in which a unit subtree including target data to be partitioned is separated and registered according to the target data, and a base partition which includes an index for uniquely identifying the unit subtree, an XML parser which identifies positions of a start to and an end tag defining an attribute value of the structured document and generates a cutting position list Corresponding to the hierarchical structure of the attribute value for registration in a storage area, an XML cutting unit which identifies the start tag position and the end tag position of the unit subtree to be cut, and separates the unit subtree and its index from the structured document, and a cut-XML registration unit which registers the unit subtree and the index in the separate partitions. | 04-02-2015 |
20150106380 | DATABASE MANAGING METHOD, DATABASE MANAGING SYSTEM, AND DATABASE TREE STRUCTURE - Provided are a database (DB) managing method and system, wherein, while forming an index of a DB, a lower value and an upper value of key values of a plurality of records included in one page are stored as separators and an overlapping part of the key values is deleted from the plurality of records by using the separators to save a storage space where pages of the index is stored, and thus the performance of the DB is improved. | 04-16-2015 |
20150112996 | PERVASIVE SEARCH ARCHITECTURE - A pervasive search architecture that indexes personal content of a querying user and made accessible to the user by other users. A compute node of a personal content location facilitates index generation and serve of the index. The index is generated for personal content stored at the personal content location. For a given content location, the index may encapsulate content stored in a set of locations with access permissions. The indexing application runs periodically at the personal content location and incrementally indexes content that is added to the shared locations. The same application allows the user to configure locations with the desired access permissions for participation in the search. | 04-23-2015 |
20150112997 | METHOD FOR CONTENT CONTROL AND ELECTRONIC DEVICE THEREOF - A method for controlling a content in an electronic device is provided. The method includes detecting first tag information based on a content accessible by the electronic device, detecting second tag information based on an image included in the content, comparing the first tag information with the second tag information, and displaying information indicating that the content is not original on the content, when the first tag information and the second tag information do not satisfy a reference condition. | 04-23-2015 |
20150112998 | SYSTEMS AND USER INTERFACES FOR DYNAMIC AND INTERACTIVE SIMULTANEOUS QUERYING OF MULTIPLE DATA STORES - Embodiments of the present disclosure relate to a computer system and interactive user interfaces configured to enable efficient and rapid access to multiple different data sources simultaneously, and by an unskilled user. The unskilled user may provide simple and intuitive search terms to the system, and the system may thereby automatically query multiple related data sources of different types and present results to the user. Data sources in the system may be efficiently interrelated with one another by way of a mathematical graph in which nodes represent data sources and/or portions of data sources (for example, database tables), and edges represent relationships among the data sources and/or portions of data sources. For example, edges may indicate relationships between particular rows and/or columns of various tables. The table graph enables a compact and memory efficient storage of relationships among various disparate data sources. | 04-23-2015 |
20150112999 | SYSTEM AND METHOD FOR IMPLEMENTING A SCALABLE DATA STORAGE SERVICE - A system that implements a scalable data storage service may maintain tables in a non-relational data store on behalf of clients. The system may provide a Web services interface through which service requests are received, and an API usable to request that a table be created, deleted, or described; that an item be stored, retrieved, deleted, or its attributes modified; or that a table be queried (or scanned) with filtered items and/or their attributes returned. An asynchronous workflow may be invoked to create or delete a table. Items stored in tables may be partitioned and indexed using a simple or composite primary key. The system may not impose pre-defined limits on table size, and may employ a flexible schema. The service may provide a best-effort or committed throughput model. The system may automatically scale and/or re-partition tables in response to detecting workload changes, node failures, or other conditions or anomalies. | 04-23-2015 |
20150120745 | DYNAMIC DATABASE INDEXES FOR ENTITY ATTRIBUTE VALUE STORES - A system and method for database storage may be dynamically optimized for entity-attribute-value (EAV) data. In one embodiment, this dynamic EAV database may include one column for the entity identification (“ID”) and a second column for the serialized data for that ID (each attribute and the values for those attributes). The database may be dynamically updated to move or copy certain attributes to a separate column. The dynamic updating of one or more attributes may be based on certain factors (seasonal, temporal, usage, etc.), such as frequent requests for a particular attribute. This dynamic updating and indexing of particular attributes may improve certain queries and data requests. | 04-30-2015 |
20150120746 | Constant Mapping Optimization in a Database - A database server receives a query associated with a calculation scenario that defines a data flow model that includes one or more calculation nodes. Each calculation node defines one or more operations to execute on the database server, the database server comprising a column-oriented database. Thereafter, the database server instantiates the calculation scenario and executes the operations defined by the calculation nodes of the instantiated calculation scenario to result in a responsive data set having a result table with a row for each column in the column-oriented database implicated by the query having a constant value. The database server then provides the data set to the application server. Related apparatus, systems, methods, and articles are also described. | 04-30-2015 |
20150120747 | TECHNIQUES FOR SEARCHING DATA ASSOCIATED WITH DEVICES IN A HETEROGENEOUS DATA CENTER - Examples are disclosed for searching data associated with devices in a heterogeneous data center. In some examples, data may be gathered for a database. The data may be associated with a heterogeneous data center having devices from multiple vendors. The database may be indexed to link data for a given device to an attribute, association or usage context for the given device. Responsive to a search request, data for a given device from among the devices in the heterogeneous data center may be provided based on a comparison of the search request to the indexed database. Other examples are described and claimed. | 04-30-2015 |
20150120748 | INDEXING SPREADSHEET STRUCTURAL ATTRIBUTES FOR SEARCHING - Technologies are described herein for extracting and/or calculating statistics regarding attributes present in a spreadsheet document and indexing the structural or functional attributes. A search component is also provided for receiving search queries of attributes to search an index to find spreadsheet documents having attributes that meet the search query requirements. | 04-30-2015 |
20150120749 | DATA MANAGEMENT FOR CONNECTED DEVICES - Techniques and constructs to facilitate data management can provide improved response time and space efficiency for time-series data such as from connected devices. The constructs may enable receiving a stream of time-series data comprising a plurality of objects and a time identification associated with each of the objects. One or more tags are associated with the objects. The constructs may also chunk the stream into a plurality of contiguous chunks, each including a plurality of objects, create an index associating the time identification and the one or more tags, transmit the chunks to a first, remote storage, and then store the index. | 04-30-2015 |
20150120750 | SYSTEM AND METHOD TO STORE VIDEO FINGERPRINTS ON DISTRIBUTED NODES IN CLOUD SYSTEMS - A method to design, implement and create distributed indexes for storing and comparing fingerprints of videos is presented. The method effectively utilises cloud computing platforms that offer varying amounts of computing resources. The method enables the distributed index to scale to large numbers of data points and the distributed index is robust to failures within the computing resources maintaining the index. The method minimizes the memory required to maintain the distributed index and reduces the I/O operations needed to process operations performed on the index. The method improves the efficiency of the index to process queries. | 04-30-2015 |
20150120751 | APPARATUS AND METHOD FOR PROCESSING MAP DATA BY REAL-TIME INDEX CREATION AND SYSTEM THEREOF - Disclosed is a system and method capable of minimizing user-set search data and local filtering data by creating index data in real time. A search data processing system includes an index generation unit adapted to generate index data for a search on map-associated data of an item selected by a user; and a search execution unit adapted to execute a search operation on the map-associated data of the selected item by means of the index data. | 04-30-2015 |
20150120752 | INFORMATION PROCESSING SYSTEM, RECORDING MEDIUM, AND INDEX MANAGEMENT METHOD - An information processing system including: a plurality of second information processing apparatuses connected to a first information processing apparatus via a network; and a management apparatus. The plurality of second information processing apparatuses dispersedly manage indexes established for searches using data and metadata of the data. The management apparatus includes a processor configured to monitor a search request transmitted from the first information processing apparatus, and to specify, on the basis of a monitoring result by the processor, a combination of types of the metadata, for which the processor should cause a same second information processing apparatus to manage the indexes, and the same second information processing apparatus and cause the specified same second information processing apparatus to manage the indexes of the type of the metadata of the specified combination. | 04-30-2015 |
20150127656 | IN-ADVANCE INDEXING - A method for in-advance obtaining properties of objects of a computerized system, the method comprising receiving a request for properties of a first object, and responsively providing the requested properties along with properties of an at least one another object that are similar to the properties of the first object, wherein the properties of the at least one another object are similar to the properties of the first object by having at least one element of properties that are common therebetween, and wherein the method is carried out by an at least one apparatus component of the computerized system, and an apparatus for performing the same. | 05-07-2015 |
20150127657 | Method and Computer for Indexing and Searching Structures - A method for indexing a plurality of structures, which are derived from a plurality of externalizations of users' mental models, is provided. The method comprises receiving at least one of the plurality of structures; analyzing the at least one structure according to a predetermined principle of normalization to obtain a plurality of index analysis results in a form of a plurality of tuples comprising a plurality of elements and/or a plurality of relations thereof related to the plurality of structures; and generating an index store according to the index analysis result; wherein each externalized relation indicates each externalized sequence from one of externalized elements to another one of the externalized elements; wherein each relation indicates each sequence from one of a plurality of elements to another one of the plurality of elements. | 05-07-2015 |
20150134668 | Index of Video Objects - A system for indexing physical objects, locations and people, collectively referred to as video objects, which appear in videos. The system enables video object-level identification of TV and video content, and makes those video objects indexable, linkable, and searchable. | 05-14-2015 |
20150134669 | ELEMENT IDENTIFICATION IN A TREE DATA STRUCTURE - A computer-implemented method of providing unique identification information for an element in a tree data structure where the element does not have an independent, unique identifier associated therewith by which it can be identified without reference to any further element. The method comprises determining ( | 05-14-2015 |
20150134670 | Generic Indexing for Efficiently Supporting Ad-Hoc Query Over Hierarchically Marked-Up Data - Hierarchical data objects are indexed using an index referred to herein as a hierarchy-value index. A hierarchy-value index has, as index keys, tokens (tag name, a word in node string value) that are extracted from hierarchical data objects. Each token is mapped to the locations that correspond to the data for the token in hierarchical data objects. A token can represent a non-leaf node, such as an XML element or a JSON field. A location can be a region covering and subsuming child nodes. For a token that represents a non-leaf node, a location to which the token is mapped contains the location of any token corresponding to a descendant node of the non-leaf node. Thus, token containment based on the locations of tokens within a hierarchical data object may be used to determine containment relationships between nodes in a hierarchical data object. | 05-14-2015 |
20150134671 | METHOD AND APPARATUS FOR DATA DISTRIBUTION AND CONCURRENCE - Methods and apparatus for data distribution and concurrence are provided. Data and a primary key of the data are obtained. A sub-key is assigned according to the primary key. The data are routed to a corresponding queue according to the sub-key. A writing operation is processed to the data in the queue. Because multiple sub-keys can exist under a same primary key and each sub-key can have one place for the writing operation, the writing operation can be processed simultaneously to the multiple sub-keys without affecting one another. The success rate of data writing concurrence can be improved and timeliness of the data can be ensured. | 05-14-2015 |
20150142816 | Managing Searches for Information Associated with a Message - A method for managing information about a product. A processor searches documents for a location of a message of the product using a set of rules that are based on instructions for generating the message. The instructions are in a resource of the product. The processor then adds the location to an index of locations of the message in the documents. | 05-21-2015 |
20150142817 | DENSE TREE VOLUME METADATA UPDATE LOGGING AND CHECKPOINTING - In one embodiment, a node of a cluster executing a storage input/output (I/O) stack having a volume layer, stores a multi-level dense tree metadata structure. Each level of the dense tree metadata structure includes volume metadata entries for storing volume metadata. One or more non-volatile logs (NVLogs) are updated. The one or more NVLogs including a volume layer log configured to record changes to the volume metadata, wherein volume metadata entries inserted into a top-level of the dense tree metadata structure are recorded in the volume layer log. The node writes volume metadata entries from the volume layer log to one or more storage devices to be stored as extents. | 05-21-2015 |
20150142818 | PAGED COLUMN DICTIONARY - A request for an entry of a dictionary having multiple dictionary blocks may be received, where the dictionary stores string values associated with corresponding value identifiers (IDs) for dictionary compression. One of the dictionary blocks may be selectively loaded into memory, and the dictionary block that has been loaded into memory may be searched. A value ID directory may be constructed in memory, where the value ID directory includes last value IDs for the dictionary blocks, and each of the last value IDs is mapped to an index of one of the dictionary blocks that includes a string value for that last value ID. A separator directory may also be constructed in memory, where the separator directory includes separators for the dictionary blocks, and each of the separators is mapped to an index of one of the dictionary blocks that includes a string value for that separator. | 05-21-2015 |
20150142819 | LARGE STRING ACCESS AND STORAGE - Techniques for handling large string values in a string dictionary within an in-memory database system are presented. The string dictionary can be a paged-column main dictionary. For example, a value block of a dictionary may be loaded into memory, where the dictionary stores values for dictionary compression. The value block may include a first portion of a large string value and one or more logical pointers to one or more large string pages containing a remainder of the large string value. The large string value may be materialized into a contiguous location in memory, and a pointer to the materialized large string value may be created. | 05-21-2015 |
20150142820 | Association strengths and value significances of ontological subjects of networks and compositions - The present invention discloses methods, systems, and tools for evaluating a number of value significance measures of ontological subjects of compositions or networks. The method breaks a composition into its constituent ontological subjects of different orders and builds a participation matrix indicating the participation of ontological subjects of the composition in other ontological subjects, i.e. the partitions, of the composition. Using the participation information of the OSs into each other, an association strength matrix is built from which the value significance measures of the partitions of the composition are calculated. The methods systematically calculate the value significances of the ontological subjects of different orders of the composition. Various systems for implementing the methods and some exemplary applications and services are disclosed. | 05-21-2015 |
20150149472 | FOR ALL ENTRIES PROCESSING - A system includes determination of a value identifier for each value of a first table, based on a data dictionary of a second table, generation of a hash function, the hash function mapping, for each row of the first table, one or more value identifiers of one or more columns of the row to a pointer to the row, application of the hash function to one or more value identifiers of a row of a second table, determination of whether the output of the hash function is a valid hash, and, if it is determined that the output of the hash function is a valid hash, determination that the row of the second table is associated with the first table. | 05-28-2015 |
20150149473 | SYSTEMS AND METHODS FOR ASSOCIATING TAGS WITH MEDIA ASSETS BASED ON VERBAL INPUT - Systems and methods for tagging a media asset are provided. A plurality of communications is received from a plurality of users. Each of the plurality of communications includes words spoken by a respective one of the users while accessing the media asset and is associated with a media asset play position during which the respective words were spoken. A subset of the plurality of communications is selected for which the associated media asset play position is within a range of play positions, the range of play positions being shorter than the duration of the media asset. A word that a threshold number of the selected communications have in common is identified. The word is cross-referenced with an attribute database to retrieve an attribute associated with the word. The retrieved attribute is assigned to the media asset within the range of play positions. | 05-28-2015 |
20150149474 | METHOD AND APPARATUS FOR INDEXING AND SEARCHING DOCUMENTS - Methods and apparatus for indexing and searching documents are disclosed. For example, a user of an electronic record management system may deploy a background indexing processes using a certain amount of parallel computing resources that may take several hours to complete. Subsequently, the user may change the number of computing resources dedicated to the indexing process without interrupting the indexing process. Upon completion, the indexing process creates a plurality of subindexes. The user may then deploy a background searching processes using a selected amount of parallel computing resources that may take several hours to complete. Subsequently, the user may change the number of computing resources dedicated to the searching process without interrupting the searching process. Upon completion, the searching process creates a plurality of partial search results that are combined into a final search result. | 05-28-2015 |
20150149475 | DATA HARMONIC ANALYSIS METHOD AND DATA ANALYSIS DEVICE - The present invention provides a data harmonic analysis method and a data analysis device for data analysis in which a plurality of data items to be analyzed are acquired; similarities among a plurality of data sources that generate the data values of the acquired plurality of data items are obtained; a hierarchical graph is generated as a graph structure indicating the acquired plurality of data items, with a plurality of child nodes corresponding to the plurality of data items being located in a lower layer and a parent node that has no data item being located in an upper layer; the connection rate between the parent node and each of the plurality of child nodes is calculated by using the information of the obtained similarities in the generated hierarchical graph; and harmonic analysis is applied, on the basis of the generated hierarchical graph, to the data values in the graph. | 05-28-2015 |
20150149476 | DATA STORAGE PLANNING - A planning device and method of data storage are disclosed. The planning device includes a format-processing sub-logic and a space-allocation sub-logic. The format-processing sub-logic is for creating a formatting information area, for creating an index area for storing index data, and for creating a data area on a designated physical storage disk, and wherein the formatting information defines the index area and the data area. The space-allocation sub-logic is for allocating at least one data block to an upper-layer application in accordance with a size of a document needed to be stored. By adopting the formatting process, users may create the index area and data area on different physical storage disks so as to separate the index area and the data area. In this way, the users may store the information regarding the index area on different storage disks. | 05-28-2015 |
20150149477 | PERSONALIZED DATA SEARCH SYSTEM AND METHOD THEREOF - Disclosed are systems and methods for conducting a personalized data search including selecting, by a cloud server, one of a plurality of data servers that is allocated to a desired user, the selecting being in response to a search request that is received via a cloud service platform from the desired user; conducting a search, by the cloud server, using the selected data server; and providing, by the cloud server, a search result based on the conducted search to the desired user via the cloud service platform. | 05-28-2015 |
20150149478 | Unified Server for Managing a Heterogeneous Mix of Devices - A system and method for integrating one or more resource servers for one or more types of devices are described using an enterprise service bus and network appliance as a service (NAaaS) application services. The enterprise service bus receives a request for one or more types of media, determines a first communication protocol of the request and one or more types of second communication protocols of the one or more resource servers, translates the request from a first communication protocol into one or more types of second communication protocols for one or more resource servers and retrieves one or more results matching the translated request from the one or more resource servers. A NAaaS application services determine one or more resource servers that the request is for based on the request matching a global index. | 05-28-2015 |
20150293937 | SYSTEM AND METHOD FOR DATA STORAGE - In some examples, a system configured to provide data storage is described. The system may include a write server configured to receive data from a first client for storing on the system and computer storage media configured to receive and to store the data. The system may also include a read server configured to receive the data from the write server and to store the data in the read server for a duration. The read server may be further configured to provide the data from the read server to a second client during the duration and, after the duration, to access the data from the computer storage media and to provide the data accessed from the computer storage media to the second client. | 10-15-2015 |
20150293940 | IMAGE TAGGING METHOD AND APPARATUS THEREOF - An image tagging method of an electronic device according to certain embodiments of the present disclosure includes transmitting a selected image to a server, receiving, from the server, tag information extracted on the basis of the image, generating information on a tag area on the image on the basis of the tag information, comparing the tag information with a pre-stored tag database to extract connection information matched to the tag information, and tagging the extracted connection information to the tag area. | 10-15-2015 |
20150293955 | DISTRIBUTED PROCESSING OF NETWORK DATA USING REMOTE CAPTURE AGENTS - The disclosed embodiments provide a method and system for processing network data. During operation, the system obtains one or more event streams from one or more remote capture agents over one or more networks, wherein the one or more event streams include event data generated from network packets captured by the one or more remote capture agents. Next, the system applies one or more transformations to the one or more event streams to obtain transformed event data from the event data. The system then enables querying of the transformed event data. | 10-15-2015 |
20150293960 | REAL-TIME INDEX CONSISTENCY CHECK - A real-time index system is provided for maintaining the consistency of an index of a database. The real-time index system can receive data associated with an inconsistency detected in the index, compute an index key value from the received data, and obtain the index record and data record associated with the computed index key value. The real-time index system can further compare the data in the data record with the corresponding data in the index record to identify a mismatch in data between the data in the index record and the data in the data record. The real-time index system can maintain the consistency of the index by modifying a portion of data in the index record to be consistent with a corresponding portion of data in the data record. | 10-15-2015 |
20150302006 | ADVANCED SEARCH FOR MEDIA CONTENT - Methods and systems for indexing and efficiently retrieving media content in a database use subtitle data of media content items, including subtitle text and timestamps, for indexing the items. A media server coupled to a communication network identifies keywords in the subtitle data, and stores the media content items in association with metadata including the keywords. When a search request for media content is received, a search for media content includes searching the metadata of the stored media content to identify media content items having subtitles matching the search request. In one example, keywords are translated into multiple languages to enable searching of the metadata in multiple languages. In another example, timestamp information included in the subtitle data is also included in the metadata so as to enable a search to return a time point within a media content item at which a keyword matches the search request. | 10-22-2015 |
20150302035 | PARTIAL INDEXES FOR PARTITIONED TABLES - Techniques for creating and using partial indexes are provided. A partial index is an index that indexes one or more partitions of a partitioned table and does not index one or more other partitions of the partitioned table. Thus, if a partition of a partitioned table is rarely used, then an index may index all other partitions of the partitioned table or at least only those partitions that are written to or read from relatively frequently. Also, in preparation for loading data into a partition, indexing for the partition may be “turned off”, effectively making a full index a partial index. While the data is loaded into the partition, the partial index is still available for query processing. After the data is loaded into the partition, indexing for the partition is “turned on”, which causes the data in the partition to be indexed by the partial index. | 10-22-2015 |
20150302427 | DATA PROCESSING METHOD FOR SITUATIONAL ANALYSIS - The present invention relates to a method for processing data by a server ( | 10-22-2015 |
20150309998 | METHOD AND APPARATUS FOR PLAYING A MP4 FILE CONTAINER WHILE GENERATING SUCH A FILE - A method for enabling playback of an MP4 file wrapper/container during generation of the same includes the creation of an auxiliary index file at initiation of the MP4 file generation and the updating of the auxiliary index file simultaneously with the MP4 file being generated. The auxiliary index file provides the instantaneous playback of a partially generated MP4 file container up to a last data block written to the MP4 file wrapper. Furthermore, by fixing the size of the index records that make up the auxiliary index file, a device can seek to any index record within the auxiliary index file at constant time and read the attributes of a frame such as data block location and frame offset. | 10-29-2015 |
20150310036 | MAPPING A LOCATION TO AN AREA - An apparatus receives coordinates of a location and selects an object of a first layer of a tree structure based on the coordinates. The apparatus determines whether data assigned to the selected object comprises an identifier of an area or a pointer to a set of objects of a respective next layer of the tree structure. If the data comprises a pointer to a set of objects, the apparatus selects one of the objects from the set of objects based on the received coordinates and continues with determining whether data assigned to the selected object comprises an identifier or a pointer. If the data comprises an identifier of an area, the apparatus reads the data comprising the identifier and provides the identifier as a result of a mapping of a location to an identifier of an area. A computer readable storage medium stores a database having a corresponding tree structure. | 10-29-2015 |
20150310045 | MANAGING AN INDEX OF A TABLE OF A DATABASE - A method, a system, and a computer program product for managing a database are disclosed. Managing the database includes managing an index of a table of the database. Managing the index includes determining the table includes a first field having a first input value configured to identify a first dynamic data function utilized to determine a first output value for the first field. In embodiments, managing the index includes determining the first dynamic data function is deterministic, determining the first output value for the first field using the first dynamic data function, and storing the first output value for the first field. In embodiments, managing the index includes determining the first dynamic data function is nondeterministic, determining a first special value to represent the first output value for the first field, and storing the first special value for the first field. | 10-29-2015 |
20150310048 | METHOD AND/OR SYSTEM FOR TRANSFORMING BETWEEN TREES AND ARRAYS - Embodiments of articles, apparatuses, devices and/or systems for manipulating hierarchical sets of data are disclosed. | 10-29-2015 |
20150310049 | MANAGING AN INDEX OF A TABLE OF A DATABASE - A method, a system, and a computer program product for managing a database are disclosed. Managing the database includes managing an index of a table of the database. Managing the index includes determining the table includes a first field having a first input value configured to identify a first dynamic data function utilized to determine a first output value for the first field. In embodiments, managing the index includes determining the first dynamic data function is deterministic, determining the first output value for the first field using the first dynamic data function, and storing the first output value for the first field. In embodiments, managing the index includes determining the first dynamic data function is nondeterministic, determining a first special value to represent the first output value for the first field, and storing the first special value for the first field. | 10-29-2015 |
20150310052 | MANAGING A TABLE OF A DATABASE - A method, a system, and a computer program product for managing a database are disclosed. Managing the database includes managing a table of the database, the table having a set of fields including a first field in a first row of the table. Managing the table includes selecting, in the first field, a first input value configured to identify a first dynamic data function utilized to determine a first output value for the first field. Managing the table includes determining the first output value for the first field using the first dynamic data function. Managing the table includes returning (for a read request) the first output value for the first field. | 10-29-2015 |
20150310072 | DEVICES, SYSTEMS, AND METHODS FOR CONTEXT MANAGEMENT - Systems, devices, and methods for contextual management extract context information and content information from a collection of stored digital content items; generate an index of the extracted context information and content information; store the indexed context information and content information; receive a query from a computing device, wherein the query is associated with a user; determine a context of the user; determine a context of a user activity; determine contents of the query; generate a recommendation in response to the query based on the context of the user, the context of the user activity, and the contents of the query; and return the recommendation to the computing device. | 10-29-2015 |
20150317343 | FACILITATING CROSS-PLATFORM CONTENT ACCESS - A messaging system includes features of gathering information regarding content accessed across multiple applications and/or devices and making that information available to account holders of the messaging system. The messaging system stores data related to accessed content in a plurality of indices and makes that data available to enable account holders to access previously-accessed content, even if the previous access occurred in a different software application, on a different device, or on a device with a different operating system. Account holders may also be provided with recommendations based on gathered information. | 11-05-2015 |
20150317345 | MULTIPLE FIELDS PARALLEL QUERY METHOD AND CORRESPONDING STORAGE ORGANIZATION - It is provided a method, comprising associating value ranges to each of a predefined number of fields, wherein the value ranges for each of the fields are continuous; associating, for each of the fields, bijectively rowkey field values to the value ranges of the respective field, wherein the rowkey field values for each of the fields are continuous; generating rowkeys, wherein each rowkey comprises one of the rowkey field values for each of the fields, and wherein a rowkey is generated for each of the corresponding combinations of the rowkey field values; wherein the associating of the rowkey field values is further adapted to associate the rowkey field values such that for each of the fields and for each of the rowkeys: a first rowkey field value for the respective field of the respective rowkey is neighbored to a second rowkey field value for the respective field of a second rowkey of the rowkeys, and a first value range of the respective field of the respective rowkey is continuous with a second value range of the respective field of the second rowkey, wherein the rowkey field values of the respective field comprise the first and second rowkey field values, and the value ranges of the respective field comprise the first and second value ranges. | 11-05-2015 |
20150317384 | Methods and Systems for Graph Database Processing of Complex Entity Relationships - The present invention relates to systems and methods of graph database processing and displaying complex relationships in a computer implemented program, for ease of review by a user. | 11-05-2015 |
20150324407 | METHOD FOR INDEXING GEOGRAPHICAL DATA - A method for indexing geographical data by defining at least one input point allowing access to all or part of the geographical data stored in a storage place accessible from the input point. The input point is in the form of a root path, a uniform resource locator or a connection chain in accordance with the storage mode of the geographical data. Information layers containing at least one piece of geographical data in the storage place defined by the input point is detected. A reference index is constructed by adding into a table, for each detected information layer, a piece of metadata characterizing the detected information layer. | 11-12-2015 |
20150324480 | LOCK-FREE PARALLEL DICTIONARY ENCODING - A system includes reception of a value, determination of whether the value is associated with a respective value identifier in a dictionary index associating each of a plurality of values with a respective value identifier, and in response to a determination that the value is not associated with a respective value identifier in the dictionary index: reservation of a slot of a reservation array comprising a plurality of slots, writing of the value into the reserved slot, insertion of a reserved value identifier of the reserved slot and a version counter of the reserved slot into a position of the dictionary index corresponding to the value, insertion of the value into a position of a dictionary vector storing a respective value in each of a plurality of vector positions, insertion of a first value identifier corresponding to the position of the dictionary vector into the position of the dictionary index corresponding to the value, and returning of the first value identifier. | 11-12-2015 |
20150324483 | IDENTIFYING A SUBSET OF NETWORK RELATIONSHIPS BASED ON DATA RECEIVED FROM EXTERNAL DATA SOURCES - Techniques are provided for determining which entities, in a population of entities, is most like a given entity. In the context of companies, the techniques involve constructing a peer network graph based on company-to-company relationship data. Once the graph is constructed, the weights of the edges are determined, and values for the paths are determined based on the edge weights. Peer connection scores are generated for a particular company based on the number and values of the paths between the node that represents the company and the nodes that represent other companies. Based on the peer connection scores between the particular company and other companies, a subset of the other companies are selected as members of a peer group for the company. | 11-12-2015 |
20150339295 | METHODS AND SYSTEMS FOR STORAGE OF MEDIA ITEM METADATA - The various embodiments described herein include methods and systems for storage of media item metadata. In one aspect, a method is performed at a server system with one or more processors and memory. The method includes receiving, from a client device, metadata corresponding to a modified media item, where the modified media item is a modified version of a media item corresponding to a particular node in a family tree within a database of media items. The method further includes, in response to receiving the metadata corresponding to the modified media item, appending, to the family tree, a new leaf node that is linked to the particular node, where the new leaf node corresponds to the modified media item. | 11-26-2015 |
20150339308 | MANAGING SITE-BASED SEARCH CONFIGURATION DATA - Techniques are described for managing data within a multi-site clustered data intake and query system. A data intake and query system as described herein generally refers to a system for collecting, retrieving, and analyzing data. In this context, a clustered data intake and query system generally refers to a system environment that is configured to provide data redundancy and other features that improve the availability of data stored by the system. For example, a clustered data intake and query system may be configured to store multiple copies of data stored by the system across multiple components such that recovery from a failure of one or more of the components is possible by using copies of the data stored elsewhere in the cluster. | 11-26-2015 |
20150347476 | CONTENT MANAGEMENT IN A TRAVEL MANAGEMENT SYSTEM - Methods, apparatus, and computer program products for managing content in a travel management system. A standard data element and a non-standard data element comprising the content are received from one or more content providers. A first data container for the standard data element and a second data container for the non-standard data element are created in an extended record data structure. The first data container includes a common record identifier and first data values for first attributes corresponding to the standard data element. The second data container includes the common record identifier and second data values corresponding to second attributes for the non-standard data element. The travel management system manages access to the first container and the second container in the extended record data structure based on the common record identifier. | 12-03-2015 |
20150347520 | USING CROWDSOURCING CONSENSUS TO DETERMINE NUTRITIONAL CONTENT OF FOODS DEPICTED IN AN IMAGE - A method, system, and computer program product for wellness maintenance programs configured for user-specific tracking of food consumption to goals. Upon receiving a digital image of food or beverage items, the digital image is transmitted to a repository configured to serve a plurality of accesses by a plurality of human members (e.g., a crowdsource repository). Members of the crowd generate food description annotations pertaining to aspects of the pictured food or beverage items. The food description annotations (e.g., menu picks, text descriptions) are used to look-up nutrition records. Some use cases correlate a set of food description annotations that are shared between two or more of the human members to generate a confidence score, and some use cases also receive food intake recommendations from the human members. The food intake recommendations can be recorded in a wellness profile which in turn can be used for progress tracking against nutrition goals. | 12-03-2015 |
20150347691 | SYSTEMS AND METHODS FOR EVENT STREAM PLATFORMS WHICH ENABLE APPLICATIONS - Systems and methods to generate a final event stream are provided. The system collects information from a wide variety of sources, and then parses, normalizes, and indexes the information. This generates an initial event stream that can be tagged and then iteratively processed to generate a final event stream. The processing includes first order logic querying and knowledge extraction to infer additional events which is added to the event stream. The final event stream is used by a knowledge exchange for consumption by applications. These applications may be internal applications and/or third party applications. This system may be particularly useful in use with medical information, or any other big data enterprise system. | 12-03-2015 |
20150356129 | INDEX GENERATING DEVICE AND METHOD, AND SEARCH DEVICE AND SEARCH METHOD - An index generating device ( | 12-10-2015 |
20150356130 | DATABASE MANAGEMENT SYSTEM - A method of operating a database system access to plurality of heterogeneous databases, each of said databases having a structure based on a respective database model and storing a plurality of data entities having attributes and occurrences within the structure; providing an interface database, the interface database populated with a plurality of unique, multi-character expressions associated with the data entities of plurality of databases, whereby the plurality of unique, multi-character expressions are defined by assigning to every entity, every attribute and every entity occurrence a unique, multi-character expression, the expression having a predetermined hierarchical structure which defines the relationship between each entity, attribute and entity occurrence with every other entity, attribute and entity occurrence in the interface database and storing said expressions in an expression set table linking each element of each expression to a hierarchical level and a position in a data model. | 12-10-2015 |
20150356144 | Cognitive Media Content - A method, system and computer-usable medium for providing composite cognitive insights comprising receiving streams of data from a plurality of data sources; processing the streams of data from the plurality of data sources, the processing the streams of data from the plurality of data sources performing data enriching and generating a sub-graph for incorporation into a cognitive graph; processing the cognitive graph, the processing the cognitive graph providing a plurality of individual cognitive insights; generating a composite cognitive insight, the composite cognitive insight being composed of the plurality of individual cognitive insights; and, providing the composite cognitive insight to a user via a set of cognitive media content. | 12-10-2015 |
20150356168 | Method for Refining Cognitive Insights Using Travel-Related Cognitive Graph Vectors - A method, system and computer-usable medium are disclosed for using travel-related cognitive graph vectors. | 12-10-2015 |
20150356201 | System for Refining Cognitive Insights Using Travel-Related Cognitive Graph Vectors - A method, system and computer-usable medium are disclosed for using travel-related cognitive graph vectors. | 12-10-2015 |
20150363404 | MINIMIZING INDEX MAINTENANCE COSTS FOR DATABASE STORAGE REGIONS USING HYBRID ZONE MAPS AND INDICES - Elements of a database object are stored among a plurality of different storage regions, each storage region being associated with a first value range indicating upper and lower range values for element values within that storage region. An index is generated and maintained for one or more second value ranges that include range values that comprise subsets of the first value ranges. A query is processed that includes a query value and one or more storage regions to be read is determined by using one of the index and the first value ranges based on the query value relative to a second value range. | 12-17-2015 |
20150363415 | GENERATING AND ACCESSING A DATA TABLE - An index shared between a plurality of tables is used to access a data sharing space (DSS) storing a set of rows from a base table having a plurality of rows with stored data. First data values of the index are identified in response to a query for a table. A subset of rows is then retrieved from the DSS using the index elements corresponding to the first data values. Second data values and some of the first data values are then identified in response to a second query for a table. A second subset of rows are then retrieved from the DSS using the index elements corresponding to the second data values and some of the first data values and the query is then applied to the subset of rows. | 12-17-2015 |
20150363439 | DIVIDING INCOMING DATA INTO MULTIPLE DATA STREAMS AND TRANSFORMING THE DATA FOR STORAGE IN A LOGICAL DATA OBJECT - Various embodiments for storing a logical object are provided. In one such embodiment, by way of example only, incoming data is divided corresponding to a logical data object into a plurality of independent streams, associating each data chunk of a plurality of obtained data chunks with a corresponding stream among the plurality of independent streams. At least one of the obtained data chunks and derivatives thereof is sequentially accommodated in accordance with an order the obtained chunks are received, while keeping the association with the corresponding streams. A global index is generated as a single meta-data stream accommodated in the logical data object and comprising information common to the plurality of independent streams and related to mapping between data in the logical data object and the obtained data chunks. | 12-17-2015 |
20150363469 | HIGH PERFORMANCE REAL-TIME RELATIONAL DATABASE SYSTEM AND METHODS FOR USING SAME - A database system supporting persistent queries, comprising a persistent query service that receives connections and requests from client software applications; and a plurality of network-attached data sources; and a method for providing persistent queries using a persistent query service. | 12-17-2015 |
20150363483 | POPULATING TEXT INDEXES - Provided are techniques for populating a new text index. In response to determining that a limit for indexing a set of documents to the new text index has been reached, a commit is performed, a restart key is updated to identify a next document to be indexed, and the next document is indexed in a next commit cycle. | 12-17-2015 |
20150370795 | METHOD AND APPARATUS FOR STRESS MANAGEMENT IN A SEARCHABLE DATA SERVICE - Method and apparatus for stress management in a searchable data service. The searchable data service may provide a searchable index to a backend data store, and an interface to build and query the searchable index, that enables client applications to search for and retrieve locators for stored entities in the backend data store. Embodiments of the searchable data service may implement a distributed stress management mechanism that may provide functionality including, but not limited to, the automated monitoring of critical resources, analysis of resource usage, and decisions on and performance of actions to keep resource usage within comfort zones. In one embodiment, in response to usage of a particular resource being detected as out of the comfort zone on a node, an action may be performed to transfer at least part of the resource usage for the local resource to another node that provides a similar resource. | 12-24-2015 |
20150370839 | BUILT-IN SEARCH INDEXING FOR NAS SYSTEMS - Methods, computing systems and computer program products implement embodiments of the present invention that include storing multiple files to a file-level storage system that includes one or more storage devices, and maintaining, by the file-level storage system, one or more searchable indexes for the multiple files, each of the indexes indexing a respective key field. In some embodiments the file-level storage system may be configured as a network attached storage system. Upon receiving, via a network, a search query from a computer, the file-level storage system can identify, using the one or more searchable indexes, one or more of the multiple files that match the search query, and convey, to the computer, the respective name and the respective location of each of the one or more identified files. In some embodiments, the file-level storage system includes a web server exposing a representational state transfer application programming interface. | 12-24-2015 |
20150370841 | METHOD AND APPARATUS FOR GUARANTEEING AND OPTIMIZING DATA EXCHANGE IN MOBILE M2M COMMUNICATION WITH RESTRICTED CONNECTIVITY - A method for transmitting data from a mobile unit to a backend system includes, when connectivity is present between the mobile unit and the backend system: receiving, by the mobile unit, a master index from the backend system, deleting, by the mobile unit, data of the local database which are contained in the master database based on a comparison of a local index with the master index and updating the local index, transmitting, by the mobile unit, data stored in the local database which are not yet contained in the master database, receiving and storing a new master index from the backend system, and deleting the transmitted data of the local database based on a comparison of the updated local index with the new master index and further updating the local index. | 12-24-2015 |
20150379055 | GROUP MEMBER MANAGEMENT SUPPORT SYSTEM AND METHOD - A group member management support system and method are disclosed for effectively supporting group member management such as analysis of psychological states of students in a school and figuring out friend relations of students accordingly. The group member management support system includes a database storing self-expression index representing psychology or feeling of each member of a group, an interface server receiving the self-expression index through a network from the member to store the self-expression index in the database, and a member's terminal receiving the self-expression index to transmit the self-expression index to the interface server through the network; and a manager's terminal requesting and receiving the self-expression index from the interface server through the network to display or output the self-expression index. | 12-31-2015 |
20150379127 | FUZZY SUBSTRING SEARCH - The present disclosure describes methods, systems, and computer program products for performing fuzzy substring search. One computer-implemented method includes determining, by operation of a hardware processor, one or more tokens of an input string received from a client application, identifying, from a database, a reference string based on the one or more tokens of the input string, determining a plurality of character index values of the reference string, each character index value indicating a position of a character in the reference string, determining one or more hop index values of the input string, each hop index value indicating a position of a separator in the input string, and calculating a similarity score of the reference string relative to the input string based on the plurality of character index values of the reference string and the one or more hop index values of the input string. | 12-31-2015 |
20160004732 | EXTENDED MEMORY SYSTEM - Described herein are technologies that are configured to assist a user in recollection information about people, places, and things. Computer-readable data is captured, and contextual data that temporally corresponds to the computer-readable data is also captured. In a database, the computer-readable data is indexed by the contextual data. Thus, when a query is received that references the contextual data, the computer-readable data is retrieved. | 01-07-2016 |
20160004737 | RESOLVING IN-MEMORY FOREIGN KEYS IN TRANSMITTED DATA PACKETS FROM SINGLE-PARENT HIERARCHIES - A web service interface may receive a web service request that describes multiple objects. Each of the multiple objects may be associated with a common object, such as in a parent-child relationship. The web service can identify the common object and the multiple relationships between objects, and instead of creating duplicate versions of the common object, the web service can create a single record representing the common object in, for example, a database. Records for each of the multiple objects referring to the common object can reference the common object record in the database using, for example, foreign keys. | 01-07-2016 |
20160004773 | SYSTEMS AND METHODS FOR ORGANIZING AND ANALYZING AUDIO CONTENT DERIVED FROM MEDIA FILES - Systems for receiving, analyzing, and organizing audio content contained within a plurality of media files are disclosed. The systems generally include a server that is configured to receive, index, and store a plurality of media files, which are received by the server from a plurality of sources, within at least one database in communication with the server. In addition, the server is configured to make one or more of the media files accessible to and searchable by, one or more persons other than the original sources of such media files. Still further, the server may be configured to organize audio content included within each of the plurality of media files into bipartite graphs; segment media files into parts that exhibit similar attributes; extract and present meta data to a user that pertain to each media file; and employ multi-variable ranking methods to prioritize media file search results. | 01-07-2016 |
20160004928 | TEXTUAL REPRESENTATION OF AN IMAGE - At least a computer-implemented method and an apparatus for processing an image are described. In examples, numeric values for at least one property of the image are determined. These values are then converted into at least one corresponding text character, said conversion being independent of any text content within the image. This enables a text representation of the image to be generated that contains said plurality of text characters. This text representation may be used to index and search for the image. | 01-07-2016 |
20160012073 | METHOD OF MANAGING, WRITING, AND READING FILE ON TAPE | 01-14-2016 |
20160012086 | LINEAR RUN LENGTH ENCODING: COMPRESSING THE INDEX VECTOR | 01-14-2016 |
20160012087 | DYNAMIC UPDATE OF CORPUS INDICES FOR QUESTION ANSWERING SYSTEM | 01-14-2016 |
20160012133 | METHOD AND APPARATUS FOR SETTING INFLUENCE INDEX OF USER IN NETWORK SERVICE | 01-14-2016 |
20160026663 | DISTRIBUTING AND PROCESSING STREAMS OVER ONE OR MORE NETWORKS FOR ON-THE-FLY SCHEMA EVOLUTION - In an embodiment, a method comprises receiving a first record comprising one or more first fields from a first source computer; in response to determining that the first record is a first type and a first node is associated with the first type, sending the first record to the first node to be processed; receiving a second record comprising the one or more first fields and one or more second fields from a second source computer; in response to determining that the second record is a second type and a second node is associated with the second type, sending the second record to the second node; in response to determining that the second type is a first subtype of the first type, sending the second record to the first node to be processed, without suspending the first node. | 01-28-2016 |
20160026665 | GENERATION OF SEARCH INDEX - Technologies are generally described for generating search indices. In some examples, a method performed under control of a storage device including a first memory and a second memory may include generating a first search index that corresponds to a first data stored in the first memory, based on analysis of the first data; and generating a second search index that corresponds to a second data stored in the second memory in parallel with the generating of the first search index, based on analysis of the second data. | 01-28-2016 |
20160034497 | INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING APPARATUS - An information processing system includes an information processing apparatus and one or more terminal devices. The information processing system includes: an image transmitting unit configured to transmit an image; an image acquiring unit configured to acquire the image transmitted by the image transmitting unit, and assign unique identification information to the acquired image; a feature information extracting unit configured to extract feature information from the image acquired by the image acquiring unit; a first image storage unit configured to store therein the feature information and the identification information in an associated manner; and a managing unit configured to, when an amount of the feature information stored in the first image storage unit exceeds a threshold, change the first image storage unit into a second image storage unit, generate a new first image storage unit, and prohibit the second image storage unit from storing therein new feature information. | 02-04-2016 |
20160041989 | INFORMATION SYSTEM AND METHOD FOR DEVELOPING AN INFORMATION SYSTEM - An information system, for a motor vehicle workshop, has a user input unit for a use to input at least one feature; a database contains data sets and at least one link table having fields for linking the data sets, each data set including at least one feature and at least one first allocation list having at least one allocation list element linking the data set with at least one field in the at least one link table; a selection unit to select from the database at least one data set having at least one of the features inputted as the first data set; and an allocation unit to evaluate the first allocation lists of the data sets and to select as second data sets those data sets whose first allocation list contains at least one allocation list element linking the respective data set with a field in the link table that is linked with the first data set by at least one allocation list element in the first allocation list of the first data set. | 02-11-2016 |
20160042020 | Using Mote-Associated Indexes - Systems and related methods utilizing one or more mote-related content indexes. | 02-11-2016 |
20160042022 | DATA COORDINATION SUPPORT APPARATUS AND DATA COORDINATION SUPPORT METHOD - A data coordination support apparatus is provided to efficiently detect between which table and a table being a coordination destination there is a reference relation, in data coordination. The data coordination support apparatus identifies, with respect to each table other than the table being the coordination destination, a first column designated as the primary key and a second column not designated as the primary key, from a plurality of columns included in a table concerned. The data coordination support apparatus extracts such a table that a third column similar to the first column is included in the table being the coordination destination, and a fourth column similar to the second column is included the table being the coordination destination, from tables other than the table being the coordination destination. The data coordination support apparatus outputs recommendation information that recommends using a combination of the table being the coordination destination and the table extracted, in the data coordination. | 02-11-2016 |
20160042091 | System And Method Of Forming An Index - The invention discloses a computer implemented method of forming an index that comprises of a set of variables that are formed by filtering a first set of variables on condition set to form a second set of variables, wherein a resource is proportionately allocated to each of the variables of the second set of variables in proportion to the weights assigned, and analyzing the performance of each of the variables in the second set of variables after an exit time interval and rebalancing the second set of variables, wherein the components of the second set of variables are rebalanced with the components of the first set of variables based on exit conditions. | 02-11-2016 |
20160048571 | APPARATUS AND METHODS FOR RELATING INTRA-ORGANIZATION OBJECTS - The present disclosure provides a professional, lite, and fast internal platform for an individual to share a professional profile, role and specialty, and work relationships inside a company for its own use. Furthermore, a user may be able to move beyond simple contact information and be enabled to find co-workers by role, team, or project. Users may be able to find co-workers by specialized focus, affinities, or personal characteristics such as similar hobbies or educational institutions. The platform provides both current and historical data to a company and users in an organized and efficient manner with minimal dedicated resources from the user or company. | 02-18-2016 |
20160048582 | DYNAMIC ALTERNATE KEYS FOR USE IN FILE SYSTEMS UTILIZING A KEYED INDEX - In one embodiment, a system includes at least one processor and logic integrated with and/or executable by the at least one processor, the logic configured to create a first base record in a keyed index of a database, the first base record including a first unique key and a first data record, wherein the first data record includes at least one sub key and at least one first value, each sub key being correlated with a different one of the at least one first value in a sub key/value pair, and create one or more alternate key records in the database, each of the alternate key records including one of the at least one sub key which is correlated with the first base record and the first unique key of the first base record. More embodiments of systems, methods, and computer program products for providing alternate keys are also presented. | 02-18-2016 |
20160055124 | SYSTEMS AND METHODS FOR LOW-RANK MATRIX APPROXIMATION - Systems and methods for generating a low-rank approximation of a matrix are provided. The various systems and methods may identify at least a first set of right singular vectors and a first set of singular values of a subset of the matrix, reduce the subset by an amount of energy of a selected data entry of the subset based on the first set of right singular vectors and the first set of singular values, incorporate a new data entry from the matrix into the subset, update the first set of right singular vectors and the first set of singular values of the subset based on the new data entry by a singular value decomposition (SVD) update, and generate the low-rank approximation of the matrix based on the updated first set of right singular vectors and the updated first set of singular values. | 02-25-2016 |
20160055188 | MULTI-USER SEARCH SYSTEM WITH METHODOLOGY FOR INSTANT INDEXING - A multi-user search system with methodology for instant indexing. In one embodiment, for example, a system for instant indexing includes a token store storing sets of tokens for current versions of documents. The system further includes a tokenizer server configured to tokenize new versions of the documents and to generate sets of tokens for the new versions of the documents, an instant indexer configured to determine tokens to use to index the documents based on identified differences between the sets of tokens for the new versions of the documents and the sets of tokens for the current versions of the documents, and to generate index mutations including the tokens to use to index the documents, an index mutation journal configured to store the generated index mutations in association with timestamps, and an index mutation server configured to provide, to index servers, from the index mutation journal, generated index mutations for the index servers that are associated with timestamps that are newer than specified timestamps. | 02-25-2016 |
20160055189 | Database Storage System based on Optical Disk Library, and Method Using Same - A database storage system based on Jukebox comprises a server, a jukebox and a magnetic disk storage. The database management system comprises a field structure generation module, a record index generation module, a storage space allocation module, a storage space status module, a record linkage module, a record split module; further comprises a database creation and input on optical disk module, a search-term storage module, a search-term index generation module. By means of the present invention, a database is directly created on a write-once optical disk or an optical disk of other types, data is input into the database on the optical disk; full-text search for the databases on the optical disks in the jukebox can be performed within seconds, an original file of a database file on an optical disk can be retrieved within seconds, solving effectively the technical problems of managing and accessing large-scale optical disks with databases. | 02-25-2016 |
20160063045 | Method and Apparatus for Road Risk Indices Generation - A system includes a processor configured to gather historical risk-affecting data with respect to a current road. The processor is also configured to gather current risk-affecting data with respect to the current road. Further, the processor is configured to generate a baseline risk index for the road based on the historical data. The processor is additionally configured to modify the baseline risk index based on the current data and provide a risk index value for the current road based on the modified baseline risk index | 03-03-2016 |
20160070727 | METHODS AND APPARATUS FOR BUILDING A SEARCH INDEX FOR A DATABASE - Methods and apparatus for building a search index for a database are disclosed. When an incremental build trigger is detected (e.g., a threshold number of documents are added to database), the system determines which sub-indexes need to be updated and which sub-indexes do not need to be updated. Rather than update the affected sub-indexes directly, the system builds new sub-indexes to replace the affected sub-indexes. Database queries that occur during the generation of the replacement sub-indexes use the old sub-indexes. When the new sub-indexes are ready, the system moves pointers from the old sub-indexes to the new sub-indexes so that subsequent database queries use the new sub-indexes. | 03-10-2016 |
20160070728 | CELL-BASED DATABASE MANAGEMENT SYSTEM - Provided are systems and methods for managing a database. In an example system, information stored in cells includes a row identifier, a column identifier, a value identifier, and a transaction identifier. The system includes a cell domain and one or more cell buffers. Both the cell domain and cell buffer include a plurality of cells, a row sequence ordering the row identifiers of cells, and a column sequence ordering the column identifiers. Additionally, the cell domain includes row and column identifier generators and transaction and value sequences. The system further includes a transfer module operable to transfer cells from a cell buffer to the cell domain, and a reader module operable to output cells, upon receiving a request buffer, from the cell domain to a response buffer. A set of constraints is applied when outputting cells to the response buffer based on the information stored in the request buffer. | 03-10-2016 |
20160070731 | ANALYTICS BASED ON SCALABLE HIERARCHICAL CATEGORIZATION OF WEB CONTENT - Various methods and systems for performing analytics based on hierarchical categorization of content are provided. Analytics can be performed using an index building workflow and a classification workflow. In the index building workflow, documents are received and analyzed to extract features from the documents. Hierarchical category paths can be identified for the features. The documents are indexed to support searching the documents for the hierarchical category paths. In the classification workflow, a query, that includes or references content, may be received and analyzed to extract features from the content. The features are executed against a search engine that returns search result documents associated with hierarchical category paths. The hierarchical category paths from the search result documents may be used to generate a topic model of the content associated with the query. The topic model, used for web analytics, includes scores for the hierarchical category paths and for enumerated category topics. | 03-10-2016 |
20160070749 | MULTI-TIERED TARGETED QUERYING - Technology is disclosed for a multi-tiered querying system to target queries to systems storing data relevant to the query. A multi-tiered targeted query system comprises at least three tiers: a web tier, an aggregator tier, and a shards tier. Servers at the web tier can be configured to service user data requests and pass them to servers at the aggregator tier. Servers at the aggregator tier can be configured to determine which selected shard servers have the requested information; formulate queries for the selected shard servers; send the queries to the selected shard servers; and aggregate results from the selected shard servers. Servers at the shard tier can be configured to store data, receive queries on that data, and return results for received queries. | 03-10-2016 |
20160070815 | SYSTEM AND METHOD FOR XML SUBDOCUMENT SELECTION - Methods for XML subdocument selection and corresponding systems and computer-readable mediums. A method includes receiving a document having fragments with attribute/value pairs and receiving logical expressions that define relationships between fragments of the document. The method includes analyzing the logical expressions according to the document and creating an index based on the analysis that includes names of the fragments to be candidates for selection into subdocuments. The method includes extracting, from the document, all fragments named in the index and creating, in the index, an entry for each attribute/value pair. The method includes creating a plurality of subdocuments corresponding to the document and storing the subdocuments, including the respective related fragments. | 03-10-2016 |
20160077800 | METHODS, SYSTEMS, AND COMPUTER READABLE MEDIA FOR PROCESSING DATA CONTAINING TYPE-LENGTH-VALUE (TLV) ELEMENTS - Methods, systems, and computer readable media for processing messages containing type-length-value (TLV) elements are disclosed. According to one exemplary method for processing a message containing TLV elements, the method includes receiving an ingress message containing a plurality of TLV elements. The method also includes decoding a TLV element from the plurality of TLV elements. The method further includes storing an entry containing information about the TLV element in a multi-view data structure implemented using a memory, wherein the multi-view data structure includes an associative container and at least one linked list data structure for ordering the entry with regard to other entries in the multi-view data structure. | 03-17-2016 |
20160078071 | LARGE SCALE OFFLINE RETRIEVAL OF MACHINE OPERATIONAL INFORMATION - A computer-implemented method of retrieving information stored on a computer readable storage medium and related to operation of a machine includes creating one or more lists of a plurality of data files stored on the computer readable storage medium and containing data derived from sensors measuring one or more machine operational characteristics. The method may further include creating one or more lists of a plurality of channels of information with signals indicative of the one or more machine operational characteristics recorded in each data file, creating an index of variables contained in one or more of the channels of information recorded in each data file, and searching for and locating data files that meet requestor-specified conditions by employing at least one of the lists of data files, at least one of the lists of channels of information, and the index of variables recorded in each data file to focus a search for relevant data files meeting the requestor-specified conditions. | 03-17-2016 |
20160078072 | TERM VARIANT DISCERNMENT SYSTEM AND METHOD THEREFOR - A term variant discernment system identifies terms in content and executes one or more discernment processes to determine a meaning for each term. An ID is assigned to each term based on its meaning, with terms and their variant terms being assigned a distinct ID when they have different meanings and with terms and their variant terms being assigned the same ID when they have the same meaning. The terms and variants can then be individually queried via a query even though the terms and their variants may have the same spelling, abbreviation, or other characteristics. | 03-17-2016 |
20160078073 | SYSTEM AND METHOD FOR CONTENT STORAGE - A system and method of content storage includes a content store for storing a plurality of first content blocks and one or more first indexes, a content manager including one or more first content analyzers, and an interface for receiving access requests for the plurality of first content blocks. Each of the one or more first indexes includes a plurality of first entries for organizing a plurality of second content blocks from the plurality of first content blocks according to a respective organizational structure. Each of the first entries includes a link to a corresponding content block from the plurality of second content blocks. Each of the plurality of content analyzers monitors the plurality of first content blocks for changes and updates a second index from the one or more first indexes based on information associated with the monitored changes. | 03-17-2016 |
20160085774 | CONTEXT BASED IMAGE SEARCH - A method comprising receiving an image, the image including associated contextual information; converting the received image into searchable image data, the searchable image data being descriptive of the received image; filtering information from a search database based on the contextual information associated with the received image to create a filtered information set; collecting a plurality of images from the filtered information set to create a seed data set; comparing the received image to the plurality of images from the seed data set using the searchable image data; and determining whether one of the plurality of images is related to the received image. | 03-24-2016 |
20160085782 | UPDATE METHOD AND UPDATING DEVICE - An update method executed by a computer includes: recording a history of processes executed on records in a database in accordance with a request; generating a second index, respectively, for a plurality of new records added to the database in accordance with another request, by applying a storage position to a first index of the plurality of records respectively; and reflecting, to the second index, at least one process from among the processes, the at least one process being executed after a start of the generating of the second index. | 03-24-2016 |
20160085790 | STRATIFIED SAMPLING OF LOG RECORDS FOR APPROXIMATE FULL-TEXT SEARCH - A log record from a host machine node includes an invariant string and a term. A template identifier is selected, from among template identifiers within a template repository, for a template string matching the invariant string. A sampling count threshold is selected from among a set of sampling count thresholds based on the template identifier and the term. A template-term count is obtained based on a number of earlier log records that were received since the count was reset and have a template identifier and a term that match the template identifier and the term of the log record. Based on the template-term count satisfying the sampling count threshold, an index entry is generated in a sampled log records index based on the log record and the template-term count is reset to a defined value. Based on the template-term count not satisfying the sampling count threshold, the template-term count is incremented. | 03-24-2016 |
20160085851 | GUIDED DATA EXPLORATION - A system for exploring data receives the data from a database and indexes the data in a server. The system displays one or more selectable datasets from the indexed data, where the selected dataset includes one or more attributes. The system then sorts the one or more attributes by level of interestingness and displays the sorted attributes. | 03-24-2016 |
20160085875 | Method for configuring browser bookmarks, device and terminal thereof - A setting method and device for a browser bookmark, and a terminal are described. The method includes: a first network address is acquired from a bookmark to be stored, and a first root website is extracted from the first network address; if a tree-like structure cluster has been locally established, the tree-like structure cluster is obtained, wherein the tree-like structure cluster includes a domain name tree of at least one root website, and corresponding bookmarks are stored on nodes in the domain name tree; a domain name tree of the first root website is modified or constructed according to whether there is the domain name tree of the first root website in the tree-like structure cluster or not, and the bookmark to be stored is stored on a corresponding node of the domain name tree of the first root website; and the domain name tree of the first root website and the first network address associated with the bookmark to be stored are stored. According to the disclosure, the problems of bookmark setting disorder and searching difficulty can be improved, and user experiences in bookmarks can be improved. | 03-24-2016 |
20160085880 | GUIDED DATA EXPLORATION - A system for exploring data receives the data from a database and indexes the data in a server. The system displays one or more selectable datasets from the indexed data, where the selectable datasets include a plurality of attributes. The system receives a selection of one of the plurality of attributes. The system then sorts the one or more attributes by level of interestingness relative to the selected attribute, and displays the sorted attributes. | 03-24-2016 |
20160092481 | INFORMATION INTEGRATION AND ASSESSMENT - Aspects of information integration and assessment are described. In one embodiment, one or more individuals are identified for information research. The identification may be made with reference to an attendee list, contact list, facial recognition, or user input among other ways. At least one source is identified to survey for information related to the individuals, and the source is surveyed for the information. In various embodiments, the source may comprise one or more social media or enterprise networks, services, or indexed search results, for example. The information is assembled into one or more integrated records, and various reports may be generated using the records. The reports and records may be ranked, displayed, compared, annotated, indexed and stored based on a surrounding context metric, for example. With reference to the reports and records, a user may be able to easily assemble and identify relevant information for reference in various contexts. | 03-31-2016 |
20160092584 | SELECTIVE INDEXING TO IMPROVE COMPLEX QUERYING OF ONLINE PROFESSIONAL NETWORK DATA - The disclosed embodiments provide a system for processing data. During operation, the system obtains a query containing two or more hops in a graph of online professional network data and one or more predicates containing one or more attributes of the online professional network data. The graph includes nodes representing members and companies, and a set of edges representing relationships between pairs of nodes. The query also includes two or more input nodes, including a first input member and/or an input company, that are used to form a path containing the two or more hops, along with an output that matches the one or more predicates and includes one or more output members. Next, the system executes the query to obtain a subset of the online professional network data matching the query. Finally, the system provides the subset of the online professional network data in response to the query. | 03-31-2016 |
20160098398 | Method For Preserving Conceptual Distance Within Unstructured Documents - A method, system and computer-usable medium are disclosed for preserving conceptual distance within unstructured documents by characterizing conceptual relationships. Natural language processing is applied to content in a plurality of documents to identify topics and subjects. Analytic analysis is then applied to the identified topics and subjects to identify concepts. The content in each of the plurality of documents is partitioned into a first structured hierarchy, preserving at least one structure in each document inherent in the each document. Access is then provided to the content through a first index based upon utilizing the first structured hierarchy and through a second index utilizing a second structured hierarchy. The conceptual relationship criteria are based upon a directed graph with weights based upon a similarity and a distance based upon concepts. | 04-07-2016 |
20160098415 | CONFIGURATION MANAGEMENT OF INDUSTRIAL DATA - Systems and techniques for storing, managing and/or displaying industrial data are presented. A controller component stores configuration data associated with a plurality of product types for a product generated based on at least one manufacturing process in a nested data structure. A data interface component manages data corresponding to the configuration data stored in the nested data structure for a data table associated with a spreadsheet application. | 04-07-2016 |
20160098433 | METHOD FOR FACET SEARCHING AND SEARCH SUGGESTIONS - Methods for faceted searching within clustered in-memory databases are disclosed. Faceted searching may be used to generate search suggestions. The faceted search engine may be able to use non-literal key algorithms for a partial prefix fuzzy matching and may include a feature disambiguation module. The disclosed search engine may be capable of processing large amounts of unstructured data in real time to generate search suggestions. | 04-07-2016 |
20160098435 | DISTRIBUTED INDEXING SYSTEM FOR DATA STORAGE - A distributed indexing system spreads out the load on an index of stored data in a data storage system. Rather than maintain a single index, the distributed indexing system maintains an index in each media agent of a federated data storage system and a master index that points to the index in each media agent. In some embodiments, the distributed indexing system includes an index server (or group of servers) that handles indexing requests and forwards the requests to the appropriate distributed systems. Thus, the distributed indexing system, among other things, increases the availability and fault tolerance of a data storage index. | 04-07-2016 |
20160098436 | CONTENT CREATION - A system for facilitating content creation includes collecting profiles which are analyzed to build a profile parameter index. A dummy profile is created based on the profile parameter index. The dummy profile is a fictitious character having profile parameters based on input from a user of the profile parameter index. The control of the dummy profile is under the user. | 04-07-2016 |
20160103863 | METHOD OF MANAGING RELATIONAL DATA IN A SINGLE MATRIX REPRESENTATION - A method, system, and computer program for managing relational data in a single matrix representation that is generically useful and application-independent. The method, system, and computer program provide for the structured use of single matrix representations for maintaining and managing normalized relational databases in various applications. | 04-14-2016 |
20160103864 | Structured Information Differentiation in Naming - Structured information differentiation in naming techniques are described. In one or more embodiments, first and second collections of structured information are tokenized to generate a first ordered list of tokens that correspond to the first collection of structured information and a second ordered list of tokens that correspond to the second collection of structured information. The first and second ordered list of tokens are inserted into a tree structure in order by assigning tokens from the first and second ordered lists that match in the order to a same node and assigning non-matching tokens to different nodes. The first and second collections of the structured information are tagged by traversing the tree structure such that both the first and second collections of the structured information are tagged using at least one said token that matches and the first collection of structured information is tagged using one or more other said tokens that are different than one or more other said tokens that are used to tag the second collection of structured information. | 04-14-2016 |
20160110391 | Composite Partition Functions - Composite partition function techniques are described herein that may be used to repartition a database without moving existing data. In one or more implementations, a database is partitioned in a first arrangement associated with a first partition function. The database is repartitioned to form a second arrangement associated with a second partition function. A record of key membership for data entries in the partitions existing at the time of repartitioning is created, which can be used to recognize data corresponding to the first arrangement. A composite partition function is defined for subsequent database operations that is configured to apply the first partition function for data having keys found in the record of key membership and apply the second partition function for other data having keys that are not found in the record of key membership. | 04-21-2016 |
20160110394 | Obituary Alerting System and Method of Use - A method of using an obituary alerting system, comprising: associating a plurality of users with one another in said Obituary Alerting System 100 based on a one or more associations; indexing a one or more obituary sources; and updating a one or more living users of the deal and related information for a one or more deceased users among said plurality of users based on said one or more obituary source and said one or more associations. | 04-21-2016 |
20160110396 | DATA PROCESSING APPARATUSES, METHODS, AND NON-TRANSITORY TANGIBLE MACHINE-READABLE MEDIUM THEREOF - Data processing apparatuses, methods, and non-transitory tangible machine-readable medium thereof are provided. The data processing method accesses a dimension table. The dimension table is defined with a plurality of attributes and includes at least one member. Each of the at least one member includes a plurality of attribute values corresponding to the attributes. The data processing method generates a smart index for each of the distinct attribute values. Each of the smart indexes includes a first value equivalent to one of the attribute values, a second value equivalent to the attribute corresponding to the first value, and a third value indicating a rank of the first value comparing to the rest attribute values corresponding to the second value, the first values are distinct. The data processing method integrates the smart indexes into a smart index record, wherein each of the smart indexes has an index within the smart index record. | 04-21-2016 |
20160110476 | RELATIONSHIP GRAPH INTERLINKAGE SYSTEM - A relationship graph interlinkage system includes a plurality of relationship graph databases | 04-21-2016 |
20160125001 | AUTOMATIC SCREEN GENERATION DEVICE, AUTOMATIC SCREEN GENERATION PROGRAM, AND AUTOMATIC SCREEN GENERATION METHOD - According to a conventional method, there is only one kind of a database access screen pattern (single sheet format), the screen layout cannot be changed from one user to another, a portion that produces an output to a database needs to make an external program call, and, there is no user-by-user utilization right which is generally required for a database access screen, not is there a function provided for transitioning from database access screen A to database access screen B involving value transfer. The present invention is provided with two kinds of screen patterns of a single sheet format and a list format, eliminates the need for a programming language and a database operation language, and is further provided with user-by-user utilization right and a function for transitioning from data access screen A to data access screen involving value transfer on the basis of a setting made by a setting person without requiring programming. Further, in accordance with a setting content by the setting person is stored, and a database access screen corresponding to a utilization mode is automatically generated in accordance with the setting content selected by the user and a screen program. | 05-05-2016 |
20160125003 | SECONDARY QUEUE FOR INDEX PROCESS - An index server may be configured to collect content associated with an entity, and process the content for indexing at a primary index queue. At least one portion of the content may be marked for exclusion from the primary index queue, and the marked content may be transferred to a secondary index queue to be processed for indexing. A comprehensive index may be stored based on the processing performed at the primary and secondary index queues such that the content associated with the entity is wholly represented by indexed content. If the secondary index queue is disabled, automatically or in response to a request from the entity, the content processed for indexing at the primary index queue may be stored in an index excluding the marked content. In some examples, the comprehensive index and/or the index with excluded marked content may be provided to a compliance server for analysis. | 05-05-2016 |
20160125004 | METHOD OF INDEX RECOMMENDATION FOR NOSQL DATABASE - In one embodiment, a method for indexing a document database includes determining, according to user's access status on a plurality of documents in the document database, index indicators from attributes of respective fields of the plurality of documents, wherein the index indicators are used for indexing the plurality of documents, dividing the plurality of documents into a plurality of document sets, indexing the plurality of document sets according to the index indicators. The method may include dividing a plurality of documents in a document database into document sets by utilizing an attribute that a document database is easy to be divided into chunks, and may effectively implement indexing of the document database for the document sets according to another embodiment. | 05-05-2016 |
20160125005 | Apparatus and Method for Profiling Activities and Transitions - A method and apparatus for generating a structured profile of activities and transitions is provided. The method includes receiving data related to a plurality of activities and a plurality of transitions, wherein each of the plurality of transitions is a path between a pair of activities among the plurality of activities, storing the data related to the plurality of activities and the plurality of transitions into the storage device, reducing a dimension of the data by indexing the data using state variables, generating a plurality of nodes, wherein each of the plurality of nodes corresponds to each of the plurality of activities, generating a plurality of links, wherein each of the plurality of links corresponds to each of the plurality of transitions, and storing the plurality of nodes and the plurality of links as the structured profile at the storage device. | 05-05-2016 |
20160125009 | PARALLELIZED EXECUTION OF WINDOW OPERATOR - A window operator can be processed according to a variety of techniques that introduce parallelism. Window function sub-results can be calculated separately on different nodes. Overall superior performance can result. Skewness in input data can be accounted for by controlling a degree of parallelism at nodes. | 05-05-2016 |
20160125032 | PARTITION-AWARE DISTRIBUTED EXECUTION OF WINDOW OPERATOR - Partition-aware calculation of a window operator can be supported. Different nodes can calculate window function sub-results on database partitions locally, in parallel and independently. Recognition of scenarios in which such parallelism is permissible can be performed. Overall superior performance can result. | 05-05-2016 |
20160125084 | ELECTRONIC DEVICE, SEARCH SERVER, AND DATA SEARCH METHOD THEREFOR - An electronic device, a search server, and a data search method are provided herein. The search server includes a communication device configured to receive a plurality of indices for setting a search item. The search server also includes a memory configured to store an index map in which link relations among a plurality of indices are defined. The search server further includes a controller configured to detect at least one path, including the plurality of received indices, using the index map and to set the search item according to the at least one detected path. | 05-05-2016 |
20160125094 | METHOD AND SYSTEM FOR BEHAVIOR QUERY CONSTRUCTION IN TEMPORAL GRAPHS USING DISCRIMINATIVE SUB-TRACE MINING - A method and system for constructing behavior queries in temporal graphs using discriminative sub-trace mining. The method includes generating system data logs to provide temporal graphs, wherein the temporal graphs include a first temporal graph corresponding to a target behavior and a second temporal graph corresponding to a set of background behaviors, generating temporal graph patterns for each of the first and second temporal graphs to determine whether a pattern exists between a first temporal graph pattern and a second temporal graph pattern, wherein the pattern between the temporal graph patterns is a non-repetitive graph pattern, pruning the pattern between the first and second temporal graph patterns to provide a discriminative temporal graph, and generating behavior queries based on the discriminative temporal graph. | 05-05-2016 |
20160132515 | SOCIAL GENOME - Systems and methods are provided for processing a data stream in real time to identify connections between one or more social elements social elements and aggregating and storing the identified connections over a rolling time window. Social elements may be stored as nodes and connection between social elements may be stored as labeled connections between nodes. Other embodiments are provided. | 05-12-2016 |
20160132538 | CRAWLER FOR DISCOVERING CONTROL SYSTEM DATA IN AN INDUSTRIAL AUTOMATION ENVIRONMENT - An industrial data indexing system facilitates discovery and indexing of plant-wide data residing on multiple different data platforms. The indexing system automatically inventories industrial devices and other data sources located throughout a plant, and identifies available data items on each data source. The indexing system indexes the discovered data items in a federated data model that can subsequently be searched to locate data items or tags of interest. The federated data model records references to data items found on different types of data sources, including but not limited to industrial controller programs, human-machine interface applications, data historians, device documentation stores, inventory tracking systems, and other such data sources. The indexing system also discovers and records interdependencies between data items, which can be used to improve relevancy of search results returned to the user. | 05-12-2016 |
20160132540 | PRE-CACHING OF RELATIONAL DATABASE MANAGEMENT SYSTEM BASED ON DATA RETRIEVAL PATTERNS - A processor tracks a frequency of access requests of a first index corresponding to a first data page of a plurality of data pages stored in a database. The processor determines the first index corresponding to the first data page having a frequency of access requests that exceeds a configurable target, and the processor retains, with preference, the first data page that corresponds to the first index, within the cache memory. | 05-12-2016 |
20160132541 | EFFICIENT IMPLEMENTATIONS FOR MAPREDUCE SYSTEMS - Techniques for use with a processor configured to function as at least a Mapper in a MapReduce system include generating a set of [key, value] pairs by executing a Map function on input data. The set of [key, value] pairs may be stored in a storage system implemented on at least one data storage medium, the storage system being organized into a plurality of divisions with different divisions of the storage system storing [key, value] pairs corresponding to different keys. A first [key, value] pair corresponding to a first key handled by a first Reducer in the MapReduce system and a second [key, value] pair corresponding to a second key handled by a second Reducer in the MapReduce system may both be stored in a first division of the plurality of divisions. | 05-12-2016 |
20160132543 | AUTOMATICALLY AGGREGATING DATA IN DATABASE TABLES - A method for aggregating data in database tables. A computer receives a request to create an aggregation object comprising a set of related tables and aggregation rules. The computer system creates an aggregation object comprising multiple tables with related key and data columns and managed as a single entity with a single definition. Upon receiving data in the form of a new, inserted, or updated row, the computer system automatically aggregates the data to all objects in the aggregation object hierarchy by creating a summarized row for the current table using the inserted, merged or updated row and defined aggregation rules for each object. | 05-12-2016 |
20160140146 | Systems and Methods of Building and Using an Image Catalog - A method of managing an image catalog is performed by one or more servers. The process receives from a first user identification of one or more images in an image database. The image database is distinct from the servers. For each of the images, the process analyzes the image to extract keywords that describe the image and creates an index entry in the image catalog. The index entry includes the keywords. The process receives a query from a second user and matches the query to an index entry in the image catalog. The index entry corresponds to a first image in the image database. The process determines whether the second user is authorized to view the first image. When the second user is authorized to view the first image, the process retrieves the first image from the image database and transmits the first image to the second user. | 05-19-2016 |
20160140156 | DISTRIBUTED STORAGE SYSTEM WITH PLUGGABLE QUERY PROCESSING - As part of a query-processing technique, in response to receiving queries for information stored in a distributed storage system and associated query-processing information identifying sets of predefined operations to perform on one or more databases in the distributed storage system, the distributed storage system accesses sets of predefined operations. Then, for the sets of predefined operations on the one or more databases, the distributed storage system uses one or more adaptors for multiple storage subsystems in the distributed storage system and one or more indexes corresponding to collections of information in the one or more databases. Moreover, the distributed storage system executes the sets of predefined operations on the multiple storage subsystems to obtain results for the queries. The sets of predefined operations may be defined by different users, so that the distributed storage system can be adapted to these users' needs. | 05-19-2016 |
20160147479 | METADATA STORING TECHNIQUE - A computer-implemented metadata processing method and a system includes writing metadata to or reading metadata from a storage medium in which an index that contains metadata of each of one or more pieces of data is stored. The method and system include determining, in response to a processing request, which includes a write request or a read request, when to communicate with a storage area outside the index. At least a portion of content of the metadata relating to the processing request is processed, and accessed, including storing or obtaining, in the index, location information indicating the storage area of the portion of content of the metadata. | 05-26-2016 |
20160147803 | PATTERN-BASED PROBLEM DETERMINATION GUIDANCE - Embodiments in accordance with the present invention disclose a method and system for pattern-based problem determination guidance. The method involves receiving data with respect to a computer system and determining a pattern index based on the data, searching a database to find a matching pattern index, creating problem determination guidance based on the matching pattern index and an associated PCI triplet, sending the guidance to the computer system and receiving feedback from the computer system indicating the corrective action that was implemented, along with a response of the computer system, and storing in the database, data indicating the corrective action, and the response of the computer system to the corrective action. | 05-26-2016 |
20160147805 | SELF-OPTIMIZING TABLE DISTRIBUTION WITH TRANSPARENT REPLICA CACHE - Storing data in a distributed database management system. The distributed database management system includes a first set of database tables, wherein data of a logical database table is distributed among the first set of database tables according to a first distribution key. A second set of database tables is maintained using a second distribution key that is different from the first distribution key. Furthermore, at least one performance metric value for the stored query when executing a query is maintained, and the second distribution key is determined based on the at least one performance metric value. | 05-26-2016 |
20160147904 | FAST ROW TO PAGE LOOKUP OF DATA TABLE USING CAPACITY INDEX - The subject matter disclosed herein provides methods for determining the page on which a desired row position value is located. A table having a plurality of rows can be accessed. The rows can be distributed across one or more pages in an in-memory database. Each row can be associated with a unique row position value. Each page can be associated with a capacity that is representative of an amount of data stored on the page. A capacity index having a plurality of entries can be created to record changes in capacity between pages. Neighboring entries in the capacity index can have a different capacity. Each entry in the capacity index can correspond to a page. A page directory can be created based on the capacity index. The page directory can indicate all possible row position values associated with each page. Related apparatus, systems, techniques, and articles are also described. | 05-26-2016 |
20160154833 | CONTENT CLASS FOR OBJECT STORAGE INDEXING SYSTEM | 06-02-2016 |
20160154851 | COMPUTING DEVICE, STORAGE MEDIUM, AND DATA SEARCH METHOD | 06-02-2016 |
20160154896 | PROCESSING A DATA FLOW GRAPH OF A HYBRID FLOW | 06-02-2016 |
20160162520 | Data Storage Method and Apparatus for Distributed Database - A data storage method and apparatus for a distributed database which includes receiving, by a first node, a record insertion request, where the record insertion request carries a first record, and determining, by the first node according to a mapping relationship between a foreign key identifier and a storage node, a first storage node on which a first associated record that corresponds to the first record is located, where a primary key identifier of the first associated record is same as a foreign key identifier of the first record, and inserting, by the first node, the first record to a second node in the first storage nodes. | 06-09-2016 |
20160162522 | SYSTEMS AND METHODS FOR DETECTING DATA LEAKAGE IN AN ENTERPRISE - Systems and methods for detecting data leakage in an enterprise are described. In one implementation, the method comprises receiving files metadata from at least one device associated with the enterprise. The files metadata comprises a plurality of file parameters. Further, the method comprises processing the files metadata to generate indexed metadata. The indexed metadata comprises at least one of the plurality of file parameters. Further, the method comprises analyzing the indexed metadata based on predefined leakage patterns to detect the data leakage. | 06-09-2016 |
20160162524 | Real Time Indexing - Methods and systems for generating a database index and searching the database index are described herein. In one aspect this may include receiving an item of database data, wherein the item of database data comprises a sequence of data elements. Groups of data elements may be identified within the received item of database data. If any one or more of the identified groups of data elements have not been previously identified within the item of database data and/or previous items database data then the one or more identified group may be stored as index entries in a database index. Information describing the sequence of the groups of neighbouring data elements may be recorded within the database index as they are located within the item of database data. The database index may be searched based on a received search term comprising a sequence of data elements. | 06-09-2016 |
20160162526 | FAST STREAMS AND SUBSTREAMS - Various embodiments provide data structures and algorithms for efficiently computing a time ordered stream that is the combination of a time ordered substream for individual persons of interest. In addition, various embodiments provide an approach for identifying how many changes of interest have happened to a particular stream since it was last viewed. | 06-09-2016 |
20160171000 | Adjusting Extension Size of a Database Table Using a Volatile Database Table Attribute | 06-16-2016 |
20160171031 | COMPUTER-READABLE RECORDING MEDIUM | 06-16-2016 |
20160171032 | Managing a Computerized Database Using a Volatile Database Table Attribute | 06-16-2016 |
20160171033 | Managing a Computerized Database Using a Volatile Database Table Attribute | 06-16-2016 |
20160171052 | METHOD AND SYSTEM FOR DOCUMENT INDEXING AND DATA QUERYING | 06-16-2016 |
20160171089 | SYSTEMS AND METHODS FOR RESOLVING OVER MULTIPLE HIERARCHIES | 06-16-2016 |
20160171092 | Framework for Annotated-Text Search using Indexed Parallel Fields | 06-16-2016 |
20160179857 | DATABASE JOINS USING UNCERTAIN CRITERIA | 06-23-2016 |
20160179931 | System And Method For Supplementing Search Queries | 06-23-2016 |
20160188618 | CLIENT GENERATED AGGREGATED INDICES - Disclosed are various embodiments for generating an index. A computing device provides a location of a requested file to a client device in response to receiving a search query of a master index from the client device. The computing device updates the master index with index data associated with the requested file, wherein the index data associated with the requested file is received from the client device. | 06-30-2016 |
20160188642 | INCREMENTAL UPDATE OF EXISTING PATENTS WITH NEW TECHNOLOGY - A computer-implemented method for combining a primary document with one or more candidate documents, the method comprising: extracting process steps disclosed in the primary document and extracting candidate process steps disclosed in the one or more candidate documents; constructing a primary data structure corresponding to the primary document, wherein the primary data structure comprises interconnected nodes and each node corresponds to an extracted process step disclosed in the primary document; identifying one or more candidate processes to combine with the primary data structure; and inserting the one or more identified candidate process steps into the primary data structure. | 06-30-2016 |
20160188643 | METHOD AND APPARATUS FOR SCALABLE SORTING OF A DATA SET - Embodiments of the present invention pertain to a method and apparatus for a scalable sorting of a data set in a database on a computer system. A number of contiguous ranges spanning the data set are defined. Each individual data value of the data set is assigned to a range to which it falls into. The values in the ranges are then sorted. The sorting can be performed by different nodes in parallel. Once the sorting is completed, the results are stored in contiguous memory locations. This results the overall data set being sorted. | 06-30-2016 |
20160188645 | Performing keyword-based search and retrieval on data that has been losslessly reduced using a prime data sieve - Some embodiments can factorize input data into a sequence of elements. Next, for at least one element in the sequence of elements, the embodiments can identify one or more prime data elements and determine a reconstitution program based on the element and the one or more prime data elements. The embodiments can then use the one or more prime data elements and the reconstitution program to generate a losslessly reduced representation of the element, and store the losslessly reduced representation of the element at a storage location. Next, in response to determining that a keyword is present in the element, the embodiments can (i) create a link that associates the keyword with the storage location, and (ii) store the link in the keyword index. The link can additionally associate the keyword with an offset where the keyword occurs in the element. | 06-30-2016 |
20160188664 | Rapid Searching and Matching of Data to a Dynamic Set of Signatures Facilitating Parallel Processing and Hardware Acceleration - A mechanism is provided in a data processing system for matching data to a dynamic set of signatures. The mechanism creates a state transition, result, and mask (STR&M) table based on a set of signatures. The mechanism executes scanner code in each of a plurality of hardware devices in parallel to form a plurality of matching engines. The mechanism loads the STR&M table into each of the plurality of matching engines. Responsive to receiving a plurality of input records, the mechanism distributes input strings from the plurality of input records among the plurality of matching engines and receiving signature set match results from the plurality of matching engines. | 06-30-2016 |
20160188751 | Rapid Searching and Matching of Data to a Dynamic Set of Signatures Facilitating Parallel Processing and Hardware Acceleration - A mechanism is provided in a data processing system for matching data to a dynamic set of signatures. The mechanism creates a state transition, result, and mask (STR&M) table based on a set of signatures. The mechanism executes scanner code in each of a plurality of hardware devices in parallel to form a plurality of matching engines. The mechanism loads the STR&M table into each of the plurality of matching engines. Responsive to receiving a plurality of input records, the mechanism distributes input strings from the plurality of input records among the plurality of matching engines and receiving signature set match results from the plurality of matching engines. | 06-30-2016 |
20160191295 | Semantics Annotation And Semantics Repository For M2M Systems - Methods, systems, and devices may be used to annotate and store “semantics of normal resources” (hereinafter resource semantics). These methods, systems, and devices may be a part of a functional architecture for semantics support and allow for semantics based query. Also methods, systems, and devices associated with a semantics child resource may be created within a resource in order to represent semantics information. | 06-30-2016 |
20160196286 | METHOD AND APPARATUS FOR MANAGING IMAGE METADATA | 07-07-2016 |
20160196290 | Alleviation of Index Hot Spots in Datasharing Environment with Remote Update and Provisional Keys | 07-07-2016 |
20160203130 | METHOD AND SYSTEM FOR IDENTIFYING AND EVALUATING SEMANTIC PATTERNS IN WRITTEN LANGUAGE | 07-14-2016 |
20160203146 | Ecosystem Services Index, Exchange and Marketplace and Methods of Using Same | 07-14-2016 |
20160203170 | ORDER ENCODED MANIFESTS IN A CONTENT CENTRIC NETWORK | 07-14-2016 |
20160203171 | INDEXING HETEROGENEOUS SERCHABLE DATA IN A MULTI-TENANT CLOUD | 07-14-2016 |
20160378802 | PROBABILISTIC DATA STRUCTURES FOR DELETION - A method for deleting a set of keys from a storage server is provided. The method includes generating a probabilistic data structure for a first set of keys and for each key in a second set of keys, determining whether a key of the second set of keys is found in the probabilistic data structure. The method includes identifying the key as a candidate for deletion if the key is not found in the probabilistic data structure. A system is also provided. | 12-29-2016 |
20160378810 | MULTIDIMENSIONAL CORRELATED DATA EXTRACTING DEVICE AND MULTIDIMENSIONAL CORRELATED DATA EXTRACTING METHOD - A method of extracting subsets from the whole population of data configured by values of many elements in a case where the subsets have a correlation for a plurality of elements and finding out correlated elements. More specifically, the method comprises modeling data as a vector based on values of all the elements configuring individual data, and, in a multidimensional space in which all the data included in the population is plotted, extracting subsets each having a multidimensional correlation based on the densities of plots, and finding out featured elements having a correlation in the subsets. | 12-29-2016 |
20160378811 | APPARATUS, SYSTEM AND METHOD OF PRESENTING VIEWS OF A USER'S DATA USING SUMMARY TABLES - An article, apparatus and methods are disclosed. The device and related methods involve a computing device including a program, such as a word processing program or spreadsheet program, configured to organize, through primary and other designations, table data and display, or otherwise use, the table of data. The program also provides a mechanism whereby by rows and columns may be prioritized for display, particularly on a relatively smaller mobile display. The prioritization(s) are then used by a device, whether the same or different than used to create provide the designations, to use the prioritizations to display the prioritized information, with other information being nested and displayable upon an interaction from a user, such as touching a displayed primary column cell to cause a secondary nested and related column cell to be displayed. In some instances, table designations occur automatically based on some attribute of the table or table data. | 12-29-2016 |
20160378833 | QUERY PROCESSING USING A DIMENSION TABLE IMPLEMENTED AS DECOMPRESSION DICTIONARIES - According to one embodiment of the present invention, a system may execute database queries and may comprise a computer system including at least one processor. The system may select a column of a first table as a candidate column for creating a dictionary, create tokens for a foreign key in a second table that corresponds to a primary key in the first table, generate the dictionary for the selected column using the created tokens as indexes in the dictionary and modify a query accessing the column of the first table to retrieve column values based on the dictionary. Embodiments of the present invention further include a method and computer program product executing database queries in substantially the same manner described above. | 12-29-2016 |
20170235747 | Electronic Document and Electronic File | 08-17-2017 |
20170235775 | LOCATING DATA IN A SET WITH A SINGLE INDEX USING MULTIPLE PROPERTY VALUES | 08-17-2017 |
20170235845 | NON-UNIQUE SECONDARY INDEXING OF SEMI-STRUCTURED DATA IN DATABASES | 08-17-2017 |
20180024988 | METHOD FOR GENERATING SEARCH INDEX AND SERVER UTILIZING THE SAME | 01-25-2018 |
20180025015 | ESTIMATING MOUNT TIME COMPLETION IN FILE SYSTEMS | 01-25-2018 |
20180025037 | Big Data Computing Architecture | 01-25-2018 |
20190146960 | DATA ENRICHMENT AND AUGMENTATION | 05-16-2019 |
20190147076 | FEATURE GENERATION AND STORAGE IN A MULTI-TENANT ENVIRONMENT | 05-16-2019 |
20220138244 | MEDIUM RECOMMENDATION SYSTEM, TABLE CREATION SYSTEM, MEDIUM RECOMMENDATION DEVICE, AND TABLE CREATION METHOD - One or more embodiments of the invention is a medium recommendation system for recommending media or sets of media in a game that proceeds as a user selects media from a possessed medium group constructed to include D media and places the media in a game field, the recommended media or sets of media being media or sets of media to be added to a medium in order to construct at least a portion of the possessed medium group from the medium, the medium recommendation system including: an input acceptance unit that accepts the input of a single medium among the T kinds of media; a table storage unit that stores first tables; and a medium recommendation unit that is capable of outputting, as recommendation information, media or sets of media with which the single medium accepted by the input acceptance unit is associated in the first tables. | 05-05-2022 |
20220138262 | TECHNIQUES TO GENERATE AND STORE GRAPH MODELS FROM STRUCTURED AND UNSTRUCTURED DATA IN A CLOUD-BASED GRAPH DATABASE SYSTEM - Embodiments include systems, methods, articles of manufacture, and computer-readable media configured process data in a structured format and an unstructured format and applying one or more algorithms to detect elements and links between the elements in the data. Embodiments are further configured to generate a graph model comprising nodes comprising the elements and edges comprising the links. | 05-05-2022 |