Entries |
Document | Title | Date |
20100082618 | CLUSTERED SEARCH PROCESSING - Methods and apparatus for searching data and grouping search results into clusters that are ordered according to search relevance. Each cluster comprises one or more data type, such as images, web pages, local information, news, advertisements, and the like. In one embodiment, a search term is evaluated for related concepts indicating categories of data sources to search. Data sources may also be identified by context information such as a location of a client device, a currently running application, and the like. Search results in each cluster are ordered by relevance and each cluster is given a score based on an aggregate of the relevance within the cluster. Each cluster score may be modified based on one or more corresponding concepts and/or context information. The clusters are ordered based on the modified scores. Content, including advertisements, may also be added to the ordered list to appear as another cluster. | 04-01-2010 |
20100100541 | INFORMATION RETRIEVAL APPARATUS - An information retrieval apparatus, which can present to a user only a related word matching a user search intent, includes: an associative dictionary storage unit ( | 04-22-2010 |
20100114883 | SYSTEM FOR DYNAMIC PRODUCT SUMMARY BASED ON CONSUMER-CONTRIBUTED KEYWORDS - A system for presenting keywords obtained from users in a review process. The keywords are displayed along with a use value that reflects the number of times users have voted or selected the keyword as being relevant or effective in a review of a product. The keywords can be used to assist consumers in deciding whether to purchase a product or a service, in determining a brand's reputation, or for other purposes. Keywords can be ranked according to usage criteria such as the frequency of use of the keyword in reviews, the reputation of a user/reviewer who created or used the keyword, etc. Rankings can be dynamically updated when keyword usage changes. Such as when a keyword declines in popularity, when words change in meaning or become obsolete or irrelevant with respect to their original intent, etc. Keywords can be used as filters for product searches. | 05-06-2010 |
20100114884 | INTERACTIVE PROGRAM SEARCH APPARATUS - In an interactive program search apparatus ( | 05-06-2010 |
20100211567 | Word Association Method and Apparatus - A method for creating and using a cross-idea association database that includes a method for associating words and word strings in a language by analyzing word formations around a word or word string to identify other words or word strings that are equivalents or near equivalents semantically. One method for associating words and word strings includes querying a collection of documents with a user-supplied word or word string, determining a user-defined amount of words or word strings to the left and right of the query string, determining the frequency of occurrence of words or word strings located on the left and right of the query string, and ranking the located words. | 08-19-2010 |
20100228729 | Detecting Real Word Typos - Systems and methods for detecting real word typos are provided. Received text is designated for evaluation. A plurality of words in the received text is parsed into word pairs. A word pair is two consecutive words found in the designated text. A database is identified for comparison to the text. The database includes word pairs previously identified in one or more source texts. The word pairs in the received text are analyzed based on a comparison to the word pairs in the identified database. Based on the analysis, an indication may be generated that a word pair from the designated text may include an error. | 09-09-2010 |
20100325108 | Compiling Co-associating Bioattributes - A bioinformatics method, software, database and system are presented in which attribute profiles of query-attribute-positive individuals and query-attribute-negative individuals are compared, and combinations of pangenetic and non-pangenetic attributes that occur at a higher frequency in the group of query-attribute-positive individuals are identified and stored to generate a compilation of bioattribute combinations that co-associate with the query attribute (i.e., an attribute of interest). | 12-23-2010 |
20110016118 | METHOD AND APPARATUS FOR DETERMINING RELEVANT SEARCH RESULTS USING A MATRIX FRAMEWORK - A method and apparatus are provided for ranking documents according to relevancy scoring. In one implementation, a computer-implemented method is provided for receiving search results identifying a plurality of documents resulting from a search, the plurality of documents containing one or more words. The method generates a first matrix containing a term column and a document column, wherein at least one row of the first matrix correlates one of the plurality of documents with one of the terms. The method selects a sort preference, and sorts the two-column matrix according to the sort preference. The method further generates a second matrix containing values representing a measure of overlap between the plurality of documents and the terms. The method further calculates cumulative confidence scores according to the values of the second matrix and ranks the search results according to the cumulative confidence scores. | 01-20-2011 |
20110131207 | TEXT MESSAGING HOT TOPICS - A device includes a memory to store instructions; and a processor to execute the instructions to implement a data collector to collect text messages, a keyword extractor to extract keywords or key phrases from the collected text messages, and a user interface to present one or more of the extracted keywords or key phrases to a particular user. The device further includes a filter to filter the collected text messages based on one or more criteria, and a keyword ranker to rank the extracted keywords or key phrases based on one or more criteria. | 06-02-2011 |
20110145235 | Determining Core Geographical Information in a Document - A method determines core geographical information in a document by computing a score for each geographical name found in the document. The computation of the score uses the appearance frequency of the respective geographical name and positional weights assigned to various types of appearance positions of the geographical name in the document. The system determines the core geographical information in the document based on the scores of the geographical names found in the document. The method may further compute aggregated scores of geographical regions related to the geographical names and determine the core geographical information using both the aggregated scores of geographical regions and the scores of individual geographical names to increase accuracy. | 06-16-2011 |
20110208735 | Learning Term Weights from the Query Click Field for Web Search - Described is a technology by which a term frequency function for web click data is machine learned from raw click features extracted from a query log or the like and training data. Also described is using combining the term frequency function with other functions/click features to learn a relevance function for use in ranking document relevance to a query. | 08-25-2011 |
20120005202 | Method for Acceleration of Legacy to Service Oriented (L2SOA) Architecture Renovations - A method, system, and program product are presented for identifying similar functional segments of code to a service oriented architecture transition team. The method, system and program product comprise identifying, by a processor of a computer, a number of functionally equivalent segments in a number of lines of code by analyzing tag files associated with each of a number of functional segments in the number of lines of code. | 01-05-2012 |
20120150857 | BOOKMARK EXTRACTING APPARATUS, METHOD AND COMPUTER PROGRAM - Disclosed is a bookmark extracting apparatus which selects the most suitable bookmarks related to a website being browsed at present accurately from bookmarks registered in advance and provides them to a user is disclosed. This bookmark extracting apparatus includes a keyword extraction unit which extracts a keyword based on browsing history information of a website up to now, and a providing unit which provides a bookmark related to the keyword extracted by the extraction unit from a plurality of registered bookmarks based on the keyword. | 06-14-2012 |
20120158718 | INVERTED INDEXES WITH MULTIPLE LANGUAGE SUPPORT - A search query for a collection of electronic documents is parsed to identify one or more terms and such identified terms are associated with one or more languages (i.e., spoken languages such as English, German, Spanish, etc.). A terms inverted index and a language inverted index are accessed to identify documents responsive to the query. Related apparatus, systems, techniques and articles are also described. | 06-21-2012 |
20120203777 | Finding and Disambiguating References to Entities on Web Pages - A system and method for disambiguating references to entities in a document. In one embodiment, an iterative process is used to disambiguate references to entities in documents. An initial model is used to identify documents referring to an entity based on features contained in those documents. The occurrence of various features in these documents is measured. From the number occurrences of features in these documents, a second model is constructed. The second model is used to identify documents referring to the entity based on features contained in the documents. The process can be repeated, iteratively identifying documents referring to the entity and improving subsequent models based on those identifications. Additional features of the entity can be extracted from documents identified as referring to the entity. | 08-09-2012 |
20120254166 | Signature Detection in E-Mails - In an electronic discovery search tool, non-substantive information, such as signatures in e-mail, can bias a search tool and add processing time. A method and system for identifying recurring non-substantive text in documents has been developed so that non-substantive text may be processed or ignored by the search tool, as needed. | 10-04-2012 |
20120317106 | Information Providing System - An internal state of a user and a situation of the user are complementarily used to recommend optimal information for the individual user. An information providing system according to the present invention estimates a current strength of a desire of the user and a current situation of the user to refer to a database describing combinations of the strength and the situation to thereby present items that can satisfy both the desire of the user and the situation of the user. | 12-13-2012 |
20130066863 | Indicating a content preference - Recording a user's preference for content is disclosed. A first indication that a user has a first preference for the content is received. In response to receiving the first indication, the content is associated with the first preference. A second indication that the user has a second preference for the content is received. In response to receiving the second indication, the content is additionally associated with the second preference. | 03-14-2013 |
20130086056 | GESTURE BASED CONTEXT MENUS - Methods, systems, and techniques for providing context menus based upon gestured input are provided. Example embodiments provide a Gesture Based Context Menu System, which enables a gesture-based user interface to invoke a context menu to present one or more choices of next actions and/or entities based upon the context indicated by the gestured input and a set of criteria. In overview, the GBCMS allows an area of electronically presented content to be dynamically indicated by a gesture and then examines the indicated area in conjunction with a set of criteria to determine and present a context menu of further choices available to the user. The choices may be presented in the form of, for example, a pop-up menu, a pull-down menu, an interest wheel, or a rectangular or non-rectangular menu. In some embodiments the menus dynamically change as the gesture is modified. | 04-04-2013 |
20130110830 | RANKING OF ENTITY PROPERTIES AND RELATIONSHIPS | 05-02-2013 |
20130132381 | TAGGING ENTITIES WITH DESCRIPTIVE PHRASES - A plurality of description phrases associated with a first domain may be determined, based on an analysis of a first plurality of documents to determine co-occurrences of the description phrases with one or more name labels associated with the first domain. An entity associated with the first domain may be obtained. An analysis of a second plurality of documents may be initiated to identify co-occurrences of mentions of the obtained entity and one or more of the plurality of description phrases, and contexts associated with each of the co-occurrences of the mentions and description phrases, in each one of the second plurality of documents. A description tag association between the obtained entity and one of the description phrases may be determined, based on an analysis of the identified contexts. | 05-23-2013 |
20130138641 | CONSTRUCTION OF TEXT CLASSIFIERS - Methods, systems, and apparatus, including computer program products, for constructing text classifiers. The method includes receiving a collection of candidate phrases for a given topic; filtering the received candidate phrases to remove erroneously included candidate phrases; assigning weights to the candidate phrases including scoring each candidate phrase using an initial classifier and assigning weights to the candidate phrases based on the scores; and generating a linear classifier using the filtered and weighted candidate phrases, where the linear classifier varies the weights for each phrase candidate depending on the length of the document being classified. | 05-30-2013 |
20130144874 | METHOD AND SYSTEM FOR DOCUMENT CLASSIFICATION OR SEARCH USING DISCRETE WORDS - A method of operating a computerized document search system where information is matched against a database containing documents in response to user queries includes receiving a query identifying a source document that has information content related to the documents within the database. Important words within the source document are detected automatically, where at least one of the important words has been processed using at least two dictionary functions consisting of Derived Words, Acronym, Word Capitalization, and Hyphenation. An importance value is generated for important words in a processed document using a WordRatio and at least one of a selected set of values. A score is generated for a processed document based partly on the importance value of at least one important word in that document. A document list is created for identifying documents that are related to a source document. | 06-06-2013 |
20130151514 | EXTRACTING TIPS - Embodiments disclosed herein may relate to extracting tips from online sources and/or selecting tips for display to a user on a computing platform. | 06-13-2013 |
20130159300 | Computer-Implemented System and Method for Clustering Similar Documents - A computer-implemented system and method for clustering similar documents is provided. Concepts are identified for a set of documents and occurrence frequencies are determined for each concept in the documents set. A distance quantifying a similarity for each of the documents in the set with one or more clusters of documents is calculated. Each document is mapped to at least one of the one or more document clusters. | 06-20-2013 |
20130173611 | GENERATION OF NICKNAME DICTIONARY - Methods, apparatuses and systems for generating a name-word dictionary that includes associations between names of users and candidate words (e.g., nicknames) based on statistical analysis of user communications observed at a network communications facility, such as a social network system, an email provider and the like. | 07-04-2013 |
20130212093 | GENERATING VISUALIZATIONS OF A DISPLAY GROUP OF TAGS REPRESENTING CONTENT INSTANCES IN OBJECTS SATISFYING A SEARCH CRITERIA - Provided are a computer program product, method, and system for rendering search results. A search request is received having a search criteria to perform with respect to objects having content instances. A determination is made of the objects having qualifying content instances that satisfy the search criteria, an attribute value of the qualifying content instances for a specified attribute, and appearance settings for the qualifying content instances based on the determined attribute values. The appearance settings vary based on the attribute values. Tags are generated indicating the content instances and appearance settings for the content instances. A visualization of the tags in a display group are generated to provide visualization of the qualifying content instances in the objects according to the appearance settings, wherein visualizations of the tags is varied based on the determined appearance settings. | 08-15-2013 |
20130212094 | VISUAL SIGNATURES FOR INDOOR POSITIONING - Systems and methods for managing and utilizing visual signature (VS) databases are described herein. A method for managing a VS database as described herein includes obtaining a plurality of images of objects represented by a VS; obtaining context information associated with the plurality of images; grouping the plurality of images into one or more context classifications according to the context information associated with the plurality of images; for respective ones of the one or more context classifications, selecting an image representative of the VS according to one or more criteria; and adding the selected images for the respective ones of the one or more context classifications to entries of the VS database corresponding to the VS. | 08-15-2013 |
20130212095 | SYSTEM AND METHOD FOR MARK-UP LANGUAGE DOCUMENT RANK ANALYSIS - A system and method for mark-up language document rank analysis that may be performed automatically and that may also determine one or more differences between mark-up language documents with regard to their relative rank. | 08-15-2013 |
20130212096 | METHOD FOR MATCHING QUERIES WITH ANSWER ITEMS IN A KNOWLEDGE BASE - A system for providing answers to questions presented in the form of electronic signals representing natural language words conveyed to said system by way of a network connected to a computer. The system includes a plurality of search indexes relating to a field of knowledge, each in a specific natural language. A store-house of natural words is associated with each one of the search indexes, and in which a list of natural words is maintained in an order reflecting the usage frequency of said words in that list. In addition, a language storehouse of natural words common to each of the search indexes associated each with a specific natural language. The search index includes a list of score ordered keywords, indexed answer items each associated with an internal list of references (ILOR) pointing to it, and a list of ordered numerical references associated with each of the ordered keywords. Each such reference represents quantitatively an association between the keyword and an indexed answer item. | 08-15-2013 |
20130246412 | RANKING SEARCH RESULTS USING RESULT REPETITION - Ranking search results using result repetition is described. In an embodiment, a set of results generated by a search engine is ranked or re-ranked based on whether any of the results were included in previous sets of results generated in response to earlier queries by the same user in one or more searching sessions. User behavior data, such as whether a user clicks on a result, skips a result or misses a result, is stored in real-time and the stored data is used in performing the ranking. In various examples, the ranking is performed using a machine-learning algorithm and various parameters, such as whether a result in a current set of results has previously been clicked, skipped or missed in the same session, are generated based on the user behavior data for the current session and input to the machine-learning algorithm. | 09-19-2013 |
20130268526 | DISCOVERY ENGINE - A searching/discovery engine is disclosed wherein the searching methodology may involve selecting at least one category of sources; selecting at least one source (i.e. a collection of documents) within at least one category of sources; utilizing search terms to search the at least one source; returning related documents from the at least one source based on the search terms; collecting any of the related documents into a collection; permitting at least one related document returned to be selected for a further search utilizing the entire text of the at least one related document as the search criteria in a selected source to return additional related documents; and exporting the collection of related documents by creating a Uniform Resource Locator (URL) with all of the collected related documents stored at a location referenced in the URL. | 10-10-2013 |
20130325858 | PERSONALIZED PROFESSIONAL CONTENT RECOMMENDATION - A personalized content recommendation system includes a client interface configured to automatically monitor a user's information data stream transmitted on the Internet. A hybrid contextual behavioral and collaborative personal interest inference engine resident to a non-transient media generates automatic predictions about the interests of individual users of the system. A database server retains the user's personal interest profile based on a plurality of monitored information. The system also includes a server programmed to filter items in an incoming information stream with the personal interest profile and is further programmed to identify only those items of the incoming information stream that substantially match the personal interest profile. | 12-05-2013 |
20130332454 | DICTIONARY ENTRY NAME GENERATOR - A method for building dictionary entry names for data elements of a canonical data model includes identifying candidate terms for the dictionary entry name of a node or equivalence class of the canonical data model. The method includes counting a frequency of occurrence of candidate terms in use and based on the use counts creating a candidate ordering of terms for the complete ordered dictionary entry name of the node or equivalence class. The method further includes validating the candidate ordering of terms for the complete ordered dictionary entry name of the node or equivalence class by comparison of the ordering with reliable dictionary entry name entries in a database and/or by usage counts in search engine results. | 12-12-2013 |
20140136532 | SEMI-AUTOMATIC INDEX TERM AUGMENTATION IN DOCUMENT RETRIEVAL - Disclosed are methods and systems for indexing or retrieving materials accessible through computer networks. | 05-15-2014 |
20140136533 | INDEXING AND SEARCH QUERY PROCESSING - A method for processing a search query according to one embodiment includes receiving a search query containing terms; combining at least some consecutive terms in the search query to create biwords; looking up at least some of the terms and biwords in a search index for identifying sections of documents containing the at least some of the terms and/or biwords; generating a content score for each of the identified sections based at least in part on a number of the terms and biwords found in the sections of each document, wherein the biwords are given a higher priority than matched terms, wherein the priority affects the content score; and selecting and outputting an indicator of at least one of the sections, or portion thereof, based at least in part on the content score. | 05-15-2014 |
20140164370 | METHOD FOR RETRIEVAL OF ARABIC HISTORICAL MANUSCRIPTS - The method for retrieval of Arabic historical manuscripts using Latent Semantic Indexing approaches the problem of manuscripts indexing and retrieval by automatic indexing of Arabic historical manuscripts through word spotting, using “Text Image” similarity of keywords. The similarity is computed using Latent Semantic Indexing (LSI). The method involves a manuscript page preprocessing step, a segmentation step, and a feature extraction step. Feature extraction utilizes a circular polar grid feature set. Once the salient features have been extracted, indexing of historical Arabic manuscripts using LSI is performed in support of content-based image retrieval (CBIR). | 06-12-2014 |
20140188864 | SYSTEM AND METHOD OF SEMANTIC BASED SEARCHING - A computer-implemented method is provided for searching documents containing complex bodies of knowledge, such as patents and research papers. The computer-implemented method and related hardware and software provides methodology to interpret the intent of the searcher (the meaning of the searcher's query) into a MetaLanguage, including but not limited to the use of Fundamental Nature Attributes, Fundamental Action Attributes and Weighting of these attributes as it pertains to the intent of the searcher. The invention relates to semantic based searches. The same methodology that is used on the searcher's query is also used to mine and store the existing databases of patents and research papers into databases of MetaLanguage for the purpose of producing search results that better match search inquiries. | 07-03-2014 |
20140207770 | System and Method for Identifying Documents - A system for determining a similarity between a first document and a potential matching document is provided, wherein the system comprises a processor that is configured to perform steps of: determining a first identifier associated with the first document; identifying at least one potential matching document; for each document of the at least one potential matching documents: determining a second identifier; and determining a document similarity score, the document similarity score being indicative of a similarity between the first identifier and the second identifier. | 07-24-2014 |
20140214821 | SYSTEM AND METHOD FOR ADAPTIVE TEXT RECOMMENDATION - Network system provides a real-time adaptive recommendation set of documents with a high statistical measure of relevancy to the requestor device. The recommendation set is optimized based on analyzing text of documents of the interest set, categorizing these documents into clusters, extracting keywords representing the themes or concepts of documents in the clusters, and filtering a population of eligible documents accessible to the system utilizing site and or Internet-wide search engines. The system is either automatically or manually invoked and it develops and presents the recommendation set in real-time. The recommendation set may be presented as a greeting, notification, alert, HTML fragment, fax, voicemail, or automatic classification or routing of customer e-mail, personal e-mail, job postings, and offers for sale or exchange. | 07-31-2014 |
20140236940 | SYSTEM AND METHOD FOR ORGANIZING SEARCH RESULTS - Some embodiments concern a method for organizing two or more search results. The method includes: receiving at least one search parameter from a user; using at least one computer processor to determine a search type based upon the at least one search parameter; using the at least one computer processor to determine potential search results based upon the at least one search parameter; using the at least one computer processor to determine one or more qualitative traits of the potential search results; using the at least one computer processor to organize the two or more search results based upon the search type and the one or more qualitative traits of the potential search results; and displaying the two or more search results to the user. Other embodiments are disclosed. | 08-21-2014 |
20140236941 | DISCOVERY ENGINE - A method that is relatively inexpensive to implement and that permits a user to conduct searches of electronically stored documents using an entire document, multiple documents or portions of a document as the search criteria and to collect, store and to share the relevant documents from the search. | 08-21-2014 |
20140280114 | QUESTION ANSWERING USING ENTITY REFERENCES IN UNSTRUCTURED DATA - Methods, systems, and computer-readable media are provided for collective reconciliation. In some implementations, a query is received, wherein the query is associated at least in part with a type of entity. One or more search results are generated based at least in part on the query. Previously generated data is retrieved associated with at least one search result of the one or more of search results, the data comprising one or more entity references in the at least one search result corresponding to the type of entity. The one or more entity references are ranked, and an entity result is selected from the one or more entity references based at least in part on the ranking. An answer to the query is provided based at least in part on the entity result. | 09-18-2014 |
20150039606 | SEARCH PHRASE MODIFICATION - A user may submit a search string to a system. Before processing the search, the system may analyze the search string and modify it. For example, the search string may be modified by treating some terms as a phrase, by dropping some terms, by treating some terms as attributes, or any suitable combination thereof. The modification of the search string may be based on an analysis of prior search strings and user actions. The results of a search based on the modified search string may be returned to the user. | 02-05-2015 |
20150066921 | SYSTEMS AND METHODS FOR IDENTIFYING WORD PHRASES BASED ON STRESS PATTERNS - The present disclosure provides systems and methods for generating matching phrases based on user-defined criteria including a desired stress pattern of a phrase. The system may determine a stress pattern based on user-defined criteria including an auditory file of a melody, and use the determined stress pattern to generate a plurality of matching phrases that include the same stress pattern. | 03-05-2015 |
20150112981 | Entity Review Extraction - Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for entity review extraction. In one aspect, a method includes receiving documents identified as containing potential reviews of entities and extracting individual review candidates from one or more of the received documents wherein each individual review candidate contains at most one review and providing one or more of the review candidates to a sentiment analysis process wherein the sentiment analysis process is configured to calculate a sentiment magnitude for each of the review candidates based on words in the review candidates. | 04-23-2015 |
20150134652 | METHOD OF EXTRACTING AN IMPORTANT KEYWORD AND SERVER PERFORMING THE SAME - A method of extracting an important keyword by an important keyword extracting server, the method includes receiving a set of one or more documents from a network, receiving one or more user defined keywords from a user terminal, calculating, by the server, a relative importance value for each of words detected in the set of documents, determining, by the server, a weight for each of the words based on the one or more user defined keywords, applying, by the server, the weight for each of the words to the relative importance value for each of the words, determining, by the server, at least one of the words to be the important keyword based on the relative importance value to which the weight is applied and transmitting, by the server, the important keyword to the user terminal. Therefore, the method may effectively detect a user defined keyword from at least one document. | 05-14-2015 |
20150142791 | GENERATING VISUALIZATIONS OF A DISPLAY GROUP OF TAGS REPRESENTING CONTENT INSTANCES IN OBJECTS SATISFYING A SEARCH CRITERIA - Provided are a computer program product, method, and system for rendering search results. A search request is received having a search criteria to perform with respect to objects having content instances. A determination is made of the objects having qualifying content instances that satisfy the search criteria, an attribute value of the qualifying content instances for a specified attribute, and appearance settings for the qualifying content instances based on the determined attribute values. The appearance settings vary based on the attribute values. Tags are generated indicating the content instances and appearance settings for the content instances. A visualization of the tags in a display group are generated to provide visualization of the qualifying content instances in the objects according to the appearance settings, wherein visualizations of the tags is varied based on the determined appearance settings. | 05-21-2015 |
20150293918 | Web Searching Software Promoting Results Of Websites Formatted For Mobile Devices - A system for promoting results of websites formatted for mobile devices including a processor, software executing on the processor receiving a search term via a communications network from a mobile device, software executing on the processor identifying a plurality of websites responsive to the search term, software executing on the processor determining a mobile compatibility of each of the plurality of websites, software executing on the processor generating and presenting, via the communications network to a user interface of the mobile device, a list of at least a portion of the plurality of websites ranked at least in part according to the mobile compatibility of each of the plurality of websites. | 10-15-2015 |
20150294017 | Systems and Methods for Paragraph-Based Document Searching - A computerized method of searching a collection of electronic documents may include comparing search terms to sets of paragraph terms associated with paragraphs in the documents. Search terms and paragraph terms may be standardized, prior to the comparison. The method may also include generating paragraph scores for the paragraphs using term weight values associated with paragraph terms that match search terms, generating paragraph scores for the paragraphs, and using the paragraph scores to generate overall document scores. The method may also include using the overall document scores to determine a set of search results and providing the search results to a display. | 10-15-2015 |
20150294018 | METHOD AND APPARATUS FOR RECOMMENDING KEYWORDS - A method for recommending keywords can receive a first search term entered by a user, search a keyword library comprising a plurality of keywords and retrieve a preset number of keywords based on a similarity coefficient between each keyword and the first search term. After receiving a second search term entered by the user, the method obtains a correlation value between the second search term and the first search term based on whether a webpage in a search result of the first search term visited by the user includes the second search term, and determines the similarity coefficient between the second search term and the first search term in accordance with the correlation value. And then, the method updates the keyword library to save the similarity coefficient between the second search term and the first search term. | 10-15-2015 |
20150310010 | SYSTEMS AND METHODS FOR MULTIMEDIA IMAGE CLUSTERING - Computer image clustering systems and methods for conducting effective media searches by grouping multimedia documents tagged by keywords into a hierarchy of images configured to: (1) maintain a first database, (2) maintain an initial occurrence matrix, (3) maintain an occurrence matrix, (4) maintain a media file activation score for each media file in the first database, (5) generate a log version of the occurrence matrix, (6) maintain an inverse media file frequency value for each descriptive term in the first database, (7) generate a descriptive term frequency matrix and generate a list of document vectors in multidimensional space (list), and (8) organize and process each media file in the list into a high activation score category and a low activation score category. | 10-29-2015 |
20150310020 | METHODS, SYSTEMS, AND DEVICES FOR OUTCOME PREDICTION OF TEXT SUBMISSION TO NETWORK BASED ON CORPORA ANALYSIS - Computationally implemented methods and systems include receiving input of a message that is configured to be submitted to a network for publication, facilitating performance of text-based analysis on the acquired message to determine an objective message prediction, wherein the text-based analysis is at least partially based on a corpus of one or more related texts, and acquiring the determined objective message prediction. In addition to the foregoing, other aspects are described in the claims, drawings, and text. | 10-29-2015 |
20150317313 | SEARCHING LOCALLY DEFINED ENTITIES - A user can select a name of an entity such as a character in a book. In response to the selection, the passages of the book are processed using entity frequency and passage length to determine passages that are relevant to the entity. These relevant passages are processed to determine which of the relevant passages are descriptive and are most likely to help a user understand the entity by identifying characteristics of helpful passages such as words that indicate particular actions, words that are associated with biographical information, or the location of the passage in the book. The most descriptive passages can be shown to the user on the computing device that he is using to view the book. | 11-05-2015 |
20150324363 | METHODS AND SYSTEMS FOR PROVIDING AN AUTO-GENERATED REPAIR-HINT TO A VEHICLE REPAIR TOOL - Methods and systems pertaining to auto-generated repair-hints are described. A computer-readable processor can compare terms identified on a repair order (RO) to a taxonomy term database to determine standard terms associated with the terms on the RO, to store the standard terms as meta-data associated with the RO, to select pre-drafted text strings with gaps, and to insert the meta-data into the text string gaps to create a complete text string that forms at least part of an auto-generated repair-hint. The processor can receive a set of standard search terms to search for the auto-generated repair-hint from among multiple repair-hints. The processor can cause the auto-generated repair-hint to be transmitted to a vehicle repair tool for displaying the auto-generated repair-hint. A machine including the processor can receive a set of non-standard search terms (NSST) and identify a set of standard search terms associated with the NSST. | 11-12-2015 |
20150339298 | DOCUMENT MANAGEMENT SYSTEM, DOCUMENT MANAGEMENT METHOD, AND DOCUMENT MANAGEMENT PROGRAM - It is possible to reduce a review load of a reviewer. A document management system includes a screen display unit that displays a document group having a plurality of pieces of document data extracted from digital information to be determined for relevance to a lawsuit by a user and classification buttons allowing the user to select classification conditions for classifying the document group under predetermined conditions, a selection information reception unit that receives information relating to a classification button selected by the user among the classification button displayed by the screen display unit as selection information, and a classification instruction unit that analyzes the document group based on the selection information, classifies document data in the document group using the analysis result, and instructs the screen display unit to display the document group based on the classification result. | 11-26-2015 |
20150347418 | INFORMATION PROVISION DEVICE, INFORMATION PROVISION METHOD, AND INFORMATION PROVISION PROGRAM - An information provision device according to one embodiment includes an acquisition unit, a counting unit, and a presentation unit. The acquisition unit refers to data containing a location of a facility, a comment posted by a user of the facility, and a date of use of the facility by the user in association with one another, and acquires a set of an area where the facility is located, a keyword extracted from the comment, and a period corresponding to the date of use. The counting unit counts the number of sets in each time during a specified period for each pair of the area and the keyword and thereby obtains a distribution of the number. The presentation unit outputs information about the pair having a burst time where the number is larger than in other times by a specified criterion or more, in association with the burst time. | 12-03-2015 |
20150356152 | TEXT MINING DEVICE, TEXT MINING METHOD, AND RECORDING MEDIUM - A text mining device includes: an analysis unit which acquires, from data including text and one or more attributes including an attribute name and an attribute value and associated with the text, the attributes as analysis viewpoints, analyzes the data using the respective analysis viewpoints to obtain an analysis result from each analysis viewpoint, and generates result vectors of the respective analysis viewpoints; a similarity acquisition unit which acquires a vector similarity between the result vectors of the plural analysis viewpoints; and a recommendation unit which extracts and output a combination of the analysis viewpoints as a recommendation candidate on basis of the vector similarity. | 12-10-2015 |
20150370805 | Suggested Keywords - A suggested keywords system is configured for identifying phrases, which are most relevant to experience and expertise of a professional network member, and which the member may be interested in weaving into their profile summary. The suggested keywords system generates a model, for each phrase, that calculates probability of that phrase being present in a profile that is characterized by the absence of certain attributes and by the presence of certain attributes. Based on the model, the suggested keywords system calculates a ranking value for the phrase for a particular target profile. The phrases with the higher rank are considered to be more relevant in describing professional background of the target member. A certain number of phrases that have the highest ranking are presented to the member as suggested keywords to be included in their professional summary. | 12-24-2015 |
20160004769 | ESTIMATING FULL TEXT SEARCH RESULTS OF LOG RECORDS - A method by a computer includes receiving a search query from a user equipment, where the search query defines a logical combination of terms to be searched within a defined interval of records of a log stream. An estimate is generated for the number of occurrences of the logical combination of terms in the defined interval of records. A message containing the estimate for the number of occurrences of the logical combination of terms in the defined interval of records is communicated toward the user equipment. | 01-07-2016 |
20160012054 | COMPUTING THE RELEVANCE OF A DOCUMENT TO CONCEPTS NOT SPECIFIED IN THE DOCUMENT | 01-14-2016 |
20160012057 | COMPUTING THE RELEVANCE OF A DOCUMENT TO CONCEPTS NOT SPECIFIED IN THE DOCUMENT | 01-14-2016 |
20160012115 | COMBINATIONAL DATA MINING | 01-14-2016 |
20160034575 | VOCABULARY-EFFECTED E-CONTENT DISCOVERY - A computing device includes a housing and a display assembly having a screen and a set of touch sensors. The housing at least partially circumvents the screen so that the screen is viewable. A processor is provided within the housing to display content pertaining to an e-book on the screen of the display assembly. The processor further detects a first user interaction with the set of touch sensors and interprets the first user interaction as a first user input corresponding with a selection of a word or phrase in the displayed content. In response to the first user input, the processor searches an e-book library for e-books containing the selected word or phrase and presents a set of search results on the display assembly. The set of search results includes one or more e-books that contain the selected word or phrase. | 02-04-2016 |
20160070707 | KEYWORD SEARCH ON DATABASES - Systems and methods for keyword based searching in a database are described herein. In one implementation, the method comprises receiving a keyword based query, comprising at least one keyword, from a user. The method further comprises searching an inverted index associated with the database to detect the presence of at least one of the keywords in documents, identified by a document ID, present in the inverted index. Based on the searching, the documents in which at least one of the keywords is present are identified. The identified documents are then ranked in a descending order of relevancy. | 03-10-2016 |
20160070748 | METHOD AND APPARATUS FOR IMPROVED SEARCHING OF DIGITAL CONTENT - Improved searching of digital content using a large corpus of content collected from content generating websites is described. A search query received from a user is compared to the collected content to determine how often the elements of the search query are repeated in the collected content and whether these elements have frequently co-occurred with other elements in the content. Co-occurring elements are presented to the user so that the user can select one or more elements that best describe her intent in conducting the search. An updated search query is formed based on the information received from the user. The updated query is used to retrieve a number of documents and the retrieved documents are classified to distinguish relevant documents from those irrelevant to the user's intent. Documents classified as relevant are presented to the user. | 03-10-2016 |
20160070803 | CONCEPTUAL PRODUCT RECOMMENDATION - A conceptual product recommendation service that allows users to define the parameters that drive a search for one or more target products as a concept that can be specified in a variety of different ways, ranging from the specification of an abstract or generic idea to the specification of a particular instance of a product that embodies one or more conceptual elements sought by the user. In the process of matching the user-specified concept to a set of target products, the conceptual product recommendation service compares a word vector based representation of a multi-document compilation relating to the user-specified concept to respective word vector based representations of multi-document compilations relating to the target products to produce respective match scores corresponding to degrees of match between the user-specified concept and the target products. | 03-10-2016 |
20160085755 | IDENTIFYING AND SCORING DATA VALUES - Text including at least a first term can be presented on a display. An enterprise glossary is queried to identify other terms that match the first term. Data assets to which each of the other terms are linked and which include data values for the other terms can be identified. A first score indicating a level of relevance of the respective data asset to an enterprise is assigned to each of the data assets. A frequency distribution of the data values in the data assets is determined. Based at least on the first scores indicating the level of relevance of the respective data assets to the enterprise and the frequency distribution of the data values in the data assets, second scores are assigned to each of the data values. A plurality the data values which are assigned highest of the second scores are presented on the display. | 03-24-2016 |
20160085756 | IDENTIFYING AND SCORING DATA VALUES - Text including at least a first term can be presented on a display. An enterprise glossary is queried to identify other terms that match the first term. Data assets to which each of the other terms are linked and which include data values for the other terms can be identified. A first score indicating a level of relevance of the respective data asset to an enterprise is assigned to each of the data assets. A frequency distribution of the data values in the data assets is determined. Based at least on the first scores indicating the level of relevance of the respective data assets to the enterprise and the frequency distribution of the data values in the data assets, second scores are assigned to each of the data values. A plurality the data values which are assigned highest of the second scores are presented on the display. | 03-24-2016 |
20160085830 | REPUTATION-BASED DISCOVERY OF CONTENT OBJECTS - A content-discovery system allows a node in a Content Centric Networks (CCN) to discover content over CCN. The CCN node can generate an Interest that includes a query for discovering content associated with a given name prefix, and after disseminating the Interest over CCN, can receive a query-result Content Object that includes a listing of matching Content Objects and their reputation information. The CCN node can also process Interests issued by other CCN nodes that would like to discover content. After receiving an Interest comprising a query for discovering content, the CCN node searches a repository for a set of Content Objects that match the query. The CCN node generates a results list that includes the Content Object in the search results and their reputation information. The CCN node then generates and returns a query-result Content Object that includes the Interest's name, and whose payload includes the results list. | 03-24-2016 |
20160085866 | SYSTEM FOR PERFORMING TEXT MINING AND TOPIC DISCOVERY FOR ANTICIPATORY IDENTIFICATION OF EMERGING TRENDS - This disclosure describes a system that facilitates analyzing a broad and continuously updated sample of recent written communications for the purpose of identifying emerging and important news and topic developments before they attract broad attention. The system continuously discovers topics and relationships between topics in the written communications as the communications are received. Individual topics and topic relationships are continuously analyzed for the purpose of detecting changes in the frequency and manner in which the topics are addressed and changes in the content that accompanies the topics. The system uses this analysis as the basis for identifying certain topics and the communications that address them as emerging, and therefore, important. The system uses a display feature to highlight these topics and communications for a user to review or investigate further. | 03-24-2016 |
20160098407 | Systems and Methods for Identifying Documents Based on Citation History - Systems, methods, and computer-executable instructions described herein generally relate to increasing user productivity in reviewing query results by surfacing a set of documents ranked by their relative value calculated as a tabulation of how often they are cited for a specific purpose. A database management system can access an index of metadata corresponding to a set of content items in a corpus/corpora of electronically stored content. A sub-system is configured to receive a query request entered by a user in said interactive GUI. A computer machine is configured to receive said query as computer machine input; automatically determine at least one concept contained within said query; automatically normalize said at least one concept contained within said query thus creating at least one normalized concept; automatically compare said at least one normalized concept to a set of metadata comprising at least one document centric concept profile associated with said set of content items in said corpora; and automatically surface a set of documents, comprising at least one first document, matching said document centric concept profile via said GUI wherein said set of documents are ranked according to a reference value assigned to each document for a normalized concept. | 04-07-2016 |
20160103837 | SYSTEM FOR, AND METHOD OF, RANKING SEARCH RESULTS OBTAINED BY SEARCHING A BODY OF DATA RECORDS - A weighting processor and a method for ranking search results obtained by searching a body of data records. The ranking is carried out in relation to at least one selected search term contained in a taxonomy in which search terms have associated metadata which, for each search term, identifies a category and includes any measure of relatedness to at least one different search term in the same category, the measure being based on co-occurrences of the search terms in individual ones of a plurality of data records. The search results identify data records containing one or more search terms from the taxonomy and the results are ranked by summing, for each data record of the results, the measures of relatedness of search terms present in the data record to the selected search term(s). | 04-14-2016 |
20160117324 | RANKING LABELED INSTANCES EXTRACTED FROM TEXT - Technologies for development of IsA repositories are described that can be applied to the interpretation of text by computing devices in a variety of settings. The use of features other than those computed over an underlying document collection, such as popularity in search queries of the terms in class labels, is described, for the purpose of determining, or improving, the relative ranking of various class labels, given a class instance. | 04-28-2016 |
20160117360 | Contextual Search Disambiguation - Methods, systems, computer-readable media, and apparatuses for providing search disambiguation using contextual information and domain ontologies are presented. In some embodiments, a computing device may receive a natural language input from a user. The computing device may identify a plurality of hypotheses for the natural language input. The computing device may map the plurality of hypotheses to one or more concepts of a plurality of concepts of an ontology by annotating the one or more concepts. The ontology may include the plurality of concepts respectively connected by a plurality of relations. The computing device may determine that there is an imperfect match between the annotated one or more concepts and annotations of answers. In response, the computing device may disambiguate the annotated one or more concepts using the ontology. The computing device may present output to the user based on the disambiguation. | 04-28-2016 |
20160125038 | SYSTEMS AND METHODS FOR ENTERPRISE DATA SEARCH AND ANALYSIS - A system and method for enterprise searching of documents. The system comprises a computing system configured to receive one or more search terms, and responsively analyze a group of documents to return analysis results. A method for enterprise searching includes indexing the group of documents, determining relevant terms and measuring the context between terms. Relevant portions of documents, also called passages of interest, are determined as part of the analysis process. The analysis includes analyzing the passages of interest for words, repeating term sequences, non-consecutive repeating root term sequences, and non-word terms. The terms/sequences are scored and sorted, resulting in a set of high-importance items, allowing a user to quickly subselect search results without reading through the results. | 05-05-2016 |
20160132563 | METHOD AND DEVICE FOR PROVIDING CONTENT RECOMMENDING INFORMATION TO DEVICES - A method and a device for providing content recommending information are provided. The method includes receiving first log information of an external device, generating content recommending information based on the first log information and second log information of the first device, and displaying the content recommending information. | 05-12-2016 |
20160147768 | DEVICE AND METHOD FOR PROVIDING MEDIA RESOURCE - Provided is a display device for providing a media resource. The display device includes a communicator and a controller. The communicator collects background media resource database (DB) information. The controller extracts text information from each of media resources included in a background media resource DB, acquires one or more feature words based on the extracted text information, generates a feature word weight matrix of the background media resource DB which includes a respective weight of each acquired feature word, calculates a clustering similarity between each media resource included in the background media resource DB and a current media resource, which is being watched by a user, by using the feature word weight matrix, and provides a media resource recommendation list which includes one or more media resources based on the clustering similarity. | 05-26-2016 |
20160154797 | Keyword Frequency Analysis System | 06-02-2016 |
20160154798 | METHOD OF AUTOMATICALLY CONSTRUCTING CONTENT FOR WEB SITES | 06-02-2016 |
20160154805 | ESTIMATING MOST FREQUENT VALUES FOR A DATA SET | 06-02-2016 |
20160170985 | IDENTIFYING A THUMBNAIL IMAGE TO REPRESENT A VIDEO | 06-16-2016 |
20160171090 | Systems and Methods for Collaborative Project Analysis | 06-16-2016 |
20160179967 | SEARCHING FOR IDEOGRAMS IN AN ONLINE SOCIAL NETWORK | 06-23-2016 |
20160188723 | CLOUD WEBSITE RECOMMENDATION METHOD AND SYSTEM BASED ON TERMINAL ACCESS STATISTICS, AND RELATED DEVICE - The present invention discloses a method and system for recommending cloud websites based on terminal access statistics, wherein, the method mainly comprises: storing websites accessed by a terminal, and sorting the websites according to the accessed frequency by the terminal; capturing corresponding website descriptive information according to a website sorting result; storing the websites and corresponding descriptive information to a cloud storage medium; and when receiving a query request, querying the stored websites and corresponding website descriptive information according to the query keywords in the query request, returning a query result, and returning one or more websites whose frequency ranks on the top in the query result as recommended items. The present invention calculates the popularity of websites and sorts the websites based on the statistics about access behaviors, thereby significantly improving the quality and relevance of the recommended result. | 06-30-2016 |
20160196343 | AUDIO MATCHING BASED ON HARMONOGRAM | 07-07-2016 |
20160378759 | ACCOUNT ROUTING TO USER ACCOUNT SETS - New account routing to user account sets is described. A system creates multiple accounts profiles corresponding to multiple sets of accounts, based on multiple attributes associated with each account of the multiple sets of accounts. The system calculates multiple account scores for an account based on comparing multiple attributes associated with the account against the corresponding multiple accounts profiles, wherein the account is not in the multiple sets of accounts. The system identifies a highest account score of the multiple account scores. The system routes the account to a user associated with a set of accounts corresponding to the highest account score. | 12-29-2016 |
20160378763 | METHOD AND SYSTEM FOR ASSIGNING PUBLISHED SUBJECTS - A 2- or 3-dimensional buzzword map that arranges buzzwords on the map depending on the frequency of combined appearance with other buzzwords on the map, measured in certain contexts, is provided in which each of the buzzwords is assigned to a 2- or 3-dimensional element which is arranged at a defined location in the 2- or 3-dimensional map. Respective positions of the elements or positional relations of the elements to each other reflect a relation between the contents of the respective buzzwords. The plurality of the elements associated to the pre-defined plurality of buzzwords is displayed on a display screen as 2- or 3-dimensional images, with the elements having a pre-defined extension in each of the dimensions of the map. The elements can be shaped as an ellipse, circle, rectangle, or square in a 2-dimensional map or as a ellipsoid, sphere, brick, or cube in a 3-dimensional map. | 12-29-2016 |
20160378766 | SYSTEM AND METHOD FOR PATENT AND PRIOR ART ANALYSIS - Various embodiments of the present disclosure include systems and methods for analyzing patents and prior art in a patent management system. In an example embodiment, a computer-implemented method of determining a potential point of novelty for an identified patent comprises retrieving at least one independent claim of the patent in the claim form as issued; retrieving at least one independent claim of the patent in the claim form as published; and automatically comparing the issued claim to the published claim by identifying unique keywords present in the issued claim but not present in the published claim and flagging the unique keywords to a user. | 12-29-2016 |
20160378769 | PRELIMINARY RANKER FOR SCORING MATCHING DOCUMENTS - The technology described herein provides for preliminary ranking of matching documents for a search query. A preliminary ranker uses score tables for scoring each matching document based on its relevant to a search query. The score table for a document stores pre-computed data used to derive a frequency of terms and other information in the document. The preliminary ranker uses the score table for each matching document and the terms form the search query to determine a score for each matching document. The lowest scoring documents are removed from further consideration by a final ranker. | 12-29-2016 |
20170235726 | INFORMATION IDENTIFICATION AND EXTRACTION | 08-17-2017 |
20170235736 | SYSTEM AND METHOD FOR CONFIDENTIALITY-PRESERVING RANK-ORDERED SEARCH | 08-17-2017 |
20170235836 | INFORMATION IDENTIFICATION AND EXTRACTION | 08-17-2017 |
20180024998 | INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM | 01-25-2018 |
20180025364 | INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM | 01-25-2018 |
20190146976 | CONTINUOUS EVALUATION AND ADJUSTMENT OF SEARCH ENGINE RESULTS | 05-16-2019 |