Patent application number | Description | Published |
20120203777 | Finding and Disambiguating References to Entities on Web Pages - A system and method for disambiguating references to entities in a document. In one embodiment, an iterative process is used to disambiguate references to entities in documents. An initial model is used to identify documents referring to an entity based on features contained in those documents. The occurrence of various features in these documents is measured. From the number occurrences of features in these documents, a second model is constructed. The second model is used to identify documents referring to the entity based on features contained in the documents. The process can be repeated, iteratively identifying documents referring to the entity and improving subsequent models based on those identifications. Additional features of the entity can be extracted from documents identified as referring to the entity. | 08-09-2012 |
20140289177 | FINDING AND DISAMBIGUATING REFERENCES TO ENTITIES ON WEB PAGES - A system and method for disambiguating references to entities in a document. In one embodiment, an iterative process is used to disambiguate references to entities in documents. An initial model is used to identify documents referring to an entity based on features contained in those documents. The occurrence of various features in these documents is measured. From the number occurrences of features in these documents, a second model is constructed. The second model is used to identify documents referring to the entity based on features contained in the documents. The process can be repeated, iteratively identifying documents referring to the entity and improving subsequent models based on those identifications. Additional features of the entity can be extracted from documents identified as referring to the entity. | 09-25-2014 |
20140379743 | FINDING AND DISAMBIGUATING REFERENCES TO ENTITIES ON WEB PAGES - A system and method for disambiguating references to entities in a document. In one embodiment, an iterative process is used to disambiguate references to entities in documents. An initial model is used to identify documents referring to an entity based on features contained in those documents. The occurrence of various features in these documents is measured. From the number occurrences of features in these documents, a second model is constructed. The second model is used to identify documents referring to the entity based on features contained in the documents. The process can be repeated, iteratively identifying documents referring to the entity and improving subsequent models based on those identifications. Additional features of the entity can be extracted from documents identified as referring to the entity. | 12-25-2014 |
Patent application number | Description | Published |
20090125371 | Domain-Specific Sentiment Classification - A domain-specific sentiment classifier that can be used to score the polarity and magnitude of sentiment expressed by domain-specific documents is created. A domain-independent sentiment lexicon is established and a classifier uses the lexicon to score sentiment of domain-specific documents. Sets of high-sentiment documents having positive and negative polarities are identified. The n-grams within the high-sentiment documents are filtered to remove extremely common n-grams. The filtered n-grams are saved as a domain-specific sentiment lexicon and are used as features in a model. The model is trained using a set of training documents which may be manually or automatically labeled as to their overall sentiment to produce sentiment scores for the n-grams in the domain-specific sentiment lexicon. This lexicon is used by the domain-specific sentiment classifier. | 05-14-2009 |
20090193011 | Phrase Based Snippet Generation - Disclosed herein is a method, a system and a computer product for generating a snippet for an entity, wherein each snippet comprises a plurality of sentiments about the entity. One or more textual reviews associated with the entity is selected. A plurality of sentiment phrases are identified based on the one or more textual reviews, wherein each sentiment phrase comprises a sentiment about the entity. One or more sentiment phrases from the plurality of sentiment phrases are selected to generate a snippet. | 07-30-2009 |
20100185653 | POPULATING A STRUCTURED PRESENTATION WITH NEW VALUES - Methods, systems, and apparatus, including computer programs stored on computer storage media, for populating a structured presentation with new values. One aspect can be embodied in machine-implemented methods that include the actions of receiving description data describing a preexisting structured presentation, comparing characteristics of the preexisting structured presentation with content of electronic documents in an unstructured collection of electronic documents to locate electronic documents that identify a new value that is relevant to the preexisting structured presentation, adding the new value to the preexisting structured presentation to form a new structured presentation, and outputting instructions for visually presenting the new structured presentation. | 07-22-2010 |
20100185654 | ADDING NEW INSTANCES TO A STRUCTURED PRESENTATION - Methods, systems, and apparatus, including computer programs stored on computer storage media, for retrieval and display of information from an electronic document collection. In one aspect, a machine-implemented method includes the actions of receiving description data describing a preexisting structured presentation, comparing characteristics of the preexisting structured presentation with content of electronic documents in an unstructured collection of electronic documents to locate electronic documents that identify a new instance that is relevant to the preexisting structured presentation, adding an identifier of the new instance to the preexisting structured presentation to form an expanded structured presentation, and outputting instructions for presenting the expanded structured presentation. | 07-22-2010 |
20100185934 | ADDING NEW ATTRIBUTES TO A STRUCTURED PRESENTATION - Methods, systems, and apparatus, including computer programs stored on computer storage media, for retrieval and display of information from an electronic document collection. One aspect can be embodied in machine-implemented methods that include the actions of receiving description data describing a preexisting structured presentation, comparing characteristics of the preexisting structured presentation with content of electronic documents in an unstructured collection of electronic documents to locate electronic documents that identify a new attribute that is relevant to the preexisting structured presentation, adding an identifier of the new attribute to the preexisting structured presentation to form an expanded structured presentation, and outputting instructions for presenting the expanded structured presentation. | 07-22-2010 |
20100281034 | Query-Independent Entity Importance in Books - A corpus contains information including text from books and metadata about the books. The book texts mention entities of various types, such as location, date, event, and person entities. An entity importance engine analyzes the information in the corpus to identify the entities mentioned therein, and ranks the entities using query-independent importance scores. The importance scores can be based in part on the contexts in which the entities are mentioned by the books. A presentation engine enables searching of the corpus to identify books satisfying a search query. The presentation engine presents information about a designated book. Entities mentioned in the book are presented in a style selected to enhance comprehension and utility. For example, location entities can be presented on a map with markers showing the locations of the entities. The number of entities presented are limited and ordered based on the query-independent importance scores. | 11-04-2010 |
20120131021 | Phrase Based Snippet Generation - Disclosed herein is a method, a system and a computer product for generating a snippet for an entity, wherein each snippet comprises a plurality of sentiments about the entity. One or more textual reviews associated with the entity is selected. A plurality of sentiment phrases are identified based on the one or more textual reviews, wherein each sentiment phrase comprises a sentiment about the entity. One or more sentiment phrases from the plurality of sentiment phrases are selected to generate a snippet. | 05-24-2012 |
20130104026 | SYSTEMS AND METHODS FOR WEB SITE CUSTOMIZATION BASED ON TIME-OF-DAY - Systems and methods are provided for delivering customized versions of web pages to users. In one implementation, a method is provided for customizing a delivered version of a web page to reflect a current time-of-day at a geographic location of the user. According to the method, a request for a web page is received from a client device of the user. The request for the web page includes an IP address of the client device. Based on the IP address, a current time is determined for the received request. Thereafter, a version of the requested web page corresponding to the current time is generated, and the generated version of the requested web page is delivered to the client device. | 04-25-2013 |
20140320499 | DATA OBJECT VISUALIZATION USING GRAPHS - A presentation engine presents facts in a UI having a graph. Through manipulation of the UI, an end-user can add or remove facts from the graph, and can sort the facts shown in the graph based on values that are shown, or not shown, in the graph. The presentation engine determines a graph type that best facilitates interpretation of the facts by the end-user and automatically creates a graph of that type. Possible graph types include bar graphs, scatter plots, timelines, and maps. | 10-30-2014 |
20140359409 | Learning Synonymous Object Names from Anchor Texts - A repository contains objects representing entities. The objects also include facts about the represented entities. The facts are derived from source documents. A synonymous name of an object is determined by identifying a source document from which one or more facts of the entity represented by the object were derived, identifying a plurality of linking documents that link to the source document through hyperlinks, each hyperlink having an anchor text, processing the anchor texts in the plurality of linking documents to generate a collection of synonym candidates for the entity represented by the object, and selecting a synonymous name for the entity represented by the object from the collection of synonym candidates. | 12-04-2014 |