Patent application number | Description | Published |
20110022597 | System And Method For Thematically Grouping Documents Into Clusters - A system and method for thematically grouping documents into clusters is provided. Concepts are extracted from a plurality of documents. The concepts include nouns or noun phrases. A number of occurrences for each concept are determined within each document. A bounded range is applied to the concepts and a subset of the concepts is selected by removing the concepts that fall outside the bounded range. The bounded range includes upper edge conditions and lower edge conditions. Themes are generated from the subset of concepts by identifying two or more concepts with common semantic meaning. Clusters of the documents are generated based on the themes. | 01-27-2011 |
20110221774 | System And Method For Reorienting A Display Of Clusters - A system and method for reorienting a display of clusters is provided. Clusters are maintained within a display. Each cluster includes a center located at a distance relative to a common origin for the display. A location of each cluster is compared to each other cluster. Two or more clusters that overlap are identified. At least one of the overlapping clusters is reoriented until no overlap occurs. | 09-15-2011 |
20110320453 | System And Method For Grouping Similar Documents - A system and method for grouping similar documents is provided. Frequencies of occurrences are determined for terms and noun phrases within a set of documents. A subset of the documents is selected by removing those documents having terms and noun phrases that fall outside a bounded range of upper and lower conditions for frequency of occurrence. Each of the documents in the subset is mapped to a cluster of documents based on a similarity of the documents to the cluster documents. | 12-29-2011 |
20130159300 | Computer-Implemented System and Method for Clustering Similar Documents - A computer-implemented system and method for clustering similar documents is provided. Concepts are identified for a set of documents and occurrence frequencies are determined for each concept in the documents set. A distance quantifying a similarity for each of the documents in the set with one or more clusters of documents is calculated. Each document is mapped to at least one of the one or more document clusters. | 06-20-2013 |
20130212098 | Computer-Implemented System And Method For Generating A Display Of Document Clusters - A computer-implemented system and method for generating a display of document clusters is described. Clusters of documents are presented in a multi-dimensional concept space. At least one document is selected from a collection of documents to be clusters. An angle θ of the document relative to a common origin of the multi-dimensional concept space is computed. The selected document is compared with each of the clusters. An angle σ from the common origin is determined for each cluster. A difference between the angle θ for the document and the angle σ for the cluster is determined. The difference is compared to the variance, and a new cluster is created when the difference exceeds the variance for all the clusters. | 08-15-2013 |
20140156664 | Computer-Implemented System And Method For Populating Clusters Of Documents - A computer-implemented system and method for populating clusters of documents is provided. A set of clusters is placed in a display in relation to a common origin. One of a plurality of unclustered documents in the display is selected and an angle θ of the document from the common origin is determined. An angle σ of the cluster relative to the common origin is computed for each cluster. A difference is determined between the document angle θ and one such cluster angle σ. A predetermined variance is applied to the difference. The document is placed into the cluster when the difference is less than the variance. | 06-05-2014 |
20140176556 | Computer-Implemented System and Method For Correcting A Rendering Of Clusters - A computer-implemented system and method for correcting a rendering of clusters is provided. A pair of clusters is selected within a representation. A span between centers of the clusters is determined. Radii for each of the clusters in the pair is identified. The radii are summed and at least one of the clusters in the pair is moved within the representation when the span exceeds the sum. | 06-26-2014 |
20140250087 | Computer-Implemented System And Method For Identifying Relevant Documents For Display - A computer-implemented system and method for identifying relevant documents for display are provided. Themes for a set of documents are generated. The documents are clustered based on the themes. A matrix including an inner product of document frequency occurrences and cluster concept weightings for each theme is generated for the documents. From the matrix, documents most relevant to a particular theme are identified, and the relevant documents are displayed. | 09-04-2014 |