Patent application number | Description | Published |
20080263021 | Methods of object search and recognition - The proposed technical solution allows processing of machine-readable forms of unfixed format. An auxiliary brief description may be optionally specified to determine the spatial orientation of the image. A method of searching for elements of a document comprises the following main operations in addition to the operations of preliminary image processing: selecting the varieties of structural description from several available variants, determining the orientation of the image, selecting the text objects, where the text must be recognized, and determining the minimal required volume of recognition, recognizing the text objects, searching for elements of the form. Searching for elements of the form comprises the following actions: selecting a searched element in the structural description, gaining the algorithm of search constraints from the structural description, searching for the element, testing the obtained variants. | 10-23-2008 |
20090070099 | METHOD FOR TRANSLATING DOCUMENTS FROM ONE LANGUAGE INTO ANOTHER USING A DATABASE OF TRANSLATIONS, A TERMINOLOGY DICTIONARY, A TRANSLATION DICTIONARY, AND A MACHINE TRANSLATION SYSTEM - In one embodiment, the invention provides a method for translating a document in an input language into an output language comprising: a) for each document fragment for which a translation is readily available, translating said document fragment based on said readily available translation; and b) for each remaining untranslated fragment for which a translation is not readily available, translating said untranslated fragment based on a model-based machine translation technique. A translation is readily available if a search reveals at least one matching translation for the document fragment in a translation database. | 03-12-2009 |
20090132477 | Methods of object search and recognition. - The proposed technical solution allows processing of machine-readable forms of unfixed format. An auxiliary brief description may be optionally specified to determine the spatial orientation of the image. A method of searching for elements of a document comprises the following main operations in addition to the operations of preliminary image processing: selecting the varieties of structural description from several available variants, determining the orientation of the image, selecting the text objects, where the text must be recognized, and determining the minimal required volume of recognition, recognizing the text objects, searching for elements of the form. Searching for elements of the form comprises the following actions: selecting a searched element in the structural description, gaining the algorithm of search constraints from the structural description, searching for the element, testing the obtained variants. | 05-21-2009 |
20090175532 | Method and System for Creating Flexible Structure Descriptions - In one embodiment, the invention provides a method, comprising detecting data fields on a scanned image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training the flexible document description using a search algorithm to detect the data fields on additional training images based on the set of search elements. | 07-09-2009 |
20110013806 | Methods of object search and recognition - Embodiments of the invention disclose techniques for processing of machine-readable forms of unfixed or flexible format. An auxiliary brief description may be optionally specified to determine the spatial orientation of the image. A method of searching for elements of a document comprises the following main operations in addition to the operations of preliminary image processing: selecting the varieties of structural description from several available variants, determining the orientation of the image, selecting the text objects, where the text must be recognized, and determining the minimal required volume of recognition, recognizing the text objects, searching for elements of the form. Searching for elements of the form comprises the following actions: selecting a searched element in the structural description, gaining the algorithm of search constraints from the structural description, searching for the element, testing the obtained variants. | 01-20-2011 |
20110091109 | METHOD OF PRE-ANALYSIS OF A MACHINE-READABLE FORM IMAGE - In one embodiment, the invention provides a method for a machine to perform machine-readable form pre-recognition analysis. The method comprises preliminarily assigning at least one graphic image in a form for identification of form type, preliminarily creating at least one model of the said graphic image for identification of the form type, parsing a form image into regions, determining an image form type for the form image, comprising: (a) detecting on the form image at least one of said graphic images for identification of the form type, (b) performing a primary identification of the form image type based on a comparison of the detected graphic image with the said model, and(c) performing a profound analysis using a supplementary data said-primary identification results in multiple possibilities for the form image type. | 04-21-2011 |
20110257963 | METHOD AND SYSTEM FOR SEMANTIC SEARCHING - A method comprising a preliminary automated analysis of at least one corpus of natural language text is disclosed. For each sentence of a corpus, the method includes performing a syntactic analysis using linguistic descriptions to generate at least one syntactic structure for the sentence, building a semantic structure for the sentence, associating each generated syntactic and semantic structure with the sentence, and saving each structure. For each corpus text that was preliminary analyzed, performing an indexing operation to index lexical meanings and values of linguistic parameters of each syntactic structure and each semantic structure associated with sentences in the corpus text. A semantic search includes at least one automatic preliminary analyzed corpus of sentences comprising searched values of linguistic, syntactic and semantic parameters. Due to a deep semantic analysis of a corpus, the search may be executed in various languages, in resources of various languages, and in the text of corpora of various languages regardless of the language of the query. | 10-20-2011 |
20110270607 | METHOD AND SYSTEM FOR SEMANTIC SEARCHING OF NATURAL LANGUAGE TEXTS - A method and system comprising an automated analysis of at least one corpus of natural language text is disclosed. For each sentence of a corpus, the analysis includes performing a syntactic analysis using linguistic descriptions to generate at least one syntactic structure for the sentence, building a semantic structure for the sentence, associating each generated syntactic and semantic structure with the sentence, and saving each generated syntactic and semantic structure. For each corpus text that was preliminary analyzed, performing an indexing operation to index lexical meanings and values of linguistic parameters of each syntactic structure and each semantic structure associated with sentences in the corpus text. A semantic search as disclosed herein includes at least one automatic preliminary analyzed corpus of sentences comprising searched values of linguistic, syntactic and semantic parameters. Due to deep semantic analysis of one or more corpora, the search may be executed in various languages, in resources of various languages, and in text corpora of various languages, regardless of the language of the query. | 11-03-2011 |
20120010872 | Method and System for Semantic Searching - In one embodiment, there is provided a computer-implemented method and system for implementing the method. The method comprises: preliminarily analyzing at least one corpus of natural language text comprising for each sentence of each natural language text of the corpus, performing syntactic analysis using linguistic descriptions to generate at least one syntactic structure for the sentence; building a semantic structure for the sentence; associating each generated syntactic and semantic structure with the sentence; and saving each generated syntactic and semantic structure; for each corpus of natural language text that was preliminarily analyzed, performing an indexing operation to index lexical meanings and values of linguistic parameters of each syntactic structure and each semantic structure associated with sentences in the corpus; and searching in at least one preliminarily analyzed corpora for sentences comprising searched values for the linguistic parameters. | 01-12-2012 |
20120011434 | Method for Object Recognition and Describing Structure of Graphical objects - The invention involves a method for processing of machine-readable forms or documents of non-fixed format. The method makes use of, for example, a structural description of characteristics of document elements, a description of a logical structure of the document, and methods of searching for document elements by using the structural description. A structural description of the spatial and parametric characteristics of document elements and the logical connections between elements may include a hierarchical logical structure of the elements, specification of an algorithm of determining the search constraints, specification of characteristics of every searched element, and specification of a set of parameters for a compound element identified on the basis of the aggregate of its components. The method of describing the logical structure of a document and methods of searching for elements of a document may be based on the use of the structural description. | 01-12-2012 |
20120109640 | METHOD AND SYSTEM FOR ANALYZING AND TRANSLATING VARIOUS LANGUAGES WITH USE OF SEMANTIC HIERARCHY - A method and computer system for analyzing sentences of various languages and constructing a language-independent semantic structure are provided. On the basis of comprehensive knowledge about languages and semantics, exhaustive linguistic descriptions are created, and lexical, morphological, syntactic, and semantic analyses for one or more sentences of a natural or artificial language are performed. A computer system is also provided to implement, analyze and store various linguistic structures and to perform lexical, morphological, syntactic, and semantic analyses. As result, a generalized data structure, such as a semantic structure, is generated and used to describe the meaning of one or more sentences in language-independent form, applicable to automated abstracting, machine translation, control systems, Internet information retrieval, etc. | 05-03-2012 |
20120123766 | Indicating and Correcting Errors in Machine Translation Systems - The preferred embodiments provide an automated machine translation from one language to another. The source language may contain expressions or words that are not readily handled by the translation system. Such problematic words or word combinations may, for example, include the words not found in the dictionary of the translation system, as well as text fragments corresponding to structures with low ratings. To improve translation quality, such potentially erroneous words or questionable word combinations are identified by the translation system and displayed to a user by distinctive display styles in the display of a document in the source language and in its translation to a target language. A user is provided with a capability to correct erroneous or questionable words so as to improve the quality of translation. | 05-17-2012 |
20120173224 | Deep Model Statistics Method for Machine Translation - In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options. | 07-05-2012 |
20120201420 | Object Recognition and Describing Structure of Graphical Objects - Methods for processing machine-readable forms or documents of non-fixed format are disclosed. The methods make use of, for example, a structural description of characteristics of document elements, a description of a logical structure of the document, and methods of searching for document elements by using the structural description. A structural description of the spatial and parametric characteristics of document elements and the logical connections between elements may include a hierarchical logical structure of the elements, specification of an algorithm of determining the search constraints, specification of characteristics of searched elements, and specification of a set of parameters for a compound element identified on the basis of the aggregate of its components. The method of describing the logical structure of a document and methods of searching for elements of a document may be based on the use of the structural description. | 08-09-2012 |
20120232883 | METHOD AND SYSTEM FOR TRANSLATING SENTENCES BETWEEN LANGAUGES - A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language. The method and computer system can be applied to in automated abstracting, machine translation, natural language processing, control systems, Internet information retrieval, etc. | 09-13-2012 |
20120239378 | Methods and Systems for Alignment of Parallel Text Corpora - Computer-implemented systems and methods align fragments of a first text with corresponding fragments of a second text, which is a translation of the first text. One preferred embodiment preliminarily divides the first and second texts into fragments; generates a hypothesis about the correspondence between the fragments of the first and second texts; performs a lexico-morphological analysis of the fragments using linguistic descriptions; performs a syntactic analysis of the fragments using linguistic descriptions and generates syntactic structures for the fragments; generates semantic structures for the fragments; and estimates the degree of correspondence between the semantic structures. | 09-20-2012 |
20120259621 | Translating Texts Between Languages - Methods and computer systems for translating sentences between languages from an intermediate language-independent semantic representation are provided. Based on a comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, build syntactic structures and language independent semantic structures and representations, and synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language. The methods and systems can be applied to automated abstracting, machine translation, natural language processing, control systems, Internet information retrieval, etc. | 10-11-2012 |
20120271627 | CROSS-LANGUAGE TEXT CLASSIFICATION - Methods are described for performing classification (categorization) of text documents written in various languages. Language-independent semantic structures are constructed before classifying documents. These structures reflect lexical, morphological, syntactic, and semantic properties of documents. The methods suggested are able to perform cross-language text classification which is based on document properties reflecting their meaning. The methods are applicable to genre classification, topic detection, news analysis, authorship analysis, etc. | 10-25-2012 |
20130024180 | Deep Model Statistics Method for Machine Translation - In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options. | 01-24-2013 |
20130024186 | Deep Model Statistics Method for Machine Translation - In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options. | 01-24-2013 |
20130041652 | CROSS-LANGUAGE TEXT CLUSTERING - Methods are described for performing clustering or classification of texts of different languages. Language-independent semantic structures (LISS) are constructed before clustering is performed. These structures reflect lexical, morphological, syntactic, and semantic properties of texts. The methods suggested are able to perform cross-language text clustering which is based on the meaning derived from texts. The methods are applicable to genre classification, topic detection, news analysis, authorship analysis, internet searches, and creating corpora for other tasks, etc. | 02-14-2013 |
20130054612 | Universal Document Similarity - Described herein are methods for finding substantially similar/different sources (files and documents), and estimating similarity or difference between given sources. Similarity and difference may be found across a variety of formats. Sources may be in one or more languages such that similarity and difference may be found across any number and types of languages. A variety of characteristics may be used to arrive at an overall measure of similarity or difference including determining or identifying syntactic roles, semantic roles and semantic classes in reference to sources. | 02-28-2013 |
20130191108 | Translation of a Selected Text Fragment of a Screen - Disclosed is a method for translating text fragments displayed on a screen from an input language into an output language and displaying the result. Translation may use electronic dictionaries, machine translation, natural language processing, control systems, information searches, (e.g., search engine via an Internet protocol), semantic searches, computer-aided learning, and expert systems. For a word combination, appropriate local or network accessible dictionaries are consulted. The disclosed method provides a translation in grammatical agreement in accordance with grammatical rules of the output language in consideration of the context of the text. | 07-25-2013 |
20130191109 | Translating Sentences Between Languages - A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language. The method and computer system can be applied to in automated abstracting, machine translation, natural language processing, control systems, Internet information retrieval, etc. | 07-25-2013 |
20130198615 | Creating Flexible Structure Descriptions - In one embodiment, the invention provides a method, comprising detecting data fields on a scanned document image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training or modifying the flexible document description using, for example, a search algorithm to detect the data fields on additional training images based on the set of search elements. | 08-01-2013 |
20130211816 | Deep Model Statistics Method for Machine Translation - In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options. | 08-15-2013 |
20130211819 | Displaying examples from texts in dictionaries - In one embodiment, the invention provides a method for a system to provide information based on a query, the method comprising: performing a first search of at least one first source for information responsive to the query; providing a result of the search to a user; searching documents using at least a part of the result of the search; providing the user with at least one example of usage of the result of the search obtained from the searching of stored documents; based on user input, performing a second search of at least one second source for information responsive to the query; and providing a result of said second search to the user. The invention provides ways of showing the most relevant examples from parallel text corpora according to a ranking. | 08-15-2013 |
20130322773 | METHODS OF OBJECT SEARCH AND RECOGNITION - Embodiments of the invention disclose techniques for processing of machine-readable forms of unfixed or flexible format. An auxiliary brief description may be optionally specified to determine the spatial orientation of the image. A method of searching for elements of a document comprises the following main operations in addition to the operations of preliminary image processing: selecting the varieties of structural description from several available variants, determining the orientation of the image, selecting the text objects, where the text must be recognized, and determining the minimal required volume of recognition, recognizing the text objects, searching for elements of the form. Searching for elements of the form comprises the following actions: selecting a searched element in the structural description, gaining the algorithm of search constraints from the structural description, searching for the element, testing the obtained variants. | 12-05-2013 |
20140101171 | Similar Document Search - Described herein are methods for finding substantially similar/different sources (files and documents), and estimating similarity or difference between given sources. Similarity and difference may be found across a variety of formats. Sources may be in one or more languages such that similarity and difference may be found across any number and types of languages. A variety of characteristics may be used to arrive at an overall measure of similarity or difference including determining or identifying syntactic roles, semantic roles and semantic classes in reference to sources. | 04-10-2014 |
20140114649 | METHOD AND SYSTEM FOR SEMANTIC SEARCHING - A method and system for facilitating a semantic search based on one or more corpuses of natural language texts are provided. One or more corpuses of natural language texts are received including indexed linguistic parameters and semantic structures of lexical units. The linguistic parameters and semantic structures are generated during a preliminary syntactico-semantic analysis. Searching for text fragments satisfying a query in the one or more corpuses is performed. Relevance of the search results is estimated. | 04-24-2014 |
20140129212 | Universal Difference Measure - Described herein are methods for finding substantially similar/different sources (files and documents), and estimating similarity or difference between given sources. Similarity and difference may be found across a variety of formats. Sources may be in one or more languages such that similarity and difference may be found across any number and types of languages. A variety of characteristics may be used to arrive at an overall measure of similarity or difference including determining or identifying syntactic roles, semantic roles and semantic classes in reference to sources. | 05-08-2014 |
20140257786 | INDICATING AND CORRECTING ERRORS IN MACHINE TRANSLATION SYSTEMS - The preferred embodiments provide an automated machine translation from one language to another. The source language may contain expressions or words that are not readily handled by the translation system. Such problematic words or word combinations may, for example, include the words not found in the dictionary of the translation system, as well as text fragments corresponding to structures with low ratings. To improve translation quality, such potentially erroneous words or questionable word combinations are identified by the translation system and displayed to a user by distinctive display styles in the display of a document in the source language and in its translation to a target language. A user is provided with a capability to correct erroneous or questionable words so as to improve the quality of translation. | 09-11-2014 |
20150057992 | EXHAUSTIVE AUTOMATIC PROCESSING OF TEXTUAL INFORMATION - A system for natural language processing is provided. A first natural language processing program may be constructed using language-independent semantic descriptions, and language-dependent morphological descriptions, lexical descriptions, and syntactic descriptions of one or more target languages. The natural language processing program may include any of machine translation, fact extraction, semantic indexing, semantic search, sentiment analysis, document classification, summarization, big data analysis, or another program. Additional sets of natural language processing programs may be constructed. | 02-26-2015 |
20150220515 | DEEP MODEL STATISTICS METHOD FOR MACHINE TRANSLATION - In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options. | 08-06-2015 |