Class / Patent application number | Description | Number of patent applications / Date published |
382185000 | Ideographic characters (e.g., Japanese or Chinese) | 24 |
20080205761 | Radical Set Determination For HMM Based East Asian Character Recognition - Exemplary techniques are described for selecting radical sets for use in probabilistic East Asian character recognition algorithms. An exemplary technique includes applying a decomposition rule to each East Asian character of the set to generate a progressive splitting graph where the progressive splitting graph comprises radicals as nodes, formulating an optimization problem to find an optimal set of radicals to represent the set of East Asian characters using maximum likelihood and minimum description length and solving the optimization problem for the optimal set of radicals. Another exemplary technique includes selecting an optimal set of radicals by using a general function that characterizes a radical with respect to other East Asian characters and a complex function that characterizes complexity of a radical. | 08-28-2008 |
20080219556 | Radical-Based HMM Modeling for Handwritten East Asian Characters - Exemplary methods, systems, and computer-readable media for developing, training and/or using models for online handwriting recognition of characters are described. An exemplary method for building a trainable radical-based HMM for use in character recognition includes defining radical nodes, where a radical node represents a structural element of an character, and defining connection nodes, where a connection node represents a spatial relationship between two or more radicals. Such a method may include determining a number of paths in the radical-based HMM using subsequence direction histogram vector (SDHV) clustering and determining a number of states in the radical-based HMM using curvature scale space-based (CSS) corner detection. | 09-11-2008 |
20080232689 | Coding systems for Chinese characters and uses thereof - User friendly coding systems are provided for Chinese characters, either complicated or simplified. Each Chinese character is assigned a code based on the shape of the character. In particular, the characters sharing the same beginning strokes are grouped together. The coding systems are useful for searching or sorting Chinese characters, as well as for typing Chinese characters on a computer or word processor. | 09-25-2008 |
20080310724 | Text conversion apparatus capable of relieving inputting load and a method therefor - A text input device receives, in its information input circuit, a letter indicating a destination of transmission as information on the destination of transmission. The text input device stores, in its word-finder with learning function, an input text and an output text in a state correlated with the information on the destination of transmission or its attribute. The text input device in its text learning circuit controls a change in storage caused by correlating an input text matched to a text entered with the information on the destination of transmission or its attribute stored and coincident with the information on the destination of transmission or its attribute entered. When a text matched to the text entered is output, the text input device in its text converter takes out and outputs at least one output text stored. | 12-18-2008 |
20090060338 | Method of indexing Chinese characters - In practicing the present invention, in analyzing a Chinese character, a 3×3 square grid of 9 boxes is superimposed over the character. The character is analyzed based upon the shape of the stroke that is at the lowest elevation within the lower right-hand corner. A Table is consulted consisting of a plurality of elements including horizontal strokes, and the element most closely resembling the corresponding portion of the character is chosen. The user then consults a Root Table where characters all having in common the same part of the character immediately on top are displayed. From examination of the Root Table, the user narrows down the identity of the character to a smaller group. The pages to which the user is directed are carefully reviewed and the entire character may be found in a Form Block including pertinent information concerning the character. When the entire character is found, reference is made to a page in a dictionary where the same character may be found along with its definition and examples of proper use. | 03-05-2009 |
20090060339 | Method of organizing chinese characters - A method of organizing Chinese characters includes the steps of: generating Stroke Set; generating Symbol Set; generating Stroke Code Set; generating a sequential code for each of the Chinese characters to be organized; generating a spatial code for each of the Chinese characters to be organized; generating a character code for each of the Chinese characters to be organized; and organizing said character codes together with related the Chinese characters to be organized such that a Chinese character is adapted to be located by first locating the related character code of the Chinese character, then locating the Chinese character in responsive to the related character code of the Chinese character. | 03-05-2009 |
20090103809 | INPUT METHOD TRANSFORM - Illustrative embodiments provide a computer implemented method, a data processing system and a computer program product for transforming character data input between a first writing system and a second writing system. The computer implemented method comprises receiving character data input of a first writing system and ensuring the character data input contains normalized characters. A predefined transform is selected based on the character data input of the first writing system and output to a second writing system to transform the normalized characters of the first writing system to character data output of the second writing system, and providing the character data output to a display process. | 04-23-2009 |
20090202152 | AREA EXTRACTION PROGRAM, CHARACTER RECOGNITION PROGRAM, AND CHARACTER RECOGNITION DEVICE - An area extraction method including obtaining a character lattice showing a connection relation between unit areas, which are obtained by separating a character string pattern in an image into patterns each recognized as corresponding to a single character, judging whether or not all combinations of each of the unit areas in the obtained character lattice and each of the unit areas in a regular lattice defining a regular connection relation between the unit areas are likely to be established, generating a path coupling between nodes corresponding to the combination of the unit areas which is determined as likely to be established, determining an optimum path from the generated paths based on a degree of coincidence with the regular lattice or the character lattice, and extracting from an image the unit areas in the character lattice corresponding to the determined optimum path. | 08-13-2009 |
20090324082 | CHARACTER AUTO-COMPLETION FOR ONLINE EAST ASIAN HANDWRITING INPUT - An exemplary method includes receiving stroke information for a partially written East Asian character, the East Asian character representable by one or more radicals; based on the stroke information, selecting a radical on a prefix tree wherein the prefix tree branches to East Asian characters as end states; identifying one or more East Asian characters as end states that correspond to the selected radical for the partially written East Asian character; and receiving user input to verify that one of the identified one or more East Asian characters is the end state for the partially written East Asian character. In such a method, the selection of a radical can occur using radical-based hidden Markov models. Various other exemplary methods, devices, systems, etc., are also disclosed. | 12-31-2009 |
20100008583 | APPARATUS, SYSTEM, AND METHOD FOR IDENTIFYING VISUAL STRUCTURES - A “gliding” interface operates in a space of visual perceptions and tries to predict an intended pattern sequence to enable a high-speed recognition of Asian writing with a simple interface. When a search begins, the user is presented with a collection of visual patterns within a box. The user can “zoom in” on a visual pattern by moving the cursor toward it. As the user zooms closer, a new layer of patterns appears within the box. The new layer includes more complex visual patterns than those in the previous layer. The system is mistake-tolerant, with no single cumulation of visual patterns required to achieve a specific visual pattern; rather, a statistical algorithm selects the visual patterns most likely to match the unknown structure. The algorithm is updated as the user searches, tracking every visual pattern that the user traverses while using the interface. A database contains groups of visual patterns that describe each Kanji character. The tracked visual patterns are correlated against the database to determine the visual patterns with the most visual similarity. The visual patterns with the highest correlation are displayed in the new layer. | 01-14-2010 |
20100061635 | IMAGE PROCESSING APPARATUS, IMAGE PROCESSING SYSTEM AND COMPUTER READABLE MEDIUM STORING PROGRAM - An image processing apparatus includes: a recognition unit that recognizes a layout of a line including a character string in an image read from an original; a determination unit that determines a size of a region in which additional information is embedded so as to include at least a part of a line including a character string in the region, based on the layout recognized by the recognition unit; a dividing unit that divides the image read from the original based on the size of the region determined by the determination unit; and an embedding unit that embeds the additional information in the image divided by the dividing unit. | 03-11-2010 |
20100239168 | SEMI-TIED COVARIANCE MODELLING FOR HANDWRITING RECOGNITION - Described is a technology by which handwriting recognition is performed using a semi-tied covariance modeling (STC) that requires far less memory than other models such as MQDF. Offline training, such as via maximum likelihood and/or minimum classification error techniques, provides classification data. The classification data includes semi-tied transforms that are shared by classes, along with a class-dependent diagonal matrix and a mean vector corresponding to each class. The semi-tied transforms and class-dependent diagonal matrices are obtained by processing a precision matrix for each class. In online recognition, received handwritten input (e.g., an East Asian character) is classified into a class, based upon the class-dependent diagonal matrices and the semi-tied transforms, by a STC recognizer that outputs similarity scores for candidates and a decision rule that selects the most likely class. | 09-23-2010 |
20100246963 | Automatic arabic text image optical character recognition method - The automatic Arabic text image optical character recognition method includes training a text recognition system using Arabic printed text, using the produced models for classification of newly unseen Arabic scanned text, and generating the corresponding textual information. Scanned images of Arabic text and copies of minimal Arabic text are used in the training sessions. Each page is segmented into lines. Features of each line are extracted and input to Hidden Markov Model (HMM). All training data training features are used. HMM runs training algorithms to produce codebook and language models. In the classification stage new Arabic text is input in scanned form. Line segmentation where lines are extracted is passed through. In the feature stage, line features are extracted and input to the classification stage. In the classification stage the corresponding Arabic text is generated. | 09-30-2010 |
20100246964 | RECOGNIZING HANDWRITTEN WORDS - Recognizing handwritten words at an electronic device. A plurality of strokes is received at a common input region of an electronic device. The plurality of strokes in combination defines a word comprising a plurality of symbols, a relative geometry of a first subset of the plurality of strokes defines a first symbol and a relative geometry of a second subset of the plurality of strokes defines a second symbol such that the relative geometry of the first subset of the plurality of strokes is not related to the relative geometry of the second subset of the plurality of strokes, and at least one stroke of the first subset of the plurality of strokes is spatially superimposed over at least one stroke of the second subset of the plurality of strokes. The word is determined using a processor of the electronic device based on the plurality of strokes without requiring recognition of the plurality of symbols, wherein a word is determined based at least in part on an entry sequence of subsets of the plurality of strokes. | 09-30-2010 |
20110123115 | On-Screen Guideline-Based Selective Text Recognition - A live video stream captured by an on-device camera is displayed on a screen with an overlaid guideline. Video frames of the live video stream are analyzed for a video frame with acceptable quality. A text region is identified in the video frame approximate to the on-screen guideline and cropped from the video frame. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and generates text in an editable symbolic form (the OCR'ed text). A confidence score is determined for the OCR'ed text and compared with a threshold value. If the confidence score exceeds the threshold value, the OCR'ed text is outputted. | 05-26-2011 |
20110188756 | E-DICTIONARY SEARCH APPARATUS AND METHOD FOR DOCUMENT IN WHICH KOREAN CHARACTERS AND CHINESE CHARACTERS ARE MIXED - A method for providing a correct e-dictionary search result for a document recognition result includes performing character recognition of a document in which Korean characters (Hangul) and Chinese characters are mixed and displaying a recognition result. If a character string to be searched is selected by a user from the recognition result, determining whether the selected character string corresponds to Hangul or Chinese characters, detecting a Hangul word or a Chinese word included in the selected character string, and outputting an e-dictionary search result corresponding to the detected Hangul or a Chinese word. Accordingly, the user can use an e-dictionary function without directly inputting a search word and obtain a correct e-dictionary search result for a document in which Hangul and Chinese characters are mixed. | 08-04-2011 |
20110229038 | Feature Design for HMM Based Eastern Asian Character Recognition - An exemplary method for online character recognition of East Asian characters includes acquiring time sequential, online ink data for a handwritten East Asian character, conditioning the ink data to produce conditioned ink data where the conditioned ink data includes information as to writing sequence of the handwritten East Asian character and extracting features from the conditioned ink data where the features include a tangent feature, a curvature feature, a local length feature, a connection point feature and an imaginary stroke feature. Such a method may determine neighborhoods for ink data and extract features for each neighborhood. An exemplary Hidden Markov Model based character recognition system may use various exemplary methods for training and character recognition. | 09-22-2011 |
20110280484 | FEATURE DESIGN FOR HMM-BASED HANDWRITING RECOGNITION - The disclosed architecture is a new feature extraction approach to handwriting recognition. Given an handwriting sample (e.g., from an online source), a sequence of time-ordered dominant points are extracted, which include stroke-endings, points corresponding to local extrema of curvature, and points with a large distance to the chords formed by pairs of previously identified neighboring dominant points. At each dominant point, a multi-dimensional feature vector is extracted, which includes a combination of coordinate features, delta features, and double-delta features. | 11-17-2011 |
20120134591 | IMAGE PROCESSING APPARATUS, IMAGE PROCESSING METHOD AND COMPUTER-READABLE MEDIUM - An image processing apparatus includes a cutout position extraction unit, a character candidate extraction unit, a graph generation unit, a link value generation unit, a path selection unit and an output unit. The cutout position extraction unit extracts a cutout position. The character candidate extraction unit recognizes each character for each character image divided by the cutout position and extracts a plurality of character candidates for each recognized character. The graph generation unit sets each of the plurality of extracted character candidates as a node and generates a graph by establishing links between the nodes of adjacent character images. The link value generation unit generates a link value based on a value of character-string-hood representing a relationship between character candidates. The path selection unit selects a path in the generated graph based on the link value. The output unit outputs a character candidate string in the selected path. | 05-31-2012 |
20130077864 | SYSTEM AND METHODS FOR ARABIC TEXT RECOGNITION BASED ON EFFECTIVE ARABIC TEXT FEATURE EXTRACTION - A method for automatically recognizing Arabic text includes digitizing a line of Arabic characters to form a two-dimensional array of pixels each associated with a pixel value, wherein the pixel value is expressed in a binary number, dividing the line of the Arabic characters into a plurality of line images, defining a plurality of cells in one of the plurality of line images, wherein each of the plurality of cells comprises a group of adjacent pixels, serializing pixel values of pixels in each of the plurality of cells in one of the plurality of line images to form a binary cell number, forming a text feature vector according to binary cell numbers obtained from the plurality of cells in one of the plurality of line images, and feeding the text feature vector into a Hidden Markov Model to recognize the line of Arabic characters. | 03-28-2013 |
20130182956 | Methods and Devices for Processing Handwriting Input - A method for processing handwriting input includes determining a first boundary point and a second boundary point corresponding to each target track point, forming an enclosed area by connecting all first boundary points determined for all target track points, connecting all second boundary points determined for all the target track points, connecting the first boundary point corresponding to the first target track point with the second boundary point corresponding to the first target track point, and connecting the first boundary point corresponding to the last target track point with the second boundary points corresponding to the last target track point, and filling the enclosed area. | 07-18-2013 |
20140056523 | MOBILE APPARATUS HAVING HAND WRITING FUNCTION USING MULTI-TOUCH AND CONTROL METHOD THEREOF - A method of controlling a mobile apparatus having a hand writing function using a multi touch includes detecting a hand writing input that is input to a hand writing input window on a touch screen of the mobile apparatus, determining whether the detected hand writing input is a multi touch input or a single touch input, generating a hand writing output including multi touch output corresponding to the multi touch input and single touch output corresponding to the single touch input; and displaying the hand writing output in an output window on the touch screen. | 02-27-2014 |
20150055868 | CHARACTER DATA PROCESSING METHOD, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING APPARATUS - A character data processing method executed by a computer includes detecting glyph variant information from an input character data string, and converting detected glyph variant information to extended expression data, the extended data and the detected glyph variant information, the basic character data being associated with the detected glyph variant information in the input character string, wherein the extended expression data can be converted to the basic character data by specific bit arithmetic processing. | 02-26-2015 |
20160048728 | METHOD AND SYSTEM FOR OPTICAL CHARACTER RECOGNITION THAT SHORT CIRCUIT PROCESSING FOR NON-CHARACTER CONTAINING CANDIDATE SYMBOL IMAGES - The current document is directed to methods and systems for identifying Chinese, Japanese, Korean, or similar language symbols that correspond to symbol images in a scanned-document image or other text-containing image. In a first processing phase, each symbol image is associated with a set of candidate graphemes. In a second processing phase, each symbol image is evaluated with respect to the set of candidate graphemes identified for the symbol image during the first phase. As candidate graphemes are processed, the currently described methods and systems monitor progress towards identifying a matching grapheme and, when insufficient progress is observed, terminate processing of the candidate graphemes and identify the symbol image as a non-symbol-containing area of the scanned-document image or other text-containing image. | 02-18-2016 |