Class / Patent application number | Description | Number of patent applications / Date published |
358462000 | Text and image detection and processing | 20 |
20080239410 | IMAGE PROCESSING APPARATUS AND IMAGE PROCESSING METHOD - A pixel in a photograph region image to be subjected to blacking is subjected to a brightness modulation processing so that the pixel is modulated to any of “a pixel having the brightness to be subject to blacking” and “a pixel having the brightness not to be subject to blacking”. Specifically, the modulation by noise addition converts some pixels for which original pixel values are to be subjected to blacking to a pixel having the brightness not subjected to blacking. Thereby, “a pixel having the brightness not to be subject to blacking” thus converted has no change in brightness in the blacking processing. Consequently, a pixel at which the blacking processing is not generated can be caused to exist in the photograph image. As a result, even when the blacking processing set for character/line region is similarly set for the photograph region, the effect of the blacking can be reduced. | 10-02-2008 |
20080309988 | MULTIFUNCTION PRINTER SYSTEM AND METHOD FOR AUTOMATED BATCH PROCESSING OF DOCUMENTS - A Method of batch processing a group of hardcopy documents scans a stack of documents. Each document in the stack has a cover sheet is placed thereon. The method performs optical character recognition on each of the cover sheets in the stack. The method performs an operation on each of the documents in the stack in accordance with instructions on the cover sheet on each document. Examples of operations that may be performed include printing the document, sending the document by fax to a recipient, sending an image file of the document by email to a recipient, and the like. | 12-18-2008 |
20090034016 | Method of Conferring Interactivity on Previously Printed Graphic Images - A method of conferring interactivity on a pre-printed image containing a URI text string. The method comprises the steps of: (i) receiving association data indicating an association between an impression identity, absolute positions and a scanned image; (ii) performing Optical Character Recognition on the scanned image to convert text images into computer text; (iii) identifying a URI text string in the computer text; (iv) generating an input description for the scanned image, and (v) storing a page description comprising the input description and the scanned image. The page description is indexed with the impression identity and, further, is retrievable so as to confer interactivity on the image. | 02-05-2009 |
20090080033 | IMAGE PROCESSING APPARATUS, IMAGE FORMING APPARATUS, IMAGE FORMING METHOD, IMAGE PROCESSING PROGRAM, AND RECORDING MEDIUM - Disclosed is an image processing apparatus that receives image data scanned by first and second image scanning units and performs various image processing. The apparatus includes first and second blank-sheet detection units that detect whether the scanned image data represent a blank sheet. The image processing apparatus determines storage or deletion of the scanned image data based on detection results of the blank-sheet detection units. | 03-26-2009 |
20100067067 | INFORMATION PROCESSING METHODOLOGY - An information processing methodology gives rise to an application program interface which includes an automated digitizing unit, such as a scanner, which inputs information from a diversity of hard copy documents and stores information from the hard copy documents into a memory as stored document information. Portions of the stored document information are selected in accordance with content instructions which designate portions of the stored document information required by a particular application program. The selected stored document information is then placed into the transmission format required by a particular application program in accordance with transmission format instructions. After the information has been transmission formatted, the information is transmitted to the application program. In one operational mode, the interface interactively prompts the user to identify, on a display, portions of the hard copy documents containing information used in application programs or for storage. | 03-18-2010 |
20100073735 | CAMERA-BASED DOCUMENT IMAGING - A process and system to transform a digital photograph of a text document into a scan-quality image is disclosed. By extracting the document text from the image, and analyzing visual clues from the text, a grid is constructed over the image representing the distortions in the image. Transforming the image to straighten this grid removes distortions introduced by the camera image-capture process. Variations in lighting, the extraction of text line information, and the modeling of curved lines in the image may be corrected. | 03-25-2010 |
20110007366 | SYSTEM AND METHOD FOR CLASSIFYING CONNECTED GROUPS OF FOREGROUND PIXELS IN SCANNED DOCUMENT IMAGES ACCORDING TO THE TYPE OF MARKING - Methods and systems for classifying markings on images in a document are undertaken according to marking types. The document containing the images is supplied to a segmenter which breaks the images into fragments of foreground pixel structures that are identified as being likely to be of the same marking type by finding connected components, extracting near-horizontal or -vertical rule lines and subdividing some connected components to obtain the fragments. The fragments are then supplied to a classifier, where the classifier provides a category score for each fragment, wherein the classifier is trained from the groundtruth images whose pixels are labeled according to known marking types. Thereafter, a same label is assigned to all pixels in a particular fragment, when the fragment is classified by the classifier. | 01-13-2011 |
20110116141 | IMAGE PROCESSING METHOD AND IMAGE PROCESSING APPARATUS - An image processing method, for receiving an input image and separating pixels having text characteristics and pixels having figure characteristics, includes: applying a first filtering processing for the input image to derive a first image processing result; applying a second filtering processing for the first image processing result to derive a second image processing result, wherein a distribution of filtering parameters of the first filtering processing is different from a distribution of filtering parameters of the second filtering processing; deriving a set of first reference values according to the first image processing result and the second image processing result; and determining whether each pixel within the input image is a text pixel or a figure pixel according to at least the set of the first reference values and a predetermined threshold. | 05-19-2011 |
20110134492 | IMAGE PROCESSING APPARATUS AND CONTROLLING METHOD FOR THE SAME - When originals whose types are mixed are checked, problems exist as follows. If an original is missing from the originals, or if the order of some originals is wrong in the originals, the check is performed in accordance with a processing instruction which is different from a processing instruction originally expected to be applied. The reliability of the check result is accordingly deteriorated. Pieces of processing instruction information, which are as many as originals having different formats, for performing a check process on predetermined entry items in originals having a predetermined format are beforehand stored. A piece of image data on each original is obtained by reading the multiple originals to be checked. Subsequently, the check process is performed on the obtained piece of image data on an original by sequentially applying the as many pieces of processing instruction information as the originals having the different form. | 06-09-2011 |
20110149350 | DOCUMENT PROCESSING APPARATUS - A document processing apparatus comprises an image reader for scanning an original manuscript which is not updated to generate first image data on the original manuscript and for scanning an updated manuscript to generate second image data on the updated manuscript, a text information extraction part for extracting first text information from the first image data and extracting second text information from the second image data, an updated portion detector for detecting an updated portion of the updated manuscript on the basis of the first text information and the second text information, an electronic document generator for generating an electronic document of the updated manuscript on the basis of the second image data, and a storage controller for generating display data of the updated portion on the basis of a detection result on the updated portion and storing the display data into the electronic document. | 06-23-2011 |
20110292463 | SYSTEM FOR IDENTIFYING PHYSICAL PAGE CONTAINING PRINTED TEXT - A system for identifying a physical page containing printed text from a plurality of page fragment images. The system includes: (A) a handheld electronic device having: a camera for capturing a plurality of page fragment images at a plurality of different capture points when the device is moved across the physical page; motion sensing circuitry for measuring a displacement or a direction of movement; and a transceiver; (B) a processing system configured for: performing OCR on each captured page fragment image to identify a plurality of glyphs in a two-dimensional array; and creating a glyph group key for each page fragment image; and (C) an inverted index of the glyph group keys. | 12-01-2011 |
20120105918 | AUGMENTING PAGE ORIENTATION DIRECTION DETECTION IN A DIGITAL DOCUMENT PROCESSING ENVIRONMENT - What is disclosed is a novel system and method for augmenting present methods used for determining the orientation direction automatically being detected of digital pages of a plurality of scanned documents in a digital document processing environment. The present method takes advantage of the observation that pages scanned in data processing centers are often highly correlated. The present method contains five primary steps. 1) Page orientation (i.e., up/down) is detected using a traditional method. 2) Each page is classified as either directional or non-directional. 3) The pages classified as directional are clustered into groups. 4) The direction for each group is determined. 5) The directional group's direction is used to revise the orientation for pages contained in the group. Through the implementation of the teachings hereof, performance, in terms of both speed and accuracy, are very high relative to current methods and detection error rates can be reduced significantly. | 05-03-2012 |
20120243056 | PAPER-SHEET MANAGEMENT METHOD AND PAPER-SHEET MANAGEMENT SYSTEM - A first identification number obtained by performing character recognition of each of a plurality of digits of an identification number and allocating predetermined characters to digits where characters cannot be determined and a second identification number representing an identification number that is the target of character recognition processing or search processing are compared with each other by calculating a matching ratio between the two numbers. When the matching ratio is lower than a predetermined reference value, a shifted identification number obtained by shifting each character forming the first identification number one digit in a predetermined direction and the second identification number are compared with each other by calculating a matching ratio between the two numbers. If this matching ratio is equal to or higher than the predetermined reference value, subsequent processing is continued, so that the shifted identification number is treated as the first identification number. | 09-27-2012 |
20120274991 | SYSTEM AND METHOD FOR DOCUMENT ORIENTATION DETECTION - In one embodiment, a method of detecting document orientation includes capturing a document image, binarizing each subimage of the document image to retain textual content and eliminate graphic and noise content from the document image, detecting portrait or landscape orientation based on values computed from strip-based projection profiles, and detecting up or down text orientation based on a text-asymmetry ratio computed from strip-based projection profiles. | 11-01-2012 |
20130088757 | SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR DETERMINING DOCUMENT VALIDITY - A method according to one embodiment includes performing optical character recognition (OCR) on an image of a first document; generating a list of hypotheses mapping the first document to a complementary document using: textual information from the first document, textual information from the complementary document, and predefined business rules; at least one of: correcting OCR errors in the first document, and normalizing data from the complementary document, using at least one of the textual information from the complementary document and the predefined business rules; determining a validity of the first document based on the hypotheses; and outputting an indication of the determined validity. Additional systems, methods and computer program products are also presented. | 04-11-2013 |
20140185106 | APPARATUS, METHOD AND PROGRAM FOR CHARACTER RECOGNITION - A character recognition apparatus may include an imaging element configured to read a character string placed on an information recording medium; an image memory configured to store image data of the character string; and a character segmenting unit configured to segment a character constituting the character string. The character segmenting unit may include a minimum intensity curve creating unit configured to detect a minimum intensity value among light intensity values, and create a minimum intensity curve of the image data according to the minimum intensity value of each pixel row; a character segmenting position detecting unit configured to calculate a space between the characters neighboring in the created minimum intensity curve, in order to detect a character segmenting position between the characters; and a character segmenting process unit configured to segment each character according to the detected character segmenting position between the characters. | 07-03-2014 |
20140268250 | SYSTEMS AND METHODS FOR RECEIPT-BASED MOBILE IMAGE CAPTURE - Systems and methods of capturing data from mobile images of receipts implemented are provided herein. One of the most important tasks behind the mobile receipt capture technology is understanding and utilizing category-specific rules in the form of known document sizes, relationships between different document fields, etc. For example, knowledge that many receipts have 3 inch widths helps to alter an image to restore the actual size of a receipt, which in turn improves a printing function and, most importantly, accuracy of content extraction such as optical character recognition. | 09-18-2014 |
20150009542 | APPARATUS AND METHOD FOR SCANNING AND DECODING INFORMATION IN AN IDENTIFIED LOCATION IN A DOCUMENT - A imaging scanner identifies first and second locations in a first and second captured image of a document, analyzes each character in the identified locations, and produces a first and second string, each including a character and a confidence value. The device determines that a first measurement of the confidence values in each of the first and second string is beyond a range of a first threshold. The device compares the confidence value for each character in the first string with a corresponding confidence value in the second string, selects a character from one of the first or second string with a higher confidence value; and produces a combined string including the selected characters and the confidence value associated with each selected character. | 01-08-2015 |
20160028901 | IMAGE FORMING APPARATUS SUITABLE FOR READING OF DOCUMENT - Provided is an image forming apparatus that eliminates the necessity for resetting a single-sided document even if a set error of the single-sided document is occurred. The automatic document feeder includes a switchback mechanism that inverts the front and back of the single-sided document to switch a surface of the single-sided document. The automatic document feeder control circuit controls to activate the switchback mechanism. When a blank paper is detected from the image data before inversion, the automatic document feeder control circuit controls to continue the activation of the switchback mechanism for a second document and succeeding documents of the single-sided document. Otherwise, when a blank paper is not detected from the image data before inversion, the automatic document feeder control circuit controls to stop the activation of the switchback mechanism for the second document and the succeeding documents of the single-sided document. | 01-28-2016 |
20160180166 | IN-SCANNER DOCUMENT IMAGE PROCESSING | 06-23-2016 |