This mode enables you to perform OCR (optical character recognition) to extract data that can be recognized as text from the scanned image and create a PDF/XPS/OOXML (pptx/docx) file that is searchable. You can also set <Compact> if you select PDF or XPS as the file format. |
1 | Select <PDF> press <Set Details> <OCR (Text Searchable)>. |
2 | To change a language to use for OCR, press <OCR Language> select a language press <OK>. |
1 | Select <XPS> press <Set Details> <OCR (Text Searchable)>. |
2 | To change a language to use for OCR, press <OCR Language> select a language press <OK>. |
1 | Select <OOXML> select <Word> from the drop-down list. |
1 | Select <OOXML> select <PowerPoint> from the drop-down list. |
2 | Press <Set Details> select <OCR (Text Searchable)>. |
3 | Select the language to use with OCR in <OCR Language> press <OK>. |
If you select <PDF; OCR>, <XPS; OCR>, or <OOXML; OCR> as the file format, and <Smart Scan> is set to <On> in <OCR (Text Searchable) Settings>, the orientation of the original is detected, and the document is automatically rotated if necessary before it is sent. <OCR (Text Searchable) Settings> If you select <PDF> or <XPS> as the file format, you can set <Compact> and <OCR (Text Searchable)> at the same time. In that case, <PDF; Compact> or <XPS; Compact> is displayed as the file format on the Scan and Send Basic Features screen. If you select <Word> for <OOXML>, you can set to delete the scanned background images. You can generate Word files which are easy to edit without unwanted images. <Include Background Images in Word File> Select one language or one group according to the language used in the originals to scan. Settings and Languages for OCR Processing |
Item | Details |
Language Settings for Character Recognition | When a language is specified with OCR selected in <File Format>: Characters are recognized based on the language you select for each file format. When a language is not specified with OCR selected in <File Format>: Characters are recognized based on the language you select in <Switch Language/Keyboard> (<Switch Language/Keyboard>).*1 |
Recognizable Asian Languages | Japanese, Chinese (Simplified), Chinese (Traditional), Korean Recognizable Character Types and Fonts (Asian Languages) |
Recognizable European Languages and Language Groups | Languages: English, French, Italian, German, Spanish, Dutch, Portuguese, Albanian, Catalan, Danish, Finnish, Icelandic, Norwegian, Swedish, Croatian, Czech, Hungarian, Polish, Slovak, Estonian, Latvian, Lithuanian, Russian, Greek, Turkish Language Groups: Western European (ISO)*2, Central European (ISO)*3, Baltic (ISO)*4 Recognizable Character Types and Fonts (European Languages) |
Item | Details |
Recognizable Character Types | Japanese: Alphanumeric characters, Kana characters, Kanji characters (JIS first level, and some of the JIS second level), Symbols Chinese (Simplified): Alphanumeric characters, Chinese characters, Symbols (GB2312-80) Chinese (Traditional): Alphanumeric characters, Chinese characters, Symbols (Big5) Korean: Alphanumeric characters, Chinese characters, Hangul characters, Symbols (KSC5601) |
Recognizable Fonts | Multiple fonts are supported. (Ming-cho type is recommended.) Italicized characters cannot be recognized. |
Fonts Used for Converted Characters (Only when Word is selected as the file format) | Japanese: Asian characters: MS Mincho European characters: Century Chinese (Simplified): Asian characters: SimSun European characters: Calibri Chinese (Traditional): Asian characters: PMingLiU European characters: Calibri |
Item | Details |
Recognizable Character Types | Alphanumeric characters, Special characters of the recognized language*, Symbols |
Recognizable Fonts | Multiple fonts are supported. (Times, Century, and Arial are recommended.) Italicized characters can be recognized. |
Fonts Used for Converted Characters (Only when Word is selected as the file format) | Calibri Italic style is not reproduced. |
Item | Details |
Original Format | Printed documents, Word processor documents (documents consisting of text, graphics, photographs, or tables, and with no character slant) |
Text Format | Horizontal and vertical writing (documents containing both horizontal and vertical writing can also be recognized) Only horizontal writing can be recognized for European languages and Korean text. One to three column documents with no complex column settings |
Character Size | 8 to 40 point |
Table Format (For Word Format Only) | Tables that meet the following conditions: Tables consist of squares divided with solid lines Tables with up to 32 columns Tables with up to 32 rows |
Some originals suitable for OCR processing may not be processed properly.High accuracy may not be achieved with originals including a large amount of text on each page. Characters may be replaced with unintended characters or be missing due to the background color of the original, form and size of characters, or slanted characters.* Paragraphs, line breaks, or tables may not be reproduced.* Some parts of illustrations, photographs, or seal impressions may be recognized as characters and be replaced with characters.* * When Word is selected as the file format. |