digitizing books

13
STEPS, HINTS, VIDEOS Digitizing books 1

Upload: nyugat-magyarorszagi-egyetem-savaria-egyetemi-koezpont

Post on 22-Nov-2014

1.087 views

Category:

Education


4 download

DESCRIPTION

A digitalizálás fogalma, lépései, jó tanácsok, videók - angol nyelven. Description of digitization, steps, hints, videos.

TRANSCRIPT

1

STEPS, HINTS, VIDEOS

Digitizing books

2

What is the digitization?

Digitization is the process of converting information into a digital format. In this format, information is organized into discrete units of data that can be separately addressed. This is the binary data that computers and many devices with computing capacity can process.

http://whatis.techtarget.com/definition/0,,sid9_gci896692,00.html

See also: http://en.wikipedia.org/wiki/Digitizing

3

Steps of digitization

1. Choose the book you want to digitize.2. Choose an OCR software (GO!)3. Scan your book (Choose the devise.

Scanner, compact device, digital camera, IRIScan) (GO!)

4. Optical Character Recognition (image)5. Correction (image1) (image2)6. Save as a text searchable PDF documentSee another versions:

http://www.inquisition.ca/en/info/artic/comment_numeriser.htm

http://dlg.galileo.usg.edu/guide.html#01

4

Text and images

Text and images can be digitized similarly: a scanner captures an image (which may be an image of text) and converts it to an image file, such as a bitmap. An optical character recognition (OCR) program analyzes a text image for light and dark areas in order to identify each alphabetic letter or numeric digit, and converts each character into an ASCII code.

5

Choose an OCR software

There are a lot of softwares to digitize your documents.

On Wikipedia there is comparison list of optical character recognition softwares. Check it out!

http://en.wikipedia.org/wiki/List_of_optical_character_recognition_software

(I recommend you the ABBYY FineReader.)If you don’t want to buy (or download) a

software, here’s a free online OCR: http://www.newocr.com/

6

What is OCR?

OCR (optical character recognition) is the recognition of printed or written text characters by a computer. This involves photoscanning of the text character-by-character, analysis of the scanned-in image, and then translation of the character image into character codes, such as ASCII, commonly used in data processing.

http://searchcio-midmarket.techtarget.com/definition/OCR

Read more: http://en.wikipedia.org/wiki/Optical_character_recognition

7

What is ASCII?

ASCII (American Standard Code for Information Interchange) is the most common format for text files in computers and on the Internet. In an ASCII file, each alphabetic, numeric, or special character is represented with a 7-bit binary number (a string of seven 0s or 1s). 128 possible characters are defined.

In: http://searchcio-midmarket.techtarget.com/definition/ASCII

8

How to scan the book

With scanner: http://www.wikihow.com/Scan-a-Book

http://www.proportionalreading.com/scan.html

With one compact device: http://www.ehow.com/how_6950098_scan-book-pdf-format.html

With digital camera: http://www.wikihow.com/Scan-a-Book-With-a-Digital-Camera

With IRIScan: http://www.youtube.com/watch?v=9bgcDHLe3Xg

9

Optical Character Recognition

10

Correction image 1

11

Correction image 2

12

Videos

How to digitize a book: http://www.youtube.com/watch?v=-M95Ob4kIak

How to chop and scan a book:http://www.youtube.com/watch?v=8tx2JmW_

p4cScanning text using OCR software:http://www.youtube.com/watch?v=_SwrGtSY4

-cHow to OCR PDFs easily with Acrobat Batch

OCR:http://www.youtube.com/watch?v=V6Iz3U5X-

SUHow to digitize a million bookshttp://www.youtube.com/watch?v=OlKhKyTS

23E

13

How to put a scanned doc into PDF format

http://www.ehow.com/how_8563246_put-scanned-document-pdf-format.html

Some OCR softwares includePDF format to save.

Have a good reading onyour digital device!

Made by Mario Laskovics (2012.04.03)