Tech Tips Home
The Best Tech Tips And Daily Deals
Newsletter On The Internet!

Shop online 24hrs a day or call us Mon-Fri
8:30AM-4:30PM EST - 1-800-915-2088
WorldStart Tech Tip And Store Search
Email: Password: Login Remember Me

Like what you see here? Subscribe to the Tech Tips newsletter!   Email: Subscribe

Converting Images to Text

Friday, December 10th, 2010 by | Filed Under: MS Word, Multimedia, Printing Help, Uncategorized
 
Loading...


Daniel, from Florida asks:

I’m able to copy documents I’ve typed on Word into the body of an (AOL) e-mail but I can’t copy any text that was scanned into my computer. How do I copy scanned documents into the body of an e-mail? I’m not talking about an attachment.

Converting Images to Text

Today, everything seems to be about speed. There just doesn’t seem to be enough time to accomplish the multitude of tasks that make up our daily lives. As such, anything that can reduce the time spent performing repetitive tasks such as typing is more than welcomed.

There are quite a few alternatives to typing, such as voice recognition, but none offer more benefits than OCR software. OCR (optical character recognition) is the process of converting scanned documents and image files into fully editable text. It’s capable of accomplishing this by analyzing a document and comparing it to fonts stored in its database. It can also guess unrecognized characters by comparing different character features.

The technology for text recognition has grown immensely in just a few years and it’s now capable of recognizing any type of text (even handwritten text). Despite this, 100% accuracy for OCR software remains impossible with today’s technology.

Depending on the resolution and contrast of the scanned document and the precision of the software, you will receive text that’s about 80% – 90% accurate. Also, minor errors like missing letters or misspelling will occur even with the best text recognition software.

Still, unless you’re a fast typist, the amount of time that you’ll need to correct a few misspelled words is nowhere near the amount of time it takes to type a document manually.

While there are many software options for text recognition, their performance and accuracy varies wildly based on the OCR engine they use. One of the most accurate open source OCR engines is the Tesseract engine.

FreeOCR is a free application that provides a simple graphical interface for the Tesseract engine. Besides it’s simple interface, FreeOCR supports most image files and PDF documents and is compatible with most scanning devices.

You can download FreeOCR here.

After saving the freeocr.exe file on your computer, double-click on it and follow the instructions to install the application.

With the application installed, go to the desktop and double-click on its shortcut to open FreeOCR.

The interface is split into two windows to make the OCR process easy to understand. On the left, you can see the imported image, while on the right the extracted text is displayed.

image

You have three options for importing files into the application.

image

If you click the Scan button, your scanner interface will start and you’ll be able to scan your documents directly into the program.

Clicking the Open button allows you to select any image file on your computer and extract the text from it. The Open PDF button will do the same for PDF files.

Once you import an image file though one of these options, all you have to do is click the OCR button. This will start the OCR process. For best results, use an image that has a resolution of at least 300 dpi.

After the conversion is complete, the text in the right window is fully editable. You can correct any errors right there. Once you’re satisfied with the results, click the blue W icon in the middle to export the text into Microsoft Word. Alternatively, click the button above it to transfer all the text to the clipboard.

By default, FreeOCR comes equipped to recognize text written in the English language. If you need to convert text for another language, you will have to install it separately.

To download extra language files for FreeOCR, click here.

Download the language file and save it on your computer. Since the archives belong to the tar.gz format, you will need an archive manager like 7-Zip to extract the files.

Now, open FreeOCR, click on Settings and then click the Open Language Folder button. This will open the tessdata folder. Copy the extracted language file to this folder and restart the OCR software.

You can now change the language from the OCR Language dropdown menu.

OCR technology has yet to mature, but it can still increase your productivity while at the computer.

~Cosmin Ursachi

5 Responses to “Converting Images to Text”

  1. The Puppeteer says:

    David Cameron on record in the commons at PMQs, saying that World Start Tips are the best on the interweb, and me agrees so there!

  2. Kathy Jolowicz says:

    FANTASTIC!

  3. Herbert Pearson says:

    The best tips and programs on the web Thank You for freeOCR makes my old one Sick

    HERB

  4. Herbert Pearson says:

    GREAT THANK YOU !!

  5. roberta wescott says:

    When going to the website to tesseract-ocr, I was met with “Your version of Internet Explorer is not supported. Try a browser that contributes to open source, such as…………
    Do not want to change browsers. Am I out of luck?

Leave a Reply


Like these tips? Get them for FREE in your email!

WorldStart's Tech Tips Newsletter

  • Tech Tips Daily - Become a tech pro! Get the very best tech and computer help sent directly to your email every weekday!

  • Tech Tips Weekly - If you don't want our Tech Tips newsletter every day, then sign up for this weekly newsletter to get the best information of the week. Sent on Fridays.

Other Newsletters

  • WorldStart's Daily Deals - Every week, we send out great deals in our Daily Deals newsletter. Many of these deals are exclusively for our Daily Deals newsletter subscribers and can't be found with our regular specials.

  • Just For Grins - Each issue includes a couple clean jokes, some funny quotes, and a hilarious reader's story. Newsletter is sent five days a week.


Enter Email Address:

Subscribe

Your e-mail address is safe with us!
We only use it to send you the newsletters you request. It is NEVER disclosed to a third party for any reason, ever! Plus, if you decided you don't like our newsletters (don't worry, you'll love them), unsubscribing is fast and easy.

Free Newsletter Signup



Tech Tips Daily

Become a tech pro! Get the very best tech and computer help sent directly to your email every weekday!

Tech Tips Weekly

The week's best in tech and computer help. Get your issue sent to your email every Friday!

WorldStart's Daily Deals

The very best deals on the Internet! Get a new set of incredible sales every day of the week!

Just For Grins

Clean jokes, funny quotes, and hilarious comics. Sent 5 times a week straight to your email.


Subscribe


Love Worldstart? Refer A Friend!

WorldStart's Premium Membership

Tip Archive


Categories:
Archives: