Skip to main content

Convert Image to Text using Google Drive Free OCR Software

In my previous study post I've mentioned about how you can turn your commuting time into a productive one by using a text to speech online reader.
And some have asked me whether if it is possible to read an image file or pdf document.

The answer is Yes~
But! In order for the reader to perform text-to-speech translation you'll first have to convert your pdf/image file into a flat text file using an Optical Character Recognition (OCR) software~


So what is Optical Character Recognition and its uses?

OCR is a type of character recognition technology that is used for translating images or pdf files into pure text document. The algorithm will scan, detect, differentiate and convert characters to plain text at an efficient speed.
The extracted text can then be used for various applications:
  • Data Mining and text mining 
  • Text to Speech 
  • Language translation (who needs an interpretor, I want my カレーライス) 
  • Assistive tool for Blind and visually impaired 
  • Files keeping, saving of storage space, compression. Typically an image file will take up more bytes than a text file.


How to extract text using Google Drive OCR software 

  1. Login to Google drive
  2. Click on the gear icon on the right hand side of your window 
  3. Go to upload settings and check these two settings
    ☑Convert text from uploaded PDF and image files
    ☑Confirm settings before each upload 
  4. Google Drive OCR settings
    Google Drive OCR settings 
  5. Now when you upload a file a prompt window will appear and ask you to confirm your settings 
  6. Leave the setting as it is if you are converting English Document 
  7. Extract Japanese Language
    I am going to convert a short Japanese passage
  8. Once the upload is done, open up your new file
  9. The extracted plain text will be shown below your image 
Raw Image Top OCRed Words Below
Raw Image Top, Words Below
See it works for Japanese Characters as well~ I am able to get a 100% accuracy from this Japanese characters image.

Things to note:
Google Drive file size limitation for OCRed is 2MB. (.jpg, .gif, .png and PDF)
It will only look at the first 10 pages of PDF when searching for your text to extract.
Always do your checks OCRed document may not always be accurate. 

For best accuracy:
Use high resolution well lighted image, good contrast between characters and background
Orientate your text in the correct direction. Typically the text will be scanned from left to right.
Multiple columns document dont work well with this OCR
Select the correct language (there are 38)
Use a top down view image like what I did.

Popular posts from this blog

What is This Green green Profile Picture About

Some of you may be wondering why your friends' profile pictures are becoming green on social network platform ..facebook, twitter or may be some others.. This is not some technical glitch.. This is an ongoing protest to show support for the special effect VFX industry.

Soup Restaurant (三盅两 件) at IMM

Heh.. I should have done this review earlier.. anyway... If you are a herbal soup enthusiast. If you are looking for a dining place where your old folks can enjoy. then this might be it~ Soup Restaurant is a Cantonese themed eatery that offers Heritage Cuisine that were served in the Chinatown night bazaar in the 1960s~ Soup Restaurant's cantonese name sam zhong leung khin (三盅 两  件 - three bowls two dishes), is a derivation of a popular Cantonese expression  yat zhong leung khin  (一盅 两  件 - one bowl two dishes) which signifies a relaxed lifestyle of consuming teas and dim sum dishes at tea houses in the early morning. 

What is SEO? The really basic definition

I've been talking about SEO on my blog for quite a long time but some are still unsure of what SEO exactly is about.. So for the benefits of those beginners who wants to know more about web page ranking or want to gain more traffics to your blog/site. This is something that you will want to know. The word "SEO" is an acronym that stands for Search Engine Optimization .   First let's breakup the words into two segments and understand what each of them means.