User Control Panel
Search iVirtua
Advanced/Tag Search...
Search Users...
What is iVirtua Exclusive Community?
  • An exclusive gaming industry community targeted to, and designed for Professionals, Businesses and Students in the sectors and industries of Gaming, New Media and the Web, all closely related with it's Business and Industry.
  • A Rich content driven service including articles, contributed discussion, news, reviews, networking, downloads, and debate.
  • We strive to cater for cultural influencers, technology decision makers, early adopters and business leaders in the gaming industry.
  • A medium to share your or contribute your ideas, experiences, questions and point of view or network with other colleagues here at iVirtua Community.
Guest's Communication
Live Chat
Teamspeak (VOIP) Audio Conference
Private Messages
Check your Private Messages
Themes
Choose an iVirtua Community theme to reflect your interests...
Business Theme
India/Arabic Theme

Gaming Theme
iVirtua Recommends
Fly Emirates Advertising
Free OCR software? You may already have it...
Digg This Digg Topic Tag it on del.icio.us Tag topic on On del.icio.us Technorati Search Technorati Search Post to Slashdot Post to Slashdot
You are currently in Software
Post new topic Reply to topic
Fri Jul 20, 2007 5:25 pm Reply and quote this post
OCR (Optical Character Recognition) can really come in handy. For example, I previously wrote about how I use Timesnapper as a black box torecover work which would otherwise be lost. Since most of my work istext based (C#, SQL, HTML, documentation, communications, etc.), theobvious next step is to grab the code from a screenshot. Of course Ican retype it, but OCR would be better.


There are some greatcommercial OCR packages out there. My company recently used OmniPagePro in a project which loaded data from hundreds of PowerPointslides into SQL Server for reporting and analysis1. OmniPage isgreat software, but it costs $149 for the basic version, which doesn'treally make sense if you're just using it to avoid retyping a littletext from a screenshot every now and then.
I looked around forfree OCR software, and was a little bit surprised that there wasn'tmuch out there. Here's a rundown of what I found, wrapping up with aprogram that wasn't technically free, but I already had it. There's agood chance you've got it, too.  
GOCR
I first tried out GOCR (a.k.a. JOCR). The easiest way to try it out is the GOCR Win Frontend, which installs GOCR as well. My opinion matched Pitor's:

Tolet things be clear - gocr is not ready, to say the least. PersonallyI'd even say the effect of trying to OCR a page is so crappy it is noteven worth installing the gocr engine (seems like the total rewrite in0.40 did not help much). And I am talking about an ascii black text ona white page, without other elements. Gocr seems to go all the way downhere - error in 98% of recognized characters, randomly added spaces,etc. For example: content is C unrir in gocr, sounds like drunken elvish to me.
Tesseract OCR
Yeah, there's been some chatter in the blogospheres and internets about Tesseract since Google assisted in re-releasing it as an open source project.I have no doubts that the press alone (not to mention Google'sinvolvement) will propel Tesseract towards OCR fame and fortune, but it sounds like it's not usable at this point:
It only is configured to build under MSVC++6 for Windows.
It only accepts uncompressed bitonal tiffs.
It's command-line only.
No GUI.
It performed abysmally on the provided testimage.tif
But it did build.
Microsoft Office Document Imaging
On accident, I stumbled across Microsoft Office Document Imaging.It's included Microsoft Office Tools ("Microsoft Office \ MicrosoftOffice Tools" folder in the start menu, default installation locationis "C:\Program Files\Common Files\Microsoft Shared\MODI\11.0\"). Theinterface looks a "My First VB5 Application" reject, but it works great.
Ithandles scanned documents via TWAIN. The image import's a bit lame - itonly handles TIF files. You can convert to TIF in just about anygraphics application (e.g. MSPAINT - open the file, Save As TIF file).An easier method is to just copy the image to the clipboard and pasteas a new page into MODI.

Contributed by Editorial Team, Executive Management Team
372659 iVirtua Loyalty Points • View ProfileSend Private MessageBack to Top

Related Articles
Post new topic   Reply to topic


Page 1 of 1

iVirtua Latest
Latest Discussion

Discuss...
Latest Articles and Reviews

Latest Downloads
Subscribe to the iVirtua Community RSS Feed
Use RSS and get automatically notified of new content and contributions on the iVirtua Community.


Tag Cloud
access amd announced applications author based beta building business card case company content cool core course cpu create data deal dec demo design desktop developers development digital download drive email feature features file files firefox flash free future gaming google graphics hardware help industry information intel internet iphone ipod jan launch linux lol love mac market media memory million mobile money movie music net nintendo nov nvidia oct office official online patch performance playing power price product program ps3 pst publish ram release released report rss sales screen search security sep server show size software sony source speed support technology thu tue update video vista war web website wii windows work working works xbox 360 2006 2007 2008

© 2006 - 2008 iVirtua Community (UK), Part of iVirtua Media Group, London (UK). Tel: 020 8144 7222

Terms of Service and Community RulesAdvertise or Affiliate with iVirtuaRSSPress Information and Media CoverageiVirtua Version 4PrivacyContact