Announcement

Collapse
No announcement yet.

scan in a page as a document, not an image... possible?

Collapse
This topic is closed.
X
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

    scan in a page as a document, not an image... possible?

    i'm looking to see if it's possible to scan in a document (computer print out, book page, etc...) and have it be able to open into OpenOffice or Okular (or some other pdf viewer) but i don't want it opened as an image.. i'm going to want to be able to search for words once it's scanned in like it was a regular text document.
    i've used programs like gscan2pdf just to try but obviously with no luck.

    any ideas or help is greatly appreciated
    thank you
    Thermaltake Armor chassis<br />Intel Pentium D 960 @ 3.6Ghz<br />4GB DDR2 memory @ PC2-5300<br />6x internal SATA HDD [4.27TB total]<br />1x external 1,000GB HDD via firewire<br />nVidia GeForce 7950 GX2 Extreme<br />52&quot; LCD HDTV | 1920 X 1080p rez<br />Saitek Cyborg keyboard | SilverStone Raven mouse<br />running Kubuntu 10.04

    #2
    Re: scan in a page as a document, not an image... possible?

    You can take a look at OCRFeeder. It's a Gnome app, but...

    Available for install in the standard repositories, so you can install it from your package manager or from the command line (sudo apt-get install ocrfeeder).
    Description: Document layout analysis and optical character recognition system
    OCRFeeder is a document layout analysis and optical character
    recognition system.
    .
    Given the images it will automatically outline its contents,
    distinguish between what's graphics and text and perform OCR over the
    latter. It generates multiple formats being its main one ODT.
    .
    It features a complete GTK graphical user interface that allows the
    users to correct any unrecognized characters, defined or correct
    bounding boxes, set paragraph styles, clean the input images, import
    PDFs, save and load the project, export everything to multiple
    formats, etc.
    Windows no longer obstructs my view.
    Using Kubuntu Linux since March 23, 2007.
    "It is a capital mistake to theorize before one has data." - Sherlock Holmes

    Comment


      #3
      Re: scan in a page as a document, not an image... possible?

      As an alternative you can use ocr from commandline with the image scanned.

      Or using this: http://kde-look.org/content/show.php...content=121289

      Comment

      Working...
      X