Monday, July 30, 2007

Using MOSS to eliminate paper: multifunctionals and scanners

This is the first post of a planned series of postings on the topic of using SharePoint to get rid of paper. My intention is to share gathered knowledge on the matter, which means that I don't have a clear idea on what to write in the following articles, but that I share knowledge as it comes along. I welcome all input from the community so please let me know your thoughts and experiences.

This first posting is on the use of multifunctional devices (MFDs) and other scanning devices.

Multifunctionals

MFDs (also called MFPs) are printing devices that have other capabilities, like scanning, as well (For this article, I want to focus the machines that have scanning capabilities as this is what will make the connection with MOSS). MFDs come in various sizes: from a SoHo small footprint type, to a large printing production environment one that even staples your printed books. MFDs can be found in the repro area or on the work floor. MFDs are centralized and you'll have only a few or perhaps no more than one on a site.


Multifunctional solutions usually require some middleware of the MFD supplier to route scanned documents to another system, like MOSS. The MFD itself will have functionality build in to determine the type of content and the middleware will take care of the output type (XML, PDF, TIFF etc) and routing to a document management system. I have not seen a solution yet that can handle MOSS content types. In fact, some MFD manufacturers only mention SharePoint 2003 support.

The big plus for MFDs is that they have simple interfaces that can be used by nearly everyone. As nearly every office will have a printer/copier, an MFD with scanning capabilities can easily replace the old printer/copier.


pros

  • easy to use
  • nearly every office will require a printer/copier, so a machine with scanning capabilities does not require extra space or extra investment
  • can handle bulk

cons

  • expensive machines
  • often pay per print/scan
  • often proprietary software

Scanners

Scanners are office territory and usually reside at someone's desk. You also need a computer to access its functionality and to transfer the scanned documents to a document management system. The scanned documents are first stored in a local cache so they can be worked on before transferring them. This includes enhancing the scan and adding metadata, also referred to as indexing. Not all solutions integrate with the MOSS content types, so if this is a requirement, please take that into account when making your choice.


Some solutions offer routing, which means you can target a document to a specific location in MOSS.


pros

  • often a cheap solution
  • flexible, as the scanner can be moved easily
  • easy to install software
  • scans can be enhanced before sending them to a document management system

cons


  • mostly used decentralized, so in an office
  • device can only scan
  • needs a PC to interface, so the user should know how to operate. This makes the solution less suitable for non-office use (like factories)

Integration with MOSS

As far as tying into the MOSS features, the local scanners offer more. I have not seen an MFD solution that fully uses content types or site columns, whereas local scanning solutions do offer it. Sometimes, the middleware in the MFD solution offers a client connection to review the scans before sending them to MOSS. The picture below provides a graphical overview of how the two types work. I know I'm leaving out all the detailed steps, but at least you'll get the idea.







If this were a competition, the MFDs would win when it comes to handling bulk, like in a mail room. They would also be the preferred solution to handle standard forms in a factory like environment.

Here's a (limited) feature comparison:





Sources


http://www.knowledgelake.com/solutions/9_sharepoint-2007-software.asp http://www.ecopy.com/Products_eCopy_Connector_for_Microsoft_SharePoint.asp http://www.nuance.com/paperport/professional/features.asp http://www.omtool.com/products/accuroute.cfm http://www.hp.com/large/ipg/docmanage/doc_capture.html http://www.searchexpress.com/sharepoint.htm http://www.kn.nl/navigatie/ (Dutch) http://www.kofax.com/products/ascent/capture/index.asp http://www.nsius.com/Products/AutoStoreWorkflow/Overview/tabid/61/Default.aspx http://www.office.xerox.com/software-solutions/xerox-smartsend/engb.html

12 comments:

Henry Ong said...

Awesome write up, thanks for the information!

Renatas Lauzadis [MSFT] said...

1. In Windows you already have scanned of fax document routing.

2. It is worth mentioning that there is a problem with multipage document scanning. It is oftern possible to PDF or TIFF only, . I would like to have an option to save to XPS too. Problem is that there no OOTB IFilter to index such documents.

3. Routing middleware could be SharePoint too. In that case possible routing endpoints may be determined by user. See Records management functionality.

4. You can get rid of Middleware because most of serious network MFD can email scanned document and that means they can email documents to MOSS document library, where document may be processed using handlers or workflows.

5. Most serious need is digital forms OCR software, which is hard to find. Such software could OCR scan directly to InfoPath form.

Sjoert Ebben said...

A quick reply before my holidays:

1. OK, but I'm not sure how MFDs can cope with that.

2. This might depend on the software used. I'll have to look into that.

3. Could be, but some MFD suppliers will bring in their own middleare so you can't always get rid of it. Also, some middleware includes editing functionality that SharePoint doesn't provide. Furthermore, Records Management iin MOSS is still v1.0 and not that easy to set up. I don't say it can't be done, but other routing software is way ahead.

4. I've played around with this scenario but I'm not convinced it's as powerfull as the existing middleware solutions. However, I'm sure it can be build. This also depends on the MFD supplier as some will tie in their middleware with their MFDs.

5. Scanning to an InfoPath form would be cool. I know that Konica has software that can handle forms. They have done a trial at a European transport company where they have set up their local MFDs to scan cargo forms to a central location. Forms OCR software is provided by, amongst others, CharacTell (http://www.charactell.com).

Mike said...

Sjoert, nice blog you have here!

What I am missing in your story is whether documents are OCR'ed or not, apart from OCR forms I think it is important that you do OCR anyway.

"I have not seen a solution yet that can handle MOSS content types"

Just a thought... Most MFD's I know can email, you could let them mail your scanned document to a doc. lib that has a workflow and a content type attached. The workflow would take care for the fact that the required fields of the content type were filled in.

Another one for your link list and a tool we used for a client: http://www.magsoft.nl/producten/orion/scan/sharepoint.htm.

Oddbjørn said...

Another one for your link list ;

http://www.scanflowstore.com/en-US/Default.aspx?PageRootId=56&ContentType=3&ContentTitle=Microsoft%20SharePoint&Id=43

Anonymous said...

Dark Blue Duck provide a cool Scanning Enabler product designed for Windows SharePoint Services and MOSS http://www.darkblueduck.com/Products/ScanningEnablerWSS.aspx

Gustavo said...

Hi,
Take a look to this general solution:
http://www.spsproparts.net/products/synchronizer
http://www.spsproparts.net/products/synchronizer/synchronizerconsole

wss101 said...

what is the best ocr avaliable , i use a xerox scanner/pritn/copies, ot scans a pdf image, iw ant to aocr it and be able tobe indexed bu moss. what is the best s/w or best scanner availabel to do this.
thanks
sunny

Sjoert Ebben said...

Wss101: I'm not in a position to call one piece of software the best. I do know that some multifunctional vendors use OmniPage by Nuance (http://www.nuance.com/omnipage/). OmniPage is available as an SDK version as well to be used in your own software.

Anonymous said...

re: what is the best ocr avaliable

The Dark Blue Duck Scanning Enabler product uses Microsoft Office's OCR (using MODI).

The recognised text is added to the saved .pdf file.

The Windows SharePoint Services and MOSS search engine indexes the recognised text. Matching pdf files are shown in the normal SharePoint Search results page.

The OCR functionality provided by Microsoft Office is pretty good.

Dark Blue Duck said...

Great to see that our product has been recommended here.

re: I have not seen a solution yet that can handle MOSS content types

Please see:

http://www.darkblueduck.com/Screencast/coversheetRoutingDocumentLibrary.aspx

and please let us know what you think.

Kind regards

Dark Blue Duck

Admero, Inc. said...

Another one site for Document Imaging

http://www.ademero.com/products/document-imaging-software-system/