contentCrawler identifies image-based documents in a content repository and ignores text-based PDFs and documents with limited text. It converts image-based documents to text-searchable PDFs. It can be automated and works in either Backlog or Active Monitoring modes, or both. contentCrawler saves new documents into DMS and runs as an automated end-to-end process, or with “hold for review” stages. In this way, it ensures 100% searchability of documents without requiring staff to constantly OCR.
Downloadable installation with global help available. contentCrawler integrates with:
- MS SharePoint 2007 and 2010
- OpenText Content Server CS10 up to patch 5
- OpenText eDOCS DM [LRoy] 5.3.0/5.3.1
- OpenText Livelink 9.7.1