Data Capture

September 1996 - January 1997.

The Industry.Net site received documents for Web-publication from outside clients in diverse formats. Handling these documents speedily and cost-effectively is critical.

I put together a proof-of-concept and feasibility evaluation for semi-automated acquisition of announcements and press releases received in paper format. I headed a team of four people to put together a system involving two PCs with scanners to scan, categorize by company, and OCR documents, and thence FTP them automatically to a Solaris box for indexing (via Excite) and automated Web publishing. The system was used at two trade-shows, and revealed very good throughput and cost measures.

Back
Rujith de Silva 1997-05-13