Warning: include(/include/othersites.inc): failed to open stream: No such file or directory in /home/content/13/11000713/html/imagemaker-webs/discovery/Nav_Top/TechNotes.php on line 67

Warning: include(): Failed opening '/include/othersites.inc' for inclusion (include_path='.:/usr/local/php5_6/lib/php') in /home/content/13/11000713/html/imagemaker-webs/discovery/Nav_Top/TechNotes.php on line 67

Warning: include(/include/menu.inc): failed to open stream: No such file or directory in /home/content/13/11000713/html/imagemaker-webs/discovery/Nav_Top/TechNotes.php on line 69

Warning: include(): Failed opening '/include/menu.inc' for inclusion (include_path='.:/usr/local/php5_6/lib/php') in /home/content/13/11000713/html/imagemaker-webs/discovery/Nav_Top/TechNotes.php on line 69
Tech Specs

ImageMAKER Discovery Assistant Product Specification

Discovery Assistant is a software product designed to process email, electronic documents and image files to produce TIFF/PDF, metadata, and extracted text.

Processed documents can be exported from Discovery Assistant as TIFF/PDF with extracted text and metadata, and loaded into case management tools such as Summation or Concordance for review and production.

Discovery Assistant is capable of handling millions of documents and scaling across multiple machines.

The EDD Discovery Process

The goal of the electronic discovery process is to produce the smallest set of meaningful documents, in a usable format, that meets the required criteria.

The e-discovery process includes the following activities:

  1. Gather documents - identify document and email repositories.
  2. Review documents - use industry standard Search Tools to reduce data volume.
  3. Process documents- use Discovery Assistant to convert to TIFF/PDF, Text and Metadata.
  4. Produce Documents - Load into Case Management Tool for final review.

Product Features Overview

Discovery Assistant supports the following product features:

  • One click install with user-friendly tabbed interface.
  • Batch processing with real-time progress monitoring and reporting.
  • Native file support for Outlook (PST, MSG), Lotus Notes (NSF), Outlook Express EML, Zip, Office (DOC, XLS, PPT), PDF, Html, Txt, TIF, Jpeg, BMP, Gif, PNG.
  • Full extraction of documents from PST, MSG, NSF, EML, ZIP, and Office compound documents (OLE).
  • Full tracking of parent/child relationships.
  • Configurable processing options for Word, Excel, PowerPoint, PDF and HTML documents.
  • Local and Global De-Duplication using MD5 Hash.
  • Native and OCR based text extraction.
  • Removal of blank pages.
  • Bates Stamping, Bates Numbering.
  • Built in Quality Control Review Tool includes the ability to manually replace TIFF images.
  • Built in support for pass-thru for documents that can’t be conveted.
  • Export TIFF, PDF, TEXT, Source and Metadata to CSV, Concordance, Summation, RingTail, and Introspect Case Management load file formats.

Discovery Assistant Downloads:

Main Application:

  • Discovery Assistant.

Add-Ins:

  • Postscript Add-on - Supports text searchable PDF output.
  • 24 bit Color Add-on - Supports 24 bit TIFF color output.

Additional Tools:

  • Report Manager - Converts native XML projects to MDB, XLS and TXT.
  • TeraBite - Break millions of files into manageableimport file lists.
  • TIFF print driver - stand alone TIFF print driver.
  • Introspec IDX tool - conversion add-in tool for Introspec IDX format.

Review and search tools:

  • QuickReview - HTML based client review tool with built in search.

Summary Sheet:

Batch Processing

Load, Process, Review, Export
Supported File Types Outlook (PST, MSG), Lotus Notes (NSF), EML, Zip, DOC, XLS, PPT, PDF, Html, Txt, TIF, Jpeg, BMP, Gif, Png + many more.
Specialized processing XLS, DOC, PPT, OLE, email, zip.
File De-Duplication At File and Message level.  Local and Global.
Metadata Extraction 99 separate Metadata fields
Export Load Files CSV, Summation, Concordance, Opticon, Ipro, Ringtail, Introspec.
Export File Formats Single and Multipage B&W and Color TIFF, scanned PDF, searchable PDF, Postscript, Paper, Text.
Export Files Image, Metadata, Text, original Source
Export Folder Format Options Flat, Volume/Box, Mirror Source, Bates Folder.
Status Reporting Load, Convert, Process, Export,
Blank Page Removal Extremely fast
Assign Doc ID’s Up to 20 alphanumeric chars
Assign Bates Numbers Up to 20 alphanumeric chars
Bates Stamping Supports force white space, 6 page locations.
Print to Paper TIFF or Postscript Blowback
Document Pass-Through For files that can’t be converted
Quality Control Review Module Accept, Reject, Skip, Replace, OCR, View Source
Native Text Extraction From file, or from file print stream
OCR Text Extraction Handles portrait and Landscape images.
OLE Embedded Object ExtractionDOC, PPT and XLS OLE extraction.
Support for Foreign Character SetsMBCS only
Intelligent Process MonitoringMonitors 6 timeout values, auto-close dialogs.
Parent Child RelationshipsMaintains the parent/child/sibling hierarchy
Advanced Email HandlingPST, EML, NSF + 15 MSG types,
Upgrade Path to Multiple ServersScalable architecture
Microsoft Excel Spreadsheet formatting20+ print control settings
Microsoft Word formatting10+ print control settings
Microsoft PowerPoint formatting4+ print control settings
Built in Review and Search ToolDistributable HTML based client review tool with support for indexed search.

Batch Processing

Discovery Assistant allows the following functions to be batched:

  • File Import and local de-duplication.
  • File Conversion to TIFF, PDF, Text, and Metadata.
  • Global De-duplication.
  • Removal of Blank Pages.
  • OCR text extraction.
  • Bates Stamping.
  • Export.
  • Batch Printing

All processed data is stored within a project directory.

Files can be loaded into a Discovery Assistant project in one of 4 ways:

  • Add File
  • Add Folder
  • Add from file list
  • Drag from Windows Explorer into the All Files tab.

To batch load large file sets, download and install the TeraBite utility. This program allows the user to create an ordered file list for loading into Discovery Assistant.

Documents are categorized as follows:

ConvertibleDocument can be converted on conversion machine
Non-ConvertibleDocument cannot be converted. Requires user analysis.
Queued for ConversionIn list of files to convert.
ConvertedConverted to TIFF, Text and Metadata
StampedBates Stamped
FailedFailed while converting. Requires user analysis.

During the conversion process, files are moved from the 'all files' category through to the 'stamped' category through a series of steps. Each step is represented by a tab, which contains a subset of the 'all files' list - representing what stage the conversion process has reached for those files.

Processing can be stopped and re-started at any time.

At any point during the batch process the user can click on a processed file to view:

  • Converted TIFF / PDF file
  • Extracted Text
  • Extracted MetaData
  • Native Source Document
  • Children, Parent, and Duplicates

Supported File Types

Discovery Assistant processes a wide variety of input file types. These include: Microsoft Word documents, Excel spreadsheets, PowerPoint presentations, Outlook email files (PST, MSG), Outlook Express Files (EML), Lotus Notes files (NSF), WordPerfect documents, rich text format files (RTF), Microsoft Visio files, Corel Draw files, Cad Cam files, Lotus 1, 2 and 3 spreadsheets, text files, HTML documents, Adobe Acrobat documents (PDF), compressed archives (ZIP), images (TIF, JPG, BMP, etc.), scanned files and more.

Discovery Assistant uses the native application to petrify documents so any documents that possess a print or printto command on your system can be processed. At time of installation, Discovery Assistant produces a list that identifies any document type with a ‘print’ or ‘printto’ file association.

Discovery Assistant contains a conditional filter to exclude executable files, hidden files, system files, and an optional feature to not process email attachments or sub-directories.

Common Supported file types:
Microsoft Word (DOC, DOCX)All versions, including Office 2010
Excel (XLS, XLSX, CSV)All versions, including Office 2010
PowerPoint (PPT, PPTX)All versions, including Office 2010
Outlook Database (PST, MSG)Recognizes 14 different message types
Outlook Express (DBX, EML)EML only. DBX must be converted to PST first
Rich Text Format (RTF)Uses Word to render
Visio (VSD)2000/2003/2007
Corel Draw (CDR)Corel Draw
Corel Photo Paint (CPT)Corel Photo Paint
Cad Cam (DWG, DXF, DWF)ABViewer from Cad Soft Tools
WordPerfectUses Word Perfect
Lotus 1-2-3 (WKS)Uses Lotus 1-2-3
Lotus Notes Database (NSF)Requires Notes be installed
Text (TXT, DAT, LOG, BAT)ASCII and UNICODE
HTMLUses Internet Explorer to render
Adobe Acrobat (PDF)Uses latest version of Adobe Reader
Compressed Archives (ZIP)Maintains parent/child relationships
Images (TIF, JPG, BMP, PNG, GIF, DCX)Internal built-in Viewer
Scanned filesPass-through option to improve speed
OLE documentsWord, Excel, PowerPoint OLE embedded docs
Complete supported files listing »

File De-duplication

De-duplication can be done at the file level, or message level (multiple attachments). Global De-duplication is supported across multiple projects.

Uses MD5 Hash code to identify duplicates, and a full binary confirm on matches.

Extraction of Metadata and Text

Discovery Assistant supports full metadata extraction from source documents, including MS Office specific tags, Microsoft Outlook email specific tags, and Lotus Notes specific tags. Standard email tags include Date Sent, Time Sent, Subject, Text Body, Html Body, Filename, Author, File Size, File Date, File Time, email header information, To, From. Lotus Notes specific metadata includes IMLog, Appointment, Bookmark, Notice, Phone Message, Return Receipt, and Task Form.

Document text is extracted at time of printing, either from the document itself, or from the print stream. This ensures 100% accuracy. If the document text cannot be extracted (perhaps because the document is an image file) then the user has the option to OCR the document after conversion to TIFF (petrification). Discovery Assistant uses the Microsoft Office OCR engine to extract text from image files (99% accuracy). If Word 2003 or Word 2007 OCR engine is not available, then Discovery Assistant uses its own built-in OCR engine.

The full set of extractable Metadata fields is:
1 ITEMID Discovery Assistant file ID
2 BEGINDOC  Export file title of first page
3 ENDDOC  Export file title of last page
4 APPLICATION_NAME  Name of creating application
5 ATTACHMENTSCOUNT  Count of attachments
6 ATTACHLIST  List of export file titles of attachments
7 ATTACHMENTRANGE  Range of export file titles of attachments
8 GROUPRANGE  Range of export file titles that belong as a group. e.g. an email and its  attachments or a zip file and its contents.
9 BATESGROUPRANGE  Range of Bates Numbers that belong as a group. e.g. an email and its  attachments or a zip file and its contents
10 BEGATTACH  Export file title of first page of group. e.g. an email and its attachments  or a zip file and its contents
11 ENDATTACH  Export file title of last page of group. e.g. an email and its attachments  or a zip file and its contents
12 ATTACHTITLE  File title of attachment
13 AUTHOR  Document author
14 LASTAUTHOR  Last Document author
15 REVNUM  Last Document author
16 BATESBEG  Beginning Bates number
17 BATESEND  Ending Bates number
18 BATESBEGGROUP  Beginning Bates number for group. e.g. an email and its attachments  or a zip file and its contents
19 BATESENDGROUP  Ending Bates number for group.  e.g. an email and its attachments  or a zip file and its contents
20 BCC  Blind Carbon Copy recipient
21 CC  Carbon Copy recipient
22 DACOMMENT  Discovery Assistant Pass-Through comment
23 DOCTEXT  Document Text
24 FILECREATIONDATE  Source document creation date
25 FILECREATIONTIME  Source document creation time
26 FILEMODIFYDATE  Source document modified date
27 FILEMODIFYTIME  Source document modified time
28 FILEACCESSDATE  Source Document Last Access Date
29 FILEACCESSTIME  Source Document Last Access Time
30 FILEPRINTDATE  Source Document Last Print Date
31 FILEPRINTTIME  Source Document Last Print Time
32 SENTDATE  Email sent date
33 SENTTIME  Email sent time
34 RECEIVEDDATE  Email received date
35 RECEIVEDTIME  Email received time
36 DOCTITLE  Document Title
37 DUPPATHS  Source document paths of duplicate items
38 BODY  Body of email
39 FILEEXTENSION  Source file extension
40 FILEPATHNAME  Source file path
41 EXPORTEDSOURCEFILEPATHNAME  Exported source file path
42 FILENAME  Source file name (including extension)
43 FILEDISPLAYNAME  Source file title
44 FILETYPENAME  Source file type name
45 PARENT  Email parent folder name
46 FROM  Email From address
47 HASHCODE  MD5 hash code value for source document
48 ISDUP  True/False is duplicate
49 MSGID  Email message ID
50 DOCUMENTPAGES  Output file page count
51 PARENTID  Export file title of parent item
52 SHORTFILETITLE  Short file title
53 OBJECTSIZE  Source file size on disk
54 STOREID  Message store identifier
55 STORENAME  Message store source file name
56 SUBJECT  Email subject
57 SENTTO  Email To address
58 ITEMINDEX  Item Index
59 INETHEADER  Internet Header
60 DOCID  Document ID
61 ALTRCPALLOW  Alternate Recipient Allowed
62 AUTOFWD  Auto Forwarded
63 BILLINFO  Billing Information
64 CATEGOR  Categories
65 COMPANIES  Companies
66 CNVINDEX  Conversation Index
67 CNVTOPIC  Conversation Topic
68 DEFDLVDATE  Deferred Delivery Date
69 DEFDLVTIME  Deferred Delivery Time
70 DELAFTSUB  Delete After Submit
71 EXPIRYDATE  Expiry Date
72 EXPIRYTIME  Expiry Time
73 HTMLBODY  HTML Message Body
74 IMPORTANCE  Importance
75 MSGCLASS  Message Class
76 MSGMLG  Message Mileage
77 NOAGING  No Aging
78 DLVRPTREQ  Originator Delivery Report Requested
79 OLINTVER  Outlook Internal Version
80 OLVER  Outlook Version
81 RDRECREQ  Read Receipt Requested
82 RCVBYNAME  Received By Name
83 RCVONBEHALFNAME  Received On Behalf Of Name
84 RCPREASSPROHIB  Recipient Reassignment Prohibited
85 REPLRECIPS  Reply Recipients
86 SAVED  Saved
87 SENSITIVITY  Sensitivity
88 SENT  Sent
89 SNTONBEHALFNAME  Sent On Behalf Of Name
90 SUBMITTED  Submitted
91 UNREAD  UnRead
92 READ  Message read y/n?
93 VOTINGOPT  Voting Options
94 VOTINGRESP  Voting Response
95 GLOBALPRIMARY  'Yes' if this is the first occurrence of this item in the global table.
96 GLOBALCOUNT  Count of occurrences of this item in the Global Project table.
97 SRCCUSTOD  Source Custodian.  Obtained from third to last directory name in source  file path.
98 SRCBOX  Source Box. Obtained from second to last directory name in source  file path.
99 SRCFOLDER  Source Folder.  Obtained from last directory name in source file path.

Export Options

Discovery Assistant supports a variety of export options including:

Export Formats:

  • Summation eDii Load File
  • Concordance DAT or DCB files
  • Opticon Log files
  • Ringtail Load File
  • Ipro Load File
  • Comma Separate Value (CSV)
  • Introspec IDX Load File

File formats:

  • Single and multi page B&W TIFF.
  • Single and multi page Color TIFF.
  • Multi-page scanned PDF.
  • Multi-page text searchable PDF.
  • Multi-page Postscript.
  • Paper only (print direct to printer).
  • Text only

Supported Output resolutions and page size (determined at time of conversion):

  • Resolutions: 100x100, 200x200, 300x300, 400x400
  • Letter, Legal and A4 output page sizes

Additional export file types:

  • Metadata Text.
  • Body Text.
  • Original source file.

Folder Support:

  • Flat
  • Volume / Box
  • Mirror Source
  • Bates Folder

Export File Naming Schemes:

  • Project ID
  • File ID
  • File Title
  • Short File Title
  • File Extension
  • Bates Start
  • Bates End
  • Page Number
  • Bates Number
  • Document ID

Status Reporting

Discovery Assistant generates reports for each type of batch process. Reports are stored with the project, and can be reviewed chronologically using Windows Explorer file manager.

Reports indicate any warnings, errors, time to complete the function, and any additional information that might be necessary to track the progress of one or more files through the process.

Reports are generated at time of:

  • File Import
  • File Conversion
  • File Export
  • Batch de-blanking
  • Batch OCR
  • Batch Bates Stamping

Blank Page Removal (Page Deblanking)

Spreadsheet files by default are set to print the whole sheet, not the default print range. The printing process does generate a large number of blank pages.

Discovery Assistant can batch process files to detect and eliminate blank pages. The process is extremely fast, and can go through 1000’s of pages in a second looking for blank pages.

Document ID's and Bates Numbers

Converted documents can be assigned both a Bates Number and a Document ID. Assigned ID’s support up to 20 alpha-numeric characters.

Bates Range values are automatically assigned to parent items when Bates Numbers are assigned in ‘child next’ order.

Stamping

Discovery Assistant includes the ability to stamp converted files with specific information. The user is able to control the stamp content from the Stamp Options. Possible stamps are: Bates number, document ID, file name, file path, file type, page number, number of pages and many more. Discovery Assistant supports a user selectable option to shrink the document image so that stamps do not obscure image data.

Stamp fields can be place in one of 6 coordinate positions. Top left, Top Center, Top Right, Bottom Left, Bottom Center, Bottom Right.

Stamp text fields can include a combination of the following data fields (plus user defined strings)

  • Project ID
  • File ID
  • File Name
  • File Path
  • File Type
  • Page Number
  • Total Pages
  • Bates Page
  • Bates Number
  • Document ID
  • Confidential
  • Global Primary
  • Global Count
  • Hash Code

Sort Options

Any field type tracked and displayed by Discovery Assistant can also be sorted. Sort fields include status, filename, source path, size, date/time to name just a few.

Printing a Hard Copy

Stamped or Unstamped processed TIFF / Postscript files can be spooled to a standard printer. Metadata pages can be printed as slip sleeves.

Pass Through

Some files cannot be converted, perhaps because they are password protected, corrupt, or unprintable (such as WAV files). These files can be ‘passed through’ as a converted file by creating a placeholder file. The placeholder file includes the file’s metadata fields and a user configurable message.

Quality Control Review Module

The QC module allows users to review the conversion results, and to re-queue any incorrectly converted documents, and/or to manually replace TIFF files.

Functional features include:

  • Accept
  • Reject
  • Skip
  • Replace
  • OCR
  • View original native file

Documents can be reviewed one page at a time. Text scrolls to match the current page.

Manual TIFF Replacement

Users can manually replace converted TIFF files using the 'Replace TIFF' function in the Quality Control Review Module. Users can create alternate TIFF renderings by printing source documents to the ‘Print To Tiff’ print driver supplied with Discovery Assistant. TIFF and Text files for the current document can then be manually replaced.

Native Text Extraction

Discovery Assistant extracts the text directly from the native document at time of import, or if not easily accessible, at time of conversion. Foreign language sets are handled as MBCS characters (Multi-byte Character Set)

OCR Text Extraction

Discovery Assistant extracts text at time of import (most email messages), or at time of conversion (most email attachments). If the file is an image file, then the text can only be extracted by doing OCR. By default Discovery Assistant uses the OCR engine installed with Office 2003 or Office 2007 (Nuance Omnipage). If you have Office 2000, then a simpler OCR engine is used as the default (recommend upgrading to Office 2003).

OLE Embedded Object Extraction

If you select the Options 'Extract OLE objects' in the Options tab / MS Office / Extract OLE embedded objects, Discovery Assistant will identify and extract all OLE objects embedded in an office document.

Typically OLE objects include embedded spreadsheets, or embedded pictures or documents. These embedded objects may be displayed as an icon, or a portion of the document only. By extracting OLE objects, you are assured of getting every searchable string in the document.

Support for Foreign Character Sets

Discovery Assistant supports MBCS (Multi-Byte Character Strings) in extracted Metadata. Printed images will retain the original font and font characteristics.

Intelligent Process Monitoring

Discovery Assistant monitors file conversion status in real time, and includes intelligence to 'auto-close' and 'auto-kill' stuck applications. This ensures that the conversion process runs un-attended for hundreds of thousands of conversions.

5 timeout values (user configurable) are continuously monitored to ensure that no print job gets stuck: Job Start, First Page, Next Page, Max Pages, and Total Job.

Parent Child Relationships

Parent-Child relationships are fully maintained throughout the process. The user interface includes a ‘goto parent’, and ‘list children’ buttons that are accessible at all times. Unlimited nesting of parent/child relationships is fully supported.

Bates Number and Document ID number assignments track parent/child relationships.

De-duplication and Global De-duplication can be configured to only skip duplicates if parent item is also a skipped duplicate, so no document will be orphaned in the output set.

Exported MetaData fields include multiple Parent/Child group range fields.

Advanced Email Handling

Discovery Assistant fully supports email and attachment extraction from Outlook PST and MSG files, Lotus Notes NSF databases, Outlook Express EML files, and iCalendar and vCalendar files (ICS/VCS).

Specialized code is in place to support:

  • Conversation Topic and Conversation Index in Outlook
  • UTC Date/Times
  • iCalendar and vCalendar File Formats (ICS / VCS):
  • Maintaining Parent / Child relationships
  • Maintain all linking between emails
  • Keep folder structure within email stores
  • Preserve color fonts of native emails.

Recognized Microsoft Outlook message types include:

  • MailItem
  • ApptItem
  • ContactItem
  • DistListItem
  • JournalItem
  • MeetingItem
  • NoteItem
  • PostItem
  • ReportItem
  • TaskItem
  • TaskRequestAcceptItem
  • TaskRequestDeclineItem
  • TaskRequestItem
  • TaskRequestUpdateItem
  • Attachment

Recognized Lotus Notes message types include:

  • Standard Email form type
  • IMLog
  • Appointment
  • Bookmark
  • Notice
  • Phone Message
  • Return Receipt
  • Task

Upgrade path to support multiple servers

For large batch jobs that are too big for a single machine, we provide an upgrade path that allows multiple servers to attach to the same conversion database to help process jobs and improve throughput.

Additional tools required are:

  • TeraBite - takes large datasets, and breaks them down into smaller projects
  • Global Dedup – allows you to de-duplicate across projects

Microsoft Excel Spreadsheet formatting control

Discovery Assistant incorporates special spreadsheet controls to improve the quality of printed spreadsheets. Some of these controls include un-hiding cells columns and rows, printing formulas out separate, automatic resizing, setting scaling to fit, defining relative scale and suppressing headers/footers.

Unhide Cells Yes
Unhide Columns Yes
Unhide Rows Yes
Unhide Worksheet Yes
Clear grid lines ON/OFF/Default
Remove headers and footers Yes
Set Print order Yes
Set Print orientation Landscape / Portrait
Set Print order Yes
Set Paper size Yes
Print Comments Yes
Unhide Formula's Formulas can be printed out separate
Automatic resizing Yes
Set scaling to fit Yes
Define relative scale Yes
Suppress Headers / Footers Yes
Suppress Row/Col headings Next Version
Date/Time fields: show field codes instead Next Version
Filename fields: show field codes insteadNext Version

Microsoft Word formatting control

Specialized handling for Microsoft Word files includes:
Unhide Cells Yes
Unhide Columns Yes
Do Next Versiont print headers Next Version
Do Next Versiont print footers Next Version
Force manual repagination before printing Next Version
Log warnings for Word 95/6.0 documents Next Version
Show Comments Next Version
Log warnings for Word 95/6.0 documents Next Version
Date/Time fields: Show field codes Next Version
Filename fields: Show field codes Next Version

Microsoft PowerPoint formatting control

Specialized handling for Microsoft PowerPoint files includes:

Set Scaling Yes
Frame Slides Yes
Print hidden slides Yes
Force output to Grey/B&W Yes

Built-in Review and Search Tool

ImageMAKER has developed a separate distributable HTML based Review Tool with support for indexed search. This tool must be licensed separately for distribution to clients.

Clients can review produced documents using a simple HTML browser interface. There is no need for clients to purchase and learn a case management tool in order to review data.

Indexed search can be done on the data by downloading and installing an inexpensive third party search tool that supports indexed searching. Indexed searching allows users to perform complex search queries with very fast response rates.

Speed Tests

Discovery Assistant is rated at one gigabyte per day per machine. One gig of data averages out to approximately 70,000 pages, and about 5 gigs of converted files storage space.

Actual per page conversion speed is rated at 3,500 pages per hour of straight conversion (20 hours a day). We factor in 4 hours of overhead per day to handle the other house-keeping tasks, like file import / de-duplication / deblanking / bates labeling / exporting etc.

In addition to straight conversion is the time required to:

  • set up the machines and install all appropriate software.
  • quality control review of output data.
  • exception handling.
  • trouble-shootint.

Discovery Assistant comfortably handles the conversion of up to 100,000 files per project. If you are attempting to convert a million files, recommendation is to break the job down into 10 separate projects.

These projects can then be farmed out to up to 10 separate machines.

Internal DataBase Format

The Discovery Assistant internal database format is XML. The XML Project File is loaded into memory for processing, and updated back to the disk after significant project changes, and regularly during the document conversion process.

Discovery Assistant provides an add-on Reporting Tool to convert from XML project files to XLS, MDB, SQL, or text for review and long-term documentation.

Development Language

Discovery Assistant was developed using the latest Microsoft .NET software tools. Code is written in C# (main application), C++ (object modules), and C (print driver components).

Hardware Specifications

Operating Systems:

  • 32 bit Windows 2000, XP, 2003, Vista, Server 2008, Windows 7
  • 64 bit Windows XP, 2003, Vista, Server 2008 (GA and R2), Windows 7
  • Windows Terminal Services / Citrix supported

Processor:

  • Single or multi-processor

Processor Speed:

  • 1.7 Ghz processor (3+ Ghz recommended).

Memory:

  • 1Gb or more recommended, 2Gb for Vista and Windows 7, 4Gb for server operating systems

Hard Drive:

  • At least 1 gig available on system disk and 20 gigs available on data disk.

Database Support:

  • Uses XML. No additional database software required.

Applications:

  • Microsoft .NET run time 1.1, and 2.0 or later for optional components.
  • Lotus Notes
  • Microsoft Outlook
  • Microsoft Office
  • Microsoft Office Tools (OCR engine)
  • Acrobat Reader
  • Internet Explorer
  • Any other applications for file types to be converted.

Windows 7 note: The .Net Framework 1.1 is not supported by Microsoft on Windows 7, but in our experience, Discovery Assistant, which currently uses that framework, has worked flawlessly on both 32- and 64-bit versions.

Scanner Support:

  • No direct scanner support.
  • Can import scanned images as TIFF or PDF files.
  • Recommend that files be scanned to Custodian\Box\Folder directory to maintain source info.

Custom Development

ImageMAKER custom development services are designed to insure that if you are relying on our product to meet your customer’s needs, that we are there to help you through any problems.

Our custom development services include hot fixes, custom features and modifications, and development of new custom modules and export formats.

Maintenance & Support Services

Our Maintenance Support Services ensure that you get the answers you need, when you need them. Includes immediate access to senior development personnel, product fixes and updates.

Contact information

ImageMAKER Development Inc.
#102 - 416 6th Street
New Westminster, B.C.
Canada V3L 3B2

Sales: +1.604.525.2170
Support: +1.604.525.2108
Fax: +1.604.520.0029
E-mail: info@imgmaker.com
Web sites: http://www.discoveryassistant.com

Local (Pacific) time: GMT-8

About ImageMAKER

ImageMAKER Development Inc. consists of a team of experienced and highly qualified individuals that over the last 15 years has developed market leading OEM conversion software for fax, unified messaging, document conversion, -delivery and -storage applications, including Microsoft Mail, Microsoft Small Business Server, Lotus Domino FAX Server and products from T-Mobile, Tiscali, Eastman Kodak, FileNet, Kofax, Canon, EasyLink (Mail.com), MessageClick, MCI, IBM, Hewlett Packard, Cable & Wireless, Motorola AirCommunications, Nortel Networks, Cisco, Lucent Technologies & Octel (now Avaya), Nippon Telephone and Telegraph International, Telecom Finland (Telia Sonera), Alcatel and many more with over 55 million client installs around the world.

ImageMAKER Development current focus is on the delivery of innovative solutions for the next generation of Electronic Data Discovery applications by leveraging its market proven core document conversion technology.

TIFF File Format

Detailed TIFF file format specification more details »

Glossary

Detailed alphabetic list Of terms used on ImageMAKER website. more details »