首页>
外国专利>
SYSTEMS AND METHODS FOR ENABLING MANUAL CLASSIFICATION OF UNRECOGNIZED DOCUMENTS TO COMPLETE WORKFLOW FOR ELECTRONIC JOBS AND TO ASSIST MACHINE LEARNING OF A RECOGNITION SYSTEM USING AUTOMATICALLY EXTRACTED FEATURES OF UNRECOGNIZED DOCUMENTS
SYSTEMS AND METHODS FOR ENABLING MANUAL CLASSIFICATION OF UNRECOGNIZED DOCUMENTS TO COMPLETE WORKFLOW FOR ELECTRONIC JOBS AND TO ASSIST MACHINE LEARNING OF A RECOGNITION SYSTEM USING AUTOMATICALLY EXTRACTED FEATURES OF UNRECOGNIZED DOCUMENTS
A method in a document analysis system automatically extracts image and text features from each received electronic document and compares the extracted features with feature sets associated with each category of document to determine whether the document is recognizable as belonging to a document category. If an electronic document is recognized as belonging to one of the document categories, the method classifies the electronic document as belonging to that document category. If, however, an electronic document is unrecognized, the method submits the unrecognized document to a learning phase, in which the unrecognized document is presented to a human trainer for manual classification of the unrecognized electronic document into a document category, and automatically modifies at least one of the features and the weights of the feature set of the document category corresponding to the manually-classified electronic document using the automatically extracted features of the manually-classified document.
展开▼