Document Analysis

Monday, 4:00 pm - 5:45 pm

Session Chair: Johan Bollen

A Hierarchical, HMM-based Automatic Evaluation of OCR Accuracy for a Digital Library of Books

Shaolei Feng, R. Manmatha

Combining DOM Tree and Geometric Layout Analysis for Online Medical Journal Article Segmentation

Jie Zou, Daniel Le and George R. Thoma

Automatically Categorizing Figures in Scientific Documents

Xiaonan Lu, James Z. Wang, Prasenjit Mitra and C. Lee Giles

XML Views for Electronic Editions

Ionut E. Iacob and Alex Dekhtyar