OCR is an error-prone process. It is time-consuming and expensive t o m a n ually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in reading and understanding if they do not refer to the original image representation. As demonstrated in this paper, a hybrid document which combines symbolic representation and image representation may relieve the problem. If we represent a OCRed document properly in HTML, OCR errors will not have m uch negative eect on the human reading process in a HTML browser and can be corrected by using a HTML authoring tool. Under the approach, an experiment e v aluating a Japanese OCR system developed in CEDAR is also reported in this paper.