DeepSeek-OCR vs OLMOCR-2 Evaluation on OmniDocBench

This repository provides a comparative evaluation of DeepSeek-OCR and OLMOCR-2 on the OmniDocBench benchmark. The evaluation assesses document parsing capabilities across text, formulas, tables, and reading order.

Overview

DeepSeek-OCR: A vLLM-based multimodal pipeline for document understanding.
OLMOCR-2: An efficient OCR system using open visual language models.
OmniDocBench: A comprehensive benchmark with 1,355 annotated PDF pages covering diverse document types.

Setup

1. OmniDocBench Setup

Follow the setup instructions in OmniDocBench/README.md.

2. OLMOCR Setup

Follow the setup instructions in olmocr/README.md.

3. DeepSeek-OCR Setup

Follow the installation guide in DeepSeek-OCR-master/README.md.

Get the data

We used the HuggingFace version and based all our evals on it.

Can be found at link

For olmOCR2, convert the images to PDFs using the following

python utils/image_to_pdf.py

Our outputs can be found in the 'markdown_olmo_ocr_2' folder.

Running the Models

Generate Outputs from DeepSeek-OCR

Navigate to the DeepSeek-OCR directory:

cd DeepSeek-OCR-master/DeepSeek-OCR-vllm

Configure paths in config.py:
- Set INPUT_PATH to the OmniDocBench images directory (e.g., ../../OmniDocBench/images/)
- Set OUTPUT_PATH to a directory for output .md files (e.g., ../../outputs/deepseek_ocr/)
Run inference on images:
```
python run_dpsk_ocr_eval_batch.py
```
This will process all images and generate corresponding .md files in the output directory. Remember to use 'cleaned' .md files for evaluation, which can be found in ./tools/cleaned_markdown/ that we generated.

Generate Outputs from OLMOCR-2

Navigate to the olmocr directory:
```
cd olmocr
```
Run inference on PDFs:
```
python -m olmocr.pipeline ./localworkspace --markdown --pdfs tests/gnarly_pdfs/*.pdf
```
Replace tests/gnarly_pdfs/ with a workspace directory, that includes your pdf files.

The --markdown flag ensures .md files are generated in the workspace's markdown/ subdirectory.

Evaluation

Use OmniDocBench's evaluation scripts to compare the generated outputs.

End-to-End Evaluation (md2md)

Configure OmniDocBench/configs/md2md.yaml:
- Set ground_truth.data_path to OmniDocBench/OmniDocBench.json
- Set ground_truth.page_info to OmniDocBench/OmniDocBench.json
- Set prediction.data_path to the directory containing model outputs (e.g., outputs/deepseek_ocr/ or olmocr_workspace/markdown/)

Run evaluation:

cd OmniDocBench
python pdf_validation.py --config configs/end2end.yaml

Results

After evaluation, results are stored in OmniDocBench/result/. Use the notebooks in OmniDocBench/tools/ to generate comparison tables and visualizations.

You can find our results in results folder too!

Key metrics include:

Text accuracy (normalized edit distance)
Formula accuracy (Edit dist score)
Table TEDS score
Reading order accuracy
Overall score: ((1 - text_edit) × 100 + table_teds + (1 - edit_distance) × 100) / 3

Comparison Summary

Based on the evaluation:

DeepSeek-OCR achieves an overall accuracy of 84.24%
OLMOCR-2 achieves an overall accuracy of 81.56%
DeepSeek-OCR shows strengths in text and table recovery
Both models perform well on reading order but have room for improvement in formula parsing

See REPORT.md for detailed results and visualizations.

Name		Name	Last commit message	Last commit date
Latest commit History 150 Commits
.github/workflows		.github/workflows
DeepSeek-OCR-master		DeepSeek-OCR-master
OmniDocBench		OmniDocBench
__pycache__		__pycache__
assets		assets
demo_data		demo_data
extracted_text_deepseek_ocr		extracted_text_deepseek_ocr
frontend		frontend
markdown_olmo_ocr_2/images_to_pdf		markdown_olmo_ocr_2/images_to_pdf
markdowns_for_dpsk_ocr		markdowns_for_dpsk_ocr
model_infra		model_infra
olmocr		olmocr
.gitignore		.gitignore
README.md		README.md
REPORT.md		REPORT.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DeepSeek-OCR vs OLMOCR-2 Evaluation on OmniDocBench

Overview

Setup

1. OmniDocBench Setup

2. OLMOCR Setup

3. DeepSeek-OCR Setup

Get the data

Running the Models

Generate Outputs from DeepSeek-OCR

Generate Outputs from OLMOCR-2

Evaluation

End-to-End Evaluation (md2md)

Results

Comparison Summary

About

Uh oh!

Releases

Packages

Languages

YuvrajSingh-mist/DeepSeek-OCR

Folders and files

Latest commit

History

Repository files navigation

DeepSeek-OCR vs OLMOCR-2 Evaluation on OmniDocBench

Overview

Setup

1. OmniDocBench Setup

2. OLMOCR Setup

3. DeepSeek-OCR Setup

Get the data

Running the Models

Generate Outputs from DeepSeek-OCR

Generate Outputs from OLMOCR-2

Evaluation

End-to-End Evaluation (md2md)

Results

Comparison Summary

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages