Hebrew OCR
Flask-based OCR server for the Shimush Tehillim manuscript (Békéscsaba 1936)
שמוש תהלים — OCR App
Full-featured Tesseract OCR with image preprocessing, word-level overlay, batch processing, and a book reader mode
▶ Launch OCR AppServer not detected
How to Run
1
Open a terminal
Navigate to the OCR app directory
cd Shimush/hebrew-ocr-app
2
Install dependencies
Requires Flask, Pillow, and Tesseract-OCR installed on your system
pip install flask pillow
3
Run the server
Starts on http://localhost:5000
python app.py
4
Open the app
It will open automatically in your browser, or click the Launch button above
Features
🖼Image Preprocessing
🔤Hebrew OCR (Tesseract)
👆Word-Level Click Overlay
⚡Batch 16-Page Processing
📖Flippable Book Reader
✏️Inline Correction Tool
📤Export (TXT, JSON, CSV)
📡SSE Live Progress