Free Hebrew OCR — עברית PDF to Text
Free Hebrew OCR for PDF and images. Preserves right-to-left layout, supports Hebrew עברית script with nikud diacritics. Browser-based, no upload.
About Hebrew OCR
עברית OCR — Hebrew OCR extracts right-to-left Hebrew text from scanned PDFs and images. The Hebrew alphabet (אלפבית) has 22 consonants with 5 final forms (ך ם ן ף ץ), plus optional niqqud vowel pointing (נקוד) and cantillation marks (טעמים) used in religious texts. Our engine handles both modern Israeli Hebrew and Biblical/Rabbinic Hebrew.
אנחנו משתמשים במודל Tesseract 5 עברי LSTM — מאומן על מסמכים ממשלתיים ישראליים, עיתונים (הארץ, ידיעות אחרונות, ישראל היום), וספרי לימוד. כל העיבוד מתבצע בדפדפן שלך — תעודת זהות, דרכון, חוזים לעולם לא מועלים לשרתים. 100% חינם, ללא רישום, ללא סימן מים.
Key Features
- Full 22-letter Hebrew alphabet plus 5 sofit (final) forms
- Niqqud (נקוד) — optional vowel pointing for religious and instructional texts
- Cantillation marks (טעמי המקרא) — captured in Biblical Hebrew when present
- Mixed Hebrew-English — very common in Israeli academic and business documents
- RTL text flow with proper Unicode bidirectional markers in output
- Handles both modern Israeli Hebrew and classical/Rabbinic Hebrew (Rashi script also partially)
- In-browser only — תעודת זהות, דרכון, חוזים never upload
How to Use Free Hebrew OCR — עברית PDF to Text
- Step 1: גרור את קובץ ה-PDF או התמונה בעברית (multi-page supported)
- Step 2: Hebrew (עברית) is pre-selected as the OCR language
- Step 3: Add English/Arabic as secondary for mixed bilingual Israeli documents
- Step 4: לחץ על "חלץ" — Hebrew script recognised page-by-page
- Step 5: העתק טקסט או הורד כ-.docx / searchable PDF
Who Uses This Tool
- עורכי דין digitising Hebrew legal contracts and court filings
- Yeshiva students extracting text from תלמוד, משנה, and Rabbinic commentaries
- Researchers processing Israeli government gazettes and historical Zionist archives
- Businesses converting scanned Hebrew חוזים (contracts) and דוחות כספיים
- Genealogists working with old Jewish community records and Yizkor books
Why Choose PDF AI Tools
We've built PDF AI Tools to replace expensive desktop software like Adobe Acrobat for 95% of common document workflows — at zero cost to you. Unlike competitors who gate features behind paywalls, add watermarks, or limit file sizes, our tools are genuinely free and genuinely unlimited. Your privacy matters: files processed client-side in your browser never touch our servers, and even AI-powered features use encrypted, auto-deleting processing pipelines.
Frequently Asked Questions
האם אפשר לקרוא כתב יד עברי?
לא. Tesseract Hebrew model supports printed text only. Handwritten Hebrew (including traditional כתב סת"ם calligraphy) is not reliably recognised. Modern Israeli cursive (כתב יד מודרני) may reach 40-60% on clean samples but is not reliable.
Does it handle Biblical Hebrew with niqqud?
Yes. Biblical Hebrew (תנ"ך) text with full niqqud vowel pointing works at 88-93% accuracy on clean prints (BHS, JPS editions). Cantillation marks are captured when present. Rashi script (used for medieval commentary) works at ~80% accuracy.
דיוק על עיתונים ישראליים?
Clean printed Israeli newspapers (הארץ, ידיעות אחרונות, ישראל היום, מעריב) reach 93-96% accuracy. Ultra-Orthodox papers using traditional fonts (המודיע, יתד נאמן) drop slightly to 88-92%.
Can I OCR Yiddish documents?
Partially. Yiddish uses Hebrew script with additional diacritics and different orthographic conventions. The Hebrew model catches ~80% of standard Yiddish. For dedicated Yiddish OCR, specialised models exist — we're evaluating adding one.
האם המסמכים שלי מוגנים?
בהחלט. כל ה-OCR פועל בדפדפן שלך — תעודת זהות, דרכון, חוזים, דוחות בנק לעולם לא עוזבים את המכשיר שלך. Privacy enforced by the architecture itself, not just policy.