Free Hebrew OCR — עברית PDF to Text

Free Hebrew OCR for PDF and images. Preserves right-to-left layout, supports Hebrew עברית script with nikud diacritics. Browser-based, no upload.

About Hebrew OCR

עברית OCR — Hebrew OCR extracts right-to-left Hebrew text from scanned PDFs and images. The Hebrew alphabet (אלפבית) has 22 consonants with 5 final forms (ך ם ן ף ץ), plus optional niqqud vowel pointing (נקוד) and cantillation marks (טעמים) used in religious texts. Our engine handles both modern Israeli Hebrew and Biblical/Rabbinic Hebrew.

אנחנו משתמשים במודל Tesseract 5 עברי LSTM — מאומן על מסמכים ממשלתיים ישראליים, עיתונים (הארץ, ידיעות אחרונות, ישראל היום), וספרי לימוד. כל העיבוד מתבצע בדפדפן שלך — תעודת זהות, דרכון, חוזים לעולם לא מועלים לשרתים. 100% חינם, ללא רישום, ללא סימן מים.

Key Features

Full 22-letter Hebrew alphabet plus 5 sofit (final) forms
Niqqud (נקוד) — optional vowel pointing for religious and instructional texts
Cantillation marks (טעמי המקרא) — captured in Biblical Hebrew when present
Mixed Hebrew-English — very common in Israeli academic and business documents
RTL text flow with proper Unicode bidirectional markers in output
Handles both modern Israeli Hebrew and classical/Rabbinic Hebrew (Rashi script also partially)
In-browser only — תעודת זהות, דרכון, חוזים never upload

How to Use Free Hebrew OCR — עברית PDF to Text

Step 1: גרור את קובץ ה-PDF או התמונה בעברית (multi-page supported)
Step 2: Hebrew (עברית) is pre-selected as the OCR language
Step 3: Add English/Arabic as secondary for mixed bilingual Israeli documents
Step 4: לחץ על "חלץ" — Hebrew script recognised page-by-page
Step 5: העתק טקסט או הורד כ-.docx / searchable PDF

Who Uses This Tool

עורכי דין digitising Hebrew legal contracts and court filings
Yeshiva students extracting text from תלמוד, משנה, and Rabbinic commentaries
Researchers processing Israeli government gazettes and historical Zionist archives
Businesses converting scanned Hebrew חוזים (contracts) and דוחות כספיים
Genealogists working with old Jewish community records and Yizkor books

Why Choose PDF AI Tools

We've built PDF AI Tools to replace expensive desktop software like Adobe Acrobat for 95% of common document workflows — at zero cost to you. Unlike competitors who gate features behind paywalls, add watermarks, or limit file sizes, our tools are genuinely free and genuinely unlimited. Your privacy matters: files processed client-side in your browser never touch our servers, and even AI-powered features use encrypted, auto-deleting processing pipelines.

Frequently Asked Questions

האם אפשר לקרוא כתב יד עברי?

לא. Tesseract Hebrew model supports printed text only. Handwritten Hebrew (including traditional כתב סת"ם calligraphy) is not reliably recognised. Modern Israeli cursive (כתב יד מודרני) may reach 40-60% on clean samples but is not reliable.

Does it handle Biblical Hebrew with niqqud?

Yes. Biblical Hebrew (תנ"ך) text with full niqqud vowel pointing works at 88-93% accuracy on clean prints (BHS, JPS editions). Cantillation marks are captured when present. Rashi script (used for medieval commentary) works at ~80% accuracy.

דיוק על עיתונים ישראליים?

Clean printed Israeli newspapers (הארץ, ידיעות אחרונות, ישראל היום, מעריב) reach 93-96% accuracy. Ultra-Orthodox papers using traditional fonts (המודיע, יתד נאמן) drop slightly to 88-92%.

Can I OCR Yiddish documents?

Partially. Yiddish uses Hebrew script with additional diacritics and different orthographic conventions. The Hebrew model catches ~80% of standard Yiddish. For dedicated Yiddish OCR, specialised models exist — we're evaluating adding one.

האם המסמכים שלי מוגנים?

בהחלט. כל ה-OCR פועל בדפדפן שלך — תעודת זהות, דרכון, חוזים, דוחות בנק לעולם לא עוזבים את המכשיר שלך. Privacy enforced by the architecture itself, not just policy.