Skip to main content
FastOCR

Vietnamese OCR

Trích xuất văn bản tiếng Việt từ hình ảnh và tài liệu quét

Free · No registration for images · AI-powered

Drop your file here

PNG, JPG, PDF

Tone mark recognition

Handles all 6 Vietnamese tone marks and diacritics accurately.

Stacked diacritics

Reads characters with multiple marks like ệ, ồ, ử correctly.

Latin-based script

Recognizes the Vietnamese alphabet with all special characters.

Document processing

Works with Vietnamese legal, academic, and business documents.

Searchable PDF output

Creates PDFs with invisible text layer for full-text search.

Translate after extraction

Extract Vietnamese text then translate to any language.

Why Vietnamese OCR Is Challenging

  • Recognizing stacked diacritics where tone marks appear above or below vowels that already have accent marks
  • Handling six tone marks (sắc, huyền, hỏi, ngã, nặng) that are critical for meaning in Vietnamese
  • Preserving the đ/Đ character (d with stroke) distinct from regular d/D
  • Processing the full Vietnamese vowel set with diacritics: ă, â, ê, ô, ơ, ư and their toned variants
  • Correctly rendering double-stacked marks like ầ, ẩ, ẫ, ậ where circumflex and tone mark combine

How to Extract Vietnamese Text from a PDF & Images

  1. Go to fastocr.org
  2. Upload your Vietnamese image or PDF. Language is detected automatically.
  3. Wait for processing — images take seconds, PDFs show a progress bar.
  4. Download results: searchable PDF, raw text file, or copy text directly.

Tips for Better Vietnamese OCR Accuracy

  1. Scan at 300+ DPI — Vietnamese diacritics are small and easily lost in low-resolution images
  2. Verify tone marks after OCR — missing or wrong tones completely change word meaning in Vietnamese
  3. Check the đ character is preserved and not converted to a plain d
  4. For stacked diacritics (e.g., ầ, ổ, ữ), zoom in to verify both marks are correctly recognized
  5. Use clean, high-contrast scans to preserve the fine details of Vietnamese diacritical marks

Common Use Cases for Vietnamese OCR

  • Digitizing Vietnamese legal documents, contracts, and notarial records
  • Extracting text from Vietnamese government forms and identity documents
  • Converting scanned Vietnamese academic papers and educational materials
  • Processing Vietnamese business invoices and import/export documentation
  • Archiving Vietnamese newspaper articles and literary publications

Frequently Asked Questions

Does Vietnamese OCR handle tone marks correctly?

Yes. FastOCR recognizes all six Vietnamese tone marks and stacked diacritics with 96% accuracy on clean printed text.

Can it handle the đ character?

Yes. FastOCR correctly distinguishes đ (d with stroke) from regular d, which is essential for Vietnamese text.

How accurate is Vietnamese OCR?

FastOCR achieves 96% accuracy on printed Vietnamese. Stacked diacritics on small text may occasionally be misread.

Is Vietnamese OCR free?

Image OCR is free with no registration. PDF processing requires a free account and includes 3 free PDFs per month.

Upload Vietnamese text →

Free for images. No registration required.