Vietnamese OCR
Trích xuất văn bản tiếng Việt từ hình ảnh và tài liệu quét
Free · No registration for images · AI-powered
Drop your file here
PNG, JPG, PDF
Tone mark recognition
Handles all 6 Vietnamese tone marks and diacritics accurately.
Stacked diacritics
Reads characters with multiple marks like ệ, ồ, ử correctly.
Latin-based script
Recognizes the Vietnamese alphabet with all special characters.
Document processing
Works with Vietnamese legal, academic, and business documents.
Searchable PDF output
Creates PDFs with invisible text layer for full-text search.
Translate after extraction
Extract Vietnamese text then translate to any language.
Why Vietnamese OCR Is Challenging
- Recognizing stacked diacritics where tone marks appear above or below vowels that already have accent marks
- Handling six tone marks (sắc, huyền, hỏi, ngã, nặng) that are critical for meaning in Vietnamese
- Preserving the đ/Đ character (d with stroke) distinct from regular d/D
- Processing the full Vietnamese vowel set with diacritics: ă, â, ê, ô, ơ, ư and their toned variants
- Correctly rendering double-stacked marks like ầ, ẩ, ẫ, ậ where circumflex and tone mark combine
How to Extract Vietnamese Text from a PDF & Images
- Go to fastocr.org
- Upload your Vietnamese image or PDF. Language is detected automatically.
- Wait for processing — images take seconds, PDFs show a progress bar.
- Download results: searchable PDF, raw text file, or copy text directly.
Tips for Better Vietnamese OCR Accuracy
- Scan at 300+ DPI — Vietnamese diacritics are small and easily lost in low-resolution images
- Verify tone marks after OCR — missing or wrong tones completely change word meaning in Vietnamese
- Check the đ character is preserved and not converted to a plain d
- For stacked diacritics (e.g., ầ, ổ, ữ), zoom in to verify both marks are correctly recognized
- Use clean, high-contrast scans to preserve the fine details of Vietnamese diacritical marks
Common Use Cases for Vietnamese OCR
- Digitizing Vietnamese legal documents, contracts, and notarial records
- Extracting text from Vietnamese government forms and identity documents
- Converting scanned Vietnamese academic papers and educational materials
- Processing Vietnamese business invoices and import/export documentation
- Archiving Vietnamese newspaper articles and literary publications
Frequently Asked Questions
Does Vietnamese OCR handle tone marks correctly?
Yes. FastOCR recognizes all six Vietnamese tone marks and stacked diacritics with 96% accuracy on clean printed text.
Can it handle the đ character?
Yes. FastOCR correctly distinguishes đ (d with stroke) from regular d, which is essential for Vietnamese text.
How accurate is Vietnamese OCR?
FastOCR achieves 96% accuracy on printed Vietnamese. Stacked diacritics on small text may occasionally be misread.
Is Vietnamese OCR free?
Image OCR is free with no registration. PDF processing requires a free account and includes 3 free PDFs per month.
Free for images. No registration required.