Document Processing Engineer

Email your resume and a short blurb about why you want to work at Orbii to hello@orbii.ai

Location

Remote / UAE / KSA

Employment Type

Full Time

Department

Data

Responsibilities

Build pipelines for extracting structured data from PDFs, scanned docs, and financial statements.
Apply OCR/NLP/ML to improve accuracy and handle noisy inputs.
Optimize for Arabic-language documents (right-to-left processing, language models).
Continuously benchmark extraction performance, improve recall/precision.

Tech Skills

OCR: Tesseract, ABBYY, PaddleOCR.
NLP: spaCy, HuggingFace transformers, BERT/ArabicBERT.
Python stack: PyPDF2, pdfplumber, Camelot/Tabula.
Cloud ML services (Azure Cognitive Services, AWS Textract).
Strong regex/text parsing skills.
Familiarity with information extraction evaluation metrics (F1, precision, recall).

Values Fit

Humility fuels growth: each edge case is a lesson.