Mirrai Careers
Resume BuilderCareer Test
InsightsPricing
Get Started Free
Jobs/Senior Machine Learning Engineer - AI-Assisted Data Annotation

Senior Machine Learning Engineer - AI-Assisted Data Annotation

abbyy

Bangalore, India (Hybrid) Posted 4w ago
Apply on company site
Join ABBYY and be part of a team that celebrates your unique work style. With flexible work options, a supportive team, and rewards that reflect your value, you can focus on what matters most – driving your growth, while fueling ours. Our commitment to respect, transparency, and simplicity means you can trust us to always choose to do the right thing. As a trusted partner for purpose-built AI and intelligent automation, we solve highly complex problems for our enterprise customers and put their information to work to transform the way they do business. Over 10,000 customers trust ABBYY, including many Fortune 500 ones. You will work on further developing a portfolio already containing client names such as DHL, Johnson & Johnson, FDA, DMV, PwC, KeyBank, Spotify, and H&R BLOCK. About the Role  We are seeking a Senior Machine Learning Engineer – AI-Assisted Data Annotation to own the automated annotation track within ABBYY’s Document AI Data team.  This role sits at the intersection of large model capabilities and production data engineering, leveraging LLMs and vision-language models to generate high-quality training data at scale. You will design and build AI-assisted annotation pipelines, ensuring outputs are accurate, measurable, and reliable for downstream model training.  This is an ideal role for engineers who combine deep model expertise with strong system-building instincts and thrive in fast-moving, experimental environments.  Key Responsibilities  Technical Development & Innovation  * Design and implement AI-powered annotation pipelines using large models to generate ground truth labels at scale  * Develop and refine prompting strategies, few-shot examples, and fine-tuning approaches to improve accuracy and consistency  * Build systems for label verification, confidence scoring, and quality validation  * Evaluate which tasks are suitable for automated annotation vs. human review, and define decision criteria  * Create evaluation frameworks to benchmark automated annotations against human-labeled data  * Continuously improve annotation quality using feedback from human review workflows  Project Ownership & Leadership  * Own the automated annotation track end-to-end, from architecture through production monitoring  * Drive technical decisions across model selection, pipeline design, and validation strategies  * Define integration points with platform infrastructure and model serving systems  * Collaborate with Data Operations to design human-in-the-loop workflows for efficient review  * Contribute to roadmap planning with Principal-level technical leadership  Infrastructure & Scale  * Build and optimize large-scale inference pipelines for processing millions of documents  * Implement monitoring and alerting for quality degradation and system failures  * Design batching, caching, and fallback mechanisms to balance cost, throughput, and accuracy  * Collaborate with Platform teams on model serving, APIs, and infrastructure scaling  * Maintain clear documentation of annotation strategies, metrics, and known limitations  Qualifications  Education & Experience  * MS or PhD in Computer Science, Engineering, Mathematics, or related field  * 5+ years of experience in Machine Learning / AI, with focus on:   * Large Language Models (LLMs)  * Vision-Language Models (VLMs)  * Data annotation or labeling systems  * Demonstrated success using large AI models to automate annotation at production scale  * Strong background in evaluation design and quality measurement  Technical Expertise  * Deep expertise in LLMs and VLMs, including prompting, instruction tuning, and output evaluation  * Strong understanding of document understanding tasks (classification, extraction, layout analysis, semantic parsing)  * Experience designing label quality metrics, confidence scoring, and agreement analysis  * Strong programming skills in Python and proficiency with PyTorch or similar frameworks  * Experience with large-scale inference pipelines and model serving systems  * Familiarity with human-in-the-loop annotation systems and automation trade-offs  Leadership & Communication  * Proven ability to independently own complex technical workstreams  * Strong collaboration with data operations, platform, and modeling teams  * Ability to clearly communicate quality trade-offs and system behavior to diverse stakeholders  * Rigorous, data-driven problem-solving approach  Here are some of our local benefits:  * Comprehensive medical, accidental, and life insurance  * Weekly wellness sessions to support your physical and mental well-being  * A generous paid time off policy      Join ABBYY, and you will: Love how you work * We provide remote and hybrid working options to fit all lifestyles. * We use flexible hours across most of our teams to allow you to find your own definition of balance. * Encouraging a culture of giving, we provide two paid volunteering days off every year so you can take time to contribute to the causes you care about. * To ensure your family is cared for, we offer paid parental leave in all our locations. Love whom you work with * We are a global team of 600+ colleagues, spread across 15 countries on four continents. * With colleagues representing 30+ nationalities, our workforce reflects the world. * Innovation and excellence run through our veins. Our teams gather the expertise which has garnered ABBYY more than 140 technology patents. * We are guided by the values of respect, transparency, and simplicity. * "Team Environment" is in the top three highest-scoring drivers of engagement across all of our departments. Love what you work on * We are a company with more than 35 years of experience in the technology market; * Over 10,000 customers trust ABBYY, including many Fortune 500 ones, with names such as DHL, Johnson & Johnson, FDA, DMV, PwC, KeyBank, Spotify, and H&R BLOCK; * We have modernized the capture market by creating the first low-code/no-code IDP platform. * Our Machine Learning, Natural Language Processing, Computer Vision Technologies, and a marketplace built with AI, can transform any document in any process; * Top Analyst firms recognize ABBYY's market leadership, including Gartner, Everest PEAK Matrix ® Assessment, ISG Intelligent Automation Lens, and NelsonHall, amongst others. ABBYY is an Equal Employment Opportunity employer that values the strength that diversity brings to the workplace. To learn more about our commitment to Diversity and Inclusion, check out the careers section on our website.

See how well you match this job

Upload your resume and we’ll score your fit for this role and 6 similar roles — then tailor your CV to it with AI. Free, no credit card.

Check your match

Similar jobs

  • Senior Machine Learning Engineer, Synthetic Data & Document Understanding

    abbyy

    Bangalore, India (Hybrid)
  • Senior Machine Learning Engineer, Model Training & Evaluation

    abbyy

    Bangalore, India (Hybrid)
  • Senior Machine Learning Engineer

    cognite

    India (Bengaluru)
  • Machine Learning Engineer

    cognite

    India (Bengaluru)
  • Software Engineer, Machine Learning

    Glean

    Bangalore, India
  • Sr. Data Scientist

    6sense

    Bengaluru, Karnataka, India
Apply on company site

Want more roles like this? Browse fresh jobs or tailor your resume with AI.

Mirrai Careers

AI-powered career platform: build resumes, match jobs, and plan your career.

Product

  • All Tools
  • Resume Builder
  • Career Test
  • Pricing

Legal

  • Privacy Policy
  • Terms of Service
  • Fair Use Policy

Company

MIRRAI CHAT LTD (Company No. 16403306)

71-75 Shelton Street, Covent Garden

London, WC2H 9JQ, UNITED KINGDOM

[email protected]

© 2026 Mirrai Careers. All rights reserved.