Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Benchmark Engineer | Native Language Specialist - Serbian - Remote

Lilt

We are building a rigorous, verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects, non-English data processing, and complex locale/encoding edge cases in terminal workflows.

We are seeking experienced native-speaking software engineers to design, build, and validate these benchmarks. You will create high-signal, high-quality tasks that genuinely test a models ability to handle multilingual environments without relying on English translation crutches.

Note this is a remote, freelance opportunity

Key Responsibilities

Task Engineering: Evaluating Coding Agents.

Asset Creation: Build realistic task environments using datasets and files in your native language. Crucially, these assets must remain in the target language to genuinely measure multilingual handling.

Prompting & Translation: finding failure points where AI does not work, in your native language

Implementation & Verification: Support the development of robust solutions (reference implementations) and write highly reliable, deterministic verifier scripts (using rubric-based judging only when strictly necessary).

Calibration & Execution: Analyze execution logs and calibrate task difficulty (Easy to Very Hard) using standard Terminal-Bench run configurations against various model tiers (Haiku, Sonnet, Opus).

Quality Assurance: Participate in a rigorous, 4-layer human quality control process (creation, human review, calibration review, and audit) alongside automated LLM-based checks to ensure fairness, grammatical accuracy, and benchmark integrity.

Required Qualifications

Experience:

5+ years of industry experience in software engineering.

Background:

Proven track record at leading technology companies and/or graduation from top-tier engineering universities.

Language:

Native or near-native fluency, with a deep understanding of its grammar, register, and phrasing rules. High English proficiency.

Technical Stack:

Strong proficiency in Python, standard shell scripting, and data processing.

Workflow:

Extensive experience with Terminal/CLI-based development workflows and a working familiarity with coding agents.

Domain Expertise:

Deep technical understanding of multilingual text processing pitfalls, including:

Encoding/decoding robustness and Unicode normalization.

Locale-dependent conventions (collation, casing, non-Gregorian dates).

Text I/O, toolchain interoperability, and safe string operations.

Bidirectional/RTL handling, font fallbacks, and rendering/typography in UI or artifacts.

If interested, please submit your application including a latest copy for your CV in English.

AI is changing how the world communicates — and LILT is leading that transformation.

LILTs mission is to make the worlds information available to everyone, no matter the language they speak. Join our global community who thrive on innovation and excellence. Our collective knowledge, uniqueness, and skills deliver multilingual AI and human-verified services to Enterprises, Governments, and AI Developers around the world.

Earn money. Have fun. Advance human knowledge. Work on diverse projects from anywhere, any time you want. Get paid quickly and fairly, and build your professional network in a supportive community—all through a streamlined application process tailored to your expertise.

Information collected and processed as part of your application process, including any job applications you choose to submit, is subject to LILTs Privacy Policy at

At LILT, we are committed to a fair, inclusive, and transparent hiring process. As part of our recruitment efforts, we may use artificial intelligence (AI) and automated tools to assist in the evaluation of applications, including résumé screening, assessment scoring, and interview analysis. These tools are designed to support human decision-making and help us identify qualified candidates efficiently and objectively. All final hiring decisions are made by people. If you have any concerns, require accommodations, or would like to opt-out of the use of AI in our hiring process, please let us know at View email address on click.appcast.io.

LILT is an equal opportunity employer. We extend equal opportunity to all individuals without regard to an individual’s race, religion, color, national origin, ancestry, sex, sexual orientation, gender identity, age, physical or mental disability, medical condition, genetic characteristics, veteran or marital status, pregnancy, or any other classification protected by applicable local, state or federal laws. We are committed to the principles of fair employment and the elimination of all discriminatory practices.

#J-18808-Ljbffr
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the AI Benchmark Engineer | Native Language Specialist - Serbian - Remote in New York, NY vacancy
  •  ...A leading multilingual AI company is seeking experienced native-speaking software engineers to design and validate benchmarks for large language models. This remote opportunity requires 5+ years in software engineering with strong skills in Python and shell scripting.... 
    Remote work
    Worldwide

    Lilt

    United States
    3 days ago
  •  ...Mercor is seeking an engineer to own core product delivery for an AI-native platform in Ventura, California. This role involves shipping production features, designing integrations, and building analytics experiences. The ideal candidate has strong software engineering... 
    Remote work

    Mercor Inc

    Ventura, CA
    1 day ago
  •  ...Mercor is building an innovative AI-native platform aimed at transforming operations with impactful dashboards and workflows. We are looking for an experienced engineer to manage product delivery end-to-end, which includes core development, integrates with popular tools... 
    Remote work

    Mercor Inc

    Camden, NJ
    10 hours ago
  • An innovative AI startup is seeking a Benchmark Specialist to design and execute rigorous benchmarks and evaluate datasets...  ...communicate technical specifications to both engineers and customers. The position is full-time and offers remote work flexibility. If you are passionate... 
    Remote work
    Full time

    Pathway Genomics

    Palo Alto, CA
    5 days ago
  •  ...Mercor in Miami Gardens, FL is seeking a skilled software engineer to build an AI-native platform. The role involves creating core functionalities, integrations, and real-time analytics capabilities, ensuring robust engineering practices through testing and observability... 
    Remote work

    Mercor Inc

    Miami Gardens, FL
    1 day ago
  • $140k - $160k

     ...Hireology is seeking a Sr. Software Engineer (AI-Native) to join their Developer Experience team. The role focuses on enhancing both internal...  ...and a strong background in software engineering. This remote position prefers candidates near Chicago for occasional in-office... 
    Remote work
    Work at office

    Hireology

    United States
    1 day ago
  •  ...Mercor is seeking a software engineer to build an AI-native platform replacing traditional operations with real-time dashboards. You will be responsible for core product delivery, including platform foundations, integrations, and analytics. A strong background in SaaS... 
    Remote work

    Mercor Inc

    Corona, CA
    10 hours ago
  •  ...Mercor is seeking a skilled engineer to take ownership of core product delivery for an AI-native platform. This role involves building features, designing integrations, and developing real-time analytics capabilities to replace traditional operations with advanced workflows... 
    Remote work

    Mercor Inc

    Royal Oak, MI
    10 hours ago
  •  ...Mercor is looking for a skilled engineer to drive the development of an AI-native platform that enhances operational efficiency with real-time analytics and workflows. This role entails shipping production features and designing scalable solutions, making it integral to... 
    Remote work

    Mercor Inc

    Santa Clara, CA
    1 day ago
  •  ...Mercor is seeking a skilled software engineer for their AI-native platform development. The role involves building a robust platform with real-time dashboards, integration layers, and analytics, requiring strong software engineering skills and experience in SaaS products... 
    Remote work

    Mercor Inc

    Lynwood, CA
    10 hours ago
  •  ...Senior AI-Native Software Engineer, a full-time position focused on designing and building features for diverse user segments while leveraging AI tools throughout the development process. Key Responsibilities Design and build high-quality features for aging parents, pre... 
    Remote work
    Full time

    Virtual Vocations Inc

    United States
    1 day ago
  •  ...Mercor is seeking a Software Engineer to contribute to an AI-native platform focused on streamlining operations with real-time dashboards. This role emphasizes collaborative shipping of features, designing integrations, and managing analytics. The ideal candidate will... 
    Remote work

    Mercor Inc

    Milwaukee, WI
    1 day ago
  •  ...Mercor is seeking a skilled engineer to develop an AI-native platform that enhances operational efficiency. You will be responsible for core product delivery, including platform foundations, integrations, and analytics dashboards. The ideal candidate should have solid... 
    Remote work
    Full time

    Mercor Inc

    Montebello, CA
    1 day ago
  •  ...Seeking a hands-on AI Native Software Engineer to design, build, and deploy production-grade AI-driven systems within enterprise environments....  ...observability Proficiency in Python, Java, or similar backend languages Experience debugging and optimising production systems... 
    Remote work

    Rearc

    United States
    5 days ago
  •  ...Position: AI Native Software Engineers Length: Hybrid 2 days onsite, 3 days remote Location: Remote Pay rate- $70-75/hr on W2 (Only W2) Job Description...  ...in Python, Java, or similar backend languages ~ Experience with: CI/CD pipelines / Infrastructure... 
    Remote work

    Apolis

    United States
    4 days ago
  •  ...AI Native Software Engineer (All Levels) Bay Area | In-Office About Larridin — We Measure AI Impact Larridin is the measurement layer...  .... People who need fully baked specs to move forward Remote-only candidates (this role is in-office, Bay Area) Those... 
    Remote work
    Work at office

    Larridin

    San Francisco, CA
    2 days ago
  • $184k - $287.5k

     ...tapping into the unlimited potential of AI to define the next era of computing....  ...We are seeking a skilled HPC/AI Benchmarking and Telemetry Engineer to join our team and drive performance...  ...Python, Bash, and other scripting languages for automation, data analysis, and workflow... 
    Remote work

    NVIDIA

    United States
    5 days ago
  •  ...AI-Native Founding Engineer Join Fancysauce as our second engineer and partner with a proven founding team of Harvard grads and Apple alumni. You will own core platform verticals end-to-end, building agentic recipes that help companies optimize AI stacks. This high-... 
    Remote work

    Jack and Jill AI

    United States
    2 days ago
  • $73.8k - $261.5k

     ...Advanced Technology Centers (ATCs) is the engine for reinvention in our clients' transformation...  ...industry knowledge, the latest in Gen AI solutions, and tech expertise from around...  ...client challenges You are: An AI Native Engineer with experience building cloud-native... 
    Remote work
    Work experience placement
    Live in
    Work at office
    Local area
    3 days per week

    Accenture

    Dallas, TX
    3 days ago
  • $155k - $240k

     ...technology solutions provider leading the AI and Digital Revolution. WWT combines the...  ...advisor and thought leader for AI-Native Engineering, helping clients and internal teams understand...  ...****@*****.*** . #LI-DP2 #LI-Remote WWT will consider for employment, without... 
    Remote work
    Full time
    Shift work

    World Wide Technology

    United States
    2 days ago
  •  ...AI-Native Data Engineer @ TrueMeter SF Bay Area | Hybrid (3 days onsite, 2 remote) About Us We're building the AI Energy Agent that's becoming the default way any business pays for power and saves on energy. The grid is breaking under the weight of AI and... 
    Remote work
    Immediate start

    Pear VC

    Palo Alto, CA
    2 days ago
  •  ...An AI technology startup is seeking a Benchmarking Specialist in Palo Alto to design and execute ML evaluation benchmarks. You'll work closely with the R&D team...  ...fluent in English. This is a full-time position with remote work possibilities, targeting an immediate start... 
    Remote work
    Full time
    Immediate start

    Pathway Vet Alliance

    Palo Alto, CA
    5 days ago
  • $184k - $287.5k

    A leading technology company seeks an AI Benchmarking and Telemetry Engineer in Santa Clara, California. In this role, you will develop benchmarking approaches for HPC and AI tasks, maintain telemetry frameworks, and collaborate with engineering teams to optimize performance... 
    Remote job

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $70 - $110 per hour

     ...A dynamic tech company in Canada is looking for a Software Engineer focused on AI-native platforms and integrations. This remote position requires strong software engineering skills to deliver end-to-end features in a fast-paced environment. Candidates should have experience... 
    Remote work
    Hourly pay
    Flexible hours

    Crossing Hurdles

    New York, NY
    3 days ago
  • $122k - $150k

    RTI International in North Carolina seeks an AI Native Engineer to design and develop cloud-native AI solutions across various sectors such...  ...rigorous research standards. Flexibility to work on-site or remotely is available. Competitive salary range from $122,000 to $150... 
    Remote job

    RTI International

    Raleigh, NC
    3 days ago
  • Evident ID in Atlanta is seeking a Software Engineer with 3+ years of experience in Java/Python and a keen interest in AI development tools. This hybrid role involves integrating AI-native coding practices and delivering impactful software solutions. Candidates should... 
    Remote job

    Evident ID

    Atlanta, GA
    5 days ago
  •  ...A company is looking for a Staff AI Builder (AI Native Mobile Engineer). Key Responsibilities Transform rough ideas into functional prototypes quickly, often within a day Build and iterate user interface components and experiences based on immediate feedback Collaborate... 
    Remote work
    Immediate start

    Virtual Vocations Inc

    United States
    3 days ago
  •  ...Leading a team of engineers in a remote full-time capacity, the AI-Native Software Engineering Manager will manage and develop software engineering talent while ensuring the delivery of high-quality software solutions that integrate AI capabilities within the software... 
    Remote work
    Full time

    Virtual Vocations Inc

    United States
    1 day ago
  •  ...transformer frontier model that solves AI's fundamental memory problem....  ...the fastest data processing engine on the market, Pathway enables...  ...design and execute rigorous benchmarks and define dataset standards....  ...and location. Location : Remote work. Possibility to work or meet... 
    Remote work
    Permanent employment
    Full time
    Contract work
    Immediate start

    Pathway Vet Alliance

    Palo Alto, CA
    5 days ago
  • IDR is seeking a strong candidate for an AI-Native Quality Engineer to join one of our top clients remotely. This role is part of a cutting-edge AI-driven delivery team within a tech industry that focuses on innovative software solutions. The company values automation... 
    Remote work

    IDR Healthcare

    Florissant, CO
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Benchmark Engineer | Native Language Specialist - Serbian - Remote. Be the first to apply!