Staff AI Engineer, Model Post-Training and Alignment
OKX
OKX will be prioritising applicants who have a current right to work in Singapore, and do not require OKX's sponsorship of a visa.
Who We Are
At OKX, we believe that the future will be reshaped by crypto, and ultimately contribute to every individual's freedom. OKX is a leading crypto exchange, and the developer of OKX Wallet, giving millions access to crypto trading and decentralized crypto applications (dApps). OKX is also a trusted brand by hundreds of large institutions seeking access to crypto markets. We are safe and reliable, backed by our Proof of Reserves. Across our multiple offices globally, we are united by our core principles: We Before Me , Do the Right Thing , and Get Things Done . These shared values drive our culture, shape our processes, and foster a friendly, rewarding, and diverse environment for every OK-er. OKX is part of OKG, a group that brings the value of Blockchain to users around the world, through our leading products OKX, OKX Wallet, OKLink and more.
About the Opportunity
We are seeking a highly skilled and hands-on Machine Learning Engineer specializing in large model post-training and alignment . This role focuses on designing, executing, and optimizing post-training pipelines to improve model performance, controllability, domain adaptation, and reasoning capabilities.
You will work across the full lifecycle of post-training—from data strategy and reward modeling to reinforcement learning–based optimization and production-grade inference deployment.
What You’ll Be Doing
- Lead and execute the full post-training pipeline for large language models (LLMs), including supervised fine-tuning, preference optimization, and reinforcement learning–based methods.
- Design and implement advanced training paradigms such as DPO (Direct Preference Optimization) and GRPO (Generalized Reward Policy Optimization) .
- Develop domain-specific data recipes, curation strategies, and augmentation pipelines to optimize task performance.
- Conduct post-training of specialized small models from scratch, including architecture selection, dataset construction, and optimization strategy.
- Build and refine Reward Models to support alignment and downstream optimization.
- Design and implement RLAIF (Reinforcement Learning from AI Feedback) closed-loop systems.
- Optimize inference efficiency and deploy models using low-latency serving frameworks such as vLLM and SGLang .
- Evaluate model performance using both automated benchmarks and human/AI feedback loops.
- Collaborate with research and infrastructure teams to productionize training and deployment workflows.
What We Look For In You
- Bachelor's in Computer Science, AI, Machine Learning, or related fields with at least 8 years of industry experience .
- Strong hands-on experience across the full post-training pipeline for large models.
- Deep familiarity with preference learning and alignment techniques, including DPO, GRPO, and RL-based post-training methodologies .
- Proven experience designing domain-specific data strategies and training methodologies.
- Experience training and post-training specialized small models from scratch .
- Solid understanding of reinforcement learning fundamentals and their application to model alignment.
- Experience deploying models in low-latency production environments using frameworks such as vLLM, SGLang, or similar .
Perks & Benefits
- Competitive total compensation package
- L&D programs and Education subsidy for employees' growth and development
- Various team building programs and company events
- Wellness and meal allowances
- Comprehensive healthcare schemes for employees and dependants
- More that we love to tell you along the process!
All official OKX vacancies are published on this website. While roles may appear on selected third-party platforms from time to time, information on other sites may be inaccurate or outdated. If in doubt, please apply directly through our official careers website.
Information collected and processed as part of the recruitment process of any job application you choose to submit is subject to OKX's [Candidate Privacy Notice](
- ...Title and Summary Principal AI Engineer Overview As a Principal... ...systems, including scalable training and evaluation pipelines, deployment... ...concept drift), and automate model/agent retraining, policy... ...Influence: ability to align platform, security, governance...TrainingFull timeWorldwide
- ...error. Incident management and post-mortem optimization: Lead online... ...scheduling strategies, network models, and storage mounting; able to... ...certification; experience with AI/LLM workload scheduling (GPU scheduling, distributed training). Perks & Benefits Competitive...TrainingFull timeOverseas
- .... About the Opportunity We are looking for a Principal AI Engineer to lead the architecture and deployment of large-scale, LLM-powered... .... Design multi-level intent routing, classifier training, and fallback strategies. Building dialogue quality evaluation...TrainingFull time
- ...more. ** Responsibilities** ~ AI-Driven Code Security Detection Engine 1. Design and implement a... ...protection framework for large language model applications, covering three... ...domain fine-tuning experience, such as training and evaluating security detection models...TrainingFull timeLocal area
- ...About the opportunity The AI Engineering team is responsible for integrating AI models with different business lines, across... ...We are looking for a Senior Staff Engineer to lead the design, development... ...across the company to develop, train, deploy, and operate AI models...TrainingFull timeImmediate start
- ...us. About the team The Engineering team at Airwallex is a diverse... .... What you’ll do As a Staff Site Reliability Engineer ,... ...Reliability Engineering at Airwallex, aligned with business objectives and... ...incidents, facilitating post-mortems and driving...Worldwide
- ...million people in 190 countries. The Engineering (Tech) Team is responsible for all Feedzai... ...(risks, designs, estimates) and align with Customer Success, Security, Sales,... ...You will be immersed in our brand with training, connections, and one-on-one time with your...TrainingRemote jobContract workWork at office
- ...the intersection of engineering rigour, financial... ...looking for a Staff Data Engineer to... ...team operates in an AI-native way. You... ...and enforce data modelling standards for Finance... ...recognition alignment, and audit trail completeness... ...structured post-mortems and drive...Permanent employmentFull time
- ...administrative functions related to the Centre’s professional training programmes. The incumbent will play a key role in ensuring seamless... ...Lead the design and development of training programmes aligned with industry needs and organisational goals. Conduct in-depth...TrainingWork experience placement
- ...collaborative IP Logic Design Engineer to join our Chipsets Logic... ...flows (VCS/Synopsys tools), RTL model builds, and DFT/DFV... ...strategies, and validation content aligned to high level silicon or IP architecture... ...of computing experiences. Posting Statement: All qualified...Remote jobLocal areaImmediate startWork from homeWorldwideShift work
- ...activities for local MBS IT Projects whilst aligning to standards & best practices followed... ...Design, develop, and deploy scalable AI/ML/DL models using state-of-the-art techniques.... ...capabilities. Collaborate with data engineers to ensure seamless data pipeline integration...Local area
- ...and Summary Senior SRE Engineer Senior SRE Engineer,... ...Mastercard’s Program aligned Site Reliability Engineering... ...automation and AI technologies to enhance... ...recovery exercises and post-maintenance activities... ...team capabilities. Lead training initiatives for team members...TrainingFull timeWorldwide
- ...Summary Lead Network Engineer, Site Reliability Engineering... ...Mastercard’s Program aligned Site Reliability... ...Leverage automation and AI technologies to enhance... ...recovery exercises and post-maintenance activities... ...team capabilities. Lead training initiatives for team members...TrainingFull timeWorldwide
- ...SOP manuals) and lead internal technical training sessions. What We Look For In You 1.... ...degree or higher in Computer Science, Network Engineering, or related fields. Nice-To-Haves... ...Premium, AWS Shield). Exposure to big data/AI operations (e.g., Alibaba Cloud...Full time
- ...technology protects 900 million people in 190 countries. The Engineering (Tech) Team is responsible for all Feedzai product... ...30-Days at Feedzai: You will be immersed in our brand with training, connections, and one-on-one time with your manager. You may shadow...TrainingRemote jobContract workWork at office
- ...continuous integration in mind. Leverage AI coding agents and tools (e.g., Claude... ...improvements to developer experience and engineering velocity - build tooling, CI/CD pipelines... ...conversational UX. Familiarity with MCP (Model Context Protocol), custom skill/tool...Full time
- ...responsible for the core architecture design and technical delivery of an enterprise-grade AI Native engineering platform, driving the deep integration of large language models, AI Agents, knowledge capabilities, and the software development lifecycle. The platform will...Full time
- ...standards. The team owns the full compliance engineering stack: KYC onboarding and refresh,... ...management, and regulatory reporting — including AI/ML-powered features such as intelligent... ...completion — including delivery of AI model integration work within KYC, screening, and...Full time
- ...The candidate should be familiar with the training and adult education landscape, be adept... ...of the courses. Achieve constructive alignment of the learning outcomes with the... ...be notified. Other Information NIE staff can take chartered buses at their own expense...TrainingContract work
- ...The appointee should be familiar with the training and adult education landscape.... ...of the courses. Achieve constructive alignment of the learning outcomes with the learning... ...be notified. Other Information NIE staff can take chartered buses at their own expense...Training
- ...and integrated to understand issues from various perspectives. Aligned with the learning outcomes of the desired attributes of an NTU... ...following qualifications are encouraged to apply: MA or PhD training across STEM and humanities and social science fields Training...TrainingWork at office
- ...opportunity to work with world-class ML engineers and research scientists to make self-... ...development (e.g. data transformation, model training, and model evaluation) of our on-car and... ...team goals, priorities, and roadmap in alignment with company objectives Work closely...TrainingFull time
- ...Auros - Senior DevOps Engineer Location: Hong Kong OR Remote APAC Type: Full-Time On-... ...trading, engineering, and security teams to align infrastructure improvements with... ...closely with developers, traders and other staff to accomplish our firm’s goals. Requirements...Remote jobFull timeFlexible hours
$22 - $44 per hour
...threats. No prior security experience is required. TSA provides paid training to get you job-ready. Position Details Openings: Multiple... ...from the Transportation Security Administration (TSA). This posting promotes an independent resource that helps applicants prepare...TrainingShift workNight shift- ...Wallet, OKLink and more. ### Job Overview: We are seeking a Senior Model Risk Quant Manager to join OKX's Model Risk Management (MRM)... ...defense (2LOD), this role independently validates FinCrime, AML, and AI/ML models, including FinCrime and AML models, that are core to...Full timeContract work
- ...managers with the training to coach their teams... ...The Site Reliability Engineer (Singapore) is a... ...follow-the-sun support model for its US Global... ...Support Center staff across time zones,... ...and contribute to post-incident reviews and... ...Python tools. Leverage AI to maximize efficiency...TrainingFull timeShift workNight shiftWeekend work
- ...Trading Service team as a Senior Staff Software Engineer, where you will be a pivotal... ...components, ensuring alignment with business goals and operational... ...(e.g., new concurrency models, hardware acceleration), and... ...alerting, incident response, and post-mortem analysis for mission-...Full time
- ...kind of problem from most ML engineering work. The data spans on-chain... ...engineering work to matter beyond model accuracy metrics, this is an... ...are building the analytical and AI infrastructure that powers the... ...and responsibilities will be aligned with your experience. ~...Full time
- ...ambitious work of your career, join us. About the team The Engineering team at Airwallex is a diverse group of innovators, builders,... ...cloud infrastructure in support of key product initiatives. Aligned to the roadmap, you’ll lead on infrastructure design and delivery...Worldwide
- ...department, our Risk & Analytics Engineering team plays a pivotal role in... ...role is open to Senior and Staff-level candidates; scope and title... ...to amplify themselves with AI tooling, not just talk about it... ...Proven ability to organize and align resources to solve complex problems...Full timeContract work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff AI Engineer, Model Post-Training and Alignment. Be the first to apply!

