Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Inference Engineer - Model Optimization & Deployment

$242k - $290k

Zoox Inc.

The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence.

As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.

In this role, you will:

  • Optimize large-scale models (Multi-Modal Sensor Fusion models, LLMs, VLMs) using advanced quantization (PTQ, QAT), pruning, mixed-precision inference frameworks, and parameter-efficient fine-tuning (LoRA, QLoRA).
  • Architect and implement model conversion and compilation pipelines using TensorRT for edge deployment.
  • Perform rigorous parity checking, accuracy recovery, and latency benchmarking between PyTorch frameworks and compiled edge binaries.
  • Develop and optimize custom ML OPs and TensorRT Plugins with efficient CUDA kernels to minimize latency and maximize memory bandwidth on AI accelerators.
  • Write production-level, low latency, and memory-safe C++ and CUDA code for real-time inference on vehicle systems.
Qualifications:
  • Deep expertise in model quantization (PTQ, QAT) and mixed-precision inference frameworks (INT8, FP8, FP4, BF16/FP16).
  • Proven experience optimizing large-scale models (Multi-Modal Sensor Fusion models, LLMs, VLMs/VLAs) utilizing Efficient Attention mechanisms (e.g., FlashAttention, Linear Attention), KV-cache optimization (e.g., PagedAttention) and Speculative Decoding.
  • Extensive experience with model conversion/compilation pipelines (e.g., ONNX, TensorRT, torch.compile) and performing rigorous latency benchmark and model quality parity valuation.
  • Proficiency in low-level programming for AI accelerators, specifically developing and optimizing custom ML OPs and TensorRT Plugins with efficient CUDA kernel implementations.
  • Production-level C++ (14/17/20) and Python programming skills, with experience developing concurrent, memory-safe, real-time inference code for edge devices.
Bonus Qualifications:
  • Familiarity with SOTA autonomous driving perception algorithms (temporal 3D object detection, BEV, 3D Occupancy Networks) and multi-modal sensor processing (Vision, LiDAR, Radar).
  • Experience with distributed training pipelines and model/tensor parallelism (PyTorch Distributed, Ray, DeepSpeed, Megatron-LM) and runtime efficiency optimization for GPU clusters.
  • Experience with end-to-end autonomous driving paradigms (VLM/VLA models, Foundation models) and edge deployment technologies (e.g., TensorRT-LLM).

$242,000 - $290,000 a year

Base Salary Range

There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.

Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.

About Zoox

Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We're looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.

Follow us on LinkedIn

Accommodations

If you need an accommodation to participate in the application or interview process please reach out to [email protected] or your assigned recruiter.

A Final Note:

You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior AI Inference Engineer - Model Optimization & Deployment in Nacogdoches, TX vacancy
  • $158.4k - $237.6k

     ..., Inc. Job Area: Engineering Group, Engineering Group...  ...Join the Qualcomm AI Hub team and help developers...  ...tools to help developers optimize and deploy machine learning models on edge and mobile hardware...  ...or similar families) for inference optimization ~ Familiarity... 
    Suggested
    Work experience placement
    Immediate start
    Work from home

    Qualcomm

    Nacogdoches, TX
    5 days ago
  • $178.4k - $267.6k

     ...Technologies, Inc. is seeking a Machine Learning Engineer in San Diego, California, to work with cutting-edge AI technologies and frameworks. The ideal...  ...workflows. Responsibilities include architecting model optimization techniques and collaborating with various teams... 
    Senior

    Stryker

    Nacogdoches, TX
    3 days ago
  • $139.87k - $250.38k

     ...here! PURPOSE OF THE JOB The Senior Applied AI Engineer is responsible for leading the design...  ...-edge AI technologies, ensures models are deployed securely, cost-effectively, and in...  ...regulated environments. Monitor and optimize AI workloads for cost efficiency in... 
    Senior
    Full time
    Work at office
    Local area

    ICW Group

    Nacogdoches, TX
    5 days ago
  • $111.3k - $166.9k

     ...Technologies, Inc. Job Area: Engineering Group, Engineering Group...  ...-generation on-device Voice AI capabilities on Windows-on-...  ...multiple audio features. Optimize and validate performance...  .../or UMDF , including build, deployment, and debugging fundamentals.... 
    Senior
    Work experience placement
    Work from home

    Qualcomm

    Nacogdoches, TX
    4 days ago
  • $162.6k - $244k

     ...Qualcomm Datacenter AI Systems and Solutions Engineer Job Description...  ...research, develop, optimize, and validate software...  ...solutions that enable the deployment of cutting-edge AI...  ...-class Qualcomm AI inference accelerators for...  ...the implementation of model fine-tuning, distillation... 
    Senior
    Work experience placement
    Work from home

    Qualcomm

    Nacogdoches, TX
    10 hours ago
  • $178.4k - $267.6k

     .... Job Area: Engineering Group, Engineering Group...  ...Summary: AI Performance Engineer...  ...software solutions for Inference Acceleration....  ...development to commercial deployment-and demands strategic...  ...: ~ Convert, optimize and deploy models for efficient... 
    Senior
    Work experience placement
    Work from home

    Qualcomm

    Nacogdoches, TX
    1 day ago
  • $124k - $280k

     ...Data, Analytics & AI Industry/Sector...  ...data and analytics engineering focus on...  ...optimising algorithms, models, and systems to enable...  ...health plans. As a Senior Manager, you will...  ...) and operational optimization Foster a collaborative...  ...of PHI-compliant deployment patterns and HIPAA... 
    Senior
    Full time
    H1b

    PwC

    Nacogdoches, TX
    2 days ago
  • $165k - $175k

     ...Lead Software & AI Engineer San Diego, California, United...  ..., developing, and deploying advanced software solutions...  ..., and Large Language Models (LLMs) into secure...  ...frameworks, model deployment, inference pipelines, and...  ...Experience developing and optimizing CI/CD pipelines using:... 
    For contractors

    G2IT LLC

    Nacogdoches, TX
    2 days ago
  • $168k - $211k

     ...Get AI-powered advice on this job...  ...companies design optimal clinical trials...  .... As a Senior AI Data Scientist...  ...of the art AI models and a range of...  ...and production deployment. Responsibilities...  ...with software engineers to develop...  ...pipelines and inference models to meet... 
    Senior
    Full time
    Temporary work
    Work at office
    Remote work
    Work from home
    Home office
    Flexible hours

    Faro Health

    Nacogdoches, TX
    5 days ago
  •  ...Nutanix is seeking a skilled engineer for a role focused on developing software solutions for Inference Acceleration. This position requires experience in delivering...  ...encompasses the entire product lifecycle from R&D to deployment. Ideal candidates will have strong... 
    Senior

    Nutanix

    Nacogdoches, TX
    4 days ago
  •  ...Country: USA Summary The DevTestOps engineer handles daily requests from the...  ...infrastructure issues, manage VMs, explore new AI Tools and develop new automated engineering...  ...and tools to automate build, testing, and deployment processes.Monitoring: Designing custom dashboards... 
    Senior
    Temporary work
    Work at office
    Local area
    Remote work
    Worldwide
    Shift work

    Celestica

    Nacogdoches, TX
    1 day ago
  • $124k - $280k

     ...Competency: Data, Analytics & AI Industry/Sector:...  ...in data and analytics engineering focus on leveraging...  ...optimising algorithms, models, and systems to enable...  ...for health systems. As a Senior Manager, you will serve...  ...AI in a HIPAA-compliant deployment context... 
    Senior
    Full time
    H1b

    PwC

    Nacogdoches, TX
    5 days ago
  • $140k - $150k

     ...are seeking an innovative and hands-on AI Engineer to join our Data Science team within Business...  ...You'll Do Design and deploy scalable AI/ML and LLM-powered...  ...lifecycle including deployment, monitoring, optimization, and retraining Implement evaluation... 

    Goal Solutions

    Nacogdoches, TX
    3 days ago
  •  ...About the job AI Engineer AI / Machine Learning Engineer...  ...expertise with hands-on experience deploying LLM-based solutions into...  ...ecosystems and understands how to optimize, evaluate, and productionize...  ...and benchmark ML and language models using structured... 
    Full time
    Work at office

    Calqulate

    Nacogdoches, TX
    4 days ago
  • $158.4k - $237.6k

     ...Qualcomm in San Diego is seeking an AI Software Engineer to develop cutting-edge machine learning techniques and implement AI solutions across...  ...include designing software for Qualcomm AI Stack SDKs and optimizing performance for resource-constrained systems. The ideal... 

    Qualcomm

    Nacogdoches, TX
    3 days ago
  •  ...Qualcomm in San Diego is seeking an AI Software Engineer to develop and implement cutting-edge machine learning techniques. In this role, you...  ...SDK into applications and collaborate with various teams to optimize performance across technology verticals. The ideal candidate... 
    Senior

    Qualcomm

    Nacogdoches, TX
    3 days ago
  •  ...A leading technology employer is looking for a Senior Software Engineer specializing in Optimization & Decision Support in San Diego, California. The role involves developing and maintaining a multi-objective optimization system, working closely with stakeholders to define... 
    Senior

    Military, Veterans and Diverse Job Seekers

    Nacogdoches, TX
    3 days ago
  • $178.4k - $267.6k

     ...A leading technology company in San Diego seeks a Sr. Staff Engineer to join their Machine Learning Engineering team, focusing on model optimization and enabling on-device AI. Candidates should have strong experience in software engineering and AI frameworks, as well... 
    Senior

    Qualcomm

    Nacogdoches, TX
    3 days ago
  •  ...Senior Software Engineer (Optimization & Decision Support) Job Openings Senior Software Engineer (Optimization...  ...Job Functions: Develop, deploy, and maintain a containerized multi-...  ...understanding and creating complex mathematical models Performance benchmarking and... 
    Senior

    Military, Veterans and Diverse Job Seekers

    Nacogdoches, TX
    5 days ago
  • $123k - $209k

     ...The Sr. Machine Learning Engineer, with minimal guidance from more...  ...machine learning algorithms and models to solve problems involving biological...  ..., design, implementation and deployment. This role strongly...  ...the application of cutting-edge AI methodologies at Exact Sciences... 
    Senior
    Full time
    For contractors
    Local area
    Night shift

    Exact Sciences

    Nacogdoches, TX
    2 days ago
  • $134k - $184k

     ...through implementation and deployment, while leveraging novel technologies...  ...in C++14 and software engineering techniques including multi-...  ...management, and performance optimization Have experience integrating...  ...processing or mathematical modeling Have experience with GPU... 
    Senior
    Full time
    Local area

    STR

    Nacogdoches, TX
    3 days ago
  •  ...mighty team works with senior leaders and...  ...strategic and rigorous engineering leader who is...  ...innovation and approaches AI agent design as an...  ..., develop, and deploy AI agents and...  ...production-including model selection, prompt...  ...availability and optimal performance of applications... 
    Senior
    Temporary work
    Local area

    Intuit

    Nacogdoches, TX
    5 days ago
  •  ...SoftClouds is seeking a technical specialist to design, deploy, migrate, manage, and support Oracle Agile PLM environments. The role...  ...Oracle Database schemas, perform backups, restores, cloning, and optimize queries. Implement security configurations including SSL/TLS,... 
    Senior
    Shift work

    SoftClouds

    Nacogdoches, TX
    3 days ago
  •  ...Overview AI is one of the fastest growing product...  ...Seismic Aura, our leading AI engine, is powering this...  ...successful sales outcomes. As a Senior Software Engineer – AI/...  ...role in developing and optimizing backend systems that...  ...and Continuous Deployment (CI/CD) with expertise... 
    Senior

    Seismic

    Nacogdoches, TX
    3 days ago
  • $190k - $240k

     ...This role is a hands‑on engineering position inside...  ...CIAM infrastructure and deployments using Infrastructure as...  ...Monitor, debug, and optimize CIAM services for performance...  ...API design, data modeling, latency, error handling...  ...such as Cursor and other AI‑augmented development... 
    Senior
    Work at office
    Remote work
    Flexible hours

    Affirm

    Nacogdoches, TX
    3 days ago
  • $160k - $240k

     ...Senior Software Engineer Founded in 2015, Shield AI is a venture-backed deep-tech company with the mission of protecting...  ..., robotics, control systems, optimization, and data analysis to create...  ...mission autonomy software stack and deploying them across a breadth of... 
    Senior
    Full time
    Temporary work
    Part time
    Worldwide

    Shield AI

    Nacogdoches, TX
    5 days ago
  • $164k - $273k

     ...year Department Engineering Location San Diego...  ...libraries, agentic AI solutions, and analytics...  ...in threat‑modeling sessions. Engineer...  ...and data structures to optimize performance and scalability...  ...Experience with containerized deployment technologies (... 
    Senior
    Full time
    Immediate start

    Innovative Defense Technologies

    Nacogdoches, TX
    4 days ago
  • $160k - $240k

     ...Flight Integration Engineer Founded in 2015, Shield AI is a venture-backed deep-tech company with the mission...  ...candidate will be skilled at deploying autonomy solutions onto unmanned platforms...  ...the ability to troubleshoot and optimize system performance. Excellent... 
    Senior
    Full time
    Temporary work
    Part time
    Work experience placement
    Work at office
    Worldwide

    Shield AI

    Nacogdoches, TX
    1 day ago
  • $135k - $185k

     ...such as Cameo Systems Modeler, MagicDraw, IBM Rhapsody...  ..., and digital engineering activities for Navy and...  ...government stakeholders, senior leaders, or cross-functional...  ...work-life balance. AI at G2 Ops. At G2 Ops,...  ...infrastructures, optimizing resiliency in system design... 
    Senior
    Full time
    Temporary work
    Work at office
    Local area
    Remote work
    Flexible hours

    G2 Ops Inc

    Nacogdoches, TX
    5 days ago
  • $111.3k - $166.9k

     ...Technologies, Inc. Job Area: Engineering Group, Engineering...  ...across subsystems— AI/Gen AI, Multimedia and...  ...experience with AI and GenAI inference frameworks such as...  ...in AI concepts, model architectures, tensor...  ...design, implementation, deployment, and support. Principal... 
    Senior
    Work experience placement
    Work from home

    Qualcomm

    Nacogdoches, TX
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Inference Engineer - Model Optimization & Deployment. Be the first to apply!