Lead Real-Time Voice AI & LLM Engineer — Hybrid

$170k - $190k

ASAPP

At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we’re guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed, ownership, and a relentless focus on outcomes. We work in tight, skilled teams, prioritize clarity over complexity, and continuously evolve through curiosity, data, and craftsmanship.We’re seeking technologists and problem solvers who thrive in fast-paced environments, love collaborating with great talent, and approach every day like it’s Day 1. We're a globally diverse team with hubs in New York City, Mountain View, Latin America, and India—embracing both hybrid and remote work to bring the best minds together, wherever they are. If you're driven by continuous learning, rapid pivots, and the challenges of building in a high-growth startup, we’d love to talk. This is more than a job—it’s a journey. You will lead the design and delivery of end-to-end voice AI solutions, combining large language models with speech technologies such as speech-to-text, text-to-speech, and real-time streaming audio pipelines. This role requires a hands-on technical leader who can architect low-latency, highly reliable conversational voice systems and guide a team through ambiguity toward production excellence. We are looking for someone who understands the unique constraints of voice experiences, latency, turn-taking, interruption handling, streaming inference, and audio quality, and can translate these into scalable, enterprise-grade systems. This is a hybrid role with weekly in-person responsibilities. We have offices in New York City and Mountain View, CA What you'll do Build real-time conversational AI systems, including voice interfaces powered by speech-to-text, text-to-speech, and streaming inference pipelines Design and optimize low-latency inference workflows for multimodal applications involving text, speech, and real-time interactions Integrate and apply foundation models from major providers (OpenAI, AWS Bedrock, Anthropic, etc.) for prototyping and production use cases Adapt, evaluate, and optimize LLMs for domain-specific enterprise applications Build and maintain infrastructure for experimentation, deployment, and monitoring of AI models in production Improve model performance and inference workflows with attention to latency, cost, and reliability Provide technical leadership within the team, mentoring engineers and promoting best practices in ML engineering Partner with product and cross-functional stakeholders to translate requirements into scalable ML solutions Contribute to the evolution of internal standards for experimentation, evaluation, and deployment What you'll need 6+ years of experience in Machine Learning or AI systems, with hands-on experience in LLMs, speech, or conversational AI systems Experience building on integrating speech-to-text and text-to-speech systems Strong experience integrating voice models into production applications Proficiency on Python and ML frameworks like PyTorch or TensorFlow Proven experience leading complex, cross-functional AI initiatives Deep understanding of latency-sensitive system design and distributed architectures Strong proficiency in Python and ML frameworks such as PyTorch or TensorFlow Understanding of RAG pipelines, prompt engineering, and vector search Experience deploying and scaling AI systems using AWS (required), Docker, Kubernetes, and CI/CD practices Strong communication skills with the ability to align engineering, product, and executive stakeholders Comfortable operating in fast-paced environments and driving clarity in ambiguous problem spaces What we'd like to see Experience with speech model fine-tuning and acoustic/language model optimization Experience with production applications of S2S models Hands-on experience with real-time or streaming audio systems (WebRTC, gRPC streaming, or similar architectures) Experience optimizing TTS prosody, pronunciation control, and voice customization Background in MLOps, experimentation platforms, or evaluation frameworks for speech and conversational systems Contributions to open-source AI or speech tooling Graduate degree (MS or PhD) in Computer Science, Machine Learning, Speech Processing, or related field $170,000 - $190,000 a year Compensation package also includes a performance bonus on top of the listed salary range Separately, we also offer a compelling equity grant comprised of stock options Benefits include: Competitive compensation with stock options Comprehensive medical, vision, and dental insurance 401k matching Fitness and wellness stipend Mental well-being benefits Professional learning and development stipend Parental leave, including adoptive and foster parents 3 weeks paid time off (increases with tenure) along with sick leave, bereavement and jury duty ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at View email address on click.appcast.io to obtain assistance. #LI-SL1 #LI-Hybrid #J-18808-Ljbffr ASAPP

Apply

Vacancy posted 15 hours ago

Similar jobs that could be interesting for youBased on the Lead Real-Time Voice AI & LLM Engineer — Hybrid in Mountain View, CA vacancy

AI Inference Engineer: Real-Time ML, Hybrid, Equity
$190k - $250k
A leading financial technology firm in California seeks an AI Inference engineer to join its team. The role involves developing APIs for AI... ...system reliability, and optimizing LLM performance. Required... ...programming. The position offers a hybrid work model and competitive...
Suggested
Pantera Capital
Palo Alto, CA
5 days ago
Real-Time AI Diagnostics & Observability Engineer
...infrastructure that makes Sage Care’s AI assistant Role Overview Own and... ...trustworthy and debuggable —in real time and post-call. This engineer builds the visibility layer... ...role sits at the intersection of LLM orchestration, voice pipelines, transcription, SOP engines...
Suggested
Immediate start
Sage Care
Palo Alto, CA
1 day ago
Hybrid Cloud DevOps Engineer for Real-Time Analytics
A leading technology company is seeking a Software DevOps Engineer in Sunnyvale, CA to develop and support large hybrid cloud infrastructure for a real-time analytics platform. The ideal candidate will have a Bachelor’s degree and over 4 years of experience with hybrid...
Suggested
Redolent Infotech Pvt. Ltd.
Sunnyvale, CA
4 days ago
Hybrid Real-Time Software Engineer, BCI Data Pipeline
$168k - $210k
...neuroscience company in California seeks a Real-Time Software Engineer to advance brain-computer interface... ...and work collaboratively in a hybrid environment. Ideal candidates have a Bachelor... ...experience, and experience in AI tooling. Join a team focused on transforming...
Suggested
Neara
Santa Clara, CA
3 days ago
Real-Time Edge AI Robotics Engineer
...A leading AI technology company in Sunnyvale is seeking engineers proficient in deploying deep learning models for robotics platforms. You'll work on real-time inference of AI systems, optimizing latency and performance. Ideal candidates will have strong C++ skills, experience...
Suggested
Coram AI
Sunnyvale, CA
3 days ago
Multimodal AI Engineer — Real-Time Reasoning & Tooling
$180k
...firm based in Palo Alto is seeking to hire an experienced engineer to work on multimodal AI systems. The ideal candidate will have hands-on... ...developing data pipelines, and advancing capabilities in real-time interactions. The role offers competitive compensation ranging...
Pantera Capital
Palo Alto, CA
1 day ago
Applied AI Engineer — Real-Time Cyber Ops Platform
...in Sunnyvale, California, is looking for an ambitious Applied AI Engineer to build and maintain AI integrated backend services. You'll collaborate... ..., product managers, and AI researchers to design scalable real-time systems. Ideal candidates will have a background in computer...
Zoomcar
Sunnyvale, CA
4 days ago
Senior AI Engineer — Autonomous Agents & Real‑Time Data
Illumio is hiring a Senior Software Engineer in Sunnyvale, California, to architect high-scale distributed systems, focusing on data processing and real-time analytics. The role requires expertise in backend engineering with Java, Python, or Go, and strong knowledge in...
Illumio
Sunnyvale, CA
2 days ago
AI/ML Engineering Intern — Real-Time Fraud Platform
A leading AI fraud detection company in Mountain View, CA is seeking AI / ML Engineering Interns. Ideal for recent MS or Ph.D. graduates in Computer Science or related fields... ...AI applications using LLMs, and working on real-time system optimization. Candidates should have...
Internship
DataVisor Inc.
Mountain View, CA
5 days ago
AI Inference Co-Design Engineer for Real-Time HW
$132k - $330k
Software Engineer, AI Inference Codesign The AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Autopilot and Optimus programs. This unique role lies at the intersection of AI research...
Hourly pay
Full time
Temporary work
Flexible hours
Tesla Motors, Inc.
Palo Alto, CA
2 days ago
Real-time Video Research Lead for Agentic Platforms
An innovative technology company is seeking a Real-time Video Researcher to drive advancements in real-time video generation and intelligent... ...salary, equity, and a dynamic team environment with hybrid working options. Join to shape the future of creative technology...
Pika
Palo Alto, CA
4 days ago
Founding DevRel Lead - Real-Time AI Voice
$220k - $290k
Inworld AI in Mountain View is seeking a Developer Relations position to lead the developer relations strategy and enhance developer experience. You will create engaging... ..., and is familiar with AI/ML APIs. This full-time role offers a competitive salary between $220,00...
Full time
Inworld AI
Mountain View, CA
2 days ago
Backend AI Engineer - Real-time LLM & Cloud
LiveX AI Inc. is seeking a Backend Software Engineer to innovate customer retention through AI technology. The role involves developing backend systems that enhance customer engagement, embedding advanced AI models for personalized interactions, and collaborating with...
LiveX AI Inc.
Palo Alto, CA
1 day ago
AI Engineer III - Agentic AI
$103.75k - $174.75k
...AI Engineer III - Agentic AI New York, NY, United... ...that operate on real financial data and... ...implementation of LLM-powered and agentic... ...technical ownership over time. ~ Comfort... ...either virtual or hybrid, depending on the selected... ...career. Here, your voice and ideas matter,...
Full time
Internship
Work at office
Local area
Remote work
Visa sponsorship
Flexible hours
Shift work
3 days per week
American Express
Palo Alto, CA
1 day ago
Senior Agentic AI Engineer
$300k - $400k
...Senior Agentic AI Engineer $300,000 - $400,000 Remote / Hybrid, Palo Alto Full time / Permanent A well-known,... ...modalities (e.g. image/video, voice, browsing, code execution) Own LLM orchestration and model... ...consistency, and real user impact What Great...
Permanent employment
Full time
Remote work
Shift work
DeepRec.ai
Palo Alto, CA
2 days ago
Voice AI Engineer - Multilingual Models & Low-Latency
$150k
...xAI xAI’s mission is to create AI systems that can accurately... ...highly motivated, and focused on engineering excellence. This organization... ...ROLE: You will join the Grok Voice Model team to help build the... ...reliable across devices and real-time scenarios. We own the full...
Temporary work
Pantera Capital
Palo Alto, CA
2 days ago
Senior iOS Engineer: Real-Time, Scalable Mobile Apps
$130.6k - $285k
Fairygodboss is looking for a Senior iOS Engineer to join our team in Sunnyvale, California. In this hybrid role, you will work on building and maintaining a large... ...features, improving code quality, and solving complex real-time data issues. We offer a competitive salary range...
Fairygodboss
Sunnyvale, CA
4 days ago
Remote Full-Stack AI Engineer - Real-Time LLM UIs
$104.9k - $218.55k
NTT DATA is seeking a skilled developer to design and develop scalable full-stack AI applications using React, Next.js, TypeScript, and Node.js. You will oversee the entire UI lifecycle while ensuring performance optimization and accessibility. The ideal candidate should...
Remote job
NTT DATA
Santa Clara, CA
1 day ago
Applied AI Engineer: Real-Time GenAI & Backend Systems
Zoomcar in Sunnyvale is seeking an Applied AI Engineer to join their cybersecurity team. The ideal candidate will hold a Bachelor's degree... ...opportunities for growth, mentorship, and hands-on experience with real-world applications, making it ideal for recent graduates...
Zoomcar
Sunnyvale, CA
3 days ago
Grid Modernization Lead: Real-Time T&D Controls & SCADA
...looking for a professional to join their Utility Industry team in California. This role focuses on enhancing utility operations through real-time system design and implementation, especially in Transmission and Distribution sectors. The ideal candidate will have at least four...
Accenture
Mountain View, CA
3 days ago
Senior Gen AI Platform Engineer (Hybrid, Equity)
$160k - $240k
...possible with Generative AI in healthcare. Our... ...specific agent creation, and real-time algorithm monitoring—working alongside leading health systems to drive... ...for a skilled Platform Engineer to contribute to the development... ...flexible working hours, hybrid work options and an...
Live in
Flexible hours
3 days per week
Qualified Health PBC
Palo Alto, CA
1 day ago
Staff AI Application Engineer Lead
$183k - $224k
...Responsibilities Architect, design and implement scalable AI agent and backend systems for high-volume, real-time operational workloads, and build resilient... ...with multiple teams. Experience with LLM service, prompt engineering and evaluation techniques. Proficiency with...
Full time
Night shift
Zoomcar
Sunnyvale, CA
3 days ago
GSOC Security Operator - 24/7 Real-Time Incident Lead
$30 per hour
A veteran-owned security firm is seeking a Security Officer GSOC Operator in Sunnyvale, CA. This critical role involves real-time monitoring and incident response for client assets and personnel. The ideal candidate should have 2-5 years of GSOC experience and strong communication...
Full time
Worldwide
Surefox North America
Sunnyvale, CA
3 days ago
Full Stack Engineer — Real-Time AI Platform
...building neuron™ , a unified AI-native platform for data... .... neuron™ processes real-time telemetry from thousands of... ...Principal Software & Architecture Lead, Product, and Solution Engineering in a fast-moving, small-... ...tools (Cursor, Copilot, LLM‑based workflows) to accelerate...
Full time
Teserac, Inc.
Sunnyvale, CA
5 days ago
Senior AI Engineer
$188k - $237.5k
...transformation to AI-enabled software-defined... .... That's why leading OEMs trust Sonatus... ...motivated Senior AI Engineer to join our team... ...millions of vehicles in real-time. We need a "builder... ...optimization, and LLM security. You will... ...+ paid holidays Hybrid office work...
Work at office
Local area
Worldwide
Flexible hours
Shift work
Sonatus
Sunnyvale, CA
3 days ago
Lead Secure AI & LLM Platform Engineer
Zoomcar is seeking a Principal Software Development Engineer in Sunnyvale to architect and implement functions for monitoring LLM requests and filter for prompt injection... ...skills in Python and C/C++, with experience in AI/ML deployments, particularly with large language...
Zoomcar
Sunnyvale, CA
1 day ago
Senior AI Engineer
...AI Engineer at Hippocratic AI As an AI Engineer at Hippocratic AI,... ...role in shaping the future of voice-based generative AI in healthcare... ...of large language models, real-time voice, and human-centered... ...Experience building or deploying LLM-based or multi-agent systems...
Work at office
Hippocratic AI
Palo Alto, CA
5 days ago
Software Engineer, Behavior Planning (Real-Time Autonomy)
$126k - $201k
A leading self-driving technology company based in Mountain View is looking for a Software Engineer to join their Behavior Planning Team. This role involves developing real-time algorithms for self-driving vehicles, designing software for the Aurora Driver, and analyzing...
Australian Competition and Consumer Commission
Mountain View, CA
1 day ago
Senior HPC Software Engineer, Real-Time Beam Planning
$160k - $225k
SpaceX is seeking a Senior Software Engineer for its Starlink program in Sunnyvale, California. This role involves developing reliable, real-time software to enhance user experience for satellite internet. Candidates should have extensive software development experience...
SpaceX
Sunnyvale, CA
2 days ago
Real-Time Low-Latency Software Engineer
$135k - $185k
SpaceX is looking for a Software Engineer for their Starlink project, focusing on low-latency computing solutions. You will develop software that enhances satellite internet technology, ensuring real-time communications for users. The position requires a Bachelor’s degree...
jobr.pro
Palo Alto, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead Real-Time Voice AI & LLM Engineer — Hybrid. Be the first to apply!