Lead Real-Time Voice AI & LLM Engineer — Hybrid
$170k - $190kASAPP
At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we’re guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed, ownership, and a relentless focus on outcomes. We work in tight, skilled teams, prioritize clarity over complexity, and continuously evolve through curiosity, data, and craftsmanship.We’re seeking technologists and problem solvers who thrive in fast-paced environments, love collaborating with great talent, and approach every day like it’s Day 1. We're a globally diverse team with hubs in New York City, Mountain View, Latin America, and India—embracing both hybrid and remote work to bring the best minds together, wherever they are. If you're driven by continuous learning, rapid pivots, and the challenges of building in a high-growth startup, we’d love to talk. This is more than a job—it’s a journey. You will lead the design and delivery of end-to-end voice AI solutions, combining large language models with speech technologies such as speech-to-text, text-to-speech, and real-time streaming audio pipelines. This role requires a hands-on technical leader who can architect low-latency, highly reliable conversational voice systems and guide a team through ambiguity toward production excellence. We are looking for someone who understands the unique constraints of voice experiences, latency, turn-taking, interruption handling, streaming inference, and audio quality, and can translate these into scalable, enterprise-grade systems. This is a hybrid role with weekly in-person responsibilities. We have offices in New York City and Mountain View, CA What you'll do Build real-time conversational AI systems, including voice interfaces powered by speech-to-text, text-to-speech, and streaming inference pipelines Design and optimize low-latency inference workflows for multimodal applications involving text, speech, and real-time interactions Integrate and apply foundation models from major providers (OpenAI, AWS Bedrock, Anthropic, etc.) for prototyping and production use cases Adapt, evaluate, and optimize LLMs for domain-specific enterprise applications Build and maintain infrastructure for experimentation, deployment, and monitoring of AI models in production Improve model performance and inference workflows with attention to latency, cost, and reliability Provide technical leadership within the team, mentoring engineers and promoting best practices in ML engineering Partner with product and cross-functional stakeholders to translate requirements into scalable ML solutions Contribute to the evolution of internal standards for experimentation, evaluation, and deployment What you'll need 6+ years of experience in Machine Learning or AI systems, with hands-on experience in LLMs, speech, or conversational AI systems Experience building on integrating speech-to-text and text-to-speech systems Strong experience integrating voice models into production applications Proficiency on Python and ML frameworks like PyTorch or TensorFlow Proven experience leading complex, cross-functional AI initiatives Deep understanding of latency-sensitive system design and distributed architectures Strong proficiency in Python and ML frameworks such as PyTorch or TensorFlow Understanding of RAG pipelines, prompt engineering, and vector search Experience deploying and scaling AI systems using AWS (required), Docker, Kubernetes, and CI/CD practices Strong communication skills with the ability to align engineering, product, and executive stakeholders Comfortable operating in fast-paced environments and driving clarity in ambiguous problem spaces What we'd like to see Experience with speech model fine-tuning and acoustic/language model optimization Experience with production applications of S2S models Hands-on experience with real-time or streaming audio systems (WebRTC, gRPC streaming, or similar architectures) Experience optimizing TTS prosody, pronunciation control, and voice customization Background in MLOps, experimentation platforms, or evaluation frameworks for speech and conversational systems Contributions to open-source AI or speech tooling Graduate degree (MS or PhD) in Computer Science, Machine Learning, Speech Processing, or related field $170,000 - $190,000 a year Compensation package also includes a performance bonus on top of the listed salary range Separately, we also offer a compelling equity grant comprised of stock options Benefits include: Competitive compensation with stock options Comprehensive medical, vision, and dental insurance 401k matching Fitness and wellness stipend Mental well-being benefits Professional learning and development stipend Parental leave, including adoptive and foster parents 3 weeks paid time off (increases with tenure) along with sick leave, bereavement and jury duty ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at View email address on click.appcast.io to obtain assistance. #LI-SL1 #LI-Hybrid #J-18808-Ljbffr ASAPP
- Sage Care Inc is seeking a product-minded AI Engineer in Palo Alto, California, to build and improve their AI Voice platform. You will work on enhancing customer-facing systems, ship features, debug real-time issues, and collaborate with experienced engineers and product...Suggested
$190k - $250k
A leading financial technology firm in California seeks an AI Inference engineer to join its team. The role involves developing APIs for AI... ...system reliability, and optimizing LLM performance. Required... ...programming. The position offers a hybrid work model and competitive...Suggested- ...infrastructure that makes Sage Care’s AI assistant Role Overview Own and... ...trustworthy and debuggable —in real time and post-call. This engineer builds the visibility layer... ...role sits at the intersection of LLM orchestration, voice pipelines, transcription, SOP engines...SuggestedImmediate start
$179.5k - $220k
...company building a next-generation AI-driven operations platform,... ...generative AI deeply into real-time operational environments—combining... ...to join us as an Applied AI Engineer. This is an ideal role for someone... .... Interest in or exposure to LLM-based applications, prompt...SuggestedFull timeSummer workInternshipNight shift- A leading technology company is seeking a Software DevOps Engineer in Sunnyvale, CA to develop and support large hybrid cloud infrastructure for a real-time analytics platform. The ideal candidate will have a Bachelor’s degree and over 4 years of experience with hybrid...Suggested
$180k
...firm based in Palo Alto is seeking to hire an experienced engineer to work on multimodal AI systems. The ideal candidate will have hands-on... ...developing data pipelines, and advancing capabilities in real-time interactions. The role offers competitive compensation ranging...- Sage Care Inc. is seeking a skilled backend engineer in Palo Alto, California, to enhance their AI assistant's reliability. The role involves constructing automated... ...the development of robust tools that provide real-time insights into AI operations, promoting continuous...
- Illumio is hiring a Senior Software Engineer in Sunnyvale, California, to architect high-scale distributed systems, focusing on data processing and real-time analytics. The role requires expertise in backend engineering with Java, Python, or Go, and strong knowledge in...
$132k - $330k
Software Engineer, AI Inference Codesign The AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Autopilot and Optimus programs. This unique role lies at the intersection of AI research...Hourly payFull timeTemporary workFlexible hours- A leading AI technology company in Sunnyvale is seeking engineers proficient in deploying deep learning models for robotics platforms. You'll work on real-time inference of AI systems, optimizing latency and performance. Ideal candidates will have strong C++ skills, experience...
- About the Role As a Staff AI Engineer at Hippocratic AI, you'... ...direction for voice-based generative AI in... ...large language models, real-time voice, and human-centered... ...other teams build on. Lead cross‑functional initiatives... ...building or deploying LLM‑based or multi‑agent systems...
- An innovative technology company is seeking a Real-time Video Researcher to drive advancements in real-time video generation and intelligent... ...salary, equity, and a dynamic team environment with hybrid working options. Join to shape the future of creative technology...
- ...harnessing the latest AI innovations.... ...a product-minded AI Engineer to help build and improve Sage Care’s AI Voice platform. You’ll work on real customer-facing systems... ...Exposure to AI/ML or LLM-based applications Experience with real-time systems or APIs Interest...
$25 - $70 per hour
...students in Computer Science, Machine Learning, AI, or Data Science. Interns will learn to... ...role provides hands-on experience with real-world AI systems, mentorship from experienced... ..., and the potential for a full-time position based on performance. Compensation...Full timeInternship- ...Senior AI Engineer – LLM, RAG Bright.AI is a high-growth Physical AI company... ...data from billions of real-world events—captured across... ...Senior AI Engineer – LLM, RAG to lead the development of Retrieval-... ...foundational models, and real-time information systems—developing...Immediate start
$103.75k - $174.75k
...AI Engineer III - Agentic AI New York, NY, United... ...that operate on real financial data and... ...implementation of LLM-powered and agentic... ...technical ownership over time. ~ Comfort... ...either virtual or hybrid, depending on the selected... ...career. Here, your voice and ideas matter,...Full timeInternshipWork at officeLocal areaRemote workVisa sponsorshipFlexible hoursShift work3 days per week$300k - $400k
...Senior Agentic AI Engineer $300,000 - $400,000 Remote / Hybrid, Palo Alto Full time / Permanent A well-known,... ...modalities (e.g. image/video, voice, browsing, code execution) Own LLM orchestration and model... ...consistency, and real user impact What Great...Permanent employmentFull timeRemote workShift work$100k - $200k
OPPO AI Center is seeking a passionate AI Engineer to help build next‑generation AI products and intelligent... ..., and deploy advanced LLM and AI Agent technologies... ...AI technologies into real‑world products and user experiences... ...range for this full‑time position is $100,000-$200,...Full time$100k - $200k
A leading technology research center based in Palo Alto is seeking a full-time AI/LLM Test Engineer to evaluate and ensure the robustness of Large Language Models in real-world scenarios. This role involves collaboration with various teams and requires a strong technical...Full time$183k - $224k
...application team. We are looking for a Senior AI Engineering Lead to join our growing team. In this role,... ...and backend systems for high-volume, real-time operational workloads, and build... ...with multiple teams. Experience with LLM service, prompt engineering and evaluation...Full timeWorldwideNight shift$382.8k
Ernst & Young Oman is seeking a Real Estate Tax Senior Manager to engage with diverse clients, providing leading-edge tax advice and managing multiple projects. The role requires... ...real estate tax experience. This position is hybrid, offering extensive benefits, including a...Flexible hours- A leading AI-powered fraud and risk platform in Mountain View, CA, is hiring a Software Engineer to architect high-scale services and data pipelines. The role involves working with... ...benefits including healthcare and time off policies. #J-18808-Ljbffr DataVisor
$30 per hour
A veteran-owned security firm is seeking a Security Officer GSOC Operator in Sunnyvale, CA. This critical role involves real-time monitoring and incident response for client assets and personnel. The ideal candidate should have 2-5 years of GSOC experience and strong communication...Full timeWorldwide- ...delivered for millions of patients worldwide. We're a team of engineers, clinicians, and innovators united by one purpose: to make... ...Primary Function of Position Our team is responsible for the real-time software infrastructure and state machines that support...Local areaWorldwideFlexible hours
$188k - $237.5k
...transformation to AI-enabled software-defined... .... That's why leading OEMs trust Sonatus... ...motivated Senior AI Engineer to join our team... ...millions of vehicles in real-time. We need a "builder... ...optimization, and LLM security. You will... ...+ paid holidays Hybrid office work...Work at officeLocal areaWorldwideFlexible hoursShift work- ...building neuron™ , a unified AI-native platform for data... .... neuron™ processes real-time telemetry from thousands of... ...Principal Software & Architecture Lead, Product, and Solution Engineering in a fast-moving, small-... ...tools (Cursor, Copilot, LLM‑based workflows) to accelerate...Full time
- ...AI Engineer at Hippocratic AI As an AI Engineer at Hippocratic AI,... ...role in shaping the future of voice-based generative AI in healthcare... ...of large language models, real-time voice, and human-centered... ...Experience building or deploying LLM-based or multi-agent systems...
$123k - $215.25k
...Senior AI Engineer - Generative AI Research & Development... ...Alto, CA, United States(Hybrid) Job Description Joining... ...in working to make it real. You lift others. Your... ...and lending your voice in group settings. People... ...work because they spent time with you. Others look...Full timeWork at officeLocal areaFlexible hoursShift work- ...AI-Native Data Engineer @ TrueMeter SF Bay Area | Hybrid (3 days onsite, 2 remote) About Us We're... ...foundations for coding agents and LLM workflows. You'll... ...raw energy telemetry to real-world business impact.... ...ingestion systems for time-series energy data, billing...Immediate startRemote work
$144.25k - $256.25k
...Staff AI Engineering - Agentic AI New York, NY, United... ...(RAG) pipelines LLM infrastructure,... ...autonomous systems Lead technical design... ...either virtual or hybrid, depending on the selected... ...career. Here, your voice and ideas matter,... ...Job Schedule Full time Locations AMEX...Full timeWork at officeLocal areaRemote workVisa sponsorshipFlexible hours3 days per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Lead Real-Time Voice AI & LLM Engineer — Hybrid. Be the first to apply!
- machine learning ai engineer Mountain View, CA
- ai engineer remote Mountain View, CA
- ai prompt engineer Mountain View, CA
- ai developer Mountain View, CA
- ai engineer Mountain View, CA
- ai ml engineer Mountain View, CA
- senior ai engineer Mountain View, CA
- ai network engineer
- azure ai engineer
- ai engineer contract

