Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead Real-Time Voice AI & LLM Engineer — Hybrid

$170k - $190k

ASAPP

At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we’re guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed, ownership, and a relentless focus on outcomes. We work in tight, skilled teams, prioritize clarity over complexity, and continuously evolve through curiosity, data, and craftsmanship.We’re seeking technologists and problem solvers who thrive in fast-paced environments, love collaborating with great talent, and approach every day like it’s Day 1. We're a globally diverse team with hubs in New York City, Mountain View, Latin America, and India—embracing both hybrid and remote work to bring the best minds together, wherever they are. If you're driven by continuous learning, rapid pivots, and the challenges of building in a high-growth startup, we’d love to talk. This is more than a job—it’s a journey. You will lead the design and delivery of end-to-end voice AI solutions, combining large language models with speech technologies such as speech-to-text, text-to-speech, and real-time streaming audio pipelines. This role requires a hands-on technical leader who can architect low-latency, highly reliable conversational voice systems and guide a team through ambiguity toward production excellence. We are looking for someone who understands the unique constraints of voice experiences, latency, turn-taking, interruption handling, streaming inference, and audio quality, and can translate these into scalable, enterprise-grade systems. This is a hybrid role with weekly in-person responsibilities. We have offices in New York City and Mountain View, CA What you'll do Build real-time conversational AI systems, including voice interfaces powered by speech-to-text, text-to-speech, and streaming inference pipelines Design and optimize low-latency inference workflows for multimodal applications involving text, speech, and real-time interactions Integrate and apply foundation models from major providers (OpenAI, AWS Bedrock, Anthropic, etc.) for prototyping and production use cases Adapt, evaluate, and optimize LLMs for domain-specific enterprise applications Build and maintain infrastructure for experimentation, deployment, and monitoring of AI models in production Improve model performance and inference workflows with attention to latency, cost, and reliability Provide technical leadership within the team, mentoring engineers and promoting best practices in ML engineering Partner with product and cross-functional stakeholders to translate requirements into scalable ML solutions Contribute to the evolution of internal standards for experimentation, evaluation, and deployment What you'll need 6+ years of experience in Machine Learning or AI systems, with hands-on experience in LLMs, speech, or conversational AI systems Experience building on integrating speech-to-text and text-to-speech systems Strong experience integrating voice models into production applications Proficiency on Python and ML frameworks like PyTorch or TensorFlow Proven experience leading complex, cross-functional AI initiatives Deep understanding of latency-sensitive system design and distributed architectures Strong proficiency in Python and ML frameworks such as PyTorch or TensorFlow Understanding of RAG pipelines, prompt engineering, and vector search Experience deploying and scaling AI systems using AWS (required), Docker, Kubernetes, and CI/CD practices Strong communication skills with the ability to align engineering, product, and executive stakeholders Comfortable operating in fast-paced environments and driving clarity in ambiguous problem spaces What we'd like to see Experience with speech model fine-tuning and acoustic/language model optimization Experience with production applications of S2S models Hands-on experience with real-time or streaming audio systems (WebRTC, gRPC streaming, or similar architectures) Experience optimizing TTS prosody, pronunciation control, and voice customization Background in MLOps, experimentation platforms, or evaluation frameworks for speech and conversational systems Contributions to open-source AI or speech tooling Graduate degree (MS or PhD) in Computer Science, Machine Learning, Speech Processing, or related field $170,000 - $190,000 a year Compensation package also includes a performance bonus on top of the listed salary range Separately, we also offer a compelling equity grant comprised of stock options Benefits include: Competitive compensation with stock options Comprehensive medical, vision, and dental insurance 401k matching Fitness and wellness stipend Mental well-being benefits Professional learning and development stipend Parental leave, including adoptive and foster parents 3 weeks paid time off (increases with tenure) along with sick leave, bereavement and jury duty ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at View email address on click.appcast.io to obtain assistance. #LI-SL1 #LI-Hybrid #J-18808-Ljbffr ASAPP

Vacancy posted 15 hours ago
Similar jobs that could be interesting for youBased on the Lead Real-Time Voice AI & LLM Engineer — Hybrid in Mountain View, CA vacancy
  • $190k - $250k

    A leading financial technology firm in California seeks an AI Inference engineer to join its team. The role involves developing APIs for AI...  ...system reliability, and optimizing LLM performance. Required...  ...programming. The position offers a hybrid work model and competitive... 
    Suggested

    Pantera Capital

    Palo Alto, CA
    5 days ago
  •  ...infrastructure that makes Sage Care’s AI assistant Role Overview Own and...  ...trustworthy and debuggable —in real time and post-call. This engineer builds the visibility layer...  ...role sits at the intersection of LLM orchestration, voice pipelines, transcription, SOP engines... 
    Suggested
    Immediate start

    Sage Care

    Palo Alto, CA
    1 day ago
  • A leading technology company is seeking a Software DevOps Engineer in Sunnyvale, CA to develop and support large hybrid cloud infrastructure for a real-time analytics platform. The ideal candidate will have a Bachelor’s degree and over 4 years of experience with hybrid... 
    Suggested

    Redolent Infotech Pvt. Ltd.

    Sunnyvale, CA
    4 days ago
  • $168k - $210k

     ...neuroscience company in California seeks a Real-Time Software Engineer to advance brain-computer interface...  ...and work collaboratively in a hybrid environment. Ideal candidates have a Bachelor...  ...experience, and experience in AI tooling. Join a team focused on transforming... 
    Suggested

    Neara

    Santa Clara, CA
    3 days ago
  •  ...A leading AI technology company in Sunnyvale is seeking engineers proficient in deploying deep learning models for robotics platforms. You'll work on real-time inference of AI systems, optimizing latency and performance. Ideal candidates will have strong C++ skills, experience... 
    Suggested

    Coram AI

    Sunnyvale, CA
    3 days ago
  • $180k

     ...firm based in Palo Alto is seeking to hire an experienced engineer to work on multimodal AI systems. The ideal candidate will have hands-on...  ...developing data pipelines, and advancing capabilities in real-time interactions. The role offers competitive compensation ranging... 

    Pantera Capital

    Palo Alto, CA
    1 day ago
  •  ...in Sunnyvale, California, is looking for an ambitious Applied AI Engineer to build and maintain AI integrated backend services. You'll collaborate...  ..., product managers, and AI researchers to design scalable real-time systems. Ideal candidates will have a background in computer... 

    Zoomcar

    Sunnyvale, CA
    4 days ago
  • Illumio is hiring a Senior Software Engineer in Sunnyvale, California, to architect high-scale distributed systems, focusing on data processing and real-time analytics. The role requires expertise in backend engineering with Java, Python, or Go, and strong knowledge in... 

    Illumio

    Sunnyvale, CA
    2 days ago
  • A leading AI fraud detection company in Mountain View, CA is seeking AI / ML Engineering Interns. Ideal for recent MS or Ph.D. graduates in Computer Science or related fields...  ...AI applications using LLMs, and working on real-time system optimization. Candidates should have... 
    Internship

    DataVisor Inc.

    Mountain View, CA
    5 days ago
  • $132k - $330k

    Software Engineer, AI Inference Codesign The AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Autopilot and Optimus programs. This unique role lies at the intersection of AI research... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla Motors, Inc.

    Palo Alto, CA
    2 days ago
  • An innovative technology company is seeking a Real-time Video Researcher to drive advancements in real-time video generation and intelligent...  ...salary, equity, and a dynamic team environment with hybrid working options. Join to shape the future of creative technology... 

    Pika

    Palo Alto, CA
    4 days ago
  • $220k - $290k

    Inworld AI in Mountain View is seeking a Developer Relations position to lead the developer relations strategy and enhance developer experience. You will create engaging...  ..., and is familiar with AI/ML APIs. This full-time role offers a competitive salary between $220,00... 
    Full time

    Inworld AI

    Mountain View, CA
    2 days ago
  • LiveX AI Inc. is seeking a Backend Software Engineer to innovate customer retention through AI technology. The role involves developing backend systems that enhance customer engagement, embedding advanced AI models for personalized interactions, and collaborating with... 

    LiveX AI Inc.

    Palo Alto, CA
    1 day ago
  • $103.75k - $174.75k

     ...AI Engineer III - Agentic AI New York, NY, United...  ...that operate on real financial data and...  ...implementation of LLM-powered and agentic...  ...technical ownership over time. ~ Comfort...  ...either virtual or hybrid, depending on the selected...  ...career. Here, your voice and ideas matter,... 
    Full time
    Internship
    Work at office
    Local area
    Remote work
    Visa sponsorship
    Flexible hours
    Shift work
    3 days per week

    American Express

    Palo Alto, CA
    1 day ago
  • $300k - $400k

     ...Senior Agentic AI Engineer $300,000 - $400,000 Remote / Hybrid, Palo Alto Full time / Permanent A well-known,...  ...modalities (e.g. image/video, voice, browsing, code execution) Own LLM orchestration and model...  ...consistency, and real user impact What Great... 
    Permanent employment
    Full time
    Remote work
    Shift work

    DeepRec.ai

    Palo Alto, CA
    2 days ago
  • $150k

     ...xAI xAI’s mission is to create AI systems that can accurately...  ...highly motivated, and focused on engineering excellence. This organization...  ...ROLE: You will join the Grok Voice Model team to help build the...  ...reliable across devices and real-time scenarios. We own the full... 
    Temporary work

    Pantera Capital

    Palo Alto, CA
    2 days ago
  • $130.6k - $285k

    Fairygodboss is looking for a Senior iOS Engineer to join our team in Sunnyvale, California. In this hybrid role, you will work on building and maintaining a large...  ...features, improving code quality, and solving complex real-time data issues. We offer a competitive salary range... 

    Fairygodboss

    Sunnyvale, CA
    4 days ago
  • $104.9k - $218.55k

    NTT DATA is seeking a skilled developer to design and develop scalable full-stack AI applications using React, Next.js, TypeScript, and Node.js. You will oversee the entire UI lifecycle while ensuring performance optimization and accessibility. The ideal candidate should... 
    Remote job

    NTT DATA

    Santa Clara, CA
    1 day ago
  • Zoomcar in Sunnyvale is seeking an Applied AI Engineer to join their cybersecurity team. The ideal candidate will hold a Bachelor's degree...  ...opportunities for growth, mentorship, and hands-on experience with real-world applications, making it ideal for recent graduates... 

    Zoomcar

    Sunnyvale, CA
    3 days ago
  •  ...looking for a professional to join their Utility Industry team in California. This role focuses on enhancing utility operations through real-time system design and implementation, especially in Transmission and Distribution sectors. The ideal candidate will have at least four... 

    Accenture

    Mountain View, CA
    3 days ago
  • $160k - $240k

     ...possible with Generative AI in healthcare. Our...  ...specific agent creation, and real-time algorithm monitoring—working alongside leading health systems to drive...  ...for a skilled Platform Engineer to contribute to the development...  ...flexible working hours, hybrid work options and an... 
    Live in
    Flexible hours
    3 days per week

    Qualified Health PBC

    Palo Alto, CA
    1 day ago
  • $183k - $224k

     ...Responsibilities Architect, design and implement scalable AI agent and backend systems for high-volume, real-time operational workloads, and build resilient...  ...with multiple teams. Experience with LLM service, prompt engineering and evaluation techniques. Proficiency with... 
    Full time
    Night shift

    Zoomcar

    Sunnyvale, CA
    3 days ago
  • $30 per hour

    A veteran-owned security firm is seeking a Security Officer GSOC Operator in Sunnyvale, CA. This critical role involves real-time monitoring and incident response for client assets and personnel. The ideal candidate should have 2-5 years of GSOC experience and strong communication... 
    Full time
    Worldwide

    Surefox North America

    Sunnyvale, CA
    3 days ago
  •  ...building neuron™ , a unified AI-native platform for data...  .... neuron™ processes real-time telemetry from thousands of...  ...Principal Software & Architecture Lead, Product, and Solution Engineering in a fast-moving, small-...  ...tools (Cursor, Copilot, LLM‑based workflows) to accelerate... 
    Full time

    Teserac, Inc.

    Sunnyvale, CA
    5 days ago
  • $188k - $237.5k

     ...transformation to AI-enabled software-defined...  .... That's why leading OEMs trust Sonatus...  ...motivated Senior AI Engineer to join our team...  ...millions of vehicles in real-time. We need a "builder...  ...optimization, and LLM security. You will...  ...+ paid holidays Hybrid office work... 
    Work at office
    Local area
    Worldwide
    Flexible hours
    Shift work

    Sonatus

    Sunnyvale, CA
    3 days ago
  • Zoomcar is seeking a Principal Software Development Engineer in Sunnyvale to architect and implement functions for monitoring LLM requests and filter for prompt injection...  ...skills in Python and C/C++, with experience in AI/ML deployments, particularly with large language... 

    Zoomcar

    Sunnyvale, CA
    1 day ago
  •  ...AI Engineer at Hippocratic AI As an AI Engineer at Hippocratic AI,...  ...role in shaping the future of voice-based generative AI in healthcare...  ...of large language models, real-time voice, and human-centered...  ...Experience building or deploying LLM-based or multi-agent systems... 
    Work at office

    Hippocratic AI

    Palo Alto, CA
    5 days ago
  • $126k - $201k

    A leading self-driving technology company based in Mountain View is looking for a Software Engineer to join their Behavior Planning Team. This role involves developing real-time algorithms for self-driving vehicles, designing software for the Aurora Driver, and analyzing... 

    Australian Competition and Consumer Commission

    Mountain View, CA
    1 day ago
  • $160k - $225k

    SpaceX is seeking a Senior Software Engineer for its Starlink program in Sunnyvale, California. This role involves developing reliable, real-time software to enhance user experience for satellite internet. Candidates should have extensive software development experience... 

    SpaceX

    Sunnyvale, CA
    2 days ago
  • $135k - $185k

    SpaceX is looking for a Software Engineer for their Starlink project, focusing on low-latency computing solutions. You will develop software that enhances satellite internet technology, ensuring real-time communications for users. The position requires a Bachelor’s degree... 

    jobr.pro

    Palo Alto, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead Real-Time Voice AI & LLM Engineer — Hybrid. Be the first to apply!