Member of Technical Staff - Data & ML Infra Engineer
Moonlake AI
Introducing Moonlake, AI for creating world simulations.
Overview
- Embodied AI
- Robotics simulation
- Interactive 3D worlds
- World models
- Real-time generation
- AI infrastructure
- ML systems
- Distributed infrastructure
- GPU optimization
- Production AI deployment
- Performance engineering
- Optimize large-scale model training and inference systems
- Improve GPU utilization, latency, throughput, and deployment efficiency
- Build infrastructure that supports real-time world-model and multimodal workloads
- Develop and optimize serving pipelines for frontier AI systems
- Work closely with research teams to productionize high-performance models
- Build scalable orchestration and observability systems for distributed AI infrastructure
- Improve reliability, rollout safety, autoscaling, and production monitoring
- Design systems that support fast experimentation without sacrificing stability
- CUDA / Triton kernels
- FlashAttention family
- Paged attention
- CUDA Graphs
- Memory optimization
- Kernel-level performance tuning
- TensorRT-LLM
- Triton Inference Server
- vLLM / TGI
- Continuous batching
- On-GPU KV cache reuse
- Speculative decoding / Medusa
- Mixture-of-agents routing
- FSDP / ZeRO
- Tensor parallelism
- Pipeline parallelism
- Expert parallelism
- NCCL tuning
- Multi-node GPU orchestration
- AWQ / GPTQ / FP8
- LoRA / DoRA serving
- Efficient deployment pipelines
- Ray
- Kubernetes
- Argo
- Autoscaling systems
- Canary deployments & rollback infrastructure
- A/B experimentation systems
- Observability stack:
- Prometheus
- Grafana
- OpenTelemetry
- 200ms and 2s latency
- 40% and 90% GPU utilization
- Stable rollout and catastrophic regression
...directly impacts the company's ability to train, deploy, and scale world-model systems. You'll help define the infrastructure foundation behind the next generation of interactive AI systems. We are committed to being an on-site, in-person team currently based in San Francisco.
Vacancy posted more than 2 months ago
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff - Data & ML Infra Engineer. Be the first to apply!
Related searches
- salesforce technical analyst San Francisco, CA
- desktop support analyst San Francisco, CA
- personal computer support technician San Francisco, CA
- technical support specialist San Francisco, CA
- support analyst San Francisco, CA
- customer support technician San Francisco, CA
- support technician San Francisco, CA
- application support technician San Francisco, CA
- technical solutions specialist San Francisco, CA
- help desk administrator San Francisco, CA
