AI Infrastructure Engineer
$190k - $270kTogether AI
AI Infrastructure Engineer
As an AI Infrastructure Engineer at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, and mature automation to our operating environments and codebase. You specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.
Responsibilities
- Participate in on-call rotation (Pagerduty) to respond to production incidents
- Build and run our infrastructure with Ansible, Terraform, and Kubernetes to enable scaling to a massive number of concurrent users
- Build monitoring systems to ensure the highest quality service for our customers
- Design and implement operational processes (such as deployments and upgrades)
- Debug production issues across all services and levels of the stack
- Identify improvements for the product architecture from the reliability, performance and availability perspectives
- Plan the growth of Together AI's infrastructure
Requirements
- 5+ years of professional AI Infra or related experience
- Bachelor's degree in Computer Science or a related field or equivalent work experience
- Knowledge of Ansible (roles, playbooks), Terraform, and Kubernetes
- Proficiency in programming/scripting languages
- Direct experience in monitoring and observability practices
- Knowledge of cloud services
- Ability to thrive in a collaborative environment involving different stakeholders and subject matter experts
About Together AI
Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure.
Compensation
We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $190,000 - $270,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.
Equal Opportunity
Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.
$150k - $200k
...AI Infrastructure Specialist As vCluster’s AI Infrastructure Specialist, you will work directly with customers at the earliest and most... ...next customer’s head start. Feedback Loop: Collaborate with Engineering and Product to surface recurring infrastructure challenges,...SuggestedRemote workFlexible hours- ...AI Infrastructure Engineer Spellbrush, the world's leading generative AI studio behind nijijourney, is looking for an AI Infrastructure Engineer to join us in building out end-to-end ML infrastructure to run our models on all platforms. What You'll Do Design...SuggestedWork experience placementWork at officeVisa sponsorship
- An innovative AI infrastructure startup is seeking a Sales Engineer to lead technical discovery and drive successful evaluations with clients. The ideal candidate will have significant experience in customer-facing technical roles focused on AI and machine learning infrastructure...SuggestedRemote work
- ...About Brain Co. Brain Co. is an applied AI startup co-founded by Jared Kushner and Elad Gil, and backed by leading Silicon... ...millions of people. About the Role: As our Security Engineer, Infrastructure, you'll secure the platform layer end-to-end including cloud...SuggestedWorldwide
$50 - $70 per hour
...Mercor is looking for a full-time Network Engineer in San Francisco to work with AI systems. You will manage network data, analyze behaviors, and create scripts for data processing. Ideal candidates should have experience in network engineering and programming skills in...SuggestedHourly payFull timeContract work- AI Chopping Block, Inc. seeks a Senior Software Engineer for their Agentic Infrastructure team in San Francisco. This role involves architecting and building AI systems that enable autonomous planning and execution across the platform. Ideal candidates have 4-7 years of...Remote jobFlexible hours
$190k - $270k
AI Chopping Block, Inc. is hiring an AI Infrastructure Engineer to ensure smooth operations of user-facing services and production systems in San Francisco. The ideal candidate will have over 5 years of relevant experience and a Bachelor's degree in Computer Science or...- ...Corp, based in San Francisco, is looking for a Core Engineer to design and operate foundational infrastructure for their autonomous agent platform. This role emphasizes... ...and systems thinking, crucial for ensuring AI reliability. Qualified candidates will have a strong...
- An innovative AI company is seeking a Software Engineer to develop infrastructure that supports AI training and inference workflows. This role requires strong object-oriented programming skills and a solid foundation in data structures and algorithms. The ideal candidate...
- Handshake is seeking a Senior Software Engineer for its Agentic Infrastructure team in San Francisco. You will build the backbone for AI agents, designing key systems that ensure functionality and safety across Handshake's platform. The ideal candidate has 4-7 years of...Remote jobFlexible hours
- An innovative AI lab is seeking an experienced engineer to manage and optimize large-scale training infrastructure. You will build core systems that support researchers, focusing on distributed training, performance optimization, and data pipelines. Ideal candidates should...
- AI Chopping Block, Inc. is seeking engineers to build and operate the next generation of compute infrastructure. You will handle large-scale clusters and high-performance networks while solving real-time operational challenges. Ideal candidates have experience in distributed...
- Roboflow in San Francisco is seeking a versatile Infrastructure Engineer to enhance our core infrastructure and scale our cloud operations. You will engage with cutting-edge AI technologies and collaborate with product, operations, and security teams. The role demands expertise...
- A well-funded AI infrastructure startup in San Francisco is seeking a Founding Engineer to design and scale distributed backend systems integral to training advanced AI agents. Ideal candidates will have experience in ML pipelines, systems thinking, and a strong foundation...
$190k - $270k
AI Chopping Block, Inc. is hiring an AI Infrastructure Engineer in San Francisco, California. This full-time role involves ensuring smooth operation of user-facing services and production systems, alongside building and running infrastructure with Ansible, Terraform, and...Full time- MintMCP, located in San Francisco, is looking for a versatile builder to own end-to-end features on our AI infrastructure. This role involves backend work and some frontend tasks, where you'll leverage AI tools to enhance productivity. The ideal candidate thrives in a horizontal...
- A California-based technology company is seeking a Software Engineer for Frontier AI Infrastructure to create secure, scalable backend systems and collaborate with government agencies. The ideal candidate must have an active secret clearance and a strong background in full...
- A leading AI fashion-tech company is seeking a Software Engineer Intern to focus on building infrastructure for AI systems. This role involves designing scalable models, developing APIs, and optimizing for performance and reliability. An ideal candidate will have a strong...InternshipImmediate start
- A leading AI research firm in San Francisco seeks a Staff Infrastructure Engineer to identify and resolve infrastructure bottlenecks and design large-scale systems for AI training. The ideal candidate has over 3 years of experience in infrastructure engineering and strong...
- ...Francisco, CA (Onsite | Remote) About Virtue AI Virtue AI sets the standard for... .... What You'll Do As an AI infra Engineer, you will own the reliability, scaling,... ...with product developers to align infrastructure and inference behavior with product requirements...Remote work
- ...Meet Eloquent AI At Eloquent AI, we're building the next generation of AI Operators... ...alongside world-class talent in AI, engineering, and product as we redefine the future... ...languages. ~ Strong knowledge of cloud infrastructure (AWS, GCP, or Azure) and scalable architectures...
- ...AI Infra Engineer We are looking for an AI Infra engineer to join our growing team. We work with Kubernetes, Slurm, Python, C++, PyTorch, and primarily on AWS. As an AI Infrastructure Engineer, you will be partnering closely with our Inference and Research teams to...
$216k - $270k
...As a Software Engineer on the Machine Learning Infrastructure team, you will build the "Operating System" for our large-scale GPU clusters. You will architect... ...that transforms raw compute into breakthrough AI. You will: Architect and scale a multi-tenant orchestration...Full time- ...Rapidata Job Opportunity Compute is no longer AI's largest bottleneck, it now is human knowledge and feedback. At Rapidata we... ...looking for a super driven person at the intersection of product, engineering and customer use cases, deeply understanding our platform and...Work at office
$190k - $270k
AI Chopping Block, Inc. in San Francisco is looking for an AI Infrastructure Engineer responsible for maintaining user-facing services and production systems. This role requires 5+ years of related experience, proficiency in Ansible, Terraform, and Kubernetes, and offers...$269.1k - $307.2k
...Distinguished AI Engineer (Agentic AI Platform) At Capital One, we are creating responsible and reliable AI systems, changing banking... ...customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience in machine...Full timePart timeWork at officeLocal area$150k - $180k
...expertise, scale, and technology. Job Description Senior AI Platform Engineer Locations : San Francisco, CA / Jacksonville, FL /... ..., ensuring it seamlessly integrates with our big data infrastructure, and enabling scalable, intelligent, and autonomous data-...Ongoing contractCasual workFlexible hours$160k - $235k
...Senior AI Engineer, AI Platform San Francisco, CA; USA (Remote) Affinity stitches together billions of data points from massive datasets to create a powerful, accurate representation of the world's professional relationship graph. Based on this data, we offer our...Work at officeRemote workWorldwideFlexible hours2 days per week3 days per week$255k - $405k
A leading AI research firm in San Francisco is seeking a Software Engineer for the Agent Infrastructure team. This role involves building scalable systems for training AI models and launching agentic products. Candidates should have deep experience in AI infrastructure,...Work at office- Xterraai is looking for an AI Research Engineer to enhance its geospatial intelligence systems. This role involves building agent infrastructure, developing evaluation frameworks, and designing data systems while collaborating with researchers and geoscientists. The ideal...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Infrastructure Engineer. Be the first to apply!
- ai research engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- ai engineer remote San Francisco, CA
- ai prompt engineer San Francisco, CA
- ai developer San Francisco, CA
- ai engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- senior ai engineer San Francisco, CA
- principal infrastructure engineer San Francisco, CA
- lead infrastructure engineer San Francisco, CA

