Platform Engineer - Reliability & Scale at LangChain - San Francisco, CA
$145k - $195kVictrays
Platform Engineer – Reliability & Scale at LangChain – San Francisco, CA About LangChain At LangChain, our mission is to make intelligent agents ubiquitous. We help developers build mission-critical AI applications across the entire agent development lifecycle. Our open source frameworks — LangChain and LangGraph — see over 70+ million downloads per month. Developers rely on LangChain for composable integrations and LangGraph for controllable agent orchestration. Our commercial agent platform, consisting of LangSmith and LangGraph Platform, enables teams to build, test, run, and manage agents at scale across their organization. Founded in 2023, LangChain powers top engineering teams at companies like Replit, Lovable, Clay, Klarna, LinkedIn, and more. About the role In person 5 days/week in San Francisco, CA or New York, NY Join our platform engineering team as we scale LangSmith and LangGraph Platform products. You’ll architect and operate the critical systems that power our customers’ AI observability and LangGraph app deployments, working directly with cutting-edge technologies at the intersection of AI and distributed systems. Scale critical systems : Design and implement high throughput data-intensive systems supporting our flagship SaaS products (LangSmith and LangGraph Platform) Drive reliability : Build monitoring, alerting, and automated recovery systems that maintain high uptime Solve complex problems : Debug performance bottlenecks, optimize database queries, and architect solutions for distributed system challenges Shape platform strategy : Influence technical decisions around infrastructure, tooling, and operational practices as we grow from startup to enterprise scale Respond to incidents : Participate in on-call rotation with focus on post-incident learning, automation and prevention How to be successful in this role Experience : 5+ years building and operating production systems at scale Infrastructure expertise : Deep knowledge of Kubernetes, containerized infrastructure, cloud platforms (e.g. GCP) Database expertise : Production experience with OSS datastores (PostgreSQL, Redis, Kafka) Observability mastery : Hands-on experience with observability stacks (Datadog, Prometheus/Grafana, OpenTelemetry or similar) Programming proficiency : Strong hands-on software engineering skills (Python, Go, Rust) Operational mindset : “You build it, you run it, you own it” philosophy with the focus on sustainable practices Nice to Have Proficiency with analytical databases (e.g. ClickHouse) Background in high-growth startups Previous experience in AI/ML infrastructure Competitive salary and equity stake for role and stage of company. Commensurate with experience. Annual salary range: $145,000-$195,000 USD for Senior Engineers #J-18808-Ljbffr
$207k - $345k
...Senior Engineering Manager - Payroll Platform Rippling gives businesses one place to run... ...90 seconds. Based in San Francisco, CA, Rippling has raised $1.... ...You’ll Do Architect for Scale: Lead the predominant backend... ..." produces accurate, reliable outputs for high-stakes...SuggestedWork at office3 days per week$204k - $233k
...Staff DevOps Engineer San Francisco, CA (Hybrid) | Full-Time We're partnering... ..., developing large-scale systems that support the rapid... ...scale and evolve their cloud platform and developer... ...(GitHub Actions) to enable reliable, self-service deployments...SuggestedFull timeLocal area$117.2k - $229.2k
Senior Software Engineer - Azure Object Storage job at Microsoft Corporation. San Francisco, CA. Azure Object Storage team is looking for... ...storage system, designed to scale out and serve the entire... ...challenges related to scale and reliability for a distributed system....SuggestedLocal area$199k
As an Engineering Manager on Chime’s Data Platform, leading the Data Storage team, you will own... ...building and operating reliable, scalable, and secure... ...that support both high-scale batch workloads and latency... ...local laws, including the San Francisco Fair Chance Ordinance,...SuggestedFull timeLocal area$145k - $186k
...Staff Engineer, Cloud Engineering | Phoenix, AZ or San Francisco, CA (Hybrid, 3 days/week) A leading FinTech is in the final stretch of a multi-year cloud... ...critical systems • Championing observability at scale (Prometheus, Grafana, ELK, Splunk) • Leading root...SuggestedWork at officeFlexible hours3 days per week- AWS DevOps Engineer - INTL India (Bangalore) - Insight Global, San Francisco, CA A large IoT company is looking for an AWS focused DevOps Engineer to assist with multiple ongoing projects within their IAM and ProdSec groups. We are a company committed to creating diverse...
$80 per hour
...VMware Cloud Architect / Consultant - San Francisco, CA - $80/hr Location: San... ...design, modernize, and optimize large-scale infrastructure platforms. The ideal candidate brings deep experience... ..., infrastructure teams, and engineering organizations to develop roadmaps,...Contract workRelocationVisa sponsorship$300k - $405k
...mission is to create reliable, interpretable, and steerable... ...researchers, engineers, policy experts, and business... ...Research Data Platform team, you will design,... ...team on just a few large-scale research efforts. And... ...corporation headquartered in San Francisco. We offer competitive...Work at officeVisa sponsorshipFlexible hours$145k - $195k
...AI development company is seeking a Platform Engineer to enhance its LangSmith and LangGraph... ...maintain critical systems. With a focus on reliability and scalability, you will participate... ...on experience, and requires daily onsite work in San Francisco. #J-18808-Ljbffr...- ...OpenArt AI in San Francisco is seeking a Senior Platform & Reliability Engineer to design and improve the reliability of its infrastructure. The role emphasizes building and operating production systems while collaborating with product engineers to ensure platform scalability...
- Staff Software Engineer - Machine Learning Platform (San Francisco) Replicate makes it easy for software engineers to... ...packaging and deployment to serving, scaling, and monitoring. You’ll be... ...to maximize the utilization and reliability of our Kubernetes clusters and GPUs...Full timeWork at officeShift work3 days per week
- ...Brain Co., an applied AI startup in San Francisco, seeks a backend engineer to build scaleable technical capabilities supporting AI products for vital... ...designing backend services, optimizing systems for reliability, and collaborating across teams. Ideal candidates have...
$170k - $220k
...Staff Cloud Engineer / AWS / Hybrid in San Francisco San Francisco, California... ...undergoing a large-scale cloud... ...-critical payments platform from on-premise infrastructure... ...best practices around reliability and security, and... ...in San Francisco, CA. Required Skills...Full time3 days per week- Oracle Integration Cloud Consultant (San Francisco, CA) Job Description: Lead and support Oracle Integration Cloud (OIC) implementations and optimizations to ensure seamless data integration across various financial systems. Work closely with our internal IT, finance,...
- ...this way. The Role The Director of Platform & Reliability Engineering will lead a critical engineering... ...ensure Forge's platform capabilities scale with the business. Location: This... ...2-3 days a week in office in San Francisco, CA. Responsibilities This leader will...Full timeWork at officeLocal area2 days per week3 days per week
$102.5k - $188.9k
...Oracle Cloud Security - Senior Consultant job at Deloitte. San Francisco, CA. Our Deloitte Cyber team understands the unique challenges and... ...in Computer Science, Cyber Security, Information Security, Engineering, Information Technology, Management Information Systems, Finance...Visa sponsorship- ...on SS&C for expertise, scale, and technology. Job... ...Description Senior AI Platform Engineer Locations : San Francisco, CA / Jacksonville, FL /... ...to ensure secure, reliable, and multi-tenant deployment... ...agent frameworks (LangChain, LlamaIndex, DSPy, OpenAI...Ongoing contractCasual workFlexible hours
$160k - $225k
...A leading cybersecurity firm in San Francisco is looking for a Platform / Infrastructure Engineer to build and scale core systems for its data workflows. This role involves developing reliable and scalable backend systems, optimizing performance, and collaborating with...- ...Join Saviynt as a Staff Platform Engineer based in San Francisco, CA, where you will be instrumental in ensuring the reliability and scalability of our cloud-native SaaS platform. Your role involves designing and building core infrastructure services, primarily using...
- ...A technology company in San Francisco is seeking a DevOps Engineer to enhance the reliability and operational health of their production systems. You will set observability... ...proficiency in observability tools and cloud platforms. Join this innovative team to make a...
$190.7k - $329.6k
CloudKit Client Engineering Manager - Apple Cloud Services Location: San Francisco, CA Responsibilities Lead the team focused on CloudKit syncing and related features... ...track record, particularly on Apple platforms. Hands‑on software development experience in Objective...Relocation package- ...is the modern AI procurement platform for global enterprises.... ...their supplier relationships. Engineering at Levelpath: We are looking... ...Engineer to help us build and scale our public-facing products.... ...position is based onsite in our San Francisco office. In this role, you...Work at officeFlexible hours
- ...economics of data integration at scale. And now Airbyte is... ...be the infrastructure and reliability engineer on the Data Replication team... ...underpinning the Data Replication platform - Kubernetes clusters, CI/... ...: Onsite 5 days/week in San Francisco, CA If you find this role...Local area
- ...businesses. We’re based in San Francisco, CA, but built as a remote‑... ...the Role As a Senior Site Reliability Engineer (SRE) at Fieldguide, you will... ...available, and capable of scaling with rapid growth. You’ll... ...closely with product and platform engineering teams to define...Remote workWork from homeFlexible hours
- Principal Cloud Security Operations Engineer (Scripting, AWS, DevOps,... ..., CCIE Security, CEH) in San Francisco, CA AWS, CEH, CISA, CISSP,... ...operations (SOC). --Architect large-scale cloud environments using... .... --Educates product and platform teams on secure coding practices...Permanent employmentFull timeWork experience placementRemote workRelocation
$405k
...mission is to create reliable, interpretable, and steerable... ...researchers, engineers, policy experts, and business... ...own the foundational platform that powers all of... ...leadership to ensure Claude.ai scales gracefully as we grow.... ...headquartered in San Francisco. We offer competitive...Work at officeVisa sponsorshipFlexible hoursShift work$200k - $250k
A leading visual creation platform in San Francisco is seeking a Senior Owner of Stability and Infrastructure. This... ...leadership role demands expertise in service reliability to ensure the platform's performance as it scales. Responsibilities include setting reliability...$125k - $195k
...of exceptional, hands-on engineers to make this happen.... ...an Infrastructure & Site Reliability Engineer to design, build... ...compatible storage, VPNs Scale our observability platform: Build systems to ingest... ...advisors, and a lab/office in San Francisco, CA. Compensation & Benefits...Work at officeVisa sponsorshipNight shift$238k - $290k
...AI, an enterprise-grade platform, and deep domain expertise... ...investor support, we're scaling fast and defining a new category... ...As a Staff Software Engineer on the Site Reliability team at Harvey, you will... ...This role is based in San Francisco, CA. We use an in-person work...Relocation package- ...About the Team The Scaling team designs, builds, and operates critical... ...workloads, while remaining reliable and easy to use. About the... ...Site Reliability Engineer to own production-critical infrastructure... ...those laws, including the San Francisco Fair Chance Ordinance, the...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Platform Engineer - Reliability & Scale at LangChain - San Francisco, CA. Be the first to apply!
- client platform engineer San Francisco, CA
- platform engineer San Francisco, CA
- senior platform engineer San Francisco, CA
- platform engineering manager San Francisco, CA
- data platform engineer San Francisco, CA
- platform developer San Francisco, CA
- digital platform specialist San Francisco, CA
- director of digital platform San Francisco, CA
- platform product manager San Francisco, CA
- platform manager San Francisco, CA


