Senior Infra Engineer: Observability
Railway
Job description Our core mission at Railway is to make software engineers higher leverage. We believe that people should be given powerful tools so that they can spend less time setting up to do, and more time doing. Many infrastructure platforms simply focus on how you deploy your singular application, and now how these applications function in concert. Questions like “How do you build systems for zero downtime deployment”, “How do you do service-to-service communications”, etc are usually left up to the engineers to define. At Railway, our goal is to be an all encompassing solution to all these problems. As such, we take special care as we define our networking infrastructure. Note: Networking falls under the platform engineering umbrella. If you’re specialized, we’d love to chat! That said, we’d also like it noted you’re probably going to do a lot of non-networking + platform things “But the world would be a better place if more engineers, like me, hated technology. The stuff I design, if I'm successful, nobody will ever notice. Things will just work, and will be self-managing” - Radia Perlman About The Role Build ingestion pipelines to consume 1M+ RPS streams of logs, metrics, and other telemetry Build scalable, fault tolerant alerting engines for notifying users, in real-time, of threshold breaches Craft rich backend observability APIs, working with product to build amazing experiences for instantly grokking their application Provide APIs to access realtime log/metrics streams to be consumed by the Dashboard and Product Teams Build Golang/Rust GRPC services from scratch capable of supporting tens of thousands of users, and the million+ to come. Define infrastructure that can be torn down, failed over, and reconstituted from scratch using principle of immutable infrastructure using Terraform and Ansible. Write Engineering Requirement Documents to take something from idea, to defined tasks, to implementation, to monitoring it’s success. Interface with our TypeScript and GraphQL edge to expose your microservice APIs for both internal and potentially external consumption This is a high impact, high agency role with direct effect on company culture, trajectory, and outcome. About You A strong understanding of distributed systems. You enjoy building fault tolerant, resilient, and scalable services Interests in VictoriaMetrics, ClickHouse, and other systems for building observability stacks from the ground up A solid intuition about how long your solutions will last. All systems age. In startups, we can hope for 2-3 orders of magnitude, or 12-18mo. The tact to implement your solution, creator monitors for it’s error boundaries, and document any requirements for when you’re not around A great sense of direction and prioritization when it comes to dealing with the ambiguity of an early stage startup A sense of grit to dive into a problem, implement a solution, scale that solution, and replace it when needed A great set of communication skills for getting your point across, solution implemented, and beyond We value and love to work with diverse persons from all backgrounds Benefits and perks At Railway, we provide best in class benefits. Great salary, full health benefits including dependents, strong equity grants, equipment stipend, and much more. For more details, check back on the main careers page . Beyond compensation, there are a few things that we believe that make working at Railway truly unique: Autonomy : We have very few meetings. Just a Monday and a Friday to go over the Company Board. We think your time is sacred, whether it's at work, or outside of work. Ownership : We're a company with a high ownership, high autonomy culture. We hope that you'll come in, help us, and over the course of many years do the best work of your life. When we bring you onboard, we expect you to change the company. Novel problems/solutions : We're a startup that's well funded, with cool problems, which lets us implement novel solutions! We abhor “busywork” and think, whether it's community, engineering, operations, etc there's always opportunity for creative and high leverage solutions. Growth : We want you to grow with us, but we know that talent is loaned, so when you figure out what area you want to grow in next, whether it's at Railway or outside, we'll make sure you land there. #J-18808-Ljbffr Railway
- ...health technology company based in NYC is seeking an experienced engineer to lead infrastructure efforts. You will manage and enhance... ...infrastructure, spearheading initiatives for compliance and observability while supporting the company's growth. The ideal candidate...SeniorFlexible hours
- EPAM Systems, Inc. is seeking a Senior Observability Engineer to resolve technical monitoring issues in New Relic and drive observability practices across platforms. This strategic role will develop capabilities for end-to-end observability, automation, and integration,...SeniorRemote workFlexible hours
- jobr.pro is seeking a Senior Site Reliability Engineer in New York, NY, to enhance platform reliability and engineering excellence. You will be instrumental in implementing observability, security, and CI/CD practices. This role involves coaching teams and optimizing workflows...Senior
- ...description Our core mission at Railway is to make software engineers higher leverage. We believe that people should be given powerful... ...use to interact with our fleet every day Build out internal observability and alerting so we catch fleet problems before customers feel...SeniorMonday to Friday
$144.2k - $288.4k
CVS Health® is seeking a Principal AIOps Engineer in Georgia, USA. This full-time role involves leading the AIOps strategy and operational... ...or production operations and experience with ServiceNow and observability platforms. The salary range for this position is $144,200 - $...SeniorFull time$144.2k - $288.4k
CVS Health is seeking a Principal AIOps Engineer in New York, NY to modernize IT operations with a focus on building an intelligent... ...experience, scripting skills in Python, and experience with observability platforms. This full-time position offers a competitive salary...SeniorFull time$145k - $200k
...Palantir Technologies is seeking a Senior Software Engineer in New York to design and build managed Kubernetes product offerings. The ideal candidate will have expertise in Golang and infrastructure automation tools, with 4+ years of experience in software development...Senior- A technology company based in the United States is seeking a Sr. Platform Engineer to manage AWS, GCP, and cloud infrastructure. In this role, you will plan monitoring and observability mechanisms, develop tooling in Rust, and ensure operations meet reliability standards...SeniorRemote jobFlexible hours
- Y99000 General Electric Company is seeking a Sr. Data Engineer to enhance automation and observability for the EDAS platform. You will be automating workflows and ensuring platform reliability through various observability tools. The ideal candidate should possess a Bachelor...SeniorRemote job
$160k - $240k
Bloomberg is seeking a Senior Software Engineer for its ClickHouse Infrastructure team in New York. You will design and implement ClickHouse platform services that provide analytics and real-time insights. Candidates should have 4+ years of experience in software engineering...Senior$160k - $220k
Aura Home, Inc. is looking for a seasoned backend engineer to join their infrastructure team in New York City. This role involves designing and operating backend systems for their Rails API, ensuring they can scale reliably while managing performance and security. The...SeniorFlexible hours- Overview Discover exciting DevOps job opportunities and connect with 28,396 DevOps professionals. Responsibilities As a Senior Infrastructure Engineer at Remotive, you will play a pivotal role in enhancing our cloud infrastructure, ensuring seamless deployment and...SeniorRemote work
$166.6k - $250.9k
...A prominent fintech company in New York seeks a Reliability Engineer to drive technical projects from concept to production. The ideal candidate has extensive expertise in PostgreSQL, experience with event streaming, and familiarity with tracing. You'll focus on improving...Senior- Railway is seeking a Software Engineer to build scalable services and manage complex distributed systems. This high-impact role offers an environment rich in autonomy and ownership, where engineers can thrive while addressing novel problems. The position emphasizes collaboration...Senior
- Luminare Health Benefits Inc. is seeking a Performance and Capacity Engineer to architect infrastructure stability and scalability. You will drive operational excellence by enhancing monitoring capabilities, ensuring systems are optimized for efficiency. With a focus on...SeniorRemote work
- A leading technology company is seeking a Senior Software Engineer to enhance its self-hosted product, ensuring seamless installation and operation for customers. With a collaborative culture, the role involves driving technical direction and mentoring team members. Ideal...SeniorRemote job
- Axon Enterprise is seeking a Senior NOC Engineer in New York City. In this role, you will resolve complex technical issues and enhance operational efficiency for emergency response technologies. Ideal candidates will have a minimum of 3 years' experience in a SaaS environment...SeniorNight shift
- ...A leading financial firm in New York is seeking a highly skilled Senior Infrastructure Security Engineer to design and enhance the security of their IT systems. This permanent role involves collaborating across teams, conducting security assessments, and developing security...SeniorPermanent employment
- A growing infrastructure company is seeking a Senior Systems Engineer to support the Department of Energy. This role involves guiding national labs and research institutions in deploying VAST's data platform for critical workloads. Candidates should have over 5 years in...Senior
$80k - $90k
Axon is seeking a Senior NOC Engineer in New York to enhance operational efficiency and support emergency response technologies. This role requires hands-on expertise in troubleshooting and infrastructure management, as well as a strategic mindset for developing the NOC...SeniorNight shift$150k - $180k
01031 TPC Civil - East in Midtown Manhattan is seeking a skilled Change Order Engineer to manage the entire lifecycle of change orders. This role involves analyzing project impacts, preparing cost estimates, and collaborating with various stakeholders to ensure accurate...Senior- ...Commure, located in New York, NY, is seeking a skilled backend engineer to join their Ambient Scribe team. This role involves building Ambient AI solutions to revolutionize healthcare technology. The successful candidate will contribute to a leading healthcare platform...Senior
$148.7k - $199.4k
...A leading entertainment technology company seeks a Senior Software Engineer to develop essential infrastructure and tools in New York, NY. You will enhance CI/CD processes for thousands of engineers, ensuring fast and reliable software delivery. With expertise in modern...Senior$123k - $130k
A leading home service provider in Idaho is seeking a Systems Engineer L4 responsible for ensuring the stability and performance of enterprise infrastructure. Key duties incluyen managing Active Directory, Okta, VMware, and Veeam in a predominantly Windows environment....Senior- Softswiss is seeking a hands-on System Engineer / DevOps - Senior to ensure the design, automation, and maintenance of scalable infrastructure and deployment pipelines. The ideal candidate will have strong Kubernetes experience, knowledge of configuration management, and...Senior
$110.76k - $158.23k
A compliance technology firm in the United States is seeking a Senior Site Reliability Engineer who will design, implement, and manage business-critical Linux infrastructure. This role involves collaborating with multiple teams to enhance system performance and reliability...Senior$120k - $200k
BIP is seeking experienced AI Systems Engineers in New York to develop AI infrastructure solutions. The ideal candidates have over 8 years in Python engineering, strong knowledge of vector databases, and the ability to optimize AI workflows. This role includes responsibilities...SeniorRemote job- Iron Mountain is looking for a Senior Systems Engineer to join its Global Digital Solutions Service Operations team in Royersford, Pennsylvania. This role includes managing IT operations, solving technical issues, and ensuring security compliance. Ideal candidates will...SeniorShift work
$150k - $200k
QCT LLC is seeking experienced Pre-Sales Engineers to support and grow in the AI and Cloud Hardware sectors, including Cloud Services Provider and Enterprise Data Center. You will report to the Sales Director and play a crucial role in driving growth. The ideal candidate...Senior- Clutch Canada is seeking a Senior Backend Engineer to contribute to the reliability of production systems. You will design resilient infrastructure and partner with various engineering teams to enhance operational excellence. The ideal candidate has strong software development...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Infra Engineer: Observability. Be the first to apply!
- senior cost analyst New York, NY
- senior computer engineer New York, NY
- senior electrical estimator New York, NY
- senior process manager New York, NY
- senior development engineer New York, NY
- senior program specialist New York, NY
- senior manager quality engineering New York, NY
- senior software test automation engineer New York, NY
- senior design technologist New York, NY
- senior director corporate development New York, NY

