Senior Infra Engineer: Observability
RAIL-WAY INC
Job description Our core mission at Railway is to make software engineers higher leverage. We believe that people should be given powerful tools so that they can spend less time setting up to do, and more time doing. Many infrastructure platforms simply focus on how you deploy your singular application, and now how these applications function in concert. Questions like “How do you build systems for zero downtime deployment”, “How do you do service-to-service communications”, etc are usually left up to the engineers to define. At Railway, our goal is to be an all encompassing solution to all these problems. As such, we take special care as we define our networking infrastructure. Note: Networking falls under the platform engineering umbrella. If you’re specialized, we’d love to chat! That said, we’d also like it noted you’re probably going to do a lot of non-networking + platform things “But the world would be a better place if more engineers, like me, hated technology. The stuff I design, if I'm successful, nobody will ever notice. Things will just work, and will be self-managing” - Radia Perlman About The Role Build ingestion pipelines to consume 1M+ RPS streams of logs, metrics, and other telemetry Build scalable, fault tolerant alerting engines for notifying users, in real-time, of threshold breaches Craft rich backend observability APIs, working with product to build amazing experiences for instantly grokking their application Provide APIs to access realtime log/metrics streams to be consumed by the Dashboard and Product Teams Build Golang/Rust GRPC services from scratch capable of supporting tens of thousands of users, and the million+ to come. Define infrastructure that can be torn down, failed over, and reconstituted from scratch using principle of immutable infrastructure using Terraform and Ansible. Write Engineering Requirement Documents to take something from idea, to defined tasks, to implementation, to monitoring it’s success. Interface with our TypeScript and GraphQL edge to expose your microservice APIs for both internal and potentially external consumption This is a high impact, high agency role with direct effect on company culture, trajectory, and outcome. About You A strong understanding of distributed systems. You enjoy building fault tolerant, resilient, and scalable services Interests in VictoriaMetrics, ClickHouse, and other systems for building observability stacks from the ground up A solid intuition about how long your solutions will last. All systems age. In startups, we can hope for 2-3 orders of magnitude, or 12-18mo. The tact to implement your solution, creator monitors for it’s error boundaries, and document any requirements for when you’re not around A great sense of direction and prioritization when it comes to dealing with the ambiguity of an early stage startup A sense of grit to dive into a problem, implement a solution, scale that solution, and replace it when needed A great set of communication skills for getting your point across, solution implemented, and beyond We value and love to work with diverse persons from all backgrounds Benefits and perks At Railway, we provide best in class benefits. Great salary, full health benefits including dependents, strong equity grants, equipment stipend, and much more. For more details, check back on the main careers page . Beyond compensation, there are a few things that we believe that make working at Railway truly unique: Autonomy : We have very few meetings. Just a Monday and a Friday to go over the Company Board. We think your time is sacred, whether it's at work, or outside of work. Ownership : We're a company with a high ownership, high autonomy culture. We hope that you'll come in, help us, and over the course of many years do the best work of your life. When we bring you onboard, we expect you to change the company. Novel problems/solutions : We're a startup that's well funded, with cool problems, which lets us implement novel solutions! We abhor “busywork” and think, whether it's community, engineering, operations, etc there's always opportunity for creative and high leverage solutions. Growth : We want you to grow with us, but we know that talent is loaned, so when you figure out what area you want to grow in next, whether it's at Railway or outside, we'll make sure you land there. #J-18808-Ljbffr
- ...health technology company based in NYC is seeking an experienced engineer to lead infrastructure efforts. You will manage and enhance... ...infrastructure, spearheading initiatives for compliance and observability while supporting the company's growth. The ideal candidate...SeniorFlexible hours
- Ekloud Data Labs is seeking an experienced AIOps Engineer to enhance the reliability of cloud-hosted services. The role focuses on observability, automating incident responses, and optimizing CI/CD processes to improve operational intelligence. Ideal candidates will have...Senior
- ...jobr.pro is seeking a Senior Site Reliability Engineer in New York, NY, to enhance platform reliability and engineering excellence. You will be instrumental in implementing observability, security, and CI/CD practices. This role involves coaching teams and optimizing...Senior
- ...description Our core mission at Railway is to make software engineers higher leverage. We believe that people should be given powerful... ...use to interact with our fleet every day Build out internal observability and alerting so we catch fleet problems before customers feel...SeniorMonday to Friday
$144.2k - $288.4k
CVS Health is seeking a Principal AIOps Engineer in New York, NY to modernize IT operations with a focus on building an intelligent... ...experience, scripting skills in Python, and experience with observability platforms. This full-time position offers a competitive salary...SeniorFull time$144.2k - $288.4k
CVS Health® is seeking a Principal AIOps Engineer in Georgia, USA. This full-time role involves leading the AIOps strategy and operational... ...or production operations and experience with ServiceNow and observability platforms. The salary range for this position is $144,200 - $...SeniorFull time- ...A technology company based in the United States is seeking a Sr. Platform Engineer to manage AWS, GCP, and cloud infrastructure. In this role, you will plan monitoring and observability mechanisms, develop tooling in Rust, and ensure operations meet reliability standards...SeniorRemote workFlexible hours
- ...Space Executive is seeking a Fullstack Engineer to develop core product experiences for their AI observability platform. This role encompasses frontend engineering, distributed systems, and applied AI. You will work on building fullstack features across TypeScript, React...SeniorRemote work
- ...Y99000 General Electric Company is seeking a Sr. Data Engineer to enhance automation and observability for the EDAS platform. You will be automating workflows and ensuring platform reliability through various observability tools. The ideal candidate should possess a Bachelor...SeniorRemote work
$160k - $240k
...Bloomberg is seeking a Senior Software Engineer for its ClickHouse Infrastructure team in New York. You will design and implement ClickHouse platform services that provide analytics and real-time insights. Candidates should have 4+ years of experience in software engineering...Senior- ...Grafana Labs is seeking a Senior Product Manager for Infrastructure Observability to define product vision and strategy. You will be responsible for product roadmaps and collaborative efforts across engineering to monitor core services effectively. This remote role emphasizes...SeniorRemote work
- ...Overview Discover exciting DevOps job opportunities and connect with 28,396 DevOps professionals. Responsibilities As a Senior Infrastructure Engineer at Remotive, you will play a pivotal role in enhancing our cloud infrastructure, ensuring seamless deployment and...SeniorRemote work
- ...Grafana Labs is seeking a Senior Product Manager for Infrastructure Observability to define the vision and strategy for Grafana Cloud products. The role involves collaborating across globally distributed teams, managing product roadmaps, and ensuring customer satisfaction...SeniorRemote work
- ...Grafana Labs is seeking a Senior Product Manager for Infrastructure Observability. This fully remote position requires a strong leader to define vision and... ...responsibilities that include collaborating with engineering and conducting customer discovery, this role is crucial...SeniorRemote work
$160k - $220k
Aura Home, Inc. is looking for a seasoned backend engineer to join their infrastructure team in New York City. This role involves designing and operating backend systems for their Rails API, ensuring they can scale reliably while managing performance and security. The...SeniorFlexible hours- ...Luminare Health Benefits Inc. is seeking a Performance and Capacity Engineer to architect infrastructure stability and scalability. You will drive operational excellence by enhancing monitoring capabilities, ensuring systems are optimized for efficiency. With a focus...SeniorRemote work
- ...Scale is seeking a Senior Infrastructure Software Engineer to develop and scale core infrastructure for enterprise-grade Generative AI products. You will define architectural patterns and lead the infrastructure roadmap focused on compliance, privacy, and security. The...Senior
- ...A progressive educational technology firm in New York is seeking a DevOps Engineer to improve its infrastructure. The role requires 5+ years of experience in software engineering, with expertise in Java, Python, and proficiency in developing REST APIs. The ideal candidate...Senior
$162k - $219k
...Catalight Foundation seeks a Senior Infrastructure Engineer (AWS) to design, implement, and support AWS infrastructure. You'll leverage your expertise in cloud services, networking, and automation to ensure system reliability and security. With responsibilities ranging...SeniorFull timeRemote work- ...Axon Enterprise is seeking a Senior NOC Engineer in New York City. In this role, you will resolve complex technical issues and enhance operational efficiency for emergency response technologies. Ideal candidates will have a minimum of 3 years' experience in a SaaS environment...SeniorNight shift
- ...A leading technology company is seeking a Senior Software Engineer to enhance its self-hosted product, ensuring seamless installation and operation for customers. With a collaborative culture, the role involves driving technical direction and mentoring team members. Ideal...SeniorRemote work
- ...A leading blockchain technology firm is seeking a Senior Infrastructure Engineer to design and scale secure systems. This remote position demands extensive experience in cloud-native infrastructure, automation, and DevOps practices. Ideal candidates will have over 8 years...SeniorRemote work
$87.88k - $120k
...A leading health solutions provider is seeking a Senior Talent Acquisition Partner for a full-time remote position. This role involves leading AWS environment setups, building Infrastructure as Code using Terraform, and troubleshooting infrastructure issues. The successful...SeniorFull timeRemote work$150k - $180k
01031 TPC Civil - East in Midtown Manhattan is seeking a skilled Change Order Engineer to manage the entire lifecycle of change orders. This role involves analyzing project impacts, preparing cost estimates, and collaborating with various stakeholders to ensure accurate...Senior- Railway is seeking a Software Engineer to build scalable services and manage complex distributed systems. This high-impact role offers an environment rich in autonomy and ownership, where engineers can thrive while addressing novel problems. The position emphasizes collaboration...Senior
$110.76k - $158.23k
...Harbor-Compliance seeks a skilled Site Reliability Engineer to manage critical Linux infrastructure. This role involves designing resilient systems, implementing automation strategies, and ensuring system performance and reliability. Candidates should have 4-7 years of...Senior- ...Clutch Canada is seeking a Senior Backend Engineer to contribute to the reliability of production systems. You will design resilient infrastructure and partner with various engineering teams to enhance operational excellence. The ideal candidate has strong software development...Senior
$80k - $90k
Axon is seeking a Senior NOC Engineer in New York to enhance operational efficiency and support emergency response technologies. This role requires hands-on expertise in troubleshooting and infrastructure management, as well as a strategic mindset for developing the NOC...SeniorNight shift- A growing infrastructure company is seeking a Senior Systems Engineer to support the Department of Energy. This role involves guiding national labs and research institutions in deploying VAST's data platform for critical workloads. Candidates should have over 5 years in...Senior
- ...A leading financial firm in New York is seeking a highly skilled Senior Infrastructure Security Engineer to design and enhance the security of their IT systems. This permanent role involves collaborating across teams, conducting security assessments, and developing security...SeniorPermanent employment
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Infra Engineer: Observability. Be the first to apply!
- senior fund accountant New York, NY
- senior office manager New York, NY
- senior director ecommerce New York, NY
- senior automation controls engineer New York, NY
- senior accounts payable New York, NY
- senior brand designer New York, NY
- senior financial advisor New York, NY
- senior underwriter New York, NY
- senior cost analyst New York, NY
- senior business analyst contract New York, NY

