Senior Infra Engineer: Observability
Railway
Job description Our core mission at Railway is to make software engineers higher leverage. We believe that people should be given powerful tools so that they can spend less time setting up to do, and more time doing. Many infrastructure platforms simply focus on how you deploy your singular application, and now how these applications function in concert. Questions like “How do you build systems for zero downtime deployment”, “How do you do service-to-service communications”, etc are usually left up to the engineers to define. At Railway, our goal is to be an all encompassing solution to all these problems. As such, we take special care as we define our networking infrastructure. Note: Networking falls under the platform engineering umbrella. If you’re specialized, we’d love to chat! That said, we’d also like it noted you’re probably going to do a lot of non-networking + platform things “But the world would be a better place if more engineers, like me, hated technology. The stuff I design, if I'm successful, nobody will ever notice. Things will just work, and will be self-managing” - Radia Perlman About The Role Build ingestion pipelines to consume 1M+ RPS streams of logs, metrics, and other telemetry Build scalable, fault tolerant alerting engines for notifying users, in real-time, of threshold breaches Craft rich backend observability APIs, working with product to build amazing experiences for instantly grokking their application Provide APIs to access realtime log/metrics streams to be consumed by the Dashboard and Product Teams Build Golang/Rust GRPC services from scratch capable of supporting tens of thousands of users, and the million+ to come. Define infrastructure that can be torn down, failed over, and reconstituted from scratch using principle of immutable infrastructure using Terraform and Ansible. Write Engineering Requirement Documents to take something from idea, to defined tasks, to implementation, to monitoring it’s success. Interface with our TypeScript and GraphQL edge to expose your microservice APIs for both internal and potentially external consumption This is a high impact, high agency role with direct effect on company culture, trajectory, and outcome. About You A strong understanding of distributed systems. You enjoy building fault tolerant, resilient, and scalable services Interests in VictoriaMetrics, ClickHouse, and other systems for building observability stacks from the ground up A solid intuition about how long your solutions will last. All systems age. In startups, we can hope for 2-3 orders of magnitude, or 12-18mo. The tact to implement your solution, creator monitors for it’s error boundaries, and document any requirements for when you’re not around A great sense of direction and prioritization when it comes to dealing with the ambiguity of an early stage startup A sense of grit to dive into a problem, implement a solution, scale that solution, and replace it when needed A great set of communication skills for getting your point across, solution implemented, and beyond We value and love to work with diverse persons from all backgrounds Benefits and perks At Railway, we provide best in class benefits. Great salary, full health benefits including dependents, strong equity grants, equipment stipend, and much more. For more details, check back on the main careers page . Beyond compensation, there are a few things that we believe that make working at Railway truly unique: Autonomy : We have very few meetings. Just a Monday and a Friday to go over the Company Board. We think your time is sacred, whether it's at work, or outside of work. Ownership : We're a company with a high ownership, high autonomy culture. We hope that you'll come in, help us, and over the course of many years do the best work of your life. When we bring you onboard, we expect you to change the company. Novel problems/solutions : We're a startup that's well funded, with cool problems, which lets us implement novel solutions! We abhor “busywork” and think, whether it's community, engineering, operations, etc there's always opportunity for creative and high leverage solutions. Growth : We want you to grow with us, but we know that talent is loaned, so when you figure out what area you want to grow in next, whether it's at Railway or outside, we'll make sure you land there. #J-18808-Ljbffr Railway
- ...health technology company based in NYC is seeking an experienced engineer to lead infrastructure efforts. You will manage and enhance... ...infrastructure, spearheading initiatives for compliance and observability while supporting the company's growth. The ideal candidate...SeniorFlexible hours
$149k - $186k
THE POSITION FanDuel is looking for a Senior Observability Engineer to design, build, and mature the observability ecosystem that underpins our platform and services. You will deliver deep visibility into system behavior by combining system telemetry with user signals to...SeniorTemporary workLocal area- Bright Vision Technologies is seeking an Observability Engineer to design and operate metrics, logging, and alerting platforms. The role requires a strong background in observability platforms, particularly Prometheus, Grafana, and Datadog. The successful candidate will...SeniorRemote jobFull time
- EPAM Systems, Inc. is seeking a Senior Operational Intelligence Developer to join our collaborative team. This role focuses on implementing observability for key systems using tools like New Relic and ServiceNow, and supports remote work from Ukraine. The ideal candidate...SeniorRemote job
- LiveKit is seeking a Senior/Staff Engineer to enhance our platform's core services and observability. This role demands expertise in distributed systems and a strong grasp of programming fundamentals. You will design resilient architectures, improve system reliability,...Senior
- itD Tech is looking for a skilled Sr. Software Engineer/SRE in the UK to join our Observability team. This remote position requires expertise in large-scale systems, cloud infrastructure, and monitoring tools. You will collaborate on designing, deploying, and operating...SeniorRemote job
$148.7k - $199.4k
5014 Disney Entertainment & Sports LLC is seeking a Senior Software Engineer - AI and Observability in New York. You will lead the design of AI-driven systems crucial for Disney’s streaming services, ensuring reliability and performance. With a strong background in backend...Senior- A technology company based in the United States is seeking a Sr. Platform Engineer to manage AWS, GCP, and cloud infrastructure. In this role, you will plan monitoring and observability mechanisms, develop tooling in Rust, and ensure operations meet reliability standards...SeniorRemote jobFlexible hours
$160k - $240k
Bloomberg is seeking a Senior Software Engineer for its ClickHouse Infrastructure team in New York. You will design and implement ClickHouse platform services that provide analytics and real-time insights. Candidates should have 4+ years of experience in software engineering...Senior$160k - $220k
Aura Home, Inc. is looking for a seasoned backend engineer to join their infrastructure team in New York City. This role involves designing and operating backend systems for their Rails API, ensuring they can scale reliably while managing performance and security. The...SeniorFlexible hours- Overview Discover exciting DevOps job opportunities and connect with 28,396 DevOps professionals. Responsibilities As a Senior Infrastructure Engineer at Remotive, you will play a pivotal role in enhancing our cloud infrastructure, ensuring seamless deployment and...SeniorRemote work
- Railway is seeking a Software Engineer to build scalable services and manage complex distributed systems. This high-impact role offers an environment rich in autonomy and ownership, where engineers can thrive while addressing novel problems. The position emphasizes collaboration...Senior
- Luminare Health Benefits Inc. is seeking a Performance and Capacity Engineer to architect infrastructure stability and scalability. You will drive operational excellence by enhancing monitoring capabilities, ensuring systems are optimized for efficiency. With a focus on...SeniorRemote work
- United States Digital Space LLC is looking for a Senior Software Engineer to lead the company's payments infrastructure. Based in New York, you'll tackle deeply technical challenges in payments and compliance while building systems that process billions of dollars. This...Senior
- A leading blockchain technology firm is seeking a Senior Infrastructure Engineer to design and scale secure systems. This remote position demands extensive experience in cloud-native infrastructure, automation, and DevOps practices. Ideal candidates will have over 8 years...SeniorRemote job
$80k - $90k
Axon is seeking a Senior NOC Engineer in New York to enhance operational efficiency and support emergency response technologies. This role requires hands-on expertise in troubleshooting and infrastructure management, as well as a strategic mindset for developing the NOC...SeniorNight shift$150k - $180k
01031 TPC Civil - East in Midtown Manhattan is seeking a skilled Change Order Engineer to manage the entire lifecycle of change orders. This role involves analyzing project impacts, preparing cost estimates, and collaborating with various stakeholders to ensure accurate...Senior$123k - $130k
A leading home service provider in Idaho is seeking a Systems Engineer L4 responsible for ensuring the stability and performance of enterprise infrastructure. Key duties incluyen managing Active Directory, Okta, VMware, and Veeam in a predominantly Windows environment....Senior- Tavily Inc. in New York City is seeking a Senior Site Reliability Engineer to manage Kubernetes clusters and own the full infrastructure. You will improve CI/CD pipelines and ensure systems are reliable and scalable. This role offers the chance to work on real scaling...Senior
- Softswiss is seeking a hands-on System Engineer / DevOps - Senior to ensure the design, automation, and maintenance of scalable infrastructure and deployment pipelines. The ideal candidate will have strong Kubernetes experience, knowledge of configuration management, and...Senior
- HartleyCo in New York is hiring a Senior Engineer to lead the development of production-level AI systems. This role involves designing workflows, improving retrieval pipelines, and ensuring reliability in a high-autonomy environment. You will work directly with the founders...Senior
- Atlassian is seeking a skilled systems engineer to work on backend software that attributes AI-generated code within developer environments. This role involves building reliable systems and integrating various coding agents to support developers seamlessly across different...SeniorFlexible hours
- LVI Associates is seeking a Senior Geotechnical Engineer to join their Houston team. You will lead geotechnical projects while managing client relationships and mentoring junior staff. The ideal candidate will have a Bachelor’s degree, PE License in Texas, and 8+ years...Senior
$150k - $200k
QCT LLC is seeking experienced Pre-Sales Engineers to support and grow in the AI and Cloud Hardware sectors, including Cloud Services Provider and Enterprise Data Center. You will report to the Sales Director and play a crucial role in driving growth. The ideal candidate...Senior- A leading AI & agent engineering platform is seeking a DevOps Engineer to support infrastructure for both SaaS and on-prem offerings. In this role, you will work directly with customers, automate the release pipeline, and utilize Kubernetes across multiple cloud environments...Senior
- Clutch Canada is seeking a Senior Backend Engineer to contribute to the reliability of production systems. You will design resilient infrastructure and partner with various engineering teams to enhance operational excellence. The ideal candidate has strong software development...Senior
$106.4k - $145k
A leading media technology company in New York is seeking a Senior Systems Engineer to join their Media Content Engineering team. The role involves maintaining and managing both Windows and Linux server environments, ensuring the reliability of computer systems, and providing...Senior- Centrus Energy is seeking a proactive Civil/Structural Design Engineer to lead engineering development at its American Centrifuge Plant in Piketon, Ohio. This role involves ensuring compliance and safety in a critical environment, bridging construction and design. The...Senior
$150k
A leading staffing agency is seeking a mid-senior level professional for a Contract opportunity, focusing on multi-cloud architecture and identity management. In this role, you will design and integrate technology solutions supporting acquisitions, manage CI/CD pipelines...SeniorRemote jobContract work$110k - $140k
...Job Description Job Description A Commissioning Senior Project Engineer reports to a Team Leader and is responsible for the delivery and... ...expense management, in-field installation validation, field observation reports and meeting notes, functional performance testing,...SeniorContract workInternshipWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Infra Engineer: Observability. Be the first to apply!
- senior cloud service delivery manager New York, NY
- senior business analyst contract New York, NY
- senior product design engineer New York, NY
- senior game producer New York, NY
- senior software manager New York, NY
- senior creative strategist New York, NY
- senior manager business analytics New York, NY
- senior marketing account manager New York, NY
- senior marketing manager New York, NY
- senior contracts analyst New York, NY

