Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Scalability Engineer - Observability

$110.4k - $213k

Capital Rx

About Judi Health

Judi Health is an enterprise health technology company providing a comprehensive suite of solutions for employers and health plans, including:

  • Capital Rx , a public benefit corporation delivering full-service pharmacy benefit management (PBM) solutions to self-insured employers,
  • Judi Health , which offers full-service health benefit management solutions to employers, TPAs, and health plans, and
  • Judi , the industry's leading proprietary Enterprise Health Platform (EHP), which consolidates all claim administration-related workflows in one scalable, secure platform.

Together with our clients, we're rebuilding trust in healthcare in the U.S. and deploying the infrastructure we need for the care we deserve. To learn more, visit

Location: Remote

Position Summary:

OurScalability team as a Senior Scalability Engineer focused on observability platform development and engineering productivity.In this role, you will define, own, and build Judi Health's organization-wide observability strategy, tooling, and platform products. Beyondmaintaininginfrastructure,you'llarchitect and develop a custom observability platform that gives engineering teams powerful, fast, and cost-effective visibility into every layer of our infrastructure-from application logs and metrics to distributed traces.You'llbuild production-grade internal products using React/TypeScript frontends with Python and Rust backends, creating tools that fundamentally improve how engineers at Judi Health debug, monitor, andoptimizetheir systems. Working closely with leadership and cross-functional teams, your work will be foundational to platform stability, performance optimization, and developer productivity across our rapidly growing healthcare platform.

Position Responsibilities:

In this role,you'llown the observability infrastructure that powers our engineering organization. You will:

  • Architect observability platform: Design, implement, andmaintainthe LGTM stack (Loki, Grafana, Tempo, Mimir/Prometheus) as the primary observability platform across all engineering teams, making architectural decisions that balance cost, performance, and developer experience.
  • Build internal observability products: Design and develop production-grade internal platform products with React/TypeScript frontends and Python/Rust backends that provide engineers with powerful log search, metrics visualization, and trace analysis capabilities.
  • Develop custom log indexing systems: Architect and build high-performance log indexing solutions using Rust that process logs and provide sub-second search across billions of log lines at a fraction of the cost.
  • Integrate SQL analytics for logs: Design and implement solutions leveraging AWS Athena or similar SQL query engines (DuckDB, ClickHouse) for ad-hoc log analysis and historical queries, enabling engineers to run complex SQL queries over S3-based log data for deep investigations and trend analysis.
  • Create advanced query interfaces: Build sophisticated web interfaces that allow engineers to query logs, metrics, and traces with features like saved queries, query templates, correlation analysis, and pattern detection, supporting both full-text search and SQL-based analytics.
  • Balance cloud-native and open-source: Architect solutions that thoughtfullyleverageboth AWS-managed services (CloudWatch, Athena, Kinesis) and open-source tooling (LGTM stack,Quickwit) tooptimizefor cost, performance, and operational flexibility based on use case requirements.
  • Integrate AWS observability: Design seamless integration between AWS CloudWatch Logs/Metrics and our custom observability platform, providing unified visibility across managed and self-hosted infrastructure.
  • Build intelligent alerting: Develop smart dashboards, monitors, and alerting systems that reduce noise, detect anomalies, and help teams respond to incidents quickly.
  • Partner with engineering teams: Work directly with product teams to integrate observability into their services,establishlogging and metrics standards, and instrument code effectively, serving as the observability subject matter expert.
  • Enable performance optimization: Provide the observability foundation that allows the Scalability team toidentifyperformance bottlenecks, track optimization impact, and measure platform stability with data-driven insights.
  • Establish observability standards: Define and document comprehensive observability standards including structured logging patterns, metric naming conventions, trace instrumentation, dashboard design principles, and query best practices.
  • Drive platform adoption: Lead workshops, create documentation, and build self-service tooling that democratizes observability across engineering, making it easy for teams to adopt best practices.
  • Demonstrate technical leadership: Mentor engineers on observability practices, lead architecture reviews for instrumentation approaches, and represent the Scalability team in cross-functional planning.
  • Work in an Agile/Scrum environment to continually deliver value to stakeholders and clients.
  • Code of Conduct: Responsible for adherence to the Capital Rx Code of Conduct including reporting of noncompliance.

Required Qualifications:

  • 10+ years of software engineering or infrastructure engineering experience withdemonstratedprogression into technical leadership roles.
  • Several years of experience leading technical initiatives, building platform products, or serving as a subject matter expert on observability infrastructure.
  • Strong experience with React/TypeScript for frontend development and Python (Flask/SQLAlchemy) for backend services.
  • LGTM stackexpertise: Deep production experience with Loki, Grafana, Tempo, and Prometheus/Mimir for logs, metrics, and distributed tracing at scale.
  • AWS observability: Extensive experience with AWS CloudWatch Logs and Metrics, including custom metrics, log insights, dashboard creation, and integration patterns.
  • SQL analytics for logs: Production experience with SQL-based log analytics using AWS Athena,DuckDB, or similar query engines for analyzing structured and semi-structured data at scale.
  • Cloud-native and open-source balance:Demonstratedability to architect solutionsleveragingboth managed cloud services and open-source tooling, understanding trade-offs between operational overhead, cost, flexibility, and vendor lock-in.
  • Search and indexing experience: Hands-on experience building or operating search systems using OpenSearch, Elasticsearch, Lucene, Tantivy, or similar search and analytics engines.
  • Performance-critical systems: Experience building high-performance systems that process large volumes of data efficiently (millions of log lines, high-cardinality metrics).
  • Systems thinking: Deep understanding of distributed systems,microservicesarchitectures, and the complex observability challenges they present.
  • Data at scale: Proventrack recordhandling high-volume structured and unstructured logging data,identifyingpatterns, and building efficient search/query solutions that perform well under load.
  • Product mindset: Ability to build internal platform products that engineers love to use, with attention to UX, performance, and reliability.

Preferred Qualifications:

  • Rust development experience: Production experience with Rust for building high-performance data processing, indexing, or search systems. Strong interest in learning Rust is acceptable if combined with systems programming experience in C/C++/Go.
  • Infrastructure as code: Experience with Terraform for managing observability infrastructure and AWS resources.
  • Additionalobservability platforms: Experience architecting or operating Datadog, New Relic, Splunk, or other enterprise observability platforms.
  • Advanced query languages: DeepexpertisewithPromQL,LogQL, SQL optimization, and query optimization for high-cardinality data.
  • Columnar storage formats: Experience with Parquet, ORC, or other columnar storage formats for efficient log storage and analytics on S3.
  • Incident management: Experience designing incident response workflows, postmortem processes, and SLO/SLI frameworks that drive reliability improvements.
  • Cost optimization: Track record of reducing observability costs whilemaintainingor improving capabilities (e.g., CloudWatch S3/custom indexing migration).
  • Data pipelines:Experience withstreaming data pipelines, ETL processes, or real-time data processing.
  • Distributed tracing: Deep knowledge ofOpenTelemetry, Jaeger, Zipkin, or distributed tracing architectures.
  • Gitexpertiseand experience working in a mono repository.
  • PreviousPharmacy Benefits Manager (PBM) or healthcare technology experience.
  • Experience building developer tools or internal platforms that improve engineering productivity.

This rangerepresentsthe low and high end of theanticipatedbase salary range for the NY - based position. The actual base salary will depend on several factorssuch as:experience, knowledge, and skills, and if the location of the job changes.

Nothing in this position description restricts management's right to assign or reassign duties and responsibilities to this job at any time.

This range represents the low and high end of the anticipated base salary range. The actual base salary will depend on several factors such as: experience, knowledge, skills, and location of the job.

Remote, US Salary Range $110,400—$213,000 USD

All employees are responsible for adherence to the Capital Rx Code of Conduct including the reporting of non-compliance. This position description is designed to be flexible, allowing management the opportunity to assign or reassign duties and responsibilities as needed to best meet organizational goals.

Judi Health values a diverse workplace and celebrates the diversity that each employee brings to the table. We are proud to provide equal employment opportunities to all employees and applicants for employment and prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, medical condition, genetic information, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

By submitting an application, you agree to the retention of your personal data for consideration for a future position at Judi Health. More details about Judi Health's privacy practices can be found at

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior Scalability Engineer - Observability in New York, NY vacancy
  •  ...mission at Railway is to make software engineers higher leverage. We believe that people...  ...logs, metrics, and other telemetry Build scalable, fault tolerant alerting engines for...  ...threshold breaches Craft rich backend observability APIs, working with product to build amazing... 
    Senior
    Monday to Friday

    RAIL-WAY INC

    New York, NY
    3 days ago
  • $149k - $186k

     ...an opening with your name on it FanDuel is looking for a Senior Observability Engineer to design, build, and mature the observability ecosystem...  ...partner closely with engineering and product teams to deliver scalable observability capabilities, serve as a subject matter... 
    Senior
    Full time
    Temporary work
    Local area
    Worldwide

    FanDuel

    New York, NY
    3 hours ago
  • $160k - $200k

     ...administration-related workflows in one scalable, secure platform. Together with our...  ...Join our Scalability team as a Senior Scalability Engineer focused on streaming and realtime systems...  ...processing, error handling, and observability. Partner with product teams: Work directly... 
    Senior
    Local area
    Remote work
    Flexible hours

    Judi Health

    New York, NY
    1 day ago
  •  ...Luminare Health Benefits Inc. is seeking a Performance and Capacity Engineer to architect infrastructure stability and scalability. You will drive operational excellence by enhancing monitoring capabilities, ensuring systems are optimized for efficiency. With a focus... 
    Senior
    Remote work

    Luminare Health Benefits Inc.

    New York, NY
    12 hours ago
  • $160k - $240k

     ...A global financial technology company in New York seeks a Senior Software Engineer to enhance its Buy-Side technology platform. You will own the...  ...software development lifecycle, ensuring the delivery of scalable solutions that influence global markets. The ideal candidate... 
    Senior

    Bloomberg

    New York, NY
    12 hours ago
  •  ...jobr.pro is seeking a Senior Site Reliability Engineer in New York, NY, to enhance platform reliability and engineering excellence. You will be instrumental in implementing observability, security, and CI/CD practices. This role involves coaching teams and optimizing... 
    Senior

    Jobr

    New York, NY
    12 hours ago
  •  ...health technology company based in NYC is seeking an experienced engineer to lead infrastructure efforts. You will manage and enhance...  ...infrastructure, spearheading initiatives for compliance and observability while supporting the company's growth. The ideal candidate... 
    Senior
    Flexible hours

    Amperos Health

    New York, NY
    3 days ago
  • Railway is seeking a Software Engineer to build scalable services and manage complex distributed systems. This high-impact role offers an environment rich in autonomy and ownership, where engineers can thrive while addressing novel problems. The position emphasizes collaboration... 
    Senior

    Railway

    New York, NY
    3 days ago
  • $83.43k - $222.48k

     ...The Hispanic Alliance for Career Enhancement is looking for a Senior Software Engineer in Georgia. You will lead technical initiatives ensuring the scalability and performance of enterprise systems. Your role involves designing and maintaining systems with a focus on innovation... 
    Senior
    Full time

    Hispanic Alliance for Career Enhancement

    New York, NY
    3 days ago
  • $140k - $160k

     ...Eliassen Group is looking for a Senior Software Engineer for a remote position focused on backend Java development. You will design, build, and maintain scalable systems while participating in the full product development lifecycle from design to ongoing support. The role... 
    Senior
    Remote work

    Eliassen Group

    New York, NY
    2 days ago
  • EPAM Systems, Inc. is seeking a Senior Operational Intelligence Developer to enhance the Elastic & Observability Platform. This role includes managing platform operations and ensuring optimal performance while supporting on-call rotations. Ideal candidates will possess... 
    Senior
    Remote job

    EPAM Systems, Inc.

    New York, NY
    3 days ago
  • $148.7k - $199.4k

    5014 Disney Entertainment & Sports LLC is seeking a Senior Software Engineer - AI and Observability in New York. You will lead the design of AI-driven systems crucial for Disney’s streaming services, ensuring reliability and performance. With a strong background in backend... 
    Senior

    5014 Disney Entertainment & Sports LLC

    New York, NY
    1 day ago
  • Tavily Inc. in New York City is seeking a Senior Site Reliability Engineer to manage Kubernetes clusters and own the full infrastructure. You will...  ...improve CI/CD pipelines and ensure systems are reliable and scalable. This role offers the chance to work on real scaling... 
    Senior

    Tavily Inc.

    New York, NY
    1 day ago
  •  ...Commerce.com US, Inc. is seeking an experienced Senior Software Engineer – Backend to join the fully remote engineering team. In this role, you...  ...will collaborate on architecture, design, and development of scalable applications while solving complex infrastructure problems.... 
    Senior
    Remote work

    Commerce.com US, Inc.

    New York, NY
    12 hours ago
  • $178k - $267k

     ...AlphaSense is looking for a passionate software engineer to join their AI & Search mission in New York. In this role, you'll architect and implement scalable services for AI and search products, improving reliability and cost efficiency. You should have solid back-end... 
    Senior

    BetterCloud

    New York, NY
    12 hours ago
  •  ...Highlevel is seeking a technical contributor to manage the core infrastructure of our CRM platform. This role involves designing scalable backend services and optimizing user interfaces. You will work closely with cross-functional teams to drive high-impact initiatives... 
    Senior
    Full time

    High Level Services

    New York, NY
    2 days ago
  •  ...Space Executive is seeking a Fullstack Engineer to develop core product experiences for their AI observability platform. This role encompasses frontend engineering, distributed systems, and applied AI. You will work on building fullstack features across TypeScript, React... 
    Senior
    Remote work

    Space Executive

    New York, NY
    2 days ago
  • $160k - $200k

    Numero is seeking experienced engineers to scale their backend infrastructure and develop new products. In a collaborative team of 6,...  ...Ideal candidates will have 6+ years of experience in building scalable systems, strong SQL skills, and a passion for AI tools. The position... 
    Senior

    Numero

    New York, NY
    12 hours ago
  • $148.7k - $199.4k

    5014 Disney Streaming Technology LLC is seeking a Sr Software Engineer to build high-performance, scalable software systems. The role involves designing backend architectures and mentoring junior developers while participating in code reviews and system support during... 
    Senior

    5014 Disney Streaming Technology LLC

    New York, NY
    3 days ago
  • $87.88k - $120k

     ...A leading health solutions provider is seeking a Senior Talent Acquisition Partner for a full-time remote position. This role involves leading AWS environment setups, building Infrastructure as Code using Terraform, and troubleshooting infrastructure issues. The successful... 
    Senior
    Full time
    Remote work

    Enlyte

    New York, NY
    3 days ago
  •  ...A leading tech firm is seeking a Sr. Performance Engineer for a 90-day contract-to-hire position. This fully remote role involves building a performance testing environment, leading various testing methods, and developing automation frameworks. The ideal candidate should... 
    Senior
    Contract work
    Remote work

    Elevait Solutions

    New York, NY
    3 days ago
  •  ...A technology company is seeking an Engineer to build and maintain software features across CLI, backend, and frontend components. The ideal candidate has strong experience with Rust and familiarity with cloud infrastructure such as Google Cloud Platform. This is a full... 
    Senior
    Full time
    Remote work

    Fuku

    New York, NY
    12 hours ago
  • $120k - $170k

    SAR Engineer - Advanced Radar Systems / Remote Sensing Location:...  ...to integrate algorithms into scalable, operational pipelines. Troubleshoot...  ...engineering, and Earth observation. Opportunities for...  ...flexibility across US locations. Seniority level Mid‑Senior level... 
    Senior
    Permanent employment
    Full time
    Remote work

    EVONA

    New York, NY
    3 days ago
  •  ...The Walt Disney Company (France) is seeking a Senior Software Engineer to lead the design of intelligent, AI-driven systems for its global streaming ecosystem. The role involves architecting solutions that reduce operational overhead and ensure system reliability across... 
    Senior

    The Walt Disney Company

    New York, NY
    1 day ago
  •  ...Avalara is seeking a Senior Software Engineer to enhance its compliance platform, focusing on AI integration for improved efficiency and delivery. This role requires strong English communication skills, hands-on experience with .NET (C#), and expertise in distributed... 
    Senior

    Socotra

    New York, NY
    3 days ago
  •  ...RekNomics is seeking a Senior Software Engineer in New York to help design, build, and scale modern software solutions for a growing technology platform. The ideal candidate will have over 5 years of experience, a strong background in software engineering, and a passion... 
    Senior

    RekNomics

    New York, NY
    3 days ago
  •  ...mission at Railway is to make software engineers higher leverage. We believe that people...  ...our fleet every day Build out internal observability and alerting so we catch fleet problems...  ...building fault‑tolerant, resilient, and scalable services, and you care about what... 
    Senior
    Monday to Friday

    RAIL-WAY INC

    New York, NY
    3 days ago
  • A technology company is seeking a Staff Site Reliability Engineer to ensure the reliability and performance of critical systems. The successful candidate will focus on building scalable infrastructure and enhancing system availability through automation and monitoring.... 
    Senior

    DevOpsChat

    New York, NY
    3 days ago
  • $83.43k - $222.48k

     ...The Hispanic Alliance for Career Enhancement is looking for a Senior Software Engineer based in the United States, Kentucky. This role involves leading technical initiatives to solve complex problems, designing and maintaining critical distributed systems, and mentoring... 
    Senior

    Hispanic Alliance for Career Enhancement

    Brooklyn, NY
    2 days ago
  • $170k - $190k

     ...Duetti, Inc. is seeking a Senior QA Automation Engineer to build and scale the quality engineering function. This role involves designing automated testing frameworks across products and defining quality standards to ensure reliability as the company grows. The ideal candidate... 
    Senior

    Duetti, Inc.

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Scalability Engineer - Observability. Be the first to apply!