Get new jobs by email
- ...optimization Networking ulimits & OS-level tuning Experience with monitoring & alerting tools: Prometheus / Grafana Datadog Splunk ELK Strong SQL expertise and scripting (Python/Bash). Experience in Cloud/Container environments (AWS/Azure/GCP,...SuggestedFull time
- ...Observability & DevOps: Take ownership of the full lifecycle of your code, from CI/CD deployment to monitoring production health via Datadog. Technical Skills Primary Requirements Languages & Frameworks: Expert-level Java and Spring Boot. Data Layer:...Suggested
- ...and chaos testing. CI/CD: Automation and pipeline implementation (CI/CD) and integration with tools like Jenkins. Agile: Proven experience in Scrum/Kanban environments. Monitoring: Experience with DataDog and Splunk for application monitoringSuggested
- ...updates across OCI-hosted applications and services. Design and operate an end to end monitoring, alerting, and reliability stack (Datadog, OCI Metrics, PagerDuty) with SLO/SLA tracking and cost optimization. Monitor and respond to security alerts and events from OCI tools...Suggested
- ...using Azure Automation and CI/CD pipelines. Expertise in monitoring platforms such as SCOM, SquaredUp, or equivalent (e.g., Dynatrace, Datadog, Splunk). Knowledge of API integration and secure authentication. Process & Frameworks Working knowledge of ITIL 4 practices (...SuggestedRelocation
- ...LoadRunner, NeoLoad and/or LoadComplete Experience with monitoring tools (Dynatrace, Performance Center, Splunk, AppDynamics, JProfiler, Datadog, NinjaOne) Experience with data analytics tools (Google Analytics, Adobe Analytics, Open Web Analytics) Experience with...SuggestedLocal areaRemote work
- ...and services (AWS, Azure, GCP, etc.) in a production environment. Solid understanding of monitoring and logging tools, such as Datadog and Cloudwatch. Solid knowledge of containerization technologies (Docker, Kubernetes) and microservices architecture....Suggested
- ...systems and applications including RAID, NAS, SAN, Veeam backup software. Systems and network monitoring tools such as Logic Monitor, Datadog, etc. Network technologies and products including Cisco routers, Cisco switches, Fortinet SD-WAN, TCP/IP, SMTP, SNMP, and Cisco...SuggestedContract workWork at officeLocal area
- ...Service Now (event, incident, and workflow integrations) Experience with advanced monitoring and correlation platforms such as Datadog and Big Panda (highly desirable). Experience in configuring and managing: Monitoring profiles Polling policies...SuggestedTraineeshipNight shift
- ...Azure Automation and CI/CD pipelines. Expertise in monitoring platforms such as SCOM, SquaredUp, or equivalent (e.g., Dynatrace, Datadog, Splunk). Knowledge of API integration and secure authentication. Process & Frameworks Working knowledge of ITIL...Suggested
- ...and data quality tooling, such as: Great Expectations / Delta Expectations Databricks Quality Flows Monte Carlo Datadog Preferred Certifications: Databricks Certified Data Engineer Professional or Databricks Architect...SuggestedFull timeRemote work
- ...CD pipelines and automated deployment strategies. ~ Knowledge of Application Performance Monitoring (APM) tools like New Relic or Datadog. ~ Experience working within Agile development methodologies. ~ Familiarity with Domain-Driven Design (DDD) principles is a...Suggested
- ...reliability. Monitoring, Logging & Troubleshooting Implement and manage robust monitoring and logging systems using AWS CloudWatch, Datadog, Dynatrace, or custom solutions. Proactively identify, troubleshoot, and resolve infrastructure and application issues before...SuggestedRemote jobFlexible hours
$146k - $170k
...within a small, dynamic team Preferred Skills: Experience with AWS ECS and Lambda Familiarity with Open API, GitHub Actions, DataDog, and Kong Gateway Knowledge of the Stripe API Soft Skills & Culture Fit: Strong collaboration skills with a willingness to...Suggested- ...environments with Kubernetes and Docker. - Implement monitoring, logging, and observability solutions (Prometheus, Grafana, ELK, Datadog). - Embed DevSecOps practices into pipelines and infrastructure. - Ensure high availability, disaster recovery, and cost optimization...SuggestedFull timeFlexible hours
$114k - $133k
...and experience working with Kubernetes, Docker, and/or Lambda (preferred) ~ Experience with various monitoring tools like Splunk, Datadog, Elastic, New Relic, etc. (preferred) ~ Comfortable working in a dynamic, fast-paced startup environment and experience at a successful...Remote jobFull time- ...Ansible, Puppet, Chef. Containers & orchestration: Docker, Kubernetes, Helm. Monitoring: Prometheus, Grafana, ELK/EFK, Splunk, Datadog. Strong scripting skills in Python, Bash, PowerShell, or Go. Soft Skills : Strong analytical and problem-solving...Full timeH1bLocal areaRemote work
$57 per hour
...systems expertise * Advanced Kubernetes experience; CKA preferred * Experience with modern observability tools (Prometheus, Grafana, Datadog, Splunk, ELK, Jaeger) * CI/CD (GitLab), Linux OS expertise, troubleshooting, and RCA excellence * Strong communication and cross-...Contract work- ...Kubernetes). Experience with cloud platforms such as AWS, Google Cloud, or Azure. Familiarity with monitoring and logging tools like Datadog and OpenTelemetry. Qualifications Minimum of 5 years of experience in software development, with a focus on backend...Full timeWork at officeLocal areaFlexible hours
$46 per hour
...provide guidance to a larger community * 3+ years of AppDynamics experience * 3+ years of Batch Monitoring experience * 3+ years of Datadog experience * 3+ years of ServiceNow experience * Very strong candidate with Observability experience; Experience should include...Contract work- ...monitoring for model performance and drift Design comprehensive observability and monitoring using Prometheus, Grafana, ELK, or Datadog with distributed tracing, APM, and real-time alerting aligned to SLIs/SLOs Implement security best practices including least-...Live outWork at office
$180k - $250k
...in data center operations, colocation, or hyperscale environments Familiarity with observability stacks (Prometheus, Grafana, Datadog) in infrastructure contexts Prior experience in a forward deployed, solutions, or field engineering role at a startup or infrastructure...Full timeRemote workShift work- ...gateways. Monitoring & Observability: Experience with tools like Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Datadog, Dynatrace, etc., for monitoring application health and performance. Security: Awareness and application of secure coding...Full time
- ...optimize multiple Kubernetes clusters for enterprise-scale applications. Oversee observability and monitoring solutions using Datadog (APM, Infrastructure Monitoring, etc.) and other related tools. Partner with Engineering and QA teams to streamline CI/CD...Remote jobFull timeFor contractorsWork at office
- ...and Spring MVC. Deploy via Docker on AWS EKS Kubernetes with AWS ALB , using Harness/TeamCity for CI/CD. Monitor with Datadog and Elasticsearch ; handle bug fixing , log analysis, and root-cause resolution. Participate in daily scrum calls ,...Full time
- ...through Java/J2EE practices like annotations, reflection, Lombok, and Spring MVC patterns. Monitor app and service health using Datadog and Elasticsearch , performing root-cause analysis, bug fixing , and proactive optimizations based on production logs....Full time
- ...technologies to meet your team's objectives. Deploying, monitoring and maintaining a set of critical services in our infrastructure using Datadog and FireHydrant Signals. Building user experiences that adhere to a high level of accessibility standards. Applying...Full timeWork at officeMonday to Friday3 days per week
- ...ticketing, or reservation systems. Exposure to observability tools with focus on New Relic (other examples include CloudWatch, Datadog, or ELK stack). Understanding of security best practices for cloud-based microservices (IAM, least privilege, encryption). Experience...Remote jobFull timeTemporary workWork at officeWork from homeFlexible hours
- ...Observability & Reliability Implement OpenTelemetry for distributed tracing and metrics. Configure APM monitoring tooling (e.g Datadog ), including dashboards, alerts, and SLOs for application health and performance. Improve MTTR through automated incident...
- ...Implement automated unit, integration, and regression testing frameworks. Enable observability through monitoring tools such as Datadog, New Relic, or Prometheus. Mentor mid-level engineers and provide technical direction across delivery teams. Contribute to technical...Full timeLocal area
