AI / ML Engineer (with Observability)
Staffing the Universe
AI/ML Engineer
We are seeking a passionate and hands-on AI/ML Engineer to accelerate our Enterprise Observability strategy. This role will design, build, and operationalize AI/ML capabilities that enhance end to end telemetry pipelines, anomaly detection, intelligent alerting, and proactive system resiliency.
You will work at the intersection of AI/ML engineering, Observability platforms, and automation, developing solutions that improve detection, diagnosis, and prevention of operational issues across distributed systems.
Key Responsibilities
• Design and deploy AI/ML models supporting anomaly detection, baselining, event correlation, and predictive operational analytics.
• Build and integrate AI-enabled capabilities into enterprise Observability platforms, including Grafana, APM/RUM tools, network telemetry systems, and data observability tools.
• Develop AI Agents that can autonomously triage issues, recommend corrective actions, and initiate automated remediation workflows to reduce recovery time and improve system resilience.
• Implement self-healing automation using AI-driven decisioning, integrating with orchestration frameworks, service APIs, and infrastructure automation pipelines.
• Engineer and maintain real-time and batch data pipelines using Snowflake ML Jobs, Snowflake Cortex, streams, tasks, and UDFs.
• Implement and manage OpenTelemetry-based telemetry ingestion for logs, metrics, traces, and spans across distributed systems.
• Build asynchronous Python APIs and services for model inferencing and operational integration.
• Enhance observability intelligence with AI-powered capabilities such as root-cause acceleration, chatbot/search enablement, and automated insights.
• Contribute to SLO/SLI modeling, Golden Signals instrumentation, and Observability NFR adoption.
• Collaborate across engineering, SRE, platform and business teams to embed proactive intelligence and Observability standards throughout the ecosystem.
Required Skills & Qualifications Core Technical Skills
• Strong proficiency in Python and data science/ML libraries: NumPy, Pandas, scikit learn, TensorFlow, PyTorch, Matplotlib, Seaborn.
• Experience with Generative AI, LLM fine tuning, prompt engineering, RAG pipelines, and LLM evaluation frameworks.
• Expertise in developing and deploying ML models in production (batch & streaming).
• Strong understanding of statistics, time series modeling, and anomaly detection.
Observability & Telemetry
• Experience with OpenTelemetry for logs, metrics, traces, spans.
• Familiarity with Observability concepts: Golden Signals, SLO/SLI design, APM, RUM, Synthetics, event correlation, baselining.
• Experience with Observability tools such as Grafana (Alloy agents, dashboards, ML capabilities), Dynatrace, Monte Carlo (Data Observability), Netscout, ThousandEyes, SolarWinds, NetBrain.
Cloud, Data & Platform
• Hands on with AWS (SageMaker, Bedrock), Snowflake ML, Snowflake/Openflow, Snowflake AI Observability tooling.
• Experience building Snowflake data pipelines (streams, tasks, UDFs) – plus for Cortex features.
• Strong understanding of distributed systems and microservices telemetry requirements.
Automation & Engineering Quality
• Experience with automation pipelines, CI/CD, and infrastructure as code patterns supporting Observability adoption.
• Ability to build asynchronous Python APIs or services for model inference and operational integration.
Preferred Qualifications
• Experience developing agentic AI systems that analyze telemetry, generate action recommendations, or execute automated operational responses.
• Experience building self-healing patterns, including automated rollback, service restarts, configuration corrections, and predictive maintenance.
• Experience in Snowflake ML workflows, Snowflake Cortex Agents, and data pipeline automation.
• Exposure to AI-enabled alerting, RCA automation, and operational self-healing concepts.
• Experience with large-scale operational telemetry and multi-cloud ecosystems.
Soft Skills
• Strong analytical thinking and problem solving.
• Excellent communication skills for cross functional collaboration with infrastructure, SRE, engineering, business, and leadership teams.
• Curiosity, continuous learning mindset, and passion for applied AI and Observability.
EEO: "Mindlance is an Equal Opportunity Employer and does not discriminate in employment on the basis of – Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans."
- ...NTT DATA, we know that with the right people on board... ...), Google Vertex AI (Gemini, RAG, Vector Search... ...PGVector, Vertex Matching Engine, Pinecone), Familiarity... ...Sub eventing patterns. Observability stacks: OpenTelemetry,... ...experience in AI/ML. 2 years Chatbot development...SuggestedHourly payTemporary workRemote workFlexible hours
- ...our team, you'll work in a mission-focused environment with specialized teams, including Engineering, Threat Intelligence, Vulnerability Management,... ...Responsibilities About the Role: The Cybersecurity AI_ML Engineer is responsible for developing, deploying, monitoring...SuggestedWork experience placementWork at officeVisa sponsorshipFlexible hours
- ...JOB SUMMARY Seeking a Sr. AI Engineer with deep expertise in building AI-driven and agentic automation... ...patterns. • Implement and maintain observability stacks: OpenTelemetry, Prometheus/... ...• 5+ years of strong experience in AI/ML. • Experience with LangGraph (graphs,...Suggested
- ...highly skilled and passionate AI/MLOps Engineer to design, build, and... ...Design, develop, and deploy AI/ML applications using Azure Machine... .... Collaborate with data scientists and data engineers... ...monitoring tools and model observability best practices. Additional...SuggestedContract workWork at officeLocal areaRemote work
- ...About the job Lead AI/ML Engineer Job Description: Position Summary: As a Lead ML/AI Engineer, you will drive the design... ...Platform, leveraging and integrating the cloud native services with other standard operational and automation tools. Key...Suggested
- ...JOB SUMMARY The Lead AI Engineer will be responsible for building AI-... ...orchestration, and multi-agent systems, with ideal candidates having... ...or Pub/Sub. • Utilize observability stacks including OpenTelemetry... ...years of strong experience in AI/ML. • Experience with...
- ...8 + year We are seeking a highly skilled and experienced AI/ML Platform Engineer to build and manage our end to end Machine Learning (ML) and... ...Modelling and their Deployment • Good knowledge of working with Git • Working knowledge of Google Agentic Framework...Contract work
- ...part of our investment in technology, we are looking for a Lead AI/ML Engineer to play a key role in developing critical back-end systems... ...transactions, and customer rewards. This individual will have experience with dynamic pricing engines with multiple pricing sources. Expect...
$143.5k - $275k
...Want in? Join the #VTeamLife. What you’ll be doing... The Director of AI/ML Engineering oversees the critical bridge between experimental AI science and highly scalable, production-grade systems with a heavy emphasis on Large Language Models (LLMs) and enterprise-...Full timeTemporary workPart timeWork experience placementWork at officeWork from homeShift work3 days per week- ...Accenture’s Global Responsible AI team within the Global Data &... ...powerful, and more accessible. With these new opportunities come... ...if you’re an experienced RAI Engineer with a Responsible AI background... ...for a disability or religious observance, please call us toll free at 1...Work experience placementLive inWork at officeLocal area
- ...• Build and deploy full ML pipelines: data ingestion, feature engineering, model training, evaluation... ...MLOps practices with MLflow/Kubeflow/SageMaker... ...translate business needs into AI solutions; provide... ...systems. • Experience with observability tools (Splunk, Grafana)...
- ...Senior AI Engineer Opportunity At Schwab, you're empowered to make an impact on your career... ...high-performing AI systems that align with Schwab's innovation strategy and... ...special emphasis on reliability, monitoring, observability, and orchestration across products....Full timeWork at office
- ...Technology Centers (ATCs) is the engine for reinvention in our clients... ...ATCs will provide our clients with seamless access to industry... ...knowledge, the latest in Gen AI solutions, and tech expertise... ...evaluation harnesses, and lifecycle observability. AI Platform Integration:...Work experience placementLive inWork at officeLocal area3 days per week
- ...AI Native Software Engineer (Senior / Lead IC Only) Location: 5205 North O'Connor Blvd, West Tower... ...Production-deployed AI (not POCs) Hands-on with: Agents and/or RAG, Orchestration... ...Infrastructure as Code Monitoring / observability tools Required Skills &...
$87.52k - $140.77k
Job Description Role: Senior Software Engineer ( Gen AI) Location : Coppell, TX - Hybrid role... ...s and Texas's "Best Places to Work." With over a decade of experience in implementing... ...2025 focus includes: Enhancing AI and ML architecture for faster, more impactful...Local areaRemote workRelocationFlexible hours- ...D & A Data Engineer As our Data Engineer, you will function as a consultant between our technology unit and other business units to understand... ...and present ideas that enable them to solve those data problems with code. This is a hybrid role and must be driving distance to our...Work at office
$73.8k - $220.4k
...Centers (ATCs) are the engine for reinvention in our... ...ATCs provide our clients with seamless access to... ...knowledge, the latest in Gen AI solutions, and tech... ...harnesses, and lifecycle observability. • Build RAG and multi... ...with CI/CD across the ML and LLM lifecycle. This...Work experience placementLive inWork at officeLocal area3 days per week$73.8k - $220.4k
...Centers (ATCs) are the engine for reinvention in our... ...ATCs provide our clients with seamless access to... ...knowledge, the latest in Gen AI solutions, and tech... ...harnesses, and lifecycle observability. • Build RAG and multi... ...with CI/CD across the ML and LLM lifecycle. This...Work experience placementLive inWork at officeLocal area3 days per week$73.8k - $218.8k
...Oracle Business Group AI Center of Excellence is... ...You might come from engineering, consulting, product, or... ...sales teams win faster with AI-generated content, live... ...software engineering, AI/ML, or enterprise... ...disability or religious observance, please call us toll free...Work experience placementLive inWork at officeLocal area$73.8k - $220.4k
...Centers (ATCs) are the engine for reinvention in our... ...ATCs provide our clients with seamless access to... ...knowledge, the latest in Gen AI solutions, and tech... ...You Are: An AI/ML Architect with strong... ...harnesses, and lifecycle observability. Build LLM and agentic...Work experience placementLive inWork at officeLocal area3 days per week- ...Title: Principal AI Software Engineer - Java Location: Westlake, TX - Onsite Type... .... These roles are 100% hands-on, with direct ownership over architecture, coding... ...system design, testing, and observability. Mentor senior engineers and influence...Contract work
$102.4k - $179k
...design, build, and deploy scalable AI and machine learning solutions... ..., developing both traditional ML models and LLM-powered systems... .... You will collaborate with cross-functional teams, translate... ...degree in Computer Science, Engineering, Data Science, or a related field...Full time$70 per hour
...Position Title: Sr Azure AI Engineer Location: Coppell, TX (Onsite, In Office Daily) Job... ...workflows, and backend services Partner with product and business teams to identify... ...architecture patterns ~ Exposure to AI/ML or generative AI integration in real-world...Contract workWork at officeLocal area- ...seeking a highly motivated Principal Cloud Engineer to join our Observability Platform team within Fidelity... ...systems on AWS (EKS, core services) with strong focus on reliability, performance... ...Experience or interest in applying AI/ML techniques to observability (e.g., anomaly...
- ...sponsorship for this position Principal AI Site Reliability Engineer, EI Production Services... ...you will drive operational excellence, observability, and intelligent automation for mission... ...This position requires a self-starter with strong communication skills, capable...
- ...Position : Lead Observability Platform Engineer Location: Remote As a Lead Observability Platform Engineer, you will design, build, and... ...the enterprise. In this role, you will partner closely with SRE, Cloud Engineering, CI/CD, Infrastructure, Security, and...Work experience placementRemote work
- ...looking for a Senior Applied AI Engineer to be part of revolutionizing... ...system design, and integration with enterprise platforms—turning... .... Ensure reproducibility, observability, and version control across model... .... Partner with Applied ML Engineers and Software Engineers...Full time
- ...Senior Full Stack Engineer – AI/LLM Platform 5 days onsite Pittsburgh, PA / Farmers Branch, TX / Strongsville, OH / Birmingham, AL... ...candidate will combine strong full-stack engineering capabilities with modern AI/ML system design experience to deliver intelligent, scalable,...Contract workWork experience placementWork at officeFlexible hours
- ...Senior Software Engineer, Full Stack Chadds... ...and back end, integrate with mission-critical partners... ...iteration. Drive observability, performance,... ...adopts. Leverage AI-assisted development practices... ...Experience integrating ML-driven decisioning, scoring...Contract workWork at officeRemote work
- ...entrepreneurial professionals across more than 30 countries. Title: AI/ML / Generative AI Engineer Location: Irving, Tx Strong Data... ...on coding in the interview Realtime for given scenario with right solution approach Proficient in Data...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI / ML Engineer (with Observability). Be the first to apply!


