Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI / ML Engineer (with Observability)

Staffing the Universe

AI/ML Engineer

We are seeking a passionate and hands-on AI/ML Engineer to accelerate our Enterprise Observability strategy. This role will design, build, and operationalize AI/ML capabilities that enhance end to end telemetry pipelines, anomaly detection, intelligent alerting, and proactive system resiliency.

You will work at the intersection of AI/ML engineering, Observability platforms, and automation, developing solutions that improve detection, diagnosis, and prevention of operational issues across distributed systems.

Key Responsibilities

• Design and deploy AI/ML models supporting anomaly detection, baselining, event correlation, and predictive operational analytics.

• Build and integrate AI-enabled capabilities into enterprise Observability platforms, including Grafana, APM/RUM tools, network telemetry systems, and data observability tools.

• Develop AI Agents that can autonomously triage issues, recommend corrective actions, and initiate automated remediation workflows to reduce recovery time and improve system resilience.

• Implement self-healing automation using AI-driven decisioning, integrating with orchestration frameworks, service APIs, and infrastructure automation pipelines.

• Engineer and maintain real-time and batch data pipelines using Snowflake ML Jobs, Snowflake Cortex, streams, tasks, and UDFs.

• Implement and manage OpenTelemetry-based telemetry ingestion for logs, metrics, traces, and spans across distributed systems.

• Build asynchronous Python APIs and services for model inferencing and operational integration.

• Enhance observability intelligence with AI-powered capabilities such as root-cause acceleration, chatbot/search enablement, and automated insights.

• Contribute to SLO/SLI modeling, Golden Signals instrumentation, and Observability NFR adoption.

• Collaborate across engineering, SRE, platform and business teams to embed proactive intelligence and Observability standards throughout the ecosystem.

Required Skills & Qualifications Core Technical Skills

• Strong proficiency in Python and data science/ML libraries: NumPy, Pandas, scikit learn, TensorFlow, PyTorch, Matplotlib, Seaborn.

• Experience with Generative AI, LLM fine tuning, prompt engineering, RAG pipelines, and LLM evaluation frameworks.

• Expertise in developing and deploying ML models in production (batch & streaming).

• Strong understanding of statistics, time series modeling, and anomaly detection.

Observability & Telemetry

• Experience with OpenTelemetry for logs, metrics, traces, spans.

• Familiarity with Observability concepts: Golden Signals, SLO/SLI design, APM, RUM, Synthetics, event correlation, baselining.

• Experience with Observability tools such as Grafana (Alloy agents, dashboards, ML capabilities), Dynatrace, Monte Carlo (Data Observability), Netscout, ThousandEyes, SolarWinds, NetBrain.

Cloud, Data & Platform

• Hands on with AWS (SageMaker, Bedrock), Snowflake ML, Snowflake/Openflow, Snowflake AI Observability tooling.

• Experience building Snowflake data pipelines (streams, tasks, UDFs) – plus for Cortex features.

• Strong understanding of distributed systems and microservices telemetry requirements.

Automation & Engineering Quality

• Experience with automation pipelines, CI/CD, and infrastructure as code patterns supporting Observability adoption.

• Ability to build asynchronous Python APIs or services for model inference and operational integration.

Preferred Qualifications

• Experience developing agentic AI systems that analyze telemetry, generate action recommendations, or execute automated operational responses.

• Experience building self-healing patterns, including automated rollback, service restarts, configuration corrections, and predictive maintenance.

• Experience in Snowflake ML workflows, Snowflake Cortex Agents, and data pipeline automation.

• Exposure to AI-enabled alerting, RCA automation, and operational self-healing concepts.

• Experience with large-scale operational telemetry and multi-cloud ecosystems.

Soft Skills

• Strong analytical thinking and problem solving.

• Excellent communication skills for cross functional collaboration with infrastructure, SRE, engineering, business, and leadership teams.

• Curiosity, continuous learning mindset, and passion for applied AI and Observability.

EEO: "Mindlance is an Equal Opportunity Employer and does not discriminate in employment on the basis of – Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans."

Vacancy posted 22 days ago
Similar jobs that could be interesting for youBased on the AI / ML Engineer (with Observability) in Coppell, TX vacancy
  •  ...NTT DATA, we know that with the right people on board...  ...), Google Vertex AI (Gemini, RAG, Vector Search...  ...PGVector, Vertex Matching Engine, Pinecone), Familiarity...  ...Sub eventing patterns. Observability stacks: OpenTelemetry,...  ...experience in AI/ML. 2 years Chatbot development... 
    Suggested
    Hourly pay
    Temporary work
    Remote work
    Flexible hours

    NTT Data Americas, Inc.

    Coppell, TX
    3 days ago
  •  ...our team, you'll work in a mission-focused environment with specialized teams, including Engineering, Threat Intelligence, Vulnerability Management,...  ...Responsibilities About the Role: The Cybersecurity AI_ML Engineer is responsible for developing, deploying, monitoring... 
    Suggested
    Work experience placement
    Work at office
    Visa sponsorship
    Flexible hours

    GM Financial

    Irving, TX
    4 days ago
  •  ...JOB SUMMARY Seeking a Sr. AI Engineer with deep expertise in building AI-driven and agentic automation...  ...patterns. • Implement and maintain observability stacks: OpenTelemetry, Prometheus/...  ...• 5+ years of strong experience in AI/ML. • Experience with LangGraph (graphs,... 
    Suggested

    Compunnel

    Coppell, TX
    4 days ago
  •  ...highly skilled and passionate AI/MLOps Engineer to design, build, and...  ...Design, develop, and deploy AI/ML applications using Azure Machine...  .... Collaborate with data scientists and data engineers...  ...monitoring tools and model observability best practices. Additional... 
    Suggested
    Contract work
    Work at office
    Local area
    Remote work

    Vaco

    Farmers Branch, TX
    4 days ago
  •  ...About the job Lead AI/ML Engineer Job Description: Position Summary: As a Lead ML/AI Engineer, you will drive the design...  ...Platform, leveraging and integrating the cloud native services with other standard operational and automation tools. Key... 
    Suggested

    Inizio Partners

    Irving, TX
    2 days ago
  •  ...JOB SUMMARY The Lead AI Engineer will be responsible for building AI-...  ...orchestration, and multi-agent systems, with ideal candidates having...  ...or Pub/Sub. • Utilize observability stacks including OpenTelemetry...  ...years of strong experience in AI/ML. • Experience with... 

    Compunnel

    Coppell, TX
    4 days ago
  •  ...8 + year We are seeking a highly skilled and experienced AI/ML Platform Engineer to build and manage our end to end Machine Learning (ML) and...  ...Modelling and their Deployment • Good knowledge of working with Git • Working knowledge of Google Agentic Framework... 
    Contract work

    ECHO IT SOLUTIONS INC .

    Farmers Branch, TX
    2 days ago
  •  ...part of our investment in technology, we are looking for a Lead AI/ML Engineer to play a key role in developing critical back-end systems...  ...transactions, and customer rewards. This individual will have experience with dynamic pricing engines with multiple pricing sources. Expect... 

    GameStop Texas LTD

    Grapevine, TX
    9 days ago
  • $143.5k - $275k

     ...Want in? Join the #VTeamLife. What you’ll be doing... The Director of AI/ML Engineering oversees the critical bridge between experimental AI science and highly scalable, production-grade systems with a heavy emphasis on Large Language Models (LLMs) and enterprise-... 
    Full time
    Temporary work
    Part time
    Work experience placement
    Work at office
    Work from home
    Shift work
    3 days per week

    Verizon

    Irving, TX
    4 days ago
  •  ...Accenture’s Global Responsible AI team within the Global Data &...  ...powerful, and more accessible. With these new opportunities come...  ...if you’re an experienced RAI Engineer with a Responsible AI background...  ...for a disability or religious observance, please call us toll free at 1... 
    Work experience placement
    Live in
    Work at office
    Local area

    Accenture

    Irving, TX
    1 day ago
  •  ...• Build and deploy full ML pipelines: data ingestion, feature engineering, model training, evaluation...  ...MLOps practices with MLflow/Kubeflow/SageMaker...  ...translate business needs into AI solutions; provide...  ...systems. • Experience with observability tools (Splunk, Grafana)... 

    ClifyX

    Irving, TX
    4 days ago
  •  ...Senior AI Engineer Opportunity At Schwab, you're empowered to make an impact on your career...  ...high-performing AI systems that align with Schwab's innovation strategy and...  ...special emphasis on reliability, monitoring, observability, and orchestration across products.... 
    Full time
    Work at office

    Charles Schwab

    Southlake, TX
    1 day ago
  •  ...Technology Centers (ATCs) is the engine for reinvention in our clients...  ...ATCs will provide our clients with seamless access to industry...  ...knowledge, the latest in Gen AI solutions, and tech expertise...  ...evaluation harnesses, and lifecycle observability. AI Platform Integration:... 
    Work experience placement
    Live in
    Work at office
    Local area
    3 days per week

    Accenture

    Irving, TX
    3 days ago
  •  ...AI Native Software Engineer (Senior / Lead IC Only) Location: 5205 North O'Connor Blvd, West Tower...  ...Production-deployed AI (not POCs) Hands-on with: Agents and/or RAG, Orchestration...  ...Infrastructure as Code Monitoring / observability tools Required Skills &... 

    RIT Solutions

    Irving, TX
    20 hours ago
  • $87.52k - $140.77k

    Job Description Role: Senior Software Engineer ( Gen AI) Location : Coppell, TX - Hybrid role...  ...s and Texas's "Best Places to Work." With over a decade of experience in implementing...  ...2025 focus includes: Enhancing AI and ML architecture for faster, more impactful... 
    Local area
    Remote work
    Relocation
    Flexible hours

    Blue Yonder

    Coppell, TX
    20 hours ago
  •  ...D & A Data Engineer As our Data Engineer, you will function as a consultant between our technology unit and other business units to understand...  ...and present ideas that enable them to solve those data problems with code. This is a hybrid role and must be driving distance to our... 
    Work at office

    AAA Auto Club Group

    Coppell, TX
    16 hours ago
  • $73.8k - $220.4k

     ...Centers (ATCs) are the engine for reinvention in our...  ...ATCs provide our clients with seamless access to...  ...knowledge, the latest in Gen AI solutions, and tech...  ...harnesses, and lifecycle observability. • Build RAG and multi...  ...with CI/CD across the ML and LLM lifecycle. This... 
    Work experience placement
    Live in
    Work at office
    Local area
    3 days per week

    Accenture

    Irving, TX
    20 hours ago
  • $73.8k - $220.4k

     ...Centers (ATCs) are the engine for reinvention in our...  ...ATCs provide our clients with seamless access to...  ...knowledge, the latest in Gen AI solutions, and tech...  ...harnesses, and lifecycle observability. • Build RAG and multi...  ...with CI/CD across the ML and LLM lifecycle. This... 
    Work experience placement
    Live in
    Work at office
    Local area
    3 days per week

    Accenture

    Irving, TX
    4 days ago
  • $73.8k - $218.8k

     ...Oracle Business Group AI Center of Excellence is...  ...You might come from engineering, consulting, product, or...  ...sales teams win faster with AI-generated content, live...  ...software engineering, AI/ML, or enterprise...  ...disability or religious observance, please call us toll free... 
    Work experience placement
    Live in
    Work at office
    Local area

    Accenture

    Irving, TX
    3 days ago
  • $73.8k - $220.4k

     ...Centers (ATCs) are the engine for reinvention in our...  ...ATCs provide our clients with seamless access to...  ...knowledge, the latest in Gen AI solutions, and tech...  ...You Are: An AI/ML Architect with strong...  ...harnesses, and lifecycle observability. Build LLM and agentic... 
    Work experience placement
    Live in
    Work at office
    Local area
    3 days per week

    Accenture

    Irving, TX
    2 days ago
  •  ...Title: Principal AI Software Engineer - Java Location: Westlake, TX - Onsite Type...  .... These roles are 100% hands-on, with direct ownership over architecture, coding...  ...system design, testing, and observability. Mentor senior engineers and influence... 
    Contract work

    Korn Ferry

    Westlake, TX
    3 days ago
  • $102.4k - $179k

     ...design, build, and deploy scalable AI and machine learning solutions...  ..., developing both traditional ML models and LLM-powered systems...  .... You will collaborate with cross-functional teams, translate...  ...degree in Computer Science, Engineering, Data Science, or a related field... 
    Full time

    Vizient

    Irving, TX
    20 hours ago
  • $70 per hour

     ...Position Title:  Sr Azure AI Engineer Location: Coppell, TX (Onsite, In Office Daily) Job...  ...workflows, and backend services Partner with product and business teams to identify...  ...architecture patterns ~ Exposure to AI/ML or generative AI integration in real-world... 
    Contract work
    Work at office
    Local area

    Addison Group

    Coppell, TX
    1 day ago
  •  ...seeking a highly motivated Principal Cloud Engineer to join our Observability Platform team within Fidelity...  ...systems on AWS (EKS, core services) with strong focus on reliability, performance...  ...Experience or interest in applying AI/ML techniques to observability (e.g., anomaly... 

    Fidelity Investments

    Southlake, TX
    1 day ago
  •  ...sponsorship for this position Principal AI Site Reliability Engineer, EI Production Services...  ...you will drive operational excellence, observability, and intelligent automation for mission...  ...This position requires a self-starter with strong communication skills, capable... 

    Fidelity Investments

    Westlake, TX
    4 days ago
  •  ...Position : Lead Observability Platform Engineer Location: Remote As a Lead Observability Platform Engineer, you will design, build, and...  ...the enterprise. In this role, you will partner closely with SRE, Cloud Engineering, CI/CD, Infrastructure, Security, and... 
    Work experience placement
    Remote work

    United IT Solutions

    Irving, TX
    20 hours ago
  •  ...looking for a Senior Applied AI Engineer to be part of revolutionizing...  ...system design, and integration with enterprise platforms—turning...  .... Ensure reproducibility, observability, and version control across model...  .... Partner with Applied ML Engineers and Software Engineers... 
    Full time

    Paradigm

    Irving, TX
    13 hours ago
  •  ...Senior Full Stack Engineer – AI/LLM Platform 5 days onsite Pittsburgh, PA / Farmers Branch, TX / Strongsville, OH / Birmingham, AL...  ...candidate will combine strong full-stack engineering capabilities with modern AI/ML system design experience to deliver intelligent, scalable,... 
    Contract work
    Work experience placement
    Work at office
    Flexible hours

    System One Holdings, LLC

    Farmers Branch, TX
    8 days ago
  •  ...Senior Software Engineer, Full Stack Chadds...  ...and back end, integrate with mission-critical partners...  ...iteration. Drive observability, performance,...  ...adopts. Leverage AI-assisted development practices...  ...Experience integrating ML-driven decisioning, scoring... 
    Contract work
    Work at office
    Remote work

    Flagship Financial Group

    Coppell, TX
    2 days ago
  •  ...entrepreneurial professionals across more than 30 countries. Title: AI/ML / Generative AI Engineer Location: Irving, Tx Strong Data...  ...on coding in the interview Realtime for given scenario with right solution approach Proficient in Data... 

    E-Solutions

    Irving, TX
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI / ML Engineer (with Observability). Be the first to apply!