Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

LLM AIOps Development Engineer - Data Center Networking

$202.16k - $368.22k

ByteDance

Responsibilitie

About the team Networking brings together innovative ideas and technologies from network architecture, software defined networking (SDN), network virtualization, switch software and hardware co-design, and high-speed networking, to create hyper-scale data-center networking solutions that power several of the most popular apps of the world such as Douyin and TikTok which serve hundreds of millions of users around the globe. Network Observation team is committed to building a world-leading hyperscale data center network infrastructure that supports hundreds of millions of users' real-time access and explosive growth of massive data volumes. We believe that the next generation of network operations will be fundamentally powered by artificial intelligence technologies, particularly Large Language Models (LLMs). We are seeking a passionate development engineer who combines deep networking expertise with innovative AIOps capabilities to join us in defining and building "autonomous" data center networks. Together, we will transform network operations from a reactive "firefighting" mode into a proactive, data-driven intelligent ecosystem with predictive and self-healing capabilities. Responsibilities: As a core member of our team, you will collaborate closely with our NetOps, SRE, and platform engineering teams to tackle the complexities of one of the world's largest data center networks. You will design and implement a closed-loop AIOps for NetWork platform, covering: - Build a Panoramic Network Observability Platform: Develop a streaming telemetry data pipeline for both physical and virtual networks, integrating multi-source data from gNMI, Netconf, IPFIX/NetFlow, and SNMP to provide a high-quality, real-time data foundation for AIOps. - Develop an Intelligent Diagnostics and Root Cause Analysis System: Apply machine learning and deep learning algorithms to perform anomaly detection, correlation analysis, and intelligent noise reduction on massive volumes of network metrics, logs, and events. Swiftly pinpoint root causes of failures across the entire stack, from optical transceivers and switch hardware to protocol adjacencies and application traffic. - Explore Innovative Applications of LLMs and Agents: - Intelligent Operations Assistant: Build a conversational chatbot powered by Retrieval-Augmented Generation (RAG) that understands natural language queries, automatically queries knowledge bases and monitoring data, and provides precise troubleshooting guidance and network status reports. - Automated Remediation and Smart Runbooks: Train operational Agents to safely and controllably invoke network change tools and APIs. Empower them to autonomously generate, recommend, or even execute remediation plans and emergency runbooks based on their understanding of failure scenarios. - Establish Capacity and Risk Prediction Capabilities: Forecast network capacity bottlenecks, high-risk links, and "sub-healthy" devices based on historical data and business growth models, enabling proactive scaling and preventative maintenance. - Forge a Rock-Solid Engineering System: Adhere to engineering best practices to design and develop a highly available and scalable AIOps platform. Guarantee the stability and performance of the entire pipeline, from data collection and model training to online inference and automated closed-loop actions.

Qualification

Minimum Qualifications: - Solid Fundamentals in Computer Science and Networking: A deep understanding of data center network architectures (e.g., Spine-Leaf Fabric), and proficiency in key protocols such as EVPN/VXLAN and BGP/OSPF. In-depth knowledge of the Linux network stack is essential. - Excellent Software Engineering Skills: Mastery of Golang or Python with outstanding coding and system design abilities. Familiarity with modern software development workflows, including microservices, containerization (Docker/Kubernetes), and CI/CD. - Rich Platform Development Experience: Practical experience in one or more of the following areas is highly desirable: - Big Data Processing: Familiarity with Kafka, Flink, ClickHouse/TSDB, and experience building real-time data pipelines and analytics systems. - Observability Technologies: Experience with Prometheus/OpenTelemetry, graph databases (e.g., Neo4j), and developing alert and event platforms. - A Passion for AIOps/ML/LLM Practices: - A keen interest in the latest advancements in Large Models and Agent technologies, with thoughtful insights or hands-on experience in their application to operations (e.g., RAG, tool use, safety evaluation). Preferred Qualifications: - Experience in operating or developing for hyperscale (100,000+ servers) data center networks. - Proven experience leading or making significant contributions to an LLM/Agent-based intelligent operations project with measurable business impact. - Active contributions to open-source communities such as SONiC, P4/PINS, eBPF, Prometheus, or OpenTelemetry. - In-depth research or practical experience in high-performance networking (RDMA/RoCE), SmartNICs (NIC Offload), or DPDK/eBPF. - Experience building network configuration and control systems (e.g., based on SONiC, gNMI, Netconf).

Job Information

[For Pay Transparency]Compensation Description (Annually)

The base salary range for this position in the selected city is $202160 - $368220 annually.


Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.


Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.


For Los Angeles County (unincorporated) Candidates:


Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:


1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;


2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and


3. Exercising sound judgment.

About U

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join ByteDance

Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.


As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.


Diversity & Inclusion


ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Reasonable Accommodation

ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the LLM AIOps Development Engineer - Data Center Networking in Seattle, WA vacancy
  • $202.16k - $368.22k

     ...LLM AIOps Development Engineer - Data Center Networking Location: Seattle Employment Type: Regular Job Code: A220006 Responsibilities: About the team Networking brings together innovative ideas and technologies from network architecture, software defined... 
    Network
    Temporary work
    Local area

    Tik Tok

    Seattle, WA
    2 days ago
  • $132.1k - $178.8k

     ...Description AWS Data Center Capacity Delivery (DCCD)...  ...is looking for a Data Engineer to support data center...  ...software, hardware, and network engineers,...  ...Develop and integrate LLM-based solutions (e.g.,...  ...customers and software development teams to gather and document... 
    Network
    Flexible hours

    Amazon

    Seattle, WA
    1 day ago
  • $162k - $242k

     ...inference. Our stack is engineered for speed, scale,...  ...scale performance data warehouse. You...  ...across every data center in our global...  ...familiarity with networked systems and performance...  ...Apache Spark, Trino, llm-d, vLLM, or...  ...collaboration and enables the development of innovative... 
    Network
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Bellevue, WA
    2 days ago
  •  ...cable assemblies that support next-generation data centers, enterprise servers, storage systems, networking equipment, and high-speed computing environments...  ...We are seeking an experienced Connector Development Engineer to join our team and contribute to the design... 
    Network
    Work at office
    Remote work
    Worldwide

    Amphenol Corporation

    Seattle, WA
    5 days ago
  • $200k - $250k

     ...About The Role Controls Commissioning Engineer will take the lead in commissioning, startup...  ...5+ years in controls commissioning, data center MEP systems, or industrial automation (experience...  ...logic. Experience with industrial network protocols (Ethernet/IP, BACnet, Modbus)... 
    Network
    For contractors
    Local area

    Fluidstack

    Seattle, WA
    1 day ago
  • $165k - $242k

     ...Senior Software Engineer, Data Center Infrastructure Tooling CoreWeave is The Essential Cloud...  ...performance internal platform that gives network engineers, fleet engineers, and...  ...encourages collaboration and enables the development of innovative solutions to complex problems... 
    Network
    Temporary work
    Flexible hours

    CoreWeave

    Bellevue, WA
    3 days ago
  • $109k - $160k

     ...About The Role: The Data Platforms Team serves...  ...is responsible for the development of use cases,...  ...are seeking a senior engineer with specialization in...  ...the Linux storage and networking stacks. You can transform...  ...in our office and data center locations ~ A casual... 
    Network
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Bellevue, WA
    3 days ago
  • $136k - $184k

     ...Do you like to use network and Unix systems engineering to deliver simple, sustainable and repeatable solutions...  ...cloud running. We support all AWS data centers and all of the servers, storage,...  ...is looking for a Network Development Engineer to join our EDGE team. Network... 
    Network
    Worldwide
    Flexible hours

    Amazon

    Seattle, WA
    2 days ago
  • $136k - $184k

     ...Amazon's Event Network team is looking to hire Network Engineers to help scale and automate of one of the worlds...  ...seeking an experienced Network Development Engineer to join a highly skilled...  ...infrastructure, wired and wireless, and data center infrastructure equipment that... 
    Network
    Flexible hours

    Amazon

    Seattle, WA
    2 days ago
  • $159.1k - $215.3k

     ...design, operate, and implement networks of large scale? Would you...  ...connectivity between Amazon's data centers and services to design and...  ...software, hardware, and network engineers, supply chain specialists,...  ...is looking for a Network Development Engineer to join our EDGE team... 
    Network
    Worldwide
    Flexible hours

    Amazon

    Seattle, WA
    5 hours ago
  • $100k - $175k

     ...trusted advisor. Putting the customer at the center of every engagement, our mission is to...  ...‑sharing and more. Senior Solutions Engineer - Data Center Focus Our engineers are the...  ...maintenance vSphere Hyper‑V Virtual network connectivity Cloud (some understanding... 
    Network
    Flexible hours

    Compunet,-Inc.

    Seattle, WA
    4 days ago
  •  ...cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment...  ..., hardware, and network engineers, supply chain specialists, security...  ...during innovation and research and development projects. Work with regional... 
    Network
    Worldwide

    Amazon Data Services, Inc.

    Seattle, WA
    2 days ago
  • $165k - $242k

    About the Team The Business Systems Engineering team partners closely with Data Center Operations, Infrastructure, Facilities, and IT to design and scale the...  ...infrastructure security best practices (RBAC, network policies, pod security standards, admission controllers... 
    Network
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Bellevue, WA
    3 days ago
  • $250k - $417k

     ...leader in AI cloud infrastructure, is looking for a senior network engineer to join their Seattle team. The ideal candidate will have over...  ...10 years in IT and networking, with experience in designing Data Center networks and managing Next-Generation Firewalls.... 
    Network

    Lambda

    Seattle, WA
    1 day ago
  • Armada is seeking a Controls Engineer to take ownership of the controls layer for Modular Data Center. You will be responsible for integrating packaged equipment controls...  ...sequences, and familiarity with controls networking. The position is remote with a competitive salary... 
    Network
    Remote job

    Armada

    Bellevue, WA
    4 days ago
  •  ...cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment...  ..., electrical and mechanical engineers, as well as supply chain specialists...  ...Amazon is looking for a System Development Engineer to become part of a... 
    Network
    Temporary work
    Internship

    Amazon Data Services, Inc.

    Seattle, WA
    4 days ago
  • $113k - $175k

     .... Reporting to the Regional Engineering Manager, you will partner with...  ...customers, ensuring their network designs are robust, their deployments...  ...Management, and Software Development teams to represent the...  ...Arista solutions in large-scale Data Center, Campus, and WAN... 
    Network
    Remote work

    Arista Networks Inc

    Seattle, WA
    4 days ago
  •  ...talented Team. Job Title: Senior Data Engineer Location: Seattle, WA Job...  .... We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment...  ...business customers and software development teams to gather and document requirements... 
    Network

    Ampcus

    Seattle, WA
    1 day ago
  • $128k - $161k

     ...builders in the world. At DigitalOcean, Data Center Engineers play a critical role in building and...  ..., and scaling the servers and networking equipment that enable millions of developers...  ...engineers and contributing to team development What You'll Add To DigitalOcean 6-8+... 
    Network
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    Seattle, WA
    1 day ago
  •  ...wins across its portfolio. The role involves providing engineering support for DoD data center and cloud projects, requiring a Top Secret security clearance...  ...at least 10 years in IT, with proven capabilities in network and cloud design. #J-18808-Ljbffr Essnova Solutions,... 
    Network
    For contractors

    Essnova Solutions, Inc.

    Seattle, WA
    4 days ago
  • A leading technology company in Seattle seeks a Senior Software Engineer to join their AI Networking team. This role involves building ML tools for optimizing AI workloads across data centers, focusing on large-scale deep learning. Candidates should have a PhD or equivalent... 
    Network

    NVIDIA Corporation

    Seattle, WA
    5 days ago
  •  ...Position- System Administrator / Data Center Support Engineer Duration-Contract Location- Bellevue, W JD Data Center...  ...4+ Yrs Data Center Support Experience Knowledge of Networking Devices and Concept Experience on Windows Server Admin, AD... 
    Network
    Contract work
    Work experience placement
    Immediate start
    Remote work

    Syntricate Technologies

    Bellevue, WA
    5 hours ago
  • $136.6k - $184.8k

     ...cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment...  ..., hardware, and network engineers, supply chain specialists, security...  ...participate in the research and development of new technologies and designs... 
    Network
    For contractors
    Flexible hours

    Amazon

    Seattle, WA
    1 day ago
  •  ...Opportunity We are seeking a highly motivated and skilled Data Center Optical Engineer to lead work in customer environments and co-locations. The...  ...(MoPs), engineering documents, vendor documentation, and network diagrams. Read and interpret cut sheets and physical network... 
    Network
    Work at office
    Immediate start
    Remote work

    Ericsson

    Bellevue, WA
    5 days ago
  • $71k - $93.45k

     ...opportunity We are seeking a highly motivated and skilled Data Center Optical Engineer to lead work in customer environments and co-locations. The...  ...(MoPs), engineering documents, vendor documentation, and network diagrams. Read and interpret cut sheets and physical network... 
    Network
    Temporary work
    Work at office
    Immediate start
    Remote work

    Ericsson GmbH

    Bellevue, WA
    4 days ago
  • $183k - $247.6k

     ...Amazon Web Services (AWS) Hardware Engineering is a leading-edge product development team that creates enterprise...  ...00 fully featured services from data centers globally. Whether customers need...  ...and all of the servers, storage, networking, power, and cooling equipment that... 
    Network
    Local area
    Overseas
    Flexible hours

    Amazon

    Seattle, WA
    5 hours ago
  • $157.3k - $212.8k

     ...deadline: Jun 1, 2026 As a Cloud Hardware Development Engineer, you will be an end-to-end owner of...  ...) to bring these servers to the data center. After launch, you own the fleet — monitoring...  ..., and operations (compute, storage, network, GPU) Design and implement solutions... 
    Network
    Internship
    Local area
    Flexible hours

    Amazon

    Seattle, WA
    12 hours ago
  • $155.6k - $210.5k

     ...s low Earth orbit satellite broadband network. Its mission is to deliver fast, reliable...  ...will be performed.. As a Network Development Engineer at Amazon Leo, you will lead the...  ...launch sites, and mission operations centers worldwide. You will ensure high availability... 
    Network
    Contract work
    Work at office
    Immediate start
    Remote work
    Worldwide
    Flexible hours

    Amazon

    Bellevue, WA
    5 hours ago
  • $122k - $174k

     ...degree in Civil/Structural Engineering, or equivalent practical experience...  ...of everything we do. The Data Center Engineering team takes the...  ...lab mirrors a research and development department - cutting‑edge...  ..., Architectural, Telecoms, Networking, and Utilities. US base... 
    Network
    Full time
    Temporary work

    Google Inc.

    Seattle, WA
    3 days ago
  • $113k - $175k

     ...for key customers, focusing on post-sales activities including network design and automation. Candidates should hold a Bachelor's...  ...in Computer Science and possess strong experience in network engineering roles. The salary range is $113,000 to $175,000, with additional... 
    Network
    Remote work

    Arista Networks Inc

    Seattle, WA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to LLM AIOps Development Engineer - Data Center Networking. Be the first to apply!