Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

LLM AIOps Development Engineer - Data Center Networking

$202.16k - $368.22k

Tik Tok

Responsibilities

About the team Networking brings together innovative ideas and technologies from network architecture, software defined networking (SDN), network virtualization, switch software and hardware co-design, and high-speed networking, to create hyper-scale data-center networking solutions that power several of the most popular apps of the world such as Douyin and TikTok which serve hundreds of millions of users around the globe. Network Observation team is committed to building a world-leading hyperscale data center network infrastructure that supports hundreds of millions of users' real-time access and explosive growth of massive data volumes. We believe that the next generation of network operations will be fundamentally powered by artificial intelligence technologies, particularly Large Language Models (LLMs). We are seeking a passionate development engineer who combines deep networking expertise with innovative AIOps capabilities to join us in defining and building "autonomous" data center networks. Together, we will transform network operations from a reactive "firefighting" mode into a proactive, data-driven intelligent ecosystem with predictive and self-healing capabilities. Responsibilities: As a core member of our team, you will collaborate closely with our NetOps, SRE, and platform engineering teams to tackle the complexities of one of the world's largest data center networks. You will design and implement a closed-loop AIOps for NetWork platform, covering: - Build a Panoramic Network Observability Platform: Develop a streaming telemetry data pipeline for both physical and virtual networks, integrating multi-source data from gNMI, Netconf, IPFIX/NetFlow, and SNMP to provide a high-quality, real-time data foundation for AIOps. - Develop an Intelligent Diagnostics and Root Cause Analysis System: Apply machine learning and deep learning algorithms to perform anomaly detection, correlation analysis, and intelligent noise reduction on massive volumes of network metrics, logs, and events. Swiftly pinpoint root causes of failures across the entire stack, from optical transceivers and switch hardware to protocol adjacencies and application traffic. - Explore Innovative Applications of LLMs and Agents: - Intelligent Operations Assistant: Build a conversational chatbot powered by Retrieval-Augmented Generation (RAG) that understands natural language queries, automatically queries knowledge bases and monitoring data, and provides precise troubleshooting guidance and network status reports. - Automated Remediation and Smart Runbooks: Train operational Agents to safely and controllably invoke network change tools and APIs. Empower them to autonomously generate, recommend, or even execute remediation plans and emergency runbooks based on their understanding of failure scenarios. - Establish Capacity and Risk Prediction Capabilities: Forecast network capacity bottlenecks, high-risk links, and "sub-healthy" devices based on historical data and business growth models, enabling proactive scaling and preventative maintenance. - Forge a Rock-Solid Engineering System: Adhere to engineering best practices to design and develop a highly available and scalable AIOps platform. Guarantee the stability and performance of the entire pipeline, from data collection and model training to online inference and automated closed-loop actions.

Qualifications

Minimum Qualifications: - Solid Fundamentals in Computer Science and Networking: A deep understanding of data center network architectures (e.g., Spine-Leaf Fabric), and proficiency in key protocols such as EVPN/VXLAN and BGP/OSPF. In-depth knowledge of the Linux network stack is essential. - Excellent Software Engineering Skills: Mastery of Golang or Python with outstanding coding and system design abilities. Familiarity with modern software development workflows, including microservices, containerization (Docker/Kubernetes), and CI/CD. - Rich Platform Development Experience: Practical experience in one or more of the following areas is highly desirable: - Big Data Processing: Familiarity with Kafka, Flink, ClickHouse/TSDB, and experience building real-time data pipelines and analytics systems. - Observability Technologies: Experience with Prometheus/OpenTelemetry, graph databases (e.g., Neo4j), and developing alert and event platforms. - A Passion for AIOps/ML/LLM Practices: - A keen interest in the latest advancements in Large Models and Agent technologies, with thoughtful insights or hands-on experience in their application to operations (e.g., RAG, tool use, safety evaluation). Preferred Qualifications: - Experience in operating or developing for hyperscale (100,000+ servers) data center networks. - Proven experience leading or making significant contributions to an LLM/Agent-based intelligent operations project with measurable business impact. - Active contributions to open-source communities such as SONiC, P4/PINS, eBPF, Prometheus, or OpenTelemetry. - In-depth research or practical experience in high-performance networking (RDMA/RoCE), SmartNICs (NIC Offload), or DPDK/eBPF. - Experience building network configuration and control systems (e.g., based on SONiC, gNMI, Netconf).

Job Information

[For Pay Transparency]Compensation Description (Annually)

The base salary range for this position in the selected city is $202160 - $368220 annually.


Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.


Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.


For Los Angeles County (unincorporated) Candidates:


Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:


1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;


2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and


3. Exercising sound judgment.


About TikTok

TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we also have offices in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.


Why Join Us

Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.


We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.


Diversity & Inclusion


TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.


TikTok Accommodation

TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at
Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the LLM AIOps Development Engineer - Data Center Networking in Seattle, WA vacancy
  • $132.1k - $178.8k

     ...Description AWS Data Center Capacity Delivery (DCCD)...  ...is looking for a Data Engineer to support data center...  ...software, hardware, and network engineers,...  ...Develop and integrate LLM-based solutions (e.g.,...  ...customers and software development teams to gather and document... 
    Network
    Flexible hours

    Amazon

    Seattle, WA
    4 days ago
  • $153k - $204k

     ...inference. Our stack is engineered for speed, scale,...  ...scale performance data warehouse. You...  ...across every data center in our global...  ...familiarity with networked systems and performance...  ...Apache Spark, Trino, llm-d, vLLM, or...  ...collaboration and enables the development of innovative... 
    Network
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Bellevue, WA
    5 days ago
  • $104.5k - $160k

     ...running. We manage the critical data center components - from servers and networking to power and cooling - that ensure...  ...and motivated Systems Engineer to join our team of Windows and...  ...networking components - Automation & Development: Create and maintain automation... 
    Network
    Work experience placement
    Flexible hours
    Shift work

    Amazon Web Services, Inc.

    Seattle, WA
    12 hours ago
  •  ...cable assemblies that support next-generation data centers, enterprise servers, storage systems, networking equipment, and high-speed computing environments...  ...We are seeking an experienced Connector Development Engineer to join our team and contribute to the design... 
    Network
    Full time
    Work at office
    Remote work
    Worldwide

    Amphenol TCS

    Seattle, WA
    4 days ago
  • $104.5k - $160k

     ...people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our...  ...a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations... 
    Network
    Work experience placement
    Worldwide
    Flexible hours

    Amazon

    Seattle, WA
    9 hours ago
  • $120k - $150k

     ...Senior Electrical Engineer Fleet Data Centers designs, builds and operates mega-scale data center campuses...  ...capable of upleveling data center development scale and operations in the face of...  ..., structural engineering, network, controls, and operations teams to integrate... 
    Network
    For contractors
    Work at office
    Local area
    Remote work

    Fleet Data Centers

    Seattle, WA
    8 hours ago
  • $165k - $242k

     ...Senior Business Systems Engineer- Data Center Systems II Livingston, NJ /Bellevue, WA / Sunnyvale, CA CoreWeave is The Essential Cloud for...  ...Implement infrastructure security best practices (RBAC, network policies, pod security standards, admission controllers) in... 
    Network
    Temporary work
    Casual work
    Work at office
    Immediate start
    Remote work
    Flexible hours

    CoreWeave

    Bellevue, WA
    6 hours ago
  • $109k - $160k

     ...About The Role: The Data Platforms Team serves...  ...is responsible for the development of use cases,...  ...are seeking a senior engineer with specialization in...  ...the Linux storage and networking stacks. You can transform...  ...in our office and data center locations ~ A casual... 
    Network
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Bellevue, WA
    1 day ago
  • $165k - $242k

     ...Senior Software Engineer, Data Center Infrastructure Tooling CoreWeave is The Essential Cloud...  ...performance internal platform that gives network engineers, fleet engineers, and...  ...encourages collaboration and enables the development of innovative solutions to complex problems... 
    Network
    Temporary work
    Flexible hours

    CoreWeave

    Bellevue, WA
    1 day ago
  • $136k - $184k

     ...Do you like to use network and Unix systems engineering to deliver simple, sustainable and repeatable solutions...  ...cloud running. We support all AWS data centers and all of the servers, storage,...  ...is looking for a Network Development Engineer to join our EDGE team. Network... 
    Network
    Worldwide
    Flexible hours

    Amazon

    Seattle, WA
    2 days ago
  • $159.1k - $215.3k

     ...design, operate, and implement networks of large scale? Would you...  ...connectivity between Amazon's data centers and services to design and...  ...software, hardware, and network engineers, supply chain specialists,...  ...is looking for a Network Development Engineer to join our EDGE team... 
    Network
    Worldwide
    Flexible hours

    Amazon

    Seattle, WA
    3 days ago
  • $136k - $184k

     ...Application deadline: Jun 1, 2026 Amazon Web Services Networking is searching for hands-on Network Development Engineer to join our network team that owns critical...  ...alertness and attention to detail. Travel to data center/network sites and Amazon/customer offices as... 
    Network
    Flexible hours

    Amazon

    Seattle, WA
    3 days ago
  • $113k - $175k

     .... Reporting to the Regional Engineering Manager, you will partner with...  ...customers, ensuring their network designs are robust, their deployments...  ...Management, and Software Development teams to represent the...  ...Arista solutions in large-scale Data Center, Campus, and WAN... 
    Network
    Remote work

    Arista Networks Inc

    Seattle, WA
    2 days ago
  •  ...Data Center Engineer The engineer in this position should have a wealth of experience engineering/installing datacenter equipment. The engineer...  ...infrastructure, overhead and underfloor racking, cabinets, network and server equipment, cable distribution, power (AC or DC),... 
    Network
    For contractors
    Work at office
    Local area

    ADEX

    Seattle, WA
    2 days ago
  •  ...talented Team. Job Title: Senior Data Engineer Location: Seattle, WA Job...  .... We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment...  ...business customers and software development teams to gather and document requirements... 
    Network

    Ampcus

    Seattle, WA
    4 days ago
  •  ...System Administrator - Data Center Support Location: Bellevue, WA (Onsite) Duration: Contract Experience: 6+ Years Job Description Data Center Support Experience Knowledge of Networking Devices and Concept Experience on Windows Server Admin, AD Troubleshooting... 
    Network
    Contract work
    Work experience placement
    Remote work

    Syntricate Technologies

    Bellevue, WA
    5 hours ago
  • $128k - $161k

     ...builders in the world. At DigitalOcean, Data Center Engineers play a critical role in building and...  ..., and scaling the servers and networking equipment that enable millions of developers...  ...engineers and contributing to team development What You'll Add To DigitalOcean 6-8+... 
    Network
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    Seattle, WA
    4 days ago
  • $160k - $200k

     ...Principal Architect Of Data Center Engineering Fleet Data Centers designs, builds and operates...  ...uniquely capable of upleveling data center development scale and operations in the face of...  ...mechanical, structural engineering, network, controls, and operations teams to... 
    Network
    For contractors
    Work at office
    Local area

    Fleet Data Centers

    Seattle, WA
    21 hours ago
  •  ...Position- System Administrator / Data Center Support Engineer Duration-Contract Location- Bellevue, W JD Data Center...  ...4+ Yrs Data Center Support Experience Knowledge of Networking Devices and Concept Experience on Windows Server Admin, AD... 
    Network
    Contract work
    Work experience placement
    Immediate start
    Remote work

    Syntricate Technologies

    Bellevue, WA
    3 days ago
  • $136.6k - $184.8k

     ...cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment...  ..., hardware, and network engineers, supply chain specialists, security...  ...participate in the research and development of new technologies and designs... 
    Network
    For contractors
    Flexible hours

    Amazon

    Seattle, WA
    4 days ago
  • $183k - $247.6k

     ...Amazon Web Services (AWS) Hardware Engineering is a leading-edge product development team that creates enterprise...  ...00 fully featured services from data centers globally. Whether customers need...  ...and all of the servers, storage, networking, power, and cooling equipment that... 
    Network
    Local area
    Overseas
    Flexible hours

    Amazon

    Seattle, WA
    3 days ago
  • $157.3k - $212.8k

     ...Description As a Cloud Hardware Development Engineer, you will be an end-to-end owner of storage...  ...) to bring these servers to the data center. After launch, you own the fleet - monitoring...  ..., and operations (compute, storage, network, GPU) Design and implement solutions... 
    Network
    Internship
    Local area
    Flexible hours

    Amazon

    Seattle, WA
    4 days ago
  • $129.2k - $174.8k

     ...cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment...  ..., hardware, and network engineers, supply chain specialists, security...  ...define customer requirements, development compelling product strategies to... 
    Network
    Flexible hours

    Amazon

    Seattle, WA
    5 days ago
  • $155.6k - $210.5k

     ...s low Earth orbit satellite broadband network. Its mission is to deliver fast, reliable...  ...will be performed.. As a Network Development Engineer at Amazon Leo, you will lead the...  ...launch sites, and mission operations centers worldwide. You will ensure high availability... 
    Network
    Contract work
    Work at office
    Immediate start
    Remote work
    Worldwide
    Flexible hours

    Amazon

    Bellevue, WA
    3 days ago
  •  ...Data Center Optical Engineer We are seeking a highly motivated and skilled Data Center Optical Engineer to join our team. The ideal candidate...  ...break-fix, remote hands services, hardware replacement, and network transport optical troubleshooting. The Optical Engineer... 
    Network
    Local area
    Remote work
    Shift work
    Night shift

    WIVERSE

    Bellevue, WA
    1 day ago
  • $275k - $300k

     ...Vice President – Facilities Engineering Fleet's owner for quality...  ...details, parts lists, and asset data standards Internal Audit...  ..., QA/QC, or data center operations or similar infrastructure...  ...Build strong partnerships and networks Location and Travel:... 
    Network
    Flexible hours

    Fleet Data Centers

    Seattle, WA
    7 hours ago
  • $57 per hour

     ...Team Introduction ByteDance Networking brings together innovative ideas...  ..., to create hyperscale data-center networking solutions that power...  ...gain marketable software development and/or network operation experience...  ...technologies to support AI/LLM applications. - Design and... 
    Network
    Hourly pay
    Internship
    Local area

    ByteDance

    Seattle, WA
    4 days ago
  • $202.16k - $368.22k

     ...AI/LLM Network Software Development Engineer Location: Seattle Team: Technology Employment Type: Regular Job Code: JE3HP Responsibilities...  ..., and high-speed networking, to create hyperscale data-center networking solutions that power several of the most popular... 
    Network
    Temporary work
    Local area

    ByteDance

    Seattle, WA
    4 days ago
  • $127.1k - $172k

     ...cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment...  ..., hardware, and network engineers, supply chain specialists, security...  ...program vision, and own the product development roadmap for critical segments... 
    Network
    Flexible hours

    Amazon

    Seattle, WA
    5 hours ago
  • $202.16k - $368.22k

     ...deduplication/clustering. - Responsible for data construction, instruction fine-tuning, CoT...  ...NLP, vision, multimodal, search, graph, LLM, etc. to provide support for governance business...  ..., and influencers. Model large-scale networks to support business scenarios like content... 
    Network
    Temporary work
    Local area
    Overseas

    Tik Tok

    Seattle, WA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to LLM AIOps Development Engineer - Data Center Networking. Be the first to apply!