Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Data Pipeline Engineer

42dot

We Are Looking For The Best

42dot AI . (PB) GPU, AI.

At 42dot, our AI Data Pipeline Engineer architect and scale global data pipelines that ingest and process data from worldwide sources. You will design and operate high-throughput systems to reliably deliver petabyte-scale data to our large-scale GPU infrastructure, powering mission-critical AI workloads.

Responsibilities
  • Design and build high-performance, scalable data pipelines to support diverse AI and Machine Learning initiatives across the organization.
  • Architect and implement multi-region data infrastructure to ensure global data availability and seamless synchronization.
  • Develop flexible pipeline architectures that allow for complex branching and logic isolation to support multiple concurrent AI projects.
  • Optimize large-scale data processing workloads using Databricks and Spark to maximize throughput and minimize processing costs.
  • Maintain and evolve the containerized data environment on Kubernetes, ensuring robust and reliable execution of data workloads.
  • Collaborate with AI researchers and platform teams to streamline the flow of high-quality data into training and evaluation pipelines.
Qualifications
  • AI/ML
  • Apache Spark Databricks
  • Apache Airflow
  • Kubernetes
  • Apache Kafka
  • Python
  • Best Practices
  • Extensive professional experience in building and operating production-grade data pipelines for massive-scale AI/ML datasets.
  • Strong proficiency in distributed processing frameworks, particularly Apache Spark and the Databricks ecosystem.
  • Deep hands-on experience with workflow orchestration tools like Apache Airflow for managing complex dependency graphs.
  • Solid understanding of Kubernetes and containerization for deploying and scaling data processing components.
  • Proficiency in distributed messaging systems such as Apache Kafka for high-throughput data ingestion and event-driven architectures.
  • Expert-level programming skills in Python for system-level optimizations.
  • Strong knowledge of cloud-native services and best practices for building secure and scalable data infrastructure.
  • Logical approach to problem-solving with the persistence to identify and resolve root causes in complex, large-scale systems.
  • Strong communication skills to effectively collaborate with cross-functional teams and external partners.
Preferred Qualifications
  • / (Latency)
  • Ray AI
  • Spark Streaming Flink /(Near Real-time)
  • Terraform Infrastructure as Code(IaC)
  • ML (MLOps)
  • Experience in architecting global, multi-region data pipelines and solving challenges related to cross-border data transfer and latency.
  • Practical experience or a strong interest in implementing distributed computing frameworks like Ray for AI workloads.
  • Experience in building real-time or near-real-time pipelines using Spark Streaming or Flink.
  • Familiarity with Infrastructure as Code (IaC) tools such as Terraform to manage complex data environments.
  • Understanding of the end-to-end ML lifecycle (MLOps) and how data infrastructure supports model experimentation and deployment.
Interview Process
  • Resume Screening - Coding Test - Virtual Interview (approximately 1 hour) - Onsite or Virtual Interview (approximately 3 hours) - Final Offer
  • Please note that the interview process may vary depending on the position and is subject to change based on scheduling and other circumstances.
  • Interview schedules and results will be communicated individually via the email address provided in your application.
Additional Information
  • Please upload all required documents in PDF format.
  • Veterans and applicants eligible for employment protection will receive preferential consideration in accordance with applicable laws and regulations.
  • In compliance with the Act on Employment Promotion and Vocational Rehabilitation for Persons with Disabilities, registered individuals with disabilities will receive preferential consideration.
  • 42dot does not accept unsolicited resumes from search firms. We will not pay any fees for resumes submitted without prior agreement.
  • A 3-month probationary period may apply.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior AI Data Pipeline Engineer in United States vacancy
  • $125k - $180k

    CrowdStrike Holdings, Inc. is looking for a Sr. Data Pipeline Engineer to join their remote team. The role involves building and optimizing data pipelines, ensuring reliability, and collaborating with global engineering teams. The ideal candidate will have 12+ years of... 
    Senior
    Remote job

    CrowdStrike Holdings, Inc.

    California, MO
    2 days ago
  • Role, Inc. is seeking a Sr. Data Pipeline Engineer in the United States to join a global team responsible for building and optimizing data ingestion pipelines. This role involves ensuring the scalability and reliability of the enterprise data platform. The ideal candidate... 
    Senior

    Role, Inc.

    New York, NY
    1 day ago
  • $125k - $180k

    CrowdStrike, Inc. is looking for an experienced Sr. Data Pipeline Engineer to join their global team of data engineers. The role involves building...  ...pipelines to integrate diverse data sources and drive AI-driven insights. The ideal candidate will possess over 12 years... 
    Senior

    CrowdStrike, Inc.

    California, MO
    5 days ago
  • $133k - $254k

     ...We are looking for the best About Us 42dot is a mobility AI company committed to solving mobility challenges with software...  ...self‑managing urban transportation operating system. Our AI Data Pipeline Engineers build up the core data processing pipelines and datasets readiness... 
    Senior
    Work experience placement

    42dot

    Sunnyvale, CA
    5 days ago
  • Fullstory is hiring a Senior Data Engineer for a hybrid role based in Atlanta. You will design and optimize data pipelines processing billions of records daily, collaborating with teams to unlock insights through advanced data systems. Your responsibilities include maintaining... 
    Senior

    Fullstory

    Atlanta, GA
    5 days ago
  • $125k - $180k

    CrowdStrike Holdings, Inc. is looking for an experienced Sr. Data Pipeline Engineer to join their global team. This role will focus on building and...  ...processing pipelines essential for enabling analytics and AI-driven insights. Ideal candidates will have 12+ years of experience... 
    Senior

    CrowdStrike Holdings, Inc.

    New York, NY
    1 day ago
  • $160k - $170k

     ...Fullstory's rich digital experience data directly into customers'...  ...their warehouses, in their AI workflows, and in the tools...  ...teams already use. As our Senior Data Engineer, you will report to the Senior...  ...will design and optimize pipelines that process 30 billion+ records... 
    Senior
    Work at office
    Flexible hours
    1 day per week

    Fullstory

    Atlanta, GA
    10 days ago
  • midpage AI Inc. is hiring a senior software engineer to develop the largest case law dataset, impacting our lawyer-...  ...platform. The role involves building pipelines, ensuring the reliability of web...  ...technical generalist comfortable with data pipelines. Our competitive cash,... 
    Senior
    Visa sponsorship

    midpage AI Inc.

    New York, NY
    2 days ago
  • BankUnited is seeking a Pipeline & Integration Engineer to develop, maintain, and support ETL pipelines, primarily using Informatica. This role ensures reliable data pipelines and collaborates with senior engineers on data quality. Candidates should have at least 4 years... 
    Senior

    BankUnited

    Florida, NY
    1 day ago
  • Health Care Service Corp. is seeking a highly skilled Cribl Engineer to design, implement, and optimize data pipelines for observability and security platforms. This role will manage Cribl Stream/Edge deployments and enable efficient log routing and transformation. The... 
    Senior

    Health Care Service Corp.

    Richardson, TX
    1 day ago
  • Motion Recruitment Partners LLC is seeking a Data Pipeline Engineer for a long-term contract opportunity in Charlotte, NC or Irving, TX. You will work at a well-known Financial Services Company, developing and optimizing ETL/ELT workflows and data pipelines using cloud... 
    Senior
    Long term contract

    Motion Recruitment Partners LLC

    Charlotte, NC
    2 days ago
  • $183k - $276k

    Amplitude is seeking a Senior Software Engineer to join their Data Pipeline team in San Francisco. You will tackle complex infrastructure challenges and collaborate with product teams to shape their roadmap. Ideal candidates have at least 5 years of Software Engineering... 
    Senior
    Flexible hours

    Amplitude

    San Francisco, CA
    2 days ago
  • A leading zero trust security company in Palo Alto, CA seeks a Principal Software Engineer specializing in data pipelines. This role involves collaboration with engineers to build internal data systems, design traffic analysis pipelines, and mentor team members. Ideal candidates... 
    Senior

    xage, inc

    Palo Alto, CA
    5 days ago
  • A leading behavioral data platform is seeking a Senior Software Engineer to enhance their ingestion pipeline. This hybrid position requires one day of in-office attendance weekly in Atlanta. Ideal candidates will have experience in distributed systems, Golang, and Kubernetes... 
    Senior
    Work at office

    Fullstory

    Atlanta, GA
    11 days ago
  • $189.59k

    Barclays is seeking a Senior ETL Developer to work in Whippany, NJ. The role involves developing and maintaining ETL applications for Market Surveillance, ensuring high-quality software solutions, and collaborating with international teams. Minimum salary of $189,592 and... 
    Senior
    Remote work

    Barclays

    New York, NY
    1 day ago
  • $189.59k - $208.55k

    Barclays Services Corp. is seeking a Senior ETL Developer in Whippany, NJ to develop and maintain ETL applications for the Market Surveillance Area. You will collaborate with global teams and engage in various development projects while ensuring high-quality software delivery... 
    Senior

    慨正橡扯

    Morristown, NJ
    3 days ago
  • United States Digital Space LLC is seeking a Senior Software Engineer to join their Data Pipeline team in San Francisco, California. This role involves tackling complex infrastructure challenges and ensuring the reliability of event ingestion and processing. The ideal... 
    Senior
    Flexible hours

    United States Digital Space LLC

    San Francisco, CA
    3 days ago
  • Nuance Labs based in Seattle is seeking a Member of Technical Staff — ML Infra (Data) to design and operate large-scale data pipelines. You will work on processing and curating multimodal training data, ensuring high-quality standards at scale. The ideal candidate has... 
    Senior

    Nuance Labs

    Seattle, WA
    1 day ago
  • $166k - $174.5k

    Edgewaterit located in Washington, DC, is seeking a Senior IT Big Data Developer to support data architecture and analytics initiatives. The ideal candidate will design and optimize data pipelines, ensuring high-performance data delivery for various analytical needs. Responsibilities... 
    Senior

    Edgewaterit

    Washington DC
    2 days ago
  • $122.13k - $183.2k

    Axcelis Technologies, Inc. is seeking a Senior Data Infrastructure & Machine Learning Engineer to design scalable data systems and pipelines for advanced analytics. This hybrid role emphasizes data pipeline engineering and Python-based processing, requiring strong database... 
    Senior

    Axcelis Technologies

    Beverly, MA
    1 day ago
  • $135.68k - $203.53k

    Blueface Ltd is seeking a Senior Software Engineer to enhance FreeWheel’s data ingestion systems. This role focuses on optimizing pipeline performance and scalability, leveraging strong Python skills, distributed systems knowledge, and AWS experience. The ideal candidate... 
    Senior

    Blueface Ltd

    Englewood, CO
    2 days ago
  •  ...The Data Pipeline Engineer owns the systems that manage all data moving through pocstock — from intake to processing to delivery. This role combines data engineering, AI tooling, and quality control to ensure data flows reliably and is structured correctly at every stage... 

    Pocstock, Inc.

    Newark, NJ
    4 days ago
  • $184k - $287.5k

    Senior Data Center Performance Engineer - Benchmarking and Optimization page is loaded## Senior Data Center Performance Engineer - Benchmarking and Optimizationlocations...  .... Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $174.72k - $295.68k

     ...forefront of innovation, integrating advanced AI and autonomous driving technologies into...  ...responsible for building the end-to-end data pipeline for autonomous driving, covering the...  ...or higher in Computer Science, Software Engineering, Artificial Intelligence, or related... 
    Senior
    Full time
    Overseas

    XPENG

    California
    27 days ago
  • $91.7k - $163.7k

     ...UnitedHealth Group is looking for a Senior Data Engineer in Eden Prairie, MN. This role involves the architecture and optimization of Oracle Exadata Database Machines. Candidates should have over 5 years of experience in Oracle database administration and performance... 
    Senior
    Remote work

    UnitedHealth Group

    Eden Prairie, MN
    5 days ago
  •  ...Pattern AI is seeking a Data Engineer in Hayward Park, California, to join our team. The role focuses on expanding and optimizing data pipelines crucial for our automated machine learning platform. Candidates should have a minimum of 4 years of experience and proficiency... 
    Senior

    Pattern AI

    San Mateo, CA
    5 days ago
  • $180k - $220k

     ...behalf of an early stage start-up who are leveraging AI to modernize the data engineering industry. They're looking for a Senior Data Engineer to join their team in San Francisco and work on PB scale data pipelines, scaling their cloud infrastructure, and build their intelligent... 
    Senior

    Evolve Group, Inc.

    San Francisco, CA
    5 days ago
  •  ...We are seeking a Senior Data Engineer with 10+ years of experience to support a growing Data & Analytics...  ...that enable advanced analytics and AI/ML use cases. Key Responsibilities: Design, build, and optimize data pipelines and workflows across modern data platforms... 
    Senior

    Gardner Resources Consulting

    Dallas, TX
    4 days ago
  •  ...Role We are seeking a hands‑on Cloud Native Data Engineer with experience building and supporting enterprise‑scale data platforms, data pipelines, and cloud‑based data solutions. This...  ...project experience Experience supporting AI/ML initiatives from a data engineering perspective... 
    Senior

    Motion Recruitment

    Charlotte, NC
    5 days ago
  • $85 - $91.5 per hour

     ...Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Milestone Technologies, Inc. Seeking a Senior Data Engineer for the following role: Basic Qualifications 5+ years of relevant data engineering experience with specific experience... 
    Senior
    Contract work
    Work experience placement

    Milestone Technologies

    New York, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Data Pipeline Engineer. Be the first to apply!