Senior AI Data Pipeline Engineer
42dot
We Are Looking For The Best
42dot AI . (PB) GPU, AI.
At 42dot, our AI Data Pipeline Engineer architect and scale global data pipelines that ingest and process data from worldwide sources. You will design and operate high-throughput systems to reliably deliver petabyte-scale data to our large-scale GPU infrastructure, powering mission-critical AI workloads.
Responsibilities
- Design and build high-performance, scalable data pipelines to support diverse AI and Machine Learning initiatives across the organization.
- Architect and implement multi-region data infrastructure to ensure global data availability and seamless synchronization.
- Develop flexible pipeline architectures that allow for complex branching and logic isolation to support multiple concurrent AI projects.
- Optimize large-scale data processing workloads using Databricks and Spark to maximize throughput and minimize processing costs.
- Maintain and evolve the containerized data environment on Kubernetes, ensuring robust and reliable execution of data workloads.
- Collaborate with AI researchers and platform teams to streamline the flow of high-quality data into training and evaluation pipelines.
Qualifications
- AI/ML
- Apache Spark Databricks
- Apache Airflow
- Kubernetes
- Apache Kafka
- Python
- Best Practices
- Extensive professional experience in building and operating production-grade data pipelines for massive-scale AI/ML datasets.
- Strong proficiency in distributed processing frameworks, particularly Apache Spark and the Databricks ecosystem.
- Deep hands-on experience with workflow orchestration tools like Apache Airflow for managing complex dependency graphs.
- Solid understanding of Kubernetes and containerization for deploying and scaling data processing components.
- Proficiency in distributed messaging systems such as Apache Kafka for high-throughput data ingestion and event-driven architectures.
- Expert-level programming skills in Python for system-level optimizations.
- Strong knowledge of cloud-native services and best practices for building secure and scalable data infrastructure.
- Logical approach to problem-solving with the persistence to identify and resolve root causes in complex, large-scale systems.
- Strong communication skills to effectively collaborate with cross-functional teams and external partners.
Preferred Qualifications
- / (Latency)
- Ray AI
- Spark Streaming Flink /(Near Real-time)
- Terraform Infrastructure as Code(IaC)
- ML (MLOps)
- Experience in architecting global, multi-region data pipelines and solving challenges related to cross-border data transfer and latency.
- Practical experience or a strong interest in implementing distributed computing frameworks like Ray for AI workloads.
- Experience in building real-time or near-real-time pipelines using Spark Streaming or Flink.
- Familiarity with Infrastructure as Code (IaC) tools such as Terraform to manage complex data environments.
- Understanding of the end-to-end ML lifecycle (MLOps) and how data infrastructure supports model experimentation and deployment.
Interview Process
- Resume Screening - Coding Test - Virtual Interview (approximately 1 hour) - Onsite or Virtual Interview (approximately 3 hours) - Final Offer
- Please note that the interview process may vary depending on the position and is subject to change based on scheduling and other circumstances.
- Interview schedules and results will be communicated individually via the email address provided in your application.
Additional Information
- Please upload all required documents in PDF format.
- Veterans and applicants eligible for employment protection will receive preferential consideration in accordance with applicable laws and regulations.
- In compliance with the Act on Employment Promotion and Vocational Rehabilitation for Persons with Disabilities, registered individuals with disabilities will receive preferential consideration.
- 42dot does not accept unsolicited resumes from search firms. We will not pay any fees for resumes submitted without prior agreement.
- A 3-month probationary period may apply.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior AI Data Pipeline Engineer in United States vacancy
$125k - $180k
CrowdStrike Holdings, Inc. is looking for a Sr. Data Pipeline Engineer to join their remote team. The role involves building and optimizing data pipelines, ensuring reliability, and collaborating with global engineering teams. The ideal candidate will have 12+ years of...SeniorRemote job- Role, Inc. is seeking a Sr. Data Pipeline Engineer in the United States to join a global team responsible for building and optimizing data ingestion pipelines. This role involves ensuring the scalability and reliability of the enterprise data platform. The ideal candidate...Senior
$125k - $180k
CrowdStrike, Inc. is looking for an experienced Sr. Data Pipeline Engineer to join their global team of data engineers. The role involves building... ...pipelines to integrate diverse data sources and drive AI-driven insights. The ideal candidate will possess over 12 years...Senior$133k - $254k
...We are looking for the best About Us 42dot is a mobility AI company committed to solving mobility challenges with software... ...self‑managing urban transportation operating system. Our AI Data Pipeline Engineers build up the core data processing pipelines and datasets readiness...SeniorWork experience placement- Fullstory is hiring a Senior Data Engineer for a hybrid role based in Atlanta. You will design and optimize data pipelines processing billions of records daily, collaborating with teams to unlock insights through advanced data systems. Your responsibilities include maintaining...Senior
$125k - $180k
CrowdStrike Holdings, Inc. is looking for an experienced Sr. Data Pipeline Engineer to join their global team. This role will focus on building and... ...processing pipelines essential for enabling analytics and AI-driven insights. Ideal candidates will have 12+ years of experience...Senior$160k - $170k
...Fullstory's rich digital experience data directly into customers'... ...their warehouses, in their AI workflows, and in the tools... ...teams already use. As our Senior Data Engineer, you will report to the Senior... ...will design and optimize pipelines that process 30 billion+ records...SeniorWork at officeFlexible hours1 day per week- midpage AI Inc. is hiring a senior software engineer to develop the largest case law dataset, impacting our lawyer-... ...platform. The role involves building pipelines, ensuring the reliability of web... ...technical generalist comfortable with data pipelines. Our competitive cash,...SeniorVisa sponsorship
- BankUnited is seeking a Pipeline & Integration Engineer to develop, maintain, and support ETL pipelines, primarily using Informatica. This role ensures reliable data pipelines and collaborates with senior engineers on data quality. Candidates should have at least 4 years...Senior
- Health Care Service Corp. is seeking a highly skilled Cribl Engineer to design, implement, and optimize data pipelines for observability and security platforms. This role will manage Cribl Stream/Edge deployments and enable efficient log routing and transformation. The...Senior
- Motion Recruitment Partners LLC is seeking a Data Pipeline Engineer for a long-term contract opportunity in Charlotte, NC or Irving, TX. You will work at a well-known Financial Services Company, developing and optimizing ETL/ELT workflows and data pipelines using cloud...SeniorLong term contract
$183k - $276k
Amplitude is seeking a Senior Software Engineer to join their Data Pipeline team in San Francisco. You will tackle complex infrastructure challenges and collaborate with product teams to shape their roadmap. Ideal candidates have at least 5 years of Software Engineering...SeniorFlexible hours- A leading zero trust security company in Palo Alto, CA seeks a Principal Software Engineer specializing in data pipelines. This role involves collaboration with engineers to build internal data systems, design traffic analysis pipelines, and mentor team members. Ideal candidates...Senior
- A leading behavioral data platform is seeking a Senior Software Engineer to enhance their ingestion pipeline. This hybrid position requires one day of in-office attendance weekly in Atlanta. Ideal candidates will have experience in distributed systems, Golang, and Kubernetes...SeniorWork at office
$189.59k
Barclays is seeking a Senior ETL Developer to work in Whippany, NJ. The role involves developing and maintaining ETL applications for Market Surveillance, ensuring high-quality software solutions, and collaborating with international teams. Minimum salary of $189,592 and...SeniorRemote work$189.59k - $208.55k
Barclays Services Corp. is seeking a Senior ETL Developer in Whippany, NJ to develop and maintain ETL applications for the Market Surveillance Area. You will collaborate with global teams and engage in various development projects while ensuring high-quality software delivery...Senior- United States Digital Space LLC is seeking a Senior Software Engineer to join their Data Pipeline team in San Francisco, California. This role involves tackling complex infrastructure challenges and ensuring the reliability of event ingestion and processing. The ideal...SeniorFlexible hours
- Nuance Labs based in Seattle is seeking a Member of Technical Staff — ML Infra (Data) to design and operate large-scale data pipelines. You will work on processing and curating multimodal training data, ensuring high-quality standards at scale. The ideal candidate has...Senior
$166k - $174.5k
Edgewaterit located in Washington, DC, is seeking a Senior IT Big Data Developer to support data architecture and analytics initiatives. The ideal candidate will design and optimize data pipelines, ensuring high-performance data delivery for various analytical needs. Responsibilities...Senior$122.13k - $183.2k
Axcelis Technologies, Inc. is seeking a Senior Data Infrastructure & Machine Learning Engineer to design scalable data systems and pipelines for advanced analytics. This hybrid role emphasizes data pipeline engineering and Python-based processing, requiring strong database...Senior$135.68k - $203.53k
Blueface Ltd is seeking a Senior Software Engineer to enhance FreeWheel’s data ingestion systems. This role focuses on optimizing pipeline performance and scalability, leveraging strong Python skills, distributed systems knowledge, and AWS experience. The ideal candidate...Senior- ...The Data Pipeline Engineer owns the systems that manage all data moving through pocstock — from intake to processing to delivery. This role combines data engineering, AI tooling, and quality control to ensure data flows reliably and is structured correctly at every stage...
$184k - $287.5k
Senior Data Center Performance Engineer - Benchmarking and Optimization page is loaded## Senior Data Center Performance Engineer - Benchmarking and Optimizationlocations... .... Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our...Senior$174.72k - $295.68k
...forefront of innovation, integrating advanced AI and autonomous driving technologies into... ...responsible for building the end-to-end data pipeline for autonomous driving, covering the... ...or higher in Computer Science, Software Engineering, Artificial Intelligence, or related...SeniorFull timeOverseas$91.7k - $163.7k
...UnitedHealth Group is looking for a Senior Data Engineer in Eden Prairie, MN. This role involves the architecture and optimization of Oracle Exadata Database Machines. Candidates should have over 5 years of experience in Oracle database administration and performance...SeniorRemote work- ...Pattern AI is seeking a Data Engineer in Hayward Park, California, to join our team. The role focuses on expanding and optimizing data pipelines crucial for our automated machine learning platform. Candidates should have a minimum of 4 years of experience and proficiency...Senior
$180k - $220k
...behalf of an early stage start-up who are leveraging AI to modernize the data engineering industry. They're looking for a Senior Data Engineer to join their team in San Francisco and work on PB scale data pipelines, scaling their cloud infrastructure, and build their intelligent...Senior- ...We are seeking a Senior Data Engineer with 10+ years of experience to support a growing Data & Analytics... ...that enable advanced analytics and AI/ML use cases. Key Responsibilities: Design, build, and optimize data pipelines and workflows across modern data platforms...Senior
- ...Role We are seeking a hands‑on Cloud Native Data Engineer with experience building and supporting enterprise‑scale data platforms, data pipelines, and cloud‑based data solutions. This... ...project experience Experience supporting AI/ML initiatives from a data engineering perspective...Senior
$85 - $91.5 per hour
...Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Milestone Technologies, Inc. Seeking a Senior Data Engineer for the following role: Basic Qualifications 5+ years of relevant data engineering experience with specific experience...SeniorContract workWork experience placement
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Data Pipeline Engineer. Be the first to apply!
Related searches
- ai engineer United States
- machine learning ai engineer United States
- ai research engineer United States
- ai ml engineer United States
- senior ai engineer United States
- ai prompt engineer United States
- ai developer United States
- ai engineer remote United States
- vp data engineering United States
- senior cloud data engineer United States


