Senior AI Data Pipeline Engineer
42dot
We Are Looking For The Best
42dot AI . (PB) GPU, AI.
At 42dot, our AI Data Pipeline Engineer architect and scale global data pipelines that ingest and process data from worldwide sources. You will design and operate high-throughput systems to reliably deliver petabyte-scale data to our large-scale GPU infrastructure, powering mission-critical AI workloads.
Responsibilities
- Design and build high-performance, scalable data pipelines to support diverse AI and Machine Learning initiatives across the organization.
- Architect and implement multi-region data infrastructure to ensure global data availability and seamless synchronization.
- Develop flexible pipeline architectures that allow for complex branching and logic isolation to support multiple concurrent AI projects.
- Optimize large-scale data processing workloads using Databricks and Spark to maximize throughput and minimize processing costs.
- Maintain and evolve the containerized data environment on Kubernetes, ensuring robust and reliable execution of data workloads.
- Collaborate with AI researchers and platform teams to streamline the flow of high-quality data into training and evaluation pipelines.
Qualifications
- AI/ML
- Apache Spark Databricks
- Apache Airflow
- Kubernetes
- Apache Kafka
- Python
- Best practices
- Extensive professional experience in building and operating production-grade data pipelines for massive-scale AI/ML datasets.
- Strong proficiency in distributed processing frameworks, particularly Apache Spark and the Databricks ecosystem.
- Deep hands-on experience with workflow orchestration tools like Apache Airflow for managing complex dependency graphs.
- Solid understanding of Kubernetes and containerization for deploying and scaling data processing components.
- Proficiency in distributed messaging systems such as Apache Kafka for high-throughput data ingestion and event-driven architectures.
- Expert-level programming skills in Python for system-level optimizations.
- Strong knowledge of cloud-native services and best practices for building secure and scalable data infrastructure.
- Logical approach to problem-solving with the persistence to identify and resolve root causes in complex, large-scale systems.
- Strong communication skills to effectively collaborate with cross-functional teams and external partners.
Preferred Qualifications
- / (Latency)
- Ray AI
- Spark Streaming Flink /(Near real-time)
- Terraform Infrastructure as Code(IaC)
- ML (MLOps)
- Experience in architecting global, multi-region data pipelines and solving challenges related to cross-border data transfer and latency.
- Practical experience or a strong interest in implementing distributed computing frameworks like Ray for AI workloads.
- Experience in building real-time or near-real-time pipelines using Spark Streaming or Flink.
- Familiarity with Infrastructure as Code (IaC) tools such as Terraform to manage complex data environments.
- Understanding of the end-to-end ML lifecycle (MLOps) and how data infrastructure supports model experimentation and deployment.
Interview Process
- Resume Screening - Coding Test - Virtual Interview (approximately 1 hour) - Onsite or Virtual Interview (approximately 3 hours) - Final Offer
- Please note that the interview process may vary depending on the position and is subject to change based on scheduling and other circumstances.
- Interview schedules and results will be communicated individually via the email address provided in your application.
Additional Information
- Please upload all required documents in PDF format.
- Veterans and applicants eligible for employment protection will receive preferential consideration in accordance with applicable laws and regulations.
- In compliance with the Act on Employment Promotion and Vocational Rehabilitation for Persons with Disabilities, registered individuals with disabilities will receive preferential consideration.
- 42dot does not accept unsolicited resumes from search firms. We will not pay any fees for resumes submitted without prior agreement.
- A 3-month probationary period may apply.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior AI Data Pipeline Engineer in United States vacancy
$133k - $254k
...AI Data Pipeline Engineer 42dot is a mobility AI company committed to solving mobility challenges with software and AI. As the Global Software Center of Hyundai Motor Group, 42dot pioneers the future of mobility by advancing the development of software-defined vehicles...SeniorWork experience placementRemote work$160k - $170k
...Senior Data Engineer This is a hybrid position requiring in-office attendance one day per week... ...' hands: in their warehouses, in their AI workflows, and in the tools their teams... ...behavior. You will design and optimize pipelines that process 30 billion+ records per...SeniorWork at officeFlexible hours1 day per week$203.45k - $344.3k
...Senior Staff AI Data Infrastructure/Pipeline Engineer Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric...SeniorFull timeOverseas- ...Motion Recruitment Partners LLC is seeking a Data Pipeline Engineer for a long-term contract opportunity based in Charlotte, NC or Irving, TX. This role involves designing and developing ETL/ELT workflows, and optimizing data pipelines for operational and analytical processes...SeniorLong term contractContract work
$129k - $207k
...Feedinkoo is looking for a Senior Software Engineer to design, build, and maintain scalable data pipelines. This role requires 2+ years of experience in software engineering and expertise in data pipeline frameworks. The ideal candidate should have strong skills in SQL...SeniorRemote work$113.2k - $237.8k
...Job Title: Senior Data Pipeline Engineer Job Category: Engineering Time Type: Full time Minimum Clearance Required to Start: TS/SCI with Polygraph Employee Type: Regular Percentage of Travel Required: None Type of Travel: None * * * The Opportunity...SeniorFull timeContract workWork experience placementFlexible hours- ...Tech Holding is seeking a highly skilled Senior ML / Data Pipeline Engineer to build and optimize scalable, production-ready data pipelines. The role involves handling large-scale multimodal video and data systems, focusing on cloud environments and efficient data workflows...Senior
- Summit7 is looking for a SIEM Engineer in Huntsville, AL, to lead cybersecurity infrastructure projects. The role requires designing... ...enterprise IT systems with a focus on SIEM solutions and data pipelines. The ideal candidate should have a Bachelor's degree in a technical...SeniorRemote job
- A leading zero trust security company in Palo Alto, CA seeks a Principal Software Engineer specializing in data pipelines. This role involves collaboration with engineers to build internal data systems, design traffic analysis pipelines, and mentor team members. Ideal candidates...Senior
$183k - $276k
Amplitude is seeking a Senior Software Engineer to join their Data Pipeline team in San Francisco. You will tackle complex infrastructure challenges and collaborate with product teams to shape their roadmap. Ideal candidates have at least 5 years of Software Engineering...SeniorFlexible hours- A leading behavioral data platform is seeking a Senior Software Engineer to enhance their ingestion pipeline. This hybrid position requires one day of in-office attendance weekly in Atlanta. Ideal candidates will have experience in distributed systems, Golang, and Kubernetes...SeniorWork at office
- Mindful Support Services is seeking a Database & Pipeline Administrator in Mountlake Terrace, WA. This full-time position involves maintaining data structures and pipelines, ensuring data integrity, and collaborating with internal stakeholders. Candidates should have a...SeniorFull time
- A leading technology firm is seeking an experienced Data Scientist to support their next-generation AI platform. The role involves AI model integration, data pipeline development, and knowledge base engineering with a focus on LegalTech and RegTech. The ideal candidate...Remote work
- ...Python skills Experience with data transformation and cleansing... ...Preferred Qualifications: Data engineering certifications preferred... ...experience Peraton seeks a ETL/DATA Pipeline Engineer as a Lead Associateto... ...Command (DCDC) with DCO AI Cyber Security Support. Location...
$135k - $165k
...company. We are currently looking for a Senior Data Engineer in the United States. Join a high-... ...you will design and maintain robust data pipelines, architect modern cloud infrastructure,... ...work with modern cloud technologies, AI-driven platforms, and large-scale healthcare...SeniorRemote jobFull time- The Data Pipeline Engineer owns the systems that manage all data moving through pocstock — from intake to processing to delivery. This role combines data engineering, AI tooling, and quality control to ensure data flows reliably and is structured correctly at every stage...
- Pocstock, Inc. is seeking a Data Pipeline Engineer in Newark, NJ. The role involves managing systems for data intake, processing, and delivery, with... ...building reliable, scalable data systems, with exposure to AI tools and large datasets. Join a dynamic team committed to...
$103.5k - $192k
...behalf of a partner company. We are currently looking for a Senior Data Engineer (AI Native) in the United States. Join a fast-scaling,... ...This role offers the opportunity to work on mission-critical pipelines while actively contributing to next-generation AI-enabled...SeniorRemote jobFull timeWork from homeWorldwideHome officeFlexible hours- ...services provider specializing in Java, .NET, Big Data, Cloud Computing (AWS, GCP, Azure), Artificial Intelligence (AI), Machine Learning (ML), software development... ...of technology innovation! MLOps Data Pipeline Engineer (Airflow & MLflow) Location: Alpharetta, GA...Full time
- ...DETAILS: SR DATA PIPELINE ENGINEER - PYTHON Title : Sr. Data Pipelines Engineer - Python / MySQL / NoSQL Compensation : The total... ...the creation of Operational Data Stores (ODS). Act as the senior/lead technical programmer on the team. Works with the data architect...SeniorWork at officeRemote work
- ...MFour Mobile Research, Inc. seeks a Senior Data Engineer in Kansas City to own the data pipeline essential for AI-driven insights. This role involves ingesting and cleaning data across multiple sources, ensuring quality and compliance while collaborating with various teams...Senior
- ...in Sydney, New Zealand, London and Poland. We specialise in creating technology solutions at the intersection of Data Engineering, Software Engineering and AI. We are a team of creative engineers and technologists dedicated to unleashing the potential of data in new...SeniorRemote work
- A leading real estate organization is seeking a skilled Data Engineer to join their team in Charleston, SC. This role focuses on developing data capabilities using Azure SQL, Cosmos DB, and Databricks for customer-facing applications. The ideal candidate will have over...SeniorRemote work
- ...Accenture Federal Services is seeking a Data Engineer to design and optimize data pipelines for the U.S. federal government. The role involves building distributed data processing systems to drive AI adoption across various sectors, including defense and public safety....Senior
- ...Inceed is seeking a skilled Data Engineer to work with a dynamic company in Denver, Colorado. The role involves transforming client data warehouses and building data architectures while supporting complex client projects. Ideal candidates will have experience with CI/CD...Senior
- ...We Are Looking For The Best About Us 42dot AI AI., 42dot . 42dot,. Responsibilities ~ Observability Qualifications ~8, 3 ~ Python JVM (Java, Scala, Kotlin) ~ SQL ~ On-prem Preferred Qualifications ~ Databricks / Apache Spark...SeniorRemote work
$120k - $140k
...A leading media company in the United States is seeking a Data Engineer to own their complex data layer. The role emphasizes deep expertise... ...Responsibilities include optimizing the database for high volume AI generation tasks and ensuring data integrity. With a...Senior$90 - $125 per hour
...Obsidian is seeking a Data Engineering Expert to advance AI evaluations in data engineering. You will build pipeline tasks, ensuring verifiable outcomes and structured workflows.The ideal candidate holds a BS or MS in CS and has over 3 years of experience in data engineering...Senior- ...Ulta is seeking a Senior Data Engineer to support, plan, and coordinate data engineering activities. The role involves building batch and real-time data pipelines on Google Cloud Platform and Databricks to enhance guest experience. The ideal candidate has a Bachelor's...Senior
- ...Join to apply for the Senior Databricks Data Engineer role at ADPMN IncGet AI-powered advice on this job and more exclusive features.Job title: Senior Databricks... ...environments.Expertise in monitoring data pipelines and troubleshooting.Work closely with data scientists...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Data Pipeline Engineer. Be the first to apply!
Related searches
- ai research engineer United States
- machine learning ai engineer United States
- ai engineer remote United States
- ai prompt engineer United States
- ai developer United States
- ai engineer United States
- ai ml engineer United States
- senior ai engineer United States
- bi data engineer United States
- staff data engineer United States


