Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Data Infrastructure Engineer

$124.09k - $210k

XPENG

Senior AI Data Infrastructure Engineer

Santa Clara, CA

XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and landing (eVTOL) aircraft, and robotics. With a strong focus on intelligent mobility, XPENG is dedicated to reshaping the future of transportation through cutting-edge R&D in AI, machine learning, and smart connectivity. As a core member of our AI Infrastructure team, you will work at the intersection of Autonomous Driving and Foundation Models. We don't just process EB-scale perception data from tens of thousands of production vehicles; we are building the high-performance Data Engine that powers our next-generation AI. Your work will directly determine how our self-driving systems "learn" from massive datasets and define the cognitive ceiling of multi-modal models in the physical world.

Key Responsibilities
  • Scalable Data Pipelines: Architect and build scalable, end-to-end pipelines to automate the ingestion, cleaning, and processing of PB-scale raw data for both production autonomy and multi-modal LLMs.
  • Modern Lakehouse Architecture: Evolve our data storage solutions based on Apache Iceberg and Lance to implement efficient semantic indexing, metadata management, and data versioning.
  • Training Throughput Optimization: Deeply optimize data loading and pre-fetching strategies to ensure maximum throughput for large-scale training on 10,000+ GPU clusters.
  • Infrastructure Evolution: Support the seamless transition of foundation model data into actionable training sets, bridging the gap between raw vehicle logs and model-ready tokens.
Basic Qualifications
  • Engineering Excellence: BS/MS/PhD in Computer Science or a related field, with a proven track record of building large-scale distributed systems.
  • Work Experience: 3-5 years of industry experience.
  • Programming Mastery: Proficient in Python, C++, or Java, with a deep understanding of high-performance concurrent programming and systems design.
  • Distributed Frameworks: Hands-on experience with at least one distributed processing framework, such as Ray and Spark.
  • Lakehouse Expertise: Familiarity with Data Lakehouse concepts and practical experience with technologies like Iceberg and Lance.
Preferred Qualifications
  • Experience building data warehouses for Trillion-token datasets or PB-scale multi-modal data.
  • Deep understanding of data access patterns in deep learning frameworks like PyTorch, DeepSpeed, or Megatron.
  • Practical experience with Vector Databases, automated labeling toolchains, or data-centric AI workflows.
  • Knowledge of storage formats optimized for AI (e.g., Parquet, Lance) and high-performance file systems.

The base salary range for this full-time position is $124,091-$210,000, in addition to bonus, equity and benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. We are an Equal Opportunity Employer. It is our policy to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status or marital status or any other prescribed category set forth in federal or state regulations.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior AI Data Infrastructure Engineer in Santa Clara, CA vacancy
  • A leading technology company in California seeks a Senior Software Engineer to develop innovative video analytics solutions using advanced AI technologies. The role demands strong experience in C++ and Python, along with a deep understanding of machine learning systems... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $179.06k - $198.95k

     ...location. Cohesity is a leader in AI‑powered data security and management. Aided by an...  ...are looking for a highly motivated Senior Performance Engineer to join our Data Protection and...  ...automation frameworks, tools, and infrastructure to improve performance testing coverage... 
    Senior
    Full time

    Cohesity

    Santa Clara, CA
    4 days ago
  •  ...Senior GCP Data Engineer Location: Sunnyvale CA Duration: 4 months JOB DESCRIPTION: Skills Required: ~10+ years of hands-on...  ...call support. Learn our business domain and technology infrastructure quickly and share your knowledge freely and actively with... 
    Senior

    JConnect Infotech

    Sunnyvale, CA
    4 days ago
  •  ...Kai is the AI company rebuilding cybersecurity for the machine...  ...team: Our Heads of AI, Engineering, and Product bring extensive...  ...Role We are looking for a Senior Data Engineer (AI Platform) to...  ...engineering and modern AI data infrastructure - including large-scale data... 
    Senior

    Kai Cyber, Inc.

    San Jose, CA
    4 days ago
  • $94.43k - $202.75k

    KPMG Careers is looking for a Senior Associate, Data Engineer, to join their Consulting practice in Santa Clara, California. The role involves technical design and leading implementation of data solutions using Databricks. Candidates should have at least three years of... 
    Senior

    KPMG Careers

    Santa Clara, CA
    1 day ago
  • $130.75k - $181k

     ...About the Role/Team Join Eightfold’s core Data Platform team to design, develop, and...  ...scalability, and load balancing. Performance Engineering: Drive the scaling, optimization, and...  ...products and strong teams. Eightfold.ai provides equal employment opportunities (... 
    Senior
    Permanent employment
    Work at office
    3 days per week

    Eightfold LLC

    Santa Clara, CA
    3 days ago
  • ExpertHiring is looking for an experienced data engineering leader based in Menlo Park, California, to guide a team in developing data solutions...  ..., architecting data warehouses, and implementing innovative AI technologies. Candidates should have extensive experience in... 
    Senior
    Full time

    ExpertHiring

    Menlo Park, CA
    21 hours ago
  • A technology firm is seeking a skilled Back-End Engineer with a strong focus on data engineering to design and maintain data pipelines and back-end systems. The successful candidate will work with technologies like Spark, Python, and AWS. Responsibilities include writing... 
    Senior

    Robotics Technologies LLC

    Sunnyvale, CA
    4 days ago
  • A data analytics technology firm is seeking a skilled Back-End Engineer to design and maintain data pipelines. Key responsibilities include developing robust ETL processes using Spark, Python, and SQL while leveraging cloud technologies like AWS and GCP. Ideal candidates... 
    Senior

    Cloud Analytics Technologies, LLC

    Sunnyvale, CA
    21 hours ago
  • A technology company is seeking an experienced backend engineer to join their Data Layer and Marketing AI platform. You will focus on building distributed systems that manage high-volume batch and real-time data across global customers. The ideal candidate has over 5 years... 
    Senior

    Uniphore Technologies Inc.

    Palo Alto, CA
    3 days ago
  • $228.4k - $303.55k

    Databricks is seeking a Sr. Staff Software Engineer to work on their Data Platform team. This role involves designing the Data Intelligence Platform...  ...operations at scale. You will lead initiatives that leverage AI to enhance the performance of large-scale distributed... 
    Senior

    I did my part and supported the Regular Toilet

    Mountain View, CA
    21 hours ago
  • $45 - $48 per hour

     ...Only  Duration: 12+ Months  Location: Sunnyvale, CA Pay Range: $45 - $48 per hour on W2 Job Summary: We are seeking a Senior Data Engineer & Platform Architect to design, develop, and oversee a contemporary data platform aimed at enhancing engineering and... 
    Senior
    Hourly pay
    Contract work

    Akraya

    Sunnyvale, CA
    21 hours ago
  • A leading technology company is seeking a web developer to lead web development, gather requirements, and innovate solutions using modern technologies. The successful candidate will have extensive experience in Java and web application integration. This full-time role offers...
    Senior
    Full time

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • BBG Ventures, LLC is seeking a Senior Data Engineer for their Sunnyvale, CA location. This hybrid position involves designing and operating a modern data platform to support analytics. Responsibilities include developing data pipelines, building interactive visualizations... 
    Senior
    For contractors

    BBG Ventures, LLC

    Sunnyvale, CA
    3 days ago
  • $200k - $220k

     ...intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we...  ...across energy, manufacturing, data center construction, and cloud...  ...This Role: Join Crusoe Energy as a Senior Data Engineer, an early and pivotal hire on our growing... 
    Senior
    Full time
    Temporary work
    Work at office
    Remote work

    Crusoe

    Sunnyvale, CA
    4 days ago
  • $147k - $178k

    Epoch Biodesign is seeking a Data Integration Engineer in Sunnyvale, CA to design and maintain scalable data pipelines. You will play a crucial role in data-driven decision-making and integrating systems across various platforms. The ideal candidate should possess 5+ years... 
    Senior

    Epoch Biodesign

    Sunnyvale, CA
    21 hours ago
  • A technology company based in Sunnyvale, California, seeks a skilled Back-End Engineer to design and maintain scalable data pipelines. The ideal candidate will have over 5 years of experience in back-end development and proficiency in Spark, Python, Scala, and Java. The... 
    Senior

    Quantum Technologies. LLC

    Sunnyvale, CA
    2 days ago
  • $173.5k - $331.05k

     ...SDKs and platform libraries that power data-driven insights and AI-enabled experiences across Creative Cloud. We are seeking a software engineer with strong development and computer science...  ...be fully defined, with guidance from senior engineers and architects. ~- Strong... 
    Senior
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    2 days ago
  • $173.5k - $331.05k

     ...SDKs and platform libraries that power data-driven insights and AI-enabled experiences across Creative Cloud. We are seeking a software engineer with strong development and computer science...  ...be fully defined, with guidance from senior engineers and architects. ~- Strong... 
    Senior
    Temporary work
    Local area
    Relocation

    Adobe

    San Jose, CA
    2 days ago
  • $248k - $349k

    A leading technology company is looking for a software engineer to take part in developing projects that impact billions of users. You'll work extensively on cloud technologies, leading a team while ensuring high standards in design and architecture. The ideal candidate... 
    Senior

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • A technology company specializing in robotic products seeks a Senior Data Engineer to support their analytics needs in Sunnyvale, CA. This hybrid position involves architecting data pipelines, developing data-driven front-end applications, and collaborating closely with... 
    Senior
    Contract work
    For contractors

    The Mom Project

    Sunnyvale, CA
    4 days ago
  •  ..., Oklahoma, Pennsylvania, South Carolina, and Tennessee. Senior Data Engineer - Data Platform & Analytics (Snowflake, Kafka, React) on a...  ...control, and testing practices Implement CI/CD pipelines, infrastructure‑as‑code (e.g., Terraform), and observability frameworks... 
    Senior
    Contract work
    For contractors
    Remote work
    Flexible hours
    1 day per week

    The Mom Project

    Sunnyvale, CA
    4 days ago
  • Kindredventures is looking for a Senior Perception ML Data Infrastructure Engineer based in Mountain View, California. This role involves taking ownership of the perception data platform, managing complex sensor data, and establishing systems that optimize data quality... 
    Senior

    Kindredventures

    Mountain View, CA
    21 hours ago
  • $147.4k - $272.1k

     ...Machine Learning Infrastructure And Data Engineer Join us as an ML Data and Infrastructure Engineer and become the architect behind the data infrastructure that power tomorrow's breakthrough AI/ML innovations. You'll be the critical link between ambitious algorithmic... 
    Relocation

    Apple

    Sunnyvale, CA
    2 days ago
  •  ...accomplish. We are seeking an exceptional engineer with a background in large scale software...  ...using cutting-edge technologies to build data intensive systems, designing and optimizing...  ...big data platforms ~ Familiarity with ML/AI workflows and feature engineering to support... 
    Senior

    Apple

    Sunnyvale, CA
    4 days ago
  •  ...Role Summary We are seeking a highly skilled Senior Data Engineer - Full Stack to build and maintain internal tools, automation frameworks...  ...Streamlit, or similar frameworks Familiarity with Databricks AI/BI or other data visualization tools Exposure to data... 
    Senior

    Codvo.ai

    Santa Clara, CA
    21 hours ago
  •  ...Core Responsibilities 1. Data Architecture & Pipeline Design - Identify good data set and data candidates for building AI based recommendation engine - Design denormalized data structures optimized for LLM consumption - Aggregate data from multiple CRM modules... 
    Senior

    Diverse Lynx

    Sunnyvale, CA
    21 hours ago
  •  ...Huawei Research America The AI Platform team at Huawei...  ...applied research in Intelligent data analytics, Hardware/Co-processor...  ...We are leading researchers and engineers from worldwide to create adaptive...  ...papers. We are recruiting senior data scientist/engineers with... 
    Senior
    For contractors
    Worldwide

    Netpace

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...re tapping into the unlimited potential of AI to define the next era of computing. An...  ...NVIDIA has a rapidly expanding ecosystem of data center platform designs. From single node...  .... We are searching for a highly motivated engineer to lead performance benchmarking and optimization... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...in Mountain View, California is looking for an experienced Data Engineer to design large-scale data pipelines and advanced data systems...  ...in Python, and experience working with large-scale data infrastructure. This role offers a competitive salary range and a robust benefits... 
    Senior

    I did my part and supported the Regular Toilet

    Mountain View, CA
    21 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Data Infrastructure Engineer. Be the first to apply!