Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Data Engineer (Scala, Spark, & Gen AI)

DISQO

Job Description

Job Description

DISQO is a leading provider of advertising intelligence, measuring brand and performance outcomes across every media channel to power data-driven marketing decisions. Trusted by 500+ of the world’s largest brands and 150+ agency and media partners, and recognized by Inc., Deloitte, Ad Age, Digiday, Forbes, and Cynopsis, DISQO is redefining the power of measurement in advertising. 

Joining DISQO Nation means being part of a team that moves fast, thinks boldly, and is passionate about solving meaningful problems. Innovation isn't just something we talk about, it's how we operate. We challenge assumptions, embrace new ideas, and continuously push ourselves to build better solutions for our customers and each other.

Our values guide how we work and win together. We believe in winning as one team , fostering a culture of collaboration, trust, and shared accountability. We pursue outsized impact by focusing on opportunities that drive meaningful results for our customers and our business. We champion the customer by putting their needs at the center of every decision and delivering solutions that create real value. And we are relentlessly all in, bringing energy, passion, and commitment to every challenge, every day.

If you're energized by high-growth environments, motivated by innovation, and excited by the opportunity to do impactful work alongside talented, driven teammates, you'll feel right at home at DISQO.

This is a great opportunity to join a fun, highly motivated team and lead the development of intelligent data products that directly power how brands measure advertising effectiveness. At DISQO, we use modern cloud infrastructure, Generative AI, and expert-level data engineering to solve complex, real-world problems at scale.

We are looking for a visionary technical leader who is a master of distributed data processing (Scala/Spark) and passionate about the intersection of data engineering and Artificial Intelligence. You’ll serve as a force multiplier, working closely with engineering leadership, product managers, and analysts in a collaborative environment where rapid innovation and systemic impact matter.

We believe the best software is built by highly aligned, autonomous teams that take ownership and move quickly. We use agile development practices, modern tooling, and strong engineering discipline to deliver early and often. We care deeply about architectural excellence, data correctness, system reliability, and building intelligent systems the right way.

Position Description

As a Staff Data Engineer, you will set the technical direction for DISQO’s ad measurement platform. You will architect, build, and scale our most complex data pipelines while spearheading the integration of Generative AI capabilities directly into our core data infrastructure and products. You will tackle our hardest scalability challenges, utilizing expert-level Spark and Scala to process massive datasets, while leveraging LLMs to unlock new value from unstructured and structured data.

Operating with a high degree of autonomy, you will lead cross-functional technical initiatives, drive architectural decisions, and pioneer how we use AI to enrich data, automate pipelines, and improve data quality. You will mentor senior and mid-level engineers, raising the technical bar for the entire team while expanding DISQO's technical depth across big data systems, cloud infrastructure, and applied AI.

 

What you will do:
  • Architect and Lead: Design, build, and maintain highly scalable, fault-tolerant data pipelines using expert-level Scala and Apache Spark.

  • Gen AI Integration: Pioneer the use of Generative AI within our data ecosystem—incorporating LLMs to enrich datasets, extract value from unstructured data, automate metadata generation, and build intelligent data products.

  • Cross-Functional Strategy: Partner with Product and Engineering leadership to translate complex business requirements into forward-looking data and AI-augmented architectures.

  • Optimize Systems: Architect and aggressively optimize large-scale ETL/ELT workflows. Dive deep into Spark internals to resolve complex performance bottlenecks, memory issues, and data skew.

  • Modern AI Tooling: Implement and manage infrastructure to support AI integration, including vector databases, embeddings pipelines, and Retrieval-Augmented Generation (RAG) architectures.

  • Set the Standard: Write clean, highly optimized, and maintainable code, while establishing standards for code quality, testing, and system architecture across the organization.

  • Ensure Operational Excellence: Champion data quality, observability, and system health to consistently meet enterprise SLAs and customer commitments.

  • Mentorship: Actively mentor engineers, lead technical design reviews, and foster a culture of continuous learning and technical rigor.

What we're looking for:

  • 8+ years of experience building, architecting, and supporting complex production data pipelines, distributed systems, and backend infrastructure.

  • Expert-Level Scala & Spark: Deep, hands-on expertise in Scala and Apache Spark. You must understand Spark internals, query plans, memory management, and advanced performance tuning for massive-scale batch processing.

  • Applied Generative AI Experience: Proven experience integrating Gen AI / LLMs (e.g., OpenAI APIs, Anthropic, Bedrock) into data products or data engineering workflows. Hands on experience developing with AI dev tools such as Claude code, etc

  • Strong Python Skills: Proficiency in Python specifically to interface with modern AI ecosystems, data APIs, and orchestration tools.

  • Cloud Mastery: Extensive architectural experience within the AWS ecosystem (EMR, Glue, Athena, S3, Bedrock, etc.).

  • Core Data Foundations: Deep understanding of advanced ETL/ELT concepts, complex data modeling, and performance-tuning SQL.

  • Orchestration: Expert-level experience with workflow orchestration tools such as Airflow.

  • Leadership: Proven track record of leading technical initiatives, making architectural decisions, and mentoring teams in an agile, fast-moving environment.

Nice to have:

  • Experience with Snowflake or other modern cloud data warehouses.

  • Deep exposure to streaming or real-time event processing (Kafka, Flink, Kinesis, etc.).

  • Experience utilizing AI for automated data observability, anomaly detection, or data quality tooling.

  • Background in ad tech, measurement, attribution modeling, or specialized analytics platforms.

Why DISQO?

  • Lead the architecture of intelligent data products that directly influence how the world's top brands measure advertising impact.

  • Work with bleeding-edge data and Gen AI infrastructure at a highly meaningful scale.

  • Shape the technical culture and elevate a talented engineering organization while owning massive-scale production systems.

Your pay will be determined by your experience, work location, and other applicable factors.

#LI-MV1

 

At DISQO, we pride ourselves on having a positive, performance-oriented workplace that includes a flexible hybrid approach, competitive medical benefits, and an amazing vacation policy. Read more about our culture on Glassdoor.

 

You can learn more about what’s happening at DISQO by visiting the DISQO Company Blog.

 

Perks & Benefits:

 

·100% covered Medical/Dental/Vision for employee, competitive dependent coverage

·Stock options

·401K

·Generous PTO policy

·Team offsites, social events & happy hours

·Life Insurance

·Health FSA

·Catered lunch and fully stocked kitchen

·Paid Maternity/Paternity leave

·Disability Insurance

·Travel Assistance Program

·24/7 Counseling Services offered to Employees

 

Note: The benefits noted above are for full time US based employees only.

 

DISQO is an equal opportunity employer. Discovery, innovation, and growth are possible when we open ourselves to new possibilities, perspectives, and approaches. That’s why, at DISQO, we welcome, support, and empower individuals from diverse backgrounds. Exceptional teams are rooted in extraordinary people, each with a unique story and a compelling set of skills. DISQO does not discriminate against employees based on race, color, religion, sex, national origin, gender identity or expression, age, disability, pregnancy (including childbirth, breastfeeding, or related medical condition), genetic information, protected military or veteran status, sexual orientation, or any other characteristic protected by applicable federal, state or local laws.

 

*Recruiting firms that submit resumes to DISQO without first entering into a written contract will not be entitled to any compensation on candidates referred by that firm.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Vacancy posted 28 days ago
Similar jobs that could be interesting for youBased on the Staff Data Engineer (Scala, Spark, & Gen AI) in Los Angeles, CA vacancy
  •  ...expert insight and AI-driven intelligence...  ...development of intelligent data products that...  ...expert-level data engineering to solve complex,...  ...distributed data processing (Scala/Spark) and passionate...  ...Description As a Staff Data Engineer, you...  ...Apache Spark. Gen AI Integration:... 
    Suggested
    Full time
    Contract work
    Local area
    Flexible hours

    Disqo

    Los Angeles, CA
    16 hours ago
  • $150k - $170k

     ...Senior Data Engineer, Scala New York City, NY Boston, MA Los Angeles, CA Broomfield, CO Seattle, WA Hybrid Schedule...  ...: We are seeking someone with proficiency in Scala and Spark to build and optimize large-scale batch and streaming data... 
    Suggested
    Hourly pay
    Work experience placement
    Summer work
    Seasonal work
    Work at office
    Local area
    Remote work
    Shift work

    Magnite

    Los Angeles, CA
    4 days ago
  •  ...architect, build, and run the data backbone that powers...  ...streaming ETL/ELT with Apache Spark (PySpark/Scala), Databricks Workflows, and/or...  ...and production ML features. Engineer for production. Implement observability...  ...you agree to receive calls, AI-generated calls, text... 
    Suggested
    Contract work
    Local area
    Worldwide
    Flexible hours

    Jobot

    Los Angeles, CA
    8 hours ago
  • $125k - $159k

     ...Applying For: Edmunds is looking for a data engineer to help us manage the data explosion...  ...that power Edmunds' analytics, AI/ML initiatives, and business intelligence...  ...and optimizing data workflows using Spark, Databricks, SQL, Scala, and Python. * Process Engineering:... 
    Suggested
    Full time
    Immediate start
    Remote work
    Flexible hours

    Edmunds.com

    Santa Monica, CA
    4 days ago
  • $106.9k - $176.5k

     ...working world. Technology – Data and Decision Science – Data Engineering – Senior We are...  ...Databricks and experience with Spark for big data processing....  ...languages such as Python, Scala, or SQL. Experience...  ...markets. Enabled by data, AI and advanced technology,... 
    Suggested
    Summer holiday
    Flexible hours

    EY

    Los Angeles, CA
    5 days ago
  •  ...About the job Data Engineer Build the Data Backbone of a Healthcare AI Startup Join a stealth-mode healthcare venture as their first...  ...Bring Expert-level Python or Scala skills Hands-on experience with Spark, dbt, Airflow Proficiency in AWS (Glue... 
    Immediate start
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    AllyNd Partners

    Los Angeles, CA
    3 days ago
  • $170k

     ...Knowledge Graph team within Data Science & Engineering equips scientists, engineers...  ...driven decisions to equipping AI/ML scientists with signals...  ...languages (e.g. Python, Java, or Scala) with at least 4 years of...  ...(e.g. Hive, Presto, Spark, Iceberg, etc) on medium to... 
    Hourly pay
    Full time
    Immediate start
    Flexible hours

    Netflix

    Los Angeles, CA
    3 days ago
  • $52 - $75 per hour

     ...Our client is seeking a Big Data Engineer to join their team! This position...  ...experience using Java, Scala, Python, or similar programming...  ...experience with technologies such as Spark, Flink, SingleStore, Kafka,...  ..., you agree to receive calls, AI-generated calls, text messages... 
    Local area

    KellyMitchell Group

    Santa Monica, CA
    3 days ago
  •  ...looking for a Sr. Consultant, Data Engineer to join our growing team of experts...  ...Exadata, Netezza, SQL Server, Spark) ~ Experience building ETL /...  ...as well) ~ Python Scripting, Scala is required. ~ Ability to...  ...implement complex data and AI usecases on Snowflake platforms... 

    IBM Computing

    Los Angeles, CA
    1 day ago
  • $99k - $232k

     ...Specialty/Competency: Data, Analytics & AI Industry/Sector: Not Applicable...  ...in data and analytics engineering focus on leveraging advanced...  ...languages such as Python, Java, or Scala - Familiarity with big data technologies like Hadoop, Spark, or Kafka is a plus -... 
    Full time
    H1b

    PwC

    Los Angeles, CA
    10 days ago
  • $125.5k - $230.2k

     ...working world. Technology – Data and Decision Science – Data Engineering – Manager We are...  ...Databricks and experience with Spark for big data processing....  ...languages such as Python, Scala, or SQL. Excellent...  ...markets. Enabled by data, AI and advanced technology,... 
    Summer holiday
    Flexible hours

    EY

    Los Angeles, CA
    2 days ago
  • $177.8k - $240.5k

     ...Description GDSP is seeking a Data Engineer to own the data...  ...leverage generative AI and AWS services to raise...  ...such as Python, Java, Scala, or NodeJS ~ Experience...  ...such as: Hadoop, Hive, Spark, EMR Experience...  ...employees, supervisors, and staff; adhere to standards of... 
    Local area
    Flexible hours

    Amazon

    Santa Monica, CA
    5 days ago
  • $164k - $213k

     ...Principal Machine Learning Data Scientist, Gen AI Los Angeles, CA Xometry powers the industries of today and tomorrow by connecting...  ...deployment. Collaborate with cross-functional teams, including engineering and business teams, to align generative AI solutions with... 

    Xometry

    Los Angeles, CA
    4 days ago
  •  ...critical asset support for hospitals, data centers, remote sites, and military...  ...development leverages modern software engineering to rapidly deliver safe, factory-built...  ..., executive analytics, and AI capabilities. As a Staff Data Platform Engineer, you'll be hands... 
    Full time
    Summer work
    Immediate start
    Remote work
    Flexible hours
    Weekend work

    Radiant

    El Segundo, CA
    1 day ago
  • $199.1k - $223.4k

     ...with clients developing Big Data strategy and roadmaps, while...  ...using modern toolsets such as Spark on Scala, Storm, Flume, Sqoop Design...  ...Architect and implement scalable AI/ML and Generative AI...  ...Language Models (LLMs), prompt engineering, embeddings, vector databases... 
    Full time
    Local area
    Immediate start
    Flexible hours

    West Monroe

    Los Angeles, CA
    4 days ago
  •  ...Lead Data Scientist & Gen AI Working at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you'll have the opportunity to grow your career, give back to your community and make... 
    Work experience placement
    Work at office

    Citi

    Bell Gardens, CA
    8 hours ago
  • $229k - $343k

     ..., Spectacles. Snap Engineering teams build fun and...  ...We’re looking for a Staff Machine Learning...  ...product managers, data scientists, and engineers...  ..., generative AI, LLM-based ranking,...  ...Python, C++, Java, Scala, or similar languages...  ...infrastructure, such as Spark, Flink, Beam,... 
    Full time
    Work experience placement
    Live in
    Work at office
    Local area

    Snap Inc.

    Santa Monica, CA
    1 day ago
  •  ...Job Description Samba is an AI-powered media intelligence company...  ...web pages, combining that data with third-party signals...  ...We are seeking a skilled Data Engineer to strengthen our data platform...  ...transformations using Apache Spark (PySpark/Scala) Build procedures and guidelines... 

    SAMBA

    Los Angeles, CA
    15 hours ago
  •  ...Data Pipeline Engineer ByteDance is a technology company operating a range of content platforms that...  ...advantage of cutting-edge big data and AI technologies, we are aiming at building...  ...such as Kafka Streaming, Flink, Spark. Familiar with big data open-source... 

    Adapt Talent

    Los Angeles, CA
    1 day ago
  •  ...unique taste. Position Overview: The Data Engineer will be instrumental in constructing and...  .... Proficient in Python, Java, Scala, or similar programming languages. Skilled...  ...with big data tools (e.g., Hadoop, Spark) and real-time data processing technologies... 

    Identified Talent Solutions

    Los Angeles, CA
    2 days ago
  • $165k - $190k

     ...Senior Data Engineer Playa Vista, CA or Remote Thrive Market was founded in 2014 with a mission...  ...Experience using Claude or similar AI agents for development is a plus. Experience...  ...data ingestion, machine-learning, Apache Spark a plus Adept in the ability to elicit,... 
    Summer work
    Work at office
    Remote work
    Flexible hours

    Thrive Market

    Playa Vista, CA
    17 days ago
  • $140k - $160k

     ...the cultural fabric, igniting passions, sparking conversations, and connecting people to...  ...ever-evolving world. Job Description The Data Engineering team is seeking a Senior Data Engineer to...  ..., audience metrics) Comfortable using AI-assisted development tools (e.g., ChatGPT... 
    Work at office
    Local area
    3 days per week

    Versant Media Inc

    Los Angeles, CA
    1 day ago
  •  ...Sr Data Engineer Glendale, California, United States About the Job Sr Data Engineer...  ...Work with technologies including Airflow, Spark, Databricks, Delta Lake, and Snowflake....  ...programming language (Python, Java, or Scala). ~ Hands-on experience with distributed... 

    Pipe Recruit

    Glendale, CA
    4 days ago
  •  ...Job Title: Sr Data Engineer Location: Glendale, CA Onsite (91201) Job Duration: Contract...  ...developing large-scale data pipelines Spark, Airflow, Databricks or Snowflake, SQL,...  ...programming language (e.g. Python, Java, Scala) Hands-on production environment experience... 
    Contract work

    Trispoke Managed Services Pvt Ltd

    Glendale, CA
    1 day ago
  •  ...developers, Python/Java developers, data analysts/data scientists, data engineers, machine learning engineers for full...  ...experience. For data science/data analyst/AI/machine learning positions preferred...  ...aptitude knowledge of statistics, gen AI, LLM, sagemaker, python, computer... 
    Full time

    SynergisticIT

    Los Angeles, CA
    1 day ago
  • $130k - $170k

     ...About The Role We are seeking a Senior Data Engineer to help strengthen Arena Club's strategic...  ...pipelines built on AWS Glue (Python Shell & Spark ETL) • Manage Redshift cluster...  ...and alerting • Hands-on experience using AI tools (Claude or Cursor preferred; other... 

    Arena Club

    Los Angeles, CA
    15 hours ago
  •  ...Data Engineer Portland, Los Angeles, Las Vegas, Denver, Vancouver, BC What You Will Do:...  ...At least 4-5 years of experience in Scala/Java or Python programming AWS data products...  ...chart ~4+ years of experience with Spark, Scala and/or Akka ~4+ years of experience... 

    1872 Consulting

    Los Angeles, CA
    3 days ago
  • $100k

     ...2023 Synergisticit at Gartner Data & Analytics summit Why do Tech...  ...Statistics, Computer Science or Engineering or candidates with gaps in...  ...For data Science/Data Analyst/AI/Machine learning Positions Preferred...  ...Knowledge of Statistics, Gen AI, LLM, Sagemaker, Python, Computer... 
    H1b

    SynergisticIT

    Los Angeles, CA
    2 days ago
  • $138.9k - $186.2k

     ...Senior Data Engineer Disney Entertainment and ESPN Product & Technology is a global organization...  ...video through the power of data and AI. We design and build innovative solutions...  ...streaming data pipelines (e.g., Kafka, Flink, Spark, Kinesis). ~ Experience with embedding... 

    The Walt Disney Studios

    Glendale, CA
    1 day ago
  •  ...Data Analyst With Amazon Web Services (AWS) Exp. Location: LA...  ...Requirements: Professional engineer with experience in production...  ...and their eco-systems (Spark, EMR, Hadoop, Hive). Strong...  ...several development languages (Python, Java, Scala, NodeJs).... 
    Contract work

    ClifyX

    Los Angeles, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Data Engineer (Scala, Spark, & Gen AI). Be the first to apply!