Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer, Data Acquisition (Web Crawling & Pipelines)

$120k - $135k

Pamper Yourself With Karuna

Why Vagaro? At Vagaro, we believe in fostering a collaborative and inclusive work environment where every team member can thrive. Our culture is built on innovation, continuous learning, and a passion for making a positive impact. We support our employees' growth and vision for themselves, offering opportunities for professional development and career advancement. Join us and be part of a team that values creativity, teamwork, and a commitment to excellence. Plus, we know how to have fun while getting the job done! About the Role We are seeking a Data Acquisition Engineer to design, build, and scale systems that collect, process, and maintain high-quality external data from the web and third-party sources. This role combines hands‑on web crawling, scraping, and data engineering with strong data quality and business alignment for sales, marketing, and product teams. This role is based onsite in Pleasanton, CA Monday through Friday.

SPONSORSHIP NOT AVAILABLE FOR THIS POSITION

Compensation Base Annual Salary: $120,000 - $135,000 Annual Bonus: Up to 10% Key Responsibilities Design and build scalable web crawlers and scraping systems to extract structured and unstructured data Collect data from websites, APIs, and databases to support sales and marketing initiatives Handle dynamic content, pagination, and JavaScript‑heavy sites Perform data cleaning, preprocessing, and validation to ensure accuracy and usability Deliver clean, structured datasets aligned to business requirements Collaborate with sales, marketing, and product teams to align data efforts with business goals Optimize pipelines for performance, scalability, and reliability Ensure ethical, compliant data collection practices Data Engineering & Infrastructure Build ETL/ELT pipelines for ingestion, transformation, and storage Implement schema design, deduplication, and change detection Manage orchestration tools (Airflow, Prefect, etc.) Work with cloud platforms (AWS, GCP, Azure) and databases Core Requirements 3+ years of experience in web crawling, scraping, or data engineering Strong programming skills in Python or JavaScript Experience with tools like Scrapy, BeautifulSoup, Selenium, or Playwright Understanding of DOM parsing, and web architectures Strong data cleaning and preprocessing skills Proficiency in SQL and working with databases Strong problem‑solving skills and attention to detail Excellent written and verbal communication skills Preferred Qualifications Experience with distributed systems or large‑scale data processing Familiarity with CRM systems like HubSpot or Salesforce Knowledge of sales and marketing data use cases Experience with proxy management and crawler scaling Exposure to AI/ML tools in data workflows What Success Looks Like High‑quality, reliable data pipelines supporting business decisions Accurate and clean datasets delivered to stakeholders Strong cross‑functional collaboration with sales, marketing, and product teams Scalable and efficient data acquisition systems Why Join Collaborative, inclusive, and innovative work culture Opportunities for learning, growth, and career advancement High‑impact role with direct business influence Why You’ll Love Working Here Attractive Compensation & Performance Bonuses : Enjoy a competitive salary paired with performance‑based bonuses Generous PTO : 15 accrued days, plus 10 company holidays annually Health & Wellness : Comprehensive healthcare, dental, and vision plans for you and your family Exclusive Perks : Discounts on attractions, theme parks, shows, sports events, movies, hotels, and more through TicketsAtWork Beauty Perks : $30/month reimbursement for any Vagaro service, including health, beauty, or wellness treatments Food Perks : $50 monthly stipend for our onsite microkitchen and a complimentary DoorDash DashPass subscription Growth Opportunities : College Assistance Reimbursement, access to EAP & Work/Life Programs, and a LinkedIn Learning account Financial Security : 401k program with 4% matching and optional life/supplemental insurance Stay Active : Access to our onsite gym, flavored water dispenser, and basketball court to keep you fit and energized Equal Opportunity Employer Vagaro is proud to be an Equal Employment Opportunity and affirmative action employer. We foster an inclusive environment where individuals are evaluated without discrimination based on gender, race, ethnicity, age, disability, religion, sexual orientation, gender identity, veteran status, or any other characteristics protected by law. #J-18808-Ljbffr Vagaro

Vacancy posted 23 hours ago
Similar jobs that could be interesting for youBased on the Data Engineer, Data Acquisition (Web Crawling & Pipelines) in Pleasanton, CA vacancy
  • Vagaro in Pleasanton, CA is seeking a Data Acquisition Engineer to design and build scalable systems for collecting and processing high-quality...  ...data. Candidates should have 3+ years of experience in web crawling and scraping, along with strong programming skills in... 
    Pipeline
    Web

    Vagaro

    Pleasanton, CA
    4 days ago
  • $80 per hour

     ...Job Title: Agentic Analytics Engineer (contract) PR: $80/hr Contract...  ...scientific and business data from multiple sources to generate...  ...an AI can follow. RAG Pipeline Maintenance: Build and optimize...  ...Nice to Haves: Experience in web application development... 
    Pipeline
    Web
    Contract work
    Work experience placement
    Immediate start

    Medasource

    Hayward, CA
    1 day ago
  •  ...Provide software application engineering and maintenance for all phases...  ...search application including crawling, indexing, query tuning,...  .... Create and update CI/CD pipelines to build and test new features...  ...trends and best practices in web architecture and design in one... 
    Pipeline
    Web

    Vets Hired

    Livermore, CA
    1 day ago
  •  ...computer science, applied math, physics, engineering, statistics, economics or related field....  ...3+ years of industry experience in Data Engineering 3+ years of work experience...  ...experience on building modern data pipelines - Ingestion, Profiling, Integration, Summarization... 
    Pipeline
    Work experience placement

    Apex Informatics

    Pleasanton, CA
    4 days ago
  •  ...Title: Senior Data Engineer Location: Pleasanton, California (hybrid work) Role Overview: As a Senior/Lead Data Engineer...  ...development, and ownership of core data infrastructure-from pipelines to storage to data products. You'll be a strategic partner... 
    Pipeline

    SnapCode, Inc.

    Pleasanton, CA
    23 hours ago
  • $88.1k - $141k

    Cornerstone OnDemand is looking for a Data Engineer for a hybrid role based in Dublin, CA. The ideal candidate will have strong SQL skills...  .... Responsibilities include designing and maintaining data pipelines, optimizing infrastructure, and collaborating on machine learning... 
    Pipeline

    Cornerstone OnDemand

    Dublin, CA
    2 days ago
  •  ...fast-growing organization in life sciences is seeking a Senior Data Engineer to lead the design and implementation of a scalable Data Lakehouse. You will build real-time and batch data ingestion pipelines and develop complex ETL workflows to optimize large-scale data processing... 
    Pipeline

    Veeva Systems

    Pleasanton, CA
    1 day ago
  • $115k - $175k

     ..., employees, and communities. The Role A Senior Data Engineer who can lead the design and implementation of our next-generation...  .... Build and manage real-time and batch data ingestion pipelines using Kafka and Spark Deploy, manage, and scale data... 
    Pipeline
    Work at office
    Local area
    Remote work
    Work from home
    Flexible hours
    3 days per week

    Veeva Systems

    Pleasanton, CA
    4 days ago
  •  ...We are looking for a UI Engineer to design and develop high-performance, scalable, and user-friendly web applications. The ideal candidate should have strong expertise...  ...management. • Hands-on experience with CI/CD pipelines for frontend deployments with GitHub Actions... 
    Pipeline
    Web

    Texas State Library and Archives Commision

    Dublin, CA
    3 days ago
  •  ...Data Analytics Engineer VentureSoft Global, Inc. is a technology services company specializing in software development and IT consulting....  ...Data Analytics Engineer, you will build and maintain data pipelines, tools, and visualizations to enable organizational insights... 
    Pipeline
    Remote work
    Flexible hours

    Venturesoft

    Pleasanton, CA
    1 day ago
  •  ...more than 50% of the Fortune 500. Learn more at Data Scientist - LLM & Data Pipeline Engineering (LegalTech / RegTech AI) Overview: We are seeking...  ...faced with tough decisions. Layoffs, reorganizations, acquisitions, and mergers. Yet, throughout these challenging... 
    Pipeline
    Local area

    Archer Technologies

    Livermore, CA
    23 hours ago
  • A technology services company is seeking a Data Analytics Engineer to develop and maintain data pipelines and visualizations to drive organizational insights. The role requires a Master's degree and 5+ years of experience in data analytics. The candidate must be proficient... 
    Pipeline
    Flexible hours

    Premier Inn Hotels LLC (UAE)

    Pleasanton, CA
    23 hours ago
  • $88.1k - $141k

    Position Overview We're looking for a Data Engineer - United States for a hybrid role in the Dublin, CA office. We are seeking a talented...  ...Design, build, and maintain batch or real‑time data pipelines in production. Maintain and optimize the data infrastructure... 
    Pipeline
    Work at office
    Local area

    Cornerstone OnDemand

    Dublin, CA
    2 days ago
  •  ...6+ Build RESTful APIs and microservices using ASP.NET Core Web API Implement secure authentication and authorization using...  ...skills Preferred Qualifications Experience with CI/CD pipelines Knowledge of Infrastructure as Code (Terraform/Bicep) is a... 
    Pipeline
    Web
    Permanent employment
    Contract work
    Local area
    Remote work

    Tekfortune Inc

    Pleasanton, CA
    4 days ago
  •  ...skilled Java Full Stack Developer with expertise in building scalable web applications using React JS+ CRACO (Modern UI) on the front end...  ...JMeter, Gatling) Experience with containerization (Docker/Kubernetes) Understanding of CI/CD pipelines #J-18808-Ljbffr TechDigital Group
    Pipeline
    Web

    TechDigital Group

    Pleasanton, CA
    1 day ago
  •  ...Overview: Position Title - Java Engineer Position Responsibilities Dexian has been...  ...deployment, proficiency working on CI/CD pipelines, and with well-developed organizational,...  ...Design, develop, implement and support web applications and other technology solutions... 
    Pipeline
    Web
    Work experience placement

    Guru Schools

    Pleasanton, CA
    1 day ago
  • A leading technology firm is seeking an experienced Data Scientist to support their next-generation AI platform. The role involves AI model integration, data pipeline development, and knowledge base engineering with a focus on LegalTech and RegTech. The ideal candidate... 
    Pipeline
    Remote job

    emergemarket.com

    Livermore, CA
    3 days ago
  •  ...Salesforce CRM to enable intelligent data retrieval, personalized...  ...Retrieval-Augmented Generation (RAG) pipelines using vector databases and...  ...Pages, Apex APIs and web services. MUST HAVE SKILLS 1...  ...to work with product managers, engineers, and data teams for AI‑driven... 
    Pipeline
    Web

    TechDigital Group

    Pleasanton, CA
    23 hours ago
  • Ross Stores in Dublin, CA is seeking a Data Engineer to develop data pipelines supporting analytics. The ideal candidate will have 5-8 years of data engineering experience, proficiency in modern data architecture, and be familiar with tools like Snowflake and Airflow. This... 
    Pipeline

    Ross Stores

    Dublin, CA
    4 days ago
  •  ...Cloud, Marketing Cloud, Service Cloud, and Data Cloud. Proven hands-on experience...  ...APIs, Apex programming, SOQL, Lightning Web Components (LWC), and AppExchange solutions...  ...Experience with Salesforce DevOps tools, CI/CD pipelines, and release management best practices.... 
    Pipeline
    Web
    Work at office

    Perfict Global, Inc.

    Pleasanton, CA
    3 days ago
  • $135k - $150k

     ...role involves collaborating with stakeholders, developing dashboards with Power BI, and managing ETL pipelines. Ideal candidates will have over 7 years of experience in data analysis and strong SQL skills. The position offers a competitive salary from $135,000 to $150,000... 
    Pipeline

    Vagaro Inc

    Pleasanton, CA
    1 day ago
  •  ...Title and Summary Software Development Engineer II - Data and Analytics The Business...  ...to the development and modernization of web-based data and analytics applications,...  ...PySpark), for building or consuming data pipelines and analytical workloads. Ability to... 
    Pipeline
    Web
    Full time
    Immediate start
    Worldwide

    Mastercard

    Dublin, CA
    a month ago
  • $30 - $35 per hour

     ...Full Stack Engineering Intern Pleasanton, California, United States...  ...contextualizes and connects operational data across siloed systems,...  ...-on experience with modern web technologies and cloud...  ...with CI/CD tools and deployment pipelines Familiarity with AWS services... 
    Pipeline
    Web
    Full time
    Internship
    Local area

    Avathon

    Pleasanton, CA
    1 day ago
  •  ...storyteller, you’ll transform data into magic and inspire action....  ...and events, drive demand and pipeline, and partner closely with Sales...  ...sales, and build a marketing engine. Craft compelling stories, build...  ...by our campaigns. 50% YoY web traffic increase post-Rockstar... 
    Pipeline
    Web
    Local area
    Worldwide
    Flexible hours

    Workday

    Pleasanton, CA
    1 day ago
  • $164.1k - $222.1k

     ...elusive questions. Our web experience is how the...  ...and hands off customer data to the systems behind...  ...partner closely with engineering and commercial leaders...  ...delight users and drive pipeline. You will also shape...  ...releases that move both new acquisition and existing customer... 
    Pipeline
    Web
    Full time

    10X Genomics

    Pleasanton, CA
    4 days ago
  • $148k - $222k

     ...happen. We are the central nervous system for usage data across the enterprise. By engineering seamless data ingestion pipelines, pinpoint-accurate billing calculations, and...  ...applications 3 + years of experience with UML, Web application development or SaaS (Software as a... 
    Pipeline
    Web
    Work experience placement
    Work at office
    Immediate start
    Remote work
    Home office
    Flexible hours

    HR Tech Job

    Pleasanton, CA
    23 hours ago
  •  ...Jayaraman from Info Way Solutions, LLC We have job opening for Data Engineer with QE experience and the detailed Job description is given...  ...Responsibilities: 1. Data Quality Management : Write Pipelines and help Automate the Quality engineering aspects in there... 
    Pipeline
    Remote work

    Info Way Solutions

    Fremont, CA
    2 days ago
  •  ...Senior Dev Operations Engineer Blackstone Talent Group, an award-winning technology consulting...  ...integration tools Jenkins, Azure Pipelines ~ Experience with system automation and...  ...understanding of and experience with managing web applications in a highly available... 
    Pipeline
    Web
    Weekend work

    Blackstone Restaurant

    Pleasanton, CA
    23 hours ago
  • $156k - $204.7k

     ...ensuring seamless, secure, and scalable data flow across our global ecosystem as Snowflake...  ..., EIB, Core Connectors, and Workday Web Services (SOAP/REST) Partner with internal...  ...Snowflake to enable scalable people data pipelines, reporting, and advanced analytics... 
    Pipeline
    Web
    Work at office
    Flexible hours
    3 days per week

    Streamlit

    Dublin, CA
    1 day ago
  •  ...Solutions, LLC We have job opening for Data Scientist and the detailed Job description...  ...the JD and share your view Role: Data Engineer Location: Montreal Canada ( Hybrid - 3...  ...to learn tools to create data pipelines using Airflow Thanks & Regards, Sangeetha... 
    Pipeline

    Info Way Solutions

    Fremont, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer, Data Acquisition (Web Crawling & Pipelines). Be the first to apply!