Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer Data Acquisition Web Crawling Pipelines

$120k - $135k

Vagaro Inc

****** Why Vagaro? ******At Vagaro, we believe in fostering a collaborative and inclusive work environment where every team member can thrive. Our culture is built on innovation, continuous learning, and a passion for making a positive impact. We support our employees' growth and vision for themselves, offering opportunities for professional development and career advancement. Join us and be part of a team that values creativity, teamwork, and a commitment to excellence. Plus, we know how to have fun while getting the job done! ****

About the Role

We are seeking a Data Acquisition Engineer to design, build, and scale systems that collect, process, and maintain high-quality external data from the web and third-party sources.

This role combines hands-on web crawling, scraping, and data engineering with strong data quality and business alignment for sales, marketing, and product teams.

*** This role is based onsite in Pleasanton, CA Monday through Friday***

_ SPONSORSHIP NOT AVAILABLE FOR THIS POSITION
_

Compensation

  • Base Annual Salary: $120,000 - $135,000
  • Annual Bonus: Up to 10%

Key Responsibilities

  • Design and build scalable web crawlers and scraping systems to extract structured and unstructured data
  • Collect data from websites, APIs, and databases to support sales and marketing initiatives
  • Handle dynamic content, pagination, and JavaScript-heavy sites
  • Perform data cleaning, preprocessing, and validation to ensure accuracy and usability
  • Deliver clean, structured datasets aligned to business requirements
  • Collaborate with sales, marketing, and product teams to align data efforts with business goals
  • Optimize pipelines for performance, scalability, and reliability
  • Ensure ethical, compliant data collection practices

**Data Engineering & Infrastructure **

  • Build ETL/ELT pipelines for ingestion, transformation, and storage
  • Implement schema design, deduplication, and change detection
  • Manage orchestration tools (Airflow, Prefect, etc.)
  • Work with cloud platforms (AWS, GCP, Azure) and databases

Core Requirements

  • 3+ years of experience in web crawling, scraping, or data engineering
  • Strong programming skills in Python or JavaScript
  • Experience with tools like Scrapy, BeautifulSoup, Selenium, or Playwright
  • Understanding of DOM parsing, and web architectures
  • Strong data cleaning and preprocessing skills
  • Proficiency in SQL and working with databases
  • Strong problem-solving skills and attention to detail
  • Excellent written and verbal communication skills

Preferred Qualifications

  • Experience with distributed systems or large-scale data processing
  • Familiarity with CRM systems like HubSpot or Salesforce
  • Knowledge of sales and marketing data use cases
  • Experience with proxy management and crawler scaling
  • Exposure to AI/ML tools in data workflows

What Success Looks Like

  • High-quality, reliable data pipelines supporting business decisions
  • Accurate and clean datasets delivered to stakeholders
  • Strong cross-functional collaboration with sales, marketing, and product teams
  • Scalable and efficient data acquisition systems

Why Join

  • Collaborative, inclusive, and innovative work culture
  • Opportunities for learning, growth, and career advancement
  • High-impact role with direct business influence

Why You'll Love Working Here:

  • **Attractive Compensation & Performance Bonuses: **Enjoy a competitive salary paired with performance-based bonuses
  • Generous PTO: 15 accrued days, plus 10 company holidays annually.
  • Health & Wellness: Comprehensive healthcare, dental, and vision plans for you and your family.
  • Exclusive Perks: Discounts on attractions, theme parks, shows, sports events, movies, hotels, and more through TicketsAtWork.
  • Beauty Perks: $30/month reimbursement for any Vagaro service, including health, beauty, or wellness treatments.
  • Food Perks: $50 monthly stipend for our onsite microkitchen and a complimentary DoorDash DashPass subscription.
  • Growth Opportunities: College Assistance Reimbursement, access to EAP & Work/Life Programs, and a LinkedIn Learning account.
  • Financial Security: 401k program with 4% matching and optional life/supplemental insurance.
  • Stay Active: Access to our on-site gym, flavored water dispenser, and basketball court to keep you fit and energized!

Equal Opportunity Employer:
Vagaro is proud to be an Equal Employment Opportunity and affirmative action employer. We foster an inclusive environment where individuals are evaluated without discrimination based on gender, race, ethnicity, age, disability, religion, sexual orientation, gender identity, veteran status, or any other characteristics protected by law.

Privacy Policy:
Your privacy matters! At Vagaro, we are committed to protecting your personal information. Before proceeding with your application, please review our Employee and Applicant Privacy Notice here. By submitting your application, you acknowledge that you have read and understood our Privacy Notice, which outlines how we collect, use, disclose, and protect your information during the recruitment and employment process.

Vagaro is an E-Verify employer. Learn more at

Learn More About Vagaro:
Visit us at vagaro.com/pro and vagaro.com to learn more.]

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Data Engineer Data Acquisition Web Crawling Pipelines in Pleasanton, CA vacancy
  • Vagaro in Pleasanton, CA is seeking a Data Acquisition Engineer to design and build scalable systems for collecting and processing high-quality...  ...data. Candidates should have 3+ years of experience in web crawling and scraping, along with strong programming skills in... 
    Pipeline
    Web

    Vagaro

    Pleasanton, CA
    1 day ago
  • $140k - $200k

     ...Mac App, Chrome Extension, and Web App. Google recently named...  ...include frontend and backend engineers, AI research scientists, and others...  ...We're looking to hire for our Data side of our AI team at...  ...and bring it into our ingestion pipeline Operate and extend the cloud... 
    Pipeline
    Web
    Full time
    Work at office
    Shift work

    Speechify

    Fremont, CA
    1 day ago
  • $93k - $124k

     ...Job Description Summary As the Sr. Data Engineer, you will play a critical role in building...  ...logging, and alerting, optimizing data pipelines for scalability, and ensuring compliance...  ...this job. Desired Characteristics: Web application stacks (Node.js, Vue, React)... 
    Pipeline
    Web
    Permanent employment
    Contract work
    Remote work
    Visa sponsorship
    Work visa
    Relocation package

    GE Aerospace

    San Ramon, CA
    2 days ago
  • $100k - $150k

     ...Account Executive (SAE) to join our Acquisition Sales organization. This role...  ..., and closing. Master pipeline management: maintain disciplined...  ...Salesforce. Leverage data-driven insights: use metrics,...  ...visitors, while a full range of web presence offerings has established... 
    Pipeline
    Web
    Base plus commission
    Remote work

    Martindale-Avvo

    Pleasanton, CA
    1 day ago
  •  ...Provide software application engineering and maintenance for all phases...  ...search application including crawling, indexing, query tuning,...  .... Create and update CI/CD pipelines to build and test new features...  ...trends and best practices in web architecture and design in one... 
    Pipeline
    Web

    Vets Hired

    Livermore, CA
    3 days ago
  •  ...are looking for a highly skilled Senior Azure Databricks (ADB) Developer to join our Data Engineering team. This role involves developing large-scale batch and streaming data pipelines on Azure Cloud. The ideal candidate will have strong expertise in Python, Databricks... 
    Pipeline

    InterSources

    Pleasanton, CA
    2 days ago
  •  ...Data Engineer (Hybrid) Dublin, CA / Houston, TX (onsite 1-2 days a week) Duration: 6 month CTH Top skill sets: Snowflake Must...  ...develop Experienced with designing and implementing data pipelines that move data from various sources (such as database) into Snowflake... 
    Pipeline
    2 days per week
    1 day per week

    Staffing the Universe

    Dublin, CA
    2 days ago
  •  ...Overview: Job Summary: The Senior Data Engineer will be responsible for designing, building, and maintaining robust data pipelines and architectures on AWS to support scalable data processing, storage, and analytics. The ideal candidate will possess deep expertise... 
    Pipeline

    Purple Drive

    Pleasanton, CA
    10 hours ago
  •  ...computer science, applied math, physics, engineering, statistics, economics or related field....  ...3+ years of industry experience in Data Engineering 3+ years of work experience...  ...experience on building modern data pipelines - Ingestion, Profiling, Integration, Summarization... 
    Pipeline
    Work experience placement

    Apex Informatics

    Pleasanton, CA
    1 day ago
  •  ...Senior Data Engineer Location: Pleasanton, California (hybrid work) Role Overview As a Senior/Lead Data Engineer, you will lead...  ...development, and ownership of core data infrastructure—from pipelines to storage to data products. You'll be a strategic partner across... 
    Pipeline

    SnapCode Inc

    Pleasanton, CA
    2 days ago
  •  ...We are seeking a Data Analyst to serve as a hands-on contributor supporting data pipelines, reporting, and process automation. This role will act as a "second pair of...  ...position • Opportunity to grow skills across data engineering, automation, and analytics as team needs... 
    Pipeline

    Insight Global

    Dublin, CA
    5 days ago
  • $88.1k - $141k

     ...We're looking for a Data Engineer - United States This role is Hybrid, Dublin Office Data Engineer - Hybrid(Dublin, CA Office...  ...... • Design, build and maintain batch or real-time data pipelines in production. • Maintain and optimize the data... 
    Pipeline
    Full time
    Work at office
    Local area

    Cornerstone OnDemand

    Dublin, CA
    10 hours ago
  •  ...fast-growing organization in life sciences is seeking a Senior Data Engineer to lead the design and implementation of a scalable Data Lakehouse. You will build real-time and batch data ingestion pipelines and develop complex ETL workflows to optimize large-scale data processing... 
    Pipeline

    Veeva Systems

    Pleasanton, CA
    3 days ago
  • $128k - $252.2k

     ...: We are seeking an experienced and highly skilled senior data engineer to join our enterprise data strategy and operations team. The...  ...extensive expertise in designing, building and maintaining data pipelines and data solution architectures on cloud platforms,... 
    Pipeline
    For contractors
    Summer work
    Work at office
    Work from home
    Flexible hours

    The Clorox Company

    Pleasanton, CA
    4 days ago
  •  ...more than 50% of the Fortune 500. Learn more at Data Scientist - LLM & Data Pipeline Engineering (LegalTech / RegTech AI) Overview: We are seeking...  ...faced with tough decisions. Layoffs, reorganizations, acquisitions, and mergers. Yet, throughout these challenging... 
    Pipeline
    Local area

    Archer Technologies

    Livermore, CA
    2 days ago
  •  ...aggressive growth strategy through the acquisition of new clients, additional offerings...  ...existing clients and a robust development pipeline to ensure continued short and long...  ...develop product collateral (brochures, data sheets, web content, direct mail campaign cards, presentations... 
    Pipeline
    Web
    Temporary work
    Work at office

    Key Solutions

    Fremont, CA
    1 day ago
  •  ...skilled Java Full Stack Developer with expertise in building scalable web applications using React JS on the front end and Java on the...  ...testing tools (e.g., JMeter, Gatling). Experience with containerization (Docker/Kubernetes). Understanding of CI/CD pipelines.... 
    Pipeline
    Web

    Yochana

    Pleasanton, CA
    3 days ago
  •  ...We are looking for a UI Engineer to design and develop high-performance, scalable, and user-friendly web applications. The ideal candidate should have strong expertise...  ...management. • Hands-on experience with CI/CD pipelines for frontend deployments with GitHub Actions... 
    Pipeline
    Web

    Texas State Library and Archives Commision

    Dublin, CA
    10 hours ago
  • $127.9k - $204.6k

     ...Principal Data Engineer We are seeking a Principal Data & Advanced Analytics Engineer to help us unlock the next frontier of insight...  ...storytelling. Design and build advanced dashboards and analytics pipelines, leveraging AI/ML to solve high-impact business problems.... 
    Pipeline
    Full time

    Cornerstone OnDemand

    Dublin, CA
    2 days ago
  • A leading technology firm is seeking an experienced Data Scientist to support their next-generation AI platform. The role involves AI model integration, data pipeline development, and knowledge base engineering with a focus on LegalTech and RegTech. The ideal candidate... 
    Pipeline
    Remote work

    emergemarket.com

    Livermore, CA
    9 days ago
  • $115k - $175k

     ...its customers, employees, and communities. The Role A Senior Data Engineer who can lead the design and implementation of our next-...  ...management. Build and manage real-time and batch data ingestion pipelines using Kafka and Spark. Deploy, manage, and scale data workloads... 
    Pipeline
    Work at office
    Local area
    Work from home

    Veeva Systems

    Pleasanton, CA
    4 days ago
  • $63 - $68 per hour

     ...Experience developing and maintaining full-stack web applications. Experience integrating...  ...build tools. Experience with CI/CD pipelines and DevOps practices. Experience with...  ...Science, Information Technology, Engineering, or related field preferred. Relevant... 
    Pipeline
    Web

    Cynet Systems

    Pleasanton, CA
    2 days ago
  •  ...Full Stack Automation Engineer Job Location: Pleasanton CA Job...  ...to understand large disparate data sets and transform the information...  ...for speed and scalability of web applications Demonstrated...  ...Building and maintaining CI/CD Pipelines Demonstrated experience... 
    Pipeline
    Web
    Contract work

    InterSources

    Pleasanton, CA
    2 days ago
  •  ...6+ Build RESTful APIs and microservices using ASP.NET Core Web API Implement secure authentication and authorization using...  ...skills Preferred Qualifications Experience with CI/CD pipelines Knowledge of Infrastructure as Code (Terraform/Bicep) is a... 
    Pipeline
    Web
    Permanent employment
    Contract work
    Local area
    Remote work

    Tekfortune Inc

    Pleasanton, CA
    1 day ago
  •  ..., good to have) Create prompt engineering frameworks and templates to ensure consistent...  ...Build responsive, accessible web interfaces for AI-powered applications...  ...inference Implement efficient data processing pipelines for AI training and inference... 
    Pipeline
    Web
    Contract work

    AceStack LLC

    Pleasanton, CA
    2 days ago
  •  ...Implement services using Spring Boot, Quarkus, and modern web technologies, following 12-factor app principles for cloud-native...  ...GitHub Actions for version control, CI/CD automation, and deployment pipelines. Build and deploy containerized applications using Docker and... 
    Pipeline
    Web

    Ampcus

    Pleasanton, CA
    4 days ago
  •  ...Overview: Position Title - Java Engineer Position Responsibilities Dexian has been...  ...deployment, proficiency working on CI/CD pipelines, and with well-developed organizational,...  ...Design, develop, implement and support web applications and other technology solutions... 
    Pipeline
    Web
    Work experience placement

    Guru Schools

    Pleasanton, CA
    3 days ago
  •  ...Role: Web Lead / Staff Engineer - CGEMJP00308604 Work location: Atlanta, GA - Onsite Contract: 12+ months contract JOB DESCRIPTION...  ...Webpack, Babel, and NPM • Hands-on experience with Git, CI/CD pipelines, and automated testing • Excellent problem-solving,... 
    Pipeline
    Web
    Contract work

    Kasmo Global

    Pleasanton, CA
    4 days ago
  •  ...Developer to build AI/ML-driven cloud data solutions for enterprise clients. You'll develop scalable web applications using Node.js,...  ..., deploy AI models and data pipelines on AWS and Azure, and optimize...  ...with data scientists and engineers to integrate machine learning... 
    Pipeline
    Web
    Full time

    Right Skale, Inc.

    Pleasanton, CA
    1 day ago
  •  ...Salesforce CRM to enable intelligent data retrieval, personalized...  ...Retrieval-Augmented Generation (RAG) pipelines using vector databases and...  ...Pages, Apex APIs and web services. MUST HAVE SKILLS 1...  ...to work with product managers, engineers, and data teams for AI‑driven... 
    Pipeline
    Web

    TechDigital Group

    Pleasanton, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer Data Acquisition Web Crawling Pipelines. Be the first to apply!