Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer, Data Acquisition (Web Crawling & Pipelines)

$120k - $135k

Vagaro Inc

Why Vagaro? At Vagaro, we believe in fostering a collaborative and inclusive work environment where every team member can thrive. Our culture is built on innovation, continuous learning, and a passion for making a positive impact. We support our employees' growth and vision for themselves, offering opportunities for professional development and career advancement. Join us and be part of a team that values creativity, teamwork, and a commitment to excellence. Plus, we know how to have fun while getting the job done! About the Role We are seeking a Data Acquisition Engineer to design, build, and scale systems that collect, process, and maintain high-quality external data from the web and third-party sources. This role combines hands‑on web crawling, scraping, and data engineering with strong data quality and business alignment for sales, marketing, and product teams. This role is based onsite in Pleasanton, CA Monday through Friday. SPONSORSHIP NOT AVAILABLE FOR THIS POSITION Compensation Base Annual Salary: $120,000 - $135,000 Annual Bonus: Up to 10% Key Responsibilities Design and build scalable web crawlers and scraping systems to extract structured and unstructured data Collect data from websites, APIs, and databases to support sales and marketing initiatives Handle dynamic content, pagination, and JavaScript‑heavy sites Perform data cleaning, preprocessing, and validation to ensure accuracy and usability Deliver clean, structured datasets aligned to business requirements Collaborate with sales, marketing, and product teams to align data efforts with business goals Optimize pipelines for performance, scalability, and reliability Ensure ethical, compliant data collection practices Data Engineering & Infrastructure Build ETL/ELT pipelines for ingestion, transformation, and storage Implement schema design, deduplication, and change detection Manage orchestration tools (Airflow, Prefect, etc.) Work with cloud platforms (AWS, GCP, Azure) and databases Core Requirements 3+ years of experience in web crawling, scraping, or data engineering Strong programming skills in Python or JavaScript Experience with tools like Scrapy, BeautifulSoup, Selenium, or Playwright Understanding of DOM parsing, and web architectures Strong data cleaning and preprocessing skills Proficiency in SQL and working with databases Strong problem‑solving skills and attention to detail Excellent written and verbal communication skills Preferred Qualifications Experience with distributed systems or large‑scale data processing Familiarity with CRM systems like HubSpot or Salesforce Knowledge of sales and marketing data use cases Experience with proxy management and crawler scaling Exposure to AI/ML tools in data workflows What Success Looks Like High‑quality, reliable data pipelines supporting business decisions Accurate and clean datasets delivered to stakeholders Strong cross‑functional collaboration with sales, marketing, and product teams Scalable and efficient data acquisition systems Why Join Collaborative, inclusive, and innovative work culture Opportunities for learning, growth, and career advancement High‑impact role with direct business influence Why You’ll Love Working Here Attractive Compensation & Performance Bonuses : Enjoy a competitive salary paired with performance‑based bonuses Generous PTO : 15 accrued days, plus 10 company holidays annually Health & Wellness : Comprehensive healthcare, dental, and vision plans for you and your family Exclusive Perks : Discounts on attractions, theme parks, shows, sports events, movies, hotels, and more through TicketsAtWork Beauty Perks : $30/month reimbursement for any Vagaro service, including health, beauty, or wellness treatments Food Perks : $50 monthly stipend for our onsite microkitchen and a complimentary DoorDash DashPass subscription Growth Opportunities : College Assistance Reimbursement, access to EAP & Work/Life Programs, and a LinkedIn Learning account Financial Security : 401k program with 4% matching and optional life/supplemental insurance Stay Active : Access to our onsite gym, flavored water dispenser, and basketball court to keep you fit and energized Equal Opportunity Employer Vagaro is proud to be an Equal Employment Opportunity and affirmative action employer. We foster an inclusive environment where individuals are evaluated without discrimination based on gender, race, ethnicity, age, disability, religion, sexual orientation, gender identity, veteran status, or any other characteristics protected by law. #J-18808-Ljbffr

Vacancy posted 3 hours ago
Similar jobs that could be interesting for youBased on the Data Engineer, Data Acquisition (Web Crawling & Pipelines) in Pleasanton, CA vacancy
  •  ...Vagaro in Pleasanton, CA is seeking a Data Acquisition Engineer to design and build scalable systems for collecting and processing high-quality...  ...data. Candidates should have 3+ years of experience in web crawling and scraping, along with strong programming skills in... 
    Pipeline
    Web

    Vagaro Inc

    Pleasanton, CA
    4 hours ago
  • $140k - $200k

     ...Mac App, Chrome Extension, and Web App. Google recently named...  ...include frontend and backend engineers, AI research scientists, and others...  ...We're looking to hire for our Data side of our AI team at...  ...and bring it into our ingestion pipeline Operate and extend the cloud... 
    Pipeline
    Web
    Full time
    Work at office
    Shift work

    Speechify

    Fremont, CA
    2 days ago
  • $80 per hour

     ...Job Title: Agentic Analytics Engineer (contract) PR: $80/hr Contract...  ...scientific and business data from multiple sources to generate...  ...an AI can follow. RAG Pipeline Maintenance: Build and optimize...  ...Nice to Haves: Experience in web application development... 
    Pipeline
    Web
    Contract work
    Work experience placement
    Immediate start

    Medasource

    Fremont, CA
    4 days ago
  • $93k - $124k

     ...Job Description Summary As the Sr. Data Engineer, you will play a critical role in building...  ...logging, and alerting, optimizing data pipelines for scalability, and ensuring compliance...  ...this job. Desired Characteristics: Web application stacks (Node.js, Vue, React)... 
    Pipeline
    Web
    Permanent employment
    Contract work
    Remote work
    Visa sponsorship
    Work visa
    Relocation package

    GE Aerospace

    San Ramon, CA
    2 days ago
  • $100k - $150k

     ...Account Executive (SAE) to join our Acquisition Sales organization. This role...  ..., and closing. Master pipeline management: maintain disciplined...  ...Salesforce. Leverage data-driven insights: use metrics,...  ...visitors, while a full range of web presence offerings has established... 
    Pipeline
    Web
    Base plus commission
    Remote work

    Martindale-Avvo

    Pleasanton, CA
    2 days ago
  •  ...Provide software application engineering and maintenance for all phases...  ...search application including crawling, indexing, query tuning,...  .... Create and update CI/CD pipelines to build and test new features...  ...trends and best practices in web architecture and design in one... 
    Pipeline
    Web

    Vets Hired

    Livermore, CA
    4 days ago
  • $88.1k - $141k

     ...Cornerstone Research in Dublin, CA is seeking a Data Engineer to design and maintain data pipelines in a hybrid work environment. Candidates should have strong SQL skills and experience with cloud-based data solutions. The role involves working with data ingestion tools... 
    Pipeline

    Cornerstone Research

    Dublin, CA
    3 days ago
  •  ...are looking for a highly skilled Senior Azure Databricks (ADB) Developer to join our Data Engineering team. This role involves developing large-scale batch and streaming data pipelines on Azure Cloud. The ideal candidate will have strong expertise in Python, Databricks... 
    Pipeline

    InterSources

    Pleasanton, CA
    3 days ago
  •  ...Overview: Job Summary: The Senior Data Engineer will be responsible for designing, building, and maintaining robust data pipelines and architectures on AWS to support scalable data processing, storage, and analytics. The ideal candidate will possess deep expertise... 
    Pipeline

    Purple Drive

    Pleasanton, CA
    1 day ago
  •  ...computer science, applied math, physics, engineering, statistics, economics or related field....  ...3+ years of industry experience in Data Engineering 3+ years of work experience...  ...experience on building modern data pipelines - Ingestion, Profiling, Integration, Summarization... 
    Pipeline
    Work experience placement

    Apex Informatics

    Pleasanton, CA
    2 days ago
  •  ...Data Engineer (Hybrid) Dublin, CA / Houston, TX (onsite 1-2 days a week) Duration: 6 month CTH Top skill sets: Snowflake Must...  ...develop Experienced with designing and implementing data pipelines that move data from various sources (such as database) into Snowflake... 
    Pipeline
    2 days per week
    1 day per week

    Staffing the Universe

    Dublin, CA
    3 days ago
  •  ...Senior Data Engineer Location: Pleasanton, California (hybrid work) Role Overview As a Senior/Lead Data Engineer, you will lead...  ...development, and ownership of core data infrastructure—from pipelines to storage to data products. You'll be a strategic partner across... 
    Pipeline

    SnapCode Inc

    Pleasanton, CA
    3 days ago
  • $88.1k - $141k

     ...Data Engineer - United States (Hybrid, Dublin Office) We are seeking a talented Data Engineer to sit in our Dublin, CA office (Hybrid)....  ...you will: Design, build and maintain batch or real-time data pipelines in production. Maintain and optimize the data infrastructure... 
    Pipeline
    Temporary work
    Work at office
    Local area

    Cornerstone Research

    Dublin, CA
    3 days ago
  •  ...fast-growing organization in life sciences is seeking a Senior Data Engineer to lead the design and implementation of a scalable Data Lakehouse. You will build real-time and batch data ingestion pipelines and develop complex ETL workflows to optimize large-scale data processing... 
    Pipeline

    Veeva Systems

    Pleasanton, CA
    4 days ago
  •  ...skilled Java Full Stack Developer with expertise in building scalable web applications using React JS+ CRACO (Modern UI) on the front end...  ...testing tools (e.g., JMeter, Gatling). • Experience with containerization (Docker/Kubernetes). • Understanding of CI/CD pipelines.
    Pipeline
    Web

    Omni Inclusive

    Pleasanton, CA
    4 days ago
  •  ...Data Analytics Engineer VentureSoft Global, Inc. is a technology services company specializing in software development and IT consulting....  ...Data Analytics Engineer, you will build and maintain data pipelines, tools, and visualizations to enable organizational insights... 
    Pipeline
    Remote work
    Flexible hours

    Venturesoft

    Pleasanton, CA
    4 days ago
  •  ...more than 50% of the Fortune 500. Learn more at Data Scientist - LLM & Data Pipeline Engineering (LegalTech / RegTech AI) Overview: We are seeking...  ...faced with tough decisions. Layoffs, reorganizations, acquisitions, and mergers. Yet, throughout these challenging... 
    Pipeline
    Local area

    Archer Technologies

    Livermore, CA
    3 days ago
  •  ...We are looking for a UI Engineer to design and develop high-performance, scalable, and user-friendly web applications. The ideal candidate should have strong expertise...  ...management. • Hands-on experience with CI/CD pipelines for frontend deployments with GitHub Actions... 
    Pipeline
    Web

    Texas State Library and Archives Commision

    Dublin, CA
    1 day ago
  • A leading technology firm is seeking an experienced Data Scientist to support their next-generation AI platform. The role involves AI model integration, data pipeline development, and knowledge base engineering with a focus on LegalTech and RegTech. The ideal candidate... 
    Pipeline
    Remote work

    emergemarket.com

    Livermore, CA
    5 days ago
  • $115k - $175k

     ...its customers, employees, and communities. The Role A Senior Data Engineer who can lead the design and implementation of our next-...  ...management. Build and manage real-time and batch data ingestion pipelines using Kafka and Spark. Deploy, manage, and scale data workloads... 
    Pipeline
    Work at office
    Local area
    Work from home

    Veeva Systems

    Pleasanton, CA
    5 days ago
  •  ...Full Stack Automation Engineer Job Location: Pleasanton CA Job...  ...to understand large disparate data sets and transform the information...  ...for speed and scalability of web applications Demonstrated...  ...Building and maintaining CI/CD Pipelines Demonstrated experience... 
    Pipeline
    Web
    Contract work

    InterSources

    Pleasanton, CA
    3 days ago
  • $63 - $68 per hour

     ...Experience developing and maintaining full-stack web applications. Experience integrating...  ...build tools. Experience with CI/CD pipelines and DevOps practices. Experience with...  ...Science, Information Technology, Engineering, or related field preferred. Relevant... 
    Pipeline
    Web

    Cynet Systems

    Pleasanton, CA
    3 days ago
  •  ...Salesforce CRM to enable intelligent data retrieval, personalized...  ...Retrieval-Augmented Generation (RAG) pipelines using vector databases and...  ...Pages, Apex APIs and web services. MUST HAVE SKILLS 1....  ...to work with product managers, engineers, and data teams for AI‑driven... 
    Pipeline
    Web

    TechDigital Group

    Pleasanton, CA
    4 hours ago
  •  ...6+ Build RESTful APIs and microservices using ASP.NET Core Web API Implement secure authentication and authorization using...  ...skills Preferred Qualifications Experience with CI/CD pipelines Knowledge of Infrastructure as Code (Terraform/Bicep) is a... 
    Pipeline
    Web
    Permanent employment
    Contract work
    Local area
    Remote work

    Tekfortune Inc

    Pleasanton, CA
    2 days ago
  •  ...Ross Stores in Dublin, CA is seeking a Data Engineer to develop data pipelines supporting analytics. The ideal candidate will have 5-8 years of data engineering experience, proficiency in modern data architecture, and be familiar with tools like Snowflake and Airflow.... 
    Pipeline

    Ross Stores, Inc.

    Dublin, CA
    4 hours ago
  •  ..., good to have) Create prompt engineering frameworks and templates to ensure consistent...  ...Build responsive, accessible web interfaces for AI-powered applications...  ...inference Implement efficient data processing pipelines for AI training and inference... 
    Pipeline
    Web
    Contract work

    AceStack LLC

    Pleasanton, CA
    3 days ago
  •  ...Implement services using Spring Boot, Quarkus, and modern web technologies, following 12-factor app principles for cloud-native...  ...GitHub Actions for version control, CI/CD automation, and deployment pipelines. Build and deploy containerized applications using Docker and... 
    Pipeline
    Web

    Ampcus

    Pleasanton, CA
    5 days ago
  •  ...Role: Web Lead / Staff Engineer - CGEMJP00308604 Work location: Atlanta, GA - Onsite Contract: 12+ months contract JOB DESCRIPTION...  ...Webpack, Babel, and NPM • Hands-on experience with Git, CI/CD pipelines, and automated testing • Excellent problem-solving,... 
    Pipeline
    Web
    Contract work

    Kasmo Global

    Pleasanton, CA
    5 days ago
  •  ...Microservices design and deployment, proficiency working on CI/CD pipelines, and with well-developed organizational, analytical and problem-...  ...Responsibilities Design, develop, implement and support web applications and other technology solutions. Work with infrastructure... 
    Pipeline
    Web
    Work experience placement

    Anveta

    Pleasanton, CA
    5 days ago
  •  ...ensuring seamless, secure, and scalable data flow across our global ecosystem as Snowflake...  ..., EIB, Core Connectors, and Workday Web Services (SOAP/REST) Partner with internal...  ...Snowflake to enable scalable people data pipelines, reporting, and advanced analytics... 
    Pipeline
    Web
    Work at office
    3 days per week

    Snowflake Computing

    Dublin, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer, Data Acquisition (Web Crawling & Pipelines). Be the first to apply!