Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data/Infrastructure Advocate Engineer - US Remote

Hugging Face

Data/Infrastructure Advocate Engineer

At Hugging Face, we're on a journey to democratize good AI. We're building the fastest-growing platform for AI builders, with over 5 million users and 100k organizations who have shared more than 1M models, 300k datasets, and 300k apps. Our open-source libraries have more than 400k stars on GitHub.

About the Role

As our first Data/Infrastructure Advocate Engineer, you'll bridge the gap between cutting-edge data infrastructure and the global community of data engineers, researchers, and developers. You'll champion Xet storage on the Hugging Face Hub, helping users efficiently store, version, and collaborate on large-scale datasets. This role is for someone who thrives at the intersection of technical depth (storage, Parquet, deduplication) and community advocacy, helping define the future of open data workflows.

You'll collaborate with teams like Datasets, Hub, and Infrastructure to shape how developers interact with data on our platform, and inspire a community to build better, faster, and more scalable data pipelines.

Your Main Missions
  • Grow and nurture the open-source data/infra community: launch initiatives, collaborate with data-focused groups, and organize events or challenges. Engage with communities like Apache Parquet, Open Table Formats, and data engineering forums to promote best practices and Hugging Face tools.
  • Promote the Hugging Face Hub as the go-to platform for data storage, versioning, and collaboration, curating and showcasing datasets, benchmarks, and tools like Xet.
  • Highlight use cases like efficient large-dataset updates, Parquet editing, and deduplication to demonstrate the Hub's value for data workflows.
  • Create demos, benchmarks, and tools (for example Colab notebooks) that illustrate best practices for data storage and versioning, and experiment with Xet, Parquet, and other formats.
  • Produce high-quality tutorials, blog posts, and videos that make complex topics accessible.
  • Share insights on storage optimization, dataset versioning, and deduplication to empower developers.
  • Actively participate in online communities (Discord, GitHub, forums) to highlight contributions, answer questions, and foster collaboration.
  • Make sure datasets and tools released on the Hub are well-documented, with clear examples, benchmarks, and use cases.
About You

You're already an active voice in the data and ML community. You build in public, you publish, and people follow your work on LinkedIn and X.

You're a hands-on builder who loves experimenting with data tools, storage optimization, and dataset versioning. You can take a complex topic like deduplication, compression, or Parquet editing and make it click for other developers through writing, demos, or talks. You're passionate about open source and knowledge sharing, and you thrive in fast-moving environments.

What You'll Need
  • 3+ years in developer relations or developer advocacy, ideally for data engineering, infrastructure, or ML tools and platforms
  • An established public presence as a technical voice, with a track record of regularly publishing data/infra/ML content and a demonstrable, engaged audience on LinkedIn and X (Twitter)
  • A portfolio of developer-facing content you can point to: tutorials, blog posts, videos, demos, benchmarks, or conference talks
  • Hands-on experience building and engaging open-source or developer communities (Discord, GitHub, forums)
  • Strong Python skills
  • Hands-on experience with data libraries such as pandas, pyarrow, and huggingface/datasets
  • Practical experience with storage systems and formats: Parquet, Open Table Formats, and S3
  • Working knowledge of dataset versioning, deduplication, and compression
  • Ability to explain complex technical topics clearly through writing, demos, or talks
  • Fluent written and spoken English
Nice to Have
  • Experience with the Hugging Face Hub and datasets ecosystem, or with Xet
  • Open-source maintainer or contributor experience
  • Familiarity with large-scale data pipelines and data engineering workflows
  • Experience producing notebooks (for example Colab) for tutorials and benchmarks
A Note on Fit

If you're interested in joining us but don't tick every box above, we still encourage you to apply. We're building a diverse team whose skills, experiences, and backgrounds complement one another, and we're happy to consider where you might make the biggest impact.

How to Apply

At Hugging Face we believe great AI shouldn't require a massive cluster, we build for everyone, especially the GPU-poor. And because we genuinely read every application, here's a small sign that you read this one too: start your cover letter with the words "GPU-poor and proud of it" so we know you read the full description. No trick, no catch, it just tells us a real person is on the other side.

More About Hugging Face

We are actively working to build a culture that values diversity, equity, and inclusivity. We are intentionally building a workplace where you feel respected and supported—regardless of who you are or where you come from. We believe this is foundational to building a great company and community, as well as the future of machine learning more broadly. Hugging Face is an equal opportunity employer, and we do not discriminate based on race, ethnicity, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or ability status.

We value development. You will work with some of the smartest people in our industry. We are an organization that has a bias for impact and is always challenging ourselves to grow continuously. We provide all employees with reimbursement for relevant conferences, training, and education.

We care about your well-being. We offer flexible working hours and remote options. We offer health, dental, and vision benefits for employees and their dependents. We also offer parental leave and flexible paid time off.

We support our employees wherever they are. While we have office spaces in NYC and Paris, we're very distributed, and all remote employees have the opportunity to visit our offices. If needed, we'll also outfit your workstation to ensure you succeed.

We want our teammates to be shareholders. All employees have company equity as part of their compensation package. If we succeed in becoming a category-defining platform in machine learning and artificial intelligence, everyone enjoys the upside.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Data/Infrastructure Advocate Engineer - US Remote in United States vacancy
  •  ...About the Role As our first Data/Infrastructure Advocate Engineer , you’ll bridge the gap between cutting...  ...others. If youre interested in joining us but dont tick every box above, we...  ...We offer flexible working hours and remote options. We offer health, dental, and... 
    Remote work
    Data
    Work at office
    Flexible hours

    Hugging Face

    United States
    4 days ago
  •  ...Data/Infrastructure Advocate Engineer At Hugging Face, we're on a journey to democratize good AI. We're building...  ...If you're interested in joining us but don't tick every box above, we still...  ...We offer flexible working hours and remote options. We offer health, dental, and... 
    Remote work
    Data
    Work at office
    Flexible hours

    Hugging Face

    United States
    3 days ago
  • $90k - $100k

     ...career that matters, and help us build a safer future. Department...  ..., Customer Success Advocates focus on our most critical and...  ...strategic priorities. CSAs use data, product expertise, and a deep...  ...government customers. #LI-JM3 #LI-REMOTE Travel Requirements 25-50% Relocation... 
    Remote work
    Data
    Relocation

    Motorola Solutions

    Richmond, VA
    5 days ago
  •  ...Developer Advocate Get to know Okta Okta is The...  ...and people who can make us better with their...  ...Developer Evangelist, Sales Engineer, Solutions Engineer,...  ...with applicable data privacy and security laws...  ...integrations to applications and infrastructure providers, Okta... 
    Remote work
    Data
    Local area
    Flexible hours

    Phenom People

    Riverside, MO
    4 days ago
  • $200k - $300k

     ...systems running inside client infrastructure — cloud, containers, CI/CD,...  ...deploys. You've productionized data science outputs, deployed ML...  ...background reads: production engineer, systems engineer, SRE, or platform...  ...experience. Why Join Us Impact: Ship AI systems that... 
    Remote work
    Data
    Local area

    Tribe

    New York, NY
    2 days ago
  • $183.63k - $275.38k

     ...new era in cloud infrastructure for the global AI...  ...enterprises from data and model training...  ...infrastructure. Built by engineers, for engineers....  ...Senior Developer Advocate - AI Cloud to...  ...are welcome to work remotely from the United...  ...employee benefits in the US: ~ Health... 
    Remote work
    Data
    Temporary work
    Immediate start

    Nebius

    United States
    1 day ago
  •  ...Cisco CCIE Network Security Engineer Location: Remote/US Duration: C2C/Fulltime Job Description...  ...experience with enterprise and core infrastructure ecosystems including assessment, design...  ...related to information security and data confidentiality (e.g., FERPA) and... 
    Remote work
    Data
    Full time

    Zortech Solutions

    United States
    5 days ago
  • $135k - $200k

     ...Forward Deployed Infrastructure Engineer Palantir builds the world's leading software for data-driven decisions and operations. By bringing...  ...Engineers who can help us build, operate, and maintain...  ...a few roles that allow for "Remote" work on an exceptional basis... 
    Remote work
    Data
    Work experience placement
    Work at office
    Work from home
    Relocation package

    Palantir Technologies

    Washington DC
    14 days ago
  • $135k - $200k

     ...Forward Deployed Infrastructure Engineer Palantir builds the world's leading software for data-driven decisions and operations. By bringing...  ...Engineers who can help us build, operate, and maintain...  ...a few roles that allow for "Remote" work on an exceptional basis... 
    Remote work
    Data
    Work experience placement
    Work at office
    Work from home
    Relocation package

    Palantir Technologies

    Honolulu, HI
    5 days ago
  • $24.04 - $33.65 per hour

     ...Managed Services Center (MSC) Infrastructure Engineer, you will provide ePlus’ MSC...  ..., maintaining accurate data set in tools, deploying new...  ...rotating schedule. Ability to work remotely and still provide a...  ...make a real difference. Join us and be part of a culture that... 
    Remote work
    Data
    Hourly pay
    Contract work
    Local area
    Immediate start
    Shift work
    Night shift
    Rotating shift

    ePlus

    United States
    5 days ago
  •  ...AWS Network Engineer Location: Remote/US/Canada Duration: 6-12+ Months Job Description Build...  ...troubleshooting for the DISH corporate data network devices and services. Must...  ...sets helpful to build and secure Infrastructure as Code networks in the cloud are a... 
    Remote work
    Data

    Zortech Solutions

    United States
    3 days ago
  •  ...Analytics Engineering Advocate At Lightdash, we're obsessed with the success of our users. Our fast...  ...engineering best practices and data modeling strategy. If you have a passion...  ...members Logistics ~ Able to work US Eastern Time hours with some earlier meetings... 
    Remote work
    Data

    Lightdash

    United States
    1 day ago
  •  ...Economy. Your goal is to move us beyond the baseline standardization...  ...(e.g., Automotive V2X, Remote Healthcare, or Logistics)....  ...IMS cores to identify "hidden" data assets that can be productized...  ...software developers (not just telco engineers) consume services. Multi-... 
    Remote work
    Data

    Redolent

    Chicago, IL
    4 days ago
  •  ...best BI tool for analytics engineers by letting them manage...  ...and command line. Once the data team has written the...  ...and processes that allow us to do this, even while fully remote. We bias towards impact...  ...an Analytics Engineering Advocate to join the Lightdash team... 
    Remote work
    Data

    Lightdash, Inc

    United States
    3 days ago
  • $135k - $200k

     ...builds the world's leading software for data-driven decisions and operations. By...  ...We're looking for Forward Deployed Infrastructure Engineers who can help us build, operate, and maintain high-...  ...there are a few roles that allow for "Remote" work on an exceptional basis. If... 
    Remote work
    Data
    Work experience placement
    Work at office
    Work from home
    Relocation package

    Palantir Technologies

    New York, NY
    1 day ago
  • $100k - $150k

     ...Nscale is the GPU cloud engineered for AI. We provide...  ...effective, high-performance infrastructure for AI start-ups and...  ..., and scalability of data centre infrastructure....  ...Join our thriving remote‑first team. Geography...  ...situation, please let us know. The range below... 
    Remote work
    Data
    Flexible hours

    Nscale Ltd.

    Richmond, VA
    4 days ago
  • $98.19k - $166.92k

     ...seeking a motivated Network Engineer to join our team. The...  ...supporting network infrastructure, troubleshooting connectivity...  ...operations; limited remote work may be available...  ...copper/fiber) in various data centers and remote...  ...instructed to contact us in advance at? candidateaccommodation... 
    Remote work
    Data
    Full time
    Contract work
    Work experience placement
    Work at office

    ICF

    Reston, VA
    5 days ago
  •  ...Staff Infrastructure Security Engineer (APAC, EMEA, or US) Remote, APAC; Remote, EMEA; Remote, US GitLab is the intelligent orchestration platform for DevSecOps...  ..., used under license. Claim based on GitLab data. Fortune 100 refers to the top 20% ranked companies... 
    Remote work
    Data
    Full time
    Flexible hours

    GitLab

    United States
    1 day ago
  •  ...Lead Data Engineer-Azure Databricks Location: Remote/US Duration: 6+ Months Job Description: # Ability to design and develop Azure framework which should be able to pull data from any type of unstructured/ structured sources of data, run transformation based... 
    Remote work
    Data

    Zortech Solutions

    United States
    1 day ago
  •  ...Data Network Engineer Location: Remote/US/Canada Duration: 6-12+ Months Job Description: Data Network Engineers for DISH Job Duties and...  ...Bash), and DNA Center a plus Experience with AWS infrastructure a plus Project Implementation: Provide advanced... 
    Remote work
    Data

    Zortech Solutions

    United States
    3 days ago
  •  ...Senior Cloud Operations Engineer – Remote / US Citizens Only Remote position open to US citizens...  ..., manage, and optimize cloud infrastructure on AWS, GCP, or Azure. Automate operations...  ...Support cloud storage services and data persistence layers. Work with MongoDB... 
    Remote work
    Data
    Full time

    Saransh

    Hartford, CT
    7 days ago
  • $180k - $240k

     ...Senior Infrastructure Engineer New York, New York; Palo Alto, California...  ...and helping enterprises unify data, applications, processes, and...  ...learn more, visit Why Join Us? Ultimately, Workato...  ...us the #1 best company for remote workers Role Overview... 
    Remote work
    Data
    Full time
    Flexible hours

    Workato

    Palo Alto, CA
    4 days ago
  •  ...Computing Application Architect (Remote - US) We are looking for a Cloud...  ...that advance Public Health data modernization initiatives....  ...Strong collaboration across engineering, data, and operational teams...  ..., automation, CI/CD, and infrastructure performance. Guide application... 
    Remote work
    Data
    Flexible hours

    Jobgether

    New York, NY
    5 days ago
  • $179k - $199k

     ...working closely with other engineering teams across the...  ...focus on building the infrastructure that connects AI systems...  .... #LI-AV1 #LI-remote $179,000 - $199,000...  ...rewards package. The US base salary range for...  ...information about how your data is processed, please contact... 
    Remote work
    Data

    PointClickCare

    United States
    5 days ago
  •  ...looking for a senior hands‑on engineer to own and maintain a...  ...of backend development, data engineering, DevOps, and infrastructure work Translate business and...  ...tell you what to do next US-based, required due to...  ...a production AI platform Remote‑first, fully flexible work... 
    Remote work
    Data
    Flexible hours

    Provectus

    New York, NY
    5 days ago
  •  ...Full Stack Engineer +Azure Position: Full Stack Engineer +Azure Location: Remote/US Duration: 6+ Months Job Description: ~ Mandatory: Gitlab CI/CD and/or...  ...engineer + Azure cloud engineer Cloud: Azure Data Services - ADLS - Data Lake Storage, ADF - Azure... 
    Remote work
    Data

    Zortech Solutions

    United States
    3 days ago
  •  ...Platform Engineer-US Platform Engineer, Certinia Location: UK, Spain, Remote OK About Certinia: Certinia delivers a Services...  ...against customer Salesforce data. You will work across the stack...  ...and Terraform that provision infrastructure, to the services that make up... 
    Remote work
    Data

    Certinia

    United States
    2 days ago
  •  ...Technology Consultant Location: Remote/US Duration: 6+ Months...  ...Azure platform, leveraging Infrastructure-as-a-Service (IaaS), Platform...  ..., Azure Synapse, Azure Data Factory, Azure SQL Database,...  ...degree in Computer Science, Engineering, or a related field. A master... 
    Remote work
    Data

    Zortech Solutions

    United States
    3 days ago
  •  ...Cloud Data Engineer-DBA (NTW) Location: US/Remote Duration: 6+ Months Job Description: Looking for an experienced senior-level data engineer...  ...designing and building reliable, scalable data infrastructure with leading privacy and security techniques to safeguard... 
    Remote work
    Data

    Zortech Solutions

    United States
    3 days ago
  • $155k - $205k

     ...Cloud Enablement Engineer [Remote-US] To help keep everyone safe, we encourage all applicants...  ...malicious actors looking for personal data. Please be aware we will only reach out...  ..., and manage the company's cloud infrastructure standards, while empowering developers... 
    Remote work
    Data
    Extra income
    Local area
    Work from home
    Home office
    Flexible hours

    Quanata

    United States
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data/Infrastructure Advocate Engineer - US Remote. Be the first to apply!