Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Data Engineer/ Data Scientist

Billennium

Billennium is a global technology company with over 20 years of experience, committed to innovation and empowering businesses. As an employer, we offer a supportive, growth-focused environment where collaboration and creativity thrive. Join us to shape the future of technology together ! About the Role: We are looking for a Senior AI Data Engineer / Data Scientist who can turn messy enterprise data into AI-ready, high-quality knowledge assets. You will lead the cleanup, preparation, and enrichment of unstructured content (SharePoint/document repositories) and structured/semi-structured data (data lakes, databases) so our agents, copilots, and RAG systems are accurate, trustworthy, and scalable. This is a senior, hands-on role. You will own data quality outcomes end-to-end: discovery - cleanup - enrichment - ingestion - refresh cycles - governance. We value AI-native generalists who can remove bottlenecks by working directly with AI Engineers, Architects, and business stakeholders to decide what data is worth using and how to structure it for retrieval and reasoning. Our standardized stack includes (and this role actively uses it): ingestion/ETL foundations, Postgres + pgvector as default RAG store, Redis caching, LLM gateway patterns, Langfuse observability, DeepEval/RAGAS evaluation, and Presidio for PII detection/masking when required. Must-have requirements: 5+ years in data engineering / applied data science / analytics engineering with ownership of production pipelines. Proven experience working with unstructured enterprise data (documents, PDFs, Office files, wikis, knowledge bases). Solid understanding of data quality engineering: validation, monitoring, lineage, refresh cycles. Strong stakeholder skill : can work with business to define what data matters and what “good” looks like. Nice to have: Experience with Postgres + pgvector (or similar vector stores), retrieval optimization, and hybrid search concepts. Familiarity with observability practices for AI pipelines and the use of RAG evaluation metrics (RAGAS-style). Experience with governance tooling and privacy controls for enterprise AI (e.g., PII workflows). What you will do: Lead “data triage” for AI use cases: identify authoritative sources, duplicates, outdated content, and low-quality documents. Clean, normalize, deduplicate, and standardize enterprise content at scale (documents, PDFs, Word/Excel, wiki pages, etc.). Define what data should be excluded from AI systems (stale, contradictory, low-trust, or sensitive content). Unstructured ingestion (SharePoint + document repositories) Build robust ingestion pipelines for SharePoint and file repositories: parsing, text extraction, structure recovery, and metadata capture. Implement document normalization strategies (naming, taxonomy, metadata standards, canonical IDs). Design chunking strategies, metadata enrichment, and document structuring optimized for retrieval performance and cost. Improve retrieval quality through practical techniques such as filtered retrieval and post-retrieval optimization where appropriate (e.g., reranking), collaborating with AI Engineers on the retrieval interface. Prepare and maintain “AI-ready knowledge sets” that can be embedded and served via Postgres + pgvector (default). Data quality, evaluation, and feedback loops (non-negotiable) Define and implement data quality gates (freshness, completeness, relevance, dedupe rate, metadata coverage). Partner with AI Engineers to evaluate retrieval and RAG performance using frameworks like RAGAS (answer correctness, context recall/precision) and to monitor trust metrics over time. Establish human feedback loops where needed (review queues, sampling, targeted audits) to continuously improve data usefulness and user trust. Governance, privacy, and auditability Apply privacy and enterprise constraints; where required, implement PII detection/masking using Presidio patterns. Reuse Package reusable “data cleanup + RAG readiness” recipes: ingestion templates, metadata schemas, chunking playbooks, dedupe strategies. Build a repeatable data foundation that accelerates future use cases (not a one-off cleanup project). Our offer: Comprehensive benefits - enjoy Udemy for Business, private medical care, Multisport card, veterinary package, language lessons, and shopping vouchers. Flexibility - adaptable working hours and remote/hybrid work options to suit your lifestyle & location. Career growth - access opportunities for professional development and learning, including perks related to our official partnerships with global IT giants: Microsoft, AWS, Snowflake, Salesforce & more. Global collaboration - work with a diverse, international team. Innovative environment part of a forward-thinking and growth-oriented workplace. Engaging community - Work with passionate professionals and participate in team-building events, hackathons, and CSR initiatives to make an impact beyond work. Team-building events including our company tradition (annual company event in Mazury). A pleasant surprise to start your journey with us in the form of a welcome pack. Recruitment process: HR call Technical Interview Final Interview Decision/ Feedback Sounds interesting? Click "Apply" and have a chance to hear more! #J-18808-Ljbffr

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior AI Data Engineer/ Data Scientist in Poland, NY vacancy
  •  ...seeking candidates for a role focused on developing Python-based solutions for its Global Finance teams. The position requires strong Data Science experience and Python programming skills, aiming to improve outputs through innovative techniques. Successful candidates... 
    Senior

    HSBC

    Poland, NY
    11 hours ago
  •  ...Luxoft Poland is seeking a skilled Data Engineer to enhance an internal LLM-powered assistant. You will focus on improving data ingestion, retrieval performance, and ensuring operational excellence. The ideal candidate has at least 8 years in Data Science and 5 in Machine... 
    Senior

    Luxoft Poland

    Poland, NY
    4 days ago
  •  ...Join the Data Engineering team to contribute to the ongoing maintenance and improvement of an internal LLM-powered assistant that uses hosted LLM APIs and internal knowledge sources, with a focus on reliability, retrieval quality, and operational excellence. Maintain... 
    Senior
    Contract work

    Luxoft Poland

    Poland, NY
    3 days ago
  •  ...SoftwareOne is looking for an AI Data Engineer to join our innovative AI team. This role involves designing and building data pipelines for AI solutions and collaborating with cross-functional teams. The ideal candidate will have a proven background in data engineering... 
    Suggested
    Remote work

    SoftwareONE

    Poland, NY
    11 hours ago
  •  ...Practical Information Location: Poland | Reports to: AI Data Science Lead | Work Arrangement: Remote / Hybrid | Contract type: Full-time...  ...and machine learning for our internal employees. As an AI Data Engineer in our AI Team, you will play a critical role in shaping the... 
    Suggested
    Full time
    Contract work
    Work at office
    Local area
    Remote work
    Flexible hours

    SoftwareONE

    Poland, NY
    11 hours ago
  •  ...SoftServe is looking for an experienced Data Engineer to develop scalable cloud-based data solutions and processing pipelines on AWS. In this role, you will design and maintain data workflows while collaborating with stakeholders to deliver effective solutions. The ideal... 
    Senior

    SoftServe

    Poland, NY
    11 hours ago
  • $40 per hour

     ...Mindrift is seeking a Senior Python Data Scraping Engineer for the Tendem project, focusing on specialized data scraping workflows within an AI-driven system. This part-time remote role requires at least 5 years of experience in web scraping and automation, demanding a... 
    Senior
    Hourly pay
    Part time
    Remote work

    Mind Rift

    Poland, NY
    11 hours ago
  •  ...Exadel open positions is looking for a Senior Data Engineer to join their team in the Town of Poland, New York. In this role, you'll design and optimize ETL/ELT pipelines and collaborate with cross-functional teams to develop impactful data solutions. The ideal candidate... 
    Senior
    Work at office
    Remote work

    Exadel open positions

    Poland, NY
    4 days ago
  •  ...Exadel open positions is looking for an experienced Data Engineer located in the Town of Poland, New York. The ideal candidate will have over 5 years of experience in data engineering, strong skills in Databricks and SQL, and a solid understanding of data warehousing principles... 
    Senior
    Work at office
    Remote work

    Exadel open positions

    Poland, NY
    3 days ago
  •  ...Senior Data Engineer Proxify is a platform that connects top developers worldwide to remote full‑time opportunities. The Role We are looking for a Senior Data Engineer specializing in modern, cloud‑native data platforms, with a strong focus on Amazon Web Services (AWS... 
    Senior
    Full time
    Remote work
    Worldwide
    Flexible hours

    Proxify

    Poland, NY
    4 days ago
  •  ...We are looking for a motivated Senior Data Engineer (ADB + Python) who is willing to dive into the new project with a modern stack. If you’re...  ...produce meaningful results, please apply! Why Join Exadel We’re an AI-first global tech company with 25+ years of engineering... 
    Senior
    Contract work
    Work at office
    Remote work

    Exadel

    Poland, NY
    18 hours ago
  •  ...A leading data-focused company is seeking a Senior Data Engineer to work remotely. The candidate will be responsible for migrating and optimizing data models, fixing SQL syntax differences, and collaborating with teams to ensure successful data integration. Required qualifications... 
    Senior
    Remote work

    Pipe Recruit

    Poland, NY
    3 days ago
  •  ...Dotlinkers IT recruitment is seeking a Senior Big Data Engineer who will contribute to building a next-generation data platform and services. The successful candidate will possess over 4 years of software development experience and proficiency in Java, Scala, or Python... 
    Senior
    Remote work
    Flexible hours

    Dotlinkers IT recruitment

    Poland, NY
    3 days ago
  •  ...Exadel is searching for a skilled Data Engineer to design and implement scalable data solutions, utilizing technologies like Databricks and Azure Synapse. The role involves optimizing data systems for analytics and BI, collaborating with cross-functional teams to deliver... 
    Senior
    Work at office
    Remote work
    Flexible hours

    Exadel open positions

    Poland, NY
    18 hours ago
  •  ...A software development company in New York is looking for a Data Engineer with 4+ years of experience in Data Engineering and SQL. The role involves developing databases, designing solutions, and collaborating with various stakeholders. Proficiency in Python and Databricks... 
    Senior

    Eleks

    Poland, NY
    3 days ago
  •  ...Grape Up is seeking an experienced Data Engineer to implement scalable architectures and manage high volumes of simulation data. The ideal candidate will have a Master’s degree in a related field and over 6 years of experience in Data Engineering, specifically with AWS... 
    Senior

    Grape Up

    Poland, NY
    4 days ago
  •  ...A global IT consulting company is seeking a Senior Data Engineer to develop innovative data solutions for international clients. The role requires a minimum of 7 years of experience and expertise in data processing systems and tools such as Python, SQL, and cloud services... 
    Senior
    Remote work

    STX Next Sp z.o.o

    Poland, NY
    4 days ago
  •  ...We are seeking a highly skilled and experienced Senior Data Engineer with solid expertise in AWS, Azure & GCP, along with proficiency in modern data transformation tools such as dbt and Databricks. Tasks In the role of Senior Data Engineer, you will be entrusted with... 
    Senior
    Flexible hours

    Insightify

    Poland, NY
    3 days ago
  •  ...Grape Up, we transform businesses by unlocking the potential of AI and data through innovative software solutions. We partner with...  ...following industry best practices Collaborate effectively with data engineering team members while partnering closely with analytics and data... 
    Senior
    Flexible hours

    Grape Up

    Poland, NY
    3 days ago
  •  ...Position: Senior Big Data Engineer Working model: Remote or hybrid Form of employment: contract of employment Join our client which makes software...  ...issues during litigation and internal investigations. The AI-powered communication surveillance product proactively detects... 
    Senior
    Contract work
    Remote work
    Home office

    Dotlinkers IT recruitment

    Poland, NY
    3 days ago
  •  ...we're accelerating the transformation of global data landscapes. We're seeking highly experienced Senior Data Engineers to help deliver a robust and scalable data architecture...  ...about building end-to-end pipelines, enabling AI/BI solutions, and thriving in fast-paced, high-... 
    Senior

    Agentic Dream

    Poland, NY
    4 days ago
  •  ...functionality analysis and development, maintenance and rework of existing web applications. Requirements 3+ years of experience as a Data Engineer Knowledge and experience with: Python, SQL, Azure Synapse, Azure One Lake, Data Factory, and Spark jobs Experience with: NoSQL... 
    Senior
    Relocation
    Flexible hours

    GlobalLogic

    Poland, NY
    3 days ago
  •  ...Akamai is seeking a Senior Security Engineer to protect AI inference environments from emerging threats. In this role, you will define and implement security practices across the AI inference stack, focusing on runtime protection and threat modeling. Successful candidates... 
    Senior
    Remote work

    Akamai

    Poland, NY
    3 days ago
  •  ...Simple Life is the #1 AI-powered health coaching app for adults who want to lose weight and enjoy a healthier lifestyle—without...  ...to—new healthy habits. To learn more, visit simple.life. Senior Data Engineer Push the pace of innovation and build a future of a healthier... 
    Senior
    Work at office
    Remote work
    Flexible hours

    Palta

    Poland, NY
    11 hours ago
  •  ...A leading engineering company in Poland is seeking a Senior Artificial Intelligence/Machine Learning Engineer to join their innovative team. This role involves leading the ML infrastructure strategy, architecting scalable pipelines, and mentoring engineers. You will work... 
    Senior
    Remote work
    Flexible hours

    Ciklum

    Poland, NY
    4 days ago
  •  ...We are looking for a Senior Data Engineer to join our agile team and help design, develop, and evolve the strategic integration backbone between Dealstores, Operations, and Regulatory systems within our Digital Operations Stream. In this role, you will drive solution... 
    Senior
    Work at office
    Worldwide
    Flexible hours
    3 days per week

    EPAM Systems Inc

    Poland, NY
    1 day ago
  •  ...SQL BigQuery GCP Data Fusion Data modeling Inmon Data Vault API Data JSON XML CSV REST SOAP Git CI/CD Angielski Data Major 3+ years of expert-level SQL & BigQuery hands-on experience (complex optimization, cost control, data integrity). 3+ years of experience in ELT/ETL... 
    Senior
    Immediate start

    Link Group

    Poland, NY
    4 days ago
  •  ...EPAM Systems is seeking a Senior Data Engineer to enhance their Digital Operations Stream. Responsibilities include managing solution architecture and collaborating with agile teams to ensure robust trade flow systems. The ideal candidate should possess a strong Java... 
    Senior
    Work at office

    EPAM Systems Inc

    Poland, NY
    18 hours ago
  •  ...skilled professional in Town of Poland, NY to design and implement advanced computer vision solutions. You will collaborate with a team of AI experts, focusing on image generation and multimodal intelligence, contributing to impactful projects for world-leading clients. The... 
    Senior
    Flexible hours

    SoftServe

    Poland, NY
    4 days ago
  •  ...TL;DR: Senior Software Engineer specializing in TypeScript while using AI daily to build browser automations, seeking a fully remote role within ±3 hours of CET (Berlin). You thrive on solving real problems, are eager to grow, and want to work closely with a small, tight... 
    Senior
    Full time
    Remote work
    Worldwide
    Flexible hours

    AccessOwl

    Poland, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Data Engineer/ Data Scientist. Be the first to apply!