Senior AI Data Engineer/ Data Scientist
Billennium
Billennium is a global technology company with over 20 years of experience, committed to innovation and empowering businesses. As an employer, we offer a supportive, growth-focused environment where collaboration and creativity thrive. Join us to shape the future of technology together ! About the Role: We are looking for a Senior AI Data Engineer / Data Scientist who can turn messy enterprise data into AI-ready, high-quality knowledge assets. You will lead the cleanup, preparation, and enrichment of unstructured content (SharePoint/document repositories) and structured/semi-structured data (data lakes, databases) so our agents, copilots, and RAG systems are accurate, trustworthy, and scalable. This is a senior, hands-on role. You will own data quality outcomes end-to-end: discovery - cleanup - enrichment - ingestion - refresh cycles - governance. We value AI-native generalists who can remove bottlenecks by working directly with AI Engineers, Architects, and business stakeholders to decide what data is worth using and how to structure it for retrieval and reasoning. Our standardized stack includes (and this role actively uses it): ingestion/ETL foundations, Postgres + pgvector as default RAG store, Redis caching, LLM gateway patterns, Langfuse observability, DeepEval/RAGAS evaluation, and Presidio for PII detection/masking when required. Must-have requirements: 5+ years in data engineering / applied data science / analytics engineering with ownership of production pipelines. Proven experience working with unstructured enterprise data (documents, PDFs, Office files, wikis, knowledge bases). Solid understanding of data quality engineering: validation, monitoring, lineage, refresh cycles. Strong stakeholder skill : can work with business to define what data matters and what “good” looks like. Nice to have: Experience with Postgres + pgvector (or similar vector stores), retrieval optimization, and hybrid search concepts. Familiarity with observability practices for AI pipelines and the use of RAG evaluation metrics (RAGAS-style). Experience with governance tooling and privacy controls for enterprise AI (e.g., PII workflows). What you will do: Lead “data triage” for AI use cases: identify authoritative sources, duplicates, outdated content, and low-quality documents. Clean, normalize, deduplicate, and standardize enterprise content at scale (documents, PDFs, Word/Excel, wiki pages, etc.). Define what data should be excluded from AI systems (stale, contradictory, low-trust, or sensitive content). Unstructured ingestion (SharePoint + document repositories) Build robust ingestion pipelines for SharePoint and file repositories: parsing, text extraction, structure recovery, and metadata capture. Implement document normalization strategies (naming, taxonomy, metadata standards, canonical IDs). Design chunking strategies, metadata enrichment, and document structuring optimized for retrieval performance and cost. Improve retrieval quality through practical techniques such as filtered retrieval and post-retrieval optimization where appropriate (e.g., reranking), collaborating with AI Engineers on the retrieval interface. Prepare and maintain “AI-ready knowledge sets” that can be embedded and served via Postgres + pgvector (default). Data quality, evaluation, and feedback loops (non-negotiable) Define and implement data quality gates (freshness, completeness, relevance, dedupe rate, metadata coverage). Partner with AI Engineers to evaluate retrieval and RAG performance using frameworks like RAGAS (answer correctness, context recall/precision) and to monitor trust metrics over time. Establish human feedback loops where needed (review queues, sampling, targeted audits) to continuously improve data usefulness and user trust. Governance, privacy, and auditability Apply privacy and enterprise constraints; where required, implement PII detection/masking using Presidio patterns. Reuse Package reusable “data cleanup + RAG readiness” recipes: ingestion templates, metadata schemas, chunking playbooks, dedupe strategies. Build a repeatable data foundation that accelerates future use cases (not a one-off cleanup project). Our offer: Comprehensive benefits - enjoy Udemy for Business, private medical care, Multisport card, veterinary package, language lessons, and shopping vouchers. Flexibility - adaptable working hours and remote/hybrid work options to suit your lifestyle & location. Career growth - access opportunities for professional development and learning, including perks related to our official partnerships with global IT giants: Microsoft, AWS, Snowflake, Salesforce & more. Global collaboration - work with a diverse, international team. Innovative environment part of a forward-thinking and growth-oriented workplace. Engaging community - Work with passionate professionals and participate in team-building events, hackathons, and CSR initiatives to make an impact beyond work. Team-building events including our company tradition (annual company event in Mazury). A pleasant surprise to start your journey with us in the form of a welcome pack. Recruitment process: HR call Technical Interview Final Interview Decision/ Feedback Sounds interesting? Click "Apply" and have a chance to hear more! #J-18808-Ljbffr
- ...seeking candidates for a role focused on developing Python-based solutions for its Global Finance teams. The position requires strong Data Science experience and Python programming skills, aiming to improve outputs through innovative techniques. Successful candidates...Senior
- ...Luxoft Poland is seeking a skilled Data Engineer to enhance an internal LLM-powered assistant. You will focus on improving data ingestion, retrieval performance, and ensuring operational excellence. The ideal candidate has at least 8 years in Data Science and 5 in Machine...Senior
- ...Join the Data Engineering team to contribute to the ongoing maintenance and improvement of an internal LLM-powered assistant that uses hosted LLM APIs and internal knowledge sources, with a focus on reliability, retrieval quality, and operational excellence. Maintain...SeniorContract work
- ...SoftwareOne is looking for an AI Data Engineer to join our innovative AI team. This role involves designing and building data pipelines for AI solutions and collaborating with cross-functional teams. The ideal candidate will have a proven background in data engineering...SuggestedRemote work
- ...Practical Information Location: Poland | Reports to: AI Data Science Lead | Work Arrangement: Remote / Hybrid | Contract type: Full-time... ...and machine learning for our internal employees. As an AI Data Engineer in our AI Team, you will play a critical role in shaping the...SuggestedFull timeContract workWork at officeLocal areaRemote workFlexible hours
- ...SoftServe is looking for an experienced Data Engineer to develop scalable cloud-based data solutions and processing pipelines on AWS. In this role, you will design and maintain data workflows while collaborating with stakeholders to deliver effective solutions. The ideal...Senior
$40 per hour
...Mindrift is seeking a Senior Python Data Scraping Engineer for the Tendem project, focusing on specialized data scraping workflows within an AI-driven system. This part-time remote role requires at least 5 years of experience in web scraping and automation, demanding a...SeniorHourly payPart timeRemote work- ...Exadel open positions is looking for a Senior Data Engineer to join their team in the Town of Poland, New York. In this role, you'll design and optimize ETL/ELT pipelines and collaborate with cross-functional teams to develop impactful data solutions. The ideal candidate...SeniorWork at officeRemote work
- ...Exadel open positions is looking for an experienced Data Engineer located in the Town of Poland, New York. The ideal candidate will have over 5 years of experience in data engineering, strong skills in Databricks and SQL, and a solid understanding of data warehousing principles...SeniorWork at officeRemote work
- ...Senior Data Engineer Proxify is a platform that connects top developers worldwide to remote full‑time opportunities. The Role We are looking for a Senior Data Engineer specializing in modern, cloud‑native data platforms, with a strong focus on Amazon Web Services (AWS...SeniorFull timeRemote workWorldwideFlexible hours
- ...We are looking for a motivated Senior Data Engineer (ADB + Python) who is willing to dive into the new project with a modern stack. If you’re... ...produce meaningful results, please apply! Why Join Exadel We’re an AI-first global tech company with 25+ years of engineering...SeniorContract workWork at officeRemote work
- ...A leading data-focused company is seeking a Senior Data Engineer to work remotely. The candidate will be responsible for migrating and optimizing data models, fixing SQL syntax differences, and collaborating with teams to ensure successful data integration. Required qualifications...SeniorRemote work
- ...Dotlinkers IT recruitment is seeking a Senior Big Data Engineer who will contribute to building a next-generation data platform and services. The successful candidate will possess over 4 years of software development experience and proficiency in Java, Scala, or Python...SeniorRemote workFlexible hours
- ...Exadel is searching for a skilled Data Engineer to design and implement scalable data solutions, utilizing technologies like Databricks and Azure Synapse. The role involves optimizing data systems for analytics and BI, collaborating with cross-functional teams to deliver...SeniorWork at officeRemote workFlexible hours
- ...A software development company in New York is looking for a Data Engineer with 4+ years of experience in Data Engineering and SQL. The role involves developing databases, designing solutions, and collaborating with various stakeholders. Proficiency in Python and Databricks...Senior
- ...Grape Up is seeking an experienced Data Engineer to implement scalable architectures and manage high volumes of simulation data. The ideal candidate will have a Master’s degree in a related field and over 6 years of experience in Data Engineering, specifically with AWS...Senior
- ...A global IT consulting company is seeking a Senior Data Engineer to develop innovative data solutions for international clients. The role requires a minimum of 7 years of experience and expertise in data processing systems and tools such as Python, SQL, and cloud services...SeniorRemote work
- ...We are seeking a highly skilled and experienced Senior Data Engineer with solid expertise in AWS, Azure & GCP, along with proficiency in modern data transformation tools such as dbt and Databricks. Tasks In the role of Senior Data Engineer, you will be entrusted with...SeniorFlexible hours
- ...Grape Up, we transform businesses by unlocking the potential of AI and data through innovative software solutions. We partner with... ...following industry best practices Collaborate effectively with data engineering team members while partnering closely with analytics and data...SeniorFlexible hours
- ...Position: Senior Big Data Engineer Working model: Remote or hybrid Form of employment: contract of employment Join our client which makes software... ...issues during litigation and internal investigations. The AI-powered communication surveillance product proactively detects...SeniorContract workRemote workHome office
- ...we're accelerating the transformation of global data landscapes. We're seeking highly experienced Senior Data Engineers to help deliver a robust and scalable data architecture... ...about building end-to-end pipelines, enabling AI/BI solutions, and thriving in fast-paced, high-...Senior
- ...functionality analysis and development, maintenance and rework of existing web applications. Requirements 3+ years of experience as a Data Engineer Knowledge and experience with: Python, SQL, Azure Synapse, Azure One Lake, Data Factory, and Spark jobs Experience with: NoSQL...SeniorRelocationFlexible hours
- ...Akamai is seeking a Senior Security Engineer to protect AI inference environments from emerging threats. In this role, you will define and implement security practices across the AI inference stack, focusing on runtime protection and threat modeling. Successful candidates...SeniorRemote work
- ...Simple Life is the #1 AI-powered health coaching app for adults who want to lose weight and enjoy a healthier lifestyle—without... ...to—new healthy habits. To learn more, visit simple.life. Senior Data Engineer Push the pace of innovation and build a future of a healthier...SeniorWork at officeRemote workFlexible hours
- ...A leading engineering company in Poland is seeking a Senior Artificial Intelligence/Machine Learning Engineer to join their innovative team. This role involves leading the ML infrastructure strategy, architecting scalable pipelines, and mentoring engineers. You will work...SeniorRemote workFlexible hours
- ...We are looking for a Senior Data Engineer to join our agile team and help design, develop, and evolve the strategic integration backbone between Dealstores, Operations, and Regulatory systems within our Digital Operations Stream. In this role, you will drive solution...SeniorWork at officeWorldwideFlexible hours3 days per week
- ...SQL BigQuery GCP Data Fusion Data modeling Inmon Data Vault API Data JSON XML CSV REST SOAP Git CI/CD Angielski Data Major 3+ years of expert-level SQL & BigQuery hands-on experience (complex optimization, cost control, data integrity). 3+ years of experience in ELT/ETL...SeniorImmediate start
- ...EPAM Systems is seeking a Senior Data Engineer to enhance their Digital Operations Stream. Responsibilities include managing solution architecture and collaborating with agile teams to ensure robust trade flow systems. The ideal candidate should possess a strong Java...SeniorWork at office
- ...skilled professional in Town of Poland, NY to design and implement advanced computer vision solutions. You will collaborate with a team of AI experts, focusing on image generation and multimodal intelligence, contributing to impactful projects for world-leading clients. The...SeniorFlexible hours
- ...TL;DR: Senior Software Engineer specializing in TypeScript while using AI daily to build browser automations, seeking a fully remote role within ±3 hours of CET (Berlin). You thrive on solving real problems, are eager to grow, and want to work closely with a small, tight...SeniorFull timeRemote workWorldwideFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Data Engineer/ Data Scientist. Be the first to apply!

