Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer

North Eastern Services

About Fusemachines Founded in 2013, Fusemachines is a global provider of enterprise AI products and services, on a mission to democratize AI. Leveraging proprietary AI Studio and AI Engines, the company helps drive the clients’ AI Enterprise Transformation, regardless of where they are in their Digital AI journeys. With offices in North America, Asia, and Latin America, Fusemachines provides a suite of enterprise AI offerings and specialty services that allow organizations of any size to implement and scale AI. Fusemachines serves companies in industries such as retail, manufacturing, and government. Fusemachines continues to actively pursue the mission of democratizing AI for the masses by providing high-quality AI education in underserved communities and helping organizations achieve their full potential with AI. Immigration Sponsorship Policy This position is not eligible for employment visa sponsorship or transfer sponsorship now or in the future. Direct Company Sponsorship: Such as H-1B, J-1, or TN visas Employer of Record: Listing Fusemachines as the immigration employer on any government documentation Written Documentation: Providing letters or other support for any work authorization (e.g., OPT, STEM OPT, CPT) About The Role This is a remote full-time consulting position responsible for designing, building, and maintaining the infrastructure required for data integration, storage, processing, and analytics (BI, visualization and Advanced Analytics). We are looking for a skilled Senior Data Engineer with a strong background in Python, SQL, PySpark, Azure, Databricks, Synapse, Azure Data Lake, DevOps and cloud-based large scale data applications with a passion for data quality, performance and cost optimization. The ideal candidate will develop in an Agile environment, contributing to the architecture, design, and implementation of Data products, including migration from Synapse to Azure Data Lake. This role involves hands‑on coding, mentoring junior staff and collaboration with multi‑disciplined teams to achieve project objectives. Qualification & Experience Must have a full‑time Bachelor's degree in Computer Science or similar At least 3 years of experience as a data engineer with strong expertise in Databricks, Azure, DevOps, or other hyperscalers 3+ years of experience with Azure DevOps, GitHub Proven experience delivering large‑scale projects and products for Data and Analytics, as a data engineer, including migrations Following certifications: Databricks Certified Associate Developer for Apache Spark Databricks Certified Data Engineer Associate Microsoft Certified: Azure Fundamentals Microsoft Certified: Azure Data Engineer Associate Microsoft Exam: Designing and Implementing Microsoft DevOps Solutions (nice to have) Required Skills/Competencies Strong programming skills in one or more languages such as Python (must have), Scala, and proficiency in writing efficient and optimised code for data integration, migration, storage, processing and manipulation Strong understanding and experience with SQL and writing advanced SQL queries Thorough understanding of big data principles, techniques, and best practices Strong experience with scalable and distributed Data Processing Technologies such as Spark/PySpark (must have: experience with Azure Databricks), DBT and Kafka, to be able to handle large volumes of data Solid Databricks development experience with significant Python, PySpark, Spark SQL, Pandas, NumPy in Azure environment Strong experience in designing and implementing efficient ELT/ETL processes in Azure and Databricks and using open source solutions being able to develop custom integration solutions as needed Skilled in Data Integration from different sources such as APIs, databases, flat files, event streaming Expertise in data cleansing, transformation, and validation Proficiency with Relational Databases (Oracle, SQL Server, MySQL, Postgres, or similar) and NonSQL Databases (MongoDB or Table) Good understanding of Data Modeling and Database Design Principles. Being able to design and implement efficient database schemas that meet the requirements of the data architecture to support data solutions Strong experience in designing and implementing Data Warehousing, data lake and data lake house, solutions in Azure and Databricks Good experience with Delta Lake, Unity Catalog, Delta Sharing, Delta Live Tables (DLT) Strong understanding of the software development lifecycle (SDLC), especially Agile methodologies Strong knowledge of SDLC tools and technologies Azure DevOps and GitHub, including project management software (Jira, Azure Boards or similar), source code management (GitHub, Azure Repos or similar), CI/CD system (GitHub actions, Azure Pipelines, Jenkins or similar) and binary repository manager (Azure Artifacts or similar) Strong understanding of DevOps principles, including continuous integration, continuous delivery (CI/CD), infrastructure as code (IaC – Terraform, ARM including hands‑on experience), configuration management, automated testing, performance tuning and cost management and optimisation Strong knowledge in cloud computing specifically in Microsoft Azure services related to data and analytics, such as Azure Data Factory, Azure Databricks, Azure Synapse Analytics, Azure Data Lake, Azure Stream Analytics, SQL Server, Azure Blob Storage, Azure Data Lake Storage, Azure SQL Database, etc Experience in Orchestration using technologies like Databricks workflows and Apache Airflow Strong knowledge of data structures and algorithms and good software engineering practices Proven experience migrating from Azure Synapse to Azure Data Lake, or other technologies Strong analytical skills to identify and address technical issues, performance bottlenecks, and system failures Proficiency in debugging and troubleshooting issues in complex data and analytics environments and pipelines Good understanding of Data Quality and Governance, including implementation of data quality checks and monitoring processes to ensure that data is accurate, complete, and consistent Experience with BI solutions including PowerBI is a plus Strong written and verbal communication skills to collaborate and articulate complex situations concisely with cross‑functional teams, including business users, data architects, DevOps engineers, data analysts, data scientists, developers, and operations teams Ability to document processes, procedures, and deployment configurations Understanding of security practices, including network security groups, Azure Active Directory, encryption, and compliance standards Ability to implement security controls and best practices within data and analytics solutions, including proficient knowledge and working experience on various cloud security vulnerabilities and ways to mitigate them Self‑motivated with the ability to work well in a team, and experienced in mentoring and coaching different members of the team A willingness to stay updated with the latest services, Data Engineering trends, and best practices in the field Comfortable with picking up new technologies independently and working in a rapidly changing environment with ambiguous requirements Care about architecture, observability, testing, and building reliable infrastructure and data pipelines Responsibilities Architect, design, develop, test and maintain high-performance, large-scale, complex data architectures, which support data integration (batch and real‑time, ETL and ELT patterns from heterogeneous data systems: APIs and platforms), storage (data lakes, warehouses, data lake houses, etc), processing, orchestration and infrastructure. Ensuring the scalability, reliability, and performance of data systems, focusing on Databricks and Azure Contribute to detailed design, architectural discussions, and customer requirements sessions Actively participate in the design, development, and testing of big data products Construct and fine‑tune Apache Spark jobs and clusters within the Databricks platform Migrate out of Azure Synapse to Azure Data Lake or other technologies Assess best practices and design schemas that match business needs for delivering a modern analytics solution (descriptive, diagnostic, predictive, prescriptive) Design and implement data models and schemas that support efficient data processing and analytics Design and develop clear, maintainable code with automated testing using Pytest, unittest, integration tests, performance tests, regression tests, etc Collaborating with cross‑functional teams and Product, Engineering, Data Scientists and Analysts to understand data requirements and develop data solutions, including reusable components meeting product deliverables Evaluating and implementing new technologies and tools to improve data integration, data processing, storage and analysis Evaluate, design, implement and maintain data governance solutions: cataloging, lineage, data quality and data governance frameworks that are suitable for a modern analytics solution, considering industry‑standard best practices and patterns Continuously monitor and fine‑tune workloads and clusters to achieve optimal performance Provide guidance and mentorship to junior team members, sharing knowledge and best practices Maintain clear and comprehensive documentation of the solutions, configurations, and best practices implemented Promote and enforce best practices in data engineering, data governance, and data quality Ensure data quality and accuracy Design, Implement and maintain data security and privacy measures Be an active member of an Agile team, participating in all ceremonies and continuous improvement activities, being able to work independently as well as collaboratively Fusemachines is an Equal Opportunities Employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, colour, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local law. #J-18808-Ljbffr North Eastern Services

Vacancy posted 13 hours ago
Similar jobs that could be interesting for youBased on the Data Engineer in New York, NY vacancy
  • $140k - $160k

     ...Fitch Group is currently seeking a Associate Director/Lead Data Engineer based out of our Chicago office. As a leading, global financial information services provider, Fitch Group delivers vital credit and risk insights, robust data, and dynamic tools to champion more... 
    Suggested
    Temporary work
    Work at office
    Immediate start
    Worldwide
    Shift work
    2 days per week

    Fitch Ratings

    New York, NY
    4 days ago
  • $220k

     ...Specialist Recruiter | Databricks Data Engineer Recruitment | Connecting Top Talent with Leading US Opportunities Job Title: Lead Data Engineer Location: Remote Employment Type: Full-time Compensation: Up to $220k + benefits About the Role We’re hiring experienced Lead... 
    Suggested
    Full time
    Remote work
    Flexible hours

    KDR Talent Solutions USA

    New York, NY
    3 days ago
  • $174k - $230k

     ...The Job We are looking for our first in-house Data Engineer to own and evolve our core data infrastructure. This is an early and high-impact role. As the data engineering function grows under Engineering, you’ll have a real voice in shaping how it’s built - the processes... 
    Suggested
    For contractors
    Remote work

    Atticus Inc

    New York, NY
    3 days ago
  • $125k - $140k

     ...the vision and achieving the goals of our three core lines of business: Indexing, Digital Distribution, and Data & Analytics. Made up of developers, data engineers, designers, and project managers, the platform team is the engine that drives forward the technical... 
    Suggested

    VettaFi

    New York, NY
    4 days ago
  • $125k - $140k

     ...achieving the goals of our three core lines of business: Indexing, Digital Distribution, and Data & Analytics. This role focuses on driving technical excellence in data engineering, statistical techniques, and domain‑specific applications within VettaFi’s digital... 
    Suggested

    TMX Group

    New York, NY
    13 hours ago
  • $135k - $150k

     ...New York, United States | Posted on 08/30/2023 The position of Lead Data Engineer is open in New YorkCity, New York, within the Insurance industry. The role offers flexiblework-from-home days while requiring occasional on-site presence for clientmeetings. The Lead Data... 

    Career-Mover

    New York, NY
    4 days ago
  •  ...Must have skills: Azure Cloud, MS Fabric, ADF, Azure SQL, Synapse, DBT Preferred skills: Data Mesh implementation, Insurance domain Detailed Job Description Designs, builds, and maintains scalable data pipelines and architectures, leading a team to deliver robust data... 

    TechDigital Group

    New York, NY
    4 days ago
  • $100k - $200k

     ...Who We Are Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress...  .... Build Dream Teams. Who You Are You’re a senior, hands‑on data engineer with 8+ years of experience designing, building, and operating production... 
    Local area
    Flexible hours

    Galaxy USA

    New York, NY
    13 hours ago
  • $140k - $150k

     ...Piper Companies is seeking a Lead Data Engineer to support a company focused on enterprise digital transformation and advanced cloud‑based data modernization initiatives. This position is hybrid in Ft. Washington, PA . The Sr Data Engineer will provide strategic direction... 

    Piper Companies

    New York, NY
    3 days ago
  • $70 - $85 per hour

     ...Lead Data Engineer Location: Remote Duration: 6+ months Compensation: $70.00 - 85.00/hr Work Requirements: US Citizen, GC Holders or Authorized to Work in the U.S. Summary We are looking for a Lead Data Engineer who excels in data modeling and can build end-to-end data... 
    Contract work
    Local area
    Remote work
    Flexible hours

    INSPYR Solutions

    New York, NY
    4 days ago
  •  ...Must have: 10+ years of relevant experience in Data Engineering and delivery. 10+ years of relevant work experience in Big Data Concepts. Worked on cloud implementations. Strong experience with SQL, python and PySpark Good understanding of Data ingestion and data processing... 
    Work experience placement

    Inizio Partners Corp

    Jersey City, NJ
    3 days ago
  •  ...small businesses with property and casualty insurance and risk engineering services; affluent and high net worth individuals with substantial...  ...Overview: We are seeking a highly skilled and experienced lead data engineer to join our dynamic team. The ideal candidate will be... 
    Work experience placement
    Local area

    Chubb

    Jersey City, NJ
    13 hours ago
  • $94.43k - $202.75k

     ...services firms. Spark your curiosity and ignite your career at The Lighthouse. KPMG is currently seeking a Senior Associate, Data Engineer for our  Consulting practice. Responsibilities: Assist with technical design and development activities and lead a small workstream... 
    Local area
    Visa sponsorship

    KPMG

    New York, NY
    2 days ago
  •  ...A technology solutions firm is looking for a Lead Data Engineer to work remotely on building end-to-end data solutions. The ideal candidate will design and maintain scalable data pipelines and collaborate with AI/ML teams. The position requires strong skills in Databricks... 
    Remote work

    INSPYR Solutions

    New York, NY
    4 days ago
  • $229.9k - $262.4k

    ## Senior Lead Data Engineer (Bank Tech)Applylocations: Wilmington, DE: McLean, VAtime type: Full timeposted on: Posted Todayjob requisition id: R243479Senior Lead Data Engineer (Bank Tech)Do you love building and pioneering in the technology space? Do you enjoy solving... 
    Full time
    Part time
    Internship
    Local area

    Capital One

    New York, NY
    13 hours ago
  • $160k - $200k

     ...story while AI agents handle operationally intensive work such as data management, analytics, campaign generation, measurement, and...  ...We have two agentic systems built with OpenAI: an Agentic Data Engineer that unifies and standardizes a brand's first party data in hours... 

    Minerva Inc

    New York, NY
    2 days ago
  •  ...Join Our Team At Anteriad and innovate the way B2B marketers make data-driven business decisions. About Anteriad We are not just...  ...play a vital role within a growing part of Anteriad. As our Data Engineer, you will have an important role in receiving, organizing, and loading... 
    Work at office
    Work from home
    Flexible hours

    Anteriad, LLC

    New York, NY
    4 days ago
  •  ...the analysis of biological, clinical, agronomical, and marketing data, driving innovation in life sciences, food, nutrition, and...  ...impactful solutions. Role Description We are currently seeking a Data Engineer with a critical thinking and problem solving mindset to join our... 
    Flexible hours

    Deus ex Machina

    Brooklyn, NY
    4 days ago
  •  ...and challenging projects supporting the US Navy- Serco has a great opportunity for you! Serco has an exciting opportunity for a Data Engineer/Scientist to support U.S. Navy’s Team Submarine Program Offices at the Washington Navy Yard in Washington, DC! This position will... 
    Contract work
    Internship
    Work at office
    Local area
    Flexible hours

    Serco

    New York, NY
    4 days ago
  • $140k - $175k

     ...powered Proactive Documentation platform that reviews all patient data in the EHR to recommend diagnoses and surface clinical evidence...  ...reliably deliver data to downstream consumers Partner with engineering teams to identify and resolve data quality issues, helping ensure... 
    Work at office
    Local area
    Home office
    Visa sponsorship
    Relocation package

    REGARD

    New York, NY
    1 day ago
  • $125k - $163.8k

     ...The Position Our roster has an opening with your name on it. We are looking for a Data Engineer to join our growing data platform team and take end-to-end ownership of designing, building, and scaling the foundational data infrastructure that powers analytics, machine... 
    Temporary work
    Local area

    FanDuel

    New York, NY
    13 hours ago
  •  ...Transformeer data naar waardevolle inzichten met Azure en moderne BI-tools. Werk aan uitdagende projecten en groei continu in jouw...  ...betekenisvol. Klaar om de leiding te nemen in de wereld van data-engineering? Jouw avontuur begint hier bij Centric! Maak werk van deze... 

    Centric

    New York, NY
    13 hours ago
  •  ...Primary . The team is headquartered in New York and brings deep expertise in finance and AI. About the role You’ll be the first Data Engineer at Tabs, building the core data infrastructure that powers our internal KPIs, customer insights, and our AI systems. Your initial... 
    Full time
    Contract work
    Work at office

    TABS inc.

    New York, NY
    13 hours ago
  •  ...Texture is revolutionizing the energy sector by creating a unified data network. Backed by experienced founders and ample funding, we'...  ...and availability for complex analysis by data scientists, engineers, and other stakeholders. Who we are looking for You have expertise... 

    Texture

    New York, NY
    13 hours ago
  • $150k

     ...AVP, Data Engineer OTC Markets Group - New York, NY - Full Time OTC Markets Group Inc., operator of premier US financial marketplaces, is seeking a Data Engineer to join our IT Infrastructure team. We’re looking for a talented and driven Data Engineer to help power our... 
    Full time
    Temporary work
    Work at office
    Local area
    Flexible hours

    OTC Markets

    New York, NY
    13 hours ago
  • $190k - $250k

     ...tool. Hebbia is the competitive advantage that drives performance, alpha, and market leadership. The Role We are seeking our first Data Engineer, someone who can refine our data infrastructure, drive best practices for building data pipelines, and collaborate closely with... 

    Hebbia

    New York, NY
    4 days ago
  •  ...About the Opportunity DMI, LLC is seeking a Data Engineer to join us. Duties and Responsibilities: Builds and modernizes data pipelines integrations to improve processing efficiency across on-prem, hybrid, and multi-cloud TSA environments supports testing, documentation... 
    Remote work

    Digital Criterion

    New York, NY
    3 days ago
  •  ...As a Data Engineer at Cape, your mission is to build the analytics and reporting infrastructure that lets us understand our product and business without compromising the privacy guarantees we make to our customers. Privacy isn't a constraint layered on top of our data... 

    Cape

    New York, NY
    3 days ago
  • $150k

     ...Title: Data Engineer Location: New York, NY – Hybrid 3x per week Client: Leading US financial markets operator and data services company Role: A well‑established financial markets company is looking for a hands‑on Data Engineer to join their IT Infrastructure team. This... 

    Talener

    New York, NY
    3 days ago
  •  ...Overview Lead a team in research and implementation of complex data engineering projects, including multiple data models, maps, and workflows. Act as primary contact for the team, managing prioritization and execution of tasks. Serve as Subject Matter Expert (SME), deeply... 
    Contract work
    Remote work

    Largeton Group

    New York, NY
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!