Data Engineer
SGA
Data Engineer
Software Guidance & Assistance, Inc., (SGA), is searching for a Data Engineer for a contract assignment with one of our premier regulatory clients in Rockville, MD or Tysons, VA. The Data Engineer works with moderate supervision across two equally weighted domains: (1) large-scale data pipeline development processing market events in a cloud environment, and (2) design and development of agentic AI systems including LLM-powered regulatory data assistants, MCP servers, and agent harness architectures. This position contributes to overall product quality throughout the software development lifecycle.
Responsibilities:
- Build and maintain ETL/ELT pipelines using Apache Spark, Hive, and Trino across S3-based data lake environments
- Develop and optimize SQL for large-scale surveillance datasets including window functions, multi-table joins, and complex aggregations
- Build and engineer big data systems (EMR-on-EC2, EMR-on-EKS) and develop solutions on analytical platforms (SageMaker, Domino, Dataiku)
- Participate in data quality monitoring, anomaly detection, and production incident investigation
- Develop AI agent systems using AWS Bedrock and agent frameworks (Strands Agents SDK, LangChain/LangGraph, or equivalent)
- Build agent harness architectures combining LLM reasoning with deterministic execution - skill/RAG-based SQL generation and structured output validation
- Implement agent memory, context management, and tool integration (MCP servers, API connectors, data catalog lookups) across the data lake
- Build evaluation frameworks for agent accuracy - paraphrase robustness, routing precision, and structural consistency
- Stay informed of advances in LLM frameworks (LangGraph, Google ADK, AWS Strands) and emerging AI capabilities
- Write clean, well-tested code; contribute to CI/CD Jenkins pipelines and infrastructure-as-code on AWS
- Ensure secure handling of RCI and sensitive regulatory data across both data pipelines and agent outputs - auditable execution traces
- Adhere to FINRA and team standards for secure development practices and technology policies
- Partner across teams, communicate technical information at the appropriate level, and maintain documentation on Confluence/Wiki
- Actively learn from senior team members; contribute to process improvement in line with FINRA's values of collaboration, expertise, innovation, and responsibility
Required Skills:
Data Engineering & Big Data Technologies
- Experience building data pipelines using Apache Spark (PySpark preferred) and SQL
- Experience with SQL query engines (Hive, Trino/Presto, or similar) and cloud data platforms (AWS S3, EMR, Lambda)
- Understanding of common issues like data skew and strategies to mitigate it, working with large data volumes, and troubleshooting job failures due to resource limitations, bad data, and scalability challenges
- Real-world experience with debugging and mitigation strategies
Generative AI & Agentic Systems
- Practical experience building LLM-powered agent systems that use tools and produce structured outputs (not just chatbot interfaces)
- Hands-on experience with at least one agent framework: LangChain, LangGraph, AWS Strands, or equivalent
- Working knowledge of prompt engineering, RAG architectures, and context/memory management
- Experience with foundation model APIs (Anthropic Claude, Amazon Nova, OpenAI, or similar)
- Memory Architecture: Understanding of agent memory tiers - working memory, episodic memory, semantic memory - and strategies for context persistence, pruning, and retrieval across sessions
- Agent Harness Design: Familiarity with harness patterns that wrap LLM reasoning with deterministic guardrails, tool routing, verification loops, and graceful degradation
AI Tool Proficiency
- Hands-on experience with AI development tools (GitHub Copilot, Q Developer, ChatGPT, Claude, etc.)
- Experience with spec-driven development - using structured specifications to guide AI code generation, review, and validation
- Ability to leverage AI pair programming for code suggestions, debugging, refactoring, and automated test generation
Cloud Technologies
- Experience with AWS services like S3, EMR, EMR on EKS, Lambda, Bedrock, Step Functions, etc.
- Hands-on experience using S3 with Spark (e.g., dealing with file formats, consistency issues)
- Familiarity with AWS Bedrock for foundation model invocation, knowledge bases, guardrails, and agent orchestration
- Exposure to Google Cloud Vertex AI (model garden, grounding, agent builder) or equivalent managed AI platforms
- Familiarity with AWS monitoring and logging tools (CloudWatch, CloudTrail) for production workloads
Programming - Python
- Proficiency in Python for data engineering and automation
- Ability to write clean, modular, and performant code
- Experience with functional programming concepts (e.g., immutability, higher-order functions)
- Strong understanding of collections, concurrency, and memory management
SQL Skills (Window Functions, Joins, Complex Queries)
- Proficiency with SQL window functions, multi-table joins, and aggregations
- Ability to write and optimize complex SQL queries
- Experience handling edge cases like NULLs, duplicates, and ordering
Good to Have
- AWS Bedrock AgentCore (memory, identity, tool gateway)
- Model Context Protocol (MCP) server development and integration
- Agent evaluation harnesses and agentic patterns (draft-verification, compile-style generation)
- Fine-tuning foundation models for domain-specific tasks (LoRA, PEFT, or managed fine-tuning via Bedrock/Vertex AI)
- Local model execution with Ollama, vLLM, or similar for development and experimentation
- Vector databases (FAISS, Pinecone, OpenSearch)
- Docker, Kubernetes, and Amazon EKS for containerized workloads
- Infrastructure as Code (Terraform, CloudFormation)
- Experience with CI/CD pipelines (Jenkins, GitLab CI, GitHub Actions, ArgoCD)
- Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack)
- AWS certifications (AI Practitioner, Solutions Architect, or Kubernetes certifications like CKA/CKAD)
Education / Experience Requirements
- Bachelor's degree in Computer Science, Data Science, Information Systems, or related discipline with at least two (2) years of related experience; or equivalent training and/or work experience; past Financial Services industry experience preferred
- Demonstrated technical expertise in Object Oriented and database technologies/concepts which resulted in deployment of enterprise quality solutions
- Extensive knowledge of industry leading software engineering approaches including Test Automation, Build Automation and Configuration Management frameworks
- Strong written and verbal technical communication skills
- Demonstrated ability to develop effective working relationships that improved the quality of work products
- Ability to maintain focus and develop proficiency in new skills rapidly
- Ability to work in a fast paced environment
- ...MANTECH seeks a motivated, career and customer-oriented Data Engineer to join our team in Herndon VA. The Data Engineer will leverage their expertise with Python to support the customer’s data pipelines and related applications, from collection to ingestion, and ensure...Suggested
- ...HiLabs is looking for highly motivated and technically strong Lead Data Engineers with deep expertise in Big Data platforms and a passion for building scalable, data-intensive systems. The ideal candidates will have strong hands-on experience in Spark, PySpark, distributed...SuggestedRelocationRelocation package
- ...The Data Environments and Engineering Department (L176) in the Data and Human Centered Solutions Innovation Center is seeking highly motivated, adaptive, and enthusiastic candidates with a wide variety of interests and a foundation of skills in systems engineering, software...Suggested
$113.4k - $223.2k
...Lead Data Engineer Category: Software Development/ Engineering Main location: United States, Virginia, Fairfax Position ID:J0326-3022 Employment Type: Full Time Position Description: As a Lead Data Engineer, you will lead a team...SuggestedFull timeWork experience placementWork at officeLocal areaFlexible hours$115.2k - $228.8k
...Resp & Qualifications PURPOSE: The Lead Data Engineer is responsible for orchestrating, deploying, maintaining and scaling Cloud OR on-premise infrastructure targeting big data and platform data management (Relational and NoSQL, distributed and converged) with emphasis...SuggestedWork experience placementImmediate start- ...Overview BigBear.ai is seeking a Lead Data Engineer to support one of our customers. This position will be based out of Washington, D.C. and will offer remote flexibility. You will be joining our amazing BigBear.ai team to provide development for legacy and existing systems...Temporary workRemote work
$76.1k - $136.7k
...training and more. Join us to drive positive, lasting change that moves missions and the government forward. Job Description As a Data Engineer at the Senior Analyst level, you will be a key contributor to the team responsible for building and maintaining the data...Work at officeLocal area- ...DevTech is a mission-driven firm specializing in innovative, data-driven solutions that help governments, civil society, and the private... ...results and lasting impact. DevTech is looking for a Data Engineer and Analyst to work as an institutional contractor on its Analytics...Contract workTemporary workFor contractorsWork at officeRemote workWorldwide
- ...Description Lead Data Engineer At MetaPhase, we believe Quirky is Cool and being authentic is the only way to be! We take the work we do very seriously and do a lot of important mission-focused work for our clients. We are individuals with different passions...For contractorsLocal areaImmediate startRemote work
- ...Data Engineer – Spear AI Location: On-site, Washington, DC Clearance Required: Secret (active) Employment Type: Full-time, salaried, benefits-eligible About Spear AI: Spear AI delivers cutting-edge artificial intelligence solutions tailored specifically...Full time
- ...Lead Data Engineer Randstad is seeking a high-impact Lead Data Engineer to drive the next generation of data and AI solutions for a premier client in the transportation sector located in Washington, D.C. In this hybrid, hands-on leadership role, you will spearhead the...
- ...Overview BigBear.ai is seeking a Lead Data Engineer to support one of our customer. This position will be based out of Washington, D.C. and will offer remote flexibility. You will be joining our amazing BigBear.ai team to provide development for legacy...Temporary workRemote work
$110k - $160k
...Overview Steampunk wants you to join our awesome team as Data Visualization Engineer. In this role you'll be working with a large team of Steampunk and clients to identify data sources, tools, and mission challenges that can all be brought together to create decision...Contract work- ...ETL Data Engineer Tysons Corner, VA Type: Contract Category: Data Industry: Financial Services Reference ID: JN -062026-107214 Date Posted: 06/02/2026 Shortcut: Description Recommended Jobs Description: Hybrid 3 days onsite / 2 days...Hourly payContract workWork experience placementLocal areaRemote work
$197.3k - $225.1k
...Capital One National Association is seeking a Lead Data Engineer in McLean, VA. This role involves designing and implementing technical solutions in a collaborative environment. The ideal candidate has significant experience in application development, big data technologies...- ...Bigbear.ai is seeking a Lead Data Engineer to support one of our customers in Washington, D.C., with some remote flexibility. You will manage complex assignments, contribute to performance metrics, and function as a technical expert across multiple projects. Candidates...Remote work
- ...Blue Sky Innovators, Inc. is seeking a Lead Data Engineer to serve as a Subject Matter Expert (SME). You will work with government and industry stakeholders to design, implement, and maintain data pipelines that support mission capabilities across various systems. This...
$197.3k - $225.1k
...Lead Data Engineer - HR Tech Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group of makers...Full timePart timeInternshipH1bLocal area- ...RiVidium Inc. is seeking a Data Engineer to support our planned MODES III team supporting Military Community and Family Policy (MC&FP). This role supports IT, Cybersecurity, and Data Operations - Core Operations and helps deliver mission-focused outcomes for service members...Contract work
- ...Job Title: Mission (Data)Engineer Travel: Approximately 10% international travel required. Number of Openings: 3 Company Overview: Atreides helps organizations transform large and complex multi-modal datasets into information-rich geo‑spatial data subscriptions that can...Remote workFlexible hours
- ...growth. We treat our people like family, we are mission‑focused, and we give back! Join us today. SPA has an immediate need for a Data Engineer. Responsibilities The Data Engineer is responsible for designing, building, and maintaining data pipelines and ontology models...Immediate start
- ...Overview We’re looking for a Data Engineer to build and maintain scalable data pipelines and cloud-native solutions. You’ll work closely with data scientists, analysts, and engineers to deliver high-quality, reliable data. Qualifications Advanced SQL (Snowflake preferred...
$200k - $260k
...A consulting firm in McLean, VA is seeking a Data Engineer to join their agile team, offering a salary between $200K and $260K. The ideal candidate will have a BS in a relevant field and experience with big data environments, data analytics, and cloud technology. Key skills...- ...Job Description Seeking a Lead Data Engineer / Mission Data Pipeline to serve as a Subject Matter Expert (SME). You will work directly with government, technical, and industry stakeholders to design, implement, and sustain data pipelines that ingest, transform, store,...
$60 per hour
...Data Engineer Washington, DC Pay From: $60.00 per hour Qualifications 7+ years of related experience; advanced degree preferred. Advanced working knowledge of SQL and experience working with relational database platforms including PostgreSQL, Microsoft SQL Server, and...Hourly pay$160k - $200k
...Position Summary We are seeking a highly skilled Data Engineer to join our dynamic team. The ideal candidate will be responsible for creating robust data pipelines from various data vendors to gold tables, primarily for our Machine Learning (ML) team, utilizing Snowflake...Work at office$120k - $150k
...and supportive work environment, paired with a competitive salary and an industry-leading 401k contribution. We are looking for a Data Engineer to join our team in supporting the Program Assessment and Evaluation department. Your Day-to-day Work Will Include Providing...Bi-weekly payFull timeContract workFor contractorsRemote workFlexible hours- ...in delivering agentic AI and cloud computing solutions. Founded by ex-Googlers with engineers from Google, Amazon, and Capital One, SZNS differentiates itself particularly in AI, data engineering, blockchain, and cloud-native software application development.” Role Summary...Work at office
- ...Services division combines enterprise IT, cloud solutions, DevSecOps, systems engineering, software development, and operational support. Location Herndon, VA Role Overview We are seeking a Data Engineer to join our growing team in Herndon, Virginia. The data engineer...
- ...Job Overview Data Engineer The Opportunity: Ever-expanding technology like Internet of Things (IoT), machine learning, and artificial intelligence means that there’s more structured and unstructured data available today than ever before. As a data engineer, you know that...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!
- data engineering intern summer McLean, VA
- senior data integration developer McLean, VA
- data engineer contract McLean, VA
- data science developer McLean, VA
- senior data center engineer McLean, VA
- software data engineer McLean, VA
- hadoop big data developer McLean, VA
- data developer McLean, VA
- remote data engineer McLean, VA
- sr data engineer McLean, VA


