Data Engineer
SGA
Data Engineer
Software Guidance & Assistance, Inc., (SGA), is searching for a Data Engineer for a contract assignment with one of our premier regulatory clients in Rockville, MD or Tysons, VA. The Data Engineer works with moderate supervision across two equally weighted domains: (1) large-scale data pipeline development processing market events in a cloud environment, and (2) design and development of agentic AI systems including LLM-powered regulatory data assistants, MCP servers, and agent harness architectures. This position contributes to overall product quality throughout the software development lifecycle.
Responsibilities:
- Build and maintain ETL/ELT pipelines using Apache Spark, Hive, and Trino across S3-based data lake environments
- Develop and optimize SQL for large-scale surveillance datasets including window functions, multi-table joins, and complex aggregations
- Build and engineer big data systems (EMR-on-EC2, EMR-on-EKS) and develop solutions on analytical platforms (SageMaker, Domino, Dataiku)
- Participate in data quality monitoring, anomaly detection, and production incident investigation
- Develop AI agent systems using AWS Bedrock and agent frameworks (Strands Agents SDK, LangChain/LangGraph, or equivalent)
- Build agent harness architectures combining LLM reasoning with deterministic execution - skill/RAG-based SQL generation and structured output validation
- Implement agent memory, context management, and tool integration (MCP servers, API connectors, data catalog lookups) across the data lake
- Build evaluation frameworks for agent accuracy - paraphrase robustness, routing precision, and structural consistency
- Stay informed of advances in LLM frameworks (LangGraph, Google ADK, AWS Strands) and emerging AI capabilities
- Write clean, well-tested code; contribute to CI/CD Jenkins pipelines and infrastructure-as-code on AWS
- Ensure secure handling of RCI and sensitive regulatory data across both data pipelines and agent outputs - auditable execution traces
- Adhere to FINRA and team standards for secure development practices and technology policies
- Partner across teams, communicate technical information at the appropriate level, and maintain documentation on Confluence/Wiki
- Actively learn from senior team members; contribute to process improvement in line with FINRA's values of collaboration, expertise, innovation, and responsibility
Required Skills:
Data Engineering & Big Data Technologies
- Experience building data pipelines using Apache Spark (PySpark preferred) and SQL
- Experience with SQL query engines (Hive, Trino/Presto, or similar) and cloud data platforms (AWS S3, EMR, Lambda)
- Understanding of common issues like data skew and strategies to mitigate it, working with large data volumes, and troubleshooting job failures due to resource limitations, bad data, and scalability challenges
- Real-world experience with debugging and mitigation strategies
Generative AI & Agentic Systems
- Practical experience building LLM-powered agent systems that use tools and produce structured outputs (not just chatbot interfaces)
- Hands-on experience with at least one agent framework: LangChain, LangGraph, AWS Strands, or equivalent
- Working knowledge of prompt engineering, RAG architectures, and context/memory management
- Experience with foundation model APIs (Anthropic Claude, Amazon Nova, OpenAI, or similar)
- Memory Architecture: Understanding of agent memory tiers - working memory, episodic memory, semantic memory - and strategies for context persistence, pruning, and retrieval across sessions
- Agent Harness Design: Familiarity with harness patterns that wrap LLM reasoning with deterministic guardrails, tool routing, verification loops, and graceful degradation
AI Tool Proficiency
- Hands-on experience with AI development tools (GitHub Copilot, Q Developer, ChatGPT, Claude, etc.)
- Experience with spec-driven development - using structured specifications to guide AI code generation, review, and validation
- Ability to leverage AI pair programming for code suggestions, debugging, refactoring, and automated test generation
Cloud Technologies
- Experience with AWS services like S3, EMR, EMR on EKS, Lambda, Bedrock, Step Functions, etc.
- Hands-on experience using S3 with Spark (e.g., dealing with file formats, consistency issues)
- Familiarity with AWS Bedrock for foundation model invocation, knowledge bases, guardrails, and agent orchestration
- Exposure to Google Cloud Vertex AI (model garden, grounding, agent builder) or equivalent managed AI platforms
- Familiarity with AWS monitoring and logging tools (CloudWatch, CloudTrail) for production workloads
Programming - Python
- Proficiency in Python for data engineering and automation
- Ability to write clean, modular, and performant code
- Experience with functional programming concepts (e.g., immutability, higher-order functions)
- Strong understanding of collections, concurrency, and memory management
SQL Skills (Window Functions, Joins, Complex Queries)
- Proficiency with SQL window functions, multi-table joins, and aggregations
- Ability to write and optimize complex SQL queries
- Experience handling edge cases like NULLs, duplicates, and ordering
Good to Have
- AWS Bedrock AgentCore (memory, identity, tool gateway)
- Model Context Protocol (MCP) server development and integration
- Agent evaluation harnesses and agentic patterns (draft-verification, compile-style generation)
- Fine-tuning foundation models for domain-specific tasks (LoRA, PEFT, or managed fine-tuning via Bedrock/Vertex AI)
- Local model execution with Ollama, vLLM, or similar for development and experimentation
- Vector databases (FAISS, Pinecone, OpenSearch)
- Docker, Kubernetes, and Amazon EKS for containerized workloads
- Infrastructure as Code (Terraform, CloudFormation)
- Experience with CI/CD pipelines (Jenkins, GitLab CI, GitHub Actions, ArgoCD)
- Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack)
- AWS certifications (AI Practitioner, Solutions Architect, or Kubernetes certifications like CKA/CKAD)
Education / Experience Requirements
- Bachelor's degree in Computer Science, Data Science, Information Systems, or related discipline with at least two (2) years of related experience; or equivalent training and/or work experience; past Financial Services industry experience preferred
- Demonstrated technical expertise in Object Oriented and database technologies/concepts which resulted in deployment of enterprise quality solutions
- Extensive knowledge of industry leading software engineering approaches including Test Automation, Build Automation and Configuration Management frameworks
- Strong written and verbal technical communication skills
- Demonstrated ability to develop effective working relationships that improved the quality of work products
- Ability to maintain focus and develop proficiency in new skills rapidly
- Ability to work in a fast paced environment
$197.3k - $225.1k
...Lead Data Engineer Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers...SuggestedFull timePart timeInternshipH1bLocal area- ...Lead Data Engineer Locations: Richmond - VA / McLean - VA / Plano - TX / Chicago - IL / NYC - NY / Wilmington - DE (Preferred Location; Hybrid Role; Needs to work 3 days from Office in a week) Job Type: Long Term Contract Responsibilities: Lead the design...SuggestedLong term contractWork at office
$197.3k - $225.1k
...Lead Data Engineer (Finance Tech) Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group...SuggestedFull timePart timeInternshipH1bLocal area- ...IICS Data Engineer Our client is seeking an IICS Data Engineer with strong experience across Informatica IICS, AWS, and SQL to support enterprise data integration and cloud-based data movement initiatives within a fast-paced financial services environment. Responsibilities...Suggested
- ...Senior Data Engineer Duration: 6 months with possible Conversion GC/USC (Not Open for Relocation candidates) Bill Rate: $80 on c2c Location: Onsite in McLean, VA Tuesday through Thursday There will be a coding exercise or...SuggestedRelocation
- ...Quantitative Analytics/Big Data Developer/Data Engineer Senior Freddie Mac’s Single-Family Division is currently seeking a Quantitative Analytics/Big Data Developer/Data Engineer Senior to implement Big Data tools and methods of data processing and ingestion as a member...Work experience placement
- ...Data Engineer We are seeking a highly skilled Data Engineer with 5 to 7 years of experience to join our team as a Business Associate. The ideal candidate will have extensive experience in AWS, Python, PySpark, Snowflake, SQL, and Unix. This role involves analyzing business...
$110k - $160k
...today’s rapidly evolving technology landscape, an organization’s data has never been a more important aspect in achieving mission and... ...DevSecOps Contributions We are looking for seasoned Data Engineer to work with our team and our clients to develop enterprise grade...- ...Boto3. Experience with Pyspark with a solid understanding of big data. Your Work Falls Into Two Primary Categories: Strategy... ...impactful way Understand capabilities of and current trends in Data Engineering domain Qualifications At least 5 years of experience...Work experience placement
$77.6k - $176k
...Job Number: R0231399 Data Engineer The Opportunity: Ever-expanding technology like IoT, machine learning, and artificial intelligence means that there's more structured and unstructured data available today than ever before. As a data engineer, you know that organizing...Full timeContract workPart timeWork at officeLocal areaRemote work- ...of experience with AWS services such as S3, Lambda, EC2, IAM, CloudWatch, and Glue; strong understanding of cloud architecture and data movement. ~ Experience integrating AWS, SQL, and IICS. Preferred Industry Knowledge Experience working with Freddie Mac...
$55 - $58 per hour
...Data Engineer 3 openings. Location: Role is on-site in McLean, VA 5 days/week Interview process: Teams call then onsite interview Local candidates only Rate= $55-58/hr In addition to below description, PySpark and Snowflake...Local area- ...Backend Data Engineer MC Lean, VA Long Term Contract * Manage/Administer Kubernetes *Develop Apps on Kubernetes using Python or Golang. * Supporting Batch AWS EMR, Pyspark * Strong in Pyspark, AWS Services, Dynamo DB, ELB, EC2, Lambda * 6-8 years' experience...Long term contract
$92k - $158k
Data Engineer Job Locations US-VA-Tysons Job ID 2026-13906 # of Openings 2 Benefit Type Salaried High Fringe/Full-Time Overview LMI is seeking a skilled and detail-oriented Data Engineer to join a high performing...Full timeContract workWork at officeLocal area- ...Position contingent upon contract award As a Data Engineer , you will be responsible for developing, constructing, testing, and maintaining scalable and reliable data solutions. ESSENTIAL JOB FUNCTIONS: ~ Design, build, and maintain scalable data pipelines...Contract work
- ...Data Engineer Intelligence/Intel Community McLean, VA • Full-Time/Regular Core4ce has an opportunity with a mission critical IC Sponsor for a Data Engineer position. This Sponsor organization plays a significant role in the highly optimized mission-ready data...Full timeImmediate startFlexible hours
- ...Job Title: Data Engineer Location: McLean, VA (Hybrid - 3 Days onsite every week) Job Type: Contract Pay Rate: $60/hr C2C Job Description Required must have skills: Python AWS Kubernetes Kubeflow MLOps...Contract workTemporary workLocal area3 days per week
- ...About the job Snowflake Data Engineer Position: Snowflake Data Engineer Company: TALENT HIVE, LLC Contract Details: Full-time TALENT HIVE, LLC is seeking a highly skilled and experienced Snowflake Data Engineer to join our client's team on a full-time...Full timeContract work
- ...Data Engineer for developing and maintaining ETL solutions to drive clinical operations, advanced analytics, and machine learning models. This position will work collaboratively with the technology team to support our clinical, operational, and finance teams to...
$60 - $70 per hour
...Immediate need for a talented Data Engineer . This is a 12 months contract opportunity with long-term potential and is located in Mclean VA (Onsite). Please review the job description below and contact me ASAP if you are interested. Job ID: 25-90054 Pay Range...Contract workLocal areaImmediate start- ...Data Engineer Location: McLean VA (Onsite Hybrid Model) Duration: 6-12 Months | Extension Possible Job Description: Strong with Python Knowledgeable in Pyspark Snowflake Solid Data Engineers Someone who has worked on Structured and Unstructured...
- ...Data Engineer III - Top Secret, SCI Eligible - MUST HAVE SECURITY CLEARANCE - PLEASE DO NOT APPLY UNLESS YOU HOLD A TS WITH SCI ELIGIBILITY Position Summary: Position Description: Data Engineer III Location: National Capital Region Work...Contract workWork at office
- ...Data Engineer A Data Engineer with Python, PySpark, and AWS expertise is responsible for designing, building, and maintaining scalable and efficient data pipelines in cloud environment. Responsibilities: Design, develop, and maintain robust ETL/ELT pipelines...
$110k - $160k
...Overview Steampunk wants you to join our awesome team as Data Visualization Engineer. In this role you'll be working with a large team of Steampunk and clients to identify data sources, tools, and mission challenges that can all be brought together to create decision...Contract work- ...Technology and Business consulting services. We are in search of a highly motivated candidate to join our talented Team. Job Title: Data Engineer. Location: McLean, VA. Overview: ~ We are seeking a highly skilled Sr. Data Engineer to join our team in McLean, VA. The...Local areaRemote work
- ...KDA Consulting Inc. is seeking a highly skilled Data Engineer to design, build, and maintain scalable data pipelines and architectures supporting mission-critical programs within the Intelligence Community (IC). This role focuses on enabling efficient data processing...
- ...Position Title: Data Engineer Location: McLean, VA Duration: 12+ Months (Possible extension) Job Description: Key Responsibilities: Design, build, and optimize ETL pipelines using Python, PySpark, and Spark Develop scalable data solutions leveraging...For contractors
- ...Data Engineer Data Engineer Tysons Corner, VA Active TS/SCI with Polygraph Company Overview: Cornerstone Defense is the Employer of Choice within the Intelligence, Defense, and Space communities of the U.S. Government. Realizing early on that our most prized assets...
- ...Job Title: Data Engineer Company: BLN24 About Us: We find strength in teamwork-a better you is a better us BLN24 is an award-winning Management Consulting Firm that supports the U.S. Federal Government in successfully achieving their mission and goals. Our service...Remote work
- ...Data Engineer Location: McLean, VA (Tuesday and Wednesday onsite) Duration: 6+ Months contract (with possible conversion) Job Description: Cleanse, manipulate and analyze large datasets (Structured and Unstructured data – XMLs, JSONs, PDFs) using Hadoop...Contract work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!
- data visualization developer McLean, VA
- data science developer McLean, VA
- senior data center engineer McLean, VA
- sr information security engineer McLean, VA
- junior big data engineer McLean, VA
- entry level big data engineer McLean, VA
- data engineer contract McLean, VA
- aws data engineer McLean, VA
- senior data engineer McLean, VA
- data engineer analytics McLean, VA

