Data Engineer
Role details
Job location
Tech stack
Job description
-
Build and maintain ingestion pipelines into Snowflake from a range of source systems (Dataverse, SQL Server, APIs, files) using Azure Data Factory Develop transformation logic in Snowflake following a medallion architecture (bronze silver
-
gold), using Coalesce for orchestrated SQL transformations
-
Write Python for orchestration, automation, custom connectors, and lightweight services
-
Partner with AI coding agents (GitHub Copilot, Claude Code, Cortex) as part of your daily workflow - you're expected to be effective at directing them, not just using autocomplete
-
Apply data governance and lineage standards through Atlan
-
Participate in code reviews, CI/CD via Azure DevOps, and on-call rotations for production pipelines
-
Contribute to internal documentation and runbooks
Requirements
Do you have experience in YAML?, We're optimizing for learning velocity over years of experience. If you have 0-2+ years and a strong foundation, you're in scope., * Python - comfortable writing clean, modular code; familiar with pandas, requests, basic OOP, virtualenvs/uv
- SQL - strong fundamentals (joins, window functions, CTEs, query optimization)
- Azure Data Factory - hands-on pipeline development, parameterization, triggers, linked services
- Azure services - working knowledge of Storage (Blob/ADLS), Key Vault, DevOps (pipelines, repos)
- Snowflake - has built or contributed to real warehouses; understands warehouses, roles, micro-partitions
- Source control - Git workflows, PRs, branching
AI / Agentic Skills (Required at conceptual level)
You don't need to have shipped LLM apps in production, but you should be conversant in:
- AI coding agents - used Copilot, Cursor, Claude Code, or equivalent for real work (not just demos)
- LLMs - understand prompting, context windows, tool use / function calling, structured outputs
- RAG - know what it is, when to use it, the basic architecture (embeddings, vector store, retrieval, generation)
- MCP (Model Context Protocol) - bonus if you've used it, but willingness to learn quickly is required; we use Atlan MCP and Snowflake MCP
Nice to Have
- Coalesce, dbt, or another transformation framework
- Experience with Dataverse / Dynamics 365 / Power Platform data sources
- Atlan or another data catalog/governance tool
- CI/CD experience with Azure DevOps YAML pipelines
- Exposure to vector databases (Pinecone, Weaviate, pgvector, Snowflake Cortex)
- Personal projects with LLMs, agents, or AI tooling - show us your GitHub
Mindset We're Hiring For
- Learning mindset. Tooling will change every 6 months. We need someone who's energized by that, not exhausted by it.
- Bias to ship. Pragmatic over perfect. Get it working, get it reviewed, iterate.
- AI-leveraged. You treat AI agents as force multipliers. You know when to delegate to them and when to think yourself.
- Ownership. When a pipeline breaks at 2am, you don't wait to be told.
- Clear written communicator. Async-friendly, document-as-you-go.
Education
- Bachelor's degree in CS, Engineering, Math, Stats, or related field - or equivalent practical experience. Bootcamp grads and self-taught engineers with solid portfolios welcome.
Logistics
- Location: Atlanta, GA - hybrid
- Experience: 2+ years preferred; fresh graduates with strong projects encouraged to apply
Benefits & conditions
Pulled from the full job description
- Health insurance
- Paid time off, Your base pay is dependent upon your skills, education, qualifications, professional experience, and location. In addition to base pay, some roles are eligible for variable compensation, commission, and/or annual bonus based on your individual performance and/or the company's performance. We also offer eligible employees health, wellbeing, retirement, and other financial benefits, paid time off, overtime pay for non-exempt employees, and robust learning and development programs. You will receive reimbursement of job-related expenses per the company policy and may receive employee perks and discounts.