0000001423.AI ENGINEER II.INFO TECH - DATA AND AI

County of Dallas

Dallas, United States of America

11 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Dallas, United States of America

Tech stack

API

Agile Methodologies

Artificial Intelligence

Amazon Web Services (AWS)

Data analysis

Azure

Big Data

Cloud Computing

Information Systems

Databases

Continuous Integration

Data as a Services

Data Validation

Data Dictionary

Information Engineering

Data Governance

Data Integration

Data Transformation

Data Profiling

Data Warehousing

Software Debugging

DevOps

Graph Database

Design of User Interfaces

Python

Machine Learning

Metadata Repositories

Microsoft SQL Server

Language Modeling

Next.js

SQL Databases

Data Streaming

Systems Integration

Enterprise Data Management

Parquet

Data Processing

Google Cloud Platform

Data Storage Technologies

React

Large Language Models

Snowflake

Multi-Agent Systems

Spark

IT Architecture

HybridCloud

GIT

Containerization

Data Lake

Kubernetes

Information Technology

Data Lineage

Data Analytics

Amazon Web Services (AWS)

Kafka

Data Management

Machine Learning Operations

Data Delivery

Api Gateway

Data Inconsistencies

Software Version Control

Data Pipelines

Docker

Databricks

Job description

Designs and maintains robust AI agents and data pipelines. Performs data orchestrations and supports enterprise AI and Data efforts. Works across departments to build scalable AI solutions that ensure reliable, secure, and high-quality data is available to business users, analysts, upstream and downstream applications. Responsible for the full lifecycle of AI Development - from selecting foundation models (FM's) to deploying scalable orchestration layers on hybrid cloud environments. Contributes to data labelling, MLOps integration, AI Observability, Language Models testing, and documentation in collaboration with AI analysts, AI Architects, data scientists, developers, and system owners., Designs, develops, and maintains scalable AI Agents and Orchestration workflows across structured and semi-structured data sources. Ensures consistent design and delivery of data and AI platforms supporting Data Engineering, Cloud, and AI centers of excellence. Integrates internal and external data sources with enterprise data platforms, lakes, or warehouses. Designs and develops multi-agent systems using frameworks like LangGraph, CrewAI, or Amazon Bedrock to automate complex enterprise reviews and workflows. Performs data profiling, cleansing, and standardization to improve data quality. Monitors data pipeline health and troubleshoots failures or anomalies. Documents AI architecture, APIs, AI Business rules, and data logic for internal users. Collaborates with DevOps or infrastructure teams to implement automated AI processing workflows. Collaborates with Enterprise Architecture teams to ensure AI solutions align with internal policies, vendor questionnaires, and ethical AI guidelines. Maintains data access controls, validation rules, and retention policies. Translates business and AI requirements into technical specifications and AI pipeline designs. Participates in Agile planning, backlog grooming, and technical design sessions. Develops data and AI flow diagrams, Machine learning models, and transformation logic. Supports dataset design and delivery for dashboards, reports, or self-service analytics. Collaborates with application owners to understand source system structures and data changes. Contributes to solution architecture decisions related to Language model performance, security, storage, and data delivery. Assists in scoping and estimating new data initiatives and enhancement requests. Identifies reuse opportunities for data components, tools, or models. Builds in validation and error-handling logic into data and AI pipelines to support reliability. Performs root cause analysis for data inconsistencies and recommends preventive actions. Contributes to and follows testing procedures for data validation, performance, and integrity. Implements version control, data lineage, and reproducibility practices. Identifies performance bottlenecks and refactor inefficient data processes. Recommends improvements to schema design, data granularity, and source-system integration. Maintains awareness of industry standards for data governance, security, and accessibility. Supports automation of routine data workflows and manual reporting processes. Works closely with analysts, data scientists, application developers, and stakeholders to deliver high-quality datasets. Coordinates with system owners and system administrators to manage source data access and schema changes. Supports QA and testing teams by validating expected output and data quality criteria. Participates in data and AI design reviews, standups, retrospectives, and sprint demos. Communicates technical limitations or trade-offs to business stakeholders in an understandable way. Partners with cybersecurity teams to ensure sensitive data is handled securely and in compliance with County policy. Continues building technical proficiency in cloud platforms, big data tools, and AI frameworks. Stays current with trends in data engineering, streaming pipelines, and ML Ops practices. Contributes to internal wikis, playbooks, and best practices documentation. Mentors junior data engineers or interns on development and testing practices. Participates in knowledge-sharing sessions, communities of practice, or hackathons. Proactively seeks opportunities for cross-training with related disciplines (e.g., AI, Big Data, DevOps and MLOps). Implements robust LLM engineering practices using tools like Langfuse or Weights & Biases for tracing, debugging, and evaluating model outputs. Tracks personal learning goals and reflects on performance improvement opportunities. Communicates progress, risks, and needs to project leads or data managers. Documents data sources, logic, and transformations in data dictionaries or metadata repositories. Supports stakeholder training or onboarding on new datasets and data services. Assists in writing user guides, technical diagrams, and documentation for AI Orchestrations and data pipelines. Participates in requirement gathering and feedback sessions with business users. Supports audit and compliance documentation as needed. Provides timely responses to questions or data requests from supported teams. Coordinates deployment of data updates with impacted teams or systems. Performs other duties as assigned.

Requirements

Do you have a valid Driver's License license?, Do you have experience in Version control?, Do you have a Master's degree?, Education, Experience and Training: Education and experience equivalent to a Bachelor's degree from an accredited college or university in Computer Science, Information Systems, Data Science, AI and Analytics, or in a job-related field of study. Master's degree preferred. Five (5) years of work-related experience in data engineering, data analytics, or AI/ML data processing. Certifications (Preferred): * Certifications in cloud architecture (Azure, AWS, GCP), data modeling, and governance tools. * Amazon Certified: AWS Data Engineer Associate * AWS Certified Data Analytics - Specialty * Snowflake or Databricks certification Special Requirements/Knowledge, Skills & Abilities: Must have a valid Texas Driver's License and good driving record. Will be required to provide a copy of 10-year driving history. Must maintain a good driving record and remain in compliance with Article II, Subdivision II of Chapter 90 of the Dallas County Code. "Individuals holding or considered for a position which has, or may have, access to criminal justice databases including the FBI Criminal Justice Information Systems, NCIC/TCIC and similar databases, must pass a national fingerprint-based records check prior to placement in such position and may be denied placement in such positions and/or access to such systems. Incumbents must also maintain the ability to pass the records check while in the position or until such time that the Commissioners Court and the County Civil Service Commission deem this position no longer has this requirement." * Excellent analytical and problem-solving abilities. * Strong communication and documentation skills. * Ability to work independently and collaboratively on technical projects. * Strong collaboration and communication skills. * Ability to work independently and mentor junior team members. * Knowledge of DevOps, CI/CD, and containerized applications (Docker, Kubernetes). * Ability to design and optimize scalable data workflows. * Knowledge of Sovereign Cloud requirements or GovCloud environments. * Knowledge of big data frameworks (Snowflake, Spark, Databricks, Vector Databases, Graph Databases). * Knowledge of data warehousing, data lakes, and data modeling best practices. * Skill in SQL, Rust, Go, Python, and/or Scala for data transformation. * Knowledge of data privacy, compliance regulations (HIPAA, GDPR, CJIS). * Skill in implementing AI within county/government policy frameworks. * Knowledge of Git, CI/CD pipelines, data catalogs, Containers (Kubernetes, Docker) and business intelligence tools. * Knowledge of cloud platforms (Azure, AWS, or GCP) and UI/UX (ReactJS/NextJS) including data storage technologies (e.g., SQL Server, Snowflake, Parquet, etc.). * Skill in Python, AWS Sage maker, Lang chain, Pydantic, Model Context Protocols, Amazon Bedrock, Vector Databases, RAGs and data integration tools (e.g., Jupiter Notebooks, API Gateways etc.). * Knowledge of streaming data technologies (Kafka, Kinesis, Pub/Sub). Physical/Environmental Requirements: Occasional travel to County sites. Ability to work in a fast-paced, evolving technology environment.

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all