Junior Data Engineer (Faculty Specialist)

The University of Maryland
College Park, United States of America
yesterday

Role details

Contract type
Internship / Graduate position
Employment type
Full-time (> 32 hours)
Working hours
Shift work
Languages
English
Experience level
Junior
Compensation
$ 80K

Job location

Remote
College Park, United States of America

Tech stack

Java
Agile Methodologies
Amazon Web Services (AWS)
Data analysis
Confluence
JIRA
Bash
System Configuration
Data Infrastructure
ETL
Relational Databases
Database Servers
Linux
Failover
Design of User Interfaces
Hadoop
Python
PostgreSQL
Performance Tuning
PostGIS
Red Hat Enterprise Linux - RHEL
Data Streaming
Backup and Restore
Spark
GIT
Information Technology
Cassandra
Data Analytics
Bitbucket
Data Management
Network Server

Job description

The A. James Clark School of Engineering at the University of Maryland serves as the catalyst for high-quality research, innovation, and learning, delivering on a promise that all graduates will leave ready to impact the Grand Challenges (e.g., energy, environment, security, and human health) of the 21st century. The Clark School is dedicated to leading and transforming the engineering discipline and profession, to accelerating entrepreneurship, and to transforming research and learning activities into new innovations that benefit millions. The Center for Advanced Transportation Technology (CATT) Laboratory is the industry leader for transportation information analysis, visualization, and user interface design. We provide cutting-edge analytics products and an integrated suite of situational awareness tools for transportation practitioners. These products and services are rapidly changing the way governments operate and make decisions. You can learn more about our products at https://ritis.org/.

We receive hundreds of gigabytes of transportation data daily, making our petabytes of archived data likely the largest collection of traffic data in the world. Our clients use our software to monitor real-time operations and analyze historical data to generate valuable insights. Our work saves taxpayers money, improves the environment, and saves lives!

We're as passionate about transportation as we are about building great software. We care about building usable, stable, and secure software to analyze massive amounts of data. We use cutting-edge tech to build and maintain our software. We have a mature development process and use industry best practices to build the best software possible. Our team is composed of application developers, analysts, UX designers, data scientists, IT, quality assurance specialists, and customer support operating in an Agile environment.

Our office is in College Park near the University of Maryland, easily accessible by DC Metro, MARC train, bus, car, and bike. Local employees are welcome to work in our office, or other locations, with a flexible schedule around our core hours. We also have many employees who are fully remote and work from different states. UMD requires all employees to live in the US, and we periodically bring remote employees to work with colleagues on-site.

We believe varied perspectives build better products, are proud to have a diverse team, and encourage people of all backgrounds to apply.

When you join our team, you will work to define, document, and test a wide variety of transportation data analytics and operations applications. You will learn new skills and stay current with industry best practices and emerging technologies.

The CATT Lab is seeking an early career Data Engineer to join our Data Platform team. This individual will work to design, develop, and support over 40 relational database instances and distributed Cassandra and Hadoop instances. They will apply their knowledge, skills, and experience to create stable, secure, and reliable data platforms supporting custom data analytics applications. They will learn new skills and stay current with industry best practices and emerging technologies. If you're looking for complex data problems in the GIS domain to collaboratively solve with a brilliant research and development team, please apply!

Can be 100% remote (US based only)

Physical Demands: Sedentary work performed in an office environment. Regularly required to communicate and exchange information and to use technology/devices. Position can be 100% remote (US based only)

Requirements

Do you have a Bachelor's degree?, B.S. Degree in Computer Science or related field Relevant academic or internship experience with design, development, and administration of data platform solutions Experience writing ETL scripts and standalone applications in Python and/or Java Expertise writing and tuning SQL queries Experience deploying, configuring, maintaining, and upgrading production-level database servers and services

Preferences: Experience administering PostgreSQL servers, performance tuning, streaming replication, and PgBouncer failover configuration Experience with non-relational data storage and processing solutions like Cassandra, Hadoop, Spark, Iceberg, Sedona etc. Experience working with geospatial datasets, using geospatial extensions (such as PostGIS) and querying features. Experience with Linux (RHEL), including writing robust Bash scripts. Experience architecting and implementing backup and recovery strategies. Experience with Git, Jira, Confluence, and Bitbucket. Experience with cloud platforms including AWS and GCP.

Benefits & conditions

Pulled from the full job description

  • Flexible schedule

Apply for this position