System Engineer

SofTech, Inc.
3 days ago

Role details

Contract type
Permanent contract
Employment type
Part-time (≤ 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Remote

Tech stack

Artificial Intelligence
Systems Engineering
Bash
C++
Databases
Distributed Systems
Fault Tolerance
Python
Linux kernel
Software Engineering
SQL Databases
System Programming
High Performance Computing
Large Language Models
Perf (Linux)
Information Technology
REST
Go

Job description

The Infrastructure Systems Engineering team is in the midst of building our next-generation private cloud

infrastructure as part of a greenfield initiative to in-house existing internal use cases on public cloud and

support new initiatives.

The historical deployment primarily supported our database test system: A system that runs over 7.25

million tests per month, writes petabytes to storage daily, and we're currently scaling multiple times in

the next year alone.

As a Software Engineer, Storage Systems, you will be tasked with the design, implementation, and

continuous improvement of systems at lower abstraction levels. You'll collaborate closely with a small,

highly skilled team to drive a careful balance between battle-hardened best practices and creative

innovation in an environment that embraces working from first principles, celebration of failure, and

egoless technical discussion.

Role And Responsibilities

  • Engage in the design and development of major infrastructure components and systems,

focusing on fault tolerance and performance.

  • Practice principles of distributed leadership - being comfortable shifting between leading,

collaborating with, and following your peers on different projects.

  • Own your systems end-to-end from design to production.

  • Work closely with other teams to ensure the successful integration and operation of our

infrastructure systems.

  • Do your part to foster an environment where people take chances because they aren't punished

for mistakes and improve over time because they learn from not just their mistakes but those of

Requirements

  • At least 5 years of experience developing and optimizing systems software.

  • Deep expertise (can speak both to system level and implementation detail) in at least one of:

  • virtualization

  • storage

  • service modeling & REST API/SDK generation

  • scientific or high-performance computing

  • kernel development

  • Natural curiosity or drive to learn about new or adjacent technologies.

  • Experience designing, developing, and/or troubleshooting distributed systems.

  • Comfort with at least one systems programming language (C/C++, golang) and one scripting

language (python, bash).

  • Comfort with shells on *nix family systems.

  • B.S. degree or equivalent experience in Computer Science or a related field.

Preferred Qualifications

  • Experience with low-level monitoring & profiling (e.g. pcm, perf, bpf).

  • Experience with, or hobbyist interest in, distributed elastic AI / LLM CPU inference (e.g.

OpenVino, llama.cpp).

  • Familiarity with, or interest in, SingleStore or other distributed SQL databases.

Apply for this position