Research Data Engineer
Role details
Job location
Tech stack
Job description
We are seeking a hands-on, execution-driven Senior Research Data Engineer to scale and secure HotSpot's growing data operations infrastructure. This role serves as a vital operational bridge between our core software/cloud architecture and our computational biology and laboratory teams. You will partner closely with our Senior Data Engineer to drive alignment, eliminate single points of failure, and establish an automated, production-grade data paradigm that accelerates our drug discovery workflows., * Lead AI & LLM Data Readiness: Architect and execute the strategy to structure, clean, and index our historical and incoming research datasets, making them fully ready for machine learning and advanced ontology initiatives.
- Automate External Data Pipelines: Own and maintain robust Python ETL pipelines, ensuring seamless automated ingestion of chemistry and biology experiments from external CROs.
- System Stewardship & Data Ops: Act as the primary technical steward for our core research platforms (including Revvity Signals ELN, Certara D360, and our internal data ingestion tool, Ladle), handling Oracle schema updates, routine database maintenance, and direct user support for our scientists.
- Bridge Science and Engineering: Partner across disciplines to help scientists map biology and chemistry workflows into production templates, serving as a data continuity bridge that translates laboratory progress into scalable data models.
- Drive Operational Follow-Through: Identify dependencies and critical paths in our data delivery pipelines, actively manage infrastructure risks, and provide clear technical context to leadership.
Requirements
- Advanced Scientific Background: PhD degree in Bioinformatics, Computational Biology, Computer Science, or a related quantitative life sciences discipline and a minimum of 2+ years of relevant industry experience or a combination of post-doctoral and industry experience; MS degree in bioinformatics or related discipline with 5+ years of relevant industry experience may be considered.
- Production-Grade Engineering: Proven commercial or advanced hands-on experience writing clean, maintainable Python code and advanced SQL queries. You know how to build software that stands up to production environments.
- Cloud & Database Fluency: Demonstrated experience navigating cloud environments (AWS/GCP) and managing relational database schemas (Oracle, Postgres).
- Execution-Driven Planner: An autonomous problem-solver who brings clarity to complex data dependencies, actively tracks technical risk, and handles firefighting or data cleaning with equal dedication.
- Excellent Communicator: Ability to synthesize complex technical data constraints across disciplines and drive follow-through in a lean, fast-paced startup environment., At this time, HotSpot is not able to offer Visa sponsorship for this position. Candidates must be authorized to work in the United States without current or future sponsorship.
Benefits & conditions
We believe that people are our greatest resource and foster a supportive environment that provides growth and development for all teammates. We recognize and reward performance and incentivizes long-term success. From benefits that focus on your health and well-being to competitive compensation to ownership in the company, we want to inspire our employees.
Our Benefits Include, But Are Not Limited To:
- Competitive salary and bonus plans
- New hire stock option award
- Comprehensive package of benefits plans and fringe benefits
- Generous paid holidays, time off, including 2 company wide shutdowns
- Flexible working arrangements; hybrid work model
- Amazing team of supportive colleagues
At HotSpot, we have a bold mission to establish new drug discovery paradigm. If this appeals to you, please let us know at [email protected].
HotSpot is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.