Data Integration Specialist - R&D

Lipman Family Farms
Estero, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Shift work
Languages
English, Spanish
Experience level
Intermediate
Compensation
$ 50K

Job location

Estero, United States of America

Tech stack

Microsoft Excel
API
Data analysis
Computer Vision
Big Data
Microsoft Outlook
Cloud Computing
Databases
Image Analysis
Data Integration
ETL
Data Transformation
Relational Databases
Digital Data
Python
Microsoft Office
Network Attached Storage (Server Appliance)
Operational Data Store
Pivot Tables
Power BI
DataOps
SharePoint
Data Import/Export
Data Processing
Scripting (Bash/Python/Go/Ruby)
Cloud Platform System
Data Strategy
Build Management
Data Management
Streamlit Framework

Job description

Company Overview: Lipman Family Farms grows a variety of vegetables and fruits backed by over 80 years of experience. With locations across the continent, strong local partnerships, and tens of thousands of acres of land, we grow and source our produce year-round in optimal conditions to ensure surety of supply. So, whether you're slicing one of our top-notch tomatoes, biting into a crisp bell pepper, or enjoying the freshness of any one of our offerings, you'll know that you're getting produce grown with care and expertise, by family for family.

Job Summary: This position supports the day-to-day data operations and long-term data strategy of the Tomato breeding program at Lipman Family Farms R&D, located in Estero, Florida. The role sits at the intersection of plant breeding and data operations. Core responsibilities keep the breeding team running efficiently, while expanded responsibilities grow the program's data capabilities over time.

Day-to-day work includes setting up and managing seasonal trials and breeding activities, maintaining shared records, and fulfilling data requests from breeders and team members. Expanded work includes running image analysis pipelines, building data integrations, updating historical records, and developing new methods of analysis and reporting. The ability to gather, organize, maintain, track and transfer data between a relational database and other data platforms will be essential.

The position involves working with a range of tools and platforms, including PhenomeOne (the program's central breeding database), Microsoft Teams and SharePoint, a shared network drive, Excel, Python, and cloud-based image analysis services. Training on program-specific tools, naming conventions, and workflows is provided on the job. The position closely supports the Lead Breeder, Predictive Breeding Specialist, and other team members in data organization, analysis, and reporting., * Set up and maintain trials in PhenomeOne each season, including populating germplasm lists, configuring plot assignments through the map tab, creating observation sets, and confirming trial readiness prior to data collection.

  • Create and manage new germplasm & hybrid records. Ensure essential data is properly inherited and tracked throughout multiple generations & seasons in PhenomeOne.
  • Print field tags and labels as needed to support trial operations, crossing activities, and sample tracking.
  • Respond to data requests from breeders and team members by pulling observations, germplasm attributes, field results, and other records from PhenomeOne. Ensure all users have mobile access to data, on either PhenoTop or Teams, for reference and note taking.
  • Maintain the team's shared record system on the local server, while helping guide the full transition to Microsoft Teams/SharePoint. This includes exporting data from PhenomeOne, keeping trial lists current, updating records as new information is assigned each season, and ensuring all breeding program activity is reflected in the appropriate folders.
  • Assist Product Development team with analyzing and reporting trial results

Expanded Responsibilities

  • Run the in-house image analysis pipeline, including managing cloud GPU resources, processing field images through computer vision workflows, and retrieving structured output data for integration into breeding records.
  • Upload historical data into PhenomeOne to build a unified, searchable record of past trial performance.
  • Design and build field templates and variable structures in PhenomeOne to support the integration of new data, ensuring consistency with existing data organization standards.
  • Build and maintain data integrations using the PhenomeOne API, including relational data integrations, ETL processes, and R or Python-based analysis pipelines.
  • Identify and develop new methods of data analysis and reporting using Excel, Power BI, PhenomeOne Insights, or custom scripts to surface meaningful patterns in breeding, evaluation, and image analysis data.

Requirements

Do you have experience in Teamwork?, To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and ability required. Preference will be given to candidates with 2-3 years of experience in most of these areas. Reasonable accommodation may be made to enable individuals with disabilities to perform the essential functions.

  • Strong background in data gathering, organization of large data sets, analysis, and reporting.
  • Comfortable learning and navigating relational database platforms. Ability to manage records, configure workflows, and extract data as needed.
  • Proficient in the Microsoft Office Suite, particularly Excel (including formulas, pivot tables, and data manipulation), as well as Teams, SharePoint, and Outlook.
  • Foundational knowledge of Python or a similar scripting language for data transformation and automation. Willingness to learn additional tools and languages on the job.
  • Comfortable working with cloud-based platforms and willing to learn new tools including APIs, GPU-based computing services, and computer vision workflows.
  • Interest in data visualization and reporting. Familiarity with tools such as Power BI, Streamlit, or similar platforms is a plus.
  • Thorough and detail-oriented in completing tasks on time, with strict attention to data accuracy and consistency.
  • Ability to read, understand, and carry out written protocols and develop efficient workflows for data collection, integration, and organization.
  • Creative and proactive in problem-solving, particularly under new or evolving circumstances.
  • Fluent in English. Ability to speak Spanish is desirable.
  • Non-tobacco user in any form or at any time.
  • Ability to Relocate to Estero, FL 33928 before starting work (Required), * Highly organized, self-motivated, and detail-focused.
  • Able to effectively set and meet goals, timelines, and project milestones.
  • Able to collect, organize, interpret, store, and retrieve experimental and operational data in secure written and digital formats.
  • Strong, proactive communication skills across various work settings, including team meetings, cross-functional discussions, and one-on-one interactions with breeders and scientists.
  • Able to plan, coordinate, and execute multiple concurrent tasks and shift priorities as seasonal demands change.
  • Able to work effectively in a team-oriented and multicultural environment.
  • Able to work accurately with large data sets, long record lists, and complex naming or numbering systems.
  • Able to follow oral and written directions, and to read, comprehend, interpret, apply, and explain internal protocols, methodologies, and results to staff and leadership.

Benefits & conditions

Pulled from the full job description

  • 401(k)
  • Health insurance
  • 401(k) matching
  • Paid time off
  • Vision insurance
  • Dental insurance
  • Life insurance, Schedule:
  • 8 hour shift
  • Monday to Friday, * 401(k)
  • 401(k) matching
  • Dental insurance
  • Employee assistance program
  • Flexible schedule
  • Health insurance
  • Life insurance
  • Paid time off
  • Vision insurance

Apply for this position