AI FinOps Engineer
Role details
Job location
Tech stack
Job description
Nelnet is seeking an AI FinOps Engineer to own the token economics and cost optimization engine of our Enterprise AI program. Reporting to the IT Director of AI Delivery, this role is embedded in our Shared Services department and focused on driving efficiency across our Enterprise AI platforms - starting with Anthropic Claude and extending to the broader EA portfolio.
This is a technical, hands-on role. You will work at the API level to instrument workloads, identify inefficiencies, and engineer solutions that reduce organizational cost without degrading capability. A key output of this work is translating token-level findings into best practices that our AI enablement team can distribute across the organization.
What You Will Own
- Token Engineering: Track, model, and optimize token costs across Enterprise AI platforms. Own prompt efficiency patterns, caching strategies, and model-tier selection guidance.
- Best Practice Development: Define and document token optimization best practices. Partner with the AI enablement team to translate findings into org-wide guidance.
- Utilization Reporting: Build and maintain dashboards that surface usage trends, cost anomalies, and efficiency metrics for IT leadership.
- Cost Optimization: Go beyond reporting - identify waste, propose tier or model changes, and quantify savings. Own recommendations from analysis through implementation.
You Will Thrive Here If
- You believe "if you can't measure it, you can't improve it"-and you build the measurement yourself.
- You find token optimization a fun challenge to be solved
- You can hold your own in a conversation with both engineers and non-technical stakeholders.
Annual compensation range for this role is $77,000 - $170,000 depending on experience.
This position offers a hybrid work option. Nelnet values flexibility and understands the importance of work-life integration. Our hybrid work environment allows associates living within 30 miles of an office location to work remotely for part of the week, while also fostering collaboration and team connection through in-office presence three days per week.
Please note that we are unable to provide visa sponsorship for this position. To be considered, candidates must already be authorized to work in the United States without the need for current or future sponsorship.
This position requires work in support of the Company's contract with the United States Department of Education ("ED"). As such, the United States Government requires that any applicant for this position must complete United States Government security clearance. Effective June 1, 2018, ED has informed Nelnet that security clearance applications for foreign nationals are not being accepted or processed. In light of this direction from ED, Nelnet will be unable to hire applicants without United States citizenship for such positions.
Requirements
- 1-2 years hands-on experience with LLM APIs (Claude, OpenAI, or equivalent) at the token level - not just usage, but optimization
- Deep familiarity with LLM pricing mechanics: context windows, caching, batching, input/output token splits, and tier structures
- Experience with prompt engineering techniques focused on efficiency and cost reduction
- Python or SQL for instrumentation and pipeline work
- Ability to communicate technical findings to non-technical stakeholders
Preferred:
- 2-4 years of industry experience
- Prompt caching, batch API usage, or model-tier switching in production environments
- Cloud FinOps background or FinOps Foundation certification
- Experience with multiple LLM providers and their cost/capability tradeoffs