About Us: EvolutionIQ's mission is to deliver state of the art technology that helps insurance claims teams make claims handling more accurate, fair, and efficient, so that more people impacted by injury or illness can continue their lives with dignity and stability. We are currently experiencing massive growth and to accomplish our goals, we are hiring world-class talent who want to help build and scale internally, and transform the insurance space. Our team is our #1 priority, and we have been named one of Inc.'s Best Workplaces 3 years in a row and Built In's Best Places to work in 2025 and 2026!
About the Role & You: As a Senior Data Scientist on the Medhub team, you will be the primary architect of our LLM evaluation framework. Think of this role as the "Evaluation Expert" and will define the rigorous statistical standards they must meet before they ever touch production. You believe that LLMs should be held to the same (or higher) scientific standards as traditional supervised models. You enjoy the challenge of turning subjective outputs (like chat and summarization) into objective, measurable data. You are a "Data Scientist's Data Scientist"-someone who leans heavily into statistical significance, inter-rater reliability, and robust experimental design to ensure our AI products are safe, accurate, and reliable.
What You'll Achieve (Performance Outcomes):
In this Role You Will: