Chainforge.aiComprehensive evaluation of LLM diagnostic reasoning capabilities using real clinical cases. Measures diagnostic accuracy, reasoning coherence, and consideration of alternative diagnoses.
Comprehensive evaluation of LLM diagnostic reasoning capabilities using real clinical cases. Measures diagnostic accuracy, reasoning coherence, and consideration of alternative diagnoses.