The HealthBench test can’t possibly tell us the critical factor: How humans would respond to chatbots under real-world conditions.
The HealthBench test can’t possibly tell us the critical factor: How humans would respond to chatbots under real-world conditions.