Artificial Intelligence vs Human Evaluation of Anesthesia Education Videos: A Comparative Analysis Using Validated Quality Scales

Artificial Intelligence vs Human Evaluation of Anesthesia Education Videos: A Comparative Analysis Using Validated Quality Scales

Artificial Intelligence vs Human Evaluation of Anesthesia Education Videos: A Comparative Analysis Using Validated Quality Scales

https://www.frontiersin.org/journals/medicine/articles/10.3389/fmed.2026.1752664/full

Publish Date: 2026-01-19 02:34:00

Source Domain: www.frontiersin.org

Certainly! Here are four key points from the article provided:

  • Educational Quality Comparison: The study compared the educational quality of anesthesia-related YouTube videos produced by human educators and AI tools, focusing on human-generated content as higher quality based on certain criteria.

  • Assessment Tools: Videos were evaluated using validated scales including DISCERN, JAMA, and the Global Quality Scale (GQS) to measure the educational and overall quality of the content.

  • Human vs. AI Ratings: Human-generated videos scored significantly higher than AI-generated ones on the DISCERN and JAMA scales, though no significant difference was observed in GQS scores.

  • Agreement and Reliability: There was excellent inter-rater reliability between human experts in their evaluations, indicated by an Intraclass Correlation Coefficient (ICC) ranging from 0.81–0.86.

Those are four of the key takeaways from the provided article summary.