Tiffany H. Kung, Morgan Cheatham, Arielle Medenilla, Czarina Sillos, Lorie De Leon, Camille Elepaño, Maria Madriaga, Rimel Aggabao, Giezel Diaz-Candido, James Maningo, Victor Tseng
We evaluated the performance of a large language model called ChatGPT on the
United States Medical Licensing Exam (USMLE), which consists of three exams:
Step 1, Step 2CK, and Step 3. ChatGPT performed at or near the passing threshold
for all three exams without any specialized training or reinforcement.
Additionally, ChatGPT demonstrated a high level of concordance and insight in
its explanations... hiện toàn bộ