<?xml version="1.0" encoding="iso-8859-1" standalone="no"?>
<!DOCTYPE GmsArticle SYSTEM "http://www.egms.de/dtd/2.0.34/GmsArticle.dtd">
<GmsArticle xmlns:xlink="http://www.w3.org/1999/xlink">
  <MetaData>
    <Identifier>25dkou481</Identifier>
    <IdentifierDoi>10.3205/25dkou481</IdentifierDoi>
    <IdentifierUrn>urn:nbn:de:0183-25dkou4811</IdentifierUrn>
    <ArticleType>Meeting Abstract</ArticleType>
    <TitleGroup>
      <Title language="en">FractureGPT: A controlled comparative study of ChatGPT-4 and an orthopedic surgeon in detecting lower leg and ankle fractures</Title>
    </TitleGroup>
    <CreatorList>
      <Creator>
        <PersonNames>
          <Lastname>Hofmann</Lastname>
          <LastnameHeading>Hofmann</LastnameHeading>
          <Firstname>Max</Firstname>
          <Initials>M</Initials>
        </PersonNames>
        <Address>
          <Affiliation>Klinik f&#252;r Unfallchirurgie, Orthop&#228;die und Handchirurgie, Helios Klinikum Erfurt, Erfurt, Deutschland</Affiliation>
          <Affiliation>Health and Medical University Erfurt, Erfurt, Deutschland</Affiliation>
        </Address>
        <Creatorrole corresponding="no" presenting="yes">author</Creatorrole>
      </Creator>
      <Creator>
        <PersonNames>
          <Lastname>Fahr</Lastname>
          <LastnameHeading>Fahr</LastnameHeading>
          <Firstname>Robert</Firstname>
          <Initials>R</Initials>
        </PersonNames>
        <Address>
          <Affiliation>Klinik f&#252;r Unfallchirurgie, Orthop&#228;die und Handchirurgie, Helios Klinikum Erfurt, Erfurt, Deutschland</Affiliation>
          <Affiliation>Health and Medical University Erfurt, Erfurt, Deutschland</Affiliation>
        </Address>
        <Creatorrole corresponding="no" presenting="no">author</Creatorrole>
      </Creator>
      <Creator>
        <PersonNames>
          <Lastname>Elsaghir</Lastname>
          <LastnameHeading>Elsaghir</LastnameHeading>
          <Firstname>Ahmed</Firstname>
          <Initials>A</Initials>
        </PersonNames>
        <Address>
          <Affiliation>Klinik f&#252;r Unfallchirurgie, Orthop&#228;die und Handchirurgie, Helios Klinikum Erfurt, Erfurt, Deutschland</Affiliation>
          <Affiliation>Health and Medical University Erfurt, Erfurt, Deutschland</Affiliation>
        </Address>
        <Creatorrole corresponding="no" presenting="no">author</Creatorrole>
      </Creator>
      <Creator>
        <PersonNames>
          <Lastname>M&#252;ckley</Lastname>
          <LastnameHeading>M&#252;ckley</LastnameHeading>
          <Firstname>Thomas</Firstname>
          <Initials>T</Initials>
        </PersonNames>
        <Address>
          <Affiliation>Klinik f&#252;r Unfallchirurgie, Orthop&#228;die und Handchirurgie, Helios Klinikum Erfurt, Erfurt, Deutschland</Affiliation>
        </Address>
        <Creatorrole corresponding="no" presenting="no">author</Creatorrole>
      </Creator>
    </CreatorList>
    <PublisherList>
      <Publisher>
        <Corporation>
          <Corporatename>German Medical Science GMS Publishing House</Corporatename>
        </Corporation>
        <Address>D&#252;sseldorf</Address>
      </Publisher>
    </PublisherList>
    <SubjectGroup>
      <SubjectheadingDDB>610</SubjectheadingDDB>
    </SubjectGroup>
    <DatePublishedList>
      <DatePublished>20251031</DatePublished>
    </DatePublishedList>
    <Language>engl</Language>
    <License license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
      <AltText language="en">This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License.</AltText>
      <AltText language="de">Dieser Artikel ist ein Open-Access-Artikel und steht unter den Lizenzbedingungen der Creative Commons Attribution 4.0 License (Namensnennung).</AltText>
    </License>
    <SourceGroup>
      <Meeting>
        <MeetingId>M0634</MeetingId>
        <MeetingSequence>481</MeetingSequence>
        <MeetingCorporation>Deutsche Gesellschaft f&#252;r Orthop&#228;die und Unfallchirurgie</MeetingCorporation>
        <MeetingCorporation>Deutsche Gesellschaft f&#252;r Orthop&#228;die und Orthop&#228;dische Chirurgie</MeetingCorporation>
        <MeetingCorporation>Deutsche Gesellschaft f&#252;r Unfallchirurgie</MeetingCorporation>
        <MeetingCorporation>Berufsverband f&#252;r Orthop&#228;die und Unfallchirurgie</MeetingCorporation>
        <MeetingName></MeetingName>
        <MeetingTitle>Deutscher Kongress f&#252;r Orthop&#228;die und Unfallchirurgie (DKOU 2025)</MeetingTitle>
        <MeetingSession>Abstracts &#124; Digitalisierung 3</MeetingSession>
        <MeetingCity>Berlin</MeetingCity>
        <MeetingDate>
          <DateFrom>20251028</DateFrom>
          <DateTo>20251031</DateTo>
        </MeetingDate>
      </Meeting>
    </SourceGroup>
    <ArticleNo>AB76-4039</ArticleNo>
  </MetaData>
  <OrigData>
    <TextBlock name="Text" linked="yes">
      <MainHeadline>Text</MainHeadline><Pgraph><Mark1>Objectives and questions: </Mark1>Large Language Models have demonstrated a broad range of applications, including text editing, content creation and coding. In the medical field, ChatGPT has been explored in various contexts, such as answering patient questions, analyzing radiological reports, and fracture detection.</Pgraph><Pgraph>The aim of this study was to conduct a randomized, controlled comparative study to evaluate the diagnostic accuracy of ChatGPT-4 in detecting fractures, compared to an Orthopedic Surgeon. </Pgraph><Pgraph><Mark1>Material and methods: </Mark1>Radiographs from the FracAtlas database were used to evaluate ChatGPT-4&#39;s diagnostic performance in fracture detection against an experienced orthopedic surgeon under controlled conditions. A total of 80 images (30 with fractures, 50 without) depicting the ankle region were selected, randomly sorted, and uploaded to FractureGPT, a custom-made GPT specifically designed for fracture identification. FractureGPT was instructed to describe the depicted anatomical region and to report the presence of a fracture.</Pgraph><Pgraph>The same images were presented to a trained orthopedic surgeon, who was tasked with determining the presence or absence of fractures. Accuracy, sensitivity, specificity, and the area under the curve (AUC) of the receiver operating characteristic (ROC) were calculated and compared.</Pgraph><Pgraph><Mark1>Results: </Mark1>FractureGPT demonstrated a strong capability to detect and describe the depicted anatomical region, with an accuracy in identifying the correct projection of 97.54&#37;. Overall accuracy for FractureGPT was 66.25&#37; &#177; 5.29&#37; and 93.75&#37; &#177; 2.71&#37; for the trained orthopedic surgeon. Sensitivity was 50&#37; &#177; 9.13&#37; for FractureGPT and 96.67&#37; &#177; 3.28&#37; for the orthopedic surgeon. Specificity for FractureGPT was 76&#37; &#177; 6.04&#37; and 92.00&#37; &#177; 3.84&#37; for the trained orthopedic surgeon. </Pgraph><Pgraph>Accuracy and sensitivity of the orthopedic surgeon was significantly better than FractureGPT when compared using McNemar&#8217;s test (p&#60;0.000027 and p&#61;0.00012, respectively). The difference in specificity was not significant when tested using McNemar&#8217;s test (p&#61;0.057) but still did show a strong trend.</Pgraph><Pgraph>To compare diagnostic performance of FractureGPT and the orthopedic surgeon, the receiver operating characteristic (ROC) curve and the respective area under the curve (AUC) was calculated. AUC was 0.63 for FractureGPT and 0.94 for the orthopedic surgeon, which was significantly higher when compared using DeLong test (p&#60;0.0000001).</Pgraph><Pgraph><Mark1>Discussion and conclusions: </Mark1>While ChatGPT-4 demonstrates strong anatomical recognition, it falls significantly short of a trained orthopedic surgeon in fracture detection accuracy and sensitivity. While promising as a quick screening tool, ChatGPT-4 cannot replace trained medical expertise. However, integrating AI into fracture detection holds significant potential to enhance diagnostic workflows and enable rapid preliminary assessments.</Pgraph></TextBlock>
    <Media>
      <Tables>
        <NoOfTables>0</NoOfTables>
      </Tables>
      <Figures>
        <NoOfPictures>0</NoOfPictures>
      </Figures>
      <InlineFigures>
        <NoOfPictures>0</NoOfPictures>
      </InlineFigures>
      <Attachments>
        <NoOfAttachments>0</NoOfAttachments>
      </Attachments>
    </Media>
  </OrigData>
</GmsArticle>