2025, Number 3
Deep learning applications in orthopaedics: a systematic review and future directions
Language: English
References: 88
Page: 152-163
PDF size: 263.41 Kb.
ABSTRACT
Introduction: artificial intelligence and deep learning in orthopedics have gained mass interest in recent years. In prior studies, researchers have demonstrated different applications, from radiographic assessment to bone tumor diagnosis. The purpose of this review is to analyze the current literature on AI and deep learning tools to identify the most used tools in the risk assessment, outcome assessment, imaging, and basic science fields. Material and methods: searches were conducted in PubMed, EMBASE and Google Scholar from January 2020 up to October 31st, 2023. We identified 862 studies, 595 of which were included in the systematic review. A total of 281 studies about radiographic assessment, 102 about spine-oriented surgery, 95 about outcome assessment, 84 about fundamental AI orthopedic education, and 33 basic science applications were included. Primary outcomes were diagnostic accuracy, study design and reporting standards reported in the literature. Estimates were pooled using random effects meta-analysis. Results: 53 different imaging methods were used to measure radiographic aspects. A total of 185 different machine learning algorithms were used, with the convolutional neural network architecture being the most common (73%). To improve diagnostic accuracy and speed were the most commonly achieved results (62%). Conclusion: heterogeneity was high among the studies, and extensive variation in methodology, terminology and outcome measures was noted. This can lead to an overestimation of the diagnostic accuracy of DL algorithms for medical imaging. There is an immediate need for the development of artificial intelligence-specific guidelines to provide guidance around key issues in this field.INTRODUCTION
Significant advances have been made in deep learning (DL), a subset of machine learning and artificial intelligence (AI), in the field of healthcare, particularly in orthopedic surgery. By comprehending complex algorithms, deep learning has the potential to revolutionize diagnosis, treatment planning, prediction of surgical outcomes, and even surgical procedures.
DL and machine learning (ML) methods enable computers to learn from data and make decisions or predictions without being programmed to perform specific tasks. In simple terms, ML involves teaching computers to recognize patterns or trends in data, while DL, a subset of ML, uses layers of algorithms called neural networks to analyze these data patterns more deeply.1
In the field of medical science, deep learning has become particularly valuable in analyzing medical images. By using neural networks, DL algorithms can detect intricate patterns in images such as X-Rays or MRIs, enabling them to diagnose diseases such as cancer with a high degree of accuracy. This technology acts like an expert radiologist but can work around the clock and process information at speeds and volumes far beyond human capabilities.2,3
The integration of DL and ML into medical science, especially in orthopedics, could revolutionize how we diagnose, treat, and develop interventions. These technologies significantly advance patient care, improve treatment outcomes, and drive innovation in medical research and practices. As this field continues to evolve rapidly, it holds great promise for even more remarkable breakthroughs and applications in enhancing health and quality of life.4
ADVANCEMENTS IN DEEP LEARNING IN ORTHOPEDICS
Preoperative planning
Deep learning algorithms have been successful in preoperative planning, particularly in predicting outcomes and complications. It also helps in selecting the most appropriate surgical approach.5
Image interpretation
Deep learning algorithms have been developed for image interpretation and have achieved expert-level accuracy in diagnosing different orthopedic conditions through MRI, X-Ray, or CT scan readings.1,2,6
Prosthesis design
AI can aid in the customization of prosthetics, leading to a better fit and increased functionality for individual patients.3,7
Robotic orthopedic surgery
Artificial intelligence-based robotics systems have gained ground in orthopedic surgery, such as hip and knee replacements, leading to greater precision and potentially better patient outcomes.8
MATERIAL AND METHODS
This systematic review was carried out following the procedures highlighted in the 'PRISMA-DTA' extension for diagnostic validity studies, as well as following systematic review guidelines.9
SELECTION CRITERIA
We sought studies that reveal the diagnostic accuracy and uses of DL algorithms in identifying pathologies or diseases of orthopedic interest. The primary objective was to identify the type of studies and range of diagnostic accuracy metrics. The secondary objectives included the study design and quality of reporting.
DATA COLLECTION AND SEARCHES
Electronic literature searches in PubMed, EMBASE and Google Scholar were also conducted from January 1st, 2020, up to October 31st, 2023. The search terms used consisted of Mesh terms and all-field search terms for "orthopedic deep learning" and "neural networks" (examples are DL, convolutional networks, etc.) and specific terms such as "imaging" (such as magnetic resonance, computed tomography, ultrasound, or X-Ray), and the results were used as "diagnostic accuracy metrics" (sensitivity and specificity). We identified 862 studies, 595 of which were included in the systematic review. A total of 281 studies about radiographic assessment, 102 about spine-oriented surgery, 95 about outcome assessment, 84 about basic AI orthopedic education and 33 about basic science application were included for review. Peer-reviewed studies that reported on the accuracy of DL algorithms for identifying pathology using medical imaging were included. Primary outcomes were diagnostic accuracy, study design and reporting standards reported in the literature. Estimates were pooled using random effects meta-analysis (Figure 1).
INCLUSION CRITERIA
Eligible studies included those evaluating the diagnostic accuracy of a DL algorithm in the orthopedic field in general and in subspecialties. Only studies that reported either diagnostic accuracy raw data or sensitivity and specificity data were incorporated in the systematic review. No restrictions were imposed on the date range, and the most recent search took place in November 2023.
EXCLUSION CRITERIA
Articles not written in English were excluded. Abstracts, conference articles, preprints, reviews, and meta-analyses were not considered as the purpose was to scrutinize the methodology, reporting standards, and quality of primary research studies appearing in peer-reviewed journals.
SELECTION PROCESS
The study selection process entailed an initial review by two different reviewers of all titles and abstracts. This was followed by a second review by the third and fourth authors of the full-text articles. This process ensured that the remaining articles met the inclusion criteria.
A total of 153 different imaging methods for radiographic aspects were identified. A total of 185 different ML algorithms were used, with the convolutional neural network architecture being the most common (73%). Improving diagnostic accuracy and speed was the most common method used (62%)
REVIEW
This review focuses on five principal applications of machines and deep learning methodologies in orthopedics. The key areas of focus include (1) evaluation through radiography, (2) spine-focused surgical interventions, (3) basic orthopedic science, (4) evaluation of outcomes in general orthopedic surgeries, and (5) fundamental artificial intelligence (AI) instruction for orthopedic surgeons. Given the expansive use of AI in orthopedic surgery, it is impractical to cover every facet of this review. Instead, we have opted to concentrate on a select few applications that we believe encapsulate and effectively summarize the enhancements associated with AI and deep learning; we summarize these reviews with examples in Table 1.
APPLICATION 1: RADIOGRAPHIC ASSESSMENT
DL, an AI technology, possesses the ability to analyze and interpret complex medical imaging, a capability that paves the way for more accurate diagnoses and patient-specific treatments.10 Studies have shown that DL can process X-Rays, CT scans, and MRI images effectively to identify specific pathologies, such as fractures, osteoarthritis, bone tumors, deformities, and degenerative diseases.11,12 Radiographic parameters are instrumental for accurately determining disease progression, planning surgeries, and predicting postoperative outcomes. Moreover, this approach has the potential to analyze biomechanical data and perform automated measurements, which are traditionally time-consuming.13
In addition to improving diagnostic accuracy, its application also brings forth the potential for expedited and optimized medical workflows. With its ability to swiftly analyze large volumes of radiographic data, clinicians can rapidly plan treatment trajectories and focus intensively on critical patient care.14,15 Moreover, DL models can easily facilitate the detection of subtle or complex patterns in images, which may be challenging even for experienced radiologists. This could enhance early detection and intervention for debilitating conditions, eventually improving patient outcomes. Additionally, DL can serve as an educational tool to aid trainees and junior clinicians in enhancing their understanding and interpretation of orthopedic imaging.16 Certainly, key challenges must be considered, such as ensuring the ethical use of patient data, maintaining transparency in AI decision-making processes, and eliminating interoperability issues among various healthcare systems.
However, despite its promising advantages, DL implementation in orthopedics must be approached with caution due to its dependency on the quality and size of the dataset used for training. Hence, additional research is necessary to overcome these potential limitations and standardize the use of DLs in orthopedic radiographic assessments in pursuit of precision medicine. Overcoming these obstacles will be crucial for fully realizing the potential benefits and advancements that DL offers to the field of orthopedic radiographic assessment.
APPLICATION 2: SPINE-ORIENTED SURGERY
DL has presented significant advancements, particularly in spine-oriented surgical procedures. Researchers and practitioners are actively exploring and utilizing DLs to assist in areas ranging from diagnosis to surgical planning and prognosis prediction.17 Specific examples include the detection and classification of spinal disorders such as scoliosis, spondylolisthesis, and intervertebral disc degeneration from radiographic images. DL models have also been designed to assist in surgical planning by predicting pedicle screw placement, reducing the chances of iatrogenic injury.18,19
In addition to its use in diagnostic and treatment planning, the role of DL in enhancing intraoperative guidance and postoperative rehabilitation in spinal surgeries also demonstrates significant promise. With its advanced pattern recognition capabilities, DL can potentially provide real-time feedback during spinal surgeries, thereby increasing the precision and safety of procedures.20,21 For instance, DL algorithms can aid in identifying anatomic landmarks, enhancing visualization, and assisting tool navigation, subsequently reducing the risk of medical errors.
However, complete adoption of DL in spine-related orthopedic practices remains a challenging task. Despite the considerable progress, issues such as data privacy, algorithmic transparency, and the need for interdisciplinary collaboration between data scientists and clinicians need to be adequately addressed. As the potential of DL continues to be explored, more comprehensive guidelines governing its application in spinal surgeries have yet to be established to leverage its benefits effectively.
APPLICATION 3: BASIC SCIENCE (BIOMECHANICS/PATHOGENESIS)
In the realm of basic science in orthopedics, the incorporation of DL has amplified the possibility of understanding intricate details and processes at the molecular, cellular, and tissue levels. The established DL algorithms can supplement traditional research methods by analyzing complex patterns and interrelationships among numerous biological variables. They can handle vast multidimensional datasets, aiding in biomarker discovery, modeling disease progression, and identifying potential therapeutic targets.22
DL can aid genomic research in orthopedics, where it could be used to understand patterns of gene expression pertaining to bone growth, determine the underlying genetic causes behind orthopedic diseases, or predict patient responses to treatments at the genetic level. Similarly, at the tissue level, DL-powered image analysis can provide a more in-depth analysis of histopathological samples, potentially enabling early detection of degenerative diseases and comprehensive assessment of cellular responses to different interventions.25
Moreover, DL can complement biomechanical studies by facilitating the analysis of complex motion patterns and forces, providing deeper insights into the effects of various physical activities on the musculoskeletal system.26
APPLICATION 4: OUTCOMES ASSESSMENT
DL has proven to be a critical tool in orthopedics for assessing patient outcomes. By analyzing vast sets of patient data, DL models can predict patient-specific outcomes following various interventions, thereby enabling a more personalized approach to patient care. Moreover, this approach can help identify factors contributing to optimal and suboptimal outcomes, facilitating improvements in therapeutic strategies.27,28 DL algorithms can be used to mine data to uncover complex, non intuitive correlations between patient characteristics, intervention details, and postoperative outcomes. Such correlations could be used to predict future patient recovery patterns, incidence of complications, or even the likelihood of rehospitalization, allowing clinicians to effectively plan and adjust treatments and follow-up schedules.29
Furthermore, DL can be used to analyze real-time patient data collected through wearable technology to provide a comprehensive understanding of patient function and recovery in real-world settings. By mining these rich data sources, DL can potentially uncover nontraditional metrics of orthopedic outcomes, which may prove relevant in achieving patient-centered care.29,30 DL can decipher patterns in movement data to accurately assess rehabilitation progress following joint replacement or reconstructive surgeries. Likewise, this study can provide insights into the adherence of patients to prescribed rehabilitation protocols and enable tailored interventions for improving patient compliance.31,32
While this fusion of DL and wearable technology opens a new dimension in outcome research in orthopedics, key issues, including patient acceptance, data privacy, and data validity, need careful deliberation. Nonetheless, with continuous refinement and stringent validation practices, the integration of DL can undoubtedly drive a paradigm shift in outcome assessment, taking orthopedic care a step closer to the aim of optimized, patient-centered care. As we move forward, the effective application of DLs can significantly enhance the quality and efficacy of orthopedic care by ensuring that interventions align with individuals' expectations and desired outcomes.
APPLICATION 5: FUNDAMENTAL AI EDUCATION FOR ORTHOPEDIC SURGEONS
DL, as a subset of AI, has been progressively weaving its way into medical education, including orthopedics. With its capacity to process vast and complex datasets, DL can contribute significantly to bolstering both theoretical knowledge and practical skills among medical students and professionals in the orthopedic field. For example, DL algorithms can assist in creating immersive, personalized learning experiences by identifying individual learning patterns and offering tailored educational content.33
Beyond direct educational functions, DL can serve as an integral tool in fostering research literacy among medical students and professionals in orthopedics. With increasing focus on evidence-based practice in the current healthcare landscape, having skills to conceive, conduct, and interpret research is now considered equally important as clinical skills. DL can equip learners to handle big data analytic tasks, analyze complex research data, and more accurately interpret the findings of a study.34 Additionally, DL can be used as a pedagogical tool to teach foundational concepts of bioinformatics relevant to orthopedics, such as genomic studies in osteoarthritis or proteomic data analysis in bone healing.
The introduction of AI-driven research concepts in medical education faces several hurdles, including the need for curricular adaptations, lack of skilled educators, and the risk of overreliance on algorithms at the expense of contextual decision-making. Despite these challenges, the incorporation of DL in medical education is a necessary advancement that can mold a generation of medical professionals competent in both clinical and research domains of orthopedics.
Importantly, familiarizing future orthopedic practitioners with AI concepts and their uses also becomes fundamental to nurturing a workforce that is adept at leveraging DL in clinical problem solving.
LIMITATIONS AND CHALLENGES
Certain challenges surfaced in the literature under review include the 'black box' nature of neural network models, which makes it difficult for users and orthopedic surgeons to comprehend and explain the basis of the outcomes produced. Moreover, there are hindrances to the applicability and implementation of these models across various healthcare institutions due to variations in standardization, procedures, and dataset parameter availability. Even when dataset parameters are available, their values may differ because of varying demographics and geographical locations. These limitations may influence the overall accuracy and predictive potency of the artificial intelligence model and could pose a risk to patients if implemented indiscriminately. Thus, further investigations and trials are indispensable for confirming the efficacy of the developed model.
Despite its impressive potential, it is crucial to note that the success of DL applications relies primarily on the quality, size, and diversity of the data used for model training. The full integration of DL into surgery, therefore, requires additional studies, robust data handling techniques, and stringent validation measures to ensure accurate and reliable utilization in clinical practice.
FUTURE CONSIDERATIONS
The integration of advanced technologies such as DL and AI into orthopedics marks a significant shift toward more precise, personalized, and efficient care in the field. Currently, these technologies are being applied to improve diagnostic processes, such as interpreting X-Rays and MRI scans with remarkable accuracy and modeling and simulating surgical planning. In the future, orthopedics could be profoundly impacted by further advancements in DL and AI. These technologies hold the promise of enhancing diagnostic accuracy, streamlining treatment planning, optimizing surgical interventions through robotic assistance, and tailoring patient care to individual needs with unprecedented precision. Furthermore, AI-driven predictive models could pave the way for proactive management of orthopedic conditions, potentially improving long-term outcomes.
The field of radiology research has widely embraced deep learning, a testament to the potential of this technology to revolutionize medical imaging analysis. A noticeable trend in the plethora of medical literature is the frequent use of deep learning models applied through different conception methodologies. The replication observed in radiology studies emblematic of the community's determination to fine-tune the accuracy of these models, emphasizing the importance of deriving diverse methodologies tailored to distinct datasets and clinical scenarios, despite the commonality of the underlying algorithms. While the repetitive use of similar models facilitates a deeper understanding of their capabilities and limitations, it also poses the challenge of potentially stalling innovation by focusing on incremental improvements rather than groundbreaking discoveries.
However, this approach has undeniably propelled radiology forward, improving image analysis, helping early and accurate diagnoses, and initiating a continual dialog on best practices. In sum, although the repetitive nature prevails in radiological deep learning research, its presence is pivotal both for incremental progress and for highlighting areas where novel contributions are crucial for further advancements in the field, urging researchers to persistently seek innovative solutions.
The widespread adoption of AI in orthopedic practice must carefully address potential challenges, including data privacy concerns, the need for substantial training datasets, and ensuring equitable access to AI-driven care. Ethical considerations around patient autonomy and the role of AI in clinical decision-making also require thoughtful addressing.
CONCLUSION
Advancements in deep learning offer immeasurable opportunities in orthopedic surgery, from aiding in diagnosing complex conditions to performing surgical procedures with greater precision than ever before. However, integrating deep learning algorithms into orthopedics is associated with numerous challenges, including ethical concerns about data privacy and regulating the use of AI in clinical environments. More research is needed to validate and refine these tools and strategies so they can be safely and effectively incorporated into practice.
Heterogeneity was high between studies, and extensive variation in methodology, terminology and outcome measures was noted. This can lead to an overestimation of the diagnostic accuracy of DL algorithms for medical imaging. There is an immediate need for the development of artificial intelligence-specific guidelines to provide guidance around key issues in this field.
DL is a burgeoning field with immense potential across all areas of healthcare, particularly radiology. The present systematic review and meta-analysis scrutinized the quality of the literature and summarized the diagnostic accuracy of DL techniques. Despite the high diagnostic accuracy of DL currently available, interpreting these findings is vital considering the poor design, conduct, and reporting of studies, which could result in bias and overestimation of the power of these algorithms. Improving DL application requires standardized instructions related to study design and reporting, which can further clarify its clinical utility.
REFERENCES
McInnes MDF, Moher D, Thombs BD, McGrath TA, Bossuyt PM; and the PRISMA-DTA Group, et al. Preferred reporting items for a systematic review and meta-analysis of diagnostic test accuracy studies: the PRISMA-DTA statement. JAMA. 2018; 319(4): 388-96. doi: 10.1001/jama.2017.19163. Erratum in: JAMA. 2019; 322(20): 2026.
Mandel W, Parent S, Kadoury S. Intraoperative forecasting of growth modulation spine surgery outcomes with spatio-temporal dynamic networks. In: Martel AL, Abolmaesumi P, Stoyanov D, Mateus D, Zuluaga MA, Zhou SK, et al., editors. Medical Image Computing and Computer Assisted Intervention-MICCAI 2020. Cham: Springer International Publishing; 2020. p. 751-60.
Tran VL, Lin HY, Liu HW. MBNet: a multi-task deep neural network for semantic segmentation and lumbar vertebra inspection on X-ray images. Computer Vision–ACCV 2020: 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30–December 4, 2020, Revised Selected Papers, Part I | SpringerLink [Internet]. [Cited 2024 Jan 14].
AFFILIATIONS
1 Cirujano Ortopédico y Traumatología, Centro de Ortopedia y Traumatología, Centro Médico ABC Santa Fe. Ciudad de México.
2 Cirujano Ortopédico y Traumatólogo, Departamento de Ortopedia Hospital Español de México. Ciudad de México.
3 Cirujano Ortopédico y Traumatólogo, Departamento de Ortopedia Hospital Ángeles Lomas. Ciudad de México.
ORCID:
4 0000-0003-1663-9645
5 0009-0006-8158-7879
6 0009-0000-6730-2570
7 0009-0006-2622-5963
Funding: the authors declare that they received no funding, grants or other support during the preparation of this manuscript. the authors have no relevant financial or non-financial interests to disclose.
CORRESPONDENCE
Ramón González-Pola, M.D. E-mail: rgonzalezpola@gmail.comReceived: 09-24-2024. Accepted: 11-04-2024.