THE INTEGRATION OF ARTIFICIAL INTELLIGENCE IN SPINAL CARE ASSESSMENT AND SURGERY: A COMPREHENSIVE NARRATIVE REVIEW
PDF
Cite
Share
Request
REVIEW ARTICLE
VOLUME: 37 ISSUE: 1
P: 49 - 59
February 2026

THE INTEGRATION OF ARTIFICIAL INTELLIGENCE IN SPINAL CARE ASSESSMENT AND SURGERY: A COMPREHENSIVE NARRATIVE REVIEW

J Turk Spinal Surg 2026;37(1):49-59
1. Ege University Faculty of Medicine, Department of Orthopedics and Traumatology, İzmir, Türkiye
2. University of Health Sciences Türkiye, İzmir City Hospital, Clinic of Orthopedics and Traumatology, İzmir, Türkiye
3. Independent Researcher, İzmir, Türkiye
4. Dokuz Eylül University Faculty of Medicine, Department of Orthopedics and Traumatology, İzmir, Türkiye
No information available.
No information available
Received Date: 23.01.2026
Accepted Date: 03.02.2026
Online Date: 13.02.2026
Publish Date: 13.02.2026
PDF
Cite
Share
Request

ABSTRACT

rtificial intelligence (AI) and machine learning (ML) are driving a paradigm shift in spine surgery, augmenting surgical decision-making with data-driven insights. This review synthesizes the current landscape of AI applications across the surgical care continuum and evaluates its potential to enhance precision, personalization, and value. A narrative review was conducted through a critical analysis of contemporary literature, including original research, systematic reviews, and editorials from high-impact orthopaedic and spine surgery journals. Key themes were identified and organized to provide a coherent overview of AI’s role in preoperative planning, intraoperative execution, and postoperative economics. AI demonstrates significant utility in automating spinal imaging analysis, with convolutional neural networks enabling rapid vertebral segmentation and accurate measurement of alignment parameters. Predictive ML models excel in forecasting individualized patient risks, with specific algorithms outperforming surgeons in predicting complications and long-term outcomes. Intraoperatively, AI-driven navigation and robotic systems achieve a pedicle screw placement accuracy exceeding 94% while reducing radiation exposure. Furthermore, AI applications are emerging in health economics, effectively predicting costs and automating administrative tasks. Despite this, various challenges continue to hinder progress, notably the black-box nature of algorithms, data bias, ethical dilemmas, and barriers to clinical adoption.

The available evidence positions AI not as a proven superior alternative, but as a promising adjunct with proof-of-concept applications across the spine care continuum. AI serves as a powerful adjunctive tool in spine surgery, promising to enhance procedural precision, personalize patient care, and improve economic efficiency. While limitations regarding transparency, data diversity, and ethical frameworks must be addressed, the ongoing development of explainable AI and robust datasets indicates a transformative future for spinal surgical practice. To ensure safe and equitable adoption, the next steps require prospective multicenter validation, active surgeon participation in governance and education, and global collaborations to develop diverse datasets.

Keywords:
Artificial intelligence, machine learning, spine surgery, predictive analytics, surgical navigation, value-based care, explainable AI

INTRODUCTION

From its conceptual origins in Alan Turing’s theoretical work of the 1950s, artificial intelligence (AI), characterized by its capacity to emulate human intelligent behavior, has matured into a transformative force within modern healthcare. The foundational event was the 1956 Dartmouth College conference, which formally established AI as a field of study. Machine learning (ML), a core element of AI, allows systems to learn from experience and enhance their performance by discerning complex relationships in data, thereby producing inferences and predictions without being explicitly programmed for every individual scenario. The rapid expansion of literature, technology, and clinical use makes understanding AI/ML applications increasingly imperative in spine surgery, where their capacity for sophisticated pattern recognition and prediction is uniquely suited to the field’s intricate and multifactorial nature (Figure 1)(1).

The management of complex spinal pathologies, such as adult spinal deformity (ASD), tumors, and infections, demands the synthesis of a vast array of factors, from intricate radiographic parameters and biomechanical considerations to patient-specific comorbidities and goals, making surgical decision-making a highly nuanced process, particularly for conditions like ASD which require a holistic assessment of the entire skeletal structure for comprehensive radiographic evaluation(2). While traditional statistical methods are powerful for hypothesis testing and establishing associations in well-understood domains with structured datasets, such as public health, ML is better suited for generating individualized predictions from high-dimensional data in innovative fields like omics, radiodiagnostics, and personalized medicine. AI and ML algorithms excel in this predictive capacity, offering the potential to personalize care, enhance surgical precision, improve risk stratification, and optimize resource allocation. As emphasized by Ali et al.(3) technologies are driving significant transformations in spinal surgery. Neural networks enhance the accuracy of preoperative planning, while the use of augmented reality refines intraoperative navigation and reduces radiation exposure. Furthermore, postoperative predictive analytics enable risk stratification, thereby enabling improved precision in surgery, optimization of clinical workflows, and personalization of patient care.

The drive for innovation is further underscored by the alarmingly high complication rates in complex procedures. Effective presurgical planning must address critical patient-specific risk factors, such as age, body mass index (BMI), smoking, and osteoporosis, to mitigate complications, as evidenced by Akıntürk et al.(4) whose analysis of 26,207 patients revealed a 34.5% complication rate predominantly from implant failure (e.g., screw loosening, junctional kyphosis), neurologic deficits (10.8%), infection (3.6%), and cardiopulmonary events (4.8%), all of which adversely impact patient outcomes, length of stay, and readmission rates. This stark reality necessitates moving beyond traditional risk assessment and underscores the critical need for tools that can optimize every phase of care, from patient selection to postoperative management.

The proliferation of large, multi-institutional datasets, enhanced computational resources, and advanced algorithms are accelerating the adoption of AI in spine surgery, where it is enhancing diagnostics, increasing surgical precision, and enabling personalized rehabilitation through early risk assessment and adaptive therapies, despite persistent challenges such as data limitations and ethical considerations(5). The aim of this review is to synthesize recent literature findings and provide a comprehensive overview of the current state of AI in spinal surgery. It will explore the fundamental types of ML, detail its applications in imaging, surgical planning, outcome prediction, and health economics, and discuss the significant ethical and practical challenges that must be addressed for its successful integration into routine clinical practice.

MATERIALS AND METHODS

This narrative review was conducted through a synthesis of contemporary literature identified from the provided articles, which represent a cross-section of recent editorials, reviews, and original research in high-impact orthopaedic and spine surgery journals. The provided documents were systematically analyzed to extract information on the principles of AI/ML, specific applications in spine surgery (e.g., imaging, prediction models, surgical techniques, health economics), and discussed limitations.

Key themes and sub-themes were identified and organized into logical sections to construct a coherent overview of the field. The focus was placed on applications with direct clinical relevance, including:

a. The use of AI for automated measurement of spinal parameters and image segmentation.

b. The development of predictive models for surgical outcomes, complications, and cost.

c. The integration of AI into surgical navigation, robotics, and augmented reality systems.

d. The role of AI in health economics and value-based care.

e. The ethical and practical challenges facing implementation.

This approach offers a comprehensive, detailed analysis of AI’s current role in spinal surgery, incorporating the latest consensus and innovations from recent literature.

RESULTS

Fundamentals of ML in Spine Surgery

ML is broadly categorized into four main paradigms: supervised learning, which uses labeled data to map inputs to outputs for tasks such as classification and regression; unsupervised learning, which identifies hidden patterns and structures in unlabeled data through clustering and dimensionality reduction; semi-supervised learning, which leverages both labeled and unlabeled data to improve prediction accuracy when labeled data is scarce; and reinforcement learning, which enables an agent to learn optimal behaviors through environmental feedback based on rewards and penalties, a method particularly suited for complex domains such as robotics and autonomous systems(6). Understanding these paradigms is crucial for interpreting the literature. Recent reviews have highlighted an increasing emphasis on transparency and interpretability in clinical settings. In this context, explainable AI (XAI) not only provides the underlying algorithmic prediction but also supplies explanations that offer insights into the prediction’s reliability(7). Furthermore, generative adversarial networks (GANs), which employ two competing AI models (a generator and a discriminator) to produce high-quality synthetic data, are emerging as a powerful tool for medical imaging and data augmentation (Table 1)(8).

Supervised Learning: Algorithms are trained on a labeled dataset in which the target output (e.g., “fracture“ or “no fracture“) is predefined. The model acquires the ability to map input data to their correct labels and is later evaluated on unlabeled datasets to assess its performance. Common supervised models include:

Decision Trees (DT) and Random Forests (RF): These models use a tree-like structure of decisions (e.g., “Is the posterior ligamentous complex intact?”) to reach an outcome (e.g., “stable” or “unstable”). RF is an ensemble learning technique that operates by constructing a multitude of DT. This approach improves overall accuracy and mitigates the danger of overfitting, which is common in single DT. They are highly interpretable and have been used for risk stratification and classification, such as the AOSpine fracture classification, need of blood transfusion, preoperative planning/selection, patient type clustering, adverse events and serious complications(5, 9).

Support Vector Machines (SVM): SVMs are a supervised learning model used for classification, regression, and outlier detection. Their mechanism involves finding the mathematically optimal decision boundary (hyperplane) that maximizes the margin between different classes in a high-dimensional feature space. These models demonstrate particular efficacy in image-based diagnostic and prognostic tasks, including the classification of disc degeneration and scoliosis types, the automated detection and localization of lumbar spine and vertebral compression fractures, and the prediction of postoperative outcomes(5, 10).

Unsupervised Learning: Algorithms process unlabeled datasets autonomously without human guidance, discovering hidden patterns or intrinsic structures. A common application is clustering patients into novel subgroups based on a combination of clinical and radiographic features, which may predict distinct outcomes or complication profiles(11).

Artificial Neural Networks (ANN) and Deep Learning (DL): ANNs are composed of layered, interconnected nodes (neurons) designed to process input data, mirroring the structure and function of the human brain. DL refers to ANNs with many hidden layers, capable of learning complex, hierarchical representations of data. A specialized type of ANN, the convolutional neural network (CNN), is particularly powerful for image processing. Inspired by the visual cortex, CNNs are adept at processing pixel data and are the backbone of most modern medical imaging AI applications, from vertebral segmentation to automated Cobb angle measurement(12). Beyond image analysis, CNNs are increasingly employed for advanced prognostic modeling, demonstrating strong predictive utility in forecasting favorable postoperative outcomes, estimating the risk of relapse following discectomy, the diagnosis of cervical myelopathy, calculating mortality rates after surgery for spinal epidural abscess, and predicting probabilities of readmission or reoperation after posterior lumbar interlaminar fusion, thereby directly informing preoperative planning and surgical candidate selection, particularly in complex cases(6).

Semi-supervised Learning: To overcome the scarcity of annotated fracture data in spinal computed tomography (CT) segmentation, Pan et al.(13) developed a semi-supervised 2.5D U-Net framework that leverages both labeled and unlabeled datasets. Their approach incorporates a cascade design aligned with clinical workflows to enhance segmentation precision across vertebrae. In addressing computational constraints, Huang et al.(14) strategically employed 2D network training supplemented with 2.5D inputs to optimize performance. The model utilizes a dual-branch encoder with multi-scale Swin Transformer modules for improved feature extraction and introduces a level set function to ensure consistency between pixel classification and geometric regularization. This method demonstrates strong performance across evaluation metrics, highlighting the efficacy of semi-supervised learning and advanced architectural designs in medical image segmentation. In a separate clinical prediction task, Park et al.(15) evaluated several supervised ML algorithms to forecast whether patients with cervical spondylotic myelopathy would achieve a minimum clinically important difference (MCID) in neck pain following surgery. They emphasized that model selection should be guided by dataset characteristics and the specific clinical question. For their balanced dataset, precision was identified as the most relevant metric to optimize the identification of true MCID achievers. Logistic regression achieved the highest precision across both short- and long-term follow-up intervals, demonstrating consistent superiority among the tested models and reaffirming its utility for clinical classification problems.

Reinforcement Learning: In their study, Ao et al.(16) introduce SafeRPlan, a safety-aware deep reinforcement learning approach for autonomous pedicle screw placement in robotic spine surgery. This method incorporates an uncertainty-aware safety filter to ensure safe actions, uses pre-trained neural networks to compensate for incomplete intraoperative anatomical information, and employs domain randomization to improve generalization under noise. Experimental results demonstrated that SafeRPlan achieved over 5% higher safety rates compared to baseline methods, even under realistic surgical conditions.

XAI: As AI models, particularly complex DL systems, become more integral to clinical decision-making, the demand for transparency and interpretability has surged. XAI refers to a suite of techniques designed to make the predictions of these “black box” models understandable to human experts. This is achieved by providing insights into the model’s confidence, highlighting the features most influential to a decision (e.g., specific image regions in a CT scan), and generating a rationale for its output. In spine surgery, XAI is critical for fostering clinical trust and facilitating adoption, as it allows surgeons to validate an AI’s recommendation for fracture classification, surgical planning, or risk prediction before integrating it into patient care(7).

GANs: GANs represent a category of DL frameworks wherein two neural networks operate in opposition, a generator that produces synthetic data instances, and a discriminator that distinguishes between authentic and generated data. Through this iterative competition, the system progressively improves its ability to generate convincingly realistic synthetic outputs. In medical imaging, GANs address the critical challenge of data scarcity and privacy by creating high-quality synthetic spine CT or magnetic resonance imaging (MRI) images(8). These generated datasets can be used to augment limited training data, improving model robustness and generalizability, or to create anonymized data for research without compromising patient confidentiality. Applications include data augmentation for segmentation models and simulating anatomical variations for training purposes(17).

Applications in Spinal Imaging and Diagnostics

AI has made significant strides in automating and enhancing the interpretation of spinal images, reducing inter-observer variability and surgeon workload.

Automated Vertebral Segmentation and Identification: CNNs form a fundamental framework for diagnostic and therapeutic planning by allowing highly accurate, automated detection and localization of vertebrae in various imaging modalities such as X-Ray, CT, MRI, and ultrasound. These systems significantly outperform manual methods in consistency and precision, reducing the mean absolute error in Cobb angle measurements to less than 3° compared to manual variability of 2.8°-8°. AI-based approaches also demonstrate robustness in analyzing spinal curvature from suboptimal images, such as off-center, angulated, or smartphone-captured images, and support radiation-free scoliosis screening via ultrasound through automatic extraction of anatomical landmarks for 3D spinal reconstruction. Additional applications include quantitative assessment of thoracolumbar compression fractures to inform clinical management(18). This is crucial for surgical navigation systems, as it allows for automatic registration of the patient’s anatomy to preoperative images, facilitating the planning of pedicle screw trajectories. Burström et al.(19) created an automated spine segmentation algorithm for this purpose, based on 3D reconstructions obtained from cone-beam CT.

Classification of Pathology: ML algorithms excel at classifying spinal pathologies through medical imaging analysis, demonstrating particular strength in automatically grading intervertebral disc degeneration according to standardized systems such as Pfirrmann classification, with CNNs achieving remarkable agreement (up to 95.6%) with expert radiologists(20). These techniques have been successfully extended to identify various spinal conditions including stenosis, fractures, sacroileitis, and tumors. For neural compression pathologies, AI systems analyze morphological features to diagnose disc herniation and nerve root compression with high accuracy and exceptional reliability(21-23). Additionally, AI models demonstrate sophisticated diagnostic capabilities in distinguishing benign from malignant vertebral fractures on CT scans, matching or surpassing radiology residents’ performance, and in grading metastatic spinal cord compression by precisely delineating margins of involvement(24).

Automated Measurement of Radiographic Parameters: AI enables automated measurement of key spinopelvic parameters, including coronal and sagittal vertical axes, as well as key sagittal alignments such as thoracic kyphosis, lumbar lordosis, and the pelvic parameters of incidence, tilt, and sacral slope, from standing whole-spine radiographs. These AI-derived measurements demonstrate excellent agreement with expert surgical assessments, achieving intraclass correlation coefficients exceeding 0.90 and mean absolute errors below 3° or 3 mm, thereby providing a rapid and reliable alternative to manual methods(25).

Generative AI for Enhanced Imaging: Recent advances have introduced the use of GANs for anatomical image reconstruction. Santilli et. al.(26) developed a publicly available GAN model that generates synthetic STIR sequences of the lumbar spine from standard T1- and T2-weighted MRI scans. Expert radiologists assessed these synthetic datasets and judged them to be of comparable or superior quality in approximately 77% of cases, underscoring their potential to streamline and improve imaging workflows for preoperative evaluation. Importantly, the generated images were shown to be diagnostically equivalent to conventional acquisitions while demonstrating superior overall image quality, supporting their possible integration into routine clinical practice.

Predictive Modeling for Surgical Outcomes and Complications

AI enables personalized risk stratification and outcome prediction in spine surgery, advancing the field toward truly individualized patient care (Table 2)(5).

The potential of AI is not merely theoretical but now demonstrates tangible superiority in specific domains. A compelling example lies in outcome prediction, where an algorithm developed by the International Spine Study Group demonstrated 89% accuracy in forecasting risks. This stands in stark contrast to a study of 39 experienced deformity surgeons, whose predictions for the same set of cases were highly discordant and inconsistent, with estimates for complication rates ranging from 0% to 100%. This highlights the inherent limitations of human cognition when processing multivariate data and the confounding role of emotional bias, where a recent negative outcome can unconsciously skew a surgeon’s prediction for a subsequent, similar patient. This concept is further explored by Martin and Bono(27), who note that while traditional regression techniques are well-suited for assessing causation, they are poorly optimized for prediction, a gap that ML specifically aims to fill.

Predicting Complications: ML models have been developed to predict a wide range of complications with high accuracy. These include:

Reoperation and Major Complications: ML algorithms synthesize high-dimensional data from clinical, imaging, and patient sources to produce personalized risk assessments and predictions for surgical results. For instance, Scheer et al.(9) developed a model predicting major complications after ASD surgery with 87.6% accuracy, while Pellisé et al.(28) employed random forest models trained on more than 100 variables to forecast major complications, reoperations, and hospital readmissions, with model performance yielding area under the curve (AUC) scores between 0.67 and 0.92. Building upon this, sophisticated ML techniques, including LightGBM and RF, have been leveraged to generate probabilistic forecasts for ideal surgical outcomes. These are defined as a clinically significant enhancement in quality of life without major complications, achieved by incorporating modifiable risk factors into their analytical architecture.

Proximal Junctional Kyphosis/Failure (PJK/PJF): AI and ML models hold considerable promise for predicting PJK and PJF after ASD surgery, with some studies reporting prediction accuracies as high as 86%(29). For instance, research by Lee et al.(30) and Ryu et al.(31) has shown that random forest models deliver notably high accuracy and AUC values in forecasting PJK/PJF occurrence and pinpointing major reoperation risk factors. Nevertheless, Tretiakov et al.(32) note a critical limitation: although powerful, RF models may overestimate target outcomes in binary classification tasks due to elevated out-of-bag error, underscoring the importance of transparency and rigorous methodology in predictive modeling.

Pseudarthrosis: Recent advances in ML demonstrate strong predictive capabilities for postoperative complications in spine surgery. Johnson et al.(33) identified adipose tissue features on MRI as potential biomarkers for pseudarthrosis risk, independent of BMI. Further advancing this domain, Scheer et al.(34) devised ensemble decision tree-based models capable of predicting PJK/PJF with 86% accuracy (AUC: 0.89) and pseudarthrosis with 91% accuracy (AUC: 0.94) in a multicenter ASD patient population. Similarly, a separate model for predicting pseudarthrosis at 2-year follow-up after ASD surgery demonstrated 91% accuracy(35). Complementary to these approaches, Wang et al.(36) developed a nomogram model showing clinical utility for predicting pseudarthrosis probability, highlighting the growing sophistication of AI-driven prognostic tools in spinal surgery outcomes.

Surgical Site Infection (SSI): AI demonstrates promising capabilities in predicting SSI risk following spinal procedures. While a systematic review by Ndjonko et al.(37) noted that AI models show potential for excellent classification accuracy in predicting spinal SSI, the authors caution that most studies remain in early developmental stages, and reported performance metrics should be interpreted with appropriate scrutiny.

Other Outcomes: Models also predict transfusion requirements, length of hospital stay, and prolonged opioid use(5).

Predicting Patient-reported Outcomes Measures (PROMs): AI is increasingly used to predict PROMs following spine surgery, with common targets including the modified Japanese Orthopaedic Association score for cervical, Oswestry disability index for lumbar, and scoliosis research society-22 questionnaire (SRS-22) for deformity pathologies, alongside pain assessments like visual analog scale and numeric rating scale. Predictive models incorporate diverse features ranging from demographics and surgical characteristics to preoperative PROMs, imaging findings, and psychosocial factors. Research by Ames et al.(38) and Oh et al.(39) demonstrates ML’s capability to forecast quality-of-life improvements, such as achieving MCID on SRS-22 or predicting quality-adjusted life years (QALYs). A significant challenge remains the lack of PROM standardization, which complicates comparison across studies and limits consensus on optimal implementation.

Risk Stratification and Surgical Planning: AI significantly enhances risk stratification and surgical planning in spine care. Unsupervised learning models analyze hundreds of variables to create novel ASD classification systems, predicting distinct risk profiles and patient-reported outcomes to improve preoperative counseling and patient selection. For surgical planning, algorithms automate critical decisions, such as selecting the upper instrumented vertebra with 87.5% accuracy or optimizing the proximal junctional angle to prevent mechanical complications(40).

AI-enhanced Surgical Techniques: Navigation, Robotics, and Augmented Reality

AI is the engine behind several advanced intraoperative technologies that are increasing surgical precision and safety.

Augmented Reality Surgical Navigation (ARSN): ARSN systems, use CNN-based segmentation of intraoperative 3D cone-beam CT images. The system then projects the preoperatively planned screw trajectories directly onto the patient’s anatomy via a headset or display, creating an “X-ray vision” effect. This approach has been demonstrated to increase the accuracy of percutaneous pedicle screw placement to over 94%, while significantly reducing radiation exposure compared to conventional fluoroscopy(41). Recent innovations include marker-less registration that uses deep neural networks to autonomously identify spinal structures and determine their positional configuration in real-time, yielding a median angulation error of 1.6° with a translational error of 2.3 mm at the screw entry site, all without the time and radiation exposure of traditional methods(42).

Robotics: Robotic-assisted spine surgery systems rely on AI algorithms for planning and executing screw placement. The robotic arm guides the surgeon to the pre-planned trajectory based on intraoperative imaging. Studies report optimal placement rates exceeding 97-98%, comparable to the best results achieved with navigation. The robot adds a layer of precision and eliminates human tremor, standardizing a key step of the procedure. A significant learning curve exists; success rates improve and conversions to manual placement decrease with increased surgeon experience(43).

The integration of AI into preoperative planning is becoming increasingly seamless and accessible. Emerging platforms now allow surgeons to upload radiographic images via mobile applications, where algorithms automatically perform all necessary measurements and synthesize relevant risk variables to generate a patient-specific surgical plan. The efficacy of such tools is significant; they have been shown to reduce the risk of critical complications like implant failure and rod breakage following osteotomy from historical rates of up to 22% down to 4.7%, representing a monumental improvement in procedural safety and reliability(44).

AI in Health Economics and Value-based Care

AI advances value-based spine surgery through three core mechanisms: enhancing patient agency via improved health literacy and remote monitoring, automating administrative and operational tasks to reduce costs, and augmenting clinical decision-making through precise diagnostics, surgical planning, and outcome prediction. Despite its potential, AI implementation faces significant challenges including professional resistance, data quality and privacy concerns, and substantial financial investment in infrastructure(45).

Predicting Cost and Resource Utilization: ML models demonstrate significant capability in predicting financial outcomes in spine surgery. Karnuta et al.(46) implemented a Naïve Bayes algorithm that accurately predicts perioperative outcomes, including hospitalization costs, duration of admission, and discharge destination for patients undergoing lumbar fusion procedures, demonstrating good-to-excellent predictive reliability.

Cost-effectiveness Analysis: AI enables sophisticated cost-effectiveness analysis for spine surgery by integrating predictions of QALYs gained with cost projections, creating a robust framework for evaluating economic value beyond mere procedural expenses. Robotic spine surgery demonstrates cost-effectiveness through reduced revision rates, lower infections, decreased length of stay, and shorter operative times.

Operational Efficiency: AI extends its economic impact beyond the operating room into hospital administration, where algorithms can automatically extract billing codes from operative notes with approximately 90% accuracy, reducing financial losses from human coding errors and streamlining healthcare economic infrastructure. Clinically, AI enhances surgical precision through personalized interventions, particularly in scoliosis treatment where analysis of preoperative imagery helps determine the optimal level of surgical intervention tailored to individual patient needs.

DISCUSSION

The adoption of AI in spinal surgery signifies a fundamental transformation, providing new tools to improve care across all stages, including diagnosis, preoperative planning, intraoperative guidance, postoperative management, and health economic analysis. The evidence presented demonstrates that AI is moving from a research curiosity to a tangible clinical tool with validated applications in imaging, prediction, execution, and health economics.

The ability of ML models to analyze vast, complex datasets allows a more nuanced understanding of diseases like ASD. Traditional classification systems are being supplemented by data-driven clustering models that can identify patient subtypes with unique outcome profiles, enabling more personalized and effective treatment strategies. Predictive models for complications and PROMs empower surgeons to conduct detailed risk-benefit analyses with patients, setting realistic expectations and potentially avoiding high-risk surgeries in those unlikely to benefit(1, 3, 5, 6).

In the operating room, AI-driven navigation and robotics are mitigating human error and elevating the level of precision to new heights. The high accuracy rates of percutaneous screw placement with ARSN and robotics promise to improve patient safety and reduce revision rates(41, 43). Furthermore, the reduction in fluoroscopy time benefits both the patient and the surgical team. Recent advancements, such as marker-less registration and machine-vision systems, are pushing this further, reducing radiation exposure by up to 90% and significantly cutting down procedural time(42).

Perhaps most critically for the future sustainability of spine care, AI provides tools for navigating the shift to value-based care. By predicting both outcomes and costs, AI enables a more sophisticated approach to resource allocation and reimbursement, ensuring that interventions are not only clinically effective but also economically viable(47).

However, the path to widespread adoption is fraught with challenges that the spine community must address conscientiously, many of which are underscored in the latest literature (Table 3)(1, 3):

The “Black Box” Problem and the Need for XAI: The complexity of some DL models can make it difficult to understand how a specific prediction was made, which can erode clinician trust. Efforts to improve model interpretability through XAI are therefore not just a technical necessity but a cornerstone for building trust and facilitating ethical clinical adoption.

Data Bias and Equity: If training data is not representative of the broader population (e.g., lacking diversity in race, ethnicity, or socioeconomic status), algorithms can perpetuate and even amplify existing healthcare disparities. Vigilant curation of diverse datasets is essential. Chen et al.(48) pointed to the challenge of limited dataset diversity, which adversely affects the external validity and generalizability of AI-based systems.

Data Privacy and Security: The implementation of such systems necessitates access to vast quantities of sensitive patient health information. Ensuring stringent cybersecurity protocols and strict compliance with data governance regulations, such as the general data protection regulation and health insurance portability and accountability act, is essential.

Validation and Generalizability: Most models are developed and validated on retrospective data from single or limited institutions. Broader external validation in diverse, real-world settings is essential before they can be relied upon for routine clinical decision-making. Mandate external validation in independent cohorts before clinical implementation. Emerging techniques, such as federated learning frameworks, enable continuous validation and model refinement across institutions while preserving data privacy and addressing the central challenge of data heterogeneity.

Clinical Integration and Workflow: Integrating these tools seamlessly into clinical workflows, perhaps through electronic health records systems (EHR) using standards like substitutable medical applications, reusable technologies on fast healthcare interoperability resources, is another significant hurdle that must be overcome to avoid adding to clinician burden(49). This is particularly relevant given the spine surgery community’s historical reluctance to adopt new technologies that are perceived to disrupt established workflows or offer unclear cost-benefit advantages.

Ethical and Legal Liability: The issue of liability arising from errors produced by AI systems, such as a diagnostic error by a CNN, remains legally and ethically unresolved. A framework for human oversight and liability must be established.

De-skilling: There is a concern that over-reliance on AI could lead to the erosion of fundamental surgical skills and clinical acumen among surgeons(50). AI must be viewed as an augmentative tool, not a replacement for expertise.

Human Factors and Emotional Bias: Beyond processing power, AI systems offer a unique advantage: freedom from cognitive and emotional bias. AI algorithms, devoid of emotional feedback loops, provide consistent, objective predictions based solely on the empirical data of thousands of historical cases, plotting a patient’s risk on a precise curve rather than a wide, subjective range.

Limitations and Challenges

The adoption of AI technologies in spine surgery continues to encounter substantial implementation barriers, including the “black box” nature of complex algorithms, which may undermine clinical trust; limited generalizability due to data bias and homogeneity; unresolved ethical and legal concerns regarding privacy, security, and liability; and practical barriers to workflow integration and potential de-skilling. The historical reluctance of spine surgeons to adopt disruptive technologies further complicates implementation. As a narrative review, this study offers a valuable qualitative synthesis but is inherently susceptible to selection bias. Greater transparency regarding the literature search strategy and inclusion criteria would enhance reproducibility. While the review is well-structured and supported by effective tables and figures, the technical descriptions of ML architectures (e.g., CNNs, GANs) may challenge clinicians without a data science background. Incorporating a glossary or expanded contextual definitions could improve accessibility without compromising technical depth. The review thoroughly identifies adoption barriers but would benefit from discussing actionable solutions. Concrete strategies, such as interoperability standards for EHR integration, structured AI training programs for surgeons, and guidance on regulatory compliance, would provide a more practical roadmap for translating AI technologies into clinical practice.

Future Directions

Looking ahead, the role of AI in spinal procedures will probably see a more advanced and seamless integration throughout the care pathway. Current investigations are increasingly directed toward refining intraoperative techniques through real-time feedback, forecasting the most effective surgical strategies, and suggesting customized implants tailored to individual anatomical requirements. The development and adoption of XAI will be paramount to building trust and understanding model decisions. Furthermore, the use of generative AI, like GANs, for creating synthetic data to augment limited datasets is a promising frontier to combat data bias. The creation of large, diverse, multi-institutional datasets and open-access web applications that integrate ML predictions directly into the clinical workflow represent the next critical steps toward the equitable and practical point-of-care use of AI. For this future to be realized, the spine surgery community must actively engage in the development, validation, and ethical governance of these powerful tools. The journey has just begun, but the fusion of human expertise and AI marks the dawn of a new, more precise, and value-driven era in spine care.

CONCLUSION

AI is steadily transforming spine surgery, shifting practice from an experience-driven discipline toward one that is increasingly supported by objective, data-based insights. Applications in imaging, risk prediction, navigation, robotics, and economic modeling already illustrate how AI can refine precision, tailor treatment, and streamline workflows. Rather than replacing the surgeon, these tools should be understood as complementary, providing consistency and augmenting clinical judgment. For this transformation to progress responsibly, several priorities must be addressed. First, prospective multicenter trials are needed to validate algorithms in everyday clinical environments and across heterogeneous patient groups. Second, active involvement of spine surgeons in AI development and governance will ensure clinical relevance, accountability, and ethical oversight. Third, international cooperation to establish large, diverse datasets is essential to reduce bias and guarantee that innovations benefit patients globally rather than selectively. By combining rigorous validation with professional leadership and collaborative data sharing, AI can move beyond experimental promise to become a trusted partner in surgical care. This integration offers a pathway toward more precise, equitable, and value-driven spine surgery in the years ahead.

Acknowledgements

During the preparation of this work, the authors used ChatGPT for English language editing and proofreading. After using this tool, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication.
The authors wish to honor the memory of Prof. Dr. Emin Alıcı, whose invaluable contributions to the establishment and development of the Turkish Spine Society and the Journal of Turkish Spinal Surgery laid the foundation for scientific progress in our field. His leadership, mentorship, and dedication continue to guide future generations. This work is dedicated to his memory with deepest respect.

Authorship Contributions

Surgical and Medical Practices: A.M.Ö., C.A., O.S., E.S., Ö.A., Concept: A.M.Ö., C.A., O.S., E.S., Ö.A., Design: A.M.Ö., C.A., O.S., E.S., Ö.A., Data Collection or Processing: A.M.Ö., C.A., O.S., E.S., Ö.A., Analysis or Interpretation: A.M.Ö., C.A., O.S., E.S., Ö.A., Literature Search: A.M.Ö., C.A., O.S., E.S., Ö.A., Writing: A.M.Ö., C.A., O.S., E.S., Ö.A.
Conflict of Interest: No conflict of interest was declared by the authors.
Financial Disclosure: The authors declared that this study received no financial support.

References

1
Kalanjiyam GP, Chandramohan T, Raman M, Kalyanasundaram H. Artificial intelligence: a new cutting-edge tool in spine surgery. Asian Spine J. 2024;18:458-71.
2
Yoshida G, Hayashi K, Boissiere L, Martin A, Berven S, Lenke LG, et al. Perioperative medical complications in adult spine deformity surgery: classification and prevention strategies. Global Spine J. 2025;15:148s-58.
3
Ali IS, Bakaes Y, MacLeod JS, Lee TY, Cho S, Hsu WK. Artificial intelligence in planning for spine surgery. Curr Rev Musculoskelet Med. 2025;18:627-34.
4
Akıntürk N, Zileli M, Yaman O. Complications of adult spinal deformity surgery: a literature review. J Craniovertebr Junction Spine. 2022;13:17-26.
5
Jawed AM, Zhang L, Zhang Z, Liu Q, Ahmed W, Wang H. Artificial intelligence and machine learning in spine care: advancing precision diagnosis, treatment, and rehabilitation. World J Orthop. 2025;16:107064.
6
Kumar R, Dougherty C, Sporn K, Khanna A, Ravi P, Prabhakar P, et al. Intelligence architectures and machine learning applications in contemporary spine care. Bioengineering (Basel). 2025;12:967.
7
Brandenburg JM, Müller-Stich BP, Wagner M, van der Schaar M. Can surgeons trust AI? Perspectives on machine learning in surgery and the importance of explainable artificial intelligence (XAI). Langenbecks Arch Surg. 2025;410:53.
8
Hussain J, Båth M, Ivarsson J. Generative adversarial networks in medical image reconstruction: a systematic literature review. Comput Biol Med. 2025;191:110094.
9
Scheer JK, Smith JS, Schwab F, Lafage V, Shaffrey CI, Bess S, et al. Development of a preoperative predictive model for major complications following adult spinal deformity surgery. J Neurosurg Spine. 2017;26:736-43.
10
Mehta SD, Sebro R. Computer-aided detection of incidental lumbar spine fractures from routine dual-energy X-Ray absorptiometry (DEXA) studies using a support vector machine (SVM) classifier. J Digit Imaging. 2020;33:204-10.
11
Staartjes VE, Klukowska AM, Stumpo V, Vandertop WP, Schröder ML. Identifying clusters of objective functional impairment in patients with degenerative lumbar spinal disease using unsupervised learning. Eur Spine J. 2024;33:1320-31.
12
Zhu Y, Yin X, Chen Z, Zhang H, Xu K, Zhang J, et al. Deep learning in Cobb angle automated measurement on X-rays: a systematic review and meta-analysis. Spine Deform. 2025;13:19-27.
13
Pan R, Wang X, Lin Z, Ho CL, James Nickalls O, Peter CA, et al. A three-stage semi-supervised learning approach to spine image segmentation. Annu Int Conf IEEE Eng Med Biol Soc. 2024;2024:1-4.
14
Huang X, Zhu Y, Shao M, Xia M, Shen X, Wang P, et al. Dual-branch transformer for semi-supervised medical image segmentation. J Appl Clin Med Phys. 2024;25:e14483.
15
Park C, Mummaneni PV, Gottfried ON, Shaffrey CI, Tang AJ, Bisson EF, et al. Which supervised machine learning algorithm can best predict achievement of minimum clinically important difference in neck pain after surgery in patients with cervical myelopathy? A QOD study. Neurosurg Focus. 2023;54:e5.
16
Ao Y, Esfandiari H, Carrillo F, Laux CJ, As Y, Li R, et al. SafeRPlan: safe deep reinforcement learning for intraoperative planning of pedicle screw placement. Med Image Anal. 2025;99:103345.
17
Islam S, Aziz MT, Nabil HR, Jim JR, Mridha MF, Kabir MM, et al. Generative adversarial networks (GANs) in medical imaging: advancements, applications, and challenges. IEEE Access. 2024;12:35728-53.
18
Kim DH, Jeong JG, Kim YJ, Kim KG, Jeon JY. Automated vertebral segmentation and measurement of vertebral compression ratio based on deep learning in X-Ray images. J Digit Imaging. 2021;34:853-61.
19
Burström G, Buerger C, Hoppenbrouwers J, Nachabe R, Lorenz C, Babic D, et al. Machine learning for automated 3-dimensional segmentation of the spine and suggested placement of pedicle screws based on intraoperative cone-beam computer tomography. J Neurosurg Spine. 2019;31:147-54.
20
Liawrungrueang W, Cholamjiak W, Sarasombath P, Jitpakdee K, Kotheeranurak V. Artificial intelligence classification for detecting and grading lumbar intervertebral disc degeneration. Spine Surg Relat Res. 2024;8:552-9.
21
Compte R, Granville Smith I, Isaac A, Danckert N, McSweeney T, Liantis P, et al. Are current machine learning applications comparable to radiologist classification of degenerate and herniated discs and Modic change? A systematic review and meta-analysis. Eur Spine J. 2023;32:3764-87.
22
Liu K, Ning J, Qin S, Xu J, Hao D, Lang N. Identifying primary sites of spinal metastases: expert-derived features vs. ResNet50 model using nonenhanced MRI. J Magn Reson Imaging. 2025;62:176-86.
23
Hallinan JTPD, Zhu L, Yang K, Makmur A, Algazwi DAR, Thian YL, et al. Deep learning model for automated detection and classification of central canal, lateral recess, and neural foraminal stenosis at lumbar spine MRI. Radiology. 2021;300:130-8.
24
Hallinan JTPD, Zhu L, Zhang W, Kuah T, Lim DSW, Low XZ, et al. Deep learning model for grading metastatic epidural spinal cord compression on staging CT. Cancers (Basel). 2022;14:3219.
25
Zerouali M, Parpaleix A, Benbakoura M, Rigault C, Champsaur P, Guenoun D. Automatic deep learning-based assessment of spinopelvic coronal and sagittal alignment. Diagn Interv Imaging. 2023;104:343-50.
26
Santilli AML, Fontana MA, Xia EE, Igbinoba Z, Tan ET, Sneag DB, et al. AI-generated synthetic STIR of the lumbar spine from T1 and T2 MRI sequences trained with open-source algorithms. AJNR Am J Neuroradiol. 2025;46:552-8.
27
Martin BI, Bono CM. Artificial intelligence and spine: rise of the machines. The Spine Journal. 2021;21:1604-5.
28
Pellisé F, Serra-Burriel M, Smith JS, Haddad S, Kelly MP, Vila-Casademunt A, et al. Development and validation of risk stratification models for adult spinal deformity surgery. J Neurosurg Spine. 2019;31:587-99.
29
Brigato P, Vadalà G, De Salvatore S, Oggiano L, Papalia GF, Russo F, et al. Harnessing machine learning to predict and prevent proximal junctional kyphosis and failure in adult spinal deformity surgery: a systematic review. Brain Spine. 2025;5:104273.
30
Lee CH, Jo DJ, Oh JK, Hyun SJ, Park JH, Kim KH, et al. Development and validation of an online calculator to predict proximal junctional kyphosis after adult spinal deformity surgery using machine learning. Neurospine. 2023;20:1272-80.
31
Ryu SJ, So JY, Ha Y, Kuh SU, Chin DK, Kim KS, et al. Risk factors for unplanned reoperation after corrective surgery for adult spinal deformity. Bone Joint Res. 2023;12:245-55.
32
Tretiakov PS, Lafage R, Smith JS, Line BG, Diebo BG, Daniels AH, et al. Calibration of a comprehensive predictive model for the development of proximal junctional kyphosis and failure in adult spinal deformity patients with consideration of contemporary goals and techniques. J Neurosurg Spine. 2023;39:311-9.
33
Johnson GW, Chanbour H, Ali MA, Chen J, Metcalf T, Doss D, et al. Artificial intelligence to preoperatively predict proximal junction kyphosis following adult spinal deformity surgery: soft tissue imaging may be necessary for accurate models. Spine (Phila Pa 1976). 2023;48:1688-95.
34
Scheer JK, Osorio JA, Smith JS, Schwab F, Hart RA, Hostin R, et al. Development of a preoperative predictive model for reaching the Oswestry disability index minimal clinically important difference for adult spinal deformity patients. Spine Deform. 2018;6:593-9.
35
Scheer JK, Oh T, Smith JS, Shaffrey CI, Daniels AH, Sciubba DM, et al. Development of a validated computer-based preoperative predictive model for pseudarthrosis with 91% accuracy in 336 adult spinal deformity patients. Neurosurg Focus. 2018;45:e11.
36
Wang D, Wang Q, Cui P, Wang S, Han D, Chen X, et al. Machine-learning models for the prediction of ideal surgical outcomes in patients with adult spinal deformity. Bone Joint J. 2025;107-B:337-45.
37
Ndjonko LCM, Chakraborty A, Petri F, Alavi SMA, Matsuo T, Borgonovo F, et al. Evaluating predictive performance and generalizability of traditional and artificial intelligence models in predicting surgical site infections postspinal surgery: a systematic review. Spine J. 2026;26:280-91.
38
Ames CP, Smith JS, Pellisé F, Kelly M, Gum JL, Alanay A, et al. Development of predictive models for all individual questions of SRS-22R after adult spinal deformity surgery: a step toward individualized medicine. Eur Spine J. 2019;28:1998-2011.
39
Oh T, Scheer JK, Smith JS, Hostin R, Robinson C, Gum JL, et al. Potential of predictive computer models for preoperative patient selection to enhance overall quality-adjusted life years gained at 2-year follow-up: a simulation in 234 patients with adult spinal deformity. Neurosurg Focus. 2017;43:e2.
40
Lafage R, Ang B, Alshabab BS, Elysee J, Lovecchio F, Weissman K, et al. Predictive model for selection of upper treated vertebra using a machine learning approach. World Neurosurg. 2021;146:e225-32.
41
Taylor C, Lam C, Manoj N, Divekar O. The clinical impact of augmented reality surgical navigation on pedicle screw placement and its effect on perioperative outcomes: a systematic review. Spine Surg Relat Res. 2024;9:269-82.
42
Liebmann F, von Atzigen M, Stütz D, Wolf J, Zingg L, Suter D, et al. Automatic registration with continuous pose updates for marker-less surgical navigation in spine surgery. Med Image Anal. 2024;91:103027.
43
Samprón N, Lafuente J, Presa-Alonso J, Ivanov M, Hartl R, Ringel F. Advancing spine surgery: evaluating the potential for full robotic automation. Brain Spine. 2025;5:104232.
44
Abolfotouh S. AI and machine learning in spine surgery—a potential game changer [Internet]. AO Foundation. 2024 June 09. Available from: https://www.aofoundation.org/spine/about-aospine/blog/2024_06-blog-abolfotouh-ai-in-spine-surgery
45
Shah R, Bozic KJ, Jayakumar P. Artificial intelligence in value-based health care. HSS Journal. 2025;21:307-13.
46
Karnuta JM, Golubovsky JL, Haeberle HS, Rajan PV, Navarro SM, Kamath AF, et al. Can a machine learning model accurately predict patient resource utilization following lumbar spinal fusion? Spine J. 2020;20:329-36.
47
Ames CP, Smith JS, Gum JL, Kelly M, Vila-Casademunt A, Burton DC, et al. Utilization of predictive modeling to determine episode of care costs and to accurately identify catastrophic cost nonwarranty outlier patients in adult spinal deformity surgery: a step toward bundled payments and risk sharing. Spine (Phila Pa 1976). 2020;45:e252-65.
48
Chen T, Su ZH, Liu Z, Wang M, Cui ZF, Zhao L, et al. Automated magnetic resonance image segmentation of spinal structures at the L4-5 Level with Deep Learning: 3D reconstruction of lumbar intervertebral foramen. Orthop Surg. 2022;14:2256-64.
49
Karhade AV, Schwab JH, Del Fiol G, Kawamoto K. SMART on FHIR in spine: integrating clinical prediction models into electronic health records for precision medicine at the point of care. Spine J. 2021;21:1649-51.
50
Taghipour N, Mostafavinia A, Hosseini S, Javaherinasab M, Mehran H, Alighadr A, et al. Surgeon attitudes toward ai-enhanced robotic surgery: a survey on autonomy, liability, and skill erosion. InfoSci Trend. 2025;2:29-41.