All published articles of this journal are available on ScienceDirect.
Volumetric Measurement of Alveolar Clefts for Bone Graft Planning: A Systematic Review
Abstract
Introduction
To investigate protocols for volumetric measurement of alveolar defect in alveolar cleft cases, including innovations in Artificial Intelligence (AI).
Methods
Searches were conducted using PubMed, Embase, and Scopus databases, along with a hand search. Based on inclusion and exclusion criteria, 17 studies were selected. Additionally, discussions on the protocols included workflow and anatomical landmarks associated with the measurement.
Results
Thirty workflows were identified and categorized into virtual and 3D printing-based approaches. A 3D U-Net architecture was employed for segmentation and measurement using artificial intelligence. Various anatomical landmarks for defining alveolar cleft boundaries were described. The average volume of the alveolar cleft was 1.61 cm3.
Discussion
The majority of the studies were published within 3 years of the article search, indicating an increased desire to optimize the utilization of 3D imaging beyond simple assessments. Recent developments in AI have simplified complex imaging tasks; hence, volumetric assessments are expected to increase in the future.
Conclusion
The expert workflow with the most supporting evidence is manual tracing on the axial slice. Studies using AI are emerging and need to be explored. The anatomical landmarks advocated by this review are cementoenamel junction, anterior nasal spine, and continuity with the alveolar segments with adequate labio-palatal thickness as superior, inferior, and labio-palatal borders, respectively. Nonetheless, more studies are needed to help create a technical guideline.
1. INTRODUCTION
Cleft lip, alveolus, and/or palate are the most common craniofacial abnormalities [1]. Oral clefts occur in 0.3–0.45 per 1,000 births globally [2], and about 75% of cleft cases involve the alveolar bone and primary palate [3]. In those cases, Alveolar Bone Graft (ABG) surgery is recommended to ensure the eruption of lateral incisors and canines, maintain periodontal health of adjacent teeth, support orthodontic movement, provide a foundation for dental implants, reinforce the base of the nose, and close oronasal fistula [1]. However, both overfilling and underfilling of alveolar bone grafts have been reported, which lead to reduced success [4, 5]. Although there are several other factors that may influence the success of alveolar bone graft surgery, such as cleft type, eruption stage of canines, presence of fistulae, age, and preoperative orthodontics [3, 6, 7], any mismatch in the bone filling can be prevented by preoperatively determining the volume of graft needed. The clinical advantages of volumetric measurement in alveolar bone graft surgical planning, especially with iliac ridge donors, are discussed in the study conducted by Virani et al. [1]. Additionally, some institutions use donor sites with more limited bone resources, such as chin bone. In these cases, volumetric assessment informs surgeons about the amount of bone required, whether the selected donor site can provide sufficient volume, and whether additional allograft material is needed or a more suitable donor site should be chosen [2]. Additionally, alveolar cleft volume is also useful in predicting prognosis of ABG surgery [3-5].
Three-dimensional imaging for individualized evaluation is currently recommended due to measurement inaccuracies of 2D imaging [8, 9]. Unlike most other pathologies, alveolar defects lack borders on certain sides, which complicates measurement. Therefore, the definition of boundary landmarks is especially critical to ensure consistent and reliable surgical practice and measurement. The authors believe that a clear definition of boundary landmarks is needed for true evidence-based surgery, and this review is presented as the first step to precisely define and achieve consensus on alveolar cleft boundaries.
Volumetric measurement of alveolar cleft requires particular software and specialised skills. Radiologists must have full information in order to properly decide on a suitable measurement method instead of being limited by the software they are familiar with. It is also anticipated that volumetric analysis and planning of ABG will be more commonplace in the future due to the ease provided by semi-automatic/automatic volumetric segmentation [10]. It is imperative that automated volumetric segmentation is created with evidence-based data and methodology, as proposed in this review.
Therefore, this review is intended to compile and evaluate information on different types of available workflows and landmarks for volumetric preoperative measurement of alveolar defects in cleft alveolus and palate, including innovations in AI. To our knowledge, this is the first review to evaluate them in detail to establish more standardized landmarks and protocols for future studies.
2. METHODS
The review was registered with the International Prospective Register of Systematic Reviews (PROSPERO ID: CRD42023479149) and conducted according to Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) [11]. The question posed in this review was: What expert-based and AI-based protocols are available for volumetric alveolar defect measurement in cleft alveolus and/or palate? The scope of the study was predetermined using the PICO framework. The population included patients with cleft alveolus and/or palate planned for alveolar bone graft. The intervention involved protocols utilising CT/CBCT images for measuring alveolar cleft defects by an expert or AI. There was no specific comparison, but comparisons were made between protocols (either by expert or AI) or with the actual graft volume. The outcome of the study focused on the preoperative three-dimensional measurement of alveolar defect that resulted in volumetric sizes, such as cm3, mm3, or ml. By incorporating studies that employed one or more measurement methods, this review provides a comprehensive overview of the available protocols. Further, discussions on the protocol in this review included workflow and landmarks associated with the measurement.
2.1. Eligibility Criteria
Inclusion criteria were cross-sectional research journal articles that reported defect volumetric size utilizing imaging data from CT or CBCT, focused on the protocols of preoperative volumetric measurements for alveolar bone graft in cleft cases, and were full text and written in English.
The exclusion criteria included studies involving syndromic cleft, gingiva- or periosteoplasty, and bone graft for purposes other than filling the alveolar defect in the cleft.
Studies were also excluded if they had no description of the workflow for volume measurement, involved simulated defect and animal models, and were published as review articles, case reports, case series, expert opinions, and conference poster abstracts. Additionally, protocols described in volumetric measurements to evaluate or compare clinical interventions and assessments of bone graft outcomes or post-operative assessments were considered a minor focus and, therefore, excluded.
2.2. Search Strategy
Searches were conducted on PubMed, Embase, and Scopus databases in November 2023, with access provided through the author’s institution library access. Search strategies are listed in Table 1, which were optimised based on the rules for the search engine of each database to obtain the most relevant articles and the least amount of non-relevant articles. MesH terms in PubMed and Emtree in Embase were included in the search strategy. Additional relevant articles were also hand-searched through Google Scholar and reference lists. Titles and abstracts were screened by two examiners following inclusion and exclusion criteria. Afterward, full-text articles were retrieved and assessed for inclusion and exclusion by the same investigators. All reasons for exclusion were recorded. Consensus between the first, second, and third authors was sought if there was any ambiguity.
- | PubMed | Embase | Scopus |
---|---|---|---|
#1 | ((cleft lip[MeSH Terms] OR "cleft palate"[MeSH Terms])) AND (("alveolar bone grafting"[MeSH Terms] OR "bone transplantation"[MeSH Terms])) | ('cleft lip'/exp OR 'cleft palate'/exp) AND ('bone graft'/exp OR 'bone transplantation'/exp OR 'alveolar bone grafting'/exp OR 'alveolar bone'/exp OR 'alveolar bone defect'/exp) | (KEY(cleft AND lip) OR KEY(cleft AND palate) OR KEY (cleft AND lip AND palate)) AND (KEY(bone AND graft) OR KEY(alveolar AND bone AND graft*) OR KEY(bone AND transplant*)) |
#2 | (Cleft lip* OR cleft palate*) AND (alveolar bone graft* OR bone transplant*) | (cleft* AND lip* OR (cleft* AND palate*)) AND (alveol* AND bone AND graft* OR (bone AND transplant*)) | (TITLE-ABS-KEY(cleft* AND lip*) OR TITLE-ABS-KEY(cleft* AND palate*)) AND (TITLE-ABS-KEY(alveol* AND bone AND graft*) OR TITLE-ABS-KEY(bone AND transplant*)) |
#3 | (3D OR 3-D OR three*dimension* OR cone*beam ct OR cone*beam computed tomography OR CBCT OR computed*tomography OR CT OR CT*scan) | (3D OR '3-D' OR three?dimension* OR (cone?beam AND ct) OR (cone?beam AND computed AND tomography) OR cbct OR computed?tomography OR ct OR ct?scan) | TITLE-ABS-KEY(3d) OR TITLE-ABS-KEY(3-d) OR TITLE-ABS-KEY(three dimension*) OR TITLE-ABS-KEY(cone beam ct) OR TITLE-ABS-KEY(cone beam computed tomography) OR TITLE-ABS-KEY(cbct) OR TITLE-ABS-KEY(computed tomography) OR TITLE-ABS-KEY(ct) OR TITLE-ABS-KEY(ct scan) |
#4 | (volume OR volumetric) | volum* | TITLE-ABS-KEY(volum*) |
#5 | (AI OR A.I OR artificial intelligence OR deep learning OR machine learning OR neural network OR CNN OR ANN OR support vector machine OR support vector network OR random forest OR decision tree) | (ai OR 'a.i' OR (artificial NEAR/2 intelligence) OR (deep NEAR/2 learning) OR (machine NEAR/2 learning) OR (neural NEAR/2 network) OR CNN OR ANN OR (support NEAR/2 vector NEAR/2 machine) OR (support NEAR/2 vector NEAR/2 network) OR (random NEAR/2 forest) OR (decision NEAR/2 forest)) | (TITLE-ABS-KEY(AI) OR TITLE-ABS-KEY(A.I) OR TITLE-ABS-KEY(artificial AND intelligence) OR TITLE-ABS-KEY(deep AND learning) OR TITLE-ABS-KEY(machine AND learning) OR TITLE-ABS-KEY(neural AND network) OR TITLE-ABS-KEY(CNN) OR TITLE-ABS-KEY(ANN) OR TITLE-ABS-KEY(support AND vector AND machine) OR TITLE-ABS-KEY(support AND vector AND network) OR TITLE-ABS-KEY(random AND forest) OR TITLE-ABS-KEY(decision AND tree) |
#6 | #1 AND #3 AND #4 | #1 AND #3 AND #4 | #1 AND #3 AND #4 |
#7 | #2 AND #3 AND #4 | #2 AND #3 AND #4 | #2 AND #3 AND #4 |
#8 | #1 AND #3 AND #5 | #1 AND #3 AND #5 | #1 AND #3 AND #5 |
#9 | #2 AND #3 AND #5 | #2 AND #3 AND #5 | #2 AND #3 AND #5 |
2.3. Data Extraction
The articles were noted for their inclusion and exclusion criteria, samples, patient characteristics, cleft type, study results, modality, workflow, anatomical landmarks of the defect borders, software and hardware, and reliability assessment by two investigators. The workflows were summarized by dividing them into several general types. They were all summarised, tabulated, noted if missing, presented, and discussed in the paper.
2.4. Study Quality Assessment
The quality appraisal tool, JBI Critical Appraisal Checklist for Analytical Cross-Sectional Studies [12], was used for quality assessment. Some of the parameters were modified for the study to clarify essential requirements for a robust and reproducible measurement protocol, specifically including assessments of the gold standard, reliability, and objective anatomical landmarks.
Q4 was considered positively assessed only if landmarks for the superior, inferior, and buccopalatal borders were clearly described. If either one or two sides of the borders were missing, they were considered unclear. Q7 received a positive assessment if all steps were specified with clarity that allowed for easy and unambiguous repetition. If there was no description of workflow, the study was excluded as part of the exclusion criteria. If multiple methods were addressed in the study, all methods were considered in unison for quality assessment.
3. RESULTS
3.1. Study Selection
A total of 1,594 titles and abstracts were imported into the Zotero reference manager (version 6.0.27) to be manually reviewed for duplicate removal. One study was flagged and retracted due to self-plagiarism, resulting in 233 articles. After reviewing titles and abstracts based on inclusion and exclusion criteria, 202 articles were removed. A total of 31 studies were read in full text, and 14 of them were excluded based on the same criteria. At the end of the study, 17 articles were included. The search process diagram is presented in Fig. (1).
3.2. Study Quality Assessment
The most common quality issues identified were a lack of comparison to the gold standard (only 24% of the studies were positively assessed in Q3a), lack of inter- and intraobserver reliability (29% in Q3b), and incomplete anatomical landmarks (24% in Q4) (Fig. 2). It is worth noting that three studies failed to report reliable results despite having multiple observers/measurements, leading to questions about selective reporting [13-15].
3.3. Study Characteristics
A summary of the sampling methods, patient characteristics, cleft types, and main findings from all included studies is presented in Table 2. The average defect volume for unilateral cleft only, bilateral cleft only, and all types of clefts were 1.38 cm3, 1.72 cm3, and 1.61 cm3, respectively. Most studies did not differentiate based on the involvement of the palate; therefore, the volumetric difference between CL and CLP could not be evaluated.
Table 3 summarises technical details, including modality, software and hardware, type of workflow, and anatomic landmarks. A total of 10 studies used CBCT [10, 15-23], and 7 studies used CT [5, 13, 14, 24-27]. Thirty workflows for alveolar defect volume measurement were found among 17 studies. Measurement by experts can be broadly divided into virtual and 3D printing methods. The most often used workflow for virtual measurement was manual segmentation in each axial slice [5, 13, 15, 16, 18, 20, 21, 23-27], followed by orthogonal tracing or editing [15, 17], mirrored template using automatic superimposition [14, 27], geometric estimation [22], region growing [19], and custom algorithms [19]. Most of the studies used commercial medical image segmentation, analysis, and design software (Mimics, Materialise, Leuven, Belgium) for virtual measurements. With 3D printing, 6 studies printed the patient’s maxilla [13, 14, 17, 21, 24, 26], while one study made a physical model of the defect itself [19]. One additional study investigated the use of artificial intelligence through a 3D U-Net architecture for segmentation and measurement [10].
Various anatomical landmarks were described in the literature (Table 3). Six studies provided a complete description of virtual landmarks for superior, inferior, and buccopalatal boundaries of the alveolar cleft [5, 16, 17, 21, 23, 27]. Six studies only included partial virtual landmarks [10, 15, 19, 20, 25, 26], whereas 5 others offered none [13, 14, 18, 22, 24]. Only one landmark description was described for 3D printing [21]. None were needed for AI.
Additionally, 4 studies showed comparisons to actual graft volume, which varied from measuring the actual graft intra-operatively [5, 15] to making impressions during surgery [18] or imaging 1 month after surgery [10]. Intraclass correlation coefficient (ICC) measurements for reliability ranged from 0.305 – 0.80.
4. DISCUSSION
This systematic review aimed to investigate existing protocols for the three-dimensional measurement of alveolar defects in individuals with cleft alveolus and palate. Most ABG surgeries are conducted with autologous bone material. Therefore, accurate determination of the defect volume before bone harvesting is important to achieve optimal outcomes with minimal morbidity [4, 5]. Overharvesting leads to an increased risk of morbidity, such as infection, hematoma, bleeding, nerve injury, fracture, and extended hospitalization [17, 28]. Conversely, underfilling of the defect could lead to the failure of canine eruption, failed orthodontic movement or implant placement, and compromised facial aesthetics due to reduced bony support [1, 29]. The amount of bone graft used is directly proportional to the alveolar bone graft success [30]. Some studies have reported success with synthetic or allograft materials, where preoperative volume determination also aids in thorough planning, ensuring cost-effectiveness, and reducing operator dependency [31, 32].

Search process diagram.

Quality assessment of included studies.
Author, Year | Sample Size (Sampling Technique) | Cleft Type | Patient Characteristics | Main Findings |
---|---|---|---|---|
Shirota et al., 2010 | 13 | UCL UCLP BCL BCLP |
- 61.5% males, 38.5% females - Mean age 22.10 years - 76.9% unilateral cleft lip and palate, 15.4% bilateral cleft lip and palate, 7.7% unilateral cleft lip and alveolus |
Average volume (3.8 ± 0.8 cm3), average volume of gold standard (3.5 ± 0.8 cm3), comparison with gold standard not statistitically significant and closely correlated |
Choi et al., 2012 | 47 | UCL UCLP BCL BCLP |
- 61.7% males, 38.3% females - Mean age 9.8, ranged 8 - 11 years - Unilateral cleft lip and alveolus 27.6% - Unilateral cleft lip and palate 38.3% - Bilateral cleft lip and alveolus 4.3% - Bilateral cleft lip and palate 29.8% |
Average volume (1.2 ± 0.4 cm3), average volume of gold standard (1.3 ± 0.5 cm3), unilateral vs bilateral cleft, comparison with gold standard. (Intraobserver reliability result were not reported despite having multiple measurements) |
Quereshy et al., 2012 | 14 (randomised) | UCL UCLP |
not specified (all demographic data were anonymized for the investigators) | Average volume (0.4890 ± 0.1516 cm3); intraobserver ICC for cleft width: 0.879, cleft height: 0.827, cleft length: 0.305 |
Lee et al., 2013 | 7 | Complete UCLP | - 42.86% males, 57.14% females - Mean age 11.4 ± 2.0 years, ranged 8.8 - 14.8 years |
Average volume (2.4 ± 1.2 cm3), interobserver ICC > 0.90, average volume of gold standard (2.5 ± 1.3 cm3), difference with gold standard |
Du et al., 2017 | 10 | UCL UCLP |
- 70% males, 30% females - age ranges 8-12 years - 60% cleft lip and palate, 40% cleft lip only |
Average volume and difference (A: 1.47 cm3; B: 1.52 cm3). (Interobserver and intraobserver reliability result were not reported despite having multiple observers and measurements) |
Kasaven et al., 2017 | 15 (consecutive) | UCLP | not specified | Average volume and difference (A: 0.57556 cm3; B: 0.54043 cm3; C: 0.66262 cm3) |
Chen, G. C. et al., 2018 | 10 | UCLP | - 70% males, 30% females - age ranges 8-12 years |
Average volume (1.81197 ± 0.81790 cm3), time, correlation between volume and time |
- | - | - | - | - |
Chou et al., 2019 | 32 (consecutive, determined by sample size calculation) | UCL BCL |
68.75% unilateral cleft, of which 59.1% is male and 40.9% is female. Mean age is 9.1 ± 0.2 years 31.25% bilateral cleft, of which 60% is male and 40% is female. Mean age is 9.6 ± 0.7 years |
Average volume unilateral (A: 1.09 ± 0.25 cm3; B: 1.09 ± 0.24 cm3) vs bilateral cleft (A: 2.02 ± 0.27 cm3; B: 2.05 ± 0.22 cm3), volume difference, Intraobserver ICC method A (0.824-0.838) and method B (0.765-0.896), interobserver ICC method A (0.650- 0.769) and method B (0.7699-0.873) |
Chen, S. et al., 2020a | 12 | BCL BCLP |
- 66.6% males, 33.3% females - age ranges 8-11 years - 75% cleft lip and palate, 25% cleft lip only |
Average volume (A: 1.52 cm3; B: 1.45 cm3), volume left vs right, time. (Interobserver and intraobserver reliability result were not reported despite having multiple observers and measurements) |
Chen, S. et al., 2020b | 10 | UCL UCLP |
- 70% males, 30% females - age ranges 8-13 years - 80% cleft lip and palate, 20% cleft lip only |
Average volume and difference (A: 1.42 cm3; B: 1.39 cm3), intraobserver ICC: 0.95 - 0.97, Interobserver ICC: 0.94 - 0.98, time |
Kochhar et al., 2021 | 31 (determined by sample size calculation) | UCL UCLP |
- 54.8% males, 45.2% females - Mean age 11 ± 0.98 years, ranged 8-12 years - 58% unilateral cleft on right side, 42% unilateral cleft on left side |
Average volume between non-oriented (Left: 2.26 ± 1.16 cm3; Right: 1.75 ± 0.69 cm3) and oriented (Left: 2.73 ± 1.27 cm3; Right: 2.25 ± 0.72 cm3), left vs right, difference, intraobserver ICC > 0.90, interobserver ICC > 0.80 |
Liu et al., 2021 | 20 | UCL UCLP |
- 65% males, 35% females - Age ranged 8-12 years - 70% cleft lip and palate, 30% cleft lip |
Average volume (A: 1.27 ± 0.35 cm3; B: 1.23 ± 0.32 cm3), interobserver ICC method A: 0.966 and method B: 0.980 |
Sh et al., 2021 | 20 (convenience sampling, determined by sample size calculation) | UCL UCLP |
- 40% males, 60% females - Mean age 10 ± 1.02 years |
Average volume by expertise and slice thickness (radiologist: 1.15; 1.14; 1.14 cm3, surgeon: 1.17 cm3), volume by gold standard (1.08 ± 0.60 cm3), difference to gold standard statistically significant |
Wang et al., 2021 | 60 Group 1: 30 samples for expert measurement and development of AI (training: 24; validation: 3; testing: 3) Group 2: 30 samples for evaluation by AI |
UCLP | - 65% males, 35% females - Mean age 11.52 ± 3.27 years, ranged 8 - 18 years) - 68.3% unilateral left defect, 31.5% unilateral right defect |
Average volume (1.24 ± 0.29 cm3), height vs width vs length, intraobserver ICC method A: > 0.90, interobserver measurements were not statistically different (p = 0.39), similarity (Dice coefficient) |
Abdelhamid et al., 2022 | 12 | UCL UCLP |
- 58% males, 42% females - 50% unilateral left alveolar cleft, 50% unilateral right alveolar cleft - Mean age 10.6 +/- 2.1 |
Average volume (A: 1.19 ± 0.04 cm3; B: 1.17 ± 0.04 cm3) and difference, interobserver ICC method A: 0.998 and method B: 0.626, time |
Li et al., 2023 | 100 (randomised) | UCL UCLP |
- 61% males, 39% females - Mean age 19.32 years, ranged 13-42 years |
Average volume (A: 1.55 ± 0.42 cm3; B: 1.58 ± 0.41 cm3), intraobserver ICC (both methods): 0.98 - 0.99, interobserver ICC (both methods): 0.95 - 0.99, time |
Phienwej et al., 2023 | 20 | UCLP | - 65% males, 35% females - Mean age 12.10 ± 3.92 years, ranged 8-20 years |
Average volume (A: 1.00 ± 0.31 cm3; B: 1.03 ± 0.31 cm3), mean difference, Intraobserver ICC method A: 0.996 and method B: 0.949 |
- | - | - | - | - |
Detailed discussions on the protocol in this review include workflow and landmarks. These workflows can be generally divided into virtual and 3D-printing workflows, which are described below. Only one study that explored AI was found in this review, which was published in 2021. This is consistent with the rising development of AI in general [33]. Recent developments in deep learning have enabled complex imaging tasks that were previously impossible with conventional or older AI technologies [33]; hence, more studies involving AI are expected to emerge in the future.
More than half of the studies were published within 3 years of the article search, indicating an increased desire to optimise the utilisation of 3D imaging beyond simple visual assessment or linear measurements. While the majority of
Author, Year | Modality | Software and Hardware | Summary of Workflow | Anatomical Landmarks |
---|---|---|---|---|
Abdelhamid et al., 2022 | CBCT | A: OnDemand3D (Cybermed Inc., Seoul, South Korea) | A: traced axially slice by slice | S: ANS level I: CEJ of tooth mesial to the cleft M/D: cleft borders L: continuity of mesiolabial and distolabial dentoalveolar margins P: continuity of mesiopalatal and distopalatal bony margins |
B: InVesalius 3 (CTI, Campinas, Brazil) | B: traced axially slice by slice |
|||
Chen, G. C. et al., 2018 | CT | Mimics (Materialise, Leuven, Belgium) | Traced axially slice by slice, subtracting maxilla |
L: a line connecting mesial and distal alveolar segments P: a line connecting mesial and distal alveolar segments |
Chen, S. et al., 2020a | CT | A: Mimics | A: traced axially slice by slice, subtracting maxilla | - |
B: Mimics + Mass Portal XD30 (Mass Portal SIA, Riga, Latvia) | B: 3D-printing of maxilla, water displacement technique | - | ||
Chen, S. et al., 2020b | CT | A: Mimics | A: traced axially slice by slice, subtracting maxilla | - |
B: Mimics + Mass Portal XD30 | B: 3D-printing of maxilla, water displacement technique | - | ||
Chou et al., 2019 | CBCT | A: SimPlant Pro (Materialise Dental, Leuven, Belgium) | A: traced orthogonally | S: plane between ANS up-tilted to the lateral segments I: plane between CEJs L/P: continuity to the maxillary arch |
B: Objet30 OrthoDesk 3D Printer, (Stratasys, Rehovot, Israel) | B: 3D-printing of maxilla, water displacement technique | - | ||
Du et al., 2017 | CT | A: Mimics for STL conversion; Geomagic Studio 2013 (Geomagic, Morrisville, USA) | A: Mirrored template | - |
B: Mimics + Zprinter 350 (Z Corporation, Burlington, USA) | B: 3D-printing of maxilla, water displacement technique | - | ||
Etemadi et al., 2021 | CBCT |
A: OnDemand 3D | A: traced axially slice by slice | - |
B: - | B: Intra-operative gold standard, using silicone impression material | - | ||
Kasaven et al., 2017 | CBCT | A: MATLAB (The Mathworks Inc, Natik, USA) | A: semiautomatic algorithm, axial slice by slice | S: slice in which the alveolar defect was first seen I: slice in which bifurcation of first molar was seen in cleft side |
B: Volume Graphics Studio Max 2.2 (Volume Graphics, Heidelberg, Germany) | B: region growing, axial slice by slice | |||
C: Mimics; CatalystEx 4.2 (Stratasys, Eden Prairie, USA); Dimension Printing (Stratasysinc, Eden Prairies, Minnesota, USA); microCT X-Tek BT 160 UF (Nikon Metrology X-Tek Systems Ltd, Tring, UK) | C: 3D-printing of defect, volume measured with microCT | - | ||
Kochhar et al., 2021 | CBCT | Osirix (Pixmeo Inc., Genève, Switzerland) | A: traced axially slice by slice, non-oriented | S: at sight of bone defect I: CEJ of teeth adjacent to the defect M/D: margin of the cleft L: following the contour of the contralateral side P: following the contour of the contralateral side |
B: traced axially slice by slice, oriented | ||||
Lee et al., 2013 | CBCT | Dolphin Imaging Version 11.0.3.9 (Dolphin Imaging, Chatsworth, USA); XnView (XnSoft, Reims, France); SkyScan Dataviewer (Bruker-microCT, Kontich, Belgium) | A: traced axially slice by slice | S: ANS I: inferior border of alveolar crest |
B: Post-operative gold standard from CBCT imaging (steps not specified) | - | |||
Li et al., 2023 | CT | A: Mimics | A: multiple slice edit, subtracting maxilla | S: most inferior part of piriform aperture at non-cleft side I: CEJ of central incisor |
B: Mimics; Mass Portal XD30 | B: 3D-printing of maxilla, water displacement technique | - | ||
Liu et al., 2021 | CT | A: Mimics; Geomagic Wrap 2017 | A: mirrored template, subtracting maxilla | Not needed (automatic superimposition) |
B: Mimics | B: traced axially slice by slice, subtracting maxilla | S: 6 slices (3 mm) inferior to ANS, parallel to reference plane I: A plane parallel to greater palatine foramens and CEJ of buccal surface central incisor on the cleft side. Also serves as reference plane. L/P: following outlines of contralateral side |
||
Phienwej et al., 2023 | CBCT | A: Mimics v.17 | A: traced multiple slices axially | S: plane from ANS, most lateral and inferior point of pyriform aperture in cleft side, and the most superior point of mesiopalatal margin of the lateral alveolar segment I: plane from inferior of mesiolabial margin of the medial alveolar segment, inferior of mesiolabial margin of the lateral alveolar segment, inferior of mesiopalatal margin of the medial alveolar segment M/D: margins of the cleft L: linear line connecting the labial edges of the alveolar segments P: linear line connecting the palatal edges of the alveolar segments |
B: Mimics v.17 ); Form 2 3D printer (Formlabs Inc., Massachusetts, USA) | B: 3D-printing of maxilla, water displacement technique | S: same as method A I: same as method A |
||
Quereshy et al., 2012 | CBCT | InVivo (Anatomage Inc, San Jose, USA) | Geometric estimation | - |
Shirota et al., 2010 | CBCT | A: SimPlant Pro ver. 8.1 | A: traced axially, edited orthogonally | S: inferior margin of anterior nasal aperture I: inferior margin of alveolar bone adjacent to the defect |
B: - | B: Intraoperative actual bone volume using syringe method | - | ||
Choi et al., 2012 | CT | A: Radipia 3D 2.8 (Infinitt Healthcare, Seoul, South Korea) | A: traced axially slice by slice | S: floor of pyriform aperture I: alveolar crest of adjacent alveolar segments L/P: the labiopalatal dimension was made as thick as the adjacent normal alveolar bone |
B: Ondemand 3D 1.0 | B: traced axially slice by slice | |||
C: - | C: Intraoperative actual bone volume using syringe method | - | ||
Wang et al., 2021 | CBCT | A: ITK-Snap (Penn Image Computing and Science Laboratory, Philadelphia, USA)) | A: manual and semi-automatic segmentation | L/P: following the contour of contralateral maxillary arch |
B: PyTorch, 3D U-Net | B: AI Deep learning | Not needed |
the studies used CBCT, about 41% used CT. Additionally, discussions on the measurement accuracy associated with modality, X-ray parameters, voxel size, and slice thickness are outside the scope of the review. However, studies have shown that larger bony anatomical structures, such as cleft, are equally visible and dimensionally accurate in both modalities [34-36]. For these reasons, we consider the protocols described in this review to be viable for both modalities with various parameters.
Current studies revealed that the determination of landmark of cleft boundaries was seen as a low priority and was rarely reported. Precise operational definition of each landmark is crucial to reduce errors [37]. Therefore, this article is intended to help future researchers formulate precise and appropriate landmarks for alveolar cleft defect measurement.
4.1. Evaluation of Workflow for Virtual Measurements
The most commonly used workflow involved manually tracing the defect boundaries slice by slice in the axial view [5, 13, 16, 18, 20, 23-25, 27]. This technique is simple although possibly tedious, but can be achieved by a lot of software programs. Manual tracing has been tested against actual bone volume with clinically acceptable accuracy. Due to the manual nature of this technique, reproducibility is of particular concern. The studies utilising various software reported reliability ICC values ranging from 0.626-0.980 [5, 16, 20, 21, 23, 24, 26, 27].
An open-sourced software (Invesalius, CTI, Campinas, Brazil), which tends to be harder to use, was an outlier [16], presenting significantly worse reliability. Excluding this software, the reliability for axial tracing was at least 0.80. This indicates that the choice of software must also be considered, and it is best to use software programs that have been validated for clinical use.
Based on the findings in this review, commercial medical image processing software (Mimics, Materialise, Leuven, Belgium) offers multiple methods and has a reasonable amount of supporting evidence. Multiple-slice tracing was also used to accelerate the measurement process. However, both studies that used multiple-slice tracing did not mention the spacing distance between traced slices. Nevertheless, tracing in the axial slice seemed to be an adequately reliable method based on available evidence, although more studies are needed to confirm its accuracy.
Two studies evaluated a dental implant planning software (SimPlant Pro, Dentsply Sirona, Charlotte, USA) using manual orthogonal tracing or editing [15, 17]. Reported reliability was suboptimal, with intraclass correlation coefficients ranging from 0.650 to 0.838. However, the workflow and anatomical landmarks in the virtual method were described with good clarity. Based on these results and missing reliability data, orthogonal tracing or editing with this implant planning software could not be advocated at present due to the lack of evidence.
Mirroring the maxilla to use as a template in unilateral cases was explored in two studies, and both used automated superimposition [14, 27]. A personalised template made from the patient’s anatomy has the potential to reduce operator variability in shaping optimal alveolar shape in filling the defect, especially using automated superimposition. However, asymmetry of the nasal pyriform associated with unilateral clefts potentially complicates superimposition. Although there is no comparison to actual graft volume, this review found comparable accuracy to tracing slice-by-slice and 3D printing [14, 27]. However, reporting must be improved, especially in terms of reliability and comparison to actual graft volume.
Quereshy et al. measured defect width, height, and thickness to estimate volume using a geometric formula [22]. Intraobserver ICC was poor for defect thickness (0.305). Workflow and landmark anatomy were both unclear, which may have contributed to decreased reliability. Additionally, the measurements seemed to have been identified from a 3D-rendered virtual model rather than from sliced images, which decreases accuracy.
Barbosa et al. also found that simple geometric estimation is significantly inaccurate compared to true volumetric measurements [28]. Consequently, this technique should be avoided.
Semi-automatic techniques were used in two studies [10, 19]. Kasaven et al. explored a region-growing tool and a custom algorithm with industrial and materials science image analysis software (Volume Graphics Studio Max, Hexagon, Stockholm, Sweden) and a programmable image processing and analysis software (MATLAB, The MathWorks, Inc., Natick, USA), respectively [19]. These two techniques are highly dependent on density boundaries; thus, they have limited suitability to cleft images and require significant manual refinement. Wang et al. used a combination of manual and semi-automatic processes in an open-source medical image segmentation software (ITK-Snap, Penn Image Computing and Science Laboratory, Philadelphia, USA) [10]. However, the lack of clarity in the workflow described by Wang et al. left some uncertainty in the exact tools used. The study reported 10 hours of work per CBCT, which is a considerable difference from the minutes of work reported with other software (Mimics, Materialise, Leuven, Belgium) [16]. Therefore, time versus cost might be an important consideration in deciding which software to use. Currently, there is not enough evidence to promote these semi-automatic techniques for clinical purposes.
Wang et al. also investigated AI technology by using 3D U-Net, an AI architecture based on convolutional neural network [10]. The architecture was designed to work with small training datasets and large images, specifically to segment medical images [38]. During the training of the AI system, Wang et al. split each CBCT image into patches in the sliding window technique to be trained and create an output of labels for the segmentation. Optimization was done via generalised dice loss [10]. The similarity of AI segmentation compared to manual segmentation was 0.77 +/- 0.06, which was considered moderate to high [39]. Several other studies have also recently used AI for cleft [40, 41]. The use of AI for identifying and measuring the volume of alveolar cleft is promising, and further development is needed. It should be noted that to have any clinical value. AI systems must be developed using evidence-based data and applied under the supervision and approval of clinicians.
4.2. Evaluation of Workflow for 3D Printing
3D printing itself has been established to have adequate accuracy for medical and dental use [42, 43]. For alveolar cleft measurement, typically, plasticine was filled by observers into a 3D-printed skull. Thus, measurement was based on the plasticine filling rather than any 3D-printed object. The printed skull itself may be useful for education or precise, personalised planning [44, 45]. An exception to this was the research by Kasaven et al., in which a defect-shaped object was printed. This protocol could create 3D-printed biomaterial or scaffold [46] to aid alveolar bone graft surgery in addition to mere measurement. The time required for the 3D printing method was consistently longer than virtual measurements [13, 24, 26], which could be a significant drawback. Although cost was not part of the research from any of the included studies, it should also be part of consideration when choosing the technique.
In terms of cleft volume measurement by 3D-printed methods, this review found comparable accuracy to many types of virtual workflow, such as axial tracing [13, 21, 24, 26], orthogonal tracing [17], mirrored template [14], and region-growing techniques [19]. Yet, no studies were found that compared 3D printing-based cleft measurement to actual bone graft volume. Reported reliability ranged from 0.765 to 0.949 [19, 21, 24], although two studies did not provide reliable data [13, 14]. One study found moderate interobserver reliability between 2 cleft surgeons [17]. This was possibly due to the absence of defined landmarks for cleft filling in all 3D-printed protocols, except for the one proposed by Phienwej et al., who created superior and inferior boundaries for the defect, printed as part of the skull model [21]. While this attempt was appreciated, such physical boundaries would not be useful if the maxilla model is to be used as a simulation for surgery due to altered anatomy. Even though the research was done on 3D-printed objects, the fact that suboptimal reliability was found between cleft surgeons using physical models [17] could be significant. An argument could be made that this unreliability could extend to the surgery itself, supported by studies that found operator dependency of alveolar bone graft surgery [47, 48] and evidence of under- or overfilling [4, 5].
Additionally, literature on alveolar bone graft surgery frequently gives little description of landmarks or guidance for adequate defect filling. This highlights the importance of preoperative cleft volume measurements to guide surgeons and reduce operator dependency. Nonetheless, 3D printing for cleft care is promising and warrants future exploration with particular regard to landmark description and reliability.
4.3. Landmarks
Landmarks to determine borders of the alveolar defect should preferably be reliable, fast, and easy to determine. More importantly, they should have clinical significance and be compatible with good functional and aesthetic outcomes. Landmarks chosen for pre-operative determination of defect borders should reflect landmarks seen during surgery. However, none of the studies that provided a complete protocol for anatomical landmarks in this review had a comparison to actual bone volume.
In the current review, five studies used Cemento-enamel Junction (CEJ) [16, 17, 23, 26, 27], and another four used alveolar ridge [5, 15, 20, 21] as landmarks for the inferior margin. The normal alveolar ridge is 1-2 mm below CEJ and even farther in periodontitis or in cases of the alveolar cleft with bone dehiscence. CEJ has the advantage of being relatively constant and easily recognizable in imaging. Normal bone height and CEJ are both used as benchmarks for success evaluation [6, 49-52] and are agreeable with one another [53]. Therefore, CEJ may be preferable as a landmark for the inferior margin of alveolar defect.
For the superior margin, four studies used landmarks associated with the Anterior Nasal Spine (ANS) [15-17, 20], and two studies used a base of pyriform aperture [5, 26]. Phienwej et al. used both landmarks, describing the superior limit as a plane defined by ANS, the most lateral and inferior point of the pyriform aperture on the cleft side, and the most superior point of the mesiopalatal margin of the lateral alveolar segment [21]. Liu et al. set the superior limit at 3 mm below ANS [27]. Both ANS and pyriform aperture have been reported in the literature to be used as surgical landmarks and success measures [6, 54-58]. In cleft cases, the base of the pyriform aperture is harder to identify because it is absent in a complete alveolar defect and often asymmetrical compared to the normal side in unilateral clefts. ANS, as the superior limit, has the most supporting evidence and may be more desirable.
Alveolar thickness and shape may be of concern in the labio-palatal dimension. Three studies followed the contour of the contralateral side [10, 23, 27], two studies followed the continuity of the maxillary arch [16, 17], and another two studies used straight lines connecting the alveolar segments [21, 25]. One study prioritised thickness by dictating the defect filling to have equal thickness with adjacent alveolar segments [5]. However, not many studies evaluating volumetric ABG outcomes have described labio-palatal margins. Nagashima et al. used a mirror image of the contralateral side as a benchmark for the outcome, while Kibe et al. used a tangent line connecting the labial and palatal edges of the defect [31, 59]. As long as good continuity of the arch is achieved, literature is sparse to determine if graft shape (contoured versus straight line) affects functional or facial aesthetic outcomes. Additionally, the choice between ideal contour and straight lines might also be practical as some software may not be able to create contoured lines.
Furthermore, the majority of the studies have used graft thickness as a success measure. Padwa et al. determined graft success if the labio-palatal thickness of the graft is ≥75% of adjacent tooth root width [6]. Stasiak et al. determined a moderate outcome if the thickness is ≥50% - ≤100% of central incisor root width and a good outcome if the thickness is ≥100% [30]. Ideally, it should be at least 8 mm thick to accommodate the size of canines [29]. To comply with the outcome assessment, we recommend prioritizing good continuity with the arch and appropriate thickness as labiopalatal guidance.
Creating planes for defect limit would be more time-consuming than point landmarks, and current evidence does not clearly indicate whether it improves reliability. Point landmarks would be faster and easier to create but are more prone to patient positioning errors. One study reported that non-oriented and oriented data had equal performance [23]. With current studies, there is no evidence to discourage the use of point landmarks.
Among articles that used 3D printing, only one study attempted to provide some guidance for identifying the borders of the filling [21]. Planning the operational definition of anatomical landmarks for the defect borders is important to obtain the most reliable measurements. Although most of the reported reliability was good to excellent, a lot of the inter- and/or intraobserver reliability data in these studies were incomplete. In this regard, standardisation of border landmarks would create comparability between studies. Therefore, future studies with more thorough reporting of the landmarks and reliability are needed.
4.4. Limitations
Several studies were identified that included indirect comparisons to actual bone volume, but the validity of these appointed gold standard measurements was out of the scope of this study. Factors such as modality, X-ray parameters, voxel size, and slice thickness were also not evaluated in detail. However, evidence has shown that the choice of modality (CT and CBCT) and parameters do not significantly affect measurements of large structures, such as cleft [34, 60]. While some aspects of treatment flow (such as time and cost) were briefly discussed in the review, this aspect is rarely discussed in the studies, especially in regard to the surgery itself. Moreover, further research is needed to address important deficiencies in the current literature, such as investigating the validity and accuracy of each measurement method, reliability, and standardising landmarks to ensure volume comparability between studies.
Furthermore, the influence of preoperative volume determination on treatment flow and surgical consistency must be covered in areas of investigation. Recommendations in this review are based on available literature and professional judgement from experts with decades of experience; however, some aspects still require further evidence. Other factors that may impact surgical success and consistency are beyond the scope of this review. Nevertheless, the findings of this review would be important to pinpoint gaps in current knowledge and establish more standardized landmarks and protocols for future studies.
CONCLUSION
Diverse protocols for measuring the volume of alveolar defects in cleft alveolus and/or palate are available in the literature, employing various workflow and anatomical landmarks. The workflow with the most supporting evidence is manual tracing in the axial slice. Landmarks advocated by this review are CEJ, ANS, and continuity with the alveolar segments exhibiting adequate labio-palatal thickness as superior, inferior, and labio-palatal borders, respectively. Additionally, studies using AI to help measure alveolar defect volume are emerging and need to be explored.
The absence of a widely accepted consensus for the preoperative volumetric measurements of alveolar defect in cleft alveolus and/or palate has been observed in this review. Thus, the creation of valid, reliable, and standardized guidelines is needed for clinical and research applications of alveolar cleft volume. To help create a technical guideline, future studies must consider methodological and reporting quality as a priority, especially regarding the presence of gold standard, inter- and intraobserver reliability, and complete description of anatomical landmarks.
AUTHORS’ CONTRIBUTIONS
The authors confirm their contribution to the paper as follows: I.S., M.P., N.N., and H.S.: Study conception and design; I.S., M.P., and B.K.: Data collection; I.S., M.P., N.N., D.A., and B.K.: Analysis and interpretation of results; and I.S., M.P., N.N., and H.S.: Draft manuscript. All authors reviewed the results and approved the final version of the manuscript.
STANDARDS OF REPORTING
PRISMA guidelines and methodology were followed.
PRISMA checklist is available on the publisher’s website
AVAILABILITY OF DATA AND MATERIALS
The data and supportive information is available within the article.
FUNDING
This study was financially supported by the Universitas Indonesia (NKB-663/UN2.RST/HKP.05.00/2024).
ACKNOWLEDGEMENTS
The authors would like to acknowledge Universitas Indonesia (NKB-663/UN2.RST/HKP.05.00/2024) for supporting the publication fees of this article.