Authors: Niha Adnan ( Department of Surgery, Aga Khan University Hospital, Karachi, Pakistan. )
Fahad Umer ( Department of Surgery, Aga Khan University Hospital, Karachi-Pakistan. )
February 2022, Volume 72, Issue 2
Narrative Review
Abstract
The developments in Artificial Intelligence have been on the rise since its advent. The advancements in this field have been the innovative research area across a wide range of industries, making its incorporation in dentistry inevitable. Artificial Intelligence techniques are making serious progress in the diagnostic and treatment planning aspects of dental clinical practice. This will ultimately help in the elimination of subjectivity and human error that are often part of radiographic interpretations, and will improve the overall efficiency of the process. The various types of Artificial Intelligence algorithms that exist today make the understanding of their application quite complex. The current narrative review was planned to make comprehension of Artificial Intelligence algorithms relatively straightforward. The focus was planned to be kept on the current developments and prospects of Artificial Intelligence in dentistry, especially Deep Learning and Convolutional Neural Networks in diagnostic imaging. The narrative review may facilitate the interpretation of seemingly perplexing research published widely in dental journals.
Keywords: Artificial Intelligence, Deep learning, Machine learning, Dentistry, Imaging, Neural networks, Convolutional neural network, Intraoral radiography, Object detection, Semantic segmentation, Instance segmentation, Big data.
DOI: https://doi.org/10.47391/JPMA.AKU-12
Introduction
Artificial Intelligence (AI) deals with the science of simulating human behaviour in machines.1The exponential development in technology over the past few decades has led to the evolution of AI, making it an innovative research area across a wide array of industries.2The two subsets of AI are Machine Learning (ML) and Deep Learning (DL) (Figure-1) .3

ML is a subdomain of AI, and DL is a further subdivision of ML. These techniques can be used to develop AI architectures and subsequently trained on an abundant set of existing data.4The consequence is an architecture capable of making accurate predictions on new, unseen data.4Both techniques are widely used today, revolutionising the AI industry.
AI originated as a single-layered neural network called 'Perceptron', laying the groundwork for the progression of DL techniques.5However, it could not experience the growth that was anticipated due to challenging technological barriers that existed at the time of its incipience.6These included a lack of computing power and the unavailability of extensive annotated datasets for training these architectures.6
The work in this field continued to some extent under different aliases until 2006 when AI experienced a paradigm shift, giving precedence to 'big data' in developing AI architectures.7 This was after the ImageNet database worked for the advancement of computer vision techniques.7,8 ImageNet is a collection of over a million annotated images accumulated from the internet and has been leveraged into training various algorithms since its commencement.7,8 This instigated a rampant growth that both ML and DL experienced for the first time since their inception.3
The very premise of AI is to learn from the data that it collects; the more data that is collected and analysed, the more accurate the algorithm becomes at making predictions.9 Big data has been the driver for AI advancements, but no amount of data will ever replace human intuition.
A detailed discussion on ML can be found in literature.3 The current narrative review planned to keep its focus on DL techniques.
Deep Learning techniques
DL comprises multi-layered artificial neural networks (ANNs), modelled after the rudimentary picture of a human brain. The basic structure of an ANN consists of an input layer, one or more hidden layer(s) and an output layer, with each layer composed of nodes, also known as artificial neurons (Figure-2).5

The 'deeper' the DL algorithm, the more hidden layers it contains, allowing for more 'features' to be extracted from a given dataset. Every layer is highly interconnected with the others; and the passing of data from one layer to the next is a process converting the output of one layer into input for the next.5 The algorithm learns the 'maps' between the given inputs and outputs and with sufficient training, it is able to apply those learned patterns on unseen input data to derive accurate output predictions.1
DL algorithms can process large amounts of data and its performance improves as it analyses more data.9 The 'feature extraction' (Table) step is already incorporated into DL algorithms, eliminating the need for manually determining relevant features for it to extract and learn, which is a prerequisite for ML techniques.9 After an appropriate level of training, the DL algorithm can extract hidden features autonomously with little to no explicit programming.9

Training a DL algorithm
The process of developing an AI algorithm involves three basic steps: training, validating and testing (Figure-3).

Training and validation employ the same annotated dataset, whereas testing is conducted on a different, unseen, annotated dataset.9 By running the training data through an algorithm, it is trained to produce a particular prediction with each epoch. Usually, the data is fed through it in iterations. These iterations are run through the algorithm in distinct patterns, allowing it to learn every possible aspect of the 'features' embodied in the training dataset.9 This leads to better 'generalisation' of the important features contained within the dataset, enabling its applicability on a wide range of unseen data in the future.9 With each epoch, the parameters and hyperparameters of the algorithm are adjusted to ensure greater accuracy of the predictions. A parameter is a variable that is internal to the algorithm and the value is determined by the algorithm itself.9 Whereas a hyperparameter is a value external to the algorithm set manually by the data scientist operating it, and can be tuned manually to further increase accuracy.9 After the efficacy of the algorithm has been validated on the same dataset, testing data is run through it.9 The results make the adequacy of training evident via the accuracy of predictions made by the algorithm.9 This process can be repeated and the performance can be improved further by running more annotated training data until the desired accuracy level is achieved.
Convolutional Neural Network (CNN)
The most established algorithm among DL architectures is the Convolutional Neural Network (CNN).2 It has been in use predominantly for computer vision tasks. CNNs explicitly process data with complex patterns, such as images, with the goal to mimic functions of the human visual cortex.2,10 'Image processing' is the specific feature extraction task performed by these neural networks. This is executed by the special convolutional layer that is incorporated into an otherwise traditional DL algorithm (Figure-4).2,10

The role of CNNs is to reduce the image according to pixels, making the image easier to process without losing important features.2 Hence, these neural networks can make accurate predictions whilst simultaneously allowing for a reduction in the computational power required to effectuate these tasks.9 These are being trained widely to be applied on radiographic images in the fields of medicine and dentistry.1
Evaluating performance of DL algorithms
Different metrics are utilised in DL to discern the performance of algorithms assessed against the ground truth determined by subject specialists/ annotators in the 'testing' dataset.1 The application of various metrics is imperative to ensure the optimal quality of the algorithm.1
These performance metrics are run on the testing dataset, giving true positives and negatives, as well as false positives (FP) and false negatives (FN) of the predictions generated by the architecture.1 The values are consequently used to determine the precision, accuracy, recall/sensitivity, and F1-score of the trained algorithms measuring their true capabilities.11 Other metrics include the receiver operating curve (ROC), area under the curve (AUC) and dice index to evaluate the performance of architectures.1,11 The pertinent selection of relevant metrics for evaluation of the various AI architectures is a crucial task. It is dependent upon the specific task being carried out by the algorithm as well the aspect of its training that requires evaluation. For example, F1 score is used to determine the robustness of an algorithm.1 This aspect of algorithm training requires the proficiency of data scientists11 and a thorough discussion on this aspect of algorithm training is far beyond the scope of the current paper.
CNNs in dentistry
The evolution of AI in medicine and dentistry have been evident in the studies published in the past few years.1 AI has mostly been involved in the diagnostic aspects of these fields, approaching the level of expertise of the clinicians.10
In dentistry, CNNs can detect and segment anatomical structures and pathological conditions on radiographic images.1 This makes it relatively easy for clinicians to distinguish the positions and characteristics of teeth and any associated pathology as part of their preliminary clinical examination.2 In general, CNNs have performed well in recognising and classifying teeth on both two-dimensional (2D) and 3D radiographs.1 They are being studied as adjuncts in the identification of dental pathology, including caries, periapical lesions, periodontal bone loss, vertical root fractures, osteoporosis, cancers of the head and neck region, as well as for working length determination in endodontics, etc.6,12-18
Theoretically, adequately-trained CNN architectures can perform as well as clinicians, with the potential to surpass the level of their performance and diagnostic abilities. The aim is to simplify the provision of dental care, make it cost-effective, and concurrently eliminate human error in diagnoses and treatment plans devised by clinicians.2
Tasks performed by CNNs
The various computer vision tasks performed by CNNs include Object Classification and Localisation which entailes assigning labels to a single object in an image and localising their positions via a bounding box (Figure-5A); Object Detection which includes assigning labels to every object in an image belonging to a particular group via bounding boxes (Figure-5B); Semantic Segmentation in which individual pixels are labelled into groups via fluid margins (Figure-5C); and Instance Segmentation which employs Object Detection as well as Semantic Segmentation, labelling each pixel in every group via fluid margins (Figure-5D).9

Challenges and Prospects
The applications of AI in dentistry is lagging behind by a few years compared to that of medicine.12 This is largely due to the cost of acquisition and annotation of big data required for sufficient training of AI algorithms, which is needed to ensure robustness of the architecture.12
The ability of the algorithm to make predictions is where the 'generalisability' of an algorithm is important to consider.1 The key to improving the generalisability of an algorithm is determined by the extensiveness of the dataset used to train the model.3 'Overfitting' of data is one of the main challenges in the applicability of AI, as an overfitted algorithm is not generalisable on unseen data.19 A method to reduce overfitting is via data augmentation, which includes flipping, translation, cropping, rotating and applying filters on the existing images in a dataset.8 This ensures the training of the algorithm on every possible presentation of the training data.8
An abundance of labelled data is desirable but rarely available in medical and dental imaging, and, hence, another method called 'transfer learning' can be applied.20 Transfer learning is a technique where a neural network that is pre-trained to predict a 'general feature', like a car, in an image is then utilised on a different dataset with seemingly incompatible features, like a tooth, for the architecture to recognise after minimal preliminary training on the relevant features.20 The exact mechanism of this 'smart behaviour' remains unknown, giving rise to the 'black-box effect' where the exact workings that allow DL architectures to map input to output remain unclear.19 Employing this technique, however, significantly reduces training time and computational resources while producing quality architectures.20
The lack of open-access, standardised and annotated datasets to train AI algorithms is the main impediment in the exponential growth of AI in dentistry.12 The annotation of medical images adds to the cost of time as well as the cost of subject specialists needed for accurate annotation of datasets. The provision of standard quality datasets for training algorithms will allow for their consistent clinical implementation. The limitations of computing power has largely been overcome, but can be improved further by incorporating quantum computation which is exponentially faster and makes it a valuable platform that should be explored to cause further advancement in AI.12
There is a need for better understanding of the processes involved in the AI as well as overcoming its vulnerabilities to allow for its applicability in crucial, patient-related diagnostic problems.12 The ultimate goal is to train a machine to think like humans and perform tasks with more accuracy and greater speed, lowering resource utilisation in the long run.
Conclusion
AI has several potential applications in the fields of medicine and dentistry and there is a need for clinicians to be cognizant of all the past and current advancements. Being aware of the key concepts, workings, strengths as well as limitations of AI in clinical practice is crucial for clinicians to work towards further improving diagnoses and patient care.
Disclaimer: None.
Conflict of Interest: None.
Source of Funding: None.
References
1. Pethani F. Promises and perils of artificial intelligence in dentistry. Aust Dent J 2021;66:124-35. doi: 10.1111/adj.12812.
2. Pauwels R. A brief introduction to concepts and applications of artificial intelligence in dental imaging. Oral Radiol 2021;37:153-60. doi: 10.1007/s11282-020-00468-5.
3. Silva G, Oliveira L, Pithon M. Automatic segmenting teeth in X-ray images: Trends, a novel data set, benchmarking and future perspectives. Expert Syst Appl 2018;107:15-31.
4. Schwendicke F, Samek W, Krois J. Artificial Intelligence in Dentistry: Chances and Challenges. J Dent Res 2020;99:769-74. doi: 10.1177/0022034520915714.
5. Ramchoun H, Idrissi MAJ, Ghanou Y, Ettaouil M. Multilayer Perceptron: Architecture Optimization and Training. Int J Interact Multimed Artif Intell 2016;4:26-30. DOI: 10.9781/ijimai.2016.415
6. Shan T, Tay FR, Gu L. Application of Artificial Intelligence in Dentistry. J Dent Res 2021;100:232-44. doi: 10.1177/0022034520969115.
7. Deng J, Dong W, Socher R, Li L, Li K, Fei-Fei L. ImageNet: A large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, USA: IEEE, 2009; pp 248-55. doi: 10.1109/CVPR.2009.5206848.
8. Shin HC, Roth HR, Gao M, Lu L, Xu Z, Nogues I, et al. Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning. IEEE Trans Med Imaging 2016;35:1285-98. doi: 10.1109/TMI.2016.2528162.
9. Yamashita R, Nishio M, Do RKG, Togashi K. Convolutional neural networks: an overview and application in radiology. Insights Imaging 2018;9:611-29. doi: 10.1007/s13244-018-0639-9.
10. Hwang JJ, Jung YH, Cho BH, Heo MS. An overview of deep learning in the field of dentistry. Imaging Sci Dent 2019;49:1-7. doi: 10.5624/isd.2019.49.1.1.
11. Lee JH, Han SS, Kim YH, Lee C, Kim I. Application of a fully deep convolutional neural network to the automation of tooth segmentation on panoramic radiographs. Oral Surg Oral Med Oral Pathol Oral Radiol 2020;129:635-42. doi: 10.1016/j.oooo.2019.11.007.
12. Schwendicke F, Golla T, Dreher M, Krois J. Convolutional neural networks for dental image diagnostics: A scoping review. J Dent 2019;91:e103226. doi: 10.1016/j.jdent.2019.103226.
13. Kim J, Lee HS, Song IS, Jung KH. DeNTNet: Deep Neural Transfer Network for the detection of periodontal bone loss using panoramic dental radiographs. Sci Rep 2019;9:17615. doi: 10.1038/s41598-019-53758-2.
14. Mahmood H, Shaban M, Indave BI, Santos-Silva AR, Rajpoot N, Khurram SA. Use of artificial intelligence in diagnosis of head and neck precancerous and cancerous lesions: A systematic review. Oral Oncol 2020;110:e104885. doi: 10.1016/j.oraloncology.2020.104885.
15. Lee JS, Adhikari S, Liu L, Jeong HG, Kim H, Yoon SJ. Osteoporosis detection in panoramic radiographs using a deep convolutional neural network-based computer-assisted diagnosis system: a preliminary study. Dentomaxillofac Radiol 2019;48:e20170344. doi: 10.1259/dmfr.20170344.
16. Aminoshariae A, Kulild J, Nagendrababu V. Artificial Intelligence in Endodontics: Current Applications and Future Directions. J Endod 2021;47:1352-7. doi: 10.1016/j.joen.2021.06.003.
17. Johari M, Esmaeili F, Andalib A, Garjani S, Saberkari H. Detection of vertical root fractures in intact and endodontically treated premolar teeth by designing a probabilistic neural network: an ex vivo study. Dentomaxillofac Radiol 2017;46:e20160107. doi: 10.1259/dmfr.20160107.
18. Umer F, Habib S. Critical Analysis of Artificial Intelligence in Endodontics: A Scoping Review. J Endod 2021:S0099- 2399(21)00802-5. doi: 10.1016/j.joen.2021.11.007. [ahead of print]
19. Umer F, Khan M. A call to action: concerns related to artificial intelligence. Oral Surg Oral Med Oral Pathol Oral Radiol 2021;132:e255. doi: 10.1016/j.oooo.2021.04.056.
20. Tan C, Sun F, Kong T, Zhang W, Yang C, Liu C. A Survey on Deep Transfer Learning. In: K?rková V, Manolopoulos Y, Hammer B, Iliadis L, Maglogiannis I, eds. Artificial Neural Networks and Machine Learning - ICANN 2018. Cham, Switzerland: Springer, 2018; pp 270-9. Doi: 10.1007/978-3-030-01424-7_27.
Related Articles
Journal of the Pakistan Medical Association has agreed to receive and publish manuscripts in accordance with the principles of the following committees:




