Skip to main content
  • Research article
  • Open access
  • Published:

Application of Neural Networks for classification of Patau, Edwards, Down, Turner and Klinefelter Syndrome based on first trimester maternal serum screening data, ultrasonographic findings and patient demographics

Abstract

Background

The usage of Artificial Neural Networks (ANNs) for genome-enabled classifications and establishing genome-phenotype correlations have been investigated more extensively over the past few years. The reason for this is that ANNs are good approximates of complex functions, so classification can be performed without the need for explicitly defined input-output model. This engineering tool can be applied for optimization of existing methods for disease/syndrome classification. Cytogenetic and molecular analyses are the most frequent tests used in prenatal diagnostic for the early detection of Turner, Klinefelter, Patau, Edwards and Down syndrome. These procedures can be lengthy, repetitive; and often employ invasive techniques so a robust automated method for classifying and reporting prenatal diagnostics would greatly help the clinicians with their routine work.

Methods

The database consisted of data collected from 2500 pregnant woman that came to the Institute of Gynecology, Infertility and Perinatology “Mehmedbasic” for routine antenatal care between January 2000 and December 2016. During first trimester all women were subject to screening test where values of maternal serum pregnancy-associated plasma protein A (PAPP-A) and free beta human chorionic gonadotropin (β-hCG) were measured. Also, fetal nuchal translucency thickness and the presence or absence of the nasal bone was observed using ultrasound.

Results

The architectures of linear feedforward and feedback neural networks were investigated for various training data distributions and number of neurons in hidden layer. Feedback neural network architecture out performed feedforward neural network architecture in predictive ability for all five aneuploidy prenatal syndrome classes. Feedforward neural network with 15 neurons in hidden layer achieved classification sensitivity of 92.00%. Classification sensitivity of feedback (Elman’s) neural network was 99.00%. Average accuracy of feedforward neural network was 89.6% and for feedback was 98.8%.

Conclusion

The results presented in this paper prove that an expert diagnostic system based on neural networks can be efficiently used for classification of five aneuploidy syndromes, covered with this study, based on first trimester maternal serum screening data, ultrasonographic findings and patient demographics. Developed Expert System proved to be simple, robust, and powerful in properly classifying prenatal aneuploidy syndromes.

Peer Review reports

Background

A normal human cell is made up of 46 chromosomes that are grouped into homologous pairs (or classes): 44 autosomes, and two sex chromosomes, which specify gender (XX for female and XY for male) [1]. Each chromosomal homologous pair consists of one maternal and one paternal chromosome that pair up with each other inside a cell during meiosis [2]. Chromosomal disorders fall into two main categories such as numerical and structural abnormalities [3]. Chromosomal anomalies or aneuploidy, represented primarily by numerical change, are the single greatest contributor to prenatal morbidity and mortality [4]. Since chromosomal abnormalities are powerful in detection and diagnosis of various genetic disorders, chromosome analysis (karyotyping) is a fundamental clinical procedure most frequently performed in genetic laboratories. There are several reasons for referral for cytogenetic analysis, but advanced maternal age is still the major reason. Maternal age is a major factor in producing aneuploidy in humans. The most frequent anomaly associated with maternal age is Trisomy 21 [5]. Affected fetuses can be identified early in pregnancy through amniocentesis, thus providing the woman with the option for selective termination or continuation of the pregnancy. It is most important that accurate genetic testing and counseling is provided.

All obstetricians generally offer first trimester maternal serum screening for aneuploidy to their pregnant patients, irrespective of patient’s age. This minimally invasive screening test provides patient with a risk assessment and is not to be used as definitive diagnosing tool. If the results of maternal serum screening are concerning and suggestive of trisomies the patient may opt for confirmatory diagnostic methods, which would require patient to undergo an invasive procedure amniocentesis or chorionic villus sampling (CVS). A diverse range of diagnostic tests are currently available for the detection of prenatal chromosomal aberrations. Karyotyping, fluorescence in situ hybridization (FISH), quantitative fluorescence polymerase chain reaction (QF-PCR), array comparative genomic hybridization (aCGH), and the next-generation sequencing (NGS) are the common methods used for prenatal diagnostics [4,5,6,7,8,9,10].

Karyotyping analysis aims to assess the possible presence of genetic defects, identify individual chromosomes in a metaphase cell and arrange them in order based on the established atlas [11]. Today, use of image processing and artificial intelligence techniques has considerably increased in many medical practice fields. In automated cytogenetics, general computerized image processing and analysis techniques as well as rule based classification algorithms for karyotyping have been in use, since the 70s, replacing the human based cutting up chromosome photographs with scissors and their human based arrangements.

Developing tools for disease classification can be extremely extensive and challenging task, especially when the association between input and target values is non-linear and depending on multiple factors [12]. Machine learning methods such as Artificial Neural Networks (ANNs) have been considered as promising tools for overcoming these difficulties since they do not require analytical model of observed process [12]. The theory of neural networks is still growing field due to their ability to derive meaning from complicated or imprecise data and because they use different approach, parallel data processing instead of algorithmic approach to problem solving like conventional computers. Different ANN architectures have been used for various purposes, such as classification, pattern recognition, prediction, control and optimization [13]. The neural networks, in terms of data processing, mimic physical structure of human nervous system consisting of artificial neurons. These units serve as processors that are interconnected and organized into layers [14]. The relationship between input and output is determined by the network architecture and learning algorithm [14]. Learning is an iterative process of adjusting ANN inner parameters, weights and biases until the performance criteria is met. Most usually that performance criteria is threshold of error function. The ability that differentiates ANNS to other data processing tools is the ability to learn and improve its performance from examples. Once trained, ANN is able to predict unknown future outcomes of the same process. ANNs can be classified into two groups based on internal information flow: feedforward and feedback neural networks. Most commonly used type of feedforward architecture is the one with back propagation learning algorithm and as for feedback neural network, Elman’s architecture is most commonly used.

The usage of ANN in disease classification happens very often [15,16,17,18,19,20,21,22,23,24,25,26,27], though there have been only few studies investigating neural networks in genome-enabled predictions and classifications [28, 29]. As for cytogenetic analysis, in recent years, several research groups have developed and tested different ANNs for the classification of metaphase chromosomes. The main goal in these studies was to develop automated computer-assisted banded chromosome detection, using various methods and various ANNs architectures for classification of all 24 types of chromosomes [30,31,32,33,34]. Although many efforts have been made to develop computerized schemes for automated karyotyping and syndrome diagnosis, no schemes can get be performed without substantial human intervention [31, 35]. On the other hand, to date ANNs have not been used for analyzing cytogenetic data or screening data in determining or predicting trisomies or any prenatal syndromes.

In this paper, instead of developing an automated method to classify chromosome classes used for determining and distinguishing between Turner, Patau, Klinefelter, Edwards and Down syndrome, we focus on developing an automatic scheme for classifying chromosomal trisomies using results from combined/double screening tests in the first trimester of pregnancy, ultrasonographic findings and maternal age. Validation of system output was done by karyotyping or known pregnancy outcome.

Methods

Sample collection

The database consisted of 2500 pregnant women data who received a routine antenatal care between January 2000 and December 2016. The first trimester combined test was offered routinely at 11 to 13.6 weeks of gestation, to measure the values of maternal serum levels of maternal serum pregnancy-associated plasma protein A (PAPP-A) and free beta human chorionic gonadotropin (β-hCG). All women underwent ultrasound examination also to acquire information on fetal nuchal translucency thickness and the presence or absence of the nasal bone.

Ultrasound verification was done on the General Electric 730 Volusion Exp, with convex probe 3.5–7 MgHz (multidimensional scanning). Additional parameter in the database was maternal age since the correlation between maternal age and presence of the prenatal syndrome has been proven [5]. After ultrasound examination, maternal blood was sampled using serum separator tubes (4 ml each) for the analysis. The serum was separated by centrifugation and stored at 4 °C until being tested. The blood samples were analyzed by using an automated Siemens Immulite 1000 system (Siemens Healthcare, Erlangen, Germany). Analysis of NT thickness, PAPPA-A and β-hCG, along with patient demographics was performed using the Prisca software (Siemens Healthcare, Erlangen, Germany). The pregnancy outcome was known for all the subjects included in the study known, either by amniocentesis or by childbirth. Those women with an elevated risk of (≥1 in 250) of carrying a fetus with trisomy 13, 18, or 21, and those women with advanced maternal age were offered counseling with the option for invasive diagnostic test. In cases when combined screening test estimated lower risk for trisomies, the outcome was further validated by childbirth.

Cytogenetic analysis on long-term cultured amniocytes was performed using standard manufacturer’s protocol (Amniomax, Invitrogen, Carlsbad, CA, USA). Metaphase chromosome spreads were prepared on a glass microscope slide in accordance with standard cytogenetic procedure and in accordance with the European Society of Human Genetics (ESHG) and European Cytogenetics Association (E.C.A) guidelines [36,37,38]. Chromosomes were aged and banded using G-Bands by pancreatin and Giemsa staining technique. To investigate the total number and structure of the chromosomes, twenty metaphase cells were visualized and analyzed by an experienced cytogenetics technologists using Zeiss microscope Axioskop2 plus (Zeiss, Jena, Germany) with the assistance of the MetaSystems imaging system. Results were reported in accordance with the International Standing Committee on Human Cytogenetic Nomenclature [11]. In the case of pathological karyogram findings, spouses were invited for genetic counseling at the Institute where multidisciplinary team (genetics, obstetrician and biologist) explained relevant medical facts regarding the findings. This diagnostic method was chosen due to availability since other invasive diagnostic testing methods such as chronic villi sampling (CVS) is not performed in Bosnia and Herzegovina, therefore it was not offered to any of our patients.

The dataset containing patient characteristics, test results, and pregnancy outcomes needed for development of the system was obtained from genetics laboratory at the Institute of Gynecology, Infertility and Perinatology “Mehmedbasic” in Sarajevo. In this dataset 1500 samples were of healthy subjects and 1000 samples were with diagnosis of aneuploidy.

Summary statistics of database is presented in Table 1. Out of 2500 observed samples, 52.5% were male and 47.5% were female samples. Minimum maternal age in this dataset is 16 years and the maximum age is 49 years. Mean maternal age of pregnant women who underwent amniocentesis is 31.5 years. Out of 2500 samples, 40% were with disease classification and 60% were of healthy subjects. Out of samples with confirmed diagnosis of prenatal syndrome most were of Klinefelter Syndrome (26.83%), followed with Down syndrome (24.31%), Edwards Syndrome (19.17%), Turner Syndrome (16.11%) and Patau Syndrome (13.58%). According to these samples, training dataset folds were created.

Table 1 Summary statistics for dataset

The Expert System based on Artificial Neural Network consists of 5 input parameters that are: maternal serum pregnancy-associated plasma protein A (PAPP-A), free beta human chorionic gonadotropin (β-hCG), mother’s age, fetal nuchal translucency thickness, the presence or absence of the nasal bone and one output parameter indicating that the tested subject has one of the prenatal syndromes or is healthy.

In designing neural networks for solving specific problem, factors such as neural network architecture, number of hidden neurons, training dataset distribution and training algorithm have significant impact on overall accuracy of developed system [39]. We investigated two different neural network architectures, feedforward and feedback, for various number of neurons in hidden layer, which are according to the application experts, sufficient to properly perform the classification [15, 40,41,42].

Single hidden layer feedforward ANN with back-propagation

A feedforward neural network is probably the most common type of neural networks for classification and prediction [13]. As it can be seen from Fig. 1, network inputs are not affected with network output in any way. The output is result of modifiable synapses, represented as summations of signals from hidden and input layer.

Fig. 1
figure 1

Architecture of a single layer feedforward neural network. n is number of inputs and m is number of neurons in hidden layer; f(·) and g(·) are transfer functions in hidden and output layer respectfully. Connections between neurons are represented with weight factors w; a is bias (internal neural network parameter); Σ indicates synapses – summation of signals from previous neurons

In hidden layer, the inputs are linearly combined with vector of weights and biases (inner neural network parameter). The resulting linear combination of such input is then transformed to neuron output by transfer function. The output of single neuron in hidden layer is represented with following equation:

$$ {z}_i=f\left({a}_i+{\sum}_{k=1}^n{w}_{ik}{x}_k\right) $$
(1)

where w ik is weight of x k input to z i output, and i is the number of neuron in hidden layer (1-m).

The output of ANN is formed as linear combination of hidden layer neuron outputs. These connections between hidden and output layer are also weighted. The output is calculated as:

$$ y=g\left({a_o}^{\hbox{'}}+{\sum}_{k=1}^m{w_i}^{\hbox{'}}{z}_i\right) $$
(2)

The transfer functions can be linear and nonlinear. To model non-linear relationship between inputs and outputs, like in cytogenetic analysis result and syndrome classification, non-linear transfer function such as logsig in hidden layer was used \( \left( logsig(x)=\frac{1}{1+{e}^{-x}}\right) \). In output layer linear function (purelin(x) = x. Computationally, learning in this type of ANN architecture is regression process of adapting weights and biases until minimum error value is achieved. Simple iterative algorithm that can be used in training of this type of neural network is back-propagation algorithm. The output values are compared with the correct answer to compute the value of some predefined error-function. The weights and biases of neural network are corrected in each iteration. Most commonly used error function if mean square error (MSE). After training is completed, neural network parameters are held constant for subsequent analysis. This type of ANN can be used for modeling most problems while keeping simple architecture and low computational complexity. As a performance measure for trained neural network, absolute deviation from the actual output value can be used.

Elman recurrent artificial neural network

Elman neural network has feedback (recurrent) architecture where connections between neurons form a directed cycle. The main difference between feedforward and feedback neural network is that feedback neural networks beside the input and output layer consist of recurrent layer which introduces one-step delay in hidden layer. This recurrent layer acts like memory in network architecture. This type of neural networks, because of this ability, is mostly used for recognition. Elman neural network is a three-layer network with vertically architecture shown in Fig. 2.

Fig. 2
figure 2

Elman neural network Architectures are context units that can be treated as the memory units. There are connections from the middle (hidden) layer to these context units fixed with a weight of one

This neural network architecture also consists of input layer, hidden layer and output layer called the feedforward loop and context layer which makes back-forward loop with hidden layer which makes the neural networks sensitive to the history of input data. The hidden neuron outputs are calculated as follows:

$$ {z}_i=f\left({a}_i+{\sum}_{k=1}^n{w}_{ik}{x}_k(j)+{\sum}_{k=1}^m{w}_{ik}{z}_k\left(j-1\right)\right) $$
(3)

Where j stands for discrete time, w ik is weight of x k input to z i output, and i is the number of neuron in hidden layer (1-m). The output is calculated as:

$$ y=g\left({a_o}^{\hbox{'}}+{\sum}_{k=1}^m{w_i}^{\hbox{'}}{z}_i\right) $$
(4)

The gradient descent (GD) algorithm is most commonly used for training of Elman’s neural networks. The error between the network output and the desired outputs minimized in the steepest descent [43]. At each iteration the input is propagated in a standard feed-forward architecture, and then a learning rule is applied. The back-forward loop results in the context units always maintaining a copy of the previous values of the hidden units, thus the network can maintain a sort of state, allowing it to perform such tasks as sequence-prediction that are beyond the power of a standard multilayer perceptron.

Results

Before developing neural network architecture, total database consisted of 2500 samples was divided into twodisjoint subsets, training and subsequent validation data. The training data consisted of 2000 samples (80% of overall database) and subsequent validation (testing of the system) was performed with the rest of the data as it can be seen from the Fig. 3. Out of 2000 samples 1200 were of healthy subjects and the rest were as follows: Klinefelter Syndrome 268 samples, Down syndrome 243 samples, Edwards Syndrome 192 samples, Turner Syndrome 161 samples and Patau Syndrome 136 samples, therefore in total 800 samples as presented in Table 1. This dataset distribution is common among researchers in the field when dealing with relatively small number of samples [44].

Fig. 3
figure 3

Training data set consisted of 1200 samples of normal subjects, 161 of Turner syndrome, 268 Klinefelter syndrome, 192 Edwards syndrome, 243 Down syndrome, 136 Patau Syndrome

Due to database size and non-linearity of provided data for neural network training k-fold cross validation was used. The 2000 training samples were divided into 10 folds. This number of folds was chosen since the training dataset is comprised of 80% of database and according to experts in this division is sufficient [45]. In each iteration Levenberg – Marquardt algorithm (LMA) was used, which is common training algorithm in data classification [46]. The starting network weights were initialized with random values. At each training iteration train/test performance was evaluated as Mean Square Error between the predicted and actual values (MSE), where n is total number of samples:

$$ MSE=\frac{1}{n}{\sum}_{i=1}^n{\left({X}_{predicted}-{X}_{actual}\right)}^2 $$
(5)

The single performance estimation of neural network training process was generated by averaging results from k-folds. By using the k-fold cross validation the problem of over fitting was reduced. In case of over fitting the developed system has very good training performance but it’s predictive ability on unseen data is very poor. Developed neural network architectures were compared based on accuracy of classification of prenatal syndromes. To compare performance on neural network architectures, absolute error was used also. For validation purpose, absolute error is calculated as follows:

$$ {Error}_{absolute}={Output}_{target}-{ANN}_{output} $$
(6)

Dependency of classification accuracy in both architecture types to the number of neurons in hidden layer is examined. While number of input and output neurons is determined with data structure and process modeling, performance of training is dependent on complexity of neural network and number of neurons in hidden layer [30, 32]. Poorly defined number of neurons in hidden layer can cause over fitting problem, which leads to good training performance and very bad testing performance. There are various methods for choosing fixed number of neurons in hidden layer [47, 48], but there is no generally accepted one for determining the number of neurons in single hidden layer that would efficiently approximate any given function or process. Despite the new methods developed for this purpose, the most researchers use trial rule. This rule was used in this study also. Number of neurons in hidden layer is set to be 5, 10, 15, 17 and 20 and performance was calculated.

For each step detailed results are presented in Fig. 4 for feedforward neural network. The output performance is presented in Table 2. From the data presented in Fig. 4 and Table 3 it can be concluded that the best performance is achieved with 15 neurons for feedforward architecture.

Fig. 4
figure 4

Training performance of Feedforward neural network with different numbers of hidden neurons

Table 2 Feedforward model comparison based on different numbers of neurons in hidden layer
Table 3 Elman model comparison based on different numbers of neurons in hidden layer

For each number of neurons, performance and absolute error of training dataset was calculated for feedback architecture also. Detailed results are presented in Fig. 5 and Table 3. From the data presented it can be concluded that the best performance is achieved with 17 neurons for feedback architecture.

Fig. 5
figure 5

Training performance of Elman neural network with different numbers of hidden neurons

Figure 6 shows the performance of architectures of neural networks chosen for further development of system for classification between prenatal syndromes. Those networks are feedforward neural network with 15 neurons and Elman neural network with 17 neurons. MSE for feedforward ANN was 15.6316 and for Elman’s ANN 1.5752.

Fig. 6
figure 6

Training performance of Feedforward neural network (15 neurons) and Elman neural network (17 neurons)

Neural network subsequent validation performance for classification of five prenatal syndromes based on cytogenetic analysis and patient demographics is performed with 1000 samples out of which 1200 were healthy subjects, and 800 were subjects with disease with syndrome sample distribution as indicated in Table 1.

Based on data presented in Table 4, sensitivity of 92.00% was achieved for feedforward neural network architecture. The sensitivity of feedback neural network was higher and the value was 99.00%. Sensitivity was calculated based on following formula:

$$ Sensitivity=\frac{True\ postitive}{True\ postivie+ False\ negative} $$
(7)
Table 4 Feedforward neural network classification accuracy during subsequent validation

While specificity was calculated based on:

$$ Specificity=\frac{True\ negative}{False\ positive+ True\ negative} $$
(8)

Detailed classification hits and misses of both neural networks are presented in Tables 4 and 5. Average accuracy of feedforward neural network was 89.6% and for feedback was 98.8% (Table 6).

Table 5 Elman neural network classification accuracy during subsequent validation
Table 6 Subsequent validation performance results of developed neural networks

Discussion

To design an expert diagnostic system based on neural networks for classification of prenatal aneuploidy syndromes, we presented a novel method based on the results of combined/double test in the first trimester of pregnancy, ultrasonographic findings and maternal levels of free β-hCG and PAPP-A. The Expert System technique proposed in this paper potentially opens new avenues for the development of inexpensive, yet effective, prenatal aneuploidy screening tests. The simplicity and accuracy of this method make it a good candidate for clinical implementation as standard software for screening procedure.

There has been significant increase in usage of artificial neural networks in medicine [49], and specifically, in pediatrics. There are several studies based on determining one out of five prenatal syndromes analyzed in this study. Wojtowicz et al. [50] in their study use phenotypic features in diagnosing Down syndrome, postnatal, based on features versus genotype data and patient demographics that were done prenatal. In other study Wojtowicz et al. [51] discuss a new system that is used to solve the problem of the recognition of the dermatoglyphic pattern and the understanding of the classification process of the symptoms of Down syndrome. This system is based on the combination of text knowledge found in the scientific literature describing Down syndrome with the knowledge obtained from the analysis of dermatoglyphic indices characteristic of Down syndrome with the use of digital pattern recognition techniques. Instead of numerical values used for classification of prenatal syndromes in this study Wojtowicz et al. [51] use pattern recognition algorithms for determining the Down syndrome.

Also, for postnatal determination of Turner syndrome Naïve Bayesian neural network has been proposed in study by Pereira et al. [52]. The syndrome classification was based on nine input parameters, phenotype attributes. The problem encountered in building the Bayesian network and the forming of an expert probabilistic system was to obtain knowledge, i.e. database of useful experiment results.

A large number of different neural network structures have been constructed, trained and tested to a large database of pregnant women characteristics, aiming at generating aclassifier-predictor for the presence of chromosomal abnormalities. Soleimani et al. [53] in their study used 3-layer artificial neural network with back propagation learning algorithm to predict development disorders based only on perinatal data. True prediction of developmental disorder, obtained in this study was 83.1%. In our study, both feedforward and feedback neural network architecture used Levenberg – Marquadat training algorithm and exhibit higher accuracy than this multilayer neural network explored in previously mentioned study.

Nicolaides et al. [54] in their study used artificial neural network for non-invasive chromosomal abnormality screening. The best results were obtained when using a multi-layer neural structure having an input, an output and three hidden layers. The percentage of abnormal cases correctly predicted was 85.1%. In the same study, Turner syndrome was predicted with 42.9%. In our study, classification of Turner syndrome was higher when Elman architecture was used and lower than mentioned one when feedforward architecture was used. Also mentioned researchers incorporated “genetics” results to compare or to show “false negative and false positive” results.

Other analysis application of neural networks in prenatal diagnostics relay on using these systems for chromosome identification. For instance, Wang et al. [40] built multi-feature ANN metaphase chromosome classifier based on 150 metaphase chromosome cell images (2300 individual chromosomes) from both normal and abnormal peripheral blood and amniotic fluid samples. Similar to our ANN design, their ANNs had feed-forward structure with three layers composed of: input, hidden and output neurons.

In the future work, the aim of the researchers is to develop Graphical User Interface for implemented Expert System, as it is practice in the other similar studies [55], so physicians will be able to use it in friendlier environment.

Conclusions

A chromosome disorder is caused by an alteration in the number or genetic structure of chromosomes. A trisomy is a chromosomal disorder characterized by an additional chromosome, meaning that the affected person has 47 chromosomes instead of 46. The most common forms of trisomy are: Down syndrome, Edwards syndrome, Patau syndrome and Klinefelter. Children affected by trisomy usually have a range of birth defects, including delayed development and intellectual disabilities.

Cytogenetic and advanced molecular analyses are convincing prenatal diagnostic investigation along with clinical suspicions and biochemical screening tests. Microscope analysis and computer imaging are the most common methods used for prenatal diagnostics. However, visual karyotyping using microscopic images is time-consuming and labor intensive, which can reduce the diagnostic efficiency and accuracy. Hence, using computer-based expert diagnostic systems can significantly contribute to the evaluation of prenatal diagnostic screening test results and can be an adjunct to routine chromosomal analysis.

Aneuploidies involving 13, 18, 21 and sex (X and Y) chromosomes account for the majority of all chromosome abnormalities in live-born infants. The risk of chromosomal abnormality in the fetus increases with increasing maternal age. Rapid diagnosis of fetal chromosome anomalies may facilitate clinical decision-making, especially when a fetal abnormality is detected late in pregnancy.

In most cases, the loss of a whole chromosome is incompatible with life and will result in early miscarriage or stillbirth. If the extra genetic material is an entire chromosome, the effect is largely lethal.

In this study, we used two neural network architectures to classify between five prenatal syndromes based on the results of maternal serum screening tests, ultrasonographic findings and patient demographic. The aim of this work was to examine efficiency of different neural network architectures for this task. This study has proven that relatively simple neural network architecture, such as feedforward, can have high classification accuracy. Because of the non-linear input-output relationship better accuracy of classification can be achieved with recursive neural network architecture, such as Elman architecture. To our knowledge, this is the first application of ANN developed for classification offirst trimester maternal serum screening samples. Nonetheless, we expect that this ANN system can be expanded to other syndromes and other sample types. This supports the potential use for the fast and reliable detection of genetic disorders and fetal well-being.

There are several limitations in thisstudy. Forinstance, the use of a single health center and single equipment during the study. The second important limitation of the study is the limited number of “abnormal” samples. Nevertheless, the analysis program’s performance is limited to input information and knowledge and medical expert cannot get more than he or she has donated the system.

The impact of developed system is that it enables classification of prenatal syndromes based on input parameters that can be acquired with relatively non-invasive and low cost methods. Developed system comprises of expert knowledge in this field since all diagnosis were previously revised by field professional. By enlarging the database with various datasets regarding demographics system accuracy can be improved and the usage of the system expanded to other regions. This system can be used instead of commercial prenatal syndrome screening software’s especially in low-income countries where healthcare systems are dealing with rising costs of diagnostic and disease treatment.

Abbreviations

aCGH:

Array comparative genomic hybridization

ANN:

Artificial Neural Network

A-PAPP-A:

Pregnancy-associated plasma protein

CVS:

Chorionic villus sampling

FISH:

Fluorescence in situ hybridization

GD:

Gradient descent

LMA:

Levenberg-Marquardt algorithm

MSE:

Mean Square Error

NGS:

Next-generation sequencing

QF-PCR:

Quantitative fluorescence polymerase chain reaction

β-hCG:

beta human chorionic gonadotropin

References

  1. Tjio JH, Levan A. The chromosome number in man. Hereditas. 1956;42:1.

    Article  Google Scholar 

  2. Ford CE, Hamerton JL. The chromosomes of man. Nature. 1956;178:1020–3.

    Article  CAS  PubMed  Google Scholar 

  3. Mikkelsen M, Brondum-Nielsen K. Karyotype analysis and chromosome disorders. In: Brock DJH, Rodeck CH, Ferguson-Smith MA, editors. Prenatal diagnosis and screening. London: Churchill Livingstone; 1992. p. 99–125.

    Google Scholar 

  4. Schonberg SA. Cytogenetic analysis in prenatal diagnosis. West J Med. 1993;159:360–5.

    CAS  PubMed  PubMed Central  Google Scholar 

  5. Allen EG, Freeman SB, Druschel C, et al. Maternal age and risk for trisomy 21 assessed by the origin of chromosome nondisjunction: a report from the Atlanta and National down Syndrome Projects. Hum Genet. 2009;125:41–52.

    Article  PubMed  Google Scholar 

  6. Hixson L, Goel S, Schuber P, Faltas V, Lee J, Narayakkadan A, Leung H, Osborne J. An overview on prenatal screening for chromosomal aberrations. J Lab Automat. 2015;20(5):562–73.

    Article  CAS  Google Scholar 

  7. Badenas C, Rodríguez-Revenga L, Morales C, Mediano C, Plaja A, Pérez-Iribarne MM, Soler A, Clusellas N, Borrell A, Sánchez MÁ, Miró E. Assessment of QF-PCR as the first approach in prenatal diagnosis. J Mol Diagn. 2010;12(6):828–34.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Chiu RW, Chan KA, Gao Y, Lau VY, Zheng W, Leung TY, Foo CH, Xie B, Tsui NB, Lun FM, Zee BC. Noninvasive prenatal diagnosis of fetal chromosomal aneuploidy by massively parallel genomic sequencing of DNA in maternal plasma. Proc Natl Acad Sci. 2008;105(51):20458–63.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Wilson KL, Czerwinski JL, Hoskovec JM, Noblin SJ, Sullivan CM, Harbison A, Campion MW, Devary K, Devers P, Singletary CN. NSGC practice guideline: prenatal screening and diagnostic testing options for chromosome aneuploidy. J Genet Couns. 2013;22(1):4–15.

    Article  CAS  PubMed  Google Scholar 

  10. Jia CW, Wang SY, Ma YM, Lan YL, Si YM, Yu L, Zhou LY. Fluorescence in situ hybridization in uncultured amniocytes for detection of aneuploidy in 4210 prenatal cases. Chin Med J. 2011;124(8):1164–8.

    PubMed  Google Scholar 

  11. Shaffer LG, Tommerup N, editors. ISCN 2005: An international system for human cytogenetic nomenclature. Basel: S. Karger; 2005.

  12. Guyon I, Elisseeff A. An introduction to variable and feature selection. J Mach Learn Res. 2003:1157–82.

  13. Guenther FH. Neural networks: Biological models and applications. In: Smel- ser NJ, Baltes PB, editors. Oxford: International Encyclopedia of the Social & Behavioral Sciences; 2001. p. 10534–7.

  14. Eluyode OS, Akomolafe AT. Comparative study of biological and artificial neural networks. Eur J Appl Eng Sci Res. 2013;2:36–46.

    Google Scholar 

  15. Badnjevic A, Cifrek M, Koruga D, Osmankovic D. Neuro-fuzzy classification of asthma and chronic obstructive pulmonary disease. BMC Med Inform Decis Mak. 2015;15(Suppl 3):S1. https://doi.org/10.1186/1472-6947-15-S2-S1.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Badnjevic A, Cifrek M, Koruga D. Classification of Chronic Obstructive Pulmonary Disease (COPD) using integrated software suite. IFMBE XIII Mediterranean Conference on Medical and Biological Engineering and Computing (MEDICON), 25–28. Sevilla; 2013.

  17. Badnjevic A, Cifrek M. Classification of asthma utilizing integrated software suite. 6th European Conference of the International Federation for Medical and Biological Engineering (MBEC), 07–11. Dubrovnik; 2014.

  18. Aljovic A, Badnjevic A, Gurbeta L. Artificial Neural Networks in the Discrimination of Alzheimer's disease Using Biomarkers Data. IEEE 5th Mediterranean Conference on Embedded Computing (MECO), 12–16. 2016. Bar, Montenegro.

  19. Alic B, Sejdinovic D, Gurbeta L, Badnjevic A. Classification of Stress Recognition using Artificial Neural Network. IEEE 5th Mediterranean Conference on Embedded Computing (MECO), 12–16 June 2016, Bar, Montenegro.

  20. Fojnica A, Osmanovic A, Badnjevic A. Dynamical Model of Tuberculosis-Multiple Strain Prediction based on Artificial Neural Network. IEEE 5th Mediterranean Conference on Embedded Computing (MECO), 12–16 June 2016, Bar, Montenegro.

  21. Badnjevic A, Koruga D, Cifrek M, Smith HJ, Bego T. „Interpretation of pulmonary function test results in relation to asthma classification using integrated software suite“, IEEE 36. International convention on information and communication technology, electronics and microelectronics (MIPRO), pp: 140–144, 21.-25. May 2013. Opatija.

  22. Badnjevic A, Gurbeta L, Cifrek M, Marjanovic D, Classification of Asthma Using Artificial Neural Network“, IEEE 39th International convention on information and communication technology, electronics and microelectronics (MIPRO), 30. May to 03. June 2016. Opatija.

  23. Granulo E, Becar L, Gurbeta L, Badnjevic A, “Telemetry system for diagnosis of Asthma and Chronical Obstructive Pulmonary Disease (COPD)”, The 3rd EAI International Conference on IoT Technologies for HealthCare, 18–19 October 2016, Vasteras.

  24. Veljovic E, Spirtovic Halilovic S, Muratovic S, Osmanovic A, Badnjevic A, et al. Artificial Neural Network and Docking Study in Design and Synthesis of Xanthenes as Antimicrobial Agents, CMBEBIH 2017. IFMBE Proceedings, vol 62. pp 617–626, Springer, Singapore. https://doi.org/10.1007/978-981-10-4166-2_93

  25. Alic B, Gurbeta L, Badnjevic A, et al. Classification of Metabolic Syndrome patients using implemented Expert System”, CMBEBIH 2017. IFMBE Proceedings, vol 62. pp 601–607, Springer, Singapore. https://doi.org/10.1007/978-981-10-4166-2_91

  26. Badnjevic A, Gurbeta L, Cifrek M, Pecchia L. Pre-classification process symptom questionnaire based on fuzzy logic for pulmonary function test cost reduction. CMBEBIH 2017. IFMBE Proceedings, vol 62. pp 608–616, Springer, Singapore. https://doi.org/10.1007/978-981-10-4166-2_92

  27. Sejdinovic D, Gurbeta L, Badnjevic A, Malenica M, Dujic T, Causevic A, Bego T, Divovic L. Classification of prediabetes and Type 2 Diabetes using Artificial NeuralNetwork. CMBEBIH 2017. IFMBE Proceedings, vol 62. pp 685–689, Springer, Singapore. https://doi.org/10.1007/978-981-10-4166-2_103

  28. Gianola D, Okut H, Weigel KA, Rosa GJM. Predicting complex quantitative traits with neural networks: a case study with Jersey cows and wheat. BMC Genetics. 2011. https://doi.org/10.1186/1471-2156-12-87.

  29. Ehret A, Hochstuhl D, Gianola D, Thaller G. Application of neural networks with back-propagation to genome-enabled prediction of complex traits in Holstein-Friesian and German Fleckvieh cattle. Genetics Selection Evolution. 2015. https://doi.org/10.1186/s12711-015-0097-5.

  30. Wang X, Zheng B, Li S, Mulvihill JJ, Wood MC, Liu H. Automated classification of metaphase chromosomes: optimization of an adaptive computerized scheme. J Biomed Inform. 2009;42:22–31.

    Article  CAS  PubMed  Google Scholar 

  31. Wang X, Zheng B, Wood M, Li S, Chen W, Liu H. Development and evaluation of automated systems for detection and classification of banded chromosomes: current status and future perspectives. J Phys D Appl Phys. 2005;38:2536.

    Article  CAS  Google Scholar 

  32. Beksaç MS, et al. an expert diagnostic system based on neural networks and image analysis techniques in the field of automated cytogenetics. Technol Health Care 1996;3:217-229.

  33. Eskiizmirliler S, Erkmen AM, Basaran F, Cakar AN. A hybrid intelligent diagnostic system based on neural networks and image analysis techniques in the field of automated cytogenetics. In Image Processing 1996. Proceedings.International Conference on. Vol. 1. IEEE. 1996:315–318.

  34. Jennings AM, Graham J. A neural network approach to automatic chromosome classification. Phys Med Biol. 1993;38:959.

    Article  CAS  PubMed  Google Scholar 

  35. Braaten O, Friestad J. Syndrome diagnosis: human intuition or machine intelligence? Open Med Inform J. 2008;2:149.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Dagna-Bricarelli F, Hastings RJ, Kristoffersson U, Cavani S. Cytogenetic Guidelines and Quality Assurance. A common European framework for quality assessment for constitutional and acquired cytogenetic investigations. ECA Permanent Working Group for Cytogenetics and Society. Guidelines Version 1.2. http://www.iss.it/binary/tege4/cont/Linee_Guida_ECA.pdf

  37. Hastings RJ, Cavani S, Bricarelli FD, Patsalis PC, Kristoffersson U. Cytogenetic guidelines and quality assurance: a common European framework for quality assessment for constitutional and acquired cytogenetic investigations. Eur J Hum Genet. 2007;15:525–7.

    Article  PubMed  Google Scholar 

  38. Kroisel PM, Fryns JP, Matthijs G, Brondum-Nielsen K, Vejerslev L, von Koskull H, Giraudon E, Saura R, Held K, Muller-Reible CR, Metaxotou C. Quality guidelines and standards for genetic laboratories/clinics in prenatal diagnosis on fetal samples obtained by invasive procedures-an attempt to establish a common European framework for quality assessment. Eur J Hum Genet. 1997;5:342–50.

    Google Scholar 

  39. GnanaSheela K, Deepa SN. Review on Methods to Fix Number of Hidden Neurons in Neural Networks. Math Probl Eng. 2013. https://doi.org/10.1155/2013/425740.

  40. Byvatov E, Fechner U, Sadowski J, Schneider G. Comparison of support vector machine and artificial neural network systems for drug/nondrug classification. J Chem Inf Comput Sci. 2003;43(6):1882–9.

    Article  CAS  PubMed  Google Scholar 

  41. Khan J, Wei JS, Ringner M, Saal LH, Ladanyi M, Westermann F, Meltzer PS. Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks. Nat Med. 2001;7:673–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Güler NF, Übeyli ED, Güler İ. Recurrent neural networks employing Lyapunov exponents for EEG signals classification. Expert Syst Appl. 2005;29:506–14.

    Article  Google Scholar 

  43. Lin F-J, Kung Y-S, Chen S-Y, Liu Y-H. Recurrent wavelet-based Elman neural network control for multi-axis motion control stage using linear ultrasonic motors. IET Electric Power Applications. 2010;4(5):314–332.IEPAAN000004000005000314000001.

  44. Pasini A. Artificial neural networks for small dataset analysis. J Thorac Dis. 2015;7(5):953.

    PubMed  PubMed Central  Google Scholar 

  45. Kohavi R. A study of cross-validation and bootstrap for accuracy estimation and model selection. Ijcai. 1995;14(2):1137–45.

    Google Scholar 

  46. Hao Y, Wilamowski M. Levenberg-Marquardt Training. 2011, 1–16, eBook ISBN: 978–1–4398-0284-7, Print ISBN: 978–1–4398-0283-0.

  47. Huang SC, Huang YF. Bounds on the Number of Hidden Neurons in Multilayer Perceptrons. IEEE Trans. Neural Netw. 1991.

  48. Sheela KG, Deepa SN. Review on Methods to Fix Number of Hidden Neurons in Neural Networks. J Math Probl Eng. 2013;425740.

  49. Tu J. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J Clin Epidemiol. 1996;49:1225–31.

    Article  CAS  PubMed  Google Scholar 

  50. Wojtowicz H, Wajs W. Intelligent information system for interpretation of Dermatoglyphic patterns of Down’s syndrome in infants. Intell Inf Database Sys. 2012:284–93.

  51. Wojtowicz H, Wajs W. Medical decision support system for assessment of dermatoglyphic indices and diagnosis of Down’s syndrome. InIntelligent decision technologies. Berlin Heidelberg: Springer; 2012. p. 69–78.

  52. Pereira H, Leite F. Applicability of Data Mining Technique using Bayesians Network in Diagnosis of Genetic Diseases. 2013.

  53. Soleimani F, Teymouri R, Biglarian A. Predicting developmental disorder in infants using an artificial neural network. Acta Med Iran. 2013;51:347.

  54. Nicolaides K. The 11–13+6 weeks scan. London: Fetal Medicine Foundation; 2004.

    Google Scholar 

  55. Badnjevic A., Cifrek M., Classification of asthma utilizing integrated software suite. 6th European Conference of the International Federation for Medical and Biological Engineering (MBEC). Dubrovnik; 2014. p. 7–11.

Download references

Acknowledgements

The authors would like to thank the editor and reviewers for the thoughtful comments and constructive suggestions, which greatly helped us improve the quality of this manuscript.

Funding

Not applicable.

Availability of data and materials

The data generated and/or analyzed during the current study is available from the corresponding author on reasonable request.

Author information

Authors and Affiliations

Authors

Contributions

AC selected, directed, and reported testing, wrote manuscript. LG designed, validated and tested artificial neural networks. LG designed figures and tables, and assisted in writing manuscript. AK, molecular cytogeneticist at the Department of Pathology, Clinical Center of the University of Sarajevo contributed her expertise in prenatal syndromes. MS, director at “Mehmedbasic” Institute of Gynecology, Infertility and Perinatology discussed and approved tests, performed ultrasounds and amniocenteses. AB, director at Verlab Ltd. Sarajevo, provided guidance and expertise in developing artificial neural networks. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Aida Catic.

Ethics declarations

Ethics approval and consent to participate

Because this was a retrospective chart review study no ethics committee approval was required. By the rules of the private clinic “Mehmedbasic” informed consent was obtained from all participants included in the study prior to amniocentesis procedure.

Consent for publication

Not applicable.

Competing interests

All authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Catic, A., Gurbeta, L., Kurtovic-Kozaric, A. et al. Application of Neural Networks for classification of Patau, Edwards, Down, Turner and Klinefelter Syndrome based on first trimester maternal serum screening data, ultrasonographic findings and patient demographics. BMC Med Genomics 11, 19 (2018). https://doi.org/10.1186/s12920-018-0333-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12920-018-0333-2

Keywords