• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    Offline Urdu Nastaleeq Optical Character Recognition Based on Stacked Denoising Autoencoder

    2017-05-08 11:32:25IbrarAhmadXiaojieWangRuifanLiShahidRasheed
    China Communications 2017年1期

    Ibrar Ahmad , Xiaojie Wang, Ruifan Li, Shahid Rasheed

    1 Center for Intelligence of Science and Technology (CIST), School of Computer Science,Beijing University of Posts and Telecommunications, No.10 Xitucheng Road, Beijing 100876, China.

    2 Department of Computer Science, University of Peshawar, Peshawar 25120, Pakistan.

    3 Pakistan Telecommunication Company Limited (PTCL), Islamabad 44000, Pakistan

    * The corresponging author, email: toibrar@yahoo.com, ibrar@upesh.edu.pk

    I.INTRODUCTION

    Urdu is the national language of Pakistan and is widely used in Southern Asia.Two different writing systems for Urdu namely Nastaleeq and Naskh are primarily in vogue.Nastaleeq system of writing is widely adopted in traditional Urdu books and newspaper writings.Other regional languages such as Persian,Pashto, Punjabi, Baluchi, and Siraiki also practice Nastaleeq writing style [1].All such applications increase the importance of Optical Character Recognition (OCR) for Nastaleeq script.OCR for printed Urdu Nastaleeq text therefore becomes imperative for digitalization.

    During the last two decades, a lot work has been done on character based [2-8] and ligature based [1, 9-14] Urdu OCR.Despite of their success, there remain two major limitations: lack of suitable feature extraction techniques and scarcity of available data.Almost all researchers have applied hand-engineered feature extraction techniques which may not fully represent the data.Secondly,the largest reported dataset used comprised of approximately 83000 ligatures [15], which is not large enough for evaluating practical Urdu OCR.This paper addresses two main shortcomings that currently Urdu OCR systems exhibit.Firstly, we have used autoencoder based feature extraction method which is already proved very successful [16-20] for recognizing many image datasets by extracting features automatically from raw pixel values.And secondly, we have trained our system on 178573 ligatures from un-degraded version of Urdu Printed Text Image (UPTI) dataset [9]and tested on almost same number of ligatures from degraded versions of UPTI with 93% to 96% accuracy.

    This paper proposes the use of stacked denoising autoencoder and softmax for automatic feature extraction directly from raw pixel values and classification, respectively.

    1.1 Previous work

    Previous work done on OCR for Printed Urdu Nastaleeq text can be classified into two types:characters based method and ligature based method.In literature, the former is known as segmented oriented while the latter as segmented free techniques also.A ligature may be a word or a sub-word which is composed of one or more connected characters.A ligature may usually compose of 1 to 8 characters.The Nastaleeq text recognition, as compared to Naskh, is more complex due to its overlapping, compactness, context sensitiveness, cursive nature, diagonality, and many other[21]properties shown in Figure 1.

    Initially, considerable efforts were devoted to the recognition of Urdu basic character set.Multilayer Layer Perceptron (MLP) Network[2], Support Vector Machines (SVM) [3], Neural Network (NN) [4], Hidden Morkov Model(HMM) [5], Principal Component Analysis[6], Back Propagation (BP) based NN [7],and pattern matching classifier [8] have been used for recognition of Urdu characters.All of these OCR systems have used man designed features like invariant moments [3], structural features (width, height and checksum) [4],Discrete Cosine Transforms (DCT)[5], topological features [7] and chain code [8].The largest reported dataset is 36800 characters [3]with the accuracy of 93.59%.

    Difficulties and complexities [21] of Urdu Nastaleeq text segmentation into characters cause the paradigm shift from character to ligature based Urdu text recognition systems.Most of the prevalent Urdu OCR techniques work on ligature based recognition [1, 9-10,12-15, 22].

    Ligature based OCR systems produce superior results as compared to the counterpart character based recognition schemes.

    Various models and features are explored in literature on ligature based Urdu text recognition.Many choices of learning models have been used for this purpose, such as HMM [1,13-15, 22], K-Nearest Neighbor (KNN) [9,23], SVM [12], and Bidirectional Long Short Term Memory (BLSTM) structure[10-11] with a layer of Connectionist Temporal Classification (CTC) at output.From the previous work,it is evident that the DCT is the most dominant features extraction technique used [1, 12-15, 22] for Urdu OCRs.Some other features include shape context [9], raw pixel values[10], sliding window [11], and Hu invariant moments [23], which are all man-designed.In Urdu OCR systems, no one has so far used any of the deep learning methods e.g.autoen-coders for automatic feature extraction [16].

    Fig.1 Complexities of Nastaleeq writing style

    1.2 Motivation

    Scarcity of data for training is one of the major drawbacks of all Urdu OCR systems to-date.The largest data set for Urdu OCR system consists of 83000 ligatures [15].Also, one can easily detect that all available Urdu OCR systems are heavily relying on hand-engineered feature except [10-11].Such features are very hard to design, expensive, domain dependent and do not accurately mimic the properties of a dataset.Manual labeling like [10-11] of an enormous data set is almost impossible.

    Autoencoders have been used to extract feature representation directly from the raw pixel values of an image [16-17, 29].With noisy input, Denoising Autoencoders (DAs)have been used to extract robust hidden representation of data independent of domain [19-20, 30].It can represent the intrinsic features of data which are very difficult to uncover by human-designed features.

    Stacked Denoising Autoencoders (SDAs)have been successfully applied for character recognition [18] and recognition of Bangla Lanuage [24-25].Therefore, with the success of automatic feature extraction by autoencoders in many image recognition fields, it is a reasonable selection to let SDAs learn features itself from raw data of Urdu Nastaleeq text.

    1.3 Contribution

    In this paper, we presented a ligature based Urdu OCR application by using the basic SDA[18].No one in Urdu OCR research, thus far has reported the use of DA or SDA for feature learning.Here, different SDA networks with softmax layer on top are trained and tested on a very large dataset, known as Urdu Printed Text Image (UPTI) dataset [9].Detail about UPTI is presented in Section 3.1.

    In the referenced work, the largest data set for Urdu OCR system consists of 83000 ligatures (224 pages *371 ligatures per page) [15].Similarly, over 10, 000 ligatures have been used in [9] from UPTI data set.Although [8]reports 98% accuracy, it uses very small data set i.e.noise free 9262 ligatures of 2190 classes.

    We have used 178573 ligatures from 3732 classes of un-degraded version of UPTI for training the SDA and almost same number of ligatures from degraded versions for validation and testing.So, our training and testing set is more than twice in size than the largest dataset[15] used for Urdu OCR, so far.Validation and testing the trained SDA gives 4.14% error which is also lesser than the reported error in the referenced work.

    The structure of this paper is as follows.In Section II, learning architecture of the recognition system is discussed.In Section III,experimental setup is explained.In Section IV,results are analyzed for evaluating the performance of recognition system.Finally, Section V concludes on the basis of results and analysis done.

    I.LEARNING ARCHITECTURE

    This section presents the introduction of basic autoencoder, denoising autoencoder and the architecture of proposed SDA model for ligature recognitions of Urdu language.Then the Algorithm 1 illustrates the training procedure of proposed SDA model.

    2.1 Autoencoder

    The most well-known use of deep neural networks is feature learning [16-18, 20].It offers a way to learn useful feature vector for the posed data [26].It may learn a compact,meaningful representation of the posed data,typically aiming for reducing dimensions [17].Autoencoder is basically a feed-forward and non-recurrent neural network.There may be one or more hidden layers in between input and output layer as shown in Figure 2.An autoencoder is a MLP except that it has equal number of inputs and output nodes.Moreover,an autoencoder predicts the input value x at the output.It does a feed-forward pass and predicts the value ofmeasures the difference between x andand back propagates by performing weight updates.

    If the hidden layers are less than the input/output layers, then the final hidden layer represents the compressed representation of the input [27].As mentioned earlier, autoencoder is a subtype of MLP, therefore it can use all the activation functions used in MLP.

    Denoising Autoencoder is a type of autoencoder, which tries to learn the input x at the output asby working on partially corrupted version of input.In this way, the learnt x may be more stable, robust and gives better higher level representation [19].

    Different methods are available for corruption of input data, like MN, a suitable proportion of image pixels are masked to 0; Salt-and-Pepper Noise (SP), where randomly selected pixels of an image are set to maximum and minimum values of the image uniformly [18].We have used MN corruption method in our experiments.

    Fig.2 Basic autoencoder structures

    2.2 Urdu ligature recognition stacked denoising autoencoder(ULR-SDA)

    In this work, denoising autoencoders is used for learning Urdu ligature features.The architecture of the Stacked Denoising Autoencoder is illustrated in Figure 3.SDA network training consists of two steps: pre-training and fine-tuning.Former is performed in unsupervised way and latter in supervised manner.All layers of SDA are first trained layer wise,getting the input from latent representation of the previous network except the first hidden layer, which gets its input from outside.Pre-training is very helpful in initializing the network nodes by good representation instead of initializing them randomly [16].A MLP is made of all pre-trained layers and a back propagation algorithm is used for fine-tuning the network.

    The process of the proposed Urdu Ligature Recognition Stacked Denoising Autoencoder (ULR-SDA) is illustrated in Algorithm 1.ULR-SDA also follows the two stage general setup of SDA as already explained.A deep stack of denoising autoencoders network is first pre-trained on the images of ligatures in an unsupervised manner, layer by layer as illustrated by first for-loop in Algorithm1.Afterwards, the trained layers are connected together to form a MLP.At the top of this MLP network, a logistic regression layer is added that uses softmax function for classification.This resultant network, ULR-SDA is then fine-tuned to anticipate the target ligatures as depicted by pseudo code lines 16-27 of Algorithm1.

    Fig.3 Training process of denoising autoencoder

    The flowchart of proposed ULR-SDA is represented by Figure 4.It consists of three basic steps, namely, preprocessing, training,and testing.Preprocessing performs segmentation of image and ground truth sentences into ligatures.Training is carried out according to Algorithm 1.Testing performs classification with accuracy.

    As proposed by Vincent et al.[16], for pre-training the stack of denoising autoencoder, the deep networks can be learnt using denoising autoencoders.The underlying model is trained with the intent to learn the hidden representation for reconstructing the input as normally done by standard auto encoder network.The only difference is that the denoising autoencoder is fed with the noisy input with the objective to learn more generalized hidden representation.

    Here, the tied weight W and the bias vector b are used for encoding.The reconstructionis computed by using transposed weight matrixtransposed bias vectorand non-linear function s as follows:

    Thus, the derived autoencoder is trained on the noisy version of the input ligature x for its reconstruction.Therefore, the next autoencoder is trained in the same fashion, but the input for training of the next autoencoder is the hidden representation of the previous autoencoder.During this process, the reconstruction error between non-corrupted input ligaturexand reconstructedis computed by cross-entropy as presented in [16]:

    Once all autoencoders are pre-trained, they are connected layer wise to form a feed-for-ward MLP network.At the top of this MLP network, a layer of logistic regression added for the classification.The last layer uses softmax activation function for estimating class probabilities.This resulting complete MLP network of denoising autoencoders is finetuned through backpropagation algorithm.

    Algorithm 1. Pseudocode for training ULR-SDA

    Fig.4 Preprocessing, training and testing process of urdu ligature recognition stacked denoising autoencoder (ULR-SDA)

    Fig.5 UPTI sample sentence

    Fig.6 UPTI segmented ligatures

    III.EXPERIMENTS

    This section discusses the experiments for evaluating ULR-SDA architecture on printed offline Urdu script.

    Two types of experiments based on different input dimensions were performed on UPTI dataset.ULR-SDA was trained on 178573 ligatures and tested on almost same number of ligatures with 3732 classes for 80*80 and 15*15 input dimensions.

    3.1 Dataset and feature extraction

    UPTI (Urdu Printed Text Images)1UPTI (Urdu Printed Text Images) dataset is provided by faisal.shafait@uwa.edu.au and adnan@cs.uni-kl.dedataset [9]is used for experimental purpose.This dataset contains 10063 sentence images.A sample sentence of UPTI is shown in Figure 5.

    Four degradation techniques [28] have been applied, namely: jitter, elastic elongation,threshold and sensitivity.Every degradation technique has been applied with four different parameters value.So, UPTI contains 12 degradation versions of original 10063 sentence images.The sentence images of UPTI dataset from un-degraded as well as jitter, elongation and sensitivity degraded versions were segmented into ligatures as shown in Figure 6.The number of ligatures of un-degraded, jitter degraded, sensitivity degraded, and elastic elongation degraded versions are 189262,189265, 189260, and 189262 respectively.Segmented ligatures are then resized according to requirement as shown in Figure 6(a, b).

    The ground truths are then tokenized into ligatures, having 3732 number of tokenized ligature classes after segmentation.Only ligatures from sentences are considered valid where a sentence contains equal number of images ligatures and label ligatures.This is because of the irregularities induced during printing as shown in Figure 7.Two ligatures are incorrectly connected by the extra length of the strokes, for exampleand.Fourth part consists of all secondary components of third part.Fourth partrepresents eight dots, out of that six are connected incorrectly.Size of these connected dots become more than the maximum threshold size of a secondary component and assumes as primary component by segmentation algorithm.Because of such incorrect connections,number of the image ligatures becomes less or more in numbers as compared to ligatures segmented from text lines of ground truth.

    Only 3732 unique Urdu ligatures have been used in the UPTI dataset after dropping out the sentences where any sort of irregularity exists,as illustrated in preprocessing stage.Resized and normalized images of ligatures are then vectored into their pixel values and stored with their labels in tuple form for feeding as input to ULR-SDA.SVM classifier was also trained and tested with this format.

    3.2 Experiment setup

    For performing the experiments, ULR-SDA is trained on 178573 ligatures of un-degraded version and 60, 000 each from jitter degraded and sensitivity degraded versions of UPTI dataset for validation and testing respectively.There are 3732 classes of Urdu ligatures2A repository of 18, 000 Urdu ligatures: http://www.cle.org.pk/software/ling_resources/UrduLigatures.htm,which are used in UPTI dataset.Two different versions of UPTI dataset, i.e.; ligature images of 80*80 and 15*15 dimensions, have been constructed for experiments.For comparison study, a SVM classifier has also been trained and tested as explained earlier for ULR-SDA.In all cases, unsupervised pre-training and supervised fine-tuning (with simple stochastic gradient descent) procedures were applied,with early stopping based on validation set performance.In Figure 8 and 9, the “h” stands for 100 and ‘k’ for 1000.For example in Figure 8, 25h-16h-9h and 7k-5k-4k stand for SDA with hidden layers [2500, 1600, 900] and[7000, 5000, 4000] respectively.

    3.3 Training ULR-SDA

    Different ULR-SDA networks are trained and tested for ligature recognition as shown in Table I and Figure 8.Best ULR-SDA network has three hidden layers with [7000, 5000,4000] units for 80*80 input dimensions and[2500, 1600, 900] units for 15*15 input dimensions.All hidden layers were pre-trained as denoising autoencoders by stochastic gradient descent, using the cross-entropy cost method with the learning rate of 0.001.

    For all experiments during pre-training, 10 epochs were executed for ligatures of 80*80 as well as for 15*15 dimensions.Fine tuning of the entire MLP network was done by stochastic gradient descent using cross entropy loss function while adopting the learning rate of 0.1.The fine tuning executed until number of epochs were less than100 or the validation error did not fall below 0.1 % as shown in Fig-ure 8.All experiments were conducted using Theano library 6 on GPU.

    Fig.7 Wrongly connected components of ligatures

    Fig.8 Number of epochs and training error of ULR-SDA

    Fig.9 Effect of the number of hidden units in each layer on error

    Table I Comparison of ULR-SDA and SVM based on same training and test data Table I(a) Results for 80*80 pictures

    Table I(b) Results for 15*15 pictures

    IV.RESULTS

    In this section, the results of ULR-SDA and SVM are compared.After the comparison, the analysis of different structures of ULR-SDA is presented.At the end, the effects of input and middle layer dimensions on error are discussed.

    4.1 Comparison of ULR-SDA and SVM

    In order to evaluate the performance of ULRSDA, the system is trained on the un-degraded ligatures of UPTI dataset.The trained network is then tested on elastic elongation, jitter and sensitivity degraded versions of the same dataset.

    For comparison purpose, a multiclass SVM with Radial basis function(rbf) kernel is trained and tested in the similar fashion as ULR-SDA.Both classifiers namely ULR-SDA and SVM are trained on 80*80 and 15*15 input dimensions for 3732 output classes.

    Three layered ULR-SD achieved the ligature recognition accuracy of 96% where the SVM got 95% accuracy for 80*80 input dimensions as shown in Table I(a).Similarly,three layered ULR-SDA recognized ligature with the accuracy of 95.86% and SVM got 85% accuracy for 15*15 input dimensions Table I(b).The ULR-SDA shows superior recognition result as compare to state of the art SVM algorithm as shown in Table I.Furthermore, both classifiers achieved better accuracy on 80*80 input dimensions as compared to 15*15, which is obvious because former contains more pixel information of every ligature as input.

    4.2 Structure of ULR-SDA

    The structure of the proposed ULR-SDA is evaluated on two parameters: firstly by increasing the number of neurons of hidden layers and secondly by increasing the number of hidden layers as are shown in Figures 9 and 10 respectively.

    For this evaluation, 15*15 input dimensions have been used for all networks.Figure 9(a) shows the performance deteriorates with the decrease in the number of hidden units per layer.Three layered network [7000, 5000,4000] performs better than the network with hidden layers [5000, 4500, 400], [4000, 3000,2000], and [2500, 1600, 900].Similarly, Figure 9(b) shows two layered network [4900,3600] and [3600, 2500] performs better than[2500, 1600] and [1600, 900].

    Figure 10 shows the increased performance of ULR-SDA, as we increase the number of hidden layers from 1 to 3, for three different networks.Apart from Figure 10, Figure 9 also shows that the performance of the 3 layered networks is better than that of two layered networks.

    4.3 Dimensions

    The performance of ULR-SDA varies by input data dimensions and dimension of middle layer as shown in Table II and Figure 11.ULRSDA has been trained on examples of 80*80 and 15*15 dimensions.ULR-SDA outputs better results when trained and tested on examples of 80*80 input dimensions as shown in Table II.

    The recognition accuracies of ULR-SDA upon training and testing the examples of 80*80 and 15*15 dimensions with hidden layers [7000, 5000, 4000] are 96% and 95.86%respectively as shown in Table II(a).

    Similarly, the recognition accuracies for 80*80 and 15*15 input data dimensions with hidden layers [5000, 4500, 4000] are 96% and 94.77% respectively as shown in Table II(b).The accuracy will be increased with the increase in dimensions of input data.

    In Figure 11(b), the general trend is showing an increase in error for two layered network as well.More informative middle layer representation results in better accuracy in URL-SDA can, therefore, be confirmed.

    V.CONCLUSION AND FUTURE WORK

    This work is inspired by the success of recent SDA deep networks for characters and digits recognition.At the same time, the recent work on SDA for recognition of Bangla Lanuage[24-25] motivated us towards the implementation of deep SDA for Urdu Nastaleeq recognition.This works mainly introduces the use of deep neural networks (SDA) in the field of Urdu OCR.Also, it demonstrates that the pro-cess of recognition may become very straightforward and easy, if raw pixel values are used instead of calculating different features.In this way, better representation of all aspects of input data may be efficiently achieved, which may enhance the performance of existing Urdu OCR systems.

    Fig.10 Effect of the number of hidden layers on error

    Fig.11 Dimension of middle layer and error

    Table II Effect of number of input data dimensions on accuracy (3732 classes,180k training examples)

    In this paper, stacked denoising autoencoders and softmax at output layer have been used for automatic feature extraction directly from raw pixel values and classification respectively.Different stacked denoising autoencoders have been trained on 178573 ligatures with 3732 classes from un-degraded UPTI (Urdu Printed Text Image) data set.The trained networks are then validated and tested on degraded versions of UPTI data set.To compare the performance of ULR-SDA, multi class SVM classifiers have been trained and tested on the same train and test sets.Test results show that the accuracies of ULR-SDA networks are 93%to 96%, while the accuracies of SVM classifiers are in the range of 80% to 95%.The results of ULR-SDA based ligature recognition achieved are therefore better than the precision realized by the prevailing Urdu OCR systems for such a large dataset of ligatures.

    In the future, we will pay more attention towards a better segmentation algorithm.It would also be interesting to investigate deep denoising autoencoders with more than three hidden layers.We have used static corruption process, it also needs further investigation.

    Note

    In this paper, we introduce stacked denoising autoencoders and softmax for automatic feature extraction directly from raw pixel values and classification respectively.

    ACKNOWLEDGEMENTS

    The financial supports from National Natural Science Foundation of China (Project No.61273365) and 111 Project (No.B08004) are gratefully acknowledged.

    [1] S.HUSSAIN, S.ALI, and Q.AKRAM, “Nastalique Segmentation-Based Approach for Urdu OCR,” International Journal on Document Analysis and Recognition (IJDAR), vol.18, no.4, pp.357-374, 2015.

    [2] I.Shamsher, Z.Ahmad , J.K.Orakzai, and A.Adnan, “OCR for printed Urdu script using feed forward neural network, ” In the Proceedings of World Academy of Science, Engineering and Technology, vol.23, pp.172-175, 2007.

    [3] K.P.Imran, Abdulbari A.A, Ali, and R.J Ramteke,“Recognition of offline handwritten isolated Urdu character, ” International Journal on Advances in Computational Research, vol.4, no.1,pp.117-121, 2012.

    [4] J.Tariq, U.Nauman, and M.U.Naru, “Softconverter: a novel approach to construct OCR for printed urdu isolated characters, ” In International Conference on Computer Engineering and Technology (ICCET), vol.3, pp.495-498,April.2010.

    [5] Q.U.Akram, S.Hussain, Z.Habib, “Font size independent OCR for Noori Nastaleeq, ” Proceedings of Graduate Colloquium on Computer Sciences (GCCS), Department of Computer Science, FAST-NU Lahore, vol.1, 2010.

    [6] K.Khan, R.Ullah, N.A.Khan, and K.Naveed,“Urdu character recognition using principal component analysis, ” International Journal of Computer Applications, vol.60, no.11, pp 1-4,2012.

    [7] Ahmad Z, Orakzai JK, Shamsher I, Adnan A.“Urdu Nastaleeq optical character recognition,” In Proceedings of world academy of science,engineering and technology, vol.26, pp.249-252, Dec, 2007.

    [8] T.Nawaz, S.A.Naqvi, H.urRehman, and A.Faiz,“Optical character recognition system for urdu(naskh font) using pattern matching technique,” International Journal of Image Processing(IJIP), vol.3, no.3, p.92, Jun, 2009.

    [9] N.Sabbour and F.Shafait, “A segmentation-free approach to Arabic and Urdu OCR, ” InIS&T/SPIE Electronic Imaging, pp.86580N-86580N,Feb, 2013.

    [10] A.Ul-Hasan, S.B.Ahmed, F.Rashid, F.Shafait,and T.M.Breuel, “Offline printed Urdu Nastaleeq script recognition with Bidirectional LSTM networks, ” In Document Analysis and Recognition (ICDAR), Conf.12, pp.1061-1065, Aug,2013.

    [11] S.Naz, A.I.Umar, R.Ahmad, S.B.Ahmad, S.H.Shirazi, I.Siddiqi, and M.I.Razzak, “Offline cursive urdu nastaliq script recognition using multidimensional recurrent neural networks, ”Neurocomputing, vol.177, pp.228–241, Feb,2016.

    [12] G.S.Lehal, “Ligature segmentation for urdu OCR, ” In 12th International Conference on Document Analysis and Recognition (ICDAR),pp.1130–1134, Aug, 2013, doi: 10.1109/ICDAR.2013.229

    [13] S.A.Husain, “A multi-tier holistic approach for Urdu Nastaliq recognition, ” In International Multi topic conference (INMIC), pp.84-84, Dec,2002.

    [14] S.T.Javed and S.Hussain, “Segmentation Based Urdu Nastalique OCR, ” In Iberoamerican Congress on Pattern Recognition, pp.41-49, 2013.

    [15] Q.Akram, S.Hussain, F.Adeeba, S.Rehman,and M.Saeed, “Framework of Urdu Nastalique Optical Character Recognition System, ” In the Proceedings of Conference on Language and Technology (CLT 14), Karachi, Pakistan, 2014.

    [16] P.Vincent, H.Larochelle, Y.Bengio, and P.A.Manzagol, “Extracting and composing robust features with denoising autoencoders, ” In Proceedings of the 25th international conference on Machine learning, pp.1096-1103, Jul, 2008.

    [17] G.E.Hinton and R.R.Salakhutdinov, “Reducing the dimensionality of data with neural networks,” Science.313(5786), pp.504-507, Jul, 2006.

    [18] P.Vincent, H.Larochelle, I.Lajoie, Y.Bengio Y,and P.A.Manzagol, “Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion, ”The Journal of Machine Learning Research, vol.11, no.Dec, pp.3371-3408, Mar, 2010

    [19] Y.Bengio, “Learning deep architectures for AI,” Foundations and trends in Machine Learning,vol.2, no.1, pp.1-27.Jan, 2009.

    [20] H.Larochelle, D.Erhan, and P.Vincent, “Deep learning using robust interdependent codes, ”In International Conference on Artificial Intelligence and Statistics, pp.312-319, 2009.

    [21] S.Naz, K.Hayat, I.Razzak, W.Anwar, M.Madani,and U.Khan.“The optical character recognition of Urdu-like cursive scripts, ” Pattern Recognition, vol.47, no.3, pp.1229-1248, 2014.

    [22] S.T.Javed, S.Hussain, A.Maqbool, S.Asloob,S.Jamil, and H.Moin, “Segmentation free nastalique urdu ocr, ” World Academy of Science,Engineering and Technology, vol.46, pp.456-461, 2010.

    [23] S.Sardar and A.Wahab, “Optical character recognition system for Urdu, ” In International Conference on Information and Emerging Technologies (ICIET), pp.1-5, Jun, 2010.

    [24] A.Pal and J.D.Pawar, “Recognition of online handwritten Bangla characters using hierarchical system with Denoising Autoencoders, ” In International Conference on Computation of Power, Energy Information and Communication,pp.47-51, Apr, 2015.

    [25] A.Pal, “Bengali handwritten numeric character recognition using denoising autoencoders, ” In IEEE International Conference on Engineering and Technology, pp.1-6, Mar, 2015.

    [26] Y.Bengio, P.Lamblin, D.Popovici, and H.Larochelle, “Greedy layer-wise training of deep networks, ” Advances in neural information processing systems, vol.19, pp.153, Dec, 2007.

    [27] P.Baldi, “Autoencoders, unsupervised learning,and deep architectures, ” Unsupervised and Transfer Learning Challenges in Machine Learning, vol.7, no.1, pp.37-50, 2012.

    [28] H.S.Baird, “Document image defect models, ”In Structured Document Image Analysis, vol.1,pp.546-556, Jan, 1992.

    [29] F.Feng, X.Wang, R.Li, and I.Ahmad, “Correspondence Autoencoders for Cross-Modal Retrieval, ” ACM Transactions on Multimedia Computing, Communications, and Applications,vol.12, no.1s, pp.26, Oct, 2015.

    [30] O.K.Oyedotun, E.O.Olaniyi, and A.Khashman, “Deep Learning in Character Recognition Considering Pattern Invariance Constraints, ”International Journal of Intelligent Systems and Applications, vol.7, no.7, pp.1, Jun, 2015.

    国内少妇人妻偷人精品xxx网站 | 午夜免费观看网址| av视频在线观看入口| 99精品久久久久人妻精品| 久久草成人影院| netflix在线观看网站| 成人特级黄色片久久久久久久| 午夜a级毛片| 极品教师在线免费播放| 久久这里只有精品19| 亚洲在线观看片| 麻豆成人av在线观看| 国产一区在线观看成人免费| cao死你这个sao货| 97超级碰碰碰精品色视频在线观看| 黑人欧美特级aaaaaa片| 黑人巨大精品欧美一区二区mp4| 国产一区在线观看成人免费| 亚洲国产欧美一区二区综合| 色尼玛亚洲综合影院| 成人高潮视频无遮挡免费网站| 亚洲精品在线观看二区| 亚洲av日韩精品久久久久久密| 午夜精品在线福利| 99精品在免费线老司机午夜| 精品电影一区二区在线| 亚洲精品粉嫩美女一区| 精品一区二区三区四区五区乱码| 免费一级毛片在线播放高清视频| 日韩欧美国产一区二区入口| 淫秽高清视频在线观看| 国产亚洲欧美98| 一区二区三区国产精品乱码| 国产三级中文精品| 成在线人永久免费视频| 首页视频小说图片口味搜索| 亚洲最大成人中文| 无遮挡黄片免费观看| 久久久精品欧美日韩精品| 国产精品美女特级片免费视频播放器 | 精品欧美国产一区二区三| 国产人伦9x9x在线观看| 国产亚洲av高清不卡| 日本 av在线| 19禁男女啪啪无遮挡网站| 91av网站免费观看| 免费在线观看亚洲国产| av黄色大香蕉| 不卡一级毛片| 亚洲av日韩精品久久久久久密| 一进一出抽搐动态| 在线观看舔阴道视频| 此物有八面人人有两片| 欧美日韩中文字幕国产精品一区二区三区| xxxwww97欧美| 欧美日韩国产亚洲二区| 亚洲国产看品久久| 日本a在线网址| 国产成人欧美在线观看| 亚洲自偷自拍图片 自拍| 国产三级在线视频| 91在线精品国自产拍蜜月 | 亚洲va日本ⅴa欧美va伊人久久| 国产野战对白在线观看| 欧美日韩乱码在线| 国产成人啪精品午夜网站| 婷婷亚洲欧美| 国产免费av片在线观看野外av| 成人无遮挡网站| 日韩欧美国产在线观看| 国产视频一区二区在线看| 香蕉av资源在线| 久久久久国内视频| 久久国产精品人妻蜜桃| 免费在线观看影片大全网站| 一个人免费在线观看的高清视频| 亚洲精品456在线播放app | 中国美女看黄片| 一本精品99久久精品77| 欧美极品一区二区三区四区| 国产精品一区二区精品视频观看| 国产91精品成人一区二区三区| 久久久久国内视频| 免费在线观看日本一区| 一进一出抽搐gif免费好疼| 19禁男女啪啪无遮挡网站| 国产麻豆成人av免费视频| 99久久精品一区二区三区| 日本免费一区二区三区高清不卡| 偷拍熟女少妇极品色| 欧美丝袜亚洲另类 | 精品免费久久久久久久清纯| 国产成人啪精品午夜网站| 国产精品98久久久久久宅男小说| 老汉色av国产亚洲站长工具| 久久这里只有精品19| 欧美日韩国产亚洲二区| 日韩欧美一区二区三区在线观看| 麻豆国产av国片精品| 精品国内亚洲2022精品成人| 亚洲 国产 在线| 成在线人永久免费视频| 国产久久久一区二区三区| av天堂中文字幕网| 88av欧美| 一进一出抽搐动态| 一本综合久久免费| 精品国产三级普通话版| 黄片大片在线免费观看| 99热精品在线国产| 午夜福利18| 男女之事视频高清在线观看| 日韩欧美国产在线观看| 久久欧美精品欧美久久欧美| а√天堂www在线а√下载| 午夜福利成人在线免费观看| 亚洲 欧美 日韩 在线 免费| 国产三级黄色录像| 女人被狂操c到高潮| 岛国在线观看网站| 一二三四社区在线视频社区8| 久99久视频精品免费| 久久久久久久久中文| 日本与韩国留学比较| 狠狠狠狠99中文字幕| 蜜桃久久精品国产亚洲av| 在线a可以看的网站| 88av欧美| 桃色一区二区三区在线观看| 国产毛片a区久久久久| 久久久成人免费电影| 美女cb高潮喷水在线观看 | 亚洲精品中文字幕一二三四区| 熟女少妇亚洲综合色aaa.| 成年版毛片免费区| 19禁男女啪啪无遮挡网站| 美女高潮的动态| 亚洲午夜精品一区,二区,三区| 1024香蕉在线观看| 成年免费大片在线观看| 黑人巨大精品欧美一区二区mp4| 天堂√8在线中文| 非洲黑人性xxxx精品又粗又长| 亚洲va日本ⅴa欧美va伊人久久| 久久99热这里只有精品18| 国产亚洲精品久久久com| 国产精品野战在线观看| 男女下面进入的视频免费午夜| 十八禁人妻一区二区| 免费观看精品视频网站| 欧美日本亚洲视频在线播放| 国产亚洲av嫩草精品影院| 国产精品野战在线观看| 九九久久精品国产亚洲av麻豆 | 亚洲欧美精品综合一区二区三区| 亚洲熟妇熟女久久| 欧美日韩亚洲国产一区二区在线观看| 午夜福利高清视频| 最近最新中文字幕大全免费视频| 精品一区二区三区av网在线观看| 在线视频色国产色| 嫁个100分男人电影在线观看| 国产欧美日韩精品一区二区| 午夜福利18| 国模一区二区三区四区视频 | 制服丝袜大香蕉在线| 成人18禁在线播放| 波多野结衣高清无吗| 岛国在线免费视频观看| 天堂影院成人在线观看| 最好的美女福利视频网| 午夜福利视频1000在线观看| 欧美乱色亚洲激情| 1024香蕉在线观看| 日本撒尿小便嘘嘘汇集6| 老司机在亚洲福利影院| 欧美日韩中文字幕国产精品一区二区三区| 久久久久精品国产欧美久久久| 亚洲成人久久性| 久久中文字幕人妻熟女| 国产精品一区二区免费欧美| 午夜福利18| 99riav亚洲国产免费| 国产激情偷乱视频一区二区| 97人妻精品一区二区三区麻豆| 国模一区二区三区四区视频 | 啪啪无遮挡十八禁网站| 午夜精品一区二区三区免费看| 亚洲av成人不卡在线观看播放网| 俺也久久电影网| 亚洲av中文字字幕乱码综合| 丰满的人妻完整版| 无人区码免费观看不卡| 午夜两性在线视频| 中文字幕熟女人妻在线| 精品国产乱子伦一区二区三区| 男人舔奶头视频| 日本 欧美在线| 99久久精品一区二区三区| 日韩 欧美 亚洲 中文字幕| 久久九九热精品免费| 搡老熟女国产l中国老女人| 国产蜜桃级精品一区二区三区| 在线观看免费午夜福利视频| 亚洲欧美精品综合一区二区三区| 日本免费a在线| 中文在线观看免费www的网站| 久久久久亚洲av毛片大全| 天天躁日日操中文字幕| 真人做人爱边吃奶动态| 国产精品免费一区二区三区在线| 久久久久性生活片| 夜夜看夜夜爽夜夜摸| 精品国产乱码久久久久久男人| 国产 一区 欧美 日韩| 午夜免费成人在线视频| 黄片大片在线免费观看| 白带黄色成豆腐渣| 叶爱在线成人免费视频播放| 精品国内亚洲2022精品成人| 宅男免费午夜| 91麻豆av在线| 人人妻,人人澡人人爽秒播| a级毛片在线看网站| 看黄色毛片网站| 91麻豆精品激情在线观看国产| www.精华液| 久久久久久九九精品二区国产| 两个人视频免费观看高清| 国内精品久久久久精免费| 欧美在线黄色| 国产欧美日韩一区二区精品| 成年版毛片免费区| 我要搜黄色片| 一区二区三区国产精品乱码| 免费高清视频大片| 亚洲欧洲精品一区二区精品久久久| 久久久久亚洲av毛片大全| 一本久久中文字幕| 婷婷亚洲欧美| 精品免费久久久久久久清纯| 在线观看舔阴道视频| 五月玫瑰六月丁香| 国内久久婷婷六月综合欲色啪| 免费看a级黄色片| 两性夫妻黄色片| 日韩国内少妇激情av| 亚洲精品国产精品久久久不卡| 欧美中文综合在线视频| 亚洲精品在线观看二区| 日韩有码中文字幕| 精品国产超薄肉色丝袜足j| 亚洲精品中文字幕一二三四区| 可以在线观看毛片的网站| 精品国产三级普通话版| 婷婷丁香在线五月| 91字幕亚洲| a在线观看视频网站| 亚洲专区字幕在线| 欧美性猛交黑人性爽| 中文字幕最新亚洲高清| 一区福利在线观看| 亚洲精品456在线播放app | 婷婷丁香在线五月| 亚洲真实伦在线观看| 两性夫妻黄色片| 白带黄色成豆腐渣| 极品教师在线免费播放| 国产探花在线观看一区二区| 久久精品aⅴ一区二区三区四区| xxxwww97欧美| 国产三级中文精品| 两性夫妻黄色片| 国产黄片美女视频| 日本a在线网址| 中文字幕av在线有码专区| 色吧在线观看| 久久久久精品国产欧美久久久| 国产精品久久久av美女十八| 18禁黄网站禁片午夜丰满| 国产亚洲av高清不卡| 91在线精品国自产拍蜜月 | 国产三级在线视频| 免费在线观看成人毛片| 亚洲专区中文字幕在线| 久9热在线精品视频| 又紧又爽又黄一区二区| 色播亚洲综合网| 国产成+人综合+亚洲专区| 国产黄a三级三级三级人| 国产人伦9x9x在线观看| 国产精品久久久人人做人人爽| 欧美3d第一页| 女人高潮潮喷娇喘18禁视频| 欧美色欧美亚洲另类二区| 两个人的视频大全免费| 给我免费播放毛片高清在线观看| 女同久久另类99精品国产91| 嫩草影院入口| 国产乱人伦免费视频| 男女之事视频高清在线观看| 熟妇人妻久久中文字幕3abv| 成在线人永久免费视频| 韩国av一区二区三区四区| 亚洲精品美女久久av网站| 在线播放国产精品三级| 国产精品av视频在线免费观看| 狂野欧美激情性xxxx| 久久精品人妻少妇| 香蕉国产在线看| 黄色日韩在线| 免费搜索国产男女视频| 网址你懂的国产日韩在线| 亚洲性夜色夜夜综合| 天堂av国产一区二区熟女人妻| 国产精品一区二区三区四区久久| 两个人看的免费小视频| 久久中文看片网| 国产激情久久老熟女| 美女大奶头视频| 久久久久性生活片| 最好的美女福利视频网| 99热6这里只有精品| 1000部很黄的大片| 后天国语完整版免费观看| 三级男女做爰猛烈吃奶摸视频| 亚洲欧美一区二区三区黑人| 99国产精品一区二区三区| a级毛片a级免费在线| 一级毛片精品| 欧美性猛交黑人性爽| 国产激情欧美一区二区| 琪琪午夜伦伦电影理论片6080| 91老司机精品| 国产精品一区二区三区四区久久| 日韩大尺度精品在线看网址| 五月玫瑰六月丁香| 欧美最黄视频在线播放免费| 久久精品影院6| 国产成人av激情在线播放| 美女黄网站色视频| 精品一区二区三区av网在线观看| 长腿黑丝高跟| 免费在线观看日本一区| 亚洲中文av在线| 国产精品一区二区免费欧美| 亚洲熟妇中文字幕五十中出| 亚洲欧美日韩高清在线视频| 天堂√8在线中文| 国内精品久久久久精免费| 亚洲av美国av| 精品一区二区三区av网在线观看| 国产av不卡久久| 中文字幕久久专区| 床上黄色一级片| 人妻丰满熟妇av一区二区三区| 99精品欧美一区二区三区四区| 一本精品99久久精品77| 亚洲精品色激情综合| 国产高清三级在线| 亚洲18禁久久av| xxx96com| 亚洲国产欧洲综合997久久,| 99热精品在线国产| 午夜福利免费观看在线| 在线国产一区二区在线| 国产精品精品国产色婷婷| 亚洲国产欧美网| 久久久国产精品麻豆| 少妇的逼水好多| 久久久久久国产a免费观看| 99热精品在线国产| 午夜福利免费观看在线| 一本综合久久免费| 成人午夜高清在线视频| 亚洲专区国产一区二区| 国产成人影院久久av| 免费av不卡在线播放| 国产精品 国内视频| 老司机在亚洲福利影院| 国产成人影院久久av| 国产精品久久久久久久电影 | 天堂网av新在线| 亚洲av美国av| 丁香六月欧美| 中文字幕熟女人妻在线| 日韩国内少妇激情av| 国产高清有码在线观看视频| 免费av不卡在线播放| 欧美成人一区二区免费高清观看 | 亚洲色图 男人天堂 中文字幕| 一区二区三区激情视频| 好男人在线观看高清免费视频| 12—13女人毛片做爰片一| 欧美日韩一级在线毛片| 国产三级中文精品| 久久草成人影院| 男人舔女人下体高潮全视频| 一本精品99久久精品77| 亚洲熟妇熟女久久| 久久精品国产综合久久久| 免费看美女性在线毛片视频| 国内精品一区二区在线观看| 免费看十八禁软件| 麻豆久久精品国产亚洲av| 国产精品美女特级片免费视频播放器 | 免费看美女性在线毛片视频| 亚洲va日本ⅴa欧美va伊人久久| 日本a在线网址| 成年版毛片免费区| 国产亚洲av嫩草精品影院| 久久香蕉精品热| 桃红色精品国产亚洲av| ponron亚洲| 午夜激情欧美在线| 一级a爱片免费观看的视频| 婷婷丁香在线五月| 1024香蕉在线观看| 可以在线观看毛片的网站| 综合色av麻豆| 国产伦一二天堂av在线观看| 国产精品女同一区二区软件 | 高清毛片免费观看视频网站| 一级黄色大片毛片| 久久久色成人| 精品人妻1区二区| 成年免费大片在线观看| 日韩成人在线观看一区二区三区| 欧洲精品卡2卡3卡4卡5卡区| 18禁国产床啪视频网站| 看免费av毛片| 久久中文字幕一级| 亚洲色图av天堂| 亚洲精品久久国产高清桃花| 身体一侧抽搐| e午夜精品久久久久久久| 国产一区在线观看成人免费| 亚洲第一电影网av| 一二三四社区在线视频社区8| 啦啦啦观看免费观看视频高清| 亚洲精品国产精品久久久不卡| 亚洲国产日韩欧美精品在线观看 | 91麻豆av在线| 国产精品,欧美在线| 看免费av毛片| 欧美一区二区国产精品久久精品| 久久精品亚洲精品国产色婷小说| 手机成人av网站| 欧美在线一区亚洲| 一区二区三区激情视频| 中文字幕精品亚洲无线码一区| 在线视频色国产色| 午夜久久久久精精品| 变态另类丝袜制服| 熟妇人妻久久中文字幕3abv| 又大又爽又粗| 国产精品综合久久久久久久免费| 久久久国产成人精品二区| 黄色女人牲交| 看免费av毛片| 观看美女的网站| 在线观看66精品国产| 亚洲成av人片免费观看| 久久久久亚洲av毛片大全| 精品午夜福利视频在线观看一区| 国产免费av片在线观看野外av| 国内精品久久久久精免费| 嫩草影院精品99| 激情在线观看视频在线高清| 两性夫妻黄色片| 婷婷精品国产亚洲av在线| 亚洲avbb在线观看| 男女床上黄色一级片免费看| 香蕉久久夜色| 嫩草影院入口| 日韩免费av在线播放| 国产97色在线日韩免费| 国产不卡一卡二| 精品久久久久久久久久免费视频| or卡值多少钱| 国产精品98久久久久久宅男小说| 成人性生交大片免费视频hd| 亚洲乱码一区二区免费版| 国产精品久久电影中文字幕| 亚洲熟妇熟女久久| 国产精品乱码一区二三区的特点| 俺也久久电影网| 此物有八面人人有两片| 欧洲精品卡2卡3卡4卡5卡区| 床上黄色一级片| 超碰成人久久| 欧美成狂野欧美在线观看| 国产男靠女视频免费网站| 国产精品永久免费网站| 国产av不卡久久| 欧美绝顶高潮抽搐喷水| 俺也久久电影网| 神马国产精品三级电影在线观看| 亚洲avbb在线观看| 日韩三级视频一区二区三区| 国产精品乱码一区二三区的特点| 亚洲成av人片免费观看| 精品久久久久久,| 国产高清videossex| 久久精品综合一区二区三区| 老熟妇乱子伦视频在线观看| 欧美日韩福利视频一区二区| 男女下面进入的视频免费午夜| 怎么达到女性高潮| 日本免费a在线| 夜夜看夜夜爽夜夜摸| 欧美色欧美亚洲另类二区| 久久久成人免费电影| 精品久久久久久久人妻蜜臀av| 亚洲aⅴ乱码一区二区在线播放| 亚洲电影在线观看av| 一进一出抽搐动态| 亚洲国产看品久久| 国产精品一区二区免费欧美| 在线观看美女被高潮喷水网站 | 两个人看的免费小视频| 亚洲九九香蕉| 久久香蕉精品热| 97超级碰碰碰精品色视频在线观看| 色综合亚洲欧美另类图片| 国产成人精品久久二区二区免费| 久久精品91无色码中文字幕| 久久精品人妻少妇| 亚洲第一电影网av| 天天添夜夜摸| 日本黄色视频三级网站网址| 夜夜夜夜夜久久久久| 黄片小视频在线播放| 黑人操中国人逼视频| 亚洲精品久久国产高清桃花| av中文乱码字幕在线| 亚洲天堂国产精品一区在线| 久久中文字幕人妻熟女| 久久精品91蜜桃| aaaaa片日本免费| 男女下面进入的视频免费午夜| 成人欧美大片| 色播亚洲综合网| 好男人电影高清在线观看| 美女扒开内裤让男人捅视频| 成年女人永久免费观看视频| 不卡av一区二区三区| 母亲3免费完整高清在线观看| 亚洲国产高清在线一区二区三| 99在线视频只有这里精品首页| 国内少妇人妻偷人精品xxx网站 | 欧美另类亚洲清纯唯美| 99国产极品粉嫩在线观看| 欧美色欧美亚洲另类二区| 国产一区二区在线观看日韩 | 99精品欧美一区二区三区四区| 国产高清三级在线| 变态另类丝袜制服| 国产成人精品久久二区二区免费| 日日干狠狠操夜夜爽| 又黄又爽又免费观看的视频| 母亲3免费完整高清在线观看| 日本黄色视频三级网站网址| 可以在线观看的亚洲视频| 欧美又色又爽又黄视频| 国产v大片淫在线免费观看| 亚洲av成人不卡在线观看播放网| 成年女人毛片免费观看观看9| 宅男免费午夜| 人人妻人人看人人澡| 久久久久久久精品吃奶| 老汉色∧v一级毛片| 美女高潮的动态| 一进一出好大好爽视频| 一级毛片精品| 51午夜福利影视在线观看| 欧美日韩瑟瑟在线播放| www.www免费av| 12—13女人毛片做爰片一| 国产日本99.免费观看| 午夜福利在线在线| 一本一本综合久久| 国产亚洲av高清不卡| 午夜精品在线福利| 黑人巨大精品欧美一区二区mp4| 69av精品久久久久久| 757午夜福利合集在线观看| 国产成人影院久久av| 国内少妇人妻偷人精品xxx网站 | 国产男靠女视频免费网站| 亚洲av电影不卡..在线观看| 日韩欧美三级三区| 一区二区三区国产精品乱码| 51午夜福利影视在线观看| 女同久久另类99精品国产91| 亚洲人成电影免费在线| 欧美又色又爽又黄视频| 精品99又大又爽又粗少妇毛片 | 亚洲成av人片免费观看| 久久欧美精品欧美久久欧美| 99久久综合精品五月天人人| 久久婷婷人人爽人人干人人爱| 99热这里只有是精品50| a级毛片a级免费在线| 国产熟女xx| 中亚洲国语对白在线视频| 久久国产乱子伦精品免费另类| 啦啦啦观看免费观看视频高清| 嫩草影院入口| 日韩欧美精品v在线| 亚洲av中文字字幕乱码综合| 男插女下体视频免费在线播放| 国产精品香港三级国产av潘金莲| 亚洲中文日韩欧美视频| 美女cb高潮喷水在线观看 |