UWSpace - Waterloo's institutional repository

  •   UWSpace Home
  • University of Waterloo

Convolutional Neural Network for Sentence Classification

Thumbnail

View Google statistics

Collections

  • Computer Science

Cite this version of the work

Other formats

Search UWSpace

Information.

Help | Advanced Search

Computer Science > Computation and Language

Title: convolutional neural networks for sentence classification.

Abstract: We report on a series of experiments with convolutional neural networks (CNN) trained on top of pre-trained word vectors for sentence-level classification tasks. We show that a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks. Learning task-specific vectors through fine-tuning offers further gains in performance. We additionally propose a simple modification to the architecture to allow for the use of both task-specific and static vectors. The CNN models discussed herein improve upon the state of the art on 4 out of 7 tasks, which include sentiment analysis and question classification.

Submission history

Access paper:.

  • Other Formats

References & Citations

  • Google Scholar
  • Semantic Scholar

5 blog links

Dblp - cs bibliography, bibtex formatted citation.

BibSonomy logo

Bibliographic and Citation Tools

Code, data and media associated with this article, recommenders and search tools.

  • Institution

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs .

Malaria Detection Using Convolutional Neural Network

  • Conference paper
  • First Online: 04 January 2022
  • Cite this conference paper

thesis on convolutional neural network

  • Khaled Almezhghwi   ORCID: orcid.org/0000-0001-5755-7297 15  

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 362))

Included in the following conference series:

  • International Conference on Theory and Application of Soft Computing, Computing with Words and Perceptions

388 Accesses

2 Citations

Malaria detection and classification is still time and money costly. Identification of malaria cells can be done through some costly techniques. Those techniques are good, but they require time and high cost. Hence, there is a need of discovering alternative techniques to identify blood cells, that saves both time and reduce cost. In addition to time and cost, those new techniques should also be accurate and effective. Thus, in this work, we propose a transfer learning based GoogleNet approach for the classification of Malaria cells. The depth and inception of GoogleNet made it a very robust deep network that can classify accurately if trained and fine-tuned on enough amount of data. Thus, in this study, 27558 of the 2 types of cells are used for fine-tuning and testing the pre-trained network GoogleNet. Experimentally, the employed GoogleNet fine-tuned to classify Malaria, showed a great capability in generalizing accurate and correct diagnosis of images that were not seen during training, in which it achieved a testing accuracy of 95% with a relatively short time and small number of epochs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
  • Available as EPUB and PDF
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

thesis on convolutional neural network

Efficient Malaria Cell Image Classification Using Deep Convolutional Neural Network

thesis on convolutional neural network

Malaria Cell Image Classification Using Convolutional Neural Networks (CNNs)

thesis on convolutional neural network

Razzak, M.I.: Malarial parasite classification using recurrent neural network. Int. J. Image Process 9 (2), 27–32 (2015)

Google Scholar  

World Health Organization. Disease burden of malaria. http://www.who.int/mediacentre/factsheets/fs094/en/

Almezhghwi, K., Serte, S., Al-Turjman, F.: Convolutional neural networks for the classification of chest X-rays in the IoT era. Multim. Tool. Appl. 80 (19), 29051–29065 (2021). https://doi.org/10.1007/s11042-021-10907-y

Kaymak, S., Almezhghwi, K., Shelag, A.A.S.: Classification of diseases on chest X-rays using deep learning. In: Aliev, R.A., Kacprzyk, J., Pedrycz, W., Jamshidi, M.O., Sadikoglu, F.M. (eds.) ICAFS 2018. AISC, vol. 896, pp. 516–523. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-04164-9_69

Abiyev, R.H., Maaitah, M.K.S.: Deep convolutional neural networks for chest diseases detection. J. Healthc. Eng. 2018 , 4168538 (2018). https://doi.org/10.1155/2018/4168538

Nguyen, D., et al.: A comparison of Monte Carlo dropout and bootstrap aggregation on the performance and uncertainty estimation in radiation therapy dose prediction with deep learning neural networks. Phys. Med. Biol. 66 (5), 054002 (2021)

Malaria Cell Images Dataset. https://www.kaggle.com/iarunava/cell-images-for-detecting-malaria

Szegedy, C., et al.: Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p. 12 (2015). https://doi.org/10.1109/CVPR.2015.7298594

Download references

Author information

Authors and affiliations.

Electrical and Electronics Engineering, College of Electronics Technology, Tripoli, Libya

Khaled Almezhghwi

You can also search for this author in PubMed   Google Scholar

Editor information

Editors and affiliations.

Department of Control Systems, Azerbaijan State Oil and Industry University, Baku, Azerbaijan

Rafik A. Aliev

System Research Institute, Polish Academy of Sciences, Warsaw, Poland

Janusz Kacprzyk

Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, Canada

Witold Pedrycz

Department of Electrical and Computer Engineering, University of Texas at San Antonio, San Antonio, TX, USA

Mo Jamshidi

Azerbaijan State Oil and Industry University, Baku, Azerbaijan

Mustafa Babanli

Department of Mechatronics, Near East University, North Cyprus, Turkey

Fahreddin M. Sadikoglu

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Cite this paper.

Almezhghwi, K. (2022). Malaria Detection Using Convolutional Neural Network. In: Aliev, R.A., Kacprzyk, J., Pedrycz, W., Jamshidi, M., Babanli, M., Sadikoglu, F.M. (eds) 11th International Conference on Theory and Application of Soft Computing, Computing with Words and Perceptions and Artificial Intelligence - ICSCCW-2021. ICSCCW 2021. Lecture Notes in Networks and Systems, vol 362. Springer, Cham. https://doi.org/10.1007/978-3-030-92127-9_19

Download citation

DOI : https://doi.org/10.1007/978-3-030-92127-9_19

Published : 04 January 2022

Publisher Name : Springer, Cham

Print ISBN : 978-3-030-92126-2

Online ISBN : 978-3-030-92127-9

eBook Packages : Intelligent Technologies and Robotics Intelligent Technologies and Robotics (R0)

Share this paper

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Publish with us

Policies and ethics

  • Find a journal
  • Track your research

Image Classification using Quantum Convolutional Neural Network

Ieee account.

  • Change Username/Password
  • Update Address

Purchase Details

  • Payment Options
  • Order History
  • View Purchased Documents

Profile Information

  • Communications Preferences
  • Profession and Education
  • Technical Interests
  • US & Canada: +1 800 678 4333
  • Worldwide: +1 732 981 0060
  • Contact & Support
  • About IEEE Xplore
  • Accessibility
  • Terms of Use
  • Nondiscrimination Policy
  • Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.

Convolutional Neural Network

A convolutional neural network is a type of deep learning network used primarily to identify and classify images and to recognize objects within images.

How Does a Convolutional Neural Network Work?

An artificial neural network is a system of hardware and/or software patterned after the way neurons operate in the human brain. Convolutional neural networks (CNNs) apply a variation of multilayer perceptrons (algorithms that classify visual inputs), usually across multiple convolutional layers that are either entirely connected or pooled.

CNNs learn in the same way humans do. People are born without knowing what a cat or a bird looks like. As we mature, we learn that certain shapes and colors correspond to elements that collectively correspond to an element. Once we learn what paws and beaks look like, we’re better able to differentiate between a cat and a bird.

Neural networks essentially work the same way. By processing training sets of labeled images, the machine is able to learn to identify elements that are characteristic of objects within the images.

A CNN is one of the most popular types of deep learning algorithms. Convolution is the simple application of a filter to an input that results in an activation represented as a numerical value. By repeatedly applying the same filter to an image, a map of activations called a feature map is produced. This indicates the locations and strengths of detected features.

A convolution is a linear operation that involves multiplying a set of weights with the input to yield a two-dimensional array of weights called a filter. If the filter is tuned to detect a specific type of feature in the input, then the repetitive use of that filter across the entire input image can discover that feature anywhere in the image.

A convolution.

For example, one filter may be designed to detect curves of a certain shape, another to detect vertical lines, and a third to detect horizontal lines. Other filters may detect colors, edges, and degrees of light intensity. Connecting the output of multiple filters can reveal complex shapes that matched known elements in the training data.

A CNN usually consists of three layers: 1) an input layer, 2) an output layer, and 3) a hidden layer that includes multiple convolutional layers. Within the hidden layers are pooling layers, fully connected layers, and normalization layers.

The three layers of a convolutional neural network.

The first layer is typically devoted to capturing basic features such as edges, color, gradient orientation, and basic geometric shapes. As layers are added, the model fills in high-level features that progressively determine that a large brown blob first is a vehicle, then a car, and then a Buick.

The pooling layer progressively reduces the spatial size of the representation for more efficient computation. It operates on each feature map independently. A common approach used in pooling is max pooling, in which the maximum value of an array is captured, reducing the number of values needed for calculation. Stacking convolutional layers allows the input to be decomposed into its fundamental elements.

Normalization layers regularize the data to improve the performance and stability of neural networks. Normalization makes the inputs of each layer more manageable by converting all inputs to a mean of zero and a variance of one.

Fully connected layers are used to connect every neuron in one layer to all the neurons in another layer.

Neural Networks

Why Convolutional Neural Networks?

There are three basic types of neural networks:

  • Multilayer Perceptrons are good at classification prediction problems using labeled inputs. They’re flexible networks that can be applied to a variety of scenarios, including image recognition.
  • Recurrent Neural Networks are optimized for sequence prediction problems using one or more steps as input and multiple steps as output. They’re strong at interpreting time series data but are not considered effective for image analysis.
  • Convolutional Neural Networks were specifically designed to map image data to output variables. They’re particularly strong at developing internal representations of two-dimensional images that can be used to learn position and scale invariant structures. This makes them especially good at working with data that has a spatial relationship component.

CNNs have become the go-to model for many of the most advanced computer vision applications of deep learning, such as facial recognition, handwriting recognition, and text digitization. They also have applications in recommendation systems. The turning point was in 2012, when Alex Krizhevsky , who was then a graduate student at the University of Toronto, used the CNN model to win that year’s ImageNet competition by dropping the classification error record from 26% to 15%—an astounding achievement at the time.

For applications involving image processing, the CNN model has been shown to deliver the best results and the greatest computational efficiency. While it isn’t the only deep learning model that’s appropriate to this domain, it is the consensus choice and will be the focus of continuous innovation in the future.

Key Use Cases

CNNs are the image crunchers now used by machines to identify objects and today’s eyes of autonomous vehicles , oil exploration , and fusion energy research. They can also help spot diseases faster in medical imaging and save lives .

AI-driven machines of all types are becoming powered with eyes like ours, thanks to CNNs and RNNs. Much of these applications of AI are made possible by decades of advances in deep neural networks and strides in high-performance computing from GPUs to process massive amounts of data.

Why Convolutional Neural Networks Matter to You

Data Science Teams

Image recognition has a broad range of applications and needs to be a core competency of many data science teams. CNNs are an established standard that provides a baseline of skills that data science teams can learn and acquire to address current and future image processing needs.

Data Engineering Teams

Engineers who understand the training data needed for CNN processing are a step ahead of the game in supporting their organizations’ requirements. Datasets follow a prescribed format and numerous public datasets are available for engineers to learn from. This streamlines the process of getting deep learning algorithms into production.

Accelerating Convolutional Neural Networks using GPUs

State-of-the-art neural networks can have from millions to well over one billion parameters to adjust via back-propagation. They also require a large amount of training data to achieve high accuracy, meaning hundreds of thousands to millions of input samples will have to be run through both a forward and backward pass. Because neural nets are created from large numbers of identical neurons, they are highly parallel by nature. This parallelism maps naturally to GPUs , which provide a significant computation speedup over CPU-only training.

Deep learning frameworks allow researchers to create and explore Convolutional Neural Networks (CNNs) and other Deep Neural Networks (DNNs) easily, while delivering the high speed needed for both experiments and industrial deployment. The NVIDIA Deep Learning SDK accelerates widely-used deep learning frameworks such as Caffe, CNTK, TensorFlow, Theano, and Torch, as well as many other machine learning applications. The deep learning frameworks run faster on GPUs and scale across multiple GPUs within a single node. To use the frameworks with GPUs for Convolutional Neural Network training and inference processes, NVIDIA provides cuDNN and TensorRT™ respectively. cuDNN and TensorRT provide highly tuned implementations for standard routines such as convolution, pooling, normalization, and activation layers.

Click here for a step-by-step installation and usage guide. You can also find a fast C++/NVIDIA® CUDA® implementation of convolutional neural networks here .

To develop and deploy a vision model in no-time, NVIDIA offers the DeepStream SDK for vision AI developers, as well as Transfer Learning Toolkit (TLT) to create accurate and efficient AI models for a computer vision domain.

  • Learn how a CNN detects brain hemorrhages with accuracy rivaling experts
  • For a more technical deep dive: Deep Learning in a Nutshell: Core Concepts ,  Understanding Convolution in Deep Learning and the difference between a CNN and an RNN
  • NVIDIA provides optimized software stacks to accelerate training and inference phases of the deep learning workflow. Learn more on the NVIDIA deep learning home page .
  • The NVIDIA Deep Learning Institute (DLI) offers hands-on training for developers, data scientists, and researchers in AI and accelerated computing
  • For developer news and resources, check out the  NVIDIA developers site .
  • Company Overview
  • Venture Capital (NVentures)
  • NVIDIA Foundation
  • Social Responsibility
  • Technologies
  • Company Blog
  • Technical Blog
  • Stay Informed
  • Events Calendar
  • GTC AI Conference
  • NVIDIA On-Demand
  • Executive Insights
  • Startups and VCs
  • Documentation
  • Technical Training
  • Training for IT Professionals
  • Professional Services for Data Science
  • Privacy Policy
  • Manage My Privacy
  • Do Not Sell or Share My Data
  • Terms of Service
  • Accessibility
  • Corporate Policies
  • Product Security

IMAGES

  1. Convolutional Neural Networks The Why What And How Of Convolutional

    thesis on convolutional neural network

  2. Overview and details of a convolutional neural network (CNN

    thesis on convolutional neural network

  3. convolutional neural network thesis

    thesis on convolutional neural network

  4. Introduction To Convolutional Neural Networks

    thesis on convolutional neural network

  5. Using Deep Learning Models / Convolutional Neural Networks

    thesis on convolutional neural network

  6. Convolutional Neural Networks

    thesis on convolutional neural network

VIDEO

  1. What are Convolutional Neural Networks?

  2. DeepSteadyFlows

  3. Lecture 8 Part 1

  4. CNN for Traffic Sign Recognition

  5. Convolutional Neural Network

  6. Three Minute Thesis (3MT) 2021

COMMENTS

  1. PDF IMAGE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS

    Oulu University of Applied Sciences Information Technology, Internet Services. Author: Hung Dao Title of the bachelor's thesis: Image Classification Using Convolutional Neural Networks Supervisor: Jukka Jauhiainen Term and year of completion: Spring 2020 Number of pages: 31. The objective of this thesis was to study the application of deep ...

  2. PDF Deep Learning: An Overview of Convolutional Neural Network(CNN)

    An overview of Convolutional Neural Network (CNN) and its applications in deep learning.

  3. A Comprehensive Literature Review on Convolutional Neural Networks

    Artificial Intelligence, Computer Vision, Convolutional Neural Networks, Deep Neural Networks, Image Processing, Machine Learning, Machine Vision, Pattern Recognition. 1. INTRODUCTION cognitive neuroscience. The review . in this paper covers a lot of topics and is structured as follows: Analysis of few related hardware research works,

  4. PDF Examining the Structure of Convolutional Neural Networks

    uses a speci c model called a neural network [2]. What follows in this thesis is an introduction to supervised learning, an introduction to neural networks, and my work on Convolutional Neural Networks, a speci c class of neural networks. 1.2 Supervised Learning

  5. A review of convolutional neural networks in computer vision

    A convolutional neural network (Li et al. 2021 ), known for local connectivity of neurons, weight sharing, and down-sampling, is a deep feed-forward multilayered hierarchical network inspired by the receptive field mechanism in biology. As one of the deep learning models, a CNN can also achieve "end-to-end" learning.

  6. PDF ROBUST CLASSIFICATION WITH CONVOLUTIONAL NEURAL NETWORKS A Thesis

    Chapter 2 of this thesis will present a literature review about the convolutional neural network. I shall present some techniques that increase the accuracy for Convolutional Neural Networks (CNNs). To test system performance, the Modified NIST or MNIST dataset demonstrated in [1] was chosen.

  7. PDF Analysis and Optimization of Convolutional Neural Network Architectures

    Convolutional Neural Network Architectures Master Thesis of Martin Thoma Department of Computer Science Institute for Anthropomatics and FZI Research Center for Information Technology Reviewer: Prof. Dr.-Ing. R. Dillmann Second reviewer: Prof. Dr.-Ing. J. M. Zöllner Advisor: Dipl.-Inform. Michael Weber Research Period: 03. May 2017 ...

  8. Designing a Convolutional Neural Network for Image Recognition: A

    A powerful tool for image recognition, Convolutional Neural Networks (CNNs) have been successfully applied in various fields including computer vision, medical image analysis, and self-driving cars.

  9. [1707.09725] Analysis and Optimization of Convolutional Neural Network

    View a PDF of the paper titled Analysis and Optimization of Convolutional Neural Network Architectures, by Martin Thoma. ... Master's thesis. 73 pages + 24 pages appendix; 39 figures; 33 tables: Subjects: Computer Vision and Pattern Recognition (cs.CV) Cite as: arXiv:1707.09725 [cs.CV]

  10. An Introduction to Convolutional Neural Networks

    An Introduction to Convolutional Neural Networks Keiron O'Shea1 and Ryan Nash2 1 Department of Computer Science, Aberystwyth University, Ceredigion, SY23 3DB [email protected] 2 School of Computing and Communications, Lancaster University, Lancashire, LA1 4YW [email protected] Abstract. The field of machine learning has taken a dramatic twist in re-

  11. PDF UvA-DARE (Digital Academic Repository)

    this thesis we explore ways to leverage symmetries to improve the ability of convolutional neural networks to generalize from relatively small samples. We argue and show empirically that in the context of deep learning it is bet-ter to learn equivariant rather than invariant representations, because invari-

  12. Application of Convolutional Neural Network Framework on Generalized

    Marseet, Akram, "Application of Convolutional Neural Network Framework on Generalized Spatial Modulation for Next Generation Wireless Networks" (2018). Thesis. Rochester Institute of Technology. Accessed from This Thesis is brought to you for free and open access by the RIT Libraries. For more information, please contact [email protected]. RIT

  13. (PDF) Convolutional Neural Networks

    Convolutional neural networks (CNNs or ConvNets) are a speci alized form of deep. neural networks for analyzing input data that contain some form of spatial structure. (Goodfellow et al. 2016 ...

  14. PDF Thesis One-shot Learning With Pretrained Convolutional Neural Network

    This thesis proposes an approximate nearest neighbor-based method for one-shot learning. This. method makes use of the features produced by a pretrained convolutional neural network and builds a. proximity forest to classify new classes. The algorithm is tested in two datasets with differetttiCVUCCbDDDDGDiODDOnt.

  15. PDF Deep Convolutional Neural Network for Effective Image Analysis

    Deep Convolutional Neural Network for Effective Image Analysis DESIGN AND IMPLEMENTATION OF A DEEP PIXEL-WISE SEGMENTATION ... This master thesis presents the process of designing and implementing a CNN-based architecture for image recognition included in a larger project in the field of fashion recommendation with deep learning. ...

  16. PDF Convolutional Neural Network (CNN)

    Convolutional Neural Network (CNN) by Vinay K. Chawla May, 2021 Director of Thesis: Carol Massarra, PhD Major Department: Construction Management Assessing pavement condition is extremely essential in any effort to reduce future economic losses and improve the structural reliability and resilience. Data resulting from pavement

  17. MSc THESIS

    MSc THESIS Exploring Convolutional Neural Networks on the ˆ-VEX architecture Jonathan Tetteroo Abstract Faculty of Electrical Engineering, Mathematics and Computer Science CE-MS-2018-10 As machine learning algorithms play an ever increasing role in to-day's technology, more demands are placed on computational hard-

  18. Convolutional Neural Network for Sentence Classification

    This thesis aims at recognizing the relation candidates inside a question. We define a multi-label classification problem for this challenging task. Based on the word2vec representation of words, we propose two convolutional neural networks (CNNs) to solve the multi-label classification problem, namely Parallel CNN and Deep CNN.

  19. Convolutional Neural Networks for Sentence Classification

    We report on a series of experiments with convolutional neural networks (CNN) trained on top of pre-trained word vectors for sentence-level classification tasks. We show that a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks. Learning task-specific vectors through fine-tuning offers further gains in performance. We additionally ...

  20. Convolutional Neural Network for Sentence Classification

    This thesis aims at recognizing the relation candidates inside a question by proposing two convolutional neural networks (CNNs) to solve the multi-label classification problem, namely Parallel CNN and Deep CNN. The goal of a Knowledge Base-supported Question Answering (KB-supported QA) system is to answer a query natural language by obtaining the answer from a knowledge database, which ...

  21. Malaria Detection Using Convolutional Neural Network

    In this thesis, a transfer learning based automatic classification system that is applied for the Classification of Malaria Infected Cells is presented. In order to achieve fully automated diagnosis without any manual feature extraction, we chose deep convolutional neural network (CNN) as the classifier.

  22. PDF Convolutional Neural Network (Cnn) for Digital Radio Frequency Memory

    Convolutional Neural Networks (CNN) have gained popularity in the last few years with the advent of faster high-performance computer systems which rely on GPUs for the best computational performance. A CNN operates from a mathemat-ical perspective and is used for non-trivial tasks such as image classiĄcation. CNNs

  23. Deep Convolution Neural Network in Clustering Explanation

    This paper analyze the effectiveness of feature extraction of deep convolutional neural networks in image classification from the perspective of the basic statistical pattern recognition method, K-means clustering. This clustering works on those features extracted from each convolution layer of the VGG16 network, as well as the visualization of those features. The visualization graphs show the ...

  24. Image Classification using Quantum Convolutional Neural Network

    In this thesis, neural networks will be trained to determine parameters for various parametric quantum circuits to perform important classification tasks, such as image classification. But for image classification, features from the images must also be extracted and epresented in terms of qubits, requiring convolutional layers tailored for ...

  25. PDF Exploring Graph Neural Networks for Molecular Activity Prediction

    Convolutional Neural Network (MGCN) for molecular property prediction, empha- sizing its generalizability and transferability. Lastly,Wang et al.(2023) experimented

  26. Convolutional Neural Network

    An artificial neural network is a system of hardware and/or software patterned after the way neurons operate in the human brain. Convolutional neural networks (CNNs) apply a variation of multilayer perceptrons (algorithms that classify visual inputs), usually across multiple convolutional layers that are either entirely connected or pooled.