An Introduction to Computer Vision

Sources, shadows and shading, linear filters and edge detection, pyramids and texture, segmentation by clustering, fitting and segmentation, segmentation and fitting using probabilistic methods, tracking using linear dynamic models and the kalman filter, model-based vision, recognition by template matching, recognition by relations between templates.

CS4670/5670 - Introduction to Computer Vision

and
Mon. / Wed. / Fri. 1:25pm - 2:10pm
Phillips Hall 101


  • Geometry / Physics of image formation
  • Properties of images and basic image processing
  • 3D reconstruction
  • Grouping (of image pixels into objects)
  • Machine learning in computer vision: basics, hand-designed feature vectors, convolutional networks
  • Detecting and localizing objects

Office Hour Calendar

Lectures / notes:.

Date Topic (with linked notes / slides) Additional reading Assignments etc
Jan 24 Introduction [ | ] Szeliski 1 -
Jan 26 The visual world [ | ] Szeliski 2 -
Jan 29 Image filtering [ | ] Szeliski 3.1-3.2 -
Jan 31 Image filtering and Fourier transforms [ | ] Szeliski 3.4 -
Feb 2 Fourier transforms and resizing and resampling [ | ] Szeliski 3.4, 2.3.1 -
Feb 5 Resizing, resampling and pyramids [ | ] Szeliski 3.5, 2.3.1 -
Feb 7 Grouping I - Edge detection [ | ] Szeliski 3.5, 4.2 -
Feb 9 Numpy / scipy tutorial [ | ] - -
Feb 12 Grouping II - Edge detection and k-means [ | ] Szeliski 4.1, 5.3 -
Feb 14 Grouping III - Images as graphs[ | ] Szeliski 5.3 PA1 due
Feb 16 Grouping IV | The correspondence problem [ | ] - -
Feb 19 - -
Feb 21 Feature detection [ | ] Szeliski 4.1 -
Feb 23 Harris corner detector [ | ] Szeliski 4.1 -
Feb 26 Feature descriptors and matching - I [ | ] Szeliski 4.1 -
Feb 28 Feature descriptors and matching - II [ | ] Szeliski 4.1 -
Mar 2 -- -- Szeliski 2.1 -
Mar 5 Feature descriptors and matching - III | Geometry of image formation - I [ | ] Szeliski 2.1 -
Mar 7 Geometry of image formation - II [ | ] Szeliski 2.1 -
Mar 9 Homogenous coordinates | Camera calibration - I [ | ] Szeliski 2.1, 6.1, 6.2 -
Mar 12 Prelim review - -
Mar 14 Camera calibration - II | Triangulation [ | ] - -
Mar 16 Homographies | RANSAC [ | ] Szeliski 6.1 -
Mar 19 RANSAC and Hough transforms [ | ] Szeliski 6.1 out
Mar 21 Prelim Discussion | Two-view stereo - I [ | ] Szeliski 7.2 -
Mar 23 Two-view stereo [ | ] Szeliski 7.2 -
Mar 26 Epipolar geometry [ | ] Szeliski 7.1-7.4 -
Mar 28 Epipolar geometry - II [ | ] Szeliski 7.1-7.4 -
Mar 30 Radiometry [ | ] Szeliski 2.2 due
Apr 2 - -
Apr 4 - -
Apr 6 - -
Apr 9 Photometric stereo - I [ | ] Szeliski 12.1.1 -
Apr 11 Photometric stereo - II [ | ] Szeliski 12.1.1
-
Apr 13 Photometric stereo - III | Intro to recognition [ | ] - -
Apr 16 Intro to machine learning - optimization | the ERM principle [ | ] - -
Apr 18 Machine learning and optimization | the ERM principle [ | ]
-
Apr 20 Regularization | Linear classifiers and HOG / SIFT Bag-of-words [ | ] - -
Apr 23 Non-linear classifiers [ | ] - -
Apr 25 Convolutional networks | Backpropagation [ | ]
-
Apr 27 Backpropagation [ | ] - -
Apr 30 Image classification | transfer learning [ | ] - -
May 2 Transfer learning | Object detection - I [ | ] - -
May 4 Object detection - II [ | ] - -
May 7 Object detection - III | Semantic segmentation [ | ] - -
May 9 Conclusion

-

SlidePlayer

  • My presentations

Auth with social network:

Download presentation

We think you have liked this presentation. If you wish to download it, please recommend it to your friends in any social system. Share buttons are a little bit lower. Thank you!

Presentation is loading. Please wait.

CSCE 643: Introduction to Computer Vision

Published by Darrell Carroll Modified over 9 years ago

Similar presentations

Presentation on theme: "CSCE 643: Introduction to Computer Vision"— Presentation transcript:

CSCE 643: Introduction to Computer Vision

Review of AI from Chapter 3. Journal May 13  What advantages and disadvantages do you see with using Expert Systems in real world applications like business,

computer vision presentation ppt

Introduction to Computer Vision Dr. Chang Shu COMP 4900C Winter 2008.

computer vision presentation ppt

Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

computer vision presentation ppt

CSCE 689: Computer Animation Jinxiang Chai. CSCE 689: Computer Animation Course Instructor - Dr. Jinxiang Chai - Richardson 912B - Meeting time: TR 2:20PM-3:35PM.

computer vision presentation ppt

CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.

computer vision presentation ppt

C280, Computer Vision Prof. Trevor Darrell

computer vision presentation ppt

Computer Vision (CSE P 576)

computer vision presentation ppt

Math. Meth. in Vision and Imaging cmput 613/498 Lecture 1: Introduction and course overview Martin Jagersand.

computer vision presentation ppt

CS/CMPE 535 – Machine Learning Outline. CS Machine Learning (Wi ) - Asim LUMS2 Description A course on the fundamentals of machine.

computer vision presentation ppt

Overview of Computer Vision CS491E/791E. What is Computer Vision? Deals with the development of the theoretical and algorithmic basis by which useful.

computer vision presentation ppt

A Brief Overview of Computer Vision Jinxiang Chai Many slides are borrowed from James Hays and Steve Seitz.

computer vision presentation ppt

CPSC : Data-driven Character Animation Jinxiang Chai.

computer vision presentation ppt

Algorithms and Applications in Computer Vision

computer vision presentation ppt

2007Theo Schouten1 Introduction. 2007Theo Schouten2 Human Eye Cones, Rods Reaction time: 0.1 sec (enough for transferring 100 nerve.

computer vision presentation ppt

Introduction to Computer Vision CS223B, Winter 2005.

computer vision presentation ppt

Computer Vision CS302 Data Structures Dr. George Bebis

computer vision presentation ppt

Computer Vision (CSE P576) Staff Prof: Steve Seitz TA: Jiun-Hung Chen Web Page

computer vision presentation ppt

Computer Vision. Computer vision is concerned with the theory and technology for building artificial Computer vision is concerned with the theory and.

computer vision presentation ppt

Computer Vision (CSE/EE 576) Staff Prof: Steve Seitz TA: Aseem Agarwala Web Page

computer vision presentation ppt

Instructors TAs Web Page CSE 455: Computer Vision Neel Joshi Ian Simon Ira Kemelmacher

About project

© 2024 SlidePlayer.com Inc. All rights reserved.

CS 143 Introduction to Computer Vision

Fall 2011, mwf 11:00 to 11:50, cit 368. instructor: james hays, tas: evan wallace (hta), sam birch, paul sastrasinh, libin "geoffrey" sun, and vazheh moussavi..

Computer Vision, art by kirkh.deviantart.com

Course Description

Prerequisites.

  • CS 195-F, Introduction to Machine Learning
, ,
, ,
, ,
, ,
or ...
,
  • 80% 5 programming projects
  • 20% 2 written quizzes

Important Links:

  • Mailing List
  • Collaboration Policy
  • Matlab Tutorial
  • All Available Assignments
  • All Project Results

Contact Info and Office Hours:

  • James: hays[at]cs.brown.edu
  • HTA and Professor: cs143headtas[at]cs.brown.edu
  • TAs and Professor: cs143tas[at]cs.brown.edu
  • James Hays (hays), Monday and Wednesday 1:00-2:00
  • Libin "Geoffrey" Sun (lbsun), Monday 7-9pm
  • Paul Sastrasinh (psastras), Tuesday 7-9pm
  • Sam Birch (sbirch), Wednesday 7-9pm
  • Evan Wallace (edwallac), Thursday 7-9pm
  • Vazheh Moussavi (vmoussav), Friday 5-7pm

Tentative Syllabus

W, Sept 7th Introduction to computer vision , Szeliski 1
F, Sep 9th Cameras and optics , Szeliski 2.1, especially 2.1.5
M, Sep 12th Light and color , Szeliski 2.2 and 2.3
W, Sep 14th Pixels and image filters , Szeliski 3.2
F, Sep 16th Thinking in frequency , Szeliski 3.4
M, Sep 19th Image pyramids and applications , Szeliski 3.5.2 and 8.1.1
W, Sep 21st Machine learning: overview ,
F, Sep 23rd Machine learning: clustering , Szeliski 5.3
M, Sep 26th Machine learning: classification , Project 1 due
W, Sep 28th Edge detection and line fitting w/ Hough transform , Szeliski 4.2 Project 2 out
F, Sep 30th Robust fitting (Hough Transform) , Szeliski 4.3
M, Oct 3rd Robust fitting (RANSAC and others) , Szeliski 4.3
W, Oct 5th Mixture of Gaussians and EM ,
F, Oct 7th Gestalt cues, MRFs, and graph cuts , Szeliski 5.5
M, Oct 10th Project 2 due
W, Oct 12th Recognition Overview and History , Szeliski 14 Project 3 out
F, Oct 14th Image features and bag of words models , Szeliski 4.1.2, 14.4.1, and 14.3.2
M, Oct 17th Interest points: corners , Szeliski 4.1.1
W, Oct 19th
F, Oct 21st Interest points and instance recognition , Szeliski 14.3
M, Oct 24th Large-scale instance recognition , Szeliski 14.3.2 Project 3 due
W, Oct 26th Detection with sliding windows , Szeliski 14.1
F, Oct 28th
M, Oct 31st Detection with sliding windows continued , Szeliski 14.2 Project 4 out
W, Nov 2nd Context and Spatial Layout , Szeliski 14.5
F, Nov 4th
M, Nov 7th Feature Tracking , Szeliski 4.1.4
W, Nov 9th Optical Flow see above Szeliski 8.4
F, Nov 11th Project 4 due
M, Nov 14th Epipolar Geometry , Szeliski 11
W, Nov 16th Stereo Correspondence , Project 5 out
F, Nov 18th Structure from Motion , Szeliski 7 Final Project out
M, Nov 21st Activity Recognition ,
W, Nov 23rd
F, Nov 25th
M, Nov 28th Internet Scale Vision ,
W, Nov 30th
F, Dec 2nd Crowdsourcing ,
M, Dec 5th Attributes and Course Summary ,
W, Dec 7th
F, Dec 9th
M, Dec 12th Final Project / Project 5 due
T, Dec 13th, 9:00 AM - final presentations

Acknowledgements

Previous versions of course, similar courses at other universities.

Comments, questions to James Hays .

Navigation Menu

Search code, repositories, users, issues, pull requests..., provide feedback.

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly.

To see all available qualifiers, see our documentation .

  • Notifications You must be signed in to change notification settings

Slides of "First Principles of Computer Vision" Lectures by Prof. Shree K. Nayar || Code for Automatic Slides Generation from Video Lectures.

surajiitd/fpcv_slides

Folders and files.

NameName
9 Commits

Repository files navigation

First principles of computer vision lecture slides, description.

  • Uploaded some sample slides of Youtube Lectures by Prof. Shree K. Nayar in this repo.
  • The code to extract the slides from videos is also present in this repo and it is mainly inspired from this repository
  • The code uses OpenCV's Background Subtraction algorithm to detect the change in the frame.
  • I have modified to work it with Prof. Shree K. Nayar's Youtube lectures .
  • This project converts a video presentation into a deck of pdf slides by capturing screenshots of unique frames. Note: Before uploading the slides' PDFs to this repository, I compressed them using some online tool.

Steps to run the code

python video2pdfslides.py <video_path>

Run for multiple .mp4 files in a directory

python video2pdfslides_for_multiple_vids.py <directory_path>

Future work

  • will use Tesseract for OCR in the slides.
  • Python 100.0%
,
Aug 29 (Tuesday) Introduction to Computer Vision [ ] [ ]
Aug 31 (Thursday) Light, Shading, and Color [ ] [ ] S2.2 (light)
S2.3.2 (color)
Sept 5 (Tuesday) Image Filters in Spatial Domain [ ] [ ] S3.2 (linear filtering)
S3.3 (non-linear filtering)
Sep 7 (Thursday) Image Filters in Frequency Domain [ ] [ ] S3.4 (fourier transforms)
S2.3.3 (compression)
Sep 12 (Tuesday) Templates and Image Pyramids [ ] [ ] S3.5.2 (image pyramids)
S8.1.1 (pyramid alignment)
Sep 14 (Thursday) Edge Detection [ ] [ ] S4.2
Sep 19 (Tuesday) Interest Points [ ] [ ] S4.1
Sep 21 (Thursday) Feature Tracking and Optical flow [ ] [ ] S4.1.4
S8.1, S8.4
Sep 26 (Tuesday) Fitting and Alignment [ ] [ ] S6.1 and S2.1
Sep 28 (Thursday) Object instance recognition [ ] [ ] S14.3.2
Oct 3 (Tuesday) Camera Models [ ] [ ] S2.1.5
Oct 5 (Thursday) Single-view Geometry and Calibration [ ] [ ]
Oct 10 (Tuesday) Image Stitching [ ] [ ] S9
Oct 12 (Thursday) Epipolar Geometry, Stereo [ ] [ ] S11
Oct 17 (Tuesday) Structure from Motion [ ] [ ] S7
Oct 19 (Thursday) Clustering and Image Segmentation [ ] [ ]
Oct 24 (Tuesday) EM Algorithm, Mixture of Gaussians [ ] [ ]
Oct 26 (Thursday) MRFs and Graph Cut [ ] [ ]
Oct 31 (Tuesday) Categorization and Classifiers [ ] [ ]
Nov 2 (Thursday) Convolutional Neural Networks [ ] [ ]
Nov 7 (Tuesday) Object Detection [ ] [ ] S5.3
Nov 9 (Thursday) Part and Pixel Labeling [ ] [ ] S14.1
S14.2
Nov 14 (Tuesday) Action Recognition [ ] [ ]
Nov 16 (Thursday) 3D Scenes and Context [ ] [ ]
Nov 21,23 No class (Happy Thanksgiving!)
Nov 28 (Tuesday) Class Summary and Important Open Problems [ ] [ ]
Nov 30 (Thursday) Final Project Presentation [ ] [ ]
Dec 5 (Tuesday) Vision and Language [ ] [ ]
Dec 7 (Thursday) No class (Jia-Bin at NIPS)
Dec 12,14 No class (Reading day)
Dec 18 (Monday)

Acknowledgements

The lecture slides build upon many preceding efforts by other instructors, including Derek Hoiem (UIUC), James Hays (Georgia Tech), Steve Seitz (UW), Kristen Grauman (UT Austin), and Devi Parikh (Georgia Tech). Feel free to use and modify any of the slides for academic and research purposes. Please do credit the original sources where appropriate

First Principles of Computer Vision

computer vision presentation ppt

This lecture series on computer vision is presented by Shree Nayar , T. C. Chang Professor of Computer Science at Columbia Engineering. It has been designed for students, practitioners and enthusiasts who have no prior knowledge of computer vision.

  • What is Computer Vision?
  • What is Vision Used For?
  • How Do Humans Do it?
  • Topics Covered
  • About the Lecture Series
  • References and Credits
  • Pinhole & Perspective Projection
  • Image Formation using Lenses
  • Depth of Field
  • Lens Related Issues
  • Wide Angle Cameras
  • Animal Eyes
  • A Brief History of Imaging
  • Types of Image Sensors
  • Resolution, Noise, Dynamic Range
  • Sensing Color
  • Camera Response & HDR Imaging
  • Nature’s Image Sensors
  • Geometric Properties
  • Segmenting Binary Images
  • Iterative Modification
  • Pixel Processing
  • LSIS and Convolution
  • Linear Image Filters
  • Non-Linear Image Filters
  • Template Matching
  • Fourier Transform
  • Convolution Theorem
  • Filtering in Frequency Domain
  • Deconvolution
  • Sampling Theory and Aliasing
  • What is an Edge?
  • Edge Detection Using Gradients
  • Edge Detection Using Laplacian
  • Canny Edge Detector
  • Corner Detection
  • Fitting Lines and Curves
  • Active Contours
  • Hough Transform
  • Generalized Hough Transform
  • What is an Interest Point?
  • Detecting Blobs
  • SIFT Detector
  • SIFT Descriptor
  • 2x2 Image Transformations
  • 3x3 Image Transformations
  • Computing Homography
  • Dealing with Outliers: RANSAC
  • Warping and Blending Images
  • Uses of Face Detection
  • Haar Features for Face Detection
  • Integral Image
  • Nearest Neighbor Classifier
  • Support Vector Machine
  • Radiometric Concepts
  • Scn. Radiance & Img. Irradiance
  • Reflectance Models
  • Reflection from Rough Surfaces
  • Dichromatic Model
  • Gradient Space & Reflectance Map
  • Photometric Stereo
  • Lambertian Case
  • Calibration Based Photo. Stereo
  • Shape from Normals
  • Interreflections
  • Human Perception of Shading
  • Stereographic Projection
  • Shape from Shading Algorithm
  • Shading Illusions
  • Point Spread Function
  • Depth from Focus
  • Depth from Defocus
  • Photometric Stereo Systems
  • Structured Light Range Finding
  • Phase Shifting Method
  • Structured Light Systems
  • Time of Flight Method
  • Linear Camera Model
  • Camera Calibration
  • Intrinsic and Extrinsic Matrices
  • Simple Stereo
  • Problem of Uncalibrated Stereo
  • Epipolar Geometry
  • Estimating Fundamental Matrix
  • Finding Correspondences
  • Computing Depth
  • Stereo Vision in Nature
  • Motion Field & Optical Flow
  • Optical Flow Constraint Equation
  • Lucas-Kanade Method
  • Coarse-to-Fine Flow Estimation
  • Application of Optical Flow
  • Structure from Motion Problem
  • Observation Matrix
  • Rank of Observation Matrix
  • Tomasi-Kanade Factorization
  • Change Detection
  • Gaussian Mixture Model
  • Object Tracking using Template Matching
  • Tracking by Feature Detection
  • Segmentation by humans
  • Segmentation as Clustering
  • k-Means Segmentation
  • Mean-Shift Segmentation
  • Graph Based Segmentation
  • Shape vs. Appearance
  • Learning Appearance
  • Principal Component Analysis
  • Finding Principal Components
  • PCA and SVD
  • Parametric Appearance Representation
  • Appearance Matching
  • Perceptron Network
  • Activation Function
  • Neural Network
  • Gradient Descent
  • Backpropagation Algorithm
  • Example Applications
  • When to Use Machine Learning?

computer vision

Computer Vision

Dec 20, 2019

1.39k likes | 1.79k Views

Computer Vision. Chapter 1 Introduction. The goal of computer vision is to make useful decisions about real physical objects and scenes based on sensed images. Applications areas. Industrial inspection Medical imaging Image database and query Satellite and surveillance imagery

Share Presentation

rayjones

Presentation Transcript

Computer Vision Chapter 1 Introduction

The goal of computer vision is to make useful decisions about real physical objects and scenes based on sensed images.

Applications areas • Industrial inspection • Medical imaging • Image database and query • Satellite and surveillance imagery • Entertainment • Handwriting and printed character recognition

Image dimensionality • 1D • audio (sound) • 2D • digital camera picture, chest x-ray, ultrasound • 3D • video sequence of 2D images • multispectral 2D images • volumetric medical imagery (CT, MRI) • 4D • PET-CT • MRI

Image types • Binary • Grayscale • Color • Multispectral

Operations on images • Neighborhood (local) operations • Enhancing the entire image • Combining multiple images • Ex. differences, noise reduction, blending • Feature extraction • Ex. area, centroid (center of mass), orientation, lines • invariants

Extracting features

Example features

General hardware discussion • General purpose vs. special purpose (DSP, GPU) • Uniprocessors vs. parallel processors (COWs, multiprocessors) • Sensors (discussed later)

General software discussion • Android SDK • Java-based • freely available from http://developer.android.com/sdk/index.html • Albie start app • doxygen for source code documentation • freely available from doxygen.org • code format • http://www.oracle.com/technetwork/java/codeconv-138413.html

General software discussion C# use Visual C# (Express Edition is freely available from Microsoft) CSImageViewer starter app main course web page has links doxygen for source code documentation code format

Introduction to doxygen

What’s in a program file? • Comments • Code

What’s a compiler? • A program • Input • Processing • Output

What’s a compiler? • A program • Input: • Text file (your program) • Processing: • Convert HLL statements into machine code (or similar) • Ignore comments • Output: • A binary file of machine code (or similar)

Traditional documentation • Code files are separate from design documents. • Wouldn’t it be great if we could bring code and documentation together into the same file(s)?

Tools like doxygen and javadoc • A program • Input: • Text file (your program) • Processing: • Convert (specially formatted) comments into documentation • Ignore HLL statements • Output: • Documentation (typically in HTML)

Getting started with doxygen • Download from doxygen.org. • Do this only once in directory (folder) containing your source code: (already done for you) doxygen –g • This creates a doxygen configuration file called Doxyfile which you may edit to change default options. • Edit Doxyfile and make sure all EXTRACTs are YES • Then whenever you change your code and wish to update the documentation: doxygen • which updates all documentation in html subdirectory • Demonstrate.

Usingdoxygen: document every (source code) file /** * \file ImageData.java * \brief contains ImageData class definition (note that this * class is abstract) * * <more verbose description here> * \author George J. Grevera, Ph.D. */ . . .

Using doxygen: document every class //---------------------------------------------------------------------- /** \brief CSImageViewer class. * * Longer description goes here. */ public class CSImageViewer : Form { . . .

Using doxygen: document every function //---------------------------------------------------------------- /** \brief Given a pixel's row and column location, this * function returns the gray pixel value. * \param row image row * \param col image column * \returns the gray pixel value at that position */ public int getGray ( int row, int col ) { int offset = row * mW + col; return mOriginalData[ offset ]; }

Using doxygen: document every function (parameters) //---------------------------------------------------------------- /** \brief Given a pixel's row and column location, this * function returns the gray pixel value. * \param row image row * \param col image column * \returns the gray pixel value at that position */ public int getGray ( int row, int col ) { int offset = row * mW + col; return mOriginalData[ offset ]; }

Using doxygen: document every function (return value) //---------------------------------------------------------------- /** \brief Given a pixel's row and column location, this * function returns the gray pixel value. * \param row image row * \param col image column * \returns the gray pixel value at that position */ public int getGray ( int row, int col ) { int offset = row * mW + col; return mOriginalData[ offset ]; }

Using doxygen: document all class members (and global and static variables in C/C++) protected bool mIsColor; ///< true if color (rgb); false if gray protected bool mImageModified; ///< true if image has been modified protected int mW; ///< image width protected int mH; ///< image height protected int mMin; ///< overall min image pixel value protected int mMax; ///< overall max image pixel value protected String mFname; ///< (optional) file name

doxygen(lengthier example including html) /** \brief Actual original (unmodified) unpacked (1 component per * array entry) image data. * * If the image data are gray, each entry in this array represents a * gray pixel value. So mImageData[0] is the first pixel's gray * value, mImageData[1] is the second pixel's gray value, and so * on. Each value may be 8 bits or 16 bits. 16 bits allows for * values in the range [0..65535]. * <br> <br> * If the image data are color, triples of entries (i.e., 3) represent * each color rgb value. So each value is in [0..255] for 24-bit * color where each component is 8 bits. So mImageData[0] is the * first pixel's red value, mImageData[1] is the first pixel's green * value, mImageData[2] is the first pixel's blue value, mImageData[3] * is the second pixel's red value, and so on. */ protected int[] mOriginalData;

Required documentation rules • Each file, class, method, and member variable must be documented w/ doxygen. • Exception is when we follow the one-class-per-file rule. In that case only the class or file needs to be documented. • The contents of the body of each method should contain comments, but none of these comments should be in the doxygen format. (Not every comment is a doxygen comment.)

Not every comment should be a doxygen comment. Required: • every file/class • every function/method • every class member (data) • (in C/C++, every static and/or global variable) Use regular, plain comments in the body of a function/method. (One exception is the \todo.)

int mColorImageData[][][]; ///< should be mColorImageData[mH][mW][3] //---------------------------------------------------------------------- /** \brief Given a buffered image, this ctor reads the image data, stores * the raw pixel data in an array, and creates a displayable version of * the image. Note that this ctor is protected. The user should only * use ImageData.load( fileName ) to instantiate an object of this type. * \param bi buffered image used to construct this class instance * \param w width of image * \param h height of image * \returns nothing (constructor) */ protected ColorImageData ( final BufferedImage bi, final int w, final int h ) { mW = w; mH = h; mOriginalImage = bi; mIsColor = true; //format TYPE_INT_ARGB will be saved to mDisplayData mDisplayData = mOriginalImage.getRGB(0, 0, mW, mH, null, 0, mW); mImageData = new int[ mW * mH * 3 ]; mMin = mMax = mDisplayData[0] & 0xff; for (int i=0,j=0; i<mDisplayData.length; i++) { mDisplayData[i] &= 0xffffff; //just to insure that we only have 24-bit rgb final int r = (mDisplayData[i] & 0xff0000) >> 16; final int g = (mDisplayData[i] & 0xff00) >> 8; final int b = mDisplayData[i] & 0xff; if (r<mMin) mMin = r; if (g<mMin) mMin = g; …

Summary of most useful tags \file \author \brief \param \returns \todo (not used in assignments) And many, many others.

Back to images and imaging… The good, the bad, and the ugly

The good, the bad, and the ugly. • Success is usually hard won! • Problems: • Matching models to reality • Lighting variation • Sensor noise • Occlusion & rotation/translation/scale • Limited resolution • An image is a discrete model of an underlying continuous function • Spatial discretization • Sensed values quantization • Levels Of Detail (LOD)

So let’s try to recognize chairs. Task that is trivial for us.

The good, the bad, and the ugly. • Problem: Matching models to reality

The good, the bad, and the ugly. • Problem: Matching models to reality • Maybe CAD/CAM models can help! • http://www.3dcadbrowser.com/browse.aspx?category=59

The good, the bad, and the ugly. • Problems: Lighting variation

The good, the bad, and the ugly. • Problem: Sensor noise

The good, the bad, and the ugly. • Problem: Occlusion

The good, the bad, and the ugly. • Problem: Rotation, reflection, translation, & scale

The good, the bad, and the ugly. • Problem: • Limited resolution • An image is a discrete model of an underlying continuous function • Spatial discretization (above and below) • Sensed values quantization (next slide) • Too much of a good thing can be a problem too!

  • More by User

Computer Vision

Computer Vision. Spring 2006 15-385,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm. Boundary Detection: Hough Transform Lecture #9 Reading: Computer Vision (Ballard and Brown): Chapter 4

814 views • 43 slides

Computer Vision

Computer Vision. Spring 2006 15-385,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm. Aliasing - Really bad in video. Text Aliasing. Edge Detection Lecture #7. Edge Detection. Convert a 2D image into a set of curves Extracts salient features of the scene

798 views • 39 slides

Computer Vision

Computer Vision. Spring 2006 15-385,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm Lecture #17. Structured Light + Range Imaging Lecture #17. (Thanks to Slides from Levoy, Rusinkiewicz, Bouguet, Perona). 3D Scanning. Structured Light Reconstruction.

680 views • 34 slides

Computer Vision

Computer Vision . Geometric Camera Models and Camera Calibration. Coordinate Systems. Let O be the origin of a 3D coordinate system spanned by the unit vectors i, j, and k orthogonal to each other. i. P. O. k. j. Coordinate vector. Homogeneous Coordinates. n. H. P. O.

633 views • 29 slides

Computer Vision

458 views • 17 slides

Computer Vision

Computer Vision. Spring 2006 15-385,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm Lecture #13. Announcements. Homework 4 will be out today. Due 4/4/06. Please start early. Midterm stats: A range  40+, B range  30+ 40+  13 students 30+  9 students

481 views • 26 slides

Computer Vision

Computer Vision. Spring 2006 15-385,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm Lecture #18. Polyhedral Objects and Line Drawing Lecture #18. Line Drawings. We often communicate using Line Drawings. Engineering Drawings. Topics.

405 views • 26 slides

Computer Vision

Computer Vision. Spring 2010 15-385,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #18. Principal Components Analysis Lecture #18. Example: 53 Blood and urine measurements (wet chemistry) from 65 people (33 alcoholics, 32 non-alcoholics). Matrix Format.

643 views • 52 slides

Vision Computer

Vision Computer

DS,JX. Vision Computer. DS, JX. Cameras. DS, JX. Everything Else. NF, VG. Router. System Block Diagram for Vehicle Lab (7/8/04). NF, VG. Control Computers. Arbiter. Other Team. Help!. Needs Work. Working. System Block Diagram -Rooftop Vision System- 7/8/04. RFID Reader

271 views • 2 slides

Computer Vision

Computer Vision. Filename: eie426-computer-vision-0809.ppt. Contents. Perception generally Image formation Color vision Edge detection Image segmentation Visual attention 2D  3D Object recognition. Perception generally. Stimulus (percept) S, World W S = g(W)

700 views • 42 slides

Computer Vision

Computer Vision. Ronald Frazier CIS 479 April 20, 1999. Introduction. Important area of study Ease of use for new users Managing large quantities of images. Combines a Variety of Disciplines. Standard Programming Techniques Artificial Intelligence Techniques Neural Networks Fuzzy Logic

331 views • 19 slides

Computer Vision

Computer Vision. Spring 2010 15-385,-685 Instructor: S. Narasimhan PH A18B T-R 10:30am – 11:50am Lecture #13. Mechanisms of Reflection. source. incident direction. surface reflection. body reflection. surface. Surface Reflection: Specular Reflection Glossy Appearance

586 views • 40 slides

Computer Vision

Computer Vision. CMSC 25000 Artificial Intelligence March 11, 2008. Roadmap. Motivation Computer vision applications Is a Picture worth a thousand words? Low level features Feature extraction: intensity, color High level features Top-down constraint: shape from stereo, motion,..

712 views • 43 slides

Computer Vision

Lecture 10 Roger S. Gaborski. Computer Vision. Algorithm Development: Edges. We would like to develop algorithms that detect important edges in images The ‘quality’ of edges will depend on the image characteristics Noise can result in false edges. Profiles of an Ideal Edge.

576 views • 48 slides

Computer Vision

Computer Vision. Stereo Vision. Pinhole Camera. Perspective Projection. Stereo Vision. Two cameras. Known camera positions. Recover depth. scene point. p. p’. image plane. optical center. Correspondences. p. p’. Matrix form of cross product. a =a x i +a y j +a z k.

447 views • 23 slides

Computer Vision

Computer Vision. Spring 2010 15-385,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #19. Principal Components Analysis on Images Lecture #19. The Space of Faces. An image is a point in a high dimensional space An N x M image is a point in R NM

416 views • 39 slides

Computer Vision

Computer Vision. The 2D projective plane and it’s applications HZ Ch 2. In particular: Ch 2.1-4, 2.7, Szelisky: Ch 2.1.1, 2.1.2 Estimation:HZ: Ch 4.1-4.2.5, 4.4.4-4.8 cursorly. Richard Hartley and Andrew Zisserman, Multiple View Geometry , Cambridge University Publishers, 2 nd ed. 2004.

692 views • 66 slides

Computer Vision

Computer Vision. http://www.cvl.iis.u-tokyo.ac.jp/ class2018/2018w/class2018w.html. Contents. Papers on Patch-based Object Recognition Using Images This week and next week This week Basic techniques on recent object recognition Comparison with 20Q. What is “ Object Recognition ” ?.

795 views • 69 slides

Got any suggestions?

We want to hear from you! Send us a message and help improve Slidesgo

Top searches

Trending searches

computer vision presentation ppt

49 templates

computer vision presentation ppt

44 templates

computer vision presentation ppt

61 templates

computer vision presentation ppt

american history

85 templates

computer vision presentation ppt

el salvador

34 templates

computer vision presentation ppt

63 templates

Computer Vision Research Poster

It seems that you like this template, computer vision research poster presentation, free google slides theme, powerpoint template, and canva presentation template.

Soon, computers will rule over the whole world. Until that day comes, we're free to design new templates, like this one for research posters (one of our newest structures). We've decided that the theme will be "computer vision", you know, that field in computer science that refers to how machines understand images from the real world. Customize this techie design and then print it if necessary. Our computer overlords will allow it... for now!

Features of this template

  • 100% editable and easy to modify
  • 8 different slides to impress your audience
  • Contains easy-to-edit graphics such as graphs, maps, tables, timelines and mockups
  • Includes 500+ icons and Flaticon’s extension for customizing your slides
  • Designed to be used in Google Slides, Canva, and Microsoft PowerPoint
  • 4:3 standard format for screens
  • Includes information about fonts, colors, and credits of the resources used

How can I use the template?

Am I free to use the templates?

How to attribute?

Attribution required If you are a free user, you must attribute Slidesgo by keeping the slide where the credits appear. How to attribute?

computer vision presentation ppt

Register for free and start downloading now

Related posts on our blog.

How to Add, Duplicate, Move, Delete or Hide Slides in Google Slides | Quick Tips & Tutorial for your presentations

How to Add, Duplicate, Move, Delete or Hide Slides in Google Slides

How to Change Layouts in PowerPoint | Quick Tips & Tutorial for your presentations

How to Change Layouts in PowerPoint

How to Change the Slide Size in Google Slides | Quick Tips & Tutorial for your presentations

How to Change the Slide Size in Google Slides

Related presentations.

Simple Gradients Research Poster presentation template

Premium template

Unlock this template and gain unlimited access

Elegant Research Poster presentation template

CS 231A Course Project

Course Project Overview

Important Dates

Oct 21 (11:59pm) , Finalizing team members : Maximum team size: 2 . Send us an email with your team name and team members.

Oct 21 (11:59pm) , Proposal submission : Submit a 0.5 page course project proposal in our provided template. Send a PDF file to [email protected]

Nov 18 (11:59pm) , Project milestone : Submit a 2-3 page course project milestone report.

Dec 13 (11:59pm) , Final report and code submission : No late days allowed.

Dec 14 (14:00pm) , Project presentation submission : No late days allowed.

Dec 15 (10am - 12pm) , Course project presentation. Location: Room 200-305 (Room 305 of Building 200)

Grading Policy

presentation: 5% write-up: 10%  •  clarity, structure, language, references: 3%  •  background literature survey, good understanding of the problem: 3%  •  good insights and discussions of methodology, analysis, results, etc.: 4% technical: 15%  •  correctness: 5%  •  depth: 5%  •  innovation: 5% evaluation and results: 10%  •  sound evaluation metric: 3%  •  thoroughness in analysis and experimentation: 3%  •  results and performance: 4%

Project Submission Details

You must use our provided templates . Email your project proposal, milestone report, final report and zipped code to: [email protected] , with the following format: Subject Line: Course Project Proposal/Milestone/Report Body: Full names of all group members, SUNet ID's and Project title Attachments: Write-up as LastName_LastName_Paper.pdf, Code as LastName_LastName_Code.zip, where the titles have all the last names of the group members.

You must use our provided presentation template . Email your Powerpoint slides (.ppt file) to [email protected] , with the following format: Subject Line: Course Project Presentation Body: Full names of all group members, SUNet ID's and Project title Attachments: Powerpoint slides as LastName_LastName_Presentation.ppt, where the titles have all the last names of the group members.

Final Report Write-up Guidelines

Your final write-up should be between 8 - 10 pages using the template provided. After the class, we will post all the final reports online (restricted to CS231a students only) so that you can read about each others’ work. If you do not want your writeup to be posted online, then please let us know at least a week in advance of the final writeup submission deadline. The following is a suggested structure for your report:

• Title, Author(s)

• Abstract: It should not be more than 300 words;

• Introduction: this section introduces your problem, and the overall plan for approaching your problem

• Background/Related Work: This section discusses relevant literature for your project

• Approach: This section details the framework of your project. Be specific, which means you might want to include equations, figures, plots, etc

• Experiment: This section begins with what kind of experiments you're doing, what kind of dataset(s) you're using, and what is the way you measure or evaluate your results. It then shows in details the results of your experiments. By details, I mean both quantitative evaluations (show numbers, figures, tables, etc) as well as qualitative results (show images, example results, etc).

• Conclusion: What have you learned? Suggest future ideas.

• References: This is absolutely necessary. Reports without references will not receive a score higher than 20 points (total is 40 points).

• Supplementary materials: This is NOT counted toward your 8-10 page limit. Please submit your codes as supplementary materials.

Project Presentation Guidelines

Each team should give a two minutes project presentation. After your presentation, there will be one minute for audiences to ask questions. You must use the template provided by us ( Download it here ) and make sure that your presentation contains exactly two slides. We will compile the presentation slides from all teams into a single big .ppt file and show it using our laptop, so you do not need to worry about bringing computers on the presentation day.

Project Proposal

We have provided the template for your final write-up. Your proposal should follow the same template , and should be no more than 1 page. Your proposal should describe as clearly as possible the following:

• What is the computer vision problem that you will be investigating? Why is it interesting?

• What image or video data will you use? If you are collecting new datasets, how do you plan to collect them?

• What method or algorithm are you proposing? If there are existing implementations, will you use them and how? How do you plan to improve or modify such implementations?

• Which reading will you examine to provide context and background?

• How will you evaluate your results? Qualitatively, what kind of results do you expect (e.g. plots or figures)? Quantitatively, what kind of analysis will you use to evaluate and/or compare your results (e.g. what performance metrics or statistical tests)?

Project Milestone

Your project milestone report should be between 2 - 3 pages using the template provided. The following is a suggested structure for your report:

• Problem statement: Describe your problem precisely specifying the dataset to be used, expected results and evaluation

• Technical Approach: Describe the methods you intend to apply to solve the given problem

• Intermediate/Preliminary Results: State and evaluate your results upto the milestone

You may consult any papers, books, online references, or publicly available implementations (such as SIFT) for ideas and code that you may want to incorporate into your strategy or algorithm, so long as you clearly cite your sources in your code and your writeup. However, under no circumstances may you look at another group’s code or incorporate their code into your project.

Project Reports of Previous Years

Winter, 2010-2011

3D Model Segmentation and Labeling

Generic Object/Scene recognition for the Smart Album Project

Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning

Learning Slow Features for Object Recognition

RoboGrader: Scoring Multiple Choice Tests with a Smartphone

Joint Subclassing and Classification

Heuristics for Decision Tree Selection and Weight Assignment in Random Forest for Fine-Grained Image Classification

Hole Filling Method Using Edge Based Interpolated Depth for View Synthesis

RGB-Z Segmentation of Objects in a Cluttered Scene Using a Kinect Sensor

Simultaneous Segmentation and Tracking in Point Cloud Data using the Iterative Closest Point Algorithm

KFace3D: Facial Recognition using RGBD Data

Comparison of Aircraft Tracking Using Top-Down and Bottom-Up Approaches

Geometric Understanding of Indoor Scenes

Object Pose Estimation using Optical Flow and POSIT

Real Time Subcutaneous Vein Recognition of Forearm Veins

Face Detection and Tracking for BabyCam

Image Retrieval, Semantic & Geographic Annotation using visual/multimedia representations and textual information

Smart Album: Face Recognition and Landmark Recognition in Album

Computer-assisted Detection of Defects during the Fabrication of PDMS chips

Unsupervised Learning of Invariances with Temporal Coherence

Image-based Web Page Classification

Winter, 2009-2010

Using a Functionality Model for Chair Detection

Fusing Multi-Channel Cues for Image Organization

Generalizing ImageNet to SmartPhones

Motion-sensitive Low-noise Imaging

Unsupervised Image Segmentation using Deep Belief Nets

The Retinal Algorithm to Detect, Segment and Track Moving Objects with Observer Motion

Unsupervised Feature Learning of Bi-modal Features

Efficient Classification and Segmentation of Specular Objects

Feature Descriptors for Tiny Image Categorization

A feature tracking approach to painted aperture

Baseline Scene Classifications

Camera Tracking with Fixed Point Math for Mobile Devices

Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities

Sub-meter Indoor Localization in Unmodified Environments with Inexpensive Sensors

Segmentation of seismic images

Object Detecting in Images using Time Series Ensemble Methods

Learning Visual Invariance in a 2-Layer Neural Network

The "Find Mii" Challenge

Find Mii on Wii

"Find Mii" is a game on Nintendo Wii Play. It basically involves identifying certian avatars (Miis) from a bunch of them, standing still or moving around in various styles. If you are not yet proficient in this game, let Elvis show you how to play it in the videos below.

As we know computers are designed to work in certain areas of human endeavor that are not terribly challenging to human intelligence but sometimes beyond human patience. In this course project you will be programming the computer to play the game as good as Elvis did.

You don't have to worry about any "interface" issue (as some of you asked in class). Actually you will only be dealing with images instead of programming the Wii game controller.

Project Mission

In the course project, you will be focusing on 4 tasks in "Find Mii" as described below.

Task 1 : Find this Mii! ... You will be given a reference picture of one Mii. Identify that one in a crowd.

Task 2 : Find 2 look-alikes! ... Pay attention to the faces (and hair styles?) as they might be wearing different sweaters.

Task 3 : Find n odd Miis out! ... Some Miis are odd in their styles of shaking heads or footsteps. Find them out.

Task 4 : Find the fastest Mii! ... Someone is running (or swimming) fast. Catch that one.

For each task you have 3 levels: easy, medium, and hard.

For each task & level we will give you a video file from gamplay (that would be 12 in total), and you need to identify a given number of Miis.

Minimum requirement: 3 different tasks, 1 level for each.

You will be handling different tasks & levels separately (of course you can have functionalities shared among them, just follow the comments in the infrastructure code). You don't have to do all the 12 task & levels, the minimum requirement is that you complete at least three different tasks, one level for each. However you are strongly encouraged to do all 4 tasks on higher levels to earn more points for the final challenge!

IMAGES

  1. Applications of Computer Vision PowerPoint Template

    computer vision presentation ppt

  2. Computer Vision Ppt Show Example Introduction

    computer vision presentation ppt

  3. Applications of Computer Vision PowerPoint and Google Slides Template

    computer vision presentation ppt

  4. Applications of Computer Vision PowerPoint Template

    computer vision presentation ppt

  5. Applications of Computer Vision PowerPoint Template

    computer vision presentation ppt

  6. PPT

    computer vision presentation ppt

COMMENTS

  1. PDF Lecture 1: "Computer Vision"

    Our job is to interpret the cues! Depth cues: Linear perspective. Depth cues: Aerial perspective. Depth ordering cues: Occlusion. Shape cues: Texture gradient. Shape and lighting cues: Shading. Position and lighting cues: Cast shadows. Grouping cues: Similarity (color, texture, proximity) Grouping cues: "Common fate".

  2. PPT

    Attention in Computer Vision. Attention in Computer Vision. Mica Arie-Nachimson and Michal Kiwkowitz May 22, 2005 Advanced Topics in Computer Vision Weizmann Institute of Science. Problem definition - Search Order. Vision applications apply "expensive" algorithms (e.g. recognition) to image patches. 1.48k views • 110 slides

  3. Powerpoint slide sets for "Computer Vision: A Modern Approach"

    Powerpoint slide sets for "Computer Vision: A Modern Approach". Back to main book page. An Introduction to Computer Vision. Cameras. Radiometry. Sources, Shadows and Shading. Color. Linear Filters and Edge Detection. Pyramids and Texture.

  4. CS4670/5670

    Overview: This course will serve as a detailed introduction to computer vision. The emphasis will be on covering the fundamentals which underly both computer vision research and applications. A tentative list of topics is below: Geometry / Physics of image formation. Properties of images and basic image processing. 3D reconstruction.

  5. CSCE 643: Introduction to Computer Vision

    Computer vision is the science and technology of machines that see. Concerned with the theory for building artificial systems that obtain information from images. The image data can take many forms, such as a video sequence, depth images, views from multiple cameras, or multi-dimensional data from a medical scanner 3.

  6. PDF Lecture 1

    Formalize computer vision applications into tasks - Formalize inputs and outputs for vision-related problems - Understand what data and computational requirements you need to train a model Develop and train vision models - Learn to code, debug, and train convolutional neural networks. - Learn how to use software frameworks like PyTorch and ...

  7. CS 143 Introduction to Computer Vision

    This offering of CS 143 will emphasize the core vision task of recognition in particular. We will train and evaluate classifiers to recognize various visual phenomena. The course will consist of five programming projects, two written quizzes, and a self-chosen final project. Students can earn graduate credit for the course but will need to meet ...

  8. First Principles of Computer Vision Lecture Slides

    Uploaded some sample slides of Youtube Lectures by Prof. Shree K. Nayar in this repo.; The code to extract the slides from videos is also present in this repo and it is mainly inspired from this repository; The code uses OpenCV's Background Subtraction algorithm to detect the change in the frame.; I have modified to work it with Prof. Shree K. Nayar's Youtube lectures.

  9. PPTX PowerPoint Presentation

    Using Computer Vision: Facial Expressions. Here are pictures of people and their expressions. As you can see, below the faces, the camera can sense where the main features change in the face. Camera Mouse. The Camera Mouse can detect your head's motions and move along on the computer screen. "Instead of using a mouse, a webcam or built-in ...

  10. ECE 5554 / ECE 4554: Computer Vision Fall 2017

    The lecture slides build upon many preceding efforts by other instructors, including Derek Hoiem (UIUC), James Hays (Georgia Tech), Steve Seitz (UW), Kristen Grauman (UT Austin), and Devi Parikh (Georgia Tech). Feel free to use and modify any of the slides for academic and research purposes. Please do credit the original sources where appropriate.

  11. PPT PowerPoint Presentation

    PowerPoint Presentation. CS201 Lecture 02. Computer Vision: Image Formation and Basic Techniques. John Magee. * Computer Vision Recall: Computer graphics in general Description of scene Visual representation (Image) Computer Vision in general: Image(s) Some description of the scene How are Computer Graphics and Computer Vision Related? Example ...

  12. Computer Vision

    Computer Vision.ppt - Free download as Powerpoint Presentation (.ppt), PDF File (.pdf), Text File (.txt) or view presentation slides online. Computer vision is the study of how to extract information from images and videos to understand and interact with the visual world. It has many useful applications such as 3D reconstruction, object recognition, and automated surveillance.

  13. Columbia University

    This lecture series on computer vision is presented by Shree Nayar, T. C. Chang Professor of Computer Science at Columbia Engineering.It has been designed for students, practitioners and enthusiasts who have no prior knowledge of computer vision.

  14. PPT

    Presentation Transcript. Computer Vision Chapter 1 Introduction. The goal of computer vision is to make useful decisions about real physical objects and scenes based on sensed images. Applications areas • Industrial inspection • Medical imaging • Image database and query • Satellite and surveillance imagery • Entertainment ...

  15. Computer Vision Research Poster

    Explain in a visual poster what computer vision is about. You can use this editable design for Google Slides & PPT if you need help! ... Computer Vision Research Poster Presentation . Multi-purpose . Free Google Slides theme, PowerPoint template, and Canva presentation template . Soon, computers will rule over the whole world. Until that day ...

  16. Stanford University CS 231A: Introduction to Computer Vision

    Email your Powerpoint slides (.ppt file) to [email protected], with the following format: Subject Line: Course Project Presentation Body: Full names of all group members, SUNet ID's and Project title Attachments: Powerpoint slides as LastName_LastName_Presentation.ppt, where the titles have all the last names of the group members.