My Account Log in

1 option

OpenCV 3 computer vision with Python cookbook : leverage the power of OpenCV 3 and Python to build computer vision applications / Aleksei Spizhevoi, Aleksandr Rybnikov.

Ebook Central Academic Complete Available online

View online
Format:
Book
Author/Creator:
Spizhevoi, Aleksei, author.
Rybnikov, Aleksandr, author.
Language:
English
Subjects (All):
Python (Computer program language).
Physical Description:
1 online resource (296 pages) : illustrations
Edition:
1st ed.
Place of Publication:
Birmingham ; Mumbai : Packt, 2018.
Biography/History:
Spizhevoi Aleksei: Alexey Spizhevoy has over 7 years of experience in computer vision R&D. He has worked for 5 years at Itseez, the main OpenCV contributor, before it was acquired by Intel. He has contributed to video stabilization and photo stitching modules into OpenCV library. He has successfully participated in numerous Computer Vision projects in such areas as 3D reconstruction, video conferencing, object detection and tracking, semantic segmentation, driving assistance, and others. He holds a master's degree in computer science, and he is currently pursuing PhD. Rybnikov Aleksandr: Aleksandr Rybnikov has over 5 years of experience in C++ programming, including 3 years in the Computer Vision (CV) domain. He worked at Itseez, a company that supported and developed OpenCV, and then at Intel. He enriched OpenCV's dnn module by adding support of another two Deep Learning (DL) frameworks and many features, along with improving the existing functionality. As an engineer, he participated in CV and DL projects such as iris recognition, object detection, semantic segmentation, 6-DOF pose estimation, and digital hologram reconstruction. He has a master's degree in physics.
Summary:
OpenCV 3 is a native cross-platform library for computer vision, machine learning, and image processing. OpenCV's convenient high-level APIs hide very powerful internals designed for computational efficiency that can take advantage of multicore and GPU processing. This book will help you tackle increasingly challenging computer vision problems.
Contents:
Cover
Copyright and Credits
Packt Upsell
Contributors
Table of Contents
Preface
Chapter 1: I/O and GUI
Introduction
Reading images from files
Getting ready
How to do it...
How it works...
Simple image transformations-resizing and flipping
Saving images using lossy and lossless compression
Showing images in an OpenCV window
Working with UI elements, such as buttons and trackbars, in an OpenCV window
Drawing 2D primitives-markers, lines, ellipses, rectangles, and text
Handling user input from a keyboard
Making your app interactive through handling user input from a mouse
Capturing and showing frames from a camera
Playing frame stream from video
Obtaining a frame stream properties
Writing a frame stream into video
Jumping between frames in video files
Chapter 2: Matrices, Colors, and Filters
Manipulating matrices-creating, filling, accessing elements, and ROIs
Converting between different data types and scaling values
Non-image data persistence using NumPy
How to do it.
How it works...
Manipulating image channels
Converting images from one color space to another
Gamma correction and per-element math
Mean/variance image normalization
Computing image histograms
Equalizing image histograms
Removing noise using Gaussian, median, and bilateral filters
Computing gradients using Sobel operator
Creating and applying your own filter
Processing images with real-valued Gabor filters
Going from the spatial domain to the frequency domain (and back) using the discrete Fourier transform
Manipulating image frequencies for image filtration
Processing images with different thresholds
Morphological operators
Image masks and binary operations
Chapter 3: Contours and Segmentation
Binarization of grayscale images using the Otsu algorithm
Finding external and internal contours in a binary image
Extracting connected components from a binary image.
Getting ready
Fitting lines and circles into two-dimensional point sets
Calculating image moments
Working with curves - approximation, length, and area
Checking whether a point is within a contour
Computing distance maps
Image segmentation using the k-means algorithm
Image segmentation using segment seeds - the watershed algorithm
Chapter 4: Object Detection and Machine Learning
Obtaining an object mask using the GrabCut algorithm
Finding edges using the Canny algorithm
Detecting lines and circles using the Hough transform
Finding objects via template matching
The medial flow tracker
Tracking objects using different algorithms via the tracking API
Computing the dense optical flow between two frames
Detecting chessboard and circle grid patterns
A simple pedestrian detector using the SVM model
Optical character recognition using different machine learning models
Getting ready.
How to do it...
Detecting faces using Haar/LBP cascades
Detecting AruCo patterns for AR applications
Detecting text in natural scenes
QR code detector
Chapter 5: Deep Learning
Representing images as tensors/blobs
Loading deep learning models from Caffe, Torch, and TensorFlow formats
Getting input and output tensors' shapes for all layers
Preprocessing images and inference in convolutional networks
Measuring inference time and contributions to it from each layer
Classifying images with GoogleNet/Inception and ResNet models
Detecting objects with the Single Shot Detection (SSD) model
Segmenting a scene using the Fully Convolutional Network (FCN) model
Face detection using Single Shot Detection (SSD) and the ResNet model
Age and gender prediction
Chapter 6: Linear Algebra
The orthogonal Procrustes problem
Rank-constrained matrix approximation
Principal component analysis.
Solving systems of linear equations (including under- and over-determined)
Solving polynomial equations
Linear programming with the simplex method
Chapter 7: Detectors and Descriptors
Finding corners in an image - Harris and FAST
Selecting good corners in an image for tracking
Drawing keypoints, descriptors, and matches
Detecting scale invariant keypoints
Computing descriptors for image keypoints - SURF, BRIEF, ORB
Matching techniques for finding correspondences between descriptors
Finding reliable matches - cross-check and ratio test
Model-based filtering of matches - RANSAC
BoW model for constructing global image descriptors
Chapter 8: Image and Video Processing
Warping an image using affine and perspective transformations
How to do it
How it works
Remapping an image using arbitrary transformation
Tracking keypoints between frames using the Lucas-Kanade algorithm
Background subtraction
How it works.
Stitching many images into panorama.
Notes:
Description based on print version record.
ISBN:
9781788478755
1788478754
OCLC:
1030817073

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account