Lecture 4
Machine Learning for Images. Part 2
A single instance of an object can be oriented in many ways to the camera.
Many objects of interest are not rigid bodies and can be deformed in extreme ways.
The objects of interest can be occluded. Sometimes only a tiny portion of an object (as few pixels) could be visible.
The effects of illumination can be drastic on the pixel level.
The objects of interest may blend into their environment, making them hard to identify.
A relevant piece of information about the content of an image
-e.g., edges, corners, blobs (regions), ridgesExperiment to create features that make machine learning algorithms work better
But not rotation and scaling invariance!
The generator’s architecture looks like an inverted CNN that starts with a narrow input and is upsampled a few times until it reaches the desired size
The discriminator
’s model is a typical classification neural network that aims to classify images generated by the generator as real or fake
Generated from Synthesia.io
Lecture 4
Machine Learning for Images. Part 2
CMU Computer Vision course - Matthew O’Toole.
Grokking Machine Learning. Luis G. Serrano. Manning, 2021
[CIS 419/519 Applied Machine Learning]. Eric Eaton, Dinesh Jayaraman.
Deep Learning Patterns and Practices - Andrew Ferlitsch, Maanning, 2021
Machine Learning Design Patterns - Lakshmanan, Robinson, Munn, 2020
Deep Learning for Vision Systems. Mohamed Elgendy. Manning, 2020