Offered at Georgia Tech as CS 6476
Become a Computer Vision Expert
Accelerate your career with the credential that fast-tracks you to job success.
This course provides an introduction to computer vision including fundamentals of image formation, camera imaging geometry, feature detection and matching, multiview geometry including stereo, motion estimation and tracking, and classification. We’ll develop basic methods for applications that include finding known models in images, depth recovery from stereo, camera calibration, image stabilization, automated alignment (e.g. panoramas), tracking, and action recognition. We focus less on the machine learning aspect of CV as that is really classification theory best learned in an ML course.
The focus of the course is to develop the intuitions and mathematics of the methods in lecture, and then to learn about the difference between theory and practice in the problem sets. All algorithms work perfectly in the slides. But remember what Yogi Berra said: In theory there is no difference between theory and practice. In practice there is. (Einstein said something similar but who knows more about real life?) In this course you do not, for the most part, apply high-level library functions but use low to mid level algorithms to analyze images and extract structural information.
Rich Learning Content
Interactive Quizzes
Taught by Industry Pros
Self-Paced Learning
Student Support Community
This course is your first step towards a new career with the Computer Vision Program.
Enhance your skill set and boost your hirability through innovative, independent learning.
Instructor
Instructor
Instructor
See the Technology Requirements for using Udacity.
Rich Learning Content
Interactive Quizzes
Self Paced Learning
Taught by Industry Professionals
1-1 Coaching and Mentorship
Rich Learning Content
Interactive Quizzes
Self Paced Learning
Taught by Industry Professionals
Images have become ubiquitous in computing. Sometimes we forget that images often capture the light reflected from a physical scene. This course gives you both insight into the fundamentals of image formation and analysis, as well as the ability to extract information much above the pixel level. These skills are useful for anyone interested in operating on images in a context-aware manner or where images from multiple scenarios need to be combined or organized in an appropriate way.