Computer Vision - Jacob Hume

# Topics - Introduction to computer vision - Cameras and image formation - Simple edge detection - Linear filtering and convolution - Gradient filters and blurring - Image pyramids: Gaussian and Laplacian - Nonlinear filtering (bilateral filtering) - Frequency domain and Fourier transforms - Image bases and Fourier analysis - Machine-learning fundamentals for vision - Nearest-neighbor and linear regression - Overfitting and generalization - Linear classifiers and logistic regression - Stochastic gradient descent - Neural networks: nonlinearities, architectures, regularization - Optimization and backpropagation - Computation graphs, momentum, autodiff - Convolutional neural networks - Convolution layers, pooling, normalization - Scene understanding and semantic segmentation - Object detection - Sliding-window and region-based CNNs - Instance segmentation (Mask R-CNN) - Image synthesis - Texture synthesis - Generative adversarial networks (GANs, conditional GANs) - VQ-VAEs and diffusion models - Temporal models - 3-D convolutions, recurrent nets, LSTMs - Representation learning - Transfer learning, autoencoders, self-supervised learning - Language and vision - Vision–language models (e.g., CLIP) - Sound and touch - Multimodal self-supervision - Image formation revisited: camera models, projection, plenoptic function - Multi-view geometry - Fitting geometric models (e.g., RANSAC) - Structure from motion and stereo - Motion estimation and optical flow - Aperture problem, keypoint tracking - Light, shading, and color - Shape-from-shading, intrinsic images, color perception - Embodied vision - Learning from demonstration, reinforcement learning for vision - Recent network architectures - Implicit representations, Vision Transformers, NeRFs - Ethics, bias, and disinformation in computer vision - Dataset bias, algorithmic fairness, image forensics