GradCAM Walkthrough

Date: October 31, 2024
Lecture Duration: 2.5 hours
Topic Overview: This lecture dives into the interpretability of deep learning models, specifically Convolutional Neural Networks (CNNs). We explore Grad-CAM (Gradient-weighted Class Activation Mapping), a powerful technique used to visualize and understand which regions of an image a model relies on to make its predictions.


1. Understanding Model Interpretability

We started by discussing the “black box” nature of deep learning and why interpretability matters in computer vision tasks.

2. Implementing Grad-CAM with PyTorch

The core of the lecture was a hands-on walkthrough implementing Grad-CAM from scratch using a pre-trained VGG19 model.


Lecture Slides

Core Reading


Interactive Demonstration

Below is the complete Jupyter Notebook used in class. It contains the step-by-step PyTorch implementation of Grad-CAM applied to various sample images.


← Back to Computer Vision