# Deep Learning for Computer Vision ## from image classification to actual computer vision **ISAE-SUPAERO, SDD, 18 Nov. 2025** Florient CHOUTEAU Slides : https://fchouteau.github.io/isae-intro-to-cnns Notebooks : https://github.com/SupaeroDataScience/deep-learning/tree/main/vision # Use cases of CNNs We've done image classification that we applied in a sliding window fashion on larger images  We can solve other types of tasks with ConvNets  Keypoint Detection  3D (VGGT)
Image/Video Restoration-Upsampling  Image/Video Restoration-Upsampling  And of course image and video generation
# How ? It's just about architectures and loss functions Image Segmentation  Object Detection (intuition)  Object Detection (in practice)  Instance Segmentation  To learn more about this, see [this cs231n class](http://cs231n.stanford.edu/slides/2022/lecture_9_jiajun.pdf) # Encoder Decoder and U-nets How to solve a dense task with CNNs ? The problem : Context  The Encoder Decoder Concept  ) How to upsample ? Static upsampling  How to upsample ? Learnable upsampling  How to upsample ? Learnable upsampling  Is it enough ? How to keep spatial information ?  They are called [U-Net (2015)](https://arxiv.org/abs/1505.04597)s  Imagen & co are based on unets  # Hands-on 2 : From image classification to image segmentation ### Dataset description - 12800 train images (6400 cloudy, 6400 clear), size 64x64 - 640 test images (320 cloudy, 320 clear), size 64x64  ### Dataset description - 64 big test images (256x256)  ### Let's go ! - 3_classification_to_segmentation.ipynb : From image classification to dense tasks - 4_sliding_window_unet.ipynb : How to apply our detector to a big satellite image ? - Extra : Artificial Intelligence in practice in a space systems context [ISAE SPAPS 2023-2024 class](https://docs.google.com/presentation/d/10zd65eg0X_aqydggKRvC3s20AXSu3WI9RWWUKJVvfAI/edit?usp=sharing) Extra : Other keywords that are important for CNN in Computer Vision - [Self-Supervised Learning](http://cs231n.stanford.edu/slides/2023/lecture_13.pdf) - [Vision Transformers](https://www.v7labs.com/blog/vision-transformer-guide) - [Generative Adversarial Networks](https://github.com/SupaeroDataScience/deep-learning/tree/main/GAN) - [Diffusion Models](http://cs231n.stanford.edu/slides/2023/lecture_15.pdf) (StableDiffusion, MidJourney ...)