DeepLearning.AI

Generative Adversarial Networks (GANs) Specialization

up to 3 months
Intermediate

The Generative Adversarial Networks (GANs) Specialization provides an exciting introduction to image generation with GANs, charting a path from foundational concepts to advanced techniques. Gain hands-on experience in building and evaluating GANs using PyTorch, and explore their applications in data augmentation and privacy preservation.

GAN components
Basic GANs using PyTorch
Advanced DCGANs
Conditional GANs
FID method for GAN evaluation

Overview

In this specialization, you will learn to understand GAN components, build basic and advanced GANs using PyTorch, and control your GAN to build conditional GANs. You will compare generative models, use the FID method to assess GAN fidelity and diversity, and learn to detect bias in GANs. Additionally, you will explore GAN applications in data augmentation and privacy preservation, and implement Pix2Pix and CycleGAN for image translation.

Online
course location
English
course language
Professional Certification
upon course completion
Self-paced
course format
Live classes
delivered online

Who is this course for?

Data Scientists

Professionals looking to enhance their skills in generative models and image generation.

Machine Learning Engineers

Engineers aiming to implement advanced GAN techniques in their projects.

AI Enthusiasts

Individuals interested in understanding and applying GANs for various applications.

This specialization offers a comprehensive introduction to GANs, covering both foundational concepts and advanced techniques. It is ideal for data scientists, machine learning engineers, and AI enthusiasts looking to enhance their skills in generative models and image generation. By completing this course, learners will gain hands-on experience in building and evaluating GANs, and explore their applications in data augmentation and privacy preservation.

Pre-Requisites

1 / 3

Basic knowledge of machine learning and deep learning concepts
Familiarity with Python programming
Understanding of neural networks and PyTorch

What will you learn?

Course 1: Build Basic Generative Adversarial Networks (GANs)

Learn about GANs and their applications; Understand the intuition behind the fundamental components of GANs; Explore and implement multiple GAN architectures; Build conditional GANs capable of generating examples from determined categories.

Week 1: Intro to GANs

Learn about GANs and their applications, understand the intuition behind the basic components of GANs, and build your very own GAN using PyTorch.

Week 2: Deep Convolutional GAN

Build a more sophisticated GAN using convolutional layers. Learn about useful activation functions, batch normalization, and transposed convolutions to tune your GAN architecture and apply them to build an advanced DCGAN specifically for processing images.

Week 3: Wasserstein GANs with Normalization

Reduce instances of GANs failure due to imbalances between the generator and discriminator by learning advanced techniques such as WGANs to mitigate unstable training and mode collapse with a W-Loss and an understanding of Lipschitz Continuity.

Week 4: Conditional and Controllable GANs

Understand how to effectively control your GAN, modify the features in a generated image, and build conditional GANs capable of generating examples from determined categories.

Course 2: Build Better Generative Adversarial Networks (GANs)

Assess the challenges of evaluating GANs and compare different generative models; Use the Fréchet Inception Distance (FID) method to evaluate the fidelity and diversity of GANs; Identify sources of bias and the ways to detect it in GANs; Learn and implement the techniques associated with the state-of-the-art StyleGANs.

Week 1: GAN Evaluation

Understand the challenges of evaluating GANs, learn about the advantages and disadvantages of different GAN performance measures, and implement the Fréchet Inception Distance (FID) method using embeddings to assess the accuracy of GANs.

Week 2: GAN Disadvantages and Bias

Find out the disadvantages of GANs when compared to other generative models, discover the pros/cons of these models — plus, learn about the many places where bias in machine learning can come from, why it’s important, and an approach to identify it in GANs.

Week 3: StyleGAN and Advancements

Understand how StyleGAN improves upon previous models and implement the components and the techniques associated with StyleGAN, currently the most state-of-the-art GAN with powerful capabilities.

Course 3: Apply Generative Adversarial Networks (GANs)

Explore the applications of GANs and examine them wrt data augmentation, privacy, and anonymity; Leverage the image-to-image translation framework and identify applications to modalities beyond images; Implement Pix2Pix, a paired image-to-image translation GAN, to adapt satellite images into map routes (and vice versa); Compare paired image-to-image translation to unpaired image-to-image translation and identify how their key difference necessitates different GAN architectures; Implement CycleGAN, an unpaired image-to-image translation model, to adapt horses to zebras (and vice versa) with two GANs in one.

Week 1: GANs for Data Augmentation and Privacy Preservation

Explore the applications of GANs and examine them wrt data augmentation, privacy, and anonymity. Improve your downstream AI models with GAN-generated data.

Week 2: Image-to-Image Translation

Leverage the image-to-image translation framework and identify extensions, generalizations, and applications of this framework to modalities beyond images. Implement Pix2Pix, a paired image-to-image translation GAN, to adapt satellite images to map routes (and vice versa) with advanced U-Net generator and PatchGAN discriminator architectures.

Week 3: Image-to-Image Unpaired Translation

Compare paired image-to-image translation to unpaired image-to-image translation and identify how their key difference necessitates different GAN architectures. Implement CycleGAN, an unpaired image-to-image translation model, to adapt horses to zebras (and vice versa) with two GANs in one.

Meet your instructors

Sharon Zhou
Cofounder & CEO, Lamini
Sharon Zhou is the Cofounder & CEO of Lamini, an LLM startup based on both her PhD dissertation in generative AI and her love for building delightful products as a product manager.
Eda Zhou
Full Stack Engineer, Lamini
Eda Zhou is a passionate and curious graduate from Worcester Polytechnic Institute who enjoys the possibilities computer science has to offer. She is always eager to try something new.
Eric Zelikman
CS PhD Student @ Stanford, DeepLearning.AI
Eric Zelikman is a student researcher at Blueshift at Google. He is currently pursuing a PhD in Computer Science at Stanford University.

Upcoming cohorts

Cost
Free
Duration
3 months
Dates
start now
Location
Online

Free