Safety in Generative AI

The rapid advancement of generative AI has brought remarkable innovations, from creating realistic images to generating human-like text. However, this power comes with significant responsibility. AI safety is crucial to ensure these systems are used ethically, fairly, and without unintended harm. Without proper safety measures, generative AI can spread misinformation, reinforce biases, or be exploited for malicious purposes. By prioritizing AI safety, researchers and developers can build trust, create guidelines for responsible use, and mitigate risks before they affect society on a large scale. This course covers the following topics:

Safety in Generative AI

Part 1: Intro to Modern Generative AI

Part 2: Safety Risks and Mitigation

Part 3: Watermarking

Introductory Videos

Part 1: Introduction to Modern Generative AI

Foundations of Generative AI

Large Language Models and Diffusion Models

Generative AI as Agents

Part 2: Safety Risks and Mitigation

Inference‑Time Adversarial Attacks

Training/Post‑Training Time Attacks

Societal Risks

Deepfakes, Plagiarism, AI Detectors

Part 3: Watermarking

Watermarking Generative AI

Watermarking LLMs and Beyond