20810522 - Deep Learning and Generative Models

The course aims to illustrate the foundation concepts underlying discriminative and generative deep neural networks.
The student will acquire the ability to employ deep networks, with particular reference to the state of the art, for the recognition and classification of images and signals, and for the generation of content, such as images and text. The fundamental techniques underlying Large Language Models, and recent prompt-based paradigms, will be explored. Applications in various domains will be illustrated, including computer vision, speech recognition, natural language analysis, machine translation. At the end of the course the student will be able to write Python code to train deep learning networks and test them in both discriminative and generative domains.

teacher profile | teaching materials

Programme

Course Program

Introduction to Deep Learning and the main paradigms of modern Artificial Intelligence. Fundamentals of deep neural networks: artificial neurons, activation functions, optimization methods, and training techniques. Discriminative and generative neural architectures.

Convolutional Neural Networks (CNNs) for image and signal analysis: convolution, pooling, feature extraction, and major applications in computer vision. Embeddings and distributed data representations.

Neural models for Natural Language Processing and introduction to Transformer architectures: attention mechanisms, tokenization, Large Language Models (LLMs), pre-training, and the use of generative models for text and multimodal content generation. Modern prompting techniques and interaction paradigms with generative models.

Introduction to modern generative models for images and text, with particular focus on autoregressive models and diffusion models. Applications to image generation, multimedia content generation, and intelligent conversational systems.

Recent paradigms based on intelligent agents and tool orchestration: agent-based architectures, external tool usage, memory mechanisms, and planning of complex tasks. Introduction to multi-agent systems and fundamental concepts of decision making and reinforcement learning for AI-based autonomous systems.

Applications of Deep Learning in several domains, including computer vision, speech recognition, natural language processing, machine translation, and intelligent autonomous systems.

Practical activities in Python using modern Deep Learning frameworks for the design, training, evaluation, and application of discriminative and generative neural models.

Core Documentation

Lecture notes by the professor.

Attendance

Attendance is not compulsory but recommended.

Type of evaluation

Written test, practical test.