深度学习

计算机科学与技术系研究生选修课, 厦门大学, 2024-09

该课程为厦门大学信息学院/人工智能研究院研究生选修课,以基于深度学习平台TensorFlow和PyTorch的编程实践为主,算法理论为辅,使学生能够领悟深度学习的基本原理以及适用场景,并且对使用深度学习方法来解决问题具有一定的动手能力,为学生今后开展科研工作和业界求职打下基础。

课程大纲

章节主要内容Notebook视频链接
Lecture 1 Introduction to Deep LearningDeep learning applications, impacts, researchers, history, and course description. Lecture 1
Lecture 2 Basics of Machine LearningBasics of machine learning, linear models, neural networks, back-propagation, model selection, model evaluation. Lecture 2
Lecture 3 Regularization and OptimizationGeneralization, overfitting and underfitting, regularization, optimization for deep models, batch normalization, parameter initialization. Lecture 3
Lecture 4 Hardware and SoftwareDeep learning hardware, PyTorch, TensorFlow, Keras.Lecture 4Lecture 4
Lecture 5 Basics of Convolutional Neural NetworksConvolution, padding, stride, parameter sharing, pooling, common CNN patterns. Lecture 5
Lecture 6 CNN ArchitecturesAlexNet, VGG, GoogLeNet, ResNet, SENet, DenseNet.Lecture 6Lecture 6
Lecture 7 Basics of Recurrent Neural NetworksRNN, Seq2seq, Attention models, LSTM Lecture 7
Lecture 8 Language ModelWord2vec, ELMo, Transformer, BERTLecture 8Lecture 8
Lecture 10 Deep Reinforcement LearningMarkov Decision Process, Q-Learning, Deep Q Network, Policy Gradient, Actor-Critic, DDPG. Lecture 10
Lecture 11 Generative ModelsGAN, DCGAN, CGAN, WGAN, SAGAN, pix2pix, CycleGAN, SRGANLecture 11Lecture 11
Lecture 12 Deep Learning on GraphsDeepwalk, LINE, Node2vec, GCN, GraphSAGE, GAT Lecture 12
Lecture 13 Self-Supervised LearningGeneration-Based Methods, Context-Based Methods, Free Semantic Label-Based Methods, Cross Modal-Based Methods, Contrastive Learning Lecture 13
Lecture 14 Meta-LearningOptimization-Baesd Method, Model-Based Method, Metric-Based Method, MAML, Few-Shot Learning Lecture 14
Lecture 15 Special Topics in Deep LearningKnowledge Distillation, Adversarial Samples, Model Interpretation, Fairness, Privacy Lecture 15

大作业展示

标题
A Training-Free Localized Video Style Transfer Method Based on Diffusion Models
All-In-One Image Restoation
An Adversarial Example Attack Method Based on Ensemble Learning
Application of Large Models on Edge Devices From Voice Input to Text-to-Image and Intelligent Dialogue Based on Raspberry Pi
ATN A Multi-Source Light Correction Network Using Attention for True-to-Life Color Constancy
Automatic ICD Coding with Pretrained Language Models
Construction of real-time live digital human based on deep learning neural network
Cross-platform Recommendation Technology Based on Prototype Alignment
DHFusion Dual Dynamic Hypergraph-Driven Fusion for GAN Inversion
Dynamic Contextual Adjustment and Multi-Step Self-Correction Framework for Few-Shot Knowledge Base Question Answering
Enhancing Text-to-Image Synthesis with CLIP-Integrated Diffusion Models
ESTJ-GD Visual Reconstruction via EEG-Image Joint Space Learning with Guided Diffusion.pdf
Extraction of HUA Metabolic Features Based on Deep Learning
GNFormer Structral Transformer for Large Graphs
High-Resolution Facial Visual Dubbing Technology Based on Spatial Deformation and Inpainting
HyperDSC Reject Outliers with High-Order Deep Spatial Compatibility Learning for 3D Registration
IDMS Information Density-driven Multi-scale Segmentation
Image Matching in 3D Space with Transformer
Image restoration based on Spiking Neural Network
Learning CLIP guided video person recogonization
Low Rank Quantization Adaptation for Large Language Model
Mamba-Based Public Figure Recognition in Complex Environments
Medical Image Anomaly Detection
Meta-Learning for Multimodal Cross-Domain Recommendation Systems
MSA-UDA Multi-scale Alignment Guided method in Cross-modal Unsupervised Domain Adaptation in 3D Semantic Segmentation
Multi-Emotional Expression Generation from a Single Facial Expression
Multimodal Fusion Remote Sensing Semantic Segmentation Based on CNN and Transformer
Multimodal Human Motion Capture Based on Pre-Trained Models
Named Entity Recognition of Thyroid Diseases Based on the BioBERT Pre-trained Model
New Generation Conversational Recommendation System Powered by Large Language Model Agent
OC3D Weakly Supervised Outdoor 3D Object Detection with Only Coarse Click Annotation
Optimizing Protein Sequence Design Based on the Advanced ProteinMPNN Deep Learning Model
Pet Age Automatic Recognition
Physical Shape-aware Poser(PSP) Tracking Various-shaped Human Motion From Sparse Inertial Sensors
Plant disease identification using Vanillanet
Product Summarization Extraction Model with Image Information
Real time 3D Human Pose Estimation from Video Based on Transformer
Real-time Prediction of Fire Smoke Propagation Research Proposal
Research on Improvements to Industrial Defect Detection Algorithms Based on YOLOv8
Research on Person Re-identification Based on Frequency Domain Adaptation
Rumor Detection Based on Transformer
Stain Normalization for Whole Slide Image Classification Based on Multiple Instance Learning
TalNet Text-based Semi-Supervised Referring Camouflaged Object Detection with Active Learning
Task Planning and Execution Based on LLM-powered Mobile Agents
Text-Guided Image Fusion
Text-to-Speech Integration With Voice Cloning
Toward Open-Vocabulary Video Object Detection
Towards an Enhanced Audio Watermarking Approach Balancing Robustness and Imperceptibility
Towards High Quality multi-speaker TTS of Chinese
Towards Open-Vocabulary 3D Scene Understanding Using Gaussian Splatting
Towards Robust Multimodal Sentiment Detection with LLaVA
UAV Path Planning Based on Point Cloud Deep Reinforcement Learning
Urban Street View Semantic Segmentation Based on Multi-scale Network and Ghost Attention Head
Using Multi-cues Fusion in Language-Guided Multiple Object Tracking

参考资料

本课程的课件参考了许多著名的深度学习课程,非常感谢这些课程的教授对课件进行无私的分享。

CS231n: Convolutional Neural Networks for Visual Recognition, Stanford University

CS224n: Natural Language Processing with Deep Learning, Stanford University

CS224w: Machine Learning with Graphs, Stanford University

CMSC 35246: Deep Learning, University of Chicago

Introduction to Reinforcement Learning with David Silver, DeepMind

MGMTMSA-434: Advanced Workshop on Machine Learning, UCLA

期末合影

IMG_7043 IMG_7194

往年资料

2023 2022 2021 2020