MultiModal Machine Learning 笔记_No.0 课程介绍

企业开发 2023-04-08 01:26:35 阅读次数: 0

课程简介：

Multimodal machine learning (MMML) is a vibrant multi-disciplinary research field which addresses some of the original goals of artificial intelligence by integrating and modeling multiple communicative modalities, including linguistic, acoustic, and visual messages. With the initial research on audio-visual speech recognition and more recently with language & vision projects such as image and video captioning, this research field brings some unique challenges for multimodal researchers given the heterogeneity of the data and the contingency often found between modalities. This course will teach fundamental mathematical concepts related to MMML including multimodal alignment and fusion, heterogeneous representation learning and multistream temporal modeling. We will also review recent papers describing state-of-the-art probabilistic models and computational algorithms for MMML and discuss the current and upcoming challenges.

The course will present the fundamental mathematical concepts in machine learning and deep learning relevant to the five main challenges in multimodal machine learning: (1) multimodal representation learning, (2) translation & mapping, (3) modality alignment, (4) multimodal fusion and (5) co-learning. These include, but not limited to, multimodal auto-encoder, deep canonical correlation analysis, multi-kernel learning, attention models and multimodal recurrent neural networks. The course will also discuss many of the recent applications of MMML including multimodal affect recognition, image and video captioning and cross-modal multimedia retrieval.

课程主页：https://cmu-multicomp-lab.github.io/mmml-course/fall2020/

课程视频与PPT：11-777 MMML | Schedule

猜你喜欢

转载自blog.csdn.net/like_jmo/article/details/127628963

MultiModal Machine Learning 笔记_No.0 课程介绍

Multimodal Machine Learning

Multimodal Machine Learning:A Survey and Taxonomy 综述阅读笔记

ML | Machine Learning课程笔记

Multimodal Machine Learning: A Survey and Taxonomy【未完待续】

Multimodal Machine Learning: A Survey and Taxonomy/多模态机器学习综述

structure machine learning projects 课程笔记

深度学习神经网络学习笔记-多模态方向-13- Multimodal machine learning: A survey and taxonomy

Machine Learning Yearning介绍

深度学习笔记（一）_Machine Learning (2020)_ Course Introduction【课程介绍】

Machine Learning 笔记一

Machine Learning 笔记二

Machine Learning 学习笔记

Machine Learning 笔记 (一)

Coursera课程《Machine Learning》吴恩达课堂笔记

Andrew Ng machine learning 课程笔记--牛顿方法

Andrew Ng machine learning课程笔记--机器学习的动机与应用

Andrew Ng machine learning 课程笔记--特征选择

Andrew Ng machine learning 课程笔记--顺序最小优化算法

Andrew Ng machine learning 课程笔记--生成学习算法

多模态机器学习研究分类总结-Multimodal Machine Learning A Survey and Taxonomy

李宏毅老师机器学习课程笔记_ML Lecture 0-1: Introduction of Machine Learning

0.Overview----Machine Learning

Machine Learning Yearning - Ng 笔记

Machine Learning Yearning 要点笔记

Machine Learning 学习笔记（tensorflow)

coursera课程《Neural Networks for Machine Learning》

李宏毅老师机器学习课程笔记_ML Lecture 0-2: Why we need to learn machine learning?

【Machine Learning】梯度下降算法介绍_02

【Machine Learning】梯度下降算法介绍_02

今日推荐

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

周排行

阿里云短信服务平台注册

Windows下的字符串处理(1)

sqoop: mysql导入数据到hdfs, hive, hbase

commons.lang中常用的工具类

离线安装PostgreSQL11.6

使用PyTorch简单实现卷积神经网络模型

一文彻底搞定谱聚类

一道面试题引发的血案

One Chat for Mac(聊天工具)

TCP/IP的底层队列是如何实现的？

每日归档

更多

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)