Learning world representation - 代码天地

Learning world representation

其他 2018-07-21 23:04:52 阅读次数: 0

很难将物体与其他物体分别开：

分割(segmentation)
光线(lighting)
形变(deformation)
功能可见性(affordances)
观察点(viewpoint)：维度跳跃(dimension-hopping):由于观察点变化，信息从一个维度跳跃到了另一个维度。

_______________________________________________________________________________________

视角变化解决办法

使用冗余的不变特征(use redundant invariant features)
在物体上放一个盒子来正则化像素点(normalized pixels)
使用重复特征，将他们池化，叫做卷积神经网(replicated features with pooling, called 'convolutional neural nets')
使用层次结构，清晰地展示了照相机或视网膜的各个部分的位置。(use a hierarchy of parts that have explicit poses relative to the camera)

不变特征：提取一个很大且冗余的特征集，并且在平移或者缩放或者旋转中不变。

需要大量特征，因为有了冗余的特征，一个特征才会告诉你另外两个特征如何相互关联。

在物体识别中，避免从不同物理提取特征。

放盒子：通过一些形状的知识来标记盒子的方向，进而识别物体。

测试的时候需要尝试所有不同角度和大小的盒子。

_____________________________________________________________________________________________

卷积神经网络在手写数字识别中的应用

Yann LeCun 1980s几个应用很好的神经网络之一。

卷积神经网络起源于重复特征的思想。如果一个特征检测器在图像的某个位置起作用，有很大可能这个特征检测器能够在其他位置派上用场。

所以在不同的位置采用相同的特征检测器。用很多特征，得到很多特征图(feature maps)，图像的每块都能够表示为很多不同特征的集合。

在不同比例和方向上复制特征，会更困难复杂；

在不同位置复制特征大大减少了需要学习的自由参数的数量。

卷积神经网络与反向传播并不冲突：可以用反向传播来进行训练

重复特征达到的效果：同变性(Equivariant activities)

在神经元激活层面达到的是同变性
在权重方面达到的是不变性

如果想要在神经元活动层面上达到不变性，则对重复特征进行池化(pooling)

池化：对输入的特征图进行压缩，一方面使特征图变小，简化网络计算复杂度；一方面进行特征压缩，提取主要特征。

但是池化操作丢失了图片中物体的精确位置信息。

子采样：池化操作

S2：把C1中的重复特征合并到一起

如果在机器学习问题中加入先验知识

通过设计网络结构：

局部连接

对权重进行约束

选择合适的激活函数

还可以使用先验知识来合成数据。

在LeNet中加入合成数据，使错误率变小。

如何估计错误率：检测特定的错误

在第一个表格中，模型2要比模型1好。

__________________________________________________________________________________________________

识别彩色图像中的物体，比识别手写字体难很多。

物体种类多
很多像素(256*256 color vs 28*28 gray)
需要对3D图片进行处理，会丢失很多信息
需要分割不同杂乱的场景
每张图片中不同的物体

在识别手写字体中表现的很好的神经网络在识别彩色图片也会表现很好吗？

李飞飞创造了ImageNet数据集和比赛，推荐李飞飞的计算机视觉课程。

AlexKrizhevsky的神经网络：16%的错误率

GPU很擅长处理矩阵乘法，30倍速度

猜你喜欢

转载自blog.csdn.net/ll523587181/article/details/78865598

Learning world representation

[转载]Deep Learning·NLP·Representation

iCaRL: Incremental Classifier and Representation Learning

Model Representation--machine learning

《Graph Representation Learning》【1】——Introduction

# Representation Learning with Contrastive Predictive Coding

Building Program Vector Representation for Deep Learning

Representation Learning: A Review and New Perspectives阅读笔记

Machine Learning - Neural Networks Representation Part II

Improved Representation Learning for Question Answer Matching

Video Representation Learning Using Discriminative Pooling 阅读

表示学习(representation learning)的初印象

（IS 19）Unsupervised Raw Waveform Representation Learning for ASR

论文阅读——Knowledge-Embedded Representation Learning

Hierarchical Graph Representation Learning with Differentiable Pooling摘要

《Graph Representation Learning》【3】——Neighborhood Reconstruction Methods

NLP 3.2 Representation learning and about DL

《Graph Representation Learning》【2】——Background and Traditional Approaches

Inductive and Unsupervised Representation Learning on Graph Structured Objects

Generative Models as a Data Source for Multiview Representation Learning

[论文阅读] iCaRL: Incremental Classifier and Representation Learning

[论文翻译] iCaRL: Incremental Classifier and Representation Learning

Momentum Contrast for Unsupervised Visual Representation Learning

MOCO： Momentum Contrast for Unsupervised Visual Representation Learning

MoCO ——Momentum Contrast for Unsupervised Visual Representation Learning

Unsupervised Visual Representation Learning by Context Prediction（2015

详解VQVAE：Neural Discrete Representation Learning

论文阅读笔记 Improved Word Representation Learning with Sememes

Incoherent dictionary learning for sparse representation based image denoising（一）

Deep Direct Reinforcement Learning for Financial Signal Representation and Trading

今日推荐

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

报告：Django 仍然是 74% 开发者的首选

《2024 年一季度互联网投融资运行情况》研究报告

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

周排行

curl的POST请求，封装方法

8.1.1. Integer Types

Java基础 Day05(个人复习整理)

Python - Django - 中间件 process_exception

小L的试卷

【Shell编程】（函数）判断用户是否存在

python(css样式)

spring ant path 匹配原则 - 【笔记】

《JavaScript与JScript从入门到精通》(美)James.Jaworski.中译本.扫描版.pdf

Eclipse运行带参数的java程序

每日归档

更多

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)