8.17号论文粗读

企业开发 2023-09-09 19:11:07 阅读次数: 0

文章目录

MixMatch: A Holistic Approach to Semi-Supervised Learning(2019)
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features（2019）
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
U2-Net: Going Deeper with Nested U-Structure for Salient Object Detection
Bootstrap Your Own Latent A New Approach to Self-Supervised Learning
CBAM: Convolutional Block Attention Module
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
Res2Net: A New Multi-scale Backbone Architecture（2019）
Barlow Twins: Self-Supervised Learning via Redundancy Reduction（2021）
Emerging Properties in Self-Supervised Vision Transformers（2021）
MOBILEVIT: LIGHT-WEIGHT, GENERAL-PURPOSE,AND MOBILE-FRIENDLY VISION TRANSFORMER（2022）
Supervised Contrastive Learning（2020）
RepVGG: Making VGG-style ConvNets Great Again（2021）
Pay Attention to MLPs（2021）
Dual Path Networks（2017）
Visual Attention Network（2022）
PVT v2: Improved Baselines with Pyramid Vision Transformer（2021）
Swin Transformer V2: Scaling Up Capacity and Resolution
MetaFormer Is Actually What You Need for Vision（2022）
CvT: Introducing Convolutions to Vision Transformers（2021）

MixMatch: A Holistic Approach to Semi-Supervised Learning(2019)

在这里插入图片描述

锐化的公式

CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features（2019）

在这里插入图片描述

图片混合起来

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution

在这里插入图片描述

两个低高频信息的更新以及交换

U2-Net: Going Deeper with Nested U-Structure for Salient Object Detection

在这里插入图片描述

就是嵌套UNet

Bootstrap Your Own Latent A New Approach to Self-Supervised Learning

在这里插入图片描述
采用平均教师模型来对两个不同输出进行损失

CBAM: Convolutional Block Attention Module

在这里插入图片描述

在这里插入图片描述

class GhostModule(nn.Module):
    def __init__(self, inp, oup, kernel_size=1, ratio=2, dw_size=3, stride=1, relu=True):
        super(GhostModule, self).__init__()
        self.oup = oup
        init_channels = math.ceil(oup / ratio)
        new_channels = init_channels*(ratio-1)

        self.primary_conv = nn.Sequential(
            nn.Conv2d(inp, init_channels, kernel_size, stride, kernel_size//2, bias=False),
            nn.BatchNorm2d(init_channels),
            nn.ReLU(inplace=True) if relu else nn.Sequential(),
        )

        self.cheap_operation = nn.Sequential(
            nn.Conv2d(init_channels, new_channels, dw_size, 1, dw_size//2, groups=init_channels, bias=False),
            nn.BatchNorm2d(new_channels),
            nn.ReLU(inplace=True) if relu else nn.Sequential(),
        )

    def forward(self, x):
        x1 = self.primary_conv(x)
        x2 = self.cheap_operation(x1)
        out = torch.cat([x1,x2], dim=1)
        return out[:,:self.oup,:,:]

可以通过代码就比较好理解了，他这个的本质就是减少卷积的参数
代码地址

FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence

在这里插入图片描述
就是不同的增强利用交叉熵计算机他们概率分布的一个损失

Res2Net: A New Multi-scale Backbone Architecture（2019）

在这里插入图片描述
增加多规模

Barlow Twins: Self-Supervised Learning via Redundancy Reduction（2021）

在这里插入图片描述
其实是比较简单的，就是同一个图像的不同变换经过相同网络的结果的损失

Emerging Properties in Self-Supervised Vision Transformers（2021）

在这里插入图片描述

经过不同增强版本，然后平均教师网络计算损失

MOBILEVIT: LIGHT-WEIGHT, GENERAL-PURPOSE,AND MOBILE-FRIENDLY VISION TRANSFORMER（2022）

在这里插入图片描述
相当于是加入卷积减少参数量

Supervised Contrastive Learning（2020）

在这里插入图片描述
大概就是，自监督的对比学习把另一个狗也当成了负例，有监督的解决了这个问题

RepVGG: Making VGG-style ConvNets Great Again（2021）

在这里插入图片描述
看网络就知道了

Pay Attention to MLPs（2021）

在这里插入图片描述
有点像MLP-mixer的思想

Dual Path Networks（2017）

在这里插入图片描述
不知道是不是基于通道划分的两个分支

Visual Attention Network（2022）

在这里插入图片描述

PVT v2: Improved Baselines with Pyramid Vision Transformer（2021）

在这里插入图片描述

Swin Transformer V2: Scaling Up Capacity and Resolution

在这里插入图片描述
就是qkv的计算方式变了一下

MetaFormer Is Actually What You Need for Vision（2022）

在这里插入图片描述

CvT: Introducing Convolutions to Vision Transformers（2021）

在这里插入图片描述

有的使用MLP生成token，有的利用卷积，利用卷积要注意维度变换

猜你喜欢

转载自blog.csdn.net/qq_45745941/article/details/132336518

8.17号论文粗读

论文粗读

带状态论文粗读（二）

【目标检测】YOLOF论文粗读

SDN测量论文粗读（二）9.21

SDN测量论文粗读（三）9.24

SDN测量论文粗读（一）9.19

8.17

SDN网络虚拟化、资源映射等相关论文粗读

带状态论文粗读（四）[带状态故障检测相关]

One-Stage Visual Grounding(单阶段语言指示的视觉定位)论文粗读_2017-2018

8.17总结

8.17题解

2023.8.12号论文阅读

实时追踪科研动态丨杨健、Jürgen Schmidhuber等人8.17精选新论文，附ChatPaper综述

AspectJ 生成的代码粗读

粗读SD标准

【NLP】UNILM粗读

[ unittest ] 文档粗读

机房测试8.17

8.17 12.21-12.24

各种面试8.17

8.17 学习笔记

8.17前端(4)

暑假集训 - 8.17 总结

8.17积累（杂）

8.17IO作业

粗读《大道至简》

8.17 动态规划——书的抄写

8.17学生出勤记录

今日推荐

Linus “吃狗粮”最积极！

开源日报 | Winamp播放器即将开源；生成式AI之战升级第二轮；Linus“吃狗粮”最积极；AI进入泡沫前期；吴泳铭为阿里云带来了什么？

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

周排行

SVN服务端安装在阿里云

实战 | 相机标定

webpack核心概念

note20——》只要肯低头吃苦，人生就会有救

PAT甲级 1062 Talent and Virtue （25 分）排序

NG Toolset开发笔记--5GNR Resource Grid（26）

如何对待上司

oracle命令

第9章 STL迭代器

logstash使用es映射模板

每日归档

更多

2024-05-20(36)

2024-05-19(0)

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)