文献笔记（5）(2017ISSCC 14.1)

物联网 2018-10-23 13:11:10 阅读次数: 0

文章目录

1 英文缩写
2 overall architecture
2 accelerator subsystem
3 convolutional accelerator
4 DSP cluster

文献摘自A 2.9TOPS/W Deep Convolutional Neural Network SoC in FD-SOI 28nm for Intelligent Embedded Systems

1 英文缩写

DCNN: deep convolutional neural networks
HW: hardware
DMA: 直接内存访问
CAF: configurable accelerator framework
CA: convolution accelerator
MAC: multiply-accumulate

2 overall architecture

We present a state-of-the-art performance and energy efficient HW-accelerated DCNN processor with the following features:

An energy-efficient set of DCNN HW convolutional accelerators supporting kernel compression
an on-chip reconfigurable data-transfer fabric
a power-efficient array of DSPs to support complete real-world computer vision applications
an ARM-based host subsystem with peripherals(外围设备)
a range of high-speed IO interfaces for imaging and other types of sensors
a chip-to-chip multilink（多链路） to pair multiple devices together
ARM Cortex microcontroller with 128KB of memory
不同的外围设备

扫描二维码关注公众号，回复： 3689075 查看本文章
8 DSP clusters
- 每个DSP cluster有2 DSP, 4-way 16KB instruction caches, 64KB local RAMs and a 64KB shared RAM
Image&CDNN co-processor subsystem
- 8个convolution accelerator

2 accelerator subsystem

a configurable fully connected switch to/from different kinds of sources/sinks
various types of accelerators
Various kernel sizes (up to 12×12), batch sizes (up to 16), and parallel kernels (up to 4) can be handled by a single CA instance, but any size kernel can be accommodated with the accumulator input.

3 convolutional accelerator

a line buffer to fetch up to 12 feature map data words in parallel with a single memory access
最大kernel size：12*12，但是kernel buffer 有36个读端口

4 DSP cluster

32bit DSP
The DSPs are tasked with max or average pooling, nonlinear activation, cross-channel response normalization and classification.

猜你喜欢

转载自blog.csdn.net/tiaozhanzhe1900/article/details/83213914

文献笔记（5）(2017ISSCC 14.1)

文献笔记（6）(2017ISSCC: 14.2)

‎Cocos2d-x 学习笔记(14.1) EventDispatcher

‎Cocos2d-x 学习笔记(14.1) Event EventCustom EventListener

14.1 Java的API及obje

14.1 Storm简介

14.1表单脚本

14.1节练习

AWVS14.1安装

Proxy-Go v14.1 发布，sps/socks5 新增 udp 兼容模式，好用的全能代理！

LineageOS 14.1(CM14.1)的快速下载与编译

14.1 JMX API和ActiveMQ

java基础：14.1 泛型

14.1 Go数据结构

Android核心破解 for LineageOs 14.1

14.1 input子系统详解

BurnAware Professional v14.1

14.1为什么要使用RTTI

14.1 使用Flask提供REST Web服务

day14.1_生成器

14.1 使用工具进行重构

java基础：14.1 Java FX与属性绑定

java_14.1 判断是否是闰年

14.1-线程及执行器

14.1 网络编程的基础知识

(14.1) 跨端语言对比

Quartus II14.1安装教程

ShareX 14.1 发布，改进中文 OCR 功能

文献笔记（3）(2018ISSCC 13.4)

文献笔记（4）(2018ISSCC 13.3)

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)