(Sigcomm'19)Verifying Deep-RL-Driven Systems

其他 2019-10-26 22:20:47 阅读次数: 0

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接： https://blog.csdn.net/Hesy_H/article/details/101314550

@Huji

文章目录

abstract
introduction
inspiration

abstract

深度学习难以解释，但是我们可以验证深度RL驱动的系统是否符合期望的，设计师指定的行为。为此，我们启动了深度RL形式验证的研究，并提出了Verily，这是一种基于深度RL的系统验证系统，它利用了深度神经网络验证的最新进展。我们聘请Verily来验证最近引入的深度RL驱动的系统，以用于自适应视频流，云资源管理和Internet拥塞控制。
**我们的结果揭示了在深度RL驱动的决策中产生不良行为的场景。**我们讨论了构建更安全，更易于验证的深度RL驱动系统的准则。

introduction

我们在三个深度RL驱动的系统上对Verily进行评估：Pensieve自适应视频流方案[24]，用于云资源管理的DeepRM调度程序[23]和Custard Internet拥塞控制器[12]。
- 我们为这些系统中的每一个制定自然要求，并通过Verily确定是否始终满足这些要求，如果不是，则生成反例。我们的初步评估结果暴露了测试系统中的几个问题.

inspiration

Custeard使用了的input:
- 对过去网络状况的观察，包括吞吐量，丢失率和时延变化；
- 以前的发送费率；
- 先前的奖励【???】

猜你喜欢

转载自blog.csdn.net/Hesy_H/article/details/101314550

(Sigcomm'19)Verifying Deep-RL-Driven Systems

Deep Learning for Recommender Systems资料

Paper Reading:Wide & Deep Learning for Recommender Systems

论文阅读：《Wide & Deep Learning for Recommender Systems》

论文阅读: Wide & Deep Learning for Recommender Systems

Wide & Deep Learning for Recommender Systems 模型实践

Wide & Deep Learning for Recommender Systems 翻译

《Wide & Deep Learning for Recommender Systems》论文总结

《Wide and Deep Learning for Recommender Systems》学习笔记

Model-Reuse Attacks on Deep Learning Systems

Wide & Deep Learning for Recommender Systems【论文记录】

Recommender Systems Based on Generative Adversarial Networks: A Problem-Driven Perspective

《Wide and deep learning in Recommender Systems》论文阅读笔记

论文笔记 - Wide & Deep Learning for Recommender Systems

推荐系统——A Hybrid Collaborative Filtering Model with Deep Structure for Recommender Systems

（转）Understanding Memory in Deep Learning Systems: The Neuroscience, Psychology and Technology Perspectives

论文笔记之 Collaborative Deep Learning for Recommender Systems

Wide & Deep Learning for Recommender Systems 论文阅读总结

[论文解读] DeepMutation: Mutation Testing of Deep Learning Systems

推荐系统综述：A review on deep learning for recommender systems: challenges and remedies

论文笔记：Deep Matrix Factorization Models for Recommender Systems

《Deep Matrix Factorization Models for Recommender Systems》DMF模型及python代码

Systems biology informed deep learning for inferring parameters and hidden dynamics

Distributed Systems

Expert systems

Type Systems

ECS：Systems

#Reading Paper# 【序列推荐综述】IJCAI‘19:Sequential Recommender Systems: Challenges, Progress and Prospects

论文笔记-Loop Closure Detection for Visual SLAM Systems Using Deep Neural Networks

『论文阅读』A Multi-View Deep Learning Approach for Cross Domain User Modeling in Recommendation Systems

今日推荐

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

《2024 年一季度互联网投融资运行情况》研究报告

报告：Django 仍然是 74% 开发者的首选

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

周排行

记一下去大梅沙的准备（2018-05-26）

Spring 注解事务

基于HTTP协议的客户端缓存

阿里云rds 备份和还原

[PHP] 几个拖慢 PHP 程序/API 运行速度的点

python 代码风格------------PEP8规则

js控制json生成菜单——自制菜单（一）

将字符串: 'k:1|k1:2|k2:3|k3:4 ' ,处理成 python 字典: {'k':1, 'k1':2, ...}

微信小程序转支付宝小程序

Qt551.窗口滚动条

每日归档

更多

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)