[转]Deep Reinforcement Learning Based Trading Application at JP Morgan Chase

其他 2018-07-22 11:45:16 阅读次数: 0

Deep Reinforcement Learning Based Trading Application at JP Morgan Chase

https://medium.com/@ranko.mosic/reinforcement-learning-based-trading-application-at-jp-morgan-chase-f829b8ec54f2

FT released a story today about the new application that will optimize JP Morgan Chase trade execution ( Business Insider article on the same topic for readers that do not have FT subscription ). The intent is to reduce market impact and provide best trade execution results for large orders.

It is a complex application with many moving parts:

Its core is an RL algorithm that learns to perform the best action ( choose optimal price, duration and order size ) based on market conditions. It is not clear if it is Sarsa ( On-Policy TD Control) or Q-learning (Off-Policy Temporal Difference Control Algorithm ) as both algorithms are present in JP Morgan slides:

Sarsa

Q-learning

State consists of price series, expected spread cost, fill probability, size placed, as well as elapsed time, %progress, etc. Rewards are immediate rewards ( price spread ) and terminal ( end of episode ) rewards like completion, order duration and market penalties ( obviously those are negative rewards that punish the agent along these dimensions ).

Actions are memorized as weights of a Deep Neural Network — function approximation via NN is used since state, action space is too big to be handled in tabular form. We assume stochastic gradient descent is used for both feed forward and backprop operation operation ( hence Deep designation ):

JP Morgan is convinced this is the very first real time trading AI/ML application on Wall Street. We are assuming this is not true i.e. there are surely other players operating in this space as RL implementation to order execution is known for quite a while now ( Kearns and Nevmyvaka 2006 ).

The latest LOXM developments will be presented at QuantMinds Conference in Lisbon (May of 2018).

Instinet is also using Q-learning, probably for the same purpose ( market impact reduction ).

猜你喜欢

转载自www.cnblogs.com/freebird92/p/9349492.html

[转]Deep Reinforcement Learning Based Trading Application at JP Morgan Chase

Deep Reinforcement Learning for AutomatedStock Trading: An Ensemble Strategy

Deep Direct Reinforcement Learning for Financial Signal Representation and Trading

Deep Reinforcement Learning for Automated Stock Trading An Ensemble Strategy

QUANT[22]论文2:Deep Direct Reinforcement Learning for Financial Signal Representation and Trading

Policy-based Reinforcement learning

Relational Deep Reinforcement Learning

022 Deep Reinforcement Learning

DeepFlow: Deep Learning-Based Malware Detection by Mining Android Application

Review on the Recent Welding Research with Application of CNN-Based Deep Learning

读书笔记5：Deep Progressive Reinforcement Learning for Skeleton-based Action Recognition

CAPES:Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement Learning

《2018-Deep Progressive Reinforcement Learning for Skeleton-based Action Recognition》

读论文 DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills

【5分钟 Paper】Reinforcement Learning with Deep Energy-Based Policies

论文翻译：Deep Progressive Reinforcement Learning for Skeleton-based Action Recognition

Albert-Z-Guo/Deep-Reinforcement-Stock-Trading

Value-Based Reinforcement Learning-DQN

An Application of Reinforcement Learning to Aerovbatic Helicopter Flight

Deep Reinforcement Learning is a waste of time

Random Thoughts on Deep Reinforcement Learning

# Asynchronous Methods for Deep Reinforcement Learning

Asynchronous Methods for Deep Reinforcement Learning

Deep Reinforcement Learning with Double Q-learning

DeepFlow: Deep Learning-Based Malware Detection by Mining Android Application for Abnormal Usage 2

[转]Introduction to Learning to Trade with Reinforcement Learning

论文阅读:Reinforcement Learning Based Dynamic Resource Migration for Virtual Networks

《Reinforcement learning based parameters adaption method for particleswarm optimization》代码复现

Deep Reinforcement Learning: Pong from Pixels

Deep Reinforcement Learning 深度增强学习资源

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)