【二】最新多智能体强化学习文章如何查阅{顶会:AAAI、 ICML }

2023-12-29 20:32

本文主要是介绍【二】最新多智能体强化学习文章如何查阅{顶会:AAAI、 ICML },希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!

相关文章:

【一】最新多智能体强化学习方法【总结】

【二】最新多智能体强化学习文章如何查阅{顶会:AAAI、 ICML }

【三】多智能体强化学习(MARL)近年研究概览 {Analysis of emergent behaviors(行为分析)_、Learning communication(通信学习)}

【四】多智能体强化学习(MARL)近年研究概览 {Learning cooperation(协作学习)、Agents modeling agents(智能体建模)}

1.中国计算机学会(CCF)推荐国际学术会议和期刊目录

CCF官方网站

CCF推荐国际学术会议(参考链接:链接点击查阅具体分类)

类别如下计算机系统与高性能计算,计算机网络,网络与信息安全,软件工程,系统软件与程序设计语言,数据库、数据挖掘与内容检索,计算机科学理论,计算机图形学与多媒体,人工智能与模式识别,人机交互与普适计算,前沿、交叉与综合

2021 ICML 多智能体强化学习论文整理汇总

类别名称数量
投稿量5513​
接收量1184
强化学习方向文章163
其中多智能体强化学习文章15

ICML地位:

1.1 中国计算机学会推荐国际学术会议
(人工智能与模式识别)

1.1.1 A类

序号

会议简称

会议全称

出版社

网址

1

AAAI

AAAI Conference on Artificial Intelligence

AAAI

http://www.aaai.org

2

CVPR

IEEE Conference on Computer Vision and 
Pattern Recognition

IEEE

http://www.pamitc.org/cvpr13/

3

ICCV

International Conference on Computer
Vision

IEEE

http://www.iccv2013.org/

4

ICML

International Conference on Machine 
Learning

ACM

http://icml.cc/2013/

5

IJCAI

International Joint Conference on Artificial
Intelligence

Morgan Kaufmann

http://www.ijcai.org

1.1.2 B类

序号

会议简称

会议全称

出版社

网址

1

COLT

Annual Conference on Computational
Learning Theory

Springer

http://orfe.princeton.edu/conferences/colt2013/

2

NIPS

Annual Conference on Neural Information
Processing Systems

MIT Press

http://www.nips.cc

1.1.3 B、C类更多见附录

2.推荐深度强化学习实验室及链接

2.1 arXiv

arXiv是一个免费的分发服务和开放存取的档案,收录了物理、数学、计算机科学、定量生物学、定量金融、统计学、电气工程和系统科学以及经济学等领域的1,917,177篇学术文章。本网站上的材料没有经过arXiv的同行评审。

链接:https://arxiv.org/

 2.2 深度强化学习实验室

DeepRL——github:https://github.com/neurondance

微信公众号:Deep-RL

官网:http://www.neurondance.com/

论坛http://deeprl.neurondance.com/

2.3 AI 会议Deadlines

: https://aideadlin.es

2.4 ICML官网:

https://icml.cc/

3.最新多智能体强化学习方向论文

3.1 ICML  International Conference on Machine Learning

[1]. Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

作者: Shariq Iqbal (University of Southern California) · Christian Schroeder (University of Oxford) · Bei Peng (University of Oxford) · Wendelin Boehmer (Delft University of Technology) · Shimon Whiteson (University of Oxford) · Fei Sha (Google Research)

[2]. UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

作者: Tarun Gupta (University of Oxford) · Anuj Mahajan (Dept. of Computer Science, University of Oxford) · Bei Peng (University of Oxford) · Wendelin Boehmer (Delft University of Technology) · Shimon Whiteson (University of Oxford)

[3]. Emergent Social Learning via Multi-agent Reinforcement Learning

作者: Kamal Ndousse (OpenAI) · Douglas Eck (Google Brain) · Sergey Levine (UC Berkeley) · Natasha Jaques (Google Brain, UC Berkeley)

[4]. DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning

作者: Wei-Fang Sun (National Tsing Hua University) · Cheng-Kuang Lee (NVIDIA Corporation) · Chun-Yi Lee (National Tsing Hua University)

[5]. Cooperative Exploration for Multi-Agent Deep Reinforcement Learning

作者: Iou-Jen Liu (University of Illinois at Urbana-Champaign) · Unnat Jain (UIUC) · Raymond Yeh (University of Illinois at Urbana–Champaign) · Alexander Schwing (UIUC)

[6]. Large-Scale Multi-Agent Deep FBSDEs

作者: Tianrong Chen (Georgia Institute of Technology) · Ziyi Wang (Georgia Institute of Technology) · Ioannis Exarchos (Stanford University) · Evangelos Theodorou (Georgia Tech)

[7]. Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

作者: Anuj Mahajan (Dept. of Computer Science, University of Oxford) · Mikayel Samvelyan (University College London) · Lei Mao (NVIDIA) · Viktor Makoviychuk (NVIDIA) · Animesh Garg (University of Toronto, Vector Institute, Nvidia) · Jean Kossaifi (NVIDIA) · Shimon Whiteson (University of Oxford) · Yuke Zhu (University of Texas - Austin) · Anima Anandkumar (Caltech and NVIDIA)

[8]. Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing

作者: Filippos Christianos (University of Edinburgh) · Georgios Papoudakis (The University of Edinburgh) · Muhammad Arrasy Rahman (The University of Edinburgh) · Stefano Albrecht (University of Edinburgh)

[9]. Parallel Droplet Control in MEDA Biochips using Multi-Agent Reinforcement Learning

作者: Tung-Che Liang (Duke University) · Jin Zhou (Duke University) · Yun-Sheng Chan (National Chiao Tung University) · Tsung-Yi Ho (National Tsing Hua University) · Krishnendu Chakrabarty (Duke University) · Cy Lee (National Chiao Tung University)

[10]. A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

作者: Dong Ki Kim (MIT) · Miao Liu (IBM) · Matthew Riemer (IBM Research) · Chuangchuang Sun (MIT) · Marwa Abdulhai (MIT) · Golnaz Habibi (MIT) · Sebastian Lopez-Cot (MIT) · Gerald Tesauro (IBM Research) · Jonathan How (MIT)

[11]. Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot

作者: Joel Z Leibo (DeepMind) · Edgar Duenez-Guzman (DeepMind) · Alexander Vezhnevets (DeepMind) · John Agapiou (DeepMind) · Peter Sunehag () · Raphael Koster (DeepMind) · Jayd Matyas (DeepMind) · Charles Beattie (DeepMind Technologies Limited) · Igor Mordatch (Google Brain) · Thore Graepel (DeepMind)

[12]. Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers

作者: Luke Marris (DeepMind) · Paul Muller (DeepMind) · Marc Lanctot (DeepMind) · Karl Tuyls (DeepMind) · Thore Graepel (DeepMind)

[13]. Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition

作者: Bo Liu (University of Texas, Austin) · Qiang Liu (UT Austin) · Peter Stone (University of Texas at Austin) · Animesh Garg (University of Toronto, Vector Institute, Nvidia) · Yuke Zhu (University of Texas - Austin) · Anima Anandkumar (California Institute of Technology)

[14]. Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning

作者: Matthieu Zimmer (Shanghai Jiao Tong University) · Claire Glanois (Shanghai Jiao Tong University) · Umer Siddique (Shanghai Jiao Tong University) · Paul Weng (Shanghai Jiao Tong University)

[15]. FOP: Factorizing Optimal Joint Policy of Maximum-Entropy Multi-Agent Reinforcement Learning

作者: Tianhao Zhang (Peking University) · yueheng li (Peking university) · Chen Wang (Peking University) · Zongqing Lu (Peking University) · Guangming Xie (1. State Key Laboratory for Turbulence and Complex Systems, College of Engineering, Peking University; 2. Institute of Ocean Research, Peking University)

3.2 AAAI Conference on Artificial Intelligence

会议时间节点

  • August 15 – August 30, 2020: Authors register on the AAAI web site
  • September 1, 2020: Electronic abstracts due at 11:59 PM UTC-12 (anywhere on earth)
  • September 9, 2020: Electronic papers due at 11:59 PM UTC-12 (anywhere on earth)
  • September 29, 2020: Abstracts AND full papers due for revisions of rejected NeurIPS/EMNLP submissions by 11:59 PM UTC-12 (anywhere on earth)
  • AAAI-21 Reviewing Process: Two-Phase Reviewing and NeurIPS/EMNLP Fast Track Submissions
  • November 3-5, 2020: Author Feedback Window (anywhere on earth)
  • December 1, 2020: Notification of acceptance or rejection

具体论文见链接:http://deeprl.neurondance.com/d/191-82aaai2021

接收论文列表(共84篇)

4.附录

4.1 B类

序号

会议简称

会议全称

出版社

网址

1

COLT

Annual Conference on Computational
Learning Theory

Springer

http://orfe.princeton.edu/conferences/colt2013/

2

NIPS

Annual Conference on Neural Information
Processing Systems

MIT Press

http://www.nips.cc

3

ACL

Annual Meeting of the Association for 
Computational Linguistics

ACL

http://acl2013.org/site/index.html

4

EMNLP

Conference on Empirical Methods in Natural
Language Processing

ACL

http://www.sigdat.org/

5

ECAI

European Conference on Artificial 
Intelligence

IOS Press

http://www.ecai2013.upit.ro/?i=2542

6

ECCV

European Conference on Computer Vision

Springer

http://eccv2012.unifi.it/

7

ICRA

IEEE International Conference on Robotics
and Automation

IEEE

http://www.icra2013.org/

8

ICAPS

International Conference on Automated
Planning and Scheduling

AAAI

http://www.icaps-conference.org/

9

ICCBR

International Conference on Case-Based
Reasoning

Springer

http://www.iccbr.org/

10

COLING

International Conference on Computational
Linguistics

ACM

 http://www.coling2012-iitb.org/

11

KR

International Conference on Principles of
Knowledge Representation and Reasoning

Morgan Kaufmann

http://www.kr.org/

12

UAI

International Conference on Uncertainty
in Artificial Intelligence

AUAI

http://auai.org/

13

AAMAS

International Joint Conference
on Autonomous Agents and Multi-agent
Systems

Springer

http://www.aamas-conference.org/

4.2 C类

序号

会议简称

会议全称

出版社

网址

1

ACCV

Asian Conference on Computer Vision

Springer

http://www.accv2012.org/

2

CoNLL

Conference on Natural Language Learning

CoNLL

http://www.clips.ua.ac.be/conll/

3

GECCO

Genetic and Evolutionary Computation
Conference

ACM

http://www.sigevo.org/gecco-2013/

4

ICTAI

IEEE International Conference on Tools with
Artificial Intelligence

IEEE

http://ictai12.unipi.gr/

5

ALT

International Conference on Algorithmic
Learning Theory

Springer

http://www-alg.ist.hokudai.ac.jp/~thomas/ALT13/

6

ICANN

International Conference on Artificial Neural
Networks

Springer

https://www.waset.org/conferences/2013/
amsterdam/icann/

7

FGR

International Conference on Automatic Face
and Gesture Recognition

IEEE

http://fg2013.cse.sc.edu/

8

ICDAR

International Conference on Document
Analysis and Recognition

IEEE

http://www.icdar2013.org/

9

ILP

International Conference on Inductive Logic
Programming

Springer

http://ilp13.cos.ufrj.br/

10

KSEM

International conference on Knowledge
Science,Engineering and Management

Springer

http://ksem.dlut.edu.cn/

11

ICONIP

International Conference on Neural 
Information Processing

Springer

http://iconip2013.org/

12

ICPR

International Conference on Pattern 
Recognition

IEEE

http://www.icpr2014.org/

13

ICB

International Joint Conference on Biometrics

IEEE

http://atvs.ii.uam.es/icb2013/

14

IJCNN

International Joint Conference on Neural
Networks

IEEE

http://www.ijcnn2013.org/

15

PRICAI

Pacific Rim International Conference on 
Artificial Intelligence

Springer

http://ktw.mimos.my/pricai2012/

16

NAACL

The Annual Conference of the North
American Chapter of the Association 
for Computational Linguistics

NAACL

http://naacl2013.naacl.org/

17

BMVC

British Machine Vision Conference

British Machine
Vision 
Association

http://bmvc2013.bristol.ac.uk/

这篇关于【二】最新多智能体强化学习文章如何查阅{顶会:AAAI、 ICML }的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!



http://www.chinasem.cn/article/550638

相关文章

PyCharm 接入 DeepSeek最新完整教程

《PyCharm接入DeepSeek最新完整教程》文章介绍了DeepSeek-V3模型的性能提升以及如何在PyCharm中接入和使用DeepSeek进行代码开发,本文通过图文并茂的形式给大家介绍的... 目录DeepSeek-V3效果演示创建API Key在PyCharm中下载Continue插件配置Con

Java深度学习库DJL实现Python的NumPy方式

《Java深度学习库DJL实现Python的NumPy方式》本文介绍了DJL库的背景和基本功能,包括NDArray的创建、数学运算、数据获取和设置等,同时,还展示了如何使用NDArray进行数据预处理... 目录1 NDArray 的背景介绍1.1 架构2 JavaDJL使用2.1 安装DJL2.2 基本操

MySQL 缓存机制与架构解析(最新推荐)

《MySQL缓存机制与架构解析(最新推荐)》本文详细介绍了MySQL的缓存机制和整体架构,包括一级缓存(InnoDBBufferPool)和二级缓存(QueryCache),文章还探讨了SQL... 目录一、mysql缓存机制概述二、MySQL整体架构三、SQL查询执行全流程四、MySQL 8.0为何移除查

MySql9.1.0安装详细教程(最新推荐)

《MySql9.1.0安装详细教程(最新推荐)》MySQL是一个流行的关系型数据库管理系统,支持多线程和多种数据库连接途径,能够处理上千万条记录的大型数据库,本文介绍MySql9.1.0安装详细教程,... 目录mysql介绍:一、下载 Mysql 安装文件二、Mysql 安装教程三、环境配置1.右击此电脑

在 Windows 上安装 DeepSeek 的完整指南(最新推荐)

《在Windows上安装DeepSeek的完整指南(最新推荐)》在Windows上安装DeepSeek的完整指南,包括下载和安装Ollama、下载DeepSeekRXNUMX模型、运行Deep... 目录在www.chinasem.cn Windows 上安装 DeepSeek 的完整指南步骤 1:下载并安装

深入理解Apache Airflow 调度器(最新推荐)

《深入理解ApacheAirflow调度器(最新推荐)》ApacheAirflow调度器是数据管道管理系统的关键组件,负责编排dag中任务的执行,通过理解调度器的角色和工作方式,正确配置调度器,并... 目录什么是Airflow 调度器?Airflow 调度器工作机制配置Airflow调度器调优及优化建议最

Spring Boot统一异常拦截实践指南(最新推荐)

《SpringBoot统一异常拦截实践指南(最新推荐)》本文介绍了SpringBoot中统一异常处理的重要性及实现方案,包括使用`@ControllerAdvice`和`@ExceptionHand... 目录Spring Boot统一异常拦截实践指南一、为什么需要统一异常处理二、核心实现方案1. 基础组件

Golang的CSP模型简介(最新推荐)

《Golang的CSP模型简介(最新推荐)》Golang采用了CSP(CommunicatingSequentialProcesses,通信顺序进程)并发模型,通过goroutine和channe... 目录前言一、介绍1. 什么是 CSP 模型2. Goroutine3. Channel4. Channe

Python基于火山引擎豆包大模型搭建QQ机器人详细教程(2024年最新)

《Python基于火山引擎豆包大模型搭建QQ机器人详细教程(2024年最新)》:本文主要介绍Python基于火山引擎豆包大模型搭建QQ机器人详细的相关资料,包括开通模型、配置APIKEY鉴权和SD... 目录豆包大模型概述开通模型付费安装 SDK 环境配置 API KEY 鉴权Ark 模型接口Prompt

Spring Boot 中整合 MyBatis-Plus详细步骤(最新推荐)

《SpringBoot中整合MyBatis-Plus详细步骤(最新推荐)》本文详细介绍了如何在SpringBoot项目中整合MyBatis-Plus,包括整合步骤、基本CRUD操作、分页查询、批... 目录一、整合步骤1. 创建 Spring Boot 项目2. 配置项目依赖3. 配置数据源4. 创建实体类