水资源管理的多目标强化学习 (Multi-Objective Reinforcement Learning for Water Management) - 专知论文

会员服务 ·

0

资源管理 · 多目标 · 多目标强化学习 · 基准 · 强化学习 ·

Multi-Objective Reinforcement Learning for Water Management

翻译：水资源管理的多目标强化学习

Zuzanna Osika,Roxana Rădulescu,Jazmin Zatarain Salazar,Frans Oliehoek,Pradeep K. Murukannaiah

from arxiv, Accepted to AAMAS 2025

Many real-world problems (e.g., resource management, autonomous driving, drug discovery) require optimizing multiple, conflicting objectives. Multi-objective reinforcement learning (MORL) extends classic reinforcement learning to handle multiple objectives simultaneously, yielding a set of policies that capture various trade-offs. However, the MORL field lacks complex, realistic environments and benchmarks. We introduce a water resource (Nile river basin) management case study and model it as a MORL environment. We then benchmark existing MORL algorithms on this task. Our results show that specialized water management methods outperform state-of-the-art MORL approaches, underscoring the scalability challenges MORL algorithms face in real-world scenarios.

翻译：许多现实世界问题（例如资源管理、自动驾驶、药物发现）需要同时优化多个相互冲突的目标。多目标强化学习（MORL）将经典强化学习扩展至同时处理多个目标，生成一系列捕捉不同权衡策略的解决方案。然而，MORL领域目前缺乏复杂且真实的环境与基准测试。本研究引入尼罗河流域水资源管理案例，并将其建模为MORL环境，随后在此任务上对现有MORL算法进行基准评估。结果表明，专业的水资源管理方法优于当前最先进的MORL方法，这突显了MORL算法在现实场景中面临的可扩展性挑战。

0

相关内容

资源管理

CLIP通用提示学习的简要概述

CLIP通用提示学习的简要概述

专知会员服务

16+阅读 · 3月13日

【NeurIPS2024】无需3D数据的开放词汇单目3D物体检测模型训练

【NeurIPS2024】无需3D数据的开放词汇单目3D物体检测模型训练

专知会员服务

17+阅读 · 2024年11月26日

【ECCV2022】对比视觉Transformer的在线持续学习

【ECCV2022】对比视觉Transformer的在线持续学习

专知会员服务

23+阅读 · 2022年7月29日

【Tel Aviv大学】StyleGAN的架构、方法和应用的最新进展，State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

【Tel Aviv大学】StyleGAN的架构、方法和应用的最新进展，State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

专知会员服务

20+阅读 · 2022年3月17日

【CVPR2020-UBC】改进小样本学习视觉分类，Few-Shot Visual Classification

【CVPR2020-UBC】改进小样本学习视觉分类，Few-Shot Visual Classification

专知会员服务

68+阅读 · 2020年2月25日

ICLR'21 | GNN联邦学习的新基准

ICLR'21 | GNN联邦学习的新基准

图与推荐

12+阅读 · 2021年11月15日

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

论文浅尝 | GEOM-GCN: Geometric Graph Convolutional Networks

论文浅尝 | GEOM-GCN: Geometric Graph Convolutional Networks

开放知识图谱

14+阅读 · 2020年4月8日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知

245+阅读 · 2019年11月18日

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

机器之心

15+阅读 · 2019年3月18日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

东亚—北美间断分布草本菝葜类群的物种形成和谱系地理学研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于高空间分辨电子显微学In2-xGaxO3(ZnO)m缺陷分析

国家自然科学基金

0+阅读 · 2015年12月31日

面向学术资源的TSD与TDC测度及分析研究

国家自然科学基金

1+阅读 · 2015年12月31日

海量Web用户生成内容物化关键技术

国家自然科学基金

2+阅读 · 2014年12月31日

Hybrid Physics-ML Model for Forward Osmosis Flux with Complete Uncertainty Quantification

Arxiv

0+阅读 · 12月11日

Pretraining in Actor-Critic Reinforcement Learning for Robot Locomotion

Arxiv

0+阅读 · 12月8日

PIGEON: VLM-Driven Object Navigation via Points of Interest Selection

Arxiv

0+阅读 · 11月17日

Fair In-Context Learning via Latent Concept Variables

Arxiv

0+阅读 · 11月14日

A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 11月5日

VIP会员

文章信息

相关主题

多目标强化学习

相关VIP内容

CLIP通用提示学习的简要概述

CLIP通用提示学习的简要概述

专知会员服务

16+阅读 · 3月13日

【NeurIPS2024】无需3D数据的开放词汇单目3D物体检测模型训练

【NeurIPS2024】无需3D数据的开放词汇单目3D物体检测模型训练

专知会员服务

17+阅读 · 2024年11月26日

【ECCV2022】对比视觉Transformer的在线持续学习

【ECCV2022】对比视觉Transformer的在线持续学习

专知会员服务

23+阅读 · 2022年7月29日

【Tel Aviv大学】StyleGAN的架构、方法和应用的最新进展，State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

【Tel Aviv大学】StyleGAN的架构、方法和应用的最新进展，State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

专知会员服务

20+阅读 · 2022年3月17日

【CVPR2020-UBC】改进小样本学习视觉分类，Few-Shot Visual Classification

【CVPR2020-UBC】改进小样本学习视觉分类，Few-Shot Visual Classification

专知会员服务

68+阅读 · 2020年2月25日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用人工智能对军事行动进行建模》

《利用人工智能学习、优化与推演美国海军作战部队的战略布局与分散（续文）》

机器人、无人机与实时影像：应对城市爆炸威胁的三大技术方案

《指挥官意图消息中关键概念自动提取》最新47页

相关资讯

ICLR'21 | GNN联邦学习的新基准

ICLR'21 | GNN联邦学习的新基准

图与推荐

12+阅读 · 2021年11月15日

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

论文浅尝 | GEOM-GCN: Geometric Graph Convolutional Networks

论文浅尝 | GEOM-GCN: Geometric Graph Convolutional Networks

开放知识图谱

14+阅读 · 2020年4月8日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知

245+阅读 · 2019年11月18日

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

机器之心

15+阅读 · 2019年3月18日

相关论文

Hybrid Physics-ML Model for Forward Osmosis Flux with Complete Uncertainty Quantification

Arxiv

0+阅读 · 12月11日

Pretraining in Actor-Critic Reinforcement Learning for Robot Locomotion

Arxiv

0+阅读 · 12月8日

PIGEON: VLM-Driven Object Navigation via Points of Interest Selection

Arxiv

0+阅读 · 11月17日

Fair In-Context Learning via Latent Concept Variables

Arxiv

0+阅读 · 11月14日

A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 11月5日

相关基金

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

东亚—北美间断分布草本菝葜类群的物种形成和谱系地理学研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于高空间分辨电子显微学In2-xGaxO3(ZnO)m缺陷分析

国家自然科学基金

0+阅读 · 2015年12月31日

面向学术资源的TSD与TDC测度及分析研究

国家自然科学基金

1+阅读 · 2015年12月31日

海量Web用户生成内容物化关键技术

国家自然科学基金

2+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员