Our work is out on "deployment" efficient RL algorithm based on model-based offline method! done in collaboration with @frt03_ @ymatsuo @ofirnachum and @shaneguML Tweet added by Tatsuya Matsushima @ICRA2024 🍣 @__tmats__

Tatsuya Matsushima @ICRA2024 🍣

4 years

Our work is out on "deployment" efficient RL algorithm based on model-based offline method! done in collaboration with @frt03_ @ymatsuo @ofirnachum and @shaneguML

Shane Gu

@shaneguML

4 years

Can we solve Gym tasks with only 5-10 "trials"? Yes we can. We propose deployment efficiency as a new metric for RL, counting # of distinct data collection policies in learning. Most prior methods use 100s-1Ms. Joint w/ @Matsuo_Lab @ofirnachum 1/

290

Replies

Tatsuya Matsushima @ICRA2024 🍣

@__tmats__

4 years

@frt03_ @ymatsuo @ofirnachum @shaneguML この研究に関連して，オフラインモデルベース強化学習に関して，明日の人工知能学会全国大会「世界モデルと知能」セッションにおいて発表しますおそらく日本では初めての「世界モデル」を中心トピックとしたセッションですので，是非ご参加ください！

JSAI2020　OS-18　世界モデルと知能

■ 概要

sites.google.com