더 크게, 더 좋게, 더 빠르게: 인간 수준의 효율성을 갖춘 인간 수준의 Atari

AI/Google&DeepMind

더 크게, 더 좋게, 더 빠르게: 인간 수준의 효율성을 갖춘 인간 수준의 Atari

유로파물고기 2023. 6. 1. 13:23

https://arxiv.org/abs/2305.19452

Bigger, Better, Faster: Human-level Atari with human-level efficiency

We introduce a value-based RL agent, which we call BBF, that achieves super-human performance in the Atari 100K benchmark. BBF relies on scaling the neural networks used for value estimation, as well as a number of other design choices that enable this sca

arxiv.org

https://github.com/google-research/google-research/tree/master/bigger_better_faster

GitHub - google-research/google-research: Google Research

Google Research. Contribute to google-research/google-research development by creating an account on GitHub.

github.com

Atari 100K 벤치마크에서 초인적인 성능을 달성하는 BBF라고 하는 가치 기반 RL 에이전트를 소개합니다. BBF는 값 추정에 사용되는 신경망 확장과 샘플 효율적인 방식으로 이러한 확장을 가능하게 하는 여러 가지 다른 디자인 선택에 의존합니다. 우리는 이러한 디자인 선택에 대한 광범위한 분석을 수행하고 향후 작업에 대한 통찰력을 제공합니다. ALE에서 샘플 효율적인 RL 연구를 위해 골대 업데이트에 대한 논의로 끝납니다. 우리는 코드와 데이터를 이 https URL 에서 공개적으로 사용할 수 있도록 합니다 .

'AI > Google&DeepMind' 카테고리의 다른 글

SQL-PaLM: Text-to-SQL을 위한 개선된 대규모 언어 모델 적응 (0)	2023.06.02
브레인포머: 효율성을 위한 거래 단순성 (0)	2023.06.02
PaLI-X: 다국어 비전 및 언어 모델 확장 (0)	2023.05.31
온라인 비확률적 모델 없는 강화학습 (0)	2023.05.30
Q-러닝에 대한 보다 효율적인 대안인 VA-러닝 (0)	2023.05.30

현재글더 크게, 더 좋게, 더 빠르게: 인간 수준의 효율성을 갖춘 인간 수준의 Atari

Foundation Models for Robotics,

Today :
Yesterday :

일	월	화	수	목	금	토
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

SUI