분류 전체보기 110

대규모 언어모델로 반복 게임하기

https://arxiv.org/abs/2305.16867 Playing repeated games with Large Language Models Large Language Models (LLMs) are transforming society and permeating into diverse applications. As a result, LLMs will frequently interact with us and other agents. It is, therefore, of great societal value to understand how LLMs behave in interactive soci arxiv.org 1. 대규모 언어 모델(LLMs)이 사회를 변화시키고 다양한 응용 분야에 퍼지고 있습니..

AI/etc 2023.05.29

Voyager: 대규모 언어 모델을 사용하는 개방형 구현 에이전트

설명 -> https://twitter.com/DrJimFan/status/1662115266933972993?s=20 트위터에서 즐기는 Jim Fan “What if we set GPT-4 free in Minecraft? ⛏️ I’m excited to announce Voyager, the first lifelong learning agent that plays Minecraft purely in-context. Voyager continuously improves itself by writing, refining, committing, and retrieving *code* from twitter.com 1. 'Voyager'는 Minecraft에서 인간의 개입 없이 지속적으로 세계를 탐색하고, ..

AI/Nvidia 2023.05.29

시뮬레이션된 인간 사회에서 사회적으로 정렬된 언어 모델 교육

https://arxiv.org/abs/2305.16960 Training Socially Aligned Language Models in Simulated Human Society Social alignment in AI systems aims to ensure that these models behave according to established societal values. However, unlike humans, who derive consensus on value judgments through social interaction, current language models (LMs) are trained to rigidl arxiv.org 1. 인공지능 시스템에서의 사회적 조정은 이러한 모델..

AI/Google&DeepMind 2023.05.29

BiomGPT: 비전, 언어 및 멀티모달 작업을 위한 통합 및 제너럴리스트 Biomedic GPT

https://arxiv.org/abs/2305.17100 BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks In this paper, we introduce a unified and generalist Biomedical Generative Pre-trained Transformer (BiomedGPT) model, which leverages self-supervision on large and diverse datasets to accept multi-modal inputs and perform a range of downstr..

AI/etc 2023.05.29

도구 제작자로서의 대규모 언어 모델

설명 https://twitter.com/tianle_cai/status/1662988435114852352?s=20 트위터에서 즐기는 Tianle Cai “LLMs can make their own tools just like humans🤖! Thrilled to share my intern work @Google. We introduced a closed-loop framework to let LLMs make and utilize reusable new tools🛠️ (implemented as programs). Paper: https://t.co/cOk3VZ47ka More det twitter.com https://arxiv.org/abs/2305.17126 Large Language Mode..

AI/Google&DeepMind 2023.05.29

대규모 언어모델을 사용한 역할극

https://arxiv.org/abs/2305.16367 Role-Play with Large Language Models As dialogue agents become increasingly human-like in their performance, it is imperative that we develop effective ways to describe their behaviour in high-level terms without falling into the trap of anthropomorphism. In this paper, we foreground the conc arxiv.org 대화형 에이전트가 인간과 같은 행동을 점점 더 보여주면서, 인간의 특성을 과잉적으로 부여하는 인간화(anthr..

AI/Google&DeepMind 2023.05.29

행동하기 전에 생각하기: 내부 작업 기억을 가진 의사 결정 트랜스포머

https://arxiv.org/abs/2305.16338 Think Before You Act: Decision Transformers with Internal Working Memory Large language model (LLM)-based decision-making agents have shown the ability to generalize across multiple tasks. However, their performance relies on massive data and compute. We argue that this inefficiency stems from the forgetting phenomenon, in whic arxiv.org 1. 대형 언어 모델(LLM) 기반의 결정 생..

AI/etc 2023.05.29