AI/etc 48

생성 후 선택: World Knowledge가 안내하는 개방형 시각적 질문 답변

https://arxiv.org/abs/2305.18842 Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge The open-ended Visual Question Answering (VQA) task requires AI models to jointly reason over visual and natural language inputs using world knowledge. Recently, pre-trained Language Models (PLM) such as GPT-3 have been applied to the task and shown to be arxiv.org 1. 개방형 시각 질문 응..

AI/etc 2023.05.31

과거를 상상하여 미래를 추론

https://arxiv.org/abs/2305.17195 Inferring the Future by Imagining the Past A single panel of a comic book can say a lot: it shows not only where characters currently are, but also where they came from, what their motivations are, and what might happen next. More generally, humans can often infer a complex sequence of past and fut arxiv.org 1. 단일한 만화책 패널이 많은 정보를 전달할 수 있습니다: 그것은 캐릭터들이 현재 어디에 있는지 ..

AI/etc 2023.05.30

부분적으로 개인화된 연합 학습: 데이터 이질성의 저주 깨기

https://arxiv.org/abs/2305.18285 Partially Personalized Federated Learning: Breaking the Curse of Data Heterogeneity We present a partially personalized formulation of Federated Learning (FL) that strikes a balance between the flexibility of personalization and cooperativeness of global training. In our framework, we split the variables into global parameters, which are arxiv.org 1. 우리는 개인화의 유연성..

AI/etc 2023.05.30

네이버가 보는 AGI 는?

https://themiilk.com/articles/a6946e73d?u=15ed0645&t=abef9db15&from&fbclid=IwAR1I6NThOH6Dnb3hrL-ZG_3qbFfC8QEs2dgzaHmwGhNSBkkXIEetHED7_lQ AI 휩쓰는 미국 빅테크에 '디지털 영토' 또 내줄 것인가? - 더밀크 [네이버 클라우드 AI이노베이션 하정우 센터장 인터뷰] ●일본, 동남아, 중동 등 글로벌 시장, 소버린(sovereign) AI서비스 및 인프라 구축 계획 하는 네이버. ●챗GPT등 기존 빅테크 서비스에 종속 막 themiilk.com 하 센터장은 AI가 발전한다면 AGI가 이론적으로는 가능은 하지만, ‘지능(intelligence)를 어떻게 정의하느냐?’에 따라서 달라질 수도 있다. 또한 언..

AI/etc 2023.05.30

마음의 눈 재구성: fMRI-to-Image with Contrastive Learning 및 Diffusion Priors

https://arxiv.org/abs/2305.18274 github: https://medarc-ai.github.io/mindeye/ Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors We present MindEye, a novel fMRI-to-image approach to retrieve and reconstruct viewed images from brain activity. Our model comprises two parallel submodules that are specialized for retrieval (using contrastive learning) and re..

AI/etc 2023.05.30

Ghost in the Minecraft: 텍스트 기반 지식 및 메모리를 갖춘 대규모 언어 모델을 통해 일반적으로 오픈 월드 환경에 사용할 수 있는 에이전트

abs: https://arxiv.org/abs/2305.17144 github: https://github.com/OpenGVLab/GITM Ghost in the Minecraft: Generally Capable Agents for Open-World Enviroments via Large Language Models with Text-based Knowledge The captivating realm of Minecraft has attracted substantial research interest in recent years, serving as a rich platform for developing intelligent agents capable of functioning in open-wo..

AI/etc 2023.05.30