Invited Talks

8월 12일 (화)
13:00 - 13:45	장형진 교수 (버밍엄대) When AI Sees People: Human-Centred Vision and Interaction (45분)
13:45 - 14:30	정다샘 교수 (서강대) Deep Learning for Diverse Music Data (45분)
15:00 - 15:45	최성준 교수 (고려대) Towards Human-Centered Robotics: Coexistence of Human and Robots (45분)
15:45 - 16:30	김선 교수 (서울대) Navigating Chemical Space with Deep Learning Technologies (45분)

8월 13일 (수)
13:45 - 14:30	이종민 교수 (연세대) RL Meets Convex Optimization (45분)
15:00 - 15:45	윤세영 교수 (KAIST) Training for Efficient LLM Inference (45분)
15:45 - 16:30	신원용 교수 (연세대) 초고속 추천용 그래프 필터링 방법 (45분)

장형진 교수
(버밍엄대)

Biography
2022-현재	Associate Professor @ University of Birmingham
2025-현재	Affiliated Researcher @ The Institute of Data and AI (IDAI)
2021-2023	Turing Fellow @ The Alan Turing Institute
2018-2022	Assistant Professor @ University of Birmingham
2013-2017	Post-doc Researcher @ Imperial College London

When AI Sees People: Human-Centred Vision and Interaction (45분)

AI is created by humans and for humans. In this talk, I will introduce our recent research on human-centred visual learning — building AI systems that understand people better by focusing on how we move, look, and interact with the world. Key visual cues like body pose, hand movements, gaze direction, and object interaction provide important insights into human actions and intentions. Our team has been working on combining these visual signals with language models to help AI systems understand context more effectively and interact more naturally with people. I will share our latest work on integrating body motion, hand gestures, and gaze with language to build smarter and more responsive systems. We also explore how different types of information — such as audio, text, and images — can work together to improve tasks like estimating hand-object shapes or generating realistic face and gaze images. This multimodal approach makes AI more accurate, flexible, and easier to communicate with. Finally, I will highlight recent trends in computer vision and discuss how these ideas can shape the future of human-AI interaction. These technologies have the potential to change the way AI sees and connects with people in everyday life.

정다샘 교수
(서강대)

Biography
2021-현재	서강대학교 아트&테크놀로지학과 조교수
2024-현재	ISMIR 2025 공동조직위원장
2025-현재	한국음악정보학회 부회장
2024-현재	한국음악지각인지학회 부회장
2025-현재	판소리학회 섭외이사
2020-2021	SK Telecom T-Brain 연구원
2015-2020	KAIST 문화기술대학원 공학박사
2013-2015	KAIST 문화기술대학원 공학석사
2008-2013	KAIST 기계공학과 공학사

Deep Learning for Diverse Music Data (45분)

Music exists in various formats. It can be in the form of a musical score (either scanned or in a machine-readable format like MusicXML), or as data with more performance-related aspects, such as MIDI or audio. Furthermore, different music genres often use different forms of representation. For example, in the case of Korean court music, music exists in Jeongganbo. This talk introduces two studies concerning the handling of different formats of music data. The first study focuses on reviving 15th-century Korean court music. This involves two key contributions: first, constructing the first machine-readable Jeongganbo dataset using optical music recognition, and second, proposing a novel encoding method that imitates Jeongganbo notation to represent melodies. The second study introduces a unified framework for cross-modal translation among various music formats, including scanned score images, machine-readable scores, MIDI performance data, and audio recordings.

최성준 교수
(고려대)

Biography
2025.01 - 현재	Principal Scientist at RLWRLD
2020.09 - 현재	Assistant Professor at Korea University
2018.06 - 2020.08	Postdoctoral Associate at Disney Research
2018.02 - 2018.05	Research Scientist at Kakao Brain
2012.09 - 2018.02	Ph.D. in EECS, Seoul National University

Towards Human-Centered Robotics: Coexistence of Human and Robots (45분)

Recent advancements in large language models and vision-language models have opened up new possibilities for human-centered robotics. This presentation aims to explore the potential of these cutting-edge technologies in enhancing human-robot interaction and enabling robots to better serve human needs. Firstly, we will see the rapid development of humanoid robots and their potential to replace human labor in various domains. Secondly, we will see the integration of artificial intelligence techniques with these humanoid robots to facilitate their deployment in everyday life scenarios. Furthermore, I will showcase our laboratory's ongoing research projects including utilizing Vision Language Models (VLMs) for human-robot interaction, combining VLMs with simulators, and developing robot agents with distinct personas.

김선 교수
(서울대)

Biography
2011-현재	서울대 컴퓨터공학과 교수
2022-현재	서울대 생명과학부 겸임교수
2024-현재	서울대병원 AI Healthcare 교수
2021-현재	(주) 아이겐드럭 CEO/CTO
2024-2025	국가인공지능위원회 인재/인프라 분과 위원장
2022-2024	(재) 목암생명과학연구소, 소장
2017-2019	한국정보과학회 인공지능소사이어티, 회장
2009-2011	Indiana University School of Informatics and Computing, 학과장
2001-2011	Indiana University School of Informatics and Computing, 조/부교수
1998-2001	DuPont Central Research, Senior Computer Scientist
1997-1998	UIUC Keck Center Bioinformatics Unit, Director

Navigating Chemical Space with Deep Learning Technologies (45분)

Drug development is a long and complex discovery process. Now advanced deep learning technologies have begun to realize AI-based drug discovery with a pronounced example of protein structure prediction (2024 Novel prize). In this talk, I will share our recent works on navigating chemical space with deep learning technologies which occurs in early stages of drug discovery. First, we show how drug-likeness can be re-defined with deep learning technologies (ISMB 2025, ICML 2025). Then, we will share our works on predicting drug target affinity prediction (ICLR 2025, ISMB 2025).

이종민 교수
(연세대)

Biography
2025-현재	연세대학교 인공지능학과 조교수
2022-2025	UC Berkeley, 박사후 연구원
2021	Google DeepMind, Research Scientist Intern
2022	KAIST 전산학 박사
2017	KAIST 전산학 석사
2014	서울대학교 컴퓨터공학 학사

RL Meets Convex Optimization (45분)

Reinforcement learning (RL) has achieved impressive results in domains like games and robotics, but its reliance on large amounts of on-policy interaction limits its real-world applicability. In this talk, I will present a unified framework that enables data-efficient policy learning from arbitrary off-policy data, built on top of convex optimization. By directly optimizing the stationary distribution of policies, rather than relying on unstable off-policy actor-critic methods, our approach achieves stable and principled learning across diverse settings such as reward-maximizing RL, constrained RL, imitation learning, state entropy maximization, and more. This perspective simplifies algorithm design and demonstrates that the objective for many complex sequential decision-making problems can be reduced to solving a single convex optimization.

윤세영 교수
(KAIST)

Biography
2017	KAIST 김재철AI대학원 부교수
2016-2017	Los Alamos National Lab. (미국) 박사후연구원
2015-2016	Microsoft Research, Cambridge (영국) 방문연구원
2014-2015	Microsoft Research-INRIA (프랑스) 박사후연구원
2013-2014	KTH (스웨덴) 박사후연구원
2012	KAIST 전기및전자공학 박사
2006	KAIST 전기및전자공학 학사

Training for Efficient LLM Inference (45분)

Recently, large language models (LLMs) have significantly impacted our lives. Services such as ChatGPT, Gemini, and Claude are powered by LLMs and offer AI capabilities for a wide range of applications. However, these models typically require substantial computational resources and are primarily deployed in the cloud, rather than on-device. For reasons such as privacy concerns and offline use cases, on-device deployment is becoming increasingly important. This highlights the need for methods that can reduce inference costs without compromising performance. In this talk, I will present several training strategies aimed at achieving this goal, including knowledge distillation, block decoding, and weight reuse techniques.

신원용 교수
(연세대)

Biography
2019-현재	연세대학교 수학계산학부(계산과학공학) 부교수/교수
2024-현재	연세대학교 배터리공학과 겸임교수
2022-현재	POSTECH 인공지능대학원 겸임교수
2021-현재	㈜카이로스랩 공동창업자
2024-현재	㈜프리딕션 공동창업자
2012-2019	단국대학교 컴퓨터학과 조교수/부교수
2009-2012	Harvard University, 박사후연구원/Research Associate
2008	KAIST 전자전산학과 공학박사

초고속 추천용 그래프 필터링 방법 (45분)

Graph convolutional network (GCN) has received considerable attention as a neural network structure that learns the optimal weight parameters by combining node attribute information with graph connection information for various graph downstream tasks. However, GCN-based recommender systems have the disadvantage in that they take too long to learn to reflect the large-scale interaction data and the user's preferences that change in real time. In this talk, I present a new methodology that performs recommendation based on pre-defined low-pass filters without expensive training processes. Built upon the pioneering work, dubbed GF-CF, we newly design Turbo-CF, a more cost-effective graph filtering method that does not necessitate costly matrix decomposition. Through comprehensive experiments, I demonstrate the effectiveness of Turbo-CF in terms of runtime efficiency and recommendation accuracy.

Invited Talks

AI Korea 2025강연 자료 다운로드

AI Korea 2025
강연 자료 다운로드