
My research centers on developing a system for conversing with humans, specifically in the realm of conversational AI. The advent of large language models (LLMs) has significantly influenced their integration into smartphone applications and conversational robots. My primary focus is on creating "human-like natural conversation," aiming to design conversational robots capable of engaging in flexible and profound interactions akin to human communication. While the main modality used in my research is audio-based, my approach incorporates multimodal data. I am leveraging technologies such as large-scale foundation models and continuous prediction models. Additionally, I draw insights from conversation analysis, linguistics, and cognitive science. Through this research, my ultimate objective is to reveal the mechanisms underlying human-to-human communication.
Book

Research Highlights
Yeah, Well, Haha: Generating Non-linguistic Behaviors
Keynote, SIGDIAL 2024 (September 18th, 2024)