Hyeonsu Kang is a PhD student in Human-Computer Interaction Institute at Carnegie Mellon University (CMU), where he is advised by Dr. Niki Kittur. His research interests are in Human-Computer Interaction, Natural Language Processing, and Collective Intelligence, where he uses human-centered methods to study, design, and build interactive and intelligent systems that unlock new forms of creativity, empower scientists to build mental models of others and use such mental models to personalize and configure their literature discovery process, bring experts with complementary expertise and fresh insights together to tackle wicked problems, and optimize workflows for reviewing the prior art in different scientific fields. His work has been published in premier venues in HCI and NLP including ACM CHI, UIST, TOCHI, and NAACL. His work is generously supported by the National Science Foundation (NSF), Allen Institute for Artificial Intelligence, Google Cloud, the Office of Naval Research, and the Center for Knowledge Acceleration at CMU.

He was named a Google Cloud Research Innovator (2021) and was previously supported by the South Korean National Scholarship for Science and Engineering.

He received his BS in Computer Science and Engineering at Seoul National University. During the course of his graduate studies, he has interned at Allen Institute for Artificial Intelligence (Summer 2022 and 2021) and Tableau Software.

CV | Google Scholar | Semantic Scholar | Email: hyeonsuk@cs.cmu.edu


Journal Papers

a purpose query 'Facilitate heat transfer in semiconductors' is shown to match to two different papers with diverse mechanisms that inspired a user study participant to come up with creative adaptation and direct application ideas
Hyeonsu B. Kang, Xin Qian, Tom Hope, Dafna Shahaf, Joel Chan, and Aniket Kittur
TOCHI 2022
Analogies have been central to creative problem-solving throughout the history of science and technology. As the number of scientific papers continues to increase exponentially, there is a growing opportunity for finding diverse solutions to existing problems. However, realizing this potential requires the development of a means for searching through a large corpus that goes beyond surface matches and simple keywords. Here we contribute the first end-to-end system for analogical search on scientific papers and evaluate its effectiveness with scientists' own problems. Using a human-in-the-loop AI system as a probe we find that our system facilitates creative ideation, and that ideation success is mediated by an intermediate level of matching on the problem abstraction (i.e., high versus low). We also demonstrate a fully automated AI search engine that achieves a similar accuracy with the human-in-the-loop system. We conclude with design implications for enabling automated analogical inspiration engines to accelerate scientific innovation.

Refereed Conference Proceedings Papers

Hyeonsu B. Kang, Joseph Chee Chang, Yongsung Kim, Aniket Kittur
UIST 2022
Reviewing the literature to understand relevant threads of past work is a critical part of research and vehicle for learning. However, as the scientific literature grows the challenges for users to find and make sense of the many different threads of research grow as well. Previous work has helped scholars to find and group papers with citation information or textual similarity using standalone tools or overview visualizations. Instead, in this work we explore a tool integrated into users' reading process that helps them with leveraging authors' existing summarization of threads, typically in introduction or related work sections, in order to situate their own work's contributions. To explore this we developed a prototype that supports efficient extraction and organization of threads along with supporting evidence as scientists read research articles. The system then recommends further relevant articles based on user-created threads. We evaluate the system in a lab study and find that it helps scientists to follow and curate research threads without breaking out of their flow of reading, collect relevant papers and clips, and discover interesting new articles to further grow threads.
An example indirect author-based relevance message augmenting the incoming new paper recommendaiton.
Hyeonsu B. Kang, Rafal Kocielnik, Andrew Head, Jiangjiang Yang, Matt Latzke, Aniket Kittur, Daniel Weld, Doug Downey, and Jonathan Bragg
CHI 2022
Finding and engaging with the relevant scientific knowledge is foundational for intellectual progress in a society. Yet, with an exponential growth in publication rates, this becomes a challenging task. While personalized recommendations can help, they still may lack explanations of how certain papers are relevant and thus should be prioritized or attended to. To combat this, we developed a citation-based and two kinds of social relation-based approaches to boost user engagement with scholarly paper recommendations. For users who opted in, these approaches augmented paper recommendations included in email alerts with textual relevance descriptions underneath the recommendations. We evaluated our approaches in a randomized field experiment that ran for over two months and with 7,000+ users, and also in a controlled lab study (N=14) for deeper qualitative insights. We report on our findings and implications for the design of future approaches that aim to augment scholarly recommendations.
A functional graph representation using the extracted purposes of product ideas
Tom Hope, Ronen Tamari, Hyeonsu Kang, Daniel Hershcovich, Joel Chan, Aniket Kittur, and Dafna Shahaf
CHI 2022 
We explore a novel representation for automatically breaking up product ideas described in natural language into fine-grained functional aspects. This representation can capture the core purposes and mechanisms in ideas, and support the backbone interactions (e.g., functional search of ideas, mapping and exploration of the design space around a focal problem) for augmenting human intelligence and accelerating the rate of innovation.
A diagrammatic representation of the idea of Paragon
Hyeonsu B. Kang, Gabriel Amoako, Neil Sengupta, Steven Dow
CHI 2018
“A picture is worth a thousand words.” We developed Paragon, a system that supports crowdworkers and peers during feedback exchange by enabling search of design examples that supplement the written feedback. In two lab studies, we found that i) feedback providers select poster examples that complement their feedback and align with a provided rubric and that ii) feedback providers give significantly more specific, actionable, and novel input when using an example-centric approach, as opposed to text alone.
An example of bidirectional code and visualization linking.
Hyeonsu Kang, Philip Guo
UIST 2017
We developed Omnicode, a programming environment with an always-on run-time visualization that helps novice programmers directly see how the variables and their relations change in real-time, in response to the changes they make in the program code. In our lab study, we found Omnicode to be useful for debugging, forming proper mental models, explaining their code to others, and discovering moments of serendipity that would not have been likely within an ordinary IDE.

Lightly Refereed Workshop Papers

A diagrammatic representation of the system implementation consisting of three main components: Aspect-based querying; Global domain cluster generation; Local domain cluster generation
Hyeonsu B. Kang*, Sheshera Mysore*, Kevin Huang*, Haw-Shiuan Chang, Thorben Prein, Andrew McCallum, Aniket Kittur, Elsa Olivetti
NAACL 2022 HCI + NLP Workshop
Exposure to ideas in domains outside a scientist's own may benefit her in reformulating existing research problems in novel ways and discovering new application domains for existing solution ideas. While improved performance in scholarly search engines can help scientists efficiently identify relevant advances in domains they may already be familiar with, it may fall short of helping them explore diverse ideas outside such domains. In this paper we explore the design of systems aimed at augmenting the end-user ability in cross-domain exploration with flexible query specification. To this end, we develop an exploratory search system in which end-users can select a portion of text core to their interest from a paper abstract and retrieve papers that have a high similarity to the user-selected core aspect but differ in terms of domains. Furthermore, end-users can 'zoom in' to specific domain clusters to retrieve more papers from them and understand nuanced differences within the clusters. Our case studies with scientists uncover opportunities and design implications for systems aimed at facilitating cross-domain exploration and inspiration.
A 2-D distribution of project ideas based on their similarity to the source project's problem and solution ideas
Matching Open Innovation Projects for Analogical Feedback Exchange
Hyeonsu Kang, Felicia Ng, Aniket Kittur
Collective Intelligence 2019
We developed an algorithm for matching teams in open innovation contests that tackle related conservataion challenges using diverse approaches, thereby encouraging the transfer of analogical inspirations between teams. To this end, our algorithm used pre-trained language models to encode the natural language text descriptions of team challenges and their solution approaches into a vector similarity space, then computed semantic similarity between them to systematically find teams tackling similar problems using diverse approaches, shown as a conducive mechanism for the transfer.
a While block program block in Starlogo Nova
Custom Blocks in StarLogo Nova: A Template-Based Approach to Abstraction for Improved Ease of Use and Expressive Power
Hyeonsu Kang, David Wu, David Wendel
We developed a general extension to the StarLogo Nova language to support end-user programming in various disciplines such as evolutionary biology, physics, and ecosystem sciences. This extension allowed end-users to select blocks that correspond to low-level programming constructs such as looping and variable assignment statements, and group them to create abstraction blocks that hide the low-level implementation details that oft-times distract learners from disciplinary learning objectives and system-level conceptual understanding. Using such abstraction blocks can also reduce the complexity of the programming language itself and lower the barrier to entry for novice learners.