The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces

Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the greater the need for new technology to support scholars. In contrast to the process of finding papers, which...

Full description

Bibliographic Details
Main Authors: Lo, Kyle, Chang, Joseph, Head, Andrew, Bragg, Jonathan, Zhang, Amy, Trier, Cassidy, Anastasiades, Chloe, August, Tal, Authur, Russell, Bragg, Danielle, Bransom, Erin, Cachola, Isabel, Candra, Stefan, Chandrasekhar, Yoganand, Chen, Yen-Sung, Cheng, Evie, Chou, Yvonne, Downey, Doug, Evans, Rob, Fok, Raymond
Format: Article
Language:English
Published: ACM 2024
Online Access:https://hdl.handle.net/1721.1/157322
Description
Summary:Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the greater the need for new technology to support scholars. In contrast to the process of finding papers, which has been transformed by Internet technology, the experience of reading research papers has changed little in decades. For instance, the PDF format for sharing papers remains widely used due to its portability but has significant downsides, inter alia, static content and poor accessibility for low-vision readers. This paper explores the question "Can recent advances in AI and HCI power intelligent, interactive, and accessible reading interfaces, even for legacy PDFs?" We describe the Semantic Reader Project, a collaborative effort across multiple institutions to explore automatic creation of dynamic reading interfaces for research papers. Through this project, we've developed a collection of novel reading interfaces and evaluated them with study participants and real-world users to show improved reading experiences for scholars. We've also released a production research paper reading interface that will continuously incorporate novel features from our research as they mature. We structure this paper around five key opportunities for AI assistance in scholarly reading---discovery, efficiency, comprehension, synthesis, and accessibility---and present an overview of our progress and discuss remaining open challenges.