MultiHATHI: A Complete Collection of Multilingual Prose Fiction in the HathiTrust Digital Library

This dataset provides detailed metadata on ca. 10.2 million works of fiction and non-fiction written after 1799 in 521 different languages available in the HathiTrust Digital Library. The dataset bolsters the May 2022 Hathifile by supplying missing predicted fiction tags with a bespoke BERT-based mu...

Full description

Bibliographic Details
Main Authors: Sil Hamilton, Andrew Piper
Format: Article
Language:English
Published: Ubiquity Press 2023-02-01
Series:Journal of Open Humanities Data
Subjects:
Online Access:https://account.openhumanitiesdata.metajnl.com/index.php/up/article/view/95