MultiHATHI: A Complete Collection of Multilingual Prose Fiction in the HathiTrust Digital Library
This dataset provides detailed metadata on ca. 10.2 million works of fiction and non-fiction written after 1799 in 521 different languages available in the HathiTrust Digital Library. The dataset bolsters the May 2022 Hathifile by supplying missing predicted fiction tags with a bespoke BERT-based mu...
Main Authors: | Sil Hamilton, Andrew Piper |
---|---|
Format: | Article |
Language: | English |
Published: |
Ubiquity Press
2023-02-01
|
Series: | Journal of Open Humanities Data |
Subjects: | |
Online Access: | https://account.openhumanitiesdata.metajnl.com/index.php/up/article/view/95 |
Similar Items
-
HATHI 1M: Introducing a Million Page Historical Prose Dataset in English from the Hathi Trust
by: Sunyam Bagga, et al.
Published: (2022-03-01) -
HathiTrust Ingest of Locally Managed Content: A Case Study from the University of Illinois at Urbana-Champaign
by: Kyle R. Rimkus, et al.
Published: (2014-07-01) -
HathiTrust and Local Digital Stewardship: A Case Study in How Massive Digital Libraries Affect Local Digital Resources Decisions
by: Heidi M. Winkler, et al.
Published: (2017-07-01) -
The Ghost of My Mother: A Short Book of Fiction
by: Milcho Manchevski
Published: (2002-01-01) -
Horizontal slice of Russian prose-2012
Published: (2012-04-01)