Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • harvard-cite-them-right
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Comparing Feature Engineering Techniques for the Time Period Categorisation of Novels
University of Borås, Faculty of Librarianship, Information, Education and IT.
2024 (English)In: KO KNOWLEDGE ORGANIZATION, ISSN 0943-7444, Vol. 51, no 5, p. 330-339Article in journal (Refereed) Published
Abstract [en]

The growing number of literary works being produced and published has emphasised the importance of better cataloguing methods to handle the increasing volume effectively. One specific issue is the lack of organising works by time periods, which is crucial for understanding and organising literature. In this study, "time" refers to when the story's events occur or the narrative's temporal setting, like specific historical periods or events, rather than the publication date. Categorising literary works based on their historical settings can significantly improve accessibility for library patrons navigating online catalogues. However, time period categorisation is uncommon, primarily due to the resource-intensive nature of the process, which necessitates extensive analysis by librarians and cataloguers. To address this issue, this paper proposes evaluating different machine learning workflows to predict time periods for novels. The workflow comprises preprocessing, feature engineering, classification, and evaluation. The feature engineering techniques used are Latent Dirichlet Allocation (LDA), Word Embedding with Sentence-BERT (WE SBERT), and Term Frequency-Inverse Document Frequency (TF-IDF), and the classification algorithm used is Logistic Regression. The models are assessed using the F1 score, precision, and recall metrics. The time period categories used are Medieval, Era of Great Power, Age of Liberty, and Gustavian periods. The objective is to determine how effectively each model categorises Swedish historical fiction novels into their appropriate time period categories. By leveraging machine learning techniques, the research seeks to supplement the time period categorisation process, aiding cataloguers and ultimately enhancing the accessibility and usability of library collections.

Place, publisher, year, edition, pages
2024. Vol. 51, no 5, p. 330-339
Keywords [en]
literary categorization, machine learning techniques, time period categorization
National Category
Information Systems
Identifiers
URN: urn:nbn:se:hb:diva-32621DOI: 10.5771/0943-7444-2024-5ISI: 001309320900006OAI: oai:DiVA.org:hb-32621DiVA, id: diva2:1900867
Available from: 2024-09-25 Created: 2024-09-25 Last updated: 2025-09-24Bibliographically approved

Open Access in DiVA

fulltext(756 kB)72 downloads
File information
File name FULLTEXT01.pdfFile size 756 kBChecksum SHA-512
264c603f1435cdb17a8196c46adbc2cdeecd83105bdf6f2d89b8d86ace18ac78a66d1d19c5ed7bbe65c227b7ffe747439d0bfb1e30a09529068dd62c5f4ba82f
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Westin, Fereshta

Search in DiVA

By author/editor
Westin, Fereshta
By organisation
Faculty of Librarianship, Information, Education and IT
Information Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 72 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 69 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • harvard-cite-them-right
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf