Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • harvard-cite-them-right
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Time Period Categorization in Fiction: A Comparative Analysis of Machine Learning Techniques
University of Borås, Faculty of Librarianship, Information, Education and IT. University of Borås, Borås, Sweden.
2024 (English)In: Cataloging & Classification Quarterly, ISSN 0163-9374, E-ISSN 1544-4554, p. 1-30Article in journal (Refereed) Published
Abstract [en]

This study investigates the automatic categorization of time period metadata in fiction, a critical but often overlooked aspect of cataloging. Using a comparative analysis approach, the performance of three machine learning techniques, namely Latent Dirichlet Allocation (LDA), Sentence-BERT (SBERT), and Term Frequency-Inverse Document Frequency (TF-IDF) were assessed, by examining their precision, recall, F1 scores, and confusion matrix results. LDA identifies underlying topics within the text, TF-IDF measures word importance, and SBERT measures sentence semantic similarity. Based on F1-score analysis and confusion matrix outcomes, TF-IDF and LDA effectively categorize text data by time period, while SBERT performed poorly across all time period categories.

Place, publisher, year, edition, pages
2024. p. 1-30
Keywords [en]
Cataloging for digital resources; fiction, LDA, machine learning, SBERT, text analysis, TF-IDF, time period categorization
National Category
Computer and Information Sciences Information Studies
Research subject
Library and Information Science; Library and Information Science
Identifiers
URN: urn:nbn:se:hb:diva-31755DOI: 10.1080/01639374.2024.2315548ISI: 001189786900001Scopus ID: 2-s2.0-85189510207OAI: oai:DiVA.org:hb-31755DiVA, id: diva2:1851616
Available from: 2024-04-15 Created: 2024-04-15 Last updated: 2024-10-01Bibliographically approved

Open Access in DiVA

fulltext(7970 kB)38 downloads
File information
File name FULLTEXT01.pdfFile size 7970 kBChecksum SHA-512
d83dc9fe239fec67044391b1a82fd715bc77b4fc65f1ca344a10ca56e1aad3caa31659e1afc4aafe72f3111f2627155611ed409073488a1e7fa4adff1d1e58cb
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Westin, Fereshta

Search in DiVA

By author/editor
Westin, Fereshta
By organisation
Faculty of Librarianship, Information, Education and IT
In the same journal
Cataloging & Classification Quarterly
Computer and Information SciencesInformation Studies

Search outside of DiVA

GoogleGoogle Scholar
Total: 38 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 60 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • harvard-cite-them-right
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf