Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • harvard-cite-them-right
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Automated fiction classification: an explorative study offiction classification using machine-learning techniques
University of Borås, Faculty of Librarianship, Information, Education and IT.
2019 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

This thesis aims to explore the possibilities and components of employing automated text classification techniques to classify collections of narrative fiction by genre, and also, what linguistic features are prominent in distinguishing genres of fiction. The historical traditions and current practices and theories in the field of fiction classification are outlined, along with central concepts of classification and genre theory. Linguistic features are also introduced, and hypothesized to carry capabilities of distinguishing genres of fiction. The thesis also reviews the foundations and current state of automated text classification, and reasons on what constitutes topical and stylistic features in relation to fiction. Knowledge gaps are identified between automated text classification and traditional fiction classification, and also, concerning the potentially genre distinguishing qualities of topical and stylistic features. The main experiment, around which the thesis is centered, is divided into two parts. The first part employs and evaluates kNN and SVM classifiers on a collection of fiction documents across four genres of fiction. In the second part, some feature selection methods are employed for inspection of distinguishing features across the collection. Findings suggest a potential of using automated techniques to classify fiction, and also illustrates feature patterns that are argued to distinguish each of the four different genres of fiction. Some suggestions for further research are also proposed.

Place, publisher, year, edition, pages
2019.
Keywords [sv]
skönlitteratur, klassifikation, genrer, särdrag, ämne, stil, maskininlärning
National Category
Information Studies
Identifiers
URN: urn:nbn:se:hb:diva-22862OAI: oai:DiVA.org:hb-22862DiVA, id: diva2:1395257
Available from: 2020-02-26 Created: 2020-02-21 Last updated: 2022-03-02Bibliographically approved

Open Access in DiVA

fulltext(1094 kB)796 downloads
File information
File name FULLTEXT01.pdfFile size 1094 kBChecksum SHA-512
37267753310307002814f45667ff3abd7e05e83c6acae04935212c92f822c7780386f5cc5056a37ecf156912244fec6031505035de6c8b5171dc2d51b61bbefb
Type fulltextMimetype application/pdf
avtal(37 kB)56 downloads
File information
File name FULLTEXT02.pdfFile size 37 kBChecksum SHA-512
a1d2b0bdaf3463bd4073a106c039b7e7a88dd82ab88e82d4e5e3a67f3081cc487de6c16b6de31f03803d5f184239b5fc0617f9ac2a8e8d3b2364d1f91412f9b9
Type attachmentMimetype application/pdf

By organisation
Faculty of Librarianship, Information, Education and IT
Information Studies

Search outside of DiVA

GoogleGoogle Scholar
Total: 852 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 962 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • harvard-cite-them-right
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf