Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • harvard-cite-them-right
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Rocchio, Ide, Okapi och BIM: En komparativ studie av fyra metoder för relevance feedback
Högskolan i Borås, Institutionen Biblioteks- och informationsvetenskap / Bibliotekshögskolan.
2008 (svensk)Independent thesis Advanced level (degree of Master (One Year))OppgaveAlternativ tittel
Rocchio, Ide, Okapi and BIM : A comparative study of four methods for relevance feedback (engelsk)
Abstract [en]

This thesis compares four relevance feedback methods. The Rocchio and Ide dec-hi algorithms for the vector space model and the binary independence model and Okapi BM25 within the probabilistic framework. This is done in a custom-made Information Retrieval system utilizing a collection containing 131 896 LA-Times articles which is part of the TREC ad-hoc collection. The methods are compared on two grounds, using only the relevance information from the 20 highest ranked documents from an initial search and also by using all available relevance information. Although a significant effect of choice of method could be found on the first ground, post-hoc analysis could not determine any statistically significant differences between the methods where Rocchio, Ide dec-hi and Okapi BM25 performed equivalent. All methods except the binary independence model performed significantly better than using no relevance feedback. It was also revealed that although the binary independence model performed far worse on average than the other methods it did outperform them on nearly 20 % of the topics. Further analysis argued that this depends on the lack of query expansion in the binary independence model which is advantageous for some topics although has a negative effect on retrieval efficiency in general. On the second ground Okapi BM25 performed significantly better than the other methods with the binary independence model once again being the worst performer. It was argued that the other methods have problems scaling to large amounts of relevance information where Okapi BM25 has no such issues.

sted, utgiver, år, opplag, sider
University of Borås/Swedish School of Library and Information Science (SSLIS) , 2008.
Serie
Magisteruppsats i biblioteks- och informationsvetenskap vid institutionen Biblioteks- och informationsvetenskap, ISSN 1654-0247 ; 2008:45
Emneord [en]
relevance feedback, information retrieval, rocchio, ide dec-hi, okapi bm25, vektormodellen, sökfrågeexpansion
Emneord [sv]
klassiska probabilistiska modellen
HSV kategori
Identifikatorer
URN: urn:nbn:se:hb:diva-18877Lokal ID: 2320/3699OAI: oai:DiVA.org:hb-18877DiVA, id: diva2:1310811
Merknad
Uppsatsnivå: DTilgjengelig fra: 2019-04-30 Laget: 2019-04-30

Open Access i DiVA

fulltekst(305 kB)238 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 305 kBChecksum SHA-512
5837455ecfa625d29ae09d9ab68497bfffc64751d3deef59599ef6ab2257b2ba3348f08121359968e78d1d34004cc7ca2fc288191bd139165279eb829923e9a7
Type fulltextMimetype application/pdf

Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 238 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

urn-nbn

Altmetric

urn-nbn
Totalt: 176 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • harvard-cite-them-right
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf