Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient
University of Borås, Swedish School of Library and Information Science.
University of Borås, Swedish School of Library and Information Science.
2003 (English)In: Journal of The American Society For Information Science And Technology, ISSN 1532-2882, E-ISSN 1532-2890, Vol. 54, no 6, p. 550-560Article in journal (Other academic) Published
Abstract [en]

Ahlgren, Jarneving, and. Rousseau review accepted procedures for author co-citation analysis first pointing out that since in the raw data matrix the row and column values are identical i,e, the co-citation count of two authors, there is no clear choice for diagonal values. They suggest the number of times an author has been co-cited with himself excluding self citation rather than the common treatment as zeros or as missing values. When the matrix is converted to a similarity matrix the normal procedure is to create a matrix of Pearson's r coefficients between data vectors. Ranking by r and by co-citation frequency and by intuition can easily yield three different orders. It would seem necessary that the adding of zeros to the matrix will not affect the value or the relative order of similarity measures but it is shown that this is not the case with Pearson's r. Using 913 bibliographic descriptions form the Web of Science of articles form JASIS and Scientometrics, authors names were extracted, edited and 12 information retrieval authors and 12 bibliometric authors each from the top 100 most cited were selected. Co-citation and r value (diagonal elements treated as missing) matrices were constructed, and then reconstructed in expanded form. Adding zeros can both change the r value and the ordering of the authors based upon that value. A chi-squared distance measure would not violate these requirements, nor would the cosine coefficient. It is also argued that co-citation data is ordinal data since there is no assurance of an absolute zero number of co-citations, and thus Pearson is not appropriate. The number of ties in co-citation data make the use of the Spearman rank order coefficient problematic.

Place, publisher, year, edition, pages
New York : Wiley , 2003. Vol. 54, no 6, p. 550-560
National Category
Information Studies
Identifiers
URN: urn:nbn:se:hb:diva-2242DOI: 10.1002/asi.10242Local ID: 2320/2707OAI: oai:DiVA.org:hb-2242DiVA, id: diva2:870332
Available from: 2015-11-13 Created: 2015-11-13 Last updated: 2017-12-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records BETA

Ahlgren, PerJarneving, Bo

Search in DiVA

By author/editor
Ahlgren, PerJarneving, Bo
By organisation
Swedish School of Library and Information Science
In the same journal
Journal of The American Society For Information Science And Technology
Information Studies

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 833 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf