building matrices and normalization. in order to normalize co-occurences you will need first to...

9
Building matrices and normalization

Upload: benjamin-gardner

Post on 03-Jan-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Building matrices and normalization. In order to normalize co-occurences you will need first to build a matrix with units (words, cited authors etc) in

Building matrices and normalization

Page 2: Building matrices and normalization. In order to normalize co-occurences you will need first to build a matrix with units (words, cited authors etc) in

In order to normalize co-occurences you will need first to build a matrix with units (words, cited authors etc) in the columns and document numbers in the rows. BibExcel will fill the matrix with numbers and then you could calculate Salton’s or Jaccard Index.

Page 3: Building matrices and normalization. In order to normalize co-occurences you will need first to build a matrix with units (words, cited authors etc) in

Make a co-word analysis based on the ID-field. The low file has a nicer look than the out-file after running Edit out-files/Convert Upper Lower Case/Good for reference strings on the outfile

Page 4: Building matrices and normalization. In order to normalize co-occurences you will need first to build a matrix with units (words, cited authors etc) in

Calculate frequencies on the low-file and the cit-file looks like this

Page 5: Building matrices and normalization. In order to normalize co-occurences you will need first to build a matrix with units (words, cited authors etc) in

Select the most frequent units, down to frequencies=20, sort them in Excel and then paste them into The List. Then select the low-file containing the id-words, and then run Analyze/Docs and units matrix/Make docnr+units matrix without zero row sum.

Page 6: Building matrices and normalization. In order to normalize co-occurences you will need first to build a matrix with units (words, cited authors etc) in

The ma5-file now contains the matrix!

Page 7: Building matrices and normalization. In order to normalize co-occurences you will need first to build a matrix with units (words, cited authors etc) in

To calculate Salton’s index select the ma5-file and run Analyze/Docs and units/Calculate Salton cosine from a ma5-file

Page 8: Building matrices and normalization. In order to normalize co-occurences you will need first to build a matrix with units (words, cited authors etc) in

Answer Yes (Ja) to this question:

Answer No (Nej) to this question:

Page 9: Building matrices and normalization. In order to normalize co-occurences you will need first to build a matrix with units (words, cited authors etc) in

…and the result is in the sal-file, with Salton index values, multiplied by 1000 (good for some applications)

Instead of Salton you may choose Jaccard or Vladutz & Cook normalization and apply them to the ma5-file.