Fractional Counting of Authors and Addresses: Fraction.EXE
This website provides a routine to compute fractional values for the numbers of publications and citations in a set of papers downloaded from the Web-of-Science (v5). First, this set can be organized in a relational database using ISI.exe. ISI.exe uses as input the download in the tagged format of the WoS which is available in the same folder and named “data.txt” (see at http://www.leydesdorff.net/software/isi for further explanation). The output is a set of databases (.dbf) which can be read using Excel or SPSS. For example, authors are organized into au.dbf and addresses into cs.dbf. (The various files are related in terms of the field “nr;” MSAccess can be used for relational database management.)
The resulting files can be used by fraction.exe as input. This program will add au2.dbf and cs2.dbf to the dbf-files generated by isi.exe.
1. Au2.dbf contains additional to the fields in au.dbf, the fields “tc” (times cited), “tc_au” (times cited/author), and “npub_au” (fractional count of the publication). One can, for example, read au2.dbf into excel and use a pivot table for generating a fractional count of the authorship of the authors in the set.
2. Cs2.dbf contains similar fields, but counts in terms of the number of addresses attributed to the publication in the C1-field of WoS. The RP-field of WoS is not used since it usually extends on one of the other addresses (in WoS v5).
Records before 2008 are not used because the address
fields before this date are not consistently organized in WoS.
Note that when two different addresses are listed for the same author, both are counted in the current version. (This can further be refined.) If one wishes an address not to be counted, it is perhaps easiest to remove it in the input file of ISI.exe (“data.txt”) using a text-editor (e.g., NotePad or WordPad; be careful to save in the DOS format.) One can add the RP address manually to the address list under C1.
3. Additionally, a scratch file core2.dbf is generated that is equal to core.dbf, but to which the various fields under 1. and 2. are added at the end. This does not add new information and can be disregarded.
The routine overwrites files with the same names in each next run.
August 26, 2015.
A further routine instfrac.exe produces the matrices for institutional collaboration: matrix.txt, matrix.dbf, fractionally counted, and on the basis of these matrices the network files coocc.dat and cosine.dat for processing in Pajek. See for further explanation at http://www.leydesdorff.net/software/instcoll (September 23, 2015).
For institutional networks with more than 1,000 institutions, click here.
Similarly, intfrac.exe produces the matrices for international collaboration: matrix.txt, matrix.dbf, fractionally counted, and on the basis of these matrices the network files coocc.dat and cosine.dat for processing in Pajek. See for further explanation at http://www.leydesdorff.net/software/intcoll (November 2, 2015)
November 2, 2015.