<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.umiacs.umd.edu/adapt/index.php?action=history&amp;feed=atom&amp;title=Webarc%3ATrec_Eval_%28modified%29</id>
	<title>Webarc:Trec Eval (modified) - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.umiacs.umd.edu/adapt/index.php?action=history&amp;feed=atom&amp;title=Webarc%3ATrec_Eval_%28modified%29"/>
	<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/adapt/index.php?title=Webarc:Trec_Eval_(modified)&amp;action=history"/>
	<updated>2026-04-07T10:54:36Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.43.7</generator>
	<entry>
		<id>https://wiki.umiacs.umd.edu/adapt/index.php?title=Webarc:Trec_Eval_(modified)&amp;diff=2519&amp;oldid=prev</id>
		<title>Scsong at 01:32, 10 November 2009</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/adapt/index.php?title=Webarc:Trec_Eval_(modified)&amp;diff=2519&amp;oldid=prev"/>
		<updated>2009-11-10T01:32:19Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;== What It Does ==&lt;br /&gt;
We added Kendall&amp;#039;s Tau (http://en.wikipedia.org/wiki/Kendall_tau_rank_correlation_coefficient) support in the existing Trec_Eval tool (http://trec.nist.gov/trec_eval/).&lt;br /&gt;
&lt;br /&gt;
== How To Build ==&lt;br /&gt;
In the Trec_Eval directory,&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
make&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== How To Run ==&lt;br /&gt;
Below is copied from the README file that came with trec_eval.8.1&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
trec_eval [-h] [-q] [-a] [-o] [-c] [-l&amp;lt;num&amp;gt;  [-N&amp;lt;num&amp;gt;] [-M&amp;lt;num&amp;gt;] [-Ua&amp;lt;num&amp;gt;] [-Ub&amp;lt;num&amp;gt;] [-Uc&amp;lt;num&amp;gt;] [-Ud&amp;lt;num&amp;gt;] [-T] trec_rel_file trec_top_file &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Calculate and print various evaluation measures, evaluating the results  &lt;br /&gt;
in trec_top_file against the relevance judgements in trec_rel_file. &lt;br /&gt;
 &lt;br /&gt;
There are a fair number of options, of which only the lower case options are &lt;br /&gt;
normally ever used.   &lt;br /&gt;
 -h: Print full help message and exit &lt;br /&gt;
 -q: In addition to summary evaluation, give evaluation for each query &lt;br /&gt;
 -a: Print all evaluation measures calculated, instead of just the &lt;br /&gt;
     main official measures for TREC. &lt;br /&gt;
 -o: Print everything out in old, nonrelational format (default is relational) &lt;br /&gt;
 -c: Average over the complete set of queries in the relevance judgements  &lt;br /&gt;
      instead of the queries in the intersection of relevance judgements &lt;br /&gt;
      and results.  Missing queries will contribute a value of 0 to all &lt;br /&gt;
      evaluation measures (which may or may not be reasonable for a  &lt;br /&gt;
      particular evaluation measure, but is reasonable for standard TREC &lt;br /&gt;
      measures.) &lt;br /&gt;
 -l&amp;lt;num&amp;gt;: Num indicates the minimum relevance judgement value needed for &lt;br /&gt;
      a document to be called relevant. (All measures used by TREC eval are &lt;br /&gt;
      based on binary relevance).  Used if trec_rel_file contains relevance &lt;br /&gt;
      judged on a multi-relevance scale.  Default is 1. &lt;br /&gt;
 -N&amp;lt;num&amp;gt;: Number of docs in collection &lt;br /&gt;
 -M&amp;lt;num&amp;gt;: Max number of docs per topic to use in evaluation (discard rest). &lt;br /&gt;
 -Ua&amp;lt;num&amp;gt;: Value to use for &amp;#039;a&amp;#039; coefficient of utility computation. &lt;br /&gt;
                        relevant  nonrelevant &lt;br /&gt;
      retrieved            a          b &lt;br /&gt;
      nonretrieved         c          d &lt;br /&gt;
 -Ub&amp;lt;num&amp;gt;: Value to use for &amp;#039;b&amp;#039; coefficient of utility computation. &lt;br /&gt;
 -Uc&amp;lt;num&amp;gt;: Value to use for &amp;#039;c&amp;#039; coefficient of utility computation. &lt;br /&gt;
 -Ud&amp;lt;num&amp;gt;: Value to use for &amp;#039;d&amp;#039; coefficient of utility computation. &lt;br /&gt;
 -J: Calculate all values only over the judged (either relevant or  &lt;br /&gt;
     nonrelevant) documents.  All unjudged documents are removed from the &lt;br /&gt;
     retrieved set before any calculations (possibly leaving an empty set). &lt;br /&gt;
     DO NOT USE, unless you really know what you&amp;#039;re doing - very easy to get &lt;br /&gt;
     reasonable looking, but invalid, numbers.  &lt;br /&gt;
 -T: Treat similarity as time that document retrieved.  Compute &lt;br /&gt;
      several time-based measures after ranking docs by time retrieved &lt;br /&gt;
      (first doc (lowest sim) retrieved ranked highest).  &lt;br /&gt;
      Only done if -a selected. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Input File ==&lt;br /&gt;
Example trec_rel_file:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
3 Q0 Politics_of_the_Falkland_Islands04-29-2001 1 11.12743 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_the_Falkland_Islands04-29-2001 2 11.12743 qts.1_tw.1&lt;br /&gt;
3 Q0 History_of_the_Falkland_Islands04-29-2001 3 11.12743 qts.1_tw.1&lt;br /&gt;
3 Q0 Foreign_relations_of_Cyprus04-26-2001 4 8.11532 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_Finland04-29-2001 5 7.12460 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_the_Faroe_Islands04-29-2001 6 6.77037 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_Canada03-26-2001 7 5.95059 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_Germany03-10-2001 8 5.74032 qts.1_tw.1&lt;br /&gt;
3 Q0 Paul_Robeson04-20-2001 9 4.97028 qts.1_tw.1&lt;br /&gt;
3 Q0 Politics_of_Dominica04-26-2001 10 4.09356 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_the_Republic_of_the_Congo04-26-2001 11 3.78614 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_Ethiopia04-29-2001 12 3.74733 qts.1_tw.1&lt;br /&gt;
3 Q0 Politics_of_Cyprus04-26-2001 13 2.08520 qts.1_tw.1&lt;br /&gt;
4 Q0 Politics_of_the_Falkland_Islands04-29-2001 1 12.70288 qts.1_tw.1&lt;br /&gt;
4 Q0 Christ&amp;#039;s_College,_Cambridge05-20-2001 2 12.24870 qts.1_tw.1&lt;br /&gt;
4 Q0 Politics_of_Gibraltar05-17-2001 3 12.16519 qts.1_tw.1&lt;br /&gt;
4 Q0 History_of_Hong_Kong05-04-2001 4 11.67782 qts.1_tw.1&lt;br /&gt;
4 Q0 Vittorio_De_Sica05-13-2001 5 11.17646 qts.1_tw.1&lt;br /&gt;
4 Q0 Aphex_Twin05-03-2001 6 11.02514 qts.1_tw.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Example trec_top_filefiles:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
3 Q0 Politics_of_Dominica04-26-2001 1 11.09356 qts.1_tw.1&lt;br /&gt;
3 Q0 Paul_Robeson04-20-2001 2 10.97028 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_Germany03-10-2001 3 9.74032 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_Canada03-26-2001 4 8.95059 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_the_Faroe_Islands04-29-2001 5 7.77037 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_Finland04-29-2001 6 6.12460 qts.1_tw.1&lt;br /&gt;
3 Q0 Foreign_relations_of_Cyprus04-26-2001 7 5.11532 qts.1_tw.1&lt;br /&gt;
3 Q0 History_of_the_Falkland_Islands04-29-2001 8 4.37087 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_the_Falkland_Islands04-29-2001 9 3.01286 qts.1_tw.1&lt;br /&gt;
3 Q0 Politics_of_the_Falkland_Islands04-29-2001 10 2.12743 qts.1_tw.1&lt;br /&gt;
3 Q0 Politics_of_Cyprus04-26-2001 13 2.08520 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_Ethiopia04-29-2001 12 1.74733 qts.1_tw.1&lt;br /&gt;
3 Q0 Economy_of_the_Republic_of_the_Congo04-26-2001 11 0.78614 qts.1_tw.1&lt;br /&gt;
4 Q0 Politics_of_the_Falkland_Islands04-29-2001 1 12.70288 qts.1_tw.1&lt;br /&gt;
4 Q0 Christ&amp;#039;s_College,_Cambridge05-20-2001 2 12.24870 qts.1_tw.1&lt;br /&gt;
4 Q0 Politics_of_Gibraltar05-17-2001 3 12.16519 qts.1_tw.1&lt;br /&gt;
4 Q0 History_of_Hong_Kong05-04-2001 4 11.67782 qts.1_tw.1&lt;br /&gt;
4 Q0 Vittorio_De_Sica05-13-2001 5 11.17646 qts.1_tw.1&lt;br /&gt;
4 Q0 Aphex_Twin05-03-2001 6 11.02514 qts.1_tw.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Output Files ==&lt;br /&gt;
Example output file for the example input files above (Note Kendall&amp;#039;s Tau&amp;#039;s in the middle):&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
num_q          	all	2&lt;br /&gt;
num_ret        	all	19&lt;br /&gt;
num_rel        	all	19&lt;br /&gt;
num_rel_ret    	all	19&lt;br /&gt;
map            	all	1.0000&lt;br /&gt;
gm_ap          	all	1.0000&lt;br /&gt;
R-prec         	all	1.0000&lt;br /&gt;
bpref          	all	1.0000&lt;br /&gt;
recip_rank     	all	1.0000&lt;br /&gt;
num_nonrel_judged_ret	all	0&lt;br /&gt;
exact_prec     	all	1.0000&lt;br /&gt;
exact_recall   	all	1.0000&lt;br /&gt;
11-pt_avg      	all	1.0000&lt;br /&gt;
3-pt_avg       	all	1.0000&lt;br /&gt;
avg_doc_prec   	all	1.0000&lt;br /&gt;
exact_relative_prec	all	1.0000&lt;br /&gt;
avg_relative_prec	all	1.0000&lt;br /&gt;
exact_unranked_avg_prec	all	1.0000&lt;br /&gt;
exact_relative_unranked_avg_prec	all	1.0000&lt;br /&gt;
map_at_R       	all	0.8782&lt;br /&gt;
int_map        	all	1.0000&lt;br /&gt;
exact_int_R_rcl_prec	all	1.0000&lt;br /&gt;
int_map_at_R   	all	0.8782&lt;br /&gt;
bpref_allnonrel	all	1.0000&lt;br /&gt;
bpref_retnonrel	all	1.0000&lt;br /&gt;
bpref_topnonrel	all	1.0000&lt;br /&gt;
bpref_top5Rnonrel	all	   nan&lt;br /&gt;
bpref_top10Rnonrel	all	   nan&lt;br /&gt;
bpref_top10pRnonrel	all	1.0000&lt;br /&gt;
bpref_top25pRnonrel	all	1.0000&lt;br /&gt;
bpref_top50pRnonrel	all	1.0000&lt;br /&gt;
bpref_top25p2Rnonrel	all	1.0000&lt;br /&gt;
bpref_retall   	all	1.0000&lt;br /&gt;
bpref_5        	all	1.0000&lt;br /&gt;
bpref_10       	all	1.0000&lt;br /&gt;
bpref_num_all  	all	0.0000&lt;br /&gt;
bpref_num_ret  	all	0.0000&lt;br /&gt;
bpref_num_correct	all	0&lt;br /&gt;
bpref_num_possible	all	0&lt;br /&gt;
old_bpref      	all	1.0000&lt;br /&gt;
old_bpref_top10pRnonrel	all	1.0000&lt;br /&gt;
infAP          	all	1.0000&lt;br /&gt;
gm_bpref       	all	1.0000&lt;br /&gt;
ircl_prn.0.00  	all	1.0000&lt;br /&gt;
ircl_prn.0.10  	all	1.0000&lt;br /&gt;
ircl_prn.0.20  	all	1.0000&lt;br /&gt;
ircl_prn.0.30  	all	1.0000&lt;br /&gt;
ircl_prn.0.40  	all	1.0000&lt;br /&gt;
ircl_prn.0.50  	all	1.0000&lt;br /&gt;
ircl_prn.0.60  	all	1.0000&lt;br /&gt;
ircl_prn.0.70  	all	1.0000&lt;br /&gt;
ircl_prn.0.80  	all	1.0000&lt;br /&gt;
ircl_prn.0.90  	all	1.0000&lt;br /&gt;
ircl_prn.1.00  	all	1.0000&lt;br /&gt;
P5             	all	1.0000&lt;br /&gt;
P10            	all	0.8000&lt;br /&gt;
P20            	all	0.4750&lt;br /&gt;
P30            	all	0.3167&lt;br /&gt;
P50            	all	0.1900&lt;br /&gt;
P100           	all	0.0950&lt;br /&gt;
recall5        	all	0.6090&lt;br /&gt;
recall10       	all	0.8846&lt;br /&gt;
recall20       	all	1.0000&lt;br /&gt;
recall30       	all	1.0000&lt;br /&gt;
recall50       	all	1.0000&lt;br /&gt;
recall100      	all	1.0000&lt;br /&gt;
0.20R-prec     	all	1.0000&lt;br /&gt;
0.40R-prec     	all	1.0000&lt;br /&gt;
0.60R-prec     	all	1.0000&lt;br /&gt;
0.80R-prec     	all	1.0000&lt;br /&gt;
1.00R-prec     	all	1.0000&lt;br /&gt;
1.20R-prec     	all	0.7812&lt;br /&gt;
1.40R-prec     	all	0.6754&lt;br /&gt;
1.60R-prec     	all	0.6095&lt;br /&gt;
1.80R-prec     	all	0.5436&lt;br /&gt;
2.00R-prec     	all	0.5000&lt;br /&gt;
relative_prec5 	all	1.0000&lt;br /&gt;
relative_prec10	all	1.0000&lt;br /&gt;
relative_prec20	all	1.0000&lt;br /&gt;
relative_prec30	all	1.0000&lt;br /&gt;
relative_prec50	all	1.0000&lt;br /&gt;
relative_prec100	all	1.0000&lt;br /&gt;
unranked_avg_prec5	all	0.6090&lt;br /&gt;
unranked_avg_prec10	all	0.6846&lt;br /&gt;
unranked_avg_prec20	all	0.4750&lt;br /&gt;
unranked_avg_prec30	all	0.3167&lt;br /&gt;
unranked_avg_prec50	all	0.1900&lt;br /&gt;
unranked_avg_prec100	all	0.0950&lt;br /&gt;
relative_unranked_avg_prec5	all	1.0000&lt;br /&gt;
relative_unranked_avg_prec10	all	0.6800&lt;br /&gt;
relative_unranked_avg_prec20	all	0.2562&lt;br /&gt;
relative_unranked_avg_prec30	all	0.1139&lt;br /&gt;
relative_unranked_avg_prec50	all	0.0410&lt;br /&gt;
relative_unranked_avg_prec100	all	0.0102&lt;br /&gt;
Kendall&amp;#039;s_tau5 	all	0.0000&lt;br /&gt;
Kendall&amp;#039;s_tau10	all	0.0000&lt;br /&gt;
Kendall&amp;#039;s_tau20	all	0.3846&lt;br /&gt;
Kendall&amp;#039;s_tau30	all	0.3846&lt;br /&gt;
Kendall&amp;#039;s_tau50	all	0.3846&lt;br /&gt;
Kendall&amp;#039;s_tau100	all	0.3846&lt;br /&gt;
utility_1.0_-1.0_0.0_0.0	all	9.5000&lt;br /&gt;
rcl_at_142_nonrel	all	1.0000&lt;br /&gt;
fallout_recall_0	all	0.0000&lt;br /&gt;
fallout_recall_14	all	1.0000&lt;br /&gt;
fallout_recall_28	all	1.0000&lt;br /&gt;
fallout_recall_42	all	1.0000&lt;br /&gt;
fallout_recall_56	all	1.0000&lt;br /&gt;
fallout_recall_71	all	1.0000&lt;br /&gt;
fallout_recall_85	all	1.0000&lt;br /&gt;
fallout_recall_99	all	1.0000&lt;br /&gt;
fallout_recall_113	all	1.0000&lt;br /&gt;
fallout_recall_127	all	1.0000&lt;br /&gt;
fallout_recall_142	all	1.0000&lt;br /&gt;
int_0.20R-prec 	all	1.0000&lt;br /&gt;
int_0.40R-prec 	all	1.0000&lt;br /&gt;
int_0.60R-prec 	all	1.0000&lt;br /&gt;
int_0.80R-prec 	all	1.0000&lt;br /&gt;
int_1.00R-prec 	all	1.0000&lt;br /&gt;
int_1.20R-prec 	all	0.7812&lt;br /&gt;
int_1.40R-prec 	all	0.6754&lt;br /&gt;
int_1.60R-prec 	all	0.6095&lt;br /&gt;
int_1.80R-prec 	all	0.5436&lt;br /&gt;
int_2.00R-prec 	all	0.5000&lt;br /&gt;
micro_prec     	all	1.0000&lt;br /&gt;
micro_recall   	all	1.0000&lt;br /&gt;
micro_bpref    	all	   nan&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Notes ==&lt;br /&gt;
Use -a option to generate Kendall&amp;#039;s Tau values.&lt;br /&gt;
&lt;br /&gt;
== Source Codes ==&lt;br /&gt;
svn co http://narasvn.umiacs.umd.edu/repository/src/webarc/trec_eval.8.1&lt;/div&gt;</summary>
		<author><name>Scsong</name></author>
	</entry>
</feed>