Providers¶
accuracy
¶
Accuracy provider Supported Measures:
Accuracy(rel=ANY)@ANY
compat
¶
Version of the compatibility measure desribed in:
@article{10.1145/3451161,
author = {Clarke, Charles L. A. and Vtyurina, Alexandra and Smucker, Mark D.},
title = {Assessing Top-k Preferences},
journal = {ACM Transactions on Information Systems},
volume = {39},
number = {3},
articleno = {33},
numpages = {21},
year = {2021},
url = {https://doi.org/10.1145/3451161},
}
Supported Measures:
Compat(p=ANY,normalize=ANY)
cwl_eval
¶
cwl_eval, providing C/W/L (“cool”) framework measures.
@inproceedings{azzopardi2019cwl,
author = {Azzopardi, Leif and Thomas, Paul and Moffat, Alistair},
title = {cwl\_eval: An Evaluation Tool for Information Retrieval},
booktitle = {SIGIR},
year = {2019}
}
Supported Measures:
P(rel=ANY,judged_only=False)@ANY
RR(rel=ANY,judged_only=False)@NOT_PROVIDED
AP(rel=ANY,judged_only=False)@NOT_PROVIDED
RBP(rel=REQUIRED,p=ANY)@NOT_PROVIDED
BPM(T=ANY,min_rel=ANY,max_rel=REQUIRED)@ANY
SDCG(dcg='log2',min_rel=ANY,max_rel=REQUIRED)@REQUIRED
NERR8(min_rel=ANY,max_rel=REQUIRED)@REQUIRED
NERR9(min_rel=ANY,max_rel=REQUIRED)@REQUIRED
NERR10(p=ANY,min_rel=ANY,max_rel=REQUIRED)
NERR11(T=ANY,min_rel=ANY,max_rel=REQUIRED)
INST(T=ANY,min_rel=ANY,max_rel=REQUIRED)
INSQ(T=ANY,min_rel=ANY,max_rel=REQUIRED)
gdeval
¶
gdeval
Supported Measures:
nDCG(dcg='exp-log2',gains=NOT_PROVIDED,judged_only=False)@REQUIRED
ERR@REQUIRED
judged
¶
python implementation of judgment rate
Adapted from OpenNIR’s implementation: https://github.com/Georgetown-IR-Lab/OpenNIR/blob/master/onir/metrics/judged.py
Supported Measures:
Judged@ANY
msmarco
¶
MS MARCO’s implementation of RR
Supported Measures:
RR(rel=ANY,judged_only=False)@ANY
pyndeval
¶
pyndeval
Supported Measures:
ERR_IA(rel=ANY,judged_only=ANY)@ANY
nERR_IA(rel=ANY,judged_only=ANY)@ANY
alpha_DCG(alpha=ANY,rel=ANY,judged_only=ANY)@ANY
alpha_nDCG(alpha=ANY,rel=ANY,judged_only=ANY)@ANY
NRBP(alpha=ANY,beta=ANY,rel=ANY)
nNRBP(alpha=ANY,beta=ANY,rel=ANY)
AP_IA(rel=ANY,judged_only=ANY)
P_IA(rel=ANY,judged_only=ANY)@ANY
StRecall(rel=ANY)@ANY
pytrec_eval
¶
pytrec_eval
https://github.com/cvangysel/pytrec_eval
@inproceedings{VanGysel2018pytreceval,
title={Pytrec\_eval: An Extremely Fast Python Interface to trec\_eval},
author={Van Gysel, Christophe and de Rijke, Maarten},
publisher={ACM},
booktitle={SIGIR},
year={2018},
}
Supported Measures:
P(rel=ANY,judged_only=ANY)@ANY
RR(rel=ANY,judged_only=ANY)@NOT_PROVIDED
Rprec(rel=ANY,judged_only=ANY)
AP(rel=ANY,judged_only=ANY)@ANY
nDCG(dcg='log2',gains=ANY,judged_only=ANY)@ANY
R(judged_only=ANY)@ANY
Bpref(rel=ANY)
NumRet(rel=ANY)
NumQ
NumRel(rel=1)
SetAP(rel=ANY,judged_only=ANY)
SetF(rel=ANY,beta=ANY,judged_only=ANY)
SetP(rel=ANY,relative=ANY,judged_only=ANY)
SetR(rel=ANY)
Success(rel=ANY,judged_only=ANY)@ANY
IPrec(judged_only=ANY)@ANY
infAP(rel=ANY)
ranx
¶
ranx
https://amenra.github.io/ranx/
@misc{ranx2021,
title = {ranx: A Blazing-Fast Python Library for Ranking Evaluation and Comparison},
author = {Bassani, Elias},
year = {2021},
publisher = {GitHub},
howpublished = {\url{https://github.com/AmenRa/ranx}},
}
Supported Measures:
P(rel=ANY,judged_only=False)@ANY
SetP(rel=ANY,judged_only=False)
RR(rel=ANY,judged_only=False)@NOT_PROVIDED
Rprec(rel=ANY,judged_only=False)
AP(rel=ANY,judged_only=False)@ANY
nDCG(dcg=('log2', 'exp-log2'),gains=NOT_PROVIDED,judged_only=False)@ANY
R(judged_only=False)@ANY
SetR(rel=ANY)
NumRet(rel=REQUIRED)
Success(rel=ANY,judged_only=False)@REQUIRED
runtime
¶
Supports measures that are defined at runtime via ir_measures.define() and ir_measures.define_byquery().
Supported Measures:
trectools
¶
trectools
https://github.com/joaopalotti/trectools
@inproceedings{palotti2019,
author = {Palotti, Joao and Scells, Harrisen and Zuccon, Guido},
title = {TrecTools: an open-source Python library for Information Retrieval practitioners involved in TREC-like campaigns},
series = {SIGIR'19},
year = {2019},
location = {Paris, France},
publisher = {ACM}
}
Supported Measures:
P(rel=1,judged_only=False)@ANY
RR(rel=1,judged_only=False)@NOT_PROVIDED
Rprec(rel=1,judged_only=False)
AP(rel=1,judged_only=False)@ANY
nDCG(dcg=ANY,gains=NOT_PROVIDED,judged_only=False)@ANY
Bpref(rel=1)
RBP(p=ANY,rel=ANY)@ANY