← smac.pub home

A Python Interface to PISA!

link bibtex poster demonstration paper

Authors: Sean MacAvaney, Craig Macdonald

Appeared in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2022)

Links/IDs:
DOI 10.1145/3477495.3531656 DBLP conf/sigir/MacAvaneyM22 ACM 3477495.3531656 Google Scholar 7wWfoDgAAAAJ:TQgYirikUcIC Semantic Scholar b7da554d9f1f51e13a852ab0270dcd0d824c52e8 smac.pub sigir2022-pisa

Abstract:

PISA (Performant Indexes and Search for Academia) provides very efficient implementations of various retrieval algorithms over sparse inverted indices. The highly-optimized C++ implementation, however, has previously only been accessible via command line tools. From indexing to retrieval, 5–6 commands need to be executed in sequence, making the process relatively involved. Further complications when using PISA include a lengthy build process and minimal interoperability with other tools. In this work, we demonstrate a new tool that provides a native Python wrapper around PISA. The wrapper features a simplified interface that adheres to the PyTerrier API, making it easy to use (e.g., via Pandas DataFrames), apply to a multitude of datasets (e.g., those from the ir_datasets package) and combine with other methods (e.g., neural re-ranking and dense retrieval methods).

BibTeX @inproceedings{macavaney:sigir2022-pisa, author = {MacAvaney, Sean and Macdonald, Craig}, title = {A Python Interface to PISA!}, booktitle = {Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval}, year = {2022}, doi = {10.1145/3477495.3531656} }