← smac.pub home

Genetic Generative Information Retrieval

link bibtex code slides short conference paper

Authors: Hrishikesh Kulkarni, Zachary Young, Nazli Goharian, Ophir Frieder, Sean MacAvaney

Appeared in: The 23rd ACM Symposium on Document Engineering (DocEng 2023)

DOI 10.1145/3573128.3609340 DBLP conf/doceng/KulkarniYGFM23 Google Scholar 7wWfoDgAAAAJ:lSLTfruPkqcC Semantic Scholar 21de7594eeefe160b9cf5ee32df8b2671e063e82 Enlighten 303221 smac.pub doceng2023-gen2ir


We demonstrate that a simple genetic algorithm can improve generative information retrieval by using a document's text as a genetic representation, a relevance model as a fitness function, and a large language model as a genetic operator that introduces diversity through random changes to the text to produce new documents. By 'mutating' highly-relevant documents and 'crossing over' content between documents, we produce new documents of greater relevance to a user's information need - validated in terms of estimated relevance scores from various models and via a preliminary human evaluation. We also identify challenges that demand further study.

BibTeX @inproceedings{kulkarni:doceng2023-gen2ir, author = {Kulkarni, Hrishikesh and Young, Zachary and Goharian, Nazli and Frieder, Ophir and MacAvaney, Sean}, title = {Genetic Generative Information Retrieval}, booktitle = {The 23rd ACM Symposium on Document Engineering}, year = {2023}, doi = {10.1145/3573128.3609340} }