Machine Learning for Retrieval

This research area applies machine learning techniques to improve various aspects of information retrieval, from ranking and matching to cross-market adaptation and personalization. Key work includes developing neural ranking models that combine local and distributed text representations, creating density-based user representations using Gaussian process regression for multi-interest personalization, and learning to rank with labeled features rather than document labels. The research encompasses cross-market model adaptation using pairwise preference data, methods for transferring retrieval knowledge across collections with non-overlapping vocabularies, and techniques for generating quick links and proactive suggestions.

Publications

T.-E. Kim and F. Diaz
SIGIR LiveRAG Workshop, 2025
H. Wu, O. Meshi, M. Zoghi, F. Diaz, X. Liu, C. Boutilier, M. Karimzadehgan
NeurIPS 2024
B. Mitra, F. Diaz, N. Craswell
WWW 2017
F. Diaz
ICTIR 2016
J. Seo, F. Diaz, E. Gabrilovich, V. Josifovski, B. Pang
WWW 2011
J. Bai, F. Diaz, Y. Chang, Z. Zheng
COLING 2010
F. Diaz and D. Metzler
IJCAI 2007