Retrieval-Enhanced Machine Learning
Retrieval-Enhanced Machine Learning (REML) explores how machine learning models can be augmented with retrieval components to improve performance, interpretability, and scalability. This framework treats information access systems as supporting both human users and task-driven machines, applying core principles of indexing, representation, retrieval, and ranking to enhance model capabilities. REML has applications across diverse domains including natural language processing, computer vision, time series prediction, and computational biology. Recent work explores fairness considerations in retrieval-augmented generation (RAG) systems, ensuring equitable exposure of sources and balanced attribution in generated content.
Publications
J. Kalra, X. Zhao, T.-E. Kim, F. Cai, F. Diaz, T. Wu
EMNLP, 2025
T.-E. Kim and F. Diaz
ICTIR, 2025
F. Diaz, A. Drozdov, T.-E. Kim, A. Salemi, H. Zamani
SIGIR, 2025
T.-E. Kim and F. Diaz
SIGIR LiveRAG Workshop, 2025
T.-E. Kim, A. Salemi, A. Drozdov, F. Diaz, H. Zamani
arxiv, 2024
H. Zamani, F. Diaz, M. Dehghani, D. Metzler, M. Bendersky
SIGIR 2022 (Perspectives)