Fernando Diaz

Fernando Diaz Associate Professor Language Technologies Institute, Carnegie Mellon University Research Scientist Google Research diazf [at] acm [dot] org
Introduction
My research focuses on the design and evaluation of information access systems, including information retrieval (e.g. web search) and recommender systems. This has included developing evaluation metrics, designing algorithms, and understanding the broader, societal implications of these technologies. My recent projects include, Evaluation preference-based evaluation: improving the sensitivity of evaluation metrics by measuring pairwise preferences between systems. fairness in ranking: designing evaluation metrics for ranked lists of items capturing different notions of fairness. artificial intelligence and culture: understanding and measuring the impact of artificial intelligence on culture industries such as music, film, and literature. Algorithm Design retrieval-enhanced machine learning: designing algorithms to support machine learning systems. tip of the tongue retrieval: designing algorithms to help people re-find previously consumed content. My previous projects have included text summarization in crisis informatics contexts, inferring user intent from mouse cursor behavior, aggregated search, time-sensitive ranking, and core retrieval algorithms. Detailed biographical information can be found on my curriculum vitae.
Publications
Organization
Conferences	ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2021), General Co-Chair
	ACM Conference on Fairness, Accountability, and Transparency (FAccT 2019), Program Co-Chair
	ACM International Conference on Web Search and Data Mining (WSDM 2014), General Co-Chair
Workshops	NeurIPS 2022 Workshop on Cultures in AI/AI in Culture 2022
	NeurIPS 2020 Workshop on Algorithmic Fairness through the Lens of Causality and Interpretability 2020
	CIFAR Workshop on AI, Recommendation, and the Curation of Culture 2019
	Workshop on Fairness, Accountability, and Transparency on the World Wide Web (FATWEB) 2017
	WSDM Workshop on the Ethics of Online Experimentation 2016
	SIGIR Workshop on Reproducibility, Inexplicability, and Generalizability of Results (RIGOR) 2015
	Social Web for Disaster Management (SWDM) 2015, 2016
	SIGIR Workshop on Time-Aware Information Access 2012, 2013, 2014
	ACM Workshop on Social Web Search and Mining: Analysis of User Generated Content Under Crisis 2011
TREC	Tip of The Tongue Retrieval 2023
	Fair Ranking 2019, 2020
	Real-Time Summarization 2016
	Temporal Summarization 2013, 2014, 2015
	Web 2013, 2014
Teaching
	Search Engines Language Technologies Institute CMU Spring 2025
	Quantitative Evaluation of Language Technologies Language Technologies Institute CMU Spring 2024, Fall 2025
	Web Search Engines Department of Computer Science Courant Institute of Mathematical Sciences NYU Spring 2013, Fall 2014, Fall 2016
	Experimental Design for Information Systems University of Trento Summer 2012
	Advanced Information Retrieval and Databases Department of Computer Science School of Engineering NYU Spring 2011
Code
	pref_eval Implementation of mulitiple preference-based evaluation for ranked lists and binary relevance.
	notation maintain notation across latex documents. also can generate a notation table.
	ranking-meta-evaluation-data Ranking metric meta-evaluation data across several retrieval and recommendation domains.
	expeval Code to compute expected exposure metrics.
	indri A clone of indri-5.12 with minor customizations.
	trec-data scripts to download and standardize trec query and document sets.
	latex-dependencies.py generate .tex dependencies for a root latex file.
	latex-merge merge collection of latex source into a single latex file.
	kstem stand alone Krovetz stemmer.
FAQ
	Q: Why 841.io? A: Because it is short.