Tristan Miller

Department of Computer Science · University of Manitoba
+1 204 474 6792 tristan@logological.org ()

I'm a computational linguist with research interests in lexical semantics, historical online corpora, and computational detection and interpretation of humour. I currently head the Computational Linguistics at Manitoba (CLAM) Lab at the University of Manitoba's Department of Computer Science.

Research news

2026-09-21

Co-organizer of the CLEF 2026 workshop "JOKER: Automatic Humour Analysis"

2026-07-02

Co-chair for 2nd Workshop on Computational Humor (CHum 2026)

2025-12-16

Invited talk at the University of Tübingen: "Inclusive English in Science Communication"

2025-09-09

Co-organizer of the CLEF 2025 workshop "JOKER: Automatic Humour Analysis"

2025-09-05

Senior area co-chair for Semantics: Lexical, Sentence-level Semantics at EMNLP 2025

2025-08-06

Co-editor of EJHR special issue "AI Meets Humour"

2025-07-30

Interview for Undark and Smithsonian Magazine

2025-07-27

Faculty advisor to the Student Research Workshop Chairs at ACL 2025

2025-07-11

Co-convenor of the Humor and AI Panel at ISHS 2025

2025-04-02

Co-convenor of the panel "Beyond the Web: Usenet as an Archive of Digital Discourse" at BDCAM25

Publications

Liana Ermakova, Igor Kuzmin, Poojan Vachharajani, Tristan Miller, Anne-Gwenn Bosser, and Jaap Kamps.
CLEF 2026 JOKER track: Humour detection, search, and translation.
In Ricardo Campos, Adam Jatowt, Yanyan Lan, Mohammad Aliannejadi, Christine Bauer, Sean MacAvaney, Avishek Anand, Zhaochun Ren, Suzan Verberne, Nan Bai, and Masoud Mansoury, editors, Advances in Information Retrieval: 48th European Conference on Information Retrieval, ECIR 2026, Delft, The Netherlands, March 29–April 2, Proceedings, Part IV, volume 16486 of Lecture Notes in Computer Science (ISSN 0302-9743), pages 242–250, Cham, Switzerland, March 2026. Springer. ISBN 978-3-032-21320-4. DOI: 10.1007/978-3-032-21321-1_34.

Over the last few years, the JOKER Track has created an active community of researchers in NLP and IR working together on the non-literal use of language in text—which is still challenging for both AI models and humans, as it requires understanding implicit cultural references and double meanings. Its benchmarks on humorous text analysis, retrieval, and translation have become standard references. We made significant changes to the track's setup and tasks in 2024 and 2025, and propose continuing these to complete the test collections. The CLEF 2026 JOKER track will contain the following four tasks. Task 1 (Humor-aware Information Retrieval): retrieve short humorous texts for a query. Task 2 (Pun Translation): translate puns from English to French and Spanish. Task 3 (Onomastic Wordplay Translation): translate onomastic wordplay from English to French. Task 4 (Humor Generation): guided creativity.

@inproceedings{ermakova2026clef,

author = {Liana Ermakova and Igor Kuzmin and Poojan Vachharajani and Tristan Miller and Anne-Gwenn Bosser and Jaap Kamps},

editor = {Ricardo Campos and Adam Jatowt and Yanyan Lan and Mohammad Aliannejadi and Christine Bauer and Sean MacAvaney and Avishek Anand and Zhaochun Ren and Suzan Verberne and Nan Bai and Masoud Mansoury},

title = {{CLEF} 2026 {JOKER} Track: Humour Detection, Search, and Translation},

booktitle = {Advances in Information Retrieval: 48th {European} {Conference} on {Information} {Retrieval}, {ECIR}~2026, {Delft}, {The} {Netherlands}, {March}~29–{April}~2, Proceedings, Part~{IV}},

volume = {16486},

pages = {242--250},

series = {Lecture Notes in Computer Science},

month = mar,

year = {2026},

publisher = {Springer},

address = {Cham, Switzerland},

isbn = {978-3-032-21320-4},

issn = {0302-9743},

doi = {10.1007/978-3-032-21321-1_34},

}

Citation Abstract BibTeX

HTML PDF

Anna Palmann and Tristan Miller.
What's in a pun? assessing the relationship between phonological distance and perceived funniness of punning jokes.
Humor: International Journal of Humor Research, 38(4):643–658, 2025. ISSN 0933-1719. DOI: 10.1515/humor-2024-0060.

Punning is a form of humorous wordplay based on semantic ambiguity between two phonologically similar words – the pun and the target – in a context where both meanings are more or less acceptable. While the pun is expressed explicitly, the target is invoked implicitly in the text. Previous work has attempted to quantify and compare phonological features of puns and their targets, looking at correlations with the understandability of the jokes in which they occur. Our study quantifies the phonological distance between pun and target words and assesses possible correlations with funniness ratings of the corresponding jokes. Our statistical analyses on a large dataset of puns reveal a significant negative correlation between phonological distance and perceived funniness for two of the four phonological distance measures we applied. This finding supports the hypothesis, often (implicitly) made in previous research but never verified at this scale, that lower phonological distance between a pun and its target is associated with higher funniness ratings. The parameters of our study suggest that future work should examine the semantic features of pun and target in order to create a more holistic understanding of what contributes to the perceived funniness of punning jokes.

@article{palmann2025whats,

author = {Anna Palmann and Tristan Miller},

title = {What's in a Pun? Assessing the Relationship Between Phonological Distance and Perceived Funniness of Punning Jokes},

journal = {Humor: International Journal of Humor Research},

volume = {38},

number = {4},

pages = {643--658},

year = {2025},

issn = {0933-1719},

doi = {10.1515/humor-2024-0060},

}

Citation Abstract BibTeX

HTML PDF

Wei Zhao, Jennifer D'Souza, Steffen Eger, Anne Lauscher, Yufang Hou, Nafise Sadat Moosavi, Tristan Miller, and Chenghua Lin, editors.
Proceedings of the First Workshop on Human–LLM Collaboration for Ethical and Responsible Science Production (SciProdLLM).
Association for Computational Linguistics, Kerville, TX, December 2025. ISBN 979-8-89176-307-4.

Large language models (LLMs) are on the rapid rise to empower human researchers in science production at all stages, from the initial conception of research problems to reporting scientific discoveries. In 2025, American publisher Wiley surveyed 5,000 researchers across 70 countries and found that majority support LLM adoption in scientific production. While LLMs could enable faster, cost-effective research addressing global challenges, they raise ethical and trust concerns. To explore these issues, we organized the SciProdLLM workshop with the goal of proving a forum for presenting and discussing research on integrating LLMs into the typical research workflow: from ideation to experimentation to scientific writing, with a particular focus on human-centered approaches that ensure ethical and responsible use of LLMs. We also invite work that evaluates the quality of LLM-assisted research workflows and the resulting outputs.

@book{zhao2025first,

editor = {Wei Zhao and Jennifer D'Souza and Steffen Eger and Anne Lauscher and Yufang Hou and Nafise {Sadat Moosavi} and Tristan Miller and Chenghua Lin},

title = {Proceedings of the First Workshop on Human--{LLM} Collaboration for Ethical and Responsible Science Production ({SciProdLLM})},

month = dec,

year = {2025},

publisher = {Association for Computational Linguistics},

address = {Kerville, TX},

isbn = {979-8-89176-307-4},

}

Citation Abstract BibTeX

HTML PDF

Steffen Eger, Yong Cao, Jennifer D'Souza, Andreas Geiger, Christian Greisinger, Stephanie Gross, Yufang Hou, Brigitte Krenn, Anne Lauscher, Yizhi Li, Chenghua Lin, Nafise Sadat Moosavi, Wei Zhao, and Tristan Miller.
Transforming science with large language models: a survey on AI-assisted scientific discovery, experimentation, content generation, and evaluation.
ArXiv e-prints, 2502.05151, February 2025. DOI: 10.48550/arXiv.2502.05151.

With the advent of large multimodal language models, science is now at a threshold of an AI-based technological transformation. Recently, a plethora of new AI models and tools has been proposed, promising to empower researchers and academics worldwide to conduct their research more effectively and efficiently. This includes all aspects of the research cycle, especially (1) searching for relevant literature; (2) generating research ideas and conducting experimentation; generating (3) text-based and (4) multimodal content (e.g., scientific figures and diagrams); and (5) AI-based automatic peer review. In this survey, we provide an in-depth overview over these exciting recent developments, which promise to fundamentally alter the scientific research process for good. Our survey covers the five aspects outlined above, indicating relevant datasets, methods and results (including evaluation) as well as limitations and scope for future research. Ethical concerns regarding shortcomings of these tools and potential for misuse (fake science, plagiarism, harms to research integrity) take a particularly prominent place in our discussion. We hope that our survey will not only become a reference guide for newcomers to the field but also a catalyst for new AI-based initiatives in the area of “AI4Science”.

@article{eger2025transforming,

author = {Steffen Eger and Yong Cao and Jennifer D'Souza and Andreas Geiger and Christian Greisinger and Stephanie Gross and Yufang Hou and Brigitte Krenn and Anne Lauscher and Yizhi Li and Chenghua Lin and Nafise Sadat Moosavi and Wei Zhao and Tristan Miller},

title = {Transforming Science with Large Language Models: a Survey on {AI}-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation},

journal = {{ArXiv} e-prints},

volume = {2502.05151},

month = feb,

year = {2025},

doi = {10.48550/arXiv.2502.05151},

}

Citation Abstract BibTeX

PDF

More publications

Projects

Funded research projects

Computational Pun-derstanding

OFAI • 2019–

Principal investigator

An FWF-funded research project on the computer-assisted translation of wordplay

eFISK

DFKI • 2004–2005

Principal investigator

A study on attention-based information retrieval using eye tracking

@VISOR

DFKI • 2004–2005

Named investigator

A holistic context- and content-sensitive approach to information navigation

Events & organizations

ACL Student Research Workshop 2025

Vienna • 2025

Co-organizer

Computational linguistics workshop for students

CHum 2025

Abu Dhabi and online • 2025

Co-organizer

Workshop on computational humour

JOKER 2025

CLEF • 2025

Co-organizer

Workshop on automatic wordplay analysis

Beyond the Web: Usenet as an Archive of Digital Discourse

BDCAM25 • 2025

Co-convenor

Panel at the Born-Digital Collections, Archives and Memory Conference

Humor & Artificial Intelligence

ISHS • 2025

Co-convenor

Panel at the 2025 International Society for Humor Studies Conference

OFAI 2024 Spring Lecture Series

OFAI • 2024

Co-organizer

Public guest lecture series on artificial intelligence

Mawachihitotaak 2024

University of Manitoba • 2024

Co-organizer

Métis studies symposium

JOKER 2024

CLEF • 2024

Co-organizer

Workshop on automatic wordplay analysis

Humor & Artificial Intelligence

ISHS–HRC • 2024

Co-convenor

Panel at the 2024 International Society for Humor Studies Conference and 14th Humor Research Conference

OFAI 2023 Fall Lecture Series

OFAI • 2023

Co-organizer

Public guest lecture series on artificial intelligence

OFAI 2023 Lecture Series

OFAI • 2023

Co-organizer

Public guest lecture series on artificial intelligence

JOKER 2023

CLEF • 2023

Co-organizer

Workshop on automatic wordplay analysis

Humor & Artificial Intelligence

ISHS • 2023

Co-convenor

Panel at the 2023 International Society for Humor Studies Conference

OFAI 2022 Lecture Series

OFAI • 2022

Co-organizer

Public guest lecture series on artificial intelligence

JOKER 2022

CLEF • 2022

Co-organizer

Workshop on automatic pun and humour translation

Abusive and Offensive Humour

ISHS • 2022

Co-convenor

Reinhold Aman Memorial Panel at the 2022 International Society for Humor Studies Conference

Humor & Artificial Intelligence

ISHS • 2022

Co-convenor

Panel at the 2022 International Society for Humor Studies Conference

SemEval-2021 Task 12

ACL • 2021

Co-chair

Shared task on learning from disagreements

Big-8 Management Board

Usenet • 2020–

Co-chair

Administration of Usenet's original discussion hierarchies

Humor & Artificial Intelligence

ISHS • 2019

Co-convenor

Panel at the 2019 International Society for Humor Studies Conference

Humor & Artificial Intelligence

ISHS • 2018

Co-convenor

Panel at the 2018 International Society for Humor Studies Conference

SemEval-2017 Task 7

ACL • 2017

Co-chair

Shared task on the computational detection and interpretation of puns

GermEval 2015: LexSub

GSCL • 2015

Co-chair

Workshop for German-language lexical substitution

Software

sshrc-insight

University of Manitoba • 2024–

Lead developer

A LaTeX class for SSHRC Insight proposals

heria

OFAI • 2023–

Lead developer

A LaTeX class for Horizon Europe proposals

STUMP & WebSTUMP

The GNU Project • 2020–

Co-maintainer

Usenet robomoderation software and a Web-based front end

PunCAT

OFAI • 2020–

Co-developer

Interactive prototype tool for the computer-assisted translation of puns

UBY

TU Darmstadt • 2015–

Contributor

A large-scale unified lexical-semantic resource for natural language processing based on LMF

DKPro WSD

TU Darmstadt • 2012–

Lead developer

A modular, extensible Java framework for word sense disambiguation based on Apache UIMA

TWSI Sense Substituter

TU Darmstadt • 2012–

Contributor

A tool that produces lexical substitutions in context for over 1000 frequent nouns in English

DKPro Core

TU Darmstadt • 2011–

Contributor

A collection of UIMA software components for natural language processing

Biblet

DFKI • 2005–

Lead developer

A set of BibTeX bibliography styles (bst) which generate XHTML

openSUSE

The openSUSE Project • 2005–

PackagerQA

A complete, multi-purpose GNU/Linux distribution

GPP

DFKI • 2004–

Lead maintainer

A general-purpose preprocessor with customizable syntax

eoconv

DFKI • 2004–

Lead developer

Convert text files to and from various Esperanto text encodings

dlg2html

DFKI • 2004–

Lead developer

Convert DLG Pro message bases to HTML for archiving on the Web

SeaMonkey

Mozilla • 2001–

PackagerQA

An integrated web browser, composer, mail/news client, and IRC client (formerly the Mozilla Application Suite)

DELORES

Griffith University • 1999–2003

Lead developer

A forward-chaining reasoning engine for defeasible logic

WEBWEAVR-III

University of Regina • 1998–1999

Contributor

A Bayesian network research toolkit

CHEOPS

University of Regina • 1998–

Lead developer

A fully-functional chess engine capable of human-vs-human, human-vs-computer, and computer-vs-computer play

Publishing & documentation

Journal of Open Source Software

Open Journals • 2024–

Topic editor

A developer-friendly, open-access journal for research software

HUMOR

De Gruyter • 2020–

Consulting editor

International Journal of Humor Research

Maledicta article index

OFAI • 2020

Editor

Title and author index for Maledicta: The International Journal of Verbal Aggression

Babel: The Language Magazine

University of Huddersfield • 2012–

Advisory panelColumnist

A quarterly pop-science magazine that delivers cutting-edge linguistic research in an accessible and colourful format

The PracTeX Journal

TeX Users Group • 2004–2006

Editorial board

A journal on the practical use of TeX and friends

Miscellany

My interests in language, math, and computers were sparked and strengthened by exposure to the works of Willard R. Espy, Louis Phillips, Mike Keith, Dmitri Borgmann, Jim Butterfield, and others. These writers share a great talent for making technical or linguistic topics fun and accessible to a general audience. You can check out my own contributions to popular and recreational mathematics and linguistics, plus a few other odds and ends.

I also maintain an index of miscellaneous documents and websites I've produced which don't really fit into any other section.