academics

Memories of being an NL researcher in 1990

Jul 20, 2026 ehudreiterLeave a comment

Since I am about to retire, I decided to “reminisce ” about what it was like to be an NL researcher in 1990, when I got my PhD. The community was much smaller than 2026, but in many ways it was nicer, including less pressure on early career researchers and a more open research culture.

academics

What is the purpose of ACL conferences?

Jul 9, 2026 ehudreiter5 Comments

The reviewing system for ACL conferences is struggling. In order to fix it, we should be clear about what the main pupose of the conferences is: meeting people, enhancing CVs, identifying good papers, or providing a home for exciting science. The best reviewing system depends on the goal of our conferences.

evaluation

Future of NLG evaluation

Jun 26, 2026 ehudreiterLeave a comment

In a recent position paper, I argued that NLG evaluation in the future needs to be become more rigorous. It also needs to move beyond benchmarks, and focus more on impact, qualitative, and safety evaluation.

academics

I am worried by NLP research culture

Jun 8, 2026Jun 8, 2026 ehudreiter3 Comments

In most ways NLG and NLP are much better in 2026 than when I got my PhD in 1990. Unfortunately research culture has gotten *worse” in this period, which really worries me as I retire. We have a culture which does not value scientific rigour, tolerates cheating and fraud, and in many ways is closed to new ideas and new people.

building NLG systems

Software engineering of prompts

May 20, 2026May 20, 2026 ehudreiterLeave a comment

When we create complex prompts for LLMs, we face similar software engineering challenges as conventional software development (requirements, design, implementation and debugging, testing, maintenance). We need to better understand good software engineering for prompts.

academics

AI and CS Teaching

May 5, 2026May 5, 2026 ehudreiterLeave a comment

I am often asked how AI will impact Computer Science teaching. The biggest challenge is adapting what we teach so that it is relevant to a world where AI assistants are heavily used in software development. We should also use AI tutors to help teach. Least important is making assessments more resistant to AI cheating.

AI in Healthcare

Achieving my vision of personal AI health assistants

Apr 20, 2026Apr 23, 2026 ehudreiter1 Comment

25 years ago I proposed personal health assistants as a grand challenge for computer science. LLMs have brought this vision closer to reality, but many challenges remain. These include understanding requirements, adapting to individual users, showing effectiveness in RCTs, and running on cheap phones with limited Internet access.

AI in Healthcare

Real-world safety and harms from patient-facing LLMs

Apr 6, 2026Apr 22, 2026 ehudreiter2 Comments

There is very limited data on harms to real patients from using AI health chatbots. The limited data we have from incident reports, clinical trials with patients, and data from health providers suggests that bots are usually safe, but can cause harm in a few cases. More data is badly needed!

academics

Comparing performance of LLMs is not very interesting

Mar 24, 2026Mar 24, 2026 ehudreiter1 Comment

Quantitative comparisons of different LLMs are not very interesting in research papers, because the LLMs in question will probably be out of date by the time the paper is published. However looking for behaviour which is shared by several LLMs is definitely interesting and worthwhile.

academics

Please follow the rules for ARR/ACL papers

Mar 16, 2026Mar 18, 2026 ehudreiter3 Comments

ACL/ARR have rules and guidelines for how papers are written. Unfortunately many authors (and reviewers) ignore these, which makes their papers harder to read and less useful. Please follow the rules!

Ehud Reiter's Blog

Ehud's thoughts about Natural Language Generation. Also see my book on NLG.

Memories of being an NL researcher in 1990

What is the purpose of ACL conferences?

Future of NLG evaluation

I am worried by NLP research culture

Software engineering of prompts

AI and CS Teaching

Achieving my vision of personal AI health assistants

Real-world safety and harms from patient-facing LLMs

Comparing performance of LLMs is not very interesting

Please follow the rules for ARR/ACL papers