Scientific Context

The current lack of standardized practices and definitions in NLP systems hinders the progress of the field. Indeed, there is not always consensus on which evaluation methods are meaningful and fruitful, or which of their implementations are to be used with which parameters (eg. SacreBLEU, Post 2018). In some cases, there is no general agreement on the very definition of a task. This situation calls for work on standardizing NLP practices.

The International Organization for Standardization (ISO) has just created a dedicated working group on NLP (as a joint effort of the AI and Language committees), and 2 standards are already under way. Topics under consideration by the ISO standardization committees include NLP terminology, evaluation metrics, interoperability, annotation guidelines, good practices in NLP development/evaluation/corpora, documentation.

These topics are already heavily discussed in academia, and a number of informal guidelines have already been proposed. We believe that the creation of NLP standards can significantly benefit from the input of both NLP academics and industry NLP practitioners. Reciprocally, NLP researchers would benefit from getting involved in the standardization effort, thus ensuring that academia’s views are listened to, in particular in the context of the AI Act (the European regulation on AI that has been finalized in December), whose enforcement will strongly rely on those standards.

The objective of the STAND workshop is

to foster discussion on existing standards, their creation and use
to assess the current needs of the community for standardization
to share experience on the impact on the research activities when lacking good practices
to collect existing good practices (and propose new ones)

We invite contributions from NLP practitioners from both the industry and academia, as well as standardization experts.

See the call for contributions and the program.

Invited Speakers

Joakim Nivre
Matt Post
Dirk Hovy
APIL (French association of NLP companies)

Organizing committee

Lauriane Aufrant, Inria
Rania Wazir, leiwand.ai
Timothée Bernard, Université Paris Cité, LLF
Taras Holoyad, BNetzA
Yoann Dupont, Université Sorbonne Nouvelle
Maximin Coavoux, CNRS, LIG
Arnaud Ferré, Inrae, MAIAGE