KiPT: Knowledge-injected Prompt Tuning for Event Detection

Paper · Source

A diagram of a war

Event detection aims to detect events from the text by identifying and classifying event triggers (the most representative words). Most of the existing works rely heavily on complex downstream networks and require sufficient training data. Thus, those models may be structurally redundant and perform poorly when data is scarce. Prompt-based models are easy to build and are promising for few-shot tasks. However, current prompt-based methods may suffer from low precision because they have not introduced event-related semantic knowledge (e.g., part of speech, semantic correlation, etc.). To address these problems, this paper proposes a Knowledge-injected Prompt Tuning (KiPT) model. Specifically, the event detection task is formulated into a condition generation task. Then, knowledge-injected prompts are constructed using external knowledge bases, and a prompt tuning strategy is leveraged to optimize the prompts. Extensive experiments indicate that KiPT outperforms strong baselines, especially in few-shot scenarios.

Introduction. Events describe state changes of participating entities. The Event Detection (ED) task is one of the essential tasks in the Information Extraction field. Event triggers are the most representative words or phrases in events, and they are usually composed of verbs or nouns. There is a one-to-one correspondence between events and event triggers, so the ED task is equivalent to identifying and classifying event triggers. The ED task has a wide range of applications, providing helpful information for downstream tasks such as text summarization, auto summarization, machine question and answer (QA), etc. Meanwhile, with the vigorous development of Internet news and social media, ED has become a practical approach for extracting information from massive texts. Therefore, the ED task has attracted increasing attention with great academic and applied value in recent years. Most current ED models use a pre-trained language model to build complex downstream networks (including CNN, RNN, GCN, etc.)

Discussion / Conclusion. This paper proposes a prompt-based learning method for ED by introducing knowledge-injected prompt tuning. External knowledge and soft tokens are used to construct knowledge-injected prompts, which can be optimized through training. Comprehensive experiments demonstrate that KiPT outperforms current prompt-based ED models and strong baselines, especially in data-scarce scenarios. Through our method, prompt-based models can introduce task-related knowledge more conveniently and effectively. In the future, we will explore more knowledge injection approaches and their applications in other tasks.

Lines of inquiry this paper opens 24

Research framings built by reading the notes related to this paper — the questions it feeds into.

Can prompting inject entirely new knowledge into language models?

How do training data properties shape reasoning capability development?

What makes a good in-context learning example for a given task?

Why do continual learning scenarios trigger catastrophic forgetting and interference?

Why does reinforcement learning suppress output diversity compared to supervised fine-tuning?

How does active selection of training content differ from random reinforcement sampling?

How does example difficulty affect learning efficiency in language models?

Can learned priors effectively select and weight ensemble members by inference budget?

How do training priors constrain what context information can override?

Why is in-context learning brittle to the order of examples presented?

How do evaluation biases undermine LLM quality assessment systems?

Do accurate-looking LLM outputs hide structural failures in learning and reasoning?

Why does analytical depth demand trigger fabrication over transparent uncertainty?

Can prompting strategies overcome LLM biases without model fine-tuning?

What dimensions of recommendation quality do standard metrics miss?

Why does sophisticated measurement not validate the underlying scientific inference?

KiPT: Knowledge-injected Prompt Tuning for Event Detection

Synthesis notes from this paper's topics 8

Lines of inquiry this paper opens 24