Saltar al contenido principal

πŸ“™ Vocabulary Reference

Please refer to this page for a list of terms and concepts that we will use throughout this course.

Large Language Models (LLMs), Pretrained Language Models (PLMs)1, Language Models (LMs), and foundation models​

These terms all refer more or less to the same thing: large AIs (neural networks), which have usually been trained on a huge amount of text.

Masked Language Models (MLMs)​

MLMs are a type of NLP model, which have a special token, usually [MASK], which is replaced with a word from the vocabulary. The model then predicts the word that was masked. For example, if the sentence is "The dog is [MASK] the cat", the model will predict "chasing" with high probability.

Labels​

The concept of labels is best understood with an example.

Say we want to classify some Tweets as mean or not mean. If we have a list of Tweets and their corresponding label (mean or not mean), we can train a model to classify whether tweets are mean or not. Labels are generally just possibilities for the classification task.

Label Space​

All of the possible labels for a given task ('mean' and 'not mean' for the above example).

Sentiment Analysis​

Sentiment analysis is the task of classifying text into positive, negative, or other sentiments.

"Model" vs. "AI" vs. "LLM"​

These terms are used somewhat interchangeably throughout this course, but they do not always mean the same thing. LLMs are a type of AI, as noted above, but not all AIs are LLMs. When we mentioned models in this course, we are referring to AI models. As such, in this course, you can consider the terms "model" and "AI" to be interchangeable.

Verbalizer​

In the classification setting, verbalizers are mappings from labels to words in a language model's vocabulary2. For example, consider performing sentiment classification with the following prompt:

Tweet: "I love hotpockets"
What is the sentiment of this tweet? Say 'pos' or 'neg'.

Here, the verbalizer is the mapping from the conceptual labels of positive and negative to the tokens pos and neg.

Reinforcement Learning from Human Feedback (RLHF)​

RLHF is a method for fine tuning LLMs according to human preference data.


  1. Branch, H. J., Cefalu, J. R., McHugh, J., Hujer, L., Bahl, A., del Castillo Iglesias, D., Heichman, R., & Darwishi, R. (2022). Evaluating the Susceptibility of Pre-Trained Language Models via Handcrafted Adversarial Examples. ↩
  2. Schick, T., & SchΓΌtze, H. (2020). Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference. ↩