Hugging Face ecosystem | HF NLP course | 2. Using Hugging Face Transformers | Other Flashcards

Question 1

Q

[page] Models: [page section] Creating a Transformer: [q] To initialize a BERT model from scratch, you first need to initialize a ? and a ?

Answer

A

from transformers import BertConfig, BertModel
					
# Building the config
config = BertConfig()
					
# Building the model from the config
model = BertModel(config)

Source

Question 2

Q

[page] Models: [page section] Creating a Transformer: [section] Different loading methods: [q] Code to load a Transformer model with a specific architecture that is already trained.

Answer

A

from transformers import BertModel
					
model = BertModel.from_pretrained("bert-base-cased")

Source

Question 3

Q

[page] Models: [page section] Creating a Transformer: [section] Different loading methods: [q] How can you can customize your cache folder?

Answer

A

HF_HOME environment variable

Source

Question 4

Q

[page] Models: [page section] Creating a Transformer: [section] Saving methods: [q] How do you save a model?

Answer

A

model.save_pretrained(“directory_on_my_computer”)

Source

Question 5

Q

[page] Models: [page section] Using a Transformer model for inference: [q] What does the model accept as inputs? What creates these inputs?

Answer

A

Tensors (of rectangular shape). Tokenizers.

Source

Question 6

Q

[page] Models: [video] Instantiate a transformers model: [q] To load a model configuration from a checkpoint?

Answer

A

bert_config = AutoConfig.from_pretrained("bert-base-cased")

Source

Question 7

Q

[page] Tokenizers: [page section] Loading and saving: [q] Code to load the BERT tokenizer trained with the same checkpoint as BERT.

Answer

A

from transformers import BertTokenizer
tokenizer = BertTokenizer.from_pretrained("bert-base-cased")

Source

Question 8

Q

[page] Handling multiple sequences: [page section] Models expect a batch of inputs: [q] For a single sequence, initialize a model for classification and show the intermediate methods of the tokenizer to generate ids for the example (3 steps).

Answer

A

"import torch
from transformers import AutoTokenizer, AutoModelForSequenceClassification
					
checkpoint = ""distilbert-base-uncased-finetuned-sst-2-english""
tokenizer = AutoTokenizer.from_pretrained(checkpoint)
model = AutoModelForSequenceClassification.from_pretrained(checkpoint)
					
sequence = ""I've been waiting for a HuggingFace course my whole life.""
					
tokens = tokenizer.tokenize(sequence)
ids = tokenizer.convert_tokens_to_ids(tokens)
					
input_ids = torch.tensor([ids])
print(""Input IDs:"", input_ids)
					
output = model(input_ids)
print(""Logits:"", output.logits)"

Source

Hugging Face ecosystem | HF NLP course | 2. Using Hugging Face Transformers | Other Flashcards

(8 cards)