Domain 3 Flashcards

Question

You can also use a prompt template. .

Answer 1

Templates might include instructions, few-shot examples, and specific content and questions for different use cases

Answer 2

where the actual prompt text is replaced with a continuous embedding backer that is optimized during training. This technique helps the prompt to be fine-tuned for a specific task. At the same time, it keeps the rest of the model parameters frozen, which can be more efficient than full fine-tuning

Answer 3

A few are classification, question and answer with and without context, summarization, open-ended text generation, code generation, math, and reasoning or logical thinking.

Answer 4

is the encoded knowledge of language in a large language model. It's the stored patterns of data that capture relationships and, when prompted, reconstruct language from those patterns.

Answer 5

But it's also possible, especially if your model is smaller, that the model's latent space might not have enough information about the topic of your prompt. That situation can cause the model to hallucinate a model that doesn't know the exact specifics of a prompt because the knowledge isn't in its latent space will choose the closest match. This result might be interpreted as a mistake, but the model is actually functioning correctly.

Answer 6

It describes attacks of prompt manipulation.

Answer 7

When an attacker tries to bypass the guardrails that you have established, this is called jailbreaking. In this case, it is different because jailbreaking targets the safety measures put in place such as guardrails

Answer 8

is an attempt to change or manipulate the original prompt with new instructions.

Answer 9

is another risk of prompt engineering where harmful instructions are embedded in messages, emails, web pages, and more

Answer 10

uardrails provide safety and privacy controls to manage interactions in your generative AI applications. You can define topics within the context of your application that are not desirable. You can set words to be blocked. You can configure thresholds to filter across categories that might be harmful and prompt attacks such as jailbreak and prompt injections. You can also filter inputs that might contain sensitive data.

Answer 11

They include pre-training, fine-tuning, and continuous pre-training

Answer 12

which is a complex process. It requires millions of graphic processing units, GPUs, compute hours, terabytes and petabytes of data, trillions of tokens, trial and error, and time.

Answer 13

is a process that extends the training of the model to improve the generation of completions for a specific task. It is a supervised learning process and you use a dataset of labeled examples to update the weights of the LLM

Answer 14

Catastrophic forgetting

Answer 15

is a process and set of techniques that freeze or preserve the parameters and weights of the original LLM and fine-tune or train a small number of task-specific adaptor layers and parameters. PEFT reduces the compute and memory that's needed because it's fine-tuning a small set of model parameters

Answer 16

is a popular PEFT technique that also preserves or freezes the original weights of the foundation model and creates new trainable low-rank matrices into each layer of a transformer architecture.

Answer 17

is an extension of fine-tuning a single task. Multitask fine-tuning requires a lot of data. For example, the dataset might contain examples that instruct a model to complete multiple tasks such as reviews or ratings, summarization, translating code, and more. This produces an instruction tuned model that has learned how to complete many different tasks simultaneously.

Answer 18

Domain adaptation fine-tuning gives you the ability to use the pre-trained foundation models and adapt them to specific tasks by using limited domain-specific data. You can use domain adaptation fine-tuning to help your model work with domain-specific language such as industry jargon, technical terms, or other specialized data

Answer 19

RLHF uses reinforcement learning to fine-tune the LLM with human feedback data to better align the model with human preferences

Answer 20

Amazon SageMaker Canvas

Answer 21

Amazon SageMaker Clarify

Answer 22

remember that reducing the size of the model might decrease its performance.

Answer 23

making a more concise prompt, reducing the size of the retrieved snippets and their number, and reducing generation through inference parameters and prompt.

Answer 24

No. The output of generative AI models is non-deterministic and makes the evaluation more difficult to determine.

Answer 25

is a set of metrics and a software package. It is used to evaluate automatic summarization tasks and machine translation software in natural language processing. It evaluates how well the input compares to the generated output.

Answer 26

is an algorithm that is used for translation tasks. It evaluates the quality of text which has been machine translated from one natural language to another.

Answer 27

Amazon SageMaker Clarify

Answer 28

an evaluation module

Answer 29

It is a collection of natural language tasks, such as sentiment analysis and question answering. You can use these tasks to evaluate and compare model performance across a set of language tasks. Then, you can use the benchmark to measure and compare the model performance

Answer 30

evaluates the knowledge and problem-solving capabilities of the model. To perform well, models must have extensive world knowledge and problem-solving ability. The models are tested on more than basic language understanding, such as history, mathematics, laws, computer science, and more.

Answer 31

focuses on tasks that are beyond the capabilities of the current language models. It contains tasks such as math, biology, physics, bias, linguistics, reasoning, childhood development, software development, and more.

Answer 32

which is a benchmark to help improve model transparency. It offers users guidance on which model performs well for a given task. HELM is a combination of metrics for tasks such as summarization, question and answer, sentiment analysis, and bias detection

Answer 33

refers to the process of providing specific labeled examples to train a model on a specific task. Instruction tuning can help models follow specific tasks or responses to prompts in a specific way

Answer 34

Foundation models typically support parameters that limit the length of the response. Examples of these parameters are provided below. Response length – An exact value to specify the minimum or maximum number of tokens to return in the generated response. Penalties – Specify the degree to which to penalize outputs in a response. Examples include the following. The length of the response. Repeated tokens in a response. Frequency of tokens in a response. Types of tokens in a response. Stop sequences – Specify sequences of characters that stop the model from generating further tokens. If the model generates a stop sequence that you specify, it will stop generating after that sequence.

Domain 3 Flashcards

(63 cards)