midterm Flashcards

Question

what are image generation models?

Answer 1

machine learning (ML) models trained with large amounts of images that are able to generate visual content from a text description

Answer 2

Large language models are advanced artificial intelligence systems designed to understand and generate human-like text based on vast amounts of data. These models, like GPT-3 (Generative Pre-trained Transformer 3), are characterized by their extensive size, complexity, and ability to perform a wide range of natural language processing tasks.

Answer 3

-deep neural networks -transformers -transfer and learning -increased computing capacity -large text data sets -advances in optimization and training

Answer 4

the size of the model is measured in the number of parameters they indicate the size and capacity of the model (weights and biases of the neurons that are adjusted in the training process)

Answer 5

is a part of the word, the atomic unit that LLMs work in. IT can be characters, words, subwords, other segments of text, or code, depending on the chosen tokenization method or scheme

Answer 6

Open AI, Claude (Anthropic), Brad, Gemini (Google), Preplexity (Preplexity AI), Grok (Xai)

Answer 7

1. Hallucinations (incorrect information given as true): *implement robust data validation and fact-checking protocols, * regularly update models with accurate data, * educate users about AI limitations 2. Biased and unfair models (AI models can inherit or amplify biases present in their training data, leading to unfair or discriminatory outcomes): *use diverse and representative datasets for training, *apply fairness algorithms and bias detection tools, *continuous monitoring for biases 3. Data privacy (risk of capturing or release private informaiton): *compliance technology architecture, *use anynymization techniques 4. Loose competitivness (LLM if not adopted could affect efficiency lost against competitors): *foster a culture of continuous learning, *invest in AI research, *partner with AI innovation leaders 5. Intellectual property (AI can generate content that blurs the lines of intellectual property ownership, leading to legal challenges): *develop clear guidelines on AI-generated content IP, *Stay informed about evolving IP laws, *Use AI ethically and responsibly

Answer 8

1. Economic and employment impact (job displacements in sector where automatics tasks are done by humans): *develop workforce training programs, *support sectors most affected by AI 2. Misinformation and propaganda (propagation of fake news to manipulate people): *implement stringent laws against AI-generated misinforamtion, *develop AI tools to detect fake news, *educate the public on media literacy and critical thinking, *develop AI regulation and close collaboraiton with AI entities

Answer 9

a discipline focused on designing and refining text inputs for LLMs to obtain optimal results

Answer 10

- in its basic unstructured form, is completely useless and meaningless -needs to be cleaned, processed, organized, analyzed and visualized in order to become meaningful or informative - if this information leads to a deeper understanding of a given situation, the how and why, then this information becomes knowledge and allows you to make evidence-based decisions

Answer 11

it's data collected by you, through surveys, focus groups, interviews, observations, and experiments, can be either qualitative or quantitative, specific to your needs, and you control the quality, the disadvantage is that it usually costs more and takes more time

Answer 12

collected by someone else, can be either qualitative or quantitative, usually cheap and quick, the key disadvantage is that data can be too old and /or not specific enough for your needs

Answer 13

An analysis framework is a structured approach or methodology used to systematically analyze data, information, processes, systems, or phenomena in order to gain insights, draw conclusions, and make informed decisions. It provides a framework or structure for organizing, categorizing, and interpreting data or observations within a specific context or domain.

Answer 14

- audiovisual data (they neither qualitative nor quantitative) -geospatial data -PII (personal identifiable information): any type of information relative to a physical person that can lead to its identification

Answer 15

-observations -key informant interviews -Participatory approaches -household surveys

Answer 16

can be used to rapidly collect different types of information. Doesn't require costly resources, or detailed training, which makes it quick data collection process that is easy to implement. Observations can be structured (the observer looks for a specific behavior, object or event) and unstructured (the observer looks at how things are done and what issues exist)

Answer 17

qualitative data collection method to gather in-depth information from individuals who have specialized knowledge or insights about a particular topic and/or a particular group of people

Answer 18

participatory and qualitative research method involving a small, diverse group of people who are part of a target population. They are engaged in structured discussions to explore their perceptions, opinions, and experiences on specific development-related issues or interventions

Answer 19

quantitative data collection method where information is gathered from a sample of household, typically regarding demographic, economic, and social characteristics

Answer 20

this method involves selecting individuals from a population entirely by chance, where each member has an equal probability of being chosen.

Answer 21

this method involves selecting nth individuals from the population list.

Answer 22

in this method, the population is divided into smaller groups, or strata, based on shared characteristics (like age, income, or location). A random sample is then taken from each stratum

Answer 23

cluster sampling involves dividing the population into clusters and then randomly selecting entire clusters for study

Answer 24

in convenience sampling, individuals are selected based on their availability and willingness to participate.

Answer 25

used primary in qualitative research, snowball sampling involves existing study subjects recruiting future subjects from among their acquaintances

Answer 26

-Sampling bias: occurs when the sample isn't representative of the population. For example, only surveying accessible or certain demographic groups can lead to skewed results -Response Bias: can happen if respondents give answers they think the interviewer wants to hear, rather then their true opinion -nonresponse bias: when a significant portion of selected participants doesn't respond, and their nonresponse is correlated with the outcome of interest -data entry errors: mistakes in data entry can occur, especially with large datasets and manual entry -cultural bias: not considering cultural nuances can lead to misunderstanding or misinterpretations of data -observer bias: the presence of an observer can sometimes influence the behavior of those being observed -recall bias: in surveys or interviews, participants might not accurately remember past events or experiences, leading to inaccurate responses -selection bias: this happens when the procedure used to select participants leads to a sample that is not representative of the population

midterm Flashcards

(50 cards)