Gemini 4-18-25 Flashcards

1
Q

What is Gemini?

A

Google’s family of highly capable and general-purpose AI models. A key characteristic is its native multimodality.

[cite: 14, 15]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Native Multimodality?

A

Gemini models are designed from the ground up to understand, process, and combine different types of information seamlessly (text, code, images, audio, video).

[cite: 15, 16, 17]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Transformer Architecture?

A

The neural network architecture that underpins Gemini, enabling it to process input data in a highly parallel and context-aware manner.

[cite: 19, 20, 21]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are Encoders in the context of Gemini?

A

Components of the Transformer that convert input data (text, images, etc.) into numerical representations called embeddings.

[cite: 21, 22]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the Self-Attention Mechanism?

A

The core innovation of the Transformer that allows the model to weigh the importance of different parts of the input sequence relative to each other.

[cite: 23, 24, 25, 26, 27]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are Decoders in the context of Gemini?

A

Components of the Transformer that generate output (text, code, etc.) using the numerical representations from the encoders and the contextual understanding gained through self-attention.

[cite: 28, 29, 30, 31]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Vertex AI?

A

Google Cloud’s platform that allows enterprises to access and utilize Gemini models, providing the infrastructure, tools, and governance features for building and deploying generative AI applications.

[cite: 35, 36, 37]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Model Garden?

A

The central hub within the Vertex AI platform where users can discover, test, and deploy Gemini models alongside other first-party, third-party, and open-source models.

[cite: 37, 38]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Gemini 1.0 Pro?

A

An initial flagship Gemini model focused on text and code generation, natural language tasks, and multi-turn chat.

[cite: 38, 39, 40, 41]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Gemini 1.0 Pro Vision?

A

A Gemini model that adds the ability to understand image and video inputs alongside text.

[cite: 38, 39, 40, 41]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Gemini 1.5 Pro?

A

A Gemini model that introduced a breakthrough 1 million token context window (expandable to 2 million tokens on Vertex AI), allowing it to process and reason over enormous amounts of information.

[cite: 42, 43, 44]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Gemini 1.5 Flash?

A

A lighter, faster, and more cost-effective counterpart to 1.5 Pro, retaining the large context window and multimodal input capabilities but optimized for speed and efficiency.

[cite: 44, 45, 46]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Gemini 2.0 Flash?

A

A Gemini model that incorporates next-generation features, enhanced speed, native tool use capabilities, and multimodal generation (outputting text, images, and audio).

[cite: 46, 47, 48, 49]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Gemini 2.0 Flash-Lite?

A

A variant of Gemini 2.0 Flash specifically optimized for cost-efficiency and low latency, suitable for high-throughput scenarios.

[cite: 49, 50, 51]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Gemini 2.5 Pro?

A

A Gemini model that introduces a significant architectural evolution towards models that can perform explicit reasoning steps before generating a final response, designed for tasks demanding maximum quality and deep reasoning.

[cite: 50, 51, 52, 53, 54]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Gemini 2.5 Flash?

A

A Gemini model that brings the ‘thinking’ capability to a model optimized for a balance between performance, cost, and latency, ideal for high-volume applications that benefit from reasoning but also require efficiency.

[cite: 54, 55, 56, 57, 58]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is Function Calling?

A

A feature that allows Gemini models to interact with external tools, databases, and APIs to fulfill requests.

[cite: 75, 76, 77, 78]

18
Q

What is Grounding?

A

A feature that enhances factual accuracy and reduces hallucinations by connecting the Gemini model to authoritative external data sources during response generation.

[cite: 78, 79, 80, 81]

19
Q

What is Imagen?

A

Google’s family of models, accessible within Vertex AI, that specialize in generating high-quality images from text prompts.

[cite: 82, 83, 84]

20
Q

What is the Generative AI Evaluation Service?

A

A Vertex AI service that allows organizations to systematically assess the performance of their Gemini models and other generative AI applications.

[cite: 85, 86, 87, 88]

21
Q

What is the Vertex AI Agent Builder?

A

A tool to build AI agents and chatbots, powered by Gemini.

[cite: 103, 104]

22
Q

What are Vertex AI Pipelines?

A

Automates and orchestrates workflows involving Gemini, such as data preprocessing, model tuning, evaluation, and deployment.

[cite: 203, 204]

23
Q

What is Vertex AI Search / Vector Search?

A

Enables grounding Gemini responses in private enterprise data or builds semantic search applications using embeddings generated by Gemini.

[cite: 203, 204]

24
Q

What is Gemini in BigQuery?

A

Gemini capabilities embedded within BigQuery to assist with SQL generation, data exploration, and analysis.

[cite: 207, 208]

25
What is Vertex AI Studio?
The interactive environment for prompting, testing, and initiating tuning jobs for Gemini models. ## Footnote [cite: 200, 201, 202]
26
What is Gemini's Native Multimodality?
Gemini's ability to seamlessly process and combine different types of information (text, code, images, audio, video) allows it to grasp nuances and contexts that might be lost on models less adept at synthesizing information from multiple sources. This capability enables more comprehensive data analysis and richer, more intuitive user interactions. ## Footnote [cite: 15, 16, 17, 18]
27
What are Large Context Windows?
Gemini's large context windows (up to 2 million tokens) enable it to process and reason over enormous amounts of information simultaneously, allowing for more complex analysis, summarization, and reasoning tasks. ## Footnote [cite: 42, 43, 44]
28
What are Advanced Reasoning ('Thinking' Models)?
The Gemini 2.5 series introduces models that can perform explicit reasoning steps before generating a final response. This 'thinking' capability aims to improve accuracy, handle complexity more effectively, and provide greater transparency, which is crucial for enterprise applications requiring reliability and explainability. ## Footnote [cite: 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61]
29
What is Enterprise Readiness via Vertex AI?
Vertex AI provides a comprehensive suite of enterprise-grade features crucial for production deployments of Gemini, including MLOps tools, security and governance features, scalability, and seamless integration with the broader Google Cloud ecosystem. ## Footnote [cite: 94, 95, 96, 97]
30
What is Model Choice and Flexibility?
Vertex AI's Model Garden offers access to a wide range of models, including Gemini, other Google foundation models, open-source models, and third-party models. This provides enterprises with unparalleled flexibility to choose the best model for a specific task or budget, experiment across different model providers, and avoid vendor lock-in. ## Footnote [cite: 97, 98, 99]
31
How does Gemini automate content generation?
Gemini can automate the creation of marketing materials, draft emails and reports, generate product descriptions, and summarize lengthy documents, meetings, videos, or audio files, saving time and increasing efficiency. ## Footnote [cite: 102, 103]
32
How does Gemini enhance customer service?
Gemini, through Vertex AI Agent Builder, enables the building of sophisticated AI agents and chatbots that can provide personalized customer support, answer product inquiries, and guide users through processes, improving customer satisfaction and reducing support costs. ## Footnote [cite: 103, 104]
33
How does Gemini analyze data and extract insights?
Gemini's long context and reasoning abilities allow it to analyze large volumes of structured and unstructured data, including text documents, code repositories, images, and videos, to extract key insights, identify patterns, and generate analytical summaries or reports, supporting better decision-making. ## Footnote [cite: 105, 106, 107]
34
How does Gemini assist software development?
Gemini can assist developers throughout the software lifecycle with tasks like code generation, code completion, explaining complex code segments, debugging assistance, and generating unit tests, increasing developer productivity and code quality. ## Footnote [cite: 107, 108, 109]
35
How does Gemini analyze and generate media?
Gemini's multimodal capabilities can be leveraged to analyze and understand the content of images and videos, generate descriptive captions, classify media assets, or even generate new images or video content, enabling new forms of media creation and analysis. ## Footnote [cite: 109]
36
How does Gemini improve research and development?
Gemini can accelerate research by analyzing vast amounts of scientific literature, extracting critical information from dense technical documents or patents, and summarizing research findings, speeding up the research process. ## Footnote [cite: 109]
37
How does Gemini provide personalized gardening support?
Gemini can provide scalable, personalized gardening support, offering customers tailored advice and relevant product recommendations, enhancing customer engagement and providing helpful guidance. ## Footnote [cite: 110, 111, 112]
38
How does Gemini increase chatbot engagement?
Gemini's multimodal capabilities can increase the utility and engagement of chatbots, resulting in richer interactions and increased user engagement. ## Footnote [cite: 112, 113]
39
How does Gemini provide quick access to vehicle information?
Gemini's multimodal understanding enables drivers to ask natural language questions about their owner's manual or point their smartphone camera at the dashboard to get explanations for indicator lights, improving the user experience and potentially reducing calls to support centers. ## Footnote [cite: 113, 114]
40
How does Gemini process extensive legal and financial documents?
Gemini's large context window is crucial for processing extensive legal and financial documents, enabling faster analysis and the development of new capabilities that require understanding entire documents in context. ## Footnote [cite: 114, 115, 116, 117]
41
How does Gemini support decision-making in agriculture?
Gemini can analyze relevant data to provide timely insights in the field, contributing to more sustainable and efficient farming practices by improving the quality and speed of decision-making. ## Footnote [cite: 117, 118]
42
How does Gemini automate media captioning?
Gemini can automate the costly and time-consuming process of manually captioning media, resulting in significant cost and time reductions. ## Footnote [cite: 118, 119, 120, 121]