Paper 3 Flashcards
What is Latency in the context of Natural Language Processing?
The delay that occurs in processing user input and generating a response
Define Dependency Reduction in the Critical Path.
Strategies for streamlining processes by minimizing dependencies that cause delays
What is the Critical Path?
The shortest and most efficient sequence of linked machine learning models required to process a user’s message
What are the main types of analysis challenges in Natural Language Processing?
- Syntactic Analysis Failures
- Semantic Analysis Failures
- Pragmatic Analysis Failures
What is Syntactic Analysis Failures?
Problems with understanding sentence structure
What is Semantic Analysis Failures?
Challenges in understanding meanings, such as idioms and homonyms
What does Pragmatic Analysis Failures refer to?
Issues with understanding context, tone, or cultural differences
What is an example of a Syntactic Analysis Failure?
Misunderstanding a request due to poor sentence structure
What is Data Augmentation in NLP?
The process of artificially generating new data based on existing training data
What are some hardware types used in NLP?
- CPUs
- GPUs
- TPUs
What is the role of CPUs in machine learning?
They are cost-effective for non-parallel processing tasks
What are GPUs particularly good at?
Matrix operations and tasks requiring parallel processing
What is the von Neumann bottleneck?
The limitation of memory access speed compared to calculation speed in CPUs
What is a common challenge faced by Natural Language Processing models?
Biases such as Historical Bias, Linguistic Bias, and Sampling Bias
What is Historical Bias in NLP datasets?
Bias that arises from historical inequalities reflected in training data
What is an example of Sampling Bias?
Algorithms predicting healthcare needs favoring one demographic over another
What is an ethical challenge in AI?
Accountability for decisions made by AI systems
True or False: Data augmentation can help reduce bias in AI training datasets.
True
What is an example of a real-world scenario where NLP failed?
McDonald’s Drive Thru Chatbot Beta Test leading to incorrect orders
What strategies can be used to solve tokenization problems in NLP?
- Querying large tables of edge cases
- Using context-aware tokenization models
- Applying more complex heuristics
Fill in the blank: _______ are problems with understanding context, tone, or cultural differences.
Pragmatic Analysis Failures
What is the impact of cultural biases in NLP models?
Models trained on English datasets may not understand biases relevant to other cultures
What is a potential solution to mitigate biases in NLP models?
Data augmentation and training on diverse datasets
What is a common issue with tokenization in languages like Chinese?
Defining a ‘word’ is difficult due to the lack of spaces