03-Computer Vision Flashcards
What is Computer Vision
AI that “see” the world and make sense of it
What is Image Classification
Train ML model to classify images based on their contents.
What is Object Detection
Train ML model to classify individual objects within an image and identify their location with a bounding box
What is Semantic Segmentation
Advanced ML technique in which INDIVIDUAL PIXELS in the image are CLASSIFIED according to the object to which they belong
What is Image analysis
Combine ML models with advanced image analysis techniques to extract information from images, including “tags” that could help catalog the image or even descriptive captions that summarize the scene shown in the image
What is Face detection, analysis, and recognition
Specialized form of object detection that locates human faces in an image. Combined with classification and facial geometry analysis techniques to infer details such as age, and emotional state; and even recognize individuals based on their facial features
Optical character recognition (OCR)
Technique to detect and read text in images.
What 4 things does Cognitive Services include
Cognitive Service includes
- Decision
- Language
- Speech
- Vision
What are 4 Decision services
- Anomaly Detector
- Content Moderator
- Metrics Advisor (Preview)
- Personalizer
What is Anomaly Detector
Identify potential problems early on
What is Content Moderator
Detect potentially offensive or unwanted content
What are Metrics Advisor
Monitor metrics and diagnose issues
What is Personalizer
Create rich, personalized experiences for every user
What are 5 Language services
- Immersive Reader
- Language Understanding
- QnA Maker
- Text Analytics
- Translator
What are 4 Speech services
- Speech to Text
- Text to Speech
- Speech Translation
- Speaker Recognition (Preview)
What is Immersive Reader
Helps readers of all abilities comprehend text using audio and visual cues
What is Language Understanding
Build natural language understanding into apps, bots, and IoT devices
What is QnA Maker
Create a conversational question and answer layer over your data
What is Text Analytics
Detect sentiment, key phrases, and named entities
What is Translator
Detect and translate more than 90 supported languages
What is Speech to Text
Transcribe audible speech into readable, searchable text
What is Text to Speech
Convert text to life-like speech for more natural interfaces
What is Speech Translation
Integrate real-time speech translation into your apps
What is Speaker Recognition
Identify and verify the people speaking based on audio
What are 5 Vision services
- Computer Vision
- Custom Vision
- Face
- Form Recognizer
- Video Indexer
What is Computer Vision
Analyze content in images and video
What is Custom Vision
Customize image recognition to fit your business needs
What is Face
Detect and identify people and emotions in image
What is Form Recognizer
Extract text, key-value pairs, and tables from documents