Computer Vision Flashcards
Concepts related to Azure Computer Vision resources
Use this feature for general, unstructured documents with smaller amount of text, or images that contain text.
Azure AI Vision - OCR
Use this service to read small to large volumes of text from images and PDF documents.
Azure AI Document Intelligence
Which service do you use to read text from street signs, handwritten notes, and store signs?
OCR
Which service do you use to read receipts, and invoices?
Document intelligence
Which API would be best for this scenario? You need to read a large number of files with high accuracy. The text is short sections of handwritten text, some in English and some of it is in multiple languages.
Image Analysis service OCR feature
What levels of division are the OCR results returned?
Results contain blocks, words and lines, as well as bounding boxes for each word and line.
You’ve scanned a letter into PDF format and need to extract the text it contains. What should you do?
The Document Intelligence API can be used to process PDF formatted files.
What features exist for prebuilt Document Intelligence models?
Text extraction
Key-value pairs
Entities.
Selection marks
Tables.
Fields.
What specific forms exist as prebuilt models in Document intelligence?
Invoice
Receipt
W2
ID document model. US drivers’ licenses and international passports
Business card
Health insurance card
What generic prebuilt models exist in Document intelligence?
Read model.
General document model
Layout model.
What features are available in the Read model in Document Intelligence?
Text extraction
What features are available in the General document model in Document Intelligence?
Text extraction
Key-value pairs
Entities
Selection marks
Tables
What features are available in the Layout model in Document Intelligence?
Text extraction
Selection marks
Tables
What features are available in the Invoice model in Document Intelligence?
Text extraction
Key-value pairs
Selection marks
Tables
Fields
What features are available in the Receipt model in Document Intelligence?
Text extraction
Key-value pairs
Fields
What features are available in the W2 model in Document Intelligence?
Text extraction
Key-value pairs
Selection marks
Tables
Fields
What features are available in the ID document model in Document Intelligence?
Text extraction
Key-value pairs
Fields
What features are available in the Business card model in Document Intelligence?
Text extraction
Key-value pairs
Fields
Which file formats can be consumed by prebuilt Document Intelligence models?
JPEG
PNG
BMP
TIFF
PDF
What file size requirements exist for Document Intelligence documents?
The file must be smaller than 500 MB for the standard tier, and 4 MB for the free tier.
What image size requirements exist for Document Intelligence documents?
Images must have dimensions between 50 x 50 pixels and 10,000 x 10,000 pixels.
What limitations exist for PDF files in Document Intelligence?
PDF documents must have dimensions less than 17 x 17 inches or A3 paper size.
PDF documents must not be protected with a password.
What amount of pages are allowed for PDF and TIFF files in Document Intelligence?
PDF and TIFF files can have any number of pages but, in the standard tier, only the first 2000 pages are analyzed. In the free tier, only the first two pages are analyzed.
How do you use the Document Intelligence service?
For custom applications, use the REST API.
To explore the models and how they behavior with your forms visually, you can experiment in the Azure AI Document Intelligence Studio.
In the Read model in Doc intelligence, how can you select a page range for analysis?
Use the pages parameter
What is the purpose of the Read model in Document Intelligence?
The read model is ideal if you want to extract words and lines from documents with no fixed or predictable structure.
Which prebuilt Document Intelligence model supports Entity extraction?
general document model