Image and video processing solutions 15-20% Flashcards
Face app capabilities
identify, verify, find similar, group
What are the capabilities of the computer vision app?
Image analysis, Video analysis, Image classification, object detection, facial analysis
What are the different face detection models?
detection_01, detection_02, detection_03
What’s good about detection_02?
It has improved accuracy on small, side-view, and blurry faces, but it won’t return attributes or landmarks
What’s good about detection_03?
Better accuracy on small faces than 02, and will return attributes and landmarks
What computer vision tier supports OCR?
S2
Which document formats does form recognizer prebuilt read?
PDF, word, and excel
What does A1 and A2 mean in the image classification domain?
Improved accuracy
what formats can OCR read api ingest?
jpeg, png, bmp, pdf, and tiff
How many pages can PDFs and TIFFs be for OCR read api?
2000 pages, first 2 pages for free tier
How large can the image files be for the OCR read api?
<500MB, 4MB for the free tier
What are the image dimensions for the OCR read api?
50^2 to 10,000^2
How many print languages can the OCR read api handle?
164
How many handwritten languages can the OCR read api handle?
9
How does calling the OCR read api work?
You call it first and receive an async ID. You then call again for the results