Machine Learning Flashcards
AWS Rekognition
makes it easy to add image and video analysis to your applications using proven, highly scalable, deep learning technology that requires no machine learning expertise to use
can identify objects, people, text, scenes, and activities in images and videos, and detect any inappropriate content
also provides highly accurate facial analysis and facial search capabilities taht you can use to detect, analyze, and compare faces for a wide variety of user verification, people counting, and public safety use cases
Rekognition Video
processes a video stored in an Amazon S3 bucket
the completion status of the request is published to an Amazon Simple Notification Service topic
you can track each person w/in a shot and through the video across shots
in streaming mode, you can search faces against a collection with tens of millions of faces in real time
uses a Kinesis Video Stream as input, to process a video stream
Process:
User uploads image
upload to S3 triggers a Lambda function
Rekognition analyzes the image
Rekognition publishes result status to an SNS Topic
A Lambda function process the results and create an item in DynamoDB
Amazon Transcribe
enables you to add speech-to-text capability to applications
audio data is virtually impossible for computers to search and analyze. Therefore, recorded speech needs to be converted to text before it can be used in applications
uses a deep learning process called automatic speech recognition (ASR) to convert speech to text quickly and accurately
Can be used to transcribe customer service calls, to automate closed captioning and subtitling, and to generate metadata for media assets to create a fully searchable archive
Amazon Translate
neural machine translation service that deliver fast, high-quality, and affordable language translation
looks like the google service that offers same thing
Neural Machine Translation - a form of language translation automation that uses deep learning models to deliver more accurate and more natural sounding translation than traditional statistical and rule-based translation algorithms
Amazon Translate allows you to localize content (ie websites and applications) for international users, and to easily translate large volumes of text efficiently
Amazon Polly
service that turns text into lifelike speech, allowing you to create applications that talk, and build entirely new categories of speech-enabled products
Polly’s Text-to-Speech (TTS) service uses advanced deep learning technologies to synthesize natural sounding human speech