Tech Qs Flashcards
Is user uploaded data directly useable by AI? It hasn’t been diagnosed.
We’re reviewing the images uploaded with a dermatologist to get a diagnosis.
Is user uploaded data directly useable by AI? It hasn’t been diagnosed.
The digital scan is 1200 dpi which is pretty good quality. These slides were originally created by detail-oriented dermatologists, so they’re actually amazing quality.
What is the quality of the user uploaded images?
Getting a good photo requires 2 things: 1) Giving users appropriate guidance and 2) direct feedback on the quality of the image. A telemedicine company we know has gotten to where 99% of the images upload are usable, so it’s a known science in teaching how to take a good photo.
Are the digital scans useable by AI as is?
It depends on the image and the disease. We have internal tools that enables rapid data sorting and general cleanup. For AI usage, for some images we need to do additional steps like segmentation and background subtraction which we are working on.
How do you make sure the images you’re getting from doctors are correctly labeled and organized?
We have customized tools to ensure we have the appropriate other information associated with each image and we’re using standardized taxonomy Snomed for organizing and labeling each disease.
Why do you have a consumer tool? It seems like having fully diagnosed images from doctors is really the highest quality data, so this consumer product seems like a distraction.
The consumer tool allows us to build our own dataset of primarily healthy data, so we know what “healthy” looks like versus diseased from doctors.
What’s special about your AI model?
AI accuracy requires the right algorithms and the right data to drive high quality. There’s a new algorithm every 6 months, so we have the people on the team to ensure we’re using the right algorithm for our application, but it’s really all about the data.
If your AI isn’t special, what is protectable about what you’re doing?
The data we license from hospitals that’s digitized we have exclusive commercial rights to, in addition to the images uploaded from people at home. Our data is our protectable IP.
Does the AI work?
Yes, we have built an AI model resulting in 80% top 5 accuracy across 23 diseases, which means we identify the right disease in the top 5 matches 4 of 5 times. More data makes this better. Right now our accuracy is between a primary care doctor and a dermatologist.
What AI model are you using?
Tensorflow’s Inception-ResNet-V2 with transfer learning.
How were you able to build a high accuracy model on only 13,000 images?
We augmented the dataset, including rotating, shifting, flipping and zooming caused accuracy to increase by 5-10% top 5 accuracy
What off-the-shelf model structure are you using?
We manually compared VGG16, VGG19, ResNet50, Xception, InceptionV3, and InceptionResNetV2 model structures and found InceptionResNetV2 to have the highest performance.
How do you know the AI is training on the right data if you’re mixing user pics with dermatologist pics?
Each image within our database has been tagged with it’s original source, so we can keep track of levels of quality, and make sure we’re only testing on images with high quality verification.
What if someone uploads a picture of a disease your algorithm hasn’t been trained on?
We have an “other” option in the AI model, where it’s been trained on a variety of conditions not in our model. As we add diseases over time, the number of diseases in this category will reduce.
Can you improve the AI to be better than 80% top 5?
The accuracy relies on the quality, volume and variety of the dataset. That’s the main reason we are focusing on getting more data.