Interview Topics Flashcards
Probability and statistics
Conditional probability (Bayes' Theorem) Probability distribution Hypothesis testing (null hypothesis, p-values, confidence intervals) Covariance and correlation
Machine Learning
Supervised Learning (Linear Regression, k-NN, SVM, Random Forest, Gradient Boosting) Unsupervised Learning (k-means, hierarchical clustering) Deep Learning General Predictive Modelling (Choosing the right evaluation metrics, train and test sets, cross-validation)
Computer science
Coding (Python/R)
Data Structures (Lists, Hash Tables, Stacks, Queues, Trees, Graphs)
Algorithms (searching, sorting, graph traversals)
Databases (SQL, noSQL)
Distributed computing (mapReduce, spark, Hadoop)
Data Engineering
Data Wrangling, Cleaning and Visualisation
Feature Engineering
Domain Knowledge
Knowledge of the industry, key metrics of the field
Behavioural and “Fit”
Alignment to company culture
Teamwork: Worked with someone different to me
Ability to Adapt: Lots of pressure, when you failed and how you dealt with that
Communication: Time you successfully persuaded someone at work. Technical concept to a non-technical audience
STAR
Situation
Task
Action
Result