02 The Tech in FinTech Flashcards
What is Fintech?
Fintech is the application of modern technology by established financial institutions or by new players
—Massive increase in computing power, storage capacity and connectivity in past decade
—Especially connectivity has become mobile, since mobile phones are basically available anywhere.
–> wearables and samrt phones have become widespread
What is Cloud computing?
Is the delivery of on—demand, off-site computing and storage resources.
- provided by third paties via offsite servers hosted in advance
What are the benefits of Cloud computing?
General benefits: flexible, scalable, accesible
- reduces up-front investments
- allows established institutions to outsource infrastructure and launch new systems and services
attenuating legacy technonolgy debt issues
What are the factors contributoing to TechFin success?
- Already have a very broad userbase
- have the technological expertise
Large technilogy companies are increasingly entering the finance space
Reasons why the amount of data for FI is increasing?
- Digitization: from analog to digital makes collecting, transmitting, and analyzing data substainially easier
- APis and Open Banking Regulation: they forces FI to share data, thus making collecting data easier
- Computing and storage capacities: advanes in network bandwidth and processing power make working with large datasets feasible
Reasons why the amount of data for FI is increasing? Last two reasons
- Mobile devices: communication habits, health data, new mobile devices constantly collect data
- New “Data Awareness”:
Increasing awareness in the usefulness of data, combined with cheap storage space, lead institutions to collect rather than discard data
What are the three different characteristics of Big Data in Finane?
- Large in scale
- High dimensionality
- Complex structure
Preview of AI and ML?
What do they prodivide?
They potentially provide the means to analzyze the big datasets
–>Its all about finding correlations or complex relationships within data
–>(Weak or Narrow) AI: Machines mimicking human behavior when solving specific tasks
Drivers of recent advances in AI and ML?
- Computing Power: was necessary to tacle complex problems of machine learning
- Big Data: since many types rely on large datasets to train and evaluate
- Algortihms: using ML algorithm is much more accessible today than only a few years ago.
What is Machine Learning?
AI gets better with more experience and as you feed it more data, AI does not need this feature.
–> ML uses tools form computer science and statistics
General use cases of ML MEthods: Classifciation, Clustering, Regression, Prediction, Dimensionality reduction
— Training AI without ML is difficult and requires a lot of work by experts
–>Generally speaking machine learning performs well when looking at (or for) non-linear relationshios in the data.
Supervised Machine Learning
General description
Supervised learning maps an input to an output
–>it uses labeld data to lean about the mapping from input to output
Supervised Machine Learning
Some Statistical methods used
1. Classification: Predicting labels or classes of observations (discrete)
* Logistic regression
* Naive Bayes classifier
* Support-vector machines
* Decision trees
2. Regression: Predicting continous variables
* Linear and non-linear regression
* Ridge regression and least absolute shrinkage and selection operator
Supervised Machine learning
Short Process description
First dataset, the training data, is used to build a first model
* more data is used to validate and adjust the model
–>as more data is added, the algorithm learns by incorporating new information
–>The final model can then be applied to label unlabeld data
Unsupervised Machine Learning
Description
Uses unlabeld data, with the aim to discover hidden, potentially interesting structure within the data
* subgroups and clusters
* Patterns
Unsupervised Machine Learning
Advantages and Disadvantages
Advantages: (compared to Supervised Learning)
* unlabeld data is much easier to obtain, since no prior classification
* can identify patterns that may not be noticed by experts
Disadvantages: (compared to supervised learning)
* Usually requires even larger datasets
* Errors and anomalies that experts would have spooted might strongly impact the outcome
Unsupervised Learning
Some statistical methods used
-
Clustering
* Find objects that are similar by looking at distances (minimizing within distance, maximize between distance)
–>Example tools: Hierachical methods, K-means clustering
2. Dimensionality Reduction
* Reduce large dataset to small dataset by focusing on important features
* Output may be used in other analyses or visualization
–>Example tpols: Principal component analysis (PCA)