Topic 8 Flashcards
Big Data & Machine Learning in the Financial Industry
Big Data
used broadly to describe the storage and analysis of large and/or complicated data sets using a variety of techniques including AI
Artificial Intelligence (AI)
the theory and development of computer systems able to perform tasks that traditionally have required human intelligence
Natural Language Processing (NLP)
allows computers to ‘read’ and produce written text or, when combined with voice recognition, to read and produce spoken language.
Machine Learning (ML)
a method of designing a sequence of actions to solve a problem, known as algorithms, which optimize automatically through experience and with limited or no human intervention
Supervised learning
algorithm is fed a set of ‘training’ data that contains labels on some portion of the observations.
Unsupervised learning
situations where the data provided to the algorithm does not contain labels. The algorithm is asked to detect patterns in the data by identifying clusters of observations that depend on similar underlying characteristics
Deep learning
a form of machine learning that uses algorithms that work in ‘layers’ inspired by the structure and function of the brain. Deep learning algorithms, whose structure are called artificial neural networks, can be used for supervised, unsupervised, or reinforcement learning
Sentiment indicators
Unsupervised textual analysis method, measures of happy/unhappy or other emotion (sentiment) about a company or unit.
Trading signals
leading indicators that provide sufficiently high-quality information to trade based on
Fraud detection
Detecting fraud
RegTech
applications by financial institutions for regulatory compliance; or applications by financial regulators to monitor non-compliance
InsurTech
FinTech, for insurance
Chatbots
Provide human-like tech support, cutting costs (edit. and super annoying)
Know your customer (KYC)
Machine learning is increasingly used in remote KYC of financial services firms to perform identity and background pre-checks. It is predominantly used in two ways: (1) evaluating whether images in identifying documents match one another, and (2) calculating risk scores on which firms determine which individuals or applications need to receive additional scrutiny.
Reinforcement learning
the algorithm is fed an unlabeled set of data, chooses an action for each data point, and receives feedback (perhaps from a human) that helps the algorithm learn