AM3 - Exam Flashcards

Question

What is remote storage and what are the advantages and disadvantages?

Answer 1

storing data on servers located offsite, typically within the same organisation but at different physical locations, accessed via network connections. Data backup processes can provide disaster recovery in case of site specific failures, security can be controlled by the organisation, often less expensive than local storage for large datasets HOWEVER higher latency compare to local storage due to network transmission, it is complex, requiring robust networking infrastructure and management and it is dependent on network reliability and bandwidth.

Answer 2

storage used for short-term data retention, often in memory (RAM) or cache storage solutions. It has extremely fast access times and can be cost effective for short term data needs and reduces load on persistent storage systems by offloading transient data HOWEVER it is volatile – data can be lost when power is turned off or the system restarts, it is limited in size compared to other storage methods and is not suitable for long term data retention

Answer 3

Supervised - input data is labelled Unsupervised - data not labelled Supervised - used for prediction Unsupervised - used for analysis Supervised - data classified based on training set Unsupervised - data assigned a classification Supervised - Divided into regression and classification Unsupervised - mainly clustering

Answer 4

Regression - linear and ridge Classification - Logistic and decision tree

Answer 5

K-means clustering, hierarchical clustering

Answer 6

absolute differences between predicted values and actual values

Answer 7

average of the squared differences between predicted values and actual values

Answer 8

goodness of fit of the regression model (perfect fit would be 1)

Answer 9

How often the model is correct overall

Answer 10

How often a model is correct when predicting target class (i.e. of all _ve predictions, how many are truly +ve)

Answer 11

Of all real positive cases, how many are predicted positive?

Answer 12

sum of squared distances between each data point and the centroid of its assigned cluster

Answer 13

measure of compactness and separation of clusters

Answer 14

measure of compactness and separation of clusters

Answer 15

measures how the dendrogram preserves pairwise distances between original data points

Answer 16

* Contribute to society and human well-being – all people are stakeholders in computing. * Avoid harm, be honest and trustworthy, be fair and take action not to discriminate. * Respect privacy and honour confidentiality

Answer 17

Governs EU citizens and aims to safeguard individuals’ personal data by enhancing privacy rights and imposing tule of data handling and processing by organisations. E.g. gaining explicit consent for data collection. Non-compliance can result in hefty fines (up to €20m or 4% of annual global turnover) and reputational impact

Answer 18

* Transparency – be transparent about data sources, methods, and intentions. Communicate clearly with the public and stakeholders. * Accountability - Establish clear responsibilities and accountability mechanisms. Ensure all team members understand their ethical obligations * Use data Ethically - Share best practices and resources. Provide guidance and support to others on ethical data use

Answer 19

The Asilomar AI Principles are a set of guidelines developed to ensure that artificial intelligence (AI) technologies are beneficial and safe for humanity. Focus on three areas, Research, Ethics and Values and Longer term issues (e.g. risks, self-improvement and common good).

Answer 20

is a set of professional standards and ethical guidelines that members of the BCS, The Chartered Institute for IT, are expected to follow. It outlines the principles and standards of behaviour required to maintain the integrity and reputation of the profession. The key elements are: 1. Public Interest: Members must have due regard for public health, privacy, security, and the well-being of others and the environment. They should avoid harm and ensure their work contributes to the public good. 2. Professional Competence and Integrity: They must keep their skills and knowledge up-to-date and act with honesty and integrity at all times. 3. Duty to Relevant Authority: respect the rules and procedures of their employer or any relevant authority. 4. Duty to the Profession: should act in a manner that promotes trust and confidence in the profession

Answer 21

* Approach: o Waterfall: A linear and sequential approach where each phase (requirements, design, implementation, verification, maintenance) is completed before the next one begins. o Agile: An iterative and incremental approach where projects are divided into small cycles called sprints, allowing for continuous feedback and adjustments. * Flexibility: o Waterfall: Inflexible; changes are difficult to implement once the project is underway as each phase must be completed before moving to the next. o Agile: Highly flexible; changes and improvements can be made throughout the project based on ongoing feedback. * Documentation: o Waterfall: Extensive documentation is required upfront and throughout each phase. o Agile: Emphasizes working software over comprehensive documentation, focusing more on collaboration and responsiveness to change. * Project Size and Scope: o Waterfall: Better suited for projects with well-defined scope and requirements that are unlikely to change. o Agile: Ideal for projects with evolving requirements and scope, allowing for frequent reassessment and adjustment.

Answer 22

i.e. need the necessary computational resources (CPU, memory, storage) and software tools (libraries, frameworks, dev environments) for machine learning models. This may need to be scaled depending on size of dataset/complexity of the model. Will need to consider increasing computational demand as the system grows. May need to consider distributed computing, parallel processing and optimisation techniques to improve performance and reduce latency. Also need correct security measures in place to protect sensitive data, ensure model robustness against adversarial attacks and comply with privacy regulations (GDPR). Techniques such as encryption and access controls can be employed to enhance security and privacy.

Answer 23

Maximizes investments, streamlines processes, and enhances data accessibility. Compatibility ensures seamless integration, leveraging existing infrastructure and workflows while minimising disruptions. By embedding machine learning capabilities into familiar interfaces, user adoption is enhanced, leading to improved efficiency. Leveraging existing resources also boosts scalability and performance, distributing computational tasks efficiently. Maintenance and support are simplified through centralized management, reducing operational overhead.

Answer 24

Uncertainty (different types), error and bias

Answer 25

Data Exposure, data Linkage, data Storage, data Analysis and data Visualisation (ELSAV)

Answer 26

Ethical, legal, regulatory, professional constraints and the business impact of non compliance

Answer 27

Resources and architecture needed to solve the problem. Project management workflows (compare 2) , UX.

Answer 28

* Consent from people to use their data (GDPR) * Anonymize data to ensure privacy * Does the business have the right to use 3rd party data (data sharing agreement etc)

Answer 29

* Incorrect predictions could lead to customer dissatisfaction * Is there an impact on employees (e.g. increased workload?) * Need to be transparent about how data is used and predictions are made to maintain public trust in company

Answer 30

* Model should be free of bias (should not perpetuate inequalities) * Clear accountability for model development, deployment and for monitoring impact * Transparency about models limitations and uncertainty in predictions

Answer 31

* Engage correct people in model development * Adhere to any professional standards * Regularly update model to reflect changes * Provide training on how to use model and outputs

AM3 - Exam Flashcards

(55 cards)