Version control systems Flashcards

Question

Git commit

Answer 1

Send changes from working copy to local repository. git commit -m "Completed printing feature." Send changed data to the local repository, which then makes an effort to integrate it into the current state of the repository (even if it may have changed since the last update). The data is, at this point, not yet available in the remote repository and needs to be pushed.

Answer 2

Send changes from local to remote repository. git push Send the data from the local repository to the remote one.

Answer 3

Retrieve changes from the remote repository to local repository and working copy. git pull

Answer 4

Put new/changed file(s) under version control. git add Example1.java The add command tells Git to have the specified file(s) under version control, i.e., when committing to the local repository and pushing to the remote directory, it checks if there were changes and, if so, stores them as a new version.

Answer 5

git checkout -b BRANCHNAME The checkout command switches to another branch of development. When used with -b, a new branch is created before immediately switching to it.

Answer 6

git checkout –b printing // development git commit –m "Realized new feature." git checkout master git merge printing The merge command merges the changes of the specified branch into the currently active branch, i.e. when wanting to merge into master (the default) one has to switch to it first.

Answer 7

The current version of the software as the majority of people would use it.

Answer 8

Is the version of the software that contains the newest features, but have not been tested properly for a general release.

Answer 9

For late adopters that cannot update frequently. Supposed to still receive critical updates over long periods of time, but no new functionality.

Answer 10

Major.minor.patch Major: significant new program functionality Minor: new program functionality that is compatible with old functionality Patch: bug fixes and minor internal changes.

Answer 11

1. Data Large, sparse, replicated data 2. Coordination Communication, query data, similar setup 3. Calculation Scaling, parallelism, distribution

Answer 12

Solve massive data problems with distributed computers - storage and processing of big data - compensates for hardware failures MapReduce HDFS distributed file system, many cheap computers

Answer 13

Distributed computing system. Especially supports iterative and interactive/exploratory programming models as, e.g. needed by training algorithms for machine learning. Apache Spark is designed for in-memory data processing, which makes it much faster than traditional data processing frameworks like Apache Hadoop.

Answer 14

Data storage tool. Distributed, fault tolerant, column oriented non-relational database on top of HDFS. Logo is shark.

Answer 15

Data storage tool Distributed relational database engine with SQL support using Hbase.

Answer 16

Data storage tool Distributed column-oriented data store for real time analytics

Answer 17

Data storage tool Distributed wide-column data store for big data

Answer 18

Data storage tool Data warehouse for simplified/unified data query and analysis

Answer 19

Data storage tool Standard SQL queries on Hadoop for big data

Answer 20

Coordination tool Centralised service for distributed access to a hierarchical key-value store. Apache ZooKeeper is an open-source distributed coordination service designed to manage and synchronize the configuration information, naming, and various other distributed services across a large distributed system

Answer 21

Calculation tool High-level platform for creating programs that run on Hadoop. Pig's intent is to make development of applications for hadoop easier.

Answer 22

Calculation tool Collect and distribute data streams in real-time from/to interested clients

Answer 23

calculation tool Develop applications that process streaming data, e.g. from Kafka

Answer 24

ML tool Collection of distributed, scalable machine learning algorithms Distributed linear algebra framework

Answer 25

ML tool ML library for Java

Answer 26

ML tool Distributed deep learning library for Java

Answer 27

ML tool Data mining through machine learning

Answer 28

Arguing for using a "pre-release" version: Access to New Features and Improvements: Benefit: Pre-release versions often include the latest features, enhancements, and bug fixes. By using a pre-release version, you can gain early access to these improvements, allowing you to take advantage of new functionality and optimizations. Early Testing and Feedback: Benefit: Adopting a pre-release version allows you to participate in early testing and provide feedback to the developers. This can be valuable for both you and the development team, as it helps identify and address issues before the stable release. Your input could contribute to a more robust and reliable final release. Arguing against using a "pre-release" version: Stability and Reliability Concerns: Drawback: Pre-release versions are inherently less stable than their stable counterparts. They may contain bugs, incomplete features, or undergo significant changes that could impact the reliability of your system. Depending on your project's requirements, relying on a pre-release version might introduce unnecessary risks. Compatibility Issues: Drawback: Pre-release versions may not be backward compatible with the stable releases or with other libraries/tools in your ecosystem. This could lead to integration challenges and increase the complexity of your development and deployment processes. Using a stable release ensures a more predictable and compatible environment.

Answer 29

Look at slides

Version control systems Flashcards

(54 cards)