Week 5 - Open Data, Reproducibility and Replicability Flashcards by Alexander Bazba

What is verifiability

A statement is meaningful when it can be verified empirically

How well did you know this?

Not at all

Perfectly

What is the induction problem

To establish a law like “All swans are white” we must observe all swans

How well did you know this?

Not at all

Perfectly

What is falsifiability

a statement is a valid theory if it makes predictions that can be tested, and can be falsified by a counterexample

How well did you know this?

Not at all

Perfectly

What is a non-falsifiable theory

“One day there will be a human that can breathe underwater”

How well did you know this?

Not at all

Perfectly

What is a Desideratum

in empirical work, we want to connect observations/measuerments with a falsifiable hypothesis or theory

How well did you know this?

Not at all

Perfectly

What is Hypothesis testing

We need to establish a null hypothesis and an alternative hypothesis for falsifiability

How well did you know this?

Not at all

Perfectly

What is transparency

In the ideal world, a study is fully transparent, in terms of what hypothesis is being challanged, what methodology was used, and what results were obtained.

How well did you know this?

Not at all

Perfectly

What is Open data

All data should be available in order for other researchers to evaluate the study, or reuse materials

How well did you know this?

Not at all

Perfectly

Open data is required or not required?

required in some academic journals

How well did you know this?

Not at all

Perfectly

What could be the reasons why data is not shared?

No time
NO access
Privacy
Propriotery data (companies dont want to share their data)

How well did you know this?

Not at all

Perfectly

how to combat data not being shared (no open data)

enforcing open data as a journal, peer review practice
Open data is necessary but not sufficient to guarantee good research

How well did you know this?

Not at all

Perfectly

What is replicaiblity

The ability of a researcher to duplicate the results of a prior study if the same procedures are followed but new data is collected.

How well did you know this?

Not at all

Perfectly

What is reproducibility

The ability of a reasearher to duplicate the results of a prior study using the same maternals as were used by the original investigator.

How well did you know this?

Not at all

Perfectly

What are research artifacts

Any concrete object that was used in the execution of a study and that is needed to reproduce the study. Examples:
* Paper/report
* Dataset
* Model
* Software

How well did you know this?

Not at all

Perfectly

What is a taxonomy of best practices on paper/report

peer review and checklists

How well did you know this?

Not at all

Perfectly

What is the taxonomy of best practices of the dataset

Data annotation

How well did you know this?

Not at all

Perfectly

What is the best taxonomy of best practices of Model and Software

Study These Flashcards

ML best practices

What are the problems we can encounter when doing ML paper

Study These Flashcards

The data: the way we set up our data into a splits may impact performance
The network: many current models are DNN, meaning they consist of many layers with low interpretability

How can we combat that. Give 2 methods

Study These Flashcards

Reproducibility from within: things researchers can do to increase the quality of their research.
Reproducibility from outside: things reviewers should pay attention to

What is generalization

Study These Flashcards

your model’s ability to adapt properly to new, previously unseen data, drawn from the same distribution as the one used to create the model

What is overfitting

Study These Flashcards

The model performs very well on the training data but poorly on the validation data.

what is underfitting

Study These Flashcards

the model doesn’t perform well anywhere

How to prevent loss hacking

Study These Flashcards

Require authors to include loss statistics

how to combat underspecification (not enough detail and not reproducible)

Study These Flashcards

Provide the selected train,val,test set and ideally the code that we used to create the split. This makes sure that the report is reproducible

How to combat label imballance (situation where the distribution of labels in the dataset is skewed)

Make sure that the relevant statistical properties of the intended splits are the same.

What is cherry-picking

We could chose accidentally or not very favourable seed values

how to combat cherry picking

Seed-averaging A simple solution is to average performance results over multiple runs with different seeds

What could be the problem with classification metric like a contingency table

Table could be imballanced.

What metrics are prefered when the data is unballanced in a contingency table?

Precision and Recall

give accuracy function

Acc=correct predictions / # items

Give Precision function | use ham and spam example

Prec = correctly marked as spam / marked as spam

How can we mix precision and recall

We can define a weighted mix of precision and recall using Fb score

What are Review Checklists

Review checklists in research methods are systematic tools used to evaluate the quality, rigor, and completeness of a research study. They ensure that all necessary aspects of the research process are addressed, including study design, data collection, analysis, and reporting. These checklists help maintain consistency and transparency, aiding in the replication and validation of research findings

What are the two issues why we need review checklists

Central issue: in practice, reviewing takes palce on a pro bono basis, little time is available for reviewing Reviewing vs reproduction: There is not always time to reproduce or replicate a given study

What is the name of system where authors and reviewers are aware of best practices

Review checklists

What are the 5 questions asked in the review checklist

1. General content: contributions, intro, rq 2. Scientific Artifacts: referenced? lincence? 3. Computational Experiments: environment described, detailed results? 4. Human participants: demographic, recruitment 5. AI Assistants: use of ai

What is Academic Sin

Use of plagiarism. Copying text or data from other researchers and pretending it is your own.

What are code licenses

They might lead to lawsuits

What is a research proposal

1. Motivation: main idea 2. Application: thesis, for funding There are different types of proposals: reprodude/replicate a study, propose new framework

What is the structure of a research proposal. Give 6 points

1. Background/context 2. Research question 3. Contributions 4. Methodology 5. Planning: timeline 6. Resources

How to set up a good proposal? | name 3 possible things

1. Get creative 2. Write idea first 3. Be SMART

Name the SMART attributes

1. Specific 2. Measurable 3. Achievable 4. Relevant 5. Time-bound

Week 5 - Open Data, Reproducibility and Replicability Flashcards

(42 cards)