Data Entry Flashcards
When collecting data with paper surveys, what data entry options might be available?
Manual data entry of data from paper surveys
Scanning paper surveys and having data entered remotely
Optical character recognition
Survey specifications include
- A well-designed, clean survey
- Unique IDs and coding, in this case automatic coding
- Variable values, skip patterns, logical checks etc.
Entering mock values to ensure responses can be entered…
What type of software check?
Bench testing
Entering mock values to ensure skips work properly…
What type of software check?
Bench testing
Checking that batteries have a long-enough life for field work
What type of software check?
Device testing
Checking that data entered are stored accurately
What type of software check?
Data flow testing
Why is it important to include special codes, and why in the form of: -999, -888, etc?
Special codes such as -999, etc, are for missing values. We want to know whether a value is missing because a respondent not knowing the answer suggests a very different interpretation from refusing to answer. Using a negatives allows us to not confuse this missing response with real data (eg age). The other options relate to unique IDs and questionnaire design.
Which process is used for creating digital data collection software but not for creating manual data entry software
Device testing (e.g. battery life, extreme conditions)
If the surveyor records a respondent’s response incorrectly on the questionnaire, at what point will this likely be caught in the data entry process
It won’t.
The double data entry process is designed to catch data entry error, not surveyor error.
After entering our “audit” sample, to which dataset should we compare the resulting audited data?
The reconciled dataset from the first two entries after correcting for errors
What are the benefits of outsourcing data entry operations?
We would not need to invest in hardware (computer, power backups, etc)
We would not need to invest time in learning how to program software (assuming we have no prior knowledge)
When conducting data entry operations in-house, we should always invest in a power backup solution (e.g. Uninterrupted Power Supply or UPS) so that, at a minimum…
We have enough time to save whatever data has been entered and we can safely close the program
If we need electricity for 24 hours/day or 10 hours/day, we want to select a location that has a reliable power supply, and should not rely on self-generated power, or a power backup. The power back up is primarily to ensure we do not lose data we’ve already entered.
What is the main reason it is unadvisable to pay data entry operators (DEOs) per survey entered?
It encourages speed over quality
For large scale data collection, why might we require paper surveys over digital data collection?
If we do not have enough time to work on digital data collection software prior to survey launch
If we are administering exams to children
If we are collecting data from within a factory that does not allow digital devices for fear of losing intellectual property
Questionnaire –> 1st Entry –> 2nd Entry –> XXXXX –> Reconciliation –> Complete Dataset
Which step is missing in the data entry process?
Identification of discrepancies