W06 - Database Concepts and Data Sources Flashcards

Question

what is a hierarchical database?

Answer 1

organizes its data at different levels and uses only one-to-many associations between levels (ex. zoning > parcel > owner)

Answer 2

builds connections across tables

Answer 3

the linkages between tables must be known in advance and built into the database at design time. could make the database complicated and inflexible

Answer 4

collection of tables (or relations) that can be connected to each other by keys

Answer 5

represents one or more attributes whose values can uniquely identify a record in a table cannot be null and should never change

Answer 6

one or more attributes that refer to a primary key in another table

Answer 7

primary and foreign key with the same name

Answer 8

simple and flexible each table in the database can be prepared, maintained and edited separately from the other tables tables can remain separate until a query or analysis requires attribute data from different tables to be linked together (efficient for data management and data processing)

Answer 9

the Soil Survey Geographic database, produced by the Natural Resources Conservation Service (NRCS) SSURGO data collected from field mapping, archiving data in 7.5 minute quadrangle units, organized by soil survey area, which may consist of a county, multiple counties, or part of multiple counties database consists of spatial and tabular data for each soil survey area, spatial data contained a detailed soil map, made of soil map units (which may be made of one or more noncontiguous polygons). ` a soil map unit represents a set of geographic areas for which a common land-use management strategy is suitable.

Answer 10

process of decomposition, taking a table with all the attribute data and breaking it down into small tables while maintaining the links between them

Answer 11

- avoid redundant data in tables that waste space and can cause data integrity problems - ensure attribute data in separate tables can be maintained and updated separately and linked when necessary - facilitate a distributed database

Answer 12

higher normal forms than the third can slow down data access and create higher maintenance costs.

Answer 13

one to one one to many many to one many to many origin and destination

Answer 14

one record in a table is related to only one record in another table

Answer 15

one record in a table may be related to many records in another table

Answer 16

many records in a table may be related to one record in another table (ex. several households may share the same street address)

Answer 17

many records in a table may be related to many records in another table

Answer 18

brings together 2 tables by using a common field or a primary key + foreign key ex. joining attribute data from a nonspatial data table to a feature attribute table recommended for one to one or many to one relationships doesn't work for one to many or many to many because only the first matching record from the destination will be assigned to the origin record

Answer 19

operation that temporarily connects 2 tables but keeps the tables physically separate works for all types of relationships, but slows down data access

Answer 20

relationships between objects, predefined and stored in a geodatabase. for the object-based data model can be one to one, many to one, one to many and many to many for the first 3, records in the origin are directly linked to records in the destination for many to many, an intermediate table sorts out the associations between records

Answer 21

# define each field in the table, usually include - field name - data width (# of spaces reserved for a field) - data type - number of decimal digits (part of the definition for the float type) field definition becomes a property of the field so it is important to consider how the field will be used before defining it

Answer 22

import attribute files, but if they don't already exist, then typing it in. for map unit symbols or feature IDs, best to enter them directly in a GIS. for nonspatial data, better to use word processing or spreadsheet packages (excel, notepad)

Answer 23

1) make sure that attribute data are properly linked to spatial data (feature ID should be unique and contain no null values) 2) verify the accuracy of attribute data

Answer 24

use attribute domains in the geodatabase attribute domains allows the user to define a valid range of values or a valid set of values for an attribute

Answer 25

adding or deleting fields and creating new attributes through classification and computation of existing attribute data

Answer 26

reduces confusion in using the data set and also saves computer time for data processing

Answer 27

data classification reduces a data set to a small number of classes (ex. reclassifying elevations into groups) 1) define a new field for saving the classification result 2) select a data subset using a query 3) assign a value to the selected data subset

Answer 28

1) define a new field | 2) compute the new field values from the values of existing attributes

Answer 29

allows you to examine the general trends in the data, take a look at subsets, focus on possible relationships between data sets purpose is to better understand the data and provide a starting point for formulating research questions and hypotheses

Answer 30

discipline that uses a variety of exploratory techniques and graphics to understand and gain insight into data

Answer 31

1) data exploration in GIS involves both spatial and attribute data 2) includes map and map features besides descriptive statistics and graphics, data exploration in GIS must also cover map-based data manipulation, attribute data query, and spatial data query

Answer 32

difference between the minimum and the maximum

Answer 33

the midpoint value (50th percentile)

Answer 34

the 25th percentile

Answer 35

the 75th percentile

Answer 36

average of data values

Answer 37

measure of the spread of the data about the mean sum of (value - mean) ^2 divided by # of values

Answer 38

square root of the variance

Answer 39

standardized score (x - mean) / standard deviation

Answer 40

line graph that plots the ordered data values against the cumulative distribution values the cumulative distribution value is (i - 0.5)/n the values fall between 0 and 1

Answer 41

a variation of scatterplots that uses varying-sized bubbles that represent a third variable

Answer 42

show min, first quartile, median, third quartile, max used to tell if the distribution is symmetric or skilled or if there are any outliers

Answer 43

quantile-quantile plots compare the cumulative distribution of a data set with some theoretical distribution (ex. a normal distribution) points in a QQ plot fall in a straight line if the data set follows the theoretical distribution

Answer 44

graphics displayed in multiple and dynamically linked windows where we can directly manipulate data points

Answer 45

allows the user to graphically select a subset of points from one chart and view related data points in other graphics

Answer 46

data visualization that focuses on geospatial data and the integration of cartography, GIS, image analysis, and exploratory data analysis

Answer 47

data classification, spatial aggregation, and map comparison

Answer 48

1) superimpose layers on top of one another and have them be represented on the map differently, or turn the layers on and off, or use transparency 2) use map symbols that can show two data sets ex. bivariate choropleth map ex. cartogram, where the unit areas are sized proportional to a variable (ex state population) and the area symbols are used to represent the second variable 3) temporal animation can be used if there is time-dependent data

Answer 49

process of retrieving data by working with attributes (ex. SQL commands)

Answer 50

data query language designed for manipulating relational databases, used in the GIS to communicate with a database select from where ex. select Parcel.Sale_date from Parcel where Parcel.PIN = 'P101' ex. select Parcel.Sale_date from Parcel, Owner where Parcel.PIN = Owner.PIN AND Owner_name = 'Costello' query joins the two tables and then actually queries it

Answer 51

1) only have to enter WHERE in the query expression box because typically the field and table have already been selected 2) an attribute query dialog is typically designed for a single table, so if the query involves attributes from two tables, they have to be joined first.

Answer 52

the where conditions with Boolean expressions and connectors

Answer 53

contains 2 operands and a logical operator operands can be a field, number, or text logical operators can be =, >, =, <> (not equal to) can also contain arithmetic operators

Answer 54

AND, OR, XOR, NOT XOR is the opposite of AND. only records that satisfy one and only one of the expressions are selected

Answer 55

add more records to a subset remove records from a subset select a smaller subset

Answer 56

works with a relational database, selects a data subset in the table and also selects records related to the subset in other tables

Answer 57

join operations combines the attribute data from 2 or more tables into a single table. relate dynamically links the tables but keeps the tables separate

Answer 58

process of retrieving a data subset from a layer by working directly with feature geometries. the results can be simultaneously inspected in the map, linked to the records in the table and displayed in charts can select features spatially using a cursor, a graphic or the spatial relationship between features

Answer 59

draw a shape (graphic) to select objects of interest (ex. restaurants within a 1 mile radius of a hotel)

Answer 60

selects features based on their spatial or topological relationships to other features ex. roadside rest areas within 50 mile radius of selected rest area; rest areas within each county spatial relationships used for querying include containment, intersect, and proximity

Answer 61

selects features that fall completely within features for selection

Answer 62

selects features that intersect features for selection

Answer 63

selects features that are within a specified distance of features for selection

Answer 64

features to be selected and features for selection share common boundaries and the specified distance is 0

Answer 65

use the raster instead of a field in the operand to query a feature can query multiple rasters, which may be integer, floating point, or a mix of both. querying multiple rasters directly is unique to raster data

Answer 66

features can be used to query a raster and it returns an output raster with values for cells that correspond to the query and no data in the other cells

W06 - Database Concepts and Data Sources Flashcards

(90 cards)