week 4: big data Flashcards

Question

if extracting from more natural, unstructured, human-written text, .... may help

Answer 1

some natural language processing

Answer 2

POS tagging, syntactic parsing, semantic word categories

Answer 3

POS or phrase tags (in prefillers/fillers)

Answer 4

prefiller pattern, filler pattern and postfiller pattern

Answer 5

independent maually-annotated test data not used during system development

Answer 6

N (total # correct extractions in solution template) E (total # of slot/value pairs extracted by the system) C (# of extracted slot/value pairs that are correct)

Answer 7

IE would be unnecesarry

Answer 8

difficult to format, difficult to mannualy annotate docs with good XML tags, commercial industry might be reluctant to provide it

Answer 9

automatically transforming semi structured or unstructured data into xml compatible format

Answer 10

first parsing web pages into dom trees

Answer 11

extraction patterns can be specified as paths from the root of the dom tree to the node containing the text to extract

Answer 12

regex patterns to idenitfy proper portion of final character data node

Answer 13

representational state transfer

Answer 14

collection of netwrok architecture principles which outline how resources are defined and addressed. Its not a standard, but uses several standarrds.

Answer 15

communications protocol that allows retrievin interlinked text documents (www)

Answer 16

to caputre characteristic of web that made web succesfull (make request, http protocol, URI)

Answer 17

nouns (resources) -> unconstrained (full website) verbs -> constraint (GET) representations -> constrained (XML

Answer 18

conceptual mapping to a set of entities represented with global identifier

Answer 19

represent actions to be performed on resources

Answer 20

how clients asked for info theyseek

Answer 21

updates a resources

Answer 22

creates resource

Answer 23

removes resource identified by uri

Answer 24

how data is represented/returned to the client for presentation (javascript or xml, can be multiple)

Answer 25

client application changes/transfers state with each resource representation

Answer 26

html to define content of web pages | css to specify layout of webpagesjavascript to program behavior of webpages

Answer 27

form validation, page embellishments and special effects, dynamic content manipulation, emerging web 2.0

Answer 28

increased responsivess and interactiveness of webpages. exchanging small amounts of data with server entire web page doesnt have to be reloaded each time user performs action

Answer 29

a technology itself but a term to refer to use of group of technologies

Answer 30

xmlhttprequest object (page doesnt need to refresh)

Answer 31

user driven, views defined by urls, simple user interaction model synchronous interacton

Answer 32

client event occurs an xmlhttprequest is created and configured asynchronous request made to server via xmlhttprequest object server processes request and returns data, client executes a callback in the xmlhttprequest object html dom updated based on response data

Answer 33

document object model, platform and language independent way to represent xml

Answer 34

hyp application development/maintenance cost behavior not weblike security issuesp

Answer 35

sisd, simd, misd, mimd (between control unit and processor unit)

Answer 36

single instruction stream, single datastream ->serial procoessor

Answer 37

single instruction stream, multiple data stream -> array processor

Answer 38

multiple instruction stream multiple data stream -> multiprocessor or multicomputer

Answer 39

multiple instruction stream, single data stream -> no examples

Answer 40

proccessors cant determine the state of the other processors (need message)

Answer 41

distributed algos, but they can determine state of other processors

Answer 42

fraction of time that a processor spends on perfoming useful work

Answer 43

a regular expression is a special string used to define string patterns

Answer 44

they are responsible to defining the context in which the filler patterns operates

Answer 45

text categorisation is usually used. You need to treat each of the possible values of the slot as a category and classify the entire document to determine the correct filler

week 4: big data Flashcards

(71 cards)