Option C — Web Science Flashcards

1
Q

WWW

A

The system used for accessing web pages and websites

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Web 1.0

A

A library where you can look for information, but cannot change anything

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Web 2.0

A
  • more interactive
  • more multimedia
  • more social
  • creating a web page where the user is able to change the information
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Web 3.0

A

All the data on the web is interconnected like a super database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Software agents

A

Programs that crawl through the Web, searching for relevant information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Ontology

A
  • a file that defines the relationships among a group of terms.
  • concept of linking context to content
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Semantic Web

A

Proposes to help computers “read” and use the web

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Metadata

A

Information included in the code of the web that are invisible to humans, but are readable to the computers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Internet

A

The entire network of connected computers and routers used for sending data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

HTTP

A
  • hyper text transfer protocol
  • client requests a HTTP request message
  • server returns a response message
  • the response contains completion status info. about request
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

HTTPS

A
  • hypertext transfer protocol secure
  • a communications protocol for secure communication over a computer network
  • the result of layering the HTTP on top of the SSL/TLS protocol
  • provides the authentication of the website
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

HTML

A
  • hypertext markup language

- uses tags to determine how the webpage will be displayed in the web browser

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

XML

A
  • extensive markup language
  • markup language that defines a set of rules for encoding document in a format that is both human and machine readable
  • it can create any set of tags
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

XSLT

A
  • extensible style sheet language transformations

- a language that transforms XML documents into other formats

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

JavaScript

A

an object orientated computer programming language used to create interactive effects within web browsers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

CSS

A
  • cascading style sheet
  • a style sheet language used for describing the presentation of a document in a markup language
  • designed primarily to separate document context with document presentation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

CSS Advantages

A
  • improve content accessibility
  • provide more flexibility and control in the specification of the characteristics
  • enables multiple HTTP pages to have the same CSS/style
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

URI

A
  • uniform resources identifier
  • the method in which you identify some points of content (on WWW)
  • the most common form of URI is URL
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

URL

A
  • uniform resource locator

- a specific character string that constitutes a reference to a Internet resource

20
Q

How does a DNS work

A

It turns a user friendly domain name into an IP address that allows the computer identify each other on he network

21
Q

IP

A

A numerical a label assigned to each computer on the network

22
Q

TCP

A

Connection is established and maintained until the two hosts have finished exchanging messages

23
Q

FTP

A

Most common protocol that is used to transfer files between two locations

24
Q

Meta tags

A

Snippets of text that describes a page’s content that’s only visible in the text’s code

25
Protocol
Enables the compatibility through a common language internationally
26
Standard
An agreed way of doing/measuring something
27
Static web page
A web page delivered to the user exactly as it is stored
28
Advantages and Disadvantages of Static Web pages
Adv: - cheap to host - quick and cheap to develop Disadv: - requires web development skills to update - not useful for user - content can get stagnant
29
Dynamic web page
A web page that displays trifle rent content each time you access it
30
Scripts + what are they used for
A set of instructions used mainly in a dynamic webpage to find your query results, placing an ad, display a list of products etc
31
Client side scripts
- interpreted by the browser - used to make the web page change AFTER it has arrived to the browser - relys on user's computer
32
Client side script process
- client requests for a web page to the server - server returns the web page - the page is displayed while the script is running after/during display
33
Server side scripts
- the script is customized to the user/user's occasion - allows a level of privacy - script is interpreted by the server and more scripts = more workload on server - script always works in the same way
34
Server side script process
- client requests a web page to server - the script in the page is interpreted by the server which creates/changes the page content to match the user's customization - the final form of the page is sent and the content CANNOT be changed with server side scripting
35
CGI + describe the process of the CGI
- common gateway interface | - the method of passing data back and forth between the server and application
36
Search engine
Software that allows the user to search for information on the WWW with specific key terms
37
PageRank
- da h page is given a score for a certain search - the most important pages have the most important inlinks - uses the probability of landing on a page after clicking on a specific number of links
38
HITS Algorithm
- link analysis algorithm that ranks web pages - uses hubs and authorities (define them) - a repetitive process that is executed at query time = slow
39
How does the HITS algorithm work?
Refer to notebook
40
Web Crawler + how does it function?
- computer programs that scan the web, 'reading' everything they find that is relevant to the search - those key terms are then indexed - reeder to notebook
41
What is the relationship between data in a meta tag?
The relationship is NOT always transitive. | Define transitive
42
Parallel Web Crawler + Goal and how does it achieve its goal
- A crawler that runs multiple processes simultaneously - maximize download rate while also minimizing overheads and avoid repeated downloads - system requires a policy that assigns new URLs discovered
43
Index + its Purpose
- where all the key terms are found and stored by the web crawler - to optimize speed and performance in finding relevant documents for a search query
44
Black hat techniques
- hidden texts - scraping - keyword stuffing - blog spam - link farms - paid links - doorway pages - parasite hosting - cloaking
45
White hat techniques
- guest blogging - link baiting - quality content - internal linking
46
Gray hat techniques
- 3 way link exchange - buying old/expired domains - article spinning - Google bombing