Chapter 40 Flashcards

1
Q

What are dimensions in dataware house

A

A dimension is a structure that categorizes facts and measures in order to enable users to answer business questions. Commonly used dimensions are people, products, place and time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are famous Web Warehouse Dimensions

A

Date, Time of day, Part Vendor, Transaction, Status

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What happen when definition of static page changes

A

When the definition of a static page changes because the Webmaster alters it, the row in the page dimension either can be overwritten or can be treated as a slowly changing dimension.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is clickstream

A

clickstream is every page event recorded by each of the company’s Web servers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are dimensions of clickstream

A

The clickstream contains a number of new dimensions such as page, session, and referrer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the issues with clickstream data

A

Clickstream data has many issues. like

  • Identifying the Visitor Origin
  • Identifying the Session
  • Identifying the Visitor
  • Proxy Servers.
  • Browser Caches
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How can we identify a session on web as HTTP is stateless

A

There are several ways to do this
Using Time-contiguous Log Entries
Using Transient Cookies
Using HTTP’s secure sockets layer (SSL)
Using session ID Ping-pong.
Using Persistent Cookies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are Transient Cookies

A

session-level cookie into the visitor’s Web browser. Using a transient cookie value as a temporary session ID for both the clickstream and application logging allows a straightforward approach to associating the data from both these sources during post session log processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is super session

A

cer­tain groups of Web sites can agree to store a common ID tag that would let these sites combine their separate notions of a visitor session into a super session.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are proxy servers

A

Proxy servers are used to cache frequently requested content at a location between its intended source and an end visitor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are 3 problems caused by proxy servers

A
  1. A proxy may deliver outdated content
  2. Proxies may satisfy a content request without properly notifying the originating server that the request has been served by the proxy.
  3. If the visitor has come though a proxy, the Web site will not know who made the page request unless a cookie is present.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are 2 types of proxy servers

A
  1. Forward proxy

2. Reverse proxy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are challenges to web dataware house

A
  1. Teaming
  2. Beware of slow string functions (slow insertion and data manipulation)
  3. Large data so minimum loading
How well did you know this?
1
Not at all
2
3
4
5
Perfectly