Chapter 40 Flashcards
What are dimensions in dataware house
A dimension is a structure that categorizes facts and measures in order to enable users to answer business questions. Commonly used dimensions are people, products, place and time.
What are famous Web Warehouse Dimensions
Date, Time of day, Part Vendor, Transaction, Status
What happen when definition of static page changes
When the definition of a static page changes because the Webmaster alters it, the row in the page dimension either can be overwritten or can be treated as a slowly changing dimension.
What is clickstream
clickstream is every page event recorded by each of the company’s Web servers
What are dimensions of clickstream
The clickstream contains a number of new dimensions such as page, session, and referrer
What are the issues with clickstream data
Clickstream data has many issues. like
- Identifying the Visitor Origin
- Identifying the Session
- Identifying the Visitor
- Proxy Servers.
- Browser Caches
How can we identify a session on web as HTTP is stateless
There are several ways to do this
Using Time-contiguous Log Entries
Using Transient Cookies
Using HTTP’s secure sockets layer (SSL)
Using session ID Ping-pong.
Using Persistent Cookies.
What are Transient Cookies
session-level cookie into the visitor’s Web browser. Using a transient cookie value as a temporary session ID for both the clickstream and application logging allows a straightforward approach to associating the data from both these sources during post session log processing.
what is super session
certain groups of Web sites can agree to store a common ID tag that would let these sites combine their separate notions of a visitor session into a super session.
What are proxy servers
Proxy servers are used to cache frequently requested content at a location between its intended source and an end visitor.
What are 3 problems caused by proxy servers
- A proxy may deliver outdated content
- Proxies may satisfy a content request without properly notifying the originating server that the request has been served by the proxy.
- If the visitor has come though a proxy, the Web site will not know who made the page request unless a cookie is present.
What are 2 types of proxy servers
- Forward proxy
2. Reverse proxy
What are challenges to web dataware house
- Teaming
- Beware of slow string functions (slow insertion and data manipulation)
- Large data so minimum loading