web analytics Flashcards
Web usage mining
discovering interesting patterns in how visitors use a
web site
- Web content mining
– extracting useful information or knowledge from web
page contents
- Web structure mining
– mining hyperlink structure of web
What is server log analysis?
The process of examining data captured by web servers when users interact with a website.
Every user request (e.g., visiting a page) creates an entry in a log file.
What information can be found in server logs?
User information (IP address)
Date and Time
HTTP request method (GET, POST)
Resource requested (page, image, etc.)
HTTP status code (success, error, redirect)
Number of bytes transferred
Referrer (previous webpage)
Browser and platform (user agent)
How can server log analysis be useful?
Understand user behavior (popular pages, broken links)
Identify traffic sources (search engines, social media)
Optimize website performance for different devices
Detect suspicious activities (security threats)
What are Cookies?
Cookies are small pieces of data that websites store on your computer’s hard drive (or similar storage) through your web browser. They act like a little note that the website leaves behind to remember you.
how cookies work ?
A web server sends a cookie to your browser.
Your browser stores the cookie.
Every time you visit the same website again, your browser sends the cookie back to the server.
What Information Do Cookies Contain?
A unique user ID (often random letters and numbers)
The website’s domain name (e.g., yoursite.com)
An expiration date (optional)
Additional customized data (optional)
Session Cookies vs. Persistent Cookies
Session Cookies: These are temporary and disappear when you close your browser. They’re used to track your activity during a single visit.
Persistent Cookies: These stay on your computer until they expire or you delete them. They’re used to remember you as a returning visitor.
First-Party vs. Third-Party Cookies:
First-Party Cookies: These are set by the website you’re directly visiting.
Third-Party Cookies: These are set by a different domain (e.g., an ad network) embedded on the website you’re visiting.
Zombie Cookies:
Zombie Cookies: These are cookies that reappear even after deletion, often a security breach.
Evercookie:
Evercookie: This is a notorious example of a “zombie cookie” program.
What is Page Tagging?
Page tagging is a method for collecting website usage data on the client-side (user’s browser) as opposed to the server-side. This is achieved by embedding a small snippet of JavaScript code into your web pages.
Page Views
Number of times page was viewed
visits
Identifying and counting visits (sessions)