6. Tracking and Surveilance Flashcards

Question

Data Brokers

Answer 1

A data broker (also known as an information product company) is an organization that makes money by collecting your personal information, analyzing it, and licensing it out to be used by other companies for things like marketing purposes. https://www.mcafee.com/blogs/tips-tricks/what-is-a-data-broker/

Answer 2

A pixel hack, in which a unique identifier is written into a minuscule image, generated on the fly, in the form of the color values for one or more pixels. Since images are often cached, or stored locally by the browser to avoid having to download the resource again in the future, these tracking values can often be retrieved later.

Answer 3

Entity tags (ETags) are HTTP headers that allow a browser to permanently tag a previously viewed resource (a web page or an object contained in the page) with an identifier. They were originally designed to enhance performance when loading previously viewed websites. When a user views a website, browsers generally save a copy of objects viewed on the user’s hard drive so that identical content does not need to be downloaded multiple times. A site can tag content with an HTTP ETag identifier, which changes each time the content is updated on the server. As a result, a browser can request a resource from a website while specifying that the resource should be returned only if it has changed, based on the ETag. If the resource has not changed, the site only needs to confirm this fact so that the browser can use the local copy. To enable tracking, a web server need only change and track ETags during each transaction to reidentify a visitor across multiple transactions. ETags are generally not deleted when a user clears their cookies; rather, ETags may be deleted when a user clears the browser’s cache of previously viewed pages. Thus, ETags enable tracking even if cookies are deleted.

Answer 4

The Adobe Flash plug-in that is used to display videos and other interactive content on a number of sites has its own means of storing information, commonly called either local shared objects (LSOs) or “Flash cookies.” A particular plug-in will generally be configured to run in each web browser on a user’s computer. As a result, a website that utilizes that plug-in can access the same cookies regardless of which web browser is being used. Furthermore, LSOs are stored in a location on the hard drive separate from HTTP cookies, which means that hitting the “clear cookies” button in a web browser may not clear LSOs. While LSOs can be used for purposes such as remembering the volume setting for watching videos on a particular website, they can also be used for storing unique identifiers for users that may not be deleted when a user deletes their cookies.

Answer 5

Features of web browsers designed to enhance users’ experience on the web can also be misused for tracking purposes. By default, web browsers show links on a page that have already been visited in one color, while links that have not yet been visited are displayed in a different color. Although it cannot directly access a user’s browsing history, JavaScript can access the color of any element on a web page, including links. Therefore, in a technique known as browser history stealing or sniffing, an unscrupulous page can include thousands of invisible links to popular sites and then use JavaScript to query the color of those links and learn whether a particular page has been visited by the client browser.

Answer 6

Another technique that misuses features of JavaScript and HTML for tracking purposes is browser fingerprinting, which has become widely used in recent years.49 So that websites can adjust their pages to match the configuration of a particular user’s computer, there are JavaScript functions that reveal the time zone and screen resolution, as well as fonts and plug-ins that have been installed on a particular computer. A 2010 study found that, even among a sample of potentially privacy-conscious users, 94.2 percent of browser configurations with Flash or Java installed could be uniquely fingerprinted.50 That same study also captured an array of browser fingerprinting techniques in use. These fingerprinting techniques leverage the unique characteristics of an individual user’s browser—the fonts installed, the particular version of the browser running, the idiosyncrasies of a particular graphics card—as a semi-stable, unique identifier in place of cookies, but for much the same purpose.51 Measurement studies conducted in 2014 and 2016 observed increasing use of browser fingerprinting in the wild, establishing browser fingerprinting as a major frontier in future tracking efforts.

Answer 7

Two common techniques for tracking email recipients are variants of the beacon and URL rewriting techniques used for web tracking. Popular email programs, such as Microsoft Outlook and Gmail, can display emails containing HTML code, the markup language used to format many websites. HTML code enables emails to contain different colors, advanced formatting and images, just like websites. Images can be attached to the email, or they can be downloaded automatically from a remote server. To determine whether a particular recipient has opened an email, the HTML code sent in an email to that user can request that content uniquely tied to that user be downloaded automatically from a remote server when the message is opened by the recipient. As on web pages, links to websites that are included in an email can also be customized to track whether or not a user has clicked on them. An email might contain a link that will eventually bring the email recipient to a specific website, such as www.big-sale.com, if they click on the link. However, rather than containing a direct link to that page, the email might contain a link to www.big-sale.com/174cx3a, where 174cx3a is an identifier sent only to Bob. Therefore, if big-sale.com receives a request for the page 174cx3a, it knows this request originated from the email that it sent to Bob. Alarmingly, a 2018 study found that personally identifiable information (PII), such as the recipient’s email address, is frequently leaked to these third-party email trackers.53 Unfortunately, the same study also observed that many existing defenses are insufficient for fully stopping email tracking. These defenses include filtering HTML, accessing content through a proxy, or blocking cookies or Referer headers.

Answer 8

Cross-device tracking is the process of tracking a user across multiple devices, such as computers, smartphones, tablets, and smart TVs. This can be useful to users when it allows them to suspend a video on one device and resume watching it on another or maintain state between other types of sessions across their devices. However, cross-device tracking can also be used to build rich user profiles across multiple devices, which companies may use for advertising or other purposes. Companies use both deterministic and probabilistic approaches to facilitate cross-device tracking. When users log in to a service on each of their devices, companies can use deterministic approaches to know that it is most likely the same user on each device. However, when users do not log in, companies can use probabilistic approaches, for example matching IP addresses, to determine, for example, that the same user is likely logged into two devices simultaneously. Cookies, location and behavioral data can also be used for probabilistic cross-device tracking. Companies build device graphs based on the inferences they have made about the devices used by a particular user. Users are largely unaware that this is occurring.

Answer 9

Platform for Privacy Preferences Project (P3P) tokens. P3P is a machine-readable language with which websites can express their privacy practices, such as the information that they collect and how this information is used.

Answer 10

A number of companies engaged in tracking also offer a system of opt-out cookies. Rather than being used for tracking, opt-out cookies are HTTP cookies indicating that a consumer has chosen to opt out of receiving targeted advertising. Although users who have opted out will not receive targeted ads from a particular company, some companies will still track those users’ online activities. Opt-out cookies are also problematic from a usability perspective since users who delete their cookies, as many privacy-conscious users might, also delete their opt-out cookies. Furthermore, setting opt-out cookies for each of the hundreds of tracking companies a user might encounter would take a long time. Centralized websites organized by industry groups offer a single place at which a user can opt out from many companies at once.85 However, research has identified major usability problems with these centralized websites.

Answer 11

A number of companies offer tools specifically designed to stop web tracking conducted by advertising networks, social networks and other companies interested in collecting what websites a user visits. For example, the partially open-source tool Disconnect is provided by a company of the same time. The company Cliqz offers Ghostery, which was formerly owned by Evidon. Similar tools include the open-source Privacy Badger from the nonprofit Electronic Frontier Foundation (EFF). These tools generally work by blocking, to varying extents, the mechanisms used for tracking. While some tools completely prevent the user’s browser from communicating with those domains or trying to download those resources, others allow the request to go through, yet prevent the request from including cookies.87 Additional subtle modifications to requests, such as removing the HTTP Referer field, can also protect the user’s privacy in limited ways. Some general-purpose browser add-ons can limit web tracking to an extent. For instance, the popular Firefox and Chrome extension Adblock Plus, designed to block large fractions of the advertising on the web, blocks requests to the domains of a number of advertisers and thereby limits the collection of tracking data by those particular advertisers. Similarly, NoScript, a Firefox add-on designed to prevent websites from executing JavaScript code and plug-ins like Flash, can prevent tracking that occurs using those plug-ins. Notably, HTTP cookies are sometimes created using JavaScript, and blocking the Flash plug-in can prevent LSOs from being set.

Answer 12

Term functional privacy means users’ willingness to aim for as much privacy as they can get without breaking the functionality of what they hope to accomplish on the web.

Answer 13

Search engines, such as DuckDuckGo, promise to neither collect nor share a user’s personal information. By default, DuckDuckGo does not use HTTP cookies except to save preferences about the page layout a user has chosen, nor does it allow the HTTP Referer field to contain information about the search query. However, users must trust DuckDuckGo and similar sites to fulfill their privacy promises. Users who wish to hide their search history can also download a tool to assist them, although few tools exist for this purpose. TrackMeNot, an add-on for Firefox and Chrome, protects a user’s privacy by issuing decoy queries to major search engines.101 As such, it operates by achieving security through obscurity, creating ambiguity about whether a particular query was issued by a user, or whether it was issued automatically by the program. The plug-in’s behavior is meant to mimic that of a real user. For example, it sometimes performs a large number of queries in a short amount of time, and it selectively chooses whether or not to click through to a link. For instance, a proxy or an anonymizing network such as Tor can strip some or all of the identifying information from web traffic, making a user’s searches more difficult or impossible to track. However, it is possible for private information to leak even when using techniques such as the TrackMeNot plug-in and anonymizing services.

Answer 14

“A number of modern email clients block beacons, images and other content loaded from external sites since this external content could be used for tracking. This behavior disables one of the most widespread techniques for determining whether or not an email has been read. Unfortunately, not all email clients block outgoing requests by default, nor implement related privacy-protective measure. “since tracking can still be accomplished through URL rewriting, it is important that a privacy-conscious user also not follow links contained in emails. If a user does not follow links contained in emails, tracking using URL rewriting cannot succeed. Furthermore, due to the threat of phishing attacks, it is generally considered good practice not to follow links in emails whenever the user must enter information at the destination. Even if a link in an email does not seem to contain any type of unique identifier, users who follow the link or otherwise access that information on a site are subject to web-tracking techniques.

Answer 15

Wireless Triangulation is a method that measures the distance and angle from two or more known points as a cross reference to pinpoint a location. Wi-Fi and cellular signals can be used to allow a device that is enabled for Wi-Fi or cellular communications to determine its location. Cellular phones communicate with cellular towers that receive their signal and connect phones to a global network. The time it takes messages from a particular cell phone tower to arrive to a phone, the strength of the signal from that tower and, most simply, which towers a phone can communicate with all reveal information about the phone’s location. After determining the phone’s position relative to a handful of towers whose locations are known by the cellular provider, the position of the phone can then be determined geometrically through triangulation. In addition to signals from cell towers, the Wi-Fi signals a phone receives can help determine its location. Wi-Fi signals have a shorter range, allowing for more fine-grained location information. Cell towers provide a more permanent location marker but less granular location data.

Answer 16

Global Positioning System “Many consumer devices, including mobile phones, are equipped with GPS capabilities for location tracking. Cameras and similar devices can also include GPS capabilities for tagging the location of photographs taken, and automobile infotainment systems can include GPS capabilities to pull regional content, such as weather and news-related information, into the vehicle’s navigation system. GPS calculates a device’s location using signals received from at least four of a set of dozens of geosynchronous satellites positioned in space and run by the U.S. government.105 Based on the differences in the time it takes messages from these different satellites to arrive to a receiver, a GPS receiver can determine its position relative to the satellites. Since these satellites’ positions are known and constant relative to the earth, the GPS receiver can determine its own position geometrically. Because devices receive and do not transmit any signals in the GPS process, devices do not automatically reveal their location by using GPS. However, devices with GPS can also include transmitters that can be used to reveal the device’s location to other parties and services. For example, a smartphone that determines its own location by receiving signals from GPS satellites might subsequently, and automatically , share that information with an app or the phone provider.

Answer 17

Global Navigation Satellite System (GNSS) refers to a constellation of satellites providing signals from space that transmit positioning and timing data to GNSS receivers. The receivers then use this data to determine location. By definition, GNSS provides global coverage.

Answer 18

Radio Frequency Identification (RFID) refers to a wireless system comprised of two components: tags and readers. The reader is a device that has one or more antennas that emit radio waves and receive signals back from the RFID tag. “RFID chips are tiny microchips that can be as small as a fraction of a millimeter. Each microchip is identified by a unique serial number and contains an antenna with which it transmits information, such as its serial number, to an RFID reader. RFID chips can be placed on products or cards or implanted in animals (such as household pets) for tracking purposes. They are commonly used in supply chain management to allow companies to track inventory. Passive RFID chips, which do not contain their own power source, are the most common. When power is applied by an RFID reader, these chips transmit a signal encoding their identifier. Active RFID chips contain their own power source, which allows them to transmit farther than passive chips. Depending on the type of chip and its power, particularly whether it contains its own power source, the signal can be picked up at varying distances. RFID chips transmitting at low frequencies have a range of about half a meter; those that transmit at ultrahigh frequencies can reach readers located dozens of meters away.106 The unique serial number associated with each RFID tag allows for location tracking. Tagged items are tracked as the Readers pick up the tag IDs at different locations. If additional information is stored on the tag, the reader is also able to pick up that information and associate it with the tag’s location. Tracking through RFID chips can be physically blocked, or in some cases, the RFID chip can be physically removed. Because RFID chips rely on a radio wave signal for tracking, a protective sleeve can be placed over an item that contains an RFID chip to prevent the chip from being read until the user desires the chip to be readable. This is useful for items like passports, which include chips containing information that the user does not want to be accessible until a certain time. RFID chips can also be removed from items like clothing or other products to prevent tracking, although such techniques prevent the use of the information on the RFID chip at a later time.

Answer 19

The location of a mobile phone and the individual to whom the cell phone belongs can be tracked using receivers installed within a building complex. The FCC also requires that phone companies be able to track phones when an emergency (911) call is placed.

Answer 20

Location information can also be automatically stored in the metadata of content, like photos. Metadata is information that is automatically or manually added to content and that can be later accessed and used during processing or by applications. For photos taken with GPS-enabled devices, such as cell phones or GPS-capable cameras, location is often automatically stored in the camera metadata, sometimes without the user’s awareness. When the photos are loaded into photo-browsing or -editing applications, this information is then accessible to the user or application, potentially raising privacy concerns.

Answer 21

Near-field communication (NFC) is a set of communication protocols that enables communication between two electronic devices over a distance of 4 cm (1.57 in) or less.[1] NFC offers a low-speed connection through a simple setup that can be used to bootstrap more capable wireless connections.[2] Like other "proximity card" technologies, NFC is based on inductive coupling between two antennas present on NFC-enabled devices—for example a smartphone and a printer—communicating in one or both directions, using a frequency of 13.56 MHz in the globally available unlicensed radio frequency ISM band using the ISO/IEC 18000-3 air interface standard at data rates ranging from 106 to 848 kbit/s. Mobile devices equipped with near-field communication (NFC) are another technology that can support location-based advertising. NFC allows devices in close proximity, or that are touching, to transmit information via radio waves. This allows consumers to access content when at a specific location.

Answer 22

A geographic information system (GIS), such as a computer database or imaging tool, is a technology used to view and manipulate stored geographic information. Such geographic content could relate to any quantities associated with a particular location, including maps, population or other census statistics, or data about a specific resource at a location. Uses for GIS are wide ranging. They can include logistics systems used for businesses, such as airlines, that need to track passengers, and utility companies, which need to direct crews, as well as agricultural applications for planting decisions.

Answer 23

Location tracking should be included in a system only if it provides a direct benefit, and, wherever possible, should be an opt-in rather than opt-out. Once data is collected, users should be able to easily see what has been stored about them and delete or update any past location data. Collected location data should be considered privacy-sensitive. Users should be informed, through a privacy policy or other means, of how their location information will be used. If it is going to be used in an unexpected manner, it is effective practice to ensure that users know about this ahead of time. Additionally, before making location data more publicly available, it is effective practice to carefully consider how it might be reused or combined with other datasets. “When using location-based applications to track others, such as in a workplace setting, it is effective practice to limit such tracking to instances where there is a clear need and to inform employees about the tracking whenever possible. Additionally, tracking should take place only while the employee is working. If tracking is done through a mobile phone that an employee also carries during nonwork hours, tracking should be limited to the workday. Once tracking data is collected, it should be used only for necessary purposes and access should be minimized.

Answer 24

Malware can be uploaded onto a computer, take control of a user’s microphone for audio surveillance and simultaneously hide its own existence. These types of malware, often known as Remote Access Trojans (RATs), are controlled by a complex web of operators.

Answer 25

Many smart televisions employ automated content recognition (ACR) to determine what the user is watching. Consumers may not be aware that their detailed viewing habits are being transmitted outside their homes, potentially to entities ranging from the device manufacturer to advertisers.

Answer 26

Voice over IP Researchers have demonstrated that simply having access to the encrypted version of a message may be sufficient for using linguistic techniques to reconstruct the call if certain types of encryption are used.162 A number of intentional mechanisms can also be used to surveil VoIP communications. In the United States, the FCC has interpreted the 1994 Communications Assistance for Law Enforcement Act (CALEA), which requires companies to be able to intercept communications in response to authorized legal requests, to include VoIP services. Although Skype had made “affirmative promises to users about their inability to perform wiretaps,” Skype was among the services revealed to be accessible as part of the PRISM program in the United States.163 Even if the body of the communication is encrypted by the sender, the metadata about a communication can often leak a large amount of the communication. For example, a 2016 study empirically reidentified telephone metadata and used it to infer locations and relationships between individuals.

Answer 27

When performing audio or video surveillance, especially within a work environment, it is effective practice to ensure that the minimal amount of surveillance is being performed for the necessary objective and that it is conducted in a legal manner. Video and audio surveillance can be very privacy invasive and should not be performed unless a necessary objective (e.g., security, efficiency) outweighs the privacy drawbacks. Wherever possible, those under surveillance should be informed about the system to lower the impact of the privacy violation. Additionally, a group should check local privacy laws before putting surveillance in place. In the United States, a first step for employers is making sure that the surveillance is not taking place in an environment in which employees have an expectation of privacy (e.g., inside a bathroom). Once audio and video surveillance data has been gathered, it is effective practice to take proper measures to maintain data security and limit exposure of the content. Whenever possible, use automated systems to analyze the data or employ a separation of duties, where the analyst examines the audio or video and only reports the finding to others; this avoids exposing the raw audio and video to unauthorized repurposing or snooping. In these situations, it is important to securely retain the raw audio or video in the event the finding is ever challenged. To ensure that the data is not misused, one should track access to the data and limit it to necessary personnel. A clear policy should be put in place for who has access to the data and under what circumstances, and for how long the data will be retained. Data should be purged when no longer needed for the intended purpose.

Answer 28

Ubiquitous computing (also termed ubicomp) refers to the transition of computing from purpose-built, independent computing devices to computing that occurs at all times and in all places. As computing occurs ubiquitously, absent the visible boundaries present when interacting with a device like a traditional laptop computer, the types and amount of data that can be collected at scale raise important concerns about privacy, tracking and surveillance. Early research on ubiquitous computing highlighted important requirements for end-user acceptance of this paradigm from a privacy perspective: The system should have obvious value, the retention of data should be limited and users should be provided both feedback and control.175 While many of the domains discussed in the rest of this section arguably also fall under the umbrella of ubiquitous computing, in this subsection, we focus on two particular types of ubiquitous computing: smart cities and augmented reality”.

Answer 29

Accelerometers are one type of sensor frequently found on mobile devices, yet infrequently encountered on other computing devices. Accelerometers enable phones to know when to rotate the screen, measuring the speed of the device when it is in motion as the user walks or bikes, and permit physical interaction with smartphone games. However, the data from an accelerometer can also be used for surveillance. For example, researchers have shown how an accelerometer alone, even without persistent location awareness, can determine the distance traveled and therefore leak information about the user’s location relative to a previous position.190 It can also leak information about the passwords a user types into their phone.

Answer 30

“researchers systematically analyzed vehicles’ centralized control system, the Electronic Control Unit (ECU).217 Those researchers showed that controlling the ECU could lead to attacks that disable brakes, control acceleration and perform other nefarious acts. Over the last few years, car hacking has become even more sophisticated. Proof-of-concept exploits have taken over cars remotely, showing that attacks against cars are a threat on a large scale.

Answer 31

This use of a remotely activated smartphone microphone is called a “roving bug.