6. Pointer Analysis Flashcards

Question 1

Q

Pointer Analysis

Answer

A

flow of non primitive things

Question 2

Q

pointer aliasing

Answer

A

Expressions built using pointers, such as x.radius, allow the same memory address to be referred to in
different ways

Question 3

Q

pointer aliasing circle example

Answer

A

Circle x = new Circle()
Circle z = ?
x. radius = 1
z.radius = 2
y = x.radius 
assert(y==1)

Because we don’t know if z represents the same circle as x or not, the analysis is unable to continue past z.radius =2 since we’d be unsure if x.radius still equaled 1

Question 4

Q

May-Alias Analysis

Answer

A

An analysis that is dedicated to proving facts of the form “x may-alias z?” is called
a MAY-alias analysis.

Question 5

Q

Is pointer analysis must-alias or may-alias analysis?

Answer

A

may-alias

Question 6

Q

May-Alias circle example

Answer

A

Circle x = new Circle()
Circle z = new Circle()
x. radius = 1
z.radius = 2
y = x.radius 
assert(y==1)

We know now in this case that x!=z so after z.radius=2 is done, we can confidently say that x.radius is still 1.

Question 7

Q

Must-Alias circle example

Answer

A

Circle x = new Circle()
Circle z = x
x. radius = 1
z.radius = 2
y = x.radius 
assert(y==1)

after the assignment to z, we know that x == z. x.radius =1 makes the x radius 1. z.radius=2 makes x.radius=2 since we previously established that x == z.
x and z MUST Alias in this case.

Question 8

Q

May alias vs must alias

Answer

A

Must alias is more advance, but it is less useful in practice.
May alias analysis is useful for more practical dataflow analysis than must alias.

Question 9

Q

Why is pointer analysis hard?

Answer

A

you have to keep track of everything. In the case of a doubly linked list, you could refer to h.data in a number of ways:
h.next.prev.data, h.next.next.prev.prev.data, etc
cycles are hard yo

Question 10

Q

Pointer analysis problem is undecidable. T/F

Answer

A

True. We must sacrifice some combination of Soundness, Completeness, Termination.

Question 11

Q

what does pointer analysis sacrifice to become decidable?

Answer

A

completeness. This means that we can expect false positives, but no false negatives.

Question 12

Q

False positive

Answer

A

If the answer is no, but yes is the returned answer.

Question 13

Q

How can a false positive manifest in the circle example?

Answer

A

Circle x = new Circle()
Circle z = new Circle()
x. radius = 1
z.radius = 2
y = x.radius 
assert(y==1)

x May-Alias z returns YES. 
means that after z=new Circle() our analysis cannot determine that x!=z, but that x==z or x!=z. So going down the remaining analysis we reach the conclusion that y==1 or y==2

Question 14

Q

Approximate algorithms for pointer analysis have varying levels of precision. These algorithms differ in two key aspects:

Answer

A

How to abstract the heap (ie dynamically allocated data)

2. How to abstract control-flow

Question 15

Q

Abstracting the heap (Elevator example): Abstract object based on site they are allocated

Answer

A

Basically take the levels of the program and combine all those things into a single node in the graph. all objects allocated within a for loop are represented by a single node.
Creates a Points to graph http://imgur.com/a/nXlBb

Question 16

Q

Abstracting the control flow (Elevator example)

Answer

A

only a single points to graph for entire program, so we abstract the control flow.

remember how the Points to graph looked before? Instead of changing the graph representation, change the code itself to create that graph. http://imgur.com/a/a18LY

Question 17

Q

flow insensitivity

Answer

A

http://imgur.com/a/a18LY idk man
all constructs such as for loops are removed
; are removed as well
all statements that do not affect pointers removed
indices replaced with nondeterministic *s
no order to these statements now even though they are still in rough order

Question 18

Q

Chaotic Iteration Algorithm

Answer

A

Start with empty point to graph
go through each statement s in set
while applying rule corresponding to s on graph
until graph stops changing.

Question 19

Q

Statement types

Answer

A

v = new
v = v2 object copy
v2 = v.f field read statement
v.f = v2 field write statement
v2 = v[*] field read 
v[*] = v2 field write

Question 20

Q

Convert v.events = new Object[] to the statement grammar or whatever its called

Answer

A

tmp = new Object[]
v.events = tmp

Question 21

Q

convert

v.events[*] = e

Answer

A

tmp = v.events
tmp[*] = e

Question 22

Q

convert

v1.f = v2.f

Answer

A

tmp = v2.f
v1.f = tmp

Question 23

Q

convert

v1.f.g = v2.h

Answer

A

tmp1 = v1.f
tmp2 = v2.h
tmp1.g=tmp2

Question 24

Q

Rule for object allocation sites: Weak update

Answer

A

if there is already an arrow from v to another allocation site node and we just need to add another.

Question 25

Q

Rule for object allocation sites: Strong update

Answer

A

replace the points-to information

Question 26

Q

Rule for Object Copy

Answer

A

v1 = v2
create variable node for v1
add blue arrow from v1 to all nodes pointed to by v2

Question 27

Q

Rule for field writes

Answer

A

v1.f=v2
if v1 points to A and v2 points to B
we add a red arrow from A to B and label it by name for field or by * if its an index

Question 28

Q

rule for field reads

Answer

A

v1 = v2.f
if v2 points to B and B points to C by f or *
we add a blue arrow from v1 to C

Question 29

Q

Classifying Pointer Analysis Algorithms

Answer

A

Flow sensitive
Context Sensitive
What heap abstraction scheme
how are aggregate data types modeled

Question 30

Q

flow sensitivity

Answer

A

how to model control flow within a procedure

either flow insensitive or flow sensitive

Question 31

Q

flow insensitive

Answer

A

weak updates
only generate new facts, doesn’t kill any old facts
suffice for may-alias analysis

Question 32

Q

flow sensitive

Answer

A

strong updates - killing and generating facts

must-alias analysis

Question 33

Q

Inpractical for a must alias analysis to have a low false positive rate by being flow insensitive

Question 34

Q

Context sensitivity

Answer

A

how to model control flow across procedures
context insensitive
context sensitive

Question 35

Q

context insensitive

Answer

A

only analyze each procedure once despite how many times its called in the program
inprecise but efficient

Question 36

Q

context sensitive

Answer

A

analyze each procedure possibly multiple times, once per abstract calling context

Question 37

Q

heap abstraction scheme

Answer

A

scheme to partition unbounded set of concrete objects into finitely many abstract objects (oval nodes)
Ensures pointer analysis terminates
Many sound schemes exist

Question 38

Q

Heap abstraction scheme: too few abstract objects ==

Answer

A

efficient but imprecise

Question 39

Q

Heap abstraction scheme: too many abstract objects ==

Answer

A

expensive but precise

Question 40

Q

Heap abstraction scheme #1: Allocation-Site Based

Answer

A

one abstract object per allocation site
Allocation site identified by: “new” keyword in Java/C++, malloc() call in C
Finitely many allocation sites in program, so guaranteed finitely many abstract objects

Question 41

Q

Allocation site based downsides

Answer

A

costly
large programs
clients needing quick turnaround time
overly fine granularity of sites

Question 42

Q

Heap abstraction scheme #2: Type Based

Answer

A

one abstract object per type

finitely many types in program, so finitely many abstract objects

Question 43

Q

Heap abstraction scheme #3: Heap Insensitive

Answer

A

Single abstract object representing the entire heap.

highly imprecise, but sound

Question 44

Q

When is heap insensitive scheme useful

Answer

A

popular for languages with primarily stack-directed pointers (C)

Question 45

Q

Do quiz #31 in lesson 6

Question 46

Q

Modeling Aggregate Data Types: Arrays

Answer

A

Common choice: single [*] field to represent all array elements

Question 47

Q

Modeling Aggregate Data Types: Records

Answer

A

structs or objects
Three common choices:
1. field insensitive - merge all fields of each record object
2. field based - merge each field of all record objects
3, field sensitive - keep each field of each record object separate