Vorlesung 12 Flashcards
Intel’s single core assessment
power and area increase more rapidly than performance
Why multi-core processors?
POWER – Formeln auf Folie 60
What is the evolution of processor performance
Currently, end of the line?, because only 3% processor performance increase can be achieved within a year.
ARM LITTLE
most energy efficient application processor from ARM
- simple architecture
- in-order
- 8 stage pipeline
ARM big
highest performance in mobile power envelope
- complex architecture
- out-of-order
- multi-issue pipeline
When does one use ARM’s little?
“Always on, always connected” tasks like OS, UI activity
Maximum efficiency
When does one use ARM’s big
for best performance…
“demanding tasks” like browsers, gaming, content creation
What is SGEMM/W?
single-precision general matrix multiplication per Watt
What is the lesson learned from nature?
High power efficiency of human brain comes from specialization. More advanced societies have higher degree of specilization.
—–> dedicated architectures have a higher energy efficiency compared to general purpose chip types
what is the main task of the zFAS (zentrales Fahrassistenzsteuergerät)
Sensor fusion (signals from stereo cameras, radar, multi-axis acceleration sensors)
what does a Field Programmable Gate Array (FPGA) consist of?
A set of programmable macro cells
A programmable interconnection network
Programmable input/outputs
what are LUTs and what are they used for?
Look-up tables, used as function generators in memory FPGA
characteristics of software
Large flexibility, easy to modify
„Easy“ to learn, many teaching courses
Minimal infrastructure necessary
Large amount of free and open source software
characteristics of hardware
Any software requires underlying hardware
Some functions can‘t be implemented in software, e.g., amplifier
Hardware is inherently parallel
Hardware engineering is based on physical laws
where does the problem of concurrency in software come from and how can it be solved?
programming languages are usually based on sequential models…
can be solved by having parallelized code