26. RAID Flashcards
Describe workshop papers.
Short (5-6 pages), usually containing only a provocative argument, system design, or very preliminary results
Ex: “Hierarchical Filesystems Are Dead”
Describe conference papers
Long (12-14 pages), enough space to describe and evaluate a complete novel system
Example: The Scalable Commutativity Rule
Describe journal papers
Longer (>14 pages), usually a published conference paper with extra material (frequently all of the unnecessary results that they removed to make the conference page limit)
Advice: read the conference paper
What are the 5 kinds of papers discussed in class?
- (Big) idea papers
- Problem papers
- Data papers
- New technology papers
- Wrong way papers
What do (big) idea papers cover?
Presents a new approach to an existing problem or a new idea about how to build systems.
Should convince you that the solution is (1) new, (2) works, and (3) is useful
What do problem papers cover?
Presents a new problem and, usually, some ideas about how to solve it.
Should convince you that the problem is (1) new, (2) matters, and (3) that there are some ways to solve it
What do data papers cover?
Presents novel analysis or analysis of a novel data set that produces interesting insights.
Should convince you that the results are useful to the design of future systems
What do new technology papers cover?
Describe some new hardware capability or device feature and why it’s interesting.
Should convince you that the hardware can be used to build better systems
What do wrong way papers cover?
Argue that the community is solving an existing problem incorrectly. Frequently these are workshop-level papers and eventually lead to idea papers.
Should be able to convince you that everyone else is confused and misguided (Good luck!)
What does RAID stand for?
Redundant Arrays of Inexpensive Disks
What kind of paper is the RAID paper?
A big idea paper.
It spawned a commonly-used technology, an entire industry, and lots of similar approaches
What is the big idea of the RAID paper?
Several cheap things can be better than one expensive thing
Where else do we see this idea applied today?
Multicore processors, Google using thousands of Dell computers in network, and crowdsourcing
What is the problem that the RAID paper identifies?
Computer CPUs are getting faster, computer memory is getting faster, and hard drives are not keeping up
While we can imagine improvements in software file systems via buffering for near term I/O demands, we need innovation to avoid an I/O crisis
What is the problem with the RAID solution to the cost of SLEDs?
Many cheap things fail much more frequently than one expensive thing.
This means there needs to be a plan in place to handle failures