Lecture 2: Terminology and Basics Flashcards

Question

Is Rendering for VR or AR differnt to “normal” computer graphics?

Answer 1

Rendering for VR or AR is not different from “normal” computer graphics (CG) Stereoscopic rendering just means rendering a different 2D image for each eye

Answer 2

There is a wide variety of ways to describe input data. This ranges from geometric data (like points or polygons) over volumes to mathematical descriptions (like surfaces).

Answer 3

For our purpose a “point” is a rendering primitive that stores at least a position in space. A set of points, or point cloud, can be used to render a continuous three-dimensional surface. Point-based rendering has been proposed as early as the mid 80s

Answer 4

Given enough points we can render a seemingly continuous three-dimensional surface. But: If we move the camera too close to the surface, holes may appear between points! Various techniques have been proposed to use point clouds to render smooth surfaces. One simple approach is to render small discs instead of just the points, to create a surface.

Answer 5

Rendering points is very efficient: * Each point is rendered individually (including lighting) * No connectivity information needed, all points are individual * No interpolation between edges (like with polygons) * Supported by GPUs and most APIs Point clouds are commonly used today: * Laser scanning * Stereo-Vision * Photogrammetry

Answer 6

Lines: Building upon points, a line is a rendering primitive that is defined by its two endpoints. Line rendering is not commonly used for surfaces, but may be useful at times and is supported by most rendering APIs. We only know the data stored in the endpoints. To generate data for any position on the line, the data from the endpoints is interpolated. Line rendering is common in SciVis!

Answer 7

A streamline is a line in 3D space representing the path of flow for a fluid, particle, etc. Allows easy to understand representation to visualise turbulence, path of least resistance, etc.

Answer 8

A polygon is a geometric figure that is described by a number of straight lines. As a rendering primitive triangles are most commonly used. A triangle primitive consists of three points forming three edges (lines) and a planar surface. As seen with lines, data for any position on the surface is generated by interpolating data from the three points.

Answer 9

To improve storage efficiency modern APIs support vertex and index buffers to avoid redundancy: * Vertex buffers store just point data * Index buffers store the index of points in the vertex buffer; three consecutive indexes form a triangle Forming triangles from a set of points requires information about connectivity: which groups of 3 points form a triangle? Since triangles may be connected, a single point can be part of multiple triangles.

Answer 10

Additional surface data may be either stored in the mesh (vertex data) or in a separate map/image/lookup table (texture).

Answer 11

A simple way to add colours to meshes is to store colour data in the vertices. Each vertex stores a colour value, for each point of the triangle surface the colour value is computed by interpolating the colours from the vertices. Vertex colours are a rather simple approach that is quite limiting, since the amount of detail in the surface colour depends on the size and number of triangles.

Answer 12

Ideally, vertex data should define the shape, but not limit detail in the colours. Instead of storing colour values in the mesh, we map an image onto the surface. The image is called a texture. To know what part of the image goes where, each vertex stores a texture coordinate (often denoted as uv coordinate), usually in the range [0-1]. As with vertex colours, interpolation across the triangle is used, but here for the texture coordinates and not the colour values.

Answer 13

When creating textures stretching and shrinking of parts of the image may occur to match the uv-mapping and the model. Nowadays, textures can be more than just images. They may store a wide variety of data that is read and mapped to a model during rendering (e.g. materials, normal vectors, etc.).

Answer 14

Mipmapping takes a texture and creates multiple versions with progressively lower resolutions: for each new mipmap of a texture the horizontal and vertical resolution is halved. During rendering a mipmap may be used instead of the original texture depening on the distance between the camera and a pixel (greater distance, lower resolution mipmap). Mipmapping can avoid artefact and increase rendering speed, but at the cost of texture memory.

Answer 15

A voxel is a “three-dimensional pixel”. It represents a value for a cell within a grid in space. Voxels are commonly used to represent volume data. A number of voxels, often as part of a regular grid, describe the parameters of the volume for their given region. Volume data are often used in scientific applications and can contain all kinds of data, e.g.: Pressure, moisture, flow speed and direction, temperature, …

Answer 16

* In medical applications: CT scans, medical imaging, … * Simulation and Supercomputing

Answer 17

The process of creating an image from data is called “rendering”.

Answer 18

Rasterisation is the process of mapping our objects to the pixels of an image. Rasterisation is well supported in modern hardware and APIs and is commonly used in video games, real-time graphics, etc. Each primitive to be rendered is mapped to pixels of the resulting image. Due to it’s performance and hardware support most VR rendering is done using rasterisation.

Answer 19

Ray tracing is simulating individual rays of light. This approach allows us to simulate many physical effects, especially when recursively tracing rays. Ray tracing is very computationally intensive, though. Ideally we would want to render everything through ray tracing. Due to the required computational power this is, however, currently not possible. Current advancements in hard- and software try to integrate ray tracing with rasterisation for improved reflections, shadows, etc. (Nvidia RT cores, AMD ray accelerators).

Answer 20

Ray casting simplifies the idea of ray tracing by casting a ray from the camera through each pixel of the image. When a ray hits an object, we have a colour value. Ray casting is a lot simpler and faster than ray tracing, it does loose a lot of ray tracing features though, as it does not simulate the physical effects as well.

Answer 21

Physically Based Rendering (PBR) Physically Based Rendering tries to generate images by simulating what happens in the real world (light bouncing around, being reflected, refracted, etc.). Usually aims to create photo-realistic results, but is computationally intensive! PBR used to be for cinematic rendering only, but now started to enter the real-time world as well!

Answer 22

Direct light is light directly from a light source hitting an object. In reality, light bounces from surfaces. Every surface reflects some amount of light, usually changing the color. This reflected light also lights surfaces around it -> indirect lighting. Global Illumination aims to add light from reflections to create a more realistic image

Answer 23

performance! Not only do low framerates cause cybersickness, but we also need to render separate images for each eye, so twice the frames! As a result methods to increase performance, ideally without loosing visual quality, are very important in VR!

Answer 24

pixels How many pixels of an image an object is mapped to depends on scale, distance, etc. Fewer pixels means fewer details are visible!

Answer 25

Not all parts of a scene are always visible. Some objects may be outside the area visible by the camera, others may just be hidden by some object and for non-transparent objects we usually cannot see the backside. Culling refers to removal of objects or part of objects that are not visible. Usually we talk about frustum culling, occlusion culling and backface culling.

Answer 26

We mentioned that we render objects with varying detail, depending on the amount of detail we need. Humans only see full detail if we fixate on some point. Anything in our field of vision that we do not focus on is not perceived as detailed. Knowing where a user looks allows us to render with high detail only where needed (= what the user looks at). This is called foveated rendering.

Answer 27

incrementally Drawcalls: In a scene we usually have many objects. Most often, each object is sent to the GPU individually and drawn onto the image. When all objects are drawn, the image is complete. Draw calls can be expensive and can cause CPU overhead. So less drawcalls is better/faster!

Answer 28

In many scenes, objects are rendered many times: grass/vegetation, crowds, flocks of birds, etc! When rendering an object multiple times we can use geometry instancing! Instead of drawing one object many times, we send the geometry to the GPU once together with a list of varying attributes (position, rotation, etc.) Geometry Instancing can reduce overhead and improve performance (when CPU bound).

Answer 29

In VR, we always render everything twice (once per eye)! In essence, we render each object twice: once for the left and once for the right eye. Instanced stereo (also known as single pass stereo) replaces drawcalls with instanced drawcalls to render both eyes at once (into a single packed texture). Can help to increase VR performance!

Answer 30

Manual Generation: time consuming, need of artistic skill and creativity Automated Generation

Answer 31

Tools like laser scanners usually scan the environment and create a point cloud for any solid surface. The resulting data can either be rendered directly or computed into a triangle mesh, volume data, …

Answer 32

One variety of scanning the environment is photogrammetry. Photogrammetry uses a camera as a sensor. Having a lot of photos of a single object or environment, an automated algorithm can match the photos and find overlapping areas and compute how the images fit together. If a point is visible on multiple images, its position in 3D space can be reconstructed. This process is repeated to create a (coloured) point cloud.

Lecture 2: Terminology and Basics Flashcards

(62 cards)