vulkano/DESIGN.md

# Design document

The Vulkan specs are not public, but [the Mantle specs are](http://www.amd.com/Documents/Mantle-Programming-Guide-and-API-Reference.pdf). Many design decisions can be made already based on what Mantle shows us.

## Objectives of this library

From highest to lowest priority:

 - Be memory safe.
 - Don't hide what it's doing.
 - Be convenient to use.

## Extensions

Mantle has an extensions system for things such as DMA, occlusion query or platform-specific bindings.
It is unclear how Vulkan handles this.

Mantle's bindings to Windows is pretty simple: you need to create a special kind of resource that is similar to images, and this special kind of resource can be drawn on a `HWND`. The image isn't tied to the `HWND` ; you need tell which `HWND` to use each time you swap buffers. Deciding how to bind this library to platform-specific interfaces requires a broader picture.

## Thread safety

Some Mantle functions are marked as not being thread-safe. However we most likely want to store them in `Arc`s.

If we used `&mut` for non-thread-safe operations, we would need to wrap the entire object in a `Mutex`. Instead the best solution is to use an internal mutex that is used only for thread-unsafe operations:

```rust
pub struct Device {
    mutex: Mutex<()>,
    ...
}

impl Device {
    pub fn thread_safe_operation(&self) {
        // mutex not used
        vkDoSomething();
    }

    pub fn not_thread_safe_operation(&self) {
        let _lock = self.mutex.lock();
        vkDoSomethingElse();
    }
}
```

## Shaders introspection

When you use a descriptor set so that it's used by a shader, it is important for safety that you make sure that it matches what the shader expects.

However the Vulkan API will likely not provide any way to introspect a shader. Therefore a small SPIR-V analyzer will need to be included in the library that parses the input data.

This situation could be improved after Rust plugins are made stable, so that the analysis is done at compile-time.

## Resources borrowing management

*In this sections, "resources" describes everything that is used by a command buffer: buffers, images, pipelines, dynamic state, etc.*

The rule is that objects must not be written by the CPU nor destroyed while they are still used by the GPU. The Vulkan library doesn't automatically handle this for you, but this library should enforce it.

### Ownership by command buffers

Command buffers need to somehow make sure that the resources they use are still valid as long as they exist. This could be handled with regular Rust lifetimes, but since Rust doesn't allow structs to borrow themselves my personal opinion is that a `CommandBuffer<'a>` struct would be too cumbersome. You would end up with something like this:

```rust
pub struct GameResources {
    ...
}

pub struct GameCommandBuffers<'a> {
    command_buffer1: vulkano::CommandBuffer<'a>,
    ...
}

fn main() {
    let resources = GameResources::new();
    let command_buffers = GameCommandBuffers::new(&resources);
}
```

It could also be handled with `Arc`s or `Rc`s. In my opinion the best way is to give the choice to the user through a trait.

Example:

```rust
impl CommandBufferBuilder {
    pub fn add_buffer_copy_command<B>(&mut self, buffer: B) where B: SomeTrait<Buffer> {
        ...
    }
}
```

### When to destroy?

Resources should not be destroyed as long as they are still in use by the GPU.

The trick is that the only way that a resource could be in use by the GPU is through a command buffer. Command buffers too must be not destroyed as long as they are still in use. Command buffers already ensure that the resources they use stay alive, therefore the only tricky part is to know when to destroy command buffers.

If all command buffers that use a specific resource are destroyed, then we are sure that this resource is not in use by the GPU and we can delete it. The destructors of buffers, textures, etc. are trivial.

When it comes to command buffers, in my opinion it should look like this:

```rust
pub fn submit_command_buffer<'a>(cmd: &'a CommandsBuffer) -> FenceGuard<'a> {
    ...
}
```

The `FenceGuard` ensures that the command buffer is alive. Destroying a `FenceGuard` blocks until the vulkan fence is fulfilled.

### Mutability

*Only relevant for buffers and images.*

Since both a command buffer and the user have access to a borrow of the same resource, we have to ensure that a resource can't be read and written simultaneously (similarly to any other CPU object).

This is going to be done with a wrapper object, just like `RefCell` or `Mutex`.

Example:

```rust
pub struct Buffer {
    ...
}

impl Buffer {
    // always available
    pub fn modify(&mut self, ...) { ... }
}

pub struct GpuAccess<T> {
    content: T,
    in_use: Option<Fence>,
}

impl GpuAccess<T> {
    // waits for the fence if necessary
    pub fn lock(&self) -> &mut T { ... }
    pub fn try_lock(&self) -> Option<&mut T> { ... }
}
```

The command buffer writes the fence in the `GpuAccess` through a trait.

But should submitting a command buffer wait for fences in the resources they use? The answer would be no if all command buffers are submitted to the same queue, since you would be sure that the first command buffer is over when the second one starts. However we have multiple queues accessible to us (including the DMA and multiple GPUs). Mantle provides semaphore objects for this. It is unknown if Vulkan uses the same mechanism. **This point remains to be seen**.

### Fence or no fence

If you submit several command buffers in a row to the same queue, then you only need to create a fence at the last submission.

Creating a fence every time could become a big overhead. However you usually know too late whether you needed a fence or not at the previous submission. We can't predict the future, therefore the choice should be given to the user whether to create a fence or not.

Consequently, submitting a command buffer would look like this:

```rust
pub fn submit_command_buffer_fence<'a>(cmd: &'a CommandsBuffer) -> FenceGuard<'a> {
    ...
}

pub fn submit_command_buffer_nofence<'a>(cmd: &'a CommandsBuffer) -> NoFence<'a> {
    ...
}
```

`NoFence` would be a linear type. Rust doesn't have linear types, but this can be emulated by panicking in the destructor.

The only way to use a `NoFence` is to consume it by submitting another command buffer immediately after:

```rust
impl<'a> NoFence<'a> {
    pub fn submit_after_fence(self, cmd: &'a CommandsBuffer) -> FenceGuard<'a> { ... }
    pub fn submit_after_nofence(self, cmd: &'a CommandsBuffer) -> NoFence<'a> { ... }
}
```

If possible this could be made more convenient by merging the two methods and returning a trait implementation instead:

```rust
pub fn submit_command_buffer<'a, F>(cmd: &'a CommandsBuffer) -> F where F: GpuBlock<'a> { ... }

impl<'a> NoFence<'a> {
    pub fn submit_after<F>(self, cmd: &'a CommandsBuffer) -> F where F: GpuBlock<'a> { ... }
}
```

## Resources state management

With Mantle, all buffers and textures are in a certain state. This is likely going to be the same in Vulkan.
To do certain operations, a resource must be in a given state. For example copying data in RAM to a buffer requires the buffer to be in the `GR_MEMORY_STATE_DATA_TRANSFER` state.

Changing the state of a resource requires using a command in a command buffer, and requires to know what the state of the resource at the time of the execution is. For example to read the same buffer from a shader, you must switch it from the `GR_MEMORY_STATE_DATA_TRANSFER` state to the `GR_MEMORY_STATE_GRAPHICS_SHADER_READ_WRITE` state. This is done by explicitely stating that the buffer currently is in the `GR_MEMORY_STATE_DATA_TRANSFER` state.

The problem is that we need to know what the state of a resource is at the time *when the command buffer is used*, and not at the time when the command buffer is created. Let's say that we build the command buffers A and B. A leaves a given resource in a certain state. B needs to know in which state that resource was left in. Requiring the user to state in which order the command buffers will be executed would be a burden.

Instead to solve this problem, resources will have a "default state".

 - At initialization, a resource is switched to this default state.
 - Command buffers are free to switch the state of a resource, but must restore the resource to its default state at the end.

## Pipelines and dynamic state

There are five type of objects in Mantle that are "dynamic states": the rasterizer, the viewport/scissor, the blender, the depth-stencil, and the multisampling states. In addition to this, there is a pipeline object that holds the list of shaders and more states.

One instance of each of these object types must be binded by command buffers before drawing.

The dynamic state objects are annoying for the programmer to manage. Therefore I think it is a good idea to have the user pass a Rust `enum` that describes the dynamic state, and look for an existing object in a `HashMap`. This hash map would hold `Weak` pointers.

Pipelines, however, would still be created manually.

## Descriptor sets, samplers, memory views

Descriptor sets are the Vulkan equivalent of OpenGL uniforms. The Mantle specs seem to imply that different layouts can lead to tradeoffs between GPU and CPU performances. The best solution is to have a default system that just gives a basic layout, but make it possible for the user to use a custom layout instead.

Samplers should be handled internally by this library and never be destroyed. This simplifies a lot of things when it comes to synchronization.

The memory layout of each buffer access should be passed by value when adding a draw command to a command buffer.

**To do**

## Context loss

**To do**

## Examples of what it should look like

These examples are likely to change, but they should give a good overview:

### Uploading a buffer

Uploading to CPU-accessible memory, ie. slow write:

```rust
#[derive(Copy, Clone, BufferLayout)]      // `#[derive_BufferLayout]` is not possible until plugins are stable though
struct Data {
    position: (f32, f32, f32),
    tex_coords: (f32, f32),
}

let data: &[Data] = &[Data { ... }, Data { ... }, ...];
let mut buffer = AccessBuffer::cpu_accessible_with_data(&device, &data);

// let's modify a value of the sake of the example
{
    let mut mapping = buffer.write();       // same API as RwLock
    mapping[1].position.0 = 1.0;
}
```

Using the DMA to copy:

```rust
let local_buffer = AccessBuffer::pinned_memory_with_data(&device, &data);
let remote_buffer = Buffer::empty(&device, std::mem::size_of_val(&data));

let dma_queue = device.dma_queues().next().unwrap();
let command_buffer = CommandBufferBuilder::once(&device)            // `once` optimizes for buffers that trigger once
                        .copy_buffer(&local_buffer, &remote_buffer)
                        .build();
dma_queue.submit(&command_buffer);

assert!(local_buffer.try_write().is_none());        // buffer is in use, we can't access it because of safety
assert!(local_buffer.try_read().is_some());         // however we can read its content
```

### Hello triangle

**To do**
Initial commit 2015-07-06 08:54:29 +00:00			`# Design document`

			`The Vulkan specs are not public, but [the Mantle specs are](http://www.amd.com/Documents/Mantle-Programming-Guide-and-API-Reference.pdf). Many design decisions can be made already based on what Mantle shows us.`

			`## Objectives of this library`

Update goals 2015-11-01 10:06:25 +00:00			`From highest to lowest priority:`

			`- Be memory safe.`
Initial commit 2015-07-06 08:54:29 +00:00			`- Don't hide what it's doing.`
Update goals 2015-11-01 10:06:25 +00:00			`- Be convenient to use.`
Initial commit 2015-07-06 08:54:29 +00:00
			`## Extensions`

			`Mantle has an extensions system for things such as DMA, occlusion query or platform-specific bindings.`
			`It is unclear how Vulkan handles this.`

			Mantle's bindings to Windows is pretty simple: you need to create a special kind of resource that is similar to images, and this special kind of resource can be drawn on a `HWND`. The image isn't tied to the `HWND` ; you need tell which `HWND` to use each time you swap buffers. Deciding how to bind this library to platform-specific interfaces requires a broader picture.

			`## Thread safety`

			Some Mantle functions are marked as not being thread-safe. However we most likely want to store them in `Arc`s.

			If we used `&mut` for non-thread-safe operations, we would need to wrap the entire object in a `Mutex`. Instead the best solution is to use an internal mutex that is used only for thread-unsafe operations:

			```rust
			`pub struct Device {`
			`mutex: Mutex<()>,`
			`...`
			`}`

			`impl Device {`
			`pub fn thread_safe_operation(&self) {`
			`// mutex not used`
			`vkDoSomething();`
			`}`

			`pub fn not_thread_safe_operation(&self) {`
			`let _lock = self.mutex.lock();`
			`vkDoSomethingElse();`
			`}`
			`}`
			```

			`## Shaders introspection`

			`When you use a descriptor set so that it's used by a shader, it is important for safety that you make sure that it matches what the shader expects.`

			`However the Vulkan API will likely not provide any way to introspect a shader. Therefore a small SPIR-V analyzer will need to be included in the library that parses the input data.`

			`This situation could be improved after Rust plugins are made stable, so that the analysis is done at compile-time.`

Update DESIGN.md 2015-11-01 10:05:38 +00:00			`## Resources borrowing management`
Initial commit 2015-07-06 08:54:29 +00:00
			`In this sections, "resources" describes everything that is used by a command buffer: buffers, images, pipelines, dynamic state, etc.`

Update DESIGN.md 2015-11-01 10:05:38 +00:00			`The rule is that objects must not be written by the CPU nor destroyed while they are still used by the GPU. The Vulkan library doesn't automatically handle this for you, but this library should enforce it.`
Initial commit 2015-07-06 08:54:29 +00:00
Update DESIGN.md 2015-11-01 10:05:38 +00:00			`### Ownership by command buffers`
Initial commit 2015-07-06 08:54:29 +00:00
Update DESIGN.md 2015-11-01 10:05:38 +00:00			Command buffers need to somehow make sure that the resources they use are still valid as long as they exist. This could be handled with regular Rust lifetimes, but since Rust doesn't allow structs to borrow themselves my personal opinion is that a `CommandBuffer<'a>` struct would be too cumbersome. You would end up with something like this:
Initial commit 2015-07-06 08:54:29 +00:00
			```rust
			`pub struct GameResources {`
			`...`
			`}`

			`pub struct GameCommandBuffers<'a> {`
			`command_buffer1: vulkano::CommandBuffer<'a>,`
			`...`
			`}`

			`fn main() {`
			`let resources = GameResources::new();`
			`let command_buffers = GameCommandBuffers::new(&resources);`
			`}`
			```

Update DESIGN.md 2015-11-01 10:05:38 +00:00			It could also be handled with `Arc`s or `Rc`s. In my opinion the best way is to give the choice to the user through a trait.

			`Example:`

			```rust
			`impl CommandBufferBuilder {`
Update DESIGN.md 2015-11-01 10:18:55 +00:00			`pub fn add_buffer_copy_command<B>(&mut self, buffer: B) where B: SomeTrait<Buffer> {`
Update DESIGN.md 2015-11-01 10:05:38 +00:00			`...`
			`}`
			`}`
			```
Initial commit 2015-07-06 08:54:29 +00:00
			`### When to destroy?`

Update DESIGN.md 2015-11-01 10:05:38 +00:00			`Resources should not be destroyed as long as they are still in use by the GPU.`

Update DESIGN.md 2015-11-01 10:18:55 +00:00			`The trick is that the only way that a resource could be in use by the GPU is through a command buffer. Command buffers too must be not destroyed as long as they are still in use. Command buffers already ensure that the resources they use stay alive, therefore the only tricky part is to know when to destroy command buffers.`
Initial commit 2015-07-06 08:54:29 +00:00
Update DESIGN.md 2015-11-01 10:05:38 +00:00			`If all command buffers that use a specific resource are destroyed, then we are sure that this resource is not in use by the GPU and we can delete it. The destructors of buffers, textures, etc. are trivial.`
Initial commit 2015-07-06 08:54:29 +00:00
Update DESIGN.md 2015-11-01 10:05:38 +00:00			`When it comes to command buffers, in my opinion it should look like this:`

			```rust
			`pub fn submit_command_buffer<'a>(cmd: &'a CommandsBuffer) -> FenceGuard<'a> {`
			`...`
			`}`
			```
Initial commit 2015-07-06 08:54:29 +00:00
Update DESIGN.md 2015-11-01 10:05:38 +00:00			The `FenceGuard` ensures that the command buffer is alive. Destroying a `FenceGuard` blocks until the vulkan fence is fulfilled.
Initial commit 2015-07-06 08:54:29 +00:00
			`### Mutability`

			`Only relevant for buffers and images.`

Update DESIGN.md 2015-11-01 10:05:38 +00:00			`Since both a command buffer and the user have access to a borrow of the same resource, we have to ensure that a resource can't be read and written simultaneously (similarly to any other CPU object).`
Initial commit 2015-07-06 08:54:29 +00:00
Update DESIGN.md 2015-11-01 10:05:38 +00:00			This is going to be done with a wrapper object, just like `RefCell` or `Mutex`.

			`Example:`

			```rust
			`pub struct Buffer {`
			`...`
			`}`

			`impl Buffer {`
			`// always available`
			`pub fn modify(&mut self, ...) { ... }`
			`}`

			`pub struct GpuAccess<T> {`
			`content: T,`
			`in_use: Option<Fence>,`
			`}`

			`impl GpuAccess<T> {`
			`// waits for the fence if necessary`
			`pub fn lock(&self) -> &mut T { ... }`
			`pub fn try_lock(&self) -> Option<&mut T> { ... }`
			`}`
			```

			The command buffer writes the fence in the `GpuAccess` through a trait.
Initial commit 2015-07-06 08:54:29 +00:00
			`But should submitting a command buffer wait for fences in the resources they use? The answer would be no if all command buffers are submitted to the same queue, since you would be sure that the first command buffer is over when the second one starts. However we have multiple queues accessible to us (including the DMA and multiple GPUs). Mantle provides semaphore objects for this. It is unknown if Vulkan uses the same mechanism. This point remains to be seen.`

Update DESIGN.md 2015-11-01 10:18:55 +00:00			`### Fence or no fence`

			`If you submit several command buffers in a row to the same queue, then you only need to create a fence at the last submission.`

			`Creating a fence every time could become a big overhead. However you usually know too late whether you needed a fence or not at the previous submission. We can't predict the future, therefore the choice should be given to the user whether to create a fence or not.`

			`Consequently, submitting a command buffer would look like this:`

			```rust
			`pub fn submit_command_buffer_fence<'a>(cmd: &'a CommandsBuffer) -> FenceGuard<'a> {`
			`...`
			`}`

			`pub fn submit_command_buffer_nofence<'a>(cmd: &'a CommandsBuffer) -> NoFence<'a> {`
			`...`
			`}`
			```

			`NoFence` would be a linear type. Rust doesn't have linear types, but this can be emulated by panicking in the destructor.

			The only way to use a `NoFence` is to consume it by submitting another command buffer immediately after:

			```rust
			`impl<'a> NoFence<'a> {`
			`pub fn submit_after_fence(self, cmd: &'a CommandsBuffer) -> FenceGuard<'a> { ... }`
			`pub fn submit_after_nofence(self, cmd: &'a CommandsBuffer) -> NoFence<'a> { ... }`
			`}`
			```

			`If possible this could be made more convenient by merging the two methods and returning a trait implementation instead:`

			```rust
			`pub fn submit_command_buffer<'a, F>(cmd: &'a CommandsBuffer) -> F where F: GpuBlock<'a> { ... }`

			`impl<'a> NoFence<'a> {`
			`pub fn submit_after<F>(self, cmd: &'a CommandsBuffer) -> F where F: GpuBlock<'a> { ... }`
			`}`
			```

Initial commit 2015-07-06 08:54:29 +00:00			`## Resources state management`

			`With Mantle, all buffers and textures are in a certain state. This is likely going to be the same in Vulkan.`
			To do certain operations, a resource must be in a given state. For example copying data in RAM to a buffer requires the buffer to be in the `GR_MEMORY_STATE_DATA_TRANSFER` state.

			Changing the state of a resource requires using a command in a command buffer, and requires to know what the state of the resource at the time of the execution is. For example to read the same buffer from a shader, you must switch it from the `GR_MEMORY_STATE_DATA_TRANSFER` state to the `GR_MEMORY_STATE_GRAPHICS_SHADER_READ_WRITE` state. This is done by explicitely stating that the buffer currently is in the `GR_MEMORY_STATE_DATA_TRANSFER` state.

			`The problem is that we need to know what the state of a resource is at the time when the command buffer is used, and not at the time when the command buffer is created. Let's say that we build the command buffers A and B. A leaves a given resource in a certain state. B needs to know in which state that resource was left in. Requiring the user to state in which order the command buffers will be executed would be a burden.`

			`Instead to solve this problem, resources will have a "default state".`

			`- At initialization, a resource is switched to this default state.`
			`- Command buffers are free to switch the state of a resource, but must restore the resource to its default state at the end.`

			`## Pipelines and dynamic state`

			`There are five type of objects in Mantle that are "dynamic states": the rasterizer, the viewport/scissor, the blender, the depth-stencil, and the multisampling states. In addition to this, there is a pipeline object that holds the list of shaders and more states.`

			`One instance of each of these object types must be binded by command buffers before drawing.`

			The dynamic state objects are annoying for the programmer to manage. Therefore I think it is a good idea to have the user pass a Rust `enum` that describes the dynamic state, and look for an existing object in a `HashMap`. This hash map would hold `Weak` pointers.

			`Pipelines, however, would still be created manually.`

Minor work 2015-07-11 08:54:51 +00:00			`## Descriptor sets, samplers, memory views`

			`Descriptor sets are the Vulkan equivalent of OpenGL uniforms. The Mantle specs seem to imply that different layouts can lead to tradeoffs between GPU and CPU performances. The best solution is to have a default system that just gives a basic layout, but make it possible for the user to use a custom layout instead.`

			`Samplers should be handled internally by this library and never be destroyed. This simplifies a lot of things when it comes to synchronization.`

			`The memory layout of each buffer access should be passed by value when adding a draw command to a command buffer.`
Initial commit 2015-07-06 08:54:29 +00:00
			`To do`
Add some initial code draft 2015-07-06 09:13:19 +00:00
			`## Context loss`

			`To do`
Minor work 2015-07-11 08:54:51 +00:00
			`## Examples of what it should look like`

			`These examples are likely to change, but they should give a good overview:`

			`### Uploading a buffer`

			`Uploading to CPU-accessible memory, ie. slow write:`

			```rust
			#[derive(Copy, Clone, BufferLayout)] // `#[derive_BufferLayout]` is not possible until plugins are stable though
			`struct Data {`
			`position: (f32, f32, f32),`
			`tex_coords: (f32, f32),`
			`}`

			`let data: &[Data] = &[Data { ... }, Data { ... }, ...];`
			`let mut buffer = AccessBuffer::cpu_accessible_with_data(&device, &data);`

			`// let's modify a value of the sake of the example`
			`{`
			`let mut mapping = buffer.write(); // same API as RwLock`
			`mapping[1].position.0 = 1.0;`
			`}`
			```

			`Using the DMA to copy:`

			```rust
			`let local_buffer = AccessBuffer::pinned_memory_with_data(&device, &data);`
			`let remote_buffer = Buffer::empty(&device, std::mem::size_of_val(&data));`

			`let dma_queue = device.dma_queues().next().unwrap();`
			let command_buffer = CommandBufferBuilder::once(&device) // `once` optimizes for buffers that trigger once
			`.copy_buffer(&local_buffer, &remote_buffer)`
			`.build();`
			`dma_queue.submit(&command_buffer);`

			`assert!(local_buffer.try_write().is_none()); // buffer is in use, we can't access it because of safety`
			`assert!(local_buffer.try_read().is_some()); // however we can read its content`
			```

			`### Hello triangle`

Update DESIGN.md 2015-11-01 10:05:38 +00:00			`To do`