nordic-dev.net/wgpu - wgpu

mirror of https://github.com/gfx-rs/wgpu.git synced 2024-11-22 06:44:14 +00:00

Author	SHA1	Message	Date
Brad Werth	6cd387412f	Remove vertex_pulling_transfrom from PipelineCompilationOptions. This option was only evaluated for Metal backends, and now it's required there so the option is going away. It is still configurable for tests via the PipelineOptions struct, deserialized from .ron files. This also fixes some type problems with the unpack functions in writer.rs. Metal << operator extends operand to int-sized, which then has to be cast back down to the real size before as_type bit conversion. The math for the snorm values is corrected, in some cases using the metal unpack_snorm2x16_to_float function because we can't directly cast a bit-shifted ushort value to half.	2024-07-19 17:13:45 +02:00
Jim Blandy	aeb2067e81	[core] Make `poll(Wait)` not hang after bad command submission. Add `wgpu_core::device::Device::last_successful_submission_index`, which records the fence value that `Maintain::Wait` should actually wait for. See comments for details. Fixes #5969.	2024-07-17 16:11:04 -07:00
Teodor Tanasoaia	a3d2d31d3d	[test] allow WARP to run the zero-init workgroup memory test (#5968 ) I pinpointed this to `438d6394ef` (https://github.com/gfx-rs/wgpu/pull/3512). I'm not sure why I didn't remove this one in `30064ead9f` (https://github.com/gfx-rs/wgpu/pull/3515) as well, maybe I thought it was still failing due to early frees.	2024-07-16 12:30:53 -04:00
Teodor Tanasoaia	167f005c17	[tests] delete outdated comment on `DEVICE_DESTROY_THEN_MORE` test (#5967 ) This was fixed by `6e21f7a929`.	2024-07-16 12:30:35 -04:00
Jim Blandy	d02e2949b2	[core] Correctly check mipmap-filtering samplers against the layout. Ensure that samplers using non-`Nearest` mipmap filtering are considered "filtering samplers" when deciding bind group layout compatibility. Add tests for layout `NonFiltering` validation. Fixes #5948.	2024-07-15 10:03:49 +02:00
dependabot[bot]	586215ab2e	build(deps): bump crate-ci/typos from 1.22.9 to 1.23.1 (#5922 ) * build(deps): bump crate-ci/typos from 1.22.9 to 1.23.1 Bumps [crate-ci/typos](https://github.com/crate-ci/typos) from 1.22.9 to 1.23.1. - [Release notes](https://github.com/crate-ci/typos/releases) - [Changelog](https://github.com/crate-ci/typos/blob/master/CHANGELOG.md) - [Commits](https://github.com/crate-ci/typos/compare/v1.22.9...v1.23.1) --- updated-dependencies: - dependency-name: crate-ci/typos dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * typo fixes --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Andreas Reich <r_andreas2@web.de>	2024-07-14 22:13:25 +02:00
Nicolas Silva	4c6318c0d2	Expose gpu allocation configuration options (#5875 ) * Expose gpu allocation configuration options This commit adds hints to control memory allocations strategies to the configuration options. These hints allow for automatic profiles such as optimizing for performance (the default, makes sense for a game), optimizing for memory usage (typically more useful for a web browser or UI library) and specifying settings manually. The details of gpu allocation are still in flux. The goal is to switch vulkan and metal to gpu_allocator which is currently used with d3d12. gpu_allocator will also likely receive more configuration options, in particular the ability to start with smaller memory block sizes and progressively grow the block size. So the manual settings already provision for this upcoming option. Another approach could be to wait and add the manual option after the dust settles. The reason for providing presets and defining values in the backends is that I am convinced that optimal fonigurations should take hardware capabilities into consideration. It's a deep rabbithole, though, so that will be an exercise for later. * changelog * Update CHANGELOG.md Co-authored-by: Andreas Reich <r_andreas2@web.de> * Add a comment about not entirely knowing what we are doing --------- Co-authored-by: Andreas Reich <r_andreas2@web.de>	2024-07-08 14:49:44 +02:00
Erich Gubler	f02ec0e4e8	test: fix `wgpu-test` building by itself (#5912 )	2024-07-04 23:31:37 -04:00
Erich Gubler	5c29ad548b	refactor: `vertex_index_common`: use `size_of_val` instead of magic number	2024-07-04 16:40:34 -06:00
Erich Gubler	764b15a556	refactor: `vertex_index_common`: elide `tests` alloc. w/ `Itertools::cartesian_product`	2024-07-04 16:40:34 -06:00
Erich Gubler	7600c61b72	refactor: `vertex_index_common`: use `strum` `enum` iter. Replace manual enumerations of various `enum`s with `derive`d ones via `strum::{EnumIter, IntoEnumIterator}`.	2024-07-04 16:40:34 -06:00
teoxoy	7910fd8059	[wgpu-hal] require a `Surface` to be passed to `Instance.enumerate_adapters()` on WebGL2 Also makes wgpu's `enumerate_adapters()` native only.	2024-07-04 10:17:07 +02:00
Imbris	3a6814770a	Allow unconsumed inputs in fragment shaders (#5531 ) * Allow unconsumed inputs in fragment shaders by removing them from vertex outputs when generating HLSL. Fixes https://github.com/gfx-rs/wgpu/issues/3748 * Add naga:🔙:hlsl::FragmentEntryPoint for providing information about the fragment entry point when generating vertex entry points via naga:🔙:hlsl::Writer::write. Vertex outputs not consumed by the fragment entry point are omitted in the final output struct. * Add naga snapshot test for this new feature, * Remove Features::SHADER_UNUSED_VERTEX_OUTPUT, StageError::InputNotConsumed, and associated validation logic. * Make wgpu dx12 backend pass fragment shader info when generating vertex HLSL. * Add wgpu regression test for allowing unconsumed inputs. * Address review * Add note that nesting structs for the inter-stage interface can't happen. * Remove new TODO notes (some addressed and some transferred to an issue https://github.com/gfx-rs/wgpu/issues/5577) * Changed issue that regression test refers to 3748 -> 5553 * Add debug_assert that binding.is_some() in hlsl writer * Fix typos caught in CI Also, fix compiling snapshot test when hlsl-out feature is not enabled.	2024-07-04 09:08:46 +02:00
Andreas Reich	0a76c0fa84	Renderpass take resource ownership (#5884 ) * share timestamp write struct * Make name of set_push_constants methods consistently plural * remove lifetime bounds of resources passed into render pass * first render pass resource ownership test * introduce dynrenderpass & immediately create ArcCommands and take ownership of resources passed on pass creation * Use of dynrenderpass in deno * Separate active occlusion & pipeline statitics query * resolve render/compute command is now behind `replay` feature * add vertex & index buffer to ownership test * test for pipeline statistics query * add occlusion query set to pass resource test * add tests for resource ownership of render pass query timestamps * RenderPass can now be made 'static just like ComputePass. Add respective test * Extend encoder_operations_fail_while_pass_alive test to also check encoder locking errors with render passes * improve changelog entry on lifetime bounds	2024-07-01 18:36:24 +02:00
Jim Blandy	b9b7050bb9	Give `wgpu_test::compute_pass_ownership` buffers unique labels. Give each buffer in the `wgpu_test::compute_pass_ownership::compute_pass_resource_ownership` test a unique label, for easier debugging.	2024-06-13 12:25:05 +02:00
Jim Blandy	93c6fbe8ba	Skip `compute_pass_resource_ownership` on GL/AMD Radeon Pro WX 3200. (#5801 ) Skip `wgpu_test::compute_pass_ownership::compute_pass_resource_ownership` on the GL backend on AMD Radeon Pro WX 3200, to avoid the kernel crash described in #5800.	2024-06-12 21:27:34 +00:00
Atlas Dostal	abba12ae4e	Add support for 64 bit integer atomic operations in shaders. Add the following flags to `wgpu_types::Features`: - `SHADER_INT64_ATOMIC_ALL_OPS` enables all atomic operations on `atomic<i64>` and `atomic<u64>` values. - `SHADER_INT64_ATOMIC_MIN_MAX` is a subset of the above, enabling only `AtomicFunction::Min` and `AtomicFunction::Max` operations on `atomic<i64>` and `atomic<u64>` values in the `Storage` address space. These are the only 64-bit atomic operations available on Metal as of 3.1. Add corresponding flags to `naga::valid::Capabilities`. These are supported by the WGSL front end, and all Naga backends. Platform support: - On Direct3d 12, in `D3D12_FEATURE_DATA_D3D12_OPTIONS9`, if `AtomicInt64OnTypedResourceSupported` and `AtomicInt64OnGroupSharedSupported` are both available, then both wgpu features described above are available. - On Metal, `SHADER_INT64_ATOMIC_MIN_MAX` is available on Apple9 hardware, and on hardware that advertises both Apple8 and Mac2 support. This also requires Metal Shading Language 2.4 or later. Metal does not yet support the more general `SHADER_INT64_ATOMIC_ALL_OPS`. - On Vulkan, if the `VK_KHR_shader_atomic_int64` extension is available with both the `shader_buffer_int64_atomics` and `shader_shared_int64_atomics` features, then both wgpu features described above are available.	2024-06-08 18:36:26 -07:00
Andreas Reich	9a27ba53ca	Fix QuerySet ownership of ComputePass (#5671 ) * add new tests for checking on query set lifetime * Fix ownership management of query sets on compute passes for write_timestamp, timestamp_writes (on desc) and pipeline statistic queries * changelog entry	2024-06-04 07:47:27 +00:00
Andreas Reich	aa2821bff6	Reintroduce computepass->encoder lifetime constraint and make it opt-out via `wgpu::ComputePass::forget_lifetime` (#5768 ) * Reintroduce computepass->encoder lifetime constraint and make it opt-out via `wgpu::ComputePass::make_static` * improve comments based on review feedback * use the same lifetime name for all usages of `ComputePass<'encoder>` * comment improvement that I missed earlier * more review based comment improvements * use suggested zero-overhead lifetime removal * rename make_static to forge_lifetime * missed comma	2024-06-03 20:04:12 +02:00
Brad Werth	9b7a965667	Add an experimental vertex pulling flag to Metal pipelines. This proves a flag in msl::PipelineOptions that attempts to write all Metal vertex entry points to use a vertex pulling technique. It does this by: 1) Forcing the _buffer_sizes structure to be generated for all vertex entry points. The structure has additional buffer_size members that contain the byte sizes of the vertex buffers. 2) Adding new args to vertex entry points for the vertex id and/or the instance id and for the bound buffers. If there is an existing @builtin(vertex_index) or @builtin(instance_index) param, then no duplicate arg is created. 3) Adding code at the beginning of the function for vertex entry points to compare the vertex id or instance id against the lengths of all the bound buffers, and force an early-exit if the bounds are violated. 4) Extracting the raw bytes from the vertex buffer(s) and unpacking those bytes into the bound attributes with the expected types. 5) Replacing the varyings input and instead using the unpacked attributes to fill any structs-as-args that are rebuilt in the entry point. A new naga test is added which exercises this flag and demonstrates the effect of the transform. The msl generated by this test passes validation. Eventually this transformation will be the default, always-on behavior for Metal pipelines, though the flag may remain so that naga translation tests can be run with and without the tranformation.	2024-05-30 13:08:59 +02:00
Andreas Reich	588950110a	Remove lifetime dependency of `ComputePass` to its parent command encoder (#5620 ) * lift encoder->computepass lifetime constraint and add now failing test * compute passes now take an arc to their parent command encoder, thus removing compile time dependency to it * Command encoder goes now into locked state while compute pass is open * changelog entry * share most of the code between get_encoder and lock_encoder	2024-05-29 22:43:24 +00:00
Erich Gubler	ccccffda19	test: migrate tests for specific validation err. msgs.	2024-05-21 17:16:06 -04:00
Erich Gubler	3b1e8da1b1	style: use `concat!(…)`'d string for `DIFFERENT_BGL_ORDER_BW_SHADER_AND_API`	2024-05-21 17:16:06 -04:00
Erich Gubler	94dba0b026	test: add expected msg. arg. to `wgpu_test::fail{,_if}`	2024-05-21 17:16:06 -04:00
Erich Gubler	18b758e388	test: ensure render pipelines have at least 1 target	2024-05-17 17:45:03 -04:00
Daniel McNab	4902e470ce	Pipeline cache API and implementation for Vulkan (#5319 ) Co-authored-by: Connor Fitzgerald <connorwadefitzgerald@gmail.com>	2024-05-16 13:52:56 +00:00
Andreas Reich	77a83fb0dd	Remove lifetime constraints from `wgpu::ComputePass` methods (#5570 ) * basic test setup * remove lifetime and drop resources on test - test fails now just as expected * compute pass recording is now hub dependent (needs gfx_select) * compute pass recording now bumps reference count of uses resources directly on recording TODO: * bind groups don't work because the Binder gets an id only * wgpu level error handling is missing * simplify compute pass state flush, compute pass execution no longer needs to lock bind_group storage * wgpu sided error handling * make ComputePass hal dependent, removing command cast hack. Introduce DynComputePass on wgpu side * remove stray repr(C) * changelog entry * fix deno issues -> move DynComputePass into wgc * split out resources setup from test	2024-05-14 20:05:17 +00:00
Vladislav	00456cfb37	Add parsing support for un/pack4xI/U8	2024-05-14 17:57:04 +02:00
Andreas Reich	452cf24fa1	Remove unnecessary Cargo.toml dependencies via `cargo machete` (#5692 )	2024-05-12 19:05:00 -04:00
Jim Blandy	e5201a7267	When a `#[gpu_test]` test fails, print its source location. Use `std::panic::Location` to record the source location of each `#[gpu_test]` test, and if it fails, include that in the error output. This is not essential, but it should make working with failures a bit more comfortable.	2024-05-01 17:08:22 -07:00
Connor Fitzgerald	90e7060d0d	Fix Failure Case for MacOS 14.3	2024-04-29 12:01:05 +02:00
stefnotch	f874ed061c	Add get_compilation_info (#5410 ) * Add get_compilation_info API * Rename glsl ParseError to ParseErrors * Document ParseError label order * Update line_position to count UTF-8 bytes	2024-04-29 11:35:36 +02:00
Connor Fitzgerald	c1291bd131	Fix Merge Issues Between #5301 and #5508 (#5549 )	2024-04-17 16:31:20 -04:00
Daniel McNab	965b00c06b	Allow configuring whether workgroup memory is zero initialised (#5508 )	2024-04-17 15:50:31 -04:00
Alexander Meißner	ea77d5674d	Subgroup Operations (#5301 ) Co-authored-by: Jacob Hughes <j@distanthills.org> Co-authored-by: Connor Fitzgerald <connorwadefitzgerald@gmail.com> Co-authored-by: atlas dostal <rodol@rivalrebels.com>	2024-04-17 15:25:52 -04:00
teoxoy	3bda381812	add pipeline constants plumbing	2024-04-05 18:07:41 +02:00
Erich Gubler	0c5bebca51	fix: unlock guard for `release_gpu_resources` call in `Device::maintain`	2024-04-02 16:12:28 -04:00
Erich Gubler	b842265528	test: don't depend on BG{,L} entry order in HAL	2024-04-01 17:51:36 -04:00
Andreas Reich	ed7d9de439	Fix indexed drawing with RenderBundle (#5441 ) * enhance vertex_indices test to also run with render bundles * fix render bundle index limit check * changelog entry	2024-03-30 10:19:17 +01:00
Brad Werth	00e0e72596	Invoke a DeviceLostClosure immediately if set on an invalid device. (#5358 ) Invoke a DeviceLostClosure immediately if set on an invalid device. To make the device invalid, this defines an explicit, test-only method make_invalid. It also modifies calls that expect to always retrieve a valid device. Co-authored-by: Erich Gubler <erichdongubler@gmail.com>	2024-03-21 13:33:25 -04:00
vero	4e6f873da5	Add shader I64 and U64 support (#5154 ) Co-authored-by: Connor Fitzgerald <connorwadefitzgerald@gmail.com>	2024-03-12 12:34:06 +01:00
Eshed Schacham	8e15707631	gles: fix crash when holding multiple devices on wayland/surfaceless. (#5351 )	2024-03-10 21:12:51 +01:00
Nathan Adams	f78e2f7800	GL actually supports DEPTH32FLOAT_STENCIL8 (#5370 )	2024-03-10 20:56:58 +01:00
dependabot[bot]	744454b9e2	Bump Many Dependencies and MSRV (#5241 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Connor Fitzgerald <connorwadefitzgerald@gmail.com>	2024-02-27 14:43:05 -05:00
Nicolas Silva	c77b4d3f56	Use a unique tracker index per resource instead of the ID in trackers (#5244 ) Co-authored-by: Connor Fitzgerald <connorwadefitzgerald@gmail.com>	2024-02-26 20:25:12 +00:00
Erich Gubler	d6465702b6	fix: `command_encoder_clear_buffer`: err. on `offset + size > u64::MAX` Rust would have made this operation either an overflow in release mode, or a panic in debug mode. Neither seem appropriate for this context, where I suspect an error should be returned instead. Web browsers, for instance, shouldn't crash simply because of an issue of this nature. Users may, quite reasonably, have bad arguments to this in early stages of development!	2024-02-26 09:32:26 -05:00
Erich Gubler	9747a0ed23	fix: always check buffer clear `offset` for OOB Fuzz testing in Firefox encountered crashes for calls of `Global::command_encoder_clear_buffer` where: * `offset` is greater than `buffer.size`, but… * `size` is `None`. Oops! We should _always_ check this (i.e., even when `size` is `None`), because we have no guarantee that `offset` and the fallback value of `size` is in bounds. 😅 So, we change validation here to unconditionally compute `size` and run checks we previously gated behind `if let Some(size) = size { … }`. For convenience, the spec. link for this method: <https://gpuweb.github.io/gpuweb/#dom-gpucommandencoder-clearbuffer>	2024-02-26 09:32:26 -05:00
Connor Fitzgerald	910fb7c341	Remove GPU based validation in Tests (#5271 ) * Remove GPU based validation in Tests * Lol Typo	2024-02-19 17:04:03 -05:00
Connor Fitzgerald	df386dae02	Add vec3-in-array-test (#5264 )	2024-02-18 12:46:29 -05:00
Erich Gubler	2b487724f2	fix(dx12): discard cmd. enc. buf. on drop	2024-02-15 10:45:32 -05:00

1 2 3

128 Commits