diff options
author | Eric Engestrom <eric@engestrom.ch> | 2023-12-02 11:51:55 +0000 |
---|---|---|
committer | Eric Engestrom <eric@engestrom.ch> | 2023-12-03 07:52:19 +0000 |
commit | 519b8ee74e2f558a3c39150c8f01f5eeb64fdb3a (patch) | |
tree | 9c824df46c2244bf46702546ad851e55887c3777 /docs/relnotes | |
parent | 1fbdd37d4c1133ced5eb9812daa1fff04cbf5daa (diff) | |
download | mesa-519b8ee74e2f558a3c39150c8f01f5eeb64fdb3a.tar.gz mesa-519b8ee74e2f558a3c39150c8f01f5eeb64fdb3a.tar.bz2 mesa-519b8ee74e2f558a3c39150c8f01f5eeb64fdb3a.zip |
docs: add release notes for 23.3.0
Diffstat (limited to 'docs/relnotes')
-rw-r--r-- | docs/relnotes/23.3.0.rst | 6334 | ||||
-rw-r--r-- | docs/relnotes/new_features.txt | 21 |
2 files changed, 6334 insertions, 21 deletions
diff --git a/docs/relnotes/23.3.0.rst b/docs/relnotes/23.3.0.rst new file mode 100644 index 00000000000..cca59fdae70 --- /dev/null +++ b/docs/relnotes/23.3.0.rst @@ -0,0 +1,6334 @@ +Mesa 23.3.0 Release Notes / 2023-11-29 +====================================== + +Mesa 23.3.0 is a new development release. People who are concerned +with stability and reliability should stick with a previous release or +wait for Mesa 23.3.1. + +Mesa 23.3.0 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 23.3.0 implements the Vulkan 1.3 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + TBD. + + +New drivers +----------- +- NVK: A Vulkan driver for Nvidia hardware + +New features +------------ +- VK_EXT_pipeline_robustness on ANV +- VK_KHR_maintenance5 on RADV +- OpenGL ES 3.1 on Asahi +- GL_ARB_compute_shader on Asahi +- GL_ARB_shader_atomic_counters on Asahi +- GL_ARB_shader_image_load_store on Asahi +- GL_ARB_shader_image_size on Asahi +- GL_ARB_shader_storage_buffer_object on Asahi +- GL_ARB_sample_shading on Asahi +- GL_OES_sample_variables on Asahi +- GL_OES_shader_multisample_interpolation on Asahi +- GL_OES_gpu_shader5 on Asahi +- EGL_ANDROID_blob_cache works when disk caching is disabled +- VK_KHR_cooperative_matrix on RADV/GFX11+ + + +Bug fixes +--------- + +- crash in si_update_tess_io_layout_state during _mesa_ReadPixels (radeonsi_dri, mesa 23.2.1) +- mesa: vertex attrib regression +- [RADV] War Thunder has some grass flickering. +- radv: satisfactory broken shader +- RADV problem with R7 M440 in some games +- gpu driver crashes when opening ingame map playing dead space 2023 +- [anv] Valheim water misrendering +- EGL/v3d: EGL applications under a X compositor doesn't work +- RADV: trunc_coord breaks ambient occlusion in Dirt Rally and other games +- radv: Mass Effect Legendary Edition: a line going across the screen is visible in some areas with Ambient Occlusion enabled +- anv: DIRT5 gfx11_generated_draws_spv_source triggers "assert(!copy_value_is_divergent(src) || copy_value_is_divergent(dest));" +- panfrost: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12 buffers +- [RADV][TONGA] - BeamNG.drive (284160) - Artifacts are present when looking at the skybox. +- LEGO Star Wars: The Skywalker Saga graphical glitches (DXVK) on R9 380 +- [radv] Crypt not rendering properly +- Leaks of DescriptorSet debug names +- [Tracing flake] Missing geometry in trace\@freedreno-a630\@freedoom\@freedoom-phase2-gl-high.trace +- Unreal Engine 5.2 virtual shadow maps have glitchy/lazy tile updates +- RADV: Visual glitches in Unreal Engine 5.2.1 when using material with anisotropy and light channel 2 +- radv: Regression with UE5 test +- SIGSEGV with MESA_VK_TRACE=rgp and compute only queue +- [ANV] Corruptions in Battlefield 4 +- anv regression w/ commit e488773b29d97 ("anv: Fast clear depth/stencil surface in vkCmdClearAttachments") +- ir3: dEQP-GLES31.functional.synchronization.inter_invocation.image_atomic_read_write crash on a6xx gen4 +- Zink + Venus: driver can't handle INVALID<->LINEAR! +- Anv: Particles have black square artifacts on Counter Strike 2 on Skylake +- Lords of the Fallen 2023 Red Eye mode crashing game and desktop +- [radeonsi] [vulkan] [23.3-rc1 regression] Video output corrupted in QMplay2 with Vulkan renderer +- [BISECTED] ac/radeon commit somehow breaks nv12 surface from HEVC decode +- Parsec displays completely green screen with hardware decoder selected while using Mesa 23.3 and Mesa 24 +- H264 to H264 transcode output corruption with gst-vaapi +- opencl-jpeg-encoder does not work with nouveau/rusticl, works with nouveau/clover +- [R600] X-plane 11 demo (Linux Native) crashes upon launch on HD5870 and HD6970 +- Ubuntu 23.10 build error with rusticl_opencl_bindings.rs +- Rusticl fails to build +- ANV not handling VkMutableDescriptorTypeCreateInfoEXT::pMutableDescriptorTypeLists[i] being out of range +- tu: Wolfenstein: The New Order misrenders on a740 +- DRI_PRIME fails with ACO only radeonsi +- nir_to_tgsi: Incorrect handling of indirect array access +- ANV gen9 32 bit vulkan asserts on many cts tests +- GPU hang observed while launching 3DMark Wildlife Unlimited on MTL +- ac/gpu_info: Query maximum submitted IBs from the kernel +- RADV: regression in 23.2.1 causing GPU hang with RDNA1 in various UE5 games +- GPU page faults reported while playing Talos Principle 2 (demo) +- No CCS_E scanout on tgl+ with ANV +- anv: Modifier tests assert-fail on TGL+ +- ci: zink-tu jobs no longer included in manual pipelines +- [ANV][A770] GravityMark segfaults and buffer allocation errors +- etnaviv: gc2000 gles2 regression +- ci_run_n_monitor: pipeline finding unreliable +- nvk: Implement VK_EXT_dynamic_rendering_unused_attachments +- anv: jsl timeline semaphores flaky +- anv: OOB access in vkDestroyDevice? +- nvk: Implement VK_EXT_primitive_topology_list_restart +- nvk: Implement VK_EXT_image_sliced_view_of_3d +- nvk: Implement VK_KHR_workgroup_memory_explicit_layout +- util/macros: BITFIELD64_RANGE raises an error with mesa-clang if we try to set last bit +- r300/r400 regression; can't compile \`if/then` in shaders +- iris: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12/P010 buffers +- nvk: Implement VK_EXT_depth_bias_control +- ICL/zink: gpu hang on 'piglit.object namespace pollution.framebuffer with gldrawpixels' +- [R600] Wolfenstein: The New Order text glitch on menu +- need extension to request image/texture not use data dependent compression +- rusticl: segfault in clCreateKernel on AMD Instinct MI100 +- !25587 broke xserver +- GPU Hang in Deep Rock Galactic on DG2 +- intel: Wrong length for 3DSTATE_3D_MODE on gfx125 +- [radeonsi] Wargame: Red Dragon /w OpenGL stopped working with ACO +- traces job reference images missing again sometimes +- Vulkan Texture/Polygon Glitches in Games +- freedreno: dmabuf modify query ignores format +- virgl: removing PIPE_CAP_CLEAR_TEXTURE completely breaks virglrenderer +- Turnip build error on termux +- failiure in amd llvm helper +- failiure in amd llvm helper +- radv_amdgpu_cs_submit: Assertion \`chunk_data[request->number_of_ibs - 1].ib_data.ip_type == request->ip_type' failed. +- hasvk: subgroups regression +- radeonsi: broken hardware decoding (vaapi/vulkan) on RDNA2 gpu (bisected) +- aco: SwizzleInvocationsMaskedAMD behavior is not correct for reads from inactive lanes +- anv: dEQP-VK.ssbo.phys.layout.random.16bit.scalar.13 slow +- [RDNA3] CS:GO - excessive power consumption and lower performance in Vulkan while MSAA is set to 4x or 8x +- [ICL] piglit.spec.arb_gl_spirv.execution.ssbo.unsized-array regression +- radv: Counter Strike 2 has multiple bugs while rendering smoke grenade effect +- Doom Eternal freezing on NAVI31 with current git +- iris CTS blend test fail with MSAA config on DG2 +- anv: 32bit mesa asserts +- RADV: Randomly dissapearing objects in Starfield with RX 5xx and Vega graphics +- anv: missing barrier handling on video engines +- radv: Star Wars The Old Republic hang when DCC is enabled +- radv: Resident Evil 6 hangs 7900XTX GPU when DCC is enabled if in Options go to Display settings +- radv: Resident Evil 6 Benchmark Tool hangs 7900 XTX GPU when DCC is enabled immediately after splash screen +- ANV: fp64 shader leaked +- v3d: noop drm-shim raises some warnings +- freedreno: crashdec/etc chip_id support +- intel: compute dispatches with variable workgroup size have ralloc_asprintf CPU overhead +- ci build issues with builtin types +- freedreno: running angle perf traces with GALLIUM_THREAD=0 crashes +- RadeonSI: glClear() causes clear texture for some frames on RX580 +- radeonsi: corruption when seeking video decoded with vaapi in mpv +- Zink/HasVK regression bisected to "gallium: move vertex stride to CSO" +- [radv] [Path Of Exile] - one setting in the workaround file breaks shadows/lighting rendering. Other workaround settings seems obsolete. +- radv: images don't always have extents in RGP +- shader_test causing a crash in compiler +- D3D12: Video decoding requirements are too restrictive. ID3D12VideoDevice3 should not be required. +- Crash in st_ReadPixels +- [regression] intel build issue on i386 +- [ANV] [DG2/A770] The Spirit and The Mouse, miscellaneous issues with Mesa Git +- zink on hasvk regression: Assertion \`(dyn)->vi_binding_strides[first_binding + i] == (strides[i])' failed. +- Penumbra: Overture hangs on new game loading screen +- [r300, RV516] Some deqp-gles2\@performance\@shader\@control_statement vertex tests cause hard lockup & reboot in mesa 22.3.1 (regression over 22.1.7) on a Radeon X1550 +- v3dv: Add a feature that implicitly copies the linear image to the tiled image prior to sampling from it +- radv: Regression from 266b2cfe5bf3feda16747c50c1638fb5a0426958 +- h264 encoding picture showed randomly repeated frames. +- Mesa CI: NAVI10 hangs when running VKCTS on Linux 6.1 +- zink: no uniform buffer objects support for v3dv? +- v3dv: Request for VkImageDrmFormatModifierExplicitCreateInfoEXT::pPlaneLayouts support +- [ANV] [DG2/A770] The Spirit and The Mouse, occasional flickering geometry +- [Google][Rex][anv] GLES dEQP test fails in anv when run via ANGLE-on-Venus on ChromeOS ARCVM. +- VAAPI on VCN: bad stream may crash whole gfx system +- Crash after GPU reset +- Bifrost PanVK should not be in CI +- [Intel][Vulkan][Gen12] vkCmdCopyImage() generates garbage data when the destination texture is bound to a piece of used device memory +- mesa: new glcts fails +- tu: GPL support is broken +- lavapipe: ycbcr regression +- aco: Assertion when compiling CP2077 shader +- anv: flakiness on tgl+ with samplemask handling +- [RADV] Dead by Daylight memory leak (shader-related?) on 23.1.6 +- r300: optionally convert MULs into output modifier for the following MUL or DOT instructions +- r300: better 1-x presubtract pattern matching +- gpu hang on DG2 when running KHR-GLES31.core.texture_cube_map_array.image_op_tess* +- KHR-GLES31.core.texture_cube_map_array.image_op_tessellation_evaluation_sh fail on GFX12+ +- wsi: deadlocks when DISPLAY is changed +- hasvk: Incompatible with minigbm/gralloc4 on Android +- VAAPI: AMDGPU crash on RX 6900 XT on corrupted video +- lavapipe/llvmpipe: shader unregister crash +- [ANV] [DG2/A380] Corruption in Borderlands 3 +- blorp regression on dg2 +- decouple -Dshader-cache= from EGL_ANDROID_blob_cache +- radv: commit 81641b01555faa4dd1dfc7de2513ad8d63e77ab7 leaded to artifacts in Quake II RTX +- [radv] Colors are distorted in Cyberpunk 2077 with ray tracing enabled +- Forza Horizon 5 stuttering since mesa 23.1.4 / 9b008673 revert as a FIX +- ubsan + gtest build fails +- glCopyTexSubImage2D is very slow on Intel +- NVE4 (GeForce 710) fails to get vdpau in mesa git +- [RADV] red and pink tinted shadows in Overwatch 2 on 7900 XTX +- nouveau prevents hardware acceleration with Chromium (Wayland) +- Corrupt text rendering in Blender +- DRI2 gallium frontend is using bad format type +- regression - MR 23089 - Hellblade RT crashing +- Incorrect vlVaCreateBuffer/vlVaMapBuffer behavior for buffer type VAEncCodedBufferType in Gallium +- Issue with clang-format +- Follow-up from "Draft: intel: Disable color fast-clears for blorp_copy" +- nightly VA-API build: new timeout +- r600: retire the SB optimizer +- ci: do not download perfetto on-fly in build jobs +- Shared Memory Leak With Qt OpenGL Applications +- OpenGL, SIGSEGV when program pipeline objects has separated vertex shader progam and separated fragment shader progam with in/out +- vaDeriveImage returns VA_STATUS_ERROR_OPERATION_FAILED +- 975a8ecc881873744d851ab0ef45ad7698eaa0ef "frontends/va: use resources instead of views" cause radeonsi can't play video. +- zink: reduce pipeline hash size +- Rusticl,radeonsi: ac_rtld error(2): too much LDS +- aco, radv Rage 2 menu corruption - bisected +- radv, aco: World War Z character texture regression on 7900xtx +- android: De-stage drm_gralloc support from mesa3d +- Cyberpunk screen goes black at game launch on integrated Gfx +- lavapipe/llvmpipe: regressions since descriptor rewrite +- intel: State cache invalidation after BLORP binding table setup ought to be unnecessary on ICL. +- ci: HW job logs have spam at the end +- kernel crash seen on AMD Raven device +- crocus: regression crashing in doubles/ubo tests +- turnip: object management CTS crashes +- a618: multiple assertions with different kernel config on u_vector_add +- [anv] Death Stranding crashes +- Can no longer build Clover without llvmspirvlib +- [radeonsi][vaapi] segfault in vl_video_buffer_sampler_view_components() when using vaapisink receiving I420 format +- Baldurs Gate 3 (DX11) - Graphical corruption on RDNA3 (ACO regression) +- [AMDGPU] Compiling large Blender Eevee shader node trees is unusably slow +- Building llvmpipe with LP_USE_TEXTURE_CACHE set fails since 23.2.0-rc1: error C2039: dynamic_state is not member of lp_build_sampler_soa in lp_tex_sample.c +- r300: calculate some cycles estimate for shader-db +- intel: Deathloop and other DX12 games fail assert(validated) with invalid SEL instruction +- GTF-GL46.gtf21.GL.build.CorrectFull_vert regressed on intel platforms +- error message when encoding via VAAPI AMD +- gpu hangs on dg2 with mesh shading enabled on vkcts +- radeonsi: Deadlock when creating a new GL context in parallel with linking a shader on another GL context +- robustness2 raygen tests intermittently fail in Intel Mesa CI +- ci/ci_run_n_monitor.py: KeyError: 'clang-format' +- glthread: huge performance regression +- DirectX games do not launch on Intel HD Graphics 4000 (IVB GT2) [bisected] +- rusticl: fails to build for iris + radeonsi + + +Changes +------- + +Adam Jackson (3): + +- egl: Implement EGL_EXT_explicit_device +- mesa: Implement and advertise GL_MESA_sampler_objects +- docs: Mention 'meson devenv' in the pre-install test instructions + +Aditya Swarup (6): + +- isl: enable Tile64 for 3D images +- intel/isl: Unittest for linear to Ytile conversion +- intel/isl: Convert linear texture to Tile4 format +- intel/isl: Convert Tile4 texture to linear format +- intel/isl: Linear to Tile-4 conversion unittest +- Revert "iris: Disable tiled memcpy for Tile4" + +Alba Mendez (1): + +- meson: support installation tags + +Alejandro Piñeiro (61): + +- v3dv: re-enable sync_fd import/export on the simulator +- broadcom(cle,clif,common,simulator): add 7.1 version on the list of versions to build +- broadcom/cle: update the packet definitions for new generation v71 +- broadcom/common: add some common v71 helpers +- broadcom/qpu: add comments on waddr not used on V3D 7.x +- broadcom/qpu: set V3D 7.x names for some waddr aliasing +- broadcom/compiler: rename small_imm to small_imm_b +- broadcom/compiler: add small_imm a/c/d on v3d_qpu_sig +- broadcom/qpu: add v71 signal map +- broadcom/qpu: define v3d_qpu_input, use on v3d_qpu_alu_instr +- broadcom/qpu: add raddr on v3d_qpu_input +- broadcom/qpu: defining shift/mask for raddr_c/d +- broadcom/commmon: add has_accumulators field on v3d_device_info +- broadcom/qpu: add qpu_writes_rf0_implicitly helper +- broadcom/qpu: add pack/unpack support for v71 +- broadcom/compiler: phys index depends on hw version +- broadcom/compiler: don't favor/select accum registers for hw not supporting it +- broadcom/vir: implement is_no_op_mov for v71 +- broadcom/compiler: update vir_to_qpu::set_src for v71 +- broadcom/qpu_schedule: add process_raddr_deps +- broadcom/qpu: update disasm_raddr for v71 +- broadcom/qpu: return false on qpu_writes_accumulatorXX helpers for v71 +- broadcom/compiler: add support for varyings on nir to vir generation for v71 +- broadcom/compiler: payload_w is loaded on rf3 for v71 +- broadcom/qpu_schedule: update write deps for v71 +- broadcom/compiler: update register classes to not include accumulators on v71 +- broadcom/qpu: implement switch rules for fmin/fmax fadd/faddnf for v71 +- broadcom/compiler: update one TMUWT restriction for v71 +- broadcom/compiler: update ldunif/ldvary comment for v71 +- broadcom/compiler: update payload registers handling when computing live intervals +- broadcom/qpu: new packing/conversion v71 instructions +- v3dv/meson: add v71 hw generation +- v3dv: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71 +- v3dv/cmd_buffer: emit TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1 for v71 +- v3dvx/cmd_buffer: emit CLEAR_RENDER_TARGETS for v71 +- v3dv/cmd_buffer: emit CLIPPER_XY_SCALING for v71 +- v3dv/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71 +- v3dv/cmd_buffer: just don't fill up early-z fields for CFG_BITS for v71 +- v3dv: default vertex attribute values are gen dependant +- v3dv/pipeline: default vertex attributes values are not needed for v71 +- v3dv/pipeline: handle GL_SHADER_STATE_RECORD changed size on v71 +- v3dv: no specific separate_segments flag for V3D 7.1 +- v3dv: add support for TFU jobs in v71 +- v3d: add v71 hw generation +- v3d: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71 +- v3d: TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1 +- v3d: emit CLEAR_RENDER_TARGETS for v71 +- v3d: just don't fill up early-z fields for CFG_BITS for v71 +- v3d: emit CLIPPER_XY_SCALING for v71 +- v3d: no specific separate_segments flag for V3D 7.1 +- v3d: default vertex attributes values are not needed for v71 +- v3d/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71 +- v3d: handle new texture state transfer functions in v71 +- v3d: handle new TEXTURE_SHADER_STATE v71 YCbCr fields +- v3d: setup render pass color clears for any format bpp in v71 +- v3d: GFX-1461 does not affect V3D 7.x +- v3d: don't convert floating point border colors in v71 +- v3d: handle Z clipping in v71 +- v3d: add support for TFU blit in v71 +- v3dv: implement depthBounds support for v71 +- doc/features: update after last v3d changes + +Alex Denes (1): + +- virgl: link VA driver with build-id + +Alexander Orzechowski (1): + +- radeonsi: Set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET for auxiliary contexts + +Alyssa Rosenzweig (431): + +- zink: Switch to register intrinsics +- gallium/trace: Collect enums from multiple files +- gallium,util: Move blend enums to util/ +- gallium,util: Move util_blend_dst_alpha_to_one +- util/blend: Add helpers for normalizing inverts +- vulkan: Add helpers for blend enum translation +- lvp: Use common blend/logicop translation +- nir/lower_blend: Use util enums +- panfrost: Convert to PIPE_BLEND enums internally +- gallium: Remove pipe->compiler BLEND enum translation +- compiler: Remove blend enums duplicating util +- nir/legacy: Fix fneg(load_reg) case +- nir/legacy: Fix handling of fsat(fabs) +- ntt: Switch to new-style registers and modifiers +- ir3: Convert to register intrinsics +- nir: Add fence_{pbe,mem}_to_tex(_pixel)_agx intrinsics +- nir: Devendor load_sample_mask +- nir: Promote tess_coord_r600 to tess_coord_xy +- nir: Add nir_lower_tess_coord_z pass +- r600: Use nir_lower_tess_coord_xy +- ir3: Use nir_lower_tess_coord_z +- nir: Initialize workgroup_size in builder_init_simple_shader +- v3dv: Rely on nir_builder setting workgroup size +- radv: Rely on workgroup_size initialization +- panfrost: Fix transform feedback on v9 +- r600/sfn: Remove nir_register unit tests +- panfrost: Lower vertex_id for XFB +- panfrost: Fix transform feedback on v9 harder +- asahi: Augment fake drm_asahi_params_global +- asahi: Use nir_builder_at more +- asahi: Remove unused #define +- asahi: Refactor PBE upload routine +- asahi: Extract shader_initialize helper +- asahi: Serialize NIR in memory +- asahi: Identify background/EOT counts +- asahi,agx: Set coherency bit for clustered targets +- ail: Page-align layers for writable images +- asahi: Mark writeable images as such +- asahi: Reallocate to set the writeable image flag +- asahi: Add agx_batch_track_image helper +- asahi: Add texture/image indexing lowering pass +- asahi: Upload at most the max texture state registers +- asahi: Upload image descriptors +- asahi: Make clear the non-sRGBness of EOT images +- asahi: Don't restrict sampler views +- asahi: Forbid 2D Linear with images +- agx: Add try_coalesce_with helper +- agx: Try to allocate phis compatibly with sources +- agx: Try to allocate phi sources with phis +- agx: Try to allocate phi sources with loop phis +- agx: Vectorize 16-bit parallel copies +- agx: Reduce un/packs with mem access lowering +- agx: Fix bogus assert +- asahi: Augment PBE descriptor for software access +- asahi: Extend PBE packing for image support +- asahi: Use nir_lower_robust_access +- agx: Legalize image LODs to be 16-bit +- agx: Lower image size to txs +- agx: Generalize texture/PBE packing +- agx: Add image write instruction +- agx: Model texture bindless base +- agx: Handle bindless properly for txs lowering +- agx: Pack bindless textures +- agx: Translate texture bindless handles +- agx: Translate image_store from NIR +- agx: Handle frag side effects without render targets +- agx: Wait for outstanding stores before barriers +- agx: Implement image barriers +- agx: Handle early_fragment_tests +- agx: Add interleave opcode +- agx: Extract coords_for_buffer_texture helper +- agx: Extract texture_descriptor_ptr_for_* helpers +- agx: Lower image atomics +- agx: Lower buffer images +- asahi,agx: Fix txf sampler +- agx: Add image_load opcode +- agx: Extract texture write mask handling +- agx: Implement image_load +- agx: Emit global memory barriers for images +- agx: Don't emit silly barriers +- agx: Implement fence_*_to_tex_agx intrinsics +- agx: Add simple image fencing pass +- agx: Require tag writes with side effects +- agx: Plumb in coverage mask +- asahi: Extract sampler_view_for_surface +- asahi: Introduce concept of spilled render targets +- asahi: Add agx_tilebuffer_spills query +- asahi: Do not support masking with spilled RTs +- asahi: Ignore spilled render targets in EOT shaders +- asahi: Ignore spilled render targets with partial renders +- asahi: Extract some tilebuffer lowering code +- asahi: Lower tilebuffer access for spilled RTs +- asahi: Lower multisample image stores +- asahi: Permit meta shaders to use preambles +- asahi: Ignore spilled render targets for background load +- asahi: Offset clear colour uniform by 4 +- asahi: Execute preambles for background programs +- asahi: Advertise Z16_UNORM +- ir2: Switch to nir_legacy +- intel/fs: Don't read reg.base_offset +- panfrost: Remove unused helpers +- nir: Remove nir_lower_locals_to_regs +- nir: Rename lower_locals_to_reg_intrinsics back +- nir: Remove register arrays +- asahi: Don't depend on glibc to decode +- pan/bi: Remove leftover include +- nir/trivialize: Handle more RaW hazards +- panfrost: Disable blending for no-op logic ops +- nir/lower_blend: Fix 32-bit logicops +- nir/lower_blend: Optimize out PIPE_LOGICOP_NOOP +- clang-format: Ignore original panfrost commit +- nir/schedule: Assume no old-style registers +- gallium/u_simple_shaders: Optimize out ffloors +- gallium/u_transfer_helper: Remove dead forward decl +- nir/loop_analyze: Drop unused inverse_comparison +- nir/passthrough_gs: Drop unused array_size_for_prim +- panfrost: Add missing static inline annotation +- pan/decode: Drop unused debug function +- pan/mdg: Add missing static inline annotation +- panfrost: Drop unused decode_position for samples +- panfrost: Only define pan_blitter_get_blend_shaders for midgard +- panfrost: Add missing inline +- panfrost: Gate overdraw_alpha on Bifrost+ +- nir: Rename scoped_barrier -> barrier +- nir: Remove lower_to_source_mods +- nir: Remove lower_vec_to_movs +- nir: Remove reg_intrinsics parameter to convert_from_ssa +- nir: Remove register load/store builders +- r600/sfn: Stop referencing legacy functionality +- r600/sfn: Ignore instruction write masks +- nouveau/codegen: Drop writemask check +- vc4,broadcom/compiler: Drop write_mask handling +- zink: Collapse is_ssa check +- nir: Add {...} before case +- nir/from_ssa: Drop legacy reg support +- nir/schedule: Drop nir_schedule_dest_pressure +- nir: Drop NIR reg create/destroy +- nir: Remove nir_index_local_regs and callers +- nir/schedule: Drop more nir_register handling +- nir: Remove nir_foreach_register +- nir: remove nir_{src,dest}_for_reg +- ntt: Drop nir_register reference +- nir/print: Assume SSA +- nir/clone: Assume SSA +- nir/serialize: Drop legacy NIR +- nir/validate: Assume SSA +- nir: Remove impl->{registers,reg_alloc} +- nir: Remove nir_alu_dest::saturate +- treewide: Drop is_ssa asserts +- nir: Collapse some SSA checks +- treewide: Remove more is_ssa asserts +- nir: Remove reg-only dest manipulation +- nir: Remove stale todo +- nir/print: Drop legacy NIR +- nir: Drop nir_alu_src::{negate,abs} +- treewide: sed out more is_ssa +- pan/mdg: Assume SSA +- treewide: Drop some is_ssa if's +- nir: Drop trivial reg handling +- aco: Remove is_ssa check +- intel: Collapse is_ssa checks +- llvmpipe: Assume SSA +- ir3: Collapse is_ssa checks +- lima: Collapse is_ssa checks +- radeonsi: Collapse SSA check +- nir/gather_ssa_types: Collapse SSA checks +- nir/worklist: Assume SSA +- nir/range_analysis: Assume SSA +- treewide: Collapse more SSA checks +- nir/instr_set: Assume SSA +- nir: Collapse more SSA checks +- nir: Remove def_is_register +- nir: Do not init dests +- nir: Initialize source as a NULL SSA def +- nir: Collapse more SSA checks +- nir: Remove nir_{src,dest}::is_ssa +- nir: Drop nir_register +- nir/from_ssa: Remove pointless union +- ir3: Drop write_mask handling +- rogue: Stop reading write masks +- etnaviv: Don't use alu->dest.write_mask +- etnaviv: What if we just didn't have a compiler? +- intel/vec4: Don't use legacy write mask +- ntt: Evaluate write_mask check +- nir: Remove nir_alu_dest::write_mask +- nir: Remove nir_foreach_def +- lima: Clean up after deleting asserts +- nir: Remove no-op remove_def_cb +- nir: Drop no-op all_srcs_are_ssa +- nir: Simplify alu_instr_is_copy +- nir: Add load_coefficients_agx intrinsic +- agx: Implement nir_intrinsic_load_coefficients_agx +- agx: Allow more varying slots +- agx: Set lower_fisnormal +- agx: Forcibly vectorize pointcoord coeffs +- agx: Add interpolateAtOffset lowering pass +- agx: Lower flat shading in NIR +- asahi: Stub num_dies +- asahi: Move a bunch of helpers to common +- agx: Lower 8-bit ALU +- agx: Handle 8-bit vecs +- asahi,agx: Respect no16 even for I/O +- agx: Don't lower load_local_invocation_index +- agx/dce: Use the helper +- agx: Fix atomics with no destination +- agx: Fix shader info with sample mask writes +- agx: Do not move bindless handles +- agx: Put else instructions in the right block +- agx: Use unconditional else instruction +- agx: Optimize out pointless else instructions +- agx: Fix length bit confusion +- agx: Require an immediate for \`nest` +- agx: Use compressed fadd/fmul encodings +- agx: Optimize swaps of 2x16 channels +- agx: Optimize logical_end removal +- agx: Fix AGX_MESA_DEBUG=demand +- agx: Maintain ctx->max_reg while assigning regs +- agx: Allow 64-bit memory regs +- agx: Fix accounting for phis +- agx: Set phi sources in predecessors +- agx: Stop setting registers after the shader +- agx: Use agx_replace_src +- agx: Assert invariant stated in the comment +- agx: Don't use ssa_to_reg across blocks +- agx: Don't reuse ssa_to_reg across blocks +- agx: Remove unused allocation +- agx: Stop setting forwarding bit +- agx: Handle blocks with no predecessors +- agx: Lower f2u8/f2i8 +- agx: Handle conversions to 8-bit +- agx: Fix uadd_sat packing +- agx: Fix 64-bit immediate moves +- agx: Lower f2f16_rtz +- agx: Handle f2f16_rtne like f2f16 +- agx: Handle <32-bit local memory access +- agx: Do not allow creating vec8 +- asahi: Legalize compression before blitting +- nir: Drop "SSA" from NIR language +- agx: Stop passing nir_dest around +- agx: Remove agx_nir_ssa_index +- pan/mdg: Don't reference nir_dest +- pan/bi: Don't reference nir_dest +- asahi: Do not reference nir_dest +- panfrost: Do not reference nir_dest +- zink: Do not reference nir_dest +- ir3: Do not reference nir_dest +- dxil: Do not reference nir_dest +- nir: Drop nir_dest_init +- panfrost: Pack stride at CSO create time on v9 +- lvp,nir/lower_input_attachments: Use nir_trim_vector +- broadcom/compiler: Use nir_trim_vector explicitly +- nir: Assert that nir_ssa_for_src components matches +- nir: Add nir_shader_intrinsics_pass +- nir: Lower fquantize2f16 +- agx: Lower fquantize2f16 +- nir/lower_helper_writes: Consider bindless images +- nir/passthrough_gs: Correctly set vertices_in +- nir/passthrough_gs: Fix array size +- nir/print: Print access qualifiers for intrinsics +- nir/lower_gs_intrinsics: Remove end primitive for points +- panfrost/ci: Disable T720 +- nir: Add load_sysval_agx intrinsic +- agx: Fix extraneous bits with b2b32 +- agx: Use more barriers +- asahi: Copy CSO stride +- agx: Assert vertex_id, instance_id are VS-only +- asahi: Keep drawoverhead from OOMing itself +- agx: Don't blow up when lowering textures twice +- agx/lower_vbo: Handle nonzero component +- agx: Allow loop headers without later preds +- agx: Handle b2i8 +- agx: Convert 8-bit comparisons +- agx: Implement imul_high +- asahi: Advertise OpenGL ES 3.1! +- asahi/decode: Turn assert into error +- asahi: Report local_size from compiler +- asahi: Use local_size from compiler directly +- asahi: Pass layer stride in pixels, not elements +- agx: Clear sample count after lowering MSAA +- agx: Clear image_array after lowering +- asahi: Preserve atomic ops when rewriting image to bindless +- agx: Use 16-bit reg for pixel_coord +- asahi: Generalize query logic +- asahi: Simplify occlusion query batch tracking +- asahi: Refactor agx_get_query_result +- asahi: Only touch batch->occlusion_queries for occlusion +- asahi: Sync when beginning a query +- asahi: Add non-occlusion query tracking +- asahi: Add get_query_address helper +- agx/fence_images: Use intrinsics_pass +- agx: Do not fence write-only images +- asahi: Add missing LOD source for agx_meta's txfs +- agx: Do some texture lowering early +- agx: Add helper returning if a descriptor crawl is needed +- nir,asahi: Remove texture_base_agx +- asahi: Move UBO lowering into GL driver +- asahi: Add sysval tables for each shader stage +- asahi: Split out per-stage sysvals +- asahi: Collapse grid_info +- asahi: Extract agx_upload_textures +- asahi: Upload a single draw_uniforms per draw +- asahi: Add real per-stage dirty flags +- asahi: Extract sampler upload +- asahi: Put unuploaded uniforms on the batch +- asahi: Decouple sysval lowering from uniform assignment +- asahi: Use finer dirty tracking for blend constant +- asahi: Use proper dirty tracking for VBOs +- asahi: Dirty track VBOs + blend const separately +- asahi: Dirty the shader stage when the shader changes +- asahi: Fix shader stage dirtying +- treewide: Use nir_shader_intrinsic_pass sometimes +- treewide: Also handle struct nir_builder form +- nir/lower_shader_calls: Fix warning with clang +- nir: Add nir_before/after_impl cursors +- treewide: Use nir_before/after_impl in easy cases +- treewide: Use nir_before/after_impl for more elaborate cases +- radv: Use before/after_cf_list for entrypoints +- ci: Disable known broken Bifrost Vulkan job +- ci: Disable WHL jobs +- nir/opt_if: Simplify if's with general conditions +- asahi: Fixes for clang-warnings +- agx: Fix jmp_exec_none encoding +- agx/validate: Print to stderr +- agx: Annotate opcodes with a scheduling class +- agx: Add schedule-specialized get_sr variants +- agx: Include schedule class in the opcode info +- agx: Schedule for register pressure +- agx: Lower pack_32_4x8_split +- asahi: Force translucency for ignored render targets +- agx: Remove logical_end instructions +- agx: Lower pseudo-ops later +- agx: Expand nest +- agx: Lower nest later +- agx: Split nest instruction into begin_cf + break +- agx: Add break_if_*cmp instructions +- agx: Add agx_first/last_instr helpers +- agx: Use agx_first_instr +- agx: Detect conditional breaks +- agx: Omit push_exec at top level +- agx: Omit while_icmp without continue +- agx: Add helper to determine if a NIR loop uses continue +- agx: Only use nest by 1 for loops w/o continue +- agx: Add pseudo-instructions for icmp/fcmp +- agx: Generate unfused comparison pseudo ops +- agx: Fuse conditions into if's +- agx: Fuse compares into selects +- agx: Add unit test for if_cmp fusing +- agx: Add unit test for cmp+sel fusing +- asahi: Translate cube array dimension +- ail: Force page-alignment for layered attachments +- agx: Handle cube arrays when clamping arrays +- agx: Lower coordinates for cube map array images +- agx: Run opt_idiv_const after lowering texture +- asahi: Forbid linear 1D Array images +- asahi: Handle linear 1D Arrays +- asahi: Conditionally expose cube arrays +- gallium,mesa/st: Add PIPE_CONTEXT_NO_LOD_BIAS flag +- asahi: Skip LOD bias lowering for GLES +- nir: Add nir_function_instructions_pass helper +- nir: Add NIR_OP_IS_DERIVATIVE property +- nir: Hoist nir_op_is_derivative +- nir/opt_preamble: Use nir_op_is_derivative +- nir/opt_gcm: Use nir_op_is_derivative more +- nir/gather_info: Use nir_op_is_derivative +- nir/opt_sink: Sink load_constant_agx +- nir/opt_sink: Sink load_local_pixel_agx +- nir/opt_sink: Sink frag coord instructions +- nir/opt_sink: Do not move derivatives +- nir/opt_sink: Move ALU with constant sources +- nir/opt_sink: Also consider load_preamble as const +- agx: Enable sinking ALU +- treewide: Drop nir_ssa_for_src users +- treewide: Remove remaining nir_ssa_for_src +- nir: Remove nir_ssa_for_src +- asahi: Clamp index buffer extent to what's read +- agx: Align the reg file for 256-bit vectors +- agx: Hoist sample_mask/zs_emit +- agx: Set PIPE_SHADER_CAP_CONT_SUPPORTED +- agx: Augment if/else/while_cmp with a target +- agx: Add jumps to block ends +- agx: Add agx_prev_block helper +- agx: Insert jmp_exec_none instructions +- nir: Add layer_id_written_agx sysval +- nir: Support arrays in block_image_store_agx +- agx/nir_lower_texture: Allow disabling layer clamping +- agx: Pack block image store dim correctly +- agx: Handle layered block image stores +- agx: Add pass to lower layer ID writes +- asahi: Add helper to get layer id in internal program +- asahi,agx: Select layered rendering outputs +- agx: Support packed layered rendering writes +- agx/tilebuffer: Support layered layouts +- agx/lower_tilebuffer: Support spilled layered RTs +- asahi: Use layered layouts +- asahi: Expose VS_LAYER_VIEWPORT behind a flag +- asahi: Account for layering for attachment views +- asahi: Assume LAYER is flat-shaded +- asahi: Add pass to predicate layer ID reads +- asahi: Predicate layer ID reads +- asahi: Write to cubes/etc attachments as 2D array +- asahi: Use a 2D Array texture for array render targets +- asahi: Generate layered EOT programs +- asahi: Handle layered background programs +- lima/pp: Do not use union undefined behaviour +- nir: Add trivial nir_src_* getters +- nir: Use set_parent_instr internally +- nir: Use getters for nir_src::parent_* +- nir: Assert the nir_src union is used safely +- nir: Use a tagged pointer for nir_src parents +- nir: Add ACCESS_CAN_SPECULATE +- ir3: Set CAN_SPECULATE before opt_preamble +- ir3: Model cost of phi nodes for opt_preamble +- nir/opt_preamble: Walk cf_list manually +- nir/opt_preamble: Preserve IR when replacing phis +- nir/opt_preamble: Unify foreach_use logic +- nir/opt_preamble: Move phis for movable if's +- nir/opt_preamble: Respect ACCESS_CAN_SPECULATE +- freedreno/ci: Minetest +- r600/sfn: Handle load_global_constant +- nir/opt_phi_precision: Work with libraries +- nir/legalize_16bit_sampler_srcs: Use instr_pass +- nir/print: Handle KERNEL +- nir/lower_io: Use load_global_constant for OpenCL +- nir/opt_algebraic: Reduce int64 +- nir/opt_algebraic: Optimize LLVM booleans +- nir/trivialize_registers: Handle obscure load hazard +- hasvk: Support builiding on non-Intel +- crocus: Support building on non-Intel +- meson: Add vulkan-drivers=all option +- meson: Add gallium-drivers=all option +- agx: Fix fragment side effects scheduling + +Amber (7): + +- ir3: make wave_granularity configurable +- turnip: Add support for devices not supporting double thread size. +- turnip: make sampler_minmax support configurable. +- freedreno, turnip: set correct reg_size_vec4 for a6xx_gen1_low +- ir3: handle non-uniform case for atomic image/ssbo intrinsics +- freedreno: Add support for devices not supporting double thread size. +- turnip: Add debug option to allow non-conforming features. + +Andrew Randrianasulu (1): + +- nv50/ir: Remove few nvc0 specific defines from nv50-specific header. + +Antonio Gomes (9): + +- rusticl/kernel: Removing unnecessary clone in kernel launch +- rusticl/kernel: Add CsoWrapper +- rusticl/compiler: Add NirPrintfInfo +- rusticl: Move Cso to Program +- rusticl/compiler: Remove unnecessary functions +- rusticl: Move NirKernelBuild to ProgramDevBuild +- rusticl/program: New helper functions to NirKernelBuild +- rusticl/core: Delete KernelDevState and KernelDevStateInner +- rusticl/core: Make convert_spirv_to_nir output pair (KernelInfo, NirShader) + +Asahi Lina (29): + +- docs/tgsi: Specify that depth texture fetches are replicated +- asahi: Add synctvb debug flag +- asahi: Add smalltile debug option +- asahi: Add nomsaa debug flag +- asahi: decode: Add a params argument to pass through +- asahi: Add extra CDM header block for G14X +- asahi: wrap: Handle freeing shmems +- asahi: decode: Refactor to always copy GPU mem to local buffers +- asahi: decode: Add a function to construct decode_params from a chip_id +- asahi: Add a shared library interface for decode +- asahi: Add a noshadow debug flag +- asahi: Do not overallocate BOs by more than 2x +- asahi: Fix race in BO stats accounting +- asahi: Always use resource size, not BO size +- asahi: Print info about shadowed resources +- asahi: Impose limits on resource shadowing +- asahi: Force linear for SHARED buffers with no/implicit modifier +- asahi: Enable explicit coherency for G14D (multi-die) +- asahi: Handle non-written RTs correctly +- asahi: Fix incorrect BO bitmap reallocations +- asahi: Allocate staging resources as staging +- asahi: cmdbuf: Identify call/ret bits +- asahi: decode: Implement VDM call/ret +- asahi: decode: Do not assert on buffer overruns +- asahi: Fix VDM pipeline field width +- asahi: Add scaffolding for supporting driconf options +- asahi: Add and support the no_fp16 driconf flag +- driconf: Disable fp16 for browsers +- asahi: Allow no16 flag for disk cache + +Bas Nieuwenhuizen (16): + +- aco: fix nir_op_vec8/16 with 16-bit elements. +- aco: Fix some constant patterns in 16-bit vec4 construction with s_pack. +- nir: Fix 16-component nir_replicate. +- radv: Expose VK_EXT_external_memory_acquire_unmodified. +- util/perf: Add gpuvis integration. +- egl,venus,vulkan,turnip,freedreno: Update CPU trace init to init more than perfetto. +- vulkan: Add CPU tracing for vkWaitForFences. +- docs: Add documentation for gpuvis. +- vulkan: Add trace points for more Vulkan waiting functions. +- radv: Use a double jump to limit nops in DGC for dynamic sequence count. +- nir: Add AMD cooperative matrix intrinsics. +- aco: Add WMMA instructions. +- aco: Make RA understand WMMA instructions. +- radv: Don't transparently use wave32 with cooperative matrices. +- radv: Add cooperative matrix lowering. +- radv: Expose VK_KHR_cooperative_matrix. + +Benjamin Cheng (10): + +- radv/video: use app provided hevc scaling list order +- radv/video: copy from correct H264 scaling lists +- anv/video: copy from correct H264 scaling lists +- vulkan/video: add helper to derive H264 scaling lists +- radv/video: use vk_video_derive_h264_scaling_list +- anv/video: use vk_video_derive_h264_scaling_list +- util/vl: extract gallium vl scanning data to shared code +- radv/video: send h264 scaling list in raster order +- anv/video: send h264 scaling list in raster order +- radv/video: find SPS with pps_seq_parameter_set_id + +Benjamin Lee (1): + +- nvk: Fix segfault when opening DRI device file returns error + +Biswapriyo Nath (1): + +- radv/video: Match function definitions to declarations + +Boris Brezillon (1): + +- panfrost: Flag the right shader when updating images + +Boyuan Zhang (3): + +- virgl: Add vp9 picture desc +- virgl: Implement vp9 hardware decode +- radeonsi/vcn: disable tmz ctx buffer for VCN_2_2_0 + +Caio Oliveira (134): + +- nir: Use instructions_pass() for nir_fixup_deref_modes() +- meson: Ensure that LLVMSPIRVLib is not required for Clover +- nir: Let nir_fixup_deref_modes() fix deref_casts when possible +- nir: Add nir_opt_reuse_constants() +- radv: Use nir_opt_reuse_constants() +- compiler/types: Use ralloc for the key in array_types +- compiler/types: Use smaller keys for array_types table +- compiler/types: Extract get_explicit_matrix_instance() function +- compiler/types: Use smaller keys for explicit_matrix_types table +- anv/tests: Refactor state_pool_test_helper to not use macros for parametrization +- anv/tests: Link a single anv_tests binary using gtest +- anv/tests: Propagate failures to gtest +- hasvk/tests: Refactor state_pool_test_helper to not use macros for parametrization +- hasvk/tests: Link a single hasvk_tests binary using gtest +- hasvk/tests: Propagate failures to gtest +- util: Add convenience macros for linear allocator +- compiler/types: Use right hash for function types +- compiler/types: Don't duplicate empty string +- compiler/types: Constify a couple of pointers in glsl_type +- compiler/types: Remove unused GLSL_TYPE_FUNCTION and related functions +- compiler/types: Move GLSL specific builtin structs into glsl/ +- glsl: Add missing glsl_types initialization to test_optpass +- glsl: Don't create struct type builtins +- compiler/types: Add extra level of macro to builtin_macros +- compiler/types: Use designated initializer syntax to specify builtins +- compiler/types: Move local cache details to implementation file +- compiler/types: Add a mem_ctx for the glsl_type_cache +- compiler/types: Use type cache mem_ctx for hash tables +- compiler/types: Don't store a mem_ctx per type +- compiler/types: Simplify clearing the glsl_type_cache +- compiler/types: Move static asserts about glsl_type to a central place +- compiler/types: Store builtin types directly as data +- compiler/types: Use a linear (arena) allocator for glsl_types +- compiler/types: Make struct glsl_type visible to C code +- compiler/types: Add workaround to use builtin_type_macros.h in C +- compiler/types: Move builtin type initialization to C +- glsl: Annotate _mesa_glsl_error() with PRINTFLIKE +- compiler/types: Fix array name dimension flipping for unsized arrays +- compiler/types: Use Python to generate code for builtin types +- compiler/types: Use glsl_get_type_name() to access the type name +- compiler/types: Change glsl_type::name to be an uintptr_t +- compiler/types: Use a string table for builtin type names +- intel/compiler/xe2: Account for reg_unit() in TCS intrinsics +- intel/compiler/xe2: Account for reg_unit() in TES intrinsics +- intel/fs/xe2+: Update BS payload setup for Xe2 reg size. +- intel/fs/xe2+: Update TASK/MESH payload setup for Xe2 reg size. +- compiler: Use a meson dependency for libcompiler +- meson: Remove unnecessary inc_compiler mentions +- rusticl: Ensure NIR generated headers will be available +- clover: Hide SPIR-V related code behind HAVE_CLOVER_SPIRV +- clover: Only compile/depend libclspirv and libclnir when using SPIR-V support +- compiler: Only enable mesaclc helper if we have OpenCL SPIR-V support +- intel/compiler: Don't allocate memory for SIMD select error handling +- microsoft/compiler: Fix printf formatting string issues +- util: Add more PRINTFLIKE and MALLOCLIKE annotations +- util: Remove ralloc_parent from linear_header +- util: Use linear parent to (r)allocated extra nodes +- util: Remove size from linear_parent creation +- util: Make DECLARE_LINEAR_ALLOC_* macros assume no destructors +- util: Use an opaque type for linear context +- util: Remove usages of linear_realloc() +- util: Remove linear_realloc() +- util: Remove size information from child allocations +- util: Remove per-buffer header in linear alloc for release mode +- util: Add a few basic tests for linear_alloc +- util: Fix bookkeeping of linear node sizes +- intel/compiler: Don't store stage name and abbrev +- intel/compiler/xe2: URB fence uses LSC now +- intel/compiler/xe2: Fix URB writes in TCS +- intel/compiler/xe2: Update TCS ICP handle code to support SIMD16 +- compiler/types: Add support for Cooperative Matrix types +- nir: Add new intrinsics for Cooperative Matrix +- nir: Handle cooperative matrix in various passes +- spirv: Expose some memory related functions in vtn_private.h +- spirv: Let vtn_ssa_value hold references to variables +- spirv: Implement SPV_KHR_cooperative_matrix +- compiler/types: Remove private related declarations +- compiler/types: Remove use of new/delete +- compiler/types: Remove use of references +- compiler/types: Remove use of auto +- compiler/types: Use C compatible cast syntax +- compiler/types: Spell struct and enum in type names +- compiler/types: Add void parameter to ensure these are valid C prototypes +- intel/fs: Tweak default case of fs_inst::size_read() +- compiler/types: Move the C++ inline functions in glsl_type out of the struct body +- compiler/types: Move C declarations into glsl_types.h +- compiler/types: Flip wrapping of base_type checks +- compiler/types: Flip wrapping of various type identification checks +- compiler/types: Flip wrapping of convenience accessors for vector types +- compiler/types: Flip wrapping of basic "get type" functions +- rusticl: Add Rust bindings for inline glsl_types functions +- util: Add size to ralloc_header in debug mode +- util: Add a canary to identify gc_ctx in debug mode +- util: Add function print information about a ralloc tree +- util: Avoid waste space when linear alloc'ing large sizes +- spirv: Expose stage enum conversion in vtn_private.h +- spirv: Change spirv2nir to use the shorter shader name abbreviations +- spirv: List entry-points in spirv2nir when unsure what to use +- spirv: Let spirv2nir find out the shader to use +- intel/compiler: Don't emit calls to validate() in release build +- compiler/types: Flip wrapping of "type contains?" predicate functions +- compiler/types: Flip wrapping of array related functions +- compiler/types: Flip wrapping of cmat related functions +- compiler/types: Flip wrapping of CL related functions +- compiler/types: Flip wrapping of size related functions +- compiler/types: Flip wrapping of struct related functions +- compiler/types: Flip wrapping of interface related functions +- compiler/types: Flip wrapping of layout related functions +- compiler/types: Flip wrapping of record_compare +- compiler/types: Flip wrapping of get_instance() +- compiler/types: Flip wrapping of texture/sampler/image get instance functions +- compiler/types: Flip wrapping of various get instance functions +- compiler/types: Flip wrapping of get row/column type helpers +- compiler/types: Flip wrapping of remaining non-trivial type getters +- compiler/types: Flip wrapping of remaining small data getters +- compiler/types: Flip wrapping of numeric type conversion functions +- compiler/types: Move remaining code from nir_types to glsl_types +- rusticl: Add bindings for glsl_vector_type() +- compiler/types: Add more glsl_contains_*() functions and use them in C++ +- compiler/types: Add glsl_get_mul_type() and use it in C++ +- compiler/types: Add glsl_type_compare_no_precision() and use it in C++ +- compiler/types: Add glsl_type_uniform_locations() and use it in C++ +- compiler/types: Add glsl_get_std430_array_stride() and use it in C++ +- compiler/types: Add glsl_get_explicit_*() functions and use them in C++ +- compiler/types: Implement glsl_type::field_type() in terms of existing functions +- compiler/types: Add glsl_simple_explicit_type() and simplify glsl_simple_type() +- compiler/types: Add remaining type extraction functions and use them in C++ +- compiler/types: Use C instead of C++ constants for builtin types +- compiler/types: Remove usages of C++ members in glsl_types.cpp +- compiler/types: Annotate extern "C" only once in glsl_types.cpp +- compiler/types: Rename glsl_types.cpp to glsl_types.c +- compiler/types: Remove warnings about potential fallthrough +- compiler/types: Move comments and reorganize declarations +- anv: Fix leak when compiling internal kernels + +Carsten Haitzler (2): + +- kmsro: Add hdlcd DPU +- panfrost: Add GPU variant of G57 to the set of known ids + +Charles Giessen (1): + +- panvk: Use 1.0 in ICD Manifest json + +Charmaine Lee (8): + +- svga: set clear_texture to NULL for vgpu9 +- svga: fix stride used in vertex declaration +- svga: fix persistent mapped surface update to constant buffer +- svga: restrict use of rawbuf for constant buffer access to GL43 device +- svga: fix immediates used in rawbuf for constant buffer +- svga: use srv raw buffer for accessing readonly shader buffer +- svga: sync resource content from backing resource before image upload +- svga: ignore sampler view resource if not used by shaders + +Chia-I Wu (38): + +- radv: fix separate depth/stencil layouts in fb state +- radv: fix separate depth/stencil layouts in resolve meta +- radv: refactor depth clear in clear meta +- radv: fix separate depth/stencil layouts in clear meta +- amd/ci: update radv-stoney-aco-fails.txt for depth/stencil clear +- radv: disable tc-compat htile for layered images on gfx8 +- amd/ci: update radv-stoney-aco-fails.txt for depth/stencil resolve +- winsys/amdgpu: fix a race between import and destroy +- ac/surface: limit RADEON_SURF_NO_TEXTURE to color surfaces +- winsys/radeon: fix a race between bo import and destroy +- vulkan/runtime: add a helper for ETC2 emulation +- radv: use vk_tecompress_etc2 from the runtime +- vulkan/runtime: fix image type check for ETC2 emulation +- vulkan/runtime: fix a harmless typo for ETC2 emulation +- vulkan/runtime, radv: remove 1D support from ETC2 emulation +- radv: add radv_is_format_emulated +- radv: simplify view format override for emulated formats +- radv: hard code format features for emulated formats +- mesa: make astc_decoder.glsl vk-compatible +- radv, drirc: rename radv_require_{etc2,astc} +- anv: remove unused field from anv_image_view +- anv: add anv_image_view_{init,finish} +- anv: support image views with surface state stream +- anv: add anv_push_descriptor_set_{init,finish} +- anv: support alternative push descriptor sets +- anv: add anv_descriptor_set_write +- anv: add anv_cmd_buffer_{save,restore}_state +- anv: add anv_is_format_emulated +- anv: add a hidden plane for emulated formats +- anv: decompress on upload for emulated formats +- anv: fix up image views for emulated formats +- anv: fix up blit src for emulated formats +- anv: advertise emulated formats +- anv: add support for vk_require_astc driconf +- util: improve BITFIELD_MASK and BITFIELD64_MASK on clang +- anv: prep for gen9 astc workaround +- anv: add gen9 astc workaround +- radv: fix image view extent override for astc + +Chris Spencer (9): + +- radv: initialize result when pipeline cache creation fails +- anv/android: Fix importing hardware buffers with planar formats +- anv/android: Add support for AHARDWAREBUFFER_FORMAT_YV12 +- anv: Advertise Vulkan 1.3 on Android 13 +- anv: Don't reject Android image format if external props not supplied +- android: Add explanatory comment to u_gralloc +- anv/android: Enable shared presentable image support +- anv/video: use correct enum value for max level IDC +- radv/video: use correct enum value for max level IDC + +Christian Gmeiner (41): + +- nir/print: print instr pass_flags +- etnaviv: move nir texture lowerings into one pass +- nir: add enta specific intrinsic used for txs lowering +- etnaviv: nir: support intrinsic used for txs lowering +- etnaviv: nir: lower nir_texop_txs +- ci/etnaviv: update ci expectations +- etnaviv: make use of BITFIELD_BIT(..) macro +- etnaviv: name the enum used for pass_flags +- etnaviv: add is_dead_instruction(..) helper +- etnaviv: extend etna_pass_flags with source modifiers +- etnaviv: do not clear all pass_flags before RA +- etnaviv: nir: look at parent instr in lower_alu(..) +- etnaviv: nir: add etna_nir_lower_to_source_mods(..) +- etnaviv: nir: switch to etna_nir_lower_to_source_mods(..) +- etnaviv: nir: convert to new-style NIR registers +- freedreno/regs: remove double assignment of self.current_domain +- freedreno/regs: remove not used variable +- freedreno/regs: remove dead code +- freedreno/regs: python does not need ';' +- etnaviv: switch to log2f(..) +- etnaviv: switch to U_FIXED(..) macro +- etnaviv: switch to S_FIXED(..) macro +- etnaviv: fix null pointer dereference +- etnaviv: switch to float_to_ubyte(..) +- ci/etnaviv: update ci expectation +- etnaviv: unbreak cmdline compiler +- agx/lower_address: Use intrinsics_pass +- agx/lower_address: Remove not used has_offset +- isaspec: python does not need ';' +- docs: Move isaspec out of drivers/freedreno +- isaspec: Add support for templates +- isaspec: encode: Correct used regex +- isaspec: Add method to get all instrustions +- isaspec: Add support for custom meta information +- isaspec: Add BitSetEnumValue object +- spirv: Don't use libclc for rotate +- docs: update etnaviv extensions +- etnaviv: drm: Be able to mark end of context init +- etnaviv: Skip 'empty' cmd streams +- ci: Bump PyYAML to 6.0.1 +- etnaviv: Don't leak disk_cache + +Collabora's Gfx CI Team (2): + +- Uprev Piglit to ed58dfbd12be34fa3dab97a7a2987b890e0637f1 +- Uprev Piglit to f7db20b03de6896d013826c0a731bc4417c1a5a0 + +Cong Liu (2): + +- r300: Fix out-of-bounds access in ntr_emit_store_output() +- virgl:Fix ITEM_CPY macro pointer copy bug + +Connor Abbott (83): + +- afuc: Rework and significantly expand README.rst +- tu: Fix vk2tu_*_stage flag type +- tu: Fix and simplify execution dependency handling +- tu, freedreno/a6xx: Remove has_ccu_flush_bug +- ir3: Handle GS stream "mixing" with non-point output primitives +- tu: Disable transformFeedbackPreservesProvokingVertex +- isaspec: Add "displayname" for altering {NAME} when decoding +- isaspec: Add support for "absolute" branches +- isaspec: Add support for function and entrypoint labels +- isaspec: Add "custom" field type +- isaspec: Add callback after decoding an instruction +- isaspec: Rename isa_decode() to isa_disasm() +- isaspec: Add initial decoding support +- afuc: Fix xmov lexer typo +- afuc: Convert to isaspec +- afuc: Add setbit/clrbit +- afuc: Fix writing $00 +- freedreno/afuc: Initial a7xx support +- ir3: Parse (eq) flag +- ir3, freedreno, tu: Plumb through SP_FS_PREFETCH_CNTL::ENDOFQUAD +- tu: Add missing last_baryf statistic +- freedreno, tu, ir3: Add last_helper statistic +- ir3: Gather pixlod status earlier +- ir3: Implement helper invocation optimization +- vk/graphic_state, tu: Use dynamic blend count from subpass +- freedreno/a7xx: Add CP_RESET_CONTEXT_STATE +- vk/graphics_state: Fix copying MS locations pipeline state +- tu: Remove MSAA draw state +- tu: Merge SAMPLE_LOCATIONS and SAMPLE_LOCATIONS_ENABLE draw states +- tu: Merge PC_RASTER_CNTL into RAST draw state +- tu: Stop reusing base Vulkan dynamic state enums +- tu: Merge depth/stencil draw states +- tu: Rename PrimID-related registers +- tu, freedreno/a6xx: Don't use VS for PrimID passthru state +- tu: Pull entangled shader state into program config +- ir3: Add ir3_find_input_loc() helper +- tu: Split up tu6_emit_vpc() +- freedreno, ir3, tu: Constify various uses of ir3_shader_variant +- ir3: Add helper to determine when variant exceeds safe constlen +- tu: Split program draw state into per-shader states +- tu: Fix per-view viewport state propagation +- tu: Fix tu6_emit_*_fdm size call +- tu: Fix assert in FDM state emission +- tu: Actually emit patchpoint for viewports with FDM +- nir/lower_subgroups: Don't do multiple lowerings at once +- nir/spirv: Add inverse_ballot intrinsic +- amd: Use inverse ballot intrinsic if available +- tu: Create singleton "empty" shaders +- tu: Start tracking shaders independently of pipeline +- tu: Move FS-specific pipeline information to the shader +- tu: Use shader directly for VS/TCS output size and patch size +- tu: Rewrite tessellation modes handling +- tu: Rework passing shared consts +- tu: Decouple program state from the pipeline +- tu: Use pipeline feedback loop flag indirectly +- tu: Rewrite remaining pipeline LRZ handling +- tu: Don't reference pipeline for some draw states +- tu: Make compute dispatch use the shader +- tu: Don't use pipeline for dynamic draw states +- tu: Don't use pipeline for bandwidth validity +- tu: Don't use pipeline for per_view_viewport +- tu: Don't use pipeline for active stages +- tu: Remove pipeline from state +- zink: Rework color clamping and conversion +- freedreno/fdl: Use A8_UNORM HW format for sampling +- tu: Support clearing A8_UNORM +- freedreno/fdl: Support PIPE_FORMAT_R5G5B5A1_UNORM on a6xx +- tu/clear_blit: Fix staging image view layer count +- tu/clear_blit: Allow VK_REMAINING_ARRAY_LAYERS as layerCount +- tu: Allow VK_WHOLE_SIZE in tu_CmdBindVertexBuffers2EXT pSizes +- tu: Implement vkCmdBindIndexBuffer2KHR +- tu: Implement vkGetImageSubresourceLayout2KHR and vkGetDeviceImageSubresourceLayoutKHR +- tu: Implement vkGetRenderingAreaGranularityKHR +- tu: Use new buffer usage flags +- tu: Support VkPipelineCreateFlags2CreateInfoKHR +- tu: Check for DEVICE_LOST in vkGetEventStatus() +- tu: Add maintenance5 properties +- freedreno/ci: Skip dEQP-VK.info.device_extensions +- tu: Expose VK_KHR_maintenance5 +- freedreno/ci: Remove minetest trace +- v3d/ci: Remove minetest trace +- ir3/ra: Don't swap killed sources for early-clobber destination +- tu: Fix re-emitting VS param state after it is re-enabled + +Corentin Noël (16): + +- ci: Add locked flag to bindgen-cli installation +- virgl: Do not expose EXT_texture_mirror_clamp when using a GLES host +- ci: disable Collabora's LAVA lab for maintenance +- llvmpipe: make sure to initialize the lp_setup_context slots with the default values +- virgl: Cover all the formats defined in the virgl definition +- mesa: Ensure that the baselevel will never exceed the maximal supported number +- ci: Uprev virglrenderer +- freedreno/drm/virtio: Use MESA_TRACE_SCOPE instead of _BEGIN/_END +- tu: Use MESA_TRACE_SCOPE instead of _BEGIN/_END +- aux/tc: Use MESA_TRACE_SCOPE instead of _BEGIN/_END +- venus: Change the only occurrence of VN_TRACE_BEGIN/END to VN_TRACE_SCOPE +- util: Avoid the use of MESA_TRACE_BEGIN/END +- util/perf: Remove the tracing categories +- util: Remove MESA_TRACE_BEGIN/END +- mesa/bufferobj: ensure that very large width+offset are always rejected +- frontends/va: Remove wrong use of ProfileToPipe + +Daniel Schürmann (9): + +- nir/opt_move: fix handling of if-condition +- aco: append p_logical_end after monolithic RT shaders +- aco/insert_exec_mask: set Exact mode after p_discard_if when necessary +- aco: don't optimize cross-lane instructions across p_wqm +- aco: make p_wqm a marker instruction without Operands/Definitions +- aco: don't insert a copy when emitting p_wqm +- aco: insert a single p_end_wqm after the last derivative calculation +- aco/insert_exec_mask: Simplify WQM handling (1/2) +- aco/insert_exec_mask: Simplify WQM handling (2/2) + +Daniel Stone (23): + +- dri: Support 1555/4444 formats +- egl/dri2: Don't look up image extension twice +- egl/wayland: Always initialise fd_display_gpu +- egl/wayland: Add image loader extension for swrast +- egl/wayland: Never use DRI2_LOADER extension +- egl/wayland: Assume modern DRI interface versions +- egl/drm: Use IMAGE_DRIVER instead of DRI2_LOADER +- egl/drm: Assume modern DRI interface versions +- ci: Disable nouveau CI +- panfrost/vk: Use correct sampler dimensions for MSAA +- ci: Declare stages before jobs +- ci/radeonsi: Add new flake +- ci/d3d12: Add new flake +- ci/intel: Add new skqp flake +- ci/zink: Add new zink-lvp flakes +- ci/radeonsi: Skip more really slow tests +- ci/zink: Add another conversion fail on a618 +- ci: Move farm-disable rules before anything else +- ci: Always set user container jobs to manual +- ci: Use container rules for containers +- ci: Only look at file changes for MRs +- ci: Fix pre-merge pipelines with no code changes +- ci: Try really hard to print final result string + +Daniel van Vugt (1): + +- glx: Increment dpy->request before issuing an error that had no request + +Danylo Piliaiev (71): + +- freedreno/cffdec: Decode CP_DRAW_AUTO +- freedreno, turnip: Clarify some RB_CCU_CNTL fields +- freedreno,turnip: Make number of VSC pipes configurable +- freedreno,turnip: Make CS shared memory size configurable +- freedreno,turnip: Make VS input attr/binding count configurable +- freedreno: Add A605, A608, A610, A612 GPUs definition +- turnip: Make multiview support configurable per generation +- ir3: Make FS tex prefetch optimization optional +- ir3: Use NIR info to enable per sample shading +- freedreno/regs: Rename SP_FS_CTRL_REG0.DIFF_FINE into LODPIXMASK +- ir3: Fix FS quad ops returning wrong values from helper invocations +- tu,freedreno: Forbid blit event for R8G8_SRGB due to gpu faults +- radv: fix unused non-xfb shader outputs not being removed +- vulkan/nir: Add common helper to check if output is XFB +- radv: Use common nir_vk_is_not_xfb_output +- turnip: Use common nir_vk_is_not_xfb_output +- freedreno/regs: Define unknown SP_FS_PREFETCH_CNTL fields +- freedreno/registers: Refactor gen_header.py to allow more options +- freedreno/registers: Generate python files with reg offsets +- freedreno: Add a list of raw magic regs +- freedreno: Fully define a730 and a740 device properties +- ir3/tests: Use fd_dev_info to infer GPU generation +- freedreno/computerator: Fix remaining issues with A7XX +- isaspec: Make possible to obtain gpu_id in <expr> blocks +- ir3/a7xx: cat5 mode1 has swapped tex/samp ids +- ir3/a7xx: Don't multiply global mem instruction's offset by 4 +- ir3/a7xx: insert lock/unlock at the end of every compute shader +- ir3/a7xx: Add ccinv instruction +- ir3/a7xx: Use ccinv for data synchronization +- ir3/a7xx: Disable shared consts for a7xx +- tu/common: Generalize TU_GENX macro +- tu: Basic a7xx support +- freedreno/fdl: Set LOSSLESSCOMPEN for image when ubwc is enabled on a7xx +- tu/a7xx: Fix geometry shaders +- tu/a7xx: Fix tesselation shaders +- tu/a7xx: Fix multiview +- tu/a7xx: Fix flat shading +- tu/a7xx: Fix occlusion query +- tu/a7xx: Fix 3d blits after multiview usage +- tu/a7xx: Fix CmdDrawIndirectByteCountEXT +- tu/a7xx: Disable LRZ +- ir3/lower_tex_prefetch: Fix crash with lowered load_barycentric_at_offset +- tu: Exclude SP_UNKNOWN_AE73 from reg stomping +- tu: Call tu_cs_dbg_stomp_regs with appropriate GPU gen +- freedreno/replay: Add limited support for KGSL +- freedreno/rddecompiler: Update to handle a7xx +- freedreno/replay: Add "print" instr to ir3 asm to be used in replay +- freedreno/replay: Add "gpu_print" function for command streams +- tu/perfetto: Remove now unnecessary tu_perfetto_util +- tu/perfetto: Allow gpu time to be passed into tu_perfetto_submit +- tu/kgsl: Fix memory leak of tmp allocations during submissions +- tu/kgsl: Support u_trace and perfetto +- tu/a7xx: Correctly record timestamps for u_trace +- tu/virtio: Fix incorrect call to tu_perfetto_submit +- ci: Compile Turnip's virtio kmd in debian-arm64 +- freedreno/registers: Refine a7xx push consts registers +- ir3,tu: Refactor push consts info plumbing +- freedreno: Make possible to specify A7XX feature flags +- turnip,ir3: Implement A7XX push consts load via preamble +- tu: Add push_consts_per_stage debug option +- tu: Fix VK_FORMAT_A8_UNORM_KHR using UBWC when !has_8bpp_ubwc +- tu/kgsl: Fix field order in kgsl_command_object init +- tu: Fix stale tu_render_pass_attachment::store_stencil with dyn rendering +- tu: Zero init tu_render_pass and tu_subpass for dynamic rendering +- tu: Disable preamble push consts when they are not used +- ir3: Fix values of #wrmask not being compatible with ir3 parser +- tu: Count a whole push consts range in constlen for PREAMBLE push consts +- freedreno/rddecompiler: Use fd_dev_gen to pass gpu_id to ir3 disasm +- freedreno/rddecompiler: Decompile repeated IBs +- freedreno: Fix field size of A6XX_TEX_CONST[3].ARRAY_PITCH +- tu: Fix reading of stale (V)PC_PRIMITIVE_CNTL_0 + +Dave Airlie (163): + +- ci: remove binding model from the asan skips for lavapipe. +- gallivm: fix atomic global temporary storage. +- llvmpipe: fix fragdata/lastfragdata heuristic a bit more. +- nvk: add missing finish calls +- nvk: add some initial wsi framework. +- nvk: fix header guards to be less generic. +- nvk: add bind buffer memory +- nvk: Add initial queue +- nvk: add cmd buffer framework +- nvk: Reset pushbufs on command buffer reset +- nvk: reindent descriptor sets to mesa std. +- nvk: add initial descriptor pool framework. +- nvk: some boilerplate for descriptor sets +- nvk: add descriptor set bo allocation. +- nvk: implement buffer address. +- nvk: descriptor set freeing fix +- nvk: move to new command stream generator. +- nvk: port the blit and copy code to new command submission. +- nouveau/ws: drop the old push generators. +- nvk: link in codegen without gallium bits. +- nvk: Initial wiring in of the compiler +- nvk: Basic descriptor binding +- nouveau/vk: add support for compute classes to generator. +- nvk: retrieve gpc/mp counts from kernel. +- nvk: add support for preamble and tls allocation. +- nvk: add record result to cmd_buffer. +- nvk: add command stream upload buffer. +- nouveau/winsys: Add m2mf/compute objects +- nvk: add some basic format wrapping framework +- nvk: add some compute limits +- nvk: add basic nve4+ compute support. +- nvk: fix empty cmd submission. +- nouveau/ws: add a push reset just for references. +- nouveau/classes: add 906f header support. +- nvk: add initial 8/16 byte clears. +- nvk: fix pipeline pushbuf sizing +- nvk: increase graphics cpu push buffer +- nvk: fix depth emission ordering. +- nvk: add some limits/features from binary driver. +- nvk: add indexed draw support. +- nvk: assign vertex locations according to input attrib index +- nvk: lower io to temps to avoid output reads in vertex shaders +- nvk: handle NULL to destroy descriptor pool +- nvk: add basic primitive restart +- nvk: fix copy lower address extraction +- nvk: fix multiple pipelines failure allocation case. +- nvk: init dev->physical_device earlier. +- nvk/winsys: store device ptr into bo instead of ptr +- nvk: set the device fd +- nil: Fix image align and size constraints +- nvk: Report image alignments from NIL +- nouveau/winsys: allocate unique object handles across channels. +- nvk/nil: don't ask for compressed image kind +- nvk/barrier: handle host bit. +- nvk: add compute support for ampere +- nvk: add min_lod to spirv caps. +- nvk: fix r32_sint format support +- nvk: expose EXT_sampler_filter_minmax +- nvk: fix transform feedback crash when optimiser removes things. +- nvk: merge tess info between tcs/tes. +- nvk: introduce an optimisation loop. +- nvk: add support for D32_SFLOAT_S8_UINT +- nvk/query: fix push buffer size for copy pool results. +- nvk: init image fields for requirements +- nvk: handle alignments in device memory +- nvk/tess: don't emit patch control points in pipeline +- nvk: align geometry clip setting with nvc0 +- nvk: fix independent color write masks. +- nvk: enable rgb32 texel buffer support +- nvk: enable EXT_depth_clip_control +- nvk: enable EXT_depth_clip_enable +- nvk: always sync internal cmd bufs for vma lifetimes. +- nouveau/winsys: add support for the vma bind interfaces +- nvk: Add support for sparse buffers +- nvk: Add support for sparse images +- nvk/queue: add support for syncobjs and sparse binds +- nvk: Handle pre-turing indirect buffers with sparse +- nvk: enable sparse features +- nvk: enable a bunch of external fence/semaphore bits +- nvk: enable sparse residency buffer on maxwell+ +- nvk: add new internal bo allocation flag. +- docs: add two nvk exts to features.txt +- zink: use fprintf instead of printf to align the requirements warnings +- nvk: align sampler allocation counts with nvidia. +- zink: turn off threaded cpu access if not visible. +- nvk: add gart forced cmd pool side buffer. +- nvk: add cond render upload buffer. +- nvk: enable KHR_shader_clock. +- nvk: NOUVEAU_WS_BO_LOCAL is a trap. +- gallivm: drop unused info parameter +- llvmpipe/fs: drop cbuf 0 since it's lowered now. +- gallivm/nir: avoid using params->info +- llvmpipe/fs: move some tgsi checks in nir path to nir code. +- llvmpipe/cs: convert to using tgsi->nir +- llvmpipe/cs: drop tgsi for compute/mesh/task shader internals. +- lavapipe: use vk_buffer common code. +- lavapipe: use vk_buffer_range common code. +- llvmpipe/fs: switch to using tgsi->nir instead of handling tgsi +- llvmpipe/analyse: drop TGSI path. +- llvmpipe/fs: start using nir info in some places. +- llvmpipe/fs: drop the simple shader logic +- llvmpipe/fs: rewrite output finding using nir. +- nvk: add build_id linker argument. +- nir/gather: add support for fbfetch and bindless image loads. +- llvmpipe/cs: further cleanups after tgsi removal. +- llvmpipe: move to nir lowering for fquantize2f16 +- rusticl: don't store ptrs to nir_variables across opt passes. +- llvmpipe: enable f16 paths on aarch64. +- clover/llvm: move to modern pass manager. +- nir: use a _clone so users calling their variable clone don't get a warning +- nir: rename nir_inline_functions.c to nir_functions.c +- nir: use nir_function_instructions_pass in the inliner. +- nir: move the libclc lowering over to functions file. +- nir/functions: use helper to get function for a name. +- nir/functions: put link state into a struct +- nir/functions: move linker pass to new helper +- nir: add nir function clone +- nir: don't inline linked functions +- gallivm/nir: split prepasses out to make per-function work easier. +- gallivm: rework translator to allow per-impl work. +- spirv/nir: parse function control and store in nir. +- nir: add driver_functions option to avoid inlining. +- nir: add a function usage tracker +- rusticl: use cleanup funcs +- gallivm: add support for function calling +- llvmpipe/cs: add support for function calls. +- llvmpipe: enable driver functions. +- radv: don't emit event code on video queues. +- spirv: use a pointer sized int type for opencl event_t +- clover: fix parameter arguments since recent translator changes. +- radv/video: take db alignment into account when allocating images. +- ac,radeonsi: move vcn enc structs to common +- ac,radeonsi: move vcn enc av1 default cdf file to common +- nir: add a deref slot counter that handles compact +- llvmpipe/linear: drop tgsi path. +- gallivm: drop tgsi aos paths. +- llvmpipe/nir: call gather info to update inputs read properly +- llvmpipe/fs: start converting interp/input paths to nir. +- llvmpipe/fs: start converting dervied state to nir based. +- llvmpipe/linear: convert to using nir for output. +- llvmpipe/linear: move to nir inputs +- draw/mesh: reset some user state values on mesh draws. +- llvmpipe/fs: fix regression in sample mask handling from tgsi removal. +- llvmpipe: reset viewport_index_slot in fb bind +- llvmpipe/cs: migrate to generic jit texture from pipe code. +- llvmpipe/cs: migrate cs image handle to common jit code. +- lavapipe: fix some whitespace in advance of other changes. +- lavapipe: fix subresource layers asserts +- lavapipe: support host image copying on compressed texture formats +- llvmpipe: don't create texture functions for planar textures. +- lavapipe: don't emit blit src/dst for subsampled formats. +- llvmpipe: don't support planar formats for buffers. +- lavapipe: convert sampler to use vk base class. +- lavapipe: cleanup copy code to use a local region variable. +- lavapipe: start introducing planes structure. +- lavapipe: allocate image and image view planes. +- lavapipe: handle planes in copies +- lavapipe: handle planes in get image sub resource +- lavapipe: add descriptor sets bindings for planar images +- lavapipe: handle planes in texture lowering. +- lavapipe: expose planar ycbcr formats and new ycbcr features +- lavapipe + docs: update ycbcr extension enables. +- intel-clc: avoid using spirv-linker. + +David Heidelberg (82): + +- ci/freedreno: update a530 flakes +- ci: build kernel in gfx-ci/linux and just use binaries in Mesa3D CI +- ci: update kernel to 6.3.13 +- ci/freedreno: add fails introduced by upreving to 6.3.13 +- Revert "lima/ci: temporarily disable deqp-egl tests due to timeouts" +- ci/radeonsi: stoney arb_timer_query got fixed between kernel 6.3.1..13 +- ci/lima: EGL testing was disabled when fp16 fail was removed +- ci/freedreno: fix unexpectedpass flake on a630 +- ci/freedreno: add another a530 flakes +- ci: add quirk for GitLab assuming changes is always true for scheduled runs +- ci/microsoft: when re-enabling Windows Farm, always run the container +- ci/freedreno: add a530 flakes, remove one fail which recently started passing +- ci/panfrost: introduce OpenGL testing with Mali-G57 MP5 on Asurada chromebook +- ci/freedreno: cover all texture gather flakes +- ci/freedreno: add a530 flake vs-lessthanequal-uvec4-uvec4 +- ci/farms: always compare the code against main repository +- Revert "ci/farms: always compare the code against main repository" +- ci/kernel: add amd patch to prevent crashes when starting X +- ci/kdl: remove extra-verbose ls command +- ci/nouveau: add 20 minutes timeout to gk20a and align gm20b +- ci/freedreno: document another mapbuffer flake on a530 +- ci/amd: fix timeouting radeonsi-raven-va-full job +- docs/ci: default to port 80 for the caching proxy +- docs/ci: update to systemd and used version of the trace for testing +- docs/ci: remove default nginx config, which we don't need for proxy +- bin/ci: handle errors more gracefully in update_traces_checksum script +- ci/freedreno: document another flakes on Adreno 530 +- ci: add perfetto into mesa git-cache +- ci/panfrost: re-enable t760 and t860 traces as a nightly job +- CI: Re-enable G52 Vulkan testing +- ci/panfrost: t760-gles is nightly job, test also GLES 3 and 3.1 +- ci/zink: Add flake seen in the wild +- ci/build: limit debian-build-testing to 30 minutes +- ci/amd: add glx\@glx-visuals-depth flake to raven +- ci/freedreno: document vs-nested-return-sibling-loop2 flake on Adreno 530 +- ci/farms: enabled Microsoft job only when conditions are met +- ci/deqp: really remove the uncompressed results.csv file +- ci/baremetal: do not install curl, it's already there +- ci/baremetal: shorten BM_KERNEL to filename and BM_DTB to name only +- ci/freedreno: document another a530 flake batch +- ci: remove LAVA prefix from variables which can be used also elsewhere +- ci/zink: drop a630, which we currently have very low amount available +- ci/freedreno: the tag belongs to the apq8016 only +- ci/freedreno: switch references, the farm-rules takes care about this +- ci/freedreno: handle disabling farm properly for each FD/Collabora farm +- ci/freedreno: another batch of Adreno 530 flakes +- gtest: backport ansi color fix +- ci: disable Material Testers.x86_64_2020.04.08_13.38_frame799.rdc trace +- panfrost/ci: revert Disable T720 +- ci/piglit: add extra space on top to prevent single quote getting into URL +- ci/freedreno: There is only one King of Town. +- ci: switch to 6.4 kernel, improving Adreno 660 reliability +- ci/iris: add GL46.arrays_of_arrays_gl.SizedDeclarationsPrimitive timeout +- ci/panfrost: add G52 flakes +- ci/panfrost: we have enough device, parallelize Vulkan tests +- ci/virgl: flakes in functional.draw_buffers_indexed group +- ci/freedreno: add another a530 flake +- ci/panfrost: add G52 simple_tests.partial_image_pot_same_format_noclear flake +- panvk: architecture isn't invalid, just unsupported +- panvk: catch unsupported arch in the panvk_physical_device_init +- Revert "ci: disable a660 jobs" +- docs: add LAVA farm informations +- ci: disable Google Freedreno farm, currently timeouting on all jobs +- Revert "ci: disable Google Freedreno farm, currently timeouting on all jobs" +- ci/farms: no need to check RUNNER_TAG for Collabora farm +- ci/traces: extend no-output timeout by 5 minutes +- ci/venus: add fragment.32B_in_memory_with_vec4_s32 flake +- iris: do not mention specifically clover for OpenCL support +- ci/freedreno: disable broke cheza (Adreno 630) runners +- ci/bare-metal: correct workaround for R8152 issue while retrieving TFTP data +- ci/bare-metal: drop unused imports, sort, use SPDX license +- ci/lima: farm is down, disable for now +- ci: do not report failed job when flakes reporting fails +- ci/freedreno: re-enable Cheza (Adreno 630) runners +- ci/traces: upload only missing trace images +- ci/traces: keep images for every job except the performance testing +- ci/traces: rename upload function to reflect it works with S3 +- ci/traces: always export piglit EXTRA_ARGS +- ci: ci_marge_queue.py +- ci/freedreno: fix copy paste causing a618_gl being run only in manual pipeline +- ci/freedreno: disable Adreno 660 Vulkan pre-merge +- ci/traces: drop the freedoom-phase2-gl-high.trace + +David Rosca (70): + +- radeonsi: Use DIV_ROUND_UP instead of ALIGN_POT +- frontends/va: Skip processing buffers already converted with EFC +- frontends/va: Don't use EFC with scaling or filtering enabled +- radeonsi/vcn: Don't use chroma in AV1 encode with RGB input +- frontends/va: Parse H264 SPS for video signal parameters +- frontends/va: Parse HEVC SPS for video signal parameters +- frontends/va: Add postproc support for converting to full range +- radeonsi/vcn: Set H264 video signal parameters in bitstream +- radeonsi/vcn: Set HEVC video signal parameters in bitstream +- radeonsi/vcn: Enable full/limited range support for H264/HEVC/AV1 +- radeonsi/vcn: Fix setting color range in AV1 bitstream +- gallium/auxiliary/vl: Fix RGB->YCbCr full range matrix +- gallium/auxiliary/vl: Handle UV subsampling in compute_shader_yuv +- gallium/auxiliary/vl: Fix blurry output of compute_shader_yuv +- frontends/va: Add YUV420 to NV12 postproc conversion +- gallium/auxiliary/vl: Fix chroma and blurry output of cs video_buffer +- gallium/auxiliary/vl: Fix chroma offset of compute_shader_weave +- frontends/va: Also map VAImageBufferType for reading +- frontends/va: Alloc interlaced surface for interlaced pics +- frontends/vdpau: Alloc interlaced surface for interlaced pics +- radeonsi: Don't prefer interlaced for video decode +- ci/amd: Skip VAAPI CreateSurfacesWithConfigAttribs/1121 test +- frontends/va: Don't allow multi-plane derive without driver support +- frontends/va: Init view_resources array in vlVaPut/GetImage +- radeonsi: Copy all planes with multi-plane staging textures +- radeonsi: Enable PIPE_VIDEO_CAP_SUPPORTS_CONTIGUOUS_PLANES_MAP +- ci/amd: Skip all VAAPI tests that creates too many huge surfaces +- radeonsi/vcn: Update rate control when framerate changes with HEVC +- frontends/va: Ignore requested size when creating VAEncCodedBufferType +- gallium/auxiliary/vl: Set correct csc matrix in set_buffer_layer +- radeonsi/vcn: Fix leaking fences in decode +- gallium/auxiliary/vl: Add BT.709 full csc matrix +- frontends/va: Set csc matrix in postproc +- gallium/auxiliary/vl: Don't set csc matrix in video_buffer/rgb_to_yuv_layer +- frontends/va: Add BT.709 as supported postproc color standard +- Revert "radeonsi/vcn: add an exception of field case for h264 decoding" +- gallium/auxiliary/vl: Set vertex element src_stride in vl_deint_filter +- gallium/auxiliary: Fix util_compute_blit half texel offset with scaling +- gallium/auxiliary/vl: Map range when updating constants +- gallium/auxiliary/vl: Clamp coordinates in compute shaders +- gallium/auxiliary/vl: Support chroma sample location in compute shaders +- frontends/va: Support chroma sample location in postproc +- frontends/va: Flush after unmapping VAImageBufferType +- frontends/va: Parse chroma sample location in H264/HEVC SPS +- radeonsi/vcn: Set H264/HEVC chroma sample location in bitstream +- radeonsi/vcn: Don't hang GPU when using DCC surface as encoder input +- frontends/va: Track surfaces in context +- frontends/va: Destroy fences when destroying surface or context +- radeonsi/vcn: Implement destroy_fence vfunc +- frontends/va: Process VAEncSequenceParameterBufferType first in vaRenderPicture +- frontends/va: Set default rate control values once when creating encoder +- gallium/auxiliary/vl: Add RGB to YUV compute shader +- gallium/auxiliary/vl: Use chroma offset in YUV to RGB weave compute shader +- gallium/auxiliary/vl: Fix YUV to RGB bob compute shader deinterlacing +- gallium/auxiliary/vl: Only map the shader constants buffer in render +- frontends/va: Add High Quality preset mode +- radeonsi/vcn: Add High Quality encoding preset for AV1 +- radeonsi: Fix plane size in si_copy_multi_plane_texture +- frontends/va: Implement vaMapBuffer2 +- frontends/va: Fix locking in vlVaBeginPicture +- frontends/va: Parse H264 SPS for max_num_reorder_frames +- util/vl: Fix vl_rbsp parser with bitstreams without emulation bytes +- frontends/va: Fix parsing packed headers without emulation bytes +- radeonsi/vcn: Add encode support for H264 B-frames +- frontends/va: Map decoder and postproc surfaces for reading +- radeonsi: Fix offset for linear surfaces on GFX < 9 +- gallium/auxiliary/vl: Fix coordinates clamp in compute shaders +- gallium/auxiliary: Fix coordinates clamp in util_compute_blit +- gallium/auxiliary/vl: Scale dst_rect x0/y0 when rendering chroma plane +- util/rbsp: Fill bits twice if reading more than 16 bits + +Derek Foreman (2): + +- vulkan/wsi: Allow binding presentation_timing when software rendering +- vulkan/wsi: warn about unset present_mode in PresentModeCompatibilityExt + +Dmitry Baryshkov (3): + +- gallium: move kmsro definition to the bottom of the file +- gallium: unbreak kmsro/freedreno case +- tu: Pass real size of prime buffers to allocator + +Dmitry Osipenko (3): + +- util/cache_test: Re-add test for disabled cache +- util/cache_test: Fix disabled cache test using SHADER_CACHE_DISABLE_BY_DEFAULT +- util/cache_test: Add test for get/put() with disabled cache + +Dor Askayo (1): + +- nouveau: add exported GEM handles to the global list + +Dr. David Alan Gilbert (6): + +- rusticl/core: Add profiling time storage (queued) to event +- rusticl: Wire the 'queued' profiling time up +- rusticl: Wire the 'submit' profiling time up +- rusticl: Wrap pipe queries +- rusticl: Wrap pipe query reads +- rusticl: Wire the 'start' and 'end' profilng times up + +Dylan Baker (4): + +- VERSION: bump to 23.3.0-devel +- docs: Update release calendar for 23.2.0-rc1 +- docs: truncate feature list for 23.3-devel +- meson: use a single dependency call for lua + +Echo J (5): + +- nvk: Fix some cast defines +- nvk: Add A8B8G8R8_*_PACK32 format support +- nvk: Add bufferImageGranularity limit +- nvk: Reset offset value in ResetDescriptorPool +- nil: Add A4B4G4R4_UNORM format support + +Emma Anholt (111): + +- ci/radv: Clarify when the ANGLE GS failures started happening. +- ci: Uprev ANGLE to 0518a3ff4d4e ("Android: Simplify power metrics collection") +- ci/tgl: Improve the info for ANGLE's MSAA regression on TGL. +- ci/tu: Add more crash cases for the multithreading bugs caught on a630. +- ci/tu: Mark descriptor_buffer.basic.limits as failing in gmem too. +- ci/tu: Drop some xfails for !24086 +- tu: Fix data race in userspace VMA management. +- ci/a5xx: Add another GPU hanging piglit test to the skips. +- Revert "ci: Disable nouveau CI" +- nvk: Avoid strict aliasing warning in the pushbuffer encoding. +- nvk: Fix uninitialized result usage in NVK_DEBUG_ZERO_MEMORY. +- nvk: Fix unused result warnings in pushbuf resets. +- nvk: Remove duplicate (disabled) point sprite setup. +- nvk: Fix missing init of the stages to sync against. +- nvk: Use depth_clamp_enable to select PIXEL_*_Z_CLAMP. +- nouveau/winsys: Fix an undefined use in the error path. +- nvk: Quiet a compiler warning. +- nvk: Clean up redundant vendor checking for physical device creation. +- nvk: Add support for probing as a platform device. +- nvk: Disable shaderStorageImageReadWithoutFormat pre-Maxwell. +- freedreno/a5xx: Fix border color structure size. +- freedreno/a5xx: Skip emitting unused texture descriptors for images. +- freedreno/ir3: Move pvtmem per-fiber size alignment to the compiler. +- ci/freedreno: Drop a bunch of stale a530 xfails. +- ci/freedreno: Sort another a530 xfail with its friends. +- ci/freedreno: Update comments for some a530 xfails. +- ci/freedreno: Add some more db820c xfails. +- freedreno/devices: Move fibers_per_sp to the common info struct. +- freedreno/devices: Set num_sp_cores explicitly for pre-gen6. +- freedreno/a6xx: Move pvtmem allocation to ir3_gallium. +- freedreno/a3xx: Add the shift for MEMSIZEPERITEM according to db410c docs. +- freedreno/a5xx: Refactor SHADER_OBJ emit to a helper function. +- freedreno/a5xx: Set num_sp_cores and set PC/VFD_POWER_CNTL accordingly. +- freedreno/a5xx: Add private mem support. +- freedreno/cffdec: Fix decode on pixel 2 blob's COMPUTE_CHECKPOINT +- ci/freedreno: Add a regression test for decoding a540 blob's compute shaders. +- freedreno: Fix crashdec pre-a6xx. +- freedreno/a5xx: Skip SSBO emit when none are enabled. +- vulkan/util: Make multialloc succeed with 0 allocations. +- turnip: Track the first/last subpass an attachment is used in. +- turnip: Skip emitting empty CP_COND_REG_EXEC. +- turnip: Save the renderpass's clear values in the cmdbuf state. +- turnip: Move gmem clears and loads to the first subpass that uses them. +- turnip: Move sysmem clears to the first subpass that uses them. +- ci/freedreno: Skip some tests on a5xx that destabilize other tests. +- freedreno/a3-5xx: Don't try to emit ISAM for SSBO loads. +- ci/turnip: Add a660 VK coverage. +- disk_cache: Disable the "List" test for RO disk cache. +- blorp: Disable unaligned partial HIZ fast clears for HIZ_CCS too. +- intel/fs: Move defin/defout setup to the start of the loop. +- intel/fs: Move the defin[]/defout[] screening up to livein[]/liveout[] setup. +- intel/fs: Simplify compute_start_end(). +- ci/freedreno: Add another excessive-constlen UBO skip. +- ci/anv: Drop DEQP_VER:vk setting. +- ci/anv: Drop "-vk" from the job name. +- ci/anv: Add a manual full VK run for TGL. +- ci/anv: Add testing on JSL. +- freedreno: Build drm subdir before perfcntrs, which uses it. +- ci/intel: Add various updates from our nightly runs. +- ci/virgl: Disable virgl-iris-traces. +- ci/zink: Add a few updates for anv/tgl from the nightly runs. +- ci/fastboot: Use a case insensitive match for a fastboot line. +- ci/etnaviv: Skip some tests that hang the GPU and knock out other tests. +- ci/etnaviv: Drop some gc2k flakes that I think are resolved. +- ci/anv: Drop incorrect xfail addition for TGL +- ci/anv: Drop the 16bit.scalar.13 skip. +- ci/etnaviv: Minor xfail/flake polishing. +- ci/etnaviv: Skip a GLES2 test that times out the asan job. +- ci/zink: Skip more doubles tests on anv that flake at 3 minute timeouts. +- ci/docker: Clear the results file before starting a new deqp test run. +- ci/crocus: Add a related flake to a known one. +- ci/etnaviv: return gl-1.4-tex1d-2dborder as a known flake +- ci/crocus: Add known piglit flakes +- ci/hasvk: Add a bunch of new CTS border color fails. +- i915: Re-clang-format and enforce it in CI. +- i915: Print the relevant counts vs limits when throwing errors. +- i915: Don't log I915_DEBUG=fs output for blit shaders. +- i915: Save fragment program compile error messages in the fragment shader. +- i915: Do a test compile at glLinkShader() time. +- i915: Make exceeding tex indirect count fatal. +- i915: Use nir_group_loads() to reduce texture indirection phases. +- ci/crocus: Generalize the drawarrays-vertex-count flakes. +- ci/zink: Skip 3-minute-long glx-visuals timeouts. +- ci/zink: Skip dmat[34] op tests in general, as well +- ci/crocus: Disable flaky unvanquished-ultra trace +- nir/print: Decode system values in the variable declarations. +- ci/zink: Add a TGL flake that's showed up in nightlies recently. +- ci/radeonsi: Drop an xfail for vangogh. +- i915: Make I915_DEBUG=fs log shaders that fail to link due to CF. +- nir: Flatten ifs with discards in nir_opt_peephole_select for HW without CF. +- glsl: Remove lower_discard(). +- ci/zink: Only test half of piglit pre-merge on anv. +- ci: Stop doing internal retries in bare-metal. +- ci/bare-metal: Drop the 2 vs 1 exit code from poe_run. +- ci/bare-metal: Default our boards to a 20-minute timeout for the whole job. +- ci/iris: Drop parallel on kbl piglit to 2. +- ci/freedreno: Fold a630_egl into a630_gl. +- ci/freedreno: Move skqp testing to a618. +- ci/zink: Cut zink-lvp coverage in half. +- ci/freedreno: Generalize the implicit_unmap timeouts. +- ci_run_n_monitor: Poll mesa/mesa and user/mesa for pipelines at the same time. +- glx: Delete support for GLX_OML_swap_method. +- ci: drop skip for glx-swap-copy. +- dri: Drop a duplicate mesa vs pipe format table. +- docs/ci: Drop old instructions for farm disabling +- docs/ci: Add some links in the CI docs to how to track job flakes +- glsl: Remove int64 div/mod lowering. +- llvmpipe: Set nir_lower_dround_even. +- nir: Add nir_lower_dsign as 64-bit fsign lowering. +- glsl: Retire dround lowering. +- ci_run_n_monitor: Always resolve --rev arguments for looking up pipelines. + +Eric Engestrom (194): + +- ci: avoid running hardware jobs if lint fails - now on LAVA too! +- ci: avoid running hardware jobs if lint fails - now on Windows too! +- ci: replace copy of nouveau rules with reference +- ci: drop leftover kernel configs +- ci: use !reference for scheduled_pipeline retry rule +- ci: add .llvmpipe-manual-rules and use it +- ci: add .gallium-core-rules and use it instead of gallium_core_file_list anchor +- ci: replace llvmpipe_file_list anchor with reference +- ci: replace softpipe_file_list anchor with reference +- ci: replace lavapipe_file_list anchor with reference +- ci: replace iris_file_list anchor with reference +- ci: replace radv_file_list anchor with reference +- ci: replace radeonsi_file_list anchor with reference +- ci: replace virgl_file_list anchor with reference +- ci: move etnaviv files rules to src/etnaviv/ci/gitlab-ci.yml +- ci: move freedreno files rules to src/freedreno/ci/gitlab-ci.yml +- ci: move nouveau files rules to src/gallium/drivers/nouveau/ci/gitlab-ci.yml +- ci: move panfrost files rules to src/panfrost/ci/gitlab-ci.yml +- ci: move broadcom files rules to src/broadcom/ci/gitlab-ci.yml +- ci: move lima files rules to src/gallium/drivers/lima/ci/gitlab-ci.yml +- ci: move amd files rules to src/amd/ci/gitlab-ci.yml +- ci: move microsoft files rules to src/microsoft/ci/gitlab-ci.yml +- ci: move zink files rules to src/gallium/drivers/zink/ci/gitlab-ci.yml +- ci: move virtio files rules to src/virtio/ci/gitlab-ci.yml +- ci: move intel files rules to src/intel/ci/gitlab-ci.yml +- ci: move virgl files rules to src/gallium/drivers/virgl/ci/gitlab-ci.yml +- ci: move llvmpipe files rules to src/gallium/drivers/llvmpipe/ci/gitlab-ci.yml +- ci: move softpipe files rules to src/gallium/drivers/softpipe/ci/gitlab-ci.yml +- ci: move lavapipe files rules to src/gallium/drivers/lavapipe/ci/gitlab-ci.yml +- ci: delete install.tar after extracting it to avoid re-uploading it +- docs: add release notes for 23.1.4 +- docs: add sha256sum for 23.1.4 +- docs: update calendar for 23.1.4 +- asahi: drop unused include paths +- ci/lint: deduplicate formatting check jobs +- ci/lint: also print a diff for rust format issues +- ci: allow hw jobs even if lint jobs fail for non-Marge pipelines +- ci: print rustfmt's version +- ci: print clang-format's version +- bin/ci_run_n_monitor: get git sha from pipeline if specified, instead of requiring --rev to match +- lavapipe/ci: use tighter changes: rules +- ci: add a 10min job timeout to formatting checks +- ci: reduce bare-metal retries of poe_run to only 3 attempts +- broadcom/ci: reduce vc4-rpi3-gl timeout to 30min (instead of 1h) +- broadcom/ci: reduce v3d-rpi4-gl timeout to 30min (instead of 1h) +- broadcom/ci: reduce v3d-rpi4-traces timeout to 30min (instead of 1h) +- broadcom/ci: reduce v3dv-rpi4-vk timeout to 30min (instead of 1h) +- ci: add .core-rules to .gallium-core-rules +- ci: drop rule for non-existent src/include/ +- docs: add release notes for 23.1.5 +- docs: add sha256sum for 23.1.5 +- docs: update calendar for 23.1.5 +- ci: include some timing information in the git cache download script +- docs/ci: stop trying to enumerate drivers that are tested using VK-GL-CTS +- docs/ci: in paragraph about the CI being overwhelmed, mention our tool to help with that +- docs/ci: drop mention of build systems variants in the CI +- docs/ci: expand the description of test suites +- bin: add wrapper to run scripts in a python venv +- bin/ci/ci_run_n_monitor: use venv wrapper +- bin/ci/gitlab_gql: use venv wrapper +- bin/ci/update_traces_checksum: use venv wrapper +- bin/pick-ui: use venv wrapper +- ci: include mold in x86_64_test-base & rootfs images +- ci: use mold to build deqp +- zink/ci: set the default timeout for zink jobs to 30min instead of 1h +- egl: make _eglFilterConfigArray static +- egl: fixup _eglFilterConfigArray() params and drop _eglFallbackMatch() wrapper +- ci: build nvk +- ci: document max image tag length +- docs/radv: mark VK_EXT_tooling_info as implemented +- docs/radv: mark VK_INTEL_shader_integer_functions2 as implemented +- git-blame-ignore-revs: repeat instruction on how to enable to avoid having to look for it +- git-blame-ignore-revs: add radv formatting commit +- git-blame-ignore-revs: add pvr formatting commit +- meson: fix indentation +- docs/v3dv: mark direct display extensions as implemented +- ci: reorder vk drivers alphabetically in debian-vulkan job +- ci: build hasvk in debian-vulkan job +- ci/zink+radv: set a timeout of 2x the normal runtime +- amd/ci: drop duplicate test expectations +- panfrost: upcast uint8/uint16 before shifting them beyond their range +- ci/a530: document piglit flake +- docs: add release notes for 23.1.6 +- docs: add sha256sum for 23.1.6 +- docs: update calendar for 23.1.6 +- docs: add one more 23.1.x release +- ci: rename \*.log to \*.txt to work around gitlab bug +- ci/freedreno: reuse freedreno_gl_file_list instead of re-definining it +- egl: bump extension string length +- vc4: drop duplicate .lower_ldexp +- zink: fix format in zink_make_{image,texture}_handle_resident() +- v3dv: fix VK_PIPELINE_ROBUSTNESS_{BUFFER,IMAGE}_BEHAVIOR_DEVICE_DEFAULT_EXT copy/paste typo +- v3dv: fix copy/pasted type of \`sample` +- v3dv: fix shader stage name in error message +- v3d/qpu: fix type of function argument +- ci/deqp: backport fix for dEQP-EGL.functional.wide_color.*_888_colorspace_* +- ci/farm-rules: fix missing valve-infra jobs in scheduled pipelines +- bin/ci_run_n_monitor: error out if both --project and --pipeline-url are passed +- ci: document farm rules +- ci/b2c: skip install.tar extraction if the tarball is not present +- ci/b2c: don't allow failures in test script preparation +- ci/b2c: assert that install folder is present whether or not the tarball was extracted +- ci/amd: split the polaris10 rules into one for each farm +- ci: skip containers & build jobs when disabling a farm +- docs: add release notes for 23.1.7 +- docs: add sha256sum for 23.1.7 +- docs: update calendar for 23.1.7 +- docs: add one more 23.1.x release +- ci: taking igalia farm offline +- ci/b2c: drop logic to remove install.tar +- ci: drop clover leftover +- Revert "ci: taking igalia farm offline" +- bin/ci_run_n_monitor: print in which repo we're looking for the pipeline +- bin/ci_run_n_monitor: automatically pick MR pipelines when they exist +- ci: remove duplicate fork pipeline in MRs +- ci_run_n_monitor: add comment to explain "MR > fork" logic +- ci: don't run everything just because a farm gets re-enabled +- ci/windows: centralize definition of windows runners tags +- ci/windows: add windows docker runner tags to .windows-docker-vs2019 +- ci/windows: drop build rules from test jobs +- ci: document which image tags need to be bumped when updating piglit +- ci: document which image tags need to be bumped when updating {alpine,debian,fedora}/x86_64 +- ci/farm-rules: rename .disable-farm-mr-rules to make it clear it's only about MRs +- ci/farm-rules: re-add "run every container and build job when a farm gets re-enabled" +- ci/zink: drop redundant \`MESA_LOADER_DRIVER_OVERRIDE: zink` +- docs: add release notes for 23.1.8 +- docs: add sha256sum for 23.1.8 +- docs: update calendar for 23.1.8 +- docs: add another 23.1.x +- ci: limit build jobs to 30min so that they can retry when they go wrong +- docs: drop outdated and redundant note about the minimum meson version +- ci/zink+radv: specify that zink-radv-navi10-valve should run in the mupuf farm +- ci/zink+radv: bump the timeout of zink-radv-navi10-valve by 10 minutes +- docs: add calendar for 23.3 +- ci: unify container and build jobs rules +- docs/meson: drop mention that our meson is ready +- ci/docs: drop extra overwritten rules +- ci/zink+radv: document flake +- docs: document the merging process and what is allowed or not +- ci: drop unused shader-db clone + build from alpine image +- ci: drop unused shader-db clone + build from fedora image +- ci: move shader-db clone/build into its own script +- ci/deqp-runner: fix indentation +- ci/deqp-runner: restore exit-on-error after getting deqp-runner's exit code +- ci: fix shebang in build-deqp-runner.sh +- docs: add release notes for 23.1.9 +- docs: add sha256sum for 23.1.9 +- docs: update calendar for 23.1.9 +- ci: drop unused ephemeral packages in alpine image +- docs/ci: rewrite the "farm maintenance ^ other change" rule to mean what we actually meant +- ci: skip dEQP-VK.api.driver_properties.conformance_version for everyone +- pick-ui: use assignment expressions +- pick-ui: use more expressive variable names +- pick-ui: add \`Backport-to: XX.Y` nomination +- v3d/ci: move traces job to wayland +- ci: print deqp version in the job log +- ci/b2c: move to the shiny new \`gfx-ci/ci-tron` repo +- ci/b2c: use latest mesa-trigger image +- include/dri_interface.h: restore define mistakenly removed in !25587 +- ci_run_n_monitor: dependency jobs must always be started +- util/xmlconfig: drop driInjectDataDir() now that DRIRC_CONFIGDIR is always supported +- util/xmlconfig: inline datadir +- ci/b2c: change artifacts path to match baremetal and LAVA +- VERSION: bump for rc1 +- .pick_status.json: Update to e64a97694ac9dc97f65e1a8e91a5c9789109fd2c +- .pick_status.json: Update to 4cdd094ae1e97d857a6b9dbc291d7bbe6ea266ac +- .pick_status.json: Update to e4a1bc70dd739ca8addddc940af08312b038e288 +- .pick_status.json: Update to faed5d647f2416bb0ce3a9d33a3955169c70dc52 +- VERSION: bump for 23.3.0-rc2 +- .pick_status.json: Update to 1f1ec1c6bcc2a32a3c1df8c2cc7a2f4e7139b7ec +- .pick_status.json: Mark 8dda860f83ac30d042dc6beb4438cc925d1fd130 as denominated +- .pick_status.json: Update to 7d6f9ccfbeab050c26775d5e03578a01526cbfcb +- .pick_status.json: Update to aa33ca0a52591961f8ae01dc253354462ed17c18 +- .pick_status.json: Update to a77ea9555aa00cc12f3d1c440252e940ff552500 +- .pick_status.json: Mark 227300345ed38377190b0eaf08694d5c42ee7e60 as denominated +- VERSION: bump for 23.3.0-rc3 +- .pick_status.json: Update to 56451ce773c11094a8c08fdc6b500bb8bdcf37e1 +- .pick_status.json: Mark fa7ec4226bdf48bf63438e303af83ecd58ec95f2 as denominated +- .pick_status.json: Update to 08f851f4361cfbdb211dc70d03cf3ebff331c3ee +- .pick_status.json: Update to 03a7cb261828b350dd9b56bd74850197ca9eba33 +- .pick_status.json: Mark fcfa68a632e5711cc657b103c9a0384928e9bf49 as denominated +- VERSION: bump for 23.3.0-rc4 +- .pick_status.json: Update to f05688aa3299a27430119b27e45181a6f415bff8 +- egl/dri2: increase NUM_ATTRIBS to fit all the attributes +- .pick_status.json: Update to f39ed0063b4cd3e5a71efad2d43ce31f574c698d +- .pick_status.json: Update to b07a58157d0b110dbc09a42cffe7046c3200dd3b +- VERSION: bump for 23.3.0-rc5 +- .pick_status.json: Update to f843b14c171299e1696ca6d971ccaa496f60c3ab +- intel/perf: fix regex escaping +- intel/ci: fix .hasvk-manual-rules +- VERSION: bump for 23.3.0 +- Revert "VERSION: bump for 23.3.0" +- docs: add release notes for 23.3.0 +- Revert "docs: add release notes for 23.3.0" + +Erico Nunes (10): + +- lima/ppir: don't optimize loads with different block successors +- lima/ppir: convert to nir_legacy +- lima/gpir: switch to register intrinsics +- egl/drm: fix EGL_EXT_buffer_age with gbm contexts +- lima: fix plbu block stride calculation +- ci: disable lima LAVA lab for maintance +- Revert "ci: disable lima LAVA lab for maintance" +- v3dv: allow headless device without display device +- Revert "ci/lima: farm is down, disable for now" +- v3dv: Rework to remove drm authentication for wsi + +Erik Faye-Lund (30): + +- meson: report with_glvnd in summary +- docs: upgrade bootstrap to 5.3.1 +- docs: expand mobile-menu without js +- panfrost: delete stale editorconfig file +- docs/panfrost: link to lima +- docs/panfrost: use code-blocks with wrapping for long blocks +- docs/panfrost: use math-role to denote powers of two +- docs: fix linkcheck +- docs: update a few links to https +- docs: update anchor for link +- docs: update link to git-wiki +- docs: link to upstream etnaviv +- docs: apply some trivial redirects +- docs: use doc-role when linking to lists article +- docs: keep up with intels ever-moving documentation +- docs: mark some redirects as allowed +- docs: only link to old docs from html +- docs: use html_static_path for static files +- ci/etnaviv: update ci expectation +- ci/etnaviv: allow failure on failing test +- zink: fix wording of warning +- ci/etnaviv: move failure to flake +- meson: add wayland-protocols from meson wrapdb +- util/xmlconfig: add an env-var for overriding drirc search dir +- meson: add src/util to the drirc search path +- docs/relnotes: remove cruft from end of lines +- docs/ci: escape at-symbols +- docs/relnotes: escape some at-symbols +- bin/gen_release_notes: escape at-symbols +- panfrost: use perf_debug instead of open-coding + +Faith Ekstrand (809): + +- nv50/ir: Convert to new-style NIR registers +- nv50/ir: Support vector movs +- intel/fs: Add support for new-style registers +- intel/vec4: Assume get_nir_dest() provides a sane write-mask +- intel/vec4: Add support for new-style registers +- intel: Switch to intrinsic-based registers +- intel/fs: Drop support for nir_register +- intel/vec4: Drop support for nir_register +- anv,hasvk,iris: sampler_prog_key::swizzles is only used on crocus +- nir: Properly handle divergence for load_reg +- nir/trivialize: Maintain divergence information +- nir/trivialize: Trivialize cross-block loads +- vc4: Convert to new-style NIR registers +- nir/schedule: Support load/store_reg +- broadcom/compiler: Convert to new-style NIR registers +- intel/fs: Use write masks from store_reg intrinsics +- intel/fs: Rework the overlapping mov/vec case +- intel/fs: Assume NIR is in SSA form +- nir: Add a backend_flags field to nir_tex_instr +- intel/fs: Add a parameter to speed up register spilling +- nir/builder: Allow tex helpers on image types +- nir/builder: Add a nir_txs_deref() helper +- vulkan: Add a core vk_buffer_view struct +- vulkan: Add a more direct way to use a NIR shader +- vulkan: Add a vk_query_pool base object +- vulkan: Add common vkCmdBegin/EndQuery wrappers +- vulkan/format: Add the remaining 1-plane YCbCr formats +- vulkan: Add a core vk_sampler struct +- nv50/nir: Lower to scratch AFTER optimization +- nouveau: Allow GLSL_SAMPLER_DIM_SUBPASS* +- nouveau/nir: Implement support for compact arrays +- nouveau/codegen: Handle/indirect goes before sample index +- nouveau/codegen: Use a NULL format for PIPE_FORMAT_NONE for images +- nouveau/codegen: Don't convertSurfaceFormat for unknown formats +- nv50/ir: Run nir_divergence_analysis before out-of-SSA +- anv: Use vk_sampler +- anv: Use vk_buffer_view +- vulkan: Add init/finish helpers for vk_query_pool +- anv: Use vk_query_pool +- anv: Use the common versions of vkBegin/EndQuery() +- nir/builder: Don't assume we have compiler options +- Revert "mesa, compiler: Move gl_texture_index to glsl_types.h" +- Revert "compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h" +- vulkan: Use VkBufferUsageFlags2 in vk_buffer +- clang-format: Set ColumnLimit to 78 +- nvk: Implement EnumerateInstanceVersion +- nvk: Add stub implementations of VkImage and VkImageView +- nvk: Add stub implementation of VkSampler +- nvk: Add a stub implementation of VkBuffer +- nvk: Implement VkDescriptorSetLayout +- nvk: Implement VkPipelineLayout +- nvk: Add initial descriptor set lowering +- nvk: Implement vkUpdateDescriptorSets +- nvk: Expose nvk_descriptor_stride_align_for_type +- nvk: Re-format descriptor set layouts +- nvk: Re-format pipeline layouts +- nvk: Re-format descriptor sets some more +- nvk/buffer: Take an offset in nvk_buffer_address +- nvk/buffer: Add a push_buffer_ref helper +- nvk/copy: Use nvk_buffer_address in CmdCopyBuffer +- nvk/image: Add image address helpers +- nvk/copy: Use nvk_image_base_address() +- nvk: Add an nvk_device_physical helper +- nvk: Add a skeleton for pipelines +- nvk: Re-arrange nvk_descriptor_set.h a bit +- nvk: Reformat nvk_nir_lower_descriptors +- nvk: Add a couple descriptor set address helpers +- nvk: Move nvk_cmd_pool cast definitions +- nvk: Rework whitespace in nvk_cmd_buffer.c +- nvk: Add a root descriptor table +- nvk: Fetch descriptor set addresses from the root table +- nvk: Re-arrange nir_lower_explicit_io a bit +- nvk: Lower load_global_constant_offset +- nvk: Drop image_view_init +- nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable +- nvk: Allow R32_UINT +- nvk: Mark nvk_push_descriptor_set_ref() inline +- nvk: Add a descriptor table data structure +- nvk: Copy in the nouveau TIC format table +- nvk/image_view: Reformat and fix Create/DestroyImageView +- nvk: Add an image descriptor table to the device +- nvk: Fill out TIC table entries for image views +- nvk: Set b->cursor when lowering image intrinsics +- nvk: Unify descriptor loading in lower_descriptors +- nvk: Re-format nvk_image_view.h a bit +- nvk: Re-format nvk_buffer.c a bit +- nvk: Add a stub implementation of buffer views +- nvk: Make texture descriptors a bit more acceptable to codegen +- nvk: GART os host-cache-coherent +- nvk: Reserve a null image descriptor +- nvk: Rework descriptor writes +- nouveau: Add stubs for an image layout library called NIL +- nil: Create images +- nil: Add the TIC format table from nouveau +- nil: Add a nil_view and code to fill out TIC entries +- nvk: Add an nvk_get_format helper +- nvk: Use helpers for push_ref +- nvk: Align arguments consistently in copy/blit code +- nvk: Move Fill/UpdateBuffer to nvk_cmd_copy +- Revert "nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable" +- nvk: Manually offset for array layers in copy/blit +- nvk: Convert to using NIL for image layout +- nvk: Re-indent image entrypoints +- nvk: Implement VkGetImageSubresourceLyout +- nvk: Reset and properly clean up command buffer upload areas +- nvk: Rework format features queries +- nvk: Add a more competent GetPhysicalDeviceImageFormatProperties +- nvk: Support compressed images in copy commands +- nvk: Drop vk_sync BO refs after push_submit +- nil: Drop miptail support for now +- nil: Don't minify image dimensions when setting up TIC +- nil: Refactor TIC image extent setup +- nil: Fix image array layer alignments +- nvk: Teture pool sizes are maximums not sizes +- nvk: Re-format nvk_sampler.c +- nvk: Implement samplers +- nil: Add a helper for filling out buffer TIC entries +- nvk: Move is_storage_image_format to nvk_format.c +- nvk: Implement buffer views +- nvk: Advertise KHR_dedicated_allocation +- nvk: Use the correct root descriptor table size for CmdDispatch +- nvk: Add support for dynamic buffers +- nvk: Better advertise image format features +- nvk: Advertise descriptor array indexing +- nvk: Advertise non-zero descriptor set limits +- nvk: Use a descriptor type instead of a hand-rolled thing +- nvk: Handle cube storage images properly +- nvk: Load the requested descriptor size +- nvk: Implement push constants +- nvk: Properly indent a comment +- nvk: Fix descriptor offset alignment +- nvk: Use a switch for descriptor types in load_descriptor +- nvk: Support inline uniform blocks +- nvk: Delete the storage TIC in nvk_image_view_destroy +- nvk: Assert that we don't double-free descriptors +- nvk: Initial vkCmdClearImage support +- nvk: Unconditionally zero image format properties +- nvk: No-op sparse image format properties +- nvk: Advertise minUniformBufferOffsetAlignment +- nvk: Rework OOM handling for descriptor pools +- nvk: Bind immutable samplers on descriptor set creation +- nvk: Padd shader BOs by 4K to avoid I-cache overflow +- nvk: Include nvk_private.h in everything +- nvk: Make image/buffer address helpers const +- nouveau/push: Add a P_INLINE_FLOAT helper +- nvk: Init WSI after setting up supported_sync_types +- nouveau/parser: Fix an integer overflow and a typo +- nouveau/parser: Properly dump most arrays used by 3D +- nouveau/parser: Better dump float data +- nouveau/parser: Handle arrays properly in P_IMMD() +- nouveau/push: Make P_IMMD more versatile +- nouveau: Null terminate the debug flag list +- nouveau: Generate 3D headers +- nvk: Add graphics state to command buffers +- nvk: Split pipeline binding into helpers +- nvk: Switch to vk_pipeline_shader_stage_to_nir +- nvk: Don't free the NIR in nvk_compile_nir +- nvk: Add an nvk_shader_address helper +- nvk: Free pipeline shader BOs +- nvk: Expose pipeline alloc/free functions +- nvk: Make shader_upload take an nvk_device +- nvk/shader: Assign I/O locations and gather info +- nvk/shader: Populate headers for vertex and fragment shaders +- nvk: Add a nvk_cmd_buffer_device() helper +- nvk: Import 3D context init code from nouveau +- nil/format: Add helpers for render formats +- nvk: Add boilerplate for Begin/EndRendering +- nvk: Misc. additional state setup +- nvk: Emit dynamic graphics state +- nvk: Implement push constants and descriptors for graphics +- nouveau: Add CPU push buffers +- nvk: Graphics pipelines +- nvk: Implement vkCmdDraw() +- nvk: Color attachments clears via image clears +- vulkan/meta: Add the start of a meta framework +- vulkan/meta: Add an object tracking list +- vulkan/meta: Add a concept of rect pipelines +- vulkan/meta: Implement attachment clears +- vulkan/meta: Implement start-of-rendering clears +- vulkan/meta: Add implementations of Clear*Image +- nvk: Add an attachment format even for secondaries +- nvk: Add an addr field to nvk_buffer +- nvk: Expose a bind_vertex_buffer helper +- nvk: Use vk_meta for CmdClearAttachments +- nvk: Stop using vk_cmd_set_dynamic_graphics_state in meta_end() +- nvk: Enable all the dynamic state features +- nouveau: Fix pushbuf ref reset for user command buffers +- nvk: add linear image creation support. +- nvk: Use max alignment for descriptor pool sizes +- nil: Switch to using the new headers for TIC entries +- nvk: Use meta for CmdClear*Image +- nvk: Zero client memory objects +- nvk: Bind texture and sampler header pools for 3D +- nvk: Use the new headers for samplers +- nvk: Implement nir_intrinsic_load_frag_coord +- vulkan/meta_clear: Populate VkRenderingInfo::renderArea +- nvk: Don't assert when there are no attachments +- nvk: Track and reference all device memory objects +- vulkan: Allow scissors or viewports to be set without counts +- nvk/copy: Mape bpp part of nouveau_copy_buffer +- nvk: Implement copies for D24_UNORM_S8_UINT images +- nvk: Drop sample locations structs +- nvk/meta: Save and restore VI state +- nvk: Re-initialize dynamic_graphics_state.vi when recycling +- nvk: Move the vertex format table into nvk_format.h +- nvk: Advertise vertex buffer format featues +- nvk: Clean up try_create_physical_device error handling +- nouveau/parser: Dump more fields as float +- nvk: Depth bounds need fui() +- nouveau: Add class information to nouveau_ws_device +- nil: Properly depend on nouveau winsys and nvidia-headers +- nil: Use nvidia headers for texture format enums +- nil: Use the nvidia headers for render target format enums +- nil: Use nvidia headers for ZS format enums +- nil: Rename rt to czt in the format info struct +- nil: Rename rendering to color_target +- nil: Re-introduce the format capabilities +- nil: Add more format support helpers +- nvk: Advertise more format features +- nvk: Clear dynamic state dirty after flushing it all +- vulkan/meta: Make stencil reference dynamic for clears +- nvk: Depth buffers don't allow Z-tiling +- nvk: Disable sparse Z on Maxwell+ +- nil: Compute PTE kinds and tile modes for images +- nouveau: Add a function to allocate a tiled buffer +- nvk: Add internal helpers for device memory allocation +- nvk: Do internal dedicated allocations for ZS images +- nvk: Fix depth/stencil render pass clears +- nvk: Fix viewport Z scale +- nvk: Enable two-sided stencil +- nvk: Flip the front-face setting +- nvk: Advertise depth/stencil support +- nvk: Don't destroy NULL descriptor pool BOs +- nvk: Call nir_lower_input_attachments +- nvk: Set GEOMETRY_SHADER_SELECTS_LAYER properly +- nvk: Return OUT_OF_DEVICE_MEMORY if bo_new fails +- nil: Add a PTE kind for Z32_FLOAT +- nvk: Add nvk_queue_init/finish() helpers +- nvk: Align descriptor buffers to NVK_MIN_UBO_ALIGNMENT +- nvk: Re-flow a couple function prototypes +- nvk: Assert samples == 1 +- nvk: Allocate descriptors for input attachments +- nvk: Wire up early z and post depth coverage +- nvk: Save/restore push constants around meta ops +- nouveau/parser: Add array and float tags for clear values +- nvk: Use hardware clears for attachment clears +- nvk: Add image_view_init/finish functions +- nvk: Implement vkCmdClear*Image directly +- nvk: Use a UINT format to clear non-renderable images +- nvk: Don't advertise tiling on non-power-of-two formats +- nvk: Fix max anisotropy +- nvk: Assert on CmdExecuteCommands +- nvk: VkSamplerCreateInfo::mipLodBias is signed +- nvk: Fix border color alpha +- nil/format: Depth/stencil formats appear as red +- nil: Fix max mip level +- nil: Fix nonnormalized coordinates +- nvk: Set up clip and cull distances +- nvk: Fix dynamic buffer descriptor copies +- nvk: Inline nouveau_copy_linear +- nvk/copy: Rename push to p +- nvk/blit: Rename push to p +- nvk/dispatch: Rename push to p +- nvk: Drop most buffer tracking +- nvk: Rework TLS/SLM and image/sampler table handling +- nvk: Invalidate texture header and sampler caches each submit +- nvk/sampler: Free descriptor table entries +- nvk: Rework nvk_descriptor_table_add/remove +- nvk: Implement descriptor table growing +- nvk: Zero unused descriptors +- nvk: Add some asserts for nv50 compiler image restrictions +- nvk: Update to the new command buffer infrastructure +- nvk: Split nvk_queue into its own file +- nvk: Start every command buffer with a nop +- nvk: Initialize fixed draw/default state once +- nouveau/parser: Convert to mako +- nouveau/parser: Use more idiomatic python +- nouveau/parser: Put the dump helpers in C files +- nvk: Use f for extension features +- nvk: Drop a TODO +- nvk: Use VK_IMAGE_USAGE_*_ATTACHMENT_BIT for image clears +- nvk: Increase the graphics pipeline push space +- nil: Don't claim texture support for 2-bit SNORM +- nouveau/push: Fix a void pointer arithmetic bug +- nouveau/parser: Add more arrays +- nouveau/mme: Add basic structures for the Turing+ MME +- nouveau/mme: Add isaspec XML for the Turing+ MME +- nouveau/mme: Add an assembler and disassembler for the Turring+ MME +- nouveau/mme: Add a builder for the Turing+ MME +- nouveau/mme: Add a tiny simulator for the Turing+ MME +- nouveau/mme: Add an isaspec-based dumper +- nouveau/mme: Make the winsys headers C++ safe +- nouveau/mme: Add unit tests for the Turing+ MME simulator +- nvk: Add MME infrastructure +- nvk: Use MME for clears +- nouveau/mme: Add helper macros for setting fields +- nvk: Use MME for vkCmdDraw[Indexed]() +- nvk: Implement vkCmdDraw[Indexed]Indirect() +- nvk: Use p for the nouveau_ws_push_buffer in zero_vram +- nouveau: Add an nv_push struct +- nouveau: Rename the fields of vk_push +- nouveau: Move nv_push and helpers to their own header +- nouveau/parser: Take a FILE* in DUMP_*_MTHD_DATA +- nouveau: Move push validate to nv_push.c +- nouveau: Move push dumping to nv_push.c +- nvk: Use nv_push directly for graphics pipelines +- nouveau: Add a nouveau_ws_bo_new_mapped helper +- nvk: Use bo_new_mapped for the zero page +- nvk: Always allocate empty_push +- nvk: Move queue_sumbit to nvk_queue_drm_nouveau.c +- nvk: Submit pushbufs directly +- nvk: Use a regular BO for the empty push +- nvk: Use a regular BO for the queue state push +- nvk: Add an nvk_queue_submit_simple helper +- nvk: Initialize the queue later in device setup +- nvk: Use submit_simple for draw state init +- nvk: Use queue_submit_simple for zero_vram +- nvk: Break nvk_cmd_pool into its own file +- nvk: Use cmd instead of cmd_buffer +- nvk: Add BO recycling to the command pool +- nvk: Return VkResult from nvk_cmd_buffer_upload_alloc +- nvk: memcpy root descriptors for compute instead of doing a DMA +- nvk: Fully populate QMDs before uploading +- nvk: Constant buffer alignment is actually 64B +- nvk: Rework side-band data upload +- nvk: Add an nvk_cmd_buffer_push helper +- nvk: Add an nvk_cmd_buffer_ref_bo helper +- nvk: Allocate upload buffers from the command pool +- nvk: Use nvk_cmd_bo for push bufs +- nvk: Implement vkCmdExecuteCommands() +- nvk: Remove remaining references to nouveau_push.h +- nouveau: Use DRM interfaces directly in MME tests +- nouveau: Drop nouveau_ws_push +- nvk: Re-indent vk_instance.c +- nvk: Use vk_object_zalloc/free for descriptor pools/sets +- nvk: Fix up whitespace in nvk_descriptor_set.c +- nvk: Implement VK_KHR_push_descriptor +- nvk: Reference descriptor set layouts in the sets themselves +- nvk: Embed a nv_device_info in nvk_physical_device +- nvk: Add an nvk_queue_submit wrapper +- nvk: Also store the push BO map in nvk_queue_state +- nvk: Bring back push sync and dumping +- nvk: drop nvk_nir.h +- nvk: Add lowering for load_global_constant_bounded +- nvk: Properly implement robustBufferAccess +- vulkan/meta: Add key types +- vulkan/meta: Add a helper for image view types +- vulkan/meta: Add a create_sampler helper +- vulkan/meta: Fixes for clear +- vulkan/meta: Implement vkCmdBlitImage() +- nvk: Support load_layer_id +- nvk/meta: Save/restore descriptor set 0 +- nvk: Use meta for doing blits with the 3D hardware +- nvk: WFI in pipeline barriers +- util/vma: Allow initializing zero-size heaps +- nvk: Rework nvk_queue_submit_simple() +- nvk: Add a heap data structure +- nvk: Return a VkResult from nvk_shader_upload() +- nvk: Add a shader heap to nvk_device +- nvk: Allocate shaders from a heap +- nvk: Rework whitespace in nvk_device_memory.c +- nvk: Style fixes in nvk_physical_device.c +- nvk: Reset semaphore syncs on wait +- nvk/wsi: Style fixes +- nvk/wsi: Use the common present implementation +- nouveau/parser: Parse all fields in each method +- nvk: Add a query pool object +- nvk: Implement timestamp queries +- nvk: Implement pipeline statistics and occlusion queries +- nouveau/mme: Allow ZERO as the destinatio nof mme_load_to +- nouveau/mme: Assert on OOB registers +- nouveau/mme: Add support for freeing registers +- nouveau/mme: Add a couple helpers for working 64-bit addresses +- nouveau/mme: Add a helper for MME_DMA_READ_FIFOED +- nvk: Use mme_tu104_read_fifoed() +- nvk: Implement vkCmdCopyQueryPoolResults() +- nvk: Handle large command buffer uploads better +- nvk: Use a normal DMA for CmdUpdateBuffer +- nouveau/parser: Handle 6F methods +- nvk: Use mme_load_addr64() +- nvk: Use poll for BO waits +- nvk: Events +- nvk: Don't crash if we fail to allocate a push BO +- nvk: Stop leaking command pool BOs +- nvk: Enable VK_KHR_create_renderpass2 +- nvk: Advertise VK_KHR_imageless_framebuffer +- nvk: Flush the current pushbuf before allocating a new one +- nvk: Advertise VK_KHR_separate_depth_stencil_layout +- nvk: Tell WSI we don't support legacy scanout +- nouveau: Add PCI information to nv_device_info +- nvk: Implement VK_EXT_pci_bus_info +- nvk: Bind 3D images as 3D for clears +- nvk: Support copies between 3D and 2D images +- nil: Add a helper for getting 2D views of 3D images +- nvk: Support 2D views of 3D images +- nvk: Advertise VK_KHR_maintenance1 +- nvk: Use 2D array views for 3D storage images +- nil: Fix include guards in nil_image.h +- nvk: Advertise custom border color features +- vulkan: Add a helper for swizzling color values +- nvk: Implement VK_EXT_border_color_swizzle +- nvk: Advertise VK_EXT_extended_dynamic_state3 +- nvk: Move more states to dynamic +- nvk: Advertise VK_KHR_storage_buffer_storage_class +- nvk: Add a helper for pushing descriptors +- nouveau/headers: Add generated headers to dependencies +- nvk: Implement VK_EXT/KHR_buffer_device_address +- nvk: Break the guts of CmdDispatch into a helper +- nvk: Implement DispatchIndirect +- nouveau/mme: Add a mul64 helper +- nvk: Implement CS invocations statistics queries +- nil: Use ONE for the anixotropic coarse spread function +- nil: Properly support MSAA +- nil: Add an offset4d struct and some helpers +- nouveau/parser: Sort METHOD_ARRAY_SIZES +- nouveau/parser: Handle SET_ANTI_ALIAS_SAMPLE_POSITIONS +- nvk: Stop asserting on MSAA +- nvk: Handle zero color attachments better +- nvk: Handle multisampled render targets properly +- nvk: Support copies of MSAA images +- nvk: Use the right view format for stencil texturing +- nvk: Pass through a shader key for fragment shaders and MSAA +- nvk: Set correct multisample regs for graphics pipelines +- nvk: Stop creating a new upload BO every time +- nvk: Fill out sample locations on Maxwell B+ +- vulkan/meta: Bind whole LODs of 3D blit destinations +- vulkan/meta: Add a helper for building texture ops +- vulkan/meta: Break the guts of blit into a helper +- vulkan/meta: Support writing stencil as iterative discard +- vulkan/meta: Rename vk_meta_blit.c to vk_meta_blit_resolve.c +- vulkan/meta: Add support for MSAA resolves +- nvk/meta: Fix restore for descriptor set 0 +- nvk: Use meta for MSAA resolves +- nvk: Replace gl_SamplePosition with fract(gl_FragCoord.xy) +- nvk: Stop advertising higher framebufferNoAttachmentsSampleCounts +- nvk: Advertise MSAA via image format properties +- nvk: Advertise VK_KHR_depth_stencil_resolve +- nvk: Assert that descriptor buffer access stays in-bounds +- nvk: Add a bo size to nvk_descriptor_set +- nvk/format: Style fix for VkFormatProperties3KHR +- nvk: Support VK_FORMAT_B10G11R11_UFLOAT_PACK32 for vertex buffers +- nvk: Add a devenv ICD json file +- nvk: Advertise EXT_vertex_attribute_divisor +- nvk: Lower image_size to txs +- nvk: Fix a comment +- nvk: Add an nvk_buffer_addr_range helper +- nvk: Use nvk_buffer_addr_range for buffer descriptors +- nvk: Re-order Vulkan 1.0 feature bits +- nvk: Enable inheritedQueries +- nvk: Enable VK_EXT_provoking_vertex +- nvk: Advertise samplerMirrorClampToEdge via 1.2 features +- nvk: Advertise VK_KHR_bind_memory2 +- nvk: Enable KHR_dynamic_rendering +- nvk: Advertise KHR_uniform_buffer_standard_layout +- nvk: Advertise EXT_index_type_uint8 +- nvk: Advertise VK_EXT_separate_stencil_usage +- nvk: Capitalize NVK in user exposed strings +- nvk: Rename grid_size to group_count +- nvk: Lower load_num_workgroups ourselves +- nvk: Drop block_size from the root descriptor table +- nvk: Add a helper for loading resource_index-based descriptors +- nvk: Set maxMemoryAllocationCount +- nouveau/winsys: Take a drmDevicePtr in nouveau_ws_device_new() +- nouveau/winsys: Add an info to nouveau_ws_device +- nouveau/winsys: Move device type into nv_device_info +- nouveau/nil: Take an nv_device_info for image functions +- nouveau/nil: Use nv_device_info for format queries +- nouveau/mme: Invoke SET_OBJECT in the tests +- nouveau/mme: Make alu_op_to_str static +- nouveau/mme: Move mme_value into its own header +- nouveau/mme: Add a mme_reg_alloc struct +- nouveau/mme: Add an intermediate MME_ALU_OP enum +- nouveau/mme: Add an intermediate MME_CMP_OP enum +- nouveau/mme: Use mme_mov() for temp copies of register IMM32 sources +- nouveau/mme: Make helpers less Turing specific +- nouveau/mme: Break the Turing builder guts into a separate header +- nouveau/mme: Move the guts of mme_merge_to() into mme_tu104_builder.c +- nouveau/mme: Move the guts of mme_state_arr_to() into mme_tu104_builder.c +- nouveau/mme: Drop the implicit_imm parameter from mme_alu_to() +- nouveau/mme: Move the cf_stack struct to mme_builder.h +- nouveau/mme: Prepare the builder for multiple GPU generations +- nouveau/mme: Take an nv_device_info in mme_builder_init +- Support immediates in MERGE +- Add add immediate optimizations +- nvk: Add support for contiguous heaps to nvk_heap +- nvk: Use a contiguous shader heap pre-Volta +- nvk: Disable indirect draw/dispatch and query copy MMEs for now +- nvk: Free a couple regs in nvk_mme_build_draw_*() +- nvk: Properly align root descriptor tables for pre-Pascal +- nvk: Compile all NIR before running codegen +- vulkan/meta: Insert a geometry shader when needed +- nvk: Use a GS for layerered rendering pre-MaxwellB +- nvk: Handle zero-size index and vertex buffers pre-Turing +- nvk: Cosmetic clean-ups to Create/DestroyDevice +- nil: Only choose a PTE kind for tiled images +- nouveau/mme: Fix is_int18 for negative numbers +- nouveau/mme: Don't swap x and y in mme_fermi_merge_to() +- nouveau/mme: Take a const nv_device_info in mme_builder_init +- nouveau/mme: Unify some of the test framework +- nouveau/mme: Add some generic builder tests +- nouveau/mme: Add builder tests for SUB +- nouveau/mme: Use a uint32_t for size in mme_fermi_bfe() +- nouveau/mme: nouveau/mme: Add builder tests for SLL and SRL +- nvk/drm: Take a byte offset/range in push_add_push +- nvk: Rework nvk_cmd_push a bit +- nvk: Add a helper for pushing indirect data +- nvk: Make some MME builder names more consistent +- nouveau/mme: Don't allow WaW dependencies in the same Turing instruction +- nvk: Reduce register pressure in nvk_mme_build_draw*() +- nouveau/push: Add an NV_PUSH_MAX_COUNT #define +- nvk: Implement Draw*Indirect on pre-Turing +- vulkan/meta: Use the new NIR texture helpers +- nvk: Add a build test for MMEs +- nvk: Don't over-size push descriptor sets +- nvk: Return VK_ERROR_INCOMPATIBLE_DRIVER if the PCI vendor isn't NVIDIA +- nvk: Bump init context batch size +- nouveau/mme: Fix nested while instructions on Turing+ +- nouveau/mme: Add a helper to dump instructions +- nvk: Rework extension enables +- nvk: Rework features enables +- nvk: Advertise shaderImageGatherExtended +- nouveau/mme: Add a bfe helper +- nouveau/mme: Ensure that zero-initizlied mme_value is ZERO +- nvk: De-duplicate MME code for setting draw params +- nvk: Clamp viewport clip to max range +- nvk: Use the same lock for the submit and the memory objects list +- nvk: Advertise ICD/loader interface version 4 +- nvk: Add instace WSI entrypoints +- nouveau/mme: Use ADD for ine with an immediate +- nouveau/mme: Fix while loops pre-Turing +- nvk: Add begin to mme_scratch +- nvk: Use the new load/store_scratch helpers for DRAW_PAD_DW +- nouveau/mme: Add a helper for re-allocating registers +- nvk: Rework spill helpers and DRAW_COUNT spilling +- nvk: Spill DRAW_IDX pre-Turing +- nvk: Break the inner MME draw loop into a helper +- nvk: Increase the push runout to 512 dwords +- nil: Add a nil_image_for_level helper +- nil: Add an image_level_as_uncompressed helper +- nvk: Implement uncompressed views of compressed images +- nvk: Set pointClippingBehavior +- nvk: Expose VK_KHR_maintenance2 +- nvk: Add a separate #define for SSBO alignment +- nvk: Set spirv_to_nir_options::min_*_alignment +- nvk: Use vk_device_memory +- nvk: Implement VK_KHR_map_memory2 +- nvk: Sort SPIR-V caps +- nvk: Advertise EXT_shader_viewport_index_layer on MaxwellB+ +- nvk: Only use view_id for layer in multiview +- nvk/heap: Set the right pitch for heap resize copies +- nvk: Advertise shaderStorageImageReadWithoutFormat +- nvk: Fix the NO_PREFETCH assert for CmdDrawIndirect +- nvk: Advertise KHR_spirv_1_4 +- nvk: s/device/dev in nvk_image.c +- nvk: Add helpers for binding image planes +- nvk: Take an nvk_image_plane in nouveau_copy_rect_image +- nvk: Use the max descriptor alignemtn in GetDescriptorSetLayoutSupport +- nvk: Use NVIDIA_VENDOR_ID in pdev try_create() +- nvk: Use abbreviated names in nvk_device_memory.c +- nvk: Add device and driver UUIDs +- nvk: Add external memory queries +- nvk: Dedicated allocations override internal +- nvk: Require dedicated allocations for external images +- nouveau/winsys: Add dma-buf import support +- nvk: Support dma-buf import +- nvk: Support dma-buf export +- nvk: Enable external memory extensions +- nvk: Reformat nvk_buffer.c +- nvk: Add a buffer alignment helper +- nvk: Add an addr field to nvk_image_plane +- nvk: Use canonical variable names in nvk_physical_device.c +- nvk: Use canonical variable names in nvk_shader.c +- nvk: Use canonical variable names in nvk_bo_sync.c +- nvk: Use canonical variable names in nvk_sampler.c +- nvk: Drop nvk_physical_device::instance +- nvk: Only advertise EXT_pci_bus_info on discrete GPUs +- nouveau: Put PCI info in a pci substruct in nv_device_info +- nouveau: Stop using hex for SM numbers +- nvk: Set deviceType based on nv_device_info::type +- nouveau: Move more stuff into nv_device_info +- nouveau: Move gart_size to nv_device_info +- nvk: Use nv_device_info for class checks +- nvk: Rename nvk_device::ctx to ws_ctx +- nvk: Add a ws_dev to nvk_device and use it +- nvk: Move the winsys device to nvk_device +- nvk: Don't enumerate pre-Kepler GPUs +- nvk: Implement VK_EXT_physical_device_drm +- nvk: Require an environment variable for poorly tested hardware +- nvk: Use the new core vk_sampler struct +- Revert "vulkan: Allow scissors or viewports to be set without counts" +- vulkan/meta: Add a get_pipeline_layout helper +- vulkan/meta: Use vk_meta_get_pipeline_layout in blit/resolve +- nvk: Bind 3D depth/stencil images as 2D arrays +- nvk: Flush more state on VI_BINDINGS_VALID dirty +- nvk: Don't skip zero-size bindings in GetDescriptorSetLayoutSupport +- docs: Add a docs page for NVK +- docs: Add NVK to features.txt +- docs/relnotes: Stick something about NVK in new_features.txt +- nouveau: Drop GART size from nv_device_info +- nil: Add a nil_image_level_extent_px() helper +- nvk: Use the new NIL helper for image level extents for copies +- nvk: Improve image format properties and limits +- nvk: Rework multi-plane format features a bit +- nvk: Use nvk_root_descriptor_offset for drawInfoBase +- nvk: Add a root_desc_addr to the root descriptor table +- nvk: Add support for variable pointers +- nvk: Enable the SPIR-V DeviceGroup capability +- nvk: Separate the MME query copy code out a bit +- nvk: Implement CopyQueryPoolResults with a compute shader +- nvk: Misc. style nits +- nvk: Rework memory requirements to handle aspects correctly +- nvk: Implement the maintenance5 image layout queries +- nvk: Use VkBufferUsageFlags2 +- nvk: Implement CmdBindIndexBuffer2KHR +- nvk: Implement GetRenderingAreaGranularityKHR +- nvk: Decorate CmdBegin/EndRendering entrypoints +- nouveau: Move shader topology info to nv_device_info +- drm-uapi: Import nouveau_drm.h +- nouveau/winsys: Use the imported nouveau_drm.h headers +- nvk: Use the imported nouveau_drm.h headers +- nouveau/shim: Use the imported nouveau_drm.h headers +- nouveau/mme: Support the new UAPI +- nvk: Use an empty EXEC for the empty submit case +- nouveau/winsys: Allow nouveau_ws_device_new() without VM_BIND +- nvk: Print an error message if VM_BIND support is missing +- nvk: Enable the new UAPI +- nvk: Use more consistent device variable names +- nvk: Call nir_lower_int64 +- nir/gl: Move glsl_type::sampler_target() into a helper in its one caller +- nvk: Remove plane sources from tex instructions +- nvk: Use common physical device properties +- nv50/ir: Rework conversions for texture array indices +- clang-format: Add nir_foreach_reg_* +- clang-format: nir_foreach_src is not a foreach macro +- clang-format: Set the default ColumnLimit to 0 +- nir: Re-align a couple enums and add clang-format comments +- nir: Don't clang-format const_value helpers +- nir: Don't clang-format a couple typedefs +- nir: Don't clang-format debug print setup +- nir: More manual formatting +- nir: Pretty format type mapping helpers +- nir: Wrap pass macros in braces +- nir: Add a do to the do/while in nir_const_value_t_array() +- nir: Add a .clang-format file +- nir: clang-format src/compiler/nir/\*.[ch] +- nvk: Don't use nir_ssa_for_src() +- nir: Drop most instances of nir_ssa_dest_init() +- nir: Drop more instances of nir_ssa_dest_init() +- nir/clone: Clone nir_def nor nir_dest +- nir/serialize: [De]serialize nir_def nor nir_dest +- nir: Drop nir_ssa_dest_init() +- nir: Drop nir_ssa_dest_init_for_type() +- nir: nir_foreach_ssa_def() -> nir_foreach_def() +- st,zink,sfn: Use nir_foreach_def instead of nir_foreach_dest +- dxil: Use nir_foreach_def() instead of nir_foreach_dest() +- nir/from_ssa: Use nir_foreach_def() instead of nir_foreach_dest() +- nir: Drop nir_foreach_dest() +- intel/vec4: Stop passing around nir_dest +- intel/fs: Stop passing around nir_dest and nir_alu_dest +- broadcom: Stop using nir_dest directly +- vc4: Stop passing around nir_dest +- nir,ntt,a2xx,lima: Stop using nir_dest directly +- lima: Stop using nir_dest directly +- etnaviv: Stop passing around nir_dest +- r600/sfn: Stop passing around nir_dest and nir_alu_dest +- nv50/ir: Stop passing around nir_dest and nir_alu_dest +- nir/gather_types: Stop passing around nir_dest +- nir/dce: Stop passing around nir_dest +- nir/propagate_invariant: Stop passing around nir_dest +- nir/validate: Replace all dest validation with validate_def +- nir/print: Replace all dest printing with print_def +- nir: Get rid of nir_dest_bit_size() +- nir: Get rid of nir_dest_num_components() +- nir: Get rid of nir_dest_is_divergent() +- nir: Drop nir_alu_dest +- nir: Drop nir_dest +- util/format: 8-bit interleaved YUV formats are UNORM +- gallivm: Support G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM +- blorp: Use R8G8_UINT for YCRCB_* formats with CCS +- anv: Disable CCS_E for ISL_FORMAT_YCRCB_* +- vulkan/format: Use correct swizzle for 1-plane YCbCr formats +- gallivm: Drop the Vulkan YUV format hacks +- nir: Rename nir_instr_type_ssa_undef to nir_instr_type_undef +- nir s/nir_get_ssa_scalar/nir_get_scalar/ +- nir: s/live_ssa_def/live_def/ +- nir: s/nir_instr_ssa_def/nir_instr_def/ +- nir: Rework nir_scalar_chase_movs a bit +- nir: Fix nir_op_mov handling in nir_collect_src_uniforms +- nir: Handle nir_op_mov properly in opt_shrink_vectors +- nir: Don't handle nir_op_mov in get_undef_mask in opt_undef +- nir: Clean up nir_op_is_vec() and its callers +- nir/large_constants: Use nir_component_mask_t +- nir/large_constants: Add read/write_const_values helpers +- nir/opt_large_constants: Add Small constant handling +- spirv: Re-emit constants at their uses +- nir: Take a nir_def * in nir_tex_instr_add_src() +- nir: Take a nir_def * in nir_phi_instr_add_src() +- nir/opt_undef: Don't rewrite a bcsel to mov +- nir: Add a nir_instr_clear_src() helper and use it +- nir: Add and use a nir_instr_init_src() helper +- nir: Drop nir_if_rewrite_condition() +- nir: Drop most uses of nir_instr_rewrite_src_ssa() +- nir: Drop nir_instr_rewrite_src_ssa() +- nir: Drop most uses if nir_instr_rewrite_src() +- nir: Drop nir_instr_rewrite_src() +- nir: Drop nir_push_if_src() +- nir: Fix metadata in nir_lower_is_helper_invocation +- nir: Use nir_shader_intrinsic_pass() a few places +- drm-uapi: Sync nouveau_drm.h +- nvk: Plumb no_prefetch through to the DRM back-end +- nouveau/mme: Fix a compile warning +- intel/isl: Rename ISL_TILING_Yf/s to ISL_TILING_SKL_Yf/s +- intel/isl: Add ICL variants of Yf and Ys tiling +- intel/isl: Implement correct tile size calculations for Ys/Yf +- intel/isl: Use the depth field of phys_level0_sa for GFX4_2D 3D surfaces +- intel/isl: Fill out the correct phys_total_extent for Ys/Yf/Tile64 +- intel/isl: Indent uncompressed surface code +- intel/isl: Support Ys, Yf & Tile64 in isl_surf_get_uncompressed_surf +- intel/isl: Support Yf/Ys tiling in surf_fill_state +- intel/isl: Support Yf/Ys tiling in emit_depth_stencil_hiz +- intel/isl: Add initial data-structure support for miptails +- intel/isl: Add support for computing offsets with miptails +- intel/isl: Support miptails in isl_surf_get_uncompressed_surf +- intel/isl: Start using miptails +- intel/isl: Disallow CCS on 3D surfaces with miptails +- intel/isl: Allow Ys tiling +- anv: Align memory VA to support for Ys, Tile64 tiled images +- nvk: Clean up includes +- nvk: Add include guards to nvk_bo_sync.h +- nvk: SPDX everything +- nouveau/nil: SPDX everything +- nouveau/mme: SPDX everything +- nvk: Don't add a dummy attachment when gl_SampleMask is written +- nvk: Set the discard bit for Z/S self-deps +- nvk: Invalidate the texture cache in PipelineBarrier +- nvk: Lower interp_at_sample to interp_at_offset +- nvk: Disable statistics around meta ops +- nvk: Clean up viewport math +- nvk: Fix depth clipping parameters +- nvk: Enable dynamic clip/clamp enable +- nvk: Set GUARDBAND_Z_SCALE_1 when Z-clipping +- r600: Use more auto-generated nir_builder helpers +- r600: Use nir_builder helpers for load/store_shared_r600 +- nvk: Re-order physical device limits +- nvk: Advertise maxMemoryAllocationCount = 4096 +- nvk: Advertise discreteQueuePriorities = 2 +- nvk: Rip out old UAPI support +- nvk/drm: Drop the push_add_push_bo() helper +- nvk/drm: Drop the push_add_bo() helper +- nvk: Drop command buffer BO tracking +- nvk: Drop memory object tracking +- nvk: Drop the device-level mutex +- nvk: Get rid of the tiled memory allocation helpers +- nvk/drm: Restructure nvk_queue_submit_drm_nouveau() +- nvk/drm: Split exec as needed for large command buffers +- nvk: Don't store the descriptor pool BO in the set +- nvk: Store a 20-bit driver_build_sha in nvk_instance +- nvk: Hook up the disk cache +- nvk: Re-structure early shader compilation a bit +- nvk: Add a default pipeline cache +- nvk: Cache NIR shaders +- nvk: Init pipelineCacheUUID +- drm-uapi: Sync nouveau_drm.h +- nvk: Take GETPARAM_EXEC_PUSH_MAX into account +- nvk: Handle zero-sized sparse buffers +- nvk: Use align() and align64() instead of ALIGN_POT +- nouveau: Generate headers for Maxwell B compute +- nvk: Add a nvk_cmd_buffer_compute_cls() helper +- nvk: Invalidate sampler/texture header caches in BeginCommandBuffer() +- nvk: Invalidate SKED caches at the top of command buffers +- nvk: Advertise more inline uniform block limits +- nvk: Emit MME_DMA_SYSMEMBAR before indirect draw/dispatch +- nvk: Set max descriptors to 2^20 for most descriptor types +- nvk: Reset descriptor pool allocator when all sets are destroyed +- nil/format: Use A for alpha blend +- nil/format: Advertise R10G10B10A2_UINT texture buffer support +- nvk: Disable depth or stencil tests when unbound +- nvk: Always emit at least one color attachment +- nvk: Improve address space and buffer size limits +- nvk: Always set pixel_min/max_Z to CLAMP +- nvk: Use nouveau_ws_bo_unmap() instead of munmap() +- nvk: Free the disk cache +- nvk: Add an nvk_shader_finish() helper +- nvk: Handle unbinding images and buffers +- nvk: Clean up the disk cache on physical device create fail path +- vulkan/wsi: Allow for larger linear images +- nvk: Add a nvk_cmd_buffer_dirty_render_pass() helper +- nvk: Re-sort device features +- nvk: Implement VK_EXT_depth_bias_control +- nvk: Advertise VK_KHR_workgroup_memory_explicit_layout +- nvk: Implement VK_EXT_image_sliced_view_of_3d +- nvk: Advertise VK_EXT_primitive_topology_list_restart +- nvk: Advertise VK_EXT_attachment_feedback_loop_layout +- features: Mark VK_EXT_attachment_feedback_loop_layout done for NVK +- nvk: Re-arrange Vulkan 1.2 features to match the header +- nvk: Advertise shaderOutputLayer and shaderOutputViewportIndex +- nvk: Enable descriptorIndexing +- nvk: Implement VK_EXT_dynamic_rendering_unused_attachments +- nir: Add a nir_ssa_def_all_uses_are_fsat() helper +- nir: Add convert_alu_types to divergence analysis +- nir/lower_tex: Add a lower_txd_clamp option +- nir: Add a load_sysval_nv intrinsic +- nir: Add NV-specific texture opcodes +- nir: Add an load_barycentric_at_offset_nv intrinsic +- nir: Add a range to most I/O intrinsics +- nir: Add NVIDIA-specific I/O intrinsics +- nir/lower_bit_size: Fix subgroup lowering for floats +- nir: add deref follower builder for casts. +- nir: Handle wildcards with casts in copy_prop_vars + +Felix DeGrood (12): + +- anv: save a shader source uint32_t hash in gfx/compute pipelines +- anv: Add Source hash field to VkPipelineExecutableStatisticKHR +- iris: save shader source sha1 in ish +- mesa: propagate shader source sha1 from gl_shader to nir_shader +- intel: use shader source hash in INTEL_MEASURE +- intel/compiler: use shader source hash in shader dump code +- anv: add fake sparse support +- anv: enable fake sparse for Elden Ring +- anv: debug messaging for sparse texture usage +- anv: fix frame count reporting in INTEL_MEASURE +- anv: set ComputeMode.PixelAsyncComputeThreadLimit = 4 +- anv: remove CS_FLUSH from query regression + +Feng Jiang (9): + +- virgl: Only PIPE_BUFFER with VIRGL_BIND_CUSTOM flag is considered busy during creation +- meson: Export winsys function symbols for target va +- frontends/va: Add slice_count to AV1 slice_parameter +- virgl/video: Add definition of virgl_av1_picture_desc +- virgl/video: Add support for AV1 decoding +- virgl/video: Enable AV1 decoding +- meson: Rename dri-vdpau.dyn to dri.dyn +- CODEOWNERS: Add \@flynnjiang for VirGL video +- meson: Move video to separate section in meson configuration summary + +Filip Gawin (1): + +- crocus: Avoid fast-clear with incompatible view + +Flora Cui (1): + +- radeonsi: limit CP DMA to skip holes in sparse bo + +Francisco Jerez (29): + +- intel/fs/ra: Define REG_CLASS_COUNT constant specifying the number of register classes. +- intel/vec4/ra: Define REG_CLASS_COUNT constant specifying the number of register classes. +- intel/compiler: Make MAX_VGRF_SIZE macro depend on devinfo and update it for Xe2. +- intel/fs/ra/xe2: Scale up register allocation granularity by 2x on Xe2+ platforms. +- intel/eu/xe2+: Fix encoding of various message descriptors for change in register size. +- intel/fs: Fix signedness of payload_node_count argument of calculate_payload_ranges(). +- intel/fs/xe2+: Fix payload node live range calculations for change in register size. +- intel/fs/xe2+: Fix grf_count in post-RA scheduling for updated register file size. +- intel/fs/xe2+: Fixes for increased accumulator register width. +- intel/fs/xe2+: Scale MAX_SAMPLER_MESSAGE_SIZE by native register size. +- intel/eu/xe2+: Update validation of GRF region size to account for Xe2 reg size +- intel/fs/xe2+: Allow increased SIMD width for various get_fpu_lowered_simd_width() restrictions. +- intel/compiler/xe2+: Represent dispatch_grf_start_reg in native GRF units. +- intel/fs/xe2+: Update encoding of FB write message payload. +- intel/fs/xe2+: Round up fs_builder::vgrf() size calculation to HW register unit. +- intel/fs/xe2+: Scale BRW_MAX_MSG_LENGTH by native register size. +- intel/fs/xe2+: Fix payload layout of sampler messages for Xe2 reg size +- intel/fs/xe2+: Update GS payload setup for Xe2 reg size. +- intel/fs/xe2+: Update TCS payload setup for Xe2 reg size. +- intel/fs/xe2+: Update TES payload setup for Xe2 reg size. +- intel/fs: Lower unsupported regioning with non-trivial 2D regions on FIXED_GRFs. +- intel/fs/xe2+: Update regioning lowering offset alignment checks for Xe2 regs. +- intel/fs/xe2+: Fix execution width of SHADER_OPCODE_GET_BUFFER_SIZE for SIMD16 EU. +- intel/fs/xe2+: Fix calculation of spill message width for Xe2 regs. +- intel/xe2+: Round up size to reg_unit() in fs_reg_alloc::alloc_spill_reg(). +- intel/fs/xe2+: Fix URB writes with 0 data components. +- intel/fs: Specify number of data components of logical URB writes via control immediate. +- intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB writes. +- intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB reads. + +Frank Binns (10): + +- pvr: clang-format fixes +- pvr: skip setting up SPM consts buffer when no const shared regs are used +- pvr: cleanup SPM EOT dynarray after upload +- pvr: treat VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT as not supported +- pvr: remove pvr_pbe_get_src_pos() +- pvr: fix attachments segfault in pvr_is_stencil_store_load_needed() +- pvr: fix allocation size of clear colour consts shared regs buffer +- pvr: change a few places to use PVR_DW_TO_BYTES() +- pvr: fix setup of load op unresolved msaa mask +- pvr: emit PPP state when vis_test dirty bit is set + +Friedrich Vock (19): + +- radv/ci: Set DRIVER_NAME in LAVA raven vkcts jobs +- radv: Handle VK_SUBOPTIMAL_KHR in trace layers +- ac/msgpack: make fixstrs a const char +- ac/sqtt,radv: Split internal and API hash in PSO correlations +- ac/rgp: Write lds_size metadata +- ac/rgp: Add metadata for separate-compiled RT stages +- radv/sqtt: Move record filling to helper function +- radv/sqtt: Unregister records based on hash +- radv/sqtt: Write LDS size metadata in code objects +- radv/sqtt: Handle separately-compiled RT pipelines +- ac/sqtt,radv/sqtt: Add and use marker for separate RT compilation +- nir/load_store_vectorize: Handle intrinsics with constant base +- radv/rt: Pre-initialize instance address +- radv: Initialize shader freelist on allocation +- radv: Fix check in insert_block +- radv/rt: Reject hits within 10ULP of previous hits in emulated RT +- radv/rra: Recognize LPDDR memory +- radv/rmv: Recognize LPDDR memory +- vulkan: Don't use set_foreach_remove when destroying pipeline caches + +Ganesh Belgur Ramachandra (5): + +- radeonsi: stores bottom_edge_rule option in the rasterizer state +- radeonsi: sets OPTIMAL_BIN_SELECTION to 0 if using bottom_edge_rule +- radeonsi: "clear_render_target" shader in nir +- radeonsi: "clear_render_target_1d_array" shader in nir +- radeonsi: "clear_12bytes_buffer" shader in nir + +Georg Lehmann (39): + +- aco/gfx11: fix get_gfx11_true16_mask with v_cmp_class_f16 +- aco: improve get_gfx11_true16_mask description +- aco: combine a & ~b to bfi(b, 0, a) +- aco/gfx11: use v_cmp_class_f16 with opsel for bitnz/bitz +- aco: fix non constant 16bit bitnz/bitz +- ac/nir: handle more special cases in ac_nir_unpack_arg +- aco: use s_bitreplicate_b64_b32 to set exec to 0xffff0000ffff0000 +- nir/opt_intrinsics: optimize (exclusive_scan(op, a) op a) to inclusive scan +- aco: always use rtne for fquantize2f16 +- nir/opt_if: also rewrite uniform uses for read_invocation +- nir: unify lower_bitfield_insert with has_{bfm,bfi,bitfield_select} +- nir: unify lower_bitfield_extract with has_bfe +- nir: unify lower_find_msb with has_{find_msb_rev,uclz} +- aco: fix u2f16 with 32bit input +- aco: combine a | ~b to bfi(b, a, -1) +- aco: use v_cvt_f32_ubyte for signed casts too +- nir: add nir_scalar intrinsic helpers +- nir: add nir_scalar_equal +- aco: implement some exclusive scans with inclusive scans +- aco/gfx11: don't use bfe for local_invocation_id if the others are always 0 +- nir/opt_algebraic: remove broken fddx/fddy patterns +- aco: simplify masked swizzle dpp selection by removing or_mask first +- aco: fix p_extract with v1 dst and s1 operand +- aco: implement 64bit div find_lsb +- nir: scalarize masked_swizzle_amd created from shuffle_xor +- aco/optimizer: check if we can use omod before labeling it +- aco/optimizer: copy propagate to output modifier instructions +- aco: remove -0.0 for 32 bit fsign with mul_legacy/omod when denorms are flushed +- nir: make quad intrinsic dst bit size match src0 +- nir/lower_subgroups: use intrinsic builder more +- aco: assume new generations are unsupported by clrx +- aco: assume newer generation will use GFX11 wait_imm packing +- aco: print final ir instead if printing asm is unsupported +- aco/gfx11: optimize dual source export +- aco/gfx11: apply clamp/omod to vinterp +- aco: support v_fma_f32_dpp as fma_mix +- aco/gfx11: support vinterp as fma_mix +- aco: add missing scc def for SALU quad broadcast +- aco/sched: treat p_dual_src_export_gfx11 like export + +George Ouzounoudis (38): + +- nouveau/codegen: Support compact clip distances with arrayed_io +- nouveau/codegen: Handle nir op amul +- nouveau/codegen: Fix compact patch varyings in case of NIR +- nouveau/codegen: Add capability to pre-specify tessellation domain +- nvk: Do not increment instance id across draws +- nvk: Add a macro for root descriptor table byte offsets +- nvk: Set base vertex state in sequential mme draw +- nvk: Support base instance in instanced draw calls +- nvk: Switch point rasterization to point sprites +- nvk: Support large points +- nvk: Compile geometry shaders +- nouveau/mme: Keep device info in mme_builder +- nvk: Simplify mme build function argument +- nvk: Support VK_KHR_shader_draw_parameters +- nvk: Support for vertex shader transform feedback +- nvk: Support transform feedback indirect draws +- nvk: Support transform feedback geometry streams +- nvk: Support transform feedback queries +- nvk: Support vertex shader transform feedback on Fermi +- nvk: Disable PRIMITIVE_RESTART_VERTEX_ARRAY by default +- nvk: Fix geometry shader active stream mask +- nvk: Support geometry shaders +- nvk: Basic tessellation shader support +- nvk: Assign locations correctly for arrayed IO +- nvk: Enable multiview with tessellation shader +- nvk: Fix cases where execution mode is specified in the tesc shader. +- nvk: Respect tessellation domain origin state +- nvk: Lower io to temporaries for tessellation evaluation nir +- nvk: Support VkDescriptorSetVariableDescriptorCountLayoutSupport +- nvk: Handle cases of descriptor bindings with variable counts +- nvk: Add nir non-uniform optimization pass +- nvk: Enable descriptor indexing +- nvk: Do not keep redundant info for tessellation domain +- nouveau/codegen: Do not keep redundant info for tessellation domain +- nvk: Enable dynamic line rasterization mode state +- nvk: Fix support for VK_EXT_sample_locations +- nvk: Support dynamic state for enabling sample locations +- nouveau/codegen: Add a 4th optimization level for MemoryOpts + +Gert Wollny (63): + +- r600/sfn: Switch to register intrinsics +- r600/sfn/tests: add simple copy-prop test with register source +- r600/sfn: Allow for larger ALU CF's +- r600/sfn: Handle indirect array load/store dependencies better +- r600/sfn: Increase LDS fetch schedule priority +- r600/sfn: Add peephole optimization to move a dest to the previous op +- r600/sfn: reorder the value factory class member declaration a bit +- r600/sfn: Add some tests for proper register access +- r600/sfn: Print more info if scheduling fails +- r600/sfn: remove debug output leftovers +- r600/sfn: Fix use of multiple IDX with kcache +- r600/sfn: Always check arrays writes before allowing copy propagation +- r600/sfn: set block sizes based on chip class +- r600/sfn: Fix typo with block type +- r600/sfn: override slot count for IfInstr +- r600/sfn: Add method to convert to AluGroup directly +- r600/sfn: Add flags to check whether a group starts CF and can do that +- r600/sfn: make remaining slots a signed value +- r600/sfn: on Cayman loading an index register needs only one slot +- r600/sfn: Splizt ALU blocks in scheduler to fit into 128 slots +- r600/sfn: rework checks for ALU CF emission +- r600/sfn: Schedule AR uses befor possible groups +- r600: Explicitly force new CF in gs copy shader +- r600: Assert when backend wants to create a new ALU CF +- r600: don't check possible size of ALU CF +- r600: don't use sb disasm to disassamble copy shader +- r600: Force CF when emitting a NOP on R600 in gs copy shader +- r600/sfn: Don't try to propagate to vec4 with more than one use +- r600/sfn: Only switch to other CF if no AR uses are pending +- r600/sfn: AR loads should depend on all previous non ALU instructions +- r600/sfn: Renumber shader blocks in scheduler +- r600/sfn: Track whether a register is ALU clause local +- r600/sfn: Use clause local registers in RA +- r600/sfn: Take source uses into account when switching channels +- r600/sfn: take number of dest values into account +- r600: retire SB optimizer +- r600/sfn: work around injecting extra CF's to handle hardware bugs +- r600: use correct cso pointer for fetch shader +- r600/sfn: Make use of four clause local registers +- r600/sfn: drop unused ControlFlowInstr type enum +- r600/sfn: factor out resource as extra class +- r600/sfn: Simplify dependency chain for index loads on EG +- r600: print texture resource index mode separately +- r600/sfn: Make address split pass obligatory +- r600/sfn: rename method resource_base to resource_id +- r600/sfn: Add old address to update_indirect_addr +- r600/sfn: Sepeate resource and sampler in texture instructions +- r600/sfn: get rid of the method to get the index mode +- r600/sfn: sort the uniforms of the right shader +- r600/sfn: Fix use of scheduled_shader vs shader +- virgl: report MIRROR_CLAMP features better +- ci: Upref virglrenderer +- copyimage: check requested slice early when cube maps are involved +- mesa: check numlevels and numlayers when creating a texture view +- virgl: Use common clear_texture if host doesn't support the feature +- r600/sfn: don't remove texture sources by using the enum value +- r600: drop egcm_load_index_reg +- r600/sfn: Don't override a chgr pinning during copy propagation +- r600/sfn: When simplifying src vec4 pinnings, also check all uses +- virgl: Fix logic for reporting PIPE_MIRROR_CLAMP +- r600: Add callbacks for get_driver_uuid and get_device_uuid +- r600: Link with libgalliumvl, when enabling rusticl this is needed +- r600/sfn: Fixup component count only if intrinsic has it + +Guilherme Gallo (5): + +- bin/ci: Ensure that all jobs have nodes in DAG +- ci/radeonsi: Update flake list +- ci/freedreno: Add a new flake +- ci/zink: Found some flakes +- ci/anv: Catch some flakes + +Hannes Mann (1): + +- vulkan/wsi/wayland: Fix detection of tearing control protocol + +Hans-Kristian Arntzen (2): + +- wsi/x11: Fix potential deadlock in present ID. +- wsi/x11: Don't allow signal_present_id to rewind. + +Helen Koike (21): + +- ci: re-add EXTRA_LOCAL_PACKAGES to rootfs +- ci: add EXTRA_LOCAL_PACKAGES to apt-get install +- docs/ci: Add docs for EXTRA_LOCAL_PACKAGES +- ci: disable duplicated pipelines triggered by marge +- ci: add --project option to ci_run_n_monitor.py +- ci/android: remove strace output from cuttlefish-runner.sh +- ci: add locked flag to bindgen-cli on x86_64_build.sh +- ci: separate hiden jobs to -inc.yml files +- ci/ci_run_n_monitor: add docs for multiple targets +- ci/ci_run_n_monitor: print stress test results per job +- ci/ci_run_n_monitor: simplify with defaultdict +- ci/ci_run_n_monitor: merge print_job_status_change with print_job_status +- ci/ci_run_n_monitor: make --target mandatory +- ci/ci_run_n_monitor: merge enable_job with retry_job +- ci/ci_run_n_monitor: simplify enable/cancel logic in monitor_pipeline() +- ci/ci_run_n_monitor: allow <user>/<project> in --project +- ci/ci_run_n_monitor: limit repetitions on --stress +- ci/marge_queue: add missing python-dateutils to requirements.txt +- ci/ci_run_n_monitor: keep monitoring if a job is still running +- ci/marge_queue: add pretty_dutation() +- ci/ci_run_n_monitor: print job duration time + +Honglei Huang (7): + +- virgl/video: Add support for mpeg12 decoding +- virgl/video: Add support for vc1 decoding +- virgl/video: Add support for jpeg decoding +- virgl/video: Add support for hevc10bit decoding. +- virgl/video: Add more pipe type in virgl formats convert table +- virgl/video: Add jpeg buf start code check +- virgl: Enable vp9 hardware decode + +Hyunjun Ko (3): + +- anv: use ycbcr_info for P010 format +- anv: don't use cmd_buffer after destroyed. +- anv: don't flush_llc on gen9 + +Iago Toral Quiroga (100): + +- nir/trivialize: Move decl_reg to the start of the block +- v3dv: stop incrementing UBO indices by one +- nir/lower_robustness: drop skip_ubo_0 option +- v3dv: fix incorrect key setup +- broadcom/compiler: stop asserting on Vulkan environment +- broadcom/compiler: use NIR's lowering for dispatch base +- broadcom/compiler: move uniform offset lowering from compiler to GL driver +- broadcom/compiler: move vulkan's point coord lowering to the driver +- v3dv: don't set lower_wpos_pntc for Vulkan +- broadcom/compiler: always clamp results from logic ops +- broadcom/compiler: drop execution environment from the shader key +- v3dv: drop cpu path for buffer to image copies +- v3dv: remove unused code +- nir/lower_tex: copy backend_flags field when copying a tex instruction +- nir/lower_tex: use a callback to check sampler return size packing +- squash! v3dv,broadcom/compiler: don't abuse sampler index +- v3dv: assert that only tex instructions with sampler state have a sampler src +- v3d: fix texture packing lowering +- v3d,v3dv: use fquantize2f16 lowering in NIR +- v3dv: be more precise in vkGetImageSubresourceLayout +- v3dv: handle pPlaneLayouts in VkImageDrmFormatModifierExplicitCreateInfoEXT +- v3dv: bump up MAX_UNIFORM_BUFFERS to 16 +- v3dv: add support for sampling simple 2D linear textures +- v3dv: expand sampling from linear image hack to support multi-planar images +- v3dv: don't assume that bound descriptors have been written +- v3dv: only handle Android Hardware Buffer on Android +- v3dv: we can sample from 1D array too +- broadcom/compiler: add a couple of shader key helpers +- v3d: compute nir sha1 for uncompiled shader state +- v3d: use pre-computed shader sha1 for disk cache +- v3d: fix RAM shader cache +- v3d: get rid of shader_state pointer in v3d_key +- broadcom/simulator: reset CFG7 for compute dispatch in v71 +- broadcom/common: retrieve V3D revision number +- broadcom/compiler: update node/temp translation for v71 +- broadcom/compiler: implement "reads/writes too soon" checks for v71 +- broadcom/compiler: implement read stall check for v71 +- broadcom/compiler: add a v3d71_qpu_writes_waddr_explicitly helper +- broadcom/compiler: prevent rf2-3 usage in thread end delay slots for v71 +- broadcom/qpu: add new ADD opcodes for FMOV/MOV in v71 +- broadcom/qpu: fix packing/unpacking of fmov variants for v71 +- broadcom/compiler: make vir_write_rX return false on platforms without accums +- broadcom/compiler: rename vir_writes_rX to vir_writes_rX_implicitly +- broadcom/compiler: only handle accumulator classes if present +- broadcom/compiler: don't assign rf0 to temps across implicit rf0 writes +- broadcom/compiler: CS payload registers have changed in v71 +- broadcom/compiler: don't schedule rf0 writes right after ldvary +- broadcom/compiler: allow instruction merges in v71 +- broadcom/qpu: add MOV integer packing/unpacking variants +- broadcom/qpu: fail packing on unhandled mul pack/unpack +- broadcom/compiler: generalize check for shaders using pixel center W +- broadcom/compiler: v71 isn't affected by double-rounding of viewport X,Y coords +- broadcom/compiler: update peripheral access restrictions for v71 +- broadcom/qpu: add packing for fmov on ADD alu +- broadcom/compiler: handle rf0 flops storage restriction in v71 +- broadcom/compiler: enable ldvary pipelining on v71 +- broadcom/compiler: try to use ldunif(a) instead of ldunif(a)rf in v71 +- broadcom/compiler: don't assign rf0 to temps that conflict with ldvary +- broadcom/compiler: convert mul to add when needed to allow merge +- broadcom/compiler: implement small immediates for v71 +- broadcom/compiler: update thread end restrictions for v7.x +- broadcom/compiler: update ldvary thread switch delay slot restriction for v7.x +- broadcom/compiler: lift restriction for branch + msfign after setmsf for v7.x +- broadcom/compiler: start allocating from RF 4 in V7.x +- broadcom/compiler: validate restrictions after TLB Z write +- broadcom/compiler: lift restriction on vpmwt in last instruction for V3D 7.x +- broadcom/compiler: fix up copy propagation for v71 +- broadcom/compiler: don't allocate spill base to rf0 in V3D 7.x +- broadcom/compiler: improve allocation for final program instructions +- broadcom/compiler: don't assign registers to unused nodes/temps +- broadcom/compiler: only assign rf0 as last resort in V3D 7.x +- v3dv: expose V3D revision number in device name +- v3dv/device: handle new rpi5 device (bcm2712) +- v3dv: setup render pass color clears for any format bpp in v71 +- v3dv: setup TLB clear color for meta operations in v71 +- v3dv: fix up texture shader state for v71 +- v3dv: handle new texture state transfer functions in v71 +- v3dv: implement noop job for v71 +- v3dv: handle render pass global clear for v71 +- v3dv: GFX-1461 does not affect V3D 7.x +- broadcom/compiler: update thread end restrictions validation for v71 +- v3dv: handle early Z/S clears for v71 +- v3dv: handle RTs with no color targets in v71 +- v3dv: don't convert floating point border colors in v71 +- v3dv: handle Z clipping in v71 +- v3dv: make v3dv_viewport_compute_xform depend on the V3D version +- v3dv: fix depth clipping then Z scale is too small in V3D 7.x +- v3d/v3dv: fix texture state array stride packing for V3D 7.1.5 +- v3d,v3dv: support up to 8 render targets in v7.1+ +- v3d,v3dv: don't use max internal bpp for tile sizing in V3D 7.x +- v3d,v3dv: propagate NaNs bits in shader state records are reserved in v7.x +- v3dv: use new texture shader state rb_swap and reverse fields in v3d 7.x +- v3dv: fix color write mask for v3d 7.x +- v3d,v3dv: fix depth bias for v3d 7.x +- v3d,v3dv: fix compute for V3D 7.1.6+ +- v3dv: expose fullDrawIndexUint32 in V3D 7.x +- v3dv: expose depthClamp in V3D 7.x +- v3dv: expose scalarBlockLayout on V3D 7.x +- v3dv: fix confusing nomenclature about DRM nodes +- v3d,v3dv: fix MMU error from hardware prefetch after ldunifa + +Ian Douglas Scott (1): + +- egl/wayland: Don't segfault if \`create_wl_buffer` returns \`NULL` + +Ian Romanick (38): + +- intel/fs: Always do opt_algebraic after opt_copy_propagation makes progress +- intel/fs: Constant fold SHL +- intel/fs: Constant fold OR and AND +- util/rb-tree: Return the actual first node from rb_tree_search +- util/rb-tree: Fix typo in comment +- nir/builder: Add nir_extract_i8_imm and nir_extract_u8_imm helpers +- nir/algebraic: Remove redundant pack / unpack lowering patterns +- intel/fs: Completely re-write the combine constants pass +- intel/fs: Combine constants for SEL instructions too +- intel/fs: Combine constants for integer instructions too +- intel/fs: New VGRF packing scheme for constant combining +- intel/compiler: Combine control barriers with identical memory semantics +- intel/compiler: Don't evict for workgroup-scope fences +- glsl/list: Clean up an inappropriate comment +- util/rb-tree: Work around C++'s dislike of offsetof +- util/rb-tree: Inline rb_tree_init +- intel/fs: Don't continue fixed point iteration just because liveout changes +- intel/fs: Don't try to copy propagate into a source again after progress is made +- intel/fs: Make try_constant_propagate and try_copy_propagate file private +- intel/fs: Move src.file checks out of try_constant_propagate and try_copy_propagate +- intel/fs: Don't loop in try_constant_propagate +- intel/fs: Simplify check in can_propagate_from +- intel/fs: Make opt_copy_propagation_local file private +- intel/fs: Encapsulate per-block ACP in a structure +- intel/fs: Use rb_tree to store ACP entries by source +- intel/fs: Use rb_tree to store ACP entries by destination +- intel/fs: Use rb_tree for copy prop dataflow +- intel/fs: Merge copy prop dataflow loops +- intel/compiler/xe2: Update fs_visitor::setup_vs_payload to account for Xe2 reg size +- intel/compiler/xe2: Use SIMD16 for nir_intrinsic_image_size +- intel/compiler/xe2: TXD is lowered to SIMD16 in SIMD32 mode +- nir/rematerialize: Rematerialize ALUs used only by compares with zero +- intel/compiler/xe2: Handle new URB read messages +- intel/compiler/xe2: Handle new URB write messages +- intel/compiler/xe2: Update fs_visitor::emit_urb_writes to not assume SIMD8 +- spirv: Track when a shader has a cooperative matrix +- intel/fs: Add DP4A to get_lowered_simd_width +- nir/split_vars: Don't split arrays of cooperative matrix types + +Igor Torrente (4): + +- zink: Fix enumerate devices when running compositor +- zink: Removes \`disable_xcb_surface` +- zink: Fix one addicional case when running a compositor +- zink: fix for startup crash of weston running on top of zink + venus + +Illia Abernikhin (2): + +- state_tracker: moving initialisation of whandle out from if statement whandle initialization inside if statement but used also outside +- i915: change format in dbg string Actually, uintptr_t is of type unsigned long, but the debug line uses the %d format specifier, which expects an int. + +Illia Polishchuk (7): + +- iris: remove NULL check for already dereferenced pointer earlier +- s/Intel: fix/anv: fix: potentially overflowing expression in genX +- glx: fix dead code when gc var cannot be null due to earlier check +- state_tracker: fix dereference before null check +- anv, drirc: Add workaround to speed up Cyberpunk 2077 reg allocation +- zink: move find_sampler_var from zink to nir core +- nir: fix invalid sampler search by texture id + +Italo Nicola (24): + +- mesa/main: account for RTT samples when updating framebuffer +- mesa/main: allow readpix/teximage to read from implicitly multisampled fbos +- panfrost/genxml: fix Surface With Stride descriptor alignment +- panfrost/genxml: add Multiplanar Surface descriptor +- panfrost: refactor (un)packing of surface descriptors +- pan/decode: decode Multiplanar Surface descriptors +- panfrost: prepare pan_image_view for multiplanar formats +- panfrost: prepare the driver to support YUYV and variants +- panfrost: advertise support for YUYV and variants +- panfrost: mandate proper alignment requirement depending format and arch +- panfrost: add PAN_MESA_DEBUG=yuv for debugging yuv sampler +- gallium/st: add non-CSC lowering of I420 as PIPE_FORMAT_R8_G8_B8_420 +- gallium/st: add non-CSC lowering of YV12 as PIPE_FORMAT_R8_B8_G8_420 +- pan/bi: add support for I420 and YV12 sampling +- gallium/st: lower NV21 to R8_B8G8 instead of G8_B8R8 +- panfrost: fix invalid memory access in get_equation_str() +- pan/decode: handle more than one panfrost_device +- panfrost/ci: updated CI expectations +- egl: reenable partial redraw with a warning when using gallium hud +- pan/genxml: add Width/Height fields to v9+ Plane descriptor +- panfrost: rename _needs_multiplanar_descriptor to _is_yuv +- panfrost: prepare v9+ to support YUV sampling +- panfrost: use centered YUV chroma siting +- panfrost: advertise YUV formats for valhall + +Iván Briano (23): + +- anv: ensure CFE_STATE is emitted for ray tracing pipelines +- iris: ensure mesh is disabled on context init +- anv: ensure mesh is disabled on context init +- anv: implement Wa_14019750404 +- intel/compiler: call brw_nir_adjust_payload from brw_postprocess_nir +- anv,hasvk: respect provoking vertex setting on geometry shaders +- anv: fix missing 3DSTATE_SBE_CLIP emission +- anv: ensure pipelines have all state +- anv: tell blorp to do mesh stuff only if it's enabled +- blorp: fix hangs with mesh enabled +- anv: use a simpler MUE layout for fast linked libraries +- anv: track what kind of pipeline a fragment shader may be used with +- intel/fs: read viewport and layer from the FS payload +- intel/fs: handle URB setup for fast linked mesh pipelines +- anv: enable VK_EXT_mesh_shader where supported +- intel/fs: use ffsll so we don't explode on 32 bits +- vulkan/runtime: add internal parameter to vk_spirv_to_nir +- nir/lower_int64: respect rounding mode when casting to float +- intel/compiler: round f2f16 correctly for RTNE case +- util: add double_to_float16 helpers +- nir: round f2f16{_rtne/_rtz} correctly for constant expressions +- anv: advertise VK_KHR_global_priority_queue +- anv: use the right vertexOffset on CmdDrawMultiIndexed + +Jani Nikula (1): + +- docs/vulkan: fixup some typos + +Janne Grunau (4): + +- asahi: toggle more barrier bits after transform feedback +- asahi,agx: Fix stack buffer overflow in agx_link_varyings_vs_fs +- asahi,agx: Upload constant buffers immediately +- asahi: decode: Fix uint64_t format modifiers in agxdecode_stateful() + +Jesse Natalie (2): + +- nir_lower_mem_access_bit_sizes: Fix write-mask-constrained 3-byte stores as atomics +- d3d12: Fix multidimensional array ordering + +Jianxun Zhang (1): + +- intel/common: Only set op mask on instructions in decoder + +Jonathan Marek (2): + +- freedreno: move redump.h to common code + cleanup +- tu: add a TU_DEBUG=rd option for cmdstream dumping + +Jordan Justen (73): + +- isl: Add ISL_SURF_USAGE_STREAM_OUT_BIT +- anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS +- genxml/hsw: Add additional MOCS field enumerations +- genxml/chv: Add MEMORY_OBJECT_CONTROL_STATE_CHV to document compared to BDW +- isl/dev: Add uncached MOCS value +- isl: Set MOCS to uncached for MTL stream-out +- intel/isl: Use intel_needs_workaround() for MTL CCS WA +- intel/compiler: Use nir SUBGROUP_INVOCATION for RT TOPOLOGY_ID +- intel/dev: Add LNL platform enum +- intel/dev: Support xe2 device init (for intel_device_info_test) +- intel/tools: Use 'env bash' to find bash executable +- intel/decoder: Fix xml filename when verx10 % 10 is not 0 +- intel/decoder: Add intel_spec_load_common() +- intel/decoder: Make intel_spec_load_filename() have separate dir and name strings +- intel/genxml: Align "Texture Coordinate Mode" naming +- intel/genxml: Split some genxml sorting code into a intel_genxml module +- intel/genxml: Convert gen_bits_header to use ElementTree +- intel/genxml: Convert gen_pack_header to use ElementTree +- intel/genxml: Add GenXml class into intel_genxml module +- intel/genxml: Add filter_engines() to GenXml class +- intel/genxml: Move sorting & writing into GenXml class +- intel/genxml: Don't rewrite sorted xml if the contents didn't change +- intel/genxml: Add final newline to output when saving xml +- intel/genxml: Update xml with gen_sort_tags.py output +- intel/dev: Use RPL-U name on RPL-U devices +- intel/dev: Add more RPL PCI IDs +- anvil,hasvk: Rename need_clflush to need_flush +- intel/common: Move intel_clflush.h to intel_mem.h/intel_mem.c +- anvil,hasvk: Replace intel_clflush_range with intel_flush_range +- intel/common: Add intel_flush_range_no_fence +- anvil,hasvk: Use intel_flush_range_no_fence to flush command buffers +- util/u_cpu_detect: Drop unused has_tsc +- util/u_cpu_detect: Detect clflushopt support +- meson: Check for the __builtin_ia32_clflushopt function +- intel/clflush: Add support for clflushopt instruction +- intel/dev/xe: Move placeholder subslice info into XEHP_FEATURES +- intel/genxml: Ignore tail leading/trailing whitespace in node_validator() +- intel/genxml: Fix comparing xml when node counts differ +- intel/dev: Update device string for MTL PCI ID 0x7d55 +- intel/genxml: Support importing from another genxml file +- intel/genxml: Add support for excluding items when importing +- intel/genxml: Add all xml files as pack dependencies +- intel/genxml: Add GenXml.optimize_xml_import() +- intel/genxml: Drop assertion to allow for importing +- intel/genxml: Add GenXml.add_xml_imports method +- intel/genxml: Add GenXml.flatten_xml() method +- intel/genxml: Add genxml_import.py script +- intel/decoder: ralloc_steal() values from spec context for fields and enums +- intel/decoder: Implement support for importing genxml +- intel/genxml: Start Xe2 support +- intel/genxml: Auto-import genxml files using genxml_import.py +- intel/common: Add sse2_args for 32-bit build when -Dsse2=false was set +- intel/compiler/fs: Support Xe2 reg size in assign_curb_setup +- intel/compiler: Update opt_split_sends() for Xe2 reg size +- intel/compiler: Update emit_rt_lsc_fence() for Xe2 +- intel/compiler: Update lower_trace_ray_logical_send() for Xe2 +- intel/compiler: Update ray-tracing intrinsic lowering for Xe2 +- intel/compiler: Update RT stack_id access for Xe2 +- intel/fs: Update SSBO & shared uniform block loads for Xe2 +- intel/genxml: Build with gen20.xml +- intel/isl: Build for Xe2 +- iris: Build for Xe2 +- anv/blorp: Use anv_genX to set device->blorp.exec +- anv: Disable Ray Tracing on xe2 until our compiler supports Xe2 RT +- anv: Build for Xe2 +- anv: Print warning that Xe2 is not supported rather than failing +- intel/compiler: Add enum xe2_lsc_cache_store +- intel/compiler: Use enum xe2_lsc_cache_store on xe2 +- intel/compiler: Add enum xe2_lsc_cache_load +- intel/compiler: Use enum xe2_lsc_cache_load on xe2 +- anv/batch: Check if batch already has an error in anv_queue_submit_simple_batch() +- anv/batch: Assert that extend_cb is non-NULL if the batch is out of space +- intel/dev: Add 0x56ba-0x56bd DG2 PCI IDs + +Jose Maria Casanova Crespo (2): + +- vc4: mark buffers as initialized at vc4_texture_subdata +- vc4: Fix mask RGBA validation at YUV blit + +José Expósito (3): + +- zink: Fix crash on zink_create_screen error path +- zink: fix dereference before NULL check +- zink: allow software rendering only if selected + +José Roberto de Souza (51): + +- anv: Use workaround framework to Wa_14016118574 +- intel/aux_map: Nuke format_enum +- intel/aux_map: Use get_aux_entry() in remove_mapping() +- intel/aux_map: Replace magic number by INTEL_AUX_MAP_ENTRY_VALID_BIT +- intel/aux_map: Rename some variables to improve readability +- intel/aux_map: Mask out bits above index 47 in intel_aux_get_meta_address_mask() +- intel/aux_map: Convert l1_entry_addr_out to canonical +- intel/aux_map: Drop magic sub table size number +- intel/aux_map: Add function and macro to return l2 and l1 table masks +- anv: Add gem_create_userptr() to KMD backend +- anv: Replace handle by anv_bo in the gem_close() +- anv: Add support for userptr in Xe KMD +- intel: Sync xe_drm.h +- intel/dev/xe: Add support for small-bar setups +- anv: Request Xe KMD to place BOs to CPU visible VRAM when required +- iris: Request Xe KMD to place BOs to CPU visible VRAM when required +- iris/xe: Call iris_lost_context_state() when batch engine is replaced +- intel/dev: Port intel_dev_info tool to Xe KMD +- iris: Replace I915_EXEC_FENCE_SIGNAL by IRIS_BATCH_FENCE_SIGNAL in common code +- intel: Move i915_drm.h specific code from common/intel_gem.h to common/i915/intel_gem.h +- intel/common: Move functions inside of C++ ifdef +- intel: Rename intel_gem_add_ext() to intel_i915_gem_add_ext() +- iris: Move i915_gem_set_domain() call to i915 backend +- iris: Move iris_bufmgr_bo_close() to kmd backend +- iris: Add gem_create_userptr() to KMD backend +- iris: Add support for userptr in Xe KMD +- intel/genxml/gen125: Add missing fields in MI_MATH +- iris: Set MI_MATH MOCS field +- anv: Set MI_MATH MOCS field +- intel/tests/mi_builder: Set MI_MATH MOCS field +- intel/genxml/gen125: Set MI_MATH MOCS field as non-zero +- anv: Nuke unused READ_ONCE() from anv_batch_chain.c +- anv: Remove VkAllocationCallbacks parameter from reloc functions +- anv: Return earlier in anv_reloc_list functions +- intel: Sync xe_drm.h and rename engine to exec_queue +- anv: Override vendorID for Hogwarts Legacy +- intel/isl: Remove unknown workaround +- intel/isl: Remove Wa_22011186057 +- anv: Update Wa_16014390852 for MTL +- intel: Sync xe_drm.h +- anv: Move i915 specific gem_set_caching to backend +- anv: Move i915 specific code from common anv_gem.c +- anv: Move bo_alloc_flags_to_bo_flags() to backend +- anv: Move i915 handling of imported bos bo_flags +- anv: Remove i915_drm.h include from common code +- iris: Lock bufmgr->lock before call vma_free() in error path +- iris: Nuke useless flags from iris_fine_fence_new() +- intel: Prepare implementation of Wa_18019816803 and Wa_16013994831 for future platforms +- intel: Sync xe_drm.h +- anv: Switch Xe KMD vm bind to sync +- anv: Add missing ANV_BO_ALLOC_EXTERNAL flags when calling anv_device_import_bo() + +Juan A. Suarez Romero (7): + +- broadcom/ci: update expected results +- vc4/ci: update expected results +- v3d/shim: include new ioctl parameters +- v3dv/ci: update expected list +- broadcom: add performance counters for V3D 7.x +- broadcom/simulator: add per-hw version calls +- v3d/vc4/ci: add new fails/timeout + +Julia Tatz (10): + +- gallium/dri: fix dri2_from_names +- aux/trace: skip multi-line comments in enums2names +- aux/trace: deduplicate enum dump macro work +- aux/trace: move trace_sample_view logic +- aux/trace: fix set_hw_atomic_buffers method name +- aux/trace: add screen video methods +- aux/trace: add context video methods +- aux/trace: wrap video_codec & video_buffer +- aux/trace: unwrap refrence frames in picture_desc +- aux/trace: trace video_buffer method return vals + +Julia Zhang (1): + +- radeonsi: modify algorithm of skipping holes of sparse bo + +Julian Hagemeister (1): + +- Gallium: Fix shared memory segment leak + +Juston Li (10): + +- zink: remove venus from renderpass optimizations +- venus: sync protocol for VK_EXT_vertex_input_dynamic_state +- venus: implement VK_EXT_vertex_input_dynamic_state +- venus: set lvp queries as saturate on overflow +- venus: add helper function to get cmd handle +- venus: refactor out common cmd feedback functions +- venus: support deferred query feedback recording +- venus: track/recycle appended query feedback cmds +- venus: append query feedback at submission time +- venus: switch to unconditionally deferred query feedback + +Kai Wasserbäch (3): + +- fix: clover: LLVM 18 renamed/moved CGFT_*, update compat layer +- fix: clover: LLVM 18: s/CodeGenOpt::/CodeGenOptLevel::/ +- fix: clover: warning: ignoring return value of ‘int posix_memalign(…)’ [-Wunused-result] + +Karmjit Mahil (29): + +- pvr: Remove mrt setup from SPM EOT +- pvr: Compile SPM EOT shader +- pvr: Use the SPM EOT on barrier stores +- pvr: Remove some magic numbers and increments from km stream +- pvr: Restructure \`rogue_kmd_stream.xml` +- pvr: Submit PR commands +- pvr: Use the correct size for the unified store allocation +- pvr: Allow query stage for barrier sub cmds +- pvr: Fix occlusion query unaccounted for user fences +- pvr: Fix writing query availability write out +- pvr: Fix packing issue with max_{x,y}_clip +- pvr: Fix csb relocation status assert on \`pvr_csb_finish()` +- pvr: Fix \`for` loop itarator usage +- pvr: Fix dynamic desc offset storage +- pvr: Fix cubemap layer stride +- pvr: Use the render passes' attachments array to setup ISP state +- pvr: Adjust EOT PBE state to account for the iview's base array layer +- pvr: Fix MRT index in PBE state +- pvr: Fix pbe_emit assert +- pvr: Fix OOB access of pbe_{cs,reg}_words +- pvr: Order tile buffer EOT emits to be last +- pvr: Fix subpass sample count on ds attachment only +- pvr: Refactor subpass ds and sample count setup +- pvr: Fix SPM load shader sample rate +- pvr: Fix PPP_SCREEN sizes +- vulkan: Add \`vk_subpass_dependency_is_fb_local()` helper +- tu: Use common \`vk_subpass_dependency_is_fb_local()` +- pvr: Don't merge subpasses on framebuffer-global dependancy +- pvr: Only setup the bgobj to load if we have a load_op + +Karol Herbst (213): + +- nvc0: initial Ada enablement +- rusticl/mesa: make svm_migrate optional +- llvmpipe: enable system SVM +- nvc0: fix num_gprs for Volta+ +- rusticl: fix warnings with newer rustc +- gm107/ir: fix SULDP for loads without a known format +- nv50/ir/nir: fix txq emission on MS textures +- nv50/ir/nir: Fix zero source handling of tex instructions. +- rusticl/kernel: only handle function_temp memory before lowering printf +- meson,ci: bump meson req for rusticl to 1.2 +- rusticl/nir: add helper functions we need for a NIR_PASS macro +- rusticl/nir: add a nir_pass macro +- rusticl/nir: use the new nir_pass macro +- rusticl/kernel: rename res to internal_args inside lower_and_optimize_nir_late +- rusticl/kernel: merge lower_and_optimize_nir_pre_inputs and lower_and_optimize_nir_late +- rusticl/kernel: move things around in lower_and_optimize_nir +- rusticl/kernel: get rid of initial function_temp type lowering +- rusticl/kernel: mark can_remove_var as unsafe and document it +- n50/compute: submit initial compute state in nv50_screen_create +- nvk: add vulkan skeleton +- nouveau/winsys: add the new winsys implementation +- nvk: use winsys lib +- nvk: fix nvk_buffer include guards +- nouveau/headers: add script to sync in-tree headers with open-gpu-doc +- nouveau/headers: initial sync of headers +- nvk: implement GetPhysicalDeviceQueueFamilyProperties2 to make the CTS happy +- nvk: advertize memory heaps and types +- nouveau/ws: reorganize a little +- nouveau/ws: dup the fd +- nouveau/ws: add a field for the SM version +- nvk: set nonCoherentAtomSize as the CTS divides with this value +- nouveau/ws: add bo API +- nvk: add basic device memory support +- nouveau/headers: add nvtypes.h +- nouveau/headers: typedef Nv void types +- nouveau/headers: add host classes +- nouveau/ws: add context support +- nouveau/ws: add a cmd buffer +- novueau/bo: refcount it +- novueau/bo: add nouveau_ws_bo_wait +- nvk: allocate a GPU context for each VkDevice +- nvk: add nvk_bo_sync +- nvk: add nvk_CmdPipelineBarrier2 stub +- nvk: impl nvk_CmdCopyBuffer +- nouveau/ws: fix setting push bo domains +- nouveau/ws: PUSH_IMMD only works with 16 bit values +- nouveau/ws: set GPU object class +- nouveau/ws: bind 2D class +- nvk: use fermi class definitions +- nvk: add basic support for images +- nvk: simple format table +- nvk: add support for blits +- nvk: report maxMipLevels as 1 +- nvk: optimize blit command buffer gen +- nvk: implement CmdFillBuffer +- nvk: implement CmdUpdateBuffer +- nvk: implement CmdCopyBuffer2 +- nvk: advertise VK_KHR_copy_commands2 +- nvk: implicitly reset the command buffer +- nouveau/ws: handle 0inc inside nvk_push_val as well +- nvk: reduce pitch even further in CmdFillBuffer +- nvk: support multiple miplevels +- nvk: support array blits over multiple layers +- nvk: tiling prep work for VK_EXT_image_2d_view_of_3d +- nouveau/ws: make sure we don't submit nonsense +- nouveau/ws: assert on broken channel +- nvk/blit: assert that formats are supported +- nouveau/headers: Generate parser functions +- nouveau/ws: initial debugging options for command submissions +- nouveau/ws: depend on generated class header files +- nouveau/ws: get rid of libdrm +- nouveau/ws: use new NVIF interface to query oclasses +- nvk: set deviceName +- nouveau/headers: add path for 3D headers +- nouveau/headers: initial 3D headers import +- nouveau/ws: allocate 3D subchan +- nouveau/ws: allocate copy subchan as well +- nouveau/ws: add API to query if the context was killed +- nouveau/ws: add a bo unmap helper function +- nvk: clean up bo mappings +- nouveau/ws: bound check nouveau_ws_push_append +- nouveau/ws: rework refing push buffer bos +- nouveau/ws: push chaining +- nvk: fix OOB read inside nvk_get_va_format +- nvk: alloc a zero page and use it for vertex runouts +- nvk: fix zero page refing +- nvk: support exporting buffers +- nvk: fix some class version checks +- nvk: properly align shaders pre Turing +- nvk: rework QMD handling to support pre Turing +- nvk: align desc root table +- nvk: Use SET_PIPELINE_PROGRAM pre-Volta +- nvk: properly align slm size +- nvk: use remaps for image copies +- nvk: reduce pitch for FillBuffer +- nvk: bind more subchans in init_context_state +- nvk: support pre Maxwell Texture Headers +- nvk/device: fix order of error handling +- nvk: allocate VAB memory area +- nvk: wire up M2MF for Fermi +- nouveau/mme: add test for BEQ with magic exit offset +- nouveau/mme: add a macro exit helper +- nvk: Add a macro to set MMIO registers via falcons +- nouveau/winsys: fix SM value for Ada +- nvk: fix num_gprs for Volta+ +- nvk: replace mp with tpc +- nvk: properly calculate SLM region by taking per arch limits into account +- nouveau: fix max_warps_per_mp_for_sm for builds with asserts disabled +- nvk: enable fp helper invocations loads on more gens +- nv50/ir: use own info struct for sys vals +- nv50/ir: convert system values to gl_system_value +- nouveau/mme: fix OOB access inside while_ine builder test +- nouveau/mme: fix OOB inside tu104 simulator +- clc: use CLANG_RESOURCE_DIR for clang's resource path +- nv50: fix code uploads bigger than 0x10000 bytes +- nouveau: take glsl_type ref unconditionally +- rusticl/kernel: optimize nir between lowering io and explicit types +- nv50: limit max code uploads to 0x8000 +- zink: fix source type in load/store scratch +- zink: fix global stores +- zink: update some compute caps +- rusticl: add debug option to sync every event +- rusticl/device: _MAX_CONST_BUFFER0_SIZE is unsigned +- ci: disable a660 jobs +- nir: make workgroup_id 32 bit only +- nir: make num_workgroups 32 bit only +- ac: drop 64 bit handling for cl workgroup intrinsics +- gallivm/nir: drop 64 bit handling for cl workgroup intrinsics +- intel/compiler: drop 64 bit handling for cl workgroup intrinsics +- panfrost: drop 64 bit handling for cl workgroup intrinsics +- rusticl: reduce global_invocation_id_zero_base to 32 bit +- panfrost: drop pan_nir_lower_64bit_intrin +- rusticl/disk_cache: fix stack corruption +- rusticl/query: fix use-after-free, but also fix incorrect usage of unsafe +- rusticl/event: disable profiling for devices without timestamps +- rusticl/queue: properly implement clCreateCommandQueueWithProperties +- rusticl/memory: do not verify pitch for IMAGE1D_BUFFER +- rusticl/memory: only specify PIPE_BIND_SHADER_IMAGE where supported +- asahi: fetch available system memory +- asahi: lower hadd +- asahi: handle kernels +- asahi: handle load_workgroup_size +- asahi: handle load_global_invocation_id_zero_base +- asahi: implement get_compute_state_info +- asahi: implement set_global_binding +- asahi: implement clear_buffer +- asahi: gracefully handle allocating linear images +- asahi: handle images in is_format_supported +- rusticl/memory: fallback if allocating linear images fails +- rusticl: enable asahi +- rusticl/mesa: create contexts with PIPE_CONTEXT_NO_LOD_BIAS +- docs/features: cl_khr_3d_image_writes needs driver support +- rusticl/mesa: fix \`set_constant_buffer` when passing an empty buffer +- rusticl/kernel: skip adding global id offsets if not used +- meson/rusticl: add sha1_h +- rusticl/mesa/context: fix clear_sampler_views +- nir: add nir_lower_alu_vec8_16_srcs pass +- zink: lower vec8/16 +- rusticl/mesa: create COMPUTE_ONLY contexts +- rusticl: fix clippys bool_to_int_with_if +- rusticl/memory: fix potential use-after-free in clEnqueueSVMMemFill +- nir/load_libclc: fix libclc memory leak +- rusticl/kernel: Fix creation from programs not built for every device +- ci: add half-life 2 freedreno flake +- zink: implement get_compute_state_info +- zink: copy has_variable_shared_mem cs property +- zink: pass entire pipe_grid_info into zink_program_update_compute_pipeline_state +- zink: refactor spec constant handling +- zink: variable shared mem support +- zink: support more nir opcodes +- zink: make spirv_builder_emit_*op compatible with spec constants +- zink: support samplers with unnormalized_coords +- zink: implement remaining pack ops via bitcast +- zink: fix RA textures +- zink: fix load/store scratch offsets +- rusticl/mesa/screen,device: add driver_name +- rusticl: enable zink +- pipe-loader: allow to load multiple zink devices +- rusticl: bump rustc version to 1.66 +- rusticl/mesa/nir: mark more methods as mut +- rusticl/mesa/nir: Mark NirShader and NirPrintfInfo as Send and Sync +- rusticl/mesa: mark PipeResource as Send and Sync +- rusticl/mesa: mark PipeTransfer as Send +- rusticl/cl: mark _cl_image_desc as Send and Sync +- rusticl/queue: get rid of pointless Option around our worker thread handle +- rusticl/queue: make it Sync +- rusticl/kernel: get rid of Arcs in KernelDevStateVariant +- rusticl/memory: use get_mut instead of lock in drop +- zink: implement PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS +- rusticl/api: remove cl_closure macro +- zink: implement load_global_constant +- zink: properly emit PhysicalStorageBufferAddresses cap +- nir/lower_mem_access_bit_sizes: fix invalid shift bit_size +- rusticl/device: restrict 1Dbuffer images for RGB and RGBx +- rusticl/memory: use PIPE_BUFFER for IMAGE1D_BUFFER images +- rusticl/format: disable all sRGB formats +- asahi: flush denorms on exact fmin/fmax +- zink: wrap shared memory blocks in a struct +- zink: properly alias shared memory +- zink: fix zink_destroy_screen for early screen creation fails +- docs/features: remove empty lines confusing mesamatrix +- rusticl/device: restrict image_buffer_size +- rusticl/device: restrict param_max_size further +- rusticl/mem: properly set pipe_image_view::access +- zink: lower fisnormal as it requires the Kernel Cap +- radv: fix buffers in vkGetDescriptorEXT with size not aligned to 4 +- rusticl/queue: Only take a weak ref to the last Event +- rusticl/mesa: pass PIPE_BIND_LINEAR in resource_create_texture_from_user +- zink: deallocate global_bindings array +- rusticl/mesa/screen: do not derefence the entire pipe_screen struct +- nvc0: implement PIPE_CAP_TIMER_RESOLUTION +- rusticl/queue: do not send empty lists of event to worker queue +- rusticl/queue: fix implicit flushing of queue dependencies + +Kenneth Graunke (21): + +- iris: Re-emit 3DSTATE_DS for each primitive (workaround 14019750404) +- intel/compiler: Fix sparse cube map array coordinate lowering +- intel/compiler: Respect NIR_DEBUG_PRINT_INTERNAL for DEBUG_OPTIMIZER +- intel/fs: Account for payload GRFs when calculating register pressure +- intel/compiler: Move SCHEDULE_NONE handling into schedule_instructions() +- intel/fs: Index scheduler mode string table by mode enum +- intel/fs: Make helpers for saving/restoring instruction order +- intel/fs: Pick the lowest register pressure schedule when spilling +- intel/fs: Dump IR for pre-RA scheduler modes in DEBUG_OPTIMIZER +- iris: Check prog[] instead of uncompiled[] for BLORP state skipping +- nir: Fix function parameter indentation in nir_opt_barriers.c +- nir: Add an optimization pass to reduce barrier modes +- nir: Reduce the scope of shared memory barriers +- lavapipe: Don't delete control barriers +- virgl, nir_to_tgsi: Add a hack for promoting partial memory barriers +- dxil: Set UAV_FENCE_THREAD_GROUP any time global isn't required +- glsl: Use nir_opt_barrier_modes() to drop unnecessary barriers +- anv: Use nir_opt_barrier_modes() to drop unnecessary barriers +- mesa: Fix zeroing of new ParameterValues array entries when growing +- intel/fs: Fix Xe2 URB read/lowering with per-slot offsets +- anv: Add support for a transfer queue on Alchemist + +Kevron Rees (1): + +- Force vk vendor for spider-man remastered + +Konrad Dybcio (5): + +- freedreno: Set magic writes per-GPU, using existing data +- freedreno: Include speedbin fallback in 740 chipid to fix probing +- freedreno: Include speedbin fallback in 730 chipid to fix probing +- freedreno: Include speedbin fallback in 690 chipid to fix probing +- freedreno: Add Adreno 643 + +Konstantin Seurer (95): + +- radv: Stop using the misleading round_up_u* functions +- radv/meta_buffer: Stop setting RADV_META_SAVE_DESCRIPTORS +- radv/meta_buffer: Rename size_minus16 to max_offset +- llvmpipe: Fix compiling with LP_USE_TEXTURE_CACHE +- nir/tests: Refactor boilerplate into a common header +- nir/tests: Use a single binary +- draw: Do not restart the primitive_id at 0 +- gallivm: Fix subsampled format sampling under Vulkan +- gallivm: Ignore nir_tex_src_plane +- lavapipe: Remove dummy sampler ycbcr conversion +- lavapipe: Store immutable_samplers as lvp_sampler array +- lavapipe: Fix binding immutable samplers with desc buffers +- lavapipe: Implement samplerYcbcrConversion +- lavapipe: Advertise samplerYcbcrConversion +- llvmpipe: Zero extend vectors in widen_to_simd_width +- vulkan: Add a generated vk_properties struct +- radv: Use common physical device properties +- clang-format: Disable formatting by default +- lavapipe: Use common physical device properties +- nir/from_ssa: Don't insert store_reg instructions before phis +- gallivm: Run nir_convert_to_lcssa before nir_convert_from_ssa +- lavapipe/ci: Remove descriptor_indexing fails +- radv/rt: Rename shader_pc and next_shader +- radv/rt: Rename traversal_shader to traversal_shader_addr +- nir/opt_large_constants: Handle small float arrays +- bin: Update spirv sources +- vulkan: Allow beta extensions for physical device features +- vulkan: Allow beta extensions for physical device properties +- vulkan Add enqueue entrypoint for CmdDispatchGraphAMDX +- nir: Add shader enqueue data structures and handling +- spirv: Update headers and grammer JSON +- spirv: Implement SPV_AMDX_shader_enqueue +- lavapipe: Add lvp_pipeline_type +- lavapipe: Implement exec graph pipelines +- lavapipe: Implement AMDX_shader_enqueue commands +- lavapipe: Advertise AMDX_shader_enqueue +- radv: Add internal_nodes_offset to scratch_layout +- radv: Remove leaf_args::dst_offset +- radv/rt: Remove some dead code +- radv/rt: Do not apply stack_ptr for non-recursive stages +- radv/rt: Add and use radv_build_traversal +- radv/rt: Insert rt_return_amd before lowering shader calls +- radv/rt: Split stage initialization and hashing +- aco: Do not fixup registers if there are no shader calls +- radv: Stop updating the stack_size in insert_rt_case +- lavapipe: Lock around CSO destroys +- vulkan/wsi/x11: Implement capture hotkey using the keymap +- venus: Use the common GetPhysicalDeviceFeatures2 implementation +- nir/lower_shader_calls: Limit the remat chain length +- lavapipe: Avoid lowering shaders twice +- lavapipe: Fix the locking around cso destruction +- aco/validate: Handle p_wqm like p_parallelcopy +- aco: Use bytes() instead of size() in emit_wqm +- aco: Unify demote and demote_if selection +- radv: Only generate debug info if required +- aco/lower_to_cssa: Fix typo +- radv: Don't use the depth image view for depth bias emission +- radv/rt: Store NIR shaders separately +- radv/rt: Add monolithic raygen lowering +- radv/rt: Enable monolithic pipelines +- radv/ci: Document new flake +- vulkan/properties: Handle unsized arrays properly +- radv: Remove dead radix_sort_vk_get_memory_requirements call +- radv/radix_sort: Vendor the radix sort dispatch code +- radv: Perform multiple sorts in parallel +- radv/ci: Improve ray tracing skips +- ac/llvm: Fix typed loads with 16bit formats +- ac/llvm: Use the correct return type for uadd_carry and usub_borrow +- ac/llvm: Use float types for float atomics +- radv: Don't advertise features requiring PS epilogs with LLVM +- radv: Update navi21 llvm fails +- radv/rt: Handle stages without nir properly +- radv: Remove ray tracing shader module identifier skips +- radv/bvh: Treat instances with mask == 0 as inactive +- radv/ray_queries: Skip cull_mask handling if it is FF +- radv/rt: Skip cull_mask handling if it is FF +- aco/spill: Make sure that offset stays in bounds +- nir: Add nir_cf_node_cf_tree_prev +- nir: Add nir_foreach_block_in_cf_node_reverse +- nir: Add nir_rematerialize_deref_in_use_blocks +- nir/lcssa: Fix rematerializing derefs +- nir/deref: Layer rematerialization helpers +- lavapipe/ci: Fix asan expectations +- hasvk: Use the common GetPhysicalDeviceFeatures2 implementation +- vulkan: Remove vk_get_physical_device_core_1_*_feature_ext +- radv/bvh/ploc: Load child bounds from LDS +- radv: Merge the sync_data and header initialization +- radv: Do not sync after radv_update_buffer_cp +- zink: Initialize primitive types to an invalid value +- nir/passthrough_gs: Support edge flags with points +- zink: Enable edge flags with points +- mesa: Fix glBegin/End when LINE_LOOP is not supported +- llvmpipe: Compile a nop texture function for unsupported configurations +- radv/rt: Use nir_shader_instructions_pass for lower_rt_instructions +- radv/sqtt: Fix tracing acceleration structure commands + +Lang Yu (5): + +- amd/common: add AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 property +- radeonsi: use AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 to determine wave size +- radeonsi: use wave size to determine index stride +- amd/common: add missing stuff for gfx11.5 +- amd/radeonsi: add missing stuff for gfx11.5 + +Leandro Ribeiro (13): + +- egl: rewrite outdated comment in _eglFindDevice() +- egl: remove unused parameter from _eglAddDRMDevice() +- egl: simplify _eglAddDRMDevice() +- egl: make explicit that we don't support render nodes for software EGLDevice +- egl: move is_render_node flag to platform_wayland +- loader: rename loader_open_render_node() to loader_open_render_node_platform_device() +- loader: add driver list as parameter in loader_open_render_node_platform_device() +- pipe-loader: add pipe_loader_get_compatible_render_capable_device_fd() +- dri: add queryCompatibleRenderOnlyDeviceFd() to __DRI_MESA extension +- kmsro: try to use only compatible render-capable devices +- loader: add loader_is_device_render_capable() +- egl/drm: get compatible render-only device fd for kms-only device +- egl: error out if we can't find an EGLDevice in _eglFindDevice() + +Leo Liu (4): + +- radeonsi: add AV1 profile to supported profile +- radeonsi/vcn: fix the incorrect dt_size +- Revert "frontends/va: Also map VAImageBufferType for reading" +- ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG + +Lina Versace (14): + +- docs: Add row for VK_KHR_maintenance5 +- intel/pci_ids: Consistently use lowercase +- venus: Sync protocol for VK_EXT_graphics_pipeline_library +- venus: Erase pViewports and pScissors in fewer cases +- venus: Fix crash when VkGraphicsPipelineCreateInfo::layout is missing +- venus: Fix subpass attachments +- venus: Drop incorrectly-used always-true pipeline vars +- venus: Use VkImageAspectFlags in vn_subpass +- venus: Add enum vn_pipeline_type +- venus: Renames for VkGraphicsPipelineCreateInfo fixes +- venus: Refactor pipeline fixup into two stages +- venus: Do pipeline fixes for VK_EXT_graphics_pipeline_library +- venus: Enable VK_EXT_graphics_pipeline_library behind debug flag +- venus: Fix -Wmaybe-uninitialized + +LingMan (22): + +- rusticl/memory: fix potential use-after-free in clEnqueueSVMFree +- rusticl: Rename XyzCB aliases to FuncXyzCB +- rusticl: add structs to hold the C callbacks +- rusticl: use CreateContextCB +- rusticl: use DeleteContextCB +- rusticl: use EventCB +- rusticl: use MemCB +- rusticl: use ProgramCB +- rusticl: use SVMFreeCb +- rusticl: Make EventSig take ownership of its environment +- rusticl: add a safe abstraction to execute a DeleteContextCB +- rusticl: add a safe abstraction to execute an EventCB +- rusticl: add a safe abstraction to execute a MemCB +- rusticl: add a safe abstraction to execute an SVMFreeCb +- rusticl: add a safe abstraction to execute a CreateContextCB +- rusticl: add a safe abstraction to execute a ProgramCB +- rusticl/api: drop a few include paths +- rusticl: mark the fields of callback structs private +- rusticl: drop an \`#[allow(dead_code)]` marker +- rusticl/core: don't take a lock while dropping \`Context` +- rusticl: Show an error message if the build is attempted with an outdated bindgen version +- rusticl: Show an error message if the version of bindgen can't be detected + +Lionel Landwerlin (169): + +- anv: hide exec_flags selection inside the i915 backend +- isl: add a tool to query surface parameters +- intel/fs: fix missing predicate on SEL instruction +- intel/compiler: rework input parameters +- ci/a530: switch a few tests to flakes to unblock CI +- vulkan: bump header register to 1.3.258 +- intel/fs: don't try to rebuild sequences of non ssa values +- intel/vec4: fix log_data pointer +- intel/fs: consider UNDEF as non-partial write +- intel/fs: add more UNDEFs around SEND messages +- isl: add ability to store buffer size in unused RENDER_SURFACE_STATE fields +- anv: simplify buffer address+size loads from descriptor buffer +- intel/fs: add support for sparse accesses +- intel/nir: handle image_sparse_load in storage format lowering +- intel/nir: add lower for sparse images & textures +- anv: wire image sparse loads +- blorp: switch blorp_update_clear_color to early return +- blorp: update and move fast clear PIPE_CONTROLs to drivers +- anv: fix 3DSTATE_RASTER::APIMode field setting +- anv: enable EDS3 ConservativeRasterizationMode +- vulkan: skip non required extension structures +- vulkan/runtime: add a layered implementation of vkCmdBindIndexBuffer +- anv: enable INTEL_DEBUG=nofc +- anv: fake non intel vendorID for Death Stranding +- hasvk: fix null descriptor handling with A64 messages +- anv: remove descriptor array bounds checking +- hasvk: remove descriptor array bounds checking +- anv/hasvk: track robustness per pipeline stage +- anv: implement VK_EXT_pipeline_robustness +- intel/fs: track more steps with INTEL_DEBUG=optimizer +- intel/fs: add variable for output of debug backend optimizer +- intel/decoder: constify some input parameters +- blorp: drop programming of 3DSTATE_(MESH|TASK)_SHADER +- anv: emit 3DSTATE_GS only once per pipeline +- intel/decoder: add options to decode surfaces/samplers +- anv: get rid of genX(emit_multisample) +- anv: move genX(rasterization_mode) to gfx8_cmd_buffer.c +- anv: don't try to access dynamic buffers from surface states +- iris: ensure stalling pipe control before fast clear +- intel/compiler: disable per-sample interpolation modes with non-per-sample dispatch +- intel/compiler: fix dynamic alpha-to-coverage handling +- intel/fs: implement dynamic interpolation mode for dynamic persample shaders +- intel/fs: move lower of non-uniform at_sample barycentric to NIR +- zink+anv: add regression testing with pipeline libraries +- anv: implement vkCmdBindIndexBuffer2KHR +- anv: handle new VkBufferViewUsageCreateInfoKHR +- anv: add vkGetRenderingAreaGranularityKHR() +- anv: implement GetDeviceImageSubresourceLayoutKHR/GetImageSubresourceLayout2KHR +- anv: add maintenance5 A8_UNORM/A1B5G5R5_UNORM support +- anv: deal with new pipeline flags +- anv: enable KHR_maintenance5 +- anv: add missing ISL storage usage +- genxml/gfx11: remove Tiled Resource Mode field from HIER_DEPTH_BUFFER +- genxml/gfx12: rename Tiled Resource Mode +- isl: program 3DSTATE_HIER_DEPTH_BUFFER_BODY::TiledMode as documented +- intel/isl: Disallow Yf, Ys and Tile64 for 3D depth/stencil surfaces +- isl: disable Yf/Ys/Tile64 tilings for 1D images +- isl: add a usage flag to request 2D/3D compatible views +- isl: disallow TileYs/Yf on 3D storage images on Gfx9/11 +- intel/isl: Add a max_miptail_levels field to isl_tile_info +- isl: make isl_surf_get_uncompressed_surf robust to argument accesses +- isl: add Gfx12/12.5 restriction on 3D surfaces & compression +- isl: disallow miptails on planar formats +- isl: disable miptails on gfx12 with yuv formats +- isl: disable CCS on Ys/Yf +- blorp: allow 3D blits/copies on Ys/Yf/Tile64 tiling +- intel/aux_map: correctly program tiling mode for Ys +- isl: reorder tiling selection +- anv: enable standard Y tiles +- isl/tilememcpy_test: add multiple tile testing +- anv: rename total_batch_size +- anv: reuse cmd_buffer::total_batch_size +- intel/measure: track batch buffer sizes +- intel/nir: rerun lower_tex if it lowers something +- intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL +- hasvk: add state cache invalidation back before fast clears +- blorp: remove unused variable +- anv: remove ReorderMode from pipeline 3DSTATE_GS emission +- anv: change anv_batch_emit_merge to also do packing +- intel/anv: batch stats util +- intel/decoder: implement accumulated prints +- anv: move all dynamic state emission to cmd_buffer_flush_dynamic_state +- anv: rename files to represent their usage +- anv: categorize partial/final pipeline instruction +- anv: split 3DSTATE_TE packing between static & dynamic parts +- anv: split 3DSTATE_VFG emission +- anv: add a flag tracking occlusion query count change +- anv: split pipeline programming into instructions +- vulkan/runtime: add helper to name dirty states +- anv: add new low level emission & dirty state tracking +- anv: remove unused state emission +- anv: split BLEND_STATE packing from BLEND_STATE_POINTERS emit +- docs: update Anv documentation about dynamic state emission +- anv: create individual logical engines on i915 when possible +- anv: Copy/Clear MSAA images over companion RCS while we are on compute +- pps-producer: add ability to select device with DRI_PRIME +- anv: remove aux checking asserts +- anv: bound image usages to the associated queue family +- anv: fix 3DSTATE_VFG emission +- anv: emit 3DSTATE_URB_ALLOC_(MESH|TASK) only when mesh shaders are enabled +- anv: ensure mesh pipeline have all pre-rasterization stages disabled +- anv: ensure partially packed instructions are emitted in the pipeline +- anv: fix missing 3DSTATE_SBE_MESH emission +- anv: fix utrace timestamp buffer copies +- anv: add a memcpy compute internal kernel +- anv: add simple shader support without a command buffer +- anv: move simple shaders code to its own object +- anv: move utrace flush out of backends +- anv: enable utrace timestamp buffer copies on compute engine +- intel: don't assume Linux minor dev node +- intel/ds: lock submissions to u_trace_context +- util/u_trace: count number of tracepoints +- intel/ds: track number of tracepoint timestamp copies +- anv/utrace: trace CPU on timestamp buffer readiness +- intel/ds: avoid dropping traces when running out of shared memory +- anv/iris: widen Wa_14015946265 to Gfx11+ +- anv: add missing workaround for 3DSTATE_LINE_STIPPLE +- iris: add missing workaround for 3DSTATE_LINE_STIPPLE +- intel/fs: handle ishl in surface/sampler rematerialization +- intel/fs: handle add3 in surface/sampler rematerialization +- intel/fs: switch from SIMD 1 to 8 instructions surface/sampler rematerialization +- anv: fix internal compute copy shader build +- anv: reduce working temporary memory for BVH builds +- anv: move bo_pool allocation flags to init caller +- anv: use buffer pools for BVH build buffers +- intel/ds: track acceleration RT commands +- anv: fix index buffer size programming +- anv: implement INTEL_DEBUG=reemit +- anv: add missing workaround handling in simple shader +- anv: fix a couple of missing input for 3DSTATE_RASTER programming +- anv: flag 3DSTATE_RASTER as dirty after simple shader primitive +- vulkan: bump headers/registry to 1.3.267 +- anv: rename primary in container in ExecuteCommands() +- anv: add support for VK_EXT_nested_command_buffer +- anv: simplify push descriptors +- anv: fixup spirv cap for ImageReadWithoutFormat on Gfx12.5 +- Revert "intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL" +- anv: update batch chaining to Gfx9 commands +- anv: workaround Gfx11 with optimized state emission +- u_trace: generate tracepoint index parameter in perfetto callbacks +- u_trace: generate tracepoint name array in perfetto header +- intel/ds: provide names for different events of a timeline's row +- anv: reuse local variable for gfx state +- anv: track render targets & render area changes separately +- anv: don't uninitialize bvh_bo_pool is not initialized +- anv: uninitialize queues before utrace +- anv: move generation shader return instruction to last draw lane +- anv: fix generated draws gl_DrawID with more than 8192 indirect draws +- anv: extract out draw call generation +- anv: identify internal shader in NIR +- anv: avoid MI commands to copy draw indirect count +- anv: move generation batch fields to a sub-struct +- util/glsl2spirv: add ability to pass defines +- anv: factor out host/gpu internal shaders interfaces +- anv: index indirect data buffer with absolute offset +- anv: add ring buffer mode to generated draw optimization +- anv: merge gfx9/11 indirect draw generation shaders +- anv: document the draw indirect optimization ring mode +- anv: fixup 32bit build of internal shaders +- anv: fix uninitialized use of compute initialization batch +- intel/fs: fix dynamic interpolation mode selection +- anv/meson: add missing dependency on the interface header +- anv: fix corner case of mutable descriptor pool creation +- isl: disable MCS compression on R9G9B9E5 +- intel/fs: rerun divergence analysis prior to convert_from_ssa +- intel/nir/rt: fix reportIntersection() hitT handling +- anv: fix CC_VIEWPORT pointer dirty after blorp/simple-shaders +- anv: fix dirty state tracking for 3DSTATE_PUSH_CONSTANT_ALLOC +- intel/perf: fix querying of configurations + +Louis-Francis Ratté-Boulianne (15): + +- panfrost: Fix error in comment +- panfrost: Add methods to determine slice and body alignment +- panfrost: Add method to get size of AFBC subblocks +- panfrost: Precalculate stride and nr of blocks for AFBC layouts +- panfrost: Add panfrost_batch_write_bo +- panfrost: Make panfrost_resource_create_with_modifier public +- panfrost: Split out internal of \`panfrost_launch_grid` +- panfrost: Add infrastructure for internal AFBC compute shaders +- panfrost: Add method to get size of AFBC superblocks valid data +- panfrost: Add support for AFBC packing +- panfrost: Legalize resource when attaching to a batch +- panfrost: Don't force constant modifier after converting +- panfrost: Add debug flag to force packing of AFBC textures on upload +- panfrost: Add some debug utility methods for resources +- panfrost: Add env variable for max AFBC packing ratio + +Lucas Stach (33): + +- ci/etnaviv: update ci expectation +- etnaviv: move resource seqnos to level +- etnaviv: flush destination before executing blit +- etnaviv: optimize resource copies by skipping clean levels +- etnaviv: add helper to mark resource level as flushed +- etnaviv: add helper to mark resource level as changed +- etnaviv: add helper to transfer resource level age to another +- etnaviv: add helper to get TS validity +- etnaviv: add helper to set TS validity +- etnaviv: move TS meta into etna_resource_level +- etnaviv: add tile status buffer status into TS metadata +- etnaviv: optimize sampler source update +- etnaviv: allow sampler TS even if the resource is flushed +- etnaviv: keep blit destination tile status valid if possible +- etnaviv: optimize render resource update +- etnaviv: optimize transfers when whole resource level is discarded +- etnaviv: split etna_copy_resource_box levels parameter in src/dst +- etnaviv: don't allocate full resource as transfer staging +- etnaviv: check for valid TS as condition to create the staging resource +- etnaviv: reword comment about staging resource usage +- etnaviv: remove huge outdated comment +- etnaviv: move buffer range tracking into the PIPE_MAP_WRITE clause +- etnaviv: remove superfluous braces +- etnaviv: remove always true assert in etna_transfer_unmap +- etnaviv: remove bogus comment about replacing resource storage +- etnaviv: initialize VIVS_GL_BUG_FIXES +- etnaviv: fix read staging buffer leak +- Revert "ci/etnaviv: allow failure on failing test" +- mesa: enable NV_texture_barrier in GLES2+ (again) +- etnaviv: use correct blit box sizes when copying resource +- etnaviv: zero shared TS metadata block +- Revert "etnaviv: use correct blit box sizes when copying resource" +- mesa: add GL_APPLE_sync support + +Luigi Santivetti (1): + +- pvr: do not claim support for ASTC texture compression + +M Henning (31): + +- nv50/ir: Drop nir_jump_return handling +- nv50/ir: Remove ArgumentMovesPass +- nv50/ir: Remove Function.stackPtr +- nv50/ir: Remove dead loop from assignSlot +- nv50/ir: Remove SpillSlot +- nvc0: Keep nir directly in nvc0_program +- nv50: Keep nir directly in nv50_program +- nouveau: Delete nv50_ir_from_tgsi.cpp +- nouveau: Drop tgsi support from nv50_ir_prog_info +- nouveau: Drop ConverterCommon::Subroutine +- nouveau: Drop BuildUtil::DataArray +- nouveau: Drop BuildUtil::Location +- nouveau: Delete the nouveau_compiler tool +- nv/codegen: Call nir_shader_gather_info +- nv/codegen: Implement nir_op_fquantize2f16 +- nvk: Remove reference to genUserClip +- nv/codegen: Use nir_lower_clip +- nv50_ir_from_nir: Use nir's lower_fpow +- nv/codegen: Delete OP_POW +- nv/codegen: Fix an uninitialized variable warning +- nv/codegen: Delete OP_WRSV +- nv/codegen: Delete OP_EXP, OP_LOG +- nv/codegen: Remove fragCoord variable. +- nv/codegen: Merge from_common into from_nir +- nv/codegen: Remove unused clipVertexOutput var +- nv50_ir_ra: Delete unused functions +- nv/codegen: Delete unused OP_CONSTRAINT +- nv/codegen: Delete periodicMask32 +- nv/codegen: Remove Function::buildDefSets +- nv/codegen: Change copy-constructor call to assign +- nv/codegen: Delete copy and assign + +Maaz Mombasawala (2): + +- svga: Make surfaces shareable at creation. +- svga: Unify gmr and mob surface pool managers + +Marcin Ślusarz (16): + +- iris: avoid duplicating validation entries +- hasvk: remove dead code & comments related to mesh shading +- anv: drop support for VK_NV_mesh_shader +- intel/compiler: remove NV_mesh_shader support +- intel/compiler: remove redundant code +- anv: drop unused function +- anv: merge cases leading to the same code +- intel/compiler/mesh: compactify MUE layout +- intel/compiler,anv: put some vertex and primitive data in headers +- intel/compiler: load debug mesh compaction options once +- intel/compiler/test: fix crashes when TEST_DEBUG is set +- intel/compiler: add lsc_msg_desc_wcmask +- intel/compiler: add initial support for URB_LOGICAL_SRC_CHANNEL_MASK to lower_urb_write_logical_send_xe2 +- intel/compiler/mesh: fix position of output URB handle for xe2 +- intel/compiler/mesh: implement IO for xe2 +- intel/compiler: mask GS URB handles at thread payload construction + +Marek Olšák (125): + +- Revert "ac/nir/ngg: Follow intrinsic sources when analyzing before culling." +- glthread: determine global locking once every 64 batches to fix get_time perf +- mesa: fix 38% decrease in display list performance of Viewperf2020/NX8_StudioAA +- freedreno,lima,zink: update CI fixes and flakes +- util/u_queue: fix util_queue_finish deadlock by merging lock and finish_lock +- util/u_queue: always enable UTIL_QUEUE_INIT_SCALE_THREADS, remove the flag +- radeonsi: fix a CDNA regression breaking compute +- glthread: sync for VDPAU sync functions +- radeonsi: turn sh_base[PIPE_SHADER_VERTEX] into a constant in emit_draw_packets +- radeonsi: restructure the loop for non-indexed multi draws +- radeonsi: cosmetic changes to radeon_opt_* macros +- radeonsi: handle draw user SGPRs as tracked registers +- radeonsi: update obsolete comments about compiler queues +- radeonsi: remove si_compute.h, move the contents into si_pipe.h +- radeonsi: move si_update/emit_tess_io_layout_state into si_state_shaders.cpp +- radeonsi: move si_emit_spi_map into si_state_shaders.cpp +- radeonsi: move si_emit_rasterizer_prim_state out of si_emit_all_states +- radeonsi: remove splitting IBs that use too much memory +- radeonsi: add padding to si_resource to fix Viewperf2020/catiav5test1 perf +- radeonsi: remove unused check_mem parameter from si_sampler_view_add_buffer +- radeonsi: remove the draw counter with primitive restart from the HUD +- radeonsi: always inline si_prefetch_shaders +- radeonsi: specialize si_draw_rectangle using a C++ template +- radeonsi: add index parameter into si_atom::emit +- radeonsi: split direct pm4 emission from si_pm4_emit +- radeonsi: move code around si_pm4_emit_state into si_pm4_emit_state +- radeonsi: merge pm4 state and atom emit loops into one +- radeonsi: add a simple version of si_pm4_emit_state for non-shader states +- radeonsi: handle deferred cache flushes as a state (si_atom) +- radeonsi: remove render condition logic from si_draw by reordering atoms +- radeonsi: abort when failing to upload descriptors instead of skipping draws +- radeonsi: rename shader_pointers state -> gfx_shader_pointers +- radeonsi: merge si_upload_*_descriptors into si_emit_*_shader_pointers +- radeonsi: convert si_gfx_resources_add_all_to_bo_list to a state atom +- radeonsi/ci: update gfx11 failures +- radeonsi: move GE_CNTL emission from si_draw into si_emit_vgt_pipeline_state +- radeonsi: use num_patches_per_workgroup directly in si_get_ia_multi_vgt_param +- radeonsi: enable shader culling by default because it helps Viewperf +- radeonsi: rewrite how occlusion query precision is determined for performance +- radeonsi: set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET on aux_context explicitly +- radeon_winsys: move allow_context_lost from cs_create to ctx_create +- winsys/amdgpu: rework how SW reset status is generated and reported +- radeon_winsys: add a ctx_set_sw_reset_status callback +- radeonsi: don't abort for descriptor failures, let the winsys handle it +- radeonsi: don't use threadID.yz/blockID.yz for copy_image if those are always 0 +- radeonsi: don't use threadID.yz/blockID.yz for compute_blit if they're always 0 +- nir: fix constant evaluation of fddx/fddy sourcing Inf & NaN constant +- nir/algebraic: collapse ALU opcodes sourcing NaN +- ac/gpu_info: add the /dev/dri/ filename into radeon_info +- Revert "ac: don't call ac_query_pci_bus_info from ac_query_gpu_info" +- ac: implement AMD_FORCE_FAMILY properly, remove SI_FORCE_FAMILY +- ac: document ac_shader_args::gs_vtx_offset +- ac: minor updates to packet documentation and definitions +- ac: change offsets of DMA_DATA dwords to prevent reg offset conflicts +- ac: improve the IB parser +- ac: update gfx11 shadowed register tables +- ac: add a standalone IB parser program +- ac/surface: trivial non-functional changes +- ac/surface: add radeon_surf::u::gfx9::uses_custom_pitch +- radeonsi: allow setting any index in radeon_set_sh_reg_idx +- radeonsi: rename uses_subgroup_info to uses_tg_size +- radeonsi: improve the heuristic when to use Wave32 for compute shaders +- radeonsi: simplify/merge emit_shader_ngg functions +- radeonsi: don't pass gl_Layer to PS for blit shaders +- radeonsi/gfx11: pass attribute ring addr via SGPR instead of memory for blits +- radeonsi: fix templated si_draw_rectangle callback for Navi14 +- nir: replace undef only used by ALU opcodes with 0 or NaN +- nir: remove nir_op_unpack_64 handling from nir_opt_undef +- ac/llvm: don't convert undef to 0 because nir_opt_undef does it now +- meson: use llvm-config instead of cmake to fix linking errors with meson 1.2.1 +- gallivm: fix build with LLVM 18 +- amd/llvm: fix build with LLVM 18 +- radeonsi: fix compute-only contexts +- ac/llvm: replace removed amdgcn.ldexp for LLVM 18 +- ac/perfcounter: remove a bogus assert to fix an assertion failure on gfx11 +- ac/llvm: set !fpmath 3.0 for llvm.sqrt +- ac/gpu_info: don't align IBs to the GL2 cache line size +- ac/llvm: fix flat PS input corruption +- amd: rename GFX110x to NAVI31-33 +- ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments +- ac/gpu_info: pad IBs according to ib_size_alignment +- winsys/amdgpu: pad gfx and compute IBs with a single NOP packet +- Revert "radeonsi: specialize si_draw_rectangle using a C++ template" +- radeonsi/ci: update navi10 results +- gallium/util: fix GALLIUM_TESTS=1 by using cso_set_vertex_buffers_and_elements +- gallium/util: add more tests for compute-only contexts +- radeonsi: add another aux context for uploading shaders +- radeonsi: upload shaders via a staging buffer so as not to map VRAM directly +- ac/surface: don't require exact pitch for gfx6-8 tiled imports +- Revert "ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG" +- Revert "radv/amdgpu: fix alignment of command buffers" +- Revert "radv: fix alignment of DGC command buffers" +- Revert "winsys/amdgpu: pad gfx and compute IBs with a single NOP packet" +- Revert "ac/gpu_info: pad IBs according to ib_size_alignment" +- Revert "ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments" +- nir: sort variables by location in nir_lower_io_passes to work around a bug +- nir: recompute IO bases after DCE in nir_lower_io_passes +- nir: add dual-slot input information into load_input intrinsics +- nir: take dual slot input info into account when computing IO driver locations +- nir: gather dual slot input information +- nir: expose reusable linking helpers for cloning uniform loads +- nir: handle nir_var_mem_ubo in nir_clone_uniform_variable +- ac/gpu_info: split ib_alignment as ip[type].ib_alignment +- ac/gpu_info: move ib_pad_dw_mask into ip[] +- ac/gpu_info: drop the hack unifying all IB alignments +- ac/gpu_info: conservatively decrease IB alignment and padding to 256B +- ac/gpu_info: set gfx and compute IB padding to only 8 dwords +- winsys/amdgpu: properly pad the IB in amdgpu_submit_gfx_nop +- winsys/amdgpu: correctly pad noop IBs for RADEON_NOOP=1 +- winsys/amdgpu: pad gfx and compute IBs with only 1 NOP +- ac/gpu_info: don't allow register shadowing with SR-IOV due to bad performance +- radeonsi: disable register shadowing without SR-IOV to fix bad performance +- winsys/amdgpu: don't send CP_GFX_SHADOW chunk if shadow address is not set +- radeonsi/ci: update gfx1100 results +- nir: split FLOAT_CONTROLS_SIGNED_ZERO_INF_NAN_PRESERVE_FP* flags +- nir/algebraic: use only signed_zero_preserve_* for addition by 0 patterns, etc. +- mesa: don't pass Infs to the shader via gl_Fog.scale +- radeonsi/ci: update the runner for new build scripts +- radeonsi/ci: enable GTF tests in the runner +- radeonsi/ci: enable GLES CTS in the runner +- radeonsi/ci: update failures and flakes +- amd/common: update DCC for gfx11.5 +- radeonsi: initialize perfetto in the right place +- radeonsi/gfx11: don't set OREO_MODE to fix rare corruption +- nir: fix gathering TESS_LEVEL_INNER/OUTER usage with lowered IO + +Marek Vasut (1): + +- etnaviv: Fully replicate back stencil config + +Mark Collins (10): + +- tu/a7xx: Adapt r3d blits for A7xx +- freedreno/rnn: Remove %n usage in fprintf +- freedreno: Only add drm/computerator when system_has_kms_drm +- freedreno/decode: Support building replay for multiple KMDs +- freedreno+meson: Add lua+libarchive+libxml from Meson WrapDB +- meson: Warn about side-effects from DRM for FD KMDs +- meson: Update libarchive to v3.7.2-2 +- freedreno/common: Add max_sets property to A6xxGPUInfo +- tu: Support higher descriptor set count for A7XX +- tu,util/driconf: Add option to not reserve descriptor set + +Mark Janes (1): + +- intel: allow reduced memory usage for INTEL_MEASURE + +Martin Roukala (né Peres) (22): + +- radv/ci: drop the auto-reboot-on-hang for vkcts-navi10 +- radv/ci: use the default kernel on vkcts-navi10 +- zink/ci: automatically reboot when hitting a kernel BUG on vangogh +- zink/ci: document more flakes seen on vangogh +- radv/ci: move vkcts-navi10 testing to KWS +- radv/ci: add more tests to the navi10 vkcts flake list +- radv/ci: increase the parallelism of the vkcts-navi21 job +- radv/ci: add more tests to the navi21 vkcts flake list +- radv/ci/vkcts-navi21: catch all the line_stipple_(enable|params) flakes +- radv/ci/vkcts-navi21: document more flakes +- radv/ci/vkcts-navi10: catch all the line-related flakes +- radv/ci: update the vkcts gfx1100 flake/fail lists +- radv/ci: add a manual job to run vkcts on navi31 +- radv/ci: add a manual job for vkd3d-proton on navi31 +- ci/vkcts-vangogh: mark dEQP-VK.dynamic_rendering.primary_cmd_buff.basic.* as flake +- ci/vkcts-navi21: mark more of the RT handles checks as flakes +- ci: make B2C_JOB_VOLUME_EXCLUSIONS to all .b2c-test jobs +- zink/ci: remove 19 tests from the zink-radv-polaris10-fails list +- ci/b2c: switch containers to a back-up ahead of valve-infra renaming +- zink/ci: remove 42 tests from the zink-radv-polaris10-fails list +- radv/ci: tighten the vkcts-navi21 timeouts +- zink/ci: tighten the zink-radv-vangogh timeouts + +Martin Stransky (1): + +- llvmpipe: fix UAF in lp_scene_is_resource_referenced. + +Mary (6): + +- nouveau/mme: Add initial Fermi definition +- nouveau/mme: Add Fermi builder +- nouveau/mme: Add Fermi simulator +- nouveau/mme: Add Fermi hardware tests +- agx: Move nir_lower_fragcolor out of agx_preprocess_nir +- agx: Ensure to lower 1D image load/store to 2D + +Mary Guillemard (4): + +- nir: Add NVIDIA-specific geometry shader opcodes +- venus: skip bind sparse info when checking for feedback query +- zink: Check for VK_EXT_extended_dynamic_state3 before setting A2C +- venus: Do not submit batch manually when no feedback is required + +Matt Coster (21): + +- pvr: Pad rogue_regarray_cache_key union members to avoid UB +- pvr: Clean up extension tables +- pvr: Refactor pvr_GetPhysicalDeviceProperties2() +- docs: Fixup imagination/pvr extension support +- pvr: Add VK_KHR_get_display_properties2 +- pvr: Add VK_KHR_get_memory_requirements2 +- pvr: Add VK_KHR_get_surface_capabilities2 +- pvr: Print VkStructureType name on pvr_debug_ignored_stype() +- pvr: Add VK_KHR_copy_commands2 +- pvr: Don't override commands copied to new buffer when extending cs +- pvr: Do not require TA_STATE_HEADER.pres_ispctl_dbsc for {db,sc}enable +- pvr: Zero tail of cs buffers after linking when dumping cs +- pvr: Cleanup comments in pvr_physical_device_get_supported_*() +- pvr: Don't rely on GNU void pointer arithmetic +- pvr: Force compile error on GNU void pointer arithmetic +- pvr: Switch to common pipeline cache implementation +- pvr: Use vk_sampler base +- pvr: Clean up & fix sampler border color support +- pvr: Don't pass pvr_physical_device when only device info is needed +- pvr: Minor refactor of pvr_device.c +- pvr: Use common physical device properties + +Matt Turner (10): + +- Revert "intel/fs: only avoid SIMD32 if strictly inferior in throughput" +- intel: Rearrange for next commit +- intel: Consider with_intel_clc in with_any_intel +- intel: Only build blorp if drivers are enabled +- intel: Only build ds if drivers are enabled +- intel: Only build perf if drivers or tools are enabled +- intel: Allow using intel_clc from the system +- intel: Limit Intel Vulkan RT to x86_64 +- r600: Add missing dep on git_sha1.h +- util: Include stdint.h in libdrm.h + +Mauro Rossi (7): + +- nouveau/ws: fix building error in nouveau_ws_push_dump() +- vulkan/meta: fix gnu-empty-initializer build error +- nouveau/mme: fix print inst for case MME_FERMI_OP_MERGE +- anv/android: remove numFds check +- hasvk/android: remove numFds check +- Android.mk: filter out cflags to build with Android 14 bundled clang +- Android.mk: disable android-libbacktrace to build with Android 14 + +Mike Blumenkrantz (293): + +- ci: bump VVL to 1.3.257 +- zink: set pipeline dynamic state count after all dynamic states are set +- zink: set feedback attachments on batch init +- zink: be even dumber about buffer refs when replacing storage +- zink: emit SpvCapabilitySampleMaskPostDepthCoverage with SpvExecutionModePostDepthCoverage +- zink: fix the fix for separate shader program refcounting +- kopper: handle pixmap creation failure more gracefully +- glxsw: check geometry of drawables on creation +- kopper: move pixmap param for drawable creation to info struct +- glx/dri3: split out modifier check +- glx/sw: check for modifier support in the kopper path +- kopper: pass modifier availability to drawable creation +- kopper: determine modifier support per-drawable +- zink: don't clobber descriptor mode on multiple screen creation +- nir: fix slot calculations for compact variables with location_frac +- lavapipe: use the component offset directly for xfb +- nir: add a helper for calculating variable slots +- radv: bump max xfb output to 128 +- ir3: bump max xfb output to 128 +- gallium: bump PIPE_MAX_SO_OUTPUTS to 128 +- zink: add feedback loop exts to optimal profile +- glsl: only explicitly check GS components in PSIZ injection with output variables +- lavapipe: statically allocate fb attachment array +- lavapipe: zero fb attachment array at rp start +- lavapipe: don't check geometry for fb attachments +- lavapipe: be slightly more permissive for bad apps (and cts) with dynrender +- lavapipe: VK_EXT_host_image_copy +- zink: better handle separate shader dsl creation when no bindings exist +- zink: force image barriers after dmabuf import +- ci: bump VVL to 1.3.261 +- zink: use VK_WHOLE_SIZE when binding null db buffer descriptors +- zink: unset line stipple ds3 state flags when stipple not available +- nir/lower_io_to_scalar: fix 64bit io splitting +- nir/linking_helpers: force type matching in does_varying_match +- nir/print: print location names for (some) tess slots +- nir/print: always group variables by type when printing +- zink: add batch refs for transient images +- zink: fix zs resolve attachment indexing +- zink: don't add VK_IMAGE_USAGE_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT for transient images +- zink: don't append msrtss to dynamic render if not supported +- zink: set msrtss depth resolve mode when enabled +- zink: hook up VK_KHR_workgroup_memory_explicit_layout +- zink: propagate have_workgroup_memory_explicit_layout to ntv +- zink: use SPV_KHR_workgroup_memory_explicit_layout when available +- zink: add more locking for pipeline cache +- zink: add VK_PIPELINE_CACHE_CREATE_EXTERNALLY_SYNCHRONIZED_BIT_EXT +- aux/trace: fix winsys handle dumping +- zink: generated tcs is on the tes, not the vs +- zink: apply ZINK_DEBUG=noopt to linked separate shaders +- gallivm: handle A8_UNORM image stores +- llvmpipe: enable A8_UNORM for shader images +- llvmpipe: export PIPE_CAP_IMAGE_LOAD_FORMATTED +- lavapipe: GetRenderingAreaGranularityKHR +- llvmpipe: block weird uses of subsampled formats in buffers +- llvmpipe: fix early depth + alpha2coverage + occlusion query interaction +- lavapipe: fix BindVertexBuffers2 buffer size handling +- lavapipe: fix resolves where src image has a layer offset +- lavapipe: block yuv formats from getting blit feature flags +- lavapipe: BindIndexBuffer2 +- lavapipe: GetDeviceImageSubresourceLayoutKHR +- lavapipe: VK_REMAINING_ARRAY_LAYERS for copy ops +- lavapipe: maintenance5 +- zink: fix xfb buffer array sizing to use buffer limit, not output +- zink: move ZINK_DEBUG=nir printing to just before compile +- draw: fix so debug offset printing +- zink: reindex ssa defs before dumping debug shaders +- lavapipe: zero-init pipe_sampler_state +- zink: explicitly set non-optimal last_vertex_stage shader key on ctx create +- zink: fix big tcs output io +- zink: don't try to replace separate shader prog in noopt mode +- zink: pre-convert mode in fixup_io_locations +- zink: add a special separate shader i/o mode for legacy variables +- nir: minor fixes for io_to_scalar +- nir/lower_io: add a new doubles-only 64bit lowering option +- nir: add a filter cb to lower_io_to_scalar +- d3d10umd: use cso_context to set vertex buffers and elements +- virgl: move virgl_vertex_elements_state to header +- virgl: fix some indentation +- nouveau: calloc vertex csos +- gallium: move vertex stride to CSO +- zink: fix null config screen creation +- zink: fix crash in lower_pv_mode_gs_store +- u/draw: skip zero-sized indirect draws +- lavapipe: handle VkPipelineCreateFlagBits2KHR +- lavapipe: handle VkBufferUsageFlags2KHR +- zink: ci updates +- zink: track start/stop of a couple query types +- zink: require EDS1 for CWE usage +- zink: unset primgen suspended flag when ending a primgen query +- zink: rework rast-discard for primgen queries +- zink: rip out some awkward parts of the old non-cwe path +- zink: drop CWE requirement for renderpass tracking with primgen queries +- nir/zink: fix gs emulation xfb_info sizing +- zink: move fragcolor lowering further along the compile process +- zink: add a mode param to find_var_with_location_frac +- zink: use lowered io (kinda) for i/o vars +- zink: stop lowering indirect derefs +- ntt: handle interp intrinsics as derefs +- zink: delete split_blocks pass +- zink: delete lower_64bit_vertex_attribs pass +- zink: fix clip/cull dist xfb inlining +- zink: delete all the extra gross xfb handling +- zink: stop using pipe_stream_output +- zink: remove pipe_stream_output from function params +- zink: ci updates +- aux/trace: print bindless handles as pointers +- zink: remove unused param from create_ici +- zink: split create_ici to init and eval +- zink: add maintenance extensions to profile +- zink: use maintenance5 +- zink: use real A8_UNORM when possible +- vk/graphics: fix CWE handling with DS3 +- Revert "vk/wsi/x11: handle geometry updating more asynchronously" +- r600: store the mask of buffers used by a vertex state +- r600: better tracking for vertex buffer emission +- zink: wait on async fence during ctx program removal +- zink: handle patch variable locations for separate shaders better +- zink: don't start multiple cache jobs for the same program +- zink: use the "set" optimal key for prog last_variant_hash for consistency +- zink: sanitize optimal keys +- zink: copy some cs shader properties to the program struct +- zink: handle global atomic intrinsics +- zink: use Aligned with global load/store ops +- zink: fix rewrite_read_as_0 filtering +- rusticl: fixes for zink shader images +- zink: pass KERNEL shaders through successfully +- zink: add a618 flake +- zink: break out ds3 state resetting +- zink: be consistent with ds3 state resetting for blits +- zink: fix optimal_keys warning message +- zink: force-reset unordered flags for buffer barriers on non-matching batch access +- zink: reset unordered flags for image barriers on non-matching batch access +- zink: make image barrier init functions void return +- zink: simplify some image barrier conditionals +- zink: remove sync TODO +- zink: add lavapipe flake +- ci: disable nouveau shaderdb +- egl/dri3: only set driver_name if not already set +- egl: call dri3_x11_connect() for zink +- egl: bind dri2_set_WL_bind_wayland_display for zink when necessary +- zink: be more precise about flagging rp changes around unordered u_blitter +- zink: don't block reordering during ref updates in unordered blits +- lavapipe: update vbo indices before propagating stride +- lavapipe: fix pipeline stride propagation +- zink: fix linear modifier dmabuf imports +- zink: polaris ci updates +- aux/tc: handle stride mismatch during rp-optimized subdata +- zink: always add a per-prog ref for gpl libs +- zink: use a pointer to simplify submit struct mechanics +- zink: make zink_resource_image_barrier2_init public +- zink: add a third submitinfo (unused for now) +- zink: make submitinfo handling easier to manage with enum +- zink: add another submitinfo for fd semaphore waits +- zink: add a screen cache for fd semaphores +- zink: add a util for getting cached fd semaphores +- zink: hook up cached fd semaphore usage for batch signal/waits +- zink: handle implicit sync for dmabufs +- zink: handle multi-plane implicit sync +- zink: ci updates +- zink: set is_xfb=false for all i/o variables +- zink: reorder bindless io lowering +- zink: fix typing on bindless io lowering +- zink: delete some bindless io lowering code +- zink: use nir_io_semantics::num_slots for indirect var creation +- zink: simplify an arrayed io check during variable creation +- zink: use explicit stride from types instead of copying old_var stride +- zink: use MAX_PATCH_VERTICES directly for arrayed io var sizing +- zink: use explicit sizing for builtins when creating variables +- zink: create new vars without copying existing ones +- zink: add a new linker pass to handle mismatched i/o components +- zink: use right function to get src_type in eliminate_io_wrmasks +- zink: re-rework i/o variable handling to make having variables entirely optional +- ci: bump VVL to 1.3.263 +- zink: simplify redundant is_buffer check +- zink: use VkFormatProperties3 +- lavapipe: handle VkHostImageCopyDevicePerformanceQueryEXT +- lavapipe: don't advertise UNDEFINED layout for HIC +- zink: hook up VK_EXT_host_image_copy +- zink: move mem type detection up in file +- zink: disable HIC without resizable BAR +- zink: add a fixup method for extra driver props +- zink: fix some off-by-one indentation +- zink: use some return codes for check_ici errors +- zink: check/use suboptimal HIC during ici init +- zink: use HIC for image subdata when possible +- zink: slightly refactor psiz deletion during linking +- zink: delete all psiz=1.0 stores if maintenance5 is present +- nir/inline_uniforms: fix oob access with nir_find_inlinable_uniforms +- zink: add ZINK_DEBUG=quiet +- zink: imply ZINK_DEBUG=quiet if ZINK_DEBUG=optimal_keys is set on turnip +- zink: set optimal_keys for turnip jobs +- aux/tc: fix staging buffer sizing for texture_subdata +- aux/tc: fix address calc for segmented texture subdata +- zink: ci updates +- lavapipe: KHR_map_memory2 +- zink: slightly refactor pipeline compile selection +- zink: add a flag for combined pipeline compile for doing FAIL_ON_PIPELINE_COMPILE_REQUIRED +- zink: remove an intermediate variable in pipeline compile selection +- zink: use FAIL_ON_PIPELINE_COMPILE_REQUIRED for GPL path +- zink: pass a stage mask to pipeline create functions +- glsl: check for xfb setting xfb info +- zink: don't warn about missing scalarBlockLayout on v3dv +- aux/tc: fix renderpass tracking fb state clobber scenario +- vk/enum2str: add more max enum vendors +- aux/tc: fix rp info handling around tc_sync calls +- aux/tc: don't use pipe_buffer_create_with_data() for rp-optimized subdata +- zink: flag db maps as unsynchronized +- lavapipe: clamp cache uuid size +- lavapipe: EXT_load_store_op_none +- tu: handle unused color attachments without crashing +- zink: use much bigger dummy surfaces +- zink: propagate rp_tc_info_updated across unordered blits +- zink: use null attachments for null attachments with dynamic render +- egl/swrast: expose EXT_swap_buffers_with_damage and EXT_present_opaque +- egl/wayland: split out wl drm extension init +- egl/wayland: use more registry listeners to better handle device init +- egl/wayland: enable WL_bind_wayland_display for zink +- zink: delete injected pointsize during shader creation +- zink: require maintenance5 for shobj +- zink: delete a non-maintenance5 workaround for shobj use +- lavapipe: set separate_shaders for shader objects +- zink: set workgroup_memory_explicit_layout for shader validation +- zink: add a ZINK_DEBUG=validation alias +- zink: fix semaphore signal ordering +- zink: move swapchain fence to swapchain object +- zink: avoid UAF on wayland async present with to-be-retired swapchain +- zink: always trace_screen_unwrap in acquire +- lavapipe: fix variable descriptor count support handling +- lavapipe: always set independent blend +- lavapipe: more vertex stride fixups +- lavapipe: set default viewport and scissor count for cmdbufs +- lavapipe: set default min sample shading to 1 +- glx: XFree visual info +- radv: fix external handle type queries for dmabuf/fd +- zink: fix crashing in image rebinds +- zink: move push descriptor disable to driver workarounds +- zink: move v3dv scalarBlockLayout workaround +- zink: fix end-of-batch barrier pipeline stages +- zink: guarantee egl syncobj lifetime +- aux/trace: dump enum names for map usage +- gallium: add PIPE_MAP_NONE +- Revert "egl/wayland: Add image loader extension for swrast" +- egl/wayland: don't block in swrast when updating buffers for zink +- egl/wayland: return sooner from swrast_update_buffers() if zink +- zink: don't check submit count for unflushed usage +- egl: don't set ForceSoftware for all zink loading +- zink: error at handle export on missing EXT_image_drm_format_modifier +- gbm: delete some zink handling +- zink: apply ZINK_DEBUG=quiet to all missing feature warnings +- zink: set ZINK_DEBUG=quiet for polaris jobs +- lavapipe: don't block begin/end cmdbuf pipeline barriers +- ci: add a630 trace flakes +- zink: shrink vectors during optimization +- zink: always clamp shader stage in descriptor handling +- zink: add set_global_binding +- zink: eliminate samplers from no-sampler CL texops +- zink: add some checks to determine whether queue is init on screen destroy +- zink: don't destroy any simple_mtx_t objects during screen destroy +- zink: don't destroy uninitialized disk cache thread +- zink: reorder glsl_type_singleton_init_or_ref call +- zink: use screen destructor for creation fails +- zink: fix readback_present locking +- zink: add automatic swapchain readback using heuristics +- lavapipe: VK_EXT_nested_command_buffer +- zink: ignore unacquired swapchain images during end-of-frame flush +- nir/lower_fragcolor: preserve location_frac +- zink: update pointer for GPL pipeline cache entry formats +- zink: fix legacy depth texture rewriting for single component reads +- egl: unify dri2_egl_display creation +- egl: init dri3 version info during screen creation +- egl/glx: don't load non-sw zink without dri3 support +- egl: add automatic zink fallback loading between hw and sw drivers +- glx: add automatic zink fallback loading between hw and sw drivers +- ci: don't set GALLIUM_DRIVER for zink +- egl/wayland: only add more registry listeners for hardware devices +- zink: only increment image_rebind_counter on image export if binds exist +- zink: check for sampler view existence during zink_rebind_all_images() +- zink: use weston for anv ci +- zink: blow up broken xservers more reliably +- zink: delete some dead modifier handling +- ci: skip implicit modifier piglits for zink +- zink: don't block large vram allocations +- zink: add copy box locking +- zink: emit SpvCapabilitySampleRateShading with SampleId +- zink: always set VK_EXTERNAL_MEMORY_HANDLE_TYPE_HOST_ALLOCATION_BIT_EXT for usermem +- zink: clamp resolve extents to src/dst geometry +- zink: only emit xfb execution mode for last vertex stage +- aux/u_transfer_helper: set rendertarget bind for msaa staging resource +- zink: unset explicit_xfb_buffer for non-xfb shaders +- mesa/st/texture: match width+height for texture downloads of cube textures +- zink: add more locking for compute pipelines +- radv: correctly return oom from the device when failing to create a cs +- zink: check for cbuf0 writes before setting A2C + +Mohamed Ahmed (19): + +- vulkan/util: Support 10-bit and 12-bit color formats in ycbcr_info in vk_format.c +- vulkan/util: Support VK_EXT_ycbcr_2plane_444_formats color formats in vk_format.c +- vulkan/util: Use ycbcr_info for multiplane helpers in vk_format.c +- nvk: implement vkGetDeviceImageMemoryRequirementsKHR() +- nvk: add stub for vkGetDeviceImageSparseMemoryRequirementsKHR() +- nvk: implement vkGetDeviceBufferMemoryRequirementsKHR() +- nvk: advertise VK_KHR_maintenance4 +- nvk: advertise DemoteToHelperInvocation +- nvk: Enable multiplane images and image views +- nouveau/nvk: Add YCbCr sampler NIR lowering pass +- nouveau/nvk: Support multi-plane descriptors in nvk_nir_lower_descriptors.c +- nouveau/nvk: Create helper function for sampler creation +- nouveau/nvk: Add multiple sampler planes for CONVERSION_SEPARATE_RECONSTRUCTION_FILTER_BIT +- nouveau/nvk: Enable VK_KHR_sampler_ycbcr +- util/format: Add G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM formats +- vulkan/format: Translate G8B8G8R8_422_UNORM and B8G8R8G8_422_UNORM properly +- nvk: Enable SEPARATE_RECONSTRUCTION_FILTER_BIT for multi-planar formats only +- nvk: Enable MIDPOINT_CHROMA_SAMPLES_BIT for multi-planar formats only +- nil: Add support for G8B8_G8R8_UNORM and B8G8_R8G8_UNORM + +Nanley Chery (33): + +- iris: Remap DRM_FORMAT_MOD_INVALID more often during import +- anv: Don't support ASTC images with modifiers +- intel: Add and use isl_drm_modifier_get_plane_count +- anv: Handle explicit surface layout of DG2_RC_CCS +- anv: Reduce accesses of isl_mod_info->aux_usage +- iris: Reduce accesses of mod_info->aux_usage +- crocus: Delete modifier with aux code +- hasvk: Delete modifier with aux code +- iris: Swap stencil and modifier aux assignment order +- intel: Describe modifier compression with booleans +- intel/isl: Move the Tile4 modifier score case down +- intel/isl: Add a score for DG2_RC_CCS +- intel/blorp: Ambiguate after CCS resolves on gfx7-8 +- iris: Reorder render_aux_usage parameters +- iris: Pass the render format to prepare_render +- iris: Create BLORP surfaces after resource preparation +- iris: Handle clear color compatibility in prepare_render +- iris: Sample more texture view fast-clears on gfx11+ +- iris: Fix aux usage tracking in prepare_render +- iris: Fix iris_copy_region calls involving FCV_CCS_E +- iris: Drop get_copy_region_aux_settings +- iris: Inline iris_can_sample_mcs_with_clear +- anv: Initialize the clear color more often for FCV +- intel: Return a bool from intel_aux_map_add_mapping +- anv: Move scope of CCS binding determination +- anv: Allocate space for aux-map CCS in image bindings +- anv: Wrap aux surface image binding queries +- anv: Refactor CCS disabling at image bind time +- anv: Place images into the aux-map when safe to do so +- anv: Loosen anv_bo_allows_aux_map +- anv: Meet CCS alignment reqs with dedicated allocs +- anv: Delete implicit CCS code +- intel/isl: Add scores for GEN12_RC_CCS and MTL_RC_CCS + +Neal Gompa (1): + +- asahi: Fix 32-bit x86 build with correct data type for overflow error message + +Neha Bhende (1): + +- ntt: lower indirect tesslevels in ntt + +Paul Gofman (2): + +- driconf: add a workaround for Captain Lycop: Invasion of the Heters +- driconf: add a workaround for Rainbow Six Extraction + +Paulo Zanoni (15): + +- anv: rename the vm_bind vfuncs +- anv: add a new vm_bind vfunc +- anv/xe: make vm_binds async +- anv/xe: return failure in case waiting for the vm_bind syncobj fails +- anv: remove misleading comment about batch_len +- iris: assert bufmgr->bo_deps_lock is held +- iris: avoid stack overflow in iris_bo_wait_syncobj() +- iris: assert(bo->deps) after realloc() +- intel/isl: add ISL_SURF_USAGE_SPARSE_BIT +- intel/isl: simplify the check for maximum surface size +- anv/sparse: add the initial code for Sparse Resources +- anv/sparse: get ready to issue a single vm_bind ioctl per non-opaque bind +- anv/sparse: add INTEL_DEBUG=sparse +- anv: enable sparse resources by default +- vulkan: fix potential memory leak in create_rect_list_pipeline() + +Pavel Ondračka (44): + +- r300: update RV370 failures +- r300: check for index overflow when translating from TGSI +- r300: source register index is always unsigned +- r300: bump the RC_MAX_INDEX_BITS +- r300: normal instruction can't have presubtract op +- r300: add a helper for checking number of temporary sources +- r300: cycles estimate for shader-db +- r300: fix cycles calculation +- r300: don't abort on flow control when using draw for vs +- r300: add dEQP baseline for RV370 with forced swtcl +- r300: copy ntt to r300 compiler +- r300: add lower_sqrt to nir option +- r300: remove unused intrinsics in ntr +- r300: remove irrelevant opcodes in ntr +- r300: remove unused integer support in ntr +- r300: remove ntr_tgsi_usage_mask +- r300: remove more unused 64-bit pieces from ntr +- r300: simplify vectorization rules +- r300: remove more ntr unused helpers +- r300: remove the unneeded ntr_lower_vec_to_reg callback +- r300: remove unneeded 64bit and atomic lowering passes +- r300: remove unused ntr default settings +- r300: remove ntr default options +- r300: simplify ntr_emit_load_ubo +- r300: simplify ntr_emit_load_input +- r300: remove some virglrenderer specifics from ntr +- r300: simplify ntr_setup_uniforms +- r300: simplify ntr_output_decl +- r300: simplify ntr_try_store_in_tgsi_output +- r300: remove some unsupported texture opcodes +- r300: remove unused barrier code from ntr +- r300: simplify ntr_get_gl_varying_semantic +- r300: remove the nrt main optimization loop +- r300: reorder for easier presubtract 1-x pattern recognition +- r300: exit early in presubtract is not supported +- r300: implement bias presubtract +- r300: convert x * 2 into x + x for presubtract +- r300: move power of two multipliers down +- r300: there is no limitation on presubtract source file +- r300: use w channel for scalar opcodes if possible +- r300: reduce number of iterations for vertex shader loops +- r300: enable nir_move_vec_src_uses_to_dest +- nir/move_vec_src_uses_to_dest: skip reuse if vec is used only once in store_output +- nir/move_vec_src_uses_to_dest: allow to skip reuse of constant sources + +Philipp Zabel (1): + +- etnaviv: fix segfault after compile failure + +Pierre-Eric Pelloux-Prayer (18): + +- radeonsi/sdma: use multiple commands if required +- radv/sdma: use multiple commands if required +- radv/sdma: use correct limits for gfx10.3 +- glx: drop the 'libGL' log prefix +- loader: refactor DRI_PRIME handling code +- loader: extend DRI_PRIME to support =N +- loader: add DRI_PRIME_DEBUG env var +- device_select_layer: support DRI_PRIME=n +- docs: update DRI_PRIME documentation +- device_select: add shortcut for MESA_VK_DEVICE_SELECT_FORCE_DEFAULT_DEVICE +- st/mesa: check renderbuffer before using it +- radeonsi: emit framebuffer state after allocating cmask +- amd/common: update addrlib for gfx11.5 +- amd/common: add registers for gfx11.5 +- ac/nir: extract must_wait_attr_ring helper +- amd, radeonsi: Add code to enable gfx11.5 +- mesa: restore call to _mesa_set_varying_vp_inputs from set_vertex_processing_mode +- radeonsi: check sctx->tess_rings is valid before using it + +Piotr Kocia (2): + +- nir: Remove dead nir_const_value variables +- glsl: ir_function_param_visitor::visit_enter always true condition + +Qiang Yu (77): + +- aco,radv: replace tess_input_vertices shader info param +- radeonsi: aco does not pass LS outputs to HS by arg +- radeonsi: extract si_get_prev_stage_nir_shader to be shared with aco +- radeonsi: init aco shader info for merged LS/HS +- radeonsi: simplify si_build_wrapper_function +- radeonsi: move vertex shader vb desc input sgpr args to last +- radeonsi: remove param type check in wrapper function +- radeonsi: refine si_llvm_ls_build_end +- radeonsi: refine si_llvm_es_build_end +- radeonsi: aco compile support merged mono shader +- radeonsi: calculate lds size for merged shaders +- radeonsi: enable aco compile for mono merged LS/HS +- radeonsi: enable aco compile for mono merged ES/GS +- aco: extract aco_compile_shader_part from aco_compile_ps_epilog +- aco: add p_end_with_regs pseudo instruction +- aco: move jump to epilog out of ic_merged_wave_info +- aco: add tcs end regs for epilog usage +- aco: allow tcs with epilog to keep nir store output instruction +- aco: add pending_lds_access option for insert waitcnt +- aco: add tcs epilog generation for radeonsi +- aco: don't emit s_endpgm for tcs with epilog +- aco: skip scratch init when no scratch arg provide +- aco,radeonsi: save const addr to symbol +- ac/nir/tess: move tess factor output out of control flow +- aco: use semantic location as io temp index +- radeonsi: add exec_size to shader binary +- radeonsi: support upload multi part shader binary +- radeonsi: share si_get_tcs_out_patch_stride with aco +- radeonsi: fill part mode tcs aco shader info +- radeonsi: extract si_llvm_build_shader_part +- radeonsi: remove separate_prolog arg from prolog/epilog build +- radeonsi: add si_get_tcs_epilog_args +- radeonsi: change si_fill_aco_options args +- radeonsi: add si_aco_build_shader_part +- radeonsi: part mode standalone tcs support aco compile +- radeonsi: remove unused arg of get_tcs_tes_buffer_address +- aco: simplify setup_tcs_info +- aco: pass sw_stage when setup_isel_context +- aco: prepare fix_ls_vgpr_init_bug to be used by gl vs prolog +- aco: add vs prolog instruction selection for radeonsi +- aco: add aco compile interface for radeonsi vs prolog +- aco: do not fix_exports when program is prolog +- radeonsi: fill aco_shader_info->is_monolithic +- radeonsi: remove is_monolithic from vs prolog key +- radeonsi: extract si_get_vs_prolog_args to be shared with aco +- radeonsi: fix aco options has_ls_vgpr_init_bug setup +- radeonsi: add vs prolog aco build +- radeonsi: set vs has prolog aco shader info +- radeonsi: enable aco compile for part mode standalone vs +- aco,radv,radeonsi: rename is_monolithic to merged_shader_compiled_separately +- ac,radeonsi: move ps arg pos_fixed_pt to ac_shader_args +- aco: do not eliminate final exec write when p_end_with_regs block +- aco: remove p_end_with_regs from needs_exact() +- aco: add ps prolog generation for radeonsi +- aco: handle ps outputs from radeonsi +- aco: add create_fs_end_for_epilog for radeonsi +- aco,radv: remove unused ps epilog info fields +- aco,radv: rename ps epilog info inputs to colors +- aco: simplify export_fs_mrt_color +- aco,radv: add radeonsi spec ps epilog code +- aco: compact ps expilog color export for radeonsi +- aco,radv,radeonsi: pass spi ps input ena and addr +- aco: do not fix_exports when program has epilog +- aco: fix assertion fail when program contains empty block +- aco: create exit block for p_end_with_regs to branch to +- aco: wait memory ops done before go to next shader part +- radeonsi: reduce sgpr count for scratch_offset when aco +- radeonsi: init spi_ps_input_addr for part mode ps +- radeonsi: extract si_prolog_get_internal_binding_slot +- radeonsi: extract si_get_ps_prolog_args to be shared with aco +- ac,radeonsi: remove unused ps prolog key fields +- radeonsi: add ps prolog shader part build +- radeonsi: extract si_get_ps_epilog_args to be shared with aco +- radeonsi: fill aco shader info for ps part +- radeonsi: add ps epilog shader part build +- radeonsi: enable aco compile for part mode ps +- radeonsi: disable disk cache when use aco + +Rebecca Mckeever (32): + +- vulkan/runtime: Add helper functions for VK_EXT_host_image_copy +- nouveau/codegen: Support nir_intrinsic_load_workgroup_id_zero_base +- nouveau/codegen: Set lower_device_index_to_zero +- nvk: Convert system values for gl_PointCoord and PointCoord into inputs +- nvk: Add base_group to root descriptor table +- nvk: Lower base_workgroup_id +- nvk: Implement nvk_CmdDispatchBase and delete nvk_CmdDispatch +- nvk: Advertise KHR_device_group +- nvk: Add VK_FORMAT_B4G4R4A4_UNORM_PACK16 format to nil_format_info table +- nvk: Add A4B4G4R4 formats to nil_format_info table +- nvk: Advertise EXT_4444_formats +- nvk: Enable shadow sampling +- nvk: Implement VK_EXT_non_seamless_cube_map +- nouveau/nil: Add macros for ufixed +- nvk: Implement VK_EXT_image_view_min_lod +- nvk: Update mutable descriptor struct type +- nvk: Replace asserts with conditional that sets type_list = NULL +- nvk: Implement nvk_GetDescriptorSetLayoutSupport +- nvk: Enable VK_KHR_maintenance3 +- nvk: Advertise VK_EXT_mutable_descriptor_type +- nvk: Set image index to zero for NULL nvk_buffer_view +- nvk: Advertise VK_EXT_image_robustness +- nvk: Advertise VK_EXT_robustness2 +- nvk: Add view_index to root descriptor table +- nvk: Lower nir_intrinsic_load_view_index +- nvk: Add draw support for multiview +- nvk: Add query support for multiview +- nvk: Add input attachments support for multiview +- nvk: Advertise VK_KHR_multiview +- nvk: Load view_mask to shadow scratch in nvk_CmdBeginRendering +- nvk: Combine CLEAR_VIEWS and CLEAR_LAYERS MME macros +- nvk: Move code inside view mask loops to a helper function + +Rhys Perry (89): + +- ac/llvm: fix AC_TM_CHECK_IR +- radv: fix radv_get_ballot_bit_size with CS +- ac/llvm: fix wave32 ac_build_mbcnt_add with 64-bit mask +- ac/llvm: skip ballot zext for 32-bit dest with wave32-as-wave64 +- radv: add conformant_trunc_coord to cache UUID +- radv: don't unset TRUNC_COORD if conformant_trunc_coord=true +- ac/nir: always round cube array layers +- nir/unsigned_upper_bound: fix phi(bcsel) +- nir/tests: add test for unsigned_upper_bound with loop header phis +- nir/opt_dead_cf: remove nodes after a jump earlier +- nir/tests: add nir_opt_dead_cf_test.jump_before_constant_if +- aco: insert s_nop before VGPR deallocation +- nir/lower_shader_calls: vectorize stack access for all shaders +- radv: workaround WWZ exporting index=1 through location=1 +- radv: correctly skip MRT output NaN fixup for meta shaders +- radv: don't set vertex_attribute_strides on GFX8+ +- radv/ci: skip some mesh shader tests on GFX1100 +- aco: summarize register demand after handling branches +- aco: don't create sendmsg(dealloc_vgprs) if scratch is used +- radv: disable 64-bit color attachments +- radv: fix 128bpp comp-to-single clears +- radv: support 128bpp comp-to-single with all colors +- radv/gfx11: re-enable 0001/1110 clear values +- nir/lower_shader_calls: fix align_offset +- nir/opt_load_store_vectorize: support scratch access +- radv: vectorize RT stack access +- radv: vectorize scratch access +- aco: fix p_bpermute_gfx6 with input at non-zero byte +- aco: fix p_bpermute_gfx6's exec save/restore with wave32 +- aco: clarify bpermute pseudo opcode names +- aco: add adjust_bpermute_dst helper +- aco/spill: skip p_branch in process_block +- aco/spill: add all live-in to merge block spill candidates +- nir/lower_system_values change num_workgroups to uint32_t +- radv: optimize mesh workgroup ID using ts_mesh_dispatch_dimensions +- radv: use shortcut_1d_workgroup_id +- aco: remove fast path in insert_exec_mask's process_instructions +- aco/optimizer_postRA: check overwritten_subdword in is_overwritten_since() +- aco: check logical_phi_info at p_logical_end when eliminating exec writes +- aco: remove unused p_logical_end check when optimizing branching sequence +- radv: disable mesh dispatch XYZ_DIM when possible +- nir/deref: remove rematerialize_deref_in_block cache +- aco: reset prefetch in the correct block after removing the exit +- aco/waitcnt: replace wait_cnt::\*_cnt with booleans +- aco/waitcnt: add print helpers +- nir/lower_int64: fix find_lsb(0) +- nir/algebraic: optimize u2u32(a >> 32) +- aco/optimizer_postRA: don't combine DPP across exec on GFX8/9 +- aco: don't combine DPP into v_cmpx +- aco: disable zero offset optimization for strict WQM coords +- nir/constant_folding: remove zero texel offset +- aco: remove zero offset optimization +- aco: shrink DPP8_instruction +- aco: add fetch_inactive field to DPP instructions +- nir: add fetch inactive index to quad_swizzle_amd/masked_swizzle_amd +- aco: disable FI for quad/masked swizzle +- aco: fix LdsDirectVMEMHazard WaW with the wrong waitcnt +- aco: only mitigate VcmpxExecWARHazard when necessary +- aco: fix s_setreg hazards +- aco: consider exec_hi in reads_exec() +- aco: resolve all possible hazards at the end of shader parts +- aco/tests: test that hazards are resolved at the end of shader parts +- radv: skip zero-sized memcpy +- ac/nir: fix out-of-bounds access in ac_nir_export_position +- radv: fix signed integer overflow +- Revert "radv: pre-init surface info" +- nir: improve ms_cross_invocation_output_access with local_invocation_id +- aco,nir: add export_row_amd intrinsic +- ac/nir: add row parameter to helpers +- ac/nir: remove dead code +- ac/nir: refactor mesh vertex/primitive export +- ac/nir: implement mesh shader gs_fast_launch=2 +- ac/nir: optimize mesh shader local_invocation_index +- radv: implement mesh shader gs_fast_launch=2 +- ac/nir: add emit_ms_outputs helper +- ac/nir,radv: pass workgroup size to ac_nir_lower_ngg_ms +- ac/nir: implement mesh shader multi-row export +- radv: implement mesh shader multi-row export +- radv: enable mesh shader gs_fast_launch=2 and multi-row export +- nir/serialize: fix signed integer overflow +- nir/lower_shader_calls: skip zero-sized qsort +- util: skip zero-sized SHA1Update +- radv: call lower_array_deref_of_vec before lower_io_arrays_to_elements +- radv: skip radv_remove_varyings for mesh shaders +- radv: disable gs_fast_launch=2 by default +- docs: fix RADV_THREAD_TRACE_CACHE_COUNTERS default +- radv: add radv_disable_trunc_coord option +- radv: enable radv_disable_trunc_coord for vkd3d-proton/DXVK +- ac/nir: fix partial mesh shader output writes on GFX11 + +Rob Clark (60): + +- freedreno: move virtgpu msm_proto.h to common +- freedreno/drm/virtio: Remove unused header +- tu/msm: staticify a couple things +- tu/knl: Remove some random const'ness +- drm-uapi: Update virtgpu header +- freedreno: Update virtgpu proto +- freedreno/drm/virtio: Use global_faults +- tu: close submitqueues before device_finish() +- tu/drm: Factor out shared helpers +- tu/drm: Add missing error path cleanup +- tu/drm: Split out helper for iova alloc +- tu: Add virtgpu support +- util: Decouple disk cache from EGL_ANDROID_blob_cache +- docs: Followup to !24636 +- tu: Workaround bionic _SC_LEVEL1_DCACHE_LINESIZE +- ir3+tu: Simplify ir3_find_sysval_regid callers +- freedreno/a6xx: Drop unused screen args +- freedreno/a6xx: Re-work fd6_emit_shader +- freedreno/a6xx: Re-write the function-of-doom +- freedreno: Implement ATI_meminfo +- freedreno/a6xx: ARB_post_depth_coverage +- freedreno/a6xx: ARB_sample_locations +- freedreno/a6xx: ARB_texture_filter_minmax +- freedreno/a6xx: EXT_demote_to_helper_invocation +- freedreno/a6xx: EXT_shader_image_load_formatted +- freedreno/a6xx: EXT_depth_bounds_test +- freedreno/a6xx: Use pipe_blit_info::sample0_only +- freedreno/a6xx: Handle PIPE_BIND_BLENDABLE +- freedreno/a6xx: ARB_shader_viewport_layer_array +- tu: Fix heap size +- freedreno: Fix crash with debug msgs enabled +- freedreno/layout: Handle 565/etc MSAA special case +- freedreno/decode: Fix printing chip-id +- freedreno/a6xx: Add L8_SRGB +- freedreno: Add reformatting commits to .git-blame-ignore-revs +- freedreno/fence: Hold a strong ref to batch +- freedreno/decode: Lookup device info +- freedreno/decode: Use info->chip to decode +- freedreno/decode: Remove gpu_id +- freedreno: Indentation fix +- freedreno: Use explicit QCOM_TILED3 modifier +- freedreno/a6xx: Remove dummy packet for globals +- freedreno: Fix streamout offset_buf dirtiness +- freedreno: Fix user const buffer dirtiness +- freedreno/batch: Move query_buf allocation +- freedreno: Add private-BO tracking +- freedreno: Add missing indirect_draw_count tracking +- freedreno: Move/add some attach_bo() +- freedreno: Add attach-bo debugging +- freedreno: Rework supported-modifiers handling +- mesa: Introduce MESA_texture_const_bandwidth +- mesa: Implement MESA_texture_const_bandwidth +- freedreno: Add PIPE_CAP_HAS_CONST_BW support +- panfrost: Add PIPE_CAP_HAS_CONST_BW support +- iris: Add PIPE_CAP_HAS_CONST_BW support +- radeonsi: Add PIPE_CAP_HAS_CONST_BW support +- tu/msm: Fix timeline semaphore support +- tu/virtio: Fix timeline semaphore support +- freedreno/drm: Fix race in zombie import +- freedreno: Always attach bo to submit + +Robert Foss (9): + +- egl: Expose access to DeviceList +- egl: Rename _eglRefreshDeviceList() to _eglDeviceRefreshList() +- egl: Refresh DeviceList during eglInitialize() +- egl/surfaceless: Use EGL DeviceList instead of drmGetDevices2() +- egl/android: Use EGL DeviceList instead drmGetDevices2() +- egl: Rename _eglAddDevice() to _eglFindDevice() +- egl: Rename _eglAddDevice() to _eglFindDevice() +- egl: Fix attrib_list[0] == EGL_NONE check +- egl: Always set _EGLDisplay->Device during eglGetPlatformDisplay() + +Robert Mader (6): + +- egl/wayland: wait for compositor to release shm buffers +- iris: Support parameter queries for main planes +- util: Add new helpers for pipe resources +- panfrost: Support parameter queries for main planes +- vc4/resource: Support offset query for multi-planar planes +- v3d/resource: Support offset query for multi-planar planes + +Rohan Garg (33): + +- iris: migrate WA 14013910100 to use the WA framework +- iris: migrate WA 14016118574 to use the WA framework +- iris: fix iris for WA 16013000631 +- intel/perf: add perf query support for Intel Raptorlake +- intel/genxml: set a default value for "Pixel Position Offset Enable" in genxml +- anv: use the WA infrastructure where possible when generating state +- anv: use the correct GFX_VERx10 macro for WA +- anv,iris: program the maximum number of threads on compute queue init +- anv: drop CFE state validation checks +- iris: track reset signalling instead of replacing the context +- iris: allow for a unsynchronized device reset query +- anv: partially revert 2e8b1f6d +- anv: emitting 3DSTATE_PRIMITIVE_REPLICATION is required on Gen12+ +- anv: use the pre defined _3DPRIMITIVE_DIRECT macro +- anv: drop dead ifdef +- iris: use the correct WA macros and lineage numbers +- anv: use the lineage number for WA +- crocus: add a __gen_get_batch_address declaration +- crocus: fix GFX_VERx10 macro +- blorp: drop undefined macro +- iris: migrate preemption streamwout wa to WA infra +- intel/genxml: update PIPE_CONTROL instruction for dg2 +- anv: define clear color localy within can_fast_clear_color_att +- intel/compiler: Adjust CS payload registers for new register width on Xe2+ +- intel/compiler: Adjust fence message lengths for new register width on Xe2+ +- intel/compiler: Adjust barrier emission for Xe2+ +- intel/genxml: fix 3DSTATE_3D_MODE length to align with BSpec +- anv: ensure that FCV_CCS_E fast clears are properly tracked +- anv: enable FCV for Gen12.5 +- anv: fix debug string for PC flush +- anv: cleanup includes +- anv: turn off non zero fast clears for CCS_E +- anv: selectively enable FCV optimization for DG2 + +Roland Scheidegger (1): + +- lavapipe: further limit accurate_a0 hack + +Roman Stratiienko (22): + +- egl: android: Remove legacy name-based shared buffers support +- util: Add NONNULL macro +- android: Introduce the Android buffer info abstraction +- android: Fix num_planes assignment in u_gralloc_fallback +- v3dv/android: Use u_gralloc code +- v3dv/android: Enable shared presentable image support +- v3dv: Migrate to vk_device_memory +- v3dv/android: Skip swapchain binding +- v3dv: Rely on the internal tiled flag instead of the common vk structure +- v3dv/android: Add a helper function to support explicit layouts +- v3dv/android: Rework Android native buffer importing logic +- v3dv: Use format stored in vk_image and vk_image_view after init +- v3dv: Split v3dv_image_init to use layout setting logic separately +- v3dv/android: Add AHardwareBuffer support +- v3dv: Enable VK API v1.2 for Android +- panvk: Add Android ICD loader entry point +- u_gralloc: Remove inline modifiers from the functions +- u_gralloc: Remove usage of NONNULL macro +- Revert "util: Add NONNULL macro" +- u_gralloc: Add a function that returns gralloc type +- dri: Remove __driDriverExtensions leftovers +- v3d: Don't implicitly clear the content of the imported buffer + +Ruijing Dong (2): + +- frontends/va: checking va version for av1enc support +- radeonsi/vcn: change max_poc to fixed value for hevc encoder. + +Ryan Neph (1): + +- vulkan/android: add missed STACK_ARRAY_FINISH() + +Sagar Ghuge (34): + +- intel/compiler: Look at 2 register worth of data instead of 4 +- isl: Disable MCS compression just on ACM platform +- intel: Add env variable to add break point on/before draw +- anv: Add GPU breakpoint before/after specific draw call +- iris: Add GPU breakpoint before/after draw call +- blorp: Implement blorp hooks to emit breakpoint +- docs: Add INTEL_DEBUG_BKP_BEFORE/AFTER_DRAW_COUNT +- intel/isl: Enable INTEL_DEBUG=noccs/nohiz in ISL helpers +- anv,hasvk: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks +- iris,crocus: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks +- blorp: Drop unnecessary assertions in blorp_can_hiz_clear_depth +- anv: Add helper to create companion RCS command buffer +- anv: Split out End/Destroy/Reset cmd buffer code into helper +- anv: Handle companion RCS in end/destory/reset code path +- intel: Add helper to create/destroy i915 VM +- intel: Pass virtual memory address space ID while creating context +- anv: Create companion RCS engine +- anv: Move compute specfic bits under compute queue init +- anv: Execute RCS init batch on companion RCS context/engine +- anv: Setup companion RCS command buffer submission +- anv: Execute an empty batch to sync main and companion RCS batch +- anv: Add secondary companion RCS cmd buffer to primary +- anv: Skip layout transition on the compute queue +- anv: Extract batch print code to anv_print_batch helper +- iris: Enable always flush cache with DEBUG_STALL option +- intel/genxml: Add STATE_COMPUTE_MODE instruction +- anv: Program and emit STATE_COMPUTE_MODE +- anv: Enable barrier handling on video engines +- isl: Use 16-bit instead of 8-bits for surface format info fields +- anv: Handle end of pipe with MI_FLUSH_DW on transfer queue +- anv: Enable transfer queue only on ACM+ platforms +- blorp: Use the correct miptail start LOD for surfaces +- anv: Write timestamp using MI_FLUSH_DW on blitter +- anv: Flush data cache while clearing depth using HIZ_CCS_WT + +Saleemkhan Jamadar (1): + +- radeonsi/vcn: set jpeg reg version for gfx 1150 + +Samuel Holland (3): + +- Android.mk: Allow building only Vulkan drivers +- Android.mk: Explicitly enable/disable LLVM support +- Android.mk: Only link LLVM for radeonsi, not amd_vk + +Samuel Pitoiset (299): + +- radv: remove support for VK_INDIRECT_COMMANDS_TOKEN_TYPE_STATE_FLAGS_NV +- radv: make radv_get_pa_su_sc_mode_cntl() static +- zink/ci: update list of expected failures for NAVI10 +- radv: stop using a pipeline for emitting VGT_VERTEX_REUSE_BLOCK_CNTL +- radv: remove unused param in radv_pipeline_emit_vgt_gs_out() +- radv: pass a shaders array for computing ia_multi_vgt_param +- radv: bind the pre-compiled PS epilog to the cmdbuf state +- radv: stop using an array of binaries when compiling a compute shader +- radv: add radv_compile_cs() to compile a compute shader +- radv: remove the pipeline dependency for creating a GS copy shader +- radv: add a helper to compute the ESGS itemsize +- radv: use the number of GS linked inputs to compute the ESGS itemsize +- radv: determine ES info for VS/TES with GS earlier +- radv: determine as_ls earlier by using the next stage +- radv: simplify getting next VS stage for VS prologs +- radv: use next_stage for determining the stage to lower NGG +- radv/amdgpu: fix dumping CS with the chained IBs path +- radv/amdgpu: rename old_ib to ib in radv_amdgpu_winsys_cs_dump() +- radv: pass submit info to radv_check_gpu_hangs() +- radv: initialize stage/next_stage earlier +- radv: set next_stage to MESA_SHADER_NONE if there is no FS +- radv: rework considering force VRS without relying on graphics pipeline +- radv: stop passing radv_graphics_pipeline to radv_fill_shader_info() +- radv: move removing all varyings when the FS is a noop +- radv: rename graphics pipeline linking helpers +- radv: simplify lowering NGG GS intrinsics +- radv: rework determining the NGG stage without a graphics pipeline +- radv: cleanup pipeline compute emit helpers +- radv: rename radv_pipeline_stage to radv_shader_stage +- radv: rename NGG query state to be more generic +- radv: declare the shader query user SGPR for emulating GS counters +- radv: enable pipelinestat query emulation for legacy GS +- radv: simplify the NGG vs legacy pipelinestat query path +- radv: rename RADV_SHADER_QUERY_PIPELINE_STAT_OFFSET +- radv: implement nir_intrinsic_atomic_add_gs_invocation_count_amd +- radv: emulate GEOMETRY_SHADER_INVOCATIONS query on RDNA1-2 +- radv: track whether inputs/outputs are linked per shader stage +- radv: add support for VS/TES as ES without shaders IO linking +- radv: use next_stage to determine if the layer should be exported +- radv: use next stage to determine if primID/clip dist should be exported +- radv: compute the legacy GS info earlier +- radv: stop copying some NIR info fields from TES to TCS +- radv: stop lowering patch vertices for TES +- radv: do not always copy the number of tess patches to TES +- radv: initialize tcs.tes_{patch}_inputs_read to a default value +- radv: prevent linking TCS<->TES when TES is NULL +- radv: use a packed user SGPR for the TES state +- radv: stop checking if patch control points is dynamic everywhere +- radv: copy the number of TCS vertices out to TES shader info +- radv: add support for dynamic TCS vertices out for TES +- radv: remove radv_shader_info::tes::num_linked_patch_inputs +- amd,radeonsi: move si_shader_io_get_unique_index_patch() to common code +- radv: allow to use fixed IO locations for VS<->TCS<->TES without linking +- aco: add aco_shader_info::tcs::has_epilog +- aco: add infra for compiling TCS epilogs +- radv,aco: move has_epilog to radv_shader_info +- radv: assume a TCS needs an epilog unless it's linked with a TES +- radv: do not write tess factors in main TCS when it has an epilog +- radv: track if TES reads tess factors differently +- radv: declare new argument for the TCS epilog PC +- radv: add radv_tcs_epilog_key +- radv: add infra for creating TCS epilogs +- radv: add support for a TCS epilogs cache in the device +- radv: add support for emitting TCS epilogs in cmdbuf +- radv: remove unnecessary check in radv_pipeline_nir_to_asm() +- radv: stop passing a graphics pipeline to radv_pipeline_nir_to_asm() +- radv: inline radv_pipeline_get_nir() in radv_graphics_pipeline_compile() +- radv: add a struct for the retained shaders and GPL +- radv: add radv_graphics_shaders_compile() to compile graphics shaders +- radv: remove redundant check in radv_cmd_buffer_after_draw() +- radv: track if patch control points is dynamic from the cmdbuf state +- radv: re-emit binning state if the framebuffer is dirty +- radv: track if vertex binding stride is dynamic from the cmdbuf state +- vulkan: bump header register to 1.3.261 +- vulkan/runtime: add common implementation for GetImageSubresourceLayout() +- vulkan/format: add VK_FORMAT_{A8_UNORM,A1B5G5R5_UNORM_PACK16}_KHR +- radv: use the RT prolog scratch size directly for tracing rays +- radv: add a helper to get the maximum number of scratch waves per shader +- radv: update the number of scratch waves for RT prolog at bind time +- radv: update cmdbuf scratch size info when shaders are bound +- vulkan: add init/finish helpers for vk_buffer_view +- radv: use vk_buffer_view +- radv: use vk_sampler +- radv: use common vkCmdBegin/EndQuery wrappers +- radv: use vk_query +- zink: fix setting VkShaderCreateInfoEXT::nextStage +- radv/rt: fix capture/replay support +- vulkan/render_pass: add common vkGetRenderingAreaGranularityKHR() +- radv: implement vkCmdBindIndexBuffer2KHR() +- radv: allow VK_WHOLE_SIZE for pSizes in vkCmdBindVertexBuffers2() +- radv/rmv: remove unused pipeline create flags when logging pipelines +- radv: store pipeline create flags to radv_pipeline::create_flags +- radv: add support for VkPipelineCreateFlags2CreateInfoKHR +- radv: add support for VkBufferUsageFlags2CreateInfoKHR +- radv: allow VK_REMAINING_ARRAY_LAYERS with VkImageSubresourceLayers +- radv: implement radv_Get{Device}ImageSubresourceLayout2KHR() +- radv: advertise VK_KHR_maintenance5 +- radv: remove useless NULL for pipeline layout during shader info pass +- radv: introduce radv_shader_layout for per-stage descriptor layout +- radv: stop passing redundant stage to radv_shader_stage_init() +- radv: re-introduce radv_pipeline_stage_init() +- radv: add support for loading the LSHS vertex stride from a SGPR +- radv: use the number of VS outputs for computing the tessellation info +- vulkan: ignore VkPipelineColorWriteCreateInfoEXT if the state is dynamic +- radv: reduce TCS_OFFCHIP_LAYOUT_NUM_PATCHES to 6-bits +- radv: add missing comment about TCS_OFFCHIP_LAYOUT_LSHS_VERTEX_STRIDE +- radv: fix emitting TCS epilogs for GFX6-9 +- radv: remove radv_cmd_buffer::cached_vertex_formats +- radv: remove unused param from radv_pipeline_init_multisample_state() +- radv: simplify declaring VS specific input SGPRs +- radv: stop copying if VS or TES uses the InvocationID built-in +- Revert "radv/amdgpu: workaround a kernel bug when replacing sparse mappings" +- Revert "radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build" +- radv/amdgpu: allow to execute external IBs on the compute queue +- radv/amdgpu: add support for submitting external IBs with the chained path +- zink/ci: update list of expected failures for NAVI10 +- radv: use the maximum possible workgroup size for TCS epilogs +- radv: stop declaring the scratch offset argument for TCS epilogs +- radv: declare shader arguments for TCS epilogs +- radv: add tcs_out_patch_fits_subgroup to radv_tcs_epilog_key +- aco: fix jumping from main TCS to epilog on GFX9+ +- aco: adjust TCS epilogs for RADV +- aco: allow SGPRs operands with p_jump_to_epilog +- aco: implement create_tcs_jump_to_epilog() +- radv: track the pipeline bind point for indirect commands layout +- radv: prepare radv_get_sequence_size() for DGC compute +- radv: prepare radv_prepare_dgc() for DGC compute +- radv: implement NV_device_generated_commands_compute +- radv: allow DGC on the compute queue +- radv: advertise NV_device_generated_commands_compute +- aco: rework printing shader stages +- radv: fix the per-patch data offset when TES isn't linked with TCS +- radv: stop declaring unused SGPR arguments for PS epilogs +- radv: add radv_shader_info::is_monolithic +- radv: use info->uses_view_index directly when declaring shader arguments +- radv: do not inline push constants for non-monolithic shaders +- radv: force indirect descriptor sets for non-monolithic shaders +- radv: always declare some arguments for non-monolithic VS/TCS shaders +- radv: add a new shader argument for non-monolithic shaders PC +- ac: allow to mark shader arguments as preserved +- radv: preserve shader arguments for non-monolithic VS/TCS on GFX9+ +- aco: disable shared VGPRs for non-monolithic shaders on GFX9+ +- aco: ensure to initialize exec manually for VS as LS on GFX9+ +- aco: add support for compiling VS+TCS separately on GFX9+ +- radv: always declare some arguments for non-monolithic {VS,TES}/GS shaders +- radv: preserve shader arguments for non-monolithic {VS,TES}/GS on GFX9+ +- aco: ensure to initialize exec manually for non-monolithic {VS,TES}/GS on GFX9+ +- aco: add support for compiling {VS,TES}+GS separately on GFX9+ +- radv,aco: remove unused clip/cull distances variables +- radv: rename tcs_shader to tcs in radv_emit_tcs_epilog_state() +- radv: small cleanups in radv_emit_patch_control_points() +- radv: fix emitting TCS epilogs if TES and GS are linked on GFX9+ +- radv: remove the pipeline dependency for emitting VGT_GS_MODE +- aco: fix emitting TCS epilogs end on GFX9+ +- radv: re-order IO slot layout for stages that aren't linked +- amd/ci: update list of failures/flakes for glcts-vangogh-valve +- ci: uprev vkd3d-proton +- ci: uprev Fossilize +- ci: add comment explaining which image tags to update for Fossilize +- radv: preserve shader argument for separate compilation of NGG shaders +- aco: flag blocks with long-jump as export_end for separate compilation +- aco: adjust fix_exports() for VS/TES as NGG and non-monolithic shaders +- aco: allow separate compilation of NGG shaders +- zink/ci: add zink-radv-polaris10-valve +- radv/ci: re-enable vkcts-polaris10-valve +- radv: fix capturing indirect dispatches with SQTT +- radv/ci: re-enable vkd3d-polaris10-valve +- ci: do not fail vkd3d-proton job when the expectations match +- radv/amdgpu: fix executing secondaries without IB2 +- radv/amdgpu: do not copy the original chain link for IBs +- radv: avoid emitting SQTT markers for DGC calls +- radv: add support for DGC with SQTT +- zink/ci: merge GLCTS testing with GLESx for RADV +- zink/ci: merge piglit testing with deqp-runner for RADV +- radv: fix interactions with primitives generated queries and pipeline stats +- radv: skip DGC calls when the indirect sequence count is zero with a predicate +- radv: avoid emitting THREAD_TRACE_MARKER for predicated draws/dispatches +- radv: adjust next stage for VS prologs and merged shaders compiled separately +- radv: adjust emitted prolog regs for merged shaders compiled separately +- radv: do not use pre-compiled prologs when VS is compiled separately +- radv: remove useless PIPELINE_CREATE_2_LIBRARY_BIT check for retained shaders +- radv: fix enabling DGCC +- radv: fix emitting SQTT userdata when CAM is needed +- radv: fix capturing RGP on RDNA3 with more than one Shader Engine +- zink/ci: update list of expected failures for POLARIS10/NAVI10 +- radv: set THREAD_TRACE_TOKEN_MASK.BOP_EVENTS_TOKEN_INCLUDE on GFX10.3+ +- radv: disable unsupported hw shader stages for RGP on GFX11+ +- radv: fix instruction timing on GFX11 +- ac/rgp: use correct API stage string for mesh/task shaders +- radv: set THREAD_TRACE_MARKER_ENABLE for mesh/task draws +- radv: emit relocation for mesh/task shaders +- issue_templates/Bug Report: fix outdated URL for GFXReconstruct +- ac,radv,radeonsi: rework SPM counters configuration and share it +- ac/perfcounter: add new SQ_WGP block for GFX11+ +- ac/spm: add SPM counters configuration for GFX11 +- radv: enable the PKT3 CAM bit for some SPM register writes +- radv,radeonsi: use AC_SPM_SEGMENT_TYPE_xxx instead of magic values +- ac/spm: remove useless SPM block setting for GFX9 and older GPUs +- ac/spm: add SPM block definition for GFX10-GFX10.3 +- ac/gpu_info: init num_cu_per_sh from the kernel +- ac/perfcounter: set the number of instances of GL1C to 4 +- ac/perfcounter: compute the number of global instances of TCP,SQ,GL1C and GL2C +- ac/spm: fix checking if the counter instance is valid +- ac/spm: rework how segment muxsel RAM are filled +- ac/spm: initialize and set instance mapping for counters +- radv: reserve more CS space in SQTT/SPM paths +- ac/spm: use block flags to initialize instance mapping +- ac/spm: select correct segment type for per-SE blocks +- radv,radeonsi: make sure to emit GRBM_GFX_INDEX before SQ select registers +- ac/spm: fix number of instances of GL2C +- ac,radv,radeonsi: prepare support for multi-instance SPM SQ counters +- ac,radv,radeonsi: prepare support for multi-instance SPM generic counters +- ac/spm: move the counter instance to ac_spm_counter_create_info +- ac/spm: enable support for multi-instance counters +- radv: fix checking if RGP is enabled with others tracing tools +- radv: fix missing ISA with RGP and GPL +- ac/perfcounter: add SG_WQP group for GFX11 +- ac/perfcounter: add GFX11 groups +- drirc: remove Path of Exile workarounds +- radv: remove drirc workarounds for Path Of Exile +- radv: remove absolute_depth_bias workaround +- ac/gpu_info: define AMD_MAX_WGP +- ac/spm: add new segment types for GFX11 +- ac/spm: add support for GFX11 +- radv: add SPM support for GFX11 +- radv: enable cache counters for RGP on GFX11 +- ci: update to vulkan-cts-1.3.6.3 +- radv/ci: skip dEQP-VK.robustness.* on Vangogh due to weird GPU hangs +- nir: rename atomic_add_gs_invocation_count_amd to make it more generic +- ac/nir: add lowering for mesh shader queries +- ac/nir: add lowering for task shader queries +- radv: add GDS counters offset for mesh/task queries +- radv: adjust lowering of intrinsic queries for mesh/task shaders +- radv: enable lowering of mesh/task shader queries when enabled +- radv: declare shader_query_state for mesh/task shaders +- radv: stop skip emitting CB states when there is no color attachment +- radv: re-enable DCC with mipmaps on GFX11 +- radv: fix COMPUTE_SHADER_INVOCATIONS query on compute queue +- radv: emit missing PA_{SC,SU}_LINE_STIPPLE_xxx regs in gfx preamble +- radv: fix alignment of DGC command buffers +- radv/ci: update list of expected failures on PITCAIRN +- radv/ci: update list of flakes for NAVI10/VEGA10 +- radv/amdgpu: fix alignment of command buffers +- radv: enable DCC for MSAA images on GFX11 +- zink/ci: update list of expectations for zink-anv-tgl +- zink/ci: bump zink-anv-tgl-full timeout to 1h45m +- radv/ci: rename GFX1100 lists to NAVI31 +- radv: fix emulated geometry shader primitives/invocations queries +- radv/ci: remove duplicate skipped tests for RAVEN/STONEY +- radv/ci: exclude dEQP-VK.texture.explicit_lod.2d.sizes.128x128_* for all jobs +- radv: fix synchronization with emulated GS primitives/invocations queries +- radv/ci: remove no longer existing test for VANGOGH +- radv/ci: cleanup list of expected failures for NAVI10/NAVI21/VEGA10 +- radv: always write the sample positions when a new descriptor BO is created +- radv: fill the scratch BO in radv_fill_shader_rings() +- radv: fix gang submissions with chaining +- radv: fix re-emitting streamout descriptors for NGG streamout +- radv: fix IB alignment +- zink: use warn_missing_feature for missing modifier support +- radv: fix destroying GDS/OA BOs +- radv: allocate only 1 GDS OA counter for gfx10 NGG streamout +- ac/nir: only consider overflow for valid feedback buffers +- radv/ci: update list of expected failures on RAVEN +- radv/ci: update list of flakes for VANGOGH +- radv/ci: update list of flakes for STONEY +- radv: disable primitive restart for non-indexed draws on GFX11 +- radv: enable radv_disable_aniso_single_level=true for Zink too +- amd/llvm,aco,radv: implement NGG streamout with GDS_STRMOUT registers on GFX11 +- radv: mark GDS as needed for XFB queries with NGG streamout on GFX11 +- radv: skip GDS allocation for NGG streamout on GFX11 +- zink/ci: remove expected failures that are skipped for RADV +- ci: update CTS to vulkan-cts-1.3.7.0 +- ci: bump the number of tests per group from 500 to 5000 for Vulkan drivers +- ci: bump DEQP_FRACTION for some jobs +- radv: set ENABLE_PING_PONG_BIN_ORDER for GFX11.5 +- radv: initialize video decoder for GFX11.5 +- ac/gpu_info: query the maximum number of IBs per submit from the kernel +- Revert "radv: fix finding shaders by PC" +- radv: fix missing predicate bit for WRITE_DATA helper +- ac/gpu_info: fix querying the maximum number of IBs per ring +- radv: remove outdated RADV_DEBUG=vmfaults support +- amd: update amdgpu_drm.h +- amd: add has_gpuvm_fault_query +- radv/amdgpu: add support quering the last GPUVM fault +- radv: query and report the last GPUVM fault with RADV_DEBUG=hang +- radv: report the last GPUVM fault when a device lost is detected +- ac/gpu_info: remove bogus assertion about number of COMPUTE/SDMA queues +- radv: fix a synchronization issue with primitives generated query on RDNA1-2 +- radv: bind the non-dynamic graphics state from the pipeline unconditionally +- radv: fix compute shader invocations query on compute queue on GFX6 +- radv: emit COMPUTE_PIPELINESTAT_ENABLE for CS invocations on ACE +- nir: fix inserting the break instruction for partial loop unrolling +- radv: fix registering queues for RGP with compute only +- radv: set radv_zero_vram=true for Unreal Engine 4/5 +- radv: fix a descriptor leak with debug names and host base descriptor set +- radv: add a missing async compute workaround for Tonga/Iceland +- radv: disable TC-compatible HTILE on Tonga and Iceland +- radv: set radv_invariant_geom=true for War Thunder +- radv: do not set OREO_MODE to fix rare corruption on GFX11 + +Saroj Kumar (4): + +- radeonsi: Add perfetto support in radeonsi +- radeonsi: Add u_trace init code in radeonsi +- radeonsi: Add tracepoints in radeonsi driver +- radeonsi: fixes compilaton error when perfetto is disabled + +Sathishkumar S (2): + +- radeonsi/vcn: support variable number of bs_bufs +- radeonsi/vcn: num bs_bufs must be proportional to num jpeg engines + +Semjon Kravtsenko (1): + +- glx: Assign unique serial number to GLXBadFBConfig error + +Seppo Yli-Olli (1): + +- zink: Fix SyntaxWarning in zink_extensions script + +Sergi Blanch Torne (7): + +- Introduce ci-kdl builder and launcher. +- Integrate ci-kdl in the building process and launch process. +- ci: disable Collabora's LAVA lab for maintance +- Revert "ci: disable Collabora's LAVA lab for maintance" +- Revert "ci: disable Collabora's LAVA lab for maintance" +- ci: disable Collabora's LAVA lab for maintance +- Revert "ci: disable Collabora's LAVA lab for maintance" + +Sid Pranjale (1): + +- nvk: Enable VK_EXT_load_store_op_none + +Sil Vilerino (20): + +- util: Blake3 - Identify arm64ec as aarch64 instead of x64 +- d3d12: Fix Map/Unmap of YUV resources +- d3d12: Fix H264 interlaced decode +- d3d12: Video Decode - Remove unnecessary copy for texture array case +- util/vl_vlc: Use UINT64_MAX instead of ~0UL with MSVC compiler +- d3d12: Extend video screen AV1 encode tile support checking +- aux/tc: Add ASSERTED to unreferenced release build variable +- d3d12: Video - Relax ID3D12VideoDevice QI version for decode, process +- frontends/va: Add profile param when querying PIPE_VIDEO_CAP_ENC_QUALITY_LEVEL +- d3d12: Upgrade to D3D12 Agility SDK 1.611 Video interface +- d3d12: Fixes AV1 tx_mode_support reporting and unsupported tx_mode overriding +- d3d12: Video Decode - Wait for GPU completion before destroying decoder in-flight objects +- d3d12: Do not destroy codec when destroying video buffer +- d3d12: AV1 encode - Add lower resolution fallback check for uniform tile support +- d3d12: AV1 encode - add fallback for app passing unsupported pic_params.InterpolationFilter +- d3d12: AV1 Encode - Fix VAConfigAttribEncMaxRefFrames reporting +- frontend/va: Add support for VAConfigAttribEncMaxTileRows/Cols +- d3d12: Add support for PIPE_VIDEO_CAP_ENC_MAX_TILE_ROWS/COLS +- d3d12: Allocate d3d12_video_buffer with higher alignment for compatibility +- d3d12: d3d12_video_buffer_create_impl - Fix resource importing + +Simon Ser (7): + +- wayland: enable use of wayland-protocols as a subproject +- vulkan/wsi/wayland: add support for IMMEDIATE +- vulkan/wsi/wayland: fix unset present_mode +- radv/winsys: check amdgpu_create_bo_from_user_mem() for EINVAL +- egl: extract EGLDevice setup in dedicated function +- egl: move dri2_setup_device() after dri2_setup_extensions() +- egl: ensure a render node is passed to _eglFindDevice() + +Simon Zeni (1): + +- nouveau/winsys: use mmap instead of mmap64 in nouveau_bo + +SoroushIMG (1): + +- pvr: fix mipmap size calculation for bc formats + +Sviatoslav Peleshko (9): + +- dri: Use RGB internal formats for RGBX formats +- intel/isl: Don't over-allocate CLEAR_COLOR size to use whole cache line +- anv: Do fast clear color initialization more delicately +- zink: Change zink_vertex_elements_hw_state::b.strides to VkDeviceSize +- intel/fs: Check if the whole ubo load range is in the push const range +- zink: Store zink_vertex_elements_hw_state::b.strides by binding id +- intel/fs: Fix "packed word exception" condition for register regioning +- intel/eu/validate: Validate "packed word exception" stricter +- nir/loop_analyze: Fix inverted condition handling in iterations calculation + +Sylvain Munaut (9): + +- egl/dri2: Add a couple of missing mutex release in error path +- mesa: Enable ARB_texture_border_clamp in GL Core +- include: Fix the PFN declarations to be pointers as they should +- glx: Add missing MesaGLInteropGLXFlushObjects +- glx: Export the MESA GL Interop functions through glXGetProcAddress +- egl: Export the MESA GL Interop functions through eglGetProcAddress +- glx: Remove MESA_depth_float_bit from enum +- glx: Advertise GLX_MESA_gl_interop extension if support present +- egl: Advertise EGL_MESA_gl_interop extension if support present + +Tapani Pälli (34): + +- intel/blorp: add a new flag to communicate PSS sync need +- anv: implement required PSS sync for Wa_18019816803 +- iris: implement required PSS sync for Wa_18019816803 +- vulkan/runtime: change assert to match specification needs +- anv: remove assert, size is asserted in the runtime +- anv: refactor batch_set_preemption to use batch_emit_pipe_control +- anv: implement a dummy depth flush for Wa_14016712196 +- iris: implement a dummy depth flush for Wa_14016712196 +- mesa: fix some TexParameter and SamplerParameter cases +- mesa: remove GL_UNSIGNED_BYTE as supported for snorm reads +- ci: add a fix for KHR-GLES3.packed_pixels.*snorm tests +- anv: implement Wa_14018912822 +- iris: implement Wa_14018912822 +- driconf: use lower_depth_range_rate for The Spirit and The Mouse +- mesa: disable snorm readpix clamping with EXT_render_snorm +- iris: modify Wa_14014414195 to use intel_needs_workaround +- mesa: some cleanups for texparam extension checks +- iris: avoid issues with undefined clip distance +- crocus: avoid issues with undefined clip distance +- anv: refactor to fix pipe control debugging +- anv: fix a leak of fp64_nir shader +- iris: use intel_needs_workaround for Wa_14014414195 part 2 +- iris: correct dst alpha blend factor in Wa_14018912822 +- iris/anv: move Wa_14018912822 as a drirc workaround +- iris: flush data cache when flushing HDC on GFX < 12 +- anv: HDC flush is available only for GFX_VER 12+ +- iris: HDC flush is available only for GFX_VER 12+ +- intel/genxml: remove HDC from gen11.xml, it is not available +- mesa/st: ignore StencilSampling if stencil not part of the format +- intel/dev: expand existing fix for all gfx12 with small EU count +- egl: fix leaking drmDevicePtr in _eglFindDevice +- iris: add data cache flush for pre hiz op +- anv/drirc: add option to disable FCV optimization +- drirc: Set limit_trig_input_range option for Valheim + +Tatsuyuki Ishi (8): + +- radv/amdgpu: Remove unused bo_list variable from cs_submit. +- radv/winsys: Remove unused struct radv_winsys_bo_list. +- radv/amdgpu: Do not pass in a BO handle when clearing PRT VA region. +- radv: Fix IB size for RADV_DEBUG=hang. +- radv: Fix dumping vertex descriptors with RADV_DEBUG=hang. +- radv/amdgpu: Use rwlock to protect access to virtual BOs. +- zink: Fix missing sparse buffer bind synchronization. +- zink: Fix waiting for texture commit semaphores. + +Thomas H.P. Andersen (65): + +- tgsi: remove unused tgsi_shader_info.num_tokens +- tgsi: remove unused tgsi_shader_info.array_max +- tgsi: remove unused tgsi_shader_info.num_memory_instructions +- tgsi: remove unused tgsi_shader_info.colors_read +- tgsi: remove unused tgsi_shader_info.colors_written +- tgsi: remove unused tgsi_shader_info.reads_position +- tgsi: remove unused tgsi_shader_info.reads_samplemask +- svga: remove unused struct field +- tgsi: remove unused tgsi_shader_info.reads_tess_factors +- tgsi: remove unused tgsi_shader_info fields +- tgsi: remove unused tgsi_shader_info fields +- tgsi: remove unused tgsi_shader_info.uses_drawid +- tgsi: remove unused tgsi_shader_info fields +- tgsi: remove unused tgsi_shader_info.uses_subgroup_info +- tgsi: remove unused tgsi_shader_info.writes_primid +- tgsi: remove unused tgsi_shader_info.uses_doubles +- tgsi: remove unused tgsi_shader_info.uses_derivatives +- tgsi: remove unused tgsi_shader_info.uses_bindless_samplers +- tgsi: remove unused tgsi_shader_info.uses_bindless_images +- tgsi: remove unused tgsi_shader_info.clipdist_writemask +- tgsi: remove unused tgsi_shader_info.culldist_writemask +- tgsi: remove unused tgsi_shader_info.images_load +- tgsi: remove unused tgsi_shader_info.images_store +- tgsi: remove unused tgsi_shader_info.images_atomic +- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_load +- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_store +- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_atomic +- tgsi: remove unused tgsi_shader_info.uses_bindless_image_load +- tgsi: remove unused tgsi_shader_info.uses_bindless_image_store +- tgsi: remove unused tgsi_shader_info.uses_bindless_image_atomic +- tgsi: remove unused tgsi_shader_info.indirect_files_read +- tgsi: remove unused tgsi_shader_info.indirect_files_written +- tgsi: remove unused tgsi_shader_info.const_buffers_indirect +- tgsi: remove unused tgsi_shader_info.max_depth +- tgsi: drop two unused functions +- nvk: use common physical device enumeration +- nvk: fix implicit-fallthrough warnings with clang +- nvk: delete commented code +- nvk: fix mem leaks +- nvk: use common descriptor set layout code +- nvk: use common pipeline layout code +- nvk: advertise KHR_shader_non_semantic_info +- nvk: advertise KHR_image_format_list +- nvk: advertise EXT_private_data +- nvk: advertise KHR_sampler_mirror_clamp_to_edge +- nvk: KHR_descriptor_update_template +- nvk: CmdPushDescriptorSetWithTemplateKHR +- nvk: drop dead assignment +- nvk: drop dead assignment +- nvk: fix initialization override +- nvk: sort extensions +- nvk: advertize KHR_relaxed_block_layout +- nvk: add check for VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT +- nvk: advertise EXT_image_2d_view_of_3d +- nvk: fix maxPushDescriptors +- nvk: call correct macro to clear views +- nouveau/mme: use fermi enum in fermi builder +- nvk: add warning on non-nouveau drm driver +- nvk: Implement VK_KHR_draw_indirect_count on Turing+ +- nvk: set device info before use in nvk_get_device_extensions +- nvk: simplify code by using new helpers +- nvk: remove duplicated device features +- nvk: EXT_conditional_rendering +- nvk: advertise VK_EXT_tooling_info +- nvk: set optimization level to 3 + +Thong Thai (3): + +- radeonsi: enable vcn encoder rgb input support +- Update radeon_vcn_enc.c +- frontends/va/config: report max width and height for encoding/decoding + +Timothy Arceri (27): + +- glsl: fix validation of ES vertex attribs +- nir/opt_copy_prop_vars: don't clone copies if branch empty +- nir/opt_copy_prop_vars: speedup cloning of copy tables +- nir/opt_copy_prop_vars: remove var hash entry on kill alias +- nir/opt_copy_prop_vars: skip cloning of copies arrays until needed +- nir/opt_copy_prop_vars: drop reuse of dynamic arrays +- glsl: fix spirv sso validation +- glsl: mark structs containing images as bindless +- util: add radeonsi workaround for Nowhere Patrol +- glsl: fix out params in glsl to nir +- glsl_to_nir: add more unhandled function types +- nir: replace use of nir_src_copy() +- nir: remove unused nir_src_copy() +- nir: remove unused param from nir_alu_src_copy() +- glsl: remove field from gl_shader_program +- glsl: move get_varying_type() declaration earlier +- glsl: add nir version of validate_first_and_last_interface_explicit_locations() +- glsl: switch to nir validate_first_and_last_interface_explicit_locations() +- glsl: remove unused validate_first_and_last_interface_explicit_locations() +- nir: fix typo in comment +- nir: copy explicit_invariant flag to nir vars +- glsl: move interpolation_string() to linker_util +- glsl: move is_gl_identifier() to linker_util +- nir: add used field to nir variables +- glsl: implement cross_validate_outputs_to_inputs() in nir linker +- glsl: switch to nir linkers cross_validate_outputs_to_inputs() +- glsl: remove now unused varying linker code + +Timur Kristóf (39): + +- aco: Fix subgroup_id intrinsic on GFX10.3+. +- ac/nir: Simplify arg unpacking when shift is zero. +- ac/nir: Add new pass to lower intrinsics to shader args. +- radv: Move radv_select_hw_stage to radv_shader_info. +- radv: Use ac_nir_lower_intrinsics_to_args. +- radeonsi: Move si_select_hw_stage to si_shader_info. +- radeonsi: Use ac_nir_lower_intrinsics_to_args. +- aco: Remove subgroup_id and num_subgroups intrinsics. +- ac/llvm: Remove subgroup_id and num_subgroups intrinsics. +- aco: Refactor select_program to smaller functions. +- nir/opt_dead_cf: Remove if branches with undef condition. +- ac/nir: Add done arg to ac_nir_export_position. +- ac/nir: Slightly refactor how pos0 exports are added when missing. +- ac/nir/ngg: Wait for attribute stores before VS/TES/GS pos0 export. +- ac/nir/ngg: Refactor mesh shader primitive export. +- ac/nir/ngg: Wait for attribute ring stores in mesh shaders. +- ac/nir/ngg: Extract nogs_export_vertex_params function. +- ac/gpu_info: Add some SDMA related information. +- ac: Clarify SDMA opcode defines. +- ac: Add amd_ip_type argument to ac_parse_ib and ac_parse_ib_chunk. +- ac: Rename ac_do_parse_ib to parse_pkt3_ib. +- ac: Print IP type for IBs. +- ac: Add rudimentary implementation of printing SDMA IBs. +- radv: Rename SDMA file to radv_sdma.c +- radv: Use const device argument in radv_sdma_copy_buffer. +- radv: Use const on vi_alpha_is_on_msb arguments. +- radv: Only call si_cp_dma_wait_for_idle on GFX and ACE queues. +- radv: Move radv_cp_wait_mem to radv_cs.h and add queue family argument. +- radv: Refactor WRITE_DATA helper function. +- radv: Use new WRITE_DATA helper in more places. +- radv: Add queue family argument to some functions. +- radv: Wait for bottom of pipe in ACE gang wait postamble. +- radv: Simplify gang CS and semaphore initialization. +- radv: Allow gang submit use cases other than task shaders. +- radv: Slightly refactor gang semaphore functions. +- radv: Add gang follower semaphore functions. +- radv: Support SDMA in radv_cs_write_data_head. +- radv: Support SDMA in radv_cp_wait_mem. +- radv: Support SDMA in si_cs_emit_write_event_eop. + +Vignesh Raman (4): + +- ci: add Vignesh Raman into restricted traces access list +- Do explicit cast to suppress clang warnings +- ci: enforce -Wimplicit-const-int-float-conversion for clang +- ci: Uprev crosvm + +Vinson Lee (8): + +- nvk: Fix assert +- lavapipe: Fix struct initialization +- intel/decoder: Fix memory leak on error path +- nv50: Remove unused value +- vk/wsi/x11: Remove dead code +- freedreno/replay: Fix implicit-function-declaration error +- anv: Fix transfer type assert +- broadcom/qpu: Remove duplicate variable opcode + +Vitaliy Triang3l Kuzmin (3): + +- r600/asm: Fix AR force_add_cf setting if a clause is not open +- r600/asm: Make sure MOVA and SET_CF_IDX are in the same clause +- r600: Replace R600_BIG_ENDIAN with UTIL_ARCH_BIG_ENDIAN + +Vlad Schiller (15): + +- pvr: Implement VK_EXT_tooling_info +- pvr: Add 'info' PVR_DEBUG flag +- pvr: Implement VK_KHR_format_feature_flags2 +- pvr: Remove PVR_WINSYS_BO_FLAG_ZERO_ON_ALLOC flag +- pvr: Add VK_KHR_driver_properties +- pvr: Use correct index when writing query availability data +- pvr: Enable VK_EXT_scalar_block_layout +- pvr: Enable KHR_image_format_list +- pvr: Enable VK_KHR_uniform_buffer_standard_layout +- pvr: Implement VK_KHR_external_fence +- pvr: Implement VK_KHR_external_semaphore +- pvr: Enable VK_KHR_bind_memory2 extension +- pvr: Implement VK_EXT_texel_buffer_alignment +- pvr: Implement VK_EXT_host_query_reset +- pvr: Fix VK_EXT_texel_buffer_alignment + +WinLinux1028 (1): + +- radeonsi: prefix function with si\_ to prevent name collision + +Xaver Hugl (1): + +- vulkan wsi: add support for PresentOptionAsyncMayTear + +Yiwei Zhang (46): + +- venus: handle query feedback creation failure +- venus: ensure consistency of query overflow behavior +- venus: add a missing barrier before copying query feedback +- venus: refactor query feedback cmd record +- venus: reduce to use 4K mem suballoc align on platforms known to fit +- turnip: flush cache for dstBuffer in vkCmdCopyQueryPoolResults +- lvp: avoid reading immutable sampler from desc write info +- ci/venus: update venus-lavapipe expectations +- venus: fix a cmd builder render_pass state leak across reset +- venus: fix cmd state leak across implicit reset +- venus: log and doc the broken query feedback in suspended render pass +- venus: move transient storage from cmd to pool +- venus: remove redundant fb tracking from cmd builder +- venus: use tracked queue_family_index from the cmd pool +- venus: cleanup vn_cmd_begin_render_pass usage +- venus: add helpers to track subpass view mask +- venus: avoid redundant tracking of render pass +- venus: refactor more cmd states into cmd builder +- venus: use in_render_pass to skip present_src counting +- ci/venus: remove fixed tests that no longer run +- ci/venus: reenable pipeline cts +- venus: suppress a false logging +- venus: add no_sparse debug option to disable sparse resource support +- venus: set deviceMemoryReport feature +- venus: expose at least one cached memory type +- venus: expose KHR_external_fence/sempahore_fd extensions +- venus: fix a device memory report leak +- vulkan: remove a dup entry from vk_image_usage_to_ahb_usage +- vulkan/android: improve vkQueueSignalReleaseImageANDROID +- vulkan/android: add missing AHARDWAREBUFFER_USAGE_GPU_DATA_BUFFER usage +- vulkan/android: drop vk_buffer dependency from common AHB impl +- venus: use common vk_queue object +- venus: use common ANB implementation +- venus: use more common vk_queue related implementations +- venus: drop device, family, index, flags tracking from vn_queue +- venus: fix re-export of imported classic 3d resources +- venus: remove redundant bo roundtrip and add more docs +- venus: track VkPhysicalDeviceMemoryProperties instead +- venus: refactor vn_device_memory to prepare for async alloc +- venus: make device memory alloc async +- venus: enable Vulkan 1.3 for Android 13 and above +- zink: sync queue access for vkQueueWaitIdle +- venus: properly expose KHR_external_fence/sempahore_fd +- ci/venus: mark more flaky tests after recent cts uprev +- venus: fix query feedback batch leak and race upon submission +- zink: apply can_do_invalid_linear_modifier to Venus + +Yogesh Mohan Marimuthu (12): + +- gallium: remove start_slot parameter from pipe_context::set_vertex_buffers +- ac/surface: add astc block size to bpe_to_format() function +- util: move ASTCLutHolder from mesa/main to util +- vulkan/formats,zink: move vk_format_from_pipe_format() function +- vulkan/runtime: add compute astc decoder helper functions +- vulkan add 3D texture support for compute astc decoder +- radv: integrate meta astc compute decoder to radv +- radeonsi: add more documentation for dpbb debug env variable +- docs: remove document for unused variable dfsm from AMD_DEBUG +- radeonsi: correct old comment in si_emit_framebuffer_state() +- radeonsi: In gfx6_init_gfx_preamble_state() use gfx_level only from sctx +- radeonsi: add radeonsi to GL_RENDERER string + +Yonggang Luo (43): + +- lima: Convert to use nir_foreach_function_impl when possible +- freedreno: Switch to use nir_foreach_function_impl in tu_shader.cc +- zink: Convert to use nir_foreach_function_impl when possible +- lavapipe: Convert to use nir_foreach_function_impl +- lavapipe: fixes indent of function lvp_inline_uniforms +- microsoft/compiler: convert to use nir_foreach_function_with_impl in function emit_module +- microsoft/clc/compiler: Convert to use nir_foreach_function_impl when possible +- radeonsi: Convert to use nir_foreach_function_impl +- ac: Switch to use nir_foreach_function_impl in function analyze_shader_before_culling +- util: Move pipe_swizzle from p_defines.h to u_formats.h +- util: Move PIPE_MASK_* from p_defines.h to u_formats.h +- util: Move pipe_color_union from p_defines.h into u_formats.h +- util: Move u_pack_color.h and dbughelp.h into src/util from/src/gallium/auxiliary/util/ +- util: Remove include "pipe/\*.h" in src/util/* files +- util:Move only gallium used u_debug_refcnt.* and u_debug_describe.* into src/gallium/auxiliary/util/ +- util/meson: Getting mesa util core to be self contained +- pvr: decouple vulkan driver and compiler from gallium +- freedreno: decouple compiler and vulkan driver from gallium +- glx: decouple from gallium +- meson: Remove arm_neon_workaround +- nouveau/drm-shim: Decouple from gallium +- ac/radv: decouple radv vulkan driver and compiler from gallium +- etnaviv: decouple drm from gallium +- asahi: decouple layout from gallium +- compiler: Move WRITEMASK_* from prog_instruction.h into shader_enums.h +- intel/blorp: Use float directly to avoid #include "mesa/main/format_utils.h" +- intel/blorp: brw_sampler_prog_key_data::swizzles is only and should only accessed in crocus +- intel/brw: Define and use BRW_SWIZZLE_* instead of SWIZZLE_* +- crocus: #include "program/prog_instruction.h" for SWIZZLE_* +- intel/compiler,intel/blorp,intel/vulkan: decouple vulkan driver and compiler from gallium +- util/treewide: Use alignas(x) instead __attribute__((aligned(x))) +- v3dv: Use alignas(8) over 64 bit atomic value +- svga: use alignas over struct MKSGuestStatInfoEntry +- radv: Fixes mingw linkage error undefined reference to \`radv_GetCalibratedTimestampsEXT' +- v3d: Use DIV_ROUND_UP instead div_round_up +- freedreno: Use shared DIV_ROUND_UP instead div_round_up +- sfn: Use 4 instead of ATOMIC_COUNTER_SIZE +- intel/brw: use 4 instead of MAX_VERTEX_STREAMS to avoid #include "mesa/main/config.h" +- d3d12: replace use of MAX_VERTEX_STREAMS with PIPE_MAX_VERTEX_STREAMS +- compiler: use 4 instead ATOMIC_COUNTER_SIZE in glsl_types.h to avoid #include "mesa/main/config.h" +- compiler/glsl: Move glsl_print_type from glsl_types.* to ir_print_visitor.cpp +- util: Deduplicate macros between u_math.h and macros.h +- nvk: Should use alignment instead of align + +Yusuf Khan (4): + +- nouveau/ws: remove the drm.h header +- nvk: implement GetDeviceMemoryCommitment +- nvk: support GetImageSparseMemoryRequirements2 +- nvk: expose KHR_driver_properties + +Zhang Ning (1): + +- Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR" + +antonino (14): + +- virgl: add ci flake +- freedreno: add ci flake +- zink: remove unused indices from \`nir_load_push_constant` calls +- zink/nir: add a zink specific intrinsic for push constants +- vulkan/wsi: add \`vk_wsi_force_swapchain_to_current_extent` driconf +- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "The Talos Principle" +- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "Serious Sam Fusion" +- vulkan: Extend vkGet/SetPrivateDataEXT handling to all platforms +- vulkan: Extend vkGet/SetPrivateDataEXT handling to VkSurface +- vulkan: Handle vkSetDebugUtilsObjectNameEXT on WSI objects +- zink: store bindless var when creating it to avoid creating it again +- nir: fix several crashes in \`nir_lower_tex` +- nir: don't take the derivative of the array index in \`nir_lower_tex` +- vulkan: use instance allocator for \`object_name` in some objects + +cheyang (1): + +- isaspec : fix isaspec build error in aosp + +georgeouzou (1): + +- nvk: Support VK_EXT_line_rasterization + +jazzfool (1): + +- zink: Hash only first 32 bits of zink_gfx_pipeline_state with full DS3 + +lorn10 (1): + +- docs: Update Clover's env variable documentation + +norablackcat (2): + +- spirv/nir_to_spirv: add expect assume op codes +- rusticl: add cl_khr_expect_assume + +timmac-qmc (1): + +- glsl: fix potential crash with DisableUniformArrayResize + +twisted89 (1): + +- util/driconf: add workarounds for the Chronicles of Riddick + +wangra (1): + +- tu/kgsl: Fix bitfield of DITHER_MODE_MRT6 + +xurui (1): + +- glx: There is no need to psc++ diff --git a/docs/relnotes/new_features.txt b/docs/relnotes/new_features.txt deleted file mode 100644 index ca79cf2a745..00000000000 --- a/docs/relnotes/new_features.txt +++ /dev/null @@ -1,21 +0,0 @@ -New drivers ------------ - -- NVK: A Vulkan driver for Nvidia hardware - -New features ------------- -VK_EXT_pipeline_robustness on ANV -VK_KHR_maintenance5 on RADV -OpenGL ES 3.1 on Asahi -GL_ARB_compute_shader on Asahi -GL_ARB_shader_atomic_counters on Asahi -GL_ARB_shader_image_load_store on Asahi -GL_ARB_shader_image_size on Asahi -GL_ARB_shader_storage_buffer_object on Asahi -GL_ARB_sample_shading on Asahi -GL_OES_sample_variables on Asahi -GL_OES_shader_multisample_interpolation on Asahi -GL_OES_gpu_shader5 on Asahi -EGL_ANDROID_blob_cache works when disk caching is disabled -VK_KHR_cooperative_matrix on RADV/GFX11+ |