shadPS4

mirror of https://github.com/shadps4-emu/shadPS4.git synced 2026-06-02 21:05:13 -06:00

Author	SHA1	Message	Date
Marcin Mikołajczyk	0add6b8c1f	Neo: packed math for unsigned/signed integers (#4407 )	2026-05-13 20:33:50 -07:00
squidbus	ac61f4aee2	shader_recompiler: Strip out manual bounds checking (#4380 )	2026-05-09 10:05:18 -07:00
oltolm	442a07a707	Fix compilation with mingw-w64 (#4365 ) * cmake: apply sirit unused-command-line-argument flag only with Clang * shader_recompiler: move Opcode magic_enum range customization to opcodes.h Define the magic_enum range for Shader::Gcn::Opcode as a proper enum_range specialization in opcodes.h instead of relying on translation-unit macros in translate.cpp. This makes the customization visible where the enum is used and avoids the GCC linkage/build issue. * thread: use Windows thread naming path for MinGW-w64 Switch the thread naming guard from _MSC_VER to _WIN32 so MinGW-w64 builds use the Windows SetThreadDescription implementation instead of falling through the POSIX branch. This matches the platform rather than the compiler and avoids the MinGW-w64 build issue.	2026-05-08 22:36:55 -07:00
Marcin Mikołajczyk	34b35b526e	Neo: Float16 packed math (#4354 )	2026-05-04 15:21:20 -07:00
Marcin Mikołajczyk	a3e25efad5	Neo: V_MAD_MIX opcodes (#4338 )	2026-04-30 16:56:27 -07:00
Stephen Miller	475696c542	Bump enum range to fix unknown opcode logging (#4333 )	2026-04-29 07:11:41 +03:00
Marcin Mikołajczyk	99f2480e21	Neo: V_*_F16 arithmetic ops (#4311 )	2026-04-25 12:51:02 +02:00
Marcin Mikołajczyk	c1e496efcd	SDWA (#4203 )	2026-04-14 22:41:55 +03:00
TheTurtle	1f50aa3172	frontend: Add helper methods for thread bit getters and setters (#4243 ) Co-authored-by: georgemoralis <giorgosmrls@gmail.com>	2026-04-09 23:32:21 +03:00
georgemoralis	08168dc386	New config mode (part1 of 0.15.1 branch series) (#4145 ) * using new emulator_settings * the default user is now just player one * transfer install, addon dirs * fix load custom config issue --------- Co-authored-by: kalaposfos13 <153381648+kalaposfos13@users.noreply.github.com>	2026-03-21 22:26:36 +02:00
Kravickas	2ca342970a	MIP fixes (#4141 ) * int32-modifiers GCN VOP3 abs/neg modifier bits always operate on the sign bit (bit 31) regardless of instruction type. For integer operands this means: abs = clear bit 31 (x & 0x7FFFFFFF) neg = toggle bit 31 (x ^ 0x80000000) * int64-modifiers Previously GetSrc64<IR::U64> completely ignored input modifiers for integer operands. Now unpacks to two U32s, modifies the high dword's bit 31 (= bit 63 of the 64-bit value), and repacks. * V_MUL_LEGACY_F32 GCN V_MUL_LEGACY_F32: if either source is zero, result is +0.0 regardless of the other operand (even NaN or Inf). Standard IEEE multiply produces NaN for 0*Inf. The fix adds a zero-check select before the multiply.	2026-03-18 10:05:20 +02:00
psucien	a9f8eaf778	video_core: Initial implementation of pipeline cache (#3816 ) Some checks are pending Build and Release / reuse (push) Waiting to run Details Build and Release / clang-format (push) Waiting to run Details Build and Release / get-info (push) Waiting to run Details Build and Release / windows-sdl (push) Blocked by required conditions Details Build and Release / macos-sdl (push) Blocked by required conditions Details Build and Release / linux-sdl (push) Blocked by required conditions Details Build and Release / linux-sdl-gcc (push) Blocked by required conditions Details Build and Release / pre-release (push) Blocked by required conditions Details * Initial implementation * Fix for crash caused by stale stages data; cosmetics applied * Someone mentioned the assert * Async blob writer * Fix for memory leak * Remain stuff * Async changed to `packaged_task`	2025-11-29 11:52:08 +02:00
Stephen Miller	683e5f3b04	Core: Simulate write-only file access with read-write access (#3360 ) Some checks are pending Build and Release / reuse (push) Waiting to run Details Build and Release / clang-format (push) Waiting to run Details Build and Release / get-info (push) Waiting to run Details Build and Release / windows-sdl (push) Blocked by required conditions Details Build and Release / macos-sdl (push) Blocked by required conditions Details Build and Release / linux-sdl (push) Blocked by required conditions Details Build and Release / linux-sdl-gcc (push) Blocked by required conditions Details Build and Release / pre-release (push) Blocked by required conditions Details * Swap write access mode for read write Opening with access mode w will erase the opened file. We do not want this. * Create mode Opening with write access was previously the only way to create a file through open, so add a separate FileAccessMode that uses the write access mode to create files. * Update file_system.cpp Remove a hack added to posix_rename to bypass the file clearing behaviors of FileAccessMode::Write * Check access mode in read functions Write-only files cause the EBADF return on the various read functions. Now that we're opening files differently, properly handling this is necessary. * Separate appends into proper modes Fixes a potential regression from one of my prior PRs, and ensures the Write \| Append flag combo also behaves properly in read-related functions. * Move IsWriteOnly check after device/socket reads file->f is only valid for files, so checking this before checking for sockets/devices will cause access violations. * Fix issues Now that Write is identical to ReadWrite, internal uses of Write need to be swapped to my new Create mode * Fix remaining uses of FileAccessMode write to create files Missed these before. * Fix rebase * Add stubbed get_authinfo (#3722) * mostly stubbed get_authinfo * Return value observed on console if get_authinfo was called for the current thread, esrch otherwise --------- Co-authored-by: kalaposfos13 <153381648+kalaposfos13@users.noreply.github.com> Co-authored-by: georgemoralis <giorgosmrls@gmail.com>	2025-11-04 10:57:26 +02:00
TheTurtle	8f37cfb739	amdgpu: Split liverpool registers and cleanup (#3707 ) Some checks are pending Build and Release / reuse (push) Waiting to run Details Build and Release / clang-format (push) Waiting to run Details Build and Release / get-info (push) Waiting to run Details Build and Release / windows-sdl (push) Blocked by required conditions Details Build and Release / windows-qt (push) Blocked by required conditions Details Build and Release / macos-sdl (push) Blocked by required conditions Details Build and Release / macos-qt (push) Blocked by required conditions Details Build and Release / linux-sdl (push) Blocked by required conditions Details Build and Release / linux-qt (push) Blocked by required conditions Details Build and Release / linux-sdl-gcc (push) Blocked by required conditions Details Build and Release / linux-qt-gcc (push) Blocked by required conditions Details Build and Release / pre-release (push) Blocked by required conditions Details	2025-10-05 13:42:40 -07:00
TheTurtle	374c2194d4	video_core: Address various UE bugs (#3559 ) * vk_rasterizer: Reorder image query in fast clear elimination Fixes missing clears when a texture is being cleared using this method but never actually used for rendering purposes by ensuring the texture cache has at least a chance to register cmask * shader_recompiler: Partial support for ANCILLARY_ENA * pixel_format: Add number conversion of BC6 srgb format * texture_cache: Support aliases of 3D and 2D array images Used be UE to render its post processing LUT * pixel_format: Test BC6 srgb as unorm Still not sure what is up with snorm/unorm can be useful to have both actions to compare for now * video_core: Use attachment feedback layout instead of general if possible UE games often do mipgen passes where the previous mip of the image being rendered to is bound for reading. This appears to cause corruption issues so use attachment feedback loop extension to ensure correct output * renderer_vulkan: Improve feedback loop code * Set proper usage flag for feedback loop usage * Add dynamic state extension and enable it for color aspect when necessary * Check if image is bound instead of force_general for better code consistency * shader_recompiler: More proper depth export implementation * shader_recompiler: Fix bug in output modifiers * shader_recompiler: Fix sampling from MSAA images This is not allowed by any graphics API but seems hardware supports it somehow and it can be encountered. To avoid glitched output translate to to a texelFetch call on sample 0 * clang format * image: Add back missing code * shader_recompiler: Better ancillary implementation Now is implemented with a custom attribute that is constant propagated depending on which parts of it are extracted. It will assert if an unknown part is used or if the attribute itself is not removed by dead code elim * copy_shader: Ignore not enabled export channels * constant_propagation: Invalidate ancillary after successful elimination * spirv: Fix f11/f10 conversion to f32 --------- Co-authored-by: georgemoralis <giorgosmrls@gmail.com>	2025-09-12 19:29:16 +03:00
baggins183	df52585086	Allow vector and scalar offset in buffer address arg to LoadBuffer/StoreBuffer (#3439 ) Some checks are pending Build and Release / reuse (push) Waiting to run Details Build and Release / clang-format (push) Waiting to run Details Build and Release / get-info (push) Waiting to run Details Build and Release / windows-sdl (push) Blocked by required conditions Details Build and Release / windows-qt (push) Blocked by required conditions Details Build and Release / macos-sdl (push) Blocked by required conditions Details Build and Release / macos-qt (push) Blocked by required conditions Details Build and Release / linux-sdl (push) Blocked by required conditions Details Build and Release / linux-qt (push) Blocked by required conditions Details Build and Release / linux-sdl-gcc (push) Blocked by required conditions Details Build and Release / linux-qt-gcc (push) Blocked by required conditions Details Build and Release / pre-release (push) Blocked by required conditions Details * Allow vector and scalar offset in buffer address arg to LoadBuffer/StoreBuffer * remove is_ring check * fix atomics and update pattern matching for tess factor stores * remove old asserts about soffset * small fixes * copyright * Handle sgpr initialization for 2 special hull shader values, including tess factor buffer offset	2025-09-03 20:54:23 -07:00
Stephen Miller	c26f56ab02	Handle offsets and format overrides in fetch shaders (#3486 ) Co-authored-by: TheTurtle <47210458+raphaelthegreat@users.noreply.github.com>	2025-08-30 14:20:23 -07:00
squidbus	32244c097c	shader_recompiler: Implement V_ADD_F64 and loading 64-bit float from SGPR. (#3483 )	2025-08-29 18:11:21 -07:00
TheTurtle	93767ae31b	shader_recompiler: Rework sharp tracking for robustness (#3327 ) * shader_recompiler: Remove remnants of old discard Also constant propagate conditional discard if condition is constant * resource_tracking_pass: Rework sharp tracking for robustness * resource_tracking_pass: Add source dominance analysis When reachability is not enough to prune source list, check if a source dominates all other sources * resource_tracking_pass: Fix immediate check How did this work before * resource_tracking_pass: Remove unused template type * readlane_elimination_pass: Don't add phi when all args are the same New sharp tracking exposed some bad sources coming on sampler sharps with aniso disable pattern that also were part of readlane pattern, fix tracking by removing the unnecessary phis inbetween * resource_tracking_pass: Allow phi in disable aniso pattern * resource_tracking_pass: Handle not valid buffer sharp and more phi in aniso pattern	2025-07-28 13:32:16 -07:00
TheTurtle	93b06ba2da	translate: Correct instance id fetch in local shader (#3309 )	2025-07-23 23:27:11 +03:00
TheTurtle	4d3578edbe	shader_recompiler: Fix incorrect bary coord loads when unsupported (#3303 ) * shader_recompiler: Fix incorrect bary coord loads when unsupported Also properly set sample rate shading * emit_spirv: Implement BaryCoordSmoothCentroid	2025-07-23 19:46:06 +03:00
TheTurtle	4407ebdd9b	shader_recompiler: Implement guest barycentrics (#3245 ) * shader_recompiler: Implement guest barycentrics * Review comments and some cleanup	2025-07-15 18:49:12 +03:00
TheTurtle	399a725343	shader_recompiler: Replace buffer pulling with attribute divisor for instance step rates (#3238 ) * shader_recompiler: Replace buffer pulling with attribute divisor for instance step rates * flatten_extended_userdata: Remove special step rate buffer handling * Review comments * spirv_emit_context: Name all instance rate attribs properly * spirv: Merge ReadConstBuffer again template function only has 1 user now * attribute: Add missing attributes * translate: Reimplement step rate instance id * Resolve validation warnings * shader_recompiler: Separate vertex inputs from LS stage, cleanup tess	2025-07-14 00:32:02 +03:00
TheTurtle	8ffcfc87bd	shader_recompiler: Implement linear interpolation support (#3055 )	2025-06-08 22:46:34 +03:00
Marcin Mikołajczyk	790b54bf29	Misc opcodes fixes (#3009 ) Some checks are pending Build and Release / reuse (push) Waiting to run Details Build and Release / clang-format (push) Waiting to run Details Build and Release / get-info (push) Waiting to run Details Build and Release / windows-sdl (push) Blocked by required conditions Details Build and Release / windows-qt (push) Blocked by required conditions Details Build and Release / macos-sdl (push) Blocked by required conditions Details Build and Release / macos-qt (push) Blocked by required conditions Details Build and Release / linux-sdl (push) Blocked by required conditions Details Build and Release / linux-qt (push) Blocked by required conditions Details Build and Release / linux-sdl-gcc (push) Blocked by required conditions Details Build and Release / linux-qt-gcc (push) Blocked by required conditions Details Build and Release / pre-release (push) Blocked by required conditions Details	2025-05-29 18:51:36 -07:00
squidbus	5fd5b62539	shader_recompiler: Few fixes for buffer number conversions. (#2869 ) Some checks are pending Build and Release / reuse (push) Waiting to run Details Build and Release / clang-format (push) Waiting to run Details Build and Release / get-info (push) Waiting to run Details Build and Release / windows-sdl (push) Blocked by required conditions Details Build and Release / windows-qt (push) Blocked by required conditions Details Build and Release / macos-sdl (push) Blocked by required conditions Details Build and Release / macos-qt (push) Blocked by required conditions Details Build and Release / linux-sdl (push) Blocked by required conditions Details Build and Release / linux-qt (push) Blocked by required conditions Details Build and Release / linux-sdl-gcc (push) Blocked by required conditions Details Build and Release / linux-qt-gcc (push) Blocked by required conditions Details Build and Release / pre-release (push) Blocked by required conditions Details * liverpool: Pass correct color buffer number type for conversion mapping. * shader_recompiler: Apply number conversion to vertex inputs.	2025-04-30 20:46:16 -07:00
squidbus	81fa9b7fff	shader_recompiler: Add lowering pass for when 64-bit float is unsupported. (#2858 ) Some checks are pending Build and Release / reuse (push) Waiting to run Details Build and Release / clang-format (push) Waiting to run Details Build and Release / get-info (push) Waiting to run Details Build and Release / windows-sdl (push) Blocked by required conditions Details Build and Release / windows-qt (push) Blocked by required conditions Details Build and Release / macos-sdl (push) Blocked by required conditions Details Build and Release / macos-qt (push) Blocked by required conditions Details Build and Release / linux-sdl (push) Blocked by required conditions Details Build and Release / linux-qt (push) Blocked by required conditions Details Build and Release / linux-sdl-gcc (push) Blocked by required conditions Details Build and Release / linux-qt-gcc (push) Blocked by required conditions Details Build and Release / pre-release (push) Blocked by required conditions Details * shader_recompiler: Add lowering pass for when 64-bit float is unsupported. * shader_recompiler: Fix PackDouble2x32/UnpackDouble2x32 type. * shader_recompiler: Remove extra bit cast implementations.	2025-04-28 00:04:16 -07:00
squidbus	fd3d3c4158	shader_recompiler: Implement AMD buffer bounds checking behavior. (#2448 ) * shader_recompiler: Implement AMD buffer bounds checking behavior. * shader_recompiler: Use SRT flatbuf for bounds check size. * shader_recompiler: Fix buffer atomic bounds check. * buffer_cache: Prevent false image-to-buffer sync. Lowering vertex fetch to formatted buffer surfaced an issue where a CPU modified range may be overwritten with stale GPU modified image data. * Address review comments.	2025-02-17 16:13:39 +02:00
TheTurtle	82cacec8eb	shader_recompiler: Remove special case buffers and add support for aliasing (#2428 ) * shader_recompiler: Move shared mem lowering into emitter * IR can be quite verbose during first stages of translation, before ssa and constant prop passes have run that drastically simplify it. This lowering can also be done during emission so why not do it then to save some compilation time * runtime_info: Pack PsColorBuffer into 8 bytes * Drops the size of the total structure by half from 396 to 204 bytes. Also should make comparison of the array a bit faster, since its a hot path done every draw * emit_spirv_context: Add infrastructure for buffer aliases * Splits out the buffer creation function so it can be reused when defining multiple type aliases * shader_recompiler: Merge srt_flatbuf into buffers list * Its no longer a special case, yay * shader_recompiler: Complete buffer aliasing support * Add a bunch more types into buffers, such as F32 for float reads/writes and 8/16 bit integer types for formatted buffers * shader_recompiler: Remove existing shared memory emulation * The current impl relies on backend side implementaton and hooking into every shared memory access. It also doesnt handle atomics. Will be replaced by an IR pass that solves these issues * shader_recompiler: Reintroduce shared memory on ssbo emulation * Now it is performed with an IR pass, and combined with the previous commit cleanup, is fully transparent from the backend, other than requiring workgroup_index be provided as an attribute (computing this on every shared memory access is gonna be too verbose * clang format * buffer_cache: Reduce buffer sizes * vk_rasterizer: Cleanup resource binding code * Reduce noise in the functions, also remove some arguments which are class members * Fix gcc	2025-02-15 14:06:56 +02:00
squidbus	41d64a200d	shader_recompiler: Add swizzle support for unsupported formats. (#1869 ) Some checks are pending Build and Release / reuse (push) Waiting to run Details Build and Release / clang-format (push) Waiting to run Details Build and Release / get-info (push) Waiting to run Details Build and Release / windows-sdl (push) Blocked by required conditions Details Build and Release / windows-qt (push) Blocked by required conditions Details Build and Release / macos-sdl (push) Blocked by required conditions Details Build and Release / macos-qt (push) Blocked by required conditions Details Build and Release / linux-sdl (push) Blocked by required conditions Details Build and Release / linux-qt (push) Blocked by required conditions Details Build and Release / pre-release (push) Blocked by required conditions Details * shader_recompiler: Add swizzle support for unsupported formats. * renderer_vulkan: Rework MRT swizzles and add unsupported format swizzle support. * shader_recompiler: Clean up swizzle handling and handle ImageRead storage swizzle. * shader_recompiler: Fix type errors * liverpool_to_vk: Remove redundant clear color swizzles. * shader_recompiler: Reduce CompositeConstruct to constants where possible. * shader_recompiler: Fix ImageRead/Write and StoreBufferFormatF32 types. * amdgpu: Add a few more unsupported format remaps.	2024-12-31 06:14:47 +02:00
baggins183	62780e4e43	Initialize V0 to PrimitiveId in hull shader (#1985 )	2024-12-31 06:00:52 +02:00
Marcin Mikołajczyk	b1b4c8c487	Handle setting Vcc in Translator::SetDst64 (#1826 )	2024-12-18 21:57:58 +02:00
baggins183	3c0c921ef5	Tessellation (#1528 ) * shader_recompiler: Tessellation WIP * fix compiler errors after merge DONT MERGE set log file to /dev/null DONT MERGE linux pthread bb fix save work DONT MERGE dump ir save more work fix mistake with ES shader skip list add input patch control points dynamic state random stuff * WIP Tessellation partial implementation. Squash commits * test: make local/tcs use attr arrays * attr arrays in TCS/TES * dont define empty attr arrays * switch to special opcodes for tess tcs/tes reads and tcs writes * impl tcs/tes read attr insts * rebase fix * save some work * save work probably broken and slow * put Vertex LogicalStage after TCS and TES to fix bindings * more refactors * refactor pattern matching and optimize modulos (disabled) * enable modulo opt * copyright * rebase fixes * remove some prints * remove some stuff * Add TCS/TES support for shader patching and use LogicalStage * refactor and handle wider DS instructions * get rid of GetAttributes for special tess constants reads. Immediately replace some upon seeing readconstbuffer. Gets rid of some extra passes over IR * stop relying on GNMX HsConstants struct. Change runtime_info.hs_info and some regs * delete some more stuff * update comments for current implementation * some cleanup * uint error * more cleanup * remove patch control points dynamic state (because runtime_info already depends on it) * fix potential problem with determining passthrough --------- Co-authored-by: IndecisiveTurtle <47210458+raphaelthegreat@users.noreply.github.com>	2024-12-14 12:56:17 +02:00
¥IGA	2266622dcf	Support for Vulkan 1.4 (#1665 )	2024-12-07 19:41:41 +02:00
Vladislav Mikhalin	8eacb88a86	recompiler: fixed fragment shader built-in attribute access (#1676 ) Some checks are pending Build and Release / reuse (push) Waiting to run Details Build and Release / clang-format (push) Waiting to run Details Build and Release / get-info (push) Waiting to run Details Build and Release / windows-sdl (push) Blocked by required conditions Details Build and Release / windows-qt (push) Blocked by required conditions Details Build and Release / macos-sdl (push) Blocked by required conditions Details Build and Release / macos-qt (push) Blocked by required conditions Details Build and Release / linux-sdl (push) Blocked by required conditions Details Build and Release / linux-qt (push) Blocked by required conditions Details Build and Release / pre-release (push) Blocked by required conditions Details * recompiler: fixed fragment shader built-in attribute access * handle en/addr separately * handle other registers as well	2024-12-07 01:20:09 +02:00
squidbus	920acb8d8b	renderer_vulkan: Parse fetch shader per-pipeline (#1656 ) Some checks are pending Build and Release / reuse (push) Waiting to run Details Build and Release / clang-format (push) Waiting to run Details Build and Release / get-info (push) Waiting to run Details Build and Release / windows-sdl (push) Blocked by required conditions Details Build and Release / windows-qt (push) Blocked by required conditions Details Build and Release / macos-sdl (push) Blocked by required conditions Details Build and Release / macos-qt (push) Blocked by required conditions Details Build and Release / linux-sdl (push) Blocked by required conditions Details Build and Release / linux-qt (push) Blocked by required conditions Details Build and Release / pre-release (push) Blocked by required conditions Details * shader_recompiler: Read image format info directly from sharps instead of storing in shader info. * renderer_vulkan: Parse fetch shader per-pipeline * Few minor fixes. * shader_recompiler: Specialize on vertex attribute number types. * shader_recompiler: Move GetDrawOffsets to fetch shader	2024-12-04 13:03:47 +02:00
Daniel R.	17c47bcd96	shader_recompiler/frontend: Implement `bitcmp` instructions (#1550 ) Some checks are pending Build and Release / reuse (push) Waiting to run Details Build and Release / clang-format (push) Waiting to run Details Build and Release / get-info (push) Waiting to run Details Build and Release / windows-sdl (push) Blocked by required conditions Details Build and Release / windows-qt (push) Blocked by required conditions Details Build and Release / macos-sdl (push) Blocked by required conditions Details Build and Release / macos-qt (push) Blocked by required conditions Details Build and Release / linux-sdl (push) Blocked by required conditions Details Build and Release / linux-qt (push) Blocked by required conditions Details Build and Release / pre-release (push) Blocked by required conditions Details	2024-11-19 21:38:32 +01:00
baggins183	9ec75c3feb	Implement shader resource tables (#1165 ) * Implement shader resource tables * fix after rebase + squash * address some review comments * fix pipeline_common * cleanup debug stuff * switch to using single codegenerator	2024-11-01 08:55:53 +02:00
psucien	927bb0c175	Initial support of Geometry shaders (#1244 ) * video_core: initial GS support * fix for components mapping; missing prim type	2024-10-06 01:26:50 +03:00
TheTurtle	ee38eec7fe	shader_recompiler: Additional scope handling and user data as push constants (#1013 ) * shader_recompiler: Use push constants for user data regs * shader: Add some GR2 instructions * shader: Add some instructions * shader: Add instructions for knack * touchups * spirv: Better names * buffer_cache: Ignore non gpu modified images * clang format * Add log * more fixes	2024-09-23 08:55:43 +02:00
squidbus	a18419dd73	shader_recompiler: Exclude non-float results from output modifiers. (#1016 )	2024-09-22 15:03:17 +03:00
Daniel R.	dcf245b814	shader_recompiler: Implement basic 64-bit floating point support (#915 ) * shader_recompiler: Implement basic 64-bit floating point support * Fix formatting	2024-09-15 22:53:08 +02:00
TheTurtle	b0bbb16aae	video_core: Add fallback path for pipelines with more than 32 bindings (#837 ) * video_core: Small fixes * renderer_vulkan: Add fallback path for pipelines with more than 32 bindings * vk_resource_pool: Rewrite desc heap * work	2024-09-10 20:54:39 +03:00
baggins183	bb29224daf	Implement V_MOVREL variants (#745 ) * shader_recompiler: Implement V_MOVRELS_B32, V_MOVRELD_B32, V_MOVRELSD_B32 Generates a ton of OpSelects to hardcode reading or writing from each possible vgpr depending on the value of m0 Future work is to do range analysis to put an upper bound on m0 and check fewer registers. * fix runtime info after rebase	2024-09-06 23:47:47 +03:00
TheTurtle	f087f43736	shader_recompiler: Implement render target swizzles when no format is available (#739 ) * shader_recompiler: Use null image when shader is compiled with unbound sharp * video_core: Refactor and render target swizzles * liverpool_to_vk: Add missing swap format from RDR * video_core: Refactor shader recompiler interface * Makes it much easier to pass runtime information to the recompiler and have it treated as part of the shader key. Also pulls out most runtime state from Info struct * shader_recompiler: Avoid some asserts	2024-09-03 14:04:30 +03:00
IndecisiveTurtle	6fbbe3d79b	translator: Add missed flow instruction	2024-08-30 00:26:01 +03:00
TheTurtle	66e96dd944	video_core: Account of runtime state changes when compiling shaders (#575 ) * video_core: Compile shader permutations * spirv: Only specific storage image format for atomics * ir: Avoid cube coord patching for storage image * spirv: Fix default attributes * data_share: Add more instructions * video_core: Query storage flag with runtime state * kernel: Use std::list for semaphore * video_core: Use texture buffers for untyped format load/store * buffer_cache: Limit view usage * vk_pipeline_cache: Fix invalid iterator * image_view: Reduce log spam when alpha=1 in storage swizzle * video_core: More features and proper spirv feature detection * video_core: Attempt no2 for specialization * spirv: Remove conflict * vk_shader_cache: Small cleanup	2024-08-29 19:29:54 +03:00
Vinicius Rangel	9e4fc17e6c	shader_recompiler: handle fetch shader address offsets (#538 ) * shader_recompiler: handle fetch shader address offsets parse index & offset sgpr from fetch shader and propagate them to vkBindVertexBuffers * shader_recompiler: fix fetch_shader when offset is not present * video_core: propagate index/offset SGPRs to vkCmdDraw instead of offsetting the buffer address * video_core: add vertex_offset to non-indexed draw calls renamed fetch offset fields	2024-08-24 17:36:40 +02:00
TheTurtle	1d1c88ad31	control_flow_graph: Initial divergence handling (#434 ) * control_flow_graph: Initial divergence handling * cfg: Handle additional case * spirv: Handle tgid enable bits * clang format * spirv: Use proper format * translator: Add more instructions	2024-08-16 20:05:37 +03:00
TheTurtle	d8b9d82ffa	video_core: Various fixes (#423 ) * video_core: Various fixes * clang format	2024-08-13 20:05:10 +03:00

1 2

98 Commits