aco/gfx11: optimize dual source export
[platform/upstream/mesa.git] / src / amd / compiler / aco_lower_to_hw_instr.cpp
2023-10-05 Georg Lehmannaco/gfx11: optimize dual source export
2023-10-04 Rhys Perryaco: shrink DPP8_instruction
2023-09-27 Georg Lehmannaco: fix p_extract with v1 dst and s1 operand
2023-09-14 Daniel Schürmannaco: make p_wqm a marker instruction without Operands...
2023-08-23 Rhys Perryaco: add adjust_bpermute_dst helper
2023-08-23 Rhys Perryaco: clarify bpermute pseudo opcode names
2023-08-23 Rhys Perryaco: fix p_bpermute_gfx6's exec save/restore with wave32
2023-08-23 Rhys Perryaco: fix p_bpermute_gfx6 with input at non-zero byte
2023-08-09 Georg Lehmannaco: use s_bitreplicate_b64_b32 to set exec to 0xffff00...
2023-06-26 Vitaliy Triang3l... aco: Add Primitive Ordered Pixel Shading waitcnt rules
2023-06-26 Vitaliy Triang3l... aco: Send MSG_ORDERED_PS_DONE where necessary
2023-06-26 Vitaliy Triang3l... aco: Add Primitive Ordered Pixel Shading pseudo-instruc...
2023-06-23 Timur Kristófaco: Use ac_hw_stage instead of aco-specific HWStage.
2023-06-21 Rhys Perryaco: don't set exec_hi for wave32 scan reductions
2023-06-20 Georg Lehmannaco/gfx10+: use v_cndmask with literal for reduction...
2023-06-16 Eric Engestromaco: reformat according to its .clang-format
2023-06-08 Daniel Schürmannaco: implement nir_intrinsic_load_resume_shader_address_amd
2023-05-25 Rhys Perryaco: add MIMG_instruction::strict_wqm
2023-05-25 Rhys Perryaco: let p_start_linear_vgpr take an operand
2023-04-28 Qiang Yuaco: get scratch addr from symbol for radeonsi
2023-04-13 Harri Nieminenamd: fix typos
2023-04-10 Timur Kristófaco: Don't use nir_selection_control in aco_ir.
2023-04-03 Timur Kristófaco: Consider p_cbranch_nz as divergent branch too.
2023-03-30 Georg Lehmannaco/to_hw_instr: use VOP1 opsel for v_mov_b16
2023-03-16 Daniel Schürmannaco: remove aco::rt_stack variable
2023-03-16 Daniel Schürmannaco: create hw_init_scratch() function for p_init_scrat...
2023-03-08 Daniel Schürmannradv/rt: use terminate() when returning from raygen...
2023-03-07 Georg Lehmannaco: remove VOP[123C]P? structs
2023-03-07 Georg Lehmannaco: treat VINTERP_INREG as VALU
2023-02-16 Daniel Schürmannradv: unconditionally enable scratch for RT shaders
2023-02-08 Rhys Perryaco: don't modify exec in p_interp_gfx11
2023-02-01 Georg Lehmannaco: use s_pack_ll_b32_b16 for constant copies
2023-02-01 Georg Lehmannaco: use s_bfm_64 for constant copies
2023-01-10 Rhys Perryaco: allow Builder::Result to be dereferenced
2023-01-10 Rhys Perryaco/gfx11: fix discard early exit removal optimization
2023-01-03 Georg Lehmannaco: Use v_mov_b16 on GFX11.
2022-12-16 Rhys Perryaco/gfx11: export mrtz in discard early exit for non...
2022-12-14 Timur Kristófaco: Emulate Wave64 bpermute on GFX11.
2022-12-14 Timur Kristófaco: Stylistic changes to emit_gfx10_wave64_bpermute.
2022-12-14 Timur Kristófaco: Split opcodes for GFX6 and GFX10 emulated bpermute.
2022-12-02 Bas Nieuwenhuizenaco: Don't use v_lshrrev_b64 for moves on GFX11.
2022-12-01 Rhys Perryaco: improve do_pack_2x16() with zero constants
2022-12-01 Rhys Perryaco/gfx11: use v_cvt_i32_i16/v_cvt_u32_u16
2022-11-22 Samuel Pitoisetaco: fix emitting DEALLOC_VGPRS in the discard block
2022-11-16 Samuel Pitoisetaco: add p_dual_src_export_gfx11 for dual source blendi...
2022-11-15 Daniel Schürmannaco: move statistics enum to aco_shader_info.h
2022-11-15 Samuel Pitoisetaco: fix p_interp_gfx11 to not overwrite SCC
2022-11-01 Georg Lehmannaco: Use s_pack_ll_b32_b16 for scalar zero extend.
2022-11-01 Rhys Perryaco/gfx11: fix FS input loads in quad-divergent control...
2022-11-01 Rhys Perryaco: fix typo in branch lowering
2022-10-24 Rhys Perryaco: swap v_perm_b32 operands
2022-10-11 Timur Kristófaco: Allow explicitly removing jumps on GFX10+ when...
2022-09-30 Rhys Perryaco/gfx11: deallocate VGPRs at the end of the shader
2022-09-26 Rhys Perryaco: add VINTERP instruction format
2022-09-26 Rhys Perryaco: add LDSDIR instruction format
2022-09-01 Timur Kristófaco: Fix p_init_scratch for task shaders.
2022-08-30 Daniel Schürmannaco: use std::vector::reserve() more often
2022-08-25 Rhys Perryaco: fix long-jump version of discard early exit
2022-07-18 Samuel Pitoisetaco: add new pseudo instruction p_jump_to_epilog
2022-07-08 Rhys Perryaco: initialize scratch base registers on GFX9-GFX10.3
2022-07-05 Rhys Perryaco: don't use 32-bit fp inline constants for fp16...
2022-07-05 Rhys Perryaco: fix single-alignbyte do_pack_2x16() path with...
2022-05-31 Rhys Perryaco: use v_perm_b32 to copy 0xff00/0x00ff/0xff/0x00
2022-05-31 Rhys Perryaco: disable sdwa on gfx11
2022-05-31 Rhys Perryaco: clarify a portion of do_pack_2x16
2022-05-23 Rhys Perryaco: only add/subtract low bits of program addresses
2022-05-23 Rhys Perryaco: fix p_constaddr with a non-zero offset
2022-05-13 Marek Olšákamd: change chip_class naming to "enum amd_gfx_level...
2022-05-12 Samuel Pitoisetaco: export MRT0 instead of NULL on GFX11
2022-03-04 Rhys Perryaco: rework removal of jumps over branches
2022-02-25 Daniel Schürmannaco: relax condition to remove branches in case of...
2022-02-08 Daniel Schürmannaco: remove block_kind_discard
2021-09-28 Timur Kristófaco: Allow p_extract to have different definition and...
2021-09-17 Rhys Perryaco: implement linear vgpr copies
2021-09-17 Rhys Perryaco: add and use RegClass::resize helper
2021-09-03 Rhys Perryaco: don't coalesce constant copies into non-power...
2021-09-02 Daniel Schürmannaco: preserve subdword RC when lowering p_insert/p_extract
2021-09-02 Daniel Schürmannaco: remove explicit dst_preserve flag
2021-09-02 Daniel Schürmannaco: rewrite SDWA selector
2021-09-02 Daniel Schürmannaco: fix p_insert lowering with 16bit sources
2021-07-13 Tony Wasserkaaco: Remove use of deprecated Operand constructors
2021-07-13 Daniel Schürmannaco: fix self-intersecting register swaps
2021-07-12 Daniel Schürmannaco: Format.
2021-07-12 Daniel Schürmannaco: add 'common/' and 'llvm/' prefix to #includes
2021-07-12 Daniel Schürmannaco: add missing Licenses and remove Authors from files
2021-07-12 Daniel Schürmannaco: reorder and cleanup #includes
2021-06-08 Rhys Perryaco: add p_extract/p_insert
2021-04-15 Michel DänzerConvert most remaining free-form fall-through comments...
2021-04-13 Daniel Schürmannaco: lower p_spill with constants correctly
2021-03-11 Rhys Perryaco: lower p_constaddr into separate instructions earlier
2021-01-22 Rhys Perryaco: return references in instruction cast methods
2021-01-22 Rhys Perryaco: use format-check methods
2021-01-22 Rhys Perryaco: use instruction cast methods
2021-01-22 Rhys Perryaco: remove Format::{VOP3A,VOP3B}
2021-01-21 Daniel Schürmannaco: generalize subdword constant copy lowering
2021-01-13 Daniel Schürmannaco: remove divergent branches which only jump over...
2020-12-04 Rhys Perryaco: coalesce constant copies
2020-12-04 Rhys Perryaco: use v_lshrrev_b64 for 64-bit VGPR copies on GFX10+
2020-12-04 Rhys Perryaco: add get_const/is_constant_representable helpers
2020-12-04 Rhys Perryaco: don't assume src=lower when splitting self-interse...
next