Broadcast a register element selected by t_index across lane boundaries.
For single-lane registers, the function behaves identical to broadcast.
- Template Parameters
-
| t_index | Index of the element that should be broadcasted |
| T_RegisterType | The register type |
- Parameters
-
| [in] | src | The source register |
- Returns
- New register with broadcasted values
- Todo:
- : check if using permute_across lanes is faster in the last branch