VOOZH about

URL: https://github.com/topics/instruction-level-parallelism

⇱ instruction-level-parallelism · GitHub Topics · GitHub


Skip to content
#

instruction-level-parallelism

Here are 12 public repositories matching this topic...

IAA03_fast_math is a single-header math kernel(Atan2 only for now) designed to eliminate the "Trigonometry Tax" in high-throughput systems (Physics Engines, Audio DSP, and ML Pre-processing). Branchless ,ILP and SIMD (AVX2/SSE4.1), it achieves up to a ~186x per-element throughput speedup over std::atan2 while being IEEE 754 compliant

  • Updated
  • C++

Improve this page

Add a description, image, and links to the instruction-level-parallelism topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the instruction-level-parallelism topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.