Micro-Optimization | Speculative Branches

The Computer Architecture of AI (in 2024)

Feb 10, 2024 · 16 min read · Performance Mathematical Algorithms ·
Share on:

Over the last year, as a person with a hardware background, I have heard a lot of complaints about Nvidia's dominance of the machine learning market and whether I can build chips to make the situation better. While the amount of money I would expect it to take is less than $7 trillion, hardware accelerating this wave …

Read More
Introduction to Micro-Optimization

Sep 11, 2022 · 15 min read · Performance Software Engineering ·
Share on:

A modern CPU is an incredible machine. It can execute many instructions at the same time, it can re-order instructions to ensure that memory accesses and dependency chains don't impact performance too much, it contains hundreds of registers, and it has huge areas of silicon devoted to predicting which branches your …

Read More