The TMH consists of the pending queue and the scoreboard, as
shown in Figure 2. The pending queue holds the tag-missed memory
requests. Each scoreboard entry has a target tag, a counter, and an
arrival bit. The counter stores the number of memory requests in
the pending queue whose tag match the target tag. The arrival bit
indicates whether the data of the target tag are loaded on GPU
memory. By storing recent tags, the TMH exploits the scoreboard
to avoid a burst of data fetch requests to the host.