This project explores Thread-Level Parallelism (TLP) and its application in shared-memory multiprocessor systems using the gem5 simulator. The goal is to understand how different architectural ...
Abstract: Exploiting speculative thread-level parallelism across modules, e.g., methods, procedures, or functions, have shown promise. However, misspeculations and task creation overhead are known to ...
Abstract: High-performance fault simulation is one of the essential and preliminary tasks in the process of online and offline testing of machine learning (ML) hardware. Deep neural networks (DNN), as ...