CS4290/CS6290 High Performance Computer Architecture

Graphics processing is an application area with high level of parallelism at the data level and at the task level. Therefore, graphics processing units GPU are often implemented as multiprocessing systems with high performance floating point processing and application specific hardware stages for maximizing the graphics throughput. TTA improves scalability over the traditional VLIW-style architectures making it interesting for computationally intensive applications. We show that TTA provides high floating point processing performance while allowing more programming freedom than vector processors. Finally, one of the main features of the presented TTA-based GPU design is its fully programmable architecture making it suitable target for general purpose computing on GPU APIs which have become popular in recent years. Unable to display preview.

NVIDIA Tesla: A Unified Graphics and Computing Architecture

Unlike OpenMP and MPI, CUDA implements parallelism by exporting the parallel portions of a program for execution to a graphics processing unit, where hundreds of threads and processors divide and conquer the problem. As computing technology increased in power and cost-efficiency, the demand for high-quality computer graphics skyrocketed, especially in the field of computer games. Thus, the graphics processing unit, or GPU, was born. It was originally meant to do intense graphics work in parallel, like rendering pixels on a screen. Programmers soon tried to use the parallel computing power of the GPU. Unfortunately, this was a difficult process, as programmers needed to learn and use graphics APIs and the specific architectures of specific GPUs to even begin to use them. In addition, most graphics processing units at this time had no support for double-precision floating point numbers nor random read-and-writes to memory.

CUDA used to be an acronym that stood for Compute Unified Device Architecture, but Nvidia, it's creator, rightly decided that such a definition was silly and stopped using it.


Platforms: CUDA on GPUs

Tesla is the codename for a GPU microarchitecture developed by Nvidia , and released in , as the successor to Curie microarchitecture. It was named after the pioneering electrical engineer Nikola Tesla. Tesla replaced the old fixed-pipeline microarchitectures, represented at the time of introduction by the GeForce 7 series. Tesla was followed by Fermi. Tesla is Nvidia's first microarchitecture implementing the unified shader model. The driver supports Direct3D 10 Shader Model 4. The design is a major shift for NVIDIA in GPU functionality and capability, the most obvious change being the move from the separate functional units pixel shaders, vertex shaders within previous GPUs to a homogeneous collection of universal floating point processors called "stream processors" that can perform a more universal set of tasks.

Skip to search form Skip to main content You are currently offline. Some features of the site may not work correctly. DOI: To enable flexible, programmable graphics and high-performance computing, NVIDIA has developed the Tesla scalable unified graphics and parallel computing architecture.

Tesla (microarchitecture)

