NVIDIA Tesla P4 Server Graphics Card

BH #NITP48GBGC • MFR #900-2G414-0000-000
NVIDIA Tesla P4 Server Graphics Card
Key Features
  • Pascal Architecture
  • 8GB of vRAM
  • 50/75W Max. Power
  • 5.5 TFLOPS Single-Precision Performance
Show More
Powered by Pascal architecture, the Tesla P4 from NVIDIA is a small-factor, 50/75W graphics card designed to boost the efficiency of scale-out servers running deep learning workloads, enabling smart responsive AI-based services. It reduces inference latency by up to 15x in hyperscale infrastructures and boosts energy efficiency. The hardware-decode engine is capable of transcoding and inferencing 35 HD video streams in real time. Additionally, the P4 uses a passive cooler for increased reliability and reduced power consumption.
No Longer Available
Boruch Berman, B&H Expert

True Know-How

Ask Our Experts

800.606.6969

NVIDIA Tesla P4 Overview

  • 1Description
  • 2
  • 3
  • 4
  • 5
  • 6Responsive Experience with Real-Time Inference
  • 7Efficiency for Low-Power Scale-Out Servers
  • 8Unlock AI-Based Video Services with a Dedicated Decode Engine
  • 9Faster Deployment with TensorRT and Deepstream SDK

Powered by Pascal architecture, the Tesla P4 from NVIDIA is a small-factor, 50/75W graphics card designed to boost the efficiency of scale-out servers running deep learning workloads, enabling smart responsive AI-based services. It reduces inference latency by up to 15x in hyperscale infrastructures and boosts energy efficiency. The hardware-decode engine is capable of transcoding and inferencing 35 HD video streams in real time. Additionally, the P4 uses a passive cooler for increased reliability and reduced power consumption.

Low-profile, plug-in card form factor
Enhanced programmability with page migration engine
Server-optimized for data center deployment
ECC protection

Responsive Experience with Real-Time Inference

The Tesla P4 delivers 22 TOPs of inference performance with INT8 operations to slash latency by 15x.

Efficiency for Low-Power Scale-Out Servers

The Tesla P4's small form factor and 50/75W power footprint design accelerates density-optimized, scale-out servers. It also provides 60x better energy efficiency than CPUs for deep learning inference workloads, letting customers meet the growth in demand for AI applications.

Unlock AI-Based Video Services with a Dedicated Decode Engine

Tesla P4 can transcode and infer up to 35 HD video streams in real time, powered by a dedicated hardware-accelerated decode engine that works in parallel with the GPU doing inference.

Faster Deployment with TensorRT and Deepstream SDK

TensorRT is a library created for optimizing deep learning models for production deployment. It takes trained neural nets—usually in 32- or 16-bit data—and optimizes them for reduced precision INT8 operations. NVIDIA DeepStream SDK taps into the power of Pascal GPUs to simultaneously decode and analyze video streams.

NVIDIA Tesla P4 Specs

vRAM8 GB GDDR5
Max Power50/75 W
Single Precision5.5 TeraFLOPS
Processor Core2560 CUDA
OperationsINT8: 22 TOPS
Memory Bandwidth192 GB/s
Hardware Acceleration1 x Decode Engine
2 x Encode Engines
System InterfaceLow-Profile PCI Express Form Factor
ECCYes
Dimensions6.6 x 2.1" / 16.8 x 5.3 cm
Weight8.46 oz / 239.84 g
Packaging Info
Package Weight0.6 lb
Box Dimensions (LxWxH)8 x 4 x 1"
See any errors on this page? Let us know

YOUR RECENTLY VIEWED ITEMS

Browsing History

Close

Close

Close