Skip to content

Performance

Vitek edited this page Dec 1, 2016 · 16 revisions

UltraGrid Performance

This site contains the results of performance tests of UltraGrid.

End-to-End Latency

The following table compares performance of individual cards. The results are measured as end-to-end frame-delay, i.e. the number of frames sent before the reciever outputs the original frame.

Linux

Setup:

  • Testing machine: hd2
  • Reference machine: hd4 with BlackMagic Decklink HD Extreme
  • For the test, we used 1080i HD video at 29.97Fps.
Card send recieve
DVS Centaurus II 5 3
BlackMagic DeckLink HD Extreme 3.75 3.75
BlackMagic DeckLink 4K Extreme 3.5 3
BlackMagic Decklink Quad 4.5 4
BlackMagic Decklink Intensity PRO 4.5 4
BlackMagic Decklink Intensity 4.5 4.5
Deltacast 3G 4.5 3
OpenGL - 2.5

MacOS X

Setup:

  • Testing machine: hd7
  • Reference machine: hd4 with BlackMagic Decklink HD Extreme
  • For the test, we used 1080i HD video at 29.97Fps.
Card send recieve
AJA Kona 3G 4 3.5
DeckLink HD Pro (Quicktime) 4.5 5.5
DeckLink HD Pro (native API) 4.5 4
OpenGL (with VSync) - 2.25
OpenGL (without VSync) - 1.75

Windows

Setup:

  • Testing machine: hd7
  • Reference machine: hd4 with BlackMagic Decklink HD Extreme
  • For the test, we used 1080i HD video at 29.97Fps.
Card send recieve
BlackMagic DeckLink HD Extreme 4.5 3.5
BlackMagic Decklink Quad 4.5 4
BlackMagic Decklink Intensity PRO 4.5 3.5
BlackMagic Decklink Intensity 4.5 4
Deltacast 3G 4 3

Compression

Performance

Here you can see the performance of individual compression modules (encoding). For the measurement purposes, we used machine hd2 running Ubuntu as receiver and hd7 as sender. We used Decklink HD Extreme as grabbing cards on both sides. We used a 4k video with increasing framerate to pin down the biggest achievable that still offers fluent video experience.

UYVY, 8bit

module

2k

4k

Content

HW

version

RTDXT:DXT1

150 Fps

52 Fps

RTDXT:DXT5_YCoCg

110 Fps

38 Fps

cuda_DXT

280 Fps

80 Fps

libavcodec:codec=H.264

50 Fps

16 Fps

libavcodec:codec=H.264

43 Fps

New Zeland 4K

Intel i7-4960X

libavcodec:codec=H.264

32 Fps

New Zeland 4K

Intel i7-980X

libavcodec:codec=VP8

28 Fps

New Zeland 4K

Intel i7-4960X

libavcodec:codec=MJPEG

65 Fps

New Zeland 4K

Intel i7-4960X

JPEG:90

170 Fps

45 Fps

JPEG:90

150 Fps

NZ (frame 2500)

kypowall0 (i7-4770S, NV GTX 980, 4x8G DDR3@1600)

UG v1.3-140-g08dba83

JPEG:90

137 Fps

NZ (frame 2500)

hd12 (i7-4960X, 32 GB 1866 MHz DDR3, NV GTX 960)

UG v1.3-140-g08dba83

JPEG:90

157 Fps

NZ (frame 2500)

i7-5960X, DDR4@2166, NV GTX 960

UG v1.3-140-g08dba83

JPEG:90

201 Fps

NZ (frame 2500)

i7-5960X, DDR4@2166, GeForce GTX Titan Black

UG v1.3-140-g08dba83

JPEG:90:8

207 Fps

NZ (frame 2500)

i7-5960X, DDR4@2166, GeForce GTX Titan Black

UG v1.3-140-g08dba83

JPEG:90:16

200 Fps

NZ (frame 2500)

i7-5960X, DDR4@2166, GeForce GTX Titan Black

UG v1.3-140-g08dba83

JPEG:90

178 + 146 Fps

NZ (frame 2500)

i7-5960X, DDR4@2166, GeForce GTX Titan Black + NV 960

UG v1.3-140-g08dba83

JPEG:90

92 Fps

NZ (frame 2500)

bunny(2x Xeon E5-2660 v2, [email protected], GeForce GTX Titan)

UG v1.3-140-g08dba83

JPEG:90

96 + 96 Fps

NZ (frame 2500)

bunny(2x Xeon E5-2660 v2, [email protected], 2 x GeForce GTX Titan)
(utilization of both cards around 50%,
the same in previous case)

UG v1.3-140-g08dba83

JPEG:90

130 Fps

NZ (frame 2500)

hdd1(i7-4930K, 780Ti,2x4GB DDR3@1333)

UG v1.3-140-g08dba83

Latency

We also measured the latency added by the compression modules. For the tests 1080p@30fps was used, compressing at [[hd7|UltraGrid_Testbed#hd7]]running Ubuntu and decompression done by hd2

module end-to-end latency
uncompressed 3.75
cuda_dxt 3.75 (+0)
RTDXT:DXT1 6 (+2)
RTDXT:DXT5 5.5 (+1.75)
JPEG:90:0 4 (+0.25)
JPEG:97:0 4 (+0.25)
H.264 5 (+1.25)

Bandwidth

Here you can see the measured bandwidth including overhead with 9000B Ethernet frames. Uncompressed signal was 8-bit YUV422.

module 1080i@30 2k@30 4k (4096 × 2160)@25fps
uncompressed 980 Mbps 1504 Mbps 3489 Mbps
DXT1 245 Mbps 376 Mbps 870 Mbps
DXT5 YCoCg 489 Mbps 752 Mbps - Mbps
JPEG:90 80 Mbps 85 Mbps 160 Mbps
H.264 22 Mbps 22 Mbps 60 Mbps
Clone this wiki locally