1.4.1. RTI Routing Service

The following tests have been performed by executing the RTI Perftest C++98 benchmark application between two nodes and RTI Routing Service in a third node, all of them connected to a switch via Ethernet. The communication has been restricted to a single interface, and the transport has been set to UDPv4 for the three applications.

In this test, communication between the RTI Perftest Publisher and Subscriber is not direct, all traffic is routed with Routing Service. The Publisher application sends the samples, which are received and routed using Routing Service to the Subscriber application. If the sample needs to be returned to the Publisher application, the Subscriber sends it back, again through Routing Service. The latency measured is half Round Trip Time (RTT).

Find information about the hardware, network, and command-line parameters after each of the tests.

1.4.1.1. UDPv4

The graph below shows the one-way latency without load between a Publisher, a Routing Service, and a Subscriber running in three Linux nodes in a 10Gbps network. The numbers are for best-effort as well as strict reliable reliability scenarios.

Note

We use the median (50th percentile) instead of the average in order to get a more stable measurement that does not account for spurious outliers. We also calculate the average value and other percentile values, which can be seen in the Detailed Statistics section below.

Detailed Statistics

The following tables contain the raw numbers presented by RTI Perftest. These numbers are the exact output with no further processing.

  • Best Effort

Sample Size (Bytes)

Avg (μs)

Std (μs)

Min (μs)

Max (μs)

50% (μs)

90% (μs)

99% (μs)

99.99% (μs)

99.9999% (μs)

CPU (%)

32

44

18.8

38

9344

41

57

63

97

9344

64

45

20.7

38

9822

42

57

64

100

9822

128

45

17.2

38

8913

41

58

63

92

8913

512

47

20.0

40

9046

43

59

68

124

9046

1024

51

22.9

43

9217

47

62

78

99

9217

4096

60

5.6

55

120

57

66

83

114

120

8192

90

8.3

74

160

94

96

104

133

160

16384

195

42.3

95

330

192

247

295

322

330

32768

336

17.4

153

398

348

355

359

364

398

63000

376

6.0

218

446

376

379

384

400

446

  • Reliable

Sample Size (Bytes)

Avg (μs)

Std (μs)

Min (μs)

Max (μs)

50% (μs)

90% (μs)

99% (μs)

99.99% (μs)

99.9999% (μs)

CPU (%)

32

45

8.9

36

133

40

60

63

70

133

64

45

9.0

36

121

41

60

63

70

121

128

46

8.2

36

91

43

60

62

69

91

512

46

357.6

38

100077

41

58

62

71

100077

1024

49

7.5

40

95

47

61

63

70

95

4096

58

1.8

55

125

58

61

63

75

125

8192

75

1.8

72

128

74

78

79

87

128

16384

189

30.2

99

346

186

214

336

342

346

32768

347

9.9

139

396

350

352

354

364

396

63000

371

4.7

206

396

371

373

375

383

396


Test Hardware

The following hardware was used to perform these tests:

Linux Nodes

Dell R340 Servers (13 Units)
Processor: Intel Xeon E-2278G (3.4-5GHz, 8c/16t, 16MB cache, 2 memory channels @2666MHz)
RAM: 4x 16GB 2666MHz DIMM (64GB RAM)
HD: 480GB SATA SSD
NIC 1: Intel 710 dual port 10Gbps SFP
OS: Ubuntu 20.04 -- gcc 9.3.0

Switch

Dell 2048 -- 10Gbps switch (10Gbps and 1Gbps interfaces)