Dell PowerEdge C4140 Deep Learning Performance Comparison - Scale-up vs. Scale - Page 4

PowerEdge C4140 Multi Node Training with Different CPU Models vs 8x V100-16GB-SXM2

Page 4 highlights

Deep Learning Performance: Scale-up vs Scale-out 7.1.10 Non-Dell EMC server: 8x V100-16GB-SXM2 - Single Node 30 7.2 Throughput images/s - Multi Node 31 7.2.1 PowerEdge C4130-P100 16GB PCIe- Multi Node 31 7.2.2 PowerEdge C4140-K-V100-16GB and V100-32GB: SXM2 Multi Node 33 7.2.3 PowerEdge C4140-M-V100-16GB-SXM2 Multi Node 35 7.2.4 PowerEdge C4140-K Multi Node Training vs Non-Dell EMC 8x V100-16GB-SXM2 36 7.2.5 PowerEdge C4140-M Multi Node Training vs Non-Dell EMC 8x V100-16GB-SXM2 39 7.2.6 PowerEdge C4140 Multi Node Training with Different CPU Models vs 8x V100-16GB-SXM2 40 7.3 Results Showing Training Time Vs Accuracy 41 7.3.1 Elapsed Training Time for Several Models 42 7.4 Other Explored Aspects ...46 7.4.1 Hyper-parameters tuning...47 7.4.2 Learning Rate Effect in Distributed Mode 48 7.4.3 Communication and Neural Networks Primitives 50 8 Conclusion and Future Work ...51 9 Citation...52 10 References ...52 Architectures & Technologies Dell EMC | Infrastructure Solutions Group 3

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53

Deep Learning Performance: Scale-up vs Scale-out
Architectures & Technologies
Dell
EMC
| Infrastructure Solutions Group
3
7.1.10
Non-Dell EMC server: 8x V100-16GB-SXM2
Single Node
................................................
30
7.2
Throughput images/s
Multi Node
............................................................................................
31
7.2.1
PowerEdge C4130-P100 16GB PCIe- Multi Node
................................................................
31
7.2.2
PowerEdge C4140-K-V100-16GB and V100-32GB: SXM2 Multi Node
................................
33
7.2.3
PowerEdge C4140-M-V100-16GB-SXM2 Multi Node
.........................................................
35
7.2.4
PowerEdge C4140-K Multi Node Training vs Non-Dell EMC 8x V100-16GB-SXM2
............
36
7.2.5
PowerEdge C4140-M Multi Node Training vs Non-Dell EMC 8x V100-16GB-SXM2
...........
39
7.2.6
PowerEdge C4140 Multi Node Training with Different CPU Models vs 8x V100-16GB-SXM2
40
7.3
Results Showing Training Time Vs Accuracy
...............................................................................
41
7.3.1
Elapsed Training Time for Several Models
..........................................................................
42
7.4
Other Explored Aspects
..............................................................................................................
46
7.4.1
Hyper-parameters tuning
....................................................................................................
47
7.4.2
Learning Rate Effect in Distributed Mode
...........................................................................
48
7.4.3
Communication and Neural Networks Primitives
...............................................................
50
8
Conclusion and Future Work
..............................................................................................................
51
9
Citation
................................................................................................................................................
52
10
References
......................................................................................................................................
52