A Principled Technologies report: Hands-on testing. Real-world results.

Accelerate 3D modeling and rendering. Higher Cinebench R23, Geekbench 6 Pro, Blender 3.5, and Mason Redshift benchmark scores. Expedite project cycles. Faster Blender 3.6 renders while processing more samples per minute. Tackle complex AI/ML problems. More 3D U-Net, Bert-99, RNN-T, and ResNet-50 samples per second.

Get results from demanding workflows in less time with the new HP Z8 Fury G5 Workstation Desktop PC

Compared to a Lenovo ThinkStation P620 Tower Workstation

Today’s creatives and technical professionals face a common challenge—demanding workflows that require significant processing power. And, as always, time is money.

At Principled Technologies, we compared CPU and GPU performance using benchmarks and several MLPerf machine learning (ML) model performance between an HP Z8 Fury G5 Workstation Desktop PC and a Lenovo® ThinkStation® P620 Tower Workstation. We equipped both workstations with 128 GB of RAM (plenty for our tests) and configured them with the most powerful CPU and GPU hardware available at the time of testing. We loaded the HP Z8 Fury G5 with an Intel® Xeon® w9-3495X CPU and four NVIDIA® RTX 6000 Ada Generation GPUs. And we loaded the Lenovo ThinkStation P620 with an AMD Ryzen Threadripper PRO 5995WX CPU and two NVIDIA RTX A6000 GPUs with NVLink.

These results are relevant to creatives and technical professionals who do such computationally intense work as generating photorealistic images; running advanced simulations and visualizations; depending on hardware-intensive computer-aided design (CAD); or working with complex datasets that include videos, images, and speech.

How we tested

Before we started testing, we set the Lenovo workstation power mode to “high performance” and the HP workstation power mode to “ultimate performance.” Other than those changes, we used out-of-box OEM performance settings. We tested the highest performance CPU and GPU configurations available for each workstation:

HP Z8 Fury G5 Workstation Desktop PC*
1x 56-core Intel Xeon w9-3495X CPU (1.9 - 4.8 GHz)
4x NVIDIA RTX 6000 Ada Generation GPUs
48GB VRAM per GPU
128GB DDR5-4800 memory
4x 1TB PCIe® storage

Lenovo ThinkStation P620 Tower Workstation*
1x 64-core AMD Ryzen Threadripper PRO 5995WX CPU (2.7 - 4.5 GHz)
2x NVIDIA RTX A6000 GPUs with NVLink
2x 48GB VRAM combined via NVLink high-speed interconnect
128GB DDR5-3200 memory
1TB PCIe storage

*A note on GPU and storage discrepancies: We could only have configured the Lenovo ThinkStation P620 with either two RTX A6000 GPUs with NVLink or a single RTX 6000 Ada—there were no other configuration options for top-performing cards. Although the Lenovo ThinkStation had only two GPUs, those two RTX A6000 GPUs also had an NVLink. NVLink is a high-speed processor interconnect that enabled the two RTX A6000 processors to “send and receive data from shared pools of memory at lightning speed,”1 which effectively doubles the amount of memory available to benchmarks that have large datasets. This configuration gave the Lenovo an advantage versus using a single RTX 6000 Ada. As for the storage discrepancy, our benchmark tests taxed each system’s primary 1TB high-performance PCIe storage, but we did not run any tests that we believe storage capacity would affect.

Because application performance can be limited by processor speeds, we chose punishing CPU- and GPU-intensive content creation benchmarks that stressed each workstation in different ways:

  • Blender to compare GPU rendering, multi-GPU acceleration, and ray-tracing performance
  • Cinebench R23 and Geekbench 6 Pro to compare CPU single-core performance
  • Geekbench 6 Pro to compare GPU performance with augmented reality (AR) and ML workloads
  • Maxon Redshift to compare production rendering performance
  • PugetBench for Lightroom Classic to compare photo-editing performance with the Adobe® Lightroom® Classic app

We also ran the following MLPerf Inference Benchmark Suite ML models in the offline scenario mode, where the workstations can process the data in any order, without latency constraints.4 These results enable readers to compare inference performance from multiple angles:

  • 3D U-Net to compare medical imaging and 3D image segmentation performance
  • BERT-99 to compare natural language processing performance
  • ResNet-50 to compare image classification and detection performance
  • RNN-T to compare speech recognition performance

We ran each test three times and report the median results. The benchmark scores and ML results we report reflect the specific configurations we tested. Any difference in the configurations you test, as well as network traffic or software additions, could generate results. For a deeper dive into our testing parameters and procedures, see the science behind the report.

Note: The graphs in this report use different x-axis scales to keep to a consistent size. Please be mindful of each graph’s data range as you compare.

Deliver measurable value to your business

The World Economic Forum says that, in the next five years, over 75 percent of businesses are planning to adopt big data solutions, make use of cloud computing technologies, and capitalize on AI. These goals make high-performance workstations a serious contender for many creative and professional endeavors.5

Accelerate 3D modeling and rendering

The Cinebench benchmark provides performance scores based on “Cinema 4D’s ability to take advantage of multiple CPU cores and modern processor features available to the average user.”6 Cinema 4D is real-world 3D computer animation, modeling, simulation, and rendering software.7 The Geekbench 6 Pro benchmark uses popular, real-world applications and realistic data sets to evaluate performance in cutting-edge areas such as AR and ML.8

Higher CPU single-score scores in our Cinebench R23 and Geekbench 6 Pro comparisons accentuate the advantages the Intel Xeon w9-3495X processor-powered system brings to the table.

Chart of Cinebench R23 CPU single-core benchmark results. Higher is better. HP Z8 Fury G5 Workstation has a 1,669 score and Lenovo ThinkStation P620 Tower Workstation has a 1,485 score. 12.3 percent higher score.
Cinebench R23 CPU single-core scores. Higher is better. Source: Principled Technologies.
Chart of Geekbench 6 Pro single-core benchmark results. Higher is better. HP Z8 Fury G5 Workstation has a 2,168 score and Lenovo ThinkStation P620 Tower Workstation has a 2,044 score. 6.0 percent higher score.
Geekbench 6 Pro CPU single-core scores. Higher is better. Source: Principled Technologies.

The higher GPU Compute score in our Geekbench 6 Pro comparison shows how much better the HP Z8 Fury G5 Workstation could handle computational tasks such as identifying objects, blurring backgrounds, running a particle physics simulation, and image synthesis.9

Chart of Geekbench 6 Pro GPU Compute OpenGL benchmark results. Higher is better. HP Z8 Fury G5 Workstation has a 279,364 score and Lenovo ThinkStation P620 Tower Workstation has a 191,089 score. 46.2 percent higher score.
Geekbench 6 Pro GPU Compute scores. Higher is better. Source: Principled Technologies.

The following Blender 3.5 and Maxon Redshift benchmarks provide a snapshot of how much better the HP Z8 Fury G5 with its 56-core Intel Xeon w9-3495X CPU and four NVIDIA RTX 6000 GPUs could handle resource-intensive applications and complex workflows compared to the Lenovo ThinkStation P620 with its 64-core AMD Ryzen Threadripper PRO 5995WX CPU and two NVIDIA RTX A6000 GPUs with NVLink.

A higher number of samples translates to a cleaner picture—but the compromise is often longer render times. In both the Blender 3.5 and Maxon Redshift GPU rendering benchmark comparisons, the HP Z8 Fury G5 Workstation Desktop PC blows past that possible bottleneck.

Chart of Blender 3.5 GPU rendered samples per minute. Higher is better. For the Monster workload, HP Z8 Fury G5 Workstation is 6,848.09 samples per minute and Lenovo ThinkStation P620 Tower Workstation is 2,910.87 samples per minute; 135.2 percent more samples per minute. For the Junkshop workload, HP Z8 Fury G5 Workstation is 3,194.10 samples per minute and Lenovo ThinkStation P620 Tower Workstation is 1,744.34 samples per minute; 83.1 percent more samples per minute. For the Classroom workload, HP Z8 Fury G5 Workstation is 3,212.68 samples per minute and Lenovo ThinkStation P620 Tower Workstation is 1,493.47 samples per minute; 115.1 percent more samples per minute.
Blender 3.5 samples per minute. Higher is better. Source: Principled Technologies.
Chart of Maxon Redshift render times. Lower is better. HP Z8 Fury G5 Workstation time is 41 seconds and Lenovo ThinkStation P620 Tower Workstation time is one minute and twenty-five seconds. 51.7 percent less time or 44 seconds faster.
Maxon Redshift render times. Lower is better. Source: Principled Technologies.

Expedite project cycles

The faster professional photographers and graphic designers can complete tasks with Adobe® Creative Cloud® applications, the easier it can be to meet deadlines.

PugetBench for Adobe Lightroom Classic overall scores. Higher is better. HP Z8 Fury G5 Workstation score is 1,510.5 and Lenovo ThinkStation P620 Tower Workstation is 1,195.0. 26.4 percent higher score.
PugetBench for Adobe Lightroom Classic overall scores. Higher is better. Source: Principled Technologies.

According to NVIDIA, the NVIDIA RTX 6000 Ada Generation graphics card “combines third-generation RT Cores, fourth-generation Tensor Cores, and next-generation CUDA® cores with 48 GB of graphics memory for unprecedented rendering, AI, graphics, and compute performance.”10 Much faster Blender 3.6 render times while processing significantly more samples per minute makes the dream reality.

Chart of Blender 3.6 multi-GPU rendering results. For CUDA render time (lower is better): HP Z8 Fury G5 Workstation render time is 44.17 seconds and Lenovo ThinkStation P620 Tower Workstation render time is 1 minute and 36.05 seconds. 51 seconds faster. For CUDA samples per minute (higher is better): HP Z8 Fury G5 Workstation is 3,129.73 samples per minute and Lenovo ThinkStation P620 Tower Workstation is 1,439.25 samples per minute. 117.4 percent more samples per minute. For OptiX render time (lower is better): HP Z8 Fury G5 Workstation render time is 51.72 seconds and Lenovo ThinkStation P620 Tower Workstation render time is 1 minute and 7.59 seconds. 15 seconds faster. For OptiX samples per minute (higher is better): HP Z8 Fury G5 Workstation is 2,672.85 samples per minute and Lenovo ThinkStation P620 Tower Workstation is 2,045.27 samples per minute. 30.6 percent more samples per minute.
Blender 3.6 ray-tracing render times and samples per minute with 200 percent resolution, 256 samples (2,304 total samples). Less time is better, and higher rates of samples per minute are better. Source: Principled Technologies.

Tackle complex AI/ML problems

According to Seed Scientific, at the beginning of 2020, the amount of data in the world was estimated at 44 zettabytes (a single zettabyte has 21 zeros). This humongous number includes data generated by “social media sites, financial institutions, medical facilities, shopping platforms, automakers, and many other activities online.”11 And, while your business may not want or need to process even 1 zettabyte of data, your digital transformation specialists, data analysts, business intelligence analysts, and scientists need powerful workstations that can nimbly crunch through your still-vast troves of valuable data and get usable results as quickly as possible.

The medical imaging, language processing, and computer vision scenarios we ran from the MLPerf Inference Benchmark Suite use trained models to measure how quickly each workstation processed inputs and produced results.12

Before we dive into our machine learning test results, it’s important to note that the final stage of the machine learning process is inference—that’s the time when your proven model has all the data, training, evaluation, and tuning your experts deem necessary for that machine learning model to make informed and (we hope) accurate predictions.13

Advance research endeavors and accurate medical treatment plans

The healthcare sector uses medical imaging (e.g., X-rays, ultrasounds, MRIs, and CT scans) for medical research, disease diagnosis, and drug discovery.14 The 3D U-Net model we ran “performs volumetric segmentation of dense 3D images for medical use cases.”15 A higher number of samples here equals a cleaner image, which gives medical professionals an edge when a speedy and correct diagnosis can be the deciding factor in life and death situations.

Chart of 3D U-Net model in the offline scenario results. For 3D U-Net samples per second (higher is better): HP Z8 Fury G5 Workstation is 9.97936 samples per second and Lenovo ThinkStation P620 Tower Workstation is 3.70833 samples per second. For 3D U-Net latency (lower is better): HP Z8 Fury G5 Workstation latency is 20 minutes 30 seconds and Lenovo ThinkStation P620 Tower Workstation is 55 minutes and 12 seconds.
Number of samples per second each workstation classified and latency using the 3D U-Net model in the offline scenario. Higher numbers of samples are better, and lower latency is better. Source: Principled Technologies.

Improve customer experiences

One of the earliest Natural Language Processing (NLP) applications was an email spam filter. Today, that recurrent neural network (RNN) aspect of NLP categorizes incoming emails as primary, social, promotional, or spam based on their contents.16 The RNN-T model we ran “recognizes and transcribes audio in real time.”17 Other examples of RNN applications are predicting stock prices, text mining, and language translation.18

Chart of RNN-T model in the offline scenario results. For RNN-T samples per second (higher is better): HP Z8 Fury G5 Workstation is 22,422.60 samples per second and Lenovo ThinkStation P620 Tower Workstation is 13,031.10 samples per second. For RNN-T latency (lower is better): HP Z8 Fury G5 Workstation latency is 6 minutes and 24 seconds and Lenovo ThinkStation P620 Tower Workstation is 6 minutes and 54 seconds.
Number of samples per second each workstation classified and mean latency using the RNN-T model in the offline scenario. Higher numbers of samples are better, and lower latency is better. Source: Principled Technologies.

Another NLP model is BERT (Bidirectional Encoder Representations from Transformers). BERT, unlike RNN, which takes its text context from word order and punctuation, can “capture the semantic and syntactic features of a text.”19 The BERT model we ran sorts and analyzes text to make accurate language predictions, answer questions correctly, and respond to conversations without errors 99 percent of the time.20 Real-world examples of BERT applications include virtual assistants (e.g., Amazon® Alexa®, Google Assistant, and Apple® Siri®), chatbots, image and video captioning, and sentiment analysis.21

Chart of BERT-99 model in the offline scenario results. For BERT-99 samples per second (higher is better): HP Z8 Fury G5 Workstation is 7,545,390 samples per second and Lenovo ThinkStation P620 Tower Workstation is 3,433.930 samples per second. For BERT-99 latency (lower is better): HP Z8 Fury G5 Workstation latency is 6 minutes and 44 seconds and Lenovo ThinkStation P620 Tower Workstation is 6 minutes and 45 seconds
Number of samples per second each workstation classified and latency using the BERT-99 model in the offline scenario. Higher numbers of samples are better, and lower latency is better. Source: Principled Technologies.

Achieve higher levels of productivity

The 50-layer ResNet model we ran “[a]ssigns a label from a fixed set of categories to an input image, i.e., applies to computer vision problems.”22 Computer vision is a catch-all term for machine learning that enables computers to mimic human vision to see, identify, and understand both objects and people in images and video. Computer vision applications include facial recognition, autonomous cars, production-line automation, plant species classification, edge computing, and sports performance analysis.23

Chart of ResNet-50 model in the offline scenario results. For ResNet-50 samples per second (higher is better): HP Z8 Fury G5 Workstation is 85,915.60 samples per second and Lenovo ThinkStation P620 Tower Workstation is 39,870.90 samples per second. For ResNet-50 latency (lower is better): HP Z8 Fury G5 Workstation latency is 5 seconds and Lenovo ThinkStation P620 Tower Workstation is 5 minutes and 30 seconds
Number of samples per second each workstation classified and mean latency using the ResNet-50 model in the offline scenario. Higher numbers of samples are better, and lower latency is better. Source: Principled Technologies.

Conclusion

By adopting big data solutions, making use of cloud computing technologies, and capitalizing on AI, companies are investing in their future success. With powerful workstations, your key personnel can get the results you need to electrify that bright future in less time. Our content creation and machine learning benchmark results show that the new HP Z8 Fury G5 Workstation powered by an Intel Xeon w9-3495X CPU and four NVIDIA RTX 6000 Ada-generation GPUs could be just the investment you need.

    1. NVIDIA, “What is NVLink?” accessed September 21, 2023, https://blogs.nvidia.com/blog/2023/03/06/what-is-nvidia-nvlink/.
    2. HP, “HP Z8 Fury,“ accessed August 22, 2023, https://www.hp.com/us-en/workstations/z8-fury.html.
    3. HP, “HP Anyware Remote System Controller,” accessed August 22, 2023, https://www.hp.com/us-en/solutions/anyware-remote-system-controller.html.
    4. Sally Ward-Foxton for the EE Times, “Understanding MLPerf Benchmark Scores,” accessed September 22, 2023, https://www.eetimes.com/understanding-mlperf-benchmark-scores/.
    5. World Economic Forum, “Future of Jobs Report 2023,” accessed July 31, 2023, https://www3.weforum.org/docs/WEF_Future_of_Jobs_2023.pdf.
    6. Maxon, “Cinebench,” accessed August 22, 2023, https://www.maxon.net/en/cinebench.
    7. Maxon, “Cinema 4D,” accessed August 22, 203, https://www.maxon.net/en/cinema-4d.
    8. Geekbench, “Introducing Geekbench 6,” accessed August 22, 2023, https://www.geekbench.com.
    9. Geekbench, “Geekbench 6 GPU Compute Workloads,” accessed August 23, 2023, https://www.geekbench.com/doc/geekbench6-gpu-compute-workloads.pdf.
    10. NVIDIA, “NVIDIA RTX 6000 Ada Generation Graphics Card,” accessed August 21, 2023, https://www.nvidia.com/en-us/design-visualization/rtx-6000/.
    11. Seed Scientific, “How Much Data Is Created Every Day? +27 Staggering Stats,’ accessed September 21, 2023, https://seedscientific.com/how-much-data-is-created-every-day/.
    12. NVIDIA, “What is MLPerf?” accessed September 22, 2023, https://www.nvidia.com/en-us/data-center/resources/mlperf-benchmarks/.
    13. Matthew Mayo, “Frameworks for Approaching the Machine Learning Process,” accessed September 20, 2023, https://www.kdnuggets.com/2018/05/general-approaches-machine-learning-process.html.
    14. Simplilearn, “Top 25 Deep Learning Application Industries,” accessed September 22, 2023, https://www.simplilearn.com/tutorials/deep-learning-tutorial/deep-learning-applications.
    15. NVIDIA, “What is MLPerf?”
    16. Tableau, “8 Natural Language Processing (NLP) Examples,” accessed September 22, 2023, https://www.tableau.com/learn/articles/natural-language-processing-examples#:.
    17. NVIDIA, “What is MLPerf?”
    18. Great Learning, “What Is Recurrent Neural Network | Introduction of Recurrent Neural Network,” accessed September 22, 2023, https://www.mygreatlearning.com/blog/recurrent-neural-network/.
    19. LinkedIn, “How do you compare and contrast BERT with other deep learning approaches for sentiment analysis?”Accessed October 16, 2023, https://www.linkedin.com/advice/0/how-do-you-compare-contrast-bert-other-deep-learning#:~:text=BERT.
    20. NVIDIA, “What is MLPerf?”
    21. Simplilearn, “Top 25 Deep Learning Application Industries.”
    22. NVIDIA, “What is MLPerf?”
    23. Built In, “What Is Computer Vision?” accessed September 22, 2023, https://builtin.com/machine-learning/computer-vision.
    24. Intel, “Intel Launches new Xeon Workstation Processors—the Ultimate Solution for Professionals,” accessed August 22, 2023, https://www.intel.com/content/www/us/en/newsroom/news/intel-launches-new-xeon-workstation-processors.html#gs.4quj4k.

This project was commissioned by HP.

November 2023

Principled Technologies is a registered trademark of Principled Technologies, Inc.

All other product names are the trademarks of their respective owners.

Forgot your password?