Category: BenchmarkXPRT development community

Up next for WebXPRT 4: A new AI-focused workload!

on May 30, 2024

We’re always thinking about ways to improve WebXPRT. In the past, we’ve discussed the potential benefits of auxiliary workloads and the role that such workloads might play in future WebXPRT updates and versions. Today, we’re very excited to announce that we’ve decided to move forward with the development of a new WebXPRT 4 workload focused on browser-side AI technology!

WebXPRT 4 already includes timed AI tasks in two of its workloads: the Organize Album using AI workload and the Encrypt Notes and OCR Scan workload. These two workloads reflect the types of light browser-side inference tasks that have been available for a while now, but most heavy-duty inference on the web has historically happened in on-prem servers or in the cloud. Now, localized AI technology is growing by leaps and bounds, and the integration of new AI capabilities with browser-based tasks is on the threshold of advancing rapidly.

Because of this growth, we believe now is the time to start work on giving WebXPRT 4 the ability to evaluate new browser-based AI capabilities—capabilities that are likely to become a part of everyday life in the next few years. We haven’t yet decided on a test scenario or software stack for the new workload, but we’ll be working to refine our plan in the coming months. There seems to be some initial promise in emerging frameworks such as ONNX Runtime Web, which allows users to run and deploy web-based machine learning models by using JavaScript APIs and libraries. In addition, new Web APIs like WebGPU (currently supported in Edge, Chrome, and tech preview in Safari) and WebNN (in development) may soon help facilitate new browser-side AI workloads.

We know that many longtime WebXPRT 4 users will have questions about how this new workload may affect their tests. We want to assure you that the workload will be an optional bonus workload and will not run by default during normal WebXPRT 4 tests. As you consider possibilities for the new workload, here are a few points to keep in mind:

The workload will be optional for users to run.
It will not affect the main WebXPRT 4 subtest or overall scores in any way.
It will run separately from the main test and will produce its own score(s).
Current and future WebXPRT 4 results will still be comparable to one another, so users who’ve already built a database of WebXPRT 4 scores will not have to retest their devices.
Because many of the available frameworks don’t currently run on all browsers, the workload may not run on every platform.

As we research available technologies and explore our options, we would love to hear from you. If you have ideas for an AI workload scenario that you think would be useful or thoughts on how we should implement it, please let us know! We’re excited about adding new technologies and new value to WebXPRT 4, and we look forward to sharing more information here in the blog as we make progress.

Justin

Posted in AI, benchmark, BenchmarkXPRT, BenchmarkXPRT development community, browser performance, Browser-based benchmarks, Chrome, Collaborative benchmark development, Future of performance evaluation, JavaScript, Microsoft Edge, on-device AI, ONNX Runtime Web, Performance benchmarking, Safari, WebGPU, WebNN, WebXPRT, WebXPRT 4 | Tagged AI, benchmark, BenchmarkXPRT, browser benchmark, browser performance, cross-platform, OCR, ONNX, WebGPU, WebNN, WebXPRT, WebXPRT 4 |

XPRT mentions in the tech press

By Justin Greene

on April 4, 2024

One of the ways we monitor the effectiveness of the XPRT family of benchmarks is to regularly track XPRT usage and reach in the global tech press. Many tech journalists invest a lot of time and effort into producing thorough device reviews, and relevant and reliable benchmarks such as the XPRTs often serve as indispensable parts of a reviewer’s toolkit. Trust is hard-earned and easily lost in the benchmarking community, so we’re happy when our benchmarks consistently achieve “go-to” status for a growing number of tech assessment professionals around the world.

Because some of our newer readers may be unaware of the wide variety of outlets that regularly use the XPRTs, we occasionally like to share an overview of recent XPRT-related tech press activity. For today’s blog, we want to give readers a sampling of the press mentions we’ve seen over the past few months.

Recent mentions include:

AnandTech used WebXPRT 4 to assess the performance of the ASRock Industrial 4X4 BOX-7840U and GEEKOM A5 mini-PCs.
Android Headlines used CrXPRT 2 to measure the performance of the ASUS Chromebook Plus CX34.
Mashable measured the performance and battery life of the HP Chromebook Plus x360 14c with CrXPRT 2.
Notebookcheck used WebXPRT 4 in dozens of device reviews, including evaluations of the Apple MacBook Air 15 (M3, 2024), the HP Omen 16 (2024), the HP Spectre x360 16, the Lenovo ThinkPad X1 Carbon G12, and the Valve Steam Deck.
PCMag used WebXPRT 4 in a review of the Apple MacBook Air 15 (M3, 2024).
TechPowerUp used WebXPRT 4 in a review of the Intel Core i9-14900KS processor.
Tom’s Guide used WebXPRT 4 to compare the performance of the Snapdragon and Exynos variants of the Samsung Galaxy S24.
Other outlets that have published articles, ads, or reviews mentioning the XPRTs in the last few months include: 3DNews (Russia), Android Authority, Benchlife.info, Delkom (Poland), DigitalWorld Italia, Digitec (Switzerland), Expert Reviews, Galaxus (Germany), Hardware.info, HIPC (Japan), ITC.ua (Ukraine), ITWorld (Korea), iXBT.com (Russia), PCMag, PC-Welt (Germany), QQ.com (China), SMZDM (China), and Tweakers.

Each month, we send out a BenchmarkXPRT Development Community newsletter that contains the latest updates from the XPRT world and provides a summary of the previous month’s XPRT-related activity, including new mentions of the XPRTs in the tech press. If you don’t currently receive the monthly BenchmarkXPRT newsletter but would like to join the mailing list, please let us know! There is no cost to join, and we will not publish or sell any of the contact information you provide. We will send only the monthly newsletter and occasional benchmark-related announcements, such as news about patches or new releases.

Justin

Posted in AnandTech, Apple, ASUS, battery life, benchmark, Benchmark metrics, Benchmarking, BenchmarkXPRT, BenchmarkXPRT development community, Chromebooks, CrXPRT, HP, Lenovo, Performance benchmarking, Performance of computing devices, Samsung, tech press, What makes a good benchmark? | Tagged AnandTech, Android, Apple, ASRock, ASUS, GEEKOM, HP, Intel, Lenovo, Mashable, Notebookcheck, PCMag, PCWorld, Samsung, TechPowerUp, Tom's Hardware, WebXPRT, WebXPRT 4 |

Working with the WebXPRT 4 source code

By Justin Greene

on March 21, 2024

In our last blog post, we discussed the WebXPRT 4 source code and how you can contact us to request free access to the build package. In this post, we’ll address two questions that users sometimes ask about code access. The first question is, “How do I build a local instance of WebXPRT?” The second is, “What can I do with it?”

How to build a local WebXPRT 4 instance

After we receive your request, we’ll send you a secure link to the current WebXPRT 4 build package, which contains all the necessary source code files and installation instructions. You will need a system to use as a server, and you will need to be familiar with Apache, PHP, and MySQL configuration to follow the build instructions. WebXPRT 4 uses a LAMP (Linux, Apache, MySQL, and PHP) setup on the “server” side, but it’s also possible to set up an instance with a WAMP or XAMPP stack.

The build instructions include a step-by-step methodology for setup. If you are familiar with LAMP stack configuration, the build and configuration process should take about two to three hours, depending on whether your LAMP-related extensions and libraries are current.

What you can do with a local WebXPRT 4 instance

We allow users to set up their own WebXPRT 4 instances for purposes of review, internal testing, or experimentation.

One use-case example is internal OEM lab testing. Some labs use WebXPRT to conduct extensive testing on preproduction hardware, and they follow stringent security guidelines to avoid the possibility of any hardware or test information leaving the lab. Even though we have our own strict policies about how we handle the little amount of data that WebXPRT gathers from tests, a local WebXPRT 4 instance provides those labs with an extra layer of security for sensitive tests.

We do ask that users publish results only from tests that they run on WebXPRT.com. As we mentioned in our most recent post, benchmarking requires a product that is consistent to enable valid comparisons over time. We allow people to download the source, but we reserve the right to control derivative works and which products can use the name “WebXPRT.” That way, when people see WebXPRT scores in tech press articles or vendor marketing materials, they can run their own tests on WebXPRT.com and be confident that they’re using the same standard for comparison.

If you have any questions about using the WebXPRT 4 source code, let us know!

Justin

Posted in Apache, benchmark, Benchmark metrics, Benchmarking, BenchmarkXPRT, BenchmarkXPRT development community, browser performance, Browser-based benchmarks, Cross-platform benchmarks, LAMP, Linux, MySQL, Performance benchmarking, PHP, Source code, tech press, WAMP, WebXPRT, WebXPRT 4, XAMPP | Tagged Apache, benchmark, browser benchmark, browser performance, LAMP, Linux, MySQL, PHP, source code, tech press, WebXPRT, WebXPRT 4 |

Accessing the WebXPRT 4 source code

By Justin Greene

on March 7, 2024

If you’re new to the XPRTs, you may not be aware that we provide free access to XPRT benchmark source code. Publishing XPRT source code is part of our commitment to making the XPRT development process as transparent as possible. By allowing interested parties to access and review our source code, we’re encouraging openness and honesty in the benchmarking industry. We’re also inviting constructive feedback that can help ensure that the XPRTs continue to improve and contribute to a level playing field for all the types of products they measure.

While we do offer free access to the XPRT source code, we’ve decided to offer the code upon request instead of using a permanent download link. This approach prevents bots or other malicious actors from downloading the code. It also has the benefit of allowing us to interact with users who are interested in the source code and answer any questions they may have. We’re always keen to learn more about what others are thinking about the XPRTs and the types of work they measure.

We recently received some questions about accessing the WebXPRT 4 source code, which made us realize that we needed to make a clearer way for people to ask for the code. In response, we added a “Request WebXPRT 4 source code” link to the gray Helpful Info box on WebXPRT.com (see it in the screenshot below). Clicking the link will allow you to email the BenchmarkXPRT Support team directly and request the code.

After we receive your request, we’ll send you a secure link to the current WebXPRT 4 build package. For those users who wish to set up a local instance of WebXPRT 4 for their own internal testbeds, the package will contain all the necessary files and installation instructions. We allow folks to set up their own instances for purposes of review, internal testing, or experimentation, but we ask that users publish only test results from the official WebXPRT 4 site.

While we offer free access to XPRT source code, our approach to derivative works differs from some traditional open-source models that encourage developers to change products and even take them in different directions. Because benchmarking requires a product that remains static to enable valid comparisons over time, we allow people to download the source, but we reserve the right to control derivative works. This discourages a situation where someone publishes an unauthorized version of the benchmark and calls it an “XPRT.”

If you have any questions about accessing the WebXPRT 4 source code, let us know!

Justin

Posted in Benchmarking, BenchmarkXPRT development community, Collaborative benchmark development, Open Source, Performance benchmarking, Source code, WebXPRT, WebXPRT 4 | Tagged benchmark, BenchmarkXPRT, BenchmarkXPRT Development Community, browser benchmark, browser performance, open source, source code, WebXPRT, WebXPRT 4 |

WebXPRT benchmarking tips from the XPRT lab

By Justin Greene

on February 8, 2024

Occasionally, we receive inquiries from XPRT users asking for help determining why two systems with the same hardware configuration are producing significantly different WebXPRT scores. This can happen for many reasons, including different software stacks, but score variability can also result from different testing behaviors and environments. While some degree of variability is normal, these types of questions provide us with an opportunity to talk about some of the basic benchmarking practices we follow in the XPRT lab to produce the most consistent and reliable scores.

Below, we list a few basic best practices you might find useful in your testing. Most of them relate to evaluating browser performance with WebXPRT, but several of these practices apply to other benchmarks as well.

Hardware is not the only important factor: Most people know that different browsers produce different performance scores on the same system. Testers are not, however, always aware of shifts in performance between different versions of the same browser. While most updates don’t have a large impact on performance, a few updates have increased (or even decreased) browser performance by a significant amount. For this reason, it’s always important to record and disclose the extended browser version number for each test run. The same principle applies to any other relevant software.
Keep a thorough record of system information: We record detailed information about a test system’s key hardware and software components, including full model and version numbers. This information is not only important for later disclosure if we choose to publish a result, it can also sometimes help to pinpoint system differences that explain why two seemingly identical devices are producing very different scores. We also want people to be able to reproduce our results to the closest extent possible, so that commitment involves recording and disclosing more detail than you’ll find in some tech articles and product reviews.
Test with clean images: We typically use an out-of-box (OOB) method for testing new devices in the XPRT lab. OOB testing means that other than running the initial OS and browser version updates that users are likely to run after first turning on the device, we change as little as possible before testing. We want to assess the performance that buyers are likely to see when they first purchase the device and before they install additional software. This is the best way to provide an accurate assessment of the performance retail buyers will experience from their new devices. That said, the OOB method is not appropriate for certain types of testing, such as when you want to compare as close to identical system images as possible, or when you want to remove as much pre-loaded software as possible.
Turn off automatic updates: We do our best to eliminate or minimize app and system updates after initial setup. Some vendors are making it more difficult to turn off updates completely, but you should always double-check update settings before testing.
Get a baseline for system processes: Depending on the system and the OS, a significant amount of system-level activity can be going on in the background after you turn it on. As much as possible, we like to wait for a stable baseline (idle time) of system activity before kicking off a test. If we start testing immediately after booting the system, we often see higher variance in the first run before the scores start to tighten up.
Use more than one data point: Because of natural variance, our standard practice in the XPRT lab is to publish a score that represents the median from three to five runs, if not more. If you run a benchmark only once and the score differs significantly from other published scores, your result could be an outlier that you would not see again under stable testing conditions or over the course of multiple runs.

We hope these tips will help make your testing more accurate. If you have any questions about WebXPRT, the other XPRTs, or benchmarking in general, feel free to ask!

Justin

Posted in benchmark, Benchmark metrics, Benchmarking, Benchmarking computing devices, BenchmarkXPRT, BenchmarkXPRT development community, browser performance, Browser-based benchmarks, Cross-platform benchmarks, Performance benchmarking, Performance of computing devices, WebXPRT, WebXPRT 4 | Tagged benchmark, BenchmarkXPRT, BenchmarkXPRT Development Community, browser benchmark, browser performance, WebXPRT, WebXPRT 4, XPRTs |

Looking back on 2023 with the XPRTs

By Justin Greene

on January 4, 2024

Around the beginning of each new year, we like to take the opportunity to look back and summarize the XPRT highlights from the previous year. Readers of our newsletter are familiar with the stats and updates we include each month, but for our blog readers who don’t receive the newsletter, we’ve compiled highlights from 2023 below.

Benchmarks
In March, we celebrated the 10-year anniversary of WebXPRT! WebXPRT 4 has now taken the lead as the most commonly-used version of WebXPRT, even as the overall number of runs has continued to grow.

XPRTs in the media
Journalists, advertisers, and analysts referenced the XPRTs thousands of times in 2023. It’s always rewarding to know that the XPRTs have proven to be useful and reliable assessment tools for technology publications around the world. Media sites that used the XPRTs in 2023 include 3DNews (Russia), AnandTech, Benchlife.info (China), CHIP.pl (Poland), ComputerBase (Germany), eTeknix, Expert Reviews, Gadgetrip (Japan), Gadgets 360, Gizmodo, Hardware.info, IT168.com (China), ITC.ua (Ukraine), ITWorld (Korea), iXBT.com (Russia), Lyd & Bilde (Norway), Notebookcheck, Onchrome (Germany), PCMag, PCWorld, QQ.com (China), Tech Advisor, TechPowerUp, TechRadar, Tom’s Guide, TweakTown, Yesky.com (China), and ZDNet.

Downloads and confirmed runs
In 2023, we had more than 16,800 benchmark downloads and 296,800 confirmed runs. Users have run our most popular benchmark, WebXPRT, more than 1,376,500 times since its debut in 2013! WebXPRT continues to be a go-to, industry-standard performance benchmark for OEM labs, vendors, and leading tech press outlets around the globe.

Trade shows
In January, Justin attended the 2023 Consumer Electronics Show (CES) Las Vegas. In March, Mark attended Mobile World Congress (MWC) 2023 in Barcelona. You can view Justin’s recap of CES here and Mark’s thoughts from MWC here.

We’re thankful for everyone who used the XPRTs and sent questions and suggestions throughout 2023. We’re excited to see what’s in store for the XPRTs in 2024!

Justin

Posted in Barcelona, benchmark, BenchmarkXPRT, BenchmarkXPRT development community, CES, Collaborative benchmark development, Consumer Electronics Show, History of benchmarking, Las Vegas, Mobile World Congress, Performance benchmarking, results, results submission, tech press, Trade Shows, WebXPRT, WebXPRT 4 | Tagged 3DNews, AnandTech, Barcelona, CES, ComputerBase, Gadgets360, Gizmodo, Hardware.info, Las Vegas, MWC, Notebookcheck, PCMag, PCWorld, QQ.com, results, tech press, TechPowerUp, TechRadar, Tom's Hardware, WebXPRT, WebXPRT 4, ZDNet |

Category: BenchmarkXPRT development community

Up next for WebXPRT 4: A new AI-focused workload!

XPRT mentions in the tech press

Working with the WebXPRT 4 source code

Accessing the WebXPRT 4 source code

WebXPRT benchmarking tips from the XPRT lab

Looking back on 2023 with the XPRTs

Check out the other XPRTs: