System Performance: Multi-Tasking - ASRock Industrial 4X4 BOX-7840U mini-PC Review: AMD Phoenix in a

One of the key drivers of advancements in computing systems is multi-tasking. On mobile devices, this is quite lightweight - cases such as background email checks while the user is playing a mobile game are quite common. Towards optimizing user experience in those types of scenarios, mobile SoC manufacturers started integrating heterogenous CPU cores -

System Performance: Multi-Tasking

One of the key drivers of advancements in computing systems is multi-tasking. On mobile devices, this is quite lightweight - cases such as background email checks while the user is playing a mobile game are quite common. Towards optimizing user experience in those types of scenarios, mobile SoC manufacturers started integrating heterogenous CPU cores - some with high performance for demanding workloads, while others were frugal in terms of both power consumption / die area and performance. This trend is now slowly making its way into the desktop PC space.

Multi-tasking in typical PC usage is much more demanding compared to phones and tablets. Desktop OSes allow users to launch and utilize a large number of demanding programs simultaneously. Responsiveness is dictated largely by the OS scheduler allowing different tasks to move to the background. The processor is required to work closely with the OS thread scheduler to optimize performance in these cases. Keeping these aspects in mind, the evaluation of multi-tasking performance is an interesting subject to tackle.

We have augmented our systems benchmarking suite to quantitatively analyze the multi-tasking performance of various platforms. The evaluation involves triggering a ffmpeg transcoding task to transform 1716 3840x1714 frames encoded as a 24fps AVC video (Blender Project's 'Tears of Steel' 4K version) into a 1080p HEVC version in a loop. The transcoding rate is monitored continuously. One complete transcoding pass is allowed to complete before starting the first multi-tasking workload - the PCMark 10 Extended bench suite. A comparative view of the PCMark 10 scores for various scenarios is presented in the graphs below. Also available for concurrent viewing are scores in the normal case where the benchmark was processed without any concurrent load, and a graph presenting the loss in performance.

Even at 28W, the Phoenix SoC has plenty of headroom to tackle multiple concurrent demanding workloads triggered by mainstream users (of the type benchmarked by PCMark 10). The 4X4 BOX-7840U configurations already had leading performance in the absence of concurrent loading in most segments. Activating the ffmpeg load reduces the raw score, but the performance loss is less than the competition and the Phoenix systems continue to stay on top.

Following the completion of the PCMark 10 benchmark, a short delay is introduced prior to the processing of Principled Technologies WebXPRT4 on MS Edge. Similar to the PCMark 10 results presentation, the graph below show the scores recorded with the transcoding load active. Available for comparison are the dedicated CPU power scores and a measure of the performance loss.

The observations made for the overall PCMark 10 scores continue to hold for WebXPRT 4 also. Interestingly, the handling by Phoenix is good enough to even prop up the systems to the top spot in the presence of the transcoding load despite being in the middle of the pack without the extra load.

The final workload tested as part of the multitasking evaluation routine is CINEBENCH R23.

Similar to the web browsing workload, the Intel systems lose their ST performance advantage in the presence of concurrent loading. As a result, the Phoenix systems rise to the top. In the MT version, they retain their top billing despite the drop in the raw scores.

After the completion of all the workloads, we let the transcoding routine run to completion. The monitored transcoding rate throughout the above evaluation routine (in terms of frames per second) is graphed below.


In the above scenario, the transcoding task is supposed to happen in the background. The user should ideally not care about the transcoding performance in that case, but it is still interesting to look into how that varies based on the concurrent load being applied.

ASRock Industrial 4X4 BOX-7840U (Ryzen 7 7840U @ 28W) ffmpeg Transcoding Rate (Multi-Tasking Test)
Task SegmentTranscoding Rate (FPS)
MinimumAverageMaximum
Transcode Start Pass4.514.544.5
PCMark 10012.7944.5
WebXPRT 44.512.6622.5
Cinebench R23013.5542
Transcode End Pass314.6545
ASRock Industrial 4X4 BOX-7840U (Ryzen 7 7840U @ 40W) ffmpeg Transcoding Rate (Multi-Tasking Test)
Task SegmentTranscoding Rate (FPS)
MinimumAverageMaximum
Transcode Start Pass5.517.1251.5
PCMark 10014.8149
WebXPRT 4614.7230.5
Cinebench R232.515.1659
Transcode End Pass617.0350.5

Despite the addition of heavy concurrent loading, we see that the drop in transcoding performance is not too worrisome outside of full saturation regions from the foreground tasks.

ncG1vNJzZmivp6x7orrAp5utnZOde6S7zGiqoaenZH9yfZhyZpqroqSwrHnIp5uuq6SntqK4jG2vbWWSpMV4hJNprGalmaO2sa%2BMq5yvoZWserG0zp6lorBdnrturc1mrJyelmKut63Tmqlobw%3D%3D

 Share!