Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

Ubuntu 24.04 Linux

With 2x32GB 4800 MT/s CL38 DDR5 RAM

frameworkBackendDeviceSingleHalfQuant.Results
TensorFlow Lite
CPUIntel Core Ultra 9 185H197120371382https://browser.geekbench.com/ai/v1/259291
ONNX
CPUIntel Core Ultra 9 185H21386435634https://browser.geekbench.com/ai/v1/259319
OpenVINO
CPUIntel Core Ultra 9 185H4507454110765https://browser.geekbench.com/ai/v1/259321
OpenVINO
GPUIntel(R) Arc(TM) Graphics (iGPU)
Workload (TF Lite)AccuracyScoreWorkload (ONNX)AccuracyScoreWorkload (OpenVINO)AccuracyScoreImage Classification (SP)100%1510
280.9 IPSImage Classification (SP)100%1106
205.7 IPS Image Classification (HP)100%1428
265.6 IPSImage Classification (HP)100%133
24.6 IPS Image Classification (Q)99%993
185.1 IPSImage Classification (Q)97%4177
779.5 IPS Image Segmentation (SP)100%2253
36.5 IPSImage Segmentation (SP)100%1141
18.5 IPS Image Segmentation (HP)100%2243
36.4 IPSImage Segmentation (HP)100%209
3.38 IPS Image Segmentation (Q)98%1035
16.8 IPSImage Segmentation (Q)99%2658
43.2 IPS Pose Estimation (SP)100%2357
2.75 IPSPose Estimation (SP)100%3668
4.28 IPS Pose Estimation (HP)100%2309
2.69 IPSPose Estimation (HP)100%3007
3.51 IPS Pose Estimation (Q)96%3224
3.78 IPSPose Estimation (Q)94%20021
23.5 IPS Object Detection (SP)100%1654
131.2 IPSObject Detection (SP)100%1544
122.5 IPS Object Detection (HP)100%1648
130.7 IPSObject Detection (HP)100%269
21.3 IPS Object Detection (Q)85%1024
82.4 IPSObject Detection (Q)86%4605
370.1 IPS Face Detection (SP)100%3071
36.5 IPSFace Detection (SP)100%2807
33.4 IPS Face Detection (HP)100%3060
36.4 IPSFace Detection (HP)100%314
3.73 IPS Face Detection (Q)97%2278
27.2 IPSFace Detection (Q)97%12436
148.3 IPS Depth Estimation (SP)100%2317
17.9 IPSDepth Estimation (SP)100%4220
32.5 IPS Depth Estimation (HP)99%2507
19.3 IPSDepth Estimation (HP)99%1121
8.64 IPS Depth Estimation (Q)63%1964
18.4 IPSDepth Estimation (Q)78%13848
110.5 IPS Style Transfer (SP)100%2892
3.72 IPSStyle Transfer (SP)100%9110
11.7 IPS Style Transfer (HP)100%2928
3.76 IPSStyle Transfer (HP)100%7498
9.64 IPS Style Transfer (Q)98%5650
7.29 IPSStyle Transfer (Q)98%17976
23.2 IPS Image Super-Resolution (SP)100%1494
55.2 IPSImage Super-Resolution (SP)100%1774
65.5 IPS Image Super-Resolution (HP)100%1911
70.6 IPSImage Super-Resolution (HP)100%1166
43.1 IPS Image Super-Resolution (Q)97%1463
54.2 IPSImage Super-Resolution (Q)99%3013
111.6 IPS Text Classification (SP)100%1229
1.64 KIPSText Classification (SP)100%1105
1.48 KIPS Text Classification (HP)100%1105
1.47 KIPSText Classification (HP)100%333
444.7 IPS Text Classification (Q)92%390
524.3 IPSText Classification (Q)97%1083
1.45 KIPS Machine Translation (SP)100%1771
30.5 IPSMachine Translation (SP)100%1320
22.7 IPS Machine Translation (HP)100%2135
36.8 IPSMachine Translation (HP)100%530
9.14 IPS Machine Translation (Q)58%520
12.2 IPSMachine Translation (Q)65%3117
62.6 IPS

before some drivers added

Code Block
root@server6:/mnt/GeekbenchAI-1.3.0-Linux# ./banff --ai-list
Geekbench AI 1.3.0 : https://www.geekbench.com/ai/

Geekbench AI requires an active internet connection and automatically uploads
benchmark results to the Geekbench Browser.

Framework     | Backend       | Device
 1 TensorFlow Lite |  1 CPU        |  0 Intel Core Ultra 9 185H
 3 ONNX       |  1 CPU        |  0 Intel Core Ultra 9 185H
 4 OpenVINO   |  1 CPU        |  0 Intel(R) Core(TM) Ultra 9 185H

Install

Code Block
# OpenVino
wget https://apt.repos.intel.com/intel-gpg-keys/GPG-PUB-KEY-INTEL-SW-PRODUCTS.PUB
sudo apt-key add GPG-PUB-KEY-INTEL-SW-PRODUCTS.PUB
echo "deb https://apt.repos.intel.com/openvino/2025 ubuntu24 main" | sudo tee /etc/apt/sources.list.d/intel-openvino-2025.list
apt update
apt-cache search openvino
apt install openvino
python3 /usr/share/openvino/samples/python/hello_query_device/hello_query_device.py
 
# NPU driver
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-driver-compiler-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-fw-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-level-zero-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
dpkg --purge --force-remove-reinstreq intel-driver-compiler-npu intel-fw-npu intel-level-zero-npu
apt update
apt install libtbb12
dpkg -i *.deb
wget https://github.com/oneapi-src/level-zero/releases/download/v1.21.9/level-zero_1.21.9+u24.04_amd64.deb
dpkg -i level-zero*.deb

Code Block
#Geekbench
mkdir Geekbench
cd Geekbench
wget https://cdn.geekbench.com/GeekbenchAI-1.3.0-Linux.tar.gz
tar xvf GeekbenchAI-1.3.0-Linux.tar.gz
cd GeekbenchAI-1.3.0-Linux/

Check Available frameworks

Ubuntu 24.04

With 2x64GB 5600 MT/s CL46 DDR5 RAM

governor: performance, PL1=70W, PL2=80W

frameworkBackendDeviceSingleHalfQuant.Results
TensorFlow Lite
CPUIntel Core Ultra 9 185H217421651413https://browser.geekbench.com/ai/v1/259993
ONNX
CPUIntel Core Ultra 9 185H23047165762https://browser.geekbench.com/ai/v1/260000
OpenVINO
CPUIntel Core Ultra 9 185H4631468211063https://browser.geekbench.com/ai/v1/260002
OpenVINO
GPUIntel(R) Arc(TM) Graphics (iGPU)6196934813454https://browser.geekbench.com/ai/v1/260003


Workload (TF Lite)AccuracyScoreWorkload (ONNX)AccuracyScoreWorkload (OpenVINO-CPU)AccuracyScoreWorkload (OpenVINO-GPU)AccuracyScore
Image Classification (SP)100%1510
280.9 IPS
Image Classification (SP)100%1106
205.7 IPS
Image Classification (SP)100%2587
481.0 IPS
Image Classification (SP)100%2587
481.0 IPS
Image Classification (HP)100%1428
265.6 IPS
Image Classification (HP)100%133
24.6 IPS
Image Classification (HP)100%2716
505.1 IPS
Image Classification (HP)100%4288
797.5 IPS
Image Classification (Q)99%993
185.1 IPS
Image Classification (Q)97%4177
779.5 IPS
Image Classification (Q)100%6637
1.23 KIPS
Image Classification (Q)100%5112
950.6 IPS
Image Segmentation (SP)100%2253
36.5 IPS
Image Segmentation (SP)100%1141
18.5 IPS
Image Segmentation (SP)100%3453
56.0 IPS
Image Segmentation (SP)100%3131
50.7 IPS
Image Segmentation (HP)100%2243
36.4 IPS
Image Segmentation (HP)100%209
3.38 IPS
Image Segmentation (HP)100%3468
56.2 IPS
Image Segmentation (HP)100%6965
112.9 IPS
Image Segmentation (Q)98%1035
16.8 IPS
Image Segmentation (Q)99%2658
43.2 IPS
Image Segmentation (Q)99%6945
112.6 IPS
Image Segmentation (Q)99%9262
150.6 IPS
Pose Estimation (SP)100%2357
2.75 IPS
Pose Estimation (SP)100%3668
4.28 IPS
Pose Estimation (SP)100%5030
5.87 IPS
Pose Estimation (SP)100%22411
26.1 IPS
Pose Estimation (HP)100%2309
2.69 IPS
Pose Estimation (HP)100%3007
3.51 IPS
Pose Estimation (HP)100%5100
5.95 IPS
Pose Estimation (HP)99%20212
23.7 IPS
Pose Estimation (Q)96%3224
3.78 IPS
Pose Estimation (Q)94%20021
23.5 IPS
Pose Estimation (Q)96%18872
22.1 IPS
Pose Estimation (Q)97%44712
52.3 IPS
Object Detection (SP)100%1654
131.2 IPS
Object Detection (SP)100%1544
122.5 IPS
Object Detection (SP)100%2648
210.0 IPS
Object Detection (SP)100%2429
192.7 IPS
Object Detection (HP)100%1648
130.7 IPS
Object Detection (HP)100%269
21.3 IPS
Object Detection (HP)100%2652
210.4 IPS
Object Detection (HP)100%3775
299.4 IPS
Object Detection (Q)85%1024
82.4 IPS
Object Detection (Q)86%4605
370.1 IPS
Object Detection (Q)88%6968
558.5 IPS
Object Detection (Q)88%6319
506.4 IPS
Face Detection (SP)100%3071
36.5 IPS
Face Detection (SP)100%2807
33.4 IPS
Face Detection (SP)100%7234
86.0 IPS
Face Detection (SP)100%5692
67.6 IPS
Face Detection (HP)100%3060
36.4 IPS
Face Detection (HP)100%314
3.73 IPS
Face Detection (HP)100%7248
86.1 IPS
Face Detection (HP)100%11061
131.4 IPS
Face Detection (Q)97%2278
27.2 IPS
Face Detection (Q)97%12436
148.3 IPS
Face Detection (Q)100%14581
173.3 IPS
Face Detection (Q)100%16836
200.0 IPS
Depth Estimation (SP)100%2317
17.9 IPS
Depth Estimation (SP)100%4220
32.5 IPS
Depth Estimation (SP)100%6716
51.7 IPS
Depth Estimation (SP)100%12001
92.5 IPS
Depth Estimation (HP)99%2507
19.3 IPS
Depth Estimation (HP)99%1121
8.64 IPS
Depth Estimation (HP)99%6752
52.0 IPS
Depth Estimation (HP)98%20362
157.4 IPS
Depth Estimation (Q)63%1964
18.4 IPS
Depth Estimation (Q)78%13848
110.5 IPS
Depth Estimation (Q)89%17848
138.8 IPS
Depth Estimation (Q)89%22566
175.4 IPS
Style Transfer (SP)100%2892
3.72 IPS
Style Transfer (SP)100%9110
11.7 IPS
Style Transfer (SP)100%15042
19.3 IPS
Style Transfer (SP)100%46431
59.7 IPS
Style Transfer (HP)100%2928
3.76 IPS
Style Transfer (HP)100%7498
9.64 IPS
Style Transfer (HP)100%14930
19.2 IPS
Style Transfer (HP)100%62424
80.2 IPS
Style Transfer (Q)98%5650
7.29 IPS
Style Transfer (Q)98%17976
23.2 IPS
Style Transfer (Q)98%52607
67.8 IPS
Style Transfer (Q)98%115671
149.2 IPS
Image Super-Resolution (SP)100%1494
55.2 IPS
Image Super-Resolution (SP)100%1774
65.5 IPS
Image Super-Resolution (SP)100%3022
111.6 IPS
Image Super-Resolution (SP)100%4275
157.9 IPS
Image Super-Resolution (HP)100%1911
70.6 IPS
Image Super-Resolution (HP)100%1166
43.1 IPS
Image Super-Resolution (HP)100%3020
111.5 IPS
Image Super-Resolution (HP)100%11896
439.2 IPS
Image Super-Resolution (Q)97%1463
54.2 IPS
Image Super-Resolution (Q)99%3013
111.6 IPS
Image Super-Resolution (Q)99%10997
407.3 IPS
Image Super-Resolution (Q)99%17460
646.6 IPS
Text Classification (SP)100%1229
1.64 KIPS
Text Classification (SP)100%1105
1.48 KIPS
Text Classification (SP)100%2778
3.71 KIPS
Text Classification (SP)71%1028
1.49 KIPS
Text Classification (HP)100%1105
1.47 KIPS
Text Classification (HP)100%333
444.7 IPS
Text Classification (HP)100%2783
3.71 KIPS
Text Classification (HP)71%1545
2.24 KIPS
Text Classification (Q)92%390
524.3 IPS
Text Classification (Q)97%1083
1.45 KIPS
Text Classification (Q)92%4816
6.47 KIPS
Text Classification (Q)92%1499
2.01 KIPS
Machine Translation (SP)100%1771
30.5 IPS
Machine Translation (SP)100%1320
22.7 IPS
Machine Translation (SP)100%4745
81.7 IPS
Machine Translation (SP)100%1463
25.2 IPS
Machine Translation (HP)100%2135
36.8 IPS
Machine Translation (HP)100%530
9.14 IPS
Machine Translation (HP)100%4772
82.2 IPS
Machine Translation (HP)96%2510
43.4 IPS
Machine Translation (Q)58%520
12.2 IPS
Machine Translation (Q)65%3117
62.6 IPS
Machine Translation (Q)100%4756
81.9 IPS
Machine Translation (Q)100%1814
31.3 IPS


Code Block
wget https://cdn.geekbench.com/GeekbenchAI-1.3.0-Linux.tar.gz
tar xvf GeekbenchAI-1.3.0-Linux.tar.gz
cd GeekbenchAI-1.3.0-Linux/
./banff


before some drivers added

Code Block
root@server6:/mnt/GeekbenchAI-1.3.0-Linux# ./banff --ai-list
Geekbench AI 1.3.0 : https://www.geekbench.com/ai/

Geekbench AI requires an active internet connection and automatically uploads
benchmark results to the Geekbench Browser.

Framework     | Backend       | Device
 1 TensorFlow Lite |  1 CPU        |  0 Intel Core Ultra 9 185H
 3 ONNX       |  1 CPU        |  0 Intel Core Ultra 9 185H
 4 OpenVINO   |  1 CPU        |  0 Intel(R) Core(TM) Ultra 9 185H


Install

Code Block
# OpenVino
wget https://apt.repos.intel.com/intel-gpg-keys/GPG-PUB-KEY-INTEL-SW-PRODUCTS.PUB
sudo apt-key add GPG-PUB-KEY-INTEL-SW-PRODUCTS.PUB
echo "deb https://apt.repos.intel.com/openvino/2025 ubuntu24 main" | sudo tee /etc/apt/sources.list.d/intel-openvino-2025.list
apt update
apt-cache search openvino
apt install openvino
python3 /usr/share/openvino/samples/python/hello_query_device/hello_query_device.py
 
# NPU driver
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-driver-compiler-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-fw-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-level-zero-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
dpkg --purge --force-remove-reinstreq intel-driver-compiler-npu intel-fw-npu intel-level-zero-npu
apt update
apt install libtbb12
dpkg -i *.deb
wget https://github.com/oneapi-src/level-zero/releases/download/v1.21.9/level-zero_1.21.9+u24.04_amd64.deb
dpkg -i level-zero*.deb


Code Block
#Geekbench
mkdir Geekbench
cd Geekbench
wget https://cdn.geekbench.com/GeekbenchAI-1.3.0-Linux.tar.gz
tar xvf GeekbenchAI-1.3.0-Linux.tar.gz
cd GeekbenchAI-1.3.0-Linux/
./banff --ai-list
./banff --ai-framework 1 --ai-backend 1 --ai-device 0
./banff --ai-framework 3 --ai-backend 1 --ai-device 0
./banff --ai-framework 4 --ai-backend 1 --ai-device 0
./banff --ai-framework 4 --ai-backend 2 --ai-device 1


Check Available frameworks

Ubuntu 24.04

Code Block
root@server6:/mnt/GeekbenchAI-1.3.0-Linux# ./banff --ai-list
Geekbench AI 1.3.0 : https://www.geekbench.com/ai/

Geekbench AI requires an active internet connection and automatically uploads
benchmark results to the Geekbench Browser.

Framework     | Backend       | Device
 1 TensorFlow Lite |  1 CPU        |  0 Intel Core Ultra 9 185H
 3 ONNX       |  1 CPU        |  0 Intel Core Ultra 9 185H
 4 OpenVINO  
Code Block
root@server6:/mnt/GeekbenchAI-1.3.0-Linux# ./banff --ai-list
Geekbench AI 1.3.0 : https://www.geekbench.com/ai/

Geekbench AI requires an active internet connection and automatically uploads
benchmark results to the Geekbench Browser.

Framework     | Backend       | Device
 1 TensorFlow Lite |  1 CPU        |  0 Intel(R) Core(TM) Ultra 9 185H
 3 ONNX4 OpenVINO   |  2 GPU        |  1 Intel(R) Arc(TM) Graphics (iGPU)



Code Block
root@server6:/mnt/GeekbenchAI-1.3.0-Linux# CPU python3 /usr/share/openvino/samples/python/hello_query_device/hello_query_device.py

[ INFO ] Available devices:
[ INFO |]  0 Intel Core Ultra 9 185H
 4 OpenVINOCPU :
[ INFO ]   |  1 CPU  SUPPORTED_PROPERTIES:
[ INFO ]    |  0 Intel(R) Core(TM) Ultra 9 185H
 4 OpenVINO   |  2 GPU AVAILABLE_DEVICES:
[ INFO ]       |  1 Intel(R) Arc(TM) Graphics (iGPU)

Code Block
root@server6:/mnt/GeekbenchAI-1.3.0-Linux#  python3 /usr/share/openvino/samples/python/hello_query_device/hello_query_device.py

 RANGE_FOR_ASYNC_INFER_REQUESTS: 1, 1, 1
[ INFO ] Available devices:
[ INFO ] CPU :
[ INFO ]        SUPPORTEDRANGE_FOR_PROPERTIESSTREAMS: 1, 22
[ INFO ]                AVAILABLEEXECUTION_DEVICES: CPU
[ INFO ]                RANGEFULL_FOR_ASYNC_INFER_REQUESTS: 1, 1, 1DEVICE_NAME: Intel(R) Core(TM) Ultra 9 185H
[ INFO ]                RANGE_FOR_STREAMS: 1, 22OPTIMIZATION_CAPABILITIES: FP32, INT8, BIN, EXPORT_IMPORT
[ INFO ]                EXECUTIONDEVICE_DEVICESTYPE: CPUType.INTEGRATED
[ INFO ]                FULL_DEVICE_NAMEARCHITECTURE: Intel(R) Core(TM) Ultra 9 185Hintel64
[ INFO ]                OPTIMIZATIONNUM_CAPABILITIESSTREAMS: FP32, INT8, BIN, EXPORT_IMPORT1
[ INFO ]                DEVICEINFERENCE_NUM_TYPETHREADS: Type.INTEGRATED0
[ INFO ]                DEVICEPERF_ARCHITECTURECOUNT: intel64False
[ INFO ]                NUM_STREAMS: 1INFERENCE_PRECISION_HINT: <Type: 'float32'>
[ INFO ]                INFERENCEPERFORMANCE_NUM_THREADSHINT: 0PerformanceMode.LATENCY
[ INFO ]                PERFEXECUTION_MODE_COUNTHINT: FalseExecutionMode.PERFORMANCE
[ INFO ]                INFERENCEPERFORMANCE_HINT_PRECISIONNUM_HINTREQUESTS: <Type: 'float32'>0
[ INFO ]                PERFORMANCEENABLE_CPU_HINTPINNING: PerformanceMode.LATENCYTrue
[ INFO ]                EXECUTIONENABLE_MODECPU_HINTRESERVATION: ExecutionMode.PERFORMANCEFalse
[ INFO ]                PERFORMANCESCHEDULING_HINTCORE_NUM_REQUESTSTYPE: 0SchedulingCoreType.ANY_CORE
[ INFO ]                ENABLEMODEL_CPUDISTRIBUTION_PINNING: TruePOLICY: set()
[ INFO ]                ENABLE_CPUHYPER_RESERVATIONTHREADING: FalseTrue
[ INFO ]                SCHEDULINGDEVICE_CORE_TYPE: SchedulingCoreType.ANY_COREID:
[ INFO ]                MODELCPU_DISTRIBUTIONDENORMALS_POLICYOPTIMIZATION: set()False
[ INFO ]                ENABLELOG_HYPER_THREADINGLEVEL: TrueLevel.NO
[ INFO ]                DEVICE_ID:CPU_SPARSE_WEIGHTS_DECOMPRESSION_RATE: 1.0
[ INFO ]                CPUDYNAMIC_QUANTIZATION_DENORMALSGROUP_OPTIMIZATIONSIZE: False32
[ INFO ]                LOG_LEVEL: Level.NOKV_CACHE_PRECISION: <Type: 'uint8_t'>
[ INFO ]                CPUKEY_SPARSE_WEIGHTS_DECOMPRESSION_RATE: 1.0CACHE_PRECISION: <Type: 'uint8_t'>
[ INFO ]                DYNAMICVALUE_QUANTIZATION_GROUP_SIZE: 32CACHE_PRECISION: <Type: 'uint8_t'>
[ INFO ]                KVKEY_CACHE_GROUP_PRECISIONSIZE: <Type: 'uint8_t'>0
[ INFO ]                KEYVALUE_CACHE_GROUP_PRECISIONSIZE: <Type: 'uint8_t'>0
[ INFO ]
[ INFO ]   GPU :
[ INFO ]        VALUESUPPORTED_CACHE_PRECISION: <Type: 'uint8_t'>PROPERTIES:
[ INFO ]                KEY_CACHE_GROUP_SIZEAVAILABLE_DEVICES: 0
[ INFO ]                VALUERANGE_FOR_CACHEASYNC_GROUPINFER_SIZEREQUESTS: 01, 2, 1
[ INFO ]
[   INFO ] GPU :
[ INFO ]        SUPPORTEDRANGE_FOR_PROPERTIESSTREAMS: 1, 2
[ INFO ]                AVAILABLEOPTIMAL_BATCH_DEVICESSIZE: 01
[ INFO ]                RANGE_FOR_ASYNC_INFER_REQUESTS MAX_BATCH_SIZE: 1, 2, 1
[ INFO ]                RANGE_FOR_STREAMS: 1, 2DEVICE_ARCHITECTURE: GPU: vendor=0x8086 arch=v12.71.4
[ INFO ]                OPTIMALFULL_BATCHDEVICE_SIZE: 1NAME: Intel(R) Arc(TM) Graphics (iGPU)
[ INFO ]                MAXDEVICE_BATCH_SIZEUUID: 18680557d080000000002000000000000
[ INFO ]                DEVICE_ARCHITECTURE: GPU: vendor=0x8086 arch=v12.71.4LUID: 409a0000499a0000
[ INFO ]                FULL_DEVICE_NAME: Intel(R) Arc(TM) Graphics (iGPU)TYPE: Type.INTEGRATED
[ INFO ]                DEVICE_GOPS: {<Type: 'float16'>: 9625.599609375, <Type: 'float32'>: 4812.7998046875, <Type: 'int8_t'>: 19251.19921875, <Type:  DEVICE_UUID: 8680557d080000000002000000000000'uint8_t'>: 19251.19921875}
[ INFO ]                DEVICEOPTIMIZATION_LUIDCAPABILITIES: 409a0000499a0000FP32, BIN, FP16, INT8, EXPORT_IMPORT
[ INFO ]                GPU_DEVICE_TYPETOTAL_MEM_SIZE: Type.INTEGRATED62438608896
[ INFO ]                DEVICE_GOPS: {<Type: 'float16'>: 9625.599609375, <Type: 'float32'>: 4812.7998046875, <Type: 'int8_t'>: 19251.19921875, <Type: 'uint8_t'>: 19251.19921875}        GPU_UARCH_VERSION: 12.71.4
[ INFO ]                OPTIMIZATION_CAPABILITIES: FP32, BIN, FP16, INT8, EXPORT_IMPORTGPU_EXECUTION_UNITS_COUNT: 128
[ INFO ]                GPU_DEVICE_TOTAL_MEM_SIZEMEMORY_STATISTICS: 62438608896{}
[ INFO ]                GPUPERF_UARCH_VERSIONCOUNT: 12.71.4False
[ INFO ]                GPU_EXECUTION_UNITS_COUNTMODEL_PRIORITY: 128Priority.MEDIUM
[ INFO ]                GPU_HOST_MEMORYTASK_STATISTICSPRIORITY: {}Priority.MEDIUM
[ INFO ]                PERFGPU_QUEUE_COUNTPRIORITY: FalsePriority.MEDIUM
[ INFO ]                MODELGPU_QUEUE_PRIORITYTHROTTLE: Priority.MEDIUM
[ INFO ]                GPU_HOSTENABLE_TASKSDPA_PRIORITYOPTIMIZATION: Priority.MEDIUMTrue
[ INFO ]                GPU_QUEUEENABLE_LOOP_PRIORITYUNROLLING: Priority.MEDIUMTrue
[ INFO ]                GPU_QUEUEDISABLE_WINOGRAD_THROTTLECONVOLUTION: Priority.MEDIUMFalse
[ INFO ]                GPU_ENABLE_SDPA_OPTIMIZATION: TrueCACHE_DIR:
[ INFO ]                GPU_ENABLE_LOOP_UNROLLING: TrueCACHE_MODE: CacheMode.OPTIMIZE_SPEED
[ INFO ]                GPU_DISABLE_WINOGRAD_CONVOLUTIONPERFORMANCE_HINT: FalsePerformanceMode.LATENCY
[ INFO ]                CACHEEXECUTION_MODE_DIRHINT: ExecutionMode.PERFORMANCE
[ INFO ]                CACHECOMPILATION_NUM_MODETHREADS: CacheMode.OPTIMIZE_SPEED22
[ INFO ]                PERFORMANCENUM_HINTSTREAMS: PerformanceMode.LATENCY1
[ INFO ]                EXECUTIONPERFORMANCE_HINT_MODENUM_HINTREQUESTS: ExecutionMode.PERFORMANCE0
[ INFO ]                COMPILATIONINFERENCE_NUM_THREADSPRECISION_HINT: <Type: 22'float16'>
[ INFO ]                NUMENABLE_CPU_STREAMSPINNING: 1False
[ INFO ]                PERFORMANCEENABLE_HINTCPU_NUM_REQUESTSRESERVATION: 0False
[ INFO ]                INFERENCEDEVICE_PRECISION_HINTID: <Type: 'float16'>0
[ INFO ]                ENABLEDYNAMIC_QUANTIZATION_CPUGROUP_PINNINGSIZE: False0
[ INFO ]                ENABLEACTIVATIONS_CPUSCALE_RESERVATIONFACTOR: False-1.0
[ INFO ]                DEVICEWEIGHTS_IDPATH: 0
[ INFO ]                DYNAMICCACHE_QUANTIZATIONENCRYPTION_GROUP_SIZECALLBACKS: UNSUPPORTED 0TYPE
[ INFO ]                ACTIVATIONSKV_SCALE_FACTOR: -1.0CACHE_PRECISION: <Type: 'dynamic'>
[ INFO ]                WEIGHTSMODEL_PATHPTR: UNSUPPORTED TYPE
[ INFO ]
[ INFO ] NPU :
[ INFO ]          CACHE_ENCRYPTION_CALLBACKS: UNSUPPORTED TYPESUPPORTED_PROPERTIES:
[ INFO ]                KVAVAILABLE_CACHE_PRECISIONDEVICES: <Type: 'dynamic'>3720
[ INFO ]                MODELCACHE_PTRDIR: UNSUPPORTED TYPE
[ INFO ]
[   INFO ] NPU :
[ INFO ]        SUPPORTEDCOMPILATION_NUM_PROPERTIESTHREADS: 22
[ INFO ]                AVAILABLEDEVICE_DEVICESARCHITECTURE: 3720
[ INFO ]                CACHEDEVICE_DIRGOPS:
[ INFO ]                COMPILATION_NUM_THREADS: 22{<Type: 'bfloat16'>: 0.0, <Type: 'float16'>: 4300.7998046875, <Type: 'float32'>: 0.0, <Type: 'int8_t'>: 8601.599609375, <Type: 'uint8_t'>: 8601.599609375}
[ INFO ]                DEVICE_ARCHITECTUREID: 3720
[ INFO ]                DEVICE_PCI_GOPSINFO: {<Typedomain: 'bfloat16'>0 bus: 0.0, <Typedevice: 'float16'>: 4300.7998046875, <Type: 'float32'>: 0.0, <Type: 'int8_t'>: 8601.599609375, <Type: 'uint8_t'>: 8601.599609375}0xb function: 0}
[ INFO ]                DEVICE_TYPE: Type.INTEGRATED
[ INFO ]                DEVICE_IDUUID: 80d1d11eb73811eab3de0242ac130004
[ INFO ]                DEVICEENABLE_PCICPU_INFOPINNING: {domain: 0 bus: 0 device: 0xb function: 0}False
[ INFO ]                DEVICEEXECUTION_TYPEDEVICES: Type.INTEGRATEDNPU
[ INFO ]                DEVICEEXECUTION_MODE_UUIDHINT: 80d1d11eb73811eab3de0242ac130004ExecutionMode.PERFORMANCE
[ INFO ]                ENABLEFULL_CPUDEVICE_PINNING: FalseNAME: Intel(R) AI Boost
[ INFO ]                EXECUTION_DEVICES: NPUINFERENCE_PRECISION_HINT: <Type: 'float16'>
[ INFO ]                EXECUTIONLOG_MODE_HINTLEVEL: ExecutionModeLevel.PERFORMANCEERR
[ INFO ]                FULLMODEL_DEVICE_NAME: Intel(R) AI BoostPRIORITY: Priority.MEDIUM
[ INFO ]                INFERENCENPU_BYPASS_PRECISIONUMD_HINTCACHING: <Type: 'float16'>False
[ INFO ]                LOG_LEVEL: Level.ERRNPU_COMPILATION_MODE_PARAMS:
[ INFO ]                MODEL_PRIORITYNPU_COMPILER_DYNAMIC_QUANTIZATION: Priority.MEDIUMFalse
[ INFO ]                NPU_BYPASSCOMPILER_UMD_CACHINGVERSION: False458772
[ INFO ]                NPU_COMPILATIONDEFER_MODEWEIGHTS_PARAMSLOAD: False
[ INFO ]                NPU_COMPILERDEVICE_ALLOC_DYNAMICMEM_QUANTIZATIONSIZE: False0
[ INFO ]                NPU_DEVICE_TOTAL_COMPILERMEM_VERSIONSIZE: 45877266926030848
[ INFO ]                NPU_DEFERDRIVER_WEIGHTS_LOADVERSION: False1746727061
[ INFO ]                NPU_DEVICE_ALLOC_MEM_SIZEMAX_TILES: 02
[ INFO ]                NPU_DEVICE_TOTAL_MEM_SIZEQDQ_OPTIMIZATION: 66926030848False
[ INFO ]                NPU_DRIVER_VERSIONTILES: 1746727061-1
[ INFO ]                NPU_MAX_TILESTURBO: 2False
[ INFO ]                NPUNUM_QDQ_OPTIMIZATIONSTREAMS: False1
[ INFO ]                NPU_TILESOPTIMAL_NUMBER_OF_INFER_REQUESTS: -1
[ INFO ]                NPUOPTIMIZATION_TURBOCAPABILITIES: FalseFP16, INT8, EXPORT_IMPORT
[ INFO ]                NUMPERFORMANCE_STREAMSHINT: 1PerformanceMode.LATENCY
[ INFO ]                OPTIMALPERFORMANCE_NUMBERHINT_OF_INFERNUM_REQUESTS: 1
[ INFO ]                OPTIMIZATIONPERF_CAPABILITIESCOUNT: FP16, INT8, EXPORT_IMPORTFalse
[ INFO ]                PERFORMANCE_HINT: PerformanceMode.LATENCYRANGE_FOR_ASYNC_INFER_REQUESTS: 1, 10, 1
[ INFO ]                PERFORMANCERANGE_HINTFOR_NUM_REQUESTSSTREAMS: 1, 4
[ INFO ]                PERFWEIGHTS_COUNTPATH: False
[ INFO ]                RANGE_FOR_ASYNC_INFER_REQUESTS: 1, 10, 1
[ INFO ]                RANGE_FOR_STREAMS: 1, 4
[ INFO ]                WEIGHTS_PATH:
[ INFO ]                WORKLOAD_TYPE: WorkloadType.DEFAULT

WORKLOAD_TYPE: WorkloadType.DEFAULT

Windows 11

Intel Core Ultra 9 185H, 96GB DDR5

BIOS: Turbo Performance

Windows: Balanced

FrameworkBackendSingleHalfQuantizedURL
OpenVinoGPU84961337120718https://browser.geekbench.com/ai/v1/256798
OpenVINOCPU289630036822https://browser.geekbench.com/ai/v1/256805
ONNXDirectML504981993647https://browser.geekbench.com/ai/v1/256809
ONNXCPU371915086668https://browser.geekbench.com/ai/v1/256810