Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

Ubuntu 24.04 Linux

With 2x32GB 4800 MT/s CL38 DDR5 RAM

frameworkBackendDeviceSingleHalfQuant.Results
TensorFlow Lite
CPUIntel Core Ultra 9 185H197120371382https://browser.geekbench.com/ai/v1/259291
ONNX
CPUIntel Core Ultra 9 185H21386435634https://browser.geekbench.com/ai/v1/259319
OpenVINO
CPUIntel Core Ultra 9 185H4507454110765https://browser.geekbench.com/ai/v1/259321
OpenVINO
GPUIntel(R) Arc(TM) Graphics (iGPU)
Workload (TF Lite)AccuracyScoreWorkload (ONNX)AccuracyScoreWorkload (OpenVINO)AccuracyScoreImage Classification (SP)100%1510
280.9 IPSImage Classification (SP)100%1106
205.7 IPS Image Classification (HP)100%1428
265.6 IPSImage Classification (HP)100%133
24.6 IPS Image Classification (Q)99%993
185.1 IPSImage Classification (Q)97%4177
779.5 IPS Image Segmentation (SP)100%2253
36.5 IPSImage Segmentation (SP)100%1141
18.5 IPS Image Segmentation (HP)100%2243
36.4 IPSImage Segmentation (HP)100%209
3.38 IPS Image Segmentation (Q)98%1035
16.8 IPSImage Segmentation (Q)99%2658
43.2 IPS Pose Estimation (SP)100%2357
2.75 IPSPose Estimation (SP)100%3668
4.28 IPS Pose Estimation (HP)100%2309
2.69 IPSPose Estimation (HP)100%3007
3.51 IPS Pose Estimation (Q)96%3224
3.78 IPSPose Estimation (Q)94%20021
23.5 IPS Object Detection (SP)100%1654
131.2 IPSObject Detection (SP)100%1544
122.5 IPS Object Detection (HP)100%1648
130.7 IPSObject Detection (HP)100%269
21.3 IPS Object Detection (Q)85%1024
82.4 IPSObject Detection (Q)86%4605
370.1 IPS Face Detection (SP)100%3071
36.5 IPSFace Detection (SP)100%2807
33.4 IPS Face Detection (HP)100%3060
36.4 IPSFace Detection (HP)100%314
3.73 IPS Face Detection (Q)97%2278
27.2 IPSFace Detection (Q)97%12436
148.3 IPS Depth Estimation (SP)100%2317
17.9 IPSDepth Estimation (SP)100%4220
32.5 IPS Depth Estimation (HP)99%2507
19.3 IPSDepth Estimation (HP)99%1121
8.64 IPS Depth Estimation (Q)63%1964
18.4 IPSDepth Estimation (Q)78%13848
110.5 IPS Style Transfer (SP)100%2892
3.72 IPSStyle Transfer (SP)100%9110
11.7 IPS Style Transfer (HP)100%2928
3.76 IPSStyle Transfer (HP)100%7498
9.64 IPS Style Transfer (Q)98%5650
7.29 IPSStyle Transfer (Q)98%17976
23.2 IPS Image Super-Resolution (SP)100%1494
55.2 IPSImage Super-Resolution (SP)100%1774
65.5 IPS Image Super-Resolution (HP)100%1911
70.6 IPSImage Super-Resolution (HP)100%1166
43.1 IPS Image Super-Resolution (Q)97%1463
54.2 IPSImage Super-Resolution (Q)99%3013
111.6 IPS Text Classification (SP)100%1229
1.64 KIPSText Classification (SP)100%1105
1.48 KIPS Text Classification (HP)100%1105
1.47 KIPSText Classification (HP)100%333
444.7 IPS Text Classification (Q)92%390
524.3 IPSText Classification (Q)97%1083
1.45 KIPS Machine Translation (SP)100%1771
30.5 IPSMachine Translation (SP)100%1320
22.7 IPS Machine Translation (HP)100%2135
36.8 IPSMachine Translation (HP)100%530
9.14 IPS Machine Translation (Q)58%520
12.2 IPSMachine Translation (Q)65%3117
62.6 IPS

before some drivers added

Code Block
root@server6:/mnt/GeekbenchAI-1.3.0-Linux# ./banff --ai-list
Geekbench AI 1.3.0 : https://www.geekbench.com/ai/

Geekbench AI requires an active internet connection and automatically uploads
benchmark results to the Geekbench Browser.

Framework     | Backend       | Device
 1 TensorFlow Lite |  1 CPU        |  0 Intel Core Ultra 9 185H
 3 ONNX       |  1 CPU        |  0 Intel Core Ultra 9 185H
 4 OpenVINO   |  1 CPU        |  0 Intel(R) Core(TM) Ultra 9 185H

Install

Code Block
# OpenVino
wget https://apt.repos.intel.com/intel-gpg-keys/GPG-PUB-KEY-INTEL-SW-PRODUCTS.PUB
sudo apt-key add GPG-PUB-KEY-INTEL-SW-PRODUCTS.PUB
echo "deb https://apt.repos.intel.com/openvino/2025 ubuntu24 main" | sudo tee /etc/apt/sources.list.d/intel-openvino-2025.list
apt update
apt-cache search openvino
apt install openvino
python3 /usr/share/openvino/samples/python/hello_query_device/hello_query_device.py
 
# NPU driver
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-driver-compiler-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-fw-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-level-zero-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
dpkg --purge --force-remove-reinstreq intel-driver-compiler-npu intel-fw-npu intel-level-zero-npu
apt update
apt install libtbb12
dpkg -i *.deb
wget https://github.com/oneapi-src/level-zero/releases/download/v1.21.9/level-zero_1.21.9+u24.04_amd64.deb
dpkg -i level-zero*.deb

Code Block
#Geekbench
mkdir Geekbench
cd Geekbench
wget https://cdn.geekbench.com/GeekbenchAI-1.3.0-Linux.tar.gz
tar xvf GeekbenchAI-1.3.0-Linux.tar.gz
cd GeekbenchAI-1.3.0-Linux/

Check Available frameworks

Ubuntu 24.04

...

With 2x64GB 5600 MT/s CL46 DDR5 RAM

governor: performance, PL1=70W, PL2=80W

frameworkBackendDeviceSingleHalfQuant.Results
TensorFlow Lite
CPUIntel Core Ultra 9 185H217421651413https://browser.geekbench.com/ai/v1/259993
ONNX
CPUIntel Core Ultra 9 185H23047165762https://browser.geekbench.com/ai/v1/260000
OpenVINO
CPUIntel Core Ultra 9 185H4631468211063https://browser.geekbench.com/ai/v1/260002
OpenVINO
GPUIntel(R) Arc(TM) Graphics (iGPU)6196934813454https://browser.geekbench.com/ai/v1/260003


Workload (TF Lite)AccuracyScoreWorkload (ONNX)AccuracyScoreWorkload (OpenVINO-CPU)AccuracyScoreWorkload (OpenVINO-GPU)AccuracyScore
Image Classification (SP)100%1510
280.9 IPS
Image Classification (SP)100%1106
205.7 IPS
Image Classification (SP)100%2587
481.0 IPS
Image Classification (SP)100%2587
481.0 IPS
Image Classification (HP)100%1428
265.6 IPS
Image Classification (HP)100%133
24.6 IPS
Image Classification (HP)100%2716
505.1 IPS
Image Classification (HP)100%4288
797.5 IPS
Image Classification (Q)99%993
185.1 IPS
Image Classification (Q)97%4177
779.5 IPS
Image Classification (Q)100%6637
1.23 KIPS
Image Classification (Q)100%5112
950.6 IPS
Image Segmentation (SP)100%2253
36.5 IPS
Image Segmentation (SP)100%1141
18.5 IPS
Image Segmentation (SP)100%3453
56.0 IPS
Image Segmentation (SP)100%3131
50.7 IPS
Image Segmentation (HP)100%2243
36.4 IPS
Image Segmentation (HP)100%209
3.38 IPS
Image Segmentation (HP)100%3468
56.2 IPS
Image Segmentation (HP)100%6965
112.9 IPS
Image Segmentation (Q)98%1035
16.8 IPS
Image Segmentation (Q)99%2658
43.2 IPS
Image Segmentation (Q)99%6945
112.6 IPS
Image Segmentation (Q)99%9262
150.6 IPS
Pose Estimation (SP)100%2357
2.75 IPS
Pose Estimation (SP)100%3668
4.28 IPS
Pose Estimation (SP)100%5030
5.87 IPS
Pose Estimation (SP)100%22411
26.1 IPS
Pose Estimation (HP)100%2309
2.69 IPS
Pose Estimation (HP)100%3007
3.51 IPS
Pose Estimation (HP)100%5100
5.95 IPS
Pose Estimation (HP)99%20212
23.7 IPS
Pose Estimation (Q)96%3224
3.78 IPS
Pose Estimation (Q)94%20021
23.5 IPS
Pose Estimation (Q)96%18872
22.1 IPS
Pose Estimation (Q)97%44712
52.3 IPS
Object Detection (SP)100%1654
131.2 IPS
Object Detection (SP)100%1544
122.5 IPS
Object Detection (SP)100%2648
210.0 IPS
Object Detection (SP)100%2429
192.7 IPS
Object Detection (HP)100%1648
130.7 IPS
Object Detection (HP)100%269
21.3 IPS
Object Detection (HP)100%2652
210.4 IPS
Object Detection (HP)100%3775
299.4 IPS
Object Detection (Q)85%1024
82.4 IPS
Object Detection (Q)86%4605
370.1 IPS
Object Detection (Q)88%6968
558.5 IPS
Object Detection (Q)88%6319
506.4 IPS
Face Detection (SP)100%3071
36.5 IPS
Face Detection (SP)100%2807
33.4 IPS
Face Detection (SP)100%7234
86.0 IPS
Face Detection (SP)100%5692
67.6 IPS
Face Detection (HP)100%3060
36.4 IPS
Face Detection (HP)100%314
3.73 IPS
Face Detection (HP)100%7248
86.1 IPS
Face Detection (HP)100%11061
131.4 IPS
Face Detection (Q)97%2278
27.2 IPS
Face Detection (Q)97%12436
148.3 IPS
Face Detection (Q)100%14581
173.3 IPS
Face Detection (Q)100%16836
200.0 IPS
Depth Estimation (SP)100%2317
17.9 IPS
Depth Estimation (SP)100%4220
32.5 IPS
Depth Estimation (SP)100%6716
51.7 IPS
Depth Estimation (SP)100%12001
92.5 IPS
Depth Estimation (HP)99%2507
19.3 IPS
Depth Estimation (HP)99%1121
8.64 IPS
Depth Estimation (HP)99%6752
52.0 IPS
Depth Estimation (HP)98%20362
157.4 IPS
Depth Estimation (Q)63%1964
18.4 IPS
Depth Estimation (Q)78%13848
110.5 IPS
Depth Estimation (Q)89%17848
138.8 IPS
Depth Estimation (Q)89%22566
175.4 IPS
Style Transfer (SP)100%2892
3.72 IPS
Style Transfer (SP)100%9110
11.7 IPS
Style Transfer (SP)100%15042
19.3 IPS
Style Transfer (SP)100%46431
59.7 IPS
Style Transfer (HP)100%2928
3.76 IPS
Style Transfer (HP)100%7498
9.64 IPS
Style Transfer (HP)100%14930
19.2 IPS
Style Transfer (HP)100%62424
80.2 IPS
Style Transfer (Q)98%5650
7.29 IPS
Style Transfer (Q)98%17976
23.2 IPS
Style Transfer (Q)98%52607
67.8 IPS
Style Transfer (Q)98%115671
149.2 IPS
Image Super-Resolution (SP)100%1494
55.2 IPS
Image Super-Resolution (SP)100%1774
65.5 IPS
Image Super-Resolution (SP)100%3022
111.6 IPS
Image Super-Resolution (SP)100%4275
157.9 IPS
Image Super-Resolution (HP)100%1911
70.6 IPS
Image Super-Resolution (HP)100%1166
43.1 IPS
Image Super-Resolution (HP)100%3020
111.5 IPS
Image Super-Resolution (HP)100%11896
439.2 IPS
Image Super-Resolution (Q)97%1463
54.2 IPS
Image Super-Resolution (Q)99%3013
111.6 IPS
Image Super-Resolution (Q)99%10997
407.3 IPS
Image Super-Resolution (Q)99%17460
646.6 IPS
Text Classification (SP)100%1229
1.64 KIPS
Text Classification (SP)100%1105
1.48 KIPS
Text Classification (SP)100%2778
3.71 KIPS
Text Classification (SP)71%1028
1.49 KIPS
Text Classification (HP)100%1105
1.47 KIPS
Text Classification (HP)100%333
444.7 IPS
Text Classification (HP)100%2783
3.71 KIPS
Text Classification (HP)71%1545
2.24 KIPS
Text Classification (Q)92%390
524.3 IPS
Text Classification (Q)97%1083
1.45 KIPS
Text Classification (Q)92%4816
6.47 KIPS
Text Classification (Q)92%1499
2.01 KIPS
Machine Translation (SP)100%1771
30.5 IPS
Machine Translation (SP)100%1320
22.7 IPS
Machine Translation (SP)100%4745
81.7 IPS
Machine Translation (SP)100%1463
25.2 IPS
Machine Translation (HP)100%2135
36.8 IPS
Machine Translation (HP)100%530
9.14 IPS
Machine Translation (HP)100%4772
82.2 IPS
Machine Translation (HP)96%2510
43.4 IPS
Machine Translation (Q)58%520
12.2 IPS
Machine Translation (Q)65%3117
62.6 IPS
Machine Translation (Q)100%4756
81.9 IPS
Machine Translation (Q)100%1814
31.3 IPS


Code Block
wget https://cdn.geekbench.com/GeekbenchAI-1.3.0-Linux.tar.gz
tar xvf GeekbenchAI-1.3.0-Linux.tar.gz
cd GeekbenchAI-1.3.0-Linux/
./banff


before some drivers added

Code Block
root@server6:/mnt/GeekbenchAI-1.3.0-Linux#  python3 /usr/share/openvino/samples/python/hello_query_device/hello_query_device.py

[ INFO ] Available devices:
[ INFO ] CPU :
[ INFO ]        SUPPORTED_PROPERTIES:
[ INFO ]    ./banff --ai-list
Geekbench AI 1.3.0 : https://www.geekbench.com/ai/

Geekbench AI requires an active internet connection and automatically uploads
benchmark results to the Geekbench Browser.

Framework     | Backend       | Device
 1 TensorFlow  AVAILABLE_DEVICES:
[ INFO ] Lite |  1 CPU        |  0 Intel Core Ultra  RANGE_FOR_ASYNC_INFER_REQUESTS: 1, 1, 1
[ INFO ]9 185H
 3 ONNX       |  1 CPU      RANGE_FOR_STREAMS: 1, 22
[| INFO ]0 Intel Core Ultra 9 185H
 4 OpenVINO   |      EXECUTION_DEVICES: CPU
[ INFO ]1 CPU        |          FULL_DEVICE_NAME: 0 Intel(R) Core(TM) Ultra 9 185H
[ INFO ]                OPTIMIZATION_CAPABILITIES: FP32, INT8, BIN, EXPORT_IMPORT
[ INFO ]                DEVICE_TYPE: Type.INTEGRATED
[ INFO ]                DEVICE_ARCHITECTURE: intel64
[ INFO ]                NUM_STREAMS: 1
[ INFO ]                INFERENCE_NUM_THREADS: 0
[ INFO ]                PERF_COUNT: False
[ INFO ]                INFERENCE_PRECISION_HINT: <Type: 'float32'>
[ INFO ]                PERFORMANCE_HINT: PerformanceMode.LATENCY
[ INFO ]                EXECUTION_MODE_HINT: ExecutionMode.PERFORMANCE
[ INFO ]                PERFORMANCE_HINT_NUM_REQUESTS: 0
[ INFO ]                ENABLE_CPU_PINNING: True
[ INFO ]       


Install

Code Block
# OpenVino
wget https://apt.repos.intel.com/intel-gpg-keys/GPG-PUB-KEY-INTEL-SW-PRODUCTS.PUB
sudo apt-key add GPG-PUB-KEY-INTEL-SW-PRODUCTS.PUB
echo "deb https://apt.repos.intel.com/openvino/2025 ubuntu24 main" | sudo tee /etc/apt/sources.list.d/intel-openvino-2025.list
apt update
apt-cache search openvino
apt install openvino
python3 /usr/share/openvino/samples/python/hello_query_device/hello_query_device.py
 
# NPU driver
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-driver-compiler-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-fw-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
wget https://github.com/intel/linux-npu-driver/releases/download/v1.17.0/intel-level-zero-npu_1.17.0.20250508-14912879441_ubuntu24.04_amd64.deb
dpkg --purge --force-remove-reinstreq intel-driver-compiler-npu intel-fw-npu intel-level-zero-npu
apt update
apt install libtbb12
dpkg -i *.deb
wget https://github.com/oneapi-src/level-zero/releases/download/v1.21.9/level-zero_1.21.9+u24.04_amd64.deb
dpkg -i level-zero*.deb


Code Block
#Geekbench
mkdir Geekbench
cd Geekbench
wget https://cdn.geekbench.com/GeekbenchAI-1.3.0-Linux.tar.gz
tar xvf GeekbenchAI-1.3.0-Linux.tar.gz
cd GeekbenchAI-1.3.0-Linux/
./banff --ai-list
./banff --ai-framework 1 --ai-backend 1 --ai-device 0
./banff --ai-framework 3 --ai-backend 1 --ai-device 0
./banff --ai-framework 4 --ai-backend 1 --ai-device 0
./banff --ai-framework 4 --ai-backend 2 --ai-device 1


Check Available frameworks

Ubuntu 24.04

Code Block
root@server6:/mnt/GeekbenchAI-1.3.0-Linux# ./banff --ai-list
Geekbench AI 1.3.0 : https://www.geekbench.com/ai/

Geekbench AI requires an active internet connection and automatically uploads
benchmark results to the Geekbench Browser.

Framework     | Backend       | Device
 1 TensorFlow Lite |  1 CPU        |  0 Intel Core Ultra 9 185H
 3 ONNX       |  1 CPU        | ENABLE_CPU_RESERVATION: False
[ INFO ]      0 Intel Core Ultra 9 185H
 4 OpenVINO   |  1 CPU     SCHEDULING_CORE_TYPE: SchedulingCoreType.ANY_CORE
[ INFO ]|  0 Intel(R) Core(TM) Ultra 9 185H
 4 OpenVINO   |    MODEL_DISTRIBUTION_POLICY: set()
[ INFO ]2 GPU        |  1 Intel(R) Arc(TM) Graphics (iGPU)



Code Block
root@server6:/mnt/GeekbenchAI-1.3.0-Linux#     ENABLE_HYPER_THREADING: Truepython3 /usr/share/openvino/samples/python/hello_query_device/hello_query_device.py

[ INFO ] Available devices:
[ INFO ] CPU :
[ INFO ]        DEVICESUPPORTED_IDPROPERTIES:
[ INFO ]                CPUAVAILABLE_DENORMALS_OPTIMIZATIONDEVICES: False
[ INFO ]                LOG_LEVEL: Level.NORANGE_FOR_ASYNC_INFER_REQUESTS: 1, 1, 1
[ INFO ]                CPURANGE_SPARSE_WEIGHTS_DECOMPRESSION_RATEFOR_STREAMS: 1.0, 22
[ INFO ]                DYNAMIC_QUANTIZATION_GROUP_SIZEEXECUTION_DEVICES: 32CPU
[ INFO ]                KVFULL_CACHEDEVICE_PRECISION: <Type: 'uint8_t'>NAME: Intel(R) Core(TM) Ultra 9 185H
[ INFO ]                KEYOPTIMIZATION_CACHE_PRECISIONCAPABILITIES: <Type: 'uint8_t'>FP32, INT8, BIN, EXPORT_IMPORT
[ INFO ]                VALUEDEVICE_CACHE_PRECISION: <Type: 'uint8_t'>TYPE: Type.INTEGRATED
[ INFO ]                KEY_CACHE_GROUP_SIZEDEVICE_ARCHITECTURE: 0intel64
[ INFO ]                VALUE_CACHE_GROUP_SIZENUM_STREAMS: 01
[ INFO ]
[   INFO ] GPU :
[ INFO ]        SUPPORTEDINFERENCE_NUM_PROPERTIESTHREADS: 0
[ INFO ]                AVAILABLEPERF_DEVICESCOUNT: 0False
[ INFO ]                RANGEINFERENCE_FOR_ASYNC_INFER_REQUESTSPRECISION_HINT: 1, 2, 1<Type: 'float32'>
[ INFO ]                RANGEPERFORMANCE_FOR_STREAMSHINT: 1, 2PerformanceMode.LATENCY
[ INFO ]                OPTIMALEXECUTION_BATCHMODE_SIZEHINT: 1ExecutionMode.PERFORMANCE
[ INFO ]                MAXPERFORMANCE_HINT_BATCHNUM_SIZEREQUESTS: 10
[ INFO ]                DEVICEENABLE_CPU_ARCHITECTUREPINNING: GPU: vendor=0x8086 arch=v12.71.4True
[ INFO ]                FULLENABLE_DEVICECPU_NAMERESERVATION: Intel(R) Arc(TM) Graphics (iGPU)
False
[ INFO ]                DEVICESCHEDULING_CORE_UUIDTYPE: 8680557d080000000002000000000000SchedulingCoreType.ANY_CORE
[ INFO ]                DEVICEMODEL_DISTRIBUTION_LUIDPOLICY: 409a0000499a0000set()
[ INFO ]                DEVICEENABLE_HYPER_TYPETHREADING: Type.INTEGRATEDTrue
[ INFO ]                DEVICE_GOPSID: {<Type: 'float16'>: 9625.599609375, <Type: 'float32'>: 4812.7998046875, <Type: 'int8_t'>: 19251.19921875, <Type: 'uint8_t'>: 19251.19921875}
[ INFO ]                CPU_DENORMALS_OPTIMIZATION: False
[ INFO ]                OPTIMIZATIONLOG_CAPABILITIES: FP32, BIN, FP16, INT8, EXPORT_IMPORTLEVEL: Level.NO
[ INFO ]                GPUCPU_DEVICESPARSE_TOTALWEIGHTS_MEMDECOMPRESSION_SIZERATE: 624386088961.0
[ INFO ]                GPUDYNAMIC_QUANTIZATION_UARCHGROUP_VERSIONSIZE: 12.71.432
[ INFO ]                GPUKV_EXECUTION_UNITS_COUNT: 128CACHE_PRECISION: <Type: 'uint8_t'>
[ INFO ]                GPUKEY_MEMORY_STATISTICS: {}CACHE_PRECISION: <Type: 'uint8_t'>
[ INFO ]                PERF_COUNT: FalseVALUE_CACHE_PRECISION: <Type: 'uint8_t'>
[ INFO ]                MODEL_PRIORITYKEY_CACHE_GROUP_SIZE: Priority.MEDIUM0
[ INFO ]                GPUVALUE_HOSTCACHE_TASKGROUP_PRIORITYSIZE: Priority.MEDIUM0
[ INFO ]
[ INFO ] GPU  :
[ INFO ]         GPU_QUEUE_PRIORITY: Priority.MEDIUMSUPPORTED_PROPERTIES:
[ INFO ]                GPUAVAILABLE_QUEUE_THROTTLEDEVICES: Priority.MEDIUM0
[ INFO ]                GPURANGE_FOR_ENABLEASYNC_SDPAINFER_OPTIMIZATIONREQUESTS: True 1, 2, 1
[ INFO ]                GPURANGE_ENABLEFOR_LOOP_UNROLLINGSTREAMS: True1, 2
[ INFO ]                GPUOPTIMAL_DISABLEBATCH_WINOGRAD_CONVOLUTIONSIZE: False1
[ INFO ]                CACHEMAX_BATCH_DIRSIZE: 1
[ INFO ]                CACHEDEVICE_MODEARCHITECTURE: CacheMode.OPTIMIZE_SPEEDGPU: vendor=0x8086 arch=v12.71.4
[ INFO ]                PERFORMANCEFULL_DEVICE_HINT: PerformanceMode.LATENCYNAME: Intel(R) Arc(TM) Graphics (iGPU)
[ INFO ]                EXECUTIONDEVICE_MODE_HINTUUID: ExecutionMode.PERFORMANCE8680557d080000000002000000000000
[ INFO ]                COMPILATIONDEVICE_NUM_THREADSLUID: 22409a0000499a0000
[ INFO ]                NUMDEVICE_STREAMSTYPE: 1Type.INTEGRATED
[ INFO ]                PERFORMANCE_HINT_NUM_REQUESTS: 0
[ INFO ]                INFERENCE_PRECISION_HINT: <Type: 'float16'>DEVICE_GOPS: {<Type: 'float16'>: 9625.599609375, <Type: 'float32'>: 4812.7998046875, <Type: 'int8_t'>: 19251.19921875, <Type: 'uint8_t'>: 19251.19921875}
[ INFO ]                ENABLEOPTIMIZATION_CPU_PINNING: FalseCAPABILITIES: FP32, BIN, FP16, INT8, EXPORT_IMPORT
[ INFO ]                ENABLE_CPU_RESERVATIONGPU_DEVICE_TOTAL_MEM_SIZE: False62438608896
[ INFO ]                DEVICEGPU_UARCH_IDVERSION: 012.71.4
[ INFO ]                DYNAMICGPU_QUANTIZATIONEXECUTION_GROUPUNITS_SIZECOUNT: 0128
[ INFO ]                ACTIVATIONSGPU_SCALEMEMORY_FACTORSTATISTICS: -1.0{}
[ INFO ]                WEIGHTSPERF_PATHCOUNT: False
[ INFO ]                CACHEMODEL_ENCRYPTION_CALLBACKSPRIORITY: UNSUPPORTED TYPEPriority.MEDIUM
[ INFO ]                KVGPU_HOST_CACHETASK_PRECISIONPRIORITY: <Type: 'dynamic'>Priority.MEDIUM
[ INFO ]                MODELGPU_PTRQUEUE_PRIORITY: UNSUPPORTED TYPEPriority.MEDIUM
[ INFO ]
[   INFO ] NPU :
[ INFO ]        SUPPORTEDGPU_QUEUE_PROPERTIESTHROTTLE: Priority.MEDIUM
[ INFO ]                AVAILABLE_DEVICESGPU_ENABLE_SDPA_OPTIMIZATION: 3720True
[ INFO ]                CACHE_DIR:GPU_ENABLE_LOOP_UNROLLING: True
[ INFO ]                COMPILATIONGPU_DISABLE_NUMWINOGRAD_THREADSCONVOLUTION: 22False
[ INFO ]                DEVICECACHE_ARCHITECTUREDIR: 3720
[ INFO ]                DEVICECACHE_GOPS: {<Type: 'bfloat16'>: 0.0, <Type: 'float16'>: 4300.7998046875, <Type: 'float32'>: 0.0, <Type: 'int8_t'>: 8601.599609375, <Type: 'uint8_t'>: 8601.599609375}MODE: CacheMode.OPTIMIZE_SPEED
[ INFO ]                PERFORMANCE_HINT: PerformanceMode.LATENCY
[ INFO ]                DEVICEEXECUTION_MODE_IDHINT: ExecutionMode.PERFORMANCE
[ INFO ]                DEVICECOMPILATION_PCI_INFO: {domain: 0 bus: 0 device: 0xb function: 0}NUM_THREADS: 22
[ INFO ]                NUM_STREAMS: 1
[ INFO ]                DEVICE_TYPEPERFORMANCE_HINT_NUM_REQUESTS: Type.INTEGRATED0
[ INFO ]                DEVICE_UUID: 80d1d11eb73811eab3de0242ac130004INFERENCE_PRECISION_HINT: <Type: 'float16'>
[ INFO ]                ENABLE_CPU_PINNING: False
[ INFO ]                EXECUTIONENABLE_CPU_DEVICESRESERVATION: NPUFalse
[ INFO ]                EXECUTIONDEVICE_MODE_HINTID: ExecutionMode.PERFORMANCE0
[ INFO ]                FULLDYNAMIC_QUANTIZATION_DEVICEGROUP_NAMESIZE: Intel(R) AI Boost0
[ INFO ]                INFERENCEACTIVATIONS_PRECISIONSCALE_HINTFACTOR: <Type: 'float16'>-1.0
[ INFO ]                LOGWEIGHTS_LEVELPATH: Level.ERR
[ INFO ]                MODELCACHE_ENCRYPTION_PRIORITYCALLBACKS: Priority.MEDIUMUNSUPPORTED TYPE
[ INFO ]                NPUKV_BYPASS_UMD_CACHING: FalseCACHE_PRECISION: <Type: 'dynamic'>
[ INFO ]                NPU_COMPILATION_MODE_PARAMS:
[ MODEL_PTR: UNSUPPORTED TYPE
[ INFO ]
[ INFO ] NPU :
[ INFO ]          NPU_COMPILER_DYNAMIC_QUANTIZATION: FalseSUPPORTED_PROPERTIES:
[ INFO ]                NPUAVAILABLE_COMPILER_VERSIONDEVICES: 4587723720
[ INFO ]                NPU_DEFER_WEIGHTS_LOAD: FalseCACHE_DIR:
[ INFO ]                NPUCOMPILATION_DEVICE_ALLOC_MEM_SIZENUM_THREADS: 022
[ INFO ]                NPU_DEVICE_TOTAL_MEM_SIZEARCHITECTURE: 669260308483720
[ INFO ]                NPUDEVICE_DRIVER_VERSIONGOPS: 1746727061
[ INFO ]                NPU_MAX_TILES: 2{<Type: 'bfloat16'>: 0.0, <Type: 'float16'>: 4300.7998046875, <Type: 'float32'>: 0.0, <Type: 'int8_t'>: 8601.599609375, <Type: 'uint8_t'>: 8601.599609375}
[ INFO ]                NPUDEVICE_QDQ_OPTIMIZATIONID: False
[ INFO ]                NPUDEVICE_PCI_TILES: -1INFO: {domain: 0 bus: 0 device: 0xb function: 0}
[ INFO ]                NPUDEVICE_TURBOTYPE: FalseType.INTEGRATED
[ INFO ]                NUMDEVICE_STREAMSUUID: 180d1d11eb73811eab3de0242ac130004
[ INFO ]                OPTIMALENABLE_NUMBER_OF_INFER_REQUESTSCPU_PINNING: 1False
[ INFO ]                OPTIMIZATIONEXECUTION_CAPABILITIESDEVICES: FP16, INT8, EXPORT_IMPORTNPU
[ INFO ]                PERFORMANCEEXECUTION_MODE_HINT: PerformanceModeExecutionMode.LATENCYPERFORMANCE
[ INFO ]                PERFORMANCEFULL_HINTDEVICE_NUM_REQUESTSNAME: 1
Intel(R) AI Boost
[ INFO ]                PERF_COUNT: FalseINFERENCE_PRECISION_HINT: <Type: 'float16'>
[ INFO ]                RANGE_FOR_ASYNC_INFER_REQUESTS: 1, 10, 1LOG_LEVEL: Level.ERR
[ INFO ]                MODEL_PRIORITY: Priority.MEDIUM
[ INFO ]                RANGENPU_BYPASS_FORUMD_STREAMSCACHING: 1, 4False
[ INFO ]                WEIGHTS_PATH:
[ INFO ]                WORKLOAD_TYPE: WorkloadType.DEFAULT

    NPU_COMPILATION_MODE_PARAMS:
[ INFO ]                NPU_COMPILER_DYNAMIC_QUANTIZATION: False
[ INFO ]                NPU_COMPILER_VERSION: 458772
[ INFO ]                NPU_DEFER_WEIGHTS_LOAD: False
[ INFO ]                NPU_DEVICE_ALLOC_MEM_SIZE: 0
[ INFO ]                NPU_DEVICE_TOTAL_MEM_SIZE: 66926030848
[ INFO ]                NPU_DRIVER_VERSION: 1746727061
[ INFO ]                NPU_MAX_TILES: 2
[ INFO ]                NPU_QDQ_OPTIMIZATION: False
[ INFO ]                NPU_TILES: -1
[ INFO ]                NPU_TURBO: False
[ INFO ]                NUM_STREAMS: 1
[ INFO ]                OPTIMAL_NUMBER_OF_INFER_REQUESTS: 1
[ INFO ]                OPTIMIZATION_CAPABILITIES: FP16, INT8, EXPORT_IMPORT
[ INFO ]                PERFORMANCE_HINT: PerformanceMode.LATENCY
[ INFO ]                PERFORMANCE_HINT_NUM_REQUESTS: 1
[ INFO ]                PERF_COUNT: False
[ INFO ]                RANGE_FOR_ASYNC_INFER_REQUESTS: 1, 10, 1
[ INFO ]                RANGE_FOR_STREAMS: 1, 4
[ INFO ]                WEIGHTS_PATH:
[ INFO ]                WORKLOAD_TYPE: WorkloadType.DEFAULT

Windows 11

Intel Core Ultra 9 185H, 96GB DDR5

BIOS: Turbo Performance

Windows: Balanced

FrameworkBackendSingleHalfQuantizedURL
OpenVinoGPU84961337120718https://browser.geekbench.com/ai/v1/256798
OpenVINOCPU289630036822https://browser.geekbench.com/ai/v1/256805
ONNXDirectML504981993647https://browser.geekbench.com/ai/v1/256809
ONNXCPU371915086668https://browser.geekbench.com/ai/v1/256810