[2026-04-08 07:40:39.810307 INFO duck_llm] 这是一条信息日志 [2026-04-08 07:40:39.810342 WARN duck_llm] 这是一条警告日志 [2026-04-08 07:40:39.810344 ERROR duck_llm] 这是一条错误日志 [2026-04-08 07:40:39.810713 INFO utils] Selected DPDK lcores: master=0, workers=[2, 4, 6, 8], all_performance_core_representatives=[0, 2, 4, 6, 8, 10, 12, 14] EAL: Detected CPU lcores: 32 EAL: Detected NUMA nodes: 1 EAL: Detected shared linkage of DPDK EAL: Multi-process socket /var/run/dpdk/rte/mp_socket EAL: Selected IOVA mode 'VA' EAL: VFIO support initialized EAL: Using IOMMU type 1 (Type 1) ICE_INIT: ice_load_pkg_type(): Active package is: 1.3.36.0, ICE OS Default Package (single VLAN mode) ICE_INIT: ice_load_pkg_type(): Active package is: 1.3.36.0, ICE OS Default Package (single VLAN mode) ICE_INIT: ice_load_pkg_type(): Active package is: 1.3.36.0, ICE OS Default Package (single VLAN mode) ICE_INIT: ice_load_pkg_type(): Active package is: 1.3.36.0, ICE OS Default Package (single VLAN mode) [2026-04-08 07:40:41.847267 INFO dpdk_workers] DPDK initialized successfully. Found 4 ports. [2026-04-08 07:40:41.847284 INFO dpdk_workers] Port 0 device name: 0000:01:00.0 [2026-04-08 07:40:41.847286 INFO dpdk_workers] Port 0 IP address: 10.21.1.1 [2026-04-08 07:40:41.847288 INFO dpdk_workers] Port 0 Broadcast address: 10.21.1.255 [2026-04-08 07:40:41.847290 INFO dpdk_workers] Port 1 device name: 0000:01:00.1 [2026-04-08 07:40:41.847292 INFO dpdk_workers] Port 1 IP address: 10.21.2.1 [2026-04-08 07:40:41.847293 INFO dpdk_workers] Port 1 Broadcast address: 10.21.2.255 [2026-04-08 07:40:41.847295 INFO dpdk_workers] Port 2 device name: 0000:01:00.2 [2026-04-08 07:40:41.847296 INFO dpdk_workers] Port 2 IP address: 10.21.3.1 [2026-04-08 07:40:41.847298 INFO dpdk_workers] Port 2 Broadcast address: 10.21.3.255 [2026-04-08 07:40:41.847299 INFO dpdk_workers] Port 3 device name: 0000:01:00.3 [2026-04-08 07:40:41.847301 INFO dpdk_workers] Port 3 IP address: 10.21.4.1 [2026-04-08 07:40:41.847302 INFO dpdk_workers] Port 3 Broadcast address: 10.21.4.255 [2026-04-08 07:40:41.847304 INFO dpdk_workers] Available netifs list: [(10.21.1.255, 0, 10.21.1.1), (10.21.2.255, 1, 10.21.2.1), (10.21.3.255, 2, 10.21.3.1), (10.21.4.255, 3, 10.21.4.1)] [2026-04-08 07:40:41.847309 INFO dpdk_workers] Starting worker #0: (bcast_ip: 10.21.1.255, port_id: 0, lcore_id: 2, host_ip: 10.21.1.1) [2026-04-08 07:40:41.847336 INFO dpdk_workers] Starting worker #1: (bcast_ip: 10.21.2.255, port_id: 1, lcore_id: 4, host_ip: 10.21.2.1) [2026-04-08 07:40:41.847362 INFO dpdk_workers] Initializing worker port 0 on lcore 2... [2026-04-08 07:40:41.848780 INFO dpdk_workers] Starting worker #2: (bcast_ip: 10.21.3.255, port_id: 2, lcore_id: 6, host_ip: 10.21.3.1) [2026-04-08 07:40:41.848804 INFO dpdk_workers] Starting worker #3: (bcast_ip: 10.21.4.255, port_id: 3, lcore_id: 8, host_ip: 10.21.4.1) [2026-04-08 07:40:41.848830 INFO dpdk_workers] Initializing worker port 1 on lcore 4... [2026-04-08 07:40:41.850811 INFO dpdk_workers] Initializing worker port 2 on lcore 6... [2026-04-08 07:40:41.852795 INFO dpdk_workers] Initializing worker port 3 on lcore 8... ICE_DRIVER: ice_set_rx_function(): Using Vector AVX2 (port 0). ICE_DRIVER: ice_set_rx_function(): Using Vector AVX2 (port 1). ICE_DRIVER: ice_set_rx_function(): Using Vector AVX2 (port 2). ICE_DRIVER: ice_set_rx_function(): Using Vector AVX2 (port 3). [2026-04-08 07:40:45.502875 INFO dpdk_workers] Worker port 1 initialized successfully. [2026-04-08 07:40:45.505696 INFO dpdk_workers] Worker port 3 initialized successfully. [2026-04-08 07:40:46.316961 INFO dpdk_workers] Worker port 0 initialized successfully. [2026-04-08 07:40:46.317866 INFO dpdk_workers] Worker port 2 initialized successfully. [2026-04-08 07:40:46.317884 INFO dpdk_workers] Workers initialized successfully. 4 workers running. [2026-04-08 07:40:46.318122 INFO utils] Binding master thread to cores (excluding workers): [0, 1, 3, 5, 7, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31] [2026-04-08 07:40:46.318133 INFO utils] set_thread_affinity(tid 1355235, cores [0, 1, 3, 5, 7, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31]): 0 [2026-04-08 07:40:46.319114 INFO dpdk_workers] Run command Ping all time: send 1.0 us, recv 972.4 us [2026-04-08 07:40:46.369190 INFO dpdk_workers] Run command Ping all time: send 0.4 us, recv 0.3 us [2026-04-08 07:40:46.419246 INFO dpdk_workers] Run command Ping all time: send 0.3 us, recv 0.4 us [2026-04-08 07:40:46.469302 INFO dpdk_workers] Run command Ping all time: send 0.3 us, recv 0.4 us [2026-04-08 07:40:46.519371 INFO dpdk_workers] Run command Ping all time: send 1.1 us, recv 1.1 us [2026-04-08 07:40:46.569440 INFO dpdk_workers] Run command Ping all time: send 0.3 us, recv 0.4 us [2026-04-08 07:40:46.619496 INFO dpdk_workers] Run command Ping all time: send 0.3 us, recv 0.4 us [2026-04-08 07:40:46.669552 INFO dpdk_workers] Run command Ping all time: send 0.2 us, recv 0.3 us [2026-04-08 07:40:46.719608 INFO dpdk_workers] Run command Ping all time: send 0.2 us, recv 0.4 us [2026-04-08 07:40:46.769676 INFO dpdk_workers] Run command Ping all time: send 1.2 us, recv 1.3 us [2026-04-08 07:40:46.826059 INFO dpdk_workers] Found 32 ducks in duck-ips-multi-netifs.txt [2026-04-08 07:40:46.826063 INFO dpdk_workers] Duck #0: 10.21.1.101 (bcast_ip: 10.21.1.255) [2026-04-08 07:40:46.826065 INFO dpdk_workers] Duck #1: 10.21.1.102 (bcast_ip: 10.21.1.255) [2026-04-08 07:40:46.826067 INFO dpdk_workers] Duck #2: 10.21.1.103 (bcast_ip: 10.21.1.255) [2026-04-08 07:40:46.826069 INFO dpdk_workers] Duck #3: 10.21.1.104 (bcast_ip: 10.21.1.255) [2026-04-08 07:40:46.826071 INFO dpdk_workers] Duck #4: 10.21.1.105 (bcast_ip: 10.21.1.255) [2026-04-08 07:40:46.826073 INFO dpdk_workers] Duck #5: 10.21.1.106 (bcast_ip: 10.21.1.255) [2026-04-08 07:40:46.826075 INFO dpdk_workers] Duck #6: 10.21.1.107 (bcast_ip: 10.21.1.255) [2026-04-08 07:40:46.826077 INFO dpdk_workers] Duck #7: 10.21.1.108 (bcast_ip: 10.21.1.255) [2026-04-08 07:40:46.826079 INFO dpdk_workers] Duck #8: 10.21.2.101 (bcast_ip: 10.21.2.255) [2026-04-08 07:40:46.826081 INFO dpdk_workers] Duck #9: 10.21.2.102 (bcast_ip: 10.21.2.255) [2026-04-08 07:40:46.826083 INFO dpdk_workers] Duck #10: 10.21.2.103 (bcast_ip: 10.21.2.255) [2026-04-08 07:40:46.826085 INFO dpdk_workers] Duck #11: 10.21.2.104 (bcast_ip: 10.21.2.255) [2026-04-08 07:40:46.826087 INFO dpdk_workers] Duck #12: 10.21.2.105 (bcast_ip: 10.21.2.255) [2026-04-08 07:40:46.826089 INFO dpdk_workers] Duck #13: 10.21.2.106 (bcast_ip: 10.21.2.255) [2026-04-08 07:40:46.826091 INFO dpdk_workers] Duck #14: 10.21.2.107 (bcast_ip: 10.21.2.255) [2026-04-08 07:40:46.826092 INFO dpdk_workers] Duck #15: 10.21.2.108 (bcast_ip: 10.21.2.255) [2026-04-08 07:40:46.826094 INFO dpdk_workers] Duck #16: 10.21.3.101 (bcast_ip: 10.21.3.255) [2026-04-08 07:40:46.826096 INFO dpdk_workers] Duck #17: 10.21.3.102 (bcast_ip: 10.21.3.255) [2026-04-08 07:40:46.826098 INFO dpdk_workers] Duck #18: 10.21.3.103 (bcast_ip: 10.21.3.255) [2026-04-08 07:40:46.826100 INFO dpdk_workers] Duck #19: 10.21.3.104 (bcast_ip: 10.21.3.255) [2026-04-08 07:40:46.826101 INFO dpdk_workers] Duck #20: 10.21.3.105 (bcast_ip: 10.21.3.255) [2026-04-08 07:40:46.826103 INFO dpdk_workers] Duck #21: 10.21.3.106 (bcast_ip: 10.21.3.255) [2026-04-08 07:40:46.826105 INFO dpdk_workers] Duck #22: 10.21.3.107 (bcast_ip: 10.21.3.255) [2026-04-08 07:40:46.826107 INFO dpdk_workers] Duck #23: 10.21.3.108 (bcast_ip: 10.21.3.255) [2026-04-08 07:40:46.826109 INFO dpdk_workers] Duck #24: 10.21.4.101 (bcast_ip: 10.21.4.255) [2026-04-08 07:40:46.826110 INFO dpdk_workers] Duck #25: 10.21.4.102 (bcast_ip: 10.21.4.255) [2026-04-08 07:40:46.826112 INFO dpdk_workers] Duck #26: 10.21.4.103 (bcast_ip: 10.21.4.255) [2026-04-08 07:40:46.826114 INFO dpdk_workers] Duck #27: 10.21.4.104 (bcast_ip: 10.21.4.255) [2026-04-08 07:40:46.826116 INFO dpdk_workers] Duck #28: 10.21.4.105 (bcast_ip: 10.21.4.255) [2026-04-08 07:40:46.826118 INFO dpdk_workers] Duck #29: 10.21.4.106 (bcast_ip: 10.21.4.255) [2026-04-08 07:40:46.826119 INFO dpdk_workers] Duck #30: 10.21.4.107 (bcast_ip: 10.21.4.255) [2026-04-08 07:40:46.826124 INFO dpdk_workers] Duck #31: 10.21.4.108 (bcast_ip: 10.21.4.255) [2026-04-08 07:40:47.327951 INFO dpdk_workers] [Worker 0]: 10.21.1.101 [2026-04-08 07:40:47.327966 INFO dpdk_workers] [Worker 0]: 10.21.1.102 [2026-04-08 07:40:47.327968 INFO dpdk_workers] [Worker 0]: 10.21.1.103 [2026-04-08 07:40:47.327970 INFO dpdk_workers] [Worker 0]: 10.21.1.104 [2026-04-08 07:40:47.327972 INFO dpdk_workers] [Worker 0]: 10.21.1.105 [2026-04-08 07:40:47.327974 INFO dpdk_workers] [Worker 0]: 10.21.1.106 [2026-04-08 07:40:47.327975 INFO dpdk_workers] [Worker 0]: 10.21.1.107 [2026-04-08 07:40:47.327977 INFO dpdk_workers] [Worker 0]: 10.21.1.108 [2026-04-08 07:40:47.327982 INFO dpdk_workers] [Worker 1]: 10.21.2.101 [2026-04-08 07:40:47.327985 INFO dpdk_workers] [Worker 1]: 10.21.2.102 [2026-04-08 07:40:47.327987 INFO dpdk_workers] [Worker 1]: 10.21.2.103 [2026-04-08 07:40:47.327990 INFO dpdk_workers] [Worker 1]: 10.21.2.104 [2026-04-08 07:40:47.327992 INFO dpdk_workers] [Worker 1]: 10.21.2.105 [2026-04-08 07:40:47.327994 INFO dpdk_workers] [Worker 1]: 10.21.2.106 [2026-04-08 07:40:47.327996 INFO dpdk_workers] [Worker 1]: 10.21.2.107 [2026-04-08 07:40:47.327998 INFO dpdk_workers] [Worker 1]: 10.21.2.108 [2026-04-08 07:40:47.328078 INFO dpdk_workers] [Worker 2]: 10.21.3.101 [2026-04-08 07:40:47.328080 INFO dpdk_workers] [Worker 2]: 10.21.3.102 [2026-04-08 07:40:47.328082 INFO dpdk_workers] [Worker 2]: 10.21.3.103 [2026-04-08 07:40:47.328084 INFO dpdk_workers] [Worker 2]: 10.21.3.104 [2026-04-08 07:40:47.328085 INFO dpdk_workers] [Worker 2]: 10.21.3.105 [2026-04-08 07:40:47.328087 INFO dpdk_workers] [Worker 2]: 10.21.3.106 [2026-04-08 07:40:47.328089 INFO dpdk_workers] [Worker 2]: 10.21.3.107 [2026-04-08 07:40:47.328090 INFO dpdk_workers] [Worker 2]: 10.21.3.108 [2026-04-08 07:40:47.328093 INFO dpdk_workers] [Worker 3]: 10.21.4.101 [2026-04-08 07:40:47.328095 INFO dpdk_workers] [Worker 3]: 10.21.4.102 [2026-04-08 07:40:47.328097 INFO dpdk_workers] [Worker 3]: 10.21.4.103 [2026-04-08 07:40:47.328098 INFO dpdk_workers] [Worker 3]: 10.21.4.104 [2026-04-08 07:40:47.328100 INFO dpdk_workers] [Worker 3]: 10.21.4.105 [2026-04-08 07:40:47.328102 INFO dpdk_workers] [Worker 3]: 10.21.4.106 [2026-04-08 07:40:47.328103 INFO dpdk_workers] [Worker 3]: 10.21.4.107 [2026-04-08 07:40:47.328105 INFO dpdk_workers] [Worker 3]: 10.21.4.108 [2026-04-08 07:40:47.328257 INFO dpdk_workers] init_ducks done [2026-04-08 07:40:47.332112 INFO dpdk_ducks] Initialized 4 DPDK duck workers [2026-04-08 07:40:47.332115 INFO dpdk_ducks] DPDK duck worker 0: DpdkDuckWorker { worker_idx: 0, ducks: [DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }], all_ranks: [0, 1, 2, 3, 4, 5, 6, 7], tp_rank_range: (0, 8) } [2026-04-08 07:40:47.332120 INFO dpdk_ducks] DPDK duck worker 1: DpdkDuckWorker { worker_idx: 1, ducks: [DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }], all_ranks: [0, 1, 2, 3, 4, 5, 6, 7], tp_rank_range: (8, 16) } [2026-04-08 07:40:47.332122 INFO dpdk_ducks] DPDK duck worker 2: DpdkDuckWorker { worker_idx: 2, ducks: [DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }], all_ranks: [0, 1, 2, 3, 4, 5, 6, 7], tp_rank_range: (16, 24) } [2026-04-08 07:40:47.332125 INFO dpdk_ducks] DPDK duck worker 3: DpdkDuckWorker { worker_idx: 3, ducks: [DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }, DpdkDuck { buffer_size: 32212254720 }], all_ranks: [0, 1, 2, 3, 4, 5, 6, 7], tp_rank_range: (24, 32) } [2026-04-08 07:40:47.332131 INFO buffer_manager] Initializing buffer manager [2026-04-08 07:40:47.332133 INFO buffer_manager] Buffer manager initialized: ELF BufferAllocator { begin: 0, end: 10485760, current: 0 }, input BufferAllocator { begin: 10485760, end: 104857600, current: 10485760 }, weights BufferAllocator { begin: 104923136, end: 32212254720, current: 104923136 } [2026-04-08 07:40:47.332136 INFO fp8_dpdk_common] fp9 persistent judge enabled by default; set DUCK_FP9_PERSISTENT_JUDGE=0 to disable [2026-04-08 07:40:47.332804 INFO buffer_manager] Added kernel fp9_kernels at (0, 91664) [2026-04-08 07:40:47.342188 INFO fp8_dpdk_common] fp9 persistent judge: opened 32 sessions [2026-04-08 07:40:47.342205 INFO fp8_dpdk_common] fp9 persistent judge: force-opened 32 fresh sessions for new init [2026-04-08 07:40:47.342217 INFO fp8_mlp_dpdk] fp8_mlp_dpdk: init(tp_size=32) [2026-04-08 07:40:47.342224 INFO fp8_moe_dpdk] fp8_moe_dpdk: init(tp_size=32) [2026-04-08 07:40:47.891151 INFO weight_cache] weight_cache: header hit tp_size=32 num_slots=62 finished_slots=62 [2026-04-08 07:40:48.586443 INFO buffer_manager] Allocated weights buffer at (104923136, 0) [2026-04-08 07:40:48.586477 INFO buffer_manager] Allocated weights buffer at (104923136, 4128768) [2026-04-08 07:40:48.586479 INFO buffer_manager] Allocated weights buffer at (109051904, 516096) [2026-04-08 07:40:48.586481 INFO buffer_manager] Allocated weights buffer at (109568000, 2016) [2026-04-08 07:40:48.586483 INFO buffer_manager] Allocated weights buffer at (109572096, 4128768) [2026-04-08 07:40:48.586484 INFO buffer_manager] Allocated weights buffer at (113700864, 516096) [2026-04-08 07:40:48.586486 INFO buffer_manager] Allocated weights buffer at (114216960, 2016) [2026-04-08 07:40:48.586487 INFO buffer_manager] Allocated weights buffer at (114221056, 4128768) [2026-04-08 07:40:48.586489 INFO buffer_manager] Allocated weights buffer at (118349824, 516096) [2026-04-08 07:40:48.586490 INFO buffer_manager] Allocated weights buffer at (118865920, 2016) [2026-04-08 07:40:48.586492 INFO buffer_manager] Allocated weights buffer at (118870016, 0) [2026-04-08 07:40:48.586493 INFO fp8_mlp_dpdk] fp8_mlp_dpdk: init_layer_cached(layer_idx=0, cache_slot=0) planned desc only [2026-04-08 07:40:48.785753 INFO buffer_manager] Allocated weights buffer at (118870016, 0) [2026-04-08 07:40:48.785781 INFO buffer_manager] Allocated weights buffer at (118870016, 4128768) [2026-04-08 07:40:48.785783 INFO buffer_manager] Allocated weights buffer at (122998784, 516096) [2026-04-08 07:40:48.785785 INFO buffer_manager] Allocated weights buffer at (123514880, 2016) [2026-04-08 07:40:48.785786 INFO buffer_manager] Allocated weights buffer at (123518976, 4128768) [2026-04-08 07:40:48.785788 INFO buffer_manager] Allocated weights buffer at (127647744, 516096) [2026-04-08 07:40:48.785789 INFO buffer_manager] Allocated weights buffer at (128163840, 2016) [2026-04-08 07:40:48.785790 INFO buffer_manager] Allocated weights buffer at (128167936, 4128768) [2026-04-08 07:40:48.785792 INFO buffer_manager] Allocated weights buffer at (132296704, 516096) [2026-04-08 07:40:48.785793 INFO buffer_manager] Allocated weights buffer at (132812800, 2016) [2026-04-08 07:40:48.785795 INFO buffer_manager] Allocated weights buffer at (132816896, 0) [2026-04-08 07:40:48.785796 INFO fp8_mlp_dpdk] fp8_mlp_dpdk: init_layer_cached(layer_idx=1, cache_slot=1) planned desc only [2026-04-08 07:40:48.973982 INFO buffer_manager] Allocated weights buffer at (132816896, 0) [2026-04-08 07:40:48.974006 INFO buffer_manager] Allocated weights buffer at (132816896, 4128768) [2026-04-08 07:40:48.974008 INFO buffer_manager] Allocated weights buffer at (136945664, 516096) [2026-04-08 07:40:48.974009 INFO buffer_manager] Allocated weights buffer at (137461760, 2016) [2026-04-08 07:40:48.974016 INFO buffer_manager] Allocated weights buffer at (137465856, 4128768) [2026-04-08 07:40:48.974017 INFO buffer_manager] Allocated weights buffer at (141594624, 516096) [2026-04-08 07:40:48.974019 INFO buffer_manager] Allocated weights buffer at (142110720, 2016) [2026-04-08 07:40:48.974021 INFO buffer_manager] Allocated weights buffer at (142114816, 4128768) [2026-04-08 07:40:48.974022 INFO buffer_manager] Allocated weights buffer at (146243584, 516096) [2026-04-08 07:40:48.974023 INFO buffer_manager] Allocated weights buffer at (146759680, 2016) [2026-04-08 07:40:48.974025 INFO buffer_manager] Allocated weights buffer at (146763776, 0) [2026-04-08 07:40:48.974026 INFO fp8_mlp_dpdk] fp8_mlp_dpdk: init_layer_cached(layer_idx=2, cache_slot=2) planned desc only [2026-04-08 07:40:49.036986 INFO buffer_manager] Allocated weights buffer at (146763776, 0) [2026-04-08 07:40:49.037007 INFO buffer_manager] Allocated weights buffer at (146763776, 132120576) [2026-04-08 07:40:49.037009 INFO buffer_manager] Allocated weights buffer at (278884352, 57344) [2026-04-08 07:40:49.037010 INFO buffer_manager] Allocated weights buffer at (278941696, 132120576) [2026-04-08 07:40:49.037012 INFO buffer_manager] Allocated weights buffer at (411062272, 57344) [2026-04-08 07:40:49.037013 INFO buffer_manager] Allocated weights buffer at (411119616, 132120576) [2026-04-08 07:40:49.037015 INFO buffer_manager] Allocated weights buffer at (543240192, 57344) [2026-04-08 07:40:49.037017 INFO buffer_manager] Allocated weights buffer at (543297536, 0) [2026-04-08 07:40:49.037019 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=3, cache_slot=3) planned desc only [2026-04-08 07:40:49.118688 INFO buffer_manager] Allocated weights buffer at (543297536, 0) [2026-04-08 07:40:49.118711 INFO buffer_manager] Allocated weights buffer at (543297536, 132120576) [2026-04-08 07:40:49.118714 INFO buffer_manager] Allocated weights buffer at (675418112, 57344) [2026-04-08 07:40:49.118715 INFO buffer_manager] Allocated weights buffer at (675475456, 132120576) [2026-04-08 07:40:49.118717 INFO buffer_manager] Allocated weights buffer at (807596032, 57344) [2026-04-08 07:40:49.118718 INFO buffer_manager] Allocated weights buffer at (807653376, 132120576) [2026-04-08 07:40:49.118720 INFO buffer_manager] Allocated weights buffer at (939773952, 57344) [2026-04-08 07:40:49.118721 INFO buffer_manager] Allocated weights buffer at (939831296, 0) [2026-04-08 07:40:49.118723 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=4, cache_slot=4) planned desc only [2026-04-08 07:40:49.199649 INFO buffer_manager] Allocated weights buffer at (939831296, 0) [2026-04-08 07:40:49.199672 INFO buffer_manager] Allocated weights buffer at (939831296, 132120576) [2026-04-08 07:40:49.199674 INFO buffer_manager] Allocated weights buffer at (1071951872, 57344) [2026-04-08 07:40:49.199676 INFO buffer_manager] Allocated weights buffer at (1072009216, 132120576) [2026-04-08 07:40:49.199677 INFO buffer_manager] Allocated weights buffer at (1204129792, 57344) [2026-04-08 07:40:49.199679 INFO buffer_manager] Allocated weights buffer at (1204187136, 132120576) [2026-04-08 07:40:49.199680 INFO buffer_manager] Allocated weights buffer at (1336307712, 57344) [2026-04-08 07:40:49.199682 INFO buffer_manager] Allocated weights buffer at (1336365056, 0) [2026-04-08 07:40:49.199683 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=5, cache_slot=5) planned desc only [2026-04-08 07:40:49.273706 INFO buffer_manager] Allocated weights buffer at (1336365056, 0) [2026-04-08 07:40:49.273727 INFO buffer_manager] Allocated weights buffer at (1336365056, 132120576) [2026-04-08 07:40:49.273729 INFO buffer_manager] Allocated weights buffer at (1468485632, 57344) [2026-04-08 07:40:49.273731 INFO buffer_manager] Allocated weights buffer at (1468542976, 132120576) [2026-04-08 07:40:49.273732 INFO buffer_manager] Allocated weights buffer at (1600663552, 57344) [2026-04-08 07:40:49.273734 INFO buffer_manager] Allocated weights buffer at (1600720896, 132120576) [2026-04-08 07:40:49.273740 INFO buffer_manager] Allocated weights buffer at (1732841472, 57344) [2026-04-08 07:40:49.273741 INFO buffer_manager] Allocated weights buffer at (1732898816, 0) [2026-04-08 07:40:49.273743 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=6, cache_slot=6) planned desc only [2026-04-08 07:40:49.348642 INFO buffer_manager] Allocated weights buffer at (1732898816, 0) [2026-04-08 07:40:49.348665 INFO buffer_manager] Allocated weights buffer at (1732898816, 132120576) [2026-04-08 07:40:49.348667 INFO buffer_manager] Allocated weights buffer at (1865019392, 57344) [2026-04-08 07:40:49.348673 INFO buffer_manager] Allocated weights buffer at (1865076736, 132120576) [2026-04-08 07:40:49.348674 INFO buffer_manager] Allocated weights buffer at (1997197312, 57344) [2026-04-08 07:40:49.348676 INFO buffer_manager] Allocated weights buffer at (1997254656, 132120576) [2026-04-08 07:40:49.348677 INFO buffer_manager] Allocated weights buffer at (2129375232, 57344) [2026-04-08 07:40:49.348679 INFO buffer_manager] Allocated weights buffer at (2129432576, 0) [2026-04-08 07:40:49.348681 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=7, cache_slot=7) planned desc only [2026-04-08 07:40:49.422806 INFO buffer_manager] Allocated weights buffer at (2129432576, 0) [2026-04-08 07:40:49.422828 INFO buffer_manager] Allocated weights buffer at (2129432576, 132120576) [2026-04-08 07:40:49.422830 INFO buffer_manager] Allocated weights buffer at (2261553152, 57344) [2026-04-08 07:40:49.422832 INFO buffer_manager] Allocated weights buffer at (2261610496, 132120576) [2026-04-08 07:40:49.422833 INFO buffer_manager] Allocated weights buffer at (2393731072, 57344) [2026-04-08 07:40:49.422835 INFO buffer_manager] Allocated weights buffer at (2393788416, 132120576) [2026-04-08 07:40:49.422836 INFO buffer_manager] Allocated weights buffer at (2525908992, 57344) [2026-04-08 07:40:49.422838 INFO buffer_manager] Allocated weights buffer at (2525966336, 0) [2026-04-08 07:40:49.422839 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=8, cache_slot=8) planned desc only [2026-04-08 07:40:49.498512 INFO buffer_manager] Allocated weights buffer at (2525966336, 0) [2026-04-08 07:40:49.498535 INFO buffer_manager] Allocated weights buffer at (2525966336, 132120576) [2026-04-08 07:40:49.498537 INFO buffer_manager] Allocated weights buffer at (2658086912, 57344) [2026-04-08 07:40:49.498539 INFO buffer_manager] Allocated weights buffer at (2658144256, 132120576) [2026-04-08 07:40:49.498540 INFO buffer_manager] Allocated weights buffer at (2790264832, 57344) [2026-04-08 07:40:49.498542 INFO buffer_manager] Allocated weights buffer at (2790322176, 132120576) [2026-04-08 07:40:49.498543 INFO buffer_manager] Allocated weights buffer at (2922442752, 57344) [2026-04-08 07:40:49.498545 INFO buffer_manager] Allocated weights buffer at (2922500096, 0) [2026-04-08 07:40:49.498546 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=9, cache_slot=9) planned desc only [2026-04-08 07:40:49.572510 INFO buffer_manager] Allocated weights buffer at (2922500096, 0) [2026-04-08 07:40:49.572532 INFO buffer_manager] Allocated weights buffer at (2922500096, 132120576) [2026-04-08 07:40:49.572534 INFO buffer_manager] Allocated weights buffer at (3054620672, 57344) [2026-04-08 07:40:49.572536 INFO buffer_manager] Allocated weights buffer at (3054678016, 132120576) [2026-04-08 07:40:49.572538 INFO buffer_manager] Allocated weights buffer at (3186798592, 57344) [2026-04-08 07:40:49.572539 INFO buffer_manager] Allocated weights buffer at (3186855936, 132120576) [2026-04-08 07:40:49.572541 INFO buffer_manager] Allocated weights buffer at (3318976512, 57344) [2026-04-08 07:40:49.572542 INFO buffer_manager] Allocated weights buffer at (3319033856, 0) [2026-04-08 07:40:49.572544 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=10, cache_slot=10) planned desc only [2026-04-08 07:40:49.646986 INFO buffer_manager] Allocated weights buffer at (3319033856, 0) [2026-04-08 07:40:49.647013 INFO buffer_manager] Allocated weights buffer at (3319033856, 132120576) [2026-04-08 07:40:49.647019 INFO buffer_manager] Allocated weights buffer at (3451154432, 57344) [2026-04-08 07:40:49.647021 INFO buffer_manager] Allocated weights buffer at (3451211776, 132120576) [2026-04-08 07:40:49.647022 INFO buffer_manager] Allocated weights buffer at (3583332352, 57344) [2026-04-08 07:40:49.647024 INFO buffer_manager] Allocated weights buffer at (3583389696, 132120576) [2026-04-08 07:40:49.647025 INFO buffer_manager] Allocated weights buffer at (3715510272, 57344) [2026-04-08 07:40:49.647027 INFO buffer_manager] Allocated weights buffer at (3715567616, 0) [2026-04-08 07:40:49.647028 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=11, cache_slot=11) planned desc only [2026-04-08 07:40:49.722947 INFO buffer_manager] Allocated weights buffer at (3715567616, 0) [2026-04-08 07:40:49.722970 INFO buffer_manager] Allocated weights buffer at (3715567616, 132120576) [2026-04-08 07:40:49.722973 INFO buffer_manager] Allocated weights buffer at (3847688192, 57344) [2026-04-08 07:40:49.722974 INFO buffer_manager] Allocated weights buffer at (3847745536, 132120576) [2026-04-08 07:40:49.722976 INFO buffer_manager] Allocated weights buffer at (3979866112, 57344) [2026-04-08 07:40:49.722977 INFO buffer_manager] Allocated weights buffer at (3979923456, 132120576) [2026-04-08 07:40:49.722979 INFO buffer_manager] Allocated weights buffer at (4112044032, 57344) [2026-04-08 07:40:49.722980 INFO buffer_manager] Allocated weights buffer at (4112101376, 0) [2026-04-08 07:40:49.722982 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=12, cache_slot=12) planned desc only [2026-04-08 07:40:49.798284 INFO buffer_manager] Allocated weights buffer at (4112101376, 0) [2026-04-08 07:40:49.798307 INFO buffer_manager] Allocated weights buffer at (4112101376, 132120576) [2026-04-08 07:40:49.798309 INFO buffer_manager] Allocated weights buffer at (4244221952, 57344) [2026-04-08 07:40:49.798311 INFO buffer_manager] Allocated weights buffer at (4244279296, 132120576) [2026-04-08 07:40:49.798312 INFO buffer_manager] Allocated weights buffer at (4376399872, 57344) [2026-04-08 07:40:49.798313 INFO buffer_manager] Allocated weights buffer at (4376457216, 132120576) [2026-04-08 07:40:49.798315 INFO buffer_manager] Allocated weights buffer at (4508577792, 57344) [2026-04-08 07:40:49.798316 INFO buffer_manager] Allocated weights buffer at (4508635136, 0) [2026-04-08 07:40:49.798318 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=13, cache_slot=13) planned desc only [2026-04-08 07:40:49.873001 INFO buffer_manager] Allocated weights buffer at (4508635136, 0) [2026-04-08 07:40:49.873022 INFO buffer_manager] Allocated weights buffer at (4508635136, 132120576) [2026-04-08 07:40:49.873024 INFO buffer_manager] Allocated weights buffer at (4640755712, 57344) [2026-04-08 07:40:49.873026 INFO buffer_manager] Allocated weights buffer at (4640813056, 132120576) [2026-04-08 07:40:49.873028 INFO buffer_manager] Allocated weights buffer at (4772933632, 57344) [2026-04-08 07:40:49.873029 INFO buffer_manager] Allocated weights buffer at (4772990976, 132120576) [2026-04-08 07:40:49.873030 INFO buffer_manager] Allocated weights buffer at (4905111552, 57344) [2026-04-08 07:40:49.873032 INFO buffer_manager] Allocated weights buffer at (4905168896, 0) [2026-04-08 07:40:49.873034 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=14, cache_slot=14) planned desc only [2026-04-08 07:40:49.950363 INFO buffer_manager] Allocated weights buffer at (4905168896, 0) [2026-04-08 07:40:49.950390 INFO buffer_manager] Allocated weights buffer at (4905168896, 132120576) [2026-04-08 07:40:49.950393 INFO buffer_manager] Allocated weights buffer at (5037289472, 57344) [2026-04-08 07:40:49.950396 INFO buffer_manager] Allocated weights buffer at (5037346816, 132120576) [2026-04-08 07:40:49.950398 INFO buffer_manager] Allocated weights buffer at (5169467392, 57344) [2026-04-08 07:40:49.950401 INFO buffer_manager] Allocated weights buffer at (5169524736, 132120576) [2026-04-08 07:40:49.950403 INFO buffer_manager] Allocated weights buffer at (5301645312, 57344) [2026-04-08 07:40:49.950410 INFO buffer_manager] Allocated weights buffer at (5301702656, 0) [2026-04-08 07:40:49.950413 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=15, cache_slot=15) planned desc only [2026-04-08 07:40:50.026280 INFO buffer_manager] Allocated weights buffer at (5301702656, 0) [2026-04-08 07:40:50.026302 INFO buffer_manager] Allocated weights buffer at (5301702656, 132120576) [2026-04-08 07:40:50.026305 INFO buffer_manager] Allocated weights buffer at (5433823232, 57344) [2026-04-08 07:40:50.026307 INFO buffer_manager] Allocated weights buffer at (5433880576, 132120576) [2026-04-08 07:40:50.026308 INFO buffer_manager] Allocated weights buffer at (5566001152, 57344) [2026-04-08 07:40:50.026310 INFO buffer_manager] Allocated weights buffer at (5566058496, 132120576) [2026-04-08 07:40:50.026311 INFO buffer_manager] Allocated weights buffer at (5698179072, 57344) [2026-04-08 07:40:50.026312 INFO buffer_manager] Allocated weights buffer at (5698236416, 0) [2026-04-08 07:40:50.026314 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=16, cache_slot=16) planned desc only [2026-04-08 07:40:50.103302 INFO buffer_manager] Allocated weights buffer at (5698236416, 0) [2026-04-08 07:40:50.103322 INFO buffer_manager] Allocated weights buffer at (5698236416, 132120576) [2026-04-08 07:40:50.103325 INFO buffer_manager] Allocated weights buffer at (5830356992, 57344) [2026-04-08 07:40:50.103326 INFO buffer_manager] Allocated weights buffer at (5830414336, 132120576) [2026-04-08 07:40:50.103328 INFO buffer_manager] Allocated weights buffer at (5962534912, 57344) [2026-04-08 07:40:50.103329 INFO buffer_manager] Allocated weights buffer at (5962592256, 132120576) [2026-04-08 07:40:50.103331 INFO buffer_manager] Allocated weights buffer at (6094712832, 57344) [2026-04-08 07:40:50.103332 INFO buffer_manager] Allocated weights buffer at (6094770176, 0) [2026-04-08 07:40:50.103334 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=17, cache_slot=17) planned desc only [2026-04-08 07:40:50.178533 INFO buffer_manager] Allocated weights buffer at (6094770176, 0) [2026-04-08 07:40:50.178554 INFO buffer_manager] Allocated weights buffer at (6094770176, 132120576) [2026-04-08 07:40:50.178557 INFO buffer_manager] Allocated weights buffer at (6226890752, 57344) [2026-04-08 07:40:50.178558 INFO buffer_manager] Allocated weights buffer at (6226948096, 132120576) [2026-04-08 07:40:50.178560 INFO buffer_manager] Allocated weights buffer at (6359068672, 57344) [2026-04-08 07:40:50.178561 INFO buffer_manager] Allocated weights buffer at (6359126016, 132120576) [2026-04-08 07:40:50.178563 INFO buffer_manager] Allocated weights buffer at (6491246592, 57344) [2026-04-08 07:40:50.178564 INFO buffer_manager] Allocated weights buffer at (6491303936, 0) [2026-04-08 07:40:50.178566 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=18, cache_slot=18) planned desc only [2026-04-08 07:40:50.262421 INFO buffer_manager] Allocated weights buffer at (6491303936, 0) [2026-04-08 07:40:50.262444 INFO buffer_manager] Allocated weights buffer at (6491303936, 132120576) [2026-04-08 07:40:50.262446 INFO buffer_manager] Allocated weights buffer at (6623424512, 57344) [2026-04-08 07:40:50.262447 INFO buffer_manager] Allocated weights buffer at (6623481856, 132120576) [2026-04-08 07:40:50.262449 INFO buffer_manager] Allocated weights buffer at (6755602432, 57344) [2026-04-08 07:40:50.262450 INFO buffer_manager] Allocated weights buffer at (6755659776, 132120576) [2026-04-08 07:40:50.262452 INFO buffer_manager] Allocated weights buffer at (6887780352, 57344) [2026-04-08 07:40:50.262453 INFO buffer_manager] Allocated weights buffer at (6887837696, 0) [2026-04-08 07:40:50.262455 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=19, cache_slot=19) planned desc only [2026-04-08 07:40:50.339732 INFO buffer_manager] Allocated weights buffer at (6887837696, 0) [2026-04-08 07:40:50.339765 INFO buffer_manager] Allocated weights buffer at (6887837696, 132120576) [2026-04-08 07:40:50.339779 INFO buffer_manager] Allocated weights buffer at (7019958272, 57344) [2026-04-08 07:40:50.339785 INFO buffer_manager] Allocated weights buffer at (7020015616, 132120576) [2026-04-08 07:40:50.339788 INFO buffer_manager] Allocated weights buffer at (7152136192, 57344) [2026-04-08 07:40:50.339792 INFO buffer_manager] Allocated weights buffer at (7152193536, 132120576) [2026-04-08 07:40:50.339796 INFO buffer_manager] Allocated weights buffer at (7284314112, 57344) [2026-04-08 07:40:50.339798 INFO buffer_manager] Allocated weights buffer at (7284371456, 0) [2026-04-08 07:40:50.339801 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=20, cache_slot=20) planned desc only [2026-04-08 07:40:50.426557 INFO buffer_manager] Allocated weights buffer at (7284371456, 0) [2026-04-08 07:40:50.426581 INFO buffer_manager] Allocated weights buffer at (7284371456, 132120576) [2026-04-08 07:40:50.426584 INFO buffer_manager] Allocated weights buffer at (7416492032, 57344) [2026-04-08 07:40:50.426585 INFO buffer_manager] Allocated weights buffer at (7416549376, 132120576) [2026-04-08 07:40:50.426587 INFO buffer_manager] Allocated weights buffer at (7548669952, 57344) [2026-04-08 07:40:50.426588 INFO buffer_manager] Allocated weights buffer at (7548727296, 132120576) [2026-04-08 07:40:50.426590 INFO buffer_manager] Allocated weights buffer at (7680847872, 57344) [2026-04-08 07:40:50.426591 INFO buffer_manager] Allocated weights buffer at (7680905216, 0) [2026-04-08 07:40:50.426593 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=21, cache_slot=21) planned desc only [2026-04-08 07:40:50.511543 INFO buffer_manager] Allocated weights buffer at (7680905216, 0) [2026-04-08 07:40:50.511565 INFO buffer_manager] Allocated weights buffer at (7680905216, 132120576) [2026-04-08 07:40:50.511567 INFO buffer_manager] Allocated weights buffer at (7813025792, 57344) [2026-04-08 07:40:50.511568 INFO buffer_manager] Allocated weights buffer at (7813083136, 132120576) [2026-04-08 07:40:50.511570 INFO buffer_manager] Allocated weights buffer at (7945203712, 57344) [2026-04-08 07:40:50.511572 INFO buffer_manager] Allocated weights buffer at (7945261056, 132120576) [2026-04-08 07:40:50.511573 INFO buffer_manager] Allocated weights buffer at (8077381632, 57344) [2026-04-08 07:40:50.511575 INFO buffer_manager] Allocated weights buffer at (8077438976, 0) [2026-04-08 07:40:50.511577 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=22, cache_slot=22) planned desc only [2026-04-08 07:40:50.587566 INFO buffer_manager] Allocated weights buffer at (8077438976, 0) [2026-04-08 07:40:50.587590 INFO buffer_manager] Allocated weights buffer at (8077438976, 132120576) [2026-04-08 07:40:50.587592 INFO buffer_manager] Allocated weights buffer at (8209559552, 57344) [2026-04-08 07:40:50.587594 INFO buffer_manager] Allocated weights buffer at (8209616896, 132120576) [2026-04-08 07:40:50.587595 INFO buffer_manager] Allocated weights buffer at (8341737472, 57344) [2026-04-08 07:40:50.587597 INFO buffer_manager] Allocated weights buffer at (8341794816, 132120576) [2026-04-08 07:40:50.587598 INFO buffer_manager] Allocated weights buffer at (8473915392, 57344) [2026-04-08 07:40:50.587600 INFO buffer_manager] Allocated weights buffer at (8473972736, 0) [2026-04-08 07:40:50.587602 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=23, cache_slot=23) planned desc only [2026-04-08 07:40:50.664014 INFO buffer_manager] Allocated weights buffer at (8473972736, 0) [2026-04-08 07:40:50.664041 INFO buffer_manager] Allocated weights buffer at (8473972736, 132120576) [2026-04-08 07:40:50.664043 INFO buffer_manager] Allocated weights buffer at (8606093312, 57344) [2026-04-08 07:40:50.664045 INFO buffer_manager] Allocated weights buffer at (8606150656, 132120576) [2026-04-08 07:40:50.664046 INFO buffer_manager] Allocated weights buffer at (8738271232, 57344) [2026-04-08 07:40:50.664048 INFO buffer_manager] Allocated weights buffer at (8738328576, 132120576) [2026-04-08 07:40:50.664049 INFO buffer_manager] Allocated weights buffer at (8870449152, 57344) [2026-04-08 07:40:50.664054 INFO buffer_manager] Allocated weights buffer at (8870506496, 0) [2026-04-08 07:40:50.664056 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=24, cache_slot=24) planned desc only [2026-04-08 07:40:50.740669 INFO buffer_manager] Allocated weights buffer at (8870506496, 0) [2026-04-08 07:40:50.740691 INFO buffer_manager] Allocated weights buffer at (8870506496, 132120576) [2026-04-08 07:40:50.740698 INFO buffer_manager] Allocated weights buffer at (9002627072, 57344) [2026-04-08 07:40:50.740699 INFO buffer_manager] Allocated weights buffer at (9002684416, 132120576) [2026-04-08 07:40:50.740701 INFO buffer_manager] Allocated weights buffer at (9134804992, 57344) [2026-04-08 07:40:50.740702 INFO buffer_manager] Allocated weights buffer at (9134862336, 132120576) [2026-04-08 07:40:50.740704 INFO buffer_manager] Allocated weights buffer at (9266982912, 57344) [2026-04-08 07:40:50.740705 INFO buffer_manager] Allocated weights buffer at (9267040256, 0) [2026-04-08 07:40:50.740707 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=25, cache_slot=25) planned desc only [2026-04-08 07:40:50.815972 INFO buffer_manager] Allocated weights buffer at (9267040256, 0) [2026-04-08 07:40:50.815997 INFO buffer_manager] Allocated weights buffer at (9267040256, 132120576) [2026-04-08 07:40:50.815999 INFO buffer_manager] Allocated weights buffer at (9399160832, 57344) [2026-04-08 07:40:50.816000 INFO buffer_manager] Allocated weights buffer at (9399218176, 132120576) [2026-04-08 07:40:50.816002 INFO buffer_manager] Allocated weights buffer at (9531338752, 57344) [2026-04-08 07:40:50.816003 INFO buffer_manager] Allocated weights buffer at (9531396096, 132120576) [2026-04-08 07:40:50.816005 INFO buffer_manager] Allocated weights buffer at (9663516672, 57344) [2026-04-08 07:40:50.816006 INFO buffer_manager] Allocated weights buffer at (9663574016, 0) [2026-04-08 07:40:50.816008 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=26, cache_slot=26) planned desc only [2026-04-08 07:40:50.898678 INFO buffer_manager] Allocated weights buffer at (9663574016, 0) [2026-04-08 07:40:50.898701 INFO buffer_manager] Allocated weights buffer at (9663574016, 132120576) [2026-04-08 07:40:50.898703 INFO buffer_manager] Allocated weights buffer at (9795694592, 57344) [2026-04-08 07:40:50.898705 INFO buffer_manager] Allocated weights buffer at (9795751936, 132120576) [2026-04-08 07:40:50.898706 INFO buffer_manager] Allocated weights buffer at (9927872512, 57344) [2026-04-08 07:40:50.898708 INFO buffer_manager] Allocated weights buffer at (9927929856, 132120576) [2026-04-08 07:40:50.898709 INFO buffer_manager] Allocated weights buffer at (10060050432, 57344) [2026-04-08 07:40:50.898710 INFO buffer_manager] Allocated weights buffer at (10060107776, 0) [2026-04-08 07:40:50.898712 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=27, cache_slot=27) planned desc only [2026-04-08 07:40:50.976490 INFO buffer_manager] Allocated weights buffer at (10060107776, 0) [2026-04-08 07:40:50.976512 INFO buffer_manager] Allocated weights buffer at (10060107776, 132120576) [2026-04-08 07:40:50.976514 INFO buffer_manager] Allocated weights buffer at (10192228352, 57344) [2026-04-08 07:40:50.976516 INFO buffer_manager] Allocated weights buffer at (10192285696, 132120576) [2026-04-08 07:40:50.976517 INFO buffer_manager] Allocated weights buffer at (10324406272, 57344) [2026-04-08 07:40:50.976519 INFO buffer_manager] Allocated weights buffer at (10324463616, 132120576) [2026-04-08 07:40:50.976520 INFO buffer_manager] Allocated weights buffer at (10456584192, 57344) [2026-04-08 07:40:50.976522 INFO buffer_manager] Allocated weights buffer at (10456641536, 0) [2026-04-08 07:40:50.976523 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=28, cache_slot=28) planned desc only [2026-04-08 07:40:51.051033 INFO buffer_manager] Allocated weights buffer at (10456641536, 0) [2026-04-08 07:40:51.051055 INFO buffer_manager] Allocated weights buffer at (10456641536, 132120576) [2026-04-08 07:40:51.051061 INFO buffer_manager] Allocated weights buffer at (10588762112, 57344) [2026-04-08 07:40:51.051063 INFO buffer_manager] Allocated weights buffer at (10588819456, 132120576) [2026-04-08 07:40:51.051064 INFO buffer_manager] Allocated weights buffer at (10720940032, 57344) [2026-04-08 07:40:51.051066 INFO buffer_manager] Allocated weights buffer at (10720997376, 132120576) [2026-04-08 07:40:51.051067 INFO buffer_manager] Allocated weights buffer at (10853117952, 57344) [2026-04-08 07:40:51.051068 INFO buffer_manager] Allocated weights buffer at (10853175296, 0) [2026-04-08 07:40:51.051070 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=29, cache_slot=29) planned desc only [2026-04-08 07:40:51.129772 INFO buffer_manager] Allocated weights buffer at (10853175296, 0) [2026-04-08 07:40:51.129795 INFO buffer_manager] Allocated weights buffer at (10853175296, 132120576) [2026-04-08 07:40:51.129801 INFO buffer_manager] Allocated weights buffer at (10985295872, 57344) [2026-04-08 07:40:51.129803 INFO buffer_manager] Allocated weights buffer at (10985353216, 132120576) [2026-04-08 07:40:51.129804 INFO buffer_manager] Allocated weights buffer at (11117473792, 57344) [2026-04-08 07:40:51.129806 INFO buffer_manager] Allocated weights buffer at (11117531136, 132120576) [2026-04-08 07:40:51.129807 INFO buffer_manager] Allocated weights buffer at (11249651712, 57344) [2026-04-08 07:40:51.129808 INFO buffer_manager] Allocated weights buffer at (11249709056, 0) [2026-04-08 07:40:51.129810 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=30, cache_slot=30) planned desc only [2026-04-08 07:40:51.204375 INFO buffer_manager] Allocated weights buffer at (11249709056, 0) [2026-04-08 07:40:51.204397 INFO buffer_manager] Allocated weights buffer at (11249709056, 132120576) [2026-04-08 07:40:51.204399 INFO buffer_manager] Allocated weights buffer at (11381829632, 57344) [2026-04-08 07:40:51.204401 INFO buffer_manager] Allocated weights buffer at (11381886976, 132120576) [2026-04-08 07:40:51.204402 INFO buffer_manager] Allocated weights buffer at (11514007552, 57344) [2026-04-08 07:40:51.204404 INFO buffer_manager] Allocated weights buffer at (11514064896, 132120576) [2026-04-08 07:40:51.204405 INFO buffer_manager] Allocated weights buffer at (11646185472, 57344) [2026-04-08 07:40:51.204407 INFO buffer_manager] Allocated weights buffer at (11646242816, 0) [2026-04-08 07:40:51.204408 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=31, cache_slot=31) planned desc only [2026-04-08 07:40:51.280203 INFO buffer_manager] Allocated weights buffer at (11646242816, 0) [2026-04-08 07:40:51.280227 INFO buffer_manager] Allocated weights buffer at (11646242816, 132120576) [2026-04-08 07:40:51.280229 INFO buffer_manager] Allocated weights buffer at (11778363392, 57344) [2026-04-08 07:40:51.280231 INFO buffer_manager] Allocated weights buffer at (11778420736, 132120576) [2026-04-08 07:40:51.280233 INFO buffer_manager] Allocated weights buffer at (11910541312, 57344) [2026-04-08 07:40:51.280234 INFO buffer_manager] Allocated weights buffer at (11910598656, 132120576) [2026-04-08 07:40:51.280236 INFO buffer_manager] Allocated weights buffer at (12042719232, 57344) [2026-04-08 07:40:51.280237 INFO buffer_manager] Allocated weights buffer at (12042776576, 0) [2026-04-08 07:40:51.280239 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=32, cache_slot=32) planned desc only [2026-04-08 07:40:51.355597 INFO buffer_manager] Allocated weights buffer at (12042776576, 0) [2026-04-08 07:40:51.355619 INFO buffer_manager] Allocated weights buffer at (12042776576, 132120576) [2026-04-08 07:40:51.355621 INFO buffer_manager] Allocated weights buffer at (12174897152, 57344) [2026-04-08 07:40:51.355623 INFO buffer_manager] Allocated weights buffer at (12174954496, 132120576) [2026-04-08 07:40:51.355624 INFO buffer_manager] Allocated weights buffer at (12307075072, 57344) [2026-04-08 07:40:51.355626 INFO buffer_manager] Allocated weights buffer at (12307132416, 132120576) [2026-04-08 07:40:51.355627 INFO buffer_manager] Allocated weights buffer at (12439252992, 57344) [2026-04-08 07:40:51.355632 INFO buffer_manager] Allocated weights buffer at (12439310336, 0) [2026-04-08 07:40:51.355634 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=33, cache_slot=33) planned desc only [2026-04-08 07:40:51.431022 INFO buffer_manager] Allocated weights buffer at (12439310336, 0) [2026-04-08 07:40:51.431048 INFO buffer_manager] Allocated weights buffer at (12439310336, 132120576) [2026-04-08 07:40:51.431051 INFO buffer_manager] Allocated weights buffer at (12571430912, 57344) [2026-04-08 07:40:51.431052 INFO buffer_manager] Allocated weights buffer at (12571488256, 132120576) [2026-04-08 07:40:51.431054 INFO buffer_manager] Allocated weights buffer at (12703608832, 57344) [2026-04-08 07:40:51.431056 INFO buffer_manager] Allocated weights buffer at (12703666176, 132120576) [2026-04-08 07:40:51.431057 INFO buffer_manager] Allocated weights buffer at (12835786752, 57344) [2026-04-08 07:40:51.431058 INFO buffer_manager] Allocated weights buffer at (12835844096, 0) [2026-04-08 07:40:51.431060 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=34, cache_slot=34) planned desc only [2026-04-08 07:40:51.508554 INFO buffer_manager] Allocated weights buffer at (12835844096, 0) [2026-04-08 07:40:51.508578 INFO buffer_manager] Allocated weights buffer at (12835844096, 132120576) [2026-04-08 07:40:51.508580 INFO buffer_manager] Allocated weights buffer at (12967964672, 57344) [2026-04-08 07:40:51.508582 INFO buffer_manager] Allocated weights buffer at (12968022016, 132120576) [2026-04-08 07:40:51.508583 INFO buffer_manager] Allocated weights buffer at (13100142592, 57344) [2026-04-08 07:40:51.508585 INFO buffer_manager] Allocated weights buffer at (13100199936, 132120576) [2026-04-08 07:40:51.508586 INFO buffer_manager] Allocated weights buffer at (13232320512, 57344) [2026-04-08 07:40:51.508588 INFO buffer_manager] Allocated weights buffer at (13232377856, 0) [2026-04-08 07:40:51.508589 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=35, cache_slot=35) planned desc only [2026-04-08 07:40:51.583244 INFO buffer_manager] Allocated weights buffer at (13232377856, 0) [2026-04-08 07:40:51.583268 INFO buffer_manager] Allocated weights buffer at (13232377856, 132120576) [2026-04-08 07:40:51.583270 INFO buffer_manager] Allocated weights buffer at (13364498432, 57344) [2026-04-08 07:40:51.583271 INFO buffer_manager] Allocated weights buffer at (13364555776, 132120576) [2026-04-08 07:40:51.583273 INFO buffer_manager] Allocated weights buffer at (13496676352, 57344) [2026-04-08 07:40:51.583274 INFO buffer_manager] Allocated weights buffer at (13496733696, 132120576) [2026-04-08 07:40:51.583276 INFO buffer_manager] Allocated weights buffer at (13628854272, 57344) [2026-04-08 07:40:51.583278 INFO buffer_manager] Allocated weights buffer at (13628911616, 0) [2026-04-08 07:40:51.583279 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=36, cache_slot=36) planned desc only [2026-04-08 07:40:51.656903 INFO buffer_manager] Allocated weights buffer at (13628911616, 0) [2026-04-08 07:40:51.656926 INFO buffer_manager] Allocated weights buffer at (13628911616, 132120576) [2026-04-08 07:40:51.656928 INFO buffer_manager] Allocated weights buffer at (13761032192, 57344) [2026-04-08 07:40:51.656929 INFO buffer_manager] Allocated weights buffer at (13761089536, 132120576) [2026-04-08 07:40:51.656931 INFO buffer_manager] Allocated weights buffer at (13893210112, 57344) [2026-04-08 07:40:51.656939 INFO buffer_manager] Allocated weights buffer at (13893267456, 132120576) [2026-04-08 07:40:51.656940 INFO buffer_manager] Allocated weights buffer at (14025388032, 57344) [2026-04-08 07:40:51.656942 INFO buffer_manager] Allocated weights buffer at (14025445376, 0) [2026-04-08 07:40:51.656944 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=37, cache_slot=37) planned desc only [2026-04-08 07:40:51.730673 INFO buffer_manager] Allocated weights buffer at (14025445376, 0) [2026-04-08 07:40:51.730697 INFO buffer_manager] Allocated weights buffer at (14025445376, 132120576) [2026-04-08 07:40:51.730703 INFO buffer_manager] Allocated weights buffer at (14157565952, 57344) [2026-04-08 07:40:51.730705 INFO buffer_manager] Allocated weights buffer at (14157623296, 132120576) [2026-04-08 07:40:51.730706 INFO buffer_manager] Allocated weights buffer at (14289743872, 57344) [2026-04-08 07:40:51.730709 INFO buffer_manager] Allocated weights buffer at (14289801216, 132120576) [2026-04-08 07:40:51.730711 INFO buffer_manager] Allocated weights buffer at (14421921792, 57344) [2026-04-08 07:40:51.730712 INFO buffer_manager] Allocated weights buffer at (14421979136, 0) [2026-04-08 07:40:51.730714 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=38, cache_slot=38) planned desc only [2026-04-08 07:40:51.805803 INFO buffer_manager] Allocated weights buffer at (14421979136, 0) [2026-04-08 07:40:51.805826 INFO buffer_manager] Allocated weights buffer at (14421979136, 132120576) [2026-04-08 07:40:51.805828 INFO buffer_manager] Allocated weights buffer at (14554099712, 57344) [2026-04-08 07:40:51.805830 INFO buffer_manager] Allocated weights buffer at (14554157056, 132120576) [2026-04-08 07:40:51.805831 INFO buffer_manager] Allocated weights buffer at (14686277632, 57344) [2026-04-08 07:40:51.805833 INFO buffer_manager] Allocated weights buffer at (14686334976, 132120576) [2026-04-08 07:40:51.805834 INFO buffer_manager] Allocated weights buffer at (14818455552, 57344) [2026-04-08 07:40:51.805836 INFO buffer_manager] Allocated weights buffer at (14818512896, 0) [2026-04-08 07:40:51.805838 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=39, cache_slot=39) planned desc only [2026-04-08 07:40:51.879976 INFO buffer_manager] Allocated weights buffer at (14818512896, 0) [2026-04-08 07:40:51.879998 INFO buffer_manager] Allocated weights buffer at (14818512896, 132120576) [2026-04-08 07:40:51.880000 INFO buffer_manager] Allocated weights buffer at (14950633472, 57344) [2026-04-08 07:40:51.880002 INFO buffer_manager] Allocated weights buffer at (14950690816, 132120576) [2026-04-08 07:40:51.880003 INFO buffer_manager] Allocated weights buffer at (15082811392, 57344) [2026-04-08 07:40:51.880005 INFO buffer_manager] Allocated weights buffer at (15082868736, 132120576) [2026-04-08 07:40:51.880006 INFO buffer_manager] Allocated weights buffer at (15214989312, 57344) [2026-04-08 07:40:51.880008 INFO buffer_manager] Allocated weights buffer at (15215046656, 0) [2026-04-08 07:40:51.880009 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=40, cache_slot=40) planned desc only [2026-04-08 07:40:51.954880 INFO buffer_manager] Allocated weights buffer at (15215046656, 0) [2026-04-08 07:40:51.954903 INFO buffer_manager] Allocated weights buffer at (15215046656, 132120576) [2026-04-08 07:40:51.954906 INFO buffer_manager] Allocated weights buffer at (15347167232, 57344) [2026-04-08 07:40:51.954907 INFO buffer_manager] Allocated weights buffer at (15347224576, 132120576) [2026-04-08 07:40:51.954909 INFO buffer_manager] Allocated weights buffer at (15479345152, 57344) [2026-04-08 07:40:51.954910 INFO buffer_manager] Allocated weights buffer at (15479402496, 132120576) [2026-04-08 07:40:51.954912 INFO buffer_manager] Allocated weights buffer at (15611523072, 57344) [2026-04-08 07:40:51.954913 INFO buffer_manager] Allocated weights buffer at (15611580416, 0) [2026-04-08 07:40:51.954915 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=41, cache_slot=41) planned desc only [2026-04-08 07:40:52.028873 INFO buffer_manager] Allocated weights buffer at (15611580416, 0) [2026-04-08 07:40:52.028896 INFO buffer_manager] Allocated weights buffer at (15611580416, 132120576) [2026-04-08 07:40:52.028898 INFO buffer_manager] Allocated weights buffer at (15743700992, 57344) [2026-04-08 07:40:52.028899 INFO buffer_manager] Allocated weights buffer at (15743758336, 132120576) [2026-04-08 07:40:52.028901 INFO buffer_manager] Allocated weights buffer at (15875878912, 57344) [2026-04-08 07:40:52.028903 INFO buffer_manager] Allocated weights buffer at (15875936256, 132120576) [2026-04-08 07:40:52.028905 INFO buffer_manager] Allocated weights buffer at (16008056832, 57344) [2026-04-08 07:40:52.028910 INFO buffer_manager] Allocated weights buffer at (16008114176, 0) [2026-04-08 07:40:52.028912 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=42, cache_slot=42) planned desc only [2026-04-08 07:40:52.108573 INFO buffer_manager] Allocated weights buffer at (16008114176, 0) [2026-04-08 07:40:52.108595 INFO buffer_manager] Allocated weights buffer at (16008114176, 132120576) [2026-04-08 07:40:52.108597 INFO buffer_manager] Allocated weights buffer at (16140234752, 57344) [2026-04-08 07:40:52.108599 INFO buffer_manager] Allocated weights buffer at (16140292096, 132120576) [2026-04-08 07:40:52.108600 INFO buffer_manager] Allocated weights buffer at (16272412672, 57344) [2026-04-08 07:40:52.108602 INFO buffer_manager] Allocated weights buffer at (16272470016, 132120576) [2026-04-08 07:40:52.108603 INFO buffer_manager] Allocated weights buffer at (16404590592, 57344) [2026-04-08 07:40:52.108605 INFO buffer_manager] Allocated weights buffer at (16404647936, 0) [2026-04-08 07:40:52.108607 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=43, cache_slot=43) planned desc only [2026-04-08 07:40:52.181307 INFO buffer_manager] Allocated weights buffer at (16404647936, 0) [2026-04-08 07:40:52.181329 INFO buffer_manager] Allocated weights buffer at (16404647936, 132120576) [2026-04-08 07:40:52.181331 INFO buffer_manager] Allocated weights buffer at (16536768512, 57344) [2026-04-08 07:40:52.181332 INFO buffer_manager] Allocated weights buffer at (16536825856, 132120576) [2026-04-08 07:40:52.181334 INFO buffer_manager] Allocated weights buffer at (16668946432, 57344) [2026-04-08 07:40:52.181335 INFO buffer_manager] Allocated weights buffer at (16669003776, 132120576) [2026-04-08 07:40:52.181337 INFO buffer_manager] Allocated weights buffer at (16801124352, 57344) [2026-04-08 07:40:52.181339 INFO buffer_manager] Allocated weights buffer at (16801181696, 0) [2026-04-08 07:40:52.181340 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=44, cache_slot=44) planned desc only [2026-04-08 07:40:52.252351 INFO buffer_manager] Allocated weights buffer at (16801181696, 0) [2026-04-08 07:40:52.252373 INFO buffer_manager] Allocated weights buffer at (16801181696, 132120576) [2026-04-08 07:40:52.252376 INFO buffer_manager] Allocated weights buffer at (16933302272, 57344) [2026-04-08 07:40:52.252378 INFO buffer_manager] Allocated weights buffer at (16933359616, 132120576) [2026-04-08 07:40:52.252379 INFO buffer_manager] Allocated weights buffer at (17065480192, 57344) [2026-04-08 07:40:52.252381 INFO buffer_manager] Allocated weights buffer at (17065537536, 132120576) [2026-04-08 07:40:52.252382 INFO buffer_manager] Allocated weights buffer at (17197658112, 57344) [2026-04-08 07:40:52.252384 INFO buffer_manager] Allocated weights buffer at (17197715456, 0) [2026-04-08 07:40:52.252385 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=45, cache_slot=45) planned desc only [2026-04-08 07:40:52.322566 INFO buffer_manager] Allocated weights buffer at (17197715456, 0) [2026-04-08 07:40:52.322586 INFO buffer_manager] Allocated weights buffer at (17197715456, 132120576) [2026-04-08 07:40:52.322588 INFO buffer_manager] Allocated weights buffer at (17329836032, 57344) [2026-04-08 07:40:52.322590 INFO buffer_manager] Allocated weights buffer at (17329893376, 132120576) [2026-04-08 07:40:52.322591 INFO buffer_manager] Allocated weights buffer at (17462013952, 57344) [2026-04-08 07:40:52.322593 INFO buffer_manager] Allocated weights buffer at (17462071296, 132120576) [2026-04-08 07:40:52.322594 INFO buffer_manager] Allocated weights buffer at (17594191872, 57344) [2026-04-08 07:40:52.322596 INFO buffer_manager] Allocated weights buffer at (17594249216, 0) [2026-04-08 07:40:52.322597 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=46, cache_slot=46) planned desc only [2026-04-08 07:40:52.394573 INFO buffer_manager] Allocated weights buffer at (17594249216, 0) [2026-04-08 07:40:52.394594 INFO buffer_manager] Allocated weights buffer at (17594249216, 132120576) [2026-04-08 07:40:52.394600 INFO buffer_manager] Allocated weights buffer at (17726369792, 57344) [2026-04-08 07:40:52.394602 INFO buffer_manager] Allocated weights buffer at (17726427136, 132120576) [2026-04-08 07:40:52.394603 INFO buffer_manager] Allocated weights buffer at (17858547712, 57344) [2026-04-08 07:40:52.394605 INFO buffer_manager] Allocated weights buffer at (17858605056, 132120576) [2026-04-08 07:40:52.394606 INFO buffer_manager] Allocated weights buffer at (17990725632, 57344) [2026-04-08 07:40:52.394608 INFO buffer_manager] Allocated weights buffer at (17990782976, 0) [2026-04-08 07:40:52.394610 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=47, cache_slot=47) planned desc only [2026-04-08 07:40:52.466920 INFO buffer_manager] Allocated weights buffer at (17990782976, 0) [2026-04-08 07:40:52.466942 INFO buffer_manager] Allocated weights buffer at (17990782976, 132120576) [2026-04-08 07:40:52.466944 INFO buffer_manager] Allocated weights buffer at (18122903552, 57344) [2026-04-08 07:40:52.466945 INFO buffer_manager] Allocated weights buffer at (18122960896, 132120576) [2026-04-08 07:40:52.466947 INFO buffer_manager] Allocated weights buffer at (18255081472, 57344) [2026-04-08 07:40:52.466948 INFO buffer_manager] Allocated weights buffer at (18255138816, 132120576) [2026-04-08 07:40:52.466950 INFO buffer_manager] Allocated weights buffer at (18387259392, 57344) [2026-04-08 07:40:52.466951 INFO buffer_manager] Allocated weights buffer at (18387316736, 0) [2026-04-08 07:40:52.466953 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=48, cache_slot=48) planned desc only [2026-04-08 07:40:52.539953 INFO buffer_manager] Allocated weights buffer at (18387316736, 0) [2026-04-08 07:40:52.539975 INFO buffer_manager] Allocated weights buffer at (18387316736, 132120576) [2026-04-08 07:40:52.539977 INFO buffer_manager] Allocated weights buffer at (18519437312, 57344) [2026-04-08 07:40:52.539979 INFO buffer_manager] Allocated weights buffer at (18519494656, 132120576) [2026-04-08 07:40:52.539980 INFO buffer_manager] Allocated weights buffer at (18651615232, 57344) [2026-04-08 07:40:52.539982 INFO buffer_manager] Allocated weights buffer at (18651672576, 132120576) [2026-04-08 07:40:52.539983 INFO buffer_manager] Allocated weights buffer at (18783793152, 57344) [2026-04-08 07:40:52.539985 INFO buffer_manager] Allocated weights buffer at (18783850496, 0) [2026-04-08 07:40:52.539986 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=49, cache_slot=49) planned desc only [2026-04-08 07:40:52.611360 INFO buffer_manager] Allocated weights buffer at (18783850496, 0) [2026-04-08 07:40:52.611382 INFO buffer_manager] Allocated weights buffer at (18783850496, 132120576) [2026-04-08 07:40:52.611384 INFO buffer_manager] Allocated weights buffer at (18915971072, 57344) [2026-04-08 07:40:52.611385 INFO buffer_manager] Allocated weights buffer at (18916028416, 132120576) [2026-04-08 07:40:52.611387 INFO buffer_manager] Allocated weights buffer at (19048148992, 57344) [2026-04-08 07:40:52.611388 INFO buffer_manager] Allocated weights buffer at (19048206336, 132120576) [2026-04-08 07:40:52.611390 INFO buffer_manager] Allocated weights buffer at (19180326912, 57344) [2026-04-08 07:40:52.611391 INFO buffer_manager] Allocated weights buffer at (19180384256, 0) [2026-04-08 07:40:52.611393 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=50, cache_slot=50) planned desc only [2026-04-08 07:40:52.683472 INFO buffer_manager] Allocated weights buffer at (19180384256, 0) [2026-04-08 07:40:52.683494 INFO buffer_manager] Allocated weights buffer at (19180384256, 132120576) [2026-04-08 07:40:52.683496 INFO buffer_manager] Allocated weights buffer at (19312504832, 57344) [2026-04-08 07:40:52.683497 INFO buffer_manager] Allocated weights buffer at (19312562176, 132120576) [2026-04-08 07:40:52.683499 INFO buffer_manager] Allocated weights buffer at (19444682752, 57344) [2026-04-08 07:40:52.683501 INFO buffer_manager] Allocated weights buffer at (19444740096, 132120576) [2026-04-08 07:40:52.683506 INFO buffer_manager] Allocated weights buffer at (19576860672, 57344) [2026-04-08 07:40:52.683508 INFO buffer_manager] Allocated weights buffer at (19576918016, 0) [2026-04-08 07:40:52.683510 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=51, cache_slot=51) planned desc only [2026-04-08 07:40:52.754841 INFO buffer_manager] Allocated weights buffer at (19576918016, 0) [2026-04-08 07:40:52.754868 INFO buffer_manager] Allocated weights buffer at (19576918016, 132120576) [2026-04-08 07:40:52.754870 INFO buffer_manager] Allocated weights buffer at (19709038592, 57344) [2026-04-08 07:40:52.754872 INFO buffer_manager] Allocated weights buffer at (19709095936, 132120576) [2026-04-08 07:40:52.754874 INFO buffer_manager] Allocated weights buffer at (19841216512, 57344) [2026-04-08 07:40:52.754875 INFO buffer_manager] Allocated weights buffer at (19841273856, 132120576) [2026-04-08 07:40:52.754876 INFO buffer_manager] Allocated weights buffer at (19973394432, 57344) [2026-04-08 07:40:52.754878 INFO buffer_manager] Allocated weights buffer at (19973451776, 0) [2026-04-08 07:40:52.754879 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=52, cache_slot=52) planned desc only [2026-04-08 07:40:52.795111 INFO buffer_manager] Allocated weights buffer at (19973451776, 0) [2026-04-08 07:40:52.795127 INFO buffer_manager] Allocated weights buffer at (19973451776, 132120576) [2026-04-08 07:40:52.795129 INFO buffer_manager] Allocated weights buffer at (20105572352, 57344) [2026-04-08 07:40:52.795131 INFO buffer_manager] Allocated weights buffer at (20105629696, 132120576) [2026-04-08 07:40:52.795132 INFO buffer_manager] Allocated weights buffer at (20237750272, 57344) [2026-04-08 07:40:52.795134 INFO buffer_manager] Allocated weights buffer at (20237807616, 132120576) [2026-04-08 07:40:52.795135 INFO buffer_manager] Allocated weights buffer at (20369928192, 57344) [2026-04-08 07:40:52.795137 INFO buffer_manager] Allocated weights buffer at (20369985536, 0) [2026-04-08 07:40:52.795138 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=53, cache_slot=53) planned desc only [2026-04-08 07:40:52.849286 INFO buffer_manager] Allocated weights buffer at (20369985536, 0) [2026-04-08 07:40:52.849303 INFO buffer_manager] Allocated weights buffer at (20369985536, 132120576) [2026-04-08 07:40:52.849305 INFO buffer_manager] Allocated weights buffer at (20502106112, 57344) [2026-04-08 07:40:52.849306 INFO buffer_manager] Allocated weights buffer at (20502163456, 132120576) [2026-04-08 07:40:52.849308 INFO buffer_manager] Allocated weights buffer at (20634284032, 57344) [2026-04-08 07:40:52.849309 INFO buffer_manager] Allocated weights buffer at (20634341376, 132120576) [2026-04-08 07:40:52.849310 INFO buffer_manager] Allocated weights buffer at (20766461952, 57344) [2026-04-08 07:40:52.849312 INFO buffer_manager] Allocated weights buffer at (20766519296, 0) [2026-04-08 07:40:52.849314 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=54, cache_slot=54) planned desc only [2026-04-08 07:40:52.906827 INFO buffer_manager] Allocated weights buffer at (20766519296, 0) [2026-04-08 07:40:52.906844 INFO buffer_manager] Allocated weights buffer at (20766519296, 132120576) [2026-04-08 07:40:52.906846 INFO buffer_manager] Allocated weights buffer at (20898639872, 57344) [2026-04-08 07:40:52.906848 INFO buffer_manager] Allocated weights buffer at (20898697216, 132120576) [2026-04-08 07:40:52.906849 INFO buffer_manager] Allocated weights buffer at (21030817792, 57344) [2026-04-08 07:40:52.906851 INFO buffer_manager] Allocated weights buffer at (21030875136, 132120576) [2026-04-08 07:40:52.906852 INFO buffer_manager] Allocated weights buffer at (21162995712, 57344) [2026-04-08 07:40:52.906854 INFO buffer_manager] Allocated weights buffer at (21163053056, 0) [2026-04-08 07:40:52.906856 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=55, cache_slot=55) planned desc only [2026-04-08 07:40:52.958934 INFO buffer_manager] Allocated weights buffer at (21163053056, 0) [2026-04-08 07:40:52.958960 INFO buffer_manager] Allocated weights buffer at (21163053056, 132120576) [2026-04-08 07:40:52.958962 INFO buffer_manager] Allocated weights buffer at (21295173632, 57344) [2026-04-08 07:40:52.958964 INFO buffer_manager] Allocated weights buffer at (21295230976, 132120576) [2026-04-08 07:40:52.958967 INFO buffer_manager] Allocated weights buffer at (21427351552, 57344) [2026-04-08 07:40:52.958968 INFO buffer_manager] Allocated weights buffer at (21427408896, 132120576) [2026-04-08 07:40:52.958969 INFO buffer_manager] Allocated weights buffer at (21559529472, 57344) [2026-04-08 07:40:52.958971 INFO buffer_manager] Allocated weights buffer at (21559586816, 0) [2026-04-08 07:40:52.958973 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=56, cache_slot=56) planned desc only [2026-04-08 07:40:52.998211 INFO buffer_manager] Allocated weights buffer at (21559586816, 0) [2026-04-08 07:40:52.998228 INFO buffer_manager] Allocated weights buffer at (21559586816, 132120576) [2026-04-08 07:40:52.998230 INFO buffer_manager] Allocated weights buffer at (21691707392, 57344) [2026-04-08 07:40:52.998232 INFO buffer_manager] Allocated weights buffer at (21691764736, 132120576) [2026-04-08 07:40:52.998233 INFO buffer_manager] Allocated weights buffer at (21823885312, 57344) [2026-04-08 07:40:52.998234 INFO buffer_manager] Allocated weights buffer at (21823942656, 132120576) [2026-04-08 07:40:52.998236 INFO buffer_manager] Allocated weights buffer at (21956063232, 57344) [2026-04-08 07:40:52.998238 INFO buffer_manager] Allocated weights buffer at (21956120576, 0) [2026-04-08 07:40:52.998239 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=57, cache_slot=57) planned desc only [2026-04-08 07:40:53.034588 INFO buffer_manager] Allocated weights buffer at (21956120576, 0) [2026-04-08 07:40:53.034603 INFO buffer_manager] Allocated weights buffer at (21956120576, 132120576) [2026-04-08 07:40:53.034605 INFO buffer_manager] Allocated weights buffer at (22088241152, 57344) [2026-04-08 07:40:53.034607 INFO buffer_manager] Allocated weights buffer at (22088298496, 132120576) [2026-04-08 07:40:53.034608 INFO buffer_manager] Allocated weights buffer at (22220419072, 57344) [2026-04-08 07:40:53.034610 INFO buffer_manager] Allocated weights buffer at (22220476416, 132120576) [2026-04-08 07:40:53.034611 INFO buffer_manager] Allocated weights buffer at (22352596992, 57344) [2026-04-08 07:40:53.034613 INFO buffer_manager] Allocated weights buffer at (22352654336, 0) [2026-04-08 07:40:53.034615 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=58, cache_slot=58) planned desc only [2026-04-08 07:40:53.071261 INFO buffer_manager] Allocated weights buffer at (22352654336, 0) [2026-04-08 07:40:53.071275 INFO buffer_manager] Allocated weights buffer at (22352654336, 132120576) [2026-04-08 07:40:53.071277 INFO buffer_manager] Allocated weights buffer at (22484774912, 57344) [2026-04-08 07:40:53.071279 INFO buffer_manager] Allocated weights buffer at (22484832256, 132120576) [2026-04-08 07:40:53.071280 INFO buffer_manager] Allocated weights buffer at (22616952832, 57344) [2026-04-08 07:40:53.071282 INFO buffer_manager] Allocated weights buffer at (22617010176, 132120576) [2026-04-08 07:40:53.071283 INFO buffer_manager] Allocated weights buffer at (22749130752, 57344) [2026-04-08 07:40:53.071285 INFO buffer_manager] Allocated weights buffer at (22749188096, 0) [2026-04-08 07:40:53.071288 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=59, cache_slot=59) planned desc only [2026-04-08 07:40:53.107787 INFO buffer_manager] Allocated weights buffer at (22749188096, 0) [2026-04-08 07:40:53.107802 INFO buffer_manager] Allocated weights buffer at (22749188096, 132120576) [2026-04-08 07:40:53.107804 INFO buffer_manager] Allocated weights buffer at (22881308672, 57344) [2026-04-08 07:40:53.107805 INFO buffer_manager] Allocated weights buffer at (22881366016, 132120576) [2026-04-08 07:40:53.107807 INFO buffer_manager] Allocated weights buffer at (23013486592, 57344) [2026-04-08 07:40:53.107808 INFO buffer_manager] Allocated weights buffer at (23013543936, 132120576) [2026-04-08 07:40:53.107814 INFO buffer_manager] Allocated weights buffer at (23145664512, 57344) [2026-04-08 07:40:53.107816 INFO buffer_manager] Allocated weights buffer at (23145721856, 0) [2026-04-08 07:40:53.107818 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=60, cache_slot=60) planned desc only [2026-04-08 07:40:53.470665 INFO buffer_manager] Allocated weights buffer at (23145721856, 0) [2026-04-08 07:40:53.470688 INFO buffer_manager] Allocated weights buffer at (23145721856, 132120576) [2026-04-08 07:40:53.470690 INFO buffer_manager] Allocated weights buffer at (23277842432, 57344) [2026-04-08 07:40:53.470691 INFO buffer_manager] Allocated weights buffer at (23277899776, 132120576) [2026-04-08 07:40:53.470693 INFO buffer_manager] Allocated weights buffer at (23410020352, 57344) [2026-04-08 07:40:53.470694 INFO buffer_manager] Allocated weights buffer at (23410077696, 132120576) [2026-04-08 07:40:53.470696 INFO buffer_manager] Allocated weights buffer at (23542198272, 57344) [2026-04-08 07:40:53.470697 INFO buffer_manager] Allocated weights buffer at (23542255616, 0) [2026-04-08 07:40:53.470699 INFO fp8_moe_dpdk] fp8_moe_dpdk: init_layer_cached(layer_idx=61, cache_slot=61) planned desc only [2026-04-08 07:41:01.362643 INFO fp8_dpdk_common] fp9 fast path forced on by default in the current kernel build [2026-04-08 07:41:01.371492 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=37, expert_tiles=37, avg_tile_batch=1.08, prepare=108.252µs, send=2.095169ms, judge_wait=5.95198ms, fetch=478.183µs, reduce=23ns; duck time-ns stats: p50=5.819163ms, p90=5.832233ms, max=5.84885ms; kernel_model: matmul=0.110100 GFLOP (18.824 GFLOP/s @ duck_max), param_stream=0.050921G (8.706 Gparam/s @ duck_max), weight_stream=54.657 MiB (9.799 GB/s @ duck_max) [2026-04-08 07:41:01.379739 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=37, expert_tiles=37, avg_tile_batch=1.08, prepare=51.133µs, send=296.564µs, judge_wait=5.791575ms, fetch=336.546µs, reduce=23ns; duck time-ns stats: p50=5.671872ms, p90=5.693506ms, max=5.697422ms; kernel_model: matmul=0.110100 GFLOP (19.325 GFLOP/s @ duck_max), param_stream=0.050921G (8.938 Gparam/s @ duck_max), weight_stream=54.657 MiB (10.059 GB/s @ duck_max) [2026-04-08 07:41:01.387909 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=36, expert_tiles=36, avg_tile_batch=1.11, prepare=34.772µs, send=297.371µs, judge_wait=5.796081ms, fetch=309.438µs, reduce=22ns; duck time-ns stats: p50=5.690919ms, p90=5.699844ms, max=5.70525ms; kernel_model: matmul=0.110100 GFLOP (19.298 GFLOP/s @ duck_max), param_stream=0.049545G (8.684 Gparam/s @ duck_max), weight_stream=53.179 MiB (9.774 GB/s @ duck_max) [2026-04-08 07:41:01.396091 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=34, expert_tiles=34, avg_tile_batch=1.18, prepare=9.857µs, send=298.428µs, judge_wait=5.863207ms, fetch=310.438µs, reduce=22ns; duck time-ns stats: p50=5.74633ms, p90=5.756723ms, max=5.772377ms; kernel_model: matmul=0.110100 GFLOP (19.074 GFLOP/s @ duck_max), param_stream=0.046793G (8.106 Gparam/s @ duck_max), weight_stream=50.225 MiB (9.124 GB/s @ duck_max) [2026-04-08 07:41:01.404141 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=35, expert_tiles=35, avg_tile_batch=1.14, prepare=9.571µs, send=297.212µs, judge_wait=5.748092ms, fetch=312.782µs, reduce=25ns; duck time-ns stats: p50=5.623009ms, p90=5.637573ms, max=5.653562ms; kernel_model: matmul=0.110100 GFLOP (19.475 GFLOP/s @ duck_max), param_stream=0.048169G (8.520 Gparam/s @ duck_max), weight_stream=51.702 MiB (9.589 GB/s @ duck_max) [2026-04-08 07:41:01.412185 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=33, expert_tiles=33, avg_tile_batch=1.21, prepare=9.093µs, send=296.455µs, judge_wait=5.724925ms, fetch=313.81µs, reduce=20ns; duck time-ns stats: p50=5.606097ms, p90=5.618178ms, max=5.631332ms; kernel_model: matmul=0.110100 GFLOP (19.551 GFLOP/s @ duck_max), param_stream=0.045416G (8.065 Gparam/s @ duck_max), weight_stream=48.748 MiB (9.077 GB/s @ duck_max) [2026-04-08 07:41:01.420219 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=35, expert_tiles=35, avg_tile_batch=1.14, prepare=8.999µs, send=296.692µs, judge_wait=5.728581ms, fetch=309.256µs, reduce=24ns; duck time-ns stats: p50=5.609807ms, p90=5.626478ms, max=5.633531ms; kernel_model: matmul=0.110100 GFLOP (19.544 GFLOP/s @ duck_max), param_stream=0.048169G (8.550 Gparam/s @ duck_max), weight_stream=51.702 MiB (9.623 GB/s @ duck_max) [2026-04-08 07:41:01.428229 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=34, expert_tiles=34, avg_tile_batch=1.18, prepare=9.162µs, send=296.949µs, judge_wait=5.719857ms, fetch=308.395µs, reduce=22ns; duck time-ns stats: p50=5.60913ms, p90=5.621373ms, max=5.626515ms; kernel_model: matmul=0.110100 GFLOP (19.568 GFLOP/s @ duck_max), param_stream=0.046793G (8.316 Gparam/s @ duck_max), weight_stream=50.225 MiB (9.360 GB/s @ duck_max) [2026-04-08 07:41:01.435759 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=31, expert_tiles=31, avg_tile_batch=1.29, prepare=8.903µs, send=297.467µs, judge_wait=5.222535ms, fetch=309.964µs, reduce=22ns; duck time-ns stats: p50=5.108442ms, p90=5.122396ms, max=5.1286ms; kernel_model: matmul=0.110100 GFLOP (21.468 GFLOP/s @ duck_max), param_stream=0.042664G (8.319 Gparam/s @ duck_max), weight_stream=45.793 MiB (9.363 GB/s @ duck_max) [2026-04-08 07:41:01.443546 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=34, expert_tiles=34, avg_tile_batch=1.18, prepare=9.194µs, send=297.918µs, judge_wait=5.485287ms, fetch=311.327µs, reduce=23ns; duck time-ns stats: p50=5.37141ms, p90=5.387157ms, max=5.391378ms; kernel_model: matmul=0.110100 GFLOP (20.422 GFLOP/s @ duck_max), param_stream=0.046793G (8.679 Gparam/s @ duck_max), weight_stream=50.225 MiB (9.768 GB/s @ duck_max) [2026-04-08 07:41:01.451186 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.887µs, send=297.633µs, judge_wait=5.341592ms, fetch=308.966µs, reduce=25ns; duck time-ns stats: p50=5.234644ms, p90=5.246578ms, max=5.248126ms; kernel_model: matmul=0.110100 GFLOP (20.979 GFLOP/s @ duck_max), param_stream=0.041288G (7.867 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.854 GB/s @ duck_max) [2026-04-08 07:41:01.459227 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.953µs, send=297.387µs, judge_wait=5.720666ms, fetch=308.275µs, reduce=24ns; duck time-ns stats: p50=5.579694ms, p90=5.598213ms, max=5.626427ms; kernel_model: matmul=0.110100 GFLOP (19.568 GFLOP/s @ duck_max), param_stream=0.041288G (7.338 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.259 GB/s @ duck_max) [2026-04-08 07:41:01.466856 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=25, expert_tiles=25, avg_tile_batch=1.60, prepare=8.584µs, send=297.724µs, judge_wait=5.293141ms, fetch=309.673µs, reduce=23ns; duck time-ns stats: p50=5.178185ms, p90=5.188749ms, max=5.199526ms; kernel_model: matmul=0.110100 GFLOP (21.175 GFLOP/s @ duck_max), param_stream=0.034406G (6.617 Gparam/s @ duck_max), weight_stream=36.930 MiB (7.448 GB/s @ duck_max) [2026-04-08 07:41:01.474593 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.994µs, send=297.246µs, judge_wait=5.463416ms, fetch=309.447µs, reduce=22ns; duck time-ns stats: p50=5.350024ms, p90=5.357891ms, max=5.37084ms; kernel_model: matmul=0.110100 GFLOP (20.500 GFLOP/s @ duck_max), param_stream=0.041288G (7.687 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.652 GB/s @ duck_max) [2026-04-08 07:41:01.482735 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.738µs, send=298.331µs, judge_wait=5.837424ms, fetch=313.694µs, reduce=21ns; duck time-ns stats: p50=5.709747ms, p90=5.729842ms, max=5.743544ms; kernel_model: matmul=0.110100 GFLOP (19.169 GFLOP/s @ duck_max), param_stream=0.041288G (7.189 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.091 GB/s @ duck_max) [2026-04-08 07:41:01.490006 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=28, expert_tiles=28, avg_tile_batch=1.43, prepare=8.855µs, send=298.86µs, judge_wait=4.961625ms, fetch=311.548µs, reduce=245ns; duck time-ns stats: p50=4.84461ms, p90=4.861357ms, max=4.868172ms; kernel_model: matmul=0.110100 GFLOP (22.616 GFLOP/s @ duck_max), param_stream=0.038535G (7.916 Gparam/s @ duck_max), weight_stream=41.362 MiB (8.909 GB/s @ duck_max) [2026-04-08 07:41:01.498036 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=32, expert_tiles=32, avg_tile_batch=1.25, prepare=9.436µs, send=297.77µs, judge_wait=5.726759ms, fetch=308.544µs, reduce=23ns; duck time-ns stats: p50=5.602338ms, p90=5.613279ms, max=5.632981ms; kernel_model: matmul=0.110100 GFLOP (19.546 GFLOP/s @ duck_max), param_stream=0.044040G (7.818 Gparam/s @ duck_max), weight_stream=47.271 MiB (8.799 GB/s @ duck_max) [2026-04-08 07:41:01.505690 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=31, expert_tiles=31, avg_tile_batch=1.29, prepare=8.862µs, send=298.127µs, judge_wait=5.359553ms, fetch=307.978µs, reduce=25ns; duck time-ns stats: p50=5.244395ms, p90=5.258316ms, max=5.265361ms; kernel_model: matmul=0.110100 GFLOP (20.910 GFLOP/s @ duck_max), param_stream=0.042664G (8.103 Gparam/s @ duck_max), weight_stream=45.793 MiB (9.120 GB/s @ duck_max) [2026-04-08 07:41:01.513089 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=28, expert_tiles=28, avg_tile_batch=1.43, prepare=8.927µs, send=296.127µs, judge_wait=5.34687ms, fetch=308.636µs, reduce=21ns; duck time-ns stats: p50=5.228435ms, p90=5.242017ms, max=5.256364ms; kernel_model: matmul=0.110100 GFLOP (20.946 GFLOP/s @ duck_max), param_stream=0.038535G (7.331 Gparam/s @ duck_max), weight_stream=41.362 MiB (8.251 GB/s @ duck_max) [2026-04-08 07:41:01.520508 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=33, expert_tiles=33, avg_tile_batch=1.21, prepare=8.845µs, send=297.6µs, judge_wait=5.364145ms, fetch=309.241µs, reduce=19ns; duck time-ns stats: p50=5.262524ms, p90=5.272981ms, max=5.27622ms; kernel_model: matmul=0.110100 GFLOP (20.867 GFLOP/s @ duck_max), param_stream=0.045416G (8.608 Gparam/s @ duck_max), weight_stream=48.748 MiB (9.688 GB/s @ duck_max) [2026-04-08 07:41:01.528524 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=31, expert_tiles=31, avg_tile_batch=1.29, prepare=8.629µs, send=297.043µs, judge_wait=5.999258ms, fetch=309.083µs, reduce=23ns; duck time-ns stats: p50=5.874753ms, p90=5.886601ms, max=5.907073ms; kernel_model: matmul=0.110100 GFLOP (18.639 GFLOP/s @ duck_max), param_stream=0.042664G (7.223 Gparam/s @ duck_max), weight_stream=45.793 MiB (8.129 GB/s @ duck_max) [2026-04-08 07:41:01.535925 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=26, expert_tiles=26, avg_tile_batch=1.54, prepare=8.729µs, send=297.422µs, judge_wait=5.338333ms, fetch=309.528µs, reduce=21ns; duck time-ns stats: p50=5.224542ms, p90=5.235741ms, max=5.248109ms; kernel_model: matmul=0.110100 GFLOP (20.979 GFLOP/s @ duck_max), param_stream=0.035783G (6.818 Gparam/s @ duck_max), weight_stream=38.407 MiB (7.674 GB/s @ duck_max) [2026-04-08 07:41:01.542941 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=29, expert_tiles=29, avg_tile_batch=1.38, prepare=8.458µs, send=297.779µs, judge_wait=4.966535ms, fetch=311.493µs, reduce=21ns; duck time-ns stats: p50=4.854679ms, p90=4.873919ms, max=4.880012ms; kernel_model: matmul=0.110100 GFLOP (22.562 GFLOP/s @ duck_max), param_stream=0.039911G (8.179 Gparam/s @ duck_max), weight_stream=42.839 MiB (9.205 GB/s @ duck_max) [2026-04-08 07:41:01.550294 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=29, expert_tiles=29, avg_tile_batch=1.38, prepare=8.8µs, send=298.007µs, judge_wait=5.310101ms, fetch=310.008µs, reduce=21ns; duck time-ns stats: p50=5.17728ms, p90=5.192206ms, max=5.204284ms; kernel_model: matmul=0.110100 GFLOP (21.156 GFLOP/s @ duck_max), param_stream=0.039911G (7.669 Gparam/s @ duck_max), weight_stream=42.839 MiB (8.631 GB/s @ duck_max) [2026-04-08 07:41:01.557330 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=27, expert_tiles=27, avg_tile_batch=1.48, prepare=8.858µs, send=297.405µs, judge_wait=4.974419ms, fetch=311.709µs, reduce=20ns; duck time-ns stats: p50=4.854076ms, p90=4.861229ms, max=4.868662ms; kernel_model: matmul=0.110100 GFLOP (22.614 GFLOP/s @ duck_max), param_stream=0.037159G (7.632 Gparam/s @ duck_max), weight_stream=39.884 MiB (8.590 GB/s @ duck_max) [2026-04-08 07:41:01.564718 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=27, expert_tiles=27, avg_tile_batch=1.48, prepare=8.676µs, send=297.194µs, judge_wait=5.340673ms, fetch=312.042µs, reduce=20ns; duck time-ns stats: p50=5.227152ms, p90=5.237184ms, max=5.249192ms; kernel_model: matmul=0.110100 GFLOP (20.975 GFLOP/s @ duck_max), param_stream=0.037159G (7.079 Gparam/s @ duck_max), weight_stream=39.884 MiB (7.967 GB/s @ duck_max) [2026-04-08 07:41:01.572057 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.511µs, send=297.144µs, judge_wait=5.296692ms, fetch=309.433µs, reduce=23ns; duck time-ns stats: p50=5.194786ms, p90=5.202018ms, max=5.204819ms; kernel_model: matmul=0.110100 GFLOP (21.154 GFLOP/s @ duck_max), param_stream=0.041288G (7.933 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.928 GB/s @ duck_max) [2026-04-08 07:41:01.579414 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=29, expert_tiles=29, avg_tile_batch=1.38, prepare=8.842µs, send=297.152µs, judge_wait=5.303379ms, fetch=309.502µs, reduce=22ns; duck time-ns stats: p50=5.178314ms, p90=5.194133ms, max=5.214849ms; kernel_model: matmul=0.110100 GFLOP (21.113 GFLOP/s @ duck_max), param_stream=0.039911G (7.653 Gparam/s @ duck_max), weight_stream=42.839 MiB (8.614 GB/s @ duck_max) [2026-04-08 07:41:01.587192 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=32, expert_tiles=32, avg_tile_batch=1.25, prepare=8.649µs, send=297.242µs, judge_wait=5.728928ms, fetch=309.772µs, reduce=22ns; duck time-ns stats: p50=5.574374ms, p90=5.604985ms, max=5.621262ms; kernel_model: matmul=0.110100 GFLOP (19.586 GFLOP/s @ duck_max), param_stream=0.044040G (7.835 Gparam/s @ duck_max), weight_stream=47.271 MiB (8.818 GB/s @ duck_max) [2026-04-08 07:41:01.594185 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=29, expert_tiles=29, avg_tile_batch=1.38, prepare=8.559µs, send=297.027µs, judge_wait=4.932815ms, fetch=310.765µs, reduce=21ns; duck time-ns stats: p50=4.807896ms, p90=4.827818ms, max=4.828687ms; kernel_model: matmul=0.110100 GFLOP (22.801 GFLOP/s @ duck_max), param_stream=0.039911G (8.265 Gparam/s @ duck_max), weight_stream=42.839 MiB (9.303 GB/s @ duck_max) [2026-04-08 07:41:01.601316 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.746µs, send=297.451µs, judge_wait=5.06948ms, fetch=310.544µs, reduce=23ns; duck time-ns stats: p50=4.970805ms, p90=4.978035ms, max=4.981541ms; kernel_model: matmul=0.110100 GFLOP (22.102 GFLOP/s @ duck_max), param_stream=0.041288G (8.288 Gparam/s @ duck_max), weight_stream=44.316 MiB (9.328 GB/s @ duck_max) [2026-04-08 07:41:01.608668 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.772µs, send=297.059µs, judge_wait=5.279075ms, fetch=313.127µs, reduce=19ns; duck time-ns stats: p50=5.15877ms, p90=5.177948ms, max=5.190474ms; kernel_model: matmul=0.110100 GFLOP (21.212 GFLOP/s @ duck_max), param_stream=0.041288G (7.955 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.953 GB/s @ duck_max) [2026-04-08 07:41:01.616236 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=31, expert_tiles=31, avg_tile_batch=1.29, prepare=8.55µs, send=296.765µs, judge_wait=5.511944ms, fetch=312.493µs, reduce=20ns; duck time-ns stats: p50=5.377035ms, p90=5.396253ms, max=5.399517ms; kernel_model: matmul=0.110100 GFLOP (20.391 GFLOP/s @ duck_max), param_stream=0.042664G (7.901 Gparam/s @ duck_max), weight_stream=45.793 MiB (8.893 GB/s @ duck_max) [2026-04-08 07:41:01.624123 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=35, expert_tiles=35, avg_tile_batch=1.14, prepare=8.841µs, send=297.024µs, judge_wait=5.755478ms, fetch=308.757µs, reduce=22ns; duck time-ns stats: p50=5.634899ms, p90=5.652224ms, max=5.662815ms; kernel_model: matmul=0.110100 GFLOP (19.443 GFLOP/s @ duck_max), param_stream=0.048169G (8.506 Gparam/s @ duck_max), weight_stream=51.702 MiB (9.574 GB/s @ duck_max) [2026-04-08 07:41:01.631671 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=33, expert_tiles=33, avg_tile_batch=1.21, prepare=8.444µs, send=297.286µs, judge_wait=5.492121ms, fetch=311.092µs, reduce=22ns; duck time-ns stats: p50=5.375387ms, p90=5.392811ms, max=5.401397ms; kernel_model: matmul=0.110100 GFLOP (20.384 GFLOP/s @ duck_max), param_stream=0.045416G (8.408 Gparam/s @ duck_max), weight_stream=48.748 MiB (9.463 GB/s @ duck_max) [2026-04-08 07:41:01.639070 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.651µs, send=296.945µs, judge_wait=5.338945ms, fetch=312.749µs, reduce=21ns; duck time-ns stats: p50=5.227842ms, p90=5.242612ms, max=5.246799ms; kernel_model: matmul=0.110100 GFLOP (20.984 GFLOP/s @ duck_max), param_stream=0.041288G (7.869 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.857 GB/s @ duck_max) [2026-04-08 07:41:01.646433 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=33, expert_tiles=33, avg_tile_batch=1.21, prepare=8.81µs, send=297.006µs, judge_wait=5.325746ms, fetch=311.182µs, reduce=20ns; duck time-ns stats: p50=5.206603ms, p90=5.2193ms, max=5.234946ms; kernel_model: matmul=0.110100 GFLOP (21.032 GFLOP/s @ duck_max), param_stream=0.045416G (8.676 Gparam/s @ duck_max), weight_stream=48.748 MiB (9.764 GB/s @ duck_max) [2026-04-08 07:41:01.654282 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=33, expert_tiles=33, avg_tile_batch=1.21, prepare=8.536µs, send=298.707µs, judge_wait=5.753761ms, fetch=309.956µs, reduce=23ns; duck time-ns stats: p50=5.633179ms, p90=5.652916ms, max=5.656465ms; kernel_model: matmul=0.110100 GFLOP (19.465 GFLOP/s @ duck_max), param_stream=0.045416G (8.029 Gparam/s @ duck_max), weight_stream=48.748 MiB (9.037 GB/s @ duck_max) [2026-04-08 07:41:01.662161 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=28, expert_tiles=28, avg_tile_batch=1.43, prepare=9.752µs, send=297.164µs, judge_wait=5.485847ms, fetch=309.985µs, reduce=21ns; duck time-ns stats: p50=5.366046ms, p90=5.385364ms, max=5.391171ms; kernel_model: matmul=0.110100 GFLOP (20.422 GFLOP/s @ duck_max), param_stream=0.038535G (7.148 Gparam/s @ duck_max), weight_stream=41.362 MiB (8.045 GB/s @ duck_max) [2026-04-08 07:41:01.669784 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=33, expert_tiles=33, avg_tile_batch=1.21, prepare=8.537µs, send=297.726µs, judge_wait=5.327901ms, fetch=309.638µs, reduce=22ns; duck time-ns stats: p50=5.209928ms, p90=5.228971ms, max=5.236554ms; kernel_model: matmul=0.110100 GFLOP (21.025 GFLOP/s @ duck_max), param_stream=0.045416G (8.673 Gparam/s @ duck_max), weight_stream=48.748 MiB (9.761 GB/s @ duck_max) [2026-04-08 07:41:01.677564 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=32, expert_tiles=32, avg_tile_batch=1.25, prepare=8.632µs, send=296.88µs, judge_wait=5.494595ms, fetch=309.543µs, reduce=22ns; duck time-ns stats: p50=5.374733ms, p90=5.384945ms, max=5.39832ms; kernel_model: matmul=0.110100 GFLOP (20.395 GFLOP/s @ duck_max), param_stream=0.044040G (8.158 Gparam/s @ duck_max), weight_stream=47.271 MiB (9.182 GB/s @ duck_max) [2026-04-08 07:41:01.685531 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=33, expert_tiles=33, avg_tile_batch=1.21, prepare=8.435µs, send=296.012µs, judge_wait=5.683292ms, fetch=309.895µs, reduce=24ns; duck time-ns stats: p50=5.562998ms, p90=5.579879ms, max=5.593395ms; kernel_model: matmul=0.110100 GFLOP (19.684 GFLOP/s @ duck_max), param_stream=0.045416G (8.120 Gparam/s @ duck_max), weight_stream=48.748 MiB (9.139 GB/s @ duck_max) [2026-04-08 07:41:01.693516 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=31, expert_tiles=31, avg_tile_batch=1.29, prepare=8.595µs, send=297.241µs, judge_wait=5.712111ms, fetch=309.677µs, reduce=22ns; duck time-ns stats: p50=5.59152ms, p90=5.611238ms, max=5.618889ms; kernel_model: matmul=0.110100 GFLOP (19.595 GFLOP/s @ duck_max), param_stream=0.042664G (7.593 Gparam/s @ duck_max), weight_stream=45.793 MiB (8.546 GB/s @ duck_max) [2026-04-08 07:41:01.701171 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=34, expert_tiles=34, avg_tile_batch=1.18, prepare=8.678µs, send=297.396µs, judge_wait=5.37348ms, fetch=310.909µs, reduce=15ns; duck time-ns stats: p50=5.261968ms, p90=5.274164ms, max=5.281785ms; kernel_model: matmul=0.110100 GFLOP (20.845 GFLOP/s @ duck_max), param_stream=0.046793G (8.859 Gparam/s @ duck_max), weight_stream=50.225 MiB (9.971 GB/s @ duck_max) [2026-04-08 07:41:01.708932 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=32, expert_tiles=32, avg_tile_batch=1.25, prepare=8.469µs, send=296.752µs, judge_wait=5.492776ms, fetch=308.091µs, reduce=17ns; duck time-ns stats: p50=5.374944ms, p90=5.393142ms, max=5.398205ms; kernel_model: matmul=0.110100 GFLOP (20.396 GFLOP/s @ duck_max), param_stream=0.044040G (8.158 Gparam/s @ duck_max), weight_stream=47.271 MiB (9.182 GB/s @ duck_max) [2026-04-08 07:41:01.716955 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=33, expert_tiles=33, avg_tile_batch=1.21, prepare=8.53µs, send=296.33µs, judge_wait=5.732055ms, fetch=309.533µs, reduce=16ns; duck time-ns stats: p50=5.607011ms, p90=5.633052ms, max=5.640339ms; kernel_model: matmul=0.110100 GFLOP (19.520 GFLOP/s @ duck_max), param_stream=0.045416G (8.052 Gparam/s @ duck_max), weight_stream=48.748 MiB (9.063 GB/s @ duck_max) [2026-04-08 07:41:01.724982 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=32, expert_tiles=32, avg_tile_batch=1.25, prepare=8.565µs, send=297.246µs, judge_wait=5.742125ms, fetch=311.116µs, reduce=16ns; duck time-ns stats: p50=5.609921ms, p90=5.625514ms, max=5.647806ms; kernel_model: matmul=0.110100 GFLOP (19.494 GFLOP/s @ duck_max), param_stream=0.044040G (7.798 Gparam/s @ duck_max), weight_stream=47.271 MiB (8.776 GB/s @ duck_max) [2026-04-08 07:41:01.732731 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=32, expert_tiles=32, avg_tile_batch=1.25, prepare=8.508µs, send=298.111µs, judge_wait=5.48469ms, fetch=309.037µs, reduce=17ns; duck time-ns stats: p50=5.376377ms, p90=5.387304ms, max=5.393557ms; kernel_model: matmul=0.110100 GFLOP (20.413 GFLOP/s @ duck_max), param_stream=0.044040G (8.165 Gparam/s @ duck_max), weight_stream=47.271 MiB (9.190 GB/s @ duck_max) [2026-04-08 07:41:01.740741 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.875µs, send=298.105µs, judge_wait=5.720662ms, fetch=308.763µs, reduce=18ns; duck time-ns stats: p50=5.602956ms, p90=5.619523ms, max=5.626654ms; kernel_model: matmul=0.110100 GFLOP (19.568 GFLOP/s @ duck_max), param_stream=0.041288G (7.338 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.259 GB/s @ duck_max) [2026-04-08 07:41:01.747995 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=31, expert_tiles=31, avg_tile_batch=1.29, prepare=8.735µs, send=297.959µs, judge_wait=4.974442ms, fetch=309.909µs, reduce=18ns; duck time-ns stats: p50=4.865909ms, p90=4.876033ms, max=4.882505ms; kernel_model: matmul=0.110100 GFLOP (22.550 GFLOP/s @ duck_max), param_stream=0.042664G (8.738 Gparam/s @ duck_max), weight_stream=45.793 MiB (9.835 GB/s @ duck_max) [2026-04-08 07:41:01.755925 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=31, expert_tiles=31, avg_tile_batch=1.29, prepare=8.563µs, send=297.176µs, judge_wait=5.668789ms, fetch=310.812µs, reduce=17ns; duck time-ns stats: p50=5.551231ms, p90=5.571287ms, max=5.575318ms; kernel_model: matmul=0.110100 GFLOP (19.748 GFLOP/s @ duck_max), param_stream=0.042664G (7.652 Gparam/s @ duck_max), weight_stream=45.793 MiB (8.613 GB/s @ duck_max) [2026-04-08 07:41:01.763888 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=28, expert_tiles=28, avg_tile_batch=1.43, prepare=8.701µs, send=297.498µs, judge_wait=5.696816ms, fetch=308.759µs, reduce=16ns; duck time-ns stats: p50=5.552982ms, p90=5.577745ms, max=5.603283ms; kernel_model: matmul=0.110100 GFLOP (19.649 GFLOP/s @ duck_max), param_stream=0.038535G (6.877 Gparam/s @ duck_max), weight_stream=41.362 MiB (7.740 GB/s @ duck_max) [2026-04-08 07:41:01.771469 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.279µs, send=296.44µs, judge_wait=5.309648ms, fetch=309.971µs, reduce=18ns; duck time-ns stats: p50=5.197982ms, p90=5.208538ms, max=5.219713ms; kernel_model: matmul=0.110100 GFLOP (21.093 GFLOP/s @ duck_max), param_stream=0.041288G (7.910 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.903 GB/s @ duck_max) [2026-04-08 07:41:01.779173 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.661µs, send=297.802µs, judge_wait=5.432526ms, fetch=311.433µs, reduce=15ns; duck time-ns stats: p50=5.307035ms, p90=5.331601ms, max=5.343111ms; kernel_model: matmul=0.110100 GFLOP (20.606 GFLOP/s @ duck_max), param_stream=0.041288G (7.727 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.697 GB/s @ duck_max) [2026-04-08 07:41:01.786825 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=32, expert_tiles=32, avg_tile_batch=1.25, prepare=8.405µs, send=297.729µs, judge_wait=5.381152ms, fetch=308.479µs, reduce=18ns; duck time-ns stats: p50=5.264117ms, p90=5.280102ms, max=5.288035ms; kernel_model: matmul=0.110100 GFLOP (20.821 GFLOP/s @ duck_max), param_stream=0.044040G (8.328 Gparam/s @ duck_max), weight_stream=47.271 MiB (9.373 GB/s @ duck_max) [2026-04-08 07:41:01.794810 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.461µs, send=297.812µs, judge_wait=5.711375ms, fetch=308.924µs, reduce=18ns; duck time-ns stats: p50=5.584377ms, p90=5.591746ms, max=5.616934ms; kernel_model: matmul=0.110100 GFLOP (19.602 GFLOP/s @ duck_max), param_stream=0.041288G (7.351 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.273 GB/s @ duck_max) [2026-04-08 07:41:01.802811 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=32, expert_tiles=32, avg_tile_batch=1.25, prepare=8.36µs, send=299.198µs, judge_wait=5.737655ms, fetch=308.588µs, reduce=17ns; duck time-ns stats: p50=5.6031ms, p90=5.627035ms, max=5.641617ms; kernel_model: matmul=0.110100 GFLOP (19.516 GFLOP/s @ duck_max), param_stream=0.044040G (7.806 Gparam/s @ duck_max), weight_stream=47.271 MiB (8.786 GB/s @ duck_max) [2026-04-08 07:41:01.810422 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=5, top_k=8, tasks=40, unique_experts=30, expert_tiles=30, avg_tile_batch=1.33, prepare=8.292µs, send=298.142µs, judge_wait=5.333869ms, fetch=309.912µs, reduce=15ns; duck time-ns stats: p50=5.2184ms, p90=5.231495ms, max=5.241033ms; kernel_model: matmul=0.110100 GFLOP (21.007 GFLOP/s @ duck_max), param_stream=0.041288G (7.878 Gparam/s @ duck_max), weight_stream=44.316 MiB (8.866 GB/s @ duck_max) [2026-04-08 07:41:01.863820 INFO fp8_moe_dpdk] MoE prefill forward (Rust): batch_size=4, top_k=8, tasks=32, unique_experts=27, expert_tiles=27, avg_tile_batch=1.19, prepare=50.739µs, send=1.877334ms, judge_wait=4.710858ms, fetch=275.686µs, reduce=19ns; duck time-ns stats: p50=4.511849ms, p90=4.520456ms, max=4.528686ms; kernel_model: matmul=0.088080 GFLOP (19.449 GFLOP/s @ duck_max), param_stream=0.037159G (8.205 Gparam/s @ duck_max), weight_stream=39.884 MiB (9.235 GB/s @ duck_max) [2026-04-08 07:41:01.872001 INFO fp8_moe_dpdk] MoE forward e2e time (Rust): 1.114316ms; phases: prepare=4.776µs, send=61.95µs, judge_wait=898.255µs, fetch=93.113µs, reduce=20ns, writeback=452ns; duck time-ns stats: p50=801.195µs, p90=806.042µs, max=811.166µs; effective_read: activated_experts=8, params=0.011010G (13.573 Gparam/s @ duck_max), memory=11.818 MiB (15.276 GB/s @ duck_max), judge_gap=87.089µs, judge_ratio=1.107x [2026-04-08 07:41:02.599556 INFO fp8_moe_dpdk] MoE forward e2e time (Rust): 2.455753ms; phases: prepare=5.551µs, send=1.453056ms, judge_wait=863.088µs, fetch=96.371µs, reduce=20ns, writeback=467ns; duck time-ns stats: p50=781.236µs, p90=784.144µs, max=790.089µs; effective_read: activated_experts=8, params=0.011010G (13.935 Gparam/s @ duck_max), memory=11.818 MiB (15.684 GB/s @ duck_max), judge_gap=72.999µs, judge_ratio=1.092x Token # 1: 748.651ms; value: next_token_ids=tensor([303], device='cuda:0') mtp accept=1 prop=303 top1=303 accp=0.999 next=draft=6640 prop=6640 olap pair=683.2ms serial=1264.0ms gain=580.8ms ratio=0.46 s0=599.9ms s1=664.0ms wait=0.2/43.5ms pred gate=device [2026-04-08 07:41:02.603575 INFO fp8_moe_dpdk] MoE forward e2e time (Rust): 1.05843ms; phases: prepare=3.937µs, send=61.904µs, judge_wait=864.139µs, fetch=91.581µs, reduce=20ns, writeback=429ns; duck time-ns stats: p50=775.906µs, p90=781.947µs, max=786.039µs; effective_read: activated_experts=8, params=0.011010G (14.007 Gparam/s @ duck_max), memory=11.818 MiB (15.765 GB/s @ duck_max), judge_gap=78.1µs, judge_ratio=1.099x Token # 2: 3.881ms; value: next_token_ids=tensor([6640], device='cuda:0') mtp accept=1 prop=6640 top1=6640 accp=1.000 next=pair draft=1299 prop=1299 pred gate=device [2026-04-08 07:41:02.725008 INFO fp8_moe_dpdk] MoE forward e2e time (Rust): 1.06582ms; phases: prepare=3.333µs, send=61.55µs, judge_wait=872.166µs, fetch=91.16µs, reduce=19ns, writeback=607ns; duck time-ns stats: p50=791.624µs, p90=796.697µs, max=798.05µs; effective_read: activated_experts=8, params=0.011010G (13.796 Gparam/s @ duck_max), memory=11.818 MiB (15.527 GB/s @ duck_max), judge_gap=74.116µs, judge_ratio=1.093x Token # 3: 121.550ms; value: next_token_ids=tensor([1299], device='cuda:0') mtp accept=1 prop=1299 top1=740 accp=0.256 next=draft=740 prop=740 olap pair=116.1ms serial=207.8ms gain=91.7ms ratio=0.44 s0=4.4ms s1=203.4ms wait=0.1/50.3ms pred gate=device [2026-04-08 07:41:02.728957 INFO fp8_moe_dpdk] MoE forward e2e time (Rust): 1.050791ms; phases: prepare=3.13µs, send=60.486µs, judge_wait=858.546µs, fetch=91.427µs, reduce=19ns, writeback=532ns; duck time-ns stats: p50=774.264µs, p90=780.943µs, max=784.146µs; effective_read: activated_experts=8, params=0.011010G (14.041 Gparam/s @ duck_max), memory=11.818 MiB (15.803 GB/s @ duck_max), judge_gap=74.4µs, judge_ratio=1.095x Token # 4: 3.909ms; value: next_token_ids=tensor([740], device='cuda:0') mtp accept=1 prop=740 top1=740 accp=0.965 next=pair draft=9128 prop=9128 pred gate=device [2026-04-08 07:41:02.849440 INFO fp8_moe_dpdk] MoE forward e2e time (Rust): 1.055531ms; phases: prepare=3.532µs, send=61.738µs, judge_wait=862.051µs, fetch=91.117µs, reduce=20ns, writeback=401ns; duck time-ns stats: p50=778.221µs, p90=783.596µs, max=787.669µs; effective_read: activated_experts=8, params=0.011010G (13.978 Gparam/s @ duck_max), memory=11.818 MiB (15.732 GB/s @ duck_max), judge_gap=74.382µs, judge_ratio=1.094x Token # 5: 120.515ms; value: next_token_ids=tensor([32920], device='cuda:0') mtp accept=0 prop=9128 top1=32920 accp=0.320 next=draft=18341 prop=18341 olap pair=115.1ms serial=205.8ms gain=90.7ms ratio=0.44 s0=4.3ms s1=201.5ms wait=0.1/50.6ms pred gate=device [2026-04-08 07:41:02.968035 INFO fp8_moe_dpdk] MoE forward e2e time (Rust): 1.018585ms; phases: prepare=3.72µs, send=61.641µs, judge_wait=824.12µs, fetch=91.379µs, reduce=21ns, writeback=528ns; duck time-ns stats: p50=741.689µs, p90=747.773µs, max=749.878µs; effective_read: activated_experts=8, params=0.011010G (14.682 Gparam/s @ duck_max), memory=11.818 MiB (16.525 GB/s @ duck_max), judge_gap=74.242µs, judge_ratio=1.099x Token # 6: 118.656ms; value: next_token_ids=tensor([18341], device='cuda:0') mtp accept=1 prop=18341 top1=18341 accp=0.989 next=draft=56020 prop=56020 olap pair=113.2ms serial=201.8ms gain=88.6ms ratio=0.44 s0=4.8ms s1=197.0ms wait=0.1/49.9ms pred gate=device [2026-04-08 07:41:02.972052 INFO fp8_moe_dpdk] MoE forward e2e time (Rust): 1.059591ms; phases: prepare=2.935µs, send=61.189µs, judge_wait=867.589µs, fetch=91.164µs, reduce=20ns, writeback=474ns; duck time-ns stats: p50=783.327µs, p90=791.627µs, max=793.381µs; effective_read: activated_experts=8, params=0.011010G (13.877 Gparam/s @ duck_max), memory=11.818 MiB (15.619 GB/s @ duck_max), judge_gap=74.208µs, judge_ratio=1.094x Token # 7: 3.846ms; value: next_token_ids=tensor([56020], device='cuda:0') mtp accept=1 prop=56020 top1=56020 accp=0.717 next=pair draft=303 prop=303 pred gate=device [2026-04-08 07:41:03.091750 INFO fp8_moe_dpdk] MoE forward e2e time (Rust): 1.081921ms; phases: prepare=3.562µs, send=61.487µs, judge_wait=888.75µs, fetch=90.632µs, reduce=19ns, writeback=498ns; duck time-ns stats: p50=801.607µs, p90=806.142µs, max=813.395µs; effective_read: activated_experts=8, params=0.011010G (13.536 Gparam/s @ duck_max), memory=11.818 MiB (15.235 GB/s @ duck_max), judge_gap=75.355µs, judge_ratio=1.093x Token # 8: 119.797ms; value: next_token_ids=tensor([320], device='cuda:0') mtp accept=0 prop=303 top1=320 accp=0.341 next=draft=3279 prop=31854 olap pair=114.5ms serial=204.5ms gain=90.0ms ratio=0.44 s0=4.3ms s1=200.2ms wait=0.1/50.4ms pred gate=device [2026-04-08 07:41:03.210916 INFO fp8_moe_dpdk] MoE forward e2e time (Rust): 1.106818ms; phases: prepare=3.91µs, send=61.856µs, judge_wait=899.089µs, fetch=102.875µs, reduce=20ns, writeback=491ns; duck time-ns stats: p50=794.422µs, p90=801.04µs, max=811.181µs; effective_read: activated_experts=8, params=0.011010G (13.573 Gparam/s @ duck_max), memory=11.818 MiB (15.276 GB/s @ duck_max), judge_gap=87.908µs, judge_ratio=1.108x Token # 9: 119.182ms; value: next_token_ids=tensor([31854], device='cuda:0') mtp accept=1 prop=31854 top1=3279 accp=0.657 next=draft=1168 prop=1168 olap pair=113.7ms serial=203.0ms gain=89.3ms ratio=0.44 s0=4.3ms s1=198.7ms wait=0.1/50.3ms pred gate=device Token # 10: 3.895ms; value: next_token_ids=tensor([1168], device='cuda:0') mtp accept=1 prop=1168 top1=1168 accp=0.490 next=pair draft=13295 prop=13295 pred gate=device Token # 11: 121.360ms; value: next_token_ids=tensor([13295], device='cuda:0') mtp accept=1 prop=13295 top1=13295 accp=1.000 next=draft=303 prop=303 olap pair=116.0ms serial=207.3ms gain=91.3ms ratio=0.44 s0=4.4ms s1=202.9ms wait=0.1/50.4ms pred gate=device Token # 12: 3.845ms; value: next_token_ids=tensor([303], device='cuda:0') mtp accept=1 prop=303 top1=303 accp=1.000 next=pair draft=15206 prop=15206 pred gate=device Token # 13: 119.217ms; value: next_token_ids=tensor([15206], device='cuda:0') mtp accept=1 prop=15206 top1=15206 accp=0.688 next=draft=1043 prop=61852 olap pair=113.9ms serial=203.4ms gain=89.5ms ratio=0.44 s0=4.4ms s1=199.0ms wait=0.1/50.3ms pred gate=device Token # 14: 3.809ms; value: next_token_ids=tensor([1134], device='cuda:0') mtp accept=0 prop=61852 top1=1134 accp=0.059 next=pair draft=10251 prop=10251 pred gate=device Token # 15: 119.767ms; value: next_token_ids=tensor([10251], device='cuda:0') mtp accept=1 prop=10251 top1=10251 accp=0.849 next=draft=13503 prop=13503 olap pair=114.4ms serial=204.3ms gain=89.9ms ratio=0.44 s0=4.3ms s1=200.1ms wait=0.1/50.4ms pred gate=device Token # 16: 3.821ms; value: next_token_ids=tensor([20143], device='cuda:0') mtp accept=0 prop=13503 top1=18596 accp=0.004 next=pair draft=5402 prop=5402 pred gate=device Token # 17: 119.337ms; value: next_token_ids=tensor([6742], device='cuda:0') mtp accept=0 prop=5402 top1=6742 accp=0.074 next=draft=303 prop=303 olap pair=114.0ms serial=203.5ms gain=89.5ms ratio=0.44 s0=4.4ms s1=199.1ms wait=0.1/50.5ms pred gate=device Token # 18: 118.149ms; value: next_token_ids=tensor([303], device='cuda:0') mtp accept=1 prop=303 top1=303 accp=0.996 next=draft=4029 prop=4029 olap pair=112.7ms serial=200.9ms gain=88.3ms ratio=0.44 s0=4.8ms s1=196.1ms wait=0.1/49.7ms pred gate=device Token # 19: 3.834ms; value: next_token_ids=tensor([4029], device='cuda:0') mtp accept=1 prop=4029 top1=4029 accp=0.960 next=pair draft=3864 prop=3864 pred gate=device Token # 20: 118.612ms; value: next_token_ids=tensor([3864], device='cuda:0') mtp accept=1 prop=3864 top1=3864 accp=0.688 next=draft=20839 prop=20839 olap pair=113.2ms serial=202.2ms gain=88.9ms ratio=0.44 s0=4.2ms s1=198.0ms wait=0.1/51.0ms pred gate=device Token # 21: 3.752ms; value: next_token_ids=tensor([12073], device='cuda:0') mtp accept=0 prop=20839 top1=9992 accp=0.083 next=pair draft=27318 prop=27318 pred gate=device Token # 22: 119.475ms; value: next_token_ids=tensor([13503], device='cuda:0') mtp accept=0 prop=27318 top1=13503 accp=0.450 next=draft=1134 prop=1134 olap pair=114.1ms serial=203.7ms gain=89.6ms ratio=0.44 s0=4.3ms s1=199.4ms wait=0.1/50.4ms pred gate=device Token # 23: 118.200ms; value: next_token_ids=tensor([52334], device='cuda:0') mtp accept=0 prop=1134 top1=52334 accp=0.005 next=draft=320 prop=320 olap pair=112.9ms serial=201.3ms gain=88.4ms ratio=0.44 s0=4.3ms s1=197.0ms wait=0.1/50.6ms pred gate=device Token # 24: 116.885ms; value: next_token_ids=tensor([320], device='cuda:0') mtp accept=1 prop=320 top1=320 accp=0.925 next=draft=2386 prop=2386 olap pair=111.6ms serial=198.8ms gain=87.2ms ratio=0.44 s0=4.4ms s1=194.4ms wait=0.1/50.4ms pred gate=device Token # 25: 3.829ms; value: next_token_ids=tensor([19484], device='cuda:0') mtp accept=0 prop=2386 top1=19484 accp=0.223 next=pair draft=9701 prop=9701 pred gate=device Token # 26: 119.791ms; value: next_token_ids=tensor([9701], device='cuda:0') mtp accept=1 prop=9701 top1=9701 accp=0.979 next=draft=3803 prop=3803 olap pair=114.4ms serial=203.4ms gain=89.0ms ratio=0.44 s0=4.3ms s1=199.2ms wait=0.1/46.0ms pred gate=device Token # 27: 3.696ms; value: next_token_ids=tensor([3803], device='cuda:0') mtp accept=1 prop=3803 top1=3803 accp=1.000 next=pair draft=303 prop=303 pred gate=device Token # 28: 116.716ms; value: next_token_ids=tensor([303], device='cuda:0') mtp accept=1 prop=303 top1=303 accp=0.998 next=draft=4036 prop=4036 olap pair=111.6ms serial=199.0ms gain=87.4ms ratio=0.44 s0=4.3ms s1=194.7ms wait=0.1/44.6ms pred gate=device Token # 29: 3.699ms; value: next_token_ids=tensor([4036], device='cuda:0') mtp accept=1 prop=4036 top1=37366 accp=0.328 next=pair draft=37366 prop=37366 pred gate=device Token # 30: 116.550ms; value: next_token_ids=tensor([37366], device='cuda:0') mtp accept=1 prop=37366 top1=37366 accp=0.991 next=draft=27701 prop=27701 olap pair=111.3ms serial=198.6ms gain=87.3ms ratio=0.44 s0=4.3ms s1=194.3ms wait=0.1/45.0ms pred gate=device Token # 31: 3.745ms; value: next_token_ids=tensor([27701], device='cuda:0') mtp accept=1 prop=27701 top1=27701 accp=0.998 next=pair draft=29368 prop=29368 pred gate=device Token # 32: 117.285ms; value: next_token_ids=tensor([27867], device='cuda:0') mtp accept=0 prop=29368 top1=27867 accp=0.276 next=draft=478 prop=478 olap pair=112.0ms serial=199.0ms gain=87.1ms ratio=0.44 s0=4.2ms s1=194.8ms wait=0.1/44.9ms pred gate=device Token # 33: 116.468ms; value: next_token_ids=tensor([478], device='cuda:0') mtp accept=1 prop=478 top1=478 accp=0.998 next=draft=26260 prop=26260 olap pair=111.2ms serial=196.1ms gain=84.9ms ratio=0.43 s0=4.2ms s1=191.9ms wait=0.1/45.6ms pred gate=device Token # 34: 3.710ms; value: next_token_ids=tensor([26260], device='cuda:0') mtp accept=1 prop=26260 top1=26260 accp=0.636 next=pair draft=16690 prop=16690 pred gate=device Token # 35: 117.229ms; value: next_token_ids=tensor([16690], device='cuda:0') mtp accept=1 prop=16690 top1=4618 accp=0.254 next=draft=1207 prop=78978 olap pair=112.1ms serial=198.9ms gain=86.8ms ratio=0.44 s0=4.0ms s1=194.8ms wait=0.1/45.5ms pred gate=device Token # 36: 3.723ms; value: next_token_ids=tensor([1207], device='cuda:0') mtp accept=0 prop=78978 top1=1207 accp=0.626 next=pair draft=47321 prop=6710 pred gate=device Token # 37: 117.105ms; value: next_token_ids=tensor([4534], device='cuda:0') mtp accept=0 prop=6710 top1=4534 accp=0.042 next=draft=11053 prop=1029 olap pair=111.9ms serial=199.7ms gain=87.8ms ratio=0.44 s0=3.7ms s1=195.9ms wait=0.1/46.2ms pred gate=device Token # 38: 116.813ms; value: next_token_ids=tensor([11053], device='cuda:0') mtp accept=0 prop=1029 top1=11053 accp=0.788 next=draft=87546 prop=27316 olap pair=111.6ms serial=198.8ms gain=87.2ms ratio=0.44 s0=4.2ms s1=194.6ms wait=0.1/45.0ms pred gate=device Token # 39: 117.014ms; value: next_token_ids=tensor([87546], device='cuda:0') mtp accept=0 prop=27316 top1=87546 accp=0.609 next=draft=1882 prop=1882 olap pair=110.9ms serial=195.9ms gain=85.0ms ratio=0.43 s0=4.5ms s1=191.3ms wait=0.1/45.4ms pred gate=device Token # 40: 116.440ms; value: next_token_ids=tensor([1882], device='cuda:0') mtp accept=1 prop=1882 top1=1882 accp=0.996 next=draft=303 prop=303 olap pair=111.0ms serial=196.7ms gain=85.8ms ratio=0.44 s0=5.3ms s1=191.5ms wait=0.1/44.4ms pred gate=device Token # 41: 3.648ms; value: next_token_ids=tensor([303], device='cuda:0') mtp accept=1 prop=303 top1=303 accp=1.000 next=pair draft=1735 prop=5469 pred gate=device Token # 42: 115.948ms; value: next_token_ids=tensor([12065], device='cuda:0') mtp accept=0 prop=5469 top1=12065 accp=0.098 next=draft=15253 prop=15253 olap pair=110.8ms serial=196.6ms gain=85.8ms ratio=0.44 s0=4.2ms s1=192.4ms wait=0.1/44.9ms pred gate=device Token # 43: 117.385ms; value: next_token_ids=tensor([10934], device='cuda:0') mtp accept=0 prop=15253 top1=10934 accp=0.308 next=draft=69746 prop=17733 olap pair=112.1ms serial=198.0ms gain=85.9ms ratio=0.43 s0=4.7ms s1=193.3ms wait=0.1/44.9ms pred gate=device Token # 44: 115.700ms; value: next_token_ids=tensor([17733], device='cuda:0') mtp accept=1 prop=17733 top1=17733 accp=0.427 next=draft=30638 prop=30638 olap pair=110.3ms serial=196.2ms gain=85.9ms ratio=0.44 s0=3.9ms s1=192.3ms wait=0.1/46.0ms pred gate=device Token # 45: 3.827ms; value: next_token_ids=tensor([2833], device='cuda:0') mtp accept=0 prop=30638 top1=30638 accp=0.668 next=pair draft=19614 prop=19614 pred gate=device Token # 46: 116.282ms; value: next_token_ids=tensor([19614], device='cuda:0') mtp accept=1 prop=19614 top1=19614 accp=0.987 next=draft=303 prop=320 olap pair=111.0ms serial=197.8ms gain=86.8ms ratio=0.44 s0=4.3ms s1=193.5ms wait=0.1/44.5ms pred gate=device Token # 47: 3.639ms; value: next_token_ids=tensor([320], device='cuda:0') mtp accept=1 prop=320 top1=320 accp=0.511 next=pair draft=2684 prop=2684 pred gate=device Token # 48: 115.733ms; value: next_token_ids=tensor([7608], device='cuda:0') mtp accept=0 prop=2684 top1=7157 accp=0.115 next=draft=642 prop=642 olap pair=110.6ms serial=196.7ms gain=86.2ms ratio=0.44 s0=4.2ms s1=192.6ms wait=0.1/45.4ms pred gate=device Token # 49: 116.391ms; value: next_token_ids=tensor([589], device='cuda:0') mtp accept=0 prop=642 top1=589 accp=0.266 next=draft=6977 prop=12635 olap pair=111.2ms serial=197.5ms gain=86.3ms ratio=0.44 s0=4.6ms s1=192.9ms wait=0.1/44.6ms pred gate=device Token # 50: 114.924ms; value: next_token_ids=tensor([6977], device='cuda:0') mtp accept=0 prop=12635 top1=6977 accp=0.770 next=draft=13730 prop=13730 olap pair=109.7ms serial=195.2ms gain=85.5ms ratio=0.44 s0=4.8ms s1=190.4ms wait=0.1/43.9ms pred gate=device Token # 51: 115.006ms; value: next_token_ids=tensor([2922], device='cuda:0') mtp accept=0 prop=13730 top1=2922 accp=0.008 next=draft=10787 prop=10787 olap pair=109.8ms serial=195.2ms gain=85.4ms ratio=0.44 s0=4.5ms s1=190.7ms wait=0.1/44.8ms pred gate=device Token # 52: 115.900ms; value: next_token_ids=tensor([93404], device='cuda:0') mtp accept=0 prop=10787 top1=93404 accp=0.049 next=draft=8035 prop=8035 olap pair=110.6ms serial=197.0ms gain=86.4ms ratio=0.44 s0=4.4ms s1=192.6ms wait=0.1/45.2ms pred gate=device Token # 53: 115.720ms; value: next_token_ids=tensor([8035], device='cuda:0') mtp accept=1 prop=8035 top1=8035 accp=0.908 next=draft=303 prop=303 olap pair=110.5ms serial=196.9ms gain=86.5ms ratio=0.44 s0=3.7ms s1=193.2ms wait=0.1/46.1ms pred gate=device Token # 54: 3.744ms; value: next_token_ids=tensor([303], device='cuda:0') mtp accept=1 prop=303 top1=303 accp=1.000 next=pair draft=1700 prop=1700 pred gate=device Token # 55: 115.455ms; value: next_token_ids=tensor([1700], device='cuda:0') mtp accept=1 prop=1700 top1=1700 accp=0.981 next=draft=6640 prop=6640 olap pair=110.2ms serial=196.1ms gain=85.9ms ratio=0.44 s0=3.8ms s1=192.2ms wait=0.1/45.9ms pred gate=device Token # 56: 3.700ms; value: next_token_ids=tensor([6640], device='cuda:0') mtp accept=1 prop=6640 top1=6640 accp=1.000 next=pair draft=2516 prop=2516 pred gate=device Token # 57: 115.364ms; value: next_token_ids=tensor([2516], device='cuda:0') mtp accept=1 prop=2516 top1=2516 accp=0.790 next=draft=659 prop=659 olap pair=110.2ms serial=196.0ms gain=85.8ms ratio=0.44 s0=3.7ms s1=192.2ms wait=0.1/46.0ms pred gate=device Token # 58: 3.687ms; value: next_token_ids=tensor([659], device='cuda:0') mtp accept=1 prop=659 top1=659 accp=0.548 next=pair draft=3796 prop=3796 pred gate=device Token # 59: 115.458ms; value: next_token_ids=tensor([3796], device='cuda:0') mtp accept=1 prop=3796 top1=3796 accp=0.996 next=draft=11412 prop=11412 olap pair=110.1ms serial=196.0ms gain=85.9ms ratio=0.44 s0=3.7ms s1=192.3ms wait=0.1/46.3ms pred gate=device Token # 60: 3.824ms; value: next_token_ids=tensor([11412], device='cuda:0') mtp accept=1 prop=11412 top1=11412 accp=0.963 next=pair draft=6399 prop=6399 pred gate=device Token # 61: 115.947ms; value: next_token_ids=tensor([6399], device='cuda:0') mtp accept=1 prop=6399 top1=6399 accp=1.000 next=draft=303 prop=320 olap pair=110.8ms serial=197.4ms gain=86.6ms ratio=0.44 s0=3.9ms s1=193.5ms wait=0.1/46.0ms pred gate=device Token # 62: 3.780ms; value: next_token_ids=tensor([320], device='cuda:0') mtp accept=1 prop=320 top1=320 accp=0.410 next=pair draft=48912 prop=48912 pred gate=device Token # 63: 113.888ms; value: next_token_ids=tensor([48912], device='cuda:0') mtp accept=1 prop=48912 top1=48912 accp=0.902 next=draft=1107 prop=1107 olap pair=108.7ms serial=193.5ms gain=84.8ms ratio=0.44 s0=3.8ms s1=189.7ms wait=0.1/46.2ms pred gate=device Token # 64: 3.802ms; value: next_token_ids=tensor([1107], device='cuda:0') mtp accept=1 prop=1107 top1=1107 accp=0.936 next=pair draft=558 prop=558 pred gate=device Token # 65: 113.675ms; value: next_token_ids=tensor([558], device='cuda:0') mtp accept=1 prop=558 top1=558 accp=0.998 next=draft=15253 prop=15253 olap pair=108.5ms serial=192.9ms gain=84.4ms ratio=0.44 s0=3.8ms s1=189.1ms wait=0.1/46.2ms pred gate=device Token # 66: 3.750ms; value: next_token_ids=tensor([15253], device='cuda:0') mtp accept=1 prop=15253 top1=15253 accp=1.000 next=pair draft=19680 prop=19680 pred gate=device Token # 67: 113.409ms; value: next_token_ids=tensor([19680], device='cuda:0') mtp accept=1 prop=19680 top1=19680 accp=0.999 next=draft=16703 prop=16703 olap pair=108.2ms serial=192.5ms gain=84.3ms ratio=0.44 s0=3.9ms s1=188.6ms wait=0.1/45.9ms pred gate=device Token # 68: 3.709ms; value: next_token_ids=tensor([16703], device='cuda:0') mtp accept=1 prop=16703 top1=16703 accp=0.936 next=pair draft=40781 prop=40781 pred gate=device Token # 69: 116.238ms; value: next_token_ids=tensor([1395], device='cuda:0') mtp accept=0 prop=40781 top1=1395 accp=0.445 next=draft=40781 prop=40781 olap pair=111.0ms serial=197.9ms gain=86.8ms ratio=0.44 s0=4.3ms s1=193.6ms wait=0.1/45.0ms pred gate=device Token # 70: 115.258ms; value: next_token_ids=tensor([40781], device='cuda:0') mtp accept=1 prop=40781 top1=40781 accp=0.978 next=draft=478 prop=478 olap pair=110.0ms serial=195.6ms gain=85.7ms ratio=0.44 s0=4.6ms s1=191.0ms wait=0.1/44.7ms pred gate=device Token # 71: 3.732ms; value: next_token_ids=tensor([478], device='cuda:0') mtp accept=1 prop=478 top1=478 accp=1.000 next=pair draft=4754 prop=39267 pred gate=device Token # 72: 113.249ms; value: next_token_ids=tensor([39267], device='cuda:0') mtp accept=1 prop=39267 top1=4754 accp=0.608 next=draft=6710 prop=9209 olap pair=108.1ms serial=192.1ms gain=83.9ms ratio=0.44 s0=4.8ms s1=187.3ms wait=0.1/44.1ms pred gate=device Token # 73: 3.706ms; value: next_token_ids=tensor([6710], device='cuda:0') mtp accept=0 prop=9209 top1=6710 accp=0.651 next=pair draft=47321 prop=27316 pred gate=device Token # 74: 114.008ms; value: next_token_ids=tensor([91410], device='cuda:0') mtp accept=0 prop=27316 top1=47321 accp=0.491 next=draft=548 prop=548 olap pair=108.8ms serial=193.1ms gain=84.3ms ratio=0.44 s0=4.8ms s1=188.3ms wait=0.1/44.1ms pred gate=device Token # 75: 114.926ms; value: next_token_ids=tensor([16154], device='cuda:0') mtp accept=0 prop=548 top1=16154 accp=0.060 next=draft=303 prop=78978 olap pair=109.7ms serial=195.3ms gain=85.5ms ratio=0.44 s0=4.4ms s1=190.9ms wait=0.1/44.6ms pred gate=device Token # 76: 114.944ms; value: next_token_ids=tensor([78978], device='cuda:0') mtp accept=1 prop=78978 top1=303 accp=0.920 next=draft=303 prop=303 olap pair=109.7ms serial=193.3ms gain=83.6ms ratio=0.43 s0=4.7ms s1=188.7ms wait=0.1/44.7ms pred gate=device Token # 77: 3.697ms; value: next_token_ids=tensor([303], device='cuda:0') mtp accept=1 prop=303 top1=303 accp=1.000 next=pair draft=9209 prop=9209 pred gate=device Token # 78: 114.525ms; value: next_token_ids=tensor([9209], device='cuda:0') mtp accept=1 prop=9209 top1=422 accp=0.582 next=draft=5293 prop=5293 olap pair=109.4ms serial=192.4ms gain=83.0ms ratio=0.43 s0=4.7ms s1=187.7ms wait=0.1/44.9ms pred gate=device Token # 79: 3.685ms; value: next_token_ids=tensor([28269], device='cuda:0') mtp accept=0 prop=5293 top1=28269 accp=0.054 next=pair draft=16690 prop=49571 pred gate=device Token # 80: 115.167ms; value: next_token_ids=tensor([22623], device='cuda:0') mtp accept=0 prop=49571 top1=22623 accp=0.007 next=draft=1316 prop=1316 olap pair=110.0ms serial=193.7ms gain=83.7ms ratio=0.43 s0=4.7ms s1=189.0ms wait=0.1/45.1ms pred gate=device Token # 81: 115.645ms; value: next_token_ids=tensor([320], device='cuda:0') mtp accept=0 prop=1316 top1=1316 accp=0.799 next=draft=642 prop=642 olap pair=110.4ms serial=194.4ms gain=84.0ms ratio=0.43 s0=4.2ms s1=190.2ms wait=0.1/45.8ms pred gate=device Token # 82: 114.179ms; value: next_token_ids=tensor([642], device='cuda:0') mtp accept=1 prop=642 top1=642 accp=0.809 next=draft=428 prop=428 olap pair=109.0ms serial=191.7ms gain=82.7ms ratio=0.43 s0=4.1ms s1=187.6ms wait=0.1/46.0ms pred gate=device Token # 83: 3.693ms; value: next_token_ids=tensor([428], device='cuda:0') mtp accept=1 prop=428 top1=428 accp=0.875 next=pair draft=5852 prop=5852 pred gate=device Token # 84: 115.156ms; value: next_token_ids=tensor([18912], device='cuda:0') mtp accept=0 prop=5852 top1=46889 accp=0.032 next=draft=46889 prop=46889 olap pair=110.0ms serial=195.4ms gain=85.4ms ratio=0.44 s0=4.3ms s1=191.0ms wait=0.1/44.6ms pred gate=device Token # 85: 114.394ms; value: next_token_ids=tensor([3795], device='cuda:0') mtp accept=0 prop=46889 top1=46889 accp=0.961 next=draft=531 prop=531 olap pair=109.1ms serial=194.1ms gain=85.0ms ratio=0.44 s0=4.3ms s1=189.8ms wait=0.1/44.6ms pred gate=device Token # 86: 116.359ms; value: next_token_ids=tensor([531], device='cuda:0') mtp accept=1 prop=531 top1=531 accp=0.997 next=draft=34432 prop=34432 olap pair=111.0ms serial=198.1ms gain=87.1ms ratio=0.44 s0=4.4ms s1=193.7ms wait=0.1/45.6ms pred gate=device Token # 87: 3.695ms; value: next_token_ids=tensor([34432], device='cuda:0') mtp accept=1 prop=34432 top1=34432 accp=1.000 next=pair draft=430 prop=430 pred gate=device Token # 88: 114.414ms; value: next_token_ids=tensor([430], device='cuda:0') mtp accept=1 prop=430 top1=430 accp=0.661 next=draft=6856 prop=637 olap pair=109.2ms serial=194.4ms gain=85.1ms ratio=0.44 s0=3.7ms s1=190.6ms wait=0.1/46.5ms pred gate=device Token # 89: 3.746ms; value: next_token_ids=tensor([6856], device='cuda:0') mtp accept=0 prop=637 top1=6856 accp=0.854 next=pair draft=38879 prop=38879 pred gate=device Token # 90: 114.843ms; value: next_token_ids=tensor([38879], device='cuda:0') mtp accept=1 prop=38879 top1=38879 accp=0.999 next=draft=4270 prop=4270 olap pair=109.5ms serial=195.0ms gain=85.5ms ratio=0.44 s0=3.7ms s1=191.3ms wait=0.1/46.4ms pred gate=device Token # 91: 3.752ms; value: next_token_ids=tensor([4270], device='cuda:0') mtp accept=1 prop=4270 top1=4270 accp=0.985 next=pair draft=16730 prop=16730 pred gate=device Token # 92: 114.757ms; value: next_token_ids=tensor([16730], device='cuda:0') mtp accept=1 prop=16730 top1=16730 accp=0.901 next=draft=320 prop=320 olap pair=109.5ms serial=195.0ms gain=85.5ms ratio=0.44 s0=4.0ms s1=191.0ms wait=0.1/45.9ms pred gate=device Token # 93: 3.675ms; value: next_token_ids=tensor([320], device='cuda:0') mtp accept=1 prop=320 top1=320 accp=0.992 next=pair draft=128799 prop=128799 pred gate=device Token # 94: 113.928ms; value: next_token_ids=tensor([128799], device='cuda:0') mtp accept=1 prop=128799 top1=128799 accp=0.861 next=draft=30594 prop=30594 olap pair=108.8ms serial=193.4ms gain=84.6ms ratio=0.44 s0=3.9ms s1=189.5ms wait=0.1/45.9ms pred gate=device Token # 95: 3.742ms; value: next_token_ids=tensor([30594], device='cuda:0') mtp accept=1 prop=30594 top1=30594 accp=1.000 next=pair draft=1175 prop=1175 pred gate=device Token # 96: 114.341ms; value: next_token_ids=tensor([1175], device='cuda:0') mtp accept=1 prop=1175 top1=1175 accp=1.000 next=draft=70037 prop=70037 olap pair=109.2ms serial=194.2ms gain=85.0ms ratio=0.44 s0=3.7ms s1=190.4ms wait=0.1/46.5ms pred gate=device Token # 97: 3.682ms; value: next_token_ids=tensor([70037], device='cuda:0') mtp accept=1 prop=70037 top1=70037 accp=0.948 next=pair draft=6871 prop=6871 pred gate=device Token # 98: 115.391ms; value: next_token_ids=tensor([17611], device='cuda:0') mtp accept=0 prop=6871 top1=17611 accp=0.179 next=draft=804 prop=804 olap pair=110.2ms serial=196.0ms gain=85.9ms ratio=0.44 s0=3.7ms s1=192.3ms wait=0.1/46.5ms pred gate=device Token # 99: 114.523ms; value: next_token_ids=tensor([804], device='cuda:0') mtp accept=1 prop=804 top1=804 accp=1.000 next=draft=1175 prop=1175 olap pair=108.7ms serial=192.1ms gain=83.5ms ratio=0.43 s0=7.6ms s1=184.6ms wait=0.2/41.9ms pred gate=device Token # 100: 3.855ms; value: next_token_ids=tensor([1175], device='cuda:0') mtp accept=1 prop=1175 top1=1175 accp=0.999 next=pair draft=28927 prop=28927 pred gate=device Token # 101: 114.344ms; value: next_token_ids=tensor([28927], device='cuda:0') mtp accept=1 prop=28927 top1=28927 accp=0.995 next=draft=235 prop=235 olap pair=109.1ms serial=194.0ms gain=84.9ms ratio=0.44 s0=3.7ms s1=190.3ms wait=0.1/46.3ms pred gate=device Token # 102: 3.851ms; value: next_token_ids=tensor([235], device='cuda:0') mtp accept=1 prop=235 top1=235 accp=1.000 next=pair draft=223 prop=223 pred gate=device Token # 103: 113.374ms; value: next_token_ids=tensor([223], device='cuda:0') mtp accept=1 prop=223 top1=223 accp=0.749 next=draft=10805 prop=10805 olap pair=108.2ms serial=192.1ms gain=83.9ms ratio=0.44 s0=3.7ms s1=188.4ms wait=0.1/46.3ms pred gate=device Token # 104: 3.691ms; value: next_token_ids=tensor([10805], device='cuda:0') mtp accept=1 prop=10805 top1=10805 accp=0.988 next=pair draft=53091 prop=53091 pred gate=device Token # 105: 116.237ms; value: next_token_ids=tensor([53091], device='cuda:0') mtp accept=1 prop=53091 top1=53091 accp=0.934 next=draft=4374 prop=4374 olap pair=110.8ms serial=196.2ms gain=85.4ms ratio=0.44 s0=3.8ms s1=192.4ms wait=0.1/46.3ms pred gate=device Token # 106: 3.891ms; value: next_token_ids=tensor([4374], device='cuda:0') mtp accept=1 prop=4374 top1=4374 accp=1.000 next=pair draft=1465 prop=1465 pred gate=device Token # 107: 114.696ms; value: next_token_ids=tensor([1465], device='cuda:0') mtp accept=1 prop=1465 top1=1465 accp=1.000 next=draft=303 prop=303 olap pair=109.5ms serial=194.7ms gain=85.2ms ratio=0.44 s0=4.0ms s1=190.7ms wait=0.1/46.0ms pred gate=device Token # 108: 3.710ms; value: next_token_ids=tensor([303], device='cuda:0') mtp accept=1 prop=303 top1=303 accp=1.000 next=pair draft=1516 prop=1516 pred gate=device Token # 109: 114.778ms; value: next_token_ids=tensor([1516], device='cuda:0') mtp accept=1 prop=1516 top1=1057 accp=0.962 next=draft=17180 prop=17180 olap pair=109.6ms serial=195.0ms gain=85.4ms ratio=0.44 s0=3.7ms s1=191.3ms wait=0.1/46.5ms pred gate=device Token # 110: 3.710ms; value: next_token_ids=tensor([17180], device='cuda:0') mtp accept=1 prop=17180 top1=17180 accp=1.000 next=pair draft=1645 prop=1645 pred gate=device Token # 111: 116.322ms; value: next_token_ids=tensor([1645], device='cuda:0') mtp accept=1 prop=1645 top1=1645 accp=1.000 next=draft=4568 prop=4568 olap pair=111.2ms serial=195.8ms gain=84.7ms ratio=0.43 s0=4.2ms s1=191.7ms wait=0.1/46.0ms pred gate=device Token # 112: 3.738ms; value: next_token_ids=tensor([4568], device='cuda:0') mtp accept=1 prop=4568 top1=4568 accp=1.000 next=pair draft=2086 prop=2086 pred gate=device Token # 113: 115.568ms; value: next_token_ids=tensor([2086], device='cuda:0') mtp accept=1 prop=2086 top1=2086 accp=1.000 next=draft=81502 prop=81502 olap pair=110.3ms serial=194.2ms gain=83.9ms ratio=0.43 s0=4.2ms s1=190.0ms wait=0.1/45.9ms pred gate=device Token # 114: 3.923ms; value: next_token_ids=tensor([81502], device='cuda:0') mtp accept=1 prop=81502 top1=81502 accp=1.000 next=pair draft=13503 prop=13503 pred gate=device Token # 115: 113.709ms; value: next_token_ids=tensor([13503], device='cuda:0') mtp accept=1 prop=13503 top1=13503 accp=1.000 next=draft=52334 prop=52334 olap pair=108.4ms serial=192.0ms gain=83.6ms ratio=0.44 s0=4.0ms s1=188.0ms wait=0.1/46.0ms pred gate=device Token # 116: 3.770ms; value: next_token_ids=tensor([52334], device='cuda:0') mtp accept=1 prop=52334 top1=52334 accp=1.000 next=pair draft=320 prop=320 pred gate=device Token # 117: 113.513ms; value: next_token_ids=tensor([478], device='cuda:0') mtp accept=0 prop=320 top1=478 accp=0.361 next=draft=10457 prop=9414 olap pair=107.6ms serial=190.8ms gain=83.2ms ratio=0.44 s0=4.4ms s1=186.4ms wait=0.1/45.4ms pred gate=device Token # 118: 115.542ms; value: next_token_ids=tensor([9414], device='cuda:0') mtp accept=1 prop=9414 top1=531 accp=0.187 next=draft=15061 prop=86196 olap pair=109.3ms serial=193.8ms gain=84.5ms ratio=0.44 s0=5.2ms s1=188.6ms wait=0.1/44.6ms pred gate=device Token # 119: 4.270ms; value: next_token_ids=tensor([86196], device='cuda:0') mtp accept=1 prop=86196 top1=27625 accp=0.248 next=pair draft=2056 prop=2056 pred gate=device Token # 120: 114.376ms; value: next_token_ids=tensor([2056], device='cuda:0') mtp accept=1 prop=2056 top1=2056 accp=1.000 next=draft=410 prop=410 olap pair=109.1ms serial=194.0ms gain=84.9ms ratio=0.44 s0=3.7ms s1=190.3ms wait=0.1/46.4ms pred gate=device Token # 121: 3.699ms; value: next_token_ids=tensor([410], device='cuda:0') mtp accept=1 prop=410 top1=410 accp=0.962 next=pair draft=2386 prop=2386 pred gate=device Token # 122: 113.571ms; value: next_token_ids=tensor([2386], device='cuda:0') mtp accept=1 prop=2386 top1=2386 accp=0.976 next=draft=1710 prop=1710 olap pair=108.4ms serial=192.5ms gain=84.1ms ratio=0.44 s0=3.8ms s1=188.7ms wait=0.1/46.3ms pred gate=device Token # 123: 3.705ms; value: next_token_ids=tensor([1710], device='cuda:0') mtp accept=1 prop=1710 top1=1710 accp=0.967 next=pair draft=6399 prop=6399 pred gate=device Token # 124: 113.879ms; value: next_token_ids=tensor([6399], device='cuda:0') mtp accept=1 prop=6399 top1=6399 accp=1.000 next=draft=303 prop=303 olap pair=108.7ms serial=193.1ms gain=84.4ms ratio=0.44 s0=3.7ms s1=189.4ms wait=0.1/46.5ms pred gate=device Token # 125: 3.853ms; value: next_token_ids=tensor([303], device='cuda:0') mtp accept=1 prop=303 top1=303 accp=1.000 next=pair draft=4029 prop=4029 pred gate=device Token # 126: 113.145ms; value: next_token_ids=tensor([4029], device='cuda:0') mtp accept=1 prop=4029 top1=4029 accp=0.999 next=draft=107041 prop=107041 olap pair=107.9ms serial=191.6ms gain=83.7ms ratio=0.44 s0=4.3ms s1=187.3ms wait=0.1/45.0ms pred gate=device Token # 127: 3.706ms; value: next_token_ids=tensor([107041], device='cuda:0') mtp accept=1 prop=107041 top1=107041 accp=0.999 next=pair draft=10250 prop=10250 pred gate=device Token # 128: 113.252ms; value: next_token_ids=tensor([10250], device='cuda:0') mtp accept=1 prop=10250 top1=10250 accp=0.975 next=draft=27318 prop=27318 olap pair=107.9ms serial=191.6ms gain=83.7ms ratio=0.44 s0=4.3ms s1=187.3ms wait=0.1/45.1ms pred gate=device Token # 129: 3.697ms; value: next_token_ids=tensor([27318], device='cuda:0') mtp accept=1 prop=27318 top1=27318 accp=1.000 next=pair draft=303 prop=303 pred gate=device Token # 130: 113.758ms; value: next_token_ids=tensor([303], device='cuda:0') mtp accept=1 prop=303 top1=303 accp=1.000 next=draft=531 prop=531 olap pair=108.6ms serial=192.6ms gain=84.1ms ratio=0.44 s0=4.3ms s1=188.3ms wait=0.1/44.9ms pred gate=device Token # 131: 3.915ms; value: next_token_ids=tensor([531], device='cuda:0') mtp accept=1 prop=531 top1=531 accp=0.967 next=pair draft=20885 prop=20885 pred gate=device Token # 132: 114.193ms; value: next_token_ids=tensor([20885], device='cuda:0') mtp accept=1 prop=20885 top1=20885 accp=0.999 next=draft=70911 prop=70911 olap pair=108.9ms serial=193.6ms gain=84.7ms ratio=0.44 s0=4.3ms s1=189.4ms wait=0.1/44.9ms pred gate=device Token # 133: 3.725ms; value: next_token_ids=tensor([70911], device='cuda:0') mtp accept=1 prop=70911 top1=70911 accp=0.999 next=pair draft=34448 prop=34448 pred gate=device Token # 134: 113.557ms; value: next_token_ids=tensor([34448], device='cuda:0') mtp accept=1 prop=34448 top1=34448 accp=0.994 next=draft=3796 prop=3796 olap pair=108.4ms serial=192.7ms gain=84.3ms ratio=0.44 s0=3.9ms s1=188.7ms wait=0.1/46.0ms pred gate=device Token # 135: 3.755ms; value: next_token_ids=tensor([2833], device='cuda:0') mtp accept=0 prop=3796 top1=3796 accp=0.872 next=pair draft=1175 prop=1175 pred gate=device Token # 136: 112.696ms; value: next_token_ids=tensor([1175], device='cuda:0') mtp accept=1 prop=1175 top1=1175 accp=0.835 next=draft=34071 prop=34071 olap pair=107.5ms serial=190.8ms gain=83.3ms ratio=0.44 s0=3.7ms s1=187.1ms wait=0.1/46.5ms pred gate=device Token # 137: 3.658ms; value: next_token_ids=tensor([34071], device='cuda:0') mtp accept=1 prop=34071 top1=34071 accp=1.000 next=pair draft=41103 prop=41103 pred gate=device Token # 138: 113.266ms; value: next_token_ids=tensor([41103], device='cuda:0') mtp accept=1 prop=41103 top1=25313 accp=0.421 next=draft=20750 prop=20750 olap pair=108.1ms serial=191.1ms gain=83.0ms ratio=0.43 s0=4.0ms s1=187.1ms wait=0.1/45.9ms pred gate=device Token # 139: 3.763ms; value: next_token_ids=tensor([20750], device='cuda:0') mtp accept=1 prop=20750 top1=20750 accp=0.969 next=pair draft=2056 prop=2056 pred gate=device Token # 140: 113.915ms; value: next_token_ids=tensor([2056], device='cuda:0') mtp accept=1 prop=2056 top1=2056 accp=0.876 next=draft=410 prop=410 olap pair=108.6ms serial=193.1ms gain=84.5ms ratio=0.44 s0=3.8ms s1=189.3ms wait=0.1/46.3ms pred gate=device Token # 141: 3.702ms; value: next_token_ids=tensor([410], device='cuda:0') mtp accept=1 prop=410 top1=410 accp=1.000 next=pair draft=25313 prop=18724 pred gate=device Token # 142: 113.858ms; value: next_token_ids=tensor([25313], device='cuda:0') mtp accept=0 prop=18724 top1=25313 accp=0.583 next=draft=18724 prop=18724 olap pair=108.7ms serial=193.2ms gain=84.5ms ratio=0.44 s0=3.8ms s1=189.4ms wait=0.1/46.3ms pred gate=device Token # 143: 112.687ms; value: next_token_ids=tensor([10655], device='cuda:0') mtp accept=0 prop=18724 top1=18724 accp=0.622 next=draft=410 prop=410 olap pair=107.4ms serial=190.7ms gain=83.3ms ratio=0.44 s0=3.7ms s1=187.0ms wait=0.1/46.4ms pred gate=device Token # 144: 113.276ms; value: next_token_ids=tensor([410], device='cuda:0') mtp accept=1 prop=410 top1=410 accp=0.985 next=draft=18724 prop=18724 olap pair=108.0ms serial=191.9ms gain=83.9ms ratio=0.44 s0=3.7ms s1=188.2ms wait=0.1/46.3ms pred gate=device Token # 145: 3.652ms; value: next_token_ids=tensor([4398], device='cuda:0') mtp accept=0 prop=18724 top1=4398 accp=0.090 next=pair draft=25728 prop=25728 pred gate=device Token # 146: 113.308ms; value: next_token_ids=tensor([18804], device='cuda:0') mtp accept=0 prop=25728 top1=18804 accp=0.236 next=draft=410 prop=410 olap pair=108.1ms serial=192.0ms gain=84.0ms ratio=0.44 s0=3.8ms s1=188.2ms wait=0.1/46.2ms pred gate=device Token # 147: 112.479ms; value: next_token_ids=tensor([410], device='cuda:0') mtp accept=1 prop=410 top1=410 accp=0.942 next=draft=14592 prop=14592 olap pair=107.3ms serial=190.4ms gain=83.2ms ratio=0.44 s0=3.7ms s1=186.7ms wait=0.1/46.2ms pred gate=device Token # 148: 3.654ms; value: next_token_ids=tensor([3803], device='cuda:0') mtp accept=0 prop=14592 top1=3803 accp=0.031 next=pair draft=25728 prop=25728 pred gate=device Token # 149: 114.140ms; value: next_token_ids=tensor([25728], device='cuda:0') mtp accept=1 prop=25728 top1=25728 accp=0.952 next=draft=10877 prop=10877 olap pair=108.1ms serial=190.8ms gain=82.7ms ratio=0.43 s0=8.0ms s1=182.8ms wait=0.2/41.4ms pred gate=device Token # 150: 4.581ms; value: next_token_ids=tensor([10877], device='cuda:0') mtp accept=1 prop=10877 top1=10877 accp=1.000 next=pair draft=478 prop=478 pred gate=device Token # 151: 114.060ms; value: next_token_ids=tensor([478], device='cuda:0') mtp accept=1 prop=478 top1=478 accp=0.966 next=draft=10457 prop=10457 olap pair=108.0ms serial=190.5ms gain=82.5ms ratio=0.43 s0=8.7ms s1=181.8ms wait=0.2/40.2ms pred gate=device Token # 152: 4.561ms; value: next_token_ids=tensor([10457], device='cuda:0') mtp accept=1 prop=10457 top1=10457 accp=0.959 next=pair draft=34071 prop=34071 pred gate=device Token # 153: 115.035ms; value: next_token_ids=tensor([34071], device='cuda:0') mtp accept=1 prop=34071 top1=34071 accp=0.999 next=draft=34448 prop=34448 olap pair=109.0ms serial=190.7ms gain=81.7ms ratio=0.43 s0=8.6ms s1=182.0ms wait=0.2/40.2ms pred gate=device Token # 154: 4.533ms; value: next_token_ids=tensor([34448], device='cuda:0') mtp accept=1 prop=34448 top1=34448 accp=0.892 next=pair draft=14165 prop=14165 pred gate=device Token # 155: 115.205ms; value: next_token_ids=tensor([14165], device='cuda:0') mtp accept=1 prop=14165 top1=14165 accp=1.000 next=draft=3467 prop=3467 olap pair=109.1ms serial=190.4ms gain=81.3ms ratio=0.43 s0=8.8ms s1=181.6ms wait=0.2/40.1ms pred gate=device Token # 156: 4.592ms; value: next_token_ids=tensor([3467], device='cuda:0') mtp accept=1 prop=3467 top1=3467 accp=1.000 next=pair draft=1148 prop=1148 pred gate=device Token # 157: 113.543ms; value: next_token_ids=tensor([1148], device='cuda:0') mtp accept=1 prop=1148 top1=1148 accp=1.000 next=draft=18912 prop=18912 olap pair=108.3ms serial=189.7ms gain=81.4ms ratio=0.43 s0=6.3ms s1=183.3ms wait=0.2/43.1ms pred gate=device Token # 158: 3.644ms; value: next_token_ids=tensor([18912], device='cuda:0') mtp accept=1 prop=18912 top1=18912 accp=0.972 next=pair draft=3795 prop=3795 pred gate=device Token # 159: 115.590ms; value: next_token_ids=tensor([3795], device='cuda:0') mtp accept=1 prop=3795 top1=3795 accp=0.930 next=draft=531 prop=531 olap pair=109.7ms serial=191.9ms gain=82.2ms ratio=0.43 s0=4.5ms s1=187.4ms wait=0.1/45.6ms pred gate=device Token # 160: 4.673ms; value: next_token_ids=tensor([531], device='cuda:0') mtp accept=1 prop=531 top1=531 accp=1.000 next=pair draft=34432 prop=34432 pred gate=device Token # 161: 114.701ms; value: next_token_ids=tensor([34432], device='cuda:0') mtp accept=1 prop=34432 top1=34432 accp=1.000 next=draft=34448 prop=34448 olap pair=108.6ms serial=190.4ms gain=81.9ms ratio=0.43 s0=8.8ms s1=181.6ms wait=0.2/40.2ms pred gate=device Token # 162: 4.551ms; value: next_token_ids=tensor([16690], device='cuda:0') mtp accept=0 prop=34448 top1=34448 accp=0.716 next=pair draft=718 prop=718 pred gate=device Token # 163: 113.687ms; value: next_token_ids=tensor([718], device='cuda:0') mtp accept=1 prop=718 top1=718 accp=0.988 next=draft=97233 prop=97233 olap pair=107.5ms serial=189.6ms gain=82.0ms ratio=0.43 s0=8.7ms s1=180.8ms wait=0.2/40.4ms pred gate=device Token # 164: 4.590ms; value: next_token_ids=tensor([97233], device='cuda:0') mtp accept=1 prop=97233 top1=97233 accp=0.992 next=pair draft=1175 prop=1175 pred gate=device Token # 165: 114.476ms; value: next_token_ids=tensor([1175], device='cuda:0') mtp accept=1 prop=1175 top1=1175 accp=1.000 next=draft=57398 prop=57398 olap pair=108.4ms serial=190.0ms gain=81.6ms ratio=0.43 s0=7.4ms s1=182.5ms wait=0.2/41.9ms pred gate=device Token # 166: 4.595ms; value: next_token_ids=tensor([57398], device='cuda:0') mtp accept=1 prop=57398 top1=57398 accp=0.988 next=pair draft=1 prop=1 pred gate=device Token # 167: 116.888ms; value: next_token_ids=tensor([1], device='cuda:0') mtp accept=1 prop=1 top1=1 accp=1.000 next=none olap pair=111.5ms serial=196.6ms gain=85.1ms ratio=0.43 s0=4.6ms s1=192.0ms wait=0.1/45.3ms pred cut=device