Skip to content

Commit cb1d0dd

Browse files
committed
perf(sglang.sh): bump qwen36 max-running-requests 2->4
1 parent c5aaf19 commit cb1d0dd

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

sglang.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -631,7 +631,7 @@ cmd_qwen36_35b_nvfp4() {
631631
--quantization modelopt_mixed \
632632
--mem-fraction-static 0.85 \
633633
--context-length "${ctx}" \
634-
--max-running-requests 2 \
634+
--max-running-requests 4 \
635635
--attention-backend flashinfer \
636636
--linear-attn-backend triton \
637637
--linear-attn-prefill-backend triton \

0 commit comments

Comments
 (0)