Commit fdac99d
committed
fix(vlm): oom on default gpu_memory_utilization
Lower the tp=4 default from 0.95 to 0.9 to leave headroom for the
vision encoder.
Signed-off-by: chenht2022 <chenht2022@gmail.com>1 parent d053d50 commit fdac99d
1 file changed
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
361 | 361 | | |
362 | 362 | | |
363 | 363 | | |
364 | | - | |
| 364 | + | |
365 | 365 | | |
366 | 366 | | |
367 | 367 | | |
| |||
0 commit comments