Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.
Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).
@terry 测试完了。vllm不行,18tokens/s左右,应该还是我的主板不行。ollama稳定29tokens/s
@terry 刚知道vllm还可以开mtp,我再多试试。回头再来反馈