Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.
Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).
@blackjack 说: @joker_chang 说: @rock-shi 那就对了,24G跑128K上下文+MTP资源不够 我27 q4量化,kv均q8_0量化,上下文128k,MTP, 5090laptop 24GRAM,开thinking,50+tps,快的起飞啊
@blackjack 说:
@joker_chang 说: @rock-shi 那就对了,24G跑128K上下文+MTP资源不够
@joker_chang 说:
@rock-shi 那就对了,24G跑128K上下文+MTP资源不够
我27 q4量化,kv均q8_0量化,上下文128k,MTP, 5090laptop 24GRAM,开thinking,50+tps,快的起飞啊
厉害!一样的卡,大哥能给个作业抄吗?14900k,32g内存,llama.cpp,感谢!