<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[4080&amp;4090不同模型token性能测试]]></title><description><![CDATA[<p dir="auto">两块显卡：【RTX 4080 32GB】 和 【RTX 4090 48GB】， 都接的显卡坞（x4@PCI-E 3）。最近完整看完了“老特抡锤者”频道的相关视频，也参考了论坛里各位大神的经验分享，随后针对不同模型、量化版本、上下文长度以及MTP 参数进行了多轮测试。把测试结果整理出来，供大家参考。</p>
<h1>【20260529更新_2】</h1>
<h2>4090 / 4080 当前生产配置（亮点：Uncensored 模型驱动Hermes，什么活都不拒绝）</h2>
<table class="table table-bordered table-striped">
<thead>
<tr>
<th>参数</th>
<th><strong>4090</strong></th>
<th><strong>4080</strong></th>
</tr>
</thead>
<tbody>
<tr>
<td>GPU</td>
<td>RTX 4090 48GB (Ada)</td>
<td>RTX 4080 32GB (Ada)</td>
</tr>
<tr>
<td>框架</td>
<td>vLLM 0.21.0</td>
<td>vLLM 0.21.0</td>
</tr>
<tr>
<td>Service</td>
<td><code>vllm-4090-27b-fp8</code></td>
<td><code>vllm-4080-heretic-gptq</code></td>
</tr>
<tr>
<td>模型</td>
<td>官方 Qwen3.6-27B-FP8</td>
<td>llmfan46 Heretic v2 GPTQ-Int4</td>
</tr>
<tr>
<td><strong>客户端用途</strong></td>
<td><strong>Claude Code</strong></td>
<td><strong>Hermes</strong></td>
</tr>
<tr>
<td>Censored</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /> uncensored (MPOA)</td>
</tr>
<tr>
<td>量化</td>
<td>FP8 E4M3 block 128×128</td>
<td>GPTQ-Int4 (Marlin)</td>
</tr>
<tr>
<td>KV dtype</td>
<td>fp8</td>
<td>fp8</td>
</tr>
<tr>
<td>max-model-len</td>
<td>262144 (256K)</td>
<td>262144 (256K)</td>
</tr>
<tr>
<td>max-num-seqs</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>gpu-mem-util</td>
<td>0.97</td>
<td>0.96</td>
</tr>
<tr>
<td>MTP s</td>
<td>5</td>
<td>3</td>
</tr>
<tr>
<td>tool-call-parser</td>
<td>qwen3_coder</td>
<td>qwen3_coder</td>
</tr>
<tr>
<td>reasoning-parser</td>
<td>qwen3</td>
<td>qwen3</td>
</tr>
<tr>
<td>prefix-caching</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
</tr>
<tr>
<td>vision/video</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /> 内嵌</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /> 内嵌</td>
</tr>
<tr>
<td><strong>bench tok/s</strong></td>
<td><strong>54.2</strong></td>
<td><strong>62.7</strong></td>
</tr>
<tr>
<td><strong>bench accept</strong></td>
<td>54%</td>
<td>61%</td>
</tr>
<tr>
<td><strong>实际场景</strong></td>
<td>73-76 tok/s（高命中 99% accept）</td>
<td>接近一致</td>
</tr>
</tbody>
</table>
<h1>【20260529更新_1】</h1>
<p dir="auto"><img src="https://upload.lcz.me/uploads/e15fd7fb-4e7e-4ed8-a89e-341e5af730db.jpeg" alt="03382072-3ea1-4666-8a27-7e57a5d172a3-image.jpeg" class=" img-fluid img-markdown" /><br />
<img src="https://upload.lcz.me/uploads/226fc1d5-d0fc-4772-8032-15b36d20831b.jpeg" alt="46307668-c032-4321-9972-9464ad019234-image.jpeg" class=" img-fluid img-markdown" /><br />
<img src="https://upload.lcz.me/uploads/5287b5f3-19db-4082-86ef-ccd4980f894d.jpeg" alt="ac56489e-dde7-4c7c-87d9-ed8edb132ea0-image.jpeg" class=" img-fluid img-markdown" /><br />
<img src="https://upload.lcz.me/uploads/becc7412-3770-496f-ab72-09c7bb5851e3.jpg" alt="2ef49ea1-2f4f-4cb5-b5d4-fe083120ca98-微信图片_20260529102451_763_277.jpg" class=" img-fluid img-markdown" /></p>
<h1>【先上图，证明不是云】</h1>
<p dir="auto"><img src="https://upload.lcz.me/uploads/5512303d-1bb9-4cd3-9d4c-1eb75ad3e280.jpg" alt="微信图片_20260525230946_630_277.jpg" class=" img-fluid img-markdown" /><br />
<img src="https://upload.lcz.me/uploads/82dc4ba9-b4fd-44dc-b2cf-4d194c2bf83b.jpg" alt="微信图片_20260525230945_629_277.jpg" class=" img-fluid img-markdown" /></p>
<h1>【平台信息】</h1>
<table class="table table-bordered table-striped">
<thead>
<tr>
<th>类别</th>
<th>项</th>
<th>配置</th>
</tr>
</thead>
<tbody>
<tr>
<td><strong>机型</strong></td>
<td>型号</td>
<td>HP Z4 G4 Workstation</td>
</tr>
<tr>
<td></td>
<td>电源</td>
<td>750 W</td>
</tr>
<tr>
<td><strong>CPU</strong></td>
<td>型号</td>
<td>Intel Xeon W-2133</td>
</tr>
<tr>
<td></td>
<td>主频</td>
<td>3.6 GHz</td>
</tr>
<tr>
<td></td>
<td>核 / 线程</td>
<td>6 核 / 12 线程</td>
</tr>
<tr>
<td><strong>内存</strong></td>
<td>类型</td>
<td>DDR4</td>
</tr>
<tr>
<td></td>
<td>容量</td>
<td>32 GB</td>
</tr>
<tr>
<td><strong>GPU 0</strong></td>
<td>型号</td>
<td>RTX 4090（魔改）</td>
</tr>
<tr>
<td></td>
<td>显存</td>
<td>48 GB</td>
</tr>
<tr>
<td></td>
<td>用途</td>
<td>主推理</td>
</tr>
<tr>
<td><strong>GPU 1</strong></td>
<td>型号</td>
<td>RTX 4080（魔改）</td>
</tr>
<tr>
<td></td>
<td>显存</td>
<td>32 GB</td>
</tr>
<tr>
<td></td>
<td>用途</td>
<td>副推理</td>
</tr>
<tr>
<td><strong>GPU 2</strong></td>
<td>型号</td>
<td>RTX 2080 Ti（魔改）</td>
</tr>
<tr>
<td></td>
<td>显存</td>
<td>22 GB</td>
</tr>
<tr>
<td></td>
<td>用途</td>
<td>ComfyUI</td>
</tr>
<tr>
<td><strong>显存合计</strong></td>
<td></td>
<td>102 GB</td>
</tr>
<tr>
<td><strong>系统盘</strong></td>
<td>类型</td>
<td>NVMe M.2 SSD</td>
</tr>
<tr>
<td></td>
<td>容量</td>
<td>256 GB</td>
</tr>
<tr>
<td><strong>数据盘</strong></td>
<td>挂载点</td>
<td><code>/data</code></td>
</tr>
<tr>
<td></td>
<td>容量</td>
<td>458 GB</td>
</tr>
<tr>
<td><strong>系统</strong></td>
<td>OS</td>
<td>Ubuntu 24.04 LTS</td>
</tr>
<tr>
<td></td>
<td>内核</td>
<td>Linux 6.17.0-29-generic</td>
</tr>
</tbody>
</table>
<h1>【4090 token 性能历史】</h1>
<table class="table table-bordered table-striped">
<thead>
<tr>
<th>时间</th>
<th>模型 + 后端</th>
<th>量化</th>
<th>ctx</th>
<th>MTP</th>
<th>视觉</th>
<th>uncensored</th>
<th>单流 tok/s</th>
<th>并发 tok/s</th>
</tr>
</thead>
<tbody>
<tr>
<td>2026-05-17</td>
<td>Qwen3.6-27B-FP8 vLLM</td>
<td>FP8 + FP8 KV</td>
<td>256K</td>
<td>s=3</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td>37</td>
<td>—</td>
</tr>
<tr>
<td>2026-05-21</td>
<td>QuantTrio AWQ Dense vLLM</td>
<td>AWQ INT4 + FP8 KV</td>
<td>256K</td>
<td>s=3</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><strong>81</strong></td>
<td>208 (并发3)</td>
</tr>
<tr>
<td>2026-05-23 中</td>
<td>QuantTrio AWQ-6Bit vLLM</td>
<td>AWQ 6-bit</td>
<td>256K</td>
<td>s=3</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td>68</td>
<td>124 (并发2, 反慢)</td>
</tr>
<tr>
<td>2026-05-23 中</td>
<td>QuantTrio 35B-A3B vLLM</td>
<td>AWQ INT4</td>
<td>256K</td>
<td>s=3</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td>107</td>
<td>351 (并发5)</td>
</tr>
<tr>
<td>2026-05-23 晚</td>
<td>35B-A3B 无 MTP vLLM</td>
<td>AWQ INT4 + batched=16384</td>
<td>256K</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /> 关</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><strong>145</strong></td>
<td>337 (并发5)</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>Qwen3.6-27B-FP8 vLLM</td>
<td>FP8 + FP8 KV + prefix-cache</td>
<td>256K</td>
<td>s=5</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td>60.8</td>
<td>—</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>Qwen3.6-27B-FP8 vLLM</td>
<td>同上</td>
<td>256K</td>
<td><strong>s=7</strong></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><strong>63.8</strong></td>
<td>—</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>Qwen3.6-27B-FP8 vLLM</td>
<td>同上</td>
<td>256K</td>
<td>s=8</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td>64.0（边际死）</td>
<td>—</td>
</tr>
<tr>
<td>2026-05-25</td>
<td>Heretic Q8 llama.cpp（试）</td>
<td>Q8 + q8_0 KV</td>
<td>256K</td>
<td>n=3</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>63.4</td>
<td>—</td>
</tr>
<tr>
<td>2026-05-25</td>
<td>Heretic Q8 llama.cpp（试）</td>
<td>同上</td>
<td>256K</td>
<td>n=5</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>66.5</td>
<td>—</td>
</tr>
<tr>
<td>2026-05-25</td>
<td>Heretic GPTQ-Int4 vLLM（失败）</td>
<td>GPTQ-Int4</td>
<td>256K</td>
<td>s=3</td>
<td>—</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>21（accept 1.25% broken）</td>
<td>—</td>
</tr>
<tr>
<td>2026-05-25</td>
<td><strong>Heretic Q8 llama.cpp</strong> <img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2b50.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--star" style="height:23px;width:auto;vertical-align:middle" title="⭐" alt="⭐" /> <strong>当前 default</strong></td>
<td>Q8 + q8_0 KV</td>
<td>256K</td>
<td><strong>n=7</strong></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><strong>68.7</strong></td>
<td>—</td>
</tr>
</tbody>
</table>
<h1>【4080 token 性能历史】</h1>
<table class="table table-bordered table-striped">
<thead>
<tr>
<th>时间</th>
<th>模型 + 后端</th>
<th>量化</th>
<th>ctx</th>
<th>MTP</th>
<th>视觉</th>
<th>uncensored</th>
<th>单流 tok/s</th>
</tr>
</thead>
<tbody>
<tr>
<td>2026-05-09</td>
<td>QuantTrio AWQ Dense vLLM 0.20.1</td>
<td>AWQ INT4 + FP8 KV</td>
<td>128K</td>
<td>s=3</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td>62.9</td>
</tr>
<tr>
<td>2026-05-11</td>
<td>同上 vLLM 0.20.2（regression）</td>
<td>AWQ INT4</td>
<td>128K</td>
<td>s=2</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td>45.6</td>
</tr>
<tr>
<td>2026-05-22</td>
<td>HauhauCS 27B Aggressive llama.cpp</td>
<td>Q4_K_P GGUF</td>
<td>256K</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /> 无（mmproj 互斥）</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>32</td>
</tr>
<tr>
<td>2026-05-23</td>
<td>QuantTrio 35B-A3B vLLM</td>
<td>AWQ INT4 + FP8 KV + seqs=1</td>
<td>256K</td>
<td>无</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td>106</td>
</tr>
<tr>
<td>2026-05-23</td>
<td>同上</td>
<td>同上</td>
<td>256K</td>
<td>s=1</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td>77</td>
</tr>
<tr>
<td>2026-05-23</td>
<td>同上</td>
<td>同上</td>
<td>256K</td>
<td>s=2</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td>93</td>
</tr>
<tr>
<td>2026-05-23</td>
<td>QuantTrio 35B-A3B vLLM</td>
<td>同上</td>
<td>256K</td>
<td><strong>s=3</strong></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><strong>117</strong></td>
</tr>
<tr>
<td>2026-05-24</td>
<td>SummonGov 27B-MTP graft Q6_K_P</td>
<td>GGUF + q8 KV</td>
<td>64K</td>
<td>n=1</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>40.1</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上</td>
<td>同上</td>
<td>64K</td>
<td>n=2</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>50.1</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上</td>
<td>同上</td>
<td>64K</td>
<td>n=3</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>55.7</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上</td>
<td>同上</td>
<td>64K</td>
<td><strong>n=5</strong></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><strong>58.9</strong></td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上</td>
<td>同上</td>
<td>64K</td>
<td>n=7</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>55.3</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>SummonGov 27B-MTP Q4_K_P</td>
<td>GGUF + q8 KV</td>
<td>64K</td>
<td>n=3</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>20.6（accept 2% broken）</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上</td>
<td>同上</td>
<td>64K</td>
<td><strong>n=5</strong></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><strong>62.5</strong></td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上</td>
<td>同上</td>
<td>64K</td>
<td>n=7</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/274c.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--x" style="height:23px;width:auto;vertical-align:middle" title="❌" alt="❌" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>56.8</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>llmfan46 Heretic Q6_K llama.cpp</td>
<td>Q6_K + q8 KV</td>
<td>64K</td>
<td>n=3</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>57.0</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上</td>
<td>同上</td>
<td>64K</td>
<td>n=5</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><strong>61.6</strong></td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上</td>
<td>同上</td>
<td>64K</td>
<td>n=7</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>56.5</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上 q8_0 KV @ 256K</td>
<td>Q6_K + q8 KV</td>
<td>256K</td>
<td>n=5</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>OOM 差 836 MiB</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上 q5_1 KV @ 256K</td>
<td>Q6_K + q5_1 KV</td>
<td>256K</td>
<td>n=5</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>慢（flash-attn 不兼容）</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上 q5_0 KV @ 256K</td>
<td>Q6_K + q5_0 KV</td>
<td>256K</td>
<td>n=5</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>12（slow path）</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>同上 iq4_nl KV @ 256K</td>
<td>Q6_K + iq4_nl KV</td>
<td>256K</td>
<td>n=5</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>26（slow path）</td>
</tr>
<tr>
<td>2026-05-24</td>
<td>Heretic GPTQ-Int4 vLLM（失败）</td>
<td>GPTQ INT4</td>
<td>256K</td>
<td>s=3</td>
<td>—</td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td>21（accept 1.25%）</td>
</tr>
<tr>
<td>2026-05-24</td>
<td><strong>llmfan46 Heretic Q6_K llama.cpp</strong> <img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2b50.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--star" style="height:23px;width:auto;vertical-align:middle" title="⭐" alt="⭐" /> <strong>当前 default</strong></td>
<td>Q6_K + <strong>q4_0 KV</strong></td>
<td><strong>256K</strong></td>
<td><strong>n=5</strong></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/2705.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--white_check_mark" style="height:23px;width:auto;vertical-align:middle" title="✅" alt="✅" /></td>
<td><strong>58-62</strong></td>
</tr>
</tbody>
</table>
]]></description><link>https://lcz.me/topic/312/4080-4090不同模型token性能测试</link><generator>RSS for Node</generator><lastBuildDate>Thu, 11 Jun 2026 10:00:37 GMT</lastBuildDate><atom:link href="https://lcz.me/topic/312.rss" rel="self" type="application/rss+xml"/><pubDate>Mon, 25 May 2026 15:34:42 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Tue, 02 Jun 2026 03:44:26 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/michael-zhou" aria-label="Profile: Michael-Zhou">@<bdi>Michael-Zhou</bdi></a> 哈，我昨天也是安装了这位大佬的另外一个模型<a href="https://huggingface.co/llmfan46/Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved-GPTQ-Int4%EF%BC%8C%E6%9A%82%E6%97%B6%E4%BD%BF%E7%94%A8%E4%B9%9F%E6%98%AF%E7%A8%B3%E5%AE%9A%EF%BC%8C%E9%80%9F%E5%BA%A6%E8%BF%98%E4%B8%8D%E9%94%99%E3%80%82%E6%88%914080S32G%E3%80%82" rel="nofollow ugc">https://huggingface.co/llmfan46/Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved-GPTQ-Int4，暂时使用也是稳定，速度还不错。我4080S32G。</a><br />
简单爆测结果：</p>
<pre><code>Qwen3.6-27B-GPTQ-Int4 @ RTX 4080 SUPER

| 指标                   | 数值                                         |
|------------------------|----------------------------------------------|
| 吐字速度               | ~56 tok/s                                    |
| 包含 thinking 推理     | 544 tokens / 9.6s                            |
| 去 thinking 纯有效输出 | 看你 prompt 带不带 [SYSTEM: No reasoning]    |
</code></pre>
]]></description><link>https://lcz.me/post/4626</link><guid isPermaLink="true">https://lcz.me/post/4626</guid><dc:creator><![CDATA[demo]]></dc:creator><pubDate>Tue, 02 Jun 2026 03:44:26 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Mon, 01 Jun 2026 13:08:52 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/demo" aria-label="Profile: demo">@<bdi>demo</bdi></a> 记得是通过加载mmproj 启用视觉， 但是mmproj 和MTP没办法同时开，后面没用这个模型。<br />
推荐vllm跑<a href="https://huggingface.co/llmfan46/Qwen3.6-27B-uncensored-heretic-v2-GPTQ-Int4" rel="nofollow ugc">https://huggingface.co/llmfan46/Qwen3.6-27B-uncensored-heretic-v2-GPTQ-Int4</a> ， 这个有视觉， 我一直跑着，很稳定。<br />
4080 32G启动参数：</p>
<pre><code>exec /data/vllm-env/bin/vllm serve /data/models/heretic-gptq-int4 \
    --served-model-name 4080 \
    --port 8002 \
    --max-model-len 262144 \
    --max-num-seqs 1 \
    --gpu-memory-utilization 0.96 \
    --enable-prefix-caching \
    --kv-cache-dtype fp8 \
    --trust-remote-code \
    --reasoning-parser qwen3 \
    --enable-auto-tool-choice \
    --tool-call-parser qwen3_coder \
    --speculative-config '{"method":"mtp","num_speculative_tokens":3}'
</code></pre>
]]></description><link>https://lcz.me/post/4545</link><guid isPermaLink="true">https://lcz.me/post/4545</guid><dc:creator><![CDATA[Michael Zhou]]></dc:creator><pubDate>Mon, 01 Jun 2026 13:08:52 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Mon, 01 Jun 2026 02:54:25 GMT]]></title><description><![CDATA[<p dir="auto">Zhou，请教一下，HauhauCS 27B Aggressive llama.cpp  是怎样配置视觉参数的呢？我问了gemini和豆包，都是不带视觉的。但是询问他们俩关于比较新的第三方模型，他们总是会出现幻觉</p>
]]></description><link>https://lcz.me/post/4494</link><guid isPermaLink="true">https://lcz.me/post/4494</guid><dc:creator><![CDATA[demo]]></dc:creator><pubDate>Mon, 01 Jun 2026 02:54:25 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Sat, 30 May 2026 00:41:29 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/michael-zhou" aria-label="Profile: Michael-Zhou">@<bdi>Michael-Zhou</bdi></a> 哇塞，自带拆分，那的确不错，捡到宝了</p>
]]></description><link>https://lcz.me/post/4281</link><guid isPermaLink="true">https://lcz.me/post/4281</guid><dc:creator><![CDATA[jenaflex]]></dc:creator><pubDate>Sat, 30 May 2026 00:41:29 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Fri, 29 May 2026 17:22:14 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/jenaflex" aria-label="Profile: jenaflex">@<bdi>jenaflex</bdi></a> 转接卡上没有芯片，就是把PCIEx16分成4份直通出四个oculink口。用的BIOS的Bifurcation。主机是某宝入的二手HP Z4 G4 Workstation，支持PCIE拆分。</p>
]]></description><link>https://lcz.me/post/4259</link><guid isPermaLink="true">https://lcz.me/post/4259</guid><dc:creator><![CDATA[Michael Zhou]]></dc:creator><pubDate>Fri, 29 May 2026 17:22:14 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Fri, 29 May 2026 16:50:06 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/michael-zhou" aria-label="Profile: Michael-Zhou">@<bdi>Michael-Zhou</bdi></a> 对大家（lsp）得口味把握精准 哈哈哈哈<br />
你得oculink卡是内置PLX拆分芯片，还是利用BIOS的Bifurcation？<br />
好像记得Intel商用机工作站主板很少支持Bifurcation的，AMD EPYC主板支持的比较多</p>
]]></description><link>https://lcz.me/post/4256</link><guid isPermaLink="true">https://lcz.me/post/4256</guid><dc:creator><![CDATA[jenaflex]]></dc:creator><pubDate>Fri, 29 May 2026 16:50:06 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Fri, 29 May 2026 13:02:58 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/terry" aria-label="Profile: terry">@<bdi>terry</bdi></a> 是的， PCIE x16的槽拆分成x4x4x4x4，PICE扩展卡能接4个显卡坞。</p>
]]></description><link>https://lcz.me/post/4218</link><guid isPermaLink="true">https://lcz.me/post/4218</guid><dc:creator><![CDATA[Michael Zhou]]></dc:creator><pubDate>Fri, 29 May 2026 13:02:58 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Fri, 29 May 2026 13:01:07 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/blackjack" aria-label="Profile: blackjack">@<bdi>blackjack</bdi></a> 不在日本。图片是EDIX参展时拍的，估计大家喜欢看，就放上去了。</p>
]]></description><link>https://lcz.me/post/4217</link><guid isPermaLink="true">https://lcz.me/post/4217</guid><dc:creator><![CDATA[Michael Zhou]]></dc:creator><pubDate>Fri, 29 May 2026 13:01:07 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Fri, 29 May 2026 12:52:24 GMT]]></title><description><![CDATA[<blockquote>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/michael-zhou" aria-label="Profile: Michael-Zhou">@<bdi>Michael-Zhou</bdi></a> <a href="/post/4170">说</a>:</p>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/blackjack" aria-label="Profile: blackjack">@<bdi>blackjack</bdi></a> 图片更新了</p>
</blockquote>
<p dir="auto">非常感谢，人在日本啊</p>
]]></description><link>https://lcz.me/post/4216</link><guid isPermaLink="true">https://lcz.me/post/4216</guid><dc:creator><![CDATA[blackjack]]></dc:creator><pubDate>Fri, 29 May 2026 12:52:24 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Fri, 29 May 2026 08:44:30 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/michael-zhou" aria-label="Profile: Michael-Zhou">@<bdi>Michael-Zhou</bdi></a> 这台式机是安装了Oculink的PICE扩展卡？</p>
]]></description><link>https://lcz.me/post/4173</link><guid isPermaLink="true">https://lcz.me/post/4173</guid><dc:creator><![CDATA[terry]]></dc:creator><pubDate>Fri, 29 May 2026 08:44:30 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Fri, 29 May 2026 07:42:55 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/blackjack" aria-label="Profile: blackjack">@<bdi>blackjack</bdi></a> 图片更新了</p>
]]></description><link>https://lcz.me/post/4170</link><guid isPermaLink="true">https://lcz.me/post/4170</guid><dc:creator><![CDATA[Michael Zhou]]></dc:creator><pubDate>Fri, 29 May 2026 07:42:55 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Wed, 27 May 2026 10:38:44 GMT]]></title><description><![CDATA[<p dir="auto">有点无从下手啊 怎么办呢？</p>
]]></description><link>https://lcz.me/post/3924</link><guid isPermaLink="true">https://lcz.me/post/3924</guid><dc:creator><![CDATA[Groot Ace]]></dc:creator><pubDate>Wed, 27 May 2026 10:38:44 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Mon, 25 May 2026 22:09:28 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/michael-zhou" aria-label="Profile: Michael-Zhou">@<bdi>Michael-Zhou</bdi></a> 这个玩的有点让人热血澎湃，说真的我也挺羡慕的，<img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/1f602.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--joy" style="height:23px;width:auto;vertical-align:middle" title="😂" alt="😂" /></p>
]]></description><link>https://lcz.me/post/3664</link><guid isPermaLink="true">https://lcz.me/post/3664</guid><dc:creator><![CDATA[terry]]></dc:creator><pubDate>Mon, 25 May 2026 22:09:28 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Mon, 25 May 2026 15:59:54 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/blackjack" aria-label="Profile: blackjack">@<bdi>blackjack</bdi></a> 明天找时间上图</p>
]]></description><link>https://lcz.me/post/3640</link><guid isPermaLink="true">https://lcz.me/post/3640</guid><dc:creator><![CDATA[Michael Zhou]]></dc:creator><pubDate>Mon, 25 May 2026 15:59:54 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Mon, 25 May 2026 15:49:49 GMT]]></title><description><![CDATA[<p dir="auto">感谢分享。棒棒哒。辛苦了兄弟。</p>
]]></description><link>https://lcz.me/post/3637</link><guid isPermaLink="true">https://lcz.me/post/3637</guid><dc:creator><![CDATA[williamlouis]]></dc:creator><pubDate>Mon, 25 May 2026 15:49:49 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Mon, 25 May 2026 15:45:57 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/michael-zhou" aria-label="Profile: Michael-Zhou">@<bdi>Michael-Zhou</bdi></a> 妈耶，感觉4080跟我3080差不多</p>
]]></description><link>https://lcz.me/post/3635</link><guid isPermaLink="true">https://lcz.me/post/3635</guid><dc:creator><![CDATA[rock shi]]></dc:creator><pubDate>Mon, 25 May 2026 15:45:57 GMT</pubDate></item><item><title><![CDATA[Reply to 4080&amp;4090不同模型token性能测试 on Mon, 25 May 2026 15:43:52 GMT]]></title><description><![CDATA[<p dir="auto">多上几张机箱和拓展坞还有连接的图啊</p>
]]></description><link>https://lcz.me/post/3634</link><guid isPermaLink="true">https://lcz.me/post/3634</guid><dc:creator><![CDATA[blackjack]]></dc:creator><pubDate>Mon, 25 May 2026 15:43:52 GMT</pubDate></item></channel></rss>