<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4]]></title><description><![CDATA[<p dir="auto">以下大部分是比较罗嗦的流水账，技术内容在分割线后。</p>
<p dir="auto">前阵子入坑了本地AI。手上有两张3090，但是自己用的机器机箱、电源只够跑一张。而且平时要打打游戏，用Adobe系列软件，需要占用显卡，来回释放显存也很麻烦。想着再配一台电脑吧，但是看了看现在内存、SSD的价钱，还是算了吧。</p>
<p dir="auto">后来看了牢特视频，了解到了洋垃圾的世界。果断某东搜索华南金牌，到官方店下单了一整套X10X99套餐，带两个U，4x32G ECC拆机条，一个2TB长城NVME，总共6000（机箱散热器这些重的大的东西就没买了）。说实话玩洋垃圾似乎有点小贵，但是比起在美国装新机器还是便宜太多了，而且将来淘汰下来还可以跑我那一堆乱七八糟的docker服务。虽然美国Aliexpress也有X99套装，但是买着感觉就是不如京东旗舰店放心。又花了700元子运到了美国，大概一个星期就到了。</p>
<p dir="auto"><img src="https://upload.lcz.me/uploads/2f3a08ad-02d8-4210-8aaf-78c5ad2f6ef1.jpg" alt="e4d5a08f-cafd-4db5-a017-7b15a298d241-eda36dbeb1f234fb7695b3f90b3eb730.jpg" class=" img-fluid img-markdown" /></p>
<p dir="auto">两个洋垃圾当年不远万里离开了北美机房温柔乡，被送到了深圳冰冷的仓库里，现在终于回到了家乡的温暖怀抱。我把几年前挖矿用的开放式机架和两个电源废物利用，给他搭了个窝...</p>
<p dir="auto"><img src="https://upload.lcz.me/uploads/f68dad4f-e52e-4f3d-bcc2-e5c32b87783b.jpg" alt="3e086e4c-6619-4f03-b2e1-850bed6ed23b-17ef967d0a90eb439414c6534c580730.jpg" class=" img-fluid img-markdown" /></p>
<p dir="auto"><img src="https://upload.lcz.me/uploads/7fb169c6-a6b8-43ae-a001-d76ec08764e1.jpg" alt="77d10cff-dfa1-4e4f-8123-4a340e91a469-99ab7faab906937db2d43e200019ccbc.jpg" class=" img-fluid img-markdown" /></p>
<p dir="auto">装机一切都算顺利，只是到最后就是不POST，主板报"Ad"码。售后小哥跟我一起排查了一遍，4条内存8个插槽排列组合搞了一遍也没点亮屏幕。最后发现是因为显示器接的是HDMI，主板上有个3针跳帽负责切换显示模式。插在AB上面是VGA，把它给换到BC上面屏幕就成功亮了... 华南金牌东西是挺好的，就是说明书完全没用，不知道将来会不会对其他人有所帮助。</p>
<p dir="auto"><img src="https://upload.lcz.me/uploads/71ab94cd-1f5d-43f6-b6f2-a3fe200a44cb.jpg" alt="d59dea89-6e5a-4186-b2f8-8a038942e430-1cbbc8acfc06938991fd0805583723f6.jpg" class=" img-fluid img-markdown" /></p>
<p dir="auto">接下来就是装Ubuntu，装各种必要的软件，然后就是跑分了。</p>
<p dir="auto">================================分割线================================</p>
<p dir="auto">直接跑Github上 <a href="https://github.com/noonghunna/club-3090" rel="nofollow ugc">club-3090</a> 的懒人包，以前我都是跑单卡的，现在终于能跑双卡了。用的是vllm，AutoRound INT4 量化，FP8 KV，MTP n=3，262K上下文。</p>
<p dir="auto">两张卡我都按推荐的限制到了290W。用的是最新的595驱动，13.2 CUDA。</p>
<pre><code>========== NARRATIVE (prompt=65 chars, max_tokens=1000) ==========
=== warmups (3) ===
  warm-1     wall= 19.64s  ttft=   184ms  toks=1000  wall_TPS= 50.91  decode_TPS= 51.40
  warm-2     wall= 19.46s  ttft=   188ms  toks=1000  wall_TPS= 51.39  decode_TPS= 51.89
  warm-3     wall= 19.02s  ttft=   188ms  toks=1000  wall_TPS= 52.58  decode_TPS= 53.10

=== measured (5) ===
  run-1      wall= 19.03s  ttft=   146ms  toks=1000  wall_TPS= 52.56  decode_TPS= 52.97
  run-2      wall= 19.25s  ttft=   187ms  toks=1000  wall_TPS= 51.94  decode_TPS= 52.45
  run-3      wall= 20.46s  ttft=   145ms  toks=1000  wall_TPS= 48.88  decode_TPS= 49.23
  run-4      wall= 19.59s  ttft=   193ms  toks=1000  wall_TPS= 51.06  decode_TPS= 51.57
  run-5      wall= 18.44s  ttft=   143ms  toks= 972  wall_TPS= 52.70  decode_TPS= 53.11

=== summary [narrative] (n=5) ===
  wall_TPS       mean=  51.43   std=  1.57   CV= 3.0%   min=48.88   max=52.70
  decode_TPS     mean=  51.86   std=  1.59   CV= 3.1%   min=49.23   max=53.11
  TTFT          mean=   163ms  std=   25ms  min=143ms  max=193ms
  PP tok/s       mean=   1.00   std=  1.37   CV=136.9%   min=0.00   max=2.50

========== CODE (prompt=78 chars, max_tokens=800) ==========
=== warmups (3) ===
  warm-1     wall= 10.25s  ttft=   152ms  toks= 691  wall_TPS= 67.39  decode_TPS= 68.41
  warm-2     wall=  7.23s  ttft=   183ms  toks= 478  wall_TPS= 66.11  decode_TPS= 67.82
  warm-3     wall=  8.92s  ttft=   183ms  toks= 556  wall_TPS= 62.36  decode_TPS= 63.67

=== measured (5) ===
  run-1      wall=  7.08s  ttft=   186ms  toks= 466  wall_TPS= 65.81  decode_TPS= 67.58
  run-2      wall= 11.91s  ttft=   185ms  toks= 784  wall_TPS= 65.85  decode_TPS= 66.89
  run-3      wall= 11.79s  ttft=   184ms  toks= 771  wall_TPS= 65.41  decode_TPS= 66.45
  run-4      wall= 11.58s  ttft=   188ms  toks= 746  wall_TPS= 64.43  decode_TPS= 65.49
  run-5      wall= 12.38s  ttft=   185ms  toks= 800  wall_TPS= 64.61  decode_TPS= 65.59

=== summary [code] (n=5) ===
  wall_TPS       mean=  65.22   std=  0.67   CV= 1.0%   min=64.43   max=65.85
  decode_TPS     mean=  66.40   std=  0.88   CV= 1.3%   min=65.49   max=67.58
  TTFT          mean=   186ms  std=    2ms  min=184ms  max=188ms
  PP tok/s       mean=   2.50   std=  1.77   CV=70.7%   min=0.00   max=5.00
</code></pre>
<p dir="auto">综合的看就是写作文能跑到50多T/s，写码大概65+T/s。虽然感觉已经很够用了，但是离项目上描述的69/89还有一定的距离，具体还要再多跑一下多调试一下。</p>
<p dir="auto"><img src="https://upload.lcz.me/uploads/26abdeb4-d58a-4d94-884e-c6c87e57eadd.jpeg" alt="ae32a004-3647-432e-b429-419a2fa9dd9e-image.jpeg" class=" img-fluid img-markdown" /></p>
]]></description><link>https://lcz.me/topic/498/洋垃圾的回乡路-2x3090-x99-2x2680v4</link><generator>RSS for Node</generator><lastBuildDate>Thu, 11 Jun 2026 13:42:57 GMT</lastBuildDate><atom:link href="https://lcz.me/topic/498.rss" rel="self" type="application/rss+xml"/><pubDate>Tue, 09 Jun 2026 23:55:53 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 16:43:51 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/kos-or" aria-label="Profile: kos-or">@<bdi>kos-or</bdi></a> 我之前看到过有个CAD harness，可以让ai做cad，但是具体还没研究过。我搞CAD不专业，打印机买来给儿子打印玩具的...</p>
]]></description><link>https://lcz.me/post/6175</link><guid isPermaLink="true">https://lcz.me/post/6175</guid><dc:creator><![CDATA[Ray Wang]]></dc:creator><pubDate>Wed, 10 Jun 2026 16:43:51 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 15:10:49 GMT]]></title><description><![CDATA[<p dir="auto">旁邊好像有一台3D打印機 能結合AI設計打印嗎？<br />
我沒用過3D打印機</p>
]]></description><link>https://lcz.me/post/6162</link><guid isPermaLink="true">https://lcz.me/post/6162</guid><dc:creator><![CDATA[kos or]]></dc:creator><pubDate>Wed, 10 Jun 2026 15:10:49 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 14:41:12 GMT]]></title><description><![CDATA[<blockquote>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/leon-y" aria-label="Profile: Leon-Y">@<bdi>Leon-Y</bdi></a> <a href="/post/6130">说</a>:</p>
<blockquote>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/passss" aria-label="Profile: passss">@<bdi>passss</bdi></a> <a href="/post/6124">说</a>:</p>
<p dir="auto">楼主双3090用多大的电源？我1200W跑27b-mtp不限制到300W以下就要重启</p>
</blockquote>
<p dir="auto">我用的Corsair RM1000X， 同时满载300W稳稳的，双3090卡换了新的硅脂和散热贴，温度压在75度左右，就是风扇噪音有点吓人</p>
</blockquote>
<p dir="auto">75度真低，我280w 都快80<br />
多几天装机箱前3个风扇 希望会帮助</p>
]]></description><link>https://lcz.me/post/6156</link><guid isPermaLink="true">https://lcz.me/post/6156</guid><dc:creator><![CDATA[applejuice]]></dc:creator><pubDate>Wed, 10 Jun 2026 14:41:12 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 14:28:40 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/kos-or" aria-label="Profile: kos-or">@<bdi>kos-or</bdi></a> <img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/1f644.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--face_with_rolling_eyes" style="height:23px;width:auto;vertical-align:middle" title=":face_with_rolling_eyes:" alt="🙄" /> 我太懒了，主要我有条件扔地下室，所以直接摆了一台风扇在那24小时吹着... 压力测试了一下，E5其实发热不大，然后GPU的话还有专门的机箱120mm风扇吹着，问题也不大。</p>
]]></description><link>https://lcz.me/post/6154</link><guid isPermaLink="true">https://lcz.me/post/6154</guid><dc:creator><![CDATA[Ray Wang]]></dc:creator><pubDate>Wed, 10 Jun 2026 14:28:40 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 14:24:28 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/agi" aria-label="Profile: AGI">@<bdi>AGI</bdi></a> 哎，我这台机器本来也是准备进柜的。但是几年前买这个机柜时候啥也不懂，买的是那种很浅的，只能放点Switch这些小设备...</p>
]]></description><link>https://lcz.me/post/6153</link><guid isPermaLink="true">https://lcz.me/post/6153</guid><dc:creator><![CDATA[Ray Wang]]></dc:creator><pubDate>Wed, 10 Jun 2026 14:24:28 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 14:22:31 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/applejuice" aria-label="Profile: applejuice">@<bdi>applejuice</bdi></a> 没有，感觉用不到，我也没有VGA线哈哈。过几天我把IPMI装上就完全满足我需求了。</p>
]]></description><link>https://lcz.me/post/6152</link><guid isPermaLink="true">https://lcz.me/post/6152</guid><dc:creator><![CDATA[Ray Wang]]></dc:creator><pubDate>Wed, 10 Jun 2026 14:22:31 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 14:21:05 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/passss" aria-label="Profile: passss">@<bdi>passss</bdi></a> 嗯，现实就是，哪怕限制到300w，瞬时峰值还是有可能冲到400w以上。</p>
<p dir="auto">我之前挖矿7、8台GPU一起跑，而且美国这边是110v家用电，上大瓦数不好上，所以我用的都是多电源模式。现在这个配置是两个850w电源，中间用一个add2psu（下图）同步启动器。2号电源的24针插到这上面，然后接一根molex到1号电源上，就可以做到两台电源同时启动了。</p>
<p dir="auto">两个电源分别带一个GPU，完全够用。这套东西是经过广大矿工考验的<img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/1f605.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--sweat_smile" style="height:23px;width:auto;vertical-align:middle" title=":sweat_smile:" alt="😅" /> ，安全可靠。你有条件的话可以也像我这样再加个电源，我觉得比折腾一个大电源省事一点。</p>
<p dir="auto"><img src="https://upload.lcz.me/uploads/90cdbaf9-b203-4a42-a327-3d1f0ee554c5.jpeg" alt="5be28026-d948-439d-8940-d7ec6cf45b01-image.jpeg" class=" img-fluid img-markdown" /></p>
]]></description><link>https://lcz.me/post/6151</link><guid isPermaLink="true">https://lcz.me/post/6151</guid><dc:creator><![CDATA[Ray Wang]]></dc:creator><pubDate>Wed, 10 Jun 2026 14:21:05 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 13:01:01 GMT]]></title><description><![CDATA[<blockquote>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/passss" aria-label="Profile: passss">@<bdi>passss</bdi></a> <a href="/post/6124">说</a>:</p>
<p dir="auto">楼主双3090用多大的电源？我1200W跑27b-mtp不限制到300W以下就要重启</p>
</blockquote>
<p dir="auto">我用的Corsair RM1000X， 同时满载300W稳稳的，双3090卡换了新的硅脂和散热贴，温度压在75度左右，就是风扇噪音有点吓人</p>
]]></description><link>https://lcz.me/post/6130</link><guid isPermaLink="true">https://lcz.me/post/6130</guid><dc:creator><![CDATA[Leon Y]]></dc:creator><pubDate>Wed, 10 Jun 2026 13:01:01 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 12:40:54 GMT]]></title><description><![CDATA[<p dir="auto">楼主，你的kmv 有视频输出吗？ 我的一直显示no signal，我没什么用所以没理他 但是始终放在心里</p>
]]></description><link>https://lcz.me/post/6128</link><guid isPermaLink="true">https://lcz.me/post/6128</guid><dc:creator><![CDATA[applejuice]]></dc:creator><pubDate>Wed, 10 Jun 2026 12:40:54 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 11:36:55 GMT]]></title><description><![CDATA[<p dir="auto">楼主双3090用多大的电源？我1200W跑27b-mtp不限制到300W以下就要重启</p>
]]></description><link>https://lcz.me/post/6124</link><guid isPermaLink="true">https://lcz.me/post/6124</guid><dc:creator><![CDATA[passss]]></dc:creator><pubDate>Wed, 10 Jun 2026 11:36:55 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 10:42:40 GMT]]></title><description><![CDATA[<blockquote>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/ray-wang" aria-label="Profile: Ray-Wang">@<bdi>Ray-Wang</bdi></a> <a href="/post/6080">说</a>:</p>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/applejuice" aria-label="Profile: applejuice">@<bdi>applejuice</bdi></a> 好像跟我这code成绩差不多，但你功耗低很多，明天我跑一下试试</p>
</blockquote>
<p dir="auto">测了3090 300w 跟250w 只差5-10%<br />
230w 跟 250w 好像也只有5%的样子</p>
<p dir="auto">所以限制230w-250w = 差不多就有90% 的效率</p>
]]></description><link>https://lcz.me/post/6120</link><guid isPermaLink="true">https://lcz.me/post/6120</guid><dc:creator><![CDATA[applejuice]]></dc:creator><pubDate>Wed, 10 Jun 2026 10:42:40 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 08:06:20 GMT]]></title><description><![CDATA[<p dir="auto">折腾硬件很是麻烦。两年前买的Dell T5810，没有买华南散件，因为dell的质量的确很牛，二是因为放到家里当homelab的，不需要显卡。没想到两年后，显卡和电源成了制约因素。所以前几天买了蓝宝石7900xtx和1000w电源套装，电源专供显卡供电，还需要买一个同步启动器，拼多多10块钱，只能这样凑合着，想入第二块7900xtx，还放不下了，不想外置。<br />
机箱盖子无法正常盖好，也挺好，llm满载的时候，把盖子挪走，温度降低10度...</p>
<p dir="auto"><img src="https://i.mji.rip/2026/06/10/d0c6a0e8b727829c66648fdf292a9fe0.jpeg" alt="d0c6a0e8b727829c66648fdf292a9fe0.jpeg" class=" img-fluid img-markdown" /><br />
<img src="https://i.mji.rip/2026/06/10/68bcbaad0c6426dd144aa99c4687a95a.jpeg" alt="68bcbaad0c6426dd144aa99c4687a95a.jpeg" class=" img-fluid img-markdown" /></p>
]]></description><link>https://lcz.me/post/6094</link><guid isPermaLink="true">https://lcz.me/post/6094</guid><dc:creator><![CDATA[AGI]]></dc:creator><pubDate>Wed, 10 Jun 2026 08:06:20 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 07:39:47 GMT]]></title><description><![CDATA[<p dir="auto">我想幫Open Rack 設計一個機殼 製造空氣流通airflow 也能保護設備 避免灰塵累積</p>
]]></description><link>https://lcz.me/post/6092</link><guid isPermaLink="true">https://lcz.me/post/6092</guid><dc:creator><![CDATA[kos or]]></dc:creator><pubDate>Wed, 10 Jun 2026 07:39:47 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 07:36:35 GMT]]></title><description><![CDATA[<p dir="auto">我們用這種Open Rack 熱氣肯定有的 (夏天快到了更受不了) ,<br />
除了Power Limit之外 , 就是用電風扇和窗型排風扇 產生空氣流通</p>
<p dir="auto"><img src="https://upload.lcz.me/uploads/bea1ee3b-18b9-4e6b-be01-8680aca1360b.jpeg" alt="5845f593-b077-4c3c-9702-6226e80dd53b-image.jpeg" class=" img-fluid img-markdown" /></p>
]]></description><link>https://lcz.me/post/6091</link><guid isPermaLink="true">https://lcz.me/post/6091</guid><dc:creator><![CDATA[kos or]]></dc:creator><pubDate>Wed, 10 Jun 2026 07:36:35 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 07:22:17 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/ray-wang" aria-label="Profile: Ray-Wang">@<bdi>Ray-Wang</bdi></a> 感謝建議 ！我等礦機架到了再實際量測 購買Riser<br />
我有其他礦機架小型的 大約6~8 GPU-sized 你的屬於挑高型 我剛實際量測加上照片目測 你的顯卡支撐條大概挑高了約10cm , 這樣的設計對於你主板CPU上的大型Heatsinnk 是有利的</p>
]]></description><link>https://lcz.me/post/6089</link><guid isPermaLink="true">https://lcz.me/post/6089</guid><dc:creator><![CDATA[kos or]]></dc:creator><pubDate>Wed, 10 Jun 2026 07:22:17 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 06:39:04 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/applejuice" aria-label="Profile: applejuice">@<bdi>applejuice</bdi></a> 好像跟我这code成绩差不多，但你功耗低很多，明天我跑一下试试</p>
]]></description><link>https://lcz.me/post/6080</link><guid isPermaLink="true">https://lcz.me/post/6080</guid><dc:creator><![CDATA[Ray Wang]]></dc:creator><pubDate>Wed, 10 Jun 2026 06:39:04 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 06:34:54 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/kos-or" aria-label="Profile: kos-or">@<bdi>kos-or</bdi></a> 我买的25cm的，一言难尽，放到最边上有点短会有点扯，放到中间又很长有点折，但好歹没什么大问题，建议你架子到手了以后，把gpu排好了，自己测量一下再买线。</p>
]]></description><link>https://lcz.me/post/6076</link><guid isPermaLink="true">https://lcz.me/post/6076</guid><dc:creator><![CDATA[Ray Wang]]></dc:creator><pubDate>Wed, 10 Jun 2026 06:34:54 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 05:30:26 GMT]]></title><description><![CDATA[<p dir="auto">謝謝樓主分享, 我的配置跟你很像 目前正在備貨,<br />
請問這PCIe x 16 Riser adapter 長度是多少cm ?<br />
我在考慮要買 15, 20, or 25 cm 哪種規格</p>
<p dir="auto"><img src="https://upload.lcz.me/uploads/476442e5-1c9f-4acd-9241-9bd805b94af4.jpeg" alt="54e2d2d0-986b-4e20-b781-fd5a6fe2ac99-image.jpeg" class=" img-fluid img-markdown" /></p>
]]></description><link>https://lcz.me/post/6073</link><guid isPermaLink="true">https://lcz.me/post/6073</guid><dc:creator><![CDATA[kos or]]></dc:creator><pubDate>Wed, 10 Jun 2026 05:30:26 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 04:17:56 GMT]]></title><description><![CDATA[<blockquote>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/ray-wang" aria-label="Profile: Ray-Wang">@<bdi>Ray-Wang</bdi></a> <a href="/post/6064">说</a>:</p>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/applejuice" aria-label="Profile: applejuice">@<bdi>applejuice</bdi></a> 话说你跑大模型大概能到多少TPS呢，我看你之前的作业，只有烤机，没有写跑分</p>
</blockquote>
<p dir="auto">VLLM, 上面那张限制230w，下面那张限制240w</p>
<h3>vLLM 启动参数 (docker-compose.yml)</h3>
<pre><code class="language-yaml">--model /models/heretic-gptq-int4
--served-model-name qwen3.6-27b-heretic
--quantization gptq_marlin
--dtype float16
--tensor-parallel-size 2
--max-model-len 262144
--gpu-memory-utilization 0.9        # via ~/vllm/.env: GPU_MEM_UTIL=0.9  - 之前设置0.932 但是用下来显存到了23.4x, 所以我把它降下来
--max-num-seqs 2
--max-num-batched-tokens 8192
--kv-cache-dtype fp8_e5m2
--trust-remote-code
--reasoning-parser qwen3
--enable-auto-tool-choice
--tool-call-parser qwen3_coder
--enable-prefix-caching
--enable-chunked-prefill
--disable-custom-all-reduce         
--host 0.0.0.0
--port 8000
</code></pre>
<h3>测试结果 NVLINK</h3>
<table class="table table-bordered table-striped">
<thead>
<tr>
<th>测试项目</th>
<th>数值</th>
</tr>
</thead>
<tbody>
<tr>
<td>首响应时间 TTFT（短 prompt，冷启动）</td>
<td><strong>163 ms</strong></td>
</tr>
<tr>
<td>总响应时间（10 tokens）</td>
<td>293 ms</td>
</tr>
<tr>
<td>Prefill 1K</td>
<td>1,991 tok/s</td>
</tr>
<tr>
<td>Prefill 4K</td>
<td><strong>2,036 tok/s</strong></td>
</tr>
<tr>
<td>Prefill 16K</td>
<td>1,985 tok/s</td>
</tr>
<tr>
<td><strong>Decode（单流）</strong></td>
<td><strong>69.3 tok/s</strong></td>
</tr>
<tr>
<td>50K prompt 冷启动 TTFT</td>
<td>25.06 秒</td>
</tr>
<tr>
<td>50K prompt 缓存命中 TTFT</td>
<td>0.69 秒</td>
</tr>
<tr>
<td>缓存加速比</td>
<td>36.5×</td>
</tr>
</tbody>
</table>
<p dir="auto">没有NVLINK</p>
<h3>Prefill 4K 重复测量 (5 次)</h3>
<table class="table table-bordered table-striped">
<thead>
<tr>
<th>run</th>
<th>prompt_tokens</th>
<th>ttft</th>
<th>tok/s</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>3 836</td>
<td>2 776 ms</td>
<td>1 382</td>
</tr>
<tr>
<td>2</td>
<td>3 836</td>
<td>2 735 ms</td>
<td>1 403</td>
</tr>
<tr>
<td>3</td>
<td>3 834</td>
<td>2 665 ms</td>
<td>1 439</td>
</tr>
<tr>
<td>4</td>
<td>3 833</td>
<td>2 770 ms</td>
<td>1 384</td>
</tr>
<tr>
<td>5</td>
<td>3 838</td>
<td>2 772 ms</td>
<td>1 384</td>
</tr>
</tbody>
</table>
<p dir="auto">mean=1 398, median=1 384, min=1 382, max=1 439. <strong>σ ≈ 22 tok/s</strong> (1.6% 变化) — 极稳，退化是稳态而非瞬态。</p>
<h3>Decode 单流 重复测量 (4 次)</h3>
<table class="table table-bordered table-striped">
<thead>
<tr>
<th>run</th>
<th>prompt_tokens</th>
<th>completion_tokens</th>
<th>ttft</th>
<th>decode tok/s</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>76</td>
<td>220</td>
<td>256 ms</td>
<td>66.2</td>
</tr>
<tr>
<td>2</td>
<td>79</td>
<td>220</td>
<td>278 ms</td>
<td>66.6</td>
</tr>
<tr>
<td>3</td>
<td>81</td>
<td>220</td>
<td>284 ms</td>
<td>66.7</td>
</tr>
<tr>
<td>4</td>
<td>80</td>
<td>220</td>
<td>284 ms</td>
<td>66.7</td>
</tr>
</tbody>
</table>
]]></description><link>https://lcz.me/post/6065</link><guid isPermaLink="true">https://lcz.me/post/6065</guid><dc:creator><![CDATA[applejuice]]></dc:creator><pubDate>Wed, 10 Jun 2026 04:17:56 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 04:01:45 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/applejuice" aria-label="Profile: applejuice">@<bdi>applejuice</bdi></a> 话说你跑大模型大概能到多少TPS呢，我看你之前的作业，只有烤机，没有写跑分</p>
]]></description><link>https://lcz.me/post/6064</link><guid isPermaLink="true">https://lcz.me/post/6064</guid><dc:creator><![CDATA[Ray Wang]]></dc:creator><pubDate>Wed, 10 Jun 2026 04:01:45 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 02:35:40 GMT]]></title><description><![CDATA[<p dir="auto">有地下室真好 我放在房间外面 整个空间都可以感觉热气</p>
]]></description><link>https://lcz.me/post/6061</link><guid isPermaLink="true">https://lcz.me/post/6061</guid><dc:creator><![CDATA[applejuice]]></dc:creator><pubDate>Wed, 10 Jun 2026 02:35:40 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 02:15:17 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/applejuice" aria-label="Profile: applejuice">@<bdi>applejuice</bdi></a> 地下室，随便它怎么叫唤我也听不到，延迟的事情我得研究一下。</p>
]]></description><link>https://lcz.me/post/6058</link><guid isPermaLink="true">https://lcz.me/post/6058</guid><dc:creator><![CDATA[Ray Wang]]></dc:creator><pubDate>Wed, 10 Jun 2026 02:15:17 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 02:12:54 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/terry" aria-label="Profile: terry">@<bdi>terry</bdi></a> 这是我家地下室<img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/1f602.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--joy" style="height:23px;width:auto;vertical-align:middle" title="😂" alt="😂" /></p>
]]></description><link>https://lcz.me/post/6057</link><guid isPermaLink="true">https://lcz.me/post/6057</guid><dc:creator><![CDATA[Ray Wang]]></dc:creator><pubDate>Wed, 10 Jun 2026 02:12:54 GMT</pubDate></item><item><title><![CDATA[Reply to 洋垃圾的回乡路 —— 2x3090 + X99 + 2x2680v4 on Wed, 10 Jun 2026 01:47:18 GMT]]></title><description><![CDATA[<p dir="auto">一样硬件<img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/1f91d.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--handshake" style="height:23px;width:auto;vertical-align:middle" title=":handshake:" alt="🤝" /><br />
但是这个东西放在房间不得热死？</p>
<p dir="auto">双路cpu 好像会有延迟？</p>
]]></description><link>https://lcz.me/post/6056</link><guid isPermaLink="true">https://lcz.me/post/6056</guid><dc:creator><![CDATA[applejuice]]></dc:creator><pubDate>Wed, 10 Jun 2026 01:47:18 GMT</pubDate></item></channel></rss>